warp-os 1.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (49) hide show
  1. package/CHANGELOG.md +327 -0
  2. package/LICENSE +21 -0
  3. package/README.md +308 -0
  4. package/VERSION +1 -0
  5. package/agents/warp-browse.md +715 -0
  6. package/agents/warp-build-code.md +1299 -0
  7. package/agents/warp-orchestrator.md +515 -0
  8. package/agents/warp-plan-architect.md +929 -0
  9. package/agents/warp-plan-brainstorm.md +876 -0
  10. package/agents/warp-plan-design.md +1458 -0
  11. package/agents/warp-plan-onboarding.md +732 -0
  12. package/agents/warp-plan-optimize-adversarial.md +81 -0
  13. package/agents/warp-plan-optimize.md +354 -0
  14. package/agents/warp-plan-scope.md +806 -0
  15. package/agents/warp-plan-security.md +1274 -0
  16. package/agents/warp-plan-testdesign.md +1228 -0
  17. package/agents/warp-qa-debug-adversarial.md +90 -0
  18. package/agents/warp-qa-debug.md +793 -0
  19. package/agents/warp-qa-test-adversarial.md +89 -0
  20. package/agents/warp-qa-test.md +1054 -0
  21. package/agents/warp-release-update.md +1189 -0
  22. package/agents/warp-setup.md +1216 -0
  23. package/agents/warp-upgrade.md +334 -0
  24. package/bin/cli.js +44 -0
  25. package/bin/hooks/_warp_html.sh +291 -0
  26. package/bin/hooks/_warp_json.sh +67 -0
  27. package/bin/hooks/consistency-check.sh +92 -0
  28. package/bin/hooks/identity-briefing.sh +89 -0
  29. package/bin/hooks/identity-foundation.sh +37 -0
  30. package/bin/install.js +343 -0
  31. package/dist/warp-browse/SKILL.md +727 -0
  32. package/dist/warp-build-code/SKILL.md +1316 -0
  33. package/dist/warp-orchestrator/SKILL.md +527 -0
  34. package/dist/warp-plan-architect/SKILL.md +943 -0
  35. package/dist/warp-plan-brainstorm/SKILL.md +890 -0
  36. package/dist/warp-plan-design/SKILL.md +1473 -0
  37. package/dist/warp-plan-onboarding/SKILL.md +742 -0
  38. package/dist/warp-plan-optimize/SKILL.md +364 -0
  39. package/dist/warp-plan-scope/SKILL.md +820 -0
  40. package/dist/warp-plan-security/SKILL.md +1286 -0
  41. package/dist/warp-plan-testdesign/SKILL.md +1244 -0
  42. package/dist/warp-qa-debug/SKILL.md +805 -0
  43. package/dist/warp-qa-test/SKILL.md +1070 -0
  44. package/dist/warp-release-update/SKILL.md +1211 -0
  45. package/dist/warp-setup/SKILL.md +1229 -0
  46. package/dist/warp-upgrade/SKILL.md +345 -0
  47. package/package.json +40 -0
  48. package/shared/project-hooks.json +32 -0
  49. package/shared/tier1-engineering-constitution.md +176 -0
package/README.md ADDED
@@ -0,0 +1,308 @@
1
+ # WarpOS — A development operating system for Claude Code
2
+
3
+ > The goal of this project is to reduce the friction between you and a completed idea, while still allowing you to make the decisions that only you can make. Everything in this repo works together to achieve this - to find the line between the problems that can *only* be solved by you, and the problems that can be delegated to Claude Code. The goal here is reduce the barriers that hinder your progress so you can step back and delegate. Here, you are the architect, not the developer, and the whole design philosophy is - *the architect doesn't lay bricks*.
4
+
5
+ ## What Is Warp?
6
+
7
+ Warp is a development operating system with hardwired anti-slop guardrails that transforms Claude Code from a coding assistant into a verified engineering pipeline. Other AI coding tools say "smarter prompts" or "better models." Warp says "we don't trust the AI either — that's why the guardrails can't hallucinate." You architect. Warp engineers. Trustless verification.
8
+
9
+ ## Quick Start
10
+
11
+ ### Requirements
12
+
13
+ - **Claude Code** — the CLI, VS Code extension, JetBrains extension, desktop app, or claude.ai/code
14
+ - **Git** — for cloning and for skills that read git history
15
+
16
+ ### Install
17
+
18
+ ```bash
19
+ npx warp-os install
20
+ ```
21
+
22
+ One command. Works on Windows, Mac, and Linux. No symlinks, no Developer Mode, no admin required.
23
+
24
+ Copies 16 skills and 19 agent definitions to `~/.claude/`, installs hook scripts and the engineering foundation to `~/.warp/`, and sets up [claude-mem](https://github.com/thedotmack/claude-mem) for persistent cross-session memory. Hooks are activated per-project by `/warp-setup` — not globally.
25
+
26
+ Requires Node.js 18+. Run the same command anytime to upgrade.
27
+
28
+ ### Setup (per project)
29
+
30
+ ```bash
31
+ # Open your project in Claude Code, then:
32
+ /warp-setup
33
+ ```
34
+
35
+ Setup detects your stack, installs missing verification tools, configures hooks, sets up MCP servers, activates the orchestrator, and for existing projects automatically analyzes your codebase.
36
+
37
+ **After setup, close and restart Claude Code.** The orchestrator activates on the next session — it won't take effect until you restart.
38
+
39
+ ### Upgrade
40
+
41
+ ```bash
42
+ npx warp-os install
43
+ ```
44
+
45
+ Same command as install — detects existing installation and overwrites with the latest version. Or from inside Claude Code: `/warp-upgrade`. Version tracked at `~/.warp/version`.
46
+
47
+ ## How It Works
48
+
49
+ One command (`/warp-setup`) configures everything — tool detection, hooks, project analysis, and the orchestrator. After a restart, the Warp orchestrator becomes your default session agent. You talk to it. It runs skills directly in your context — you collaborate in real-time on every decision. For QA, it runs dual-mode: a collaborative pass with you plus an independent adversarial pass that catches what you both missed. You make the decisions. Warp does everything else.
50
+
51
+ The guardrails are structural, not optional. Every TDD cycle hits an L1 deterministic gate (linters, type-check, credential scanning) that blocks on failure. The AI fixes the issue or it doesn't proceed. No exceptions, no reasoning around it. Figma MCP provides design specs, Chrome extension gives live visual collaboration, and Context7 prevents hallucinated APIs with live library docs.
52
+
53
+ **Every skill also works standalone.** You don't need the pipeline to get value. Drop `/warp-plan-architect` into any conversation to think through a system design. Run `/warp-qa-debug` when something breaks. Use `/warp-plan-security` before any deploy. The pipeline is there when you want it — the skills are useful the moment you install them.
54
+
55
+ ## How Do I Actually Use This?
56
+
57
+ After setup, the orchestrator handles routing. You don't need to memorize skill names — just tell it what you want to do.
58
+
59
+ **"I have an idea."**
60
+ Run `/warp-setup` in an empty project folder. Restart. The orchestrator routes you to brainstorm, then scope, architect, design, test specs, security audit. By the time planning is done, every build cycle is mapped out. Say "let's build" and the orchestrator runs build-code directly — you collaborate on every TDD cycle.
61
+
62
+ **"I have an existing project and want Warp to help."**
63
+ Run `/warp-setup` in your project folder. Setup detects your stack, configures verification hooks, and automatically analyzes your entire codebase. Restart your terminal. The orchestrator knows your project and is ready to plan, build, or debug.
64
+
65
+ **"Something is broken and I've been staring at it for an hour."**
66
+ Tell the orchestrator "debug this" — or run `/warp-qa-debug` directly. It systematically isolates the root cause using binary search, hypothesis testing, and scope locks. For complex bugs with multiple theories, it uses `/fork` to test hypotheses in parallel.
67
+
68
+ **"I just want to ship this PR."**
69
+ Tell the orchestrator "ship it" — or run `/warp-release-update` directly. Diff review, version bump, PR creation, deploy, canary monitoring, documentation update.
70
+
71
+ **"I stepped away for a week."**
72
+ Just open Claude Code in the project. The orchestrator auto-loads, shows pipeline state, and orients you to where you left off. [claude-mem](https://github.com/thedotmack/claude-mem) captures everything automatically — every tool use, every decision, every file change — compresses it with AI, and injects relevant context from past sessions on startup.
73
+
74
+ ## What Makes Warp Different?
75
+
76
+ There are other great projects for Claude Code on Github — [Superpowers](https://github.com/obra/superpowers) for subagent based TDD, [gstack](https://garryslist.org) for virtual engineering teams. Warp builds on their work (and credits them). The difference is scope and verification philosophy: Warp isn't a collection of skills thrown together — it's an operating system where skills run directly with the user present, every output is verified by deterministic tools the AI cannot influence, and dual-mode QA catches blind spots through independent adversarial review.
77
+
78
+ ### The Guardrails
79
+
80
+ The core premise: AI-generated code is guilty until proven innocent. Every verification layer in Warp exists because trusting AI output at face value is how slop ships.
81
+
82
+ **Three bias levels.** Warp classifies every verification by how much you should trust it. Level 1: deterministic tools — binary pass/fail, highest trust. Level 2: AI anchored to external sources (Context7 live docs, API contracts) — medium trust. Level 3: AI evaluating AI — lowest trust. The rule: every gate that *can* be Level 1 *must* be Level 1. Level 3 is never the only layer.
83
+
84
+ **Minimal hooks — 4 hooks, not 15.** Guards and QA automation were removed — with direct-first execution, the user is present and sees everything. The remaining hooks handle session bootstrap only:
85
+
86
+ | Hook | What it does |
87
+ |---|---|
88
+ | `identity-foundation.sh` | First-run welcome, injects foundation into orchestrator context |
89
+ | `identity-briefing.sh` | Welcome banner with branch, pipeline state, next cycle |
90
+ | `consistency-check.sh` | CLAUDE.md staleness + TODOS.md validation on Stop |
91
+ | PowerShell blocker (inline) | Rejects PowerShell tool calls, forces Bash |
92
+
93
+ Every hook sources a shared JSON parsing library (`_warp_json.sh`) — no `jq` dependency. Hooks degrade gracefully: warn visibly, never crash, never fail silently.
94
+
95
+ **Project-fitted leashing.** Warp doesn't ship a generic tool list. It detects *your* stack and builds a custom gate for *your* project. A Python project gets `ruff`, `mypy`, `pytest`. A TypeScript project gets `eslint`, `tsc`, `vitest`. Tools are added as your codebase grows. The leash is fitted to your project, out of the box.
96
+
97
+ **Test design separation.** The skill that writes test specs (testdesign) never sees the implementation. The skill that implements code (build-code) works from those specs. A fresh-context subagent verifies test independence. The exam writer and the answer checker never see each other's work. This prevents the most common AI testing failure: tests that pass because they were written to match the code, not the spec.
98
+
99
+ **Live library docs via MCP + API doc registry.** Context7 provides version-specific documentation for 9,000+ libraries during builds. Figma MCP bridges design-to-code with 16 tools (read design specs, write frames, extract tokens, visual diff). Every dependency is registered in `warp-tools.json` with a doc source. Build-code checks the registry during setup and queries docs inline — no hook needed.
100
+
101
+ **Chrome as collaboration surface.** During UI builds, the Chrome extension gives both user and Claude a shared live view of the app. Either party can flag issues. Console access, visual verification, and GIF capture are built into the build cycle. Adversarial QA uses headless `/browse` instead — different tool, different perspective.
102
+
103
+ ## Skills
104
+
105
+ 16 skills organized into five groups. Every skill works standalone or as part of the pipeline.
106
+
107
+ | Group | Skills | What they do |
108
+ |-------|--------|-------------|
109
+ | **Plan** | `brainstorm` `onboarding` `scope` `architect` `design` `testdesign` `security` `optimize` | Figure out what to build — from idea to optimization strategy |
110
+ | **Build** | `build-code` | Write code with TDD + trustless verification gates |
111
+ | **QA** | `qa-test` `qa-debug` | Test and debug |
112
+ | **Release** | `release-update` | Ship and reflect — diff review, docs, push, deploy, retro |
113
+ | **Root** | `setup` `upgrade` `orchestrate` `save` `browse` | Pipeline brain, sessions, and utilities |
114
+
115
+ ### Plan Skills
116
+
117
+ Plan skills figure out *what* to build before a single line of code is written. They work like a product team in a box — brainstorming, scoping, designing architecture, creating a visual system, writing test specs, running a security audit, and optimizing the strategy. Each produces a structured artifact that the next reads. Sequential: architect → design → security → testdesign → optimize.
118
+
119
+ | Skill | What it does |
120
+ |-------|-------------|
121
+ | `warp-plan-brainstorm` | Problem discovery, user needs, constraints (new projects) |
122
+ | `warp-plan-onboarding` | Deep codebase analysis + layer mapping (existing projects) |
123
+ | `warp-plan-scope` | Defines scope — what to build, what to cut, 5-star product thinking |
124
+ | `warp-plan-architect` | Component boundaries, data flow, API contracts, failure modes |
125
+ | `warp-plan-design` | Visual system, UX strategy, screen specs, developer handoff |
126
+ | `warp-plan-testdesign` | Test-first specification, acceptance criteria, edge case enumeration |
127
+ | `warp-plan-security` | Secrets audit, dependency supply chain, OWASP, STRIDE threat modeling |
128
+ | `warp-plan-optimize` | Three modes: optimize (performance + workflow), delight (UX moments), or full (both) |
129
+
130
+ ### Build Skills
131
+
132
+ Build-code runs direct — the user collaborates in real-time on every TDD cycle (red-green-refactor). After every green cycle, the L1 gate runs all deterministic tools. Context7 MCP provides live library docs. Chrome extension provides live visual collaboration for UI work. Figma MCP provides design specs and token extraction. For multi-cycle work, re-invoke `/build` or use `/loop`.
133
+
134
+ | Skill | What it does |
135
+ |-------|-------------|
136
+ | `warp-build-code` | One TDD cycle + L1 gate + Context7 docs + Chrome + Figma |
137
+
138
+ ### QA Skills
139
+
140
+ QA runs dual-mode: a direct collaborative pass + an adversarial dispatch with clean context. Findings are auto-diffed — blind spots (found only by the adversarial pass) are the highest-value output.
141
+
142
+ | Skill | What it does |
143
+ |-------|-------------|
144
+ | `warp-qa-test` | Three-tier testing with health scoring |
145
+ | `warp-qa-debug` | Root cause investigation with parallel hypothesis testing via /fork |
146
+
147
+ ### Release Skills
148
+
149
+ | Skill | What it does |
150
+ |-------|-------------|
151
+ | `warp-release-update` | Ship + reflect. Diff review, docs update, push gate, version bump, deploy, canary, retro. All pushes go through this skill. |
152
+
153
+ ### Root Skills
154
+
155
+ | Skill | What it does |
156
+ |-------|-------------|
157
+ | `warp-setup` | One-command project setup: detection, hooks, MCP, agent activation, auto-onboarding |
158
+ | `warp-upgrade` | One-command upgrade from inside Claude Code: pull, build, install, changelog, migration |
159
+ | `warp-orchestrator` | Pipeline brain — auto-activates, runs skills direct, orchestrates dual-mode QA, presents gates |
160
+ | `warp-browse` | Headless browser for screenshots and testing |
161
+
162
+ ## The Pipeline
163
+
164
+ Skills chain together. Each reads what the previous wrote:
165
+
166
+ ```mermaid
167
+ flowchart TD
168
+ subgraph PLAN["**PLAN** — Figure out what to build"]
169
+ direction TB
170
+ BS["brainstorm\n*new projects*"]
171
+ OB["onboarding\n*existing (auto)*"]
172
+ SC["scope"]
173
+ AR["architect"]
174
+ DE["design"]
175
+ BP["testdesign"]
176
+ SE["security"]
177
+ OP["optimize"]
178
+
179
+ BS --> SC
180
+ OB --> SC
181
+ SC --> AR
182
+ AR --> DE
183
+ DE --> SE
184
+ SE --> BP
185
+ BP --> OP
186
+ end
187
+
188
+ subgraph BUILD["**BUILD + QA** — Direct collaboration + adversarial QA"]
189
+ direction TB
190
+ subgraph CYCLE["Each Cycle (direct)"]
191
+ BC["build-code\n*TDD + Chrome + Figma*"]
192
+ QQ["L1 gate\n*linters, type-check, gitleaks*"]
193
+ BC --> QQ
194
+ end
195
+ subgraph DUALQA["Dual-Mode QA"]
196
+ QD["direct pass\n*collaborative, Chrome*"]
197
+ QA["adversarial pass\n*clean context, /browse*"]
198
+ QC["comparison\n*blind spots + confirmed*"]
199
+ QD --> QC
200
+ QA --> QC
201
+ end
202
+
203
+ CYCLE -.-> DUALQA
204
+ end
205
+
206
+ subgraph RELEASE["**RELEASE** — Ship and reflect"]
207
+ LA["update\n*ship mode + retro mode*"]
208
+ end
209
+
210
+ OP --> BUILD
211
+ PHASE --> LA
212
+
213
+ subgraph AUTO["**BUILD (DIRECT)**"]
214
+ direction LR
215
+ CO["/build-code\n*TDD + Chrome + Figma*"]
216
+ end
217
+
218
+ subgraph ANYTIME["**ANYTIME**"]
219
+ direction LR
220
+ DI["debug\n*root cause + /fork*"]
221
+ UT["setup · save · browse\nupgrade"]
222
+ end
223
+
224
+ style PLAN fill:#1a1a2e,stroke:#4a4a6a,color:#e0e0e0
225
+ style BUILD fill:#16213e,stroke:#4a4a6a,color:#e0e0e0
226
+ style RELEASE fill:#0f3460,stroke:#4a4a6a,color:#e0e0e0
227
+ style AUTO fill:#1a1a2e,stroke:#4a4a6a,color:#e0e0e0
228
+ style ANYTIME fill:#1a1a2e,stroke:#4a4a6a,color:#e0e0e0
229
+ style CYCLE fill:#1b2838,stroke:#3a5a7a,color:#e0e0e0
230
+ style SUBPHASE fill:#1b2838,stroke:#3a5a7a,color:#e0e0e0
231
+ style PHASE fill:#1b2838,stroke:#3a5a7a,color:#e0e0e0
232
+ style QF fill:#2d4a22,stroke:#4a7a3a,color:#e0e0e0
233
+ ```
234
+
235
+ ## Settings
236
+
237
+ Warp stores preferences at `~/.warp/settings.json`. All features default to ON. Disable any individually:
238
+
239
+ ```bash
240
+ mkdir -p ~/.warp
241
+ echo '{"welcome_banner": false}' > ~/.warp/settings.json
242
+ ```
243
+
244
+ Available settings: `welcome_banner`, `completion_banner`, `first_run_intro`, `decision_tally`, `pipeline_progress`, `model_warnings`, `artifact_preview`, `help_on_request`, `quicksave`, `milestone_celebrations`, `pseudocode_mode`
245
+
246
+ ## Architecture
247
+
248
+ ```
249
+ WarpOS/
250
+ ├── shared/ ← Shared foundation (edit here)
251
+ │ ├── tier1-engineering-constitution.md ← Tier 1: Engineering foundation (all agents, LOCKED)
252
+ │ ├── tier2-verification.md ← Tier 2: Adversarial verification (4 adversarial agents only)
253
+ │ ├── artifact-schemas.md ← Pipeline artifact schemas
254
+ │ └── glossary-template.md ← Project glossary template
255
+ ├── src/ ← Skill sources (edit here)
256
+ │ ├── pipeline/ ← 9 chained skills (plan + build + QA + release)
257
+ │ ├── standalone/ ← 1 skill (security)
258
+ │ ├── meta/ ← Setup, save, orchestrate, upgrade
259
+ │ ├── utils/ ← Browse
260
+ │ └── adversarial/ ← 3 adversarial agent source files
261
+ ├── agents/ ← Generated (T1 + source; adversarial: T2 + source)
262
+ ├── bin/ ← verify.sh + warp-upgrade.sh + hooks/
263
+ │ └── hooks/ ← 3 hook scripts + 2 shared utilities
264
+ ├── dist/ ← Generated compiled skills (DO NOT EDIT)
265
+ ├── warp-*/ ← Generated (repo root, for global install)
266
+ ├── build.sh ← Dual output: skills (T1 + source) + agents (T1 + source; adversarial: T2 + source)
267
+ ├── install.sh ← Skills + agents + hooks + Tier 1 + global config
268
+ ├── VERSION ← Semantic version
269
+ ├── CHANGELOG.md ← Release history
270
+ └── DESIGN.md ← Terminal output design system
271
+ ```
272
+
273
+ **Two-tier authority model:**
274
+ - **Tier 1** — Engineering foundation. Loads for every skill and agent. Immutable. Core principles, bias classification, quality gates, UX patterns.
275
+ - **Tier 2** — Adversarial verification. Loads for the 4 adversarial agent definitions only. Bias model, verification hierarchy, severity tags, findings format.
276
+
277
+ Skill-specific content (role, phases, cognitive patterns, calibration examples) lives in `src/` source files, not in tiers.
278
+
279
+ Edit in `shared/` and `src/`. Everything in `dist/`, `agents/`, and root-level `warp-*/` is generated by `build.sh`.
280
+
281
+ ## Contributing
282
+
283
+ ```bash
284
+ git clone https://github.com/WolfOnWings/WarpOS.git
285
+ cd WarpOS
286
+
287
+ # Edit source files in shared/ or src/
288
+ # Build, verify, and check structural integrity
289
+ ./build.sh
290
+ ./build.sh --verify-only
291
+ ./bin/verify.sh
292
+ ```
293
+
294
+ *A note on tokens: Warp skills are deep — thousands of lines of cognitive patterns, quality gates, and calibration examples. This is the tradeoff: shallow instructions are cheaper, but what you give is what you get.*
295
+
296
+ ## Built On
297
+
298
+ Warp synthesizes ideas, patterns, and techniques from these projects:
299
+
300
+ **Major** — [gstack](https://garryslist.org) · [Superpowers](https://github.com/obra/superpowers) · [everything-claude-code](https://github.com/affaan-m/everything-claude-code)
301
+
302
+ **Significant** — [Trail of Bits](https://github.com/trailofbits) (security audit methodology)
303
+
304
+ **Supporting** — [Anthropic Cookbook](https://github.com/anthropics/anthropic-cookbook) · [Context7 MCP](https://github.com/upstash/context7)
305
+
306
+ ## License
307
+
308
+ [MIT](LICENSE) — WolfOnWings, 2026
package/VERSION ADDED
@@ -0,0 +1 @@
1
+ 1.1.0