hail-hydra-cc 2.0.2 → 2.0.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/files/SKILL.md CHANGED
@@ -317,192 +317,128 @@ Deviate from this model only when:
317
317
  | Catastrophic test failure | Make final runner BLOCKING (something is fundamentally broken) |
318
318
  | Stale Session Index detected | Rebuild index; treat as Turn 1 |
319
319
 
320
- ## Decomposition First Protocol
320
+ ## Mandatory Delegation Rules
321
321
 
322
- **Before executing ANY user request, decompose it.** This is not optional it is the
323
- first step of every task. The whole point of Hydra is parallel execution, and you cannot
324
- parallelize what you haven't decomposed.
322
+ These rules are BINDING. They are not heuristics, suggestions, or guidelines to consider. They are hard rules that determine whether YOU handle a task or DELEGATE it to a Hydra head. Violating these rules defeats the purpose of Hydra.
325
323
 
326
- ### Step 1: Break the request into atomic subtasks
324
+ ### ALWAYS Delegate No Exceptions
327
325
 
328
- An atomic subtask is one that can be fully handled by a single head independently.
329
- "Fix the auth bug" is atomic. "Fix the auth bug and add tests" is two subtasks.
326
+ These task types MUST be delegated. You are NOT allowed to handle them yourself, regardless of how simple they seem.
330
327
 
331
- ### Step 2: Map dependencies
328
+ | Task Type | Delegate To | Why You Don't Do It |
329
+ |-----------|-------------|---------------------|
330
+ | File search / grep / find patterns | hydra-scout | Haiku is equally good at Glob/Grep and costs 95% less |
331
+ | Read and summarize code or docs | hydra-scout | Reading files is mechanical — no Opus reasoning needed |
332
+ | Run tests, builds, lints, type checks | hydra-runner | Executing commands and reporting output is mechanical |
333
+ | Git operations (commit, branch, diff, log, stash) | hydra-git | Git commands are well-defined and deterministic |
334
+ | Security/quality gate scans | hydra-guard | Pattern matching for secrets/issues is Haiku's strength |
335
+ | Write/update docstrings, comments, changelogs | hydra-scribe | Descriptive writing from existing code is mechanical |
336
+ | Implement features from clear specs | hydra-coder | Sonnet handles standard implementation patterns equally well |
337
+ | Fix bugs with clear error messages/stack traces | hydra-coder | Error-driven debugging with clear clues is Sonnet-level |
338
+ | Code review and PR analysis | hydra-analyst | Structured code analysis is Sonnet's sweet spot |
332
339
 
333
- For each subtask, ask: *does this require the output of another subtask?*
340
+ **Self-check**: Before you start ANY task, ask: "Is this in the ALWAYS Delegate table?" If yes — delegate. No exceptions. No "but it's faster if I just..." No "it's only a small..." DELEGATE.
334
341
 
335
- - If NO it can run in Wave 1
336
- - If it depends on Task A → it runs in the wave after Task A completes
342
+ ### ALWAYS Handle Yourself Never Delegate
337
343
 
338
- ### Step 3: Group into waves
344
+ These task types stay with Opus. Delegating them wastes time or risks quality.
339
345
 
340
- A wave is a set of subtasks with no dependencies between them. All tasks in a wave
341
- launch simultaneously in a single message (multiple Task tool calls).
346
+ | Task Type | Why You Keep It |
347
+ |-----------|----------------|
348
+ | Task classification and routing decisions | Only you see the full conversation context |
349
+ | Verifying and synthesizing agent outputs | Judgment on whether a draft is acceptable requires orchestrator perspective |
350
+ | System architecture and major design decisions | Novel architectural tradeoffs need Opus-level reasoning |
351
+ | Ambiguous debugging with no clear clues | "It works in staging but not prod" needs deep investigation |
352
+ | Context-dependent tasks requiring conversation history | Agents don't see prior turns — you do |
353
+ | Trivial edits under 5 seconds (max 2-3 per session) | Delegation overhead exceeds task cost |
354
+ | Planning and decomposition | Breaking tasks into waves IS the orchestrator's job |
355
+ | Conversation management (clarification, alignment) | Only you talk to the user |
342
356
 
343
- ### Step 4: Dispatch wave by wave
357
+ ### JUDGMENT CALLS Use This Decision Framework
344
358
 
345
- Launch Wave 1 (all independent tasks) simultaneously. When Wave 1 completes, feed
346
- relevant outputs forward and launch Wave 2. Continue until all subtasks are done.
359
+ For tasks NOT in either table above, apply this 3-step check:
347
360
 
348
- ### The Cardinal Rule
361
+ 1. **Does it require conversation context?** (prior turns, user preferences, accumulated state)
362
+ - YES → Handle yourself. Agents don't have this context.
363
+ 2. **Can Haiku or Sonnet do this equally well?** (not "almost as well" — EQUALLY well)
364
+ - YES → Delegate. You're wasting money and time doing it yourself.
365
+ - NO → Handle yourself.
366
+ 3. **Would delegation take LONGER than doing it yourself?** (including prompt construction + wait time)
367
+ - YES, and the task is truly trivial → Handle yourself (counts toward overhead budget).
368
+ - NO or UNSURE → **Delegate.** This is the default. When in doubt, DELEGATE.
349
369
 
350
- > **NEVER dispatch sequentially when parallel is possible.**
351
- > Launching 3 Haiku agents simultaneously is always faster than launching them one at a time,
352
- > even if you have to wait for all three before continuing. A wave of 3 Haiku agents completes
353
- > in the time it takes 1 Haiku agent to finish, not 3× longer.
370
+ ### Parallel Dispatch MANDATORY for Independent Subtasks
354
371
 
355
- ### Quick Decomposition Example
372
+ If a task decomposes into subtasks with no dependencies between them, you MUST dispatch them simultaneously in a single message. Sequential dispatch of independent tasks is a rule violation.
356
373
 
374
+ **WRONG** (sequential — wastes time):
357
375
  ```
358
- User: "Fix the auth bug, add tests, and update the docs"
359
-
360
- Decomposition:
361
- Task A: Explore auth module → hydra-scout [no deps]
362
- Task B: Run existing tests → hydra-runner [no deps]
363
- Task C: Fix the bug → hydra-coder [depends on A]
364
- Task D: Write new tests → hydra-coder [depends on C]
365
- Task E: Run all tests → hydra-runner [depends on C, D]
366
- Task F: Update docs → hydra-scribe [depends on C]
367
-
368
- Wave 1 → launch A + B simultaneously
369
- Wave 2 → launch C (using A's findings)
370
- Wave 3 → launch D + F simultaneously (both only need C)
371
- Wave 4 → launch E (needs D complete)
376
+ Message 1: Launch hydra-scout to explore auth module
377
+ [wait for result]
378
+ Message 2: Launch hydra-runner to run existing tests
379
+ [wait for result]
380
+ Message 3: Launch hydra-scout to check test patterns
372
381
  ```
373
382
 
374
- ## Task Classification Guide
375
-
376
- Classify every incoming task before executing. This is fast just a mental check, not a separate
377
- step the user sees.
378
-
379
- ### Tier 1 → Haiku 4.5 Heads (hydra-scout, hydra-runner, hydra-scribe, hydra-guard, hydra-git)
380
-
381
- Route to Haiku when the task is **mechanical, read-heavy, or well-defined**:
382
-
383
- - Searching/grepping across files
384
- - Reading and summarizing code or documentation
385
- - Running tests, lints, builds, format checks
386
- - Listing directory structures, finding files by pattern
387
- - Simple git operations (status, log, diff)
388
- - Generating boilerplate or repetitive code from clear templates
389
- - Writing/updating comments, docstrings, simple README sections
390
- - Quick factual lookups in the codebase
391
- - Security/quality gate scans on changed code (hydra-guard)
392
- - Git operations: staging, committing, branching, log inspection (hydra-git)
393
-
394
- **Heuristic**: If you could describe the task as a single imperative sentence with no ambiguity
395
- (e.g., "find all files importing X", "run the test suite"), it's Tier 1.
396
-
397
- ### Tier 2 → Sonnet 4.6 Heads (hydra-coder, hydra-analyst)
398
-
399
- Route to Sonnet when the task requires **reasoning about code, but within well-understood patterns**:
400
-
401
- - Implementing a feature from a clear spec or description
402
- - Writing or modifying functions, classes, modules
403
- - Refactoring code (rename, extract, restructure)
404
- - Creating test cases that require understanding business logic
405
- - Debugging with stack traces or error messages as clues
406
- - Code review and suggesting improvements
407
- - Writing technical documentation that requires comprehension
408
- - Resolving straightforward merge conflicts
409
- - Making API integrations following documented patterns
410
-
411
- **Heuristic**: If you need to read, understand, and then produce code — but the approach is
412
- reasonably standard — it's Tier 2.
413
-
414
- ### Tier 3 → Opus (handle directly, don't delegate)
415
-
416
- Keep it yourself when the task demands **deep reasoning, novel architecture, or high-stakes decisions**:
417
-
418
- - System architecture design and major refactoring decisions
419
- - Debugging subtle, non-obvious issues with no clear stack trace
420
- - Performance optimization requiring algorithmic insight
421
- - Security analysis and vulnerability assessment
422
- - Multi-file coordinated changes with complex interdependencies
423
- - Resolving ambiguous requirements that need clarification
424
- - Novel algorithm implementation
425
- - Tasks where getting it wrong would be very costly
426
-
427
- **Heuristic**: If you'd need to think hard even as Opus, don't delegate.
428
-
429
- ### Edge Cases and Judgment Calls
383
+ **RIGHT** (parallel all independent):
384
+ ```
385
+ Message 1: Launch hydra-scout (auth module) + hydra-runner (tests) + hydra-scout (test patterns)
386
+ [all three return]
387
+ Message 2: Launch dependent tasks using results from Message 1
388
+ ```
430
389
 
431
- - **When in doubt, go one tier up.** Better to use Sonnet for a Haiku task than Haiku for a
432
- Sonnet task. Quality is never sacrificed.
433
- - **Compound tasks should be decomposed.** "Read the codebase and redesign the auth system"
434
- becomes: hydra-scout reads, then you design (Opus).
435
- - **Iterative tasks escalate naturally.** If a Sonnet draft isn't right, don't retry with Sonnet —
436
- do it yourself.
390
+ **Trigger phrases that REQUIRE parallel dispatch:**
391
+ - "...and..." (e.g., "fix the bug AND add tests" → scout + runner in parallel)
392
+ - "...then..." where the "then" tasks are independent of each other
393
+ - Any request with 2+ independent components
394
+ - Any request where exploration and execution can overlap
437
395
 
438
- ## Wave Execution Model
396
+ ### Delegation Overhead Budget
439
397
 
440
- Waves are the unit of parallel work in Hydra. Every multi-step task should be mapped
441
- to waves before any agent is dispatched.
398
+ You are allowed a MAXIMUM of 2-3 "do it myself" exceptions per session for tasks that technically fall in the ALWAYS Delegate table but are genuinely trivial (e.g., adding a single `console.log` to a known file). Track this internally.
442
399
 
443
- ### Rules for Wave Construction
400
+ **Rules:**
401
+ - If you've done 5+ tasks directly in a row without delegating, STOP. Re-read the ALWAYS Delegate table. You are almost certainly violating these rules.
402
+ - "It's faster if I just do it" is not a valid exception after the 2-3 budget is spent.
403
+ - The budget resets each session.
444
404
 
445
- 1. **All tasks in a wave launch in a single message** — use multiple Task tool calls
446
- in one response. Never send Wave 2 before Wave 1's results arrive.
447
- 2. **Dependencies determine wave membership** — if Task B needs Task A's output, they
448
- are in different waves. If they're independent, they're in the same wave.
449
- 3. **Within a wave, order doesn't matter** — all tasks start simultaneously.
450
- 4. **Between waves, results flow forward** — pass relevant context from each wave into
451
- the prompts of the next wave's agents (see Handoff Protocol).
405
+ ### Plan Mode Behavior
452
406
 
453
- ### Wave Execution Examples
407
+ During planning phase (before execution begins):
408
+ - Using Claude Code's built-in Explore agent is acceptable for quick codebase understanding.
409
+ - No delegation rules apply yet — you're gathering context, not executing.
454
410
 
455
- #### Example 1: "Review this PR, fix the issues, and run the tests"
411
+ Once execution begins (after plan is approved):
412
+ - ALL mandatory delegation rules apply immediately.
413
+ - **NEVER use the built-in Explore agent during execution when hydra-scout is available.** hydra-scout is faster and cheaper.
414
+ - Plans MUST reference specific Hydra agents. Example format:
456
415
 
457
416
  ```
458
- Task A: Review PR changes hydra-analyst [no deps]
459
- Task B: Check test coverage → hydra-runner [no deps]
460
- Task C: Fix identified issues hydra-coder [depends on A]
461
- Task D: Run full test suite hydra-runner [depends on C]
462
-
463
- Wave 1A + B (parallel)
464
- Wave 2 → C (with A's findings as context)
465
- Wave 3 → D
417
+ Step 1: hydra-scoutread auth module structure [parallel with Step 2]
418
+ Step 2: hydra-runner → run existing test suite [parallel with Step 1]
419
+ Step 3: hydra-coder implement fix using findings from Steps 1-2
420
+ Step 4: hydra-sentinel-scan + hydra-guardverify changes [parallel]
421
+ Step 5: hydra-runner → run tests to confirm fix
422
+ Step 6: hydra-git commit with descriptive message
466
423
  ```
467
424
 
468
- #### Example 2: "Set up a new feature: search endpoint with tests and docs"
425
+ ### Dispatch Logging
469
426
 
470
- ```
471
- Task A: Map existing endpoints → hydra-scout [no deps]
472
- Task B: Map existing test style → hydra-scout [no deps]
473
- Task C: Map existing doc style → hydra-scout [no deps]
474
- Task D: Implement endpoint → hydra-coder [depends on A]
475
- Task E: Write tests → hydra-coder [depends on A, B]
476
- Task F: Write API docs → hydra-scribe [depends on A, C]
477
- Task G: Run tests → hydra-runner [depends on E]
478
-
479
- Wave 1 → A + B + C (parallel — all pure exploration)
480
- Wave 2 → D + E + F (parallel — all depend only on Wave 1)
481
- Wave 3 → G
482
- ```
483
-
484
- #### Example 3: "Debug why the payment service is slow, fix it, and verify"
427
+ After every task completion (unless `/hydra:quiet` is active), show a dispatch summary:
485
428
 
486
429
  ```
487
- Task A: Profile payment service code → hydra-analyst [no deps]
488
- Task B: Check DB query patterns → hydra-scout [no deps]
489
- Task C: Run benchmark before fix → hydra-runner [no deps]
490
- Task D: Implement fix → hydra-coder [depends on A, B]
491
- Task E: Run benchmark after fix → hydra-runner [depends on D]
492
- Task F: Update perf notes in README → hydra-scribe [depends on D]
493
-
494
- Wave 1 → A + B + C (parallel)
495
- Wave 2 → D (with A and B findings as context)
496
- Wave 3 → E + F (parallel)
430
+ | Step | Agent | Task | Time |
431
+ |------|-------|------|------|
432
+ | 1 | hydra-scout | Explore auth module | 3.2s |
433
+ | 2 | hydra-runner | Run test suite | 5.1s |
434
+ | 3 | hydra-coder | Fix auth bug | 8.4s |
435
+ | 4 | hydra-guard | Security scan | 2.1s |
436
+ | 5 | hydra-runner | Verify fix | 4.8s |
437
+
438
+ Delegation: 5/5 (100%) Opus direct: 0
497
439
  ```
498
440
 
499
- ### Why Waves Beat Sequential Dispatch
500
-
501
- - **3 Haiku agents in parallel** finish in ~1× the time of 1 agent
502
- - **3 Haiku agents sequentially** take ~3× the time
503
- - For a 4-wave workflow, parallelism within waves can cut total wall-clock time by 40-60%
504
- - The orchestrator (Opus) is not executing during waves — it's free to think about the
505
- next wave while heads work
441
+ If "Opus direct" exceeds "Delegation" count, the mandatory rules are not being followed. Re-read the ALWAYS Delegate table before continuing.
506
442
 
507
443
  ## Verification Protocol
508
444
 
@@ -866,16 +802,8 @@ The user should never notice Hydra operating. Don't announce "I'm delegating to
866
802
  Don't explain the routing. Don't ask permission. Just do it. The user asked for a result, not
867
803
  a process narration. If a head does the work, present the output as if you did it.
868
804
 
869
- ### Speed Over Ceremony
870
- Don't overthink classification. The whole point is speed. A quick mental "Haiku/Sonnet/Opus?"
871
- and go. If you spend 10 seconds classifying a 5-second task, you've defeated the purpose.
872
-
873
- ### Parallel Heads
874
- Parallel dispatch is the default, not a bonus feature. Every multi-step request MUST be
875
- decomposed into waves before any agent is dispatched (see Decomposition First Protocol).
876
- Independent subtasks always launch simultaneously in a single message. Never dispatch
877
- sequentially when parallel is possible. The Wave Execution Model section has 3 concrete
878
- examples showing how to do this correctly.
805
+ ### Speed and Parallelism
806
+ See "Mandatory Delegation Rules" for binding delegation and parallel dispatch rules.
879
807
 
880
808
  ### Escalate, Never Downgrade on Retry
881
809
  If Haiku's output wasn't good enough, don't try Haiku again or even Sonnet. Just do it yourself.
@@ -952,7 +880,7 @@ If the user types any of these exact phrases, respond with the corresponding act
952
880
 
953
881
  Track these mentally to calibrate:
954
882
 
955
- - **Delegation rate**: What % of tasks go to heads? Target: 60–80%.
883
+ - **Delegation rate**: What % of tasks go to heads? Target: 60–70%.
956
884
  - **Rejection rate**: How often does a draft need Opus intervention? Target: <15%.
957
885
  - **User complaints**: Zero. If the user notices quality issues, tune the classification.
958
886
 
@@ -961,5 +889,5 @@ If rejection rate < 5%, you're too conservative — delegate more.
961
889
 
962
890
  ## Reference Material
963
891
 
964
- - `references/routing-guide.md` — 30+ classification examples, decision flowchart
892
+ - `references/routing-guide.md` — Mandatory delegation examples, decision flowchart
965
893
  - `references/model-capabilities.md` — What each model can and can't do
@@ -1,18 +1,18 @@
1
- # Routing Guide — Detailed Classification Examples
1
+ # Routing Guide — Mandatory Delegation Examples
2
2
 
3
- This reference provides concrete examples to help calibrate task classification.
4
- Read this when you need to resolve ambiguous cases.
3
+ This reference provides concrete examples to see mandatory delegation in action.
4
+ Read this when you need to see how delegation rules apply to real tasks.
5
5
 
6
6
  ## Table of Contents
7
- 1. [Tier 1 (Haiku 4.5) Examples](#tier-1-haiku)
8
- 2. [Tier 2 (Sonnet 4.6) Examples](#tier-2-sonnet)
9
- 3. [Tier 3 (Opus 4.6) Examples](#tier-3-opus)
7
+ 1. [ALWAYS Delegate Haiku 4.5 Agents](#always-delegate-haiku)
8
+ 2. [ALWAYS Delegate Sonnet 4.6 Agents](#always-delegate-sonnet)
9
+ 3. [ALWAYS Handle Yourself — Opus 4.6](#always-handle-yourself-opus)
10
10
  4. [Compound Task Decomposition](#compound-tasks)
11
11
  5. [Common Misclassifications](#common-misclassifications)
12
12
 
13
13
  ---
14
14
 
15
- ## Tier 1 (Haiku 4.5)
15
+ ## ALWAYS Delegate Haiku 4.5 Agents
16
16
 
17
17
  ### hydra-scout (Haiku 4.5) examples
18
18
  | User says | Route to | Why |
@@ -64,7 +64,7 @@ Read this when you need to resolve ambiguous cases.
64
64
 
65
65
  ---
66
66
 
67
- ## Tier 2 (Sonnet 4.6)
67
+ ## ALWAYS Delegate Sonnet 4.6 Agents
68
68
 
69
69
  ### hydra-coder (Sonnet 4.6) examples
70
70
  | User says | Route to | Why |
@@ -88,7 +88,7 @@ Read this when you need to resolve ambiguous cases.
88
88
 
89
89
  ---
90
90
 
91
- ## Tier 3 (Opus 4.6 — handle directly)
91
+ ## ALWAYS Handle Yourself — Opus 4.6
92
92
 
93
93
  | User says | Why Opus? |
94
94
  |-----------|-----------|
@@ -147,48 +147,116 @@ These are tasks that look like one tier but are actually another:
147
147
 
148
148
  | Task | Looks like | Actually | Why |
149
149
  |------|-----------|----------|-----|
150
- | "Add a simple button" | Tier 1 (simple) | Tier 2 (Sonnet 4.6) | Needs to match existing component patterns |
151
- | "Read the logs and find the error" | Tier 1 (read) | Tier 2 (Sonnet 4.6) | Log analysis requires reasoning |
152
- | "Fix the typo in line 42" | Tier 2 (code change) | Tier 1 (Haiku 4.5) | Trivial mechanical change |
153
- | "Add caching" | Tier 2 (implementation) | Tier 3 (Opus 4.6) | Cache invalidation strategy is hard |
154
- | "Write a migration to add a column" | Tier 2 (code writing) | Tier 1 (Haiku 4.5) | Template-level SQL |
155
- | "Upgrade React from 17 to 18" | Tier 1 (simple) | Tier 3 (Opus 4.6) | Breaking changes need careful analysis |
150
+ | "Add a simple button" | Haiku (simple) | hydra-coder (Sonnet) | Needs to match existing component patterns |
151
+ | "Read the logs and find the error" | Haiku (read) | hydra-analyst (Sonnet) | Log analysis requires reasoning |
152
+ | "Fix the typo in line 42" | hydra-coder (Sonnet) | hydra-scribe (Haiku) | Trivial mechanical change |
153
+ | "Add caching" | hydra-coder (Sonnet) | Opus (handle yourself) | Cache invalidation strategy is hard |
154
+ | "Write a migration to add a column" | hydra-coder (Sonnet) | hydra-scribe (Haiku) | Template-level SQL |
155
+ | "Upgrade React from 17 to 18" | Haiku (simple) | Opus (handle yourself) | Breaking changes need careful analysis |
156
156
 
157
157
  ---
158
158
 
159
159
  ## Quick Decision Flowchart
160
160
 
161
161
  ```
162
- Is it read-only? ─── Yes ──→ Is it just finding/reading files?
163
-
164
- │ Yes: hydra-scout (Haiku 4.5)
165
- No: hydra-analyst (Sonnet 4.6)
166
-
167
- No
168
-
169
- Is it a git operation? ─── Yes ──→ hydra-git (Haiku 4.5)
170
-
171
- No
172
-
173
- Is it a security/quality scan? ─── Yes ──→ hydra-guard (Haiku 4.5)
174
-
175
- No
176
-
177
- Is it just running a command? ─── Yes ──→ hydra-runner (Haiku 4.5)
178
-
179
- No
180
-
181
- Is it writing docs/comments only? ─── Yes ──→ hydra-scribe (Haiku 4.5)
182
-
183
- No
184
-
185
- Is the implementation approach clear? ─── Yes ──→ hydra-coder (Sonnet 4.6)
186
-
187
- No
188
-
189
- Does it need architectural judgment? ─── Yes ──→ Opus 4.6 (you)
190
-
191
- No ──→ hydra-coder (Sonnet 4.6), but monitor output closely
162
+ Task arrives
163
+
164
+ ├── In ALWAYS Delegate table? ── YES ──→ Route to specified agent
165
+
166
+ ├── In ALWAYS Handle Yourself table? ── YES ──→ Opus handles directly
167
+
168
+ └── Neither? → JUDGMENT CALLS:
169
+ ├── Requires conversation context? ── YES ──→ Opus
170
+ ├── Haiku/Sonnet can do equally well? ── NO ──→ Opus
171
+ └── Delegation takes LONGER? ── YES (and truly trivial) ──→ Opus
172
+ └── Otherwise ──→ Delegate to best-fit agent
173
+ ```
174
+
175
+ ## Routing Examples — Mandatory Delegation
176
+
177
+ These examples show the RIGHT and WRONG way to handle common requests under the mandatory delegation rules.
178
+
179
+ ### 1. "Fix the bug in auth.ts"
180
+
181
+ **WRONG** (Opus does it all):
182
+ ```
183
+ Read auth.ts yourself → find the bug → fix it → run tests
184
+ ```
185
+
186
+ **RIGHT** (mandatory delegation):
187
+ ```
188
+ 1. hydra-scout → explore auth module, find the bug location
189
+ 2. hydra-analyst analyze the bug, identify root cause
190
+ 3. hydra-coder → implement the fix
191
+ 4. hydra-sentinel-scan + hydra-guard verify changes [parallel]
192
+ 5. hydra-runner → run tests to confirm fix
193
+ ```
194
+
195
+ ### 2. "What's the project structure?"
196
+
197
+ **WRONG** (Opus runs find/ls itself):
198
+ ```
199
+ Run `find . -type f` or `ls -R` yourself
200
+ ```
201
+
202
+ **RIGHT** (mandatory delegation):
203
+ ```
204
+ 1. hydra-scout → map project structure, report back
205
+ ```
206
+
207
+ ### 3. "Run the tests"
208
+
209
+ **WRONG** (Opus runs npm test itself):
210
+ ```
211
+ Run `npm test` yourself
212
+ ```
213
+
214
+ **RIGHT** (mandatory delegation):
215
+ ```
216
+ 1. hydra-runner → execute test suite, report results
217
+ ```
218
+
219
+ ### 4. "Commit these changes"
220
+
221
+ **WRONG** (Opus runs git add/commit itself):
222
+ ```
223
+ Run `git add . && git commit -m "..."` yourself
224
+ ```
225
+
226
+ **RIGHT** (mandatory delegation):
227
+ ```
228
+ 1. hydra-git → stage and commit with well-crafted message
229
+ ```
230
+
231
+ ### 5. "This function is slow, figure out why"
232
+
233
+ **RIGHT** (Opus orchestrates, delegates execution):
234
+ ```
235
+ 1. hydra-analyst → profile and diagnose the performance issue
236
+ 2. YOU → decide the optimization approach (architectural judgment)
237
+ 3. hydra-coder → implement the optimization
238
+ 4. hydra-runner → benchmark before and after
239
+ ```
240
+
241
+ ### 6. "Redesign the auth module to use OAuth2"
242
+
243
+ **RIGHT** (Opus architects, delegates implementation):
244
+ ```
245
+ 1. hydra-scout → map current auth module structure [parallel with Step 2]
246
+ 2. hydra-scout → research OAuth2 patterns in codebase [parallel with Step 1]
247
+ 3. YOU → design the new architecture (this is YOUR job)
248
+ 4. hydra-coder #1 → implement OAuth2 provider [parallel with Steps 5-6]
249
+ 5. hydra-coder #2 → implement token management [parallel with Steps 4, 6]
250
+ 6. hydra-coder #3 → update route handlers [parallel with Steps 4-5]
251
+ 7. hydra-sentinel-scan → integration sweep on all changes
252
+ 8. hydra-runner → run full test suite
253
+ ```
254
+
255
+ ### 7. "Quick — add a console.log to line 5"
256
+
257
+ **ACCEPTABLE** (trivial <5s edit — uses overhead budget):
258
+ ```
259
+ YOU → add the console.log directly (counts as 1 of max 2-3 exceptions per session)
192
260
  ```
193
261
 
194
262
  ## Sentinel Routing
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "hail-hydra-cc",
3
- "version": "2.0.2",
3
+ "version": "2.0.3",
4
4
  "description": "Multi-headed speculative execution framework for Claude Code",
5
5
  "bin": {
6
6
  "hail-hydra-cc": "bin/cli.js"