vector-cadence-skills 0.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (36) hide show
  1. package/CHANGELOG.md +10 -0
  2. package/CONTRIBUTING.md +69 -0
  3. package/DOCUMENTATION.md +1081 -0
  4. package/README.md +146 -0
  5. package/examples/bug-debug-lifecycle.md +39 -0
  6. package/examples/feature-lifecycle.md +54 -0
  7. package/examples/harness-subagent-plan.md +39 -0
  8. package/examples/vertical-slices.md +43 -0
  9. package/package.json +39 -0
  10. package/references/architecture-overview.md +279 -0
  11. package/references/artifact-model.md +20 -0
  12. package/references/final-skill-quality-bar.md +33 -0
  13. package/references/minimal-mode.md +50 -0
  14. package/references/source-integration-map.md +159 -0
  15. package/references/subagent-policy.md +30 -0
  16. package/references/tiering.md +23 -0
  17. package/scripts/validate-skills.mjs +187 -0
  18. package/skills/vc-align/SKILL.md +124 -0
  19. package/skills/vc-architecture/SKILL.md +131 -0
  20. package/skills/vc-debug/SKILL.md +154 -0
  21. package/skills/vc-execute/SKILL.md +166 -0
  22. package/skills/vc-handoff/SKILL.md +127 -0
  23. package/skills/vc-harness-architect/SKILL.md +138 -0
  24. package/skills/vc-learn/SKILL.md +123 -0
  25. package/skills/vc-orient/SKILL.md +108 -0
  26. package/skills/vc-plan/SKILL.md +156 -0
  27. package/skills/vc-prototype/SKILL.md +103 -0
  28. package/skills/vc-review/SKILL.md +156 -0
  29. package/skills/vc-setup/SKILL.md +126 -0
  30. package/skills/vc-skill-author/SKILL.md +131 -0
  31. package/skills/vc-slice/SKILL.md +121 -0
  32. package/skills/vc-triage/SKILL.md +149 -0
  33. package/skills.json +98 -0
  34. package/templates/AGENTS.md +82 -0
  35. package/templates/CONTEXT.md +23 -0
  36. package/templates/vc-budget.yml +4 -0
@@ -0,0 +1,131 @@
1
+ ---
2
+ name: vc-architecture
3
+ description: Improve application code architecture by finding deep-module opportunities, better seams, safer refactor paths, and testability improvements. Use when working on module boundaries, refactor strategy, interface design, architecture friction, or making code more maintainable and agent-navigable. Use vc-harness-architect instead for Vector Cadence/pi harness, subagent, extension, provider, or tool architecture.
4
+ ---
5
+
6
+ # Vector Cadence Architecture
7
+
8
+ ## Purpose
9
+
10
+ Improve application code architecture. This skill finds friction and proposes deeper modules, clearer seams, safer interfaces, and refactor plans.
11
+
12
+ ## Operating model
13
+
14
+ Ground architecture recommendations in observed code friction, tests, domain language, and ADRs. Prefer changes that improve locality, leverage, and testability.
15
+
16
+ ## When to use
17
+
18
+ Use for:
19
+
20
+ - module boundary problems,
21
+ - refactor strategy,
22
+ - interface/API shape decisions inside an app,
23
+ - hard-to-test code,
24
+ - shallow pass-through modules,
25
+ - tangled responsibility or duplicated invariants.
26
+
27
+ ## When to skip
28
+
29
+ Skip when:
30
+
31
+ - the request is about Vector Cadence/pi harness, subagents, extensions, providers, code search, or Taste (`vc-harness-architect`),
32
+ - the user only needs a code map (`vc-orient`),
33
+ - the architecture direction is known and needs planning (`vc-plan`).
34
+
35
+ ## Vocabulary
36
+
37
+ Use these terms:
38
+
39
+ - **Module** — anything with an interface and implementation.
40
+ - **Interface** — everything callers must know: types, invariants, errors, ordering, config.
41
+ - **Implementation** — code hidden behind the interface.
42
+ - **Depth** — behavior/leverage hidden behind a smaller interface.
43
+ - **Seam** — where behavior can change without editing callers in place.
44
+ - **Adapter** — concrete implementation at a seam.
45
+ - **Locality** — related change/bugs/knowledge concentrated in one place.
46
+ - **Leverage** — many callers benefit from one deep implementation.
47
+
48
+ Deletion test: if deleting a module makes complexity vanish, it was likely pass-through; if deleting it spreads complexity across callers, it was earning its keep.
49
+
50
+ ## Workflow
51
+
52
+ ### 1. Explore with domain grounding
53
+
54
+ Read:
55
+
56
+ - `CONTEXT.md`,
57
+ - relevant ADRs,
58
+ - project instructions,
59
+ - target code and callers,
60
+ - tests around the area,
61
+ - prior solutions/knowledge if relevant.
62
+
63
+ ### 2. Find friction
64
+
65
+ Look for:
66
+
67
+ - shallow modules,
68
+ - concepts spread across many files,
69
+ - tests forced to mock internals,
70
+ - no public seam for real behavior,
71
+ - duplicated invariants,
72
+ - hard-to-reason side effects,
73
+ - leaky interfaces.
74
+
75
+ ### 3. Present candidates
76
+
77
+ For each candidate include:
78
+
79
+ - files/modules,
80
+ - observed friction,
81
+ - why the current interface is shallow or leaky,
82
+ - proposed deeper module or seam,
83
+ - locality/leverage benefit,
84
+ - testing improvement,
85
+ - ADR/domain conflicts,
86
+ - recommendation strength: Strong, Worth exploring, or Speculative.
87
+
88
+ Do not finalize interfaces until the user chooses a candidate.
89
+
90
+ ### 4. Design interface when needed
91
+
92
+ For a chosen candidate:
93
+
94
+ 1. Gather required behaviors and invariants.
95
+ 2. Create 2-3 meaningfully different interface shapes.
96
+ 3. Compare depth, misuse risk, testability, migration cost, and domain fit.
97
+ 4. Recommend one.
98
+ 5. Decide whether an ADR is warranted.
99
+
100
+ ### 5. Plan refactor path
101
+
102
+ If implementation work should proceed, produce or route to a plan with:
103
+
104
+ - current friction,
105
+ - desired seam/module,
106
+ - compatibility strategy,
107
+ - characterization tests,
108
+ - vertical migration slices,
109
+ - rollback/stop conditions,
110
+ - docs/ADR updates.
111
+
112
+ ## Output
113
+
114
+ ```markdown
115
+ ## Architecture Summary
116
+
117
+ **Top finding:** ...
118
+ **Recommended direction:** ...
119
+ **Why this improves locality/leverage/testability:** ...
120
+ **Risks:** ...
121
+ **Artifacts to write:** plan | ADR | CONTEXT update | issues
122
+ **Recommended next skill:** vc-plan | vc-slice | vc-execute
123
+ ```
124
+
125
+ ## Guardrails
126
+
127
+ - Do not recommend architecture from taste alone.
128
+ - Do not relitigate ADRs without real friction.
129
+ - Do not create seams only for hypothetical future adapters.
130
+ - Do not confuse product scope decisions with code architecture decisions.
131
+ - Do not use this skill for harness architecture; use `vc-harness-architect`.
@@ -0,0 +1,154 @@
1
+ ---
2
+ name: vc-debug
3
+ description: Diagnose and fix bugs with a feedback-loop-first process, environment sanity checks, causal-chain gates, test-first fixes, and structured handoff. Use when handling hard bugs, failing tests, regressions, stack traces, flaky behavior, performance problems, or repeated failed fixes.
4
+ ---
5
+
6
+ # Vector Cadence Debug
7
+
8
+ ## Purpose
9
+
10
+ Find the root cause before fixing. This skill prevents patching symptoms by requiring a reproduction loop, evidence-backed hypotheses, and a complete causal chain.
11
+
12
+ ## Operating model
13
+
14
+ Build a fast feedback loop, reproduce the reported symptom, verify environment sanity, test ranked hypotheses one at a time, then fix test-first only after the causal chain is clear.
15
+
16
+ ## When to use
17
+
18
+ Use for:
19
+
20
+ - failing tests,
21
+ - stack traces,
22
+ - production or user-reported bugs,
23
+ - regressions,
24
+ - flaky or non-deterministic behavior,
25
+ - performance regressions,
26
+ - “I tried fixes but it still fails”.
27
+
28
+ ## When to skip
29
+
30
+ Skip when:
31
+
32
+ - the task is a new feature (`vc-align` or `vc-plan`),
33
+ - root cause is already proven and only implementation remains (`vc-execute`),
34
+ - the user only wants codebase orientation (`vc-orient`).
35
+
36
+ ## Inputs
37
+
38
+ Use:
39
+
40
+ - issue or bug report,
41
+ - full comment thread if available,
42
+ - stack traces/logs,
43
+ - reproduction steps,
44
+ - environment details,
45
+ - prior failed attempts,
46
+ - linked PRs/deployments.
47
+
48
+ ## Workflow
49
+
50
+ ### 1. Build the feedback loop
51
+
52
+ Prefer:
53
+
54
+ 1. failing test at the real seam,
55
+ 2. focused command or CLI repro,
56
+ 3. HTTP/curl script,
57
+ 4. headless browser script,
58
+ 5. captured trace replay,
59
+ 6. throwaway harness,
60
+ 7. fuzz/property loop,
61
+ 8. bisection/differential loop,
62
+ 9. structured human-in-the-loop script.
63
+
64
+ The loop should be fast, specific, deterministic or high-reproduction-rate, and agent-runnable. If no loop is possible, stop and ask for missing access/artifacts/setup.
65
+
66
+ ### 2. Verify environment sanity
67
+
68
+ Check likely false leads:
69
+
70
+ - branch and uncommitted changes,
71
+ - dependencies,
72
+ - runtime version,
73
+ - env vars,
74
+ - stale build artifacts,
75
+ - local services,
76
+ - config differences.
77
+
78
+ ### 3. Reproduce the real symptom
79
+
80
+ Confirm the loop matches the user’s reported failure, not a nearby unrelated failure. Capture the exact signal.
81
+
82
+ ### 4. Trace the code path
83
+
84
+ Trace backward from symptom to where valid state becomes invalid. Use observed values, not assumptions. For performance, measure before changing code.
85
+
86
+ ### 5. Form hypotheses
87
+
88
+ Create 3-5 ranked hypotheses. Each includes:
89
+
90
+ - suspected cause and location,
91
+ - supporting observation,
92
+ - causal chain,
93
+ - prediction for uncertain links.
94
+
95
+ Review what has already been ruled out before forming new hypotheses.
96
+
97
+ ### 6. Probe one variable
98
+
99
+ Use debugger/REPL, targeted logs, temporary assertions, focused test variations, or bisect/diff probes. Tag temporary logs with a unique prefix and remove them before finishing.
100
+
101
+ Never “log everything and grep”.
102
+
103
+ ### 7. Pass the causal-chain gate
104
+
105
+ Do not fix until you can explain:
106
+
107
+ ```text
108
+ trigger → code path → invalid state starts → symptom appears
109
+ ```
110
+
111
+ If stuck after 2-3 hypotheses, diagnose why: wrong mental model, environment mismatch, design problem, contradictory evidence, or insufficient observability.
112
+
113
+ ### 8. Fix test-first
114
+
115
+ If fixing:
116
+
117
+ 1. Check workspace safety.
118
+ 2. Write or keep a failing regression test at the correct seam.
119
+ 3. Verify it fails for the right reason.
120
+ 4. Apply the minimal root-cause fix.
121
+ 5. Verify the test passes.
122
+ 6. Re-run the original loop.
123
+ 7. Run broader affected tests.
124
+ 8. Self-review the diff.
125
+
126
+ If no correct test seam exists, document that and recommend `vc-architecture` after the immediate fix/diagnosis.
127
+
128
+ ### 9. Cleanup and prevention
129
+
130
+ Remove debug logs, delete or mark throwaway harnesses, record the confirmed root cause, and offer `vc-learn` only for reusable lessons.
131
+
132
+ ## Output
133
+
134
+ ```markdown
135
+ ## Debug Summary
136
+
137
+ **Problem:** ...
138
+ **Reproduction:** ...
139
+ **Root cause:** ...
140
+ **Causal chain:** ...
141
+ **Fix:** ... or Diagnosis only
142
+ **Tests:** ...
143
+ **Prevention:** ...
144
+ **Confidence:** High | Medium | Low
145
+ **Recommended next skill:** vc-execute | vc-review | vc-architecture | vc-learn
146
+ ```
147
+
148
+ ## Guardrails
149
+
150
+ - Do not hypothesize without a feedback loop unless impossible, and then say so.
151
+ - Do not fix before the causal-chain gate.
152
+ - Do not batch unrelated changes.
153
+ - Do not retry variants of a failed hypothesis without invalidating it.
154
+ - Do not leave instrumentation behind.
@@ -0,0 +1,166 @@
1
+ ---
2
+ name: vc-execute
3
+ description: Execute a clear plan, issue, or agent brief with workspace safety, TDD, scoped subagent coordination, continuous verification, and tiered review gates. Use when implementing approved work, running an AFK loop, or turning a plan/issue into tested code.
4
+ ---
5
+
6
+ # Vector Cadence Execute
7
+
8
+ ## Purpose
9
+
10
+ Implement approved work safely. This skill assumes the target is clear enough to code; if not, route back to alignment, planning, triage, or debugging.
11
+
12
+ ## Operating model
13
+
14
+ Use branch/worktree safety, task tracking, one-behavior-at-a-time TDD for behavior changes, continuous validation, and scoped subagents only when their file access is safe.
15
+
16
+ ## When to use
17
+
18
+ Use for:
19
+
20
+ - implementing a reviewed plan,
21
+ - executing a `ready-for-agent` issue,
22
+ - completing an agent brief,
23
+ - running a bounded AFK implementation loop.
24
+
25
+ ## When to skip
26
+
27
+ Skip when:
28
+
29
+ - scope is vague (`vc-align` or `vc-plan`),
30
+ - a bug lacks root cause (`vc-debug`),
31
+ - a human design/security/privacy decision is unresolved,
32
+ - workspace safety cannot be established.
33
+
34
+ ## Inputs
35
+
36
+ Accept:
37
+
38
+ - plan path,
39
+ - issue reference,
40
+ - agent brief,
41
+ - small direct task.
42
+
43
+ ## Workflow
44
+
45
+ ### 1. Triage input
46
+
47
+ Classify:
48
+
49
+ | Input | Action |
50
+ |---|---|
51
+ | trivial direct edit | implement with focused verification |
52
+ | ready-for-agent issue | execute from brief |
53
+ | plan file | build task list from units |
54
+ | vague/high-risk prompt | route to align/plan |
55
+ | bug without diagnosis | route to debug |
56
+
57
+ ### 2. Check workspace safety
58
+
59
+ Before editing:
60
+
61
+ - check branch and uncommitted changes,
62
+ - avoid overwriting user work,
63
+ - avoid committing to default branch unless policy/user permits,
64
+ - prefer feature branch or isolated worktree for non-trivial work.
65
+
66
+ ### 3. Build task list
67
+
68
+ Preserve source trace:
69
+
70
+ - U-IDs,
71
+ - requirements,
72
+ - dependencies,
73
+ - execution notes,
74
+ - likely files,
75
+ - test scenarios,
76
+ - verification criteria.
77
+
78
+ Track progress in the task tracker or summary, not by editing the plan body.
79
+
80
+ ### 4. Choose execution strategy
81
+
82
+ | Strategy | Use when |
83
+ |---|---|
84
+ | Inline | 1-2 small tasks |
85
+ | Serial subagents | dependent tasks or large context |
86
+ | Parallel read-only subagents | research/review/test probes |
87
+ | Parallel write subagents | isolated worktrees or disjoint file scopes |
88
+
89
+ If the harness does not provide subagents, execute inline or serially yourself.
90
+
91
+ Never let shared-checkout subagents stage, commit, or run whole-suite tests concurrently.
92
+
93
+ ### 5. Implement behavior with TDD
94
+
95
+ For behavior-bearing work:
96
+
97
+ ```text
98
+ choose one behavior
99
+ → write one failing public-interface test
100
+ → verify it fails for the right reason
101
+ → implement minimal code
102
+ → verify green
103
+ → repeat
104
+ → refactor only while green
105
+ ```
106
+
107
+ Test observable behavior through public interfaces. Avoid tests coupled to private implementation. Add integration coverage for callbacks, persistence, permissions, retries, multi-interface parity, and external contract behavior when relevant.
108
+
109
+ Skip strict test-first for pure styling, mechanical renames, config-only changes, or cases where characterization-first is safer.
110
+
111
+ ### 6. Verify continuously
112
+
113
+ Before marking a task complete, check:
114
+
115
+ - what callbacks/middleware/jobs/events fire,
116
+ - whether tests exercise the real chain,
117
+ - whether failure can leave orphaned state,
118
+ - whether other interfaces expose the behavior,
119
+ - whether auth, privacy, payments, or external contracts are touched.
120
+
121
+ Run focused tests first, then broader affected checks as confidence grows.
122
+
123
+ ### 7. Commit only if allowed
124
+
125
+ If user/project/harness policy permits commits, commit complete logical units only when tests pass and the message describes real value. Do not create WIP commits or stage unrelated files.
126
+
127
+ ### 8. Apply review gate
128
+
129
+ | Tier | Gate |
130
+ |---|---|
131
+ | T1 | focused tests + self-review |
132
+ | T2 | affected tests + lightweight review |
133
+ | T3 | broader validation + security/correctness/performance review + human pause before merge |
134
+
135
+ ## Stop conditions
136
+
137
+ Stop when:
138
+
139
+ - acceptance criteria are ambiguous,
140
+ - three fix attempts fail,
141
+ - tests fail for unknown reasons after focused diagnosis,
142
+ - implementation reveals an unplanned product/architecture decision,
143
+ - subagent outputs conflict or risk lost work,
144
+ - high-risk behavior differs from the plan.
145
+
146
+ ## Output
147
+
148
+ ```markdown
149
+ ## Execution Summary
150
+
151
+ **Implemented:** ...
152
+ **Source:** ...
153
+ **Tests run:** ...
154
+ **Review gate:** ...
155
+ **Commits:** ...
156
+ **Open risks / follow-ups:** ...
157
+ **Recommended next skill:** vc-review | vc-learn
158
+ ```
159
+
160
+ ## Guardrails
161
+
162
+ - Do not code from vague intent.
163
+ - Do not refactor while tests are red.
164
+ - Do not let subagents write overlapping files in one checkout.
165
+ - Do not leave debug/prototype artifacts unmarked.
166
+ - Do not claim validation passed unless it did.
@@ -0,0 +1,127 @@
1
+ ---
2
+ name: vc-handoff
3
+ description: Compact session, plan, debug, review, implementation, or subagent state into a durable handoff for another agent or future session. Use when pausing work, switching agents, context is getting full, resuming later, or preparing bounded subagent/orchestrator context.
4
+ ---
5
+
6
+ # Vector Cadence Handoff
7
+
8
+ ## Purpose
9
+
10
+ Transfer useful state without dumping the whole conversation. A fresh agent should know what happened, what matters, and exactly what to do next.
11
+
12
+ ## Operating model
13
+
14
+ Reference existing artifacts instead of duplicating them, preserve failed attempts, redact sensitive data, and end with a concrete next action.
15
+
16
+ ## When to use
17
+
18
+ Use when:
19
+
20
+ - context is getting full,
21
+ - switching agents or harnesses,
22
+ - pausing work,
23
+ - handing review findings to a resolver,
24
+ - preparing a subagent brief,
25
+ - leaving a debug or implementation session mid-stream.
26
+
27
+ ## When to skip
28
+
29
+ Skip when:
30
+
31
+ - the task is complete and summarized normally,
32
+ - there are no decisions, artifacts, or next actions worth preserving,
33
+ - a source artifact already fully captures the state.
34
+
35
+ ## Handoff types
36
+
37
+ | Type | Use for |
38
+ |---|---|
39
+ | Alignment | after alignment before planning |
40
+ | Plan | after planning before execution |
41
+ | Execution | mid-implementation |
42
+ | Debug | diagnosis or stuck investigation |
43
+ | Review | unresolved findings |
44
+ | Harness | agent/tool/harness decisions |
45
+ | Subagent brief | bounded delegated task |
46
+
47
+ ## Workflow
48
+
49
+ ### 1. Gather state
50
+
51
+ Collect only what matters:
52
+
53
+ - user goal,
54
+ - current branch/worktree,
55
+ - relevant artifact paths,
56
+ - decisions made,
57
+ - open questions,
58
+ - files changed or relevant,
59
+ - tests/commands run and results,
60
+ - blockers,
61
+ - next recommended action.
62
+
63
+ ### 2. Redact and de-risk
64
+
65
+ Remove secrets, tokens, PII, private customer data, raw sensitive logs, and irrelevant conversation. Reference secure locations instead of copying sensitive details.
66
+
67
+ ### 3. Write compact handoff
68
+
69
+ Default local path:
70
+
71
+ ```text
72
+ docs/handoffs/YYYY-MM-DD-<slug>-handoff.md
73
+ ```
74
+
75
+ If temporary-only is requested, provide in chat or outside the repo.
76
+
77
+ ## Templates
78
+
79
+ ### General handoff
80
+
81
+ ```markdown
82
+ # Handoff — <topic>
83
+
84
+ ## Goal
85
+ ## Current state
86
+ ## Decisions made
87
+ ## Artifacts
88
+ ## Files changed / relevant
89
+ ## Validation so far
90
+ ## Open questions / blockers
91
+ ## Next recommended action
92
+ ## Warnings
93
+ ```
94
+
95
+ ### Subagent brief
96
+
97
+ ```markdown
98
+ # Subagent Brief — <task>
99
+
100
+ ## Role
101
+ ## Task
102
+ ## Context artifacts
103
+ ## Allowed tools / permissions
104
+ ## Files in scope
105
+ ## Files out of scope
106
+ ## Output contract
107
+ ## Constraints
108
+ ```
109
+
110
+ ## Output
111
+
112
+ ```markdown
113
+ ## Handoff Ready
114
+
115
+ **Path:** ... or provided in chat
116
+ **Type:** ...
117
+ **Next action:** ...
118
+ **Risks:** ...
119
+ **Recommended next skill:** depends on handoff type
120
+ ```
121
+
122
+ ## Guardrails
123
+
124
+ - Do not duplicate large artifacts already saved.
125
+ - Do not include secrets or raw private data.
126
+ - Do not hide failed attempts.
127
+ - Do not produce a vague “continue from here” handoff.
@@ -0,0 +1,138 @@
1
+ ---
2
+ name: vc-harness-architect
3
+ description: Design or evolve the Vector Cadence/pi harness, including wrapper CLI, extensions, subagents, provider/cache strategy, skill injection, code search, permission gates, and optional Taste integration. Use when working on harness architecture rather than application code architecture; use vc-architecture for app modules and refactors.
4
+ ---
5
+
6
+ # Vector Cadence Harness Architect
7
+
8
+ ## Purpose
9
+
10
+ Design the Vector Cadence/pi harness and its extension ecosystem. This skill handles tool/runtime architecture, not ordinary application feature work.
11
+
12
+ ## Operating model
13
+
14
+ Prefer a thin pi-first wrapper, modular packages, primitive extension tools, read-only subagents first, explicit privacy boundaries, and measurement before performance/cache claims.
15
+
16
+ ## When to use
17
+
18
+ Use for:
19
+
20
+ - Vector Cadence CLI/wrapper design,
21
+ - skill injection strategy,
22
+ - pi extensions,
23
+ - subagent runtime design,
24
+ - DeepSeek/provider/cache telemetry,
25
+ - code search tools,
26
+ - permission gates,
27
+ - optional Taste/preference integration,
28
+ - harness tests and packaging.
29
+
30
+ ## When to skip
31
+
32
+ Skip when:
33
+
34
+ - improving application module boundaries (`vc-architecture`),
35
+ - implementing a planned harness slice (`vc-execute`),
36
+ - writing a prompt-only skill (`vc-skill-author`).
37
+
38
+ ## Default package architecture
39
+
40
+ ```text
41
+ @your-scope/vc-core # shared types/config/utils
42
+ @your-scope/vc-skills # markdown skills
43
+ @your-scope/vc-extensions # pi extensions/tools
44
+ @your-scope/vc-cli # thin wrapper CLI
45
+ ```
46
+
47
+ Use bare pi as substrate. Borrow little-coder patterns. Avoid a deep fork unless extension/wrapper paths cannot support the requirement.
48
+
49
+ ## Workflow
50
+
51
+ ### 1. Classify harness area
52
+
53
+ | Area | Examples |
54
+ |---|---|
55
+ | Wrapper CLI | launch, config, curated/composable flags |
56
+ | Skill injection | selection, prompt chunks, stable ordering |
57
+ | Subagents | RPC subprocess, SDK child session, external service |
58
+ | Provider/cache | DeepSeek config, telemetry, Reasonix boundary |
59
+ | Code search | regex, symbol index, semantic search |
60
+ | Taste/preferences | explicit commands, optional prompt summary |
61
+ | Permission gates | approvals, filesystem scopes, dangerous commands |
62
+ | Testing | fake pi objects, extension registration, CLI smoke tests |
63
+
64
+ ### 2. Choose integration level
65
+
66
+ | Need | Proper layer |
67
+ |---|---|
68
+ | instructions/checklist | skill package |
69
+ | deterministic helper | script |
70
+ | new tool/behavior | extension package |
71
+ | install/default UX | wrapper CLI |
72
+ | cache-first request construction | harness core, SDK loop, or backend |
73
+
74
+ Do not solve extension problems with markdown-only instructions.
75
+
76
+ ### 3. Apply agent-native checks
77
+
78
+ Evaluate:
79
+
80
+ - action parity,
81
+ - primitive tool granularity,
82
+ - composability,
83
+ - dynamic context injection,
84
+ - shared workspace/files,
85
+ - completion signals,
86
+ - permission and approval gates,
87
+ - recovery/resume,
88
+ - agent-native tests.
89
+
90
+ ### 4. Design subagents safely
91
+
92
+ Default rollout:
93
+
94
+ 1. read-only research/review,
95
+ 2. test-running with bounded commands,
96
+ 3. serial implementation,
97
+ 4. parallel write only with isolated worktrees or disjoint file scopes.
98
+
99
+ Parent orchestrator owns final merge, validation, conflict handling, and user summary.
100
+
101
+ ### 5. Design provider/cache features honestly
102
+
103
+ DeepSeek/Reasonix strategy should progress from provider support to telemetry to prompt stability to optional Reasonix backend. Do not promise Reasonix-level prefix-cache behavior without loop-level control and measured telemetry.
104
+
105
+ ### 6. Plan tests
106
+
107
+ At minimum verify:
108
+
109
+ - extension registration,
110
+ - schema stability,
111
+ - permission enforcement,
112
+ - timeout/cancellation,
113
+ - output contracts,
114
+ - CLI help/config loading,
115
+ - curated/composable behavior.
116
+
117
+ ## Output
118
+
119
+ ```markdown
120
+ ## Harness Architecture Recommendation
121
+
122
+ **Area:** ...
123
+ **Integration level:** skill | script | extension | wrapper | SDK loop | backend
124
+ **Recommended design:** ...
125
+ **Why:** ...
126
+ **Risks:** ...
127
+ **First vertical slice:** ...
128
+ **Tests:** ...
129
+ **Recommended next skill:** vc-plan | vc-skill-author | vc-execute
130
+ ```
131
+
132
+ ## Guardrails
133
+
134
+ - Do not fork pi/little-coder unless extension/wrapper paths cannot work.
135
+ - Do not allow parallel write subagents in one checkout.
136
+ - Do not imply markdown skills can create tools by themselves.
137
+ - Do not send Taste/telemetry data without opt-in.
138
+ - Do not vary stable prompt prefixes unnecessarily in DeepSeek mode.