@fredericboyer/dev-team 0.8.1 → 0.10.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (45) hide show
  1. package/dist/create-agent.js +20 -6
  2. package/dist/create-agent.js.map +1 -1
  3. package/dist/init.d.ts +8 -1
  4. package/dist/init.js +71 -5
  5. package/dist/init.js.map +1 -1
  6. package/dist/status.js +12 -6
  7. package/dist/status.js.map +1 -1
  8. package/dist/update.d.ts +6 -0
  9. package/dist/update.js +107 -0
  10. package/dist/update.js.map +1 -1
  11. package/package.json +2 -2
  12. package/templates/CLAUDE.md +25 -11
  13. package/templates/agent-memory/dev-team-beck/MEMORY.md +20 -6
  14. package/templates/agent-memory/dev-team-borges/MEMORY.md +20 -6
  15. package/templates/agent-memory/dev-team-brooks/MEMORY.md +20 -6
  16. package/templates/agent-memory/dev-team-conway/MEMORY.md +20 -6
  17. package/templates/agent-memory/dev-team-deming/MEMORY.md +21 -7
  18. package/templates/agent-memory/dev-team-drucker/MEMORY.md +20 -6
  19. package/templates/agent-memory/dev-team-hamilton/MEMORY.md +20 -6
  20. package/templates/agent-memory/dev-team-knuth/MEMORY.md +20 -6
  21. package/templates/agent-memory/dev-team-mori/MEMORY.md +20 -6
  22. package/templates/agent-memory/dev-team-szabo/MEMORY.md +20 -6
  23. package/templates/agent-memory/dev-team-tufte/MEMORY.md +20 -6
  24. package/templates/agent-memory/dev-team-voss/MEMORY.md +20 -6
  25. package/templates/agents/dev-team-beck.md +3 -0
  26. package/templates/agents/dev-team-borges.md +119 -11
  27. package/templates/agents/dev-team-brooks.md +10 -0
  28. package/templates/agents/dev-team-conway.md +3 -0
  29. package/templates/agents/dev-team-deming.md +3 -0
  30. package/templates/agents/dev-team-drucker.md +114 -2
  31. package/templates/agents/dev-team-hamilton.md +3 -0
  32. package/templates/agents/dev-team-knuth.md +10 -0
  33. package/templates/agents/dev-team-mori.md +3 -0
  34. package/templates/agents/dev-team-szabo.md +10 -0
  35. package/templates/agents/dev-team-tufte.md +3 -0
  36. package/templates/agents/dev-team-voss.md +3 -0
  37. package/templates/dev-team-learnings.md +3 -1
  38. package/templates/dev-team-metrics.md +18 -0
  39. package/templates/hooks/dev-team-post-change-review.js +71 -0
  40. package/templates/skills/dev-team-assess/SKILL.md +20 -0
  41. package/templates/skills/dev-team-audit/SKILL.md +1 -1
  42. package/templates/skills/dev-team-review/SKILL.md +36 -3
  43. package/templates/skills/dev-team-task/SKILL.md +30 -10
  44. package/templates/{skills → workflow-skills}/dev-team-security-status/SKILL.md +1 -1
  45. /package/templates/{skills → workflow-skills}/dev-team-merge/SKILL.md +0 -0
@@ -19,9 +19,9 @@ This project uses [dev-team](https://github.com/dev-team) — adversarial AI age
19
19
  | `@dev-team-brooks` | Architect & Quality Reviewer | Architectural review, coupling, ADR compliance, quality attributes (performance, maintainability, scalability) |
20
20
  | `@dev-team-conway` | Release Manager | Versioning, changelog, release readiness, semver validation |
21
21
  | `@dev-team-drucker` | Team Lead / Orchestrator | Auto-delegates to specialists, manages review loops, resolves conflicts |
22
- | `@dev-team-borges` | Librarian | End-of-task/review/audit memory review, cross-agent coherence, system improvement |
22
+ | `@dev-team-borges` | Librarian | End-of-task memory extraction, cross-agent coherence, system improvement |
23
23
 
24
- ### Workflow
24
+ ### Capabilities
25
25
 
26
26
  For automatic delegation, use `@dev-team-drucker` — it analyzes the task and routes to the right specialist.
27
27
 
@@ -34,15 +34,17 @@ For non-trivial work: explore the area first, then implement, then review.
34
34
  - **Hamilton** — auto-flagged when infrastructure/operations files change (Dockerfile, docker-compose, CI workflows, Terraform, Helm, k8s, health checks, monitoring config, .env templates, etc.)
35
35
  - **Voss** — auto-flagged when app config/data files change (.env, config, migrations, database, etc.)
36
36
  - **Deming** — auto-flagged when tooling files change (eslint, CI workflows, package.json, etc.)
37
- - **Tufte** — auto-flagged when documentation files change (.md, /docs/, README, etc.) AND when significant implementation files change (src/, templates/agents/, templates/skills/, templates/hooks/, bin/, package.json) to detect doc-code drift
37
+ - **Tufte** — auto-flagged when documentation files change (.md, /docs/, README, etc.) AND when significant implementation files change to detect doc-code drift
38
38
  - **Brooks** — auto-flagged when any non-test implementation code changes (quality attributes) and when architectural boundaries are touched (/adr/, /core/, /domain/, /lib/, build config, etc.)
39
39
  - **Conway** — auto-flagged when release artifacts change (package.json, changelog, version files, release/publish/deploy workflows, etc.)
40
40
 
41
41
  **End-of-workflow agents:**
42
- - **Borges** — mandatory at end of every `/dev-team:task`, `/dev-team:review`, `/dev-team:audit`, and `/dev-team:assess`. Reviews memory freshness, cross-agent coherence, and system improvement opportunities.
42
+ - **Borges** — mandatory at end of every `/dev-team:task`, `/dev-team:review`, `/dev-team:audit`, and `/dev-team:assess`. Extracts structured memory entries, reviews cross-agent coherence, and identifies system improvement opportunities.
43
43
 
44
44
  **Orchestration:**
45
- - **Drucker** — delegates tasks to the right implementing agent and spawns reviewers. Szabo, Knuth, and Brooks review all code changes. Brooks covers both structural review and quality attribute assessment (performance, maintainability, scalability).
45
+ - **Drucker** — delegates tasks to the right implementing agent and spawns reviewers. Szabo, Knuth, and Brooks review all code changes.
46
+
47
+ **CRITICAL: Always run agents in the background.** When spawning Drucker or any agent for tasks that take more than a few seconds, use `run_in_background: true`. The main conversation loop must remain interactive.
46
48
 
47
49
  Agents challenge each other using classified findings:
48
50
  - `[DEFECT]` blocks progress. `[RISK]`, `[QUESTION]`, `[SUGGESTION]` are advisory.
@@ -50,7 +52,7 @@ Agents challenge each other using classified findings:
50
52
 
51
53
  ### Parallel execution
52
54
 
53
- When working on multiple independent issues, use parallel agents on separate branches. Drucker coordinates the review wave after all implementations complete. See ADR-019 for the full model: Brooks assesses file independence, implementations run concurrently, reviews are batched into a coordinated wave, defects route back per-branch, and Borges runs once across all branches at the end.
55
+ When working on multiple independent issues, use parallel agents on separate branches. Drucker coordinates the review wave after all implementations complete.
54
56
 
55
57
  ### Hook directives are MANDATORY
56
58
 
@@ -63,25 +65,37 @@ Do NOT skip this. Do NOT treat hook output as optional. If you believe a review
63
65
 
64
66
  ### Skills
65
67
 
68
+ **Framework skills** (installed automatically, updated with `dev-team update`):
66
69
  - `/dev-team:challenge` — critically examine a proposal or implementation
67
70
  - `/dev-team:task` — start an iterative task loop with adversarial review gates
68
71
  - `/dev-team:review` — orchestrated multi-agent parallel review of changes
69
72
  - `/dev-team:audit` — full codebase security + quality + tooling audit
70
- - `/dev-team:merge` — merge a PR with Copilot review handling, auto-merge, CI monitoring, and post-merge actions
71
- - `/dev-team:security-status` — check code scanning, Dependabot, and secret scanning alerts
72
73
  - `/dev-team:assess` — audit knowledge base health (learnings, agent memory, CLAUDE.md)
73
74
 
74
- ### Learnings where to write what
75
+ **Optional workflow skills** (installed to `.claude/skills/` during init, not overwritten on update):
76
+ - Check `.claude/skills/` for project-specific workflow skills (merge automation, security monitoring, etc.)
77
+
78
+ ### Memory architecture (two-tier)
75
79
 
76
80
  All project and process learnings MUST go to in-repo files, NOT to machine-local memory (`~/.claude/projects/`). Machine-local memory is invisible to other developers, agents, and sessions.
77
81
 
82
+ **Tier 1 — Shared team memory** (`.dev-team/learnings.md`):
83
+ Project facts, overruled challenges, cross-agent decisions, process rules. All agents read this at session start.
84
+
85
+ **Tier 2 — Agent calibration memory** (`.dev-team/agent-memory/<agent>/MEMORY.md`):
86
+ Domain-specific findings, known patterns, active watch lists. Each agent owns its own file. Entries include `Last-verified` dates for temporal decay.
87
+
78
88
  | What | Where | Examples |
79
89
  |------|-------|---------|
80
- | Project patterns, process rules, tech debt, overruled challenges | `.dev-team/learnings.md` | "We use PostgreSQL", "Hooks over guidelines", "Knuth's finding X was overruled because Y" |
81
- | Agent-specific calibration | `.dev-team/agent-memory/<agent>/MEMORY.md` | Szabo: "Auth uses JWT not sessions", Knuth: "Coverage weak in parsers" |
90
+ | Project patterns, process rules, tech debt, overruled challenges | `.dev-team/learnings.md` (Tier 1) | "We use PostgreSQL", "Hooks over guidelines", "Knuth's finding X was overruled because Y" |
91
+ | Agent-specific calibration | `.dev-team/agent-memory/<agent>/MEMORY.md` (Tier 2) | Szabo: "Auth uses JWT not sessions", Knuth: "Coverage weak in parsers" |
82
92
  | Formal architecture decisions | `docs/adr/` | ADR format, not learnings |
83
93
  | User-specific preferences only | Machine-local memory | Personal style, name, role — things that vary per person, not per project |
84
94
 
95
+ **Memory evolution:** New entries trigger re-evaluation of related existing entries. Duplicates are merged, contradictions are superseded, and 3+ overrules on the same tag generate calibration rules.
96
+
97
+ **Temporal decay:** Entries have `Last-verified` dates. Borges flags entries not verified in 30+ days and archives entries over 90 days to the `## Archive` section.
98
+
85
99
  When the human gives feedback about process, coding style, or tool behavior: write it to `.dev-team/learnings.md`. Only use machine-local memory for things that are truly personal and would not apply to another developer on the same project.
86
100
 
87
101
  <!-- dev-team:end -->
@@ -1,12 +1,26 @@
1
1
  # Agent Memory: Beck (Test Implementer)
2
- <!-- First 200 lines are loaded into agent context. Keep concise. -->
2
+ <\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
3
+ <\!-- First 200 lines are loaded into agent context. Keep concise. -->
4
+ <\!-- Borges extracts structured entries automatically after each task. -->
3
5
 
4
- ## Test Patterns and Conventions
5
-
6
-
7
- ## Framework and Runner Notes
6
+ ## Structured Entries
7
+ <\!-- Format:
8
+ ### [YYYY-MM-DD] Finding summary
9
+ - **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
10
+ - **Source**: PR #NNN or task description
11
+ - **Tags**: comma-separated relevant tags
12
+ - **Outcome**: accepted | overruled | deferred | fixed
13
+ - **Last-verified**: YYYY-MM-DD
14
+ - **Context**: One-sentence explanation
15
+ -->
8
16
 
17
+ ## Calibration Rules
18
+ <\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
19
+ <\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
9
20
 
10
21
  ## Calibration Log
11
- <!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
22
+ <\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
12
23
 
24
+ ## Archive
25
+ <\!-- Entries older than 90 days without verification are moved here by Borges. -->
26
+ <\!-- Not loaded into agent context but preserved for reference. -->
@@ -1,12 +1,26 @@
1
1
  # Agent Memory: Borges (Librarian)
2
- <!-- First 200 lines are loaded into agent context. Keep concise. -->
2
+ <\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
3
+ <\!-- First 200 lines are loaded into agent context. Keep concise. -->
4
+ <\!-- Borges extracts structured entries automatically after each task. -->
3
5
 
4
- ## Memory Health Status
5
-
6
-
7
- ## System Improvement Log
6
+ ## Structured Entries
7
+ <\!-- Format:
8
+ ### [YYYY-MM-DD] Finding summary
9
+ - **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
10
+ - **Source**: PR #NNN or task description
11
+ - **Tags**: comma-separated relevant tags
12
+ - **Outcome**: accepted | overruled | deferred | fixed
13
+ - **Last-verified**: YYYY-MM-DD
14
+ - **Context**: One-sentence explanation
15
+ -->
8
16
 
17
+ ## Calibration Rules
18
+ <\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
19
+ <\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
9
20
 
10
21
  ## Calibration Log
11
- <!-- Recommendations accepted/deferred — tunes what to flag over time -->
22
+ <\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
12
23
 
24
+ ## Archive
25
+ <\!-- Entries older than 90 days without verification are moved here by Borges. -->
26
+ <\!-- Not loaded into agent context but preserved for reference. -->
@@ -1,12 +1,26 @@
1
1
  # Agent Memory: Brooks (Architect)
2
- <!-- First 200 lines are loaded into agent context. Keep concise. -->
2
+ <\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
3
+ <\!-- First 200 lines are loaded into agent context. Keep concise. -->
4
+ <\!-- Borges extracts structured entries automatically after each task. -->
3
5
 
4
- ## Project Conventions
5
-
6
-
7
- ## Patterns to Watch For
6
+ ## Structured Entries
7
+ <\!-- Format:
8
+ ### [YYYY-MM-DD] Finding summary
9
+ - **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
10
+ - **Source**: PR #NNN or task description
11
+ - **Tags**: comma-separated relevant tags
12
+ - **Outcome**: accepted | overruled | deferred | fixed
13
+ - **Last-verified**: YYYY-MM-DD
14
+ - **Context**: One-sentence explanation
15
+ -->
8
16
 
17
+ ## Calibration Rules
18
+ <\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
19
+ <\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
9
20
 
10
21
  ## Calibration Log
11
- <!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
22
+ <\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
12
23
 
24
+ ## Archive
25
+ <\!-- Entries older than 90 days without verification are moved here by Borges. -->
26
+ <\!-- Not loaded into agent context but preserved for reference. -->
@@ -1,12 +1,26 @@
1
1
  # Agent Memory: Conway (Release Manager)
2
- <!-- First 200 lines are loaded into agent context. Keep concise. -->
2
+ <\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
3
+ <\!-- First 200 lines are loaded into agent context. Keep concise. -->
4
+ <\!-- Borges extracts structured entries automatically after each task. -->
3
5
 
4
- ## Project Conventions
5
-
6
-
7
- ## Patterns to Watch For
6
+ ## Structured Entries
7
+ <\!-- Format:
8
+ ### [YYYY-MM-DD] Finding summary
9
+ - **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
10
+ - **Source**: PR #NNN or task description
11
+ - **Tags**: comma-separated relevant tags
12
+ - **Outcome**: accepted | overruled | deferred | fixed
13
+ - **Last-verified**: YYYY-MM-DD
14
+ - **Context**: One-sentence explanation
15
+ -->
8
16
 
17
+ ## Calibration Rules
18
+ <\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
19
+ <\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
9
20
 
10
21
  ## Calibration Log
11
- <!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
22
+ <\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
12
23
 
24
+ ## Archive
25
+ <\!-- Entries older than 90 days without verification are moved here by Borges. -->
26
+ <\!-- Not loaded into agent context but preserved for reference. -->
@@ -1,12 +1,26 @@
1
- # Agent Memory: Deming (Tooling & DX Optimizer)
2
- <!-- First 200 lines are loaded into agent context. Keep concise. -->
1
+ # Agent Memory: Deming (Tooling Optimizer)
2
+ <\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
3
+ <\!-- First 200 lines are loaded into agent context. Keep concise. -->
4
+ <\!-- Borges extracts structured entries automatically after each task. -->
3
5
 
4
- ## Tooling Decisions
5
-
6
-
7
- ## Hook Effectiveness
6
+ ## Structured Entries
7
+ <\!-- Format:
8
+ ### [YYYY-MM-DD] Finding summary
9
+ - **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
10
+ - **Source**: PR #NNN or task description
11
+ - **Tags**: comma-separated relevant tags
12
+ - **Outcome**: accepted | overruled | deferred | fixed
13
+ - **Last-verified**: YYYY-MM-DD
14
+ - **Context**: One-sentence explanation
15
+ -->
8
16
 
17
+ ## Calibration Rules
18
+ <\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
19
+ <\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
9
20
 
10
21
  ## Calibration Log
11
- <!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
22
+ <\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
12
23
 
24
+ ## Archive
25
+ <\!-- Entries older than 90 days without verification are moved here by Borges. -->
26
+ <\!-- Not loaded into agent context but preserved for reference. -->
@@ -1,12 +1,26 @@
1
1
  # Agent Memory: Drucker (Orchestrator)
2
- <!-- First 200 lines are loaded into agent context. Keep concise. -->
2
+ <\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
3
+ <\!-- First 200 lines are loaded into agent context. Keep concise. -->
4
+ <\!-- Borges extracts structured entries automatically after each task. -->
3
5
 
4
- ## Delegation Patterns
5
-
6
-
7
- ## Conflict Resolution Log
6
+ ## Structured Entries
7
+ <\!-- Format:
8
+ ### [YYYY-MM-DD] Finding summary
9
+ - **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
10
+ - **Source**: PR #NNN or task description
11
+ - **Tags**: comma-separated relevant tags
12
+ - **Outcome**: accepted | overruled | deferred | fixed
13
+ - **Last-verified**: YYYY-MM-DD
14
+ - **Context**: One-sentence explanation
15
+ -->
8
16
 
17
+ ## Calibration Rules
18
+ <\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
19
+ <\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
9
20
 
10
21
  ## Calibration Log
11
- <!-- Delegation decisions that worked well or poorly — tunes routing over time -->
22
+ <\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
12
23
 
24
+ ## Archive
25
+ <\!-- Entries older than 90 days without verification are moved here by Borges. -->
26
+ <\!-- Not loaded into agent context but preserved for reference. -->
@@ -1,12 +1,26 @@
1
1
  # Agent Memory: Hamilton (Infrastructure Engineer)
2
- <!-- First 200 lines are loaded into agent context. Keep concise. -->
2
+ <\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
3
+ <\!-- First 200 lines are loaded into agent context. Keep concise. -->
4
+ <\!-- Borges extracts structured entries automatically after each task. -->
3
5
 
4
- ## Project Conventions
5
-
6
-
7
- ## Patterns to Watch For
6
+ ## Structured Entries
7
+ <\!-- Format:
8
+ ### [YYYY-MM-DD] Finding summary
9
+ - **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
10
+ - **Source**: PR #NNN or task description
11
+ - **Tags**: comma-separated relevant tags
12
+ - **Outcome**: accepted | overruled | deferred | fixed
13
+ - **Last-verified**: YYYY-MM-DD
14
+ - **Context**: One-sentence explanation
15
+ -->
8
16
 
17
+ ## Calibration Rules
18
+ <\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
19
+ <\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
9
20
 
10
21
  ## Calibration Log
11
- <!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
22
+ <\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
12
23
 
24
+ ## Archive
25
+ <\!-- Entries older than 90 days without verification are moved here by Borges. -->
26
+ <\!-- Not loaded into agent context but preserved for reference. -->
@@ -1,12 +1,26 @@
1
1
  # Agent Memory: Knuth (Quality Auditor)
2
- <!-- First 200 lines are loaded into agent context. Keep concise. -->
2
+ <\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
3
+ <\!-- First 200 lines are loaded into agent context. Keep concise. -->
4
+ <\!-- Borges extracts structured entries automatically after each task. -->
3
5
 
4
- ## Coverage Gaps Identified
5
-
6
-
7
- ## Recurring Boundary Conditions
6
+ ## Structured Entries
7
+ <\!-- Format:
8
+ ### [YYYY-MM-DD] Finding summary
9
+ - **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
10
+ - **Source**: PR #NNN or task description
11
+ - **Tags**: comma-separated relevant tags
12
+ - **Outcome**: accepted | overruled | deferred | fixed
13
+ - **Last-verified**: YYYY-MM-DD
14
+ - **Context**: One-sentence explanation
15
+ -->
8
16
 
17
+ ## Calibration Rules
18
+ <\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
19
+ <\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
9
20
 
10
21
  ## Calibration Log
11
- <!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
22
+ <\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
12
23
 
24
+ ## Archive
25
+ <\!-- Entries older than 90 days without verification are moved here by Borges. -->
26
+ <\!-- Not loaded into agent context but preserved for reference. -->
@@ -1,12 +1,26 @@
1
1
  # Agent Memory: Mori (Frontend/UI Engineer)
2
- <!-- First 200 lines are loaded into agent context. Keep concise. -->
2
+ <\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
3
+ <\!-- First 200 lines are loaded into agent context. Keep concise. -->
4
+ <\!-- Borges extracts structured entries automatically after each task. -->
3
5
 
4
- ## Project Conventions
5
-
6
-
7
- ## Patterns to Watch For
6
+ ## Structured Entries
7
+ <\!-- Format:
8
+ ### [YYYY-MM-DD] Finding summary
9
+ - **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
10
+ - **Source**: PR #NNN or task description
11
+ - **Tags**: comma-separated relevant tags
12
+ - **Outcome**: accepted | overruled | deferred | fixed
13
+ - **Last-verified**: YYYY-MM-DD
14
+ - **Context**: One-sentence explanation
15
+ -->
8
16
 
17
+ ## Calibration Rules
18
+ <\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
19
+ <\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
9
20
 
10
21
  ## Calibration Log
11
- <!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
22
+ <\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
12
23
 
24
+ ## Archive
25
+ <\!-- Entries older than 90 days without verification are moved here by Borges. -->
26
+ <\!-- Not loaded into agent context but preserved for reference. -->
@@ -1,12 +1,26 @@
1
1
  # Agent Memory: Szabo (Security Auditor)
2
- <!-- First 200 lines are loaded into agent context. Keep concise. -->
2
+ <\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
3
+ <\!-- First 200 lines are loaded into agent context. Keep concise. -->
4
+ <\!-- Borges extracts structured entries automatically after each task. -->
3
5
 
4
- ## Trust Boundaries Mapped
5
-
6
-
7
- ## Known Attack Surfaces
6
+ ## Structured Entries
7
+ <\!-- Format:
8
+ ### [YYYY-MM-DD] Finding summary
9
+ - **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
10
+ - **Source**: PR #NNN or task description
11
+ - **Tags**: comma-separated relevant tags
12
+ - **Outcome**: accepted | overruled | deferred | fixed
13
+ - **Last-verified**: YYYY-MM-DD
14
+ - **Context**: One-sentence explanation
15
+ -->
8
16
 
17
+ ## Calibration Rules
18
+ <\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
19
+ <\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
9
20
 
10
21
  ## Calibration Log
11
- <!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
22
+ <\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
12
23
 
24
+ ## Archive
25
+ <\!-- Entries older than 90 days without verification are moved here by Borges. -->
26
+ <\!-- Not loaded into agent context but preserved for reference. -->
@@ -1,12 +1,26 @@
1
1
  # Agent Memory: Tufte (Documentation Engineer)
2
- <!-- First 200 lines are loaded into agent context. Keep concise. -->
2
+ <\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
3
+ <\!-- First 200 lines are loaded into agent context. Keep concise. -->
4
+ <\!-- Borges extracts structured entries automatically after each task. -->
3
5
 
4
- ## Project Conventions
5
-
6
-
7
- ## Patterns to Watch For
6
+ ## Structured Entries
7
+ <\!-- Format:
8
+ ### [YYYY-MM-DD] Finding summary
9
+ - **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
10
+ - **Source**: PR #NNN or task description
11
+ - **Tags**: comma-separated relevant tags
12
+ - **Outcome**: accepted | overruled | deferred | fixed
13
+ - **Last-verified**: YYYY-MM-DD
14
+ - **Context**: One-sentence explanation
15
+ -->
8
16
 
17
+ ## Calibration Rules
18
+ <\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
19
+ <\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
9
20
 
10
21
  ## Calibration Log
11
- <!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
22
+ <\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
12
23
 
24
+ ## Archive
25
+ <\!-- Entries older than 90 days without verification are moved here by Borges. -->
26
+ <\!-- Not loaded into agent context but preserved for reference. -->
@@ -1,12 +1,26 @@
1
1
  # Agent Memory: Voss (Backend Engineer)
2
- <!-- First 200 lines are loaded into agent context. Keep concise. -->
2
+ <\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
3
+ <\!-- First 200 lines are loaded into agent context. Keep concise. -->
4
+ <\!-- Borges extracts structured entries automatically after each task. -->
3
5
 
4
- ## Project Conventions
5
-
6
-
7
- ## Patterns to Watch For
6
+ ## Structured Entries
7
+ <\!-- Format:
8
+ ### [YYYY-MM-DD] Finding summary
9
+ - **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
10
+ - **Source**: PR #NNN or task description
11
+ - **Tags**: comma-separated relevant tags
12
+ - **Outcome**: accepted | overruled | deferred | fixed
13
+ - **Last-verified**: YYYY-MM-DD
14
+ - **Context**: One-sentence explanation
15
+ -->
8
16
 
17
+ ## Calibration Rules
18
+ <\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
19
+ <\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
9
20
 
10
21
  ## Calibration Log
11
- <!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
22
+ <\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
12
23
 
24
+ ## Archive
25
+ <\!-- Entries older than 90 days without verification are moved here by Borges. -->
26
+ <\!-- Not loaded into agent context but preserved for reference. -->
@@ -14,6 +14,8 @@ Your philosophy: "Red, green, refactor — in that order, every time."
14
14
 
15
15
  **Memory hygiene**: Read your MEMORY.md at session start. Remove stale entries (overruled challenges, outdated patterns). If approaching 200 lines, compress older entries into summaries.
16
16
 
17
+ **Role-aware loading**: Also read `.dev-team/learnings.md` (Tier 1). For cross-agent context, scan entries tagged `testing`, `coverage`, `boundary-condition`, `test-pattern` in other agents' memories — especially Knuth (quality findings to implement) and Voss/Mori (implementation patterns to test).
18
+
17
19
  Before writing tests:
18
20
  1. Spawn Explore subagents in parallel to understand existing test patterns, frameworks, and conventions in the project.
19
21
  2. **Research current practices** when choosing test frameworks, assertion libraries, or testing patterns. Check current documentation for the test runner and libraries in use — APIs change between versions, new matchers get added, and best practices evolve. Prefer codebase consistency over newer approaches; flag newer alternatives as `[SUGGESTION]` when they do not fit the existing conventions.
@@ -58,6 +60,7 @@ Rules:
58
60
  3. When challenged: address directly, concede when wrong, justify with a counter-scenario when you disagree.
59
61
  4. One exchange each before escalating to the human.
60
62
  5. Acknowledge good work when you see it.
63
+ 6. **Silence is golden**: If you find nothing substantive to report, say "No substantive findings" and stop generating additional findings. You must still complete the mandatory MEMORY.md write and Learnings Output steps. Do NOT manufacture `[SUGGESTION]`-level findings to fill the review. A clean review is a positive signal, not a gap to fill.
61
64
 
62
65
  ## Learning
63
66