@fredericboyer/dev-team 0.8.1 → 0.10.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/dist/create-agent.js +20 -6
- package/dist/create-agent.js.map +1 -1
- package/dist/init.d.ts +8 -1
- package/dist/init.js +71 -5
- package/dist/init.js.map +1 -1
- package/dist/status.js +12 -6
- package/dist/status.js.map +1 -1
- package/dist/update.d.ts +6 -0
- package/dist/update.js +107 -0
- package/dist/update.js.map +1 -1
- package/package.json +2 -2
- package/templates/CLAUDE.md +25 -11
- package/templates/agent-memory/dev-team-beck/MEMORY.md +20 -6
- package/templates/agent-memory/dev-team-borges/MEMORY.md +20 -6
- package/templates/agent-memory/dev-team-brooks/MEMORY.md +20 -6
- package/templates/agent-memory/dev-team-conway/MEMORY.md +20 -6
- package/templates/agent-memory/dev-team-deming/MEMORY.md +21 -7
- package/templates/agent-memory/dev-team-drucker/MEMORY.md +20 -6
- package/templates/agent-memory/dev-team-hamilton/MEMORY.md +20 -6
- package/templates/agent-memory/dev-team-knuth/MEMORY.md +20 -6
- package/templates/agent-memory/dev-team-mori/MEMORY.md +20 -6
- package/templates/agent-memory/dev-team-szabo/MEMORY.md +20 -6
- package/templates/agent-memory/dev-team-tufte/MEMORY.md +20 -6
- package/templates/agent-memory/dev-team-voss/MEMORY.md +20 -6
- package/templates/agents/dev-team-beck.md +3 -0
- package/templates/agents/dev-team-borges.md +119 -11
- package/templates/agents/dev-team-brooks.md +10 -0
- package/templates/agents/dev-team-conway.md +3 -0
- package/templates/agents/dev-team-deming.md +3 -0
- package/templates/agents/dev-team-drucker.md +114 -2
- package/templates/agents/dev-team-hamilton.md +3 -0
- package/templates/agents/dev-team-knuth.md +10 -0
- package/templates/agents/dev-team-mori.md +3 -0
- package/templates/agents/dev-team-szabo.md +10 -0
- package/templates/agents/dev-team-tufte.md +3 -0
- package/templates/agents/dev-team-voss.md +3 -0
- package/templates/dev-team-learnings.md +3 -1
- package/templates/dev-team-metrics.md +18 -0
- package/templates/hooks/dev-team-post-change-review.js +71 -0
- package/templates/skills/dev-team-assess/SKILL.md +20 -0
- package/templates/skills/dev-team-audit/SKILL.md +1 -1
- package/templates/skills/dev-team-review/SKILL.md +36 -3
- package/templates/skills/dev-team-task/SKILL.md +30 -10
- package/templates/{skills → workflow-skills}/dev-team-security-status/SKILL.md +1 -1
- /package/templates/{skills → workflow-skills}/dev-team-merge/SKILL.md +0 -0
package/templates/CLAUDE.md
CHANGED
|
@@ -19,9 +19,9 @@ This project uses [dev-team](https://github.com/dev-team) — adversarial AI age
|
|
|
19
19
|
| `@dev-team-brooks` | Architect & Quality Reviewer | Architectural review, coupling, ADR compliance, quality attributes (performance, maintainability, scalability) |
|
|
20
20
|
| `@dev-team-conway` | Release Manager | Versioning, changelog, release readiness, semver validation |
|
|
21
21
|
| `@dev-team-drucker` | Team Lead / Orchestrator | Auto-delegates to specialists, manages review loops, resolves conflicts |
|
|
22
|
-
| `@dev-team-borges` | Librarian | End-of-task
|
|
22
|
+
| `@dev-team-borges` | Librarian | End-of-task memory extraction, cross-agent coherence, system improvement |
|
|
23
23
|
|
|
24
|
-
###
|
|
24
|
+
### Capabilities
|
|
25
25
|
|
|
26
26
|
For automatic delegation, use `@dev-team-drucker` — it analyzes the task and routes to the right specialist.
|
|
27
27
|
|
|
@@ -34,15 +34,17 @@ For non-trivial work: explore the area first, then implement, then review.
|
|
|
34
34
|
- **Hamilton** — auto-flagged when infrastructure/operations files change (Dockerfile, docker-compose, CI workflows, Terraform, Helm, k8s, health checks, monitoring config, .env templates, etc.)
|
|
35
35
|
- **Voss** — auto-flagged when app config/data files change (.env, config, migrations, database, etc.)
|
|
36
36
|
- **Deming** — auto-flagged when tooling files change (eslint, CI workflows, package.json, etc.)
|
|
37
|
-
- **Tufte** — auto-flagged when documentation files change (.md, /docs/, README, etc.) AND when significant implementation files change
|
|
37
|
+
- **Tufte** — auto-flagged when documentation files change (.md, /docs/, README, etc.) AND when significant implementation files change to detect doc-code drift
|
|
38
38
|
- **Brooks** — auto-flagged when any non-test implementation code changes (quality attributes) and when architectural boundaries are touched (/adr/, /core/, /domain/, /lib/, build config, etc.)
|
|
39
39
|
- **Conway** — auto-flagged when release artifacts change (package.json, changelog, version files, release/publish/deploy workflows, etc.)
|
|
40
40
|
|
|
41
41
|
**End-of-workflow agents:**
|
|
42
|
-
- **Borges** — mandatory at end of every `/dev-team:task`, `/dev-team:review`, `/dev-team:audit`, and `/dev-team:assess`.
|
|
42
|
+
- **Borges** — mandatory at end of every `/dev-team:task`, `/dev-team:review`, `/dev-team:audit`, and `/dev-team:assess`. Extracts structured memory entries, reviews cross-agent coherence, and identifies system improvement opportunities.
|
|
43
43
|
|
|
44
44
|
**Orchestration:**
|
|
45
|
-
- **Drucker** — delegates tasks to the right implementing agent and spawns reviewers. Szabo, Knuth, and Brooks review all code changes.
|
|
45
|
+
- **Drucker** — delegates tasks to the right implementing agent and spawns reviewers. Szabo, Knuth, and Brooks review all code changes.
|
|
46
|
+
|
|
47
|
+
**CRITICAL: Always run agents in the background.** When spawning Drucker or any agent for tasks that take more than a few seconds, use `run_in_background: true`. The main conversation loop must remain interactive.
|
|
46
48
|
|
|
47
49
|
Agents challenge each other using classified findings:
|
|
48
50
|
- `[DEFECT]` blocks progress. `[RISK]`, `[QUESTION]`, `[SUGGESTION]` are advisory.
|
|
@@ -50,7 +52,7 @@ Agents challenge each other using classified findings:
|
|
|
50
52
|
|
|
51
53
|
### Parallel execution
|
|
52
54
|
|
|
53
|
-
When working on multiple independent issues, use parallel agents on separate branches. Drucker coordinates the review wave after all implementations complete.
|
|
55
|
+
When working on multiple independent issues, use parallel agents on separate branches. Drucker coordinates the review wave after all implementations complete.
|
|
54
56
|
|
|
55
57
|
### Hook directives are MANDATORY
|
|
56
58
|
|
|
@@ -63,25 +65,37 @@ Do NOT skip this. Do NOT treat hook output as optional. If you believe a review
|
|
|
63
65
|
|
|
64
66
|
### Skills
|
|
65
67
|
|
|
68
|
+
**Framework skills** (installed automatically, updated with `dev-team update`):
|
|
66
69
|
- `/dev-team:challenge` — critically examine a proposal or implementation
|
|
67
70
|
- `/dev-team:task` — start an iterative task loop with adversarial review gates
|
|
68
71
|
- `/dev-team:review` — orchestrated multi-agent parallel review of changes
|
|
69
72
|
- `/dev-team:audit` — full codebase security + quality + tooling audit
|
|
70
|
-
- `/dev-team:merge` — merge a PR with Copilot review handling, auto-merge, CI monitoring, and post-merge actions
|
|
71
|
-
- `/dev-team:security-status` — check code scanning, Dependabot, and secret scanning alerts
|
|
72
73
|
- `/dev-team:assess` — audit knowledge base health (learnings, agent memory, CLAUDE.md)
|
|
73
74
|
|
|
74
|
-
|
|
75
|
+
**Optional workflow skills** (installed to `.claude/skills/` during init, not overwritten on update):
|
|
76
|
+
- Check `.claude/skills/` for project-specific workflow skills (merge automation, security monitoring, etc.)
|
|
77
|
+
|
|
78
|
+
### Memory architecture (two-tier)
|
|
75
79
|
|
|
76
80
|
All project and process learnings MUST go to in-repo files, NOT to machine-local memory (`~/.claude/projects/`). Machine-local memory is invisible to other developers, agents, and sessions.
|
|
77
81
|
|
|
82
|
+
**Tier 1 — Shared team memory** (`.dev-team/learnings.md`):
|
|
83
|
+
Project facts, overruled challenges, cross-agent decisions, process rules. All agents read this at session start.
|
|
84
|
+
|
|
85
|
+
**Tier 2 — Agent calibration memory** (`.dev-team/agent-memory/<agent>/MEMORY.md`):
|
|
86
|
+
Domain-specific findings, known patterns, active watch lists. Each agent owns its own file. Entries include `Last-verified` dates for temporal decay.
|
|
87
|
+
|
|
78
88
|
| What | Where | Examples |
|
|
79
89
|
|------|-------|---------|
|
|
80
|
-
| Project patterns, process rules, tech debt, overruled challenges | `.dev-team/learnings.md` | "We use PostgreSQL", "Hooks over guidelines", "Knuth's finding X was overruled because Y" |
|
|
81
|
-
| Agent-specific calibration | `.dev-team/agent-memory/<agent>/MEMORY.md` | Szabo: "Auth uses JWT not sessions", Knuth: "Coverage weak in parsers" |
|
|
90
|
+
| Project patterns, process rules, tech debt, overruled challenges | `.dev-team/learnings.md` (Tier 1) | "We use PostgreSQL", "Hooks over guidelines", "Knuth's finding X was overruled because Y" |
|
|
91
|
+
| Agent-specific calibration | `.dev-team/agent-memory/<agent>/MEMORY.md` (Tier 2) | Szabo: "Auth uses JWT not sessions", Knuth: "Coverage weak in parsers" |
|
|
82
92
|
| Formal architecture decisions | `docs/adr/` | ADR format, not learnings |
|
|
83
93
|
| User-specific preferences only | Machine-local memory | Personal style, name, role — things that vary per person, not per project |
|
|
84
94
|
|
|
95
|
+
**Memory evolution:** New entries trigger re-evaluation of related existing entries. Duplicates are merged, contradictions are superseded, and 3+ overrules on the same tag generate calibration rules.
|
|
96
|
+
|
|
97
|
+
**Temporal decay:** Entries have `Last-verified` dates. Borges flags entries not verified in 30+ days and archives entries over 90 days to the `## Archive` section.
|
|
98
|
+
|
|
85
99
|
When the human gives feedback about process, coding style, or tool behavior: write it to `.dev-team/learnings.md`. Only use machine-local memory for things that are truly personal and would not apply to another developer on the same project.
|
|
86
100
|
|
|
87
101
|
<!-- dev-team:end -->
|
|
@@ -1,12 +1,26 @@
|
|
|
1
1
|
# Agent Memory: Beck (Test Implementer)
|
|
2
|
-
|
|
2
|
+
<\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
|
|
3
|
+
<\!-- First 200 lines are loaded into agent context. Keep concise. -->
|
|
4
|
+
<\!-- Borges extracts structured entries automatically after each task. -->
|
|
3
5
|
|
|
4
|
-
##
|
|
5
|
-
|
|
6
|
-
|
|
7
|
-
|
|
6
|
+
## Structured Entries
|
|
7
|
+
<\!-- Format:
|
|
8
|
+
### [YYYY-MM-DD] Finding summary
|
|
9
|
+
- **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
|
|
10
|
+
- **Source**: PR #NNN or task description
|
|
11
|
+
- **Tags**: comma-separated relevant tags
|
|
12
|
+
- **Outcome**: accepted | overruled | deferred | fixed
|
|
13
|
+
- **Last-verified**: YYYY-MM-DD
|
|
14
|
+
- **Context**: One-sentence explanation
|
|
15
|
+
-->
|
|
8
16
|
|
|
17
|
+
## Calibration Rules
|
|
18
|
+
<\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
|
|
19
|
+
<\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
|
|
9
20
|
|
|
10
21
|
## Calibration Log
|
|
11
|
-
|
|
22
|
+
<\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
|
|
12
23
|
|
|
24
|
+
## Archive
|
|
25
|
+
<\!-- Entries older than 90 days without verification are moved here by Borges. -->
|
|
26
|
+
<\!-- Not loaded into agent context but preserved for reference. -->
|
|
@@ -1,12 +1,26 @@
|
|
|
1
1
|
# Agent Memory: Borges (Librarian)
|
|
2
|
-
|
|
2
|
+
<\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
|
|
3
|
+
<\!-- First 200 lines are loaded into agent context. Keep concise. -->
|
|
4
|
+
<\!-- Borges extracts structured entries automatically after each task. -->
|
|
3
5
|
|
|
4
|
-
##
|
|
5
|
-
|
|
6
|
-
|
|
7
|
-
|
|
6
|
+
## Structured Entries
|
|
7
|
+
<\!-- Format:
|
|
8
|
+
### [YYYY-MM-DD] Finding summary
|
|
9
|
+
- **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
|
|
10
|
+
- **Source**: PR #NNN or task description
|
|
11
|
+
- **Tags**: comma-separated relevant tags
|
|
12
|
+
- **Outcome**: accepted | overruled | deferred | fixed
|
|
13
|
+
- **Last-verified**: YYYY-MM-DD
|
|
14
|
+
- **Context**: One-sentence explanation
|
|
15
|
+
-->
|
|
8
16
|
|
|
17
|
+
## Calibration Rules
|
|
18
|
+
<\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
|
|
19
|
+
<\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
|
|
9
20
|
|
|
10
21
|
## Calibration Log
|
|
11
|
-
|
|
22
|
+
<\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
|
|
12
23
|
|
|
24
|
+
## Archive
|
|
25
|
+
<\!-- Entries older than 90 days without verification are moved here by Borges. -->
|
|
26
|
+
<\!-- Not loaded into agent context but preserved for reference. -->
|
|
@@ -1,12 +1,26 @@
|
|
|
1
1
|
# Agent Memory: Brooks (Architect)
|
|
2
|
-
|
|
2
|
+
<\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
|
|
3
|
+
<\!-- First 200 lines are loaded into agent context. Keep concise. -->
|
|
4
|
+
<\!-- Borges extracts structured entries automatically after each task. -->
|
|
3
5
|
|
|
4
|
-
##
|
|
5
|
-
|
|
6
|
-
|
|
7
|
-
|
|
6
|
+
## Structured Entries
|
|
7
|
+
<\!-- Format:
|
|
8
|
+
### [YYYY-MM-DD] Finding summary
|
|
9
|
+
- **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
|
|
10
|
+
- **Source**: PR #NNN or task description
|
|
11
|
+
- **Tags**: comma-separated relevant tags
|
|
12
|
+
- **Outcome**: accepted | overruled | deferred | fixed
|
|
13
|
+
- **Last-verified**: YYYY-MM-DD
|
|
14
|
+
- **Context**: One-sentence explanation
|
|
15
|
+
-->
|
|
8
16
|
|
|
17
|
+
## Calibration Rules
|
|
18
|
+
<\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
|
|
19
|
+
<\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
|
|
9
20
|
|
|
10
21
|
## Calibration Log
|
|
11
|
-
|
|
22
|
+
<\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
|
|
12
23
|
|
|
24
|
+
## Archive
|
|
25
|
+
<\!-- Entries older than 90 days without verification are moved here by Borges. -->
|
|
26
|
+
<\!-- Not loaded into agent context but preserved for reference. -->
|
|
@@ -1,12 +1,26 @@
|
|
|
1
1
|
# Agent Memory: Conway (Release Manager)
|
|
2
|
-
|
|
2
|
+
<\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
|
|
3
|
+
<\!-- First 200 lines are loaded into agent context. Keep concise. -->
|
|
4
|
+
<\!-- Borges extracts structured entries automatically after each task. -->
|
|
3
5
|
|
|
4
|
-
##
|
|
5
|
-
|
|
6
|
-
|
|
7
|
-
|
|
6
|
+
## Structured Entries
|
|
7
|
+
<\!-- Format:
|
|
8
|
+
### [YYYY-MM-DD] Finding summary
|
|
9
|
+
- **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
|
|
10
|
+
- **Source**: PR #NNN or task description
|
|
11
|
+
- **Tags**: comma-separated relevant tags
|
|
12
|
+
- **Outcome**: accepted | overruled | deferred | fixed
|
|
13
|
+
- **Last-verified**: YYYY-MM-DD
|
|
14
|
+
- **Context**: One-sentence explanation
|
|
15
|
+
-->
|
|
8
16
|
|
|
17
|
+
## Calibration Rules
|
|
18
|
+
<\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
|
|
19
|
+
<\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
|
|
9
20
|
|
|
10
21
|
## Calibration Log
|
|
11
|
-
|
|
22
|
+
<\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
|
|
12
23
|
|
|
24
|
+
## Archive
|
|
25
|
+
<\!-- Entries older than 90 days without verification are moved here by Borges. -->
|
|
26
|
+
<\!-- Not loaded into agent context but preserved for reference. -->
|
|
@@ -1,12 +1,26 @@
|
|
|
1
|
-
# Agent Memory: Deming (Tooling
|
|
2
|
-
|
|
1
|
+
# Agent Memory: Deming (Tooling Optimizer)
|
|
2
|
+
<\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
|
|
3
|
+
<\!-- First 200 lines are loaded into agent context. Keep concise. -->
|
|
4
|
+
<\!-- Borges extracts structured entries automatically after each task. -->
|
|
3
5
|
|
|
4
|
-
##
|
|
5
|
-
|
|
6
|
-
|
|
7
|
-
|
|
6
|
+
## Structured Entries
|
|
7
|
+
<\!-- Format:
|
|
8
|
+
### [YYYY-MM-DD] Finding summary
|
|
9
|
+
- **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
|
|
10
|
+
- **Source**: PR #NNN or task description
|
|
11
|
+
- **Tags**: comma-separated relevant tags
|
|
12
|
+
- **Outcome**: accepted | overruled | deferred | fixed
|
|
13
|
+
- **Last-verified**: YYYY-MM-DD
|
|
14
|
+
- **Context**: One-sentence explanation
|
|
15
|
+
-->
|
|
8
16
|
|
|
17
|
+
## Calibration Rules
|
|
18
|
+
<\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
|
|
19
|
+
<\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
|
|
9
20
|
|
|
10
21
|
## Calibration Log
|
|
11
|
-
|
|
22
|
+
<\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
|
|
12
23
|
|
|
24
|
+
## Archive
|
|
25
|
+
<\!-- Entries older than 90 days without verification are moved here by Borges. -->
|
|
26
|
+
<\!-- Not loaded into agent context but preserved for reference. -->
|
|
@@ -1,12 +1,26 @@
|
|
|
1
1
|
# Agent Memory: Drucker (Orchestrator)
|
|
2
|
-
|
|
2
|
+
<\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
|
|
3
|
+
<\!-- First 200 lines are loaded into agent context. Keep concise. -->
|
|
4
|
+
<\!-- Borges extracts structured entries automatically after each task. -->
|
|
3
5
|
|
|
4
|
-
##
|
|
5
|
-
|
|
6
|
-
|
|
7
|
-
|
|
6
|
+
## Structured Entries
|
|
7
|
+
<\!-- Format:
|
|
8
|
+
### [YYYY-MM-DD] Finding summary
|
|
9
|
+
- **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
|
|
10
|
+
- **Source**: PR #NNN or task description
|
|
11
|
+
- **Tags**: comma-separated relevant tags
|
|
12
|
+
- **Outcome**: accepted | overruled | deferred | fixed
|
|
13
|
+
- **Last-verified**: YYYY-MM-DD
|
|
14
|
+
- **Context**: One-sentence explanation
|
|
15
|
+
-->
|
|
8
16
|
|
|
17
|
+
## Calibration Rules
|
|
18
|
+
<\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
|
|
19
|
+
<\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
|
|
9
20
|
|
|
10
21
|
## Calibration Log
|
|
11
|
-
|
|
22
|
+
<\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
|
|
12
23
|
|
|
24
|
+
## Archive
|
|
25
|
+
<\!-- Entries older than 90 days without verification are moved here by Borges. -->
|
|
26
|
+
<\!-- Not loaded into agent context but preserved for reference. -->
|
|
@@ -1,12 +1,26 @@
|
|
|
1
1
|
# Agent Memory: Hamilton (Infrastructure Engineer)
|
|
2
|
-
|
|
2
|
+
<\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
|
|
3
|
+
<\!-- First 200 lines are loaded into agent context. Keep concise. -->
|
|
4
|
+
<\!-- Borges extracts structured entries automatically after each task. -->
|
|
3
5
|
|
|
4
|
-
##
|
|
5
|
-
|
|
6
|
-
|
|
7
|
-
|
|
6
|
+
## Structured Entries
|
|
7
|
+
<\!-- Format:
|
|
8
|
+
### [YYYY-MM-DD] Finding summary
|
|
9
|
+
- **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
|
|
10
|
+
- **Source**: PR #NNN or task description
|
|
11
|
+
- **Tags**: comma-separated relevant tags
|
|
12
|
+
- **Outcome**: accepted | overruled | deferred | fixed
|
|
13
|
+
- **Last-verified**: YYYY-MM-DD
|
|
14
|
+
- **Context**: One-sentence explanation
|
|
15
|
+
-->
|
|
8
16
|
|
|
17
|
+
## Calibration Rules
|
|
18
|
+
<\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
|
|
19
|
+
<\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
|
|
9
20
|
|
|
10
21
|
## Calibration Log
|
|
11
|
-
|
|
22
|
+
<\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
|
|
12
23
|
|
|
24
|
+
## Archive
|
|
25
|
+
<\!-- Entries older than 90 days without verification are moved here by Borges. -->
|
|
26
|
+
<\!-- Not loaded into agent context but preserved for reference. -->
|
|
@@ -1,12 +1,26 @@
|
|
|
1
1
|
# Agent Memory: Knuth (Quality Auditor)
|
|
2
|
-
|
|
2
|
+
<\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
|
|
3
|
+
<\!-- First 200 lines are loaded into agent context. Keep concise. -->
|
|
4
|
+
<\!-- Borges extracts structured entries automatically after each task. -->
|
|
3
5
|
|
|
4
|
-
##
|
|
5
|
-
|
|
6
|
-
|
|
7
|
-
|
|
6
|
+
## Structured Entries
|
|
7
|
+
<\!-- Format:
|
|
8
|
+
### [YYYY-MM-DD] Finding summary
|
|
9
|
+
- **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
|
|
10
|
+
- **Source**: PR #NNN or task description
|
|
11
|
+
- **Tags**: comma-separated relevant tags
|
|
12
|
+
- **Outcome**: accepted | overruled | deferred | fixed
|
|
13
|
+
- **Last-verified**: YYYY-MM-DD
|
|
14
|
+
- **Context**: One-sentence explanation
|
|
15
|
+
-->
|
|
8
16
|
|
|
17
|
+
## Calibration Rules
|
|
18
|
+
<\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
|
|
19
|
+
<\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
|
|
9
20
|
|
|
10
21
|
## Calibration Log
|
|
11
|
-
|
|
22
|
+
<\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
|
|
12
23
|
|
|
24
|
+
## Archive
|
|
25
|
+
<\!-- Entries older than 90 days without verification are moved here by Borges. -->
|
|
26
|
+
<\!-- Not loaded into agent context but preserved for reference. -->
|
|
@@ -1,12 +1,26 @@
|
|
|
1
1
|
# Agent Memory: Mori (Frontend/UI Engineer)
|
|
2
|
-
|
|
2
|
+
<\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
|
|
3
|
+
<\!-- First 200 lines are loaded into agent context. Keep concise. -->
|
|
4
|
+
<\!-- Borges extracts structured entries automatically after each task. -->
|
|
3
5
|
|
|
4
|
-
##
|
|
5
|
-
|
|
6
|
-
|
|
7
|
-
|
|
6
|
+
## Structured Entries
|
|
7
|
+
<\!-- Format:
|
|
8
|
+
### [YYYY-MM-DD] Finding summary
|
|
9
|
+
- **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
|
|
10
|
+
- **Source**: PR #NNN or task description
|
|
11
|
+
- **Tags**: comma-separated relevant tags
|
|
12
|
+
- **Outcome**: accepted | overruled | deferred | fixed
|
|
13
|
+
- **Last-verified**: YYYY-MM-DD
|
|
14
|
+
- **Context**: One-sentence explanation
|
|
15
|
+
-->
|
|
8
16
|
|
|
17
|
+
## Calibration Rules
|
|
18
|
+
<\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
|
|
19
|
+
<\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
|
|
9
20
|
|
|
10
21
|
## Calibration Log
|
|
11
|
-
|
|
22
|
+
<\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
|
|
12
23
|
|
|
24
|
+
## Archive
|
|
25
|
+
<\!-- Entries older than 90 days without verification are moved here by Borges. -->
|
|
26
|
+
<\!-- Not loaded into agent context but preserved for reference. -->
|
|
@@ -1,12 +1,26 @@
|
|
|
1
1
|
# Agent Memory: Szabo (Security Auditor)
|
|
2
|
-
|
|
2
|
+
<\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
|
|
3
|
+
<\!-- First 200 lines are loaded into agent context. Keep concise. -->
|
|
4
|
+
<\!-- Borges extracts structured entries automatically after each task. -->
|
|
3
5
|
|
|
4
|
-
##
|
|
5
|
-
|
|
6
|
-
|
|
7
|
-
|
|
6
|
+
## Structured Entries
|
|
7
|
+
<\!-- Format:
|
|
8
|
+
### [YYYY-MM-DD] Finding summary
|
|
9
|
+
- **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
|
|
10
|
+
- **Source**: PR #NNN or task description
|
|
11
|
+
- **Tags**: comma-separated relevant tags
|
|
12
|
+
- **Outcome**: accepted | overruled | deferred | fixed
|
|
13
|
+
- **Last-verified**: YYYY-MM-DD
|
|
14
|
+
- **Context**: One-sentence explanation
|
|
15
|
+
-->
|
|
8
16
|
|
|
17
|
+
## Calibration Rules
|
|
18
|
+
<\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
|
|
19
|
+
<\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
|
|
9
20
|
|
|
10
21
|
## Calibration Log
|
|
11
|
-
|
|
22
|
+
<\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
|
|
12
23
|
|
|
24
|
+
## Archive
|
|
25
|
+
<\!-- Entries older than 90 days without verification are moved here by Borges. -->
|
|
26
|
+
<\!-- Not loaded into agent context but preserved for reference. -->
|
|
@@ -1,12 +1,26 @@
|
|
|
1
1
|
# Agent Memory: Tufte (Documentation Engineer)
|
|
2
|
-
|
|
2
|
+
<\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
|
|
3
|
+
<\!-- First 200 lines are loaded into agent context. Keep concise. -->
|
|
4
|
+
<\!-- Borges extracts structured entries automatically after each task. -->
|
|
3
5
|
|
|
4
|
-
##
|
|
5
|
-
|
|
6
|
-
|
|
7
|
-
|
|
6
|
+
## Structured Entries
|
|
7
|
+
<\!-- Format:
|
|
8
|
+
### [YYYY-MM-DD] Finding summary
|
|
9
|
+
- **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
|
|
10
|
+
- **Source**: PR #NNN or task description
|
|
11
|
+
- **Tags**: comma-separated relevant tags
|
|
12
|
+
- **Outcome**: accepted | overruled | deferred | fixed
|
|
13
|
+
- **Last-verified**: YYYY-MM-DD
|
|
14
|
+
- **Context**: One-sentence explanation
|
|
15
|
+
-->
|
|
8
16
|
|
|
17
|
+
## Calibration Rules
|
|
18
|
+
<\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
|
|
19
|
+
<\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
|
|
9
20
|
|
|
10
21
|
## Calibration Log
|
|
11
|
-
|
|
22
|
+
<\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
|
|
12
23
|
|
|
24
|
+
## Archive
|
|
25
|
+
<\!-- Entries older than 90 days without verification are moved here by Borges. -->
|
|
26
|
+
<\!-- Not loaded into agent context but preserved for reference. -->
|
|
@@ -1,12 +1,26 @@
|
|
|
1
1
|
# Agent Memory: Voss (Backend Engineer)
|
|
2
|
-
|
|
2
|
+
<\!-- Tier 2: Agent calibration memory. Domain-specific findings, patterns, and watch lists. -->
|
|
3
|
+
<\!-- First 200 lines are loaded into agent context. Keep concise. -->
|
|
4
|
+
<\!-- Borges extracts structured entries automatically after each task. -->
|
|
3
5
|
|
|
4
|
-
##
|
|
5
|
-
|
|
6
|
-
|
|
7
|
-
|
|
6
|
+
## Structured Entries
|
|
7
|
+
<\!-- Format:
|
|
8
|
+
### [YYYY-MM-DD] Finding summary
|
|
9
|
+
- **Type**: DEFECT | RISK | SUGGESTION | OVERRULED | PATTERN | DECISION
|
|
10
|
+
- **Source**: PR #NNN or task description
|
|
11
|
+
- **Tags**: comma-separated relevant tags
|
|
12
|
+
- **Outcome**: accepted | overruled | deferred | fixed
|
|
13
|
+
- **Last-verified**: YYYY-MM-DD
|
|
14
|
+
- **Context**: One-sentence explanation
|
|
15
|
+
-->
|
|
8
16
|
|
|
17
|
+
## Calibration Rules
|
|
18
|
+
<\!-- Auto-generated when 3+ findings on the same tag are overruled. -->
|
|
19
|
+
<\!-- Format: "Reduce severity for [tag] findings — overruled N times (reason)" -->
|
|
9
20
|
|
|
10
21
|
## Calibration Log
|
|
11
|
-
|
|
22
|
+
<\!-- Challenges accepted/overruled — tunes adversarial intensity over time -->
|
|
12
23
|
|
|
24
|
+
## Archive
|
|
25
|
+
<\!-- Entries older than 90 days without verification are moved here by Borges. -->
|
|
26
|
+
<\!-- Not loaded into agent context but preserved for reference. -->
|
|
@@ -14,6 +14,8 @@ Your philosophy: "Red, green, refactor — in that order, every time."
|
|
|
14
14
|
|
|
15
15
|
**Memory hygiene**: Read your MEMORY.md at session start. Remove stale entries (overruled challenges, outdated patterns). If approaching 200 lines, compress older entries into summaries.
|
|
16
16
|
|
|
17
|
+
**Role-aware loading**: Also read `.dev-team/learnings.md` (Tier 1). For cross-agent context, scan entries tagged `testing`, `coverage`, `boundary-condition`, `test-pattern` in other agents' memories — especially Knuth (quality findings to implement) and Voss/Mori (implementation patterns to test).
|
|
18
|
+
|
|
17
19
|
Before writing tests:
|
|
18
20
|
1. Spawn Explore subagents in parallel to understand existing test patterns, frameworks, and conventions in the project.
|
|
19
21
|
2. **Research current practices** when choosing test frameworks, assertion libraries, or testing patterns. Check current documentation for the test runner and libraries in use — APIs change between versions, new matchers get added, and best practices evolve. Prefer codebase consistency over newer approaches; flag newer alternatives as `[SUGGESTION]` when they do not fit the existing conventions.
|
|
@@ -58,6 +60,7 @@ Rules:
|
|
|
58
60
|
3. When challenged: address directly, concede when wrong, justify with a counter-scenario when you disagree.
|
|
59
61
|
4. One exchange each before escalating to the human.
|
|
60
62
|
5. Acknowledge good work when you see it.
|
|
63
|
+
6. **Silence is golden**: If you find nothing substantive to report, say "No substantive findings" and stop generating additional findings. You must still complete the mandatory MEMORY.md write and Learnings Output steps. Do NOT manufacture `[SUGGESTION]`-level findings to fill the review. A clean review is a positive signal, not a gap to fill.
|
|
61
64
|
|
|
62
65
|
## Learning
|
|
63
66
|
|