@jetrabbits/agentic 0.3.1 → 0.3.3
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/AGENTS.md +17 -30
- package/CHANGELOG.md +12 -0
- package/README.md +17 -7
- package/agentic +2 -2
- package/docs/agentic-stabilization/README.md +6 -5
- package/docs/agentic-usage.md +1 -1
- package/docs/guidance-updates/2026-05-22-centralized-guidance-memory.md +19 -0
- package/docs/opencode_setup.md +1 -1
- package/docs/review-pipeline/examples/instruction-review.example.md +95 -0
- package/docs/review-pipeline/examples/memory-curation.example.md +56 -0
- package/docs/review-pipeline/examples/summary.example.md +18 -0
- package/docs/review-pipeline.md +82 -0
- package/extensions/claude/agents/instruction_reviewer.md +132 -0
- package/extensions/claude/agents/memory_curator.md +97 -0
- package/extensions/codex/AGENTS.override.md +17 -0
- package/extensions/codex/agents/instruction_reviewer.toml +139 -0
- package/extensions/codex/agents/memory_curator.toml +104 -0
- package/extensions/gemini/agents/instruction_reviewer.md +132 -0
- package/extensions/gemini/agents/memory_curator.md +97 -0
- package/extensions/opencode/agents/developer.md +1 -1
- package/extensions/opencode/agents/devops-engineer.md +1 -1
- package/extensions/opencode/agents/instruction_reviewer.md +133 -0
- package/extensions/opencode/agents/memory_curator.md +98 -0
- package/extensions/opencode/agents/team-lead.md +1 -1
- package/extensions/opencode/opencode.json +27 -23
- package/package.json +1 -1
- package/scripts/generate_how_to_use_agentic_gif.py +565 -0
- package/extensions/opencode/plugins/agent-model-mapper.ts +0 -117
package/AGENTS.md
CHANGED
|
@@ -1,34 +1,17 @@
|
|
|
1
1
|
# AGENTS — root guidance
|
|
2
2
|
|
|
3
|
-
## Dynamic loading
|
|
3
|
+
## Dynamic guidance loading
|
|
4
4
|
|
|
5
|
-
The
|
|
6
|
-
|
|
5
|
+
The loaded guidance set is project-specific and may change per task. Do not assume the statically listed files are
|
|
6
|
+
complete; first discover optional guidance under the target project's `.agent/` directory.
|
|
7
7
|
|
|
8
|
-
Discover and load
|
|
8
|
+
Discover and load guidance in this order:
|
|
9
9
|
|
|
10
|
-
|
|
11
|
-
|
|
12
|
-
|
|
13
|
-
|
|
14
|
-
|
|
15
|
-
├── workflows/
|
|
16
|
-
└── prompts/
|
|
17
|
-
```
|
|
18
|
-
|
|
19
|
-
## Guidance chain
|
|
20
|
-
|
|
21
|
-
1. Project `.agent/` baseline
|
|
22
|
-
2. `.agent/rules/*` — load all
|
|
23
|
-
3. `.agent/skills/*/SKILL.md` — load only the skill matching the current task
|
|
24
|
-
4. `.agent/workflows/*` — load the workflow matching the triggered command
|
|
25
|
-
|
|
26
|
-
**Discovery patterns:**
|
|
27
|
-
|
|
28
|
-
- `.agent/rules/*.md`
|
|
29
|
-
- `.agent/skills/*/SKILL.md`
|
|
30
|
-
- `.agent/workflows/*.md`
|
|
31
|
-
- `.agent/prompts/*.md`
|
|
10
|
+
1. `.agent/*.md` — project baseline files, when present
|
|
11
|
+
2. `.agent/rules/*.md` — all project rules
|
|
12
|
+
3. `.agent/skills/*/SKILL.md` — only the skill matching the current task
|
|
13
|
+
4. `.agent/workflows/*.md` — only the workflow matching the triggered command
|
|
14
|
+
5. `.agent/prompts/*.md` — only when explicitly requested or referenced by loaded guidance
|
|
32
15
|
|
|
33
16
|
Prefer relative paths in references inside markdown files.
|
|
34
17
|
|
|
@@ -73,13 +56,17 @@ Cross-cutting practices that apply to every project regardless of area.
|
|
|
73
56
|
### Documentation of Behavior Changes
|
|
74
57
|
|
|
75
58
|
- Any behavior change captured in Markdown artifacts must be documented under the project `docs/` directory.
|
|
76
|
-
- Use documentation paths that match the change type, for example `docs/<feature>/README.md` for feature behavior and
|
|
77
|
-
-
|
|
78
|
-
-
|
|
59
|
+
- Use documentation paths that match the change type, for example `docs/<feature>/README.md` for feature behavior and
|
|
60
|
+
`docs/incidents/<date>-<workload>-root-cause.md` for incident root cause reports.
|
|
61
|
+
- Create or update the relevant `docs/` artifact in the same change set; do not leave behavior changes documented only
|
|
62
|
+
in workflow outputs, tickets, or PR comments.
|
|
63
|
+
- Apply the `product-owner` role to confirm that docs describe the user-facing behavior, acceptance criteria, and
|
|
64
|
+
operational constraints of the change.
|
|
79
65
|
|
|
80
66
|
### MCP Memory Providers
|
|
81
67
|
|
|
82
|
-
See [MEMORY.md](MEMORY.md) for the full protocol: provider roles, Context7 usage, MemPalace session-start queries,
|
|
68
|
+
See [MEMORY.md](MEMORY.md) for the full protocol: provider roles, Context7 usage, MemPalace session-start queries,
|
|
69
|
+
fact-writing triggers, tool call examples, and fallback order.
|
|
83
70
|
|
|
84
71
|
### Code Style
|
|
85
72
|
|
package/CHANGELOG.md
CHANGED
|
@@ -1,5 +1,17 @@
|
|
|
1
1
|
# Changelog
|
|
2
2
|
|
|
3
|
+
## v0.3.3
|
|
4
|
+
|
|
5
|
+
- Updated MemPalace project initialization to pipe explicit confirmation (`echo "Y" | mempalace init ...`) for non-interactive setup robustness.
|
|
6
|
+
- Removed `agent-model-mapper` from OpenCode plugin registration.
|
|
7
|
+
- Deleted obsolete OpenCode plugin source `extensions/opencode/plugins/agent-model-mapper.ts`.
|
|
8
|
+
|
|
9
|
+
## v0.3.2
|
|
10
|
+
|
|
11
|
+
- Added optional post-task specialist agents `instruction_reviewer` and `memory_curator` outside the mandatory SDLC role matrix.
|
|
12
|
+
- Added review pipeline guidance, `.reviews/<task-id>/` output conventions, and documented example instruction/memory review reports.
|
|
13
|
+
- Registered the new specialists in OpenCode role configuration and extended deterministic install/model-mapper coverage.
|
|
14
|
+
|
|
3
15
|
## v0.3.1
|
|
4
16
|
|
|
5
17
|
- Added project-level OpenCode plugin settings in `.agentic.json`, including Telegram `botToken` and `chatId` when `telegram-notification` is enabled.
|
package/README.md
CHANGED
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
# Agent Intelligence Configuration (agentic)
|
|
2
2
|
|
|
3
|
-
> **18 areas · 10 Software specs · 8 DevOps specs · 7 SDLC agents · 105+ skills · 73+ workflows**
|
|
3
|
+
> **18 areas · 10 Software specs · 8 DevOps specs · 7 SDLC agents + 2 specialists · 105+ skills · 73+ workflows**
|
|
4
4
|
|
|
5
5
|
A unified catalog of agentic specializations and the `agentic` CLI. Install orchestrator-ready rules, skills, workflows,
|
|
6
6
|
and prompts into any project — and run a full SDLC agent team out of the box.
|
|
@@ -77,14 +77,14 @@ agent-guides/
|
|
|
77
77
|
│ └── database-ops/ # PostgreSQL, Redis, migrations, backup/restore
|
|
78
78
|
├── extensions/
|
|
79
79
|
│ ├── opencode/ # OpenCode agent definitions, commands, skills
|
|
80
|
-
│ │ └── agents/ #
|
|
80
|
+
│ │ └── agents/ # SDLC agents + optional specialists for .opencode/agents/
|
|
81
81
|
│ ├── claude/ # Claude Code configs
|
|
82
|
-
│ │ └── agents/ #
|
|
82
|
+
│ │ └── agents/ # SDLC agents + optional specialists for .claude/agents/
|
|
83
83
|
│ ├── antigravity/ # Antigravity platform configs
|
|
84
84
|
│ ├── codex/ # Codex custom agents and override configs
|
|
85
|
-
│ │ └── agents/ #
|
|
85
|
+
│ │ └── agents/ # SDLC agents + optional specialists for .codex/agents/
|
|
86
86
|
│ └── gemini/ # Gemini-specific configs
|
|
87
|
-
│ │ └── agents/ #
|
|
87
|
+
│ │ └── agents/ # SDLC agents + optional specialists for .gemini/agents/
|
|
88
88
|
├── areas/template/ # Authoring templates — start here for new content
|
|
89
89
|
├── docs/ # Setup and usage guides
|
|
90
90
|
├── AGENTS.md # Root agent guidance (loaded into every project)
|
|
@@ -140,8 +140,8 @@ guidance bundle.
|
|
|
140
140
|
|
|
141
141
|
## SDLC Agent team
|
|
142
142
|
|
|
143
|
-
The same 7-agent team works across **Claude Code**, **OpenCode**, **Codex**, and any tool that supports agent or
|
|
144
|
-
subagent files.
|
|
143
|
+
The same 7-agent SDLC team works across **Claude Code**, **OpenCode**, **Codex**, and any tool that supports agent or
|
|
144
|
+
subagent files. Agentic also ships optional post-task review specialists for instruction quality and memory hygiene.
|
|
145
145
|
|
|
146
146
|
| Agent | Role | Invoke when |
|
|
147
147
|
|:------------------|:-----------------------------------------------|:----------------------------------------------|
|
|
@@ -156,6 +156,16 @@ subagent files.
|
|
|
156
156
|
Each agent has a `vibe` (one-line personality), `Identity`, `Communication Style`, `Success Metrics`, and explicit
|
|
157
157
|
`Boundaries` — so roles never overlap and handoffs are always documented.
|
|
158
158
|
|
|
159
|
+
Optional specialist agents run outside the mandatory SDLC role matrix:
|
|
160
|
+
|
|
161
|
+
| Agent | Role | Invoke when |
|
|
162
|
+
|:-----------------------|:-------------------------------------------------|:-------------------------------------------------|
|
|
163
|
+
| `instruction_reviewer` | Post-task instruction effectiveness review | Instructions, tool use, or role guidance changed |
|
|
164
|
+
| `memory_curator` | Post-task memory hygiene recommendations | Durable facts or memory quality need review |
|
|
165
|
+
|
|
166
|
+
See [Review Pipeline](docs/review-pipeline.md) for the guidance-mode pipeline and `.reviews/<task-id>/` output
|
|
167
|
+
convention.
|
|
168
|
+
|
|
159
169
|
| Platform | Agent path | Format | Guide |
|
|
160
170
|
|:------------|:--------------------------------|:-------------------------------|:------------------------------------------------------------------------------------------------|
|
|
161
171
|
| Claude Code | `project/.claude/agents/*.md` | Markdown with YAML frontmatter | [Claude Code subagents](https://docs.claude.com/en/api/agent-sdk/subagents) |
|
package/agentic
CHANGED
|
@@ -1946,7 +1946,7 @@ print_mempalace_project_setup_instructions() {
|
|
|
1946
1946
|
out "2) Install MemPalace:"
|
|
1947
1947
|
out " pip install mempalace"
|
|
1948
1948
|
out "3) Initialize the project memory taxonomy without LLM calls:"
|
|
1949
|
-
out " mempalace init \"$PROJECT_DIR\" --yes --no-llm"
|
|
1949
|
+
out " echo \"N\" | mempalace init \"$PROJECT_DIR\" --yes --no-llm"
|
|
1950
1950
|
out "4) Mine project knowledge into its isolated wing:"
|
|
1951
1951
|
out " mempalace mine \"$PROJECT_DIR\" --wing \"$project_wing\""
|
|
1952
1952
|
if [[ -d "$PROJECT_DIR/docs" ]]; then
|
|
@@ -2242,7 +2242,7 @@ initialize_mempalace_project() {
|
|
|
2242
2242
|
return 1
|
|
2243
2243
|
fi
|
|
2244
2244
|
|
|
2245
|
-
if ! run_mempalace_command "MemPalace init" mempalace init "$PROJECT_DIR" --yes --no-llm; then
|
|
2245
|
+
if ! run_mempalace_command "MemPalace init" bash -lc "echo \"N\" | mempalace init \"$PROJECT_DIR\" --yes --no-llm"; then
|
|
2246
2246
|
print_mempalace_project_setup_instructions
|
|
2247
2247
|
return 1
|
|
2248
2248
|
fi
|
|
@@ -5,10 +5,10 @@
|
|
|
5
5
|
- Post-install doctor checks run independently for `codex`, `opencode`, `claude`, and `gemini`.
|
|
6
6
|
- `AGENTIC_DOCTOR_TIMEOUT_SECONDS` defaults to `10`; a timeout is reported as a doctor failure and install continues.
|
|
7
7
|
- Codex doctor runs non-interactively with `--ephemeral`, `--sandbox workspace-write`, and the same lightweight smoke prompt as other supported doctor targets.
|
|
8
|
-
- OpenCode uses `agent-model-mapper` instead of the removed `model-checker` artifacts.
|
|
8
|
+
- OpenCode uses an install-time `agent-model-mapper` flow instead of the removed `model-checker` artifacts.
|
|
9
9
|
- `agent-model-mapper` writes `.opencode/opencode.json` during interactive install only after confirmation.
|
|
10
|
-
- `agent-model-mapper` uses `fzf` for install-time model dropdowns when available
|
|
11
|
-
-
|
|
10
|
+
- `agent-model-mapper` uses `fzf` for install-time model dropdowns when available.
|
|
11
|
+
- No runtime OpenCode plugin is shipped for `agent-model-mapper`, so OpenCode startup never opens `fzf`, asks questions, or writes project files.
|
|
12
12
|
- Context7 offers an interactive key mode: configure without a key or enter `CONTEXT7_API_KEY` for the selected target configs.
|
|
13
13
|
- OpenCode MemPalace setup writes `mempalace-mcp` config and initializes/mines project memory into a project-specific wing without LLM calls.
|
|
14
14
|
- Telegram notification credentials are read from project `.agentic.json` when the plugin is enabled.
|
|
@@ -24,8 +24,9 @@
|
|
|
24
24
|
- Hung agent doctor commands time out and do not stop remaining selected agents from running.
|
|
25
25
|
- Doctor output includes timeout duration, exit status, and per-agent elapsed time.
|
|
26
26
|
- `extensions/opencode/plugins/model-checker.ts` and `model-checker.json` are absent.
|
|
27
|
-
- `extensions/opencode/
|
|
28
|
-
-
|
|
27
|
+
- `extensions/opencode/plugins/agent-model-mapper.ts` is absent.
|
|
28
|
+
- `extensions/opencode/opencode.json` does not list `agent-model-mapper`.
|
|
29
|
+
- Install-time model mapper execution still writes confirmed role model mappings.
|
|
29
30
|
- Telegram plugin tests prove environment-only credentials and no secret output.
|
|
30
31
|
- Real blackbox tests print created files, managed guidance sources, and MCP config evidence, then save instruction evidence to a temp file without printing Telegram secrets.
|
|
31
32
|
|
package/docs/agentic-usage.md
CHANGED
|
@@ -169,7 +169,7 @@ When `opencode` is selected, interactive installs ask whether to enable Telegram
|
|
|
169
169
|
~/.config/agentic/opencode-plugins.json
|
|
170
170
|
```
|
|
171
171
|
|
|
172
|
-
Non-interactive installs create a disabled config when no config exists. Interactive installs ask for Telegram `botToken` and `chatId` when `telegram-notification` is selected. Those credentials are written to the target project `.agentic.json` under `settings.opencode_plugins.telegram`, not to `~/.config/agentic/opencode-plugins.json`. Treat `.agentic.json` as plaintext secret-bearing project config when Telegram is enabled and do not commit it to public repositories. When enabled, `agent-model-mapper` runs during interactive `agentic install`/`agentic tui`, uses `fzf` as a dropdown picker when available, and writes `.opencode/opencode.json` only after a Confirm action. OpenCode startup
|
|
172
|
+
Non-interactive installs create a disabled config when no config exists. Interactive installs ask for Telegram `botToken` and `chatId` when `telegram-notification` is selected. Those credentials are written to the target project `.agentic.json` under `settings.opencode_plugins.telegram`, not to `~/.config/agentic/opencode-plugins.json`. Treat `.agentic.json` as plaintext secret-bearing project config when Telegram is enabled and do not commit it to public repositories. When enabled, `agent-model-mapper` runs during interactive `agentic install`/`agentic tui`, uses `fzf` as a dropdown picker when available, and writes `.opencode/opencode.json` only after a Confirm action. OpenCode startup does not load a mapper runtime plugin or prompt for model mapping.
|
|
173
173
|
|
|
174
174
|
## Context7
|
|
175
175
|
|
|
@@ -0,0 +1,19 @@
|
|
|
1
|
+
# Centralized guidance loading and memory writes
|
|
2
|
+
|
|
3
|
+
## User-facing behavior
|
|
4
|
+
|
|
5
|
+
Agent guidance loading rules are defined in the root `AGENTS.md` instead of being repeated in each `areas/**/AGENTS.md` specialization index. Area files now focus on scope, inherited constraints, overrides, and spec maps.
|
|
6
|
+
|
|
7
|
+
`MEMORY.md` now explicitly tells agents to use `mempalace_store` proactively for durable project facts when those facts are discovered, decided, or corrected.
|
|
8
|
+
|
|
9
|
+
## Acceptance criteria
|
|
10
|
+
|
|
11
|
+
- Root `AGENTS.md` contains the canonical guidance chain and `.agent/**/*.md` discovery patterns.
|
|
12
|
+
- Area specialization `AGENTS.md` files do not repeat `## Guidance chain` or `## Discovery patterns`.
|
|
13
|
+
- `areas/template/AGENTS.tmpl.md` does not reintroduce the duplicated sections for future specs.
|
|
14
|
+
- `MEMORY.md` includes a concise `mempalace_store` example with wing, optional confirmed room, text, and tags.
|
|
15
|
+
|
|
16
|
+
## Operational constraints
|
|
17
|
+
|
|
18
|
+
- Token-budget reporting uses a dependency-free estimate of `ceil(chars / 4)` unless a tokenizer dependency is intentionally added later.
|
|
19
|
+
- Validation continues to run through Makefile targets: `make lint` and `make build`.
|
package/docs/opencode_setup.md
CHANGED
|
@@ -46,7 +46,7 @@ The runtime plugin reads credentials from the project `.agentic.json`; it does n
|
|
|
46
46
|
|
|
47
47
|
Non-interactive `agentic install` defaults optional plugins to disabled when no config exists.
|
|
48
48
|
|
|
49
|
-
`agent-model-mapper` reads roles from target `.opencode/agents/*.md` and discovers model names from `~/.config/opencode/opencode.json`, then adds models from active providers in `~/.local/share/opencode/auth.json` using non-deprecated entries in `~/.cache/opencode/models.json`. When enabled, interactive `agentic install`/`agentic tui` prompts for a main and fallback model per role, using `fzf` as a dropdown picker when available, and writes `.opencode/opencode.json` only after a Confirm action. OpenCode startup never opens `fzf` or waits for model input
|
|
49
|
+
`agent-model-mapper` reads roles from target `.opencode/agents/*.md` and discovers model names from `~/.config/opencode/opencode.json`, then adds models from active providers in `~/.local/share/opencode/auth.json` using non-deprecated entries in `~/.cache/opencode/models.json`. When enabled, interactive `agentic install`/`agentic tui` prompts for a main and fallback model per role, using `fzf` as a dropdown picker when available, and writes `.opencode/opencode.json` only after a Confirm action. OpenCode startup never opens `fzf` or waits for model input because no mapper runtime plugin is shipped or registered.
|
|
50
50
|
|
|
51
51
|
For OpenCode targets, `agentic` writes generated operating guidance to `.opencode/AGENTS.md`. If OpenCode is installed
|
|
52
52
|
alongside another agent target, root `AGENTS.md` is generated as well for the non-OpenCode target.
|
|
@@ -0,0 +1,95 @@
|
|
|
1
|
+
# Instruction Effectiveness Review
|
|
2
|
+
|
|
3
|
+
## Summary
|
|
4
|
+
|
|
5
|
+
The instruction set helped the task stay inside the existing extension layout and prevented the new roles from being
|
|
6
|
+
added to the mandatory SDLC matrix. Tool discipline was mostly strong because repository facts were discovered before
|
|
7
|
+
editing. The main gap was that review pipeline behavior had to be inferred from docs rather than a dedicated guidance
|
|
8
|
+
section. No code quality findings are included because they are outside this role's scope.
|
|
9
|
+
|
|
10
|
+
## Scores
|
|
11
|
+
|
|
12
|
+
| Category | Score 0-10 | Notes |
|
|
13
|
+
|---|---:|---|
|
|
14
|
+
| Clarity | 8 | Role boundaries were clear after reading README and SDLC rules. |
|
|
15
|
+
| Usefulness | 8 | Existing extension patterns made implementation straightforward. |
|
|
16
|
+
| Tool discipline | 8 | File inspection was targeted and avoided repeated broad loops. |
|
|
17
|
+
| Memory discipline | 7 | Memory rules existed, but post-task curation was not documented. |
|
|
18
|
+
| Ambiguity resistance | 7 | The repo lacked a review pipeline section, causing one product decision. |
|
|
19
|
+
| Token efficiency | 7 | Some duplicate role text is necessary for installed agents. |
|
|
20
|
+
| Overall | 8 | Minor instruction additions are enough. |
|
|
21
|
+
|
|
22
|
+
## Effective instructions
|
|
23
|
+
|
|
24
|
+
| Instruction | Impact | Evidence |
|
|
25
|
+
|---|---|---|
|
|
26
|
+
| Keep SDLC roles one-to-one | Prevented specialist roles from replacing core SDLC owners. | `sdlc-role-responsibilities.md` keeps the mandatory matrix unchanged. |
|
|
27
|
+
| Discover project guidance before implementation | Found the extension-based agent layout. | Existing files live under `extensions/*/agents`. |
|
|
28
|
+
|
|
29
|
+
## Harmful instructions
|
|
30
|
+
|
|
31
|
+
| Instruction | Problem | Evidence |
|
|
32
|
+
|---|---|---|
|
|
33
|
+
| None | No instruction directly caused task failure. | The task completed with scoped docs and tests. |
|
|
34
|
+
|
|
35
|
+
## Missing instructions
|
|
36
|
+
|
|
37
|
+
| Missing instruction | Why needed | Suggested text |
|
|
38
|
+
|---|---|---|
|
|
39
|
+
| Post-task review pipeline guidance | Future agents need to know when specialists run and where reports go. | Add a review pipeline section that lists optional roles and `.reviews/<task-id>/` output paths. |
|
|
40
|
+
|
|
41
|
+
## Redundant instructions
|
|
42
|
+
|
|
43
|
+
| Instruction | Reason |
|
|
44
|
+
|---|---|
|
|
45
|
+
| Repeated role boundaries across extension files | Required because each installed agent file must be self-contained. |
|
|
46
|
+
|
|
47
|
+
## Tool usage findings
|
|
48
|
+
|
|
49
|
+
| Tool | Calls | Useful | Waste | Notes |
|
|
50
|
+
|---|---:|---:|---:|---|
|
|
51
|
+
| `rg` | 4 | 4 | 0 | Located role and installer references quickly. |
|
|
52
|
+
| `sed` | 5 | 5 | 0 | Confirmed local file formats before edits. |
|
|
53
|
+
| `apply_patch` | 3 | 3 | 0 | Added and updated tracked files. |
|
|
54
|
+
|
|
55
|
+
## Suggested edits
|
|
56
|
+
|
|
57
|
+
### Remove
|
|
58
|
+
|
|
59
|
+
```md
|
|
60
|
+
None.
|
|
61
|
+
```
|
|
62
|
+
|
|
63
|
+
### Replace
|
|
64
|
+
|
|
65
|
+
```md
|
|
66
|
+
The same 7-agent team works across supported IDEs.
|
|
67
|
+
```
|
|
68
|
+
|
|
69
|
+
with:
|
|
70
|
+
|
|
71
|
+
```md
|
|
72
|
+
The same 7-agent SDLC team works across supported IDEs, with optional post-task review specialists.
|
|
73
|
+
```
|
|
74
|
+
|
|
75
|
+
### Add
|
|
76
|
+
|
|
77
|
+
```md
|
|
78
|
+
Use `instruction_reviewer` and `memory_curator` after non-trivial tasks when instruction quality or memory hygiene needs review.
|
|
79
|
+
```
|
|
80
|
+
|
|
81
|
+
## Estimated waste
|
|
82
|
+
|
|
83
|
+
| Metric | Estimate |
|
|
84
|
+
|---|---:|
|
|
85
|
+
| Extra tokens | 500 |
|
|
86
|
+
| Extra tool calls | 1 |
|
|
87
|
+
| Extra retries | 0 |
|
|
88
|
+
| Extra runtime | 2 minutes |
|
|
89
|
+
|
|
90
|
+
## Final recommendation
|
|
91
|
+
|
|
92
|
+
Minor edits
|
|
93
|
+
|
|
94
|
+
The instruction set is generally effective. Add explicit review pipeline guidance so future runs do not have to infer
|
|
95
|
+
how specialist agents should be used.
|
|
@@ -0,0 +1,56 @@
|
|
|
1
|
+
# Memory Curation Report
|
|
2
|
+
|
|
3
|
+
## Summary
|
|
4
|
+
|
|
5
|
+
The task introduced a durable convention: post-task review specialists are optional and remain outside the mandatory
|
|
6
|
+
SDLC role matrix. That convention is likely to help future agent-system changes. Temporary test output, command logs,
|
|
7
|
+
and generated report examples should not be stored as memory. No automatic memory write is recommended without user or
|
|
8
|
+
orchestrator approval.
|
|
9
|
+
|
|
10
|
+
## Store
|
|
11
|
+
|
|
12
|
+
| Priority | Fact | Reason | Suggested memory text |
|
|
13
|
+
|---|---|---|---|
|
|
14
|
+
| High | `instruction_reviewer` and `memory_curator` are optional post-task specialists, not SDLC owners. | Prevents future role-boundary drift. | Agentic treats `instruction_reviewer` and `memory_curator` as optional post-task review specialists outside the mandatory SDLC role matrix. |
|
|
15
|
+
| Medium | Review artifacts use `.reviews/<task-id>/` or timestamp fallback. | Helps future tasks place reports consistently. | Post-task review reports should be written under `.reviews/<task-id>/`, or `.reviews/YYYY-MM-DD-HHMMSS/` when no task id exists. |
|
|
16
|
+
|
|
17
|
+
## Update
|
|
18
|
+
|
|
19
|
+
| Existing memory | Replace with | Reason |
|
|
20
|
+
|---|---|---|
|
|
21
|
+
| None | None | No stale memory was identified. |
|
|
22
|
+
|
|
23
|
+
## Merge
|
|
24
|
+
|
|
25
|
+
| Memory A | Memory B | Merged memory | Reason |
|
|
26
|
+
|---|---|---|---|
|
|
27
|
+
| None | None | None | No duplicate memory was identified. |
|
|
28
|
+
|
|
29
|
+
## Ignore
|
|
30
|
+
|
|
31
|
+
| Fact | Reason |
|
|
32
|
+
|---|---|
|
|
33
|
+
| Exact shell output from test runs | Temporary logs are low-value memory. |
|
|
34
|
+
| Generated example report wording | Generated code/docs examples should remain in files, not memory. |
|
|
35
|
+
| One-time task status | Current task state is transient. |
|
|
36
|
+
|
|
37
|
+
## Delete candidates
|
|
38
|
+
|
|
39
|
+
| Memory | Reason |
|
|
40
|
+
|---|---|
|
|
41
|
+
| None | No delete candidate was found. |
|
|
42
|
+
|
|
43
|
+
## Contradictions
|
|
44
|
+
|
|
45
|
+
| Memory | New information | Resolution |
|
|
46
|
+
|---|---|---|
|
|
47
|
+
| None | None | No contradiction found. |
|
|
48
|
+
|
|
49
|
+
## Final recommendation
|
|
50
|
+
|
|
51
|
+
Store count: 2
|
|
52
|
+
Update count: 0
|
|
53
|
+
Merge count: 0
|
|
54
|
+
Delete candidate count: 0
|
|
55
|
+
Memory quality score: 8/10
|
|
56
|
+
Store only the two durable conventions. Ignore logs, generated examples, and current task progress.
|
|
@@ -0,0 +1,18 @@
|
|
|
1
|
+
# Review Summary
|
|
2
|
+
|
|
3
|
+
## Instruction review
|
|
4
|
+
|
|
5
|
+
Overall score: 8/10
|
|
6
|
+
Recommendation: Minor edits
|
|
7
|
+
|
|
8
|
+
## Memory curation
|
|
9
|
+
|
|
10
|
+
Store: 2
|
|
11
|
+
Update: 0
|
|
12
|
+
Delete candidates: 0
|
|
13
|
+
|
|
14
|
+
## Action items
|
|
15
|
+
|
|
16
|
+
- [ ] Document optional post-task specialists in README.
|
|
17
|
+
- [ ] Keep specialist roles outside the mandatory SDLC matrix.
|
|
18
|
+
- [ ] Store only durable review-pipeline conventions, not logs or generated examples.
|
|
@@ -0,0 +1,82 @@
|
|
|
1
|
+
# Review Pipeline
|
|
2
|
+
|
|
3
|
+
Agentic ships two optional post-task specialist agents:
|
|
4
|
+
|
|
5
|
+
- `instruction_reviewer`: reviews how instructions affected task execution.
|
|
6
|
+
- `memory_curator`: recommends long-term memory store, update, merge, ignore, and delete-candidate actions.
|
|
7
|
+
|
|
8
|
+
These agents are outside the mandatory SDLC role matrix. They do not replace `product-owner`, `pm`, `team-lead`,
|
|
9
|
+
`developer`, `qa`, `designer`, or `devops-engineer`.
|
|
10
|
+
|
|
11
|
+
## Guidance-mode integration
|
|
12
|
+
|
|
13
|
+
Agentic currently provides guidance and IDE agent definitions for the review pipeline. It does not run a generic
|
|
14
|
+
post-task review runner. The parent or orchestrating agent should call the specialists after task execution when the
|
|
15
|
+
task size and risk justify the extra review.
|
|
16
|
+
|
|
17
|
+
Small tasks may skip this pipeline.
|
|
18
|
+
|
|
19
|
+
```yaml
|
|
20
|
+
review_pipeline:
|
|
21
|
+
enabled: true
|
|
22
|
+
default:
|
|
23
|
+
- qa
|
|
24
|
+
- instruction_reviewer
|
|
25
|
+
- memory_curator
|
|
26
|
+
task_types:
|
|
27
|
+
agent_system:
|
|
28
|
+
- qa
|
|
29
|
+
- instruction_reviewer
|
|
30
|
+
- memory_curator
|
|
31
|
+
docs:
|
|
32
|
+
- instruction_reviewer
|
|
33
|
+
- memory_curator
|
|
34
|
+
code:
|
|
35
|
+
- qa
|
|
36
|
+
- instruction_reviewer
|
|
37
|
+
- memory_curator
|
|
38
|
+
```
|
|
39
|
+
|
|
40
|
+
`tool_optimizer` may be added to `agent_system` tasks in projects that install such a role. This repository does not
|
|
41
|
+
ship a `tool_optimizer` role.
|
|
42
|
+
|
|
43
|
+
## Output files
|
|
44
|
+
|
|
45
|
+
When the orchestrating agent writes review artifacts, use this layout:
|
|
46
|
+
|
|
47
|
+
```text
|
|
48
|
+
.reviews/<task-id>/
|
|
49
|
+
├── instruction-review.md
|
|
50
|
+
├── memory-curation.md
|
|
51
|
+
└── summary.md
|
|
52
|
+
```
|
|
53
|
+
|
|
54
|
+
If the task id is unavailable, use a timestamp in `YYYY-MM-DD-HHMMSS` format, for example:
|
|
55
|
+
|
|
56
|
+
```text
|
|
57
|
+
.reviews/2026-05-26-153000/
|
|
58
|
+
```
|
|
59
|
+
|
|
60
|
+
The specialist agents only produce Markdown reports. They do not write memory automatically and do not create review
|
|
61
|
+
files unless the parent task explicitly grants file-writing scope.
|
|
62
|
+
|
|
63
|
+
Example reports live under `docs/review-pipeline/examples/`.
|
|
64
|
+
|
|
65
|
+
## Report boundaries
|
|
66
|
+
|
|
67
|
+
`instruction_reviewer` reviews instruction effects only:
|
|
68
|
+
|
|
69
|
+
- `AGENTS.md`, `MEMORY.md`, role prompts, workflows, and tool guidance
|
|
70
|
+
- instruction clarity, usefulness, conflicts, redundancy, and missing rules
|
|
71
|
+
- repeated search loops, unnecessary memory lookups, unnecessary MCP calls, and token/tool waste
|
|
72
|
+
|
|
73
|
+
It must not review code quality or product requirements.
|
|
74
|
+
|
|
75
|
+
`memory_curator` reviews memory hygiene only:
|
|
76
|
+
|
|
77
|
+
- durable project facts, conventions, workflows, decisions, constraints, and rationale
|
|
78
|
+
- duplicate, stale, contradictory, or low-value memory candidates
|
|
79
|
+
- store/update/merge/ignore/delete recommendations
|
|
80
|
+
|
|
81
|
+
It must not store temporary logs, one-time commands, transient errors, generated code, secrets, temporary URLs, noisy
|
|
82
|
+
debug output, or current task state.
|
|
@@ -0,0 +1,132 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: instruction_reviewer
|
|
3
|
+
description: Use this agent after task execution to review how AGENTS.md, MEMORY.md, role prompts, and tool-use instructions affected the run. It does not review code quality or product requirements.
|
|
4
|
+
---
|
|
5
|
+
|
|
6
|
+
# Instruction Reviewer
|
|
7
|
+
|
|
8
|
+
You are Instruction Reviewer.
|
|
9
|
+
Your job is to evaluate how agent instructions affected task execution.
|
|
10
|
+
You do NOT review code quality.
|
|
11
|
+
You do NOT review product requirements.
|
|
12
|
+
You do NOT rewrite the implementation unless an instruction directly caused a problem.
|
|
13
|
+
|
|
14
|
+
Analyze:
|
|
15
|
+
- AGENTS.md
|
|
16
|
+
- MEMORY.md
|
|
17
|
+
- role prompts
|
|
18
|
+
- task description
|
|
19
|
+
- execution log
|
|
20
|
+
- tool calls
|
|
21
|
+
- final diff
|
|
22
|
+
- test results
|
|
23
|
+
- review artifacts
|
|
24
|
+
|
|
25
|
+
Focus on:
|
|
26
|
+
- instruction clarity
|
|
27
|
+
- instruction usefulness
|
|
28
|
+
- instruction conflicts
|
|
29
|
+
- redundant rules
|
|
30
|
+
- missing rules
|
|
31
|
+
- excessive tool usage
|
|
32
|
+
- repeated search loops
|
|
33
|
+
- unnecessary memory lookups
|
|
34
|
+
- unnecessary MCP calls
|
|
35
|
+
- token waste
|
|
36
|
+
- context reuse
|
|
37
|
+
|
|
38
|
+
Output only a markdown report.
|
|
39
|
+
Use this structure:
|
|
40
|
+
|
|
41
|
+
# Instruction Effectiveness Review
|
|
42
|
+
|
|
43
|
+
## Summary
|
|
44
|
+
|
|
45
|
+
Brief 3-5 sentence summary.
|
|
46
|
+
|
|
47
|
+
## Scores
|
|
48
|
+
|
|
49
|
+
| Category | Score 0-10 | Notes |
|
|
50
|
+
|---|---:|---|
|
|
51
|
+
| Clarity | | |
|
|
52
|
+
| Usefulness | | |
|
|
53
|
+
| Tool discipline | | |
|
|
54
|
+
| Memory discipline | | |
|
|
55
|
+
| Ambiguity resistance | | |
|
|
56
|
+
| Token efficiency | | |
|
|
57
|
+
| Overall | | |
|
|
58
|
+
|
|
59
|
+
## Effective instructions
|
|
60
|
+
|
|
61
|
+
| Instruction | Impact | Evidence |
|
|
62
|
+
|---|---|---|
|
|
63
|
+
| | | |
|
|
64
|
+
|
|
65
|
+
## Harmful instructions
|
|
66
|
+
|
|
67
|
+
| Instruction | Problem | Evidence |
|
|
68
|
+
|---|---|---|
|
|
69
|
+
| | | |
|
|
70
|
+
|
|
71
|
+
## Missing instructions
|
|
72
|
+
|
|
73
|
+
| Missing instruction | Why needed | Suggested text |
|
|
74
|
+
|---|---|---|
|
|
75
|
+
| | | |
|
|
76
|
+
|
|
77
|
+
## Redundant instructions
|
|
78
|
+
|
|
79
|
+
| Instruction | Reason |
|
|
80
|
+
|---|---|
|
|
81
|
+
| | |
|
|
82
|
+
|
|
83
|
+
## Tool usage findings
|
|
84
|
+
|
|
85
|
+
| Tool | Calls | Useful | Waste | Notes |
|
|
86
|
+
|---|---:|---:|---:|---|
|
|
87
|
+
| | | | | |
|
|
88
|
+
|
|
89
|
+
## Suggested edits
|
|
90
|
+
|
|
91
|
+
### Remove
|
|
92
|
+
|
|
93
|
+
```md
|
|
94
|
+
...
|
|
95
|
+
```
|
|
96
|
+
|
|
97
|
+
### Replace
|
|
98
|
+
|
|
99
|
+
```md
|
|
100
|
+
...
|
|
101
|
+
```
|
|
102
|
+
|
|
103
|
+
with:
|
|
104
|
+
|
|
105
|
+
```md
|
|
106
|
+
...
|
|
107
|
+
```
|
|
108
|
+
|
|
109
|
+
### Add
|
|
110
|
+
|
|
111
|
+
```md
|
|
112
|
+
...
|
|
113
|
+
```
|
|
114
|
+
|
|
115
|
+
## Estimated waste
|
|
116
|
+
|
|
117
|
+
| Metric | Estimate |
|
|
118
|
+
|---|---:|
|
|
119
|
+
| Extra tokens | |
|
|
120
|
+
| Extra tool calls | |
|
|
121
|
+
| Extra retries | |
|
|
122
|
+
| Extra runtime | |
|
|
123
|
+
|
|
124
|
+
## Final recommendation
|
|
125
|
+
|
|
126
|
+
Choose one:
|
|
127
|
+
|
|
128
|
+
- Keep as-is
|
|
129
|
+
- Minor edits
|
|
130
|
+
- Significant rewrite
|
|
131
|
+
|
|
132
|
+
Explain in 2-5 sentences.
|