agent-method 1.5.1 → 1.5.5
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +245 -143
- package/bin/{agent-method.js → wwa.js} +12 -4
- package/lib/cli/check.js +71 -71
- package/lib/cli/init.js +107 -17
- package/lib/cli/pipeline.js +1 -1
- package/lib/cli/refine.js +202 -202
- package/lib/cli/route.js +1 -1
- package/lib/cli/scan.js +28 -28
- package/lib/cli/serve.js +23 -0
- package/lib/cli/status.js +61 -61
- package/lib/cli/upgrade.js +149 -146
- package/lib/cli/watch.js +32 -0
- package/lib/init.js +296 -240
- package/lib/mcp-server.js +524 -0
- package/lib/pipeline.js +1 -1
- package/lib/registry.js +1 -1
- package/lib/watcher.js +165 -0
- package/package.json +8 -5
- package/templates/README.md +13 -9
- package/templates/entry-points/.cursorrules +3 -3
- package/templates/entry-points/AGENT.md +3 -3
- package/templates/entry-points/CLAUDE.md +3 -3
- package/templates/full/.cursorrules +3 -3
- package/templates/full/AGENT.md +3 -3
- package/templates/full/CLAUDE.md +3 -3
- package/templates/full/SESSION-LOG.md +66 -5
- package/templates/starter/.cursorrules +3 -3
- package/templates/starter/AGENT.md +3 -3
- package/templates/starter/CLAUDE.md +3 -3
- package/templates/starter/SESSION-LOG.md +66 -5
package/package.json
CHANGED
|
@@ -1,7 +1,7 @@
|
|
|
1
1
|
{
|
|
2
2
|
"name": "agent-method",
|
|
3
|
-
"version": "1.5.
|
|
4
|
-
"description": "CLI tools for the
|
|
3
|
+
"version": "1.5.5",
|
|
4
|
+
"description": "CLI tools for the wwa methodology — registry-driven routing, validation, and project setup for AI-agent-assisted development",
|
|
5
5
|
"keywords": [
|
|
6
6
|
"ai-agents",
|
|
7
7
|
"prompt-engineering",
|
|
@@ -12,9 +12,10 @@
|
|
|
12
12
|
],
|
|
13
13
|
"type": "module",
|
|
14
14
|
"license": "MIT",
|
|
15
|
-
"author": "
|
|
15
|
+
"author": "wwa contributors",
|
|
16
16
|
"bin": {
|
|
17
|
-
"
|
|
17
|
+
"wwa": "bin/wwa.js",
|
|
18
|
+
"agent-method": "bin/wwa.js"
|
|
18
19
|
},
|
|
19
20
|
"files": [
|
|
20
21
|
"bin/",
|
|
@@ -27,13 +28,15 @@
|
|
|
27
28
|
"node": ">=18.0.0"
|
|
28
29
|
},
|
|
29
30
|
"dependencies": {
|
|
31
|
+
"@modelcontextprotocol/sdk": "^1.27.1",
|
|
30
32
|
"chalk": "^5.4.0",
|
|
33
|
+
"chokidar": "^4.0.3",
|
|
31
34
|
"commander": "^12.0.0",
|
|
32
35
|
"inquirer": "^9.0.0",
|
|
33
36
|
"js-yaml": "^4.1.0"
|
|
34
37
|
},
|
|
35
38
|
"repository": {
|
|
36
39
|
"type": "git",
|
|
37
|
-
"url": "https://github.com/
|
|
40
|
+
"url": "https://github.com/anthropics/wwa"
|
|
38
41
|
}
|
|
39
42
|
}
|
package/templates/README.md
CHANGED
|
@@ -251,7 +251,7 @@ The methodology works without any tooling. For teams that want additional valida
|
|
|
251
251
|
```bash
|
|
252
252
|
npx agent-method # zero-install (Node.js 18+)
|
|
253
253
|
npm install -g agent-method # permanent install
|
|
254
|
-
pip install
|
|
254
|
+
pip install wwa-tools # Python alternative
|
|
255
255
|
```
|
|
256
256
|
|
|
257
257
|
### Developer commands
|
|
@@ -271,10 +271,10 @@ pip install agent-method-tools # Python alternative
|
|
|
271
271
|
Use friendly names everywhere — all commands accept aliases:
|
|
272
272
|
|
|
273
273
|
```bash
|
|
274
|
-
|
|
275
|
-
|
|
276
|
-
|
|
277
|
-
|
|
274
|
+
wwa init code # software project
|
|
275
|
+
wwa init context # analytical/prompt project (e.g. PromptStudy)
|
|
276
|
+
wwa init data # data index/querying project (e.g. SysMLv2)
|
|
277
|
+
wwa init mix # multi-type project
|
|
278
278
|
```
|
|
279
279
|
|
|
280
280
|
### Advanced: pipeline subcommands
|
|
@@ -283,11 +283,15 @@ For debugging routing logic: `npx agent-method pipeline classify|select|resolve|
|
|
|
283
283
|
|
|
284
284
|
### Dependencies
|
|
285
285
|
|
|
286
|
+
**Node.js (npx / npm)**:
|
|
287
|
+
- Node.js 18+
|
|
288
|
+
- commander ^12.0, js-yaml ^4.1, inquirer ^9.0, chalk ^5.0
|
|
289
|
+
|
|
290
|
+
**Python (pip)**:
|
|
286
291
|
- Python 3.9+
|
|
287
|
-
- PyYAML >= 6.0
|
|
288
|
-
- Click >= 8.0
|
|
292
|
+
- PyYAML >= 6.0, Click >= 8.0
|
|
289
293
|
|
|
290
294
|
### Future enhancements
|
|
291
295
|
|
|
292
|
-
- MCP server: `pip install
|
|
293
|
-
- Registry watcher: `pip install
|
|
296
|
+
- MCP server: `pip install wwa-tools[mcp]` — exposes pipeline as agent-callable tools
|
|
297
|
+
- Registry watcher: `pip install wwa-tools[watch]` — proactive validation on file changes
|
|
@@ -36,7 +36,7 @@ When a file changes, check this table and update dependent files in the same res
|
|
|
36
36
|
| Project structure | .context/BASE.md (codebase map), this file (if new query types needed) |
|
|
37
37
|
| Intelligence layer file exceeds 300 lines | Restructure into index + components subdirectory (keep active content, archive completed sections) |
|
|
38
38
|
| New domain area | .context/BASE.md, consider new .context/ specialist, this file (if new scoping row) |
|
|
39
|
-
| Session close | SESSION-LOG.md (append
|
|
39
|
+
| Session close or high-effort task completion | SESSION-LOG.md (append metrics entry — effort, ambiguity, context level, tokens, time, user response, refinement delta, workflow, features, cascades, friction, findings) |
|
|
40
40
|
|
|
41
41
|
<!-- INSTRUCTION: Add project-specific cascade rules below the universal ones above. -->
|
|
42
42
|
|
|
@@ -70,7 +70,7 @@ method_version: 1.5
|
|
|
70
70
|
|
|
71
71
|
## CLI tools (optional)
|
|
72
72
|
|
|
73
|
-
Available via `npx agent-method` (zero-install) or `pip install
|
|
73
|
+
Available via `npx agent-method` (zero-install) or `pip install wwa-tools`:
|
|
74
74
|
|
|
75
75
|
| When you want to... | Run |
|
|
76
76
|
|---------------------|-----|
|
|
@@ -95,7 +95,7 @@ Available via `npx agent-method` (zero-install) or `pip install agent-method-too
|
|
|
95
95
|
- Surface uncertainty as open questions in STATE.md — never guess silently
|
|
96
96
|
- Keep intelligence layer files under 300 lines — split into index + components subdirectory when exceeded
|
|
97
97
|
- Propose plans and wait for approval — the human controls direction
|
|
98
|
-
- At session close, append a
|
|
98
|
+
- At session close or after any high-effort task, append a metrics entry to SESSION-LOG.md — include effort level, question ambiguity, context level, estimated tokens, time, user response (accepted/edited/revised/rejected/redirected), and for medium/high effort tasks: revision count, refinement magnitude (none/minor/moderate/major/rework), delta categories, and survival rate. Never skip, never read previous entries during normal work
|
|
99
99
|
|
|
100
100
|
## Do not
|
|
101
101
|
|
|
@@ -36,7 +36,7 @@ When a file changes, check this table and update dependent files in the same res
|
|
|
36
36
|
| Project structure | .context/BASE.md (codebase map), this file (if new query types needed) |
|
|
37
37
|
| Intelligence layer file exceeds 300 lines | Restructure into index + components subdirectory (keep active content, archive completed sections) |
|
|
38
38
|
| New domain area | .context/BASE.md, consider new .context/ specialist, this file (if new scoping row) |
|
|
39
|
-
| Session close | SESSION-LOG.md (append
|
|
39
|
+
| Session close or high-effort task completion | SESSION-LOG.md (append metrics entry — effort, ambiguity, context level, tokens, time, user response, refinement delta, workflow, features, cascades, friction, findings) |
|
|
40
40
|
|
|
41
41
|
<!-- INSTRUCTION: Add project-specific cascade rules below the universal ones above. -->
|
|
42
42
|
|
|
@@ -70,7 +70,7 @@ method_version: 1.5
|
|
|
70
70
|
|
|
71
71
|
## CLI tools (optional)
|
|
72
72
|
|
|
73
|
-
Available via `npx agent-method` (zero-install) or `pip install
|
|
73
|
+
Available via `npx agent-method` (zero-install) or `pip install wwa-tools`:
|
|
74
74
|
|
|
75
75
|
| When you want to... | Run |
|
|
76
76
|
|---------------------|-----|
|
|
@@ -95,7 +95,7 @@ Available via `npx agent-method` (zero-install) or `pip install agent-method-too
|
|
|
95
95
|
- Surface uncertainty as open questions in STATE.md — never guess silently
|
|
96
96
|
- Keep intelligence layer files under 300 lines — split into index + components subdirectory when exceeded
|
|
97
97
|
- Propose plans and wait for approval — the human controls direction
|
|
98
|
-
- At session close, append a
|
|
98
|
+
- At session close or after any high-effort task, append a metrics entry to SESSION-LOG.md — include effort level, question ambiguity, context level, estimated tokens, time, user response (accepted/edited/revised/rejected/redirected), and for medium/high effort tasks: revision count, refinement magnitude (none/minor/moderate/major/rework), delta categories, and survival rate. Never skip, never read previous entries during normal work
|
|
99
99
|
|
|
100
100
|
## Do not
|
|
101
101
|
|
|
@@ -36,7 +36,7 @@ When a file changes, check this table and update dependent files in the same res
|
|
|
36
36
|
| Project structure | .context/BASE.md (codebase map), this file (if new query types needed) |
|
|
37
37
|
| Intelligence layer file exceeds 300 lines | Restructure into index + components subdirectory (keep active content, archive completed sections) |
|
|
38
38
|
| New domain area | .context/BASE.md, consider new .context/ specialist, this file (if new scoping row) |
|
|
39
|
-
| Session close | SESSION-LOG.md (append
|
|
39
|
+
| Session close or high-effort task completion | SESSION-LOG.md (append metrics entry — effort, ambiguity, context level, tokens, time, user response, refinement delta, workflow, features, cascades, friction, findings) |
|
|
40
40
|
|
|
41
41
|
<!-- INSTRUCTION: Add project-specific cascade rules below the universal ones above. -->
|
|
42
42
|
|
|
@@ -70,7 +70,7 @@ method_version: 1.5
|
|
|
70
70
|
|
|
71
71
|
## CLI tools (optional)
|
|
72
72
|
|
|
73
|
-
Available via `npx agent-method` (zero-install) or `pip install
|
|
73
|
+
Available via `npx agent-method` (zero-install) or `pip install wwa-tools`:
|
|
74
74
|
|
|
75
75
|
| When you want to... | Run |
|
|
76
76
|
|---------------------|-----|
|
|
@@ -95,7 +95,7 @@ Available via `npx agent-method` (zero-install) or `pip install agent-method-too
|
|
|
95
95
|
- Surface uncertainty as open questions in STATE.md — never guess silently
|
|
96
96
|
- Keep intelligence layer files under 300 lines — split into index + components subdirectory when exceeded
|
|
97
97
|
- Propose plans and wait for approval — the human controls direction
|
|
98
|
-
- At session close, append a
|
|
98
|
+
- At session close or after any high-effort task, append a metrics entry to SESSION-LOG.md — include effort level, question ambiguity, context level, estimated tokens, time, user response (accepted/edited/revised/rejected/redirected), and for medium/high effort tasks: revision count, refinement magnitude (none/minor/moderate/major/rework), delta categories, and survival rate. Never skip, never read previous entries during normal work
|
|
99
99
|
|
|
100
100
|
## Do not
|
|
101
101
|
|
|
@@ -41,7 +41,7 @@ When a file changes, check this table and update dependent files in the same res
|
|
|
41
41
|
| File split, created, deleted, or renamed | .context/REGISTRY.md (file tree, topic index, structural log) |
|
|
42
42
|
| New domain area | .context/BASE.md, consider new .context/ specialist, this file (if new scoping row) |
|
|
43
43
|
| Lifecycle stage change | PROJECT-PROFILE.md (update stage + date), STATE.md (record decision) |
|
|
44
|
-
| Session close | SESSION-LOG.md (append
|
|
44
|
+
| Session close or high-effort task completion | SESSION-LOG.md (append metrics entry — effort, ambiguity, context level, tokens, time, user response, refinement delta, workflow, features, cascades, friction, findings) |
|
|
45
45
|
|
|
46
46
|
<!-- INSTRUCTION: Add project-specific cascade rules below the universal ones above. -->
|
|
47
47
|
|
|
@@ -87,7 +87,7 @@ method_version: 1.5
|
|
|
87
87
|
|
|
88
88
|
## CLI tools (optional)
|
|
89
89
|
|
|
90
|
-
Available via `npx agent-method` (zero-install) or `pip install
|
|
90
|
+
Available via `npx agent-method` (zero-install) or `pip install wwa-tools`:
|
|
91
91
|
|
|
92
92
|
| When you want to... | Run |
|
|
93
93
|
|---------------------|-----|
|
|
@@ -113,7 +113,7 @@ Available via `npx agent-method` (zero-install) or `pip install agent-method-too
|
|
|
113
113
|
- Keep intelligence layer files under 300 lines — split into index + components subdirectory when exceeded
|
|
114
114
|
- Propose plans and wait for approval — the human controls direction
|
|
115
115
|
- SUMMARY.md entries follow audit trail format: date, plan, outcome, files, decisions, next
|
|
116
|
-
- At session close, append a
|
|
116
|
+
- At session close or after any high-effort task, append a metrics entry to SESSION-LOG.md — include effort level, question ambiguity, context level, estimated tokens, time, user response (accepted/edited/revised/rejected/redirected), and for medium/high effort tasks: revision count, refinement magnitude (none/minor/moderate/major/rework), delta categories, and survival rate. Never skip, never read previous entries during normal work
|
|
117
117
|
|
|
118
118
|
## Do not
|
|
119
119
|
|
package/templates/full/AGENT.md
CHANGED
|
@@ -41,7 +41,7 @@ When a file changes, check this table and update dependent files in the same res
|
|
|
41
41
|
| File split, created, deleted, or renamed | .context/REGISTRY.md (file tree, topic index, structural log) |
|
|
42
42
|
| New domain area | .context/BASE.md, consider new .context/ specialist, this file (if new scoping row) |
|
|
43
43
|
| Lifecycle stage change | PROJECT-PROFILE.md (update stage + date), STATE.md (record decision) |
|
|
44
|
-
| Session close | SESSION-LOG.md (append
|
|
44
|
+
| Session close or high-effort task completion | SESSION-LOG.md (append metrics entry — effort, ambiguity, context level, tokens, time, user response, refinement delta, workflow, features, cascades, friction, findings) |
|
|
45
45
|
|
|
46
46
|
<!-- INSTRUCTION: Add project-specific cascade rules below the universal ones above. -->
|
|
47
47
|
|
|
@@ -87,7 +87,7 @@ method_version: 1.5
|
|
|
87
87
|
|
|
88
88
|
## CLI tools (optional)
|
|
89
89
|
|
|
90
|
-
Available via `npx agent-method` (zero-install) or `pip install
|
|
90
|
+
Available via `npx agent-method` (zero-install) or `pip install wwa-tools`:
|
|
91
91
|
|
|
92
92
|
| When you want to... | Run |
|
|
93
93
|
|---------------------|-----|
|
|
@@ -113,7 +113,7 @@ Available via `npx agent-method` (zero-install) or `pip install agent-method-too
|
|
|
113
113
|
- Keep intelligence layer files under 300 lines — split into index + components subdirectory when exceeded
|
|
114
114
|
- Propose plans and wait for approval — the human controls direction
|
|
115
115
|
- SUMMARY.md entries follow audit trail format: date, plan, outcome, files, decisions, next
|
|
116
|
-
- At session close, append a
|
|
116
|
+
- At session close or after any high-effort task, append a metrics entry to SESSION-LOG.md — include effort level, question ambiguity, context level, estimated tokens, time, user response (accepted/edited/revised/rejected/redirected), and for medium/high effort tasks: revision count, refinement magnitude (none/minor/moderate/major/rework), delta categories, and survival rate. Never skip, never read previous entries during normal work
|
|
117
117
|
|
|
118
118
|
## Do not
|
|
119
119
|
|
package/templates/full/CLAUDE.md
CHANGED
|
@@ -41,7 +41,7 @@ When a file changes, check this table and update dependent files in the same res
|
|
|
41
41
|
| File split, created, deleted, or renamed | .context/REGISTRY.md (file tree, topic index, structural log) |
|
|
42
42
|
| New domain area | .context/BASE.md, consider new .context/ specialist, this file (if new scoping row) |
|
|
43
43
|
| Lifecycle stage change | PROJECT-PROFILE.md (update stage + date), STATE.md (record decision) |
|
|
44
|
-
| Session close | SESSION-LOG.md (append
|
|
44
|
+
| Session close or high-effort task completion | SESSION-LOG.md (append metrics entry — effort, ambiguity, context level, tokens, time, user response, refinement delta, workflow, features, cascades, friction, findings) |
|
|
45
45
|
|
|
46
46
|
<!-- INSTRUCTION: Add project-specific cascade rules below the universal ones above. -->
|
|
47
47
|
|
|
@@ -87,7 +87,7 @@ method_version: 1.5
|
|
|
87
87
|
|
|
88
88
|
## CLI tools (optional)
|
|
89
89
|
|
|
90
|
-
Available via `npx agent-method` (zero-install) or `pip install
|
|
90
|
+
Available via `npx agent-method` (zero-install) or `pip install wwa-tools`:
|
|
91
91
|
|
|
92
92
|
| When you want to... | Run |
|
|
93
93
|
|---------------------|-----|
|
|
@@ -113,7 +113,7 @@ Available via `npx agent-method` (zero-install) or `pip install agent-method-too
|
|
|
113
113
|
- Keep intelligence layer files under 300 lines — split into index + components subdirectory when exceeded
|
|
114
114
|
- Propose plans and wait for approval — the human controls direction
|
|
115
115
|
- SUMMARY.md entries follow audit trail format: date, plan, outcome, files, decisions, next
|
|
116
|
-
- At session close, append a
|
|
116
|
+
- At session close or after any high-effort task, append a metrics entry to SESSION-LOG.md — include effort level, question ambiguity, context level, estimated tokens, time, user response (accepted/edited/revised/rejected/redirected), and for medium/high effort tasks: revision count, refinement magnitude (none/minor/moderate/major/rework), delta categories, and survival rate. Never skip, never read previous entries during normal work
|
|
117
117
|
|
|
118
118
|
## Do not
|
|
119
119
|
|
|
@@ -1,8 +1,9 @@
|
|
|
1
1
|
# Session Log
|
|
2
2
|
|
|
3
|
-
Append-only session observation log for case study data collection. Each session adds a
|
|
3
|
+
Append-only session observation log for case study data collection. Each session adds a metrics entry at close. High-effort tasks log immediately at task completion. This file is read-only during extraction (never during normal work).
|
|
4
4
|
|
|
5
|
-
<!-- AGENT INSTRUCTION: At the end of every session
|
|
5
|
+
<!-- AGENT INSTRUCTION: At the end of every session — or immediately after any high-effort task —
|
|
6
|
+
append a new entry below using the format in "Entry format".
|
|
6
7
|
Do NOT read this file during normal work — only append to it.
|
|
7
8
|
Do NOT modify or delete previous entries.
|
|
8
9
|
When this file exceeds 300 lines, archive older entries to session-log/batch-{N}.md -->
|
|
@@ -17,9 +18,52 @@ Append-only session observation log for case study data collection. Each session
|
|
|
17
18
|
| Extension(s) | {code-project / data-exploration / analytical-system / none} |
|
|
18
19
|
| Observation started | {date} |
|
|
19
20
|
|
|
21
|
+
## Effort classification
|
|
22
|
+
|
|
23
|
+
| Effort | Description | When to log |
|
|
24
|
+
|--------|-------------|-------------|
|
|
25
|
+
| **low** | Quick answer, single file or no changes, <5 min | At session close |
|
|
26
|
+
| **medium** | Multi-step work, several file changes, 5-30 min | At session close |
|
|
27
|
+
| **high** | Complex multi-file changes, architecture decisions, extensive debugging, 30+ min | Immediately at task completion |
|
|
28
|
+
|
|
29
|
+
## Assessment scales
|
|
30
|
+
|
|
31
|
+
**Ambiguity** (agent-assessed — how clear was the user's request?):
|
|
32
|
+
- **low**: Clear, specific request with sufficient context
|
|
33
|
+
- **medium**: Request understood but required interpretation or assumptions
|
|
34
|
+
- **high**: Vague or ambiguous, required significant clarification
|
|
35
|
+
|
|
36
|
+
**Context level** (agent-assessed — how much project context was loaded?):
|
|
37
|
+
- **very low**: No project files loaded, answered from general knowledge
|
|
38
|
+
- **low**: Entry point only
|
|
39
|
+
- **medium**: Entry point + STATE.md + 1-2 project files
|
|
40
|
+
- **high**: Entry point + STATE.md + specialist context + multiple project files
|
|
41
|
+
- **very high**: Extensive project context, multiple specialists, cross-file analysis
|
|
42
|
+
|
|
43
|
+
**User response** (agent-observed — how did the user respond to the result?):
|
|
44
|
+
- **accepted**: User proceeded to next step without changes
|
|
45
|
+
- **edited**: User manually modified the agent's output
|
|
46
|
+
- **revised**: User asked agent to redo or revise the result
|
|
47
|
+
- **rejected**: User said no or declined the result entirely
|
|
48
|
+
- **redirected**: User changed approach or gave new instructions
|
|
49
|
+
|
|
50
|
+
**Refinement magnitude** (for medium/high effort only — how much changed from first attempt to final?):
|
|
51
|
+
- **none**: Accepted as-is, 0% changed
|
|
52
|
+
- **minor**: Small fixes — typos, naming, formatting (<10% changed)
|
|
53
|
+
- **moderate**: Logic or structural changes, added/removed sections (10–50% changed)
|
|
54
|
+
- **major**: Significant rework of approach or content (50–80% changed)
|
|
55
|
+
- **rework**: Mostly rewritten, original approach abandoned (>80% changed)
|
|
56
|
+
|
|
57
|
+
**Delta categories** (what kinds of changes were needed — select all that apply):
|
|
58
|
+
- **accuracy**: Factual errors or incorrect implementation
|
|
59
|
+
- **completeness**: Missing parts, incomplete coverage
|
|
60
|
+
- **approach**: Wrong method or strategy
|
|
61
|
+
- **scope**: Over-scoped or under-scoped
|
|
62
|
+
- **style**: Formatting, naming, conventions
|
|
63
|
+
|
|
20
64
|
## Observation checklist
|
|
21
65
|
|
|
22
|
-
At session close, reflect on these before writing the
|
|
66
|
+
At session close (or high-effort task completion), reflect on these before writing the entry:
|
|
23
67
|
1. Which workflow did this session follow?
|
|
24
68
|
2. Which query types were encountered?
|
|
25
69
|
3. Which features visibly activated? (context loading, cascade, decision recording, scoping)
|
|
@@ -27,15 +71,32 @@ At session close, reflect on these before writing the micro-entry:
|
|
|
27
71
|
5. Were any decisions deferred instead of recorded immediately?
|
|
28
72
|
6. Was there friction with any methodology rule?
|
|
29
73
|
7. Any degradation signals? (HAI-05: cascade misses, instruction loss, shallow context)
|
|
74
|
+
8. How much effort did this task require? (low / medium / high)
|
|
75
|
+
9. How ambiguous was the user's request? (low / medium / high)
|
|
76
|
+
10. How much project context was loaded? (very low / low / medium / high / very high)
|
|
77
|
+
11. Approximate token usage and time spent?
|
|
78
|
+
12. How did the user respond to the result? (accepted / edited / revised / rejected / redirected)
|
|
79
|
+
13. For medium/high effort: how many revision cycles before acceptance?
|
|
80
|
+
14. What magnitude of change between first attempt and final result? (none / minor / moderate / major / rework)
|
|
81
|
+
15. What categories of refinement were needed? (accuracy / completeness / approach / scope / style)
|
|
30
82
|
|
|
31
|
-
##
|
|
83
|
+
## Entry format
|
|
32
84
|
|
|
33
|
-
<!-- Append new entries below. Format:
|
|
85
|
+
<!-- Append new entries below. Format: -->
|
|
86
|
+
<!--
|
|
34
87
|
### S{N} — {YYYY-MM-DD} — {brief title}
|
|
35
88
|
Model: {model} | Profile: {profile} | Workflow: {WF-XX}
|
|
89
|
+
Effort: {low / medium / high} | Ambiguity: {low / medium / high} | Context: {very low / low / medium / high / very high}
|
|
90
|
+
Tokens: ~{N}k | Time: ~{N} min
|
|
36
91
|
Queries: {query types encountered}
|
|
37
92
|
Features: {feature IDs activated}
|
|
38
93
|
Cascades: {triggered}/{expected} | Decisions: {count}
|
|
94
|
+
Response: {accepted / edited / revised / rejected / redirected}
|
|
95
|
+
Revisions: {0 | count of revision cycles} | Magnitude: {none / minor / moderate / major / rework}
|
|
96
|
+
Delta: {n/a | categories: accuracy, completeness, approach, scope, style} | Survival: ~{N}%
|
|
97
|
+
Delta notes: {n/a | brief description of what changed between first attempt and final}
|
|
39
98
|
Friction: {none | brief description}
|
|
40
99
|
Finding: {none | observation with methodology implication}
|
|
41
100
|
-->
|
|
101
|
+
|
|
102
|
+
## Session entries
|
|
@@ -39,7 +39,7 @@ When a file changes, check this table and update dependent files in the same res
|
|
|
39
39
|
| Intelligence layer file exceeds 300 lines | Restructure into index + components subdirectory (keep active content, archive completed sections) |
|
|
40
40
|
| New domain area | .context/BASE.md, consider new .context/ specialist, this file (if new scoping row) |
|
|
41
41
|
| Lifecycle stage change | PROJECT-PROFILE.md (update stage + date), STATE.md (record decision) |
|
|
42
|
-
| Session close | SESSION-LOG.md (append
|
|
42
|
+
| Session close or high-effort task completion | SESSION-LOG.md (append metrics entry — effort, ambiguity, context level, tokens, time, user response, refinement delta, workflow, features, cascades, friction, findings) |
|
|
43
43
|
|
|
44
44
|
<!-- INSTRUCTION: Add project-specific cascade rules below the universal ones above. -->
|
|
45
45
|
|
|
@@ -74,7 +74,7 @@ method_version: 1.5
|
|
|
74
74
|
|
|
75
75
|
## CLI tools (optional)
|
|
76
76
|
|
|
77
|
-
Available via `npx agent-method` (zero-install) or `pip install
|
|
77
|
+
Available via `npx agent-method` (zero-install) or `pip install wwa-tools`:
|
|
78
78
|
|
|
79
79
|
| When you want to... | Run |
|
|
80
80
|
|---------------------|-----|
|
|
@@ -99,7 +99,7 @@ Available via `npx agent-method` (zero-install) or `pip install agent-method-too
|
|
|
99
99
|
- Surface uncertainty as open questions in STATE.md — never guess silently
|
|
100
100
|
- Keep intelligence layer files under 300 lines — split into index + components subdirectory when exceeded
|
|
101
101
|
- Propose plans and wait for approval — the human controls direction
|
|
102
|
-
- At session close, append a
|
|
102
|
+
- At session close or after any high-effort task, append a metrics entry to SESSION-LOG.md — include effort level, question ambiguity, context level, estimated tokens, time, user response (accepted/edited/revised/rejected/redirected), and for medium/high effort tasks: revision count, refinement magnitude (none/minor/moderate/major/rework), delta categories, and survival rate. Never skip, never read previous entries during normal work
|
|
103
103
|
|
|
104
104
|
## Do not
|
|
105
105
|
|
|
@@ -39,7 +39,7 @@ When a file changes, check this table and update dependent files in the same res
|
|
|
39
39
|
| Intelligence layer file exceeds 300 lines | Restructure into index + components subdirectory (keep active content, archive completed sections) |
|
|
40
40
|
| New domain area | .context/BASE.md, consider new .context/ specialist, this file (if new scoping row) |
|
|
41
41
|
| Lifecycle stage change | PROJECT-PROFILE.md (update stage + date), STATE.md (record decision) |
|
|
42
|
-
| Session close | SESSION-LOG.md (append
|
|
42
|
+
| Session close or high-effort task completion | SESSION-LOG.md (append metrics entry — effort, ambiguity, context level, tokens, time, user response, refinement delta, workflow, features, cascades, friction, findings) |
|
|
43
43
|
|
|
44
44
|
<!-- INSTRUCTION: Add project-specific cascade rules below the universal ones above. -->
|
|
45
45
|
|
|
@@ -74,7 +74,7 @@ method_version: 1.5
|
|
|
74
74
|
|
|
75
75
|
## CLI tools (optional)
|
|
76
76
|
|
|
77
|
-
Available via `npx agent-method` (zero-install) or `pip install
|
|
77
|
+
Available via `npx agent-method` (zero-install) or `pip install wwa-tools`:
|
|
78
78
|
|
|
79
79
|
| When you want to... | Run |
|
|
80
80
|
|---------------------|-----|
|
|
@@ -99,7 +99,7 @@ Available via `npx agent-method` (zero-install) or `pip install agent-method-too
|
|
|
99
99
|
- Surface uncertainty as open questions in STATE.md — never guess silently
|
|
100
100
|
- Keep intelligence layer files under 300 lines — split into index + components subdirectory when exceeded
|
|
101
101
|
- Propose plans and wait for approval — the human controls direction
|
|
102
|
-
- At session close, append a
|
|
102
|
+
- At session close or after any high-effort task, append a metrics entry to SESSION-LOG.md — include effort level, question ambiguity, context level, estimated tokens, time, user response (accepted/edited/revised/rejected/redirected), and for medium/high effort tasks: revision count, refinement magnitude (none/minor/moderate/major/rework), delta categories, and survival rate. Never skip, never read previous entries during normal work
|
|
103
103
|
|
|
104
104
|
## Do not
|
|
105
105
|
|
|
@@ -39,7 +39,7 @@ When a file changes, check this table and update dependent files in the same res
|
|
|
39
39
|
| Intelligence layer file exceeds 300 lines | Restructure into index + components subdirectory (keep active content, archive completed sections) |
|
|
40
40
|
| New domain area | .context/BASE.md, consider new .context/ specialist, this file (if new scoping row) |
|
|
41
41
|
| Lifecycle stage change | PROJECT-PROFILE.md (update stage + date), STATE.md (record decision) |
|
|
42
|
-
| Session close | SESSION-LOG.md (append
|
|
42
|
+
| Session close or high-effort task completion | SESSION-LOG.md (append metrics entry — effort, ambiguity, context level, tokens, time, user response, refinement delta, workflow, features, cascades, friction, findings) |
|
|
43
43
|
|
|
44
44
|
<!-- INSTRUCTION: Add project-specific cascade rules below the universal ones above. -->
|
|
45
45
|
|
|
@@ -74,7 +74,7 @@ method_version: 1.5
|
|
|
74
74
|
|
|
75
75
|
## CLI tools (optional)
|
|
76
76
|
|
|
77
|
-
Available via `npx agent-method` (zero-install) or `pip install
|
|
77
|
+
Available via `npx agent-method` (zero-install) or `pip install wwa-tools`:
|
|
78
78
|
|
|
79
79
|
| When you want to... | Run |
|
|
80
80
|
|---------------------|-----|
|
|
@@ -99,7 +99,7 @@ Available via `npx agent-method` (zero-install) or `pip install agent-method-too
|
|
|
99
99
|
- Surface uncertainty as open questions in STATE.md — never guess silently
|
|
100
100
|
- Keep intelligence layer files under 300 lines — split into index + components subdirectory when exceeded
|
|
101
101
|
- Propose plans and wait for approval — the human controls direction
|
|
102
|
-
- At session close, append a
|
|
102
|
+
- At session close or after any high-effort task, append a metrics entry to SESSION-LOG.md — include effort level, question ambiguity, context level, estimated tokens, time, user response (accepted/edited/revised/rejected/redirected), and for medium/high effort tasks: revision count, refinement magnitude (none/minor/moderate/major/rework), delta categories, and survival rate. Never skip, never read previous entries during normal work
|
|
103
103
|
|
|
104
104
|
## Do not
|
|
105
105
|
|
|
@@ -1,8 +1,9 @@
|
|
|
1
1
|
# Session Log
|
|
2
2
|
|
|
3
|
-
Append-only session observation log for case study data collection. Each session adds a
|
|
3
|
+
Append-only session observation log for case study data collection. Each session adds a metrics entry at close. High-effort tasks log immediately at task completion. This file is read-only during extraction (never during normal work).
|
|
4
4
|
|
|
5
|
-
<!-- AGENT INSTRUCTION: At the end of every session
|
|
5
|
+
<!-- AGENT INSTRUCTION: At the end of every session — or immediately after any high-effort task —
|
|
6
|
+
append a new entry below using the format in "Entry format".
|
|
6
7
|
Do NOT read this file during normal work — only append to it.
|
|
7
8
|
Do NOT modify or delete previous entries.
|
|
8
9
|
When this file exceeds 300 lines, archive older entries to session-log/batch-{N}.md -->
|
|
@@ -17,9 +18,52 @@ Append-only session observation log for case study data collection. Each session
|
|
|
17
18
|
| Extension(s) | {code-project / data-exploration / analytical-system / none} |
|
|
18
19
|
| Observation started | {date} |
|
|
19
20
|
|
|
21
|
+
## Effort classification
|
|
22
|
+
|
|
23
|
+
| Effort | Description | When to log |
|
|
24
|
+
|--------|-------------|-------------|
|
|
25
|
+
| **low** | Quick answer, single file or no changes, <5 min | At session close |
|
|
26
|
+
| **medium** | Multi-step work, several file changes, 5-30 min | At session close |
|
|
27
|
+
| **high** | Complex multi-file changes, architecture decisions, extensive debugging, 30+ min | Immediately at task completion |
|
|
28
|
+
|
|
29
|
+
## Assessment scales
|
|
30
|
+
|
|
31
|
+
**Ambiguity** (agent-assessed — how clear was the user's request?):
|
|
32
|
+
- **low**: Clear, specific request with sufficient context
|
|
33
|
+
- **medium**: Request understood but required interpretation or assumptions
|
|
34
|
+
- **high**: Vague or ambiguous, required significant clarification
|
|
35
|
+
|
|
36
|
+
**Context level** (agent-assessed — how much project context was loaded?):
|
|
37
|
+
- **very low**: No project files loaded, answered from general knowledge
|
|
38
|
+
- **low**: Entry point only
|
|
39
|
+
- **medium**: Entry point + STATE.md + 1-2 project files
|
|
40
|
+
- **high**: Entry point + STATE.md + specialist context + multiple project files
|
|
41
|
+
- **very high**: Extensive project context, multiple specialists, cross-file analysis
|
|
42
|
+
|
|
43
|
+
**User response** (agent-observed — how did the user respond to the result?):
|
|
44
|
+
- **accepted**: User proceeded to next step without changes
|
|
45
|
+
- **edited**: User manually modified the agent's output
|
|
46
|
+
- **revised**: User asked agent to redo or revise the result
|
|
47
|
+
- **rejected**: User said no or declined the result entirely
|
|
48
|
+
- **redirected**: User changed approach or gave new instructions
|
|
49
|
+
|
|
50
|
+
**Refinement magnitude** (for medium/high effort only — how much changed from first attempt to final?):
|
|
51
|
+
- **none**: Accepted as-is, 0% changed
|
|
52
|
+
- **minor**: Small fixes — typos, naming, formatting (<10% changed)
|
|
53
|
+
- **moderate**: Logic or structural changes, added/removed sections (10–50% changed)
|
|
54
|
+
- **major**: Significant rework of approach or content (50–80% changed)
|
|
55
|
+
- **rework**: Mostly rewritten, original approach abandoned (>80% changed)
|
|
56
|
+
|
|
57
|
+
**Delta categories** (what kinds of changes were needed — select all that apply):
|
|
58
|
+
- **accuracy**: Factual errors or incorrect implementation
|
|
59
|
+
- **completeness**: Missing parts, incomplete coverage
|
|
60
|
+
- **approach**: Wrong method or strategy
|
|
61
|
+
- **scope**: Over-scoped or under-scoped
|
|
62
|
+
- **style**: Formatting, naming, conventions
|
|
63
|
+
|
|
20
64
|
## Observation checklist
|
|
21
65
|
|
|
22
|
-
At session close, reflect on these before writing the
|
|
66
|
+
At session close (or high-effort task completion), reflect on these before writing the entry:
|
|
23
67
|
1. Which workflow did this session follow?
|
|
24
68
|
2. Which query types were encountered?
|
|
25
69
|
3. Which features visibly activated? (context loading, cascade, decision recording, scoping)
|
|
@@ -27,15 +71,32 @@ At session close, reflect on these before writing the micro-entry:
|
|
|
27
71
|
5. Were any decisions deferred instead of recorded immediately?
|
|
28
72
|
6. Was there friction with any methodology rule?
|
|
29
73
|
7. Any degradation signals? (HAI-05: cascade misses, instruction loss, shallow context)
|
|
74
|
+
8. How much effort did this task require? (low / medium / high)
|
|
75
|
+
9. How ambiguous was the user's request? (low / medium / high)
|
|
76
|
+
10. How much project context was loaded? (very low / low / medium / high / very high)
|
|
77
|
+
11. Approximate token usage and time spent?
|
|
78
|
+
12. How did the user respond to the result? (accepted / edited / revised / rejected / redirected)
|
|
79
|
+
13. For medium/high effort: how many revision cycles before acceptance?
|
|
80
|
+
14. What magnitude of change between first attempt and final result? (none / minor / moderate / major / rework)
|
|
81
|
+
15. What categories of refinement were needed? (accuracy / completeness / approach / scope / style)
|
|
30
82
|
|
|
31
|
-
##
|
|
83
|
+
## Entry format
|
|
32
84
|
|
|
33
|
-
<!-- Append new entries below. Format:
|
|
85
|
+
<!-- Append new entries below. Format: -->
|
|
86
|
+
<!--
|
|
34
87
|
### S{N} — {YYYY-MM-DD} — {brief title}
|
|
35
88
|
Model: {model} | Profile: {profile} | Workflow: {WF-XX}
|
|
89
|
+
Effort: {low / medium / high} | Ambiguity: {low / medium / high} | Context: {very low / low / medium / high / very high}
|
|
90
|
+
Tokens: ~{N}k | Time: ~{N} min
|
|
36
91
|
Queries: {query types encountered}
|
|
37
92
|
Features: {feature IDs activated}
|
|
38
93
|
Cascades: {triggered}/{expected} | Decisions: {count}
|
|
94
|
+
Response: {accepted / edited / revised / rejected / redirected}
|
|
95
|
+
Revisions: {0 | count of revision cycles} | Magnitude: {none / minor / moderate / major / rework}
|
|
96
|
+
Delta: {n/a | categories: accuracy, completeness, approach, scope, style} | Survival: ~{N}%
|
|
97
|
+
Delta notes: {n/a | brief description of what changed between first attempt and final}
|
|
39
98
|
Friction: {none | brief description}
|
|
40
99
|
Finding: {none | observation with methodology implication}
|
|
41
100
|
-->
|
|
101
|
+
|
|
102
|
+
## Session entries
|