@techwavedev/agi-agent-kit 1.2.6 β†’ 1.2.8

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -14,12 +14,12 @@ allowed-tools: Read, Glob, Grep
14
14
 
15
15
  ### When to Trigger
16
16
 
17
- | Pattern | Action |
18
- |---------|--------|
19
- | "Build/Create/Make [thing]" without details | πŸ›‘ ASK 3 questions |
20
- | Complex feature or architecture | πŸ›‘ Clarify before implementing |
21
- | Update/change request | πŸ›‘ Confirm scope |
22
- | Vague requirements | πŸ›‘ Ask purpose, users, constraints |
17
+ | Pattern | Action |
18
+ | ------------------------------------------- | ---------------------------------- |
19
+ | "Build/Create/Make [thing]" without details | πŸ›‘ ASK 3 questions |
20
+ | Complex feature or architecture | πŸ›‘ Clarify before implementing |
21
+ | Update/change request | πŸ›‘ Confirm scope |
22
+ | Vague requirements | πŸ›‘ Ask purpose, users, constraints |
23
23
 
24
24
  ### 🚫 MANDATORY: 3 Questions Before Implementation
25
25
 
@@ -38,12 +38,12 @@ allowed-tools: Read, Glob, Grep
38
38
 
39
39
  ### Core Principles
40
40
 
41
- | Principle | Meaning |
42
- |-----------|---------|
43
- | **Questions Reveal Consequences** | Each question connects to an architectural decision |
44
- | **Context Before Content** | Understand greenfield/feature/refactor/debug context first |
45
- | **Minimum Viable Questions** | Each question must eliminate implementation paths |
46
- | **Generate Data, Not Assumptions** | Don't guessβ€”ask with trade-offs |
41
+ | Principle | Meaning |
42
+ | ---------------------------------- | ---------------------------------------------------------- |
43
+ | **Questions Reveal Consequences** | Each question connects to an architectural decision |
44
+ | **Context Before Content** | Understand greenfield/feature/refactor/debug context first |
45
+ | **Minimum Viable Questions** | Each question must eliminate implementation paths |
46
+ | **Generate Data, Not Assumptions** | Don't guessβ€”ask with trade-offs |
47
47
 
48
48
  ### Question Generation Process
49
49
 
@@ -62,6 +62,7 @@ allowed-tools: Read, Glob, Grep
62
62
  **Question:** [Clear question]
63
63
 
64
64
  **Why This Matters:**
65
+
65
66
  - [Architectural consequence]
66
67
  - [Affects: cost/complexity/timeline/scale]
67
68
 
@@ -83,19 +84,19 @@ allowed-tools: Read, Glob, Grep
83
84
 
84
85
  ### Status Board Format
85
86
 
86
- | Agent | Status | Current Task | Progress |
87
- |-------|--------|--------------|----------|
87
+ | Agent | Status | Current Task | Progress |
88
+ | ------------ | ---------- | ------------------ | ------------ |
88
89
  | [Agent Name] | βœ…πŸ”„β³βŒβš οΈ | [Task description] | [% or count] |
89
90
 
90
91
  ### Status Icons
91
92
 
92
- | Icon | Meaning | Usage |
93
- |------|---------|-------|
94
- | βœ… | Completed | Task finished successfully |
95
- | πŸ”„ | Running | Currently executing |
96
- | ⏳ | Waiting | Blocked, waiting for dependency |
97
- | ❌ | Error | Failed, needs attention |
98
- | ⚠️ | Warning | Potential issue, not blocking |
93
+ | Icon | Meaning | Usage |
94
+ | ---- | --------- | ------------------------------- |
95
+ | βœ… | Completed | Task finished successfully |
96
+ | πŸ”„ | Running | Currently executing |
97
+ | ⏳ | Waiting | Blocked, waiting for dependency |
98
+ | ❌ | Error | Failed, needs attention |
99
+ | ⚠️ | Warning | Potential issue, not blocking |
99
100
 
100
101
  ---
101
102
 
@@ -114,12 +115,12 @@ allowed-tools: Read, Glob, Grep
114
115
 
115
116
  ### Error Categories
116
117
 
117
- | Category | Response Strategy |
118
- |----------|-------------------|
119
- | **Port Conflict** | Offer alternative port or close existing |
120
- | **Dependency Missing** | Auto-install or ask permission |
121
- | **Build Failure** | Show specific error + suggested fix |
122
- | **Unclear Error** | Ask for specifics: screenshot, console output |
118
+ | Category | Response Strategy |
119
+ | ---------------------- | --------------------------------------------- |
120
+ | **Port Conflict** | Offer alternative port or close existing |
121
+ | **Dependency Missing** | Auto-install or ask permission |
122
+ | **Build Failure** | Show specific error + suggested fix |
123
+ | **Unclear Error** | Ask for specifics: screenshot, console output |
123
124
 
124
125
  ---
125
126
 
@@ -140,24 +141,65 @@ allowed-tools: Read, Glob, Grep
140
141
 
141
142
  ## Communication Principles
142
143
 
143
- | Principle | Implementation |
144
- |-----------|----------------|
145
- | **Concise** | No unnecessary details, get to point |
146
- | **Visual** | Use emojis (βœ…πŸ”„β³βŒ) for quick scanning |
147
- | **Specific** | "~2 minutes" not "wait a bit" |
148
- | **Alternatives** | Offer multiple paths when stuck |
149
- | **Proactive** | Suggest next step after completion |
144
+ | Principle | Implementation |
145
+ | ---------------- | ---------------------------------------- |
146
+ | **Concise** | No unnecessary details, get to point |
147
+ | **Visual** | Use emojis (βœ…πŸ”„β³βŒ) for quick scanning |
148
+ | **Specific** | "~2 minutes" not "wait a bit" |
149
+ | **Alternatives** | Offer multiple paths when stuck |
150
+ | **Proactive** | Suggest next step after completion |
150
151
 
151
152
  ---
152
153
 
153
154
  ## Anti-Patterns (AVOID)
154
155
 
155
- | Anti-Pattern | Why |
156
- |--------------|-----|
156
+ | Anti-Pattern | Why |
157
+ | ----------------------------------------- | ---------------------------- |
157
158
  | Jumping to solutions before understanding | Wastes time on wrong problem |
158
- | Assuming requirements without asking | Creates wrong output |
159
- | Over-engineering first version | Delays value delivery |
160
- | Ignoring constraints | Creates unusable solutions |
161
- | "I think" phrases | Uncertainty β†’ Ask instead |
159
+ | Assuming requirements without asking | Creates wrong output |
160
+ | Over-engineering first version | Delays value delivery |
161
+ | Ignoring constraints | Creates unusable solutions |
162
+ | "I think" phrases | Uncertainty β†’ Ask instead |
163
+
164
+ ---
165
+
166
+ ## β›” HARD-GATE: No Code Before Design Approval
167
+
168
+ > Adapted from obra/superpowers.
169
+
170
+ Do NOT write any code, scaffold any project, or take any implementation action until you have presented a design and the user has approved it. This applies to EVERY project regardless of perceived simplicity.
171
+
172
+ **"Simple" projects are where unexamined assumptions cause the most wasted work.** The design can be short (a few sentences for truly simple projects), but you MUST present it and get approval.
173
+
174
+ ---
175
+
176
+ ## Propose Approaches
177
+
178
+ Before settling on a design:
179
+
180
+ 1. **Propose 2-3 different approaches** with trade-offs
181
+ 2. **Lead with your recommendation** and explain why
182
+ 3. **Present options conversationally** β€” not exhaustive specs
183
+ 4. **Wait for user preference** before detailing the chosen approach
184
+
185
+ ---
186
+
187
+ ## Design Document
188
+
189
+ After the user approves the design:
190
+
191
+ 1. Write the validated design to `docs/plans/YYYY-MM-DD-<topic>-design.md` (or project root as `{task-slug}.md`)
192
+ 2. Commit the design document
193
+ 3. **Transition to implementation** β€” invoke `plan-writing` skill to create detailed implementation plan
194
+
195
+ > πŸ”΄ **The terminal state of brainstorming is always `plan-writing`.** Do NOT invoke any implementation skill directly from brainstorming.
162
196
 
163
197
  ---
198
+
199
+ ## Integration
200
+
201
+ | Skill | Relationship |
202
+ | ------------------------- | ----------------------------------------- |
203
+ | `plan-writing` | Next step after design approval |
204
+ | `executing-plans` | Executes the plan created by plan-writing |
205
+ | `test-driven-development` | Referenced in plan tasks |
@@ -0,0 +1,181 @@
1
+ ---
2
+ name: executing-plans
3
+ description: Structured plan execution with batch checkpoints or subagent-per-task with two-stage review. Use when you have a written implementation plan to execute methodically.
4
+ version: 1.0.0
5
+ ---
6
+
7
+ # Executing Plans
8
+
9
+ > Adapted from obra/superpowers β€” fitted to the agi multi-platform architecture.
10
+
11
+ ## Overview
12
+
13
+ Load a plan, review it critically, then execute tasks using one of two strategies. Report for review between batches.
14
+
15
+ **Core principle:** Batch execution with quality gates. Never skip verification.
16
+
17
+ ---
18
+
19
+ ## When to Use
20
+
21
+ | Scenario | Strategy |
22
+ | ----------------------------------------- | ------------------------------------------------------- |
23
+ | Have a plan, tasks are mostly independent | **Subagent-Driven** (two-stage review per task) |
24
+ | Have a plan, prefer human checkpoints | **Batch Execution** (3 tasks at a time, review between) |
25
+ | No plan exists | STOP β†’ Use `plan-writing` skill first |
26
+
27
+ ---
28
+
29
+ ## The Process
30
+
31
+ ### Step 1: Load and Review Plan
32
+
33
+ 1. Read the plan file
34
+ 2. Review critically β€” identify questions or concerns
35
+ 3. If concerns: **Raise them with the user before starting**
36
+ 4. If clear: Create task tracker and proceed
37
+
38
+ > πŸ”΄ **VIOLATION:** Starting execution with unresolved questions = failed execution.
39
+
40
+ ### Step 2: Choose Execution Mode
41
+
42
+ **Option A β€” Batch Execution (human checkpoints):**
43
+
44
+ - Execute first 3 tasks
45
+ - Report what was done + verification output
46
+ - Wait for feedback β†’ apply changes β†’ next batch
47
+ - Best for: high-risk changes, unfamiliar codebases
48
+
49
+ **Option B β€” Subagent-Driven (two-stage review):**
50
+
51
+ - Fresh context per task (no context pollution)
52
+ - Implementer β†’ Spec Reviewer β†’ Code Quality Reviewer chain
53
+ - Faster iteration, review is automated
54
+ - Best for: independent tasks, well-defined plan
55
+
56
+ ### Step 3: Execute Tasks
57
+
58
+ **For each task:**
59
+
60
+ 1. Mark as `[/]` in-progress
61
+ 2. Follow each step exactly (plan has granular steps)
62
+ 3. Run verifications as specified in the plan
63
+ 4. Mark as `[x]` completed
64
+
65
+ ### Step 4: Report (Batch Mode)
66
+
67
+ After each batch of 3 tasks:
68
+
69
+ ```markdown
70
+ ## Batch N Complete
71
+
72
+ ### Implemented
73
+
74
+ - Task X: [what was done]
75
+ - Task Y: [what was done]
76
+ - Task Z: [what was done]
77
+
78
+ ### Verification Output
79
+
80
+ [Paste actual command output]
81
+
82
+ ### Status
83
+
84
+ Ready for feedback.
85
+ ```
86
+
87
+ ### Step 5: Complete Development
88
+
89
+ After all tasks complete and verified:
90
+
91
+ - Run full verification suite (`verify_all.py` or project test suite)
92
+ - Use `verification-before-completion` skill before claiming done
93
+ - Present summary and next steps
94
+
95
+ ---
96
+
97
+ ## Two-Stage Review Protocol (Subagent-Driven Mode)
98
+
99
+ For each task, three roles execute in sequence:
100
+
101
+ ### 1. Implementer
102
+
103
+ - Reads the task from the plan (full task text provided, never the plan file)
104
+ - Asks clarifying questions if anything is unclear
105
+ - Implements following TDD: write test β†’ verify fail β†’ implement β†’ verify pass β†’ commit
106
+ - Self-reviews before handoff
107
+
108
+ ### 2. Spec Compliance Reviewer
109
+
110
+ Reviews against the plan requirements:
111
+
112
+ | Check | Pass | Fail |
113
+ | ----------------------------- | ---- | ----------------------------- |
114
+ | All requirements implemented? | βœ… | ❌ List missing items |
115
+ | Nothing extra added? | βœ… | ❌ List additions not in spec |
116
+ | Tests cover the requirement? | βœ… | ❌ List gaps |
117
+
118
+ **If issues found:** Implementer fixes β†’ re-review until βœ…
119
+
120
+ ### 3. Code Quality Reviewer
121
+
122
+ Reviews implementation quality:
123
+
124
+ | Check | Pass | Fail |
125
+ | -------------------------------------- | ---- | --------------------- |
126
+ | Clean, readable code? | βœ… | ❌ List issues |
127
+ | No magic numbers, good naming? | βœ… | ❌ List specifics |
128
+ | Edge cases handled? | βœ… | ❌ List missing cases |
129
+ | Tests are meaningful (not mock-heavy)? | βœ… | ❌ List concerns |
130
+
131
+ **If issues found:** Implementer fixes β†’ re-review until βœ…
132
+
133
+ > πŸ”΄ **Order matters:** Spec compliance FIRST, then code quality. Never reverse.
134
+
135
+ ---
136
+
137
+ ## Red Flags β€” STOP Immediately
138
+
139
+ - Starting implementation on main/master without user consent
140
+ - Skipping either review stage (spec OR quality)
141
+ - Proceeding with unfixed issues
142
+ - Guessing when blocked instead of asking
143
+ - Making the implementer read the full plan file (provide task text directly)
144
+ - Accepting "close enough" on spec compliance
145
+ - Moving to next task with open review issues
146
+
147
+ ---
148
+
149
+ ## When to Stop and Ask
150
+
151
+ **STOP executing when:**
152
+
153
+ - Hit a blocker mid-batch (missing dependency, test fails, instruction unclear)
154
+ - Plan has critical gaps preventing progress
155
+ - You don't understand an instruction
156
+ - Verification fails repeatedly (3+ times β†’ question architecture)
157
+
158
+ **Ask for clarification rather than guessing.**
159
+
160
+ ---
161
+
162
+ ## Platform Adaptation
163
+
164
+ | Platform | Subagent-Driven | Batch Execution |
165
+ | ----------------------------- | ------------------------------------- | --------------------------------- |
166
+ | **Claude Code** (Agent Teams) | Teammates as implementer/reviewers | Lead executes batches |
167
+ | **Claude Code** (Subagents) | `Task()` tool for each role | Direct execution with checkpoints |
168
+ | **Gemini / Antigravity** | Sequential persona switching per role | Direct execution with checkpoints |
169
+ | **Kiro IDE** | Autonomous agent tasks | Direct execution with PR reviews |
170
+
171
+ ---
172
+
173
+ ## Integration
174
+
175
+ | Skill | Relationship |
176
+ | -------------------------------- | ------------------------------------ |
177
+ | `plan-writing` | Creates the plan this skill executes |
178
+ | `test-driven-development` | TDD cycle used by implementers |
179
+ | `verification-before-completion` | Gate before claiming tasks complete |
180
+ | `parallel-agents` | Platform detection for subagent mode |
181
+ | `brainstorming` | Design phase before plan creation |
@@ -29,6 +29,49 @@ Agent stores in Qdrant β†’ cache for future use
29
29
  Agent responds to user with synthesized answer
30
30
  ```
31
31
 
32
+ ## Quick Start
33
+
34
+ > [!IMPORTANT]
35
+ > **Step 1: MCP Server Required.** The NotebookLM MCP server must be configured in your AI host. It is bundled with many setups, but verify it's running.
36
+
37
+ ### 1. Check if MCP is configured
38
+
39
+ The agent should call `get_health`. If the tool exists, the MCP server is active.
40
+
41
+ - βœ… `status: "ok"` β†’ MCP is running
42
+ - ❌ Tool not found β†’ Add the MCP server to your host config (see [MCP Server Setup](#mcp-server-setup))
43
+
44
+ ### 2. Authenticate (one-time)
45
+
46
+ ```
47
+ Agent calls: get_health
48
+ If authenticated: false β†’
49
+ Agent calls: setup_auth (opens a browser window)
50
+ User logs into Google account
51
+ Agent calls: get_health to verify β†’ authenticated: true βœ…
52
+ ```
53
+
54
+ > [!TIP]
55
+ > Auth is saved to disk. You only need to log in once. If it expires, the agent will detect it and propose `re_auth`.
56
+
57
+ ### 3. Add a Notebook
58
+
59
+ ```
60
+ User: "Here is my NotebookLM: https://notebooklm.google.com/notebook/..."
61
+ Agent calls: ask_question(notebook_url=URL, question="What is the content? What topics?")
62
+ Agent uses answer to fill: name, description, topics
63
+ Agent calls: add_notebook(url, name, description, topics)
64
+ ```
65
+
66
+ ### 4. Query
67
+
68
+ ```
69
+ User: "Research [topic] from my notebook"
70
+ Agent calls: ask_question(notebook_id="my-notebook", question="...")
71
+ ```
72
+
73
+ That's it. The agent handles everything else autonomously.
74
+
32
75
  ## MCP Tools Reference
33
76
 
34
77
  The agent has direct access to these tools. Use them autonomously.
@@ -71,11 +114,24 @@ The agent has direct access to these tools. Use them autonomously.
71
114
 
72
115
  ## Autonomous Workflow
73
116
 
117
+ ### Auth Gate (Mandatory First Step)
118
+
119
+ > [!CAUTION]
120
+ > **ALWAYS check auth before any NotebookLM operation.** If `authenticated: false`, propose `setup_auth` to the user before proceeding. Never silently fail.
121
+
122
+ ```
123
+ get_health β†’ authenticated?
124
+ β†’ true: proceed to step 1
125
+ β†’ false: tell user "NotebookLM needs authentication. A browser will open for Google login."
126
+ β†’ setup_auth β†’ get_health β†’ verify authenticated: true
127
+ β†’ if still false: propose cleanup_data(preserve_library=true) + setup_auth
128
+ ```
129
+
74
130
  ### On Any Research Request:
75
131
 
76
132
  1. **Check Qdrant first** β€” `memory_manager.py auto --query "..."`. If cache hit, return immediately.
77
133
 
78
- 2. **Check auth** β€” `get_health`. If not authenticated, run `setup_auth` and tell user a browser will open.
134
+ 2. **Auth gate** β€” `get_health`. If not authenticated, run `setup_auth` and tell user a browser will open. **Do not proceed without auth.**
79
135
 
80
136
  3. **Resolve notebook** β€” `list_notebooks`. If user mentions a topic, `search_notebooks`. If no notebooks exist, ask user for a NotebookLM URL and `add_notebook`.
81
137
 
@@ -445,3 +445,79 @@ After all agents/teammates complete, synthesize:
445
445
  5. **Single synthesis** β€” One unified report, not separate outputs
446
446
  6. **Verify changes** β€” Always include test-engineer for code modifications
447
447
  7. **Avoid file conflicts** β€” Assign non-overlapping file scopes to parallel agents
448
+
449
+ ---
450
+
451
+ ## Focused Agent Prompt Structure
452
+
453
+ > Adapted from obra/superpowers β€” applies when dispatching parallel agents for independent problems.
454
+
455
+ Good agent prompts are:
456
+
457
+ 1. **Focused** β€” One clear problem domain
458
+ 2. **Self-contained** β€” All context needed to understand the problem
459
+ 3. **Specific about output** β€” What should the agent return?
460
+
461
+ ```markdown
462
+ [Clear problem description with specific scope]
463
+
464
+ Context:
465
+
466
+ - [Error messages, test names, or specific symptoms]
467
+ - [Relevant file paths]
468
+
469
+ Your task:
470
+
471
+ 1. [Specific investigation step]
472
+ 2. [Root cause analysis]
473
+ 3. [Fix with constraints]
474
+
475
+ Constraints:
476
+
477
+ - [What NOT to change]
478
+ - [Scope boundaries]
479
+
480
+ Return: Summary of what you found and what you fixed.
481
+ ```
482
+
483
+ ### Common Mistakes
484
+
485
+ | ❌ Bad | βœ… Good |
486
+ | ------------------------------------------- | ---------------------------------------------- |
487
+ | "Fix all the tests" (too broad) | "Fix agent-tool-abort.test.ts" (focused scope) |
488
+ | "Fix the race condition" (no context) | Paste error messages and test names |
489
+ | No constraints (agent refactors everything) | "Do NOT change production code" |
490
+ | "Fix it" (vague output) | "Return summary of root cause and changes" |
491
+
492
+ ---
493
+
494
+ ## When NOT to Use Parallel Agents
495
+
496
+ | Scenario | Why | Do Instead |
497
+ | ------------------------- | ------------------------------------------- | -------------------------------- |
498
+ | **Related failures** | Fixing one might fix others | Investigate together first |
499
+ | **Need full context** | Understanding requires seeing entire system | Single agent investigates all |
500
+ | **Exploratory debugging** | You don't know what's broken yet | Use `systematic-debugging` skill |
501
+ | **Shared state** | Agents would interfere (editing same files) | Sequential execution |
502
+
503
+ ---
504
+
505
+ ## Review and Integrate Protocol
506
+
507
+ After all agents/teammates complete:
508
+
509
+ 1. **Review each summary** β€” Understand what changed
510
+ 2. **Check for conflicts** β€” Did agents edit overlapping code?
511
+ 3. **Run full test suite** β€” Verify all fixes work together
512
+ 4. **Spot check** β€” Agents can make systematic errors
513
+ 5. **Use `verification-before-completion`** β€” Evidence before claiming success
514
+
515
+ ---
516
+
517
+ ## Integration
518
+
519
+ | Skill | Relationship |
520
+ | -------------------------------- | --------------------------------------- |
521
+ | `executing-plans` | Plan execution with subagent modes |
522
+ | `systematic-debugging` | For investigation before parallel fixes |
523
+ | `verification-before-completion` | Gate after integration |