npm - @techwavedev/agi-agent-kit - Versions diffs - 1.2.6 → 1.2.8 - Mend

@techwavedev/agi-agent-kit 1.2.6 → 1.2.8

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (13) hide show

package/CHANGELOG.md +51 -0
package/README.md +51 -21
package/package.json +1 -1
package/templates/base/CHANGELOG.md +315 -0
package/templates/base/README.md +51 -21
package/templates/skills/knowledge/brainstorming/SKILL.md +82 -40
package/templates/skills/knowledge/executing-plans/SKILL.md +181 -0
package/templates/skills/knowledge/notebooklm-rag/SKILL.md +57 -1
package/templates/skills/knowledge/parallel-agents/SKILL.md +76 -0
package/templates/skills/knowledge/plan-writing/SKILL.md +96 -21
package/templates/skills/knowledge/systematic-debugging/SKILL.md +189 -84
package/templates/skills/knowledge/test-driven-development/SKILL.md +235 -0
package/templates/skills/knowledge/verification-before-completion/SKILL.md +157 -0

package/CHANGELOG.md CHANGED Viewed

@@ -5,6 +5,57 @@ All notable changes to this project will be documented in this file.
 The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
 and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
+## [1.2.8] - 2026-02-14
+### Added
+- **Superpowers Adaptation**: Adapted best patterns from [obra/superpowers](https://github.com/obra/superpowers) into the agi framework, extending it with structured execution, TDD enforcement, and verification gates while preserving all existing multi-platform, memory, and agent capabilities.
+  **New Skills:**
+  - **`executing-plans`**: Structured plan execution with two modes — Batch (3 tasks + human checkpoint) or Subagent-Driven (fresh agent per task + two-stage review: spec compliance then code quality). Platform-adaptive across Claude Code, Gemini, Kiro, and Opencode.
+  - **`test-driven-development`**: Iron-law TDD enforcement with RED-GREEN-REFACTOR cycle, rationalization prevention table, and verification checklist. "No production code without a failing test first."
+  - **`verification-before-completion`**: Universal evidence gate — no completion claims without fresh verification output. Integrates with agi audit scripts (`checklist.py`, `security_scan.py`, `lint_runner.py`, etc.).
+### Changed
+- **`systematic-debugging`**: Replaced lightweight 110-line version with comprehensive 4-phase methodology (Root Cause Investigation → Pattern Analysis → Hypothesis Testing → Implementation). Includes iron law, multi-component evidence gathering, rationalization table, and real-world impact stats (95% vs 40% first-time fix rate).
+- **`plan-writing`**: Added TDD step structure template (write test → verify fail → implement → verify pass → commit), bite-sized task granularity (2-5 min per step), and execution handoff to `executing-plans` skill.
+- **`brainstorming`**: Added HARD-GATE (no code before design approval), propose 2-3 approaches step, and design document saving with handoff to `plan-writing`.
+- **`orchestrator`** agent: Added two-stage review protocol (spec compliance then code quality), execution mode selection (batch vs subagent), verification gate, and referenced skills table.
+- **`parallel-agents`**: Added focused agent prompt structure template, explicit "when NOT to use" heuristic, and review-and-integrate protocol with `verification-before-completion` gate.
+- **`debugger`** agent: Added enhanced skills integration table referencing `systematic-debugging`, `test-driven-development`, and `verification-before-completion`.
+- **`SKILLS_CATALOG.md`**: Regenerated with 45 template skills (3 new: `executing-plans`, `test-driven-development`, `verification-before-completion`).
+### Why This Is More Complete Than Superpowers
+> All superpowers patterns were adopted. The agi framework extends them with capabilities superpowers does not have.
+| Capability                    |  obra/superpowers   |                  agi Framework                  |
+| ----------------------------- | :-----------------: | :---------------------------------------------: |
+| TDD Enforcement               |         ✅          |                  ✅ (adapted)                   |
+| Plan Execution with Review    |         ✅          |        ✅ (adapted + platform-adaptive)         |
+| Systematic Debugging          |         ✅          |   ✅ (adapted + `debugger` agent integration)   |
+| Verification Gates            |         ✅          |      ✅ (adapted + agi script integration)      |
+| Two-Stage Code Review         |         ✅          |         ✅ (adapted into orchestrator)          |
+| Git Worktrees                 |         ✅          |           ➖ (standard branches used)           |
+| Multi-Platform Orchestration  | ❌ Claude Code only | ✅ 4 platforms (Claude, Gemini, Kiro, Opencode) |
+| Semantic Memory (Qdrant)      |         ❌          |            ✅ 90-100% token savings             |
+| 19 Specialist Agents          |         ❌          |          ✅ Domain-specific boundaries          |
+| Agent Boundary Enforcement    |         ❌          |         ✅ File-type ownership protocol         |
+| Dynamic Question Generation   |         ❌          |         ✅ Trade-off tables, priorities         |
+| 12 Audit/Verification Scripts |         ❌          |     ✅ Security, lint, UX, SEO, Lighthouse      |
+| Memory-First Protocol         |         ❌          |          ✅ Auto cache-hit before work          |
+| Skill Creator + Catalog       |         ❌          |            ✅ 45+ composable skills             |
+| Platform Setup Wizard         |         ❌          |         ✅ One-shot auto-configuration          |
+## [1.2.7] - 2026-02-11
+### Security & Maintenance
+- **Repository Sanitization**: Removed all internal development configurations (`.agent`, `.claude`, `.gemini`) and private skills from the public repository.
+- **Workflow Hardening**: Enforced fork-and-PR workflow in `CONTRIBUTING.md`.
+- **Community Standards**: Added issue templates and PR templates.
 ## [1.2.6] - 2026-02-10
 ### Added

package/README.md CHANGED Viewed

@@ -7,7 +7,7 @@
 `@techwavedev/agi-agent-kit` is a modular, deterministic framework designed to bridge the gap between LLM reasoning and reliable production execution. It scaffolds a "3-Layer Architecture" (Intent → Orchestration → Execution) that forces agents to use tested scripts rather than hallucinating code.
-**v1.2.6** — Now with platform-adaptive orchestration and integrated semantic memory across Claude Code, Kiro IDE, Gemini, and Opencode.
+**v1.2.8** — Now with structured plan execution, TDD enforcement, verification gates (adapted from [obra/superpowers](https://github.com/obra/superpowers)), plus platform-adaptive orchestration and semantic memory across Claude Code, Kiro IDE, Gemini, and Opencode.
 ---
@@ -48,16 +48,40 @@ This checks Qdrant, Ollama, embedding models, and collections — auto-fixing an
 | Feature                       | Description                                                                 |
 | ----------------------------- | --------------------------------------------------------------------------- |
 | **Deterministic Execution**   | Separates business logic (Python scripts) from AI reasoning (Directives)    |
-| **Modular Skill System**      | 56 plug-and-play skills that can be added or removed instantly              |
+| **Modular Skill System**      | 45+ plug-and-play skills that can be added or removed instantly             |
+| **Structured Plan Execution** | Batch or subagent-driven execution with two-stage review (spec + quality)   |
+| **TDD Enforcement**           | Iron-law RED-GREEN-REFACTOR cycle — no production code without failing test |
+| **Verification Gates**        | Evidence before claims — no completion without fresh verification output    |
 | **Platform-Adaptive**         | Auto-detects and optimizes for Claude Code, Kiro IDE, Gemini, and Opencode  |
 | **Multi-Agent Orchestration** | Agent Teams, subagents, Powers, or sequential personas — adapts to platform |
 | **Semantic Memory**           | Built-in Qdrant-powered memory with 95% token savings via caching           |
-| **Deep RAG (NotebookLM)**     | Opt-in autonomous research via Google NotebookLM + Gemini, fully MCP-driven |
 | **Self-Healing Workflows**    | Agents read error logs, patch scripts, and update directives automatically  |
 | **One-Shot Setup**            | Platform detection + project stack scan + auto-configuration in one command |
 ---
+## 🆚 How This Compares to Superpowers
+The agi framework adopts all best patterns from [obra/superpowers](https://github.com/obra/superpowers) and extends them with capabilities superpowers does not have:
+| Capability                   | obra/superpowers |         agi Framework          |
+| ---------------------------- | :--------------: | :----------------------------: |
+| TDD Enforcement              |        ✅        |           ✅ Adapted           |
+| Plan Execution + Review      |        ✅        | ✅ Adapted + platform-adaptive |
+| Systematic Debugging         |        ✅        | ✅ Adapted + `debugger` agent  |
+| Verification Gates           |        ✅        | ✅ Adapted + 12 audit scripts  |
+| Two-Stage Code Review        |        ✅        |  ✅ Adapted into orchestrator  |
+| Multi-Platform Orchestration |  ❌ Claude only  |         ✅ 4 platforms         |
+| Semantic Memory (Qdrant)     |        ❌        |    ✅ 90-100% token savings    |
+| 19 Specialist Agents         |        ❌        |      ✅ Domain boundaries      |
+| Agent Boundary Enforcement   |        ❌        |     ✅ File-type ownership     |
+| Dynamic Question Generation  |        ❌        |   ✅ Trade-offs + priorities   |
+| Memory-First Protocol        |        ❌        |       ✅ Auto cache-hit        |
+| Skill Creator + Catalog      |        ❌        |    ✅ 45+ composable skills    |
+| Platform Setup Wizard        |        ❌        |       ✅ One-shot config       |
+---
 ## 🌐 Platform Support
 The framework automatically detects your AI coding environment and activates the best available features:
@@ -276,24 +300,30 @@ Use these keywords, commands, and phrases to trigger specific capabilities:
 ### Skill Trigger Keywords (Natural Language)
-| Category          | Trigger Words / Phrases                                               | Skill Activated                     |
-| ----------------- | --------------------------------------------------------------------- | ----------------------------------- |
-| **Memory**        | "don't use cache", "no cache", "skip memory", "fresh"                 | Memory opt-out                      |
-| **Research**      | "research my docs", "deep search", "@notebooklm", "query my notebook" | `notebooklm-rag` (Deep RAG)         |
-| **Documentation** | "update docs", "regenerate catalog", "sync documentation"             | `documentation`                     |
-| **Quality**       | "lint", "format", "check", "validate", "static analysis"              | `lint-and-validate`                 |
-| **Testing**       | "write tests", "run tests", "TDD", "test coverage"                    | `testing-patterns` / `tdd-workflow` |
-| **Architecture**  | "design system", "architecture decision", "ADR", "trade-off"          | `architecture`                      |
-| **Security**      | "security scan", "vulnerability", "audit", "OWASP"                    | `red-team-tactics`                  |
-| **Performance**   | "lighthouse", "bundle size", "core web vitals", "profiling"           | `performance-profiling`             |
-| **Design**        | "design UI", "color scheme", "typography", "layout"                   | `frontend-design`                   |
-| **Deployment**    | "deploy", "rollback", "release", "CI/CD"                              | `deployment-procedures`             |
-| **API**           | "REST API", "GraphQL", "tRPC", "API design"                           | `api-patterns`                      |
-| **Database**      | "schema design", "migration", "query optimization"                    | `database-design`                   |
-| **Planning**      | "plan this", "break down", "task list", "requirements"                | `plan-writing`                      |
-| **Brainstorming** | "explore options", "what are the approaches", "pros and cons"         | `brainstorming`                     |
-| **Code Review**   | "review this", "code quality", "best practices"                       | `code-review-checklist`             |
-| **i18n**          | "translate", "localization", "RTL", "locale"                          | `i18n-localization`                 |
+| Category           | Trigger Words / Phrases                                                | Skill Activated                     |
+| ------------------ | ---------------------------------------------------------------------- | ----------------------------------- |
+| **Memory**         | "don't use cache", "no cache", "skip memory", "fresh"                  | Memory opt-out                      |
+| **Research**       | "research my docs", "check my notebooks", "deep search", "@notebooklm" | `notebooklm-rag`                    |
+| **Documentation**  | "update docs", "regenerate catalog", "sync documentation"              | `documentation`                     |
+| **Quality**        | "lint", "format", "check", "validate", "static analysis"               | `lint-and-validate`                 |
+| **Testing**        | "write tests", "run tests", "TDD", "test coverage"                     | `testing-patterns` / `tdd-workflow` |
+| **TDD**            | "test first", "red green refactor", "failing test"                     | `test-driven-development`           |
+| **Plan Execution** | "execute plan", "run the plan", "batch execution"                      | `executing-plans`                   |
+| **Verification**   | "verify", "prove it works", "evidence", "show me the output"           | `verification-before-completion`    |
+| **Debugging**      | "debug", "root cause", "investigate", "why is this failing"            | `systematic-debugging`              |
+| **Architecture**   | "design system", "architecture decision", "ADR", "trade-off"           | `architecture`                      |
+| **Security**       | "security scan", "vulnerability", "audit", "OWASP"                     | `red-team-tactics`                  |
+| **Performance**    | "lighthouse", "bundle size", "core web vitals", "profiling"            | `performance-profiling`             |
+| **Design**         | "design UI", "color scheme", "typography", "layout"                    | `frontend-design`                   |
+| **Deployment**     | "deploy", "rollback", "release", "CI/CD"                               | `deployment-procedures`             |
+| **API**            | "REST API", "GraphQL", "tRPC", "API design"                            | `api-patterns`                      |
+| **Database**       | "schema design", "migration", "query optimization"                     | `database-design`                   |
+| **Planning**       | "plan this", "break down", "task list", "requirements"                 | `plan-writing`                      |
+| **Brainstorming**  | "explore options", "what are the approaches", "pros and cons"          | `brainstorming`                     |
+| **Code Review**    | "review this", "code quality", "best practices"                        | `code-review-checklist`             |
+| **i18n**           | "translate", "localization", "RTL", "locale"                           | `i18n-localization`                 |
+| **AWS**            | "terraform", "EKS", "Lambda", "S3", "CloudFront"                       | `aws` / `aws-terraform`             |
+| **Infrastructure** | "Consul", "service mesh", "OpenSearch"                                 | `consul` / `opensearch`             |
 ### Memory System Commands

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@techwavedev/agi-agent-kit",
-  "version": "1.2.6",
+  "version": "1.2.8",
   "description": "Enterprise-Grade Agentic Framework - Modular skill-based AI assistant toolkit with deterministic execution, semantic memory, and platform-adaptive orchestration.",
   "bin": {
     "agi-agent-kit": "./bin/init.js"

package/templates/base/CHANGELOG.md ADDED Viewed

@@ -0,0 +1,315 @@
+# Changelog
+All notable changes to this project will be documented in this file.
+The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
+and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
+## [1.2.8] - 2026-02-14
+### Added
+- **Superpowers Adaptation**: Adapted best patterns from [obra/superpowers](https://github.com/obra/superpowers) into the agi framework, extending it with structured execution, TDD enforcement, and verification gates while preserving all existing multi-platform, memory, and agent capabilities.
+  **New Skills:**
+  - **`executing-plans`**: Structured plan execution with two modes — Batch (3 tasks + human checkpoint) or Subagent-Driven (fresh agent per task + two-stage review: spec compliance then code quality). Platform-adaptive across Claude Code, Gemini, Kiro, and Opencode.
+  - **`test-driven-development`**: Iron-law TDD enforcement with RED-GREEN-REFACTOR cycle, rationalization prevention table, and verification checklist. "No production code without a failing test first."
+  - **`verification-before-completion`**: Universal evidence gate — no completion claims without fresh verification output. Integrates with agi audit scripts (`checklist.py`, `security_scan.py`, `lint_runner.py`, etc.).
+### Changed
+- **`systematic-debugging`**: Replaced lightweight 110-line version with comprehensive 4-phase methodology (Root Cause Investigation → Pattern Analysis → Hypothesis Testing → Implementation). Includes iron law, multi-component evidence gathering, rationalization table, and real-world impact stats (95% vs 40% first-time fix rate).
+- **`plan-writing`**: Added TDD step structure template (write test → verify fail → implement → verify pass → commit), bite-sized task granularity (2-5 min per step), and execution handoff to `executing-plans` skill.
+- **`brainstorming`**: Added HARD-GATE (no code before design approval), propose 2-3 approaches step, and design document saving with handoff to `plan-writing`.
+- **`orchestrator`** agent: Added two-stage review protocol (spec compliance then code quality), execution mode selection (batch vs subagent), verification gate, and referenced skills table.
+- **`parallel-agents`**: Added focused agent prompt structure template, explicit "when NOT to use" heuristic, and review-and-integrate protocol with `verification-before-completion` gate.
+- **`debugger`** agent: Added enhanced skills integration table referencing `systematic-debugging`, `test-driven-development`, and `verification-before-completion`.
+- **`SKILLS_CATALOG.md`**: Regenerated with 45 template skills (3 new: `executing-plans`, `test-driven-development`, `verification-before-completion`).
+### Why This Is More Complete Than Superpowers
+> All superpowers patterns were adopted. The agi framework extends them with capabilities superpowers does not have.
+| Capability                    |  obra/superpowers   |                  agi Framework                  |
+| ----------------------------- | :-----------------: | :---------------------------------------------: |
+| TDD Enforcement               |         ✅          |                  ✅ (adapted)                   |
+| Plan Execution with Review    |         ✅          |        ✅ (adapted + platform-adaptive)         |
+| Systematic Debugging          |         ✅          |   ✅ (adapted + `debugger` agent integration)   |
+| Verification Gates            |         ✅          |      ✅ (adapted + agi script integration)      |
+| Two-Stage Code Review         |         ✅          |         ✅ (adapted into orchestrator)          |
+| Git Worktrees                 |         ✅          |           ➖ (standard branches used)           |
+| Multi-Platform Orchestration  | ❌ Claude Code only | ✅ 4 platforms (Claude, Gemini, Kiro, Opencode) |
+| Semantic Memory (Qdrant)      |         ❌          |            ✅ 90-100% token savings             |
+| 19 Specialist Agents          |         ❌          |          ✅ Domain-specific boundaries          |
+| Agent Boundary Enforcement    |         ❌          |         ✅ File-type ownership protocol         |
+| Dynamic Question Generation   |         ❌          |         ✅ Trade-off tables, priorities         |
+| 12 Audit/Verification Scripts |         ❌          |     ✅ Security, lint, UX, SEO, Lighthouse      |
+| Memory-First Protocol         |         ❌          |          ✅ Auto cache-hit before work          |
+| Skill Creator + Catalog       |         ❌          |            ✅ 45+ composable skills             |
+| Platform Setup Wizard         |         ❌          |         ✅ One-shot auto-configuration          |
+## [1.2.7] - 2026-02-11
+### Security & Maintenance
+- **Repository Sanitization**: Removed all internal development configurations (`.agent`, `.claude`, `.gemini`) and private skills from the public repository.
+- **Workflow Hardening**: Enforced fork-and-PR workflow in `CONTRIBUTING.md`.
+- **Community Standards**: Added issue templates and PR templates.
+## [1.2.6] - 2026-02-10
+### Added
+- **notebooklm-rag (Deep RAG)**: MCP-first autonomous knowledge backend powered by Google NotebookLM + Gemini. The agent fully manages NotebookLM via MCP tools — authentication, library CRUD, querying with auto follow-ups, and Qdrant caching for token savings and cross-session context keeping. Includes fallback Python scripts (Patchright browser automation) when MCP is unavailable. Opt-in for users with Google accounts; default RAG remains `qdrant-memory`. Based on [PleasePrompto/notebooklm-skill](https://github.com/PleasePrompto/notebooklm-skill) (MIT).
+### Removed
+- **notebooklm-mcp**: Merged into `notebooklm-rag`. The comprehensive Deep RAG skill supersedes the basic MCP connector.
+### Fixed
+- **npmignore**: Fixed deep-path glob patterns (`**/__pycache__/`, `**/*.pyc`, `**/data/`) preventing compiled Python files and sensitive data from leaking into the published NPM package.
+## [1.2.5] - 2026-02-09
+### Fixed
+- **Documentation Updates**: README updated to reflect current version and removed internal skill references.
+## [1.2.4] - 2026-02-09
+### Fixed
+- **Minor bug fixes**: Addressed issues from v1.2.3 release.
+## [1.2.3] - 2026-02-09
+### Added
+- **Auto Python Virtual Environment**: `npx init` now auto-creates `.venv/` and installs all dependencies — eliminates the `externally-managed-environment` error on macOS.
+- **Auto Platform Setup**: `npx init` runs `platform_setup.py --auto` after venv creation, pre-configuring platform-specific settings (`.claude/settings.json`, `.claude/skills/`) and showing remaining manual steps.
+- **Activation Reference Table** (README): Comprehensive reference with 4 sections — 14 slash commands, 8 `@agent` mentions, 18 natural language trigger keyword categories, and 6 memory system commands.
+- **Semantic Memory Section** (README): Setup guide with Qdrant + Ollama commands, token savings table (90-100% savings), and usage examples.
+- **`pyright`** added to `requirements.txt` for Python type checking out of the box.
+### Changed
+- **`requirements.txt`**: Expanded from 6 to 12 packages across 5 organized sections (Core, Memory, Embeddings, Cloud, Testing & Auditing). Every dependency is documented with inline comments.
+- **`bin/init.js`**: Added `setupPythonEnv()` and `runPlatformSetup()` functions. Uses `child_process.execSync` for venv creation and `pip install`. Cross-platform support (macOS/Linux/Windows paths).
+- **README Prerequisites**: No longer tells users to `pip install` manually — now references auto-created `.venv` with activation command.
+- **Post-install message**: Step 1 is now "Activate the Python environment" instead of "Install Python dependencies."
+- **Templates synced**: `requirements.txt`, `README.md` copied to `templates/base/` for NPX distribution.
+- **Documentation SKILL.md**: `Last Updated` timestamp refreshed to 2026-02-09.
+- **SKILLS_CATALOG.md**: Regenerated with 56 skills.
+### Fixed
+- **`ModuleNotFoundError: No module named 'yaml'`**: `pyyaml` was missing from `requirements.txt` — `system_checkup.py` crashed on fresh installs. Now included in core dependencies.
+- **Missing dependencies**: `gitpython`, `ollama`, `sentence-transformers`, `playwright` were used by skills but not listed in `requirements.txt`. All now included.
+- **`--auto-apply` flag**: `init.js` called `platform_setup.py` with non-existent `--auto-apply` flag — fixed to use existing `--auto` flag.
+## [1.2.2] - 2026-02-09
+### Added
+- **Platform-Adaptive Multi-Agent Orchestration** (`parallel-agents` v2.0):
+  - **Strategy A: Claude Code Agent Teams** — True parallel multi-agent orchestration via tmux/in-process sessions. Requires `CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1`.
+  - **Strategy B: Claude Code Subagents** — Background/foreground task orchestration via `Task()` tool with `context: fork`.
+  - **Strategy C: Sequential Personas** — Universal fallback for Gemini, Opencode, and other platforms using `@agent` persona switching.
+  - **Strategy D: Kiro Autonomous Agent** — Async parallel task execution in sandboxed environments with PR-based delivery and cross-repo coordination.
+  - Automatic platform detection and strategy selection.
+  - Subagent configuration reference (frontmatter, tools, permissions, model, memory).
+  - Full Claude Code subagent lifecycle documentation (create, configure, invoke, manage).
+- **Kiro IDE Powers & Autonomous Agent Support**:
+  - Full Kiro Powers documentation: `POWER.md` anatomy, frontmatter keywords activation, onboarding sections, steering files, `mcp.json` configuration with auto-namespacing.
+  - Kiro Hooks system (`.kiro/hooks/`) with JSON structure and automated behaviors.
+  - Kiro Autonomous Agent: sandboxed execution, PR-based reviews, cross-repo coordination, learning from code reviews.
+  - 12 curated launch partner Powers documented (Figma, Supabase, Stripe, Neon, Netlify, Postman, Strands, Datadog, Dynatrace, AWS CDK, Terraform, Aurora).
+  - **Antigravity ↔ Kiro mapping guide**: Skill-to-Power conversion workflow, directory mapping, frontmatter translation.
+- **Plugin & Extension Auto-Discovery** (`plugin-discovery` v1.1.0 — NEW SKILL):
+  - Cross-platform extension auto-discovery for Claude Code, Kiro IDE, Gemini, and Opencode.
+  - Claude Code: Plugin marketplace guide (official + custom), LSP plugins by language, service integrations, scopes, subagent/skill discovery.
+  - Kiro: Full Powers discovery, installation guide (IDE, kiro.dev, GitHub, local path), setup checklist.
+  - Gemini/Opencode: Skills catalog discovery, MCP server detection.
+  - Cross-platform compatibility map (11 features × 4 platforms).
+- **One-Shot Platform Setup Wizard** (`platform_setup.py`):
+  - Auto-detects platform: Claude Code, Kiro IDE, Gemini, Opencode.
+  - Scans project tech stack: languages (JS/TS/Python/Go/Rust/Ruby), frameworks (Next.js/React/Vue/Express/Angular/Svelte/Astro/React Native), services (GitHub/GitLab/Docker/Vercel/Netlify/Stripe/Supabase/Firebase/Terraform).
+  - Generates prioritized recommendations with `🔴 High` / `🟡 Medium` / `🟢 Low` indicators.
+  - Auto-applies configurable settings (Agent Teams, directory creation, hook setup) with single `Y/n` confirmation.
+  - Shows manual instructions for platform-only actions (plugin installs, Power installs).
+  - 4 modes: interactive, `--auto`, `--dry-run`, `--json`.
+  - New `/setup` workflow with `// turbo` annotation for auto-run.
+- **Intelligent Routing v2.0** (`intelligent-routing` v2.0):
+  - Platform detection at session start (Claude Code, Kiro IDE, Gemini, Opencode).
+  - Proactive capability announcements per platform.
+  - Team Leader mode for Claude Code with Agent Teams enabled.
+  - Powers-driven orchestration mode for Kiro IDE.
+  - Proactive feature recommendations: suggests Agent Teams, plugins, Powers, Autonomous Agent when not enabled.
+  - Multi-domain task routing adapts to best available parallelism strategy.
+- **Memory System Integration** (`session_boot.py` — NEW SCRIPT):
+  - `execution/session_boot.py` — Single entry point for session initialization. Checks Qdrant, Ollama, embedding models, and collections in one command. `--auto-fix` flag auto-pulls missing models and creates collections.
+  - **Session Boot Protocol** added as the **first section** in `AGENTS.md` — agents run `python3 execution/session_boot.py --auto-fix` before any work begins.
+  - `AGENTS.md` Memory-First section rewritten with explicit CLI commands: `memory_manager.py auto`, `store`, `cache-store` with decision tree table.
+  - `platform_setup.py` now detects full memory system: Qdrant status, Ollama status, embedding model presence, collection existence, point counts.
+  - Memory recommendations engine: suggests starting Qdrant/Ollama, pulling embedding model, or initializing collections when issues are detected.
+  - Memory report section (🧠) in platform setup output shows live system status.
+### Changed
+- **`parallel-agents`**: Rewritten from v1.0 to v2.0 — now platform-adaptive with 4 orchestration strategies instead of 1.
+- **`intelligent-routing`**: Rewritten from v1.0 to v2.0 — now includes platform detection, proactive recommendations, and Kiro support.
+- **Cross-Platform Compatibility Map**: Updated with 2 new rows (Dynamic MCP Loading, Cross-Repo Tasks) and accurate Kiro feature coverage.
+- **`package.json`**: Added `kiro`, `opencode`, and `platform-adaptive` keywords for NPM discoverability.
+- **Templates**: All modified skills (`parallel-agents`, `intelligent-routing`, `plugin-discovery`) and scripts (`platform_setup.py`) synced to `templates/skills/knowledge/` for NPX distribution.
+- **SKILLS_CATALOG.md**: Regenerated with 56 skills, including the new `plugin-discovery` skill.
+- **`bin/init.js`**: NPX init now copies `execution/` scripts (session_boot.py, session_init.py, memory_manager.py) and `directives/` (memory_integration.md) to scaffolded projects. Update function also refreshes these files.
+- **Post-install message**: Now shows `session_boot.py --auto-fix` as step 3 after installation.
+- **`/setup-memory` workflow**: Fixed stale references, now uses correct scripts (session_init.py, memory_manager.py).
+- **`/setup` workflow**: Added memory system troubleshooting section with Docker/Ollama/collection init commands.
+### Fixed
+- **Stale script references**: `/setup-memory` workflow referenced non-existent `init_memory_system.py` and `memory_middleware.py` — now correctly uses `session_init.py` and `memory_manager.py`.
+- **Memory system unused**: Despite being installed, no agent instructions explicitly told agents WHEN to call the memory scripts. Now enforced via Session Boot Protocol at top of AGENTS.md.
+- **`platform_setup.py` missing memory**: Setup wizard reported platform features but ignored Qdrant/Ollama status — now detects and recommends fixes.
+- **`info` action missing**: Platform setup wizard didn't handle informational recommendations (e.g., "collections are empty") — added `info` action type with ℹ️ status icon.
+- **Templates missing execution scripts**: NPX init created the `execution/` directory but didn't copy any scripts into it — now ships session_boot.py, session_init.py, and memory_manager.py.
+## [1.1.7] - 2026-02-07
+### Added
+- **NotebookLM RAG Skill** (EC Scoped): New `notebooklm-rag` skill providing deep-search RAG capabilities powered by Google NotebookLM + Gemini 2.5. Complementary to `qdrant-memory` for explicit, document-grounded research.
+  - **B.L.A.S.T. Protocol**: Structured research workflow (Browse → Load → Ask → Synthesize → Transfer).
+  - **4 Research Modes**: Quick (single query), Deep (3-5 iterative), Cross-Ref (multi-notebook), Plan (doc-grounded planning).
+  - **Confidence Classification**: Automatic HIGH/MEDIUM/LOW categorization with source attribution.
+  - `scripts/research_query.py` — Generates structured research questions by mode.
+  - `scripts/research_report.py` — Formats findings into Markdown/JSON reports with confidence levels and knowledge gap analysis.
+  - `scripts/preflight_check.py` — Pre-flight validation checklist for MCP server, auth, and library.
+  - `references/research_patterns.md` — Workflow templates, query optimization, rate limit strategies.
+- **Auto-Update in Skill Creator**: `init_skill.py` now auto-runs `update_catalog.py` and `sync_docs.py` after creating a new skill.
+  - Default ON — use `--no-auto-update` to skip.
+  - New `--skills-dir` flag for specifying catalog update target directory.
+  - Non-fatal: graceful warnings if documentation skill is not installed.
+### Changed
+- **NotebookLM MCP Skill**: Migrated to `PleasePrompto/notebooklm-mcp` browser-automated implementation with full auth, library management, and stealth mode support.
+- **Skill Creator Documentation** (`SKILL_skillcreator.md`):
+  - Step 3: Documents new auto-update behavior and `--no-auto-update` / `--skills-dir` flags.
+  - Step 6: Notes that manual catalog update is now only needed for modifications/deletions.
+- **SKILLS_CATALOG.md**: Regenerated with 55 skills including `notebooklm-rag`.
+### Fixed
+- **`init_skill.py` argument parsing**: Migrated from fragile positional parsing (`sys.argv`) to robust `argparse` with proper help text and validation.
+- **`sync_docs.py` invocation**: Fixed `--update-catalog` flag to pass `"true"` as required value (not just the bare flag).
+## [1.1.6] - 2026-02-07
+### Added
+- **37 Knowledge Skills Promoted to Root**: All knowledge-pack skills now live at `skills/` root alongside core skills for unified access:
+  - `api-patterns`, `app-builder`, `architecture`, `bash-linux`, `behavioral-modes`, `brainstorming`, `clean-code`, `code-review-checklist`, `database-design`, `deployment-procedures`, `documentation-templates`, `frontend-design`, `game-development`, `geo-fundamentals`, `i18n-localization`, `intelligent-routing`, `lint-and-validate`, `mcp-builder`, `mobile-design`, `nextjs-best-practices`, `nodejs-best-practices`, `parallel-agents`, `performance-profiling`, `plan-writing`, `powershell-windows`, `python-patterns`, `react-patterns`, `red-team-tactics`, `seo-fundamentals`, `server-management`, `systematic-debugging`, `tailwind-patterns`, `tdd-workflow`, `testing-patterns`, `vulnerability-scanner`, `webapp-testing`.
+- **NotebookLM MCP Skill**: New `notebooklm-mcp` skill to interact with Google NotebookLM via Model Context Protocol, with documentation, example script, and reference assets.
+- **Opencode Support**: Added `OPENCODE.md` configuration file for Opencode editor compatibility with `opencode-antigravity-auth` plugin.
+- **Execution Scripts**: Two new deterministic execution scripts:
+  - `memory_manager.py` — Centralized memory management for Qdrant operations.
+  - `session_init.py` — Session initialization and context bootstrapping.
+- **Stitch Skills in Templates**: `design-md`, `react-components`, and `stitch-loop` now included in knowledge templates (`templates/skills/knowledge/`) for NPX distribution.
+- **Self-Update in Templates**: `self-update` skill with `update_kit.py` added to knowledge templates.
+- **Knowledge SKILLS_CATALOG**: New comprehensive `SKILLS_CATALOG.md` inside `templates/skills/knowledge/` for template-level skill discovery.
+### Changed
+- **AGENTS.md**: Added "Getting Started" section with installation, dependency, and update instructions.
+- **Memory Integration Directive**: Complete rewrite of `directives/memory_integration.md` — simplified from 211 lines to 95 lines with clearer goal/inputs/execution structure and Ollama embedding documentation.
+- **Skill Creator**: Updated `SKILL_skillcreator.md` with refined skill creation guidelines.
+- **SKILLS_CATALOG.md**: Expanded with 560+ lines of new skill entries covering all promoted knowledge skills.
+- **Documentation Skill**: Updated `skills/documentation/SKILL.md` with improved detection and sync logic.
+- **Template Hierarchy Restructured**: Shifted standalone template skills (`design-md`, `react-components`, `stitch-loop`) into `templates/skills/knowledge/` for consistent NPX packaging.
+### Improved
+- **Core Skill Definitions**: Refined SKILL.md files across `pdf-reader`, `qdrant-memory`, and `webcrawler` with updated descriptions, triggers, and references.
+- **Qdrant Memory Scripts**: Updated all 7 scripts (`benchmark_token_savings.py`, `embedding_utils.py`, `hybrid_search.py`, `init_collection.py`, `memory_retrieval.py`, `semantic_cache.py`, `test_skill.py`) with improved patterns.
+- **Knowledge Skill Documentation**: Mass update of SKILL.md definitions across 36+ knowledge skills with enhanced frontmatter, triggers, and reference content in templates.
+- **Audit Scripts**: Updated `ux_audit.py`, `security_scan.py`, `lighthouse_audit.py`, `test_runner.py`, `playwright_runner.py`, and `seo_checker.py` in templates.
+### Fixed
+- **Version Bump**: Package version correctly set to `1.1.6` in `package.json`.
+- **Template Cleanup**: Removed duplicate standalone Stitch skill templates (`templates/skills/design-md/`, `templates/skills/react-components/`, `templates/skills/stitch-loop/`), consolidated into `templates/skills/knowledge/`.
+- **Removed `verify_public_release.py`**: Deprecated top-level verification script removed (functionality integrated into CI/CD pipeline).
+## [1.1.5] - 2026-01-27
+### Added
+- **Stitch Skills Suite**: Added three new skills from Google Stitch:
+  - `design-md`: Analyzes Stitch projects to synthesize semantic `DESIGN.md` systems.
+  - `react-components`: Converts Stitch screens into modular, validating React components.
+  - `stitch-loop`: Orchestrates autonomous iterative website building loops.
+- **Memory Integration**: Integrated `qdrant-memory` into the Stitch skills suite for:
+  - Storing design systems for future retrieval (`design-md`).
+  - Retrieving code patterns and interfaces (`react-components`).
+  - Leveraging past decisions and project context (`stitch-loop`).
+## [1.1.2] - 2026-01-23
+### Added
+- **Self-Update Skill**: New `self-update` skill with `update_kit.py` script for easy framework updates.
+- **System Checkup**: New `system_checkup.py` execution script to verify agents, skills, workflows, and scripts.
+- **Workflows**: Added `/checkup` and `/update` workflows for quick access.
+### Changed
+- Updated `README.md` with comprehensive Quick Start, Commands, and Architecture sections.
+- Updated `SKILLS_CATALOG.md` to include self-update skill.
+## [1.1.01] - 2026-01-23
+### Fixed
+- Stabilized Full Suite activation.
+- Micro-bump for polish releases.
+## [1.1.0] - 2026-01-23
+### Added
+- **Knowledge Pack**: Imported 36+ new skills from `agents-web` including:
+  - `api-patterns`: Best practices for REST/GraphQL/tRPC.
+  - `frontend-design`: UX/UI principles and audit tools.
+  - `security-auditor`: Vulnerability scanning and red-team tactics.
+  - `mobile-design`: iOS/Android development patterns.
+- **Agent Roles**: Added `project-planner`, `orchestrator`, and `security-auditor` agent personas.
+- **Rules**: Added global `deployment_policy` and `clean-code` standards.
+### Changed
+- Refactored `init` command to support `knowledge` pack.
+- Enhanced `AGENTS.md` with new agent capabilities.
+- Updated `SKILLS_CATALOG.md` with full list of new skills.
+## [1.0.1] - 2026-01-23
+### Fixed
+- **Security**: Removed all references to private infrastructure from public templates.
+- **Safety**: Added `verify_public_release.py` to prevent accidental publication of private secrets.
+- **Menu**: Fixed `init` menu showing internal options.
+## [0.1.0] - 2026-01-23
+### Initial Release
+- Core framework with `webcrawler`, `pdf-reader`, and `qdrant-memory`.
+- CLI tool `agi-agent-kit` for scaffolding.

package/templates/base/README.md CHANGED Viewed

@@ -7,7 +7,7 @@
 `@techwavedev/agi-agent-kit` is a modular, deterministic framework designed to bridge the gap between LLM reasoning and reliable production execution. It scaffolds a "3-Layer Architecture" (Intent → Orchestration → Execution) that forces agents to use tested scripts rather than hallucinating code.
-**v1.2.6** — Now with platform-adaptive orchestration and integrated semantic memory across Claude Code, Kiro IDE, Gemini, and Opencode.
+**v1.2.8** — Now with structured plan execution, TDD enforcement, verification gates (adapted from [obra/superpowers](https://github.com/obra/superpowers)), plus platform-adaptive orchestration and semantic memory across Claude Code, Kiro IDE, Gemini, and Opencode.
 ---
@@ -48,16 +48,40 @@ This checks Qdrant, Ollama, embedding models, and collections — auto-fixing an
 | Feature                       | Description                                                                 |
 | ----------------------------- | --------------------------------------------------------------------------- |
 | **Deterministic Execution**   | Separates business logic (Python scripts) from AI reasoning (Directives)    |
-| **Modular Skill System**      | 56 plug-and-play skills that can be added or removed instantly              |
+| **Modular Skill System**      | 45+ plug-and-play skills that can be added or removed instantly             |
+| **Structured Plan Execution** | Batch or subagent-driven execution with two-stage review (spec + quality)   |
+| **TDD Enforcement**           | Iron-law RED-GREEN-REFACTOR cycle — no production code without failing test |
+| **Verification Gates**        | Evidence before claims — no completion without fresh verification output    |
 | **Platform-Adaptive**         | Auto-detects and optimizes for Claude Code, Kiro IDE, Gemini, and Opencode  |
 | **Multi-Agent Orchestration** | Agent Teams, subagents, Powers, or sequential personas — adapts to platform |
 | **Semantic Memory**           | Built-in Qdrant-powered memory with 95% token savings via caching           |
-| **Deep RAG (NotebookLM)**     | Opt-in autonomous research via Google NotebookLM + Gemini, fully MCP-driven |
 | **Self-Healing Workflows**    | Agents read error logs, patch scripts, and update directives automatically  |
 | **One-Shot Setup**            | Platform detection + project stack scan + auto-configuration in one command |
 ---
+## 🆚 How This Compares to Superpowers
+The agi framework adopts all best patterns from [obra/superpowers](https://github.com/obra/superpowers) and extends them with capabilities superpowers does not have:
+| Capability                   | obra/superpowers |         agi Framework          |
+| ---------------------------- | :--------------: | :----------------------------: |
+| TDD Enforcement              |        ✅        |           ✅ Adapted           |
+| Plan Execution + Review      |        ✅        | ✅ Adapted + platform-adaptive |
+| Systematic Debugging         |        ✅        | ✅ Adapted + `debugger` agent  |
+| Verification Gates           |        ✅        | ✅ Adapted + 12 audit scripts  |
+| Two-Stage Code Review        |        ✅        |  ✅ Adapted into orchestrator  |
+| Multi-Platform Orchestration |  ❌ Claude only  |         ✅ 4 platforms         |
+| Semantic Memory (Qdrant)     |        ❌        |    ✅ 90-100% token savings    |
+| 19 Specialist Agents         |        ❌        |      ✅ Domain boundaries      |
+| Agent Boundary Enforcement   |        ❌        |     ✅ File-type ownership     |
+| Dynamic Question Generation  |        ❌        |   ✅ Trade-offs + priorities   |
+| Memory-First Protocol        |        ❌        |       ✅ Auto cache-hit        |
+| Skill Creator + Catalog      |        ❌        |    ✅ 45+ composable skills    |
+| Platform Setup Wizard        |        ❌        |       ✅ One-shot config       |
+---
 ## 🌐 Platform Support
 The framework automatically detects your AI coding environment and activates the best available features:
@@ -276,24 +300,30 @@ Use these keywords, commands, and phrases to trigger specific capabilities:
 ### Skill Trigger Keywords (Natural Language)
-| Category          | Trigger Words / Phrases                                               | Skill Activated                     |
-| ----------------- | --------------------------------------------------------------------- | ----------------------------------- |
-| **Memory**        | "don't use cache", "no cache", "skip memory", "fresh"                 | Memory opt-out                      |
-| **Research**      | "research my docs", "deep search", "@notebooklm", "query my notebook" | `notebooklm-rag` (Deep RAG)         |
-| **Documentation** | "update docs", "regenerate catalog", "sync documentation"             | `documentation`                     |
-| **Quality**       | "lint", "format", "check", "validate", "static analysis"              | `lint-and-validate`                 |
-| **Testing**       | "write tests", "run tests", "TDD", "test coverage"                    | `testing-patterns` / `tdd-workflow` |
-| **Architecture**  | "design system", "architecture decision", "ADR", "trade-off"          | `architecture`                      |
-| **Security**      | "security scan", "vulnerability", "audit", "OWASP"                    | `red-team-tactics`                  |
-| **Performance**   | "lighthouse", "bundle size", "core web vitals", "profiling"           | `performance-profiling`             |
-| **Design**        | "design UI", "color scheme", "typography", "layout"                   | `frontend-design`                   |
-| **Deployment**    | "deploy", "rollback", "release", "CI/CD"                              | `deployment-procedures`             |
-| **API**           | "REST API", "GraphQL", "tRPC", "API design"                           | `api-patterns`                      |
-| **Database**      | "schema design", "migration", "query optimization"                    | `database-design`                   |
-| **Planning**      | "plan this", "break down", "task list", "requirements"                | `plan-writing`                      |
-| **Brainstorming** | "explore options", "what are the approaches", "pros and cons"         | `brainstorming`                     |
-| **Code Review**   | "review this", "code quality", "best practices"                       | `code-review-checklist`             |
-| **i18n**          | "translate", "localization", "RTL", "locale"                          | `i18n-localization`                 |
+| Category           | Trigger Words / Phrases                                                | Skill Activated                     |
+| ------------------ | ---------------------------------------------------------------------- | ----------------------------------- |
+| **Memory**         | "don't use cache", "no cache", "skip memory", "fresh"                  | Memory opt-out                      |
+| **Research**       | "research my docs", "check my notebooks", "deep search", "@notebooklm" | `notebooklm-rag`                    |
+| **Documentation**  | "update docs", "regenerate catalog", "sync documentation"              | `documentation`                     |
+| **Quality**        | "lint", "format", "check", "validate", "static analysis"               | `lint-and-validate`                 |
+| **Testing**        | "write tests", "run tests", "TDD", "test coverage"                     | `testing-patterns` / `tdd-workflow` |
+| **TDD**            | "test first", "red green refactor", "failing test"                     | `test-driven-development`           |
+| **Plan Execution** | "execute plan", "run the plan", "batch execution"                      | `executing-plans`                   |
+| **Verification**   | "verify", "prove it works", "evidence", "show me the output"           | `verification-before-completion`    |
+| **Debugging**      | "debug", "root cause", "investigate", "why is this failing"            | `systematic-debugging`              |
+| **Architecture**   | "design system", "architecture decision", "ADR", "trade-off"           | `architecture`                      |
+| **Security**       | "security scan", "vulnerability", "audit", "OWASP"                     | `red-team-tactics`                  |
+| **Performance**    | "lighthouse", "bundle size", "core web vitals", "profiling"            | `performance-profiling`             |
+| **Design**         | "design UI", "color scheme", "typography", "layout"                    | `frontend-design`                   |
+| **Deployment**     | "deploy", "rollback", "release", "CI/CD"                               | `deployment-procedures`             |
+| **API**            | "REST API", "GraphQL", "tRPC", "API design"                            | `api-patterns`                      |
+| **Database**       | "schema design", "migration", "query optimization"                     | `database-design`                   |
+| **Planning**       | "plan this", "break down", "task list", "requirements"                 | `plan-writing`                      |
+| **Brainstorming**  | "explore options", "what are the approaches", "pros and cons"          | `brainstorming`                     |
+| **Code Review**    | "review this", "code quality", "best practices"                        | `code-review-checklist`             |
+| **i18n**           | "translate", "localization", "RTL", "locale"                           | `i18n-localization`                 |
+| **AWS**            | "terraform", "EKS", "Lambda", "S3", "CloudFront"                       | `aws` / `aws-terraform`             |
+| **Infrastructure** | "Consul", "service mesh", "OpenSearch"                                 | `consul` / `opensearch`             |
 ### Memory System Commands