npm - beth-copilot - Versions diffs - 1.1.0 → 2.0.0 - Mend

beth-copilot 1.1.0 → 2.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (223) hide show

package/CHANGELOG.md CHANGED Viewed

@@ -6,6 +6,44 @@ All notable changes to Beth are documented here. Format based on [Keep a Changel
 ---
+## [Unreleased]
+## [2.0.0] - 2026-03-16
+### Breaking Changes
+- **Beads removed — Backlog.md is the sole task tracker.** The entire beads/Dolt database layer has been removed. All agent instructions, hooks, and CLI commands now use `backlog` CLI exclusively. If you were using `bd` commands in scripts, they will no longer be referenced by Beth agents.
+- **`npx beth-copilot close` command removed** (~560 lines deleted). This command enforced beads-specific close logic (blocker deps, child issues, mandatory test subtasks via `bd`). The workflow is now handled by `backlog task edit BETH-X -s "Done" --plain`.
+### Added
+- **Backlog.md CLI integration** — All agent instructions now reference `backlog task create`, `backlog task edit`, `backlog task list`, `backlog board`, and `backlog overview` commands. The `--plain` flag is enforced everywhere to prevent TUI mode in agent contexts.
+- **`npx beth-copilot update` command** — Updates project files to latest templates without full re-init. Supports `--check-only` for dry-run inspection.
+- **Behavioral skill tests** — 302 E2E skill routing tests across 3 files validating deterministic hook injection, trigger coverage, and mapping completeness.
+- **SubagentStart/SubagentStop hook enforcement** — `inject-skills.mjs` deterministically maps agent types to required skills. `verify-skills.mjs` gates subagent completion on both skill compliance and task tracking.
+- **Hub-and-spoke agent coordination** — Replaced 15 lateral handoffs across 6 agents with single "Escalate to Beth" handoff per agent. All agents now report to Beth.
+- **Community skills** — Added brainstorming, framer-components, frontend-design, proof, rclone, feature-video, and other community-contributed skill modules.
+- **27+ Azure skills** — Full Azure skill suite: compute, storage, AI, messaging, diagnostics, compliance, RBAC, cost optimization, cloud migration, resource lookup/visualizer, Entra ID, Copilot SDK, Foundry, and more.
+- **860 tests** — Up from 438 in v1.1.0. Comprehensive coverage for CLI commands, skill routing, hook injection, pipeline integration, and path validation.
+### Changed
+- **Agent instructions rewritten for Backlog.md** — All 7 agent files (`beth.agent.md`, `developer.agent.md`, etc.) and `AGENTS.md` updated to reference `backlog` CLI instead of `bd` commands.
+- **Hook enforcement updated** — `verify-skills.mjs` now checks for `backlog task edit` compliance instead of `bd` commands.
+- **Templates synced** — All template files in `templates/` now match live `.github/` configuration.
+- **Test framework consolidated** — Migrated from mixed `node:test`/vitest to vitest-only imports across all test files.
+### Removed
+- **`beth-copilot close` command** — Entire close command and its 560-line implementation deleted.
+- **Beads stub functions** — Removed `bd`-related stubs from `bin/cli.js`.
+- **Dead code cleanup** — Removed 8 redundant/deprecated skills, unused `bs-buster` dependency, dead `bin/lib` files, legacy test scripts, empty barrel exports, and orphaned documentation.
+- **Dolt/beads references** — Purged from all production source code, agent instructions, templates, and documentation.
+### Fixed
+- **Template drift** — Templates now stay in sync with live `.github/` config via the `update` command.
+- **Duplicate tools in beth.agent.md** — Removed duplicate tool entries in frontmatter.
+- **Dead pathValidation.ts exports** — Cleaned unused exports that inflated the public API surface.
+- **Pre-push guard test isolation** — Removed unused `child_process` mock that caused CI failures.
+---
 ## [1.1.0] - 2026-03-10
 ### Added
@@ -13,7 +51,7 @@ All notable changes to Beth are documented here. Format based on [Keep a Changel
 - **`npx beth-copilot close` enforcement** — 3-layer close enforcement: (1) open blocker dependencies via `bd dep list`, (2) open children via `bd children`, (3) mandatory test subtasks (unit/e2e/security) for epics. `--force` bypasses all checks.
 - **Pre-push hook** — Git pre-push hook enforcing branch discipline: blocks pushes from `main`/`master` (exit 1), warns on non-epic branch names. Pure shell hook (no Node overhead). Auto-installed during `npx beth-copilot init`. Bypass with `BETH_SKIP_PUSH_GUARD=1`.
 - **Quality gate infrastructure** — `npm run test:gate` generates markdown test reports to `docs/test-reports/`. `scripts/quality-gate.mjs` runs vitest + legacy tests, parses results, generates report, exits non-zero on failure.
-- **Comprehensive CLI test suite** — 7 new test files: `close.e2e.test.ts`, `pre-push-guard.e2e.test.ts`, `quickstart-expanded.e2e.test.ts`, `cli-edge-cases.e2e.test.ts`, `framework-isolation.test.ts`, `init-logic.e2e.test.ts`, `doctor.e2e.test.ts`. 438 tests total (up from 485).
+- **Comprehensive CLI test suite** — 7 new test files: `close.e2e.test.ts`, `pre-push-guard.e2e.test.ts`, `quickstart-expanded.e2e.test.ts`, `cli-edge-cases.e2e.test.ts`, `framework-isolation.test.ts`, `init-logic.e2e.test.ts`, `doctor.e2e.test.ts`. 438 tests total.
 - **Doctor: Dolt database hygiene** — `checkDoltDatabases()` detects orphaned `*test*` databases and warns when user DB count exceeds threshold. Extracted `parseDoltDatabases()` with 18 unit tests.
 - **Session startup drift-prevention** — Mandatory 4-step session startup checklist in AGENTS.md: check uncommitted changes, unpushed commits, spot-check closed work, sync beads state.
 - **Beads disaster recovery docs** — `docs/BD-BACKUP-PARSER-FAILURE.md` with exact parser error, root cause, repro steps, and 3 recovery paths.

package/README.md CHANGED Viewed

@@ -21,12 +21,12 @@ She commands seven specialized agents, each with their own expertise, tools, and
 | Layer | What It Does | Status |
 |-------|-------------|--------|
 | **Copilot Agents** | `.agent.md` definitions running in VS Code Agent Mode | Live |
-| **CLI Toolchain** | `beth init`, `beth doctor`, `beth quickstart` — TypeScript commands | Live |
+| **CLI Toolchain** | `beth init`, `beth doctor`, `beth close`, `beth land` — TypeScript commands | Live |
 | **Orchestration Engine** | Fan-out routing, tool calling loop, subagent spawning, handoffs | Live |
 | **Tool Abstraction** | 6 CLI tools + MCP bridge — uniform interface for all agent capabilities | Live |
 | **LLM Provider** | Azure OpenAI with Entra ID auth, streaming, retry, tool calling | Live |
-**814 tests.** 813 pass, 1 skip, 0 fail.
+**478 tests.** 477 pass, 1 skip, 0 fail.
 ---
@@ -55,11 +55,11 @@ flowchart LR
 | **LLM Provider** | Azure OpenAI via `openai` SDK | Entra ID auth (no API keys), streaming + tool calling |
 | **Auth** | `@azure/identity` DefaultAzureCredential | az login, managed identity, VS Code creds |
 | **Frontmatter** | `gray-matter` | Parses `.agent.md` and `SKILL.md` YAML |
-| **Testing** | Node.js built-in test runner | 814 tests — unit, integration, E2E |
-| **Task Tracking** | beads (`bd` CLI) | Dependency-aware issue tracking for agents |
-| **Package Manager** | pnpm | Lockfile committed |
+| **Testing** | vitest + Node.js test runner | 478 tests — unit, integration, E2E |
+| **Task Tracking** | Backlog.md (`backlog` CLI) | Markdown-based task tracking for agents and humans |
+| **Package Manager** | npm | Lockfile committed |
-**Production dependencies:** 1 (`gray-matter`). That's it. Minimal attack surface by design.
+**Production dependencies:** 1 (`gray-matter`). Minimal attack surface by design.
 ---
@@ -80,8 +80,8 @@ Then open VS Code, switch Copilot Chat to **Agent mode**, and type `@Beth`.
 **Verify everything works:**
 ```bash
-beth doctor       # Health check: Node.js, beads, agents, skills
-beth quickstart   # Init + doctor + beads setup in one shot
+beth doctor       # Health check: Node.js, agents, skills
+beth quickstart   # Init + doctor in one shot
 ```
 For detailed setup (prerequisites, task tracking, MCP servers): [docs/INSTALLATION.md](docs/INSTALLATION.md)
@@ -92,19 +92,20 @@ For detailed setup (prerequisites, task tracking, MCP servers): [docs/INSTALLATI
 | Command | What It Does |
 |---------|-------------|
-| `beth init` | Install agents, skills, VS Code settings, beads tracking |
+| `beth init` | Install agents, skills, VS Code settings, Backlog.md tracking, pre-push hook |
 | `beth init --force` | Overwrite existing files |
-| `beth doctor` | Validate Node.js ≥18, beads CLI, agents frontmatter, skills directories |
-| `beth quickstart` | Run init + doctor + beads init in one shot |
+| `beth doctor` | Validate Node.js ≥18, agents frontmatter, skills |
+| `beth quickstart` | Run init + doctor in one shot |
+| `beth land` | Automate session completion: tests, commit, push, verify sync |
 | `beth help` | Show all commands and options |
-**Flags:** `--force`, `--skip-backlog`, `--skip-mcp`, `--skip-beads`, `--verbose`
+**Flags:** `--force`, `--skip-backlog`, `--skip-mcp`, `--verbose`, `--skip-tests`, `--message/-m`, `--dry-run`
 ---
 ## Agent Orchestration
-Beth doesn't micromanage. She delegates to specialists over **subagent** and **handoff** channels, tracks dependencies with beads, and holds every agent accountable.
+Beth doesn't micromanage. She delegates to specialists over **subagent** and **handoff** channels, tracks work in Backlog.md, and holds every agent accountable.
 ### The Family
@@ -118,17 +119,23 @@ Beth doesn't micromanage. She delegates to specialists over **subagent** and **h
 | **@tester** | The Enforcer | Quality assurance, accessibility, performance |
 | **@security-reviewer** | The Bodyguard | OWASP, compliance, threat modeling |
-### Delegation Model
+### Delegation Model (Hub-and-Spoke)
 ```mermaid
 flowchart LR
     Beth["@Beth"] -->|subagent| PM["PM"] & UX["UX"] & Dev["Dev"] & Sec["Sec"] & Test["Test"] & Res["Research"]
-    PM -.->|handoff| UX & Dev
-    Dev -.->|handoff| Test & UX
+    PM -.->|escalate| Beth
+    UX -.->|escalate| Beth
+    Dev -.->|escalate| Beth
+    Sec -.->|escalate| Beth
+    Test -.->|escalate| Beth
+    Res -.->|escalate| Beth
     style Beth fill:#1e3a5f,color:#fff
 ```
+All agents escalate exclusively to Beth — no lateral handoffs. Beth routes, agents execute.
 ### Subagent vs Handoff
 | Mechanism | Control | Use When |
@@ -216,24 +223,37 @@ Full details: [docs/MCP-SETUP.md](docs/MCP-SETUP.md)
 ## Skills (On-Demand Knowledge)
-Skills are domain-knowledge modules that agents load automatically when trigger phrases match. Each skill lives in `.github/skills/<name>/SKILL.md`.
+Skills are domain-knowledge modules that agents load automatically when trigger phrases match. Each skill lives in `.github/skills/<name>/SKILL.md` or `.github/prompts/<name>/PROMPT.md`.
 | Skill | Triggers On | Used By |
 |-------|------------|---------|
 | **PRD Generation** | "create a prd", "product requirements" | Product Manager |
-| **Framer Components** | "framer component", "property controls" | UX Designer |
+| **UI UX Pro Max** | "design system", "color palette", "style guide" | UX Designer, Developer |
+| **Web Design Guidelines** | "review my UI", "check accessibility" | UX Designer, Tester |
+| **Framer Components** | "framer component", "property controls" | UX Designer, Developer |
 | **React/Next.js Best Practices** | React performance, Next.js patterns | Developer |
-| **Web Design Guidelines** | "review my UI", "check accessibility" | UX Designer |
 | **shadcn/ui** | "shadcn", "ui component" | Developer |
 | **Security Analysis** | "security review", "OWASP", "threat model" | Security Reviewer |
 | **Azure Operations** | Azure resource management | Developer |
 | **Web Search** | Internet research via Brave | Researcher |
+### Design & UI Skills
+Three complementary skills cover the full design-to-code pipeline. They don't overlap — each solves a different problem.
+| Skill | What It Does | When You Need It |
+|-------|-------------|------------------|
+| **[UI UX Pro Max](https://github.com/nextlevelbuilder/ui-ux-pro-max-skill)** | Design system generator — picks styles, colors, typography, and layout patterns from a searchable database of 67 styles, 161 color palettes, 57 font pairings, and 161 industry-specific reasoning rules. | Starting a new project or page. "What should this look like?" |
+| **Web Design Guidelines** | Code auditor — fetches live [Vercel Web Interface Guidelines](https://github.com/vercel-labs/web-interface-guidelines) and checks your actual files for accessibility, focus, form, and performance violations with `file:line` output. | Reviewing implemented code. "Is this built correctly?" |
+| **Framer Components** | Framer platform SDK reference — `addPropertyControls`, `ControlType`, code overrides, `RenderTarget`, auto-sizing, and Framer Motion integration. | Building custom components inside Framer. "How do I make this work in Framer?" |
+**Typical flow:** UI UX Pro Max generates the design system → Developer builds it → Web Design Guidelines audits the result. Framer Components is loaded only when targeting the Framer platform.
 ---
 ## How It Works
-Beth runs inside VS Code Copilot Agent Mode. The `@Beth` agent parses requests, delegates to specialist agents via subagent spawning, and tracks work through beads.
+Beth runs inside VS Code Copilot Agent Mode. The `@Beth` agent parses requests, delegates to specialist agents via subagent spawning, and tracks work through Backlog.md.
 ```mermaid
 flowchart LR
@@ -249,12 +269,12 @@ flowchart LR
 **Key capabilities:**
 - **Agent routing** — `@mention` parsing, subagent spawning, handoff chains
 - **Skill injection** — Domain knowledge loaded on trigger phrases
-- **Task tracking** — beads (`bd`) for epics, subtasks, dependencies
+- **Task tracking** — Backlog.md (`backlog`) for tasks, milestones, and progress
 - **MCP integration** — Optional external tool servers (shadcn, Playwright, Azure)
 ```
 @Beth implement the login page
-→ Beth routes to @developer, tracks work in beads
+→ Beth routes to @developer, tracks work in Backlog.md
 @Beth review this PR for security vulnerabilities
 → Beth routes to @security-reviewer, injects security-analysis skill
@@ -269,7 +289,7 @@ flowchart LR
 ## Tool Abstraction Layer
-A uniform interface for all agent capabilities — file I/O, terminal, search, beads, subagent spawning, and MCP server tools. Tools expose OpenAI-compatible function calling schemas so the LLM can invoke them directly.
+A uniform interface for all agent capabilities — file I/O, terminal, search, task tracking, subagent spawning, and MCP server tools. Tools expose OpenAI-compatible function calling schemas so the LLM can invoke them directly.
 | Tool | What It Does | Key Features |
 |------|-------------|-------------- |
@@ -277,7 +297,7 @@ A uniform interface for all agent capabilities — file I/O, terminal, search, b
 | **editFile** | Atomic string replacement | Single-match enforcement, whitespace-safe |
 | **search** | Ripgrep search | Node.js fallback, regex support, file filtering |
 | **terminal** | Execute shell commands | `execFile('/bin/sh')` — no shell injection, timeouts |
-| **beads** | Issue tracking | `bd create`, `npx beth-copilot close`, `bd list` via CLI wrapper |
+| **backlog** | Task tracking | `backlog task create`, `backlog board`, `backlog task edit` via CLI |
 | **subagent** | Spawn nested agents | Returns structured result for orchestrator to process |
 | **MCP Bridge** | External tool servers | JSON-RPC 2.0 over stdio, JSONC config, namespaced tools |
@@ -310,14 +330,14 @@ flowchart LR
     CLI --> Doctor["doctor"]
     CLI --> QS["quickstart"]
     Init --> Templates[".agent.md · SKILL.md · settings"]
-    Doctor --> Checks["Node ≥18 · beads · agents · skills"]
+    Doctor --> Checks["Node ≥18 · agents · skills"]
     QS --> Init & Doctor
 ```
 **Commands:**
-- `beth init` — Scaffold agents, skills, VS Code settings, beads tracking
-- `beth doctor` — Validate Node.js, beads CLI, agent frontmatter, skill directories
-- `beth quickstart` — Run init + doctor + beads init in one shot
+- `beth init` — Scaffold agents, skills, VS Code settings, Backlog.md tracking
+- `beth doctor` — Validate Node.js, agent frontmatter, skill directories
+- `beth quickstart` — Run init + doctor in one shot
 ---
@@ -358,7 +378,7 @@ beth/
 │   │   │   ├── editFile.ts         # Atomic string replacement
 │   │   │   ├── search.ts           # Ripgrep with Node.js fallback
 │   │   │   ├── terminal.ts         # Secure command execution
-│   │   │   ├── beads.ts            # Issue tracking via bd CLI
+│   │   │   ├── backlog.ts           # Task tracking via backlog CLI
 │   │   │   └── subagent.ts         # Agent spawning interface
 │   │   └── mcp/
 │   │       ├── client.ts           # JSON-RPC 2.0 over stdio
@@ -399,7 +419,7 @@ beth/
 | editFile | 30+ | String replacement, single-match enforcement |
 | search | 30+ | Ripgrep, Node.js fallback, regex, file filtering |
 | terminal | 30+ | Command execution, timeouts, output capture |
-| beads | 30+ | bd CLI wrapper, create/close/list/ready |
+| backlog | 30+ | Backlog.md CLI wrapper, task tracking |
 | subagent | 30+ | Spawn interface, result marking, agent validation |
 | MCP client | 30+ | JSON-RPC 2.0, protocol handshake, tool listing |
 | MCP bridge | 30+ | JSONC parsing, tool namespacing, error handling |
@@ -500,37 +520,6 @@ Is it magic? No. It's just competence with very good hair.
 - **Node.js** ≥ 18
 - **VS Code** with GitHub Copilot extension
 - **GitHub Copilot Chat** in Agent mode
-- [**beads**](https://github.com/steveyegge/beads) for task tracking (`bd` CLI)
-### Installing Beads
-```bash
-curl -fsSL https://raw.githubusercontent.com/steveyegge/beads/main/scripts/install.sh | bash
-```
-**CGO Troubleshooting (Linux/WSL):** Beads uses Dolt (a Git-for-data database) which requires CGO. If `bd init` or `bd doctor` fails with CGO-related errors:
-```bash
-# Install C compiler toolchain (required for CGO)
-sudo apt-get update && sudo apt-get install -y build-essential gcc
-# Verify CGO is available
-export CGO_ENABLED=1
-go env CGO_ENABLED  # should print 1
-# Re-install beads
-curl -fsSL https://raw.githubusercontent.com/steveyegge/beads/main/scripts/install.sh | bash
-```
-**Common beads issues:**
-- `bd: command not found` — Add `~/.local/bin` to your PATH: `export PATH="$HOME/.local/bin:$PATH"`
-- `bd doctor` warnings about metadata — Run `bd doctor --fix` to auto-repair
-- Dolt migration errors — Delete `.beads/` and re-initialize with `bd init`
-```bash
-# Verify beads is working
-bd doctor
-```
 ### Optional: MCP Servers
@@ -542,7 +531,7 @@ See [MCP Integrations](#mcp-integrations) above or [docs/MCP-SETUP.md](docs/MCP-
 | Doc | Purpose |
 |-----|---------|
-| [Installation Guide](docs/INSTALLATION.md) | Full setup: prerequisites, VS Code config, beads |
+| [Installation Guide](docs/INSTALLATION.md) | Full setup: prerequisites, VS Code config, Backlog.md |
 | [MCP Setup](docs/MCP-SETUP.md) | Optional server integrations |
 | [CLI Architecture](docs/CLI-ARCHITECTURE.md) | Dual-interface design, implementation phases |
 | [System Flow](docs/SYSTEM-FLOW.md) | Agent orchestration diagrams |

package/assets/beth-questioning.png CHANGED Viewed

File without changes

package/assets/yellowstone-beth.png CHANGED Viewed

File without changes