npm - agentic-qe - Versions diffs - 3.5.4 → 3.5.5 - Mend

agentic-qe 3.5.4 → 3.5.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (70) hide show

package/.claude/skills/debug-loop/SKILL.md ADDED Viewed

@@ -0,0 +1,61 @@
+---
+name: debug-loop
+description: Hypothesis-driven autonomous debugging with real command validation
+trust_tier: 3
+domain: debugging
+---
+# Debug Loop
+Autonomous hypothesis-driven debugging against real data. No guessing, no simulating.
+## Arguments
+- `<symptom>` — Description of the bug or unexpected behavior. If omitted, prompt the user.
+## Phases
+### Phase 1 — Reproduce
+Run the exact command that shows the bug. Capture and display the REAL output. Confirm the bug is visible.
+If the bug cannot be reproduced, stop and explain what was tried.
+### Phase 2 — Hypothesize and Test (up to 5 iterations)
+For each iteration:
+1. State a specific hypothesis (e.g., "the query targets v2 tables but data is in v3 tables")
+2. Run a REAL command to test it (e.g., `sqlite3 [db path] '.tables'` then `SELECT COUNT(*) FROM [table]`)
+3. Record whether the hypothesis was confirmed or rejected
+4. If rejected, form the next hypothesis based on what you learned
+**Do NOT make code changes until you have a confirmed root cause.**
+Important checks:
+- Always check both v2 and v3 SQLite tables when data issues are suspected
+- Check dependency versions (e.g., sqlite3 vs better-sqlite3)
+- Check for hardcoded values that may have been missed
+### Phase 3 — Fix
+Make the minimal targeted fix. Explain:
+- What the root cause was
+- What you're changing and why
+- What the blast radius is (which other code paths are affected)
+Before applying, grep for ALL instances of the problematic pattern across the entire codebase.
+### Phase 4 — Verify
+Run the SAME reproduction command from Phase 1. The output must now show correct values. If it doesn't, go back to Phase 2.
+Show before/after output comparison.
+### Phase 5 — Regression
+```bash
+npm test
+```
+Run the full test suite. If tests fail, fix them before committing.
+## Rules
+- NEVER guess or simulate output — always run real commands
+- NEVER make code changes before confirming root cause
+- Always check for the pattern across the entire codebase, not just one file
+- If blocked after 5 hypotheses, stop and ask the user for guidance

package/.claude/skills/pr-review/SKILL.md ADDED Viewed

@@ -0,0 +1,61 @@
+---
+name: pr-review
+description: Scope-aware GitHub PR review with user-friendly tone and trust tier validation
+trust_tier: 3
+domain: code-review
+---
+# PR Review Workflow
+Review pull requests with correct AQE scope boundaries, clear communication, and actionable feedback.
+## Arguments
+- `<pr-number>` — GitHub PR number to review. If omitted, prompt the user.
+## Steps
+### 1. Read the Full Diff
+```bash
+gh pr diff <pr-number>
+gh pr view <pr-number>
+```
+Read the complete diff and PR description. Do not skim — read every changed file.
+### 2. Scope Check
+- Only analyze AQE/QE skills (NOT Claude Flow platform skills)
+- Platform skills to EXCLUDE: v3-*, flow-nexus-*, agentdb-*, reasoningbank-*, swarm-advanced, swarm-orchestration
+- If the PR touches skills, verify the count/scope matches expectations (~63 AQE skills)
+- Flag any platform skill changes that may have leaked into an AQE-focused PR
+### 3. Summarize Changes
+Write a user-friendly summary of what changed and why:
+- Focus on outcomes, not implementation details
+- Avoid overly technical jargon
+- Keep it to 3-5 bullet points
+### 4. Trust Tier Validation
+For any skill changes, validate trust_tier assignments:
+- **tier 3** = has eval infrastructure (evals/, schemas/, scripts/)
+- **tier 2** = tested but no eval framework
+- **tier 1** = untested
+- Flag inconsistencies (e.g., a skill with evals at tier 2 should be tier 3)
+### 5. Code Quality Review
+Check for:
+- Hardcoded version strings
+- Production safety concerns (adapter changes, breaking changes)
+- Missing test coverage for new code
+- Security issues (exposed secrets, injection risks)
+### 6. Post Review
+```bash
+gh pr review <pr-number> --body "review comments"
+```
+## Communication Rules
+- Keep tone constructive and actionable
+- Be outcome-focused: what should the author do, not what's wrong
+- Group related comments together instead of posting many small ones
+- If approving with minor suggestions, use APPROVE with comments, not REQUEST_CHANGES

package/.claude/skills/release/SKILL.md ADDED Viewed

@@ -0,0 +1,333 @@
+---
+name: release
+description: End-to-end npm release workflow with verification gates and hardcoded-version protection
+trust_tier: 3
+domain: release-management
+---
+# Release Workflow
+Execute a safe, verified npm release for the `agentic-qe` monorepo. STOP after each phase for user confirmation.
+## Architecture
+This project has a **dual-package structure** — both must stay in sync:
+| File | Package Name | Role |
+|------|-------------|------|
+| `package.json` (root) | `agentic-qe` | **Published to npm** — the installable package |
+| `v3/package.json` | `@agentic-qe/v3` | **Actual implementation** — CLI, MCP, domains |
+- Root `prepublishOnly` hook runs `cd v3 && npm run build`
+- `npm publish` runs from the **root** directory
+- GitHub Actions triggers on `release: published` event via `.github/workflows/npm-publish.yml`
+## Arguments
+- `<version>` — Target version (e.g., `3.5.5`). If omitted, prompt the user.
+## Steps
+### 1. Pre-Flight: Ensure Clean State
+```bash
+git status
+git branch --show-current
+```
+Verify working directory is clean and you know which branch you're on. If there are uncommitted changes, stop and ask the user. The release prep happens on the **current working branch** — we PR to main later.
+**STOP — confirm clean state and current branch.**
+### 2. Version Audit
+Read the current version from `package.json` (source of truth). Then grep the ENTIRE codebase for hardcoded version strings — current version, old versions, and any version-like patterns. Check both package.json files are in sync.
+```bash
+# Read current version
+node -p "require('./package.json').version"
+node -p "require('./v3/package.json').version"
+# Search for version strings in source files
+grep -rn --include="*.ts" --include="*.js" --include="*.json" '"3\.[0-9]\+\.[0-9]\+"' . \
+  --exclude-dir=node_modules --exclude-dir=dist --exclude-dir=.git
+```
+If any stale version strings are found, fix them ALL before continuing. Both `package.json` and `v3/package.json` must match.
+**STOP — show findings and wait for user confirmation.**
+### 3. Bump Version
+Update both package.json files to the target version:
+```bash
+# Bump v3 first (no git tag — we tag manually)
+cd /workspaces/agentic-qe-new/v3 && npm version <version> --no-git-tag-version
+# Bump root to match
+cd /workspaces/agentic-qe-new && npm version <version> --no-git-tag-version
+```
+Verify both match:
+```bash
+grep '"version"' package.json v3/package.json
+```
+**STOP — confirm versions match.**
+### 4. Update CHANGELOG
+Add a new section to `v3/CHANGELOG.md` following Keep a Changelog format:
+```markdown
+## [<version>] - YYYY-MM-DD
+### Added
+- ...
+### Fixed
+- ...
+### Changed
+- ...
+```
+Write user-friendly descriptions focused on value, not implementation details.
+**STOP — show changelog entry for review.**
+### 5. Build
+```bash
+npm run build
+```
+This runs `cd v3 && npm run build` which executes `tsc && build:cli && build:mcp`. Verify zero errors. If build fails, diagnose and fix.
+**STOP — show build output.**
+### 6. Type Check
+```bash
+npm run typecheck
+```
+This runs `cd v3 && tsc --noEmit`. Must produce zero errors.
+**STOP — show type check output.**
+### 7. Unit Tests
+```bash
+cd /workspaces/agentic-qe-new/v3 && npx vitest run tests/unit/
+```
+Run REAL tests against the actual codebase. Do NOT simulate, mock, or skip any tests. ALL unit tests must pass.
+**STOP — show test results.**
+### 8. Artifact & Integration Verification
+This is the critical pre-release gate. Verify the built package works end-to-end as a user would experience it.
+#### 8a. Verify Build Artifacts
+```bash
+# Verify v3/dist/ exists with expected bundles
+ls -la v3/dist/cli/bundle.js
+ls -la v3/dist/index.js
+ls -la v3/dist/mcp/
+```
+All three must exist: CLI bundle, main entry point, MCP server.
+#### 8b. Test `aqe init --auto` in a Fresh Project
+```bash
+# Create a temporary test project
+mkdir -p /tmp/aqe-release-test && cd /tmp/aqe-release-test
+# Run init using the LOCAL build (not published version)
+node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js init --auto
+```
+Verify init completes without errors and creates the expected project structure (`.agentic-qe/` directory, config files).
+#### 8c. Verify CLI
+```bash
+# Version output
+node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js --version
+# Doctor check
+node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js doctor
+```
+Both must succeed without errors.
+#### 8d. Verify MCP Tools
+```bash
+# Verify MCP server can start and list tools
+node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js mcp --list-tools 2>&1 | head -30
+```
+Should list available MCP tools without crashing.
+#### 8e. Verify Self-Learning & Fleet Capabilities
+```bash
+cd /tmp/aqe-release-test
+# Verify memory/learning subsystem
+node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js memory list 2>&1 | head -10
+# Verify agent spawning works
+node /workspaces/agentic-qe-new/v3/dist/cli/bundle.js agent list 2>&1 | head -10
+```
+These should respond (even if empty results) without errors, confirming the subsystems initialize properly.
+#### 8f. Cleanup
+```bash
+rm -rf /tmp/aqe-release-test
+```
+**STOP — show all verification results. Every check must pass before continuing.**
+### 9. Local CI Test Suite
+Run the same tests that CI runs on PRs (`optimized-ci.yml`) and during publish (`npm-publish.yml`). Skip e2e browser tests unless the user explicitly requests them.
+```bash
+cd /workspaces/agentic-qe-new/v3
+# Journey tests (highest-value signal, from optimized-ci.yml)
+npm run test:journeys
+# Code Intelligence tests (MinCut/Graph algorithms, from optimized-ci.yml)
+npm run test:code-intelligence
+# Contract tests (if they exist, from optimized-ci.yml)
+npm run test:contracts 2>/dev/null || echo "No contract tests"
+# Infrastructure tests (from optimized-ci.yml)
+npm run test:infrastructure 2>/dev/null || echo "No infrastructure tests"
+# Regression tests (from optimized-ci.yml)
+npm run test:regression 2>/dev/null || echo "No regression tests"
+# Performance gates (from optimized-ci.yml)
+npm run performance:gate
+# Full test:ci suite (from npm-publish.yml — excludes browser/e2e)
+npm run test:ci
+```
+All mandatory test suites must pass. If any fail, diagnose and fix before continuing.
+**STOP — show all test results.**
+### 10. Commit & Create PR to Main
+```bash
+cd /workspaces/agentic-qe-new
+# Stage version bump + changelog
+git add package.json v3/package.json v3/CHANGELOG.md
+# Also stage any files that were fixed during version audit
+git status
+git commit -m "chore(release): bump version to v<version>"
+git push origin <current-branch>
+# Create PR to main
+gh pr create \
+  --base main \
+  --title "chore(release): v<version>" \
+  --body "$(cat <<'EOF'
+## Release v<version>
+### Verification Checklist
+- [x] Both package.json versions match
+- [x] Build succeeds (tsc + CLI + MCP bundles)
+- [x] Type check passes
+- [x] All unit tests pass
+- [x] `aqe init --auto` works in fresh project
+- [x] CLI commands functional
+- [x] MCP tools load correctly
+- [x] Self-learning subsystem initializes
+- [x] Journey tests pass
+- [x] Code Intelligence tests pass
+- [x] Performance gates pass
+- [x] Full test:ci suite passes
+See [CHANGELOG](v3/CHANGELOG.md) for details.
+EOF
+)"
+```
+**STOP — show PR URL and wait for CI checks to pass.**
+### 11. Merge PR
+Once CI passes on the PR:
+```bash
+gh pr merge <pr-number> --merge
+```
+Pull the merged main:
+```bash
+git checkout main && git pull origin main
+```
+**STOP — confirm merge successful.**
+### 12. Create GitHub Release
+```bash
+gh release create v<version> \
+  --title "v<version>: <Short Title>" \
+  --notes "$(cat <<'EOF'
+## What's New
+<User-friendly summary — focus on value, not technical details>
+## Getting Started
+\`\`\`bash
+npx agentic-qe init --auto
+\`\`\`
+See [CHANGELOG](v3/CHANGELOG.md) for full details.
+EOF
+)"
+```
+This automatically:
+- Creates a git tag `v<version>`
+- Triggers the `npm-publish.yml` GitHub Actions workflow
+**STOP — show release URL.**
+### 13. Monitor Publish Workflow
+```bash
+# Watch the GitHub Actions workflow
+gh run list --workflow=npm-publish.yml --limit 1
+gh run watch <run-id>
+```
+The workflow:
+1. Checks out code, installs deps
+2. Runs `npm run typecheck`
+3. Runs `npm run build`
+4. Verifies `v3/dist/` exists with CLI and MCP bundles
+5. Runs `npm run test:ci` (unit tests, excludes browser/e2e)
+6. Verifies package version matches git tag
+7. Runs `npm publish --access public --provenance`
+Wait for the workflow to succeed. If it fails, diagnose from the logs:
+```bash
+gh run view <run-id> --log-failed
+```
+**STOP — confirm workflow succeeded.**
+### 14. Post-Publish Verification
+```bash
+npm view agentic-qe@<version> name version
+```
+Confirm the published version matches. Test install:
+```bash
+npx agentic-qe@<version> --version
+```
+## Rules
+- **Both package.json files must match** — root and v3
+- Never hardcode version strings — always read from package.json
+- Always run REAL tests, never simulated
+- Publish happens via GitHub Actions, not locally (uses `--provenance` for attestation)
+- Release notes must be **user-friendly** — focus on value, not implementation internals
+- If anything unexpected is found at any step, stop and explain before proceeding
+- Never push tags or create releases without user confirmation
+- The `prepublishOnly` hook must `cd v3` before building — this is handled by root package.json
+- **No e2e browser tests** unless user explicitly requests them
+- All verification (step 8) must pass before creating the PR — this catches issues before they reach main

package/.claude/skills/skills-manifest.json CHANGED Viewed

@@ -903,7 +903,7 @@
   },
   "metadata": {
     "generatedBy": "Agentic QE Fleet",
-    "fleetVersion": "3.5.0",
+    "fleetVersion": "3.5.5",
     "manifestVersion": "1.3.0",
     "lastUpdated": "2026-02-04T00:00:00.000Z",
     "contributors": [

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "agentic-qe",
-  "version": "3.5.4",
+  "version": "3.5.5",
   "description": "Agentic Quality Engineering V3 - Domain-Driven Design Architecture with 12 Bounded Contexts, O(log n) coverage analysis, ReasoningBank learning, 51 specialized QE agents, mathematical Coherence verification, deep Claude Flow integration",
   "main": "./v3/dist/index.js",
   "types": "./v3/dist/index.d.ts",

package/v3/CHANGELOG.md CHANGED Viewed

@@ -5,6 +5,24 @@ All notable changes to Agentic QE will be documented in this file.
 The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/),
 and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
+## [3.5.5] - 2026-02-06
+### Added
+- **Ghost Intent Coverage Analysis (ADR-059)** - Detect untested behavioral intents ("phantom gaps") that traditional line/branch coverage misses. Surfaces hidden coverage gaps in error handling, edge cases, and implicit behavioral contracts.
+- **Semantic Anti-Drift Middleware (ADR-060)** - Event pipeline middleware that attaches semantic fingerprints to domain events and verifies payload integrity across agent-hop boundaries using cosine similarity checks.
+- **Asymmetric Learning Rates (ADR-061)** - 10:1 penalty-to-reward ratio for learning from failures vs successes. Failed patterns are quarantined with exponential confidence decay; successful patterns earn gradual trust through consecutive wins.
+### Fixed
+- **Dynamic package version in init** - `aqe init --auto` now reads version from package.json instead of hardcoded `3.0.0` (#240)
+- **Init upgrade path** - Resolved issues with `aqe init --auto` when upgrading existing projects (#239)
+### Changed
+- **CLAUDE.md** - Added insights-driven rules and custom skills for improved agent coordination
+- **Release workflow** - Updated `/release` skill with artifact verification gates and working-branch-first PR flow
 ## [3.5.2] - 2026-02-05
 ### Added