npm - qaa-agent - Versions diffs - 1.6.1 → 1.6.3 - Mend

qaa-agent 1.6.1 → 1.6.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (73) hide show

package/.claude/commands/create-test.md +164 -164
package/.claude/commands/qa-audit.md +37 -37
package/.claude/commands/qa-blueprint.md +54 -54
package/.claude/commands/qa-fix.md +36 -36
package/.claude/commands/qa-from-ticket.md +24 -24
package/.claude/commands/qa-gap.md +20 -20
package/.claude/commands/qa-map.md +47 -47
package/.claude/commands/qa-pom.md +36 -36
package/.claude/commands/qa-pr.md +23 -23
package/.claude/commands/qa-pyramid.md +37 -37
package/.claude/commands/qa-report.md +38 -38
package/.claude/commands/qa-research.md +33 -33
package/.claude/commands/qa-start.md +22 -22
package/.claude/commands/qa-testid.md +19 -19
package/.claude/commands/qa-validate.md +42 -42
package/.claude/commands/update-test.md +58 -58
package/.claude/settings.json +20 -20
package/.claude/skills/qa-bug-detective/SKILL.md +122 -122
package/.claude/skills/qa-learner/SKILL.md +150 -150
package/.claude/skills/qa-repo-analyzer/SKILL.md +88 -88
package/.claude/skills/qa-self-validator/SKILL.md +109 -109
package/.claude/skills/qa-template-engine/SKILL.md +113 -113
package/.claude/skills/qa-testid-injector/SKILL.md +93 -93
package/.claude/skills/qa-workflow-documenter/SKILL.md +87 -87
package/.mcp.json +8 -8
package/CHANGELOG.md +71 -71
package/CLAUDE.md +553 -553
package/agents/qa-pipeline-orchestrator.md +1378 -1378
package/agents/qaa-analyzer.md +524 -524
package/agents/qaa-bug-detective.md +446 -446
package/agents/qaa-codebase-mapper.md +935 -935
package/agents/qaa-e2e-runner.md +415 -415
package/agents/qaa-executor.md +651 -651
package/agents/qaa-planner.md +390 -390
package/agents/qaa-project-researcher.md +319 -319
package/agents/qaa-scanner.md +424 -424
package/agents/qaa-testid-injector.md +585 -585
package/agents/qaa-validator.md +452 -452
package/bin/install.cjs +198 -198
package/bin/lib/commands.cjs +709 -709
package/bin/lib/config.cjs +307 -307
package/bin/lib/core.cjs +497 -497
package/bin/lib/frontmatter.cjs +299 -299
package/bin/lib/init.cjs +989 -989
package/bin/lib/milestone.cjs +241 -241
package/bin/lib/model-profiles.cjs +60 -60
package/bin/lib/phase.cjs +911 -911
package/bin/lib/roadmap.cjs +306 -306
package/bin/lib/state.cjs +748 -748
package/bin/lib/template.cjs +222 -222
package/bin/lib/verify.cjs +842 -842
package/bin/qaa-tools.cjs +607 -607
package/docs/COMMANDS.md +341 -341
package/docs/DEMO.md +182 -182
package/docs/TESTING.md +156 -156
package/package.json +41 -41
package/templates/failure-classification.md +391 -391
package/templates/gap-analysis.md +409 -409
package/templates/pr-template.md +48 -48
package/templates/qa-analysis.md +381 -381
package/templates/qa-audit-report.md +465 -465
package/templates/qa-repo-blueprint.md +636 -636
package/templates/scan-manifest.md +312 -312
package/templates/test-inventory.md +582 -582
package/templates/testid-audit-report.md +354 -354
package/templates/validation-report.md +243 -243
package/workflows/qa-analyze.md +296 -296
package/workflows/qa-from-ticket.md +536 -536
package/workflows/qa-gap.md +303 -303
package/workflows/qa-pr.md +389 -389
package/workflows/qa-start.md +1168 -1168
package/workflows/qa-testid.md +356 -356
package/workflows/qa-validate.md +295 -295

package/.claude/commands/update-test.md CHANGED Viewed

@@ -1,58 +1,58 @@
-# Update and Improve Existing Tests
-Audit existing test files and apply targeted improvements. NEVER deletes or rewrites working tests without user approval. Surgical: add, fix, improve -- never replace.
-## Usage
-/update-test <path-to-tests> [--scope fix|improve|add|full]
-- path-to-tests: directory or specific test files to improve
-- --scope: what to do (default: full)
-  - fix: repair broken tests only
-  - improve: upgrade locators, assertions, POM structure
-  - add: add missing test cases without modifying existing
-  - full: audit everything, then improve with approval
-## What It Produces
-- QA_AUDIT_REPORT.md -- current quality assessment
-- Improved test files (after user approval of audit findings)
-## Instructions
-1. Read `CLAUDE.md` -- quality gates, locator tiers, assertion rules, POM rules.
-2. Invoke validator agent in audit mode first:
-Task(
-  prompt="
-    <objective>Audit existing test quality and produce QA_AUDIT_REPORT.md</objective>
-    <execution_context>@agents/qaa-validator.md</execution_context>
-    <files_to_read>
-    - CLAUDE.md
-    </files_to_read>
-    <parameters>
-    user_input: $ARGUMENTS
-    mode: audit
-    </parameters>
-  "
-)
-3. Present audit results and wait for user approval.
-4. Invoke executor agent to apply approved improvements:
-Task(
-  prompt="
-    <objective>Apply approved improvements to existing tests without deleting working tests</objective>
-    <execution_context>@agents/qaa-executor.md</execution_context>
-    <files_to_read>
-    - CLAUDE.md
-    - .qa-output/QA_AUDIT_REPORT.md
-    </files_to_read>
-    <parameters>
-    user_input: $ARGUMENTS
-    mode: update
-    </parameters>
-  "
-)
-$ARGUMENTS
+# Update and Improve Existing Tests
+Audit existing test files and apply targeted improvements. NEVER deletes or rewrites working tests without user approval. Surgical: add, fix, improve -- never replace.
+## Usage
+/update-test <path-to-tests> [--scope fix|improve|add|full]
+- path-to-tests: directory or specific test files to improve
+- --scope: what to do (default: full)
+  - fix: repair broken tests only
+  - improve: upgrade locators, assertions, POM structure
+  - add: add missing test cases without modifying existing
+  - full: audit everything, then improve with approval
+## What It Produces
+- QA_AUDIT_REPORT.md -- current quality assessment
+- Improved test files (after user approval of audit findings)
+## Instructions
+1. Read `CLAUDE.md` -- quality gates, locator tiers, assertion rules, POM rules.
+2. Invoke validator agent in audit mode first:
+Task(
+  prompt="
+    <objective>Audit existing test quality and produce QA_AUDIT_REPORT.md</objective>
+    <execution_context>@agents/qaa-validator.md</execution_context>
+    <files_to_read>
+    - CLAUDE.md
+    </files_to_read>
+    <parameters>
+    user_input: $ARGUMENTS
+    mode: audit
+    </parameters>
+  "
+)
+3. Present audit results and wait for user approval.
+4. Invoke executor agent to apply approved improvements:
+Task(
+  prompt="
+    <objective>Apply approved improvements to existing tests without deleting working tests</objective>
+    <execution_context>@agents/qaa-executor.md</execution_context>
+    <files_to_read>
+    - CLAUDE.md
+    - .qa-output/QA_AUDIT_REPORT.md
+    </files_to_read>
+    <parameters>
+    user_input: $ARGUMENTS
+    mode: update
+    </parameters>
+  "
+)
+$ARGUMENTS

package/.claude/settings.json CHANGED Viewed

@@ -1,20 +1,20 @@
-{
-  "permissions": {
-    "allow": [
-      "Bash(*)",
-      "Read",
-      "Write",
-      "Edit",
-      "Glob",
-      "Grep",
-      "Agent",
-      "WebFetch",
-      "WebSearch",
-      "NotebookEdit",
-      "mcp__playwright__*"
-    ]
-  },
-  "env": {
-    "CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS": "1"
-  }
-}
+{
+  "permissions": {
+    "allow": [
+      "Bash(*)",
+      "Read",
+      "Write",
+      "Edit",
+      "Glob",
+      "Grep",
+      "Agent",
+      "WebFetch",
+      "WebSearch",
+      "NotebookEdit",
+      "mcp__playwright__*"
+    ]
+  },
+  "env": {
+    "CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS": "1"
+  }
+}

package/.claude/skills/qa-bug-detective/SKILL.md CHANGED Viewed

@@ -1,122 +1,122 @@
----
-name: qa-bug-detective
-description: QA Bug Detective. Runs generated tests and classifies failures as APPLICATION BUG, TEST CODE ERROR, ENVIRONMENT ISSUE, or INCONCLUSIVE with evidence and confidence levels. Use when user wants to run tests and classify results, investigate test failures, determine if failures are bugs or test issues, debug failing tests, triage test results, or understand why tests are failing. Triggers on "run tests", "classify failures", "why is this failing", "test failures", "debug tests", "triage results", "is this a bug or test error", "investigate failures".
----
-# QA Bug Detective
-## Purpose
-Run generated tests and classify every failure into one of four categories with evidence and confidence levels. Auto-fix TEST CODE ERRORS when confidence is HIGH.
-## Classification Decision Tree
-```
-Test fails
-├── Syntax/import error in TEST file?
-│   └── YES → TEST CODE ERROR
-├── Error occurs in PRODUCTION code path?
-│   ├── Known bug / unexpected behavior? → APPLICATION BUG
-│   └── Code works as designed but test expectation wrong? → TEST CODE ERROR
-├── Connection refused / timeout / missing env var?
-│   └── YES → ENVIRONMENT ISSUE
-└── Can't determine?
-    └── INCONCLUSIVE
-```
-## Classification Categories
-### APPLICATION BUG
-- Error manifests in production code (not test code)
-- Stack trace points to src/ or app/ code
-- Behavior contradicts documented requirements or API contracts
-- **Action**: Report only. NEVER auto-fix application code.
-### TEST CODE ERROR
-- Import/require fails (wrong path, missing module)
-- Selector doesn't match current DOM
-- Assertion expects wrong value (test written incorrectly)
-- Missing await, wrong API usage, stale fixture reference
-- **Action**: Auto-fix if HIGH confidence. Report if MEDIUM or lower.
-### ENVIRONMENT ISSUE
-- Connection refused (database, API, external service)
-- Timeout waiting for resource
-- Missing environment variable
-- File/directory not found (test infrastructure)
-- **Action**: Report with suggested resolution steps.
-### INCONCLUSIVE
-- Error is ambiguous
-- Could be multiple root causes
-- Insufficient data to classify
-- **Action**: Report with what's known, request more info.
-## Evidence Requirements
-Every classification MUST include:
-1. **File path**: Exact file where error occurs
-2. **Line number**: Specific line of failure
-3. **Error message**: Complete error text
-4. **Code snippet**: The specific code proving the classification
-5. **Confidence level**: HIGH / MEDIUM-HIGH / MEDIUM / LOW
-6. **Reasoning**: Why this classification, not another
-## Confidence Levels
-| Level | Definition |
-|-------|------------|
-| HIGH | Clear evidence in one direction, no ambiguity |
-| MEDIUM-HIGH | Strong evidence but minor ambiguity |
-| MEDIUM | Evidence points one way but alternatives exist |
-| LOW | Insufficient data, multiple possible causes |
-## Auto-Fix Rules
-Only auto-fix when:
-- Classification = TEST CODE ERROR
-- Confidence = HIGH
-- Fix is mechanical (import path, selector, assertion value, config)
-Fix types:
-- Import path corrections
-- Selector updates (match current DOM/data-testid)
-- Assertion value updates (match current actual behavior)
-- Config fixes (baseURL, timeout values)
-- Missing await keywords
-- Fixture path corrections
-**NEVER auto-fix**: Application bugs, environment issues, anything with confidence < HIGH.
-## Output: FAILURE_CLASSIFICATION_REPORT.md
-```markdown
-# Failure Classification Report
-## Summary
-| Classification | Count | Auto-Fixed | Needs Attention |
-|---------------|-------|-----------|----------------|
-| APPLICATION BUG | N | 0 | N |
-| TEST CODE ERROR | N | N | N |
-| ENVIRONMENT ISSUE | N | 0 | N |
-| INCONCLUSIVE | N | 0 | N |
-## Detailed Analysis
-### Failure 1: [test name]
-- **Classification**: [category]
-- **Confidence**: [level]
-- **File**: [path]:[line]
-- **Error**: [message]
-- **Evidence**: [code snippet + reasoning]
-- **Action Taken**: [auto-fixed / reported]
-- **Resolution**: [what was fixed / what needs human attention]
-```
-## Quality Gate
-- [ ] Every failure classified with evidence
-- [ ] Confidence level assigned to each
-- [ ] No application bugs auto-fixed
-- [ ] Auto-fixes only applied at HIGH confidence
-- [ ] FAILURE_CLASSIFICATION_REPORT.md produced
+---
+name: qa-bug-detective
+description: QA Bug Detective. Runs generated tests and classifies failures as APPLICATION BUG, TEST CODE ERROR, ENVIRONMENT ISSUE, or INCONCLUSIVE with evidence and confidence levels. Use when user wants to run tests and classify results, investigate test failures, determine if failures are bugs or test issues, debug failing tests, triage test results, or understand why tests are failing. Triggers on "run tests", "classify failures", "why is this failing", "test failures", "debug tests", "triage results", "is this a bug or test error", "investigate failures".
+---
+# QA Bug Detective
+## Purpose
+Run generated tests and classify every failure into one of four categories with evidence and confidence levels. Auto-fix TEST CODE ERRORS when confidence is HIGH.
+## Classification Decision Tree
+```
+Test fails
+├── Syntax/import error in TEST file?
+│   └── YES → TEST CODE ERROR
+├── Error occurs in PRODUCTION code path?
+│   ├── Known bug / unexpected behavior? → APPLICATION BUG
+│   └── Code works as designed but test expectation wrong? → TEST CODE ERROR
+├── Connection refused / timeout / missing env var?
+│   └── YES → ENVIRONMENT ISSUE
+└── Can't determine?
+    └── INCONCLUSIVE
+```
+## Classification Categories
+### APPLICATION BUG
+- Error manifests in production code (not test code)
+- Stack trace points to src/ or app/ code
+- Behavior contradicts documented requirements or API contracts
+- **Action**: Report only. NEVER auto-fix application code.
+### TEST CODE ERROR
+- Import/require fails (wrong path, missing module)
+- Selector doesn't match current DOM
+- Assertion expects wrong value (test written incorrectly)
+- Missing await, wrong API usage, stale fixture reference
+- **Action**: Auto-fix if HIGH confidence. Report if MEDIUM or lower.
+### ENVIRONMENT ISSUE
+- Connection refused (database, API, external service)
+- Timeout waiting for resource
+- Missing environment variable
+- File/directory not found (test infrastructure)
+- **Action**: Report with suggested resolution steps.
+### INCONCLUSIVE
+- Error is ambiguous
+- Could be multiple root causes
+- Insufficient data to classify
+- **Action**: Report with what's known, request more info.
+## Evidence Requirements
+Every classification MUST include:
+1. **File path**: Exact file where error occurs
+2. **Line number**: Specific line of failure
+3. **Error message**: Complete error text
+4. **Code snippet**: The specific code proving the classification
+5. **Confidence level**: HIGH / MEDIUM-HIGH / MEDIUM / LOW
+6. **Reasoning**: Why this classification, not another
+## Confidence Levels
+| Level | Definition |
+|-------|------------|
+| HIGH | Clear evidence in one direction, no ambiguity |
+| MEDIUM-HIGH | Strong evidence but minor ambiguity |
+| MEDIUM | Evidence points one way but alternatives exist |
+| LOW | Insufficient data, multiple possible causes |
+## Auto-Fix Rules
+Only auto-fix when:
+- Classification = TEST CODE ERROR
+- Confidence = HIGH
+- Fix is mechanical (import path, selector, assertion value, config)
+Fix types:
+- Import path corrections
+- Selector updates (match current DOM/data-testid)
+- Assertion value updates (match current actual behavior)
+- Config fixes (baseURL, timeout values)
+- Missing await keywords
+- Fixture path corrections
+**NEVER auto-fix**: Application bugs, environment issues, anything with confidence < HIGH.
+## Output: FAILURE_CLASSIFICATION_REPORT.md
+```markdown
+# Failure Classification Report
+## Summary
+| Classification | Count | Auto-Fixed | Needs Attention |
+|---------------|-------|-----------|----------------|
+| APPLICATION BUG | N | 0 | N |
+| TEST CODE ERROR | N | N | N |
+| ENVIRONMENT ISSUE | N | 0 | N |
+| INCONCLUSIVE | N | 0 | N |
+## Detailed Analysis
+### Failure 1: [test name]
+- **Classification**: [category]
+- **Confidence**: [level]
+- **File**: [path]:[line]
+- **Error**: [message]
+- **Evidence**: [code snippet + reasoning]
+- **Action Taken**: [auto-fixed / reported]
+- **Resolution**: [what was fixed / what needs human attention]
+```
+## Quality Gate
+- [ ] Every failure classified with evidence
+- [ ] Confidence level assigned to each
+- [ ] No application bugs auto-fixed
+- [ ] Auto-fixes only applied at HIGH confidence
+- [ ] FAILURE_CLASSIFICATION_REPORT.md produced