npm - @ai-qa/workflow - Versions diffs - 2.0.14 → 2.0.16 - Mend

@ai-qa/workflow 2.0.14 → 2.0.16

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (14) hide show

package/.env +1 -0
package/.github/agents/playwright-test-generator.agent.md +22 -7
package/.github/agents/playwright-test-healer.agent.md +2 -1
package/.github/agents/playwright-test-planner.agent.md +9 -2
package/.qa-workflow.json +2 -1
package/README.md +60 -14
package/ai-qa-workflow.js +2 -1
package/install.js +29 -49
package/opencode.json +8 -8
package/package.json +11 -2
package/playwright.config.ts +20 -0
package/prompts/QAe2eprompt.md +12 -1
package/scripts/executor.js +59 -4
package/scripts/utils.js +1 -1

package/.env ADDED Viewed

	@@ -0,0 +1 @@
1	+ APPLITOOLS_API_KEY="5kKZ0PuYgV6jObYXU7sE102waJHMWNPy4PYJCgZJPjjdI110"

package/.github/agents/playwright-test-generator.agent.md CHANGED Viewed

@@ -27,27 +27,42 @@ mcp-servers:
 You are a Playwright test generator. Create robust E2E tests from test plans.
+## Token Efficiency Rules (CRITICAL)
+- **Read `.qa-context/selectors.json` first** — the Planner already captured selectors. Start from those, do NOT blindly re-explore.
+- **Batch verification pass** — before writing tests, navigate ONCE in a single session to quickly verify all selectors from `selectors.json` are still valid. Use lightweight `locator.isVisible()` checks, NOT full snapshots. Fix broken selectors in `selectors.json` immediately.
+- **If framework is dynamic (Ionic/Angular/etc)** — the DOM re-renders frequently. The verification pass is essential. Read `.qa-workflow.json` → `test.stableSelectors` flag: if `false`, always do a verification pass before writing tests.
+- **Snapshot depth ≤ 3** — `browser_snapshot` with depth 3 max for any navigation.
+- **Skip Applitools if not configured** — if `APPLITOOLS_API_KEY` is not set, skip all visual testing code entirely (no `applitools/check` calls).
 ## Before Starting: Read Context
 1. Read `.qa-context/pipeline.json` to see the current story and phase state
-2. Read `.qa-context/selectors.json` to use the most reliable selectors
-3. Read `.qa-context/heal-history.json` to see what's been flaky before
-4. Read `.qa-context/auth.json` and `.auth/credentials.json` to get login selectors and credentials
+2. Read `.qa-workflow.json` → check `test.stableSelectors` — if `false`, the app uses dynamic rendering (Ionic/Angular/etc), so always run the verification pass
+3. Read `.qa-context/selectors.json` — **this is your primary selector source**, use it as the starting point
+4. Read `.qa-context/heal-history.json` to see what's been flaky before
+5. Read `.qa-context/auth.json` and `.auth/credentials.json` to get login selectors and credentials
 ## Auth Protocol
 1. If auth is configured, generate an `auth.setup.ts` file using `auth-manager.js`'s `generateSetupCode()` output
 2. Configure `playwright.config.ts` to use `projects` with `dependencies: ['./auth.setup.ts']`
 3. Each test should assume the user is already logged in — no login steps in individual tests
+## Selector Verification Pass (Token-Efficient)
+Before writing any test files, do a single batch verification session:
+1. Navigate to each page that has selectors in `selectors.json`
+2. For each selector, run `page.locator(selector).isVisible()` — this is fast and lightweight (no snapshot needed)
+3. If a selector fails, use `browser_snapshot` depth 2 to capture current DOM, then find the new selector
+4. Update `selectors.json` with the corrected selectors immediately
+5. This is **ONE session** for all pages — batch all verifications, do not navigate per test file
 ## After Completing: Update Context
 1. Run: `node ai-qa-workflow.js context generate <story-name>` to mark phase complete
-2. Add any new selectors you discovered to `.qa-context/selectors.json`
+2. Update all verified/corrected selectors in `.qa-context/selectors.json`
 3. Update `docs/application-context.md` with new stable selectors found
 For each test:
 1. Read the test plan scenario
-2. Use Playwright to manually execute steps in real-time
-3. Capture selectors and locators from actual page interaction
-4. Generate the test file using `generator_write_test`
+2. Read selectors from `.qa-context/selectors.json` (already verified in the batch pass)
+3. Generate the test file using `generator_write_test`
 Always prefer: data-testid > aria-label > role > text > xpath (last resort)

package/.github/agents/playwright-test-healer.agent.md CHANGED Viewed

@@ -47,13 +47,14 @@ You are the Playwright Test Healer. Debug and fix failing tests systematically.
 Protocol (token-efficient):
 1. Run only failing tests with `test_run`
-2. Debug with `test_debug` — examine the error state
+2. Debug with `test_debug` — examine the error state (uses snapshots with depth ≤ 3 automatically)
 3. Classify the failure:
    - Selector broken → propose 1-3 line fix
    - Timing issue → propose adding 1 wait
    - App bug → mark `test.fixme()`, log defect, move on
 4. Max 1 fix attempt per test. If still failing after fix, it's a defect.
 5. Never rewrite entire files — targeted edits only.
+6. No screenshots — reference failures by path only; skip `browser_take_screenshot`.
 ## ⛔ Approval Gate
 Before applying any fix, **STOP** and present your diagnosis to the user:

package/.github/agents/playwright-test-planner.agent.md CHANGED Viewed

@@ -36,6 +36,13 @@ mcp-servers:
 You are an expert web test planner. You explore web apps and create detailed test plans.
+## Token Efficiency Rules (CRITICAL)
+- **Explore once, plan everything** — navigate the app in ONE continuous session, record all flows and selectors, then stop. Do NOT re-navigate per scenario.
+- **Snapshot depth ≤ 3** — always use `browser_snapshot` with depth 3 max. Full DOM trees (depth 10+) waste 50-100K tokens per snapshot.
+- **No screenshots during exploration** — reference pages by path only; skip `browser_take_screenshot` unless explicitly requested.
+- **Batch critical paths** — group navigation: explore all pages of the same feature in sequence before moving to the next feature.
+- **Skip non-critical pages** — don't explore every sub-page. Focus on: auth, main flows, edge cases. Skip static content, footers, about pages.
 ## Before Starting: Read Context
 1. Read `.qa-context/pipeline.json` to see what's been done and what's pending
 2. Read `.qa-context/selectors.json` to reuse known stable selectors
@@ -48,14 +55,14 @@ You are an expert web test planner. You explore web apps and create detailed tes
 3. After exploring the login page, save the form structure to `.qa-context/auth.json` using `auth-manager.js`
 4. If the user provides credentials, save them to `.auth/credentials.json` using `auth-manager.js`
-1. Navigate and explore the application
+1. Explore the application in ONE pass — navigate all critical pages, capture selectors, map flows
 2. Map user flows and identify critical paths
 3. Design comprehensive scenarios (happy path, edge cases, error handling)
 4. Save test plan using `planner_save_plan`
 ## After Completing: Update Context
 1. Run: `node ai-qa-workflow.js context plan <story-name>` to mark phase complete
-2. If you discovered new selectors, add them to `.qa-context/selectors.json`
+2. Save all discovered selectors to `.qa-context/selectors.json` — the Generator will reuse these and skip re-navigation
 3. Read `docs/application-context.md` first, then enrich it with new findings
 Each scenario must include: title, steps, expected outcomes, success criteria.

package/.qa-workflow.json CHANGED Viewed

@@ -13,7 +13,8 @@
   "test": {
     "timeout": 120000,
     "retries": 0,
-    "workers": 1
+    "workers": 1,
+    "stableSelectors": true
   },
   "auth": {
     "user": "",

package/README.md CHANGED Viewed

@@ -157,8 +157,16 @@ After installation and running `npm run qa:init`, open the project in your AI ed
 The very first thing you should say to the AI agent:
-> **"Read router.md and follow the QA workflow for my-story.md"**
+> **"Run the environment check and show me the status report"**
+The AI will check all 10 preconditions and report what's ready ✅ and what's missing ❌.
+Then wait for your instructions.
+### Option B — Go straight to QA workflow (if you already have a user story)
+> **"Read router.md and follow the QA workflow for user-story/my-story.md"**
+The AI will run the environment check (if not done), then proceed through Pla
 (Replace `my-story.md` with the name of your user story file in `user-story/`.)
 > **📖 Need more prompts?** See `prompting_template.md` for the full conversation script — approval responses, healing prompts, report prompts, and an example session.
@@ -259,10 +267,11 @@ The AI updates `docs/application-context.md` with:
 | Component | Needed for | Install |
 |-----------|-----------|---------|
 | **Node.js 18+** | Running the pipeline | — |
-| **Playwright** | Test execution | `npm install @playwright/test` |
+| **Playwright** | Test execution | Pre-installed via `npm install` |
 | **Chromium** | Running tests | `npx playwright install chromium` |
-| **Playwright MCP** | AI browser automation | `npm install -D @playwright/mcp` |
-| **Applitools MCP** | Visual testing (screenshot comparison) | `npm install -D @applitools/mcp` + `APPLITOOLS_API_KEY` |
+| **Playwright MCP** | AI browser automation | Pre-installed via `npm install` |
+| **Applitools Eyes** | Visual testing + MCP | Pre-installed via `npm install` + `APPLITOOLS_API_KEY` |
+| **Allure** | Rich test reports | Pre-installed via `npm install` |
 | **GitHub MCP** | AI creating PRs/issues | `npm install -D @modelcontextprotocol/server-github` + `GITHUB_TOKEN` |
 ### Install into a project
@@ -282,21 +291,25 @@ node install.js ../my-project --yes
 npx @ai-qa/workflow update --yes
 ```
+> **Note:** The installer creates a `.env` template (`APPLITOOLS_API_KEY=""` and `GITHUB_TOKEN=""`...) only on fresh installs. If the `.env` file was not created (e.g., during an update), create it manually in your project root with those two keys.
 ---
 ## Visual Testing (Applitools)
-The template supports **Applitools MCP** for automated visual testing.
+The template supports **Applitools Eyes** for automated visual testing via two components:
+- `@applitools/mcp` — MCP server for AI-driven visual testing
+- `@applitools/eyes-playwright` — Playwright integration for test-level visual assertions
+Both are pre-installed in `package.json` — no extra install needed.
 If `APPLITOOLS_API_KEY` is configured in your environment, the AI agent automatically adds visual checkpoints to critical pages during test generation. It captures screenshots of pages like login, dashboard, and checkout, and compares them against baselines to detect visual regressions.
 ### Setup
-```bash
-# 1. Install Applitools MCP
-npm install -D @applitools/mcp
+Set your API key (get it from https://applitools.com):
-# 2. Set your API key (get it from https://applitools.com)
+```bash
 # Option A: Export in terminal
 export APPLITOOLS_API_KEY=votre_clé_ici
@@ -306,6 +319,14 @@ echo "APPLITOOLS_API_KEY=votre_clé_ici" >> .env
 The AI will detect the key during its environment check and use Applitools automatically. If the key is not set, visual testing is skipped entirely — no errors, no blocks.
+## Allure Reports
+Allure test reports are pre-configured:
+- `allure-playwright` reporter generates raw results during `npm run qa:execute`
+- The report is auto-generated as HTML after each test run
+- View via `npm run dashboard` or open `allure-report/index.html`
+- Manual regeneration: `npm run qa:report:allure`
 ---
 ## Commands
@@ -563,8 +584,33 @@ The AI never:
 ## Token Efficiency
-1. Body text assertions over complex selectors
-2. Screenshots off by default (only on failure)
-3. Healer makes 1 fix attempt max per test
-4. Real bugs classified immediately — no retries
-5. AI edits 1-3 lines maximum per fix
+The framework is designed to minimize token usage during AI-driven testing. These rules are embedded in the agent definitions and prompts:
+### Browser Navigation Rules
+- **Planner explores once** — navigates the app in ONE session and saves all selectors to `.qa-context/selectors.json`.
+- **Verification pass (not re-exploration)** — the Generator reads `selectors.json`, then does a single lightweight batch session to verify selectors via `locator.isVisible()` checks (not full snapshots). This catches stale selectors from dynamic frameworks (Ionic, Angular) without re-exploring every page.
+- **`test.stableSelectors` flag** — set to `false` in `.qa-workflow.json` for dynamic frameworks. The Generator always runs the verification pass when this is `false`.
+- **Snapshot depth ≤ 3** — `browser_snapshot` capped at depth 3. Full DOM trees (depth 10+) can consume 50-100K tokens per snapshot.
+- **Batch all navigation** — all page visits happen in one session per phase, never per scenario.
+- **Skip non-critical pages** — static content, footers, about/legal pages are skipped.
+- **No screenshots during exploration** — pages referenced by path only.
+### Selector Reuse
+- Planner captures selectors once → saved to `.qa-context/selectors.json`
+- Generator runs a **single batch verification pass** — lightweight `isVisible()` checks, not navigation per test
+- Healer reads existing selectors + healing history → avoids re-discovering failed locators
+- Every discovered or corrected selector is saved so no agent ever re-explores the same element
+### Execution & Healing
+- Max 1 fix attempt per test — no endless retries
+- Failures classified immediately (selector / timing / bug) — no ambiguous loops
+- Targeted 1-3 line edits only — never rewrite entire files
+- `test.fixme()` marks defects — no retrying known bugs
+### Context Management
+- Error messages truncated to 200 chars
+- Files cached in memory between reads
+- Screenshots referenced by path, not embedded
+- Directory listings done once per pipeline run
+For details, see the agent files in `.github/agents/` and the `TOKEN & EFFICIENCY RULES` section in `prompts/QAe2eprompt.md`.

package/ai-qa-workflow.js CHANGED Viewed

@@ -118,7 +118,8 @@ function cmdExecute() {
     console.log(`\n  Next step: Auto-heal failures:`);
     console.log(`  node ai-qa-workflow.js heal ${result.runId}`);
   } else {
-    console.log(`\n  Next step: Generate report:`);
+    console.log(`\n  ✓ Allure report auto-generated: allure-report/index.html`);
+    console.log(`  Next step: Generate detailed report:`);
     console.log(`  node ai-qa-workflow.js report ${result.runId}`);
   }
 }

package/install.js CHANGED Viewed

@@ -24,6 +24,7 @@ const USER_DIRS = new Set([
   'test-results',
   '.qa-context',
   '.auth',
+  '.env'
 ]);
 // Template files to copy/update
@@ -42,6 +43,10 @@ const QA_ITEMS = [
      { src: '.github/copilot-instructions.md', dest: '.github/copilot-instructions.md' },
       { src: 'router.md', dest: 'router.md' },
       { src: 'opencode.json', dest: 'opencode.json' },
+    { src: 'playwright.config.ts', dest: 'playwright.config.ts' },
+    { src: 'cli.js', dest: 'cli.js' },
+    { src: 'package.json', dest: 'package.json' },
+    { src: 'install.js', dest: 'install.js' },
     ];
@@ -53,23 +58,6 @@ const UPDATE_ITEMS = QA_ITEMS.filter(item => {
 const DIRS_TO_CREATE = ['user-story', 'specs', 'tests', 'test-results', '.qa-context', '.auth', 'templates'];
-const NPM_SCRIPTS = {
-  'qa': 'node ai-qa-workflow.js',
-  'qa:init': 'node ai-qa-workflow.js init',
-  'qa:plan': 'node ai-qa-workflow.js plan',
-  'qa:generate': 'node ai-qa-workflow.js generate',
-  'qa:execute': 'node ai-qa-workflow.js execute',
-  'qa:heal': 'node ai-qa-workflow.js heal',
-  'qa:retry': 'node ai-qa-workflow.js heal',
-  'qa:report': 'node ai-qa-workflow.js report',
-  'qa:report:allure': 'node ai-qa-workflow.js report:allure',
-  'qa:status': 'node ai-qa-workflow.js status',
-  'qa:list': 'node ai-qa-workflow.js list',
-  'dashboard': 'cd qa-dashboard && npm start',
-  'dashboard:dev': 'cd qa-dashboard && npx nodemon app.js',
-  'dashboard:stop': 'npx kill-port 4000',
-};
 const BANNER = `
   ╔══════════════════════════════════════════╗
   ║    AI QA Pipeline Installer v2.0         ║
@@ -106,21 +94,6 @@ function countFiles(dir) {
   return count;
 }
-function addNpmScripts(pkgPath, overwrite) {
-  if (!fs.existsSync(pkgPath)) return;
-  let pkg = JSON.parse(fs.readFileSync(pkgPath, 'utf-8'));
-  if (!pkg.scripts) pkg.scripts = {};
-  let changed = 0;
-  for (const [key, val] of Object.entries(NPM_SCRIPTS)) {
-    if (!pkg.scripts[key] || overwrite) {
-      pkg.scripts[key] = val;
-      changed++;
-    }
-  }
-  fs.writeFileSync(pkgPath, JSON.stringify(pkg, null, 2) + '\n');
-  return changed;
-}
 function ask(query) {
   const rl = require('readline').createInterface({ input: process.stdin, output: process.stdout });
   return new Promise(resolve => rl.question(query, a => { rl.close(); resolve(a.toLowerCase()); }));
@@ -162,12 +135,17 @@ async function install(targetPath, mode) {
     }
   }
-  // 3. Add/update npm scripts
-  console.log(`\n  ── Step 3: NPM Scripts ──`);
-  const pkgPath = path.join(targetPath, 'package.json');
-  const changed = addNpmScripts(pkgPath, isUpdate);
-  if (changed > 0) console.log(`  ✓ ${isUpdate ? 'Updated' : 'Added'} ${changed} npm scripts (qa:*, dashboard)`);
-  else console.log(`  • Scripts already configured`);
+  // 3. Create .env template (fresh install only)
+  console.log(`\n  ── Step 3: Environment ──`);
+  if (!isUpdate) {
+    const envPath = path.join(targetPath, '.env');
+    if (!fs.existsSync(envPath)) {
+      fs.writeFileSync(envPath, 'APPLITOOLS_API_KEY=""\nGITHUB_TOKEN=""\n');
+      console.log(`  ✓ .env (template created)`);
+    } else {
+      console.log(`  • .env (exists, kept as-is)`);
+    }
+  }
   // 4. Dashboard
   const dashboardSrc = path.join(TEMPLATE_DIR, 'qa-dashboard');
@@ -211,13 +189,13 @@ async function install(targetPath, mode) {
     }
   }
-  // 6. Install Playwright (fresh only)
+  // 6. Install all dependencies (fresh only)
   if (!isUpdate) {
     console.log(`\n  ── Step 6: Dependencies ──`);
     if (!fs.existsSync(path.join(targetPath, 'node_modules', '@playwright'))) {
-      console.log(`  → Installing @playwright/test...`);
-      try { execSync('npm install @playwright/test', { cwd: targetPath, stdio: 'pipe', timeout: 120000 }); console.log(`  ✓ Playwright installed`); } catch (e) { console.log(`  ⚠  npm install failed: npm install @playwright/test`); }
-    } else console.log(`  • Playwright already installed`);
+      console.log(`  → Installing all dependencies (Playwright, Allure, Applitools)...`);
+      try { execSync('npm install', { cwd: targetPath, stdio: 'pipe', timeout: 180000 }); console.log(`  ✓ All dependencies installed`); } catch (e) { console.log(`  ⚠  npm install failed: npm install`); }
+    } else console.log(`  • Dependencies already installed`);
   }
   // Summary
@@ -233,9 +211,11 @@ async function install(targetPath, mode) {
     console.log(`  Restart the dashboard if it was running.\n`);
   } else {
     console.log(`  Files: ~${totalFiles} scripts + ${dashboardCount} dashboard files`);
+    console.log(`  Dependencies: @playwright/test, allure-playwright, @applitools/eyes-playwright, @applitools/mcp`);
     console.log(`  Next:\n`);
+    console.log(`  npm install          Install all dependencies`);
     console.log(`  npm run qa:init      Initialize pipeline (config + dirs + auth)`);
-    console.log(`  npm run qa:execute   Run Playwright tests`);
+    console.log(`  npm run qa:execute   Run Playwright tests (auto-generates Allure report)`);
     console.log(`  npm run qa:status    Check pipeline state`);
     console.log(`  npm run dashboard    Start dashboard (port 4000)\n`);
   }
@@ -279,12 +259,12 @@ async function main() {
   // Parse flags and target
   const nonFlagArgs = args.filter(a => !a.startsWith('-'));
-  if (IS_SELF || args.includes('init')) {
-    targetPath = process.cwd();
-    mode = 'install';
-  } else if (args.includes('update')) {
-    targetPath = process.cwd();
-    mode = 'update';
+  if (args.includes('update')) {
+      targetPath = process.cwd();
+      mode = 'update';
+  } else if (IS_SELF || args.includes('init')) {
+      targetPath = process.cwd();
+      mode = 'install';
   } else if (IS_UPDATE) {
     targetPath = path.resolve(nonFlagArgs[0] || process.cwd());
     mode = 'update';

package/opencode.json CHANGED Viewed

@@ -14,14 +14,14 @@
       "description": "GitHub integration - PRs, issues, commits. Requires GITHUB_TOKEN env var. Install: npm install -D @modelcontextprotocol/server-github"
     },
      "applitools-mcp": {
-      "type": "local",
-      "command": ["npx", "-y", "@applitools/mcp@latest"],
-      "enabled": true,
-       "description": "Visual testing - requires APPLITOOLS_API_KEY env var",
-       "env": {
-    "APPLITOOLS_API_KEY": "${APPLITOOLS_API_KEY}"
-  }
-    }
+       "type": "local",
+       "command": ["npx", "-y", "@applitools/mcp@latest"],
+       "enabled": true,
+        "description": "Visual testing via Applitools Eyes - requires APPLITOOLS_API_KEY env var. Installed during npm setup: @applitools/mcp + @applitools/eyes-playwright",
+        "env": {
+     "APPLITOOLS_API_KEY": "${APPLITOOLS_API_KEY}"
+   }
+     }
   },
   "agent": {
     "qa-planner": {

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@ai-qa/workflow",
-  "version": "2.0.14",
+  "version": "2.0.16",
   "description": "AI QA Workflow Template — transforms any AI agent into an autonomous QA engineer. AI explores, plans, generates tests, and heals. Scripts execute and report.",
   "keywords": [
     "qa",
@@ -23,6 +23,7 @@
     "ai-qa-workflow.js",
     "cli.js",
     "install.js",
+    "playwright.config.ts",
     "README.md",
     "PROJECT_GUIDE.md",
     "prompting_template.md",
@@ -32,7 +33,8 @@
     "opencode.json",
     ".qa-workflow.json",
     "router.md",
-    ".github/"
+    ".github/",
+    ".env"
   ],
   "license": "MIT",
   "author": "AI QA Workflow",
@@ -58,5 +60,12 @@
     "dashboard": "cd qa-dashboard && npm start",
     "dashboard:dev": "cd qa-dashboard && npx nodemon app.js",
     "dashboard:stop": "npx kill-port 4000"
+  },
+  "devDependencies": {
+    "@playwright/test": "^1.52.0",
+    "allure-playwright": "^3.2.1",
+    "allure-commandline": "^2.33.0",
+    "@applitools/eyes-playwright": "^1.42.0",
+    "@applitools/mcp": "^1.5.0"
   }
 }

package/playwright.config.ts ADDED Viewed

@@ -0,0 +1,20 @@
+import { defineConfig, devices } from '@playwright/test';
+export default defineConfig({
+  testDir: './tests',
+  fullyParallel: true,
+  forbidOnly: !!process.env.CI,
+  retries: process.env.CI ? 2 : 0,
+  workers: process.env.CI ? 1 : undefined,
+  use: {
+    baseURL: process.env.APP_URL || 'http://localhost:3000',
+    trace: 'on-first-retry',
+    screenshot: 'only-on-failure',
+  },
+  projects: [
+    {
+      name: 'chromium',
+      use: { ...devices['Desktop Chrome'] },
+    },
+  ],
+});

package/prompts/QAe2eprompt.md CHANGED Viewed

@@ -475,6 +475,18 @@ Use this knowledge to improve future test generation and healing.
 ---------------------------------------------------
 ## TOKEN & EFFICIENCY RULES (CRITICAL)
+### Browser Navigation Rules
+- **Explore once, plan everything** — navigate the app in ONE continuous session per phase. Do NOT re-navigate per scenario.
+- **Snapshot depth ≤ 3** — always use `browser_snapshot` with max depth 3. Full DOM trees (depth 10+) waste 50-100K tokens per snapshot.
+- **Batch all navigation** — group page visits by feature. Visit all pages of a feature in sequence, then move to the next.
+- **Skip non-critical pages** — static content, footers, about pages, legal pages. Focus on auth, main flows, edge cases.
+- **No screenshots during exploration or test logic** — reference pages by path only. Screenshots only on test failure via Playwright config.
+### Selector Reuse (Biggest Token Saver)
+- **Generator reads `selectors.json` first** — the Planner already captured selectors. The Generator writes tests without re-navigating.
+- **Only navigate if selectors are missing** — if `.qa-context/selectors.json` is empty for a page, navigate ONCE (depth ≤ 3), save selectors, then proceed.
+- **Save every discovered selector** — update `.qa-context/selectors.json` so no agent ever re-discovers the same element.
 ### Execution Rules
 - **MAX 1 healing attempt per test** — never loop more than once
 - **Classify failures immediately**: selector issue → fix; app bug → mark `test.fixme()` + document defect
@@ -484,7 +496,6 @@ Use this knowledge to improve future test generation and healing.
 ### Code Generation Rules
 - **Never rewrite entire files** — use targeted edits only (change 1-3 lines max)
 - **Use body text checks**: `page.textContent('body')` instead of fragile element selectors
-- **No screenshots during test logic** — capture only on failure via Playwright config
 - **One assertion per logical check** — no redundant assertions
 - **Skip image-heavy tests** if not critical to the feature
 - **Reuse existing helper functions** — don't create new patterns

package/scripts/executor.js CHANGED Viewed

@@ -1,4 +1,4 @@
-const { DIRS, CONFIG, ensureDir, timestamp, log, writeMarkdown } = require('./utils');
+const { ROOT, DIRS, CONFIG, ensureDir, timestamp, log, writeMarkdown } = require('./utils');
 const context = require('./context-manager');
 const path = require('path');
 const fs = require('fs');
@@ -35,7 +35,12 @@ function executeTests(testName, options = {}) {
     }
   }
-  args.push('--reporter', 'list,json');
+  const allureAvailable = (() => {
+    try { require.resolve('allure-playwright/package.json'); return true; } catch { return false; }
+  })();
+  const reporters = ['list', 'json'];
+  if (allureAvailable) reporters.push('allure-playwright');
+  args.push('--reporter', reporters.join(','));
   if (headed) args.push('--headed');
   if (retries > 0) args.push('--retries', retries.toString());
@@ -52,7 +57,7 @@ function executeTests(testName, options = {}) {
   try {
     const stdout = execSync(`npx ${args.join(' ')}`, {
-      cwd: DIRS.ROOT,
+      cwd: ROOT,
       encoding: 'utf-8',
       timeout: 300000,
       maxBuffer: 10 * 1024 * 1024,
@@ -61,6 +66,8 @@ function executeTests(testName, options = {}) {
     fs.writeFileSync(outputPath, stdout, 'utf-8');
+    const passedTests = extractPassedTests(stdout);
     const result = {
       runId,
       story: storyName,
@@ -71,7 +78,7 @@ function executeTests(testName, options = {}) {
       timestamp: new Date().toISOString(),
       output: stdout.substring(0, 50000),
       failedTests: [],
-      passedTests: [],
+      passedTests,
     };
     writeMarkdown(resultPath, JSON.stringify(result, null, 2));
@@ -82,6 +89,8 @@ function executeTests(testName, options = {}) {
     }
     log('EXECUTOR', `Tests passed (${result.duration}ms)`);
+    generateAllureReport();
     return result;
   } catch (err) {
     const stderr = err.stderr || '';
@@ -91,6 +100,7 @@ function executeTests(testName, options = {}) {
     fs.writeFileSync(outputPath, combinedOutput, 'utf-8');
     const failedTests = extractFailedTests(combinedOutput);
+    const passedTests = extractPassedTests(combinedOutput);
     const result = {
       runId,
@@ -103,6 +113,7 @@ function executeTests(testName, options = {}) {
       error: err.message,
       output: combinedOutput.substring(0, 50000),
       failedTests,
+      passedTests,
     };
     writeMarkdown(resultPath, JSON.stringify(result, null, 2));
@@ -113,6 +124,8 @@ function executeTests(testName, options = {}) {
     }
     log('EXECUTOR', `Tests failed (${result.duration}ms) - ${failedTests.length} failure(s)`);
+    generateAllureReport();
     return result;
   }
 }
@@ -148,6 +161,48 @@ function extractFailedTests(output) {
   return failed;
 }
+function extractPassedTests(output) {
+  const passed = [];
+  const lines = output.split('\n');
+  for (const line of lines) {
+    const passMatch = line.match(/\s+√\s+\d+\)\s+\[(.+?)\]\s+(.+)/);
+    if (passMatch) {
+      passed.push({ file: passMatch[1], test: passMatch[2].trim() });
+      continue;
+    }
+    const passSimple = line.match(/\s+√\s+(.+)/);
+    if (passSimple && !line.includes('ms')) {
+      passed.push({ test: passSimple[1].trim() });
+    }
+  }
+  return passed;
+}
+function generateAllureReport() {
+  const allureResults = DIRS.allureResults;
+  const allureReport = path.join(ROOT, 'allure-report');
+  if (!fs.existsSync(allureResults) || fs.readdirSync(allureResults).length === 0) return;
+  log('EXECUTOR', 'Auto-generating Allure report...');
+  try {
+    const { execSync } = require('child_process');
+    execSync(`npx allure generate "${allureResults}" --clean -o "${allureReport}"`, {
+      cwd: ROOT,
+      encoding: 'utf-8',
+      timeout: 60000,
+      stdio: 'pipe',
+    });
+    log('EXECUTOR', `Allure report ready: allure-report/index.html`);
+  } catch (err) {
+    // Allure CLI not available — non-fatal
+    log('EXECUTOR', 'Allure CLI not available, skipping HTML report generation');
+  }
+}
 if (require.main === module) {
   const testName = process.argv[2];
   const headed = process.argv.includes('--headed');

package/scripts/utils.js CHANGED Viewed

@@ -111,7 +111,7 @@ function loadConfig() {
   const defaults = {
     project: { name: 'my-project', description: '', url: 'http://localhost:3000', environment: 'Development' },
     browser: { type: 'chromium', cdpPort: 9222, headed: false },
-    test: { timeout: 120000, retries: 0, workers: 1 },
+    test: { timeout: 120000, retries: 0, workers: 1, stableSelectors: true },
     auth: { user: '', credentials: {} },
   };