qfai 1.8.3 → 1.8.4

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -30,8 +30,8 @@ Do not rely on a CLI entrypoint or package runtime loop.
30
30
  ## CRITICAL CONSTRAINTS (Read First)
31
31
 
32
32
  - Scope is all specs from `.qfai/specs/spec-*`.
33
- - The AI evaluator sub-agent performs visual evaluation. QFAI does not score visual quality. (spec-0012)
34
- - Playwright CLI (`playwright-cli`) is the sole standard browser tool. Playwright MCP, Node Playwright direct invocation, and screenshot-capture shell scripts are not used. (spec-0012)
33
+ - The AI evaluator sub-agent performs visual evaluation. QFAI does not score visual quality. (per the resolved primary prototyping spec — see `qfai prototyping show-spec`)
34
+ - Playwright CLI (`playwright-cli`) is the sole standard browser tool. Playwright MCP, Node Playwright direct invocation, and screenshot-capture shell scripts are not used. (per the resolved primary prototyping spec)
35
35
  - QFAI pre-assigns evidence paths. The evaluator MUST use the paths in the command plan (`review-bundle.json` → `command-plans.json`); it MUST NOT invent paths.
36
36
  - For every declared screen and every active candidate in every round, 4 evidence artifacts are mandatory:
37
37
  - screenshot: `.qfai/evidence/prototyping/rounds/<round>/candidates/<candidate-id>/<screen-id>.png`
@@ -42,7 +42,7 @@ Do not rely on a CLI entrypoint or package runtime loop.
42
42
  - Canonical latest HTML path: `.qfai/evidence/prototyping/html/<screen-id>.html`
43
43
  - Canonical latest paths MUST mirror the latest accepted winner/polish artifacts.
44
44
  - If any of the 4 artifacts is missing for a declared screen, the round is incomplete; rerun is mandatory, not waiver.
45
- - Mode differences are limited to `maxCycles` only (low-cost=1, standard=3, full-harness=20). Every other gate, obligation, reviewer severity, and completion criterion is identical across modes. (spec-0012)
45
+ - Mode differences are limited to `maxCycles` only (low-cost=1, standard=3, full-harness=20). Every other gate, obligation, reviewer severity, and completion criterion is identical across modes. (per the resolved primary prototyping spec)
46
46
  - DONE is forbidden until `qfai validate --profile prototyping --fail-on error` passes and `/qfai-verify` can approve the run.
47
47
  - Supported UI prototyping surfaces are `web`, `mobile`, `desktop`, and `mixed`.
48
48
  - `cli`, API-only, backend-only, and `ui_bearing: false` classifications are not prototyping execution targets.
@@ -60,7 +60,7 @@ Generate multiple design directions, converge on a winner, extract the selected
60
60
  ## Surface / Mode
61
61
 
62
62
  - surface / mode routing uses `standard` as the default execution path.
63
- - **Mode Invariant (spec-0012)**: modes differ only by `maxCycles`. Review gate, evidence requirements, reviewer severity, best-of-history, breakthrough detection, and completion criteria are identical across modes.
63
+ - **Mode Invariant**: modes differ only by `maxCycles`. Review gate, evidence requirements, reviewer severity, best-of-history, breakthrough detection, and completion criteria are identical across modes.
64
64
  - `low-cost`: `maxCycles = 1`
65
65
  - `standard`: `maxCycles = 3` (default)
66
66
  - `full-harness`: `maxCycles = 20`
@@ -70,7 +70,12 @@ Generate multiple design directions, converge on a winner, extract the selected
70
70
 
71
71
  Read and follow these references before execution:
72
72
 
73
- - `.qfai/specs/spec-0012/01_Spec.md` — primary SSOT for the prototyping harness
73
+ - **Primary SSOT for the prototyping harness**: resolve at runtime by running
74
+ `qfai prototyping show-spec` from the repo root. The output gives you the
75
+ resolved spec ID and `01_Spec.md` path (configured via
76
+ `qfai.config.yaml: prototyping.primarySpecId`, or auto-detected via the
77
+ `surface_type: ui-bearing` marker in `01_Spec.md`). Do not assume any
78
+ particular spec ID exists — read whatever `show-spec` returns.
74
79
  - `.qfai/assistant/skills/qfai-prototyping/references/evidence-requirements.md`
75
80
  - `.qfai/assistant/skills/qfai-prototyping/references/iteration-cycle.md`
76
81
  - `.qfai/assistant/skills/qfai-prototyping/references/l1-review-guide.md`
@@ -161,7 +166,7 @@ Before launching the evaluator, prepare the round-scoped artifacts via QFAI (not
161
166
  - QFAI produces:
162
167
  - `.qfai/evidence/prototyping/rounds/<rN>/command-plans.json` — the candidate-aware Playwright CLI command plans
163
168
  - `.qfai/evidence/prototyping/rounds/<rN>/review-bundle.json` — the evaluator input bundle (candidates, axisDefs, designSystemChecklist, commandPlanRef)
164
- - Do not invent evidence paths. Paths are fixed by QFAI per spec-0012.
169
+ - Do not invent evidence paths. Paths are fixed by QFAI per the resolved primary prototyping spec.
165
170
 
166
171
  ### Step 5 — AI Evaluator Executes the Command Plans and Captures Evidence
167
172
 
@@ -220,7 +225,7 @@ At least one full post-selection polish loop is mandatory. Each polish loop must
220
225
 
221
226
  ## Cycle Gate
222
227
 
223
- - Completion requires at least one `polish` cycle after winner selection (spec-0012). This applies to all modes.
228
+ - Completion requires at least one `polish` cycle after winner selection (per the resolved primary prototyping spec). This applies to all modes.
224
229
  - The same gate applies in every mode; modes differ only in `maxCycles` (low-cost=1, standard=3, full-harness=20).
225
230
  - If the polish-cycle budget is exhausted before the gate is satisfied, the run does NOT complete. The evaluator returns `REVISE` and the developer may re-run at a higher mode.
226
231
  - Any phase transition to completion must pass through the cycle gate and the reviewer gate.
@@ -47,7 +47,7 @@ Optional evidence is not allowed.
47
47
 
48
48
  ## Mode invariant
49
49
 
50
- Evidence requirements are identical for all modes (low-cost / standard / full-harness) per spec-0012. Modes differ only by `maxCycles` (1 / 3 / 20). Choosing a lower mode does NOT reduce evidence obligations.
50
+ Evidence requirements are identical for all modes (low-cost / standard / full-harness) per the resolved primary prototyping spec (run `qfai prototyping show-spec` to see which one applies). Modes differ only by `maxCycles` (1 / 3 / 20). Choosing a lower mode does NOT reduce evidence obligations.
51
51
 
52
52
  ## Validate gate expectations
53
53
 
@@ -1,6 +1,6 @@
1
1
  # Reviewer Gate
2
2
 
3
- The reviewer is an independent gate, not the implementation author. The reviewer gate applies identically to all modes (spec-0012); modes differ only in `maxCycles`.
3
+ The reviewer is an independent gate, not the implementation author. The reviewer gate applies identically to all modes (per the resolved primary prototyping spec); modes differ only in `maxCycles`.
4
4
 
5
5
  ## Reviewer must verify
6
6
 
@@ -22,6 +22,8 @@ hard_floors:
22
22
  min_score: 80
23
23
  - id: conceptFit
24
24
  min_score: 85
25
+ - id: originality
26
+ min_score: 80
25
27
  absorbable_categories:
26
28
  - layout
27
29
  - interaction
@@ -125,9 +125,7 @@ screens:
125
125
 
126
126
  ## Example
127
127
 
128
- - Copy-ready repository sample:
129
- `docs/examples/ui-contract.good.yaml`
130
- - Also available from prototyping skill template:
128
+ - Copy-ready sample bundled with this package:
131
129
  `../../assistant/skills/qfai-prototyping/templates/contracts/ui-0001-order-mockable.yaml`
132
130
 
133
131
  ## FAQ (Typical failures)