@ryuenn3123/agentic-senior-core 3.0.17 → 3.0.19

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (34) hide show
  1. package/.agent-context/prompts/bootstrap-design.md +16 -7
  2. package/.agent-context/rules/frontend-architecture.md +5 -5
  3. package/.agent-context/state/memory-continuity-benchmark.json +1 -1
  4. package/.cursorrules +1 -1
  5. package/.gemini/instructions.md +1 -1
  6. package/.github/copilot-instructions.md +1 -1
  7. package/.instructions.md +1 -1
  8. package/.windsurfrules +1 -1
  9. package/AGENTS.md +1 -1
  10. package/lib/cli/project-scaffolder/design-contract.mjs +363 -314
  11. package/lib/cli/project-scaffolder/prompt-builders.mjs +28 -22
  12. package/lib/cli/project-scaffolder/storage.mjs +0 -2
  13. package/package.json +2 -2
  14. package/scripts/frontend-usability-audit.mjs +19 -8
  15. package/scripts/mcp-server/constants.mjs +60 -0
  16. package/scripts/mcp-server/tool-registry.mjs +149 -0
  17. package/scripts/mcp-server/tools.mjs +446 -0
  18. package/scripts/mcp-server.mjs +23 -661
  19. package/scripts/release-gate/audit-checks.mjs +426 -0
  20. package/scripts/release-gate/constants.mjs +53 -0
  21. package/scripts/release-gate/runtime.mjs +63 -0
  22. package/scripts/release-gate/static-checks.mjs +182 -0
  23. package/scripts/release-gate.mjs +12 -793
  24. package/scripts/ui-design-judge/constants.mjs +24 -0
  25. package/scripts/ui-design-judge/design-execution-summary.mjs +233 -0
  26. package/scripts/ui-design-judge/git-input.mjs +131 -0
  27. package/scripts/ui-design-judge/prompting.mjs +73 -0
  28. package/scripts/ui-design-judge/providers.mjs +102 -0
  29. package/scripts/ui-design-judge/reporting.mjs +181 -0
  30. package/scripts/ui-design-judge/rubric-calibration.mjs +211 -0
  31. package/scripts/ui-design-judge/rubric-goldset.json +188 -0
  32. package/scripts/ui-design-judge.mjs +105 -774
  33. package/scripts/ui-rubric-calibration.mjs +35 -0
  34. package/scripts/validate/config.mjs +69 -16
@@ -34,9 +34,12 @@ The agent must:
34
34
  18. Design continuity is opt-in. If the user does not explicitly ask for continuity with an older system, prefer fresh synthesis from the current repo and brief.
35
35
  19. Accessibility must be split into a hard compliance floor and an advisory readability layer. Use WCAG 2.2 AA as the blocking baseline, and use APCA only as advisory perceptual tuning. APCA must never waive a WCAG failure.
36
36
  20. Accessibility planning must cover more than color contrast. It must explicitly address focus visibility, focus appearance, target size, keyboard access, accessible authentication, and dynamic state/status access.
37
- 21. Hybrid visual QA must stay deterministic-first. Define screenshot baseline expectations, dynamic-content masking rules, stability thresholds, viewport coverage, and long-page capture strategy before escalating any visual drift to a semantic judge.
38
- 22. Do not assume one screenshot is enough for a long page. Require above-fold capture, full-page capture when stable, and anchor-based section or tiled-scroll captures when deep content would otherwise be missed.
39
- 23. Deterministic visual QA must distinguish rendering noise from meaningful layout or styling drift. If a semantic judge is used later, it should only review diffs that already exceeded the deterministic threshold or lost required coverage.
37
+ 21. Structured design execution must stay representation-first. Define a surface plan, component graph, content-priority map, viewport mutation plan, and interaction-state matrix before relying on semantic review.
38
+ 22. Do not depend on screenshot capture, browser automation, or image diff artifacts as the default path. The contract must be strong enough to guide precise UI from repo evidence, component logic, and user intent alone.
39
+ 23. Semantic review should judge contract fidelity, distinctiveness, hierarchy, state behavior, and viewport mutation directly from the contract and changed UI code.
40
+ 24. Distinctive design review must use a stable review rubric. The contract should define how to judge distinctiveness, contract fidelity, visual consistency, heuristic UX quality, and motion discipline without collapsing those into one vague taste score.
41
+ 25. Genericity findings must name the actual drift signal. Do not say "generic" without tying it to a rubric dimension or explicit anti-pattern.
42
+ 26. Separate taste from failure. A bold design is valid when it still follows the contract, serves the product, and respects accessibility and runtime constraints.
40
43
 
41
44
  Required `docs/DESIGN.md` sections:
42
45
  1. Design Intent and Product Personality
@@ -69,7 +72,9 @@ Required `docs/design-intent.json` fields:
69
72
  - `motionSystem`
70
73
  - `componentMorphology`
71
74
  - `accessibilityPolicy`
72
- - `visualQaPolicy`
75
+ - `designExecutionPolicy`
76
+ - `designExecutionHandoff`
77
+ - `reviewRubric`
73
78
  - `contextHygiene`
74
79
  - `experiencePrinciples`
75
80
  - `forbiddenPatterns`
@@ -85,7 +90,9 @@ Output:
85
90
  - `docs/design-intent.json` must include deterministic fields for `colorTruth.format`, `colorTruth.allowHexDerivatives`, and `crossViewportAdaptation.mutationRules.mobile/tablet/desktop`.
86
91
  - `docs/design-intent.json` must also include `motionSystem` and `componentMorphology` so future UI work preserves state behavior and purposeful motion without collapsing into generic static output.
87
92
  - `docs/design-intent.json` must also include `accessibilityPolicy` so the hard compliance floor, advisory contrast model, and blocking-vs-advisory checks stay machine-readable.
88
- - `docs/design-intent.json` must also include `visualQaPolicy` so deterministic screenshot expectations, masking rules, viewport coverage, long-page capture strategy, stability thresholds, and semantic-escalation boundaries stay machine-readable.
93
+ - `docs/design-intent.json` must also include `designExecutionPolicy` so structured handoff rules, representation strategy, semantic review focus, and non-screenshot execution boundaries stay machine-readable.
94
+ - `docs/design-intent.json` must also include `designExecutionHandoff` so surface plans, component graph relationships, content priority, viewport mutation, interaction states, and signature move rationale are explicit before implementation begins.
95
+ - `docs/design-intent.json` must also include `reviewRubric` so distinctiveness, genericity drift, taste-vs-failure boundaries, and motion discipline are judged with stable dimensions instead of ad hoc opinion.
89
96
  - `docs/design-intent.json` must include `contextHygiene` so valid design sources, tainted carryover sources, and continuity rules are machine-readable.
90
97
  - If onboarding or detector evidence exists, preserve it under `repoEvidence.designEvidenceSummary` instead of throwing away the machine-readable snapshot of the current UI system.
91
98
  - Token intent must stay structure-first: primitive tokens hold raw values, semantic tokens carry purpose, and component tokens consume semantic tokens instead of bypassing them with raw values.
@@ -94,10 +101,12 @@ Output:
94
101
  - Motion guidance must preserve creativity: allow meaningful animation, define reduced-motion behavior, and optimize choreography instead of suppressing it by default.
95
102
  - Accessibility guidance must split hard compliance from advisory tuning: treat WCAG 2.2 AA as the minimum blocking floor and APCA as advisory perceptual guidance for readability nuance, especially in typography and dark mode.
96
103
  - Accessibility scope must include focus visibility, focus appearance, target size, accessible authentication, keyboard access, use-of-color-only failures, and dynamic status/state access.
97
- - Visual QA guidance must define deterministic-first screenshot review, noise thresholds, dynamic masking categories, mobile/tablet/desktop coverage, long-page capture strategy, and when semantic review is allowed to intervene.
104
+ - Structured design execution guidance must define the surface plan, component graph, content-priority map, viewport mutation plan, interaction-state matrix, and semantic review focus without relying on screenshot capture.
105
+ - Structured design execution must include an explicit structured handoff in `docs/design-intent.json`, not just policy booleans. The handoff should be detailed enough that a future agent can implement the UI without falling back to generic layout defaults.
106
+ - The review rubric must define stable dimensions, genericity signals, valid bold signals, and reporting rules that force the agent to explain why something is generic or valid.
98
107
  - Color direction must come from the current project context. Similarity to prior unrelated projects is drift unless the brief or repo evidence explicitly supports it.
99
108
  - If no approved reference system exists, synthesize the design from zero using current product context, constraints, and content only.
100
109
  - Explicitly record which sources are allowed to shape the visual language and which sources are tainted unless the user opts into continuity.
101
- - The resulting system should feel authored and recognizable in screenshots, not politely interchangeable with common template kits.
110
+ - The resulting system should feel authored and recognizable in implementation, not politely interchangeable with common template kits.
102
111
  - Use practical, modern, accessible language grounded in the project, not generic SaaS defaults or copycat brand systems.
103
112
  - Wait for user approval before generating Figma or code assets.
@@ -35,12 +35,12 @@ Mandatory behavior when triggered:
35
35
  - Hard accessibility checks must cover more than contrast alone. They must include focus visibility, focus appearance, target size, keyboard access, use-of-color-only failures, accessible authentication, and status or dynamic state access.
36
36
  - Keep accessibility compatible with expressive design. Fix the violation without flattening the interface into generic low-risk layouts unless that is the only safe option.
37
37
 
38
- ## Hybrid Visual QA Boundaries (Mandatory)
38
+ ## Structured Design Execution Boundaries (Mandatory)
39
39
 
40
- - Visual QA must be deterministic-first. Run screenshot or pixel-diff style checks before escalating to any semantic judge.
41
- - Treat dynamic masking as explicit policy, not ad hoc cleanup. Time, randomized content, live counters, streaming media, and other unstable regions must be masked deliberately and documented.
42
- - Define stability thresholds so the system can separate tiny rendering noise from meaningful layout or styling drift.
43
- - Required visual coverage must include mobile, tablet, and desktop unless the product scope explicitly excludes one of those surfaces.
40
+ - UI review must stay representation-first. The contract should define a surface plan, component graph, content-priority map, viewport mutation plan, interaction-state matrix, and task-flow narrative before implementation drifts into styling guesswork.
41
+ - Semantic review should judge contract fidelity, distinctiveness, hierarchy, component-state behavior, and cross-viewport mutation directly from the contract plus changed UI code.
42
+ - Do not make screenshot capture, browser automation, or pixel diff artifacts a baseline dependency for design quality in this repo.
43
+ - Required coverage must still include mobile, tablet, and desktop behavior unless the product scope explicitly excludes one of those surfaces.
44
44
  - Escalate to semantic review only when deterministic evidence shows meaningful drift, missing required viewport coverage, or another contract-critical visual failure.
45
45
  - Do not use semantic review to invent aesthetic problems when deterministic evidence says the surface is stable.
46
46
 
@@ -1,5 +1,5 @@
1
1
  {
2
- "generatedAt": "2026-04-22T23:58:57.575Z",
2
+ "generatedAt": "2026-04-23T10:22:34.642Z",
3
3
  "reportName": "memory-continuity-benchmark",
4
4
  "schemaVersion": "1.0.0",
5
5
  "passed": true,
package/.cursorrules CHANGED
@@ -1,6 +1,6 @@
1
1
  # AGENTIC-SENIOR-CORE DYNAMIC GOVERNANCE RULESET
2
2
 
3
- Generated by Agentic-Senior-Core CLI v3.0.17
3
+ Generated by Agentic-Senior-Core CLI v3.0.19
4
4
  Timestamp: 2026-04-22T12:30:18.799Z
5
5
  Selected policy file: .agent-context/policies/llm-judge-threshold.json
6
6
 
@@ -2,7 +2,7 @@
2
2
 
3
3
  Adapter Mode: thin
4
4
  Adapter Source: .instructions.md
5
- Canonical Snapshot SHA256: ee64f53171dcf88e10c6f6aae98d75cc541e13880255f0e519fc82736a4af5f4
5
+ Canonical Snapshot SHA256: 3ddc44d1c3cad20aa06e31c45b5d7289b1b4cde46decb668b0347817222fb022
6
6
 
7
7
  Canonical policy source: [.instructions.md](../.instructions.md).
8
8
 
@@ -2,7 +2,7 @@
2
2
 
3
3
  Adapter Mode: thin
4
4
  Adapter Source: .instructions.md
5
- Canonical Snapshot SHA256: ee64f53171dcf88e10c6f6aae98d75cc541e13880255f0e519fc82736a4af5f4
5
+ Canonical Snapshot SHA256: 3ddc44d1c3cad20aa06e31c45b5d7289b1b4cde46decb668b0347817222fb022
6
6
 
7
7
  The canonical policy source for this repository is [.instructions.md](../.instructions.md).
8
8
 
package/.instructions.md CHANGED
@@ -136,7 +136,7 @@ Meta-prompts that provide complete workflows for common scenarios:
136
136
  For UI-only requests, keep context isolated: load `bootstrap-design.md` and `frontend-architecture.md` first, and do not eagerly load unrelated backend-only rules such as `database-design.md`, `docker-runtime.md`, or `microservices.md` unless the request explicitly crosses those boundaries.
137
137
  For design work, treat only the current repo evidence, the current brief, current project docs, and explicitly approved reference systems as valid style context. Do not let prior-chat or unrelated-project visual memory steer the result unless the user explicitly asks for continuity.
138
138
  For accessibility in UI work, treat WCAG 2.2 AA as the hard compliance floor and APCA as advisory perceptual tuning only.
139
- For visual QA in UI work, treat deterministic screenshot or diff evidence as the first layer, then escalate only meaningful visual drift to a semantic judge.
139
+ For UI review, use structured design execution evidence first: `designExecutionPolicy`, `docs/DESIGN.md`, `repoEvidence.designEvidenceSummary`, and the changed UI diff. Do not require screenshot capture as a baseline dependency.
140
140
 
141
141
  ---
142
142
 
package/.windsurfrules CHANGED
@@ -1,6 +1,6 @@
1
1
  # AGENTIC-SENIOR-CORE DYNAMIC GOVERNANCE RULESET
2
2
 
3
- Generated by Agentic-Senior-Core CLI v3.0.17
3
+ Generated by Agentic-Senior-Core CLI v3.0.19
4
4
  Timestamp: 2026-04-22T12:30:18.799Z
5
5
  Selected policy file: .agent-context/policies/llm-judge-threshold.json
6
6
 
package/AGENTS.md CHANGED
@@ -2,7 +2,7 @@
2
2
 
3
3
  Adapter Mode: thin
4
4
  Adapter Source: .instructions.md
5
- Canonical Snapshot SHA256: ee64f53171dcf88e10c6f6aae98d75cc541e13880255f0e519fc82736a4af5f4
5
+ Canonical Snapshot SHA256: 3ddc44d1c3cad20aa06e31c45b5d7289b1b4cde46decb668b0347817222fb022
6
6
 
7
7
  This file is an adapter entrypoint for agent discovery.
8
8
  The canonical policy source is [.instructions.md](.instructions.md).