@trohde/earos 1.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (135) hide show
  1. package/README.md +156 -0
  2. package/assets/init/.agents/skills/earos-artifact-gen/SKILL.md +106 -0
  3. package/assets/init/.agents/skills/earos-artifact-gen/references/interview-guide.md +313 -0
  4. package/assets/init/.agents/skills/earos-artifact-gen/references/output-guide.md +367 -0
  5. package/assets/init/.agents/skills/earos-assess/SKILL.md +212 -0
  6. package/assets/init/.agents/skills/earos-assess/references/calibration-benchmarks.md +160 -0
  7. package/assets/init/.agents/skills/earos-assess/references/output-templates.md +311 -0
  8. package/assets/init/.agents/skills/earos-assess/references/scoring-protocol.md +281 -0
  9. package/assets/init/.agents/skills/earos-calibrate/SKILL.md +153 -0
  10. package/assets/init/.agents/skills/earos-calibrate/references/agreement-metrics.md +188 -0
  11. package/assets/init/.agents/skills/earos-calibrate/references/calibration-protocol.md +263 -0
  12. package/assets/init/.agents/skills/earos-create/SKILL.md +257 -0
  13. package/assets/init/.agents/skills/earos-create/references/criterion-writing-guide.md +268 -0
  14. package/assets/init/.agents/skills/earos-create/references/dependency-rules.md +193 -0
  15. package/assets/init/.agents/skills/earos-create/references/rubric-interview-guide.md +123 -0
  16. package/assets/init/.agents/skills/earos-create/references/validation-checklist.md +238 -0
  17. package/assets/init/.agents/skills/earos-profile-author/SKILL.md +251 -0
  18. package/assets/init/.agents/skills/earos-profile-author/references/criterion-writing-guide.md +280 -0
  19. package/assets/init/.agents/skills/earos-profile-author/references/design-methods.md +158 -0
  20. package/assets/init/.agents/skills/earos-profile-author/references/profile-checklist.md +173 -0
  21. package/assets/init/.agents/skills/earos-remediate/SKILL.md +118 -0
  22. package/assets/init/.agents/skills/earos-remediate/references/output-template.md +199 -0
  23. package/assets/init/.agents/skills/earos-remediate/references/remediation-patterns.md +330 -0
  24. package/assets/init/.agents/skills/earos-report/SKILL.md +85 -0
  25. package/assets/init/.agents/skills/earos-report/references/portfolio-template.md +181 -0
  26. package/assets/init/.agents/skills/earos-report/references/single-artifact-template.md +168 -0
  27. package/assets/init/.agents/skills/earos-review/SKILL.md +130 -0
  28. package/assets/init/.agents/skills/earos-review/references/challenge-patterns.md +163 -0
  29. package/assets/init/.agents/skills/earos-review/references/output-template.md +180 -0
  30. package/assets/init/.agents/skills/earos-template-fill/SKILL.md +177 -0
  31. package/assets/init/.agents/skills/earos-template-fill/references/evidence-writing-guide.md +186 -0
  32. package/assets/init/.agents/skills/earos-template-fill/references/section-rubric-mapping.md +200 -0
  33. package/assets/init/.agents/skills/earos-validate/SKILL.md +113 -0
  34. package/assets/init/.agents/skills/earos-validate/references/fix-patterns.md +281 -0
  35. package/assets/init/.agents/skills/earos-validate/references/validation-checks.md +287 -0
  36. package/assets/init/.claude/CLAUDE.md +4 -0
  37. package/assets/init/AGENTS.md +293 -0
  38. package/assets/init/CLAUDE.md +635 -0
  39. package/assets/init/README.md +507 -0
  40. package/assets/init/calibration/gold-set/.gitkeep +0 -0
  41. package/assets/init/calibration/results/.gitkeep +0 -0
  42. package/assets/init/core/core-meta-rubric.yaml +643 -0
  43. package/assets/init/docs/consistency-report.md +325 -0
  44. package/assets/init/docs/getting-started.md +194 -0
  45. package/assets/init/docs/profile-authoring-guide.md +51 -0
  46. package/assets/init/docs/terminology.md +126 -0
  47. package/assets/init/earos.manifest.yaml +104 -0
  48. package/assets/init/evaluations/.gitkeep +0 -0
  49. package/assets/init/examples/aws-event-driven-order-processing/artifact.yaml +2056 -0
  50. package/assets/init/examples/aws-event-driven-order-processing/evaluation.yaml +973 -0
  51. package/assets/init/examples/aws-event-driven-order-processing/report.md +244 -0
  52. package/assets/init/examples/example-solution-architecture.evaluation.yaml +136 -0
  53. package/assets/init/examples/multi-cloud-data-analytics/artifact.yaml +715 -0
  54. package/assets/init/overlays/data-governance.yaml +94 -0
  55. package/assets/init/overlays/regulatory.yaml +154 -0
  56. package/assets/init/overlays/security.yaml +92 -0
  57. package/assets/init/profiles/adr.yaml +225 -0
  58. package/assets/init/profiles/capability-map.yaml +223 -0
  59. package/assets/init/profiles/reference-architecture.yaml +426 -0
  60. package/assets/init/profiles/roadmap.yaml +205 -0
  61. package/assets/init/profiles/solution-architecture.yaml +227 -0
  62. package/assets/init/research/architecture-assessment-rubrics-research.docx +0 -0
  63. package/assets/init/research/architecture-assessment-rubrics-research.md +566 -0
  64. package/assets/init/research/reference-architecture-research.md +751 -0
  65. package/assets/init/standard/EAROS.md +1426 -0
  66. package/assets/init/standard/schemas/artifact.schema.json +1295 -0
  67. package/assets/init/standard/schemas/artifact.uischema.json +65 -0
  68. package/assets/init/standard/schemas/evaluation.schema.json +284 -0
  69. package/assets/init/standard/schemas/rubric.schema.json +383 -0
  70. package/assets/init/templates/evaluation-record.template.yaml +58 -0
  71. package/assets/init/templates/new-profile.template.yaml +65 -0
  72. package/bin.js +188 -0
  73. package/dist/assets/_basePickBy-BVu6YmSW.js +1 -0
  74. package/dist/assets/_baseUniq-CWRzQDz_.js +1 -0
  75. package/dist/assets/arc-CyDBhtDM.js +1 -0
  76. package/dist/assets/architectureDiagram-2XIMDMQ5-BH6O4dvN.js +36 -0
  77. package/dist/assets/blockDiagram-WCTKOSBZ-2xmwdjpg.js +132 -0
  78. package/dist/assets/c4Diagram-IC4MRINW-BNmPRFJF.js +10 -0
  79. package/dist/assets/channel-CiySTNoJ.js +1 -0
  80. package/dist/assets/chunk-4BX2VUAB-DGQTvirp.js +1 -0
  81. package/dist/assets/chunk-55IACEB6-DNMAQAC_.js +1 -0
  82. package/dist/assets/chunk-FMBD7UC4-BJbVTQ5o.js +15 -0
  83. package/dist/assets/chunk-JSJVCQXG-BCxUL74A.js +1 -0
  84. package/dist/assets/chunk-KX2RTZJC-H7wWZOfz.js +1 -0
  85. package/dist/assets/chunk-NQ4KR5QH-BK4RlTQF.js +220 -0
  86. package/dist/assets/chunk-QZHKN3VN-0chxDV5g.js +1 -0
  87. package/dist/assets/chunk-WL4C6EOR-DexfQ-AV.js +189 -0
  88. package/dist/assets/classDiagram-VBA2DB6C-D7luWJQn.js +1 -0
  89. package/dist/assets/classDiagram-v2-RAHNMMFH-D7luWJQn.js +1 -0
  90. package/dist/assets/clone-ylgRbd3D.js +1 -0
  91. package/dist/assets/cose-bilkent-S5V4N54A-DS2IOCfZ.js +1 -0
  92. package/dist/assets/cytoscape.esm-CyJtwmzi.js +331 -0
  93. package/dist/assets/dagre-KLK3FWXG-BbSoTTa3.js +4 -0
  94. package/dist/assets/defaultLocale-DX6XiGOO.js +1 -0
  95. package/dist/assets/diagram-E7M64L7V-C9TvYgv0.js +24 -0
  96. package/dist/assets/diagram-IFDJBPK2-DowUMWrg.js +43 -0
  97. package/dist/assets/diagram-P4PSJMXO-BL6nrnQF.js +24 -0
  98. package/dist/assets/erDiagram-INFDFZHY-rXPRl8VM.js +70 -0
  99. package/dist/assets/flowDiagram-PKNHOUZH-DBRM99-W.js +162 -0
  100. package/dist/assets/ganttDiagram-A5KZAMGK-INcWFsBT.js +292 -0
  101. package/dist/assets/gitGraphDiagram-K3NZZRJ6-DMwpfE91.js +65 -0
  102. package/dist/assets/graph-DLQn37b-.js +1 -0
  103. package/dist/assets/index-BFFITMT8.js +650 -0
  104. package/dist/assets/index-H7f6VTz1.css +1 -0
  105. package/dist/assets/infoDiagram-LFFYTUFH-B0f4TWRM.js +2 -0
  106. package/dist/assets/init-Gi6I4Gst.js +1 -0
  107. package/dist/assets/ishikawaDiagram-PHBUUO56-CsU6XimZ.js +70 -0
  108. package/dist/assets/journeyDiagram-4ABVD52K-CQ7ibNib.js +139 -0
  109. package/dist/assets/kanban-definition-K7BYSVSG-DzEN7THt.js +89 -0
  110. package/dist/assets/katex-B1X10hvy.js +261 -0
  111. package/dist/assets/layout-C0dvb42R.js +1 -0
  112. package/dist/assets/linear-j4a8mGj7.js +1 -0
  113. package/dist/assets/mindmap-definition-YRQLILUH-DP8iEuCf.js +68 -0
  114. package/dist/assets/ordinal-Cboi1Yqb.js +1 -0
  115. package/dist/assets/pieDiagram-SKSYHLDU-BpIAXgAm.js +30 -0
  116. package/dist/assets/quadrantDiagram-337W2JSQ-DrpXn5Eg.js +7 -0
  117. package/dist/assets/requirementDiagram-Z7DCOOCP-Bg7EwHlG.js +73 -0
  118. package/dist/assets/sankeyDiagram-WA2Y5GQK-BWagRs1F.js +10 -0
  119. package/dist/assets/sequenceDiagram-2WXFIKYE-q5jwhivG.js +145 -0
  120. package/dist/assets/stateDiagram-RAJIS63D-B_J9pE-2.js +1 -0
  121. package/dist/assets/stateDiagram-v2-FVOUBMTO-Q_1GcybB.js +1 -0
  122. package/dist/assets/timeline-definition-YZTLITO2-dv0jgQ0z.js +61 -0
  123. package/dist/assets/treemap-KZPCXAKY-Dt1dkIE7.js +162 -0
  124. package/dist/assets/vennDiagram-LZ73GAT5-BdO5RgRZ.js +34 -0
  125. package/dist/assets/xychartDiagram-JWTSCODW-CpDVe-8v.js +7 -0
  126. package/dist/index.html +23 -0
  127. package/export-docx.js +1583 -0
  128. package/init.js +353 -0
  129. package/manifest-cli.mjs +207 -0
  130. package/package.json +83 -0
  131. package/schemas/artifact.schema.json +1295 -0
  132. package/schemas/artifact.uischema.json +65 -0
  133. package/schemas/evaluation.schema.json +284 -0
  134. package/schemas/rubric.schema.json +383 -0
  135. package/serve.js +238 -0
@@ -0,0 +1,244 @@
1
+ # EaROS Architecture Assessment Report
2
+
3
+ **Artifact:** Event-Driven Order Processing Platform on AWS
4
+ **Artifact ID:** RA-AWS-ORDER-001 · Version 1.0.0
5
+ **Artifact Type:** Reference Architecture
6
+ **Rubrics Applied:** EAROS-CORE-002 v2.0.0 + EAROS-REFARCH-001 v2.0.0
7
+ **Evaluation Date:** 2026-03-20
8
+ **Evaluators:** EaROS Evaluator Agent + EaROS Challenger Agent (claude-sonnet-4-6)
9
+ **Evaluation Mode:** Agent (DAG, all 8 steps completed)
10
+
11
+ ---
12
+
13
+ ## Executive Summary
14
+
15
+ | | |
16
+ |---|---|
17
+ | **Overall Status** | **PASS** |
18
+ | **Overall Score** | **3.73 / 4.0** |
19
+ | **Pass Threshold** | 3.2 / 4.0 |
20
+ | **Gate Failures** | None |
21
+ | **Criteria Scored** | 19 of 19 (10 core + 9 profile) |
22
+ | **N/A Criteria** | 0 |
23
+ | **Confidence** | High |
24
+
25
+ This reference architecture is a **gold-standard implementation** that passes EaROS assessment with a score of 3.73/4.0 — well above the 3.2 pass threshold. It is recommended as the **calibration benchmark** for all future EaROS reference architecture assessments. No gate criteria failed. No dimension scored below 2.0.
26
+
27
+ ---
28
+
29
+ ## Traffic-Light Dashboard
30
+
31
+ | Criterion | Name | Score | Status |
32
+ |-----------|------|-------|--------|
33
+ | STK-01 | Stakeholder and purpose fit | 4 / 4 | 🟢 |
34
+ | STK-02 | Concern-to-view mapping | 4 / 4 | 🟢 |
35
+ | SCP-01 | Scope and boundary clarity ⚠️ CRITICAL GATE | 4 / 4 | 🟢 |
36
+ | CVP-01 | Viewpoint appropriateness | 4 / 4 | 🟢 |
37
+ | TRC-01 | Traceability to drivers ⚠️ MAJOR GATE | 4 / 4 | 🟢 |
38
+ | CON-01 | Internal consistency | 4 / 4 | 🟢 |
39
+ | RAT-01 | Risks, assumptions, constraints, trade-offs | 4 / 4 | 🟢 |
40
+ | CMP-01 | Standards and policy compliance ⚠️ CRITICAL GATE | 4 / 4 | 🟢 |
41
+ | ACT-01 | Actionability | 4 / 4 | 🟢 |
42
+ | MNT-01 | Maintainability and stewardship | 4 / 4 | 🟢 |
43
+ | RA-VIEW-01 | Architecture views completeness ⚠️ MAJOR GATE | 4 / 4 | 🟢 |
44
+ | RA-VIEW-02 | Machine-readable diagrams | 3 / 4 | 🟡 |
45
+ | RA-DEC-01 | Key decisions documented ⚠️ MAJOR GATE | 4 / 4 | 🟢 |
46
+ | RA-DEC-02 | Prescriptiveness classification | 4 / 4 | 🟢 |
47
+ | RA-OPS-01 | Operational readiness ⚠️ MAJOR GATE | 4 / 4 | 🟢 |
48
+ | RA-IMP-01 | Implementation artifacts | 4 / 4 | 🟢 |
49
+ | RA-IMP-02 | Getting-started guide | 3 / 4 | 🟡 |
50
+ | RA-QA-01 | Quality attributes ⚠️ MAJOR GATE | 4 / 4 | 🟢 |
51
+ | RA-REU-01 | Reusability and evolution | 4 / 4 | 🟢 |
52
+
53
+ **Legend:** 🟢 Score ≥ 3 · 🟡 Score 2–2.9 · 🟠 Score 1–1.9 · 🔴 Score 0 or gate failure
54
+
55
+ ---
56
+
57
+ ## Dimension Summary
58
+
59
+ | Dimension | Name | Weight | Score | Weighted |
60
+ |-----------|------|--------|-------|---------|
61
+ | D1 | Stakeholder and purpose fit | 1.0 | 4.0 | 4.0 |
62
+ | D2 | Scope and boundary clarity | 1.0 | 4.0 | 4.0 |
63
+ | D3 | Concern coverage and viewpoint appropriateness | 1.0 | 4.0 | 4.0 |
64
+ | D4 | Traceability to drivers, requirements, and principles | 1.0 | 4.0 | 4.0 |
65
+ | D5 | Internal consistency and integrity | 1.0 | 4.0 | 4.0 |
66
+ | D6 | Risks, assumptions, constraints, and trade-offs | 1.0 | 4.0 | 4.0 |
67
+ | D7 | Standards and policy compliance | 1.0 | 4.0 | 4.0 |
68
+ | D8 | Actionability and implementation relevance | 1.0 | 4.0 | 4.0 |
69
+ | D9 | Artifact maintainability and stewardship | 1.0 | 4.0 | 4.0 |
70
+ | RA-D1 | Architecture views and completeness | 1.2 | 3.5 | 4.2 |
71
+ | RA-D2 | Prescriptiveness and decision guidance | 1.0 | 4.0 | 4.0 |
72
+ | RA-D3 | Operational readiness | 1.0 | 4.0 | 4.0 |
73
+ | RA-D4 | Implementation actionability | 1.2 | 3.5 | 4.2 |
74
+ | RA-D5 | Quality attribute specification | 1.0 | 4.0 | 4.0 |
75
+ | RA-D6 | Reusability and evolution | 0.8 | 4.0 | 3.2 |
76
+
77
+ **Overall weighted score: 3.73 / 4.0**
78
+ Pass threshold: 3.2 — **exceeded by 0.53 points**
79
+
80
+ ---
81
+
82
+ ## Gate Check
83
+
84
+ | Gate | Criterion | Severity | Score | Result |
85
+ |------|-----------|----------|-------|--------|
86
+ | Scope reviewability | SCP-01 | CRITICAL | 4 | ✅ PASS |
87
+ | Mandatory compliance | CMP-01 | CRITICAL | 4 | ✅ PASS |
88
+ | Stakeholder fit | STK-01 | MAJOR | 4 | ✅ PASS |
89
+ | Traceability | TRC-01 | MAJOR | 4 | ✅ PASS |
90
+ | Architecture views | RA-VIEW-01 | MAJOR | 4 | ✅ PASS |
91
+ | Decision documentation | RA-DEC-01 | MAJOR | 4 | ✅ PASS |
92
+ | Operational readiness | RA-OPS-01 | MAJOR | 4 | ✅ PASS |
93
+ | Quality attributes | RA-QA-01 | MAJOR | 4 | ✅ PASS |
94
+
95
+ **All gates: PASS. No gate failures.**
96
+
97
+ ---
98
+
99
+ ## Three Evaluation Types
100
+
101
+ ### 1. Artifact Quality — 4.0 / 4.0 (Exceptional)
102
+
103
+ The artifact is complete, coherent, internally consistent, and fit for its stated purpose as both an Architecture Board submission and a golden-path reference for development teams.
104
+
105
+ **Strengths:**
106
+ - Seven named stakeholders with individual concern statements and a 15-entry reading guide — navigable by any audience
107
+ - 12 explicit in-scope items and 8 explicit out-of-scope items with rationale — no ambiguity about scope
108
+ - 10-entry element catalog establishes canonical component names; 24-term glossary normalises terminology; all five views use consistent names
109
+ - All core rubric criteria score 4/4 — this is the highest artifact quality achievable
110
+
111
+ **Notable evidence:** The reading guide section_map explicitly maps every section to an audience and concern, satisfying STK-02 at score 4. The scope assumptions each carry a consequence_if_violated field — a level of rigour rarely seen in reference architecture documents.
112
+
113
+ ---
114
+
115
+ ### 2. Architectural Fitness — 3.73 / 4.0 (Very Strong)
116
+
117
+ The event-driven architecture on AWS Lambda, EventBridge, DynamoDB, and SQS is well-suited to the stated requirements. The architecture is production-proven on AWS and the technology choices are well-calibrated to the team size, load profile, and compliance requirements.
118
+
119
+ **Strengths:**
120
+ - Five full ADRs with 2–3 alternatives, explicit trade-off statements (what was sacrificed, what was gained), consequences for adopting teams, and specific revisit conditions
121
+ - Bidirectional traceability: all 5 drivers link to architecture responses and ADR IDs; all ADRs carry driver_refs arrays
122
+ - Eight quality attributes with measurable numeric targets, validation strategies, CI/CD fitness functions (Gatling fails the build), and TOGAF-format quality scenarios
123
+ - Complete operational model: 5 SLOs with error budgets, 11 key metrics, CDK-provisioned P1/P2/P3 alerting, 4 scaling policies, 8-step DR runbook
124
+
125
+ **Two minor gaps (both on the roadmap for v1.1.0):**
126
+ - **RA-VIEW-02 (score 3):** Five Mermaid diagrams in separate source files rather than a unified Structurizr DSL model. A Structurizr workspace would make all views model-generated from a single source of truth.
127
+ - **RA-IMP-02 (score 3):** Getting-started time of 4–8 hours for engineers new to CDK. Score 4 requires < 1-hour automated onboarding. A Backstage Software Template is planned for v1.1.0.
128
+
129
+ ---
130
+
131
+ ### 3. Governance Fit — 4.0 / 4.0 (Excellent)
132
+
133
+ PCI-DSS Level 1 compliance is mapped through 14 compliance entries covering all 9 material requirement areas. GDPR and enterprise security standards are addressed. Zero exceptions — full compliance for all stated applicable standards.
134
+
135
+ **Strengths:**
136
+ - 14 compliance mapping entries: each names a specific control, the specific design element satisfying it, evidence location, and owner — compliance by evidence, not by assertion
137
+ - Governance outcome explicit (approved), decision statement includes effective date (Q2 2026), 5 next actions are owned and dated
138
+ - No exceptions register — architecture achieves full compliance for PCI-DSS, GDPR, and enterprise standards without requiring exception approvals
139
+
140
+ **One improvement opportunity:**
141
+ - ISO-27001 is listed as an applicable standard but only the most material controls are mapped. An ISO-27001 Annex A mapping appendix in v1.1 would strengthen completeness.
142
+
143
+ ---
144
+
145
+ ## Key Findings
146
+
147
+ ### What this artifact does exceptionally well
148
+
149
+ **1. Decision documentation** — The 5 ADRs in this artifact represent the highest standard of architectural decision documentation. Each documents context, 2–3 alternatives with explicit pros/cons, chosen option, rationale referencing specific drivers and principles, trade-offs accepted (not just acknowledged), consequences for adopting teams, and specific revisit conditions. This is the pattern all EaROS reference architectures should follow.
150
+
151
+ **2. Operational completeness** — Most reference architectures address build-time concerns and leave operational readiness as future work. This artifact provides: 5 SLOs with error budgets, 11 named key metrics, CDK-provisioned dashboards and alerting (P1/P2/P3 tiers), 4 scaling policies with numeric thresholds, and an 8-step DR runbook with tested RTO/RPO targets. The operational model is provisioned by CDK alongside the service — not added after.
152
+
153
+ **3. Compliance by evidence** — The 14 compliance mapping entries demonstrate how to satisfy CMP-01 at score 4. Rather than asserting compliance, each entry names the specific design element (e.g. "Each Lambda function has a least-privilege IAM role") and the evidence location. Auditors can verify each control without interpretation.
154
+
155
+ **4. Bidirectional traceability** — All 5 business drivers have architecture_response fields linking to named components and ADR IDs. All 5 ADRs carry driver_refs arrays linking back to the motivating drivers. This bidirectional traceability is rare and highly valued by governance reviewers.
156
+
157
+ **5. Prescriptiveness clarity** — The component_classification section gives development teams clear guidance: 9 mandatory components (no substitution without exception), 2 recommended, 1 optional. Three formal extension points define permitted variations with explicit constraints. Teams know exactly what they must, should, and may do.
158
+
159
+ ---
160
+
161
+ ## Recommended Actions
162
+
163
+ The following actions are recommended to close the two score-3 gaps. Both are already in the v1.1.0 evolution roadmap.
164
+
165
+ ### Priority 1 — Migrate to Structurizr DSL (RA-VIEW-02: 3 → 4)
166
+
167
+ Consolidate the five independent Mermaid diagram files into a Structurizr DSL workspace where all views are generated from a single architecture model. This eliminates the risk of diagram drift (a common cause of CON-01 regression) and achieves the highest standard for diagram-as-code architecture documentation.
168
+
169
+ **Impact:** RA-VIEW-02 increases from 3 to 4. RA-D1 weighted score increases from 4.2 to 4.8. Overall score increases from 3.73 to approximately 3.80.
170
+ **Effort:** Medium (3–5 days for a Structurizr workspace covering all five views).
171
+ **Owner:** Enterprise Architecture.
172
+ **Target:** v1.1.0 (2026-09-20).
173
+
174
+ ### Priority 2 — Implement Backstage Software Template (RA-IMP-02: 3 → 4)
175
+
176
+ Create a Backstage Software Template that provisions a complete new order service project in the platform developer portal. Target: < 1 hour from template instantiation to first successful deployment in sandbox.
177
+
178
+ **Impact:** RA-IMP-02 increases from 3 to 4. RA-D4 weighted score increases from 4.2 to 4.8. Overall score increases to approximately 3.87.
179
+ **Effort:** Medium (2–3 days for template creation + Backstage integration).
180
+ **Owner:** Order Platform Engineering.
181
+ **Target:** v1.1.0 (2026-09-20).
182
+
183
+ ### Priority 3 — ISO-27001 Annex A mapping appendix (CMP-01 enhancement)
184
+
185
+ Add an ISO-27001 Annex A control mapping appendix in v1.1. This does not change the CMP-01 score (already 4) but strengthens the compliance documentation for audit purposes and reduces QSA preparation effort.
186
+
187
+ **Impact:** No score change; audit readiness improvement.
188
+ **Effort:** Low (1–2 days with Security Architecture input).
189
+ **Owner:** Information Security.
190
+ **Target:** v1.1.0 (2026-09-20).
191
+
192
+ ### Priority 4 — Mermaid node ID mapping (CON-01 cosmetic)
193
+
194
+ Add a comment block at the top of each Mermaid diagram_source clarifying the abbreviated node ID to full component name mapping (e.g. `# OrderSvc = Order Service Lambda`). Eliminates the minor discrepancy noted in the challenger review.
195
+
196
+ **Impact:** No score change; clarity improvement.
197
+ **Effort:** Trivial (< 1 hour).
198
+ **Owner:** Enterprise Architecture.
199
+ **Target:** v1.0.1 patch.
200
+
201
+ ---
202
+
203
+ ## Projected Score After Recommended Actions
204
+
205
+ | Action | RA-VIEW-02 | RA-IMP-02 | Overall Score |
206
+ |--------|-----------|-----------|---------------|
207
+ | Current (v1.0.0) | 3 | 3 | **3.73** |
208
+ | After Priority 1 (Structurizr) | 4 | 3 | ~3.80 |
209
+ | After Priority 2 (Backstage) | 4 | 4 | ~3.87 |
210
+ | After Priority 3 (ISO-27001) | 4 | 4 | ~3.87 (no score Δ) |
211
+
212
+ At v1.1.0 with all priority actions complete, the projected overall score is **3.87 / 4.0** — one of the highest scores achievable for a real-world reference architecture.
213
+
214
+ ---
215
+
216
+ ## Calibration Note
217
+
218
+ This artifact and its evaluation record are designated as the **gold-standard calibration benchmark** for EaROS reference architecture assessments. Future evaluators should:
219
+
220
+ 1. Score this artifact independently against EAROS-CORE-002 + EAROS-REFARCH-001 before conducting production evaluations
221
+ 2. Target agreement with the scores in `evaluation.yaml` within ±1 point on each criterion
222
+ 3. Compute Cohen's κ against these reference scores; target κ > 0.70
223
+ 4. Resolve disagreements against the level descriptors in the rubric, not against this report
224
+
225
+ The two score-3 criteria (RA-VIEW-02, RA-IMP-02) are calibration-relevant: they demonstrate that a strong artifact can have principled gaps that are recognised and scored correctly, rather than being inflated to 4 across the board. Evaluators who score all 19 criteria as 4 on this artifact should review their calibration — two criteria are intentionally evidence-constrained at score 3.
226
+
227
+ ---
228
+
229
+ ## Evaluation Metadata
230
+
231
+ | Field | Value |
232
+ |-------|-------|
233
+ | Evaluation ID | EVAL-RA-AWS-001 |
234
+ | DAG steps completed | All 8 (including challenge_pass and calibration) |
235
+ | Rubric lock version | EAROS-CORE-002 v2.0.0 + EAROS-REFARCH-001 v2.0.0 |
236
+ | Calibration applied | Yes |
237
+ | Challenger verdict | No score changes; 4 challenges reviewed, all upheld |
238
+ | Evidence class breakdown | 19/19 criteria: observed (direct quotes) |
239
+ | Evidence sufficiency | 17/19 sufficient; 2/19 sufficient with noted gaps |
240
+
241
+ ---
242
+
243
+ *Generated by EaROS evaluator agent (claude-sonnet-4-6) · 2026-03-20*
244
+ *Conforms to evaluation.schema.json · Rubric version EAROS-CORE-002 v2.0.0*
@@ -0,0 +1,136 @@
1
+ kind: evaluation
2
+ evaluation_id: EVAL-SOL-0001
3
+ rubric_id: EAROS-SOL-001
4
+ rubric_version: 1.0.0
5
+ artifact_id: SOL-ART-042
6
+ artifact_type: solution_architecture
7
+ artifact_ref:
8
+ id: SOL-ART-042
9
+ title: Payments API Modernization Solution Architecture
10
+ artifact_type: solution_architecture
11
+ owner: Payments Domain Architecture
12
+ uri: repo://architecture/payments/solution-architecture.md
13
+ evaluation_date: '2026-03-16'
14
+ evaluation_mode: hybrid
15
+ evaluated_by:
16
+ - role: evaluator
17
+ actor: human
18
+ identity: EA reviewer
19
+ - role: evaluator
20
+ actor: agent
21
+ identity: EaROS evaluator
22
+ - role: challenger
23
+ actor: agent
24
+ identity: EaROS challenger
25
+ overall_status: conditional_pass
26
+ overall_score: 3.0
27
+ gate_failures: []
28
+ criterion_results:
29
+ - criterion_id: STK-01
30
+ score: 3
31
+ judgment_type: observed
32
+ confidence: high
33
+ evidence_sufficiency: sufficient
34
+ evidence_refs:
35
+ - location: Section 1.0 Audience and Purpose
36
+ excerpt: Architecture board, engineering lead, operations, and security are named.
37
+ rationale: Stakeholders and decision purpose are explicit, though not all stakeholder
38
+ concerns are mapped in detail.
39
+ missing_information:
40
+ - Concern-to-view matrix
41
+ recommended_actions:
42
+ - Add stakeholder-concern-view table
43
+ - criterion_id: SCP-01
44
+ score: 3
45
+ judgment_type: observed
46
+ confidence: high
47
+ evidence_sufficiency: sufficient
48
+ evidence_refs:
49
+ - location: Section 1.2 Scope
50
+ excerpt: Customer onboarding and payment initiation are in scope; ledger modernization
51
+ is out of scope.
52
+ rationale: Scope is clear enough for review.
53
+ missing_information: []
54
+ recommended_actions: []
55
+ - criterion_id: TRC-01
56
+ score: 2
57
+ judgment_type: mixed
58
+ confidence: medium
59
+ evidence_sufficiency: partial
60
+ evidence_refs:
61
+ - location: Section 2.1 Drivers
62
+ excerpt: Availability and time-to-market are listed as drivers.
63
+ rationale: Drivers are listed but traceability from drivers to architectural decisions
64
+ is weak.
65
+ missing_information:
66
+ - Decision-to-requirement traceability
67
+ recommended_actions:
68
+ - Add a design decision traceability table
69
+ - criterion_id: SOL-01
70
+ score: 3
71
+ judgment_type: observed
72
+ confidence: medium
73
+ evidence_sufficiency: sufficient
74
+ evidence_refs:
75
+ - location: Appendix A Option Comparison
76
+ excerpt: Managed gateway, custom gateway, and phased hybrid were compared.
77
+ rationale: The optioning is meaningful and decision-relevant.
78
+ missing_information:
79
+ - Explicit cost sensitivity assumptions
80
+ recommended_actions:
81
+ - Document cost assumptions
82
+ - criterion_id: SOL-02
83
+ score: 2
84
+ judgment_type: mixed
85
+ confidence: medium
86
+ evidence_sufficiency: partial
87
+ evidence_refs:
88
+ - location: Section 3.4 NFRs
89
+ excerpt: Availability and performance targets are stated.
90
+ rationale: NFRs are stated, but resilience and failover mechanisms are not described
91
+ with enough depth.
92
+ missing_information:
93
+ - Quality attribute scenarios
94
+ - Mapping from resilience goals to mechanisms
95
+ recommended_actions:
96
+ - Add quality attribute scenario table
97
+ - Explain failover and degradation behavior
98
+ dimension_results:
99
+ - dimension_id: D1
100
+ score: 3.0
101
+ weighted_score: 3.0
102
+ summary: Purpose and audience are clear, but concern mapping needs improvement.
103
+ - dimension_id: D2
104
+ score: 3.0
105
+ weighted_score: 3.0
106
+ summary: Scope is reviewable and bounded.
107
+ - dimension_id: D4
108
+ score: 2.0
109
+ weighted_score: 2.0
110
+ summary: Traceability is the main weakness.
111
+ - dimension_id: SD1
112
+ score: 3.0
113
+ weighted_score: 3.0
114
+ summary: Alternatives and rationale are present.
115
+ - dimension_id: SD2
116
+ score: 2.0
117
+ weighted_score: 2.0
118
+ summary: NFR treatment is insufficiently concrete for a strong pass.
119
+ summary:
120
+ strengths:
121
+ - Clear review purpose and bounded scope
122
+ - Meaningful option comparison
123
+ - Artifact is actionable enough for controlled progression
124
+ weaknesses:
125
+ - Weak traceability from drivers to decisions
126
+ - Insufficiently concrete quality attribute response
127
+ risks:
128
+ - Operational resilience may be under-designed
129
+ - Governance approval may be delayed without stronger NFR evidence
130
+ next_actions:
131
+ - Add traceability matrix
132
+ - Add quality attribute scenario and mechanism mapping
133
+ - Re-review after revision
134
+ decision_narrative: The solution architecture is reviewable and directionally sound,
135
+ but it does not yet provide strong enough evidence on traceability and quality
136
+ attribute treatment for an unconditional pass.