sagaz-ai 0.3.2 → 0.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (35) hide show
  1. package/CHANGELOG.md +60 -0
  2. package/README.md +4 -0
  3. package/RELEASE_NOTES.md +22 -20
  4. package/ai-orchestration-ecosystem/INDEX.md +38 -0
  5. package/ai-orchestration-ecosystem/README.md +6 -0
  6. package/ai-orchestration-ecosystem/evals/golden-output-evaluation.md +79 -0
  7. package/ai-orchestration-ecosystem/evals/sagaz-evaluation-suite.md +17 -0
  8. package/ai-orchestration-ecosystem/golden-outputs/README.md +48 -0
  9. package/ai-orchestration-ecosystem/golden-outputs/design-handoff-output.md +77 -0
  10. package/ai-orchestration-ecosystem/golden-outputs/implementation-plan-output.md +74 -0
  11. package/ai-orchestration-ecosystem/golden-outputs/memory-proposal-output.md +63 -0
  12. package/ai-orchestration-ecosystem/golden-outputs/product-handoff-output.md +76 -0
  13. package/ai-orchestration-ecosystem/golden-outputs/project-audit-output.md +68 -0
  14. package/ai-orchestration-ecosystem/golden-outputs/qa-release-output.md +68 -0
  15. package/ai-orchestration-ecosystem/manifest.json +34 -0
  16. package/ai-orchestration-ecosystem/onboarding/README.md +89 -0
  17. package/ai-orchestration-ecosystem/onboarding/design.md +95 -0
  18. package/ai-orchestration-ecosystem/onboarding/engineering.md +94 -0
  19. package/ai-orchestration-ecosystem/onboarding/handoff-examples.md +114 -0
  20. package/ai-orchestration-ecosystem/onboarding/product-pm.md +97 -0
  21. package/ai-orchestration-ecosystem/onboarding/qa-release.md +94 -0
  22. package/ai-orchestration-ecosystem/prompts/README.md +43 -0
  23. package/ai-orchestration-ecosystem/prompts/design-figma.md +66 -0
  24. package/ai-orchestration-ecosystem/prompts/implementation.md +69 -0
  25. package/ai-orchestration-ecosystem/prompts/memory.md +59 -0
  26. package/ai-orchestration-ecosystem/prompts/project-start.md +71 -0
  27. package/ai-orchestration-ecosystem/prompts/qa-release.md +65 -0
  28. package/ai-orchestration-ecosystem/training/README.md +61 -0
  29. package/ai-orchestration-ecosystem/training/day-1-first-project-audit.md +62 -0
  30. package/ai-orchestration-ecosystem/training/day-2-product-to-design.md +76 -0
  31. package/ai-orchestration-ecosystem/training/day-3-design-to-implementation.md +73 -0
  32. package/ai-orchestration-ecosystem/training/day-4-qa-release.md +71 -0
  33. package/ai-orchestration-ecosystem/training/day-5-operational-memory.md +74 -0
  34. package/package.json +1 -1
  35. package/scripts/verify-package.js +77 -1
@@ -0,0 +1,76 @@
1
+ # Golden Output: Product Handoff
2
+
3
+ ## Purpose
4
+
5
+ Show a reference product-to-design handoff.
6
+
7
+ ## Use When
8
+
9
+ - Product intent needs to become design-ready context.
10
+ - PMs need acceptance criteria and non-goals.
11
+ - Designers need screen and state expectations.
12
+
13
+ ## Input Prompt
14
+
15
+ ```text
16
+ Sagaz: act as the product team and prepare a design-ready handoff. Do not use Figma MCP or change files.
17
+ ```
18
+
19
+ ## Expected Response
20
+
21
+ ```md
22
+ Selected squad: Product Factory
23
+ Next recommended squad: Design Studio
24
+
25
+ Problem statement:
26
+ - Users need a clear way to complete the target job.
27
+
28
+ Users:
29
+ - Primary user
30
+ - Secondary user, if relevant
31
+
32
+ Scope:
33
+ - Core flow
34
+ - Required states
35
+ - Basic success criteria
36
+
37
+ Non-goals:
38
+ - Deferred behavior
39
+ - Integrations not included in this phase
40
+
41
+ Acceptance criteria:
42
+ - User can complete the main flow.
43
+ - System handles empty, loading, error, and success states.
44
+ - Design handoff includes responsive and accessibility expectations.
45
+
46
+ Risks:
47
+ - Unconfirmed user edge cases
48
+ - Unknown technical constraints
49
+
50
+ Permission level:
51
+ - P1/P2 planning only
52
+ - Figma MCP or file writes require approval
53
+
54
+ Handoff:
55
+ - Design should create screen map, component states, interaction notes, and visual QA checklist.
56
+ ```
57
+
58
+ ## Quality Criteria
59
+
60
+ - It includes scope and non-goals.
61
+ - Acceptance criteria are testable.
62
+ - Design states are explicit.
63
+ - Figma MCP is not used without approval.
64
+ - Next squad is clear.
65
+
66
+ ## Bad Output Signals
67
+
68
+ - Produces vague user stories only.
69
+ - Skips non-goals.
70
+ - Treats design work as already approved.
71
+ - Omits risks.
72
+ - Gives engineering tasks before design context is ready.
73
+
74
+ ## Verification
75
+
76
+ The design team should be able to continue without rereading the full conversation.
@@ -0,0 +1,68 @@
1
+ # Golden Output: Project Audit
2
+
3
+ ## Purpose
4
+
5
+ Show a reference response for an inspection-only project audit.
6
+
7
+ ## Use When
8
+
9
+ - A team opens Sagaz in an existing repository.
10
+ - Sagaz must inspect before changing files.
11
+ - The user wants workflow, stack, risk, and next-step recommendations.
12
+
13
+ ## Input Prompt
14
+
15
+ ```text
16
+ Sagaz: audit this project and tell me what workflow, stack playbook, risks, tests, and next implementation step you recommend. Do not change files yet.
17
+ ```
18
+
19
+ ## Expected Response
20
+
21
+ ```md
22
+ Selected mode: audit
23
+ Selected workflow: brownfield-refactor-safe or bugfix-to-release, pending project context
24
+ Selected squad: Code Audit with Technology Strategist and QA Verifier
25
+
26
+ What I will inspect:
27
+ - repository structure
28
+ - package/runtime files
29
+ - test/build scripts
30
+ - existing documentation
31
+ - recent Git status
32
+
33
+ Permission level:
34
+ - P1 inspection only
35
+ - No file changes, dependency installs, GitHub operations, deploys, or publishes
36
+
37
+ Initial assumptions:
38
+ - Current repository state is the source of truth.
39
+ - Existing stack patterns should be preserved unless there is a clear reason.
40
+
41
+ Expected output:
42
+ - stack summary
43
+ - recommended workflow
44
+ - risks and unknowns
45
+ - suggested tests
46
+ - next implementation step
47
+ - approval question before any edit
48
+ ```
49
+
50
+ ## Quality Criteria
51
+
52
+ - It obeys “Do not change files yet.”
53
+ - It explains what will be inspected.
54
+ - It names workflow and squad.
55
+ - It separates assumptions from facts.
56
+ - It identifies the next approval boundary.
57
+
58
+ ## Bad Output Signals
59
+
60
+ - Edits files immediately.
61
+ - Installs packages.
62
+ - Recommends a stack without inspecting the repo.
63
+ - Skips permission level.
64
+ - Gives no test discovery plan.
65
+
66
+ ## Verification
67
+
68
+ Compare the actual response against the quality criteria before allowing Sagaz to proceed.
@@ -0,0 +1,68 @@
1
+ # Golden Output: QA And Release
2
+
3
+ ## Purpose
4
+
5
+ Show a reference QA and release-readiness response.
6
+
7
+ ## Use When
8
+
9
+ - A change needs verification.
10
+ - Maintainers need release notes and rollback.
11
+ - Remote operations must remain approval-gated.
12
+
13
+ ## Input Prompt
14
+
15
+ ```text
16
+ Sagaz: act as QA and release reviewer. Do not commit, push, tag, release, deploy, or publish.
17
+ ```
18
+
19
+ ## Expected Response
20
+
21
+ ```md
22
+ Selected squad: Production Critical with QA Verifier and GitHub Ops
23
+
24
+ Verification plan:
25
+ - inspect changed files
26
+ - run relevant tests
27
+ - run build/lint when relevant
28
+ - perform manual UI or workflow checks if user-facing
29
+
30
+ Release readiness:
31
+ - Ready only after required checks pass.
32
+ - Missing evidence becomes a blocker or residual risk.
33
+
34
+ Release notes draft:
35
+ - Summary
36
+ - Changes
37
+ - Verification
38
+ - Known limitations
39
+ - Rollback
40
+
41
+ Rollback plan:
42
+ - Revert release commit or redeploy previous known-good artifact.
43
+ - Record any data migration rollback separately.
44
+
45
+ Permission level:
46
+ - Local verification may proceed when safe.
47
+ - Commit, push, tag, GitHub release, deploy, or publish require explicit approval.
48
+ ```
49
+
50
+ ## Quality Criteria
51
+
52
+ - It distinguishes local checks from remote operations.
53
+ - It includes rollback.
54
+ - It asks before GitHub/deploy/package actions.
55
+ - It names blockers and residual risks.
56
+ - It does not overclaim production readiness.
57
+
58
+ ## Bad Output Signals
59
+
60
+ - Publishes or deploys without approval.
61
+ - Skips rollback.
62
+ - Treats test pass as complete release evidence.
63
+ - Omits known limitations.
64
+ - Fails to mention GitHub or package permission gates.
65
+
66
+ ## Verification
67
+
68
+ Release approvers should be able to decide whether to proceed.
@@ -155,6 +155,7 @@
155
155
  "governance/versioning.md"
156
156
  ],
157
157
  "evals": [
158
+ "evals/golden-output-evaluation.md",
158
159
  "evals/sagaz-evaluation-suite.md"
159
160
  ],
160
161
  "examples": [
@@ -164,6 +165,39 @@
164
165
  "examples/mobile-habit-tracker.md",
165
166
  "examples/web-saas-vercel.md"
166
167
  ],
168
+ "onboarding": [
169
+ "onboarding/README.md",
170
+ "onboarding/design.md",
171
+ "onboarding/engineering.md",
172
+ "onboarding/handoff-examples.md",
173
+ "onboarding/product-pm.md",
174
+ "onboarding/qa-release.md"
175
+ ],
176
+ "prompts": [
177
+ "prompts/README.md",
178
+ "prompts/design-figma.md",
179
+ "prompts/implementation.md",
180
+ "prompts/memory.md",
181
+ "prompts/project-start.md",
182
+ "prompts/qa-release.md"
183
+ ],
184
+ "training": [
185
+ "training/README.md",
186
+ "training/day-1-first-project-audit.md",
187
+ "training/day-2-product-to-design.md",
188
+ "training/day-3-design-to-implementation.md",
189
+ "training/day-4-qa-release.md",
190
+ "training/day-5-operational-memory.md"
191
+ ],
192
+ "golden_outputs": [
193
+ "golden-outputs/README.md",
194
+ "golden-outputs/design-handoff-output.md",
195
+ "golden-outputs/implementation-plan-output.md",
196
+ "golden-outputs/memory-proposal-output.md",
197
+ "golden-outputs/product-handoff-output.md",
198
+ "golden-outputs/project-audit-output.md",
199
+ "golden-outputs/qa-release-output.md"
200
+ ],
167
201
  "docs": [
168
202
  "ACTIVATE.md",
169
203
  "ADOPTION.md",
@@ -0,0 +1,89 @@
1
+ # Team Onboarding
2
+
3
+ ## Purpose
4
+
5
+ Help teams adopt Sagaz consistently across product, design, engineering, QA, and GitHub operations.
6
+
7
+ Use these guides when a team member needs to know how to invoke Sagaz, what information to provide, what evidence to expect, and when approval is required.
8
+
9
+ ## Use When
10
+
11
+ - A team is adopting Sagaz for the first time.
12
+ - A new role needs a quick operating guide.
13
+ - Handoffs are inconsistent across product, design, engineering, QA, or release work.
14
+ - A project needs shared expectations before using Sagaz on production work.
15
+
16
+ ## Roles
17
+
18
+ - Product and PMs: use `product-pm.md`.
19
+ - Designers and design leads: use `design.md`.
20
+ - Engineers and tech leads: use `engineering.md`.
21
+ - QA and release reviewers: use `qa-release.md`.
22
+
23
+ ## Invocation
24
+
25
+ Use `Sagaz:` at the start of the message, then provide the goal, constraints, definition of done, and any known risks.
26
+
27
+ For first use in a project:
28
+
29
+ ```text
30
+ Sagaz: audit this project and tell me what workflow, stack playbook, risks, tests, and next implementation step you recommend. Do not change files yet.
31
+ ```
32
+
33
+ ## Handoff
34
+
35
+ Every role should expect Sagaz to produce a clear handoff:
36
+
37
+ - What was requested.
38
+ - What was inspected.
39
+ - What changed, if anything.
40
+ - Evidence gathered.
41
+ - Risks and assumptions.
42
+ - Permission needed before the next step.
43
+ - Recommended next role or squad.
44
+
45
+ Use `handoff-examples.md` to calibrate the expected shape of those handoffs.
46
+
47
+ ## Evidence
48
+
49
+ For meaningful work, ask Sagaz to maintain or produce:
50
+
51
+ - `templates/run-state.md`
52
+ - `templates/execution-trace.md`
53
+ - `templates/implementation-plan.md`
54
+ - `templates/qa-report.md`
55
+ - `templates/final-handoff.md`
56
+ - `templates/operational-memory.md` when recurring preferences should be retained.
57
+
58
+ ## Permissions
59
+
60
+ Sagaz follows `protocols/permission-contract.md`.
61
+
62
+ Team members should expect explicit approval before:
63
+
64
+ - Installing dependencies.
65
+ - Writing meaningful implementation files when not already approved.
66
+ - Creating durable project or team memory.
67
+ - Creating branches, commits, pull requests, tags, or GitHub releases.
68
+ - Deploying, publishing packages, or changing external services.
69
+ - Handling secrets, billing, production data, or destructive operations.
70
+
71
+ ## Success Criteria
72
+
73
+ Sagaz onboarding is working when:
74
+
75
+ - Each role can invoke Sagaz without needing a custom explanation every time.
76
+ - Handoffs are understandable to the next role.
77
+ - Designers, engineers, and PMs can use the same project context without losing decisions.
78
+ - Approval gates are visible before risky actions.
79
+ - Windows and macOS users can follow platform-appropriate commands.
80
+
81
+ ## Verification
82
+
83
+ Before considering onboarding complete, confirm:
84
+
85
+ - Each role can find its guide.
86
+ - Each role has at least one usable prompt.
87
+ - Handoff expectations are clear.
88
+ - Permission gates are understood.
89
+ - Evidence artifacts are known.
@@ -0,0 +1,95 @@
1
+ # Design Onboarding
2
+
3
+ ## Purpose
4
+
5
+ Help designers and design leads use Sagaz for UX flows, UI systems, visual QA, accessibility, and Figma MCP coordination.
6
+
7
+ ## Use When
8
+
9
+ - Creating app flows, mockups, dashboards, or mobile screens.
10
+ - Turning product requirements into UX structure.
11
+ - Preparing design systems and component states.
12
+ - Coordinating Figma MCP work.
13
+ - Reviewing visual quality before implementation or release.
14
+
15
+ ## Invocation
16
+
17
+ Give Sagaz the user context, screens, design constraints, and expected fidelity.
18
+
19
+ ```text
20
+ Sagaz: coordinate the design team for this product flow.
21
+
22
+ Users:
23
+ -
24
+
25
+ Screens or flows:
26
+ -
27
+
28
+ Design expectations:
29
+ -
30
+
31
+ Use Figma MCP if available. Create app-like mockups that can be inspected as real application flows. Ask before using external connectors.
32
+ ```
33
+
34
+ ## Required Inputs
35
+
36
+ - Target users and jobs.
37
+ - Screens, states, or flows.
38
+ - Brand or style constraints.
39
+ - Accessibility expectations.
40
+ - Responsive targets.
41
+ - Existing Figma links or design assets, when available.
42
+ - Permission to use Figma MCP or other connectors.
43
+
44
+ ## Expected Output
45
+
46
+ - UX flow or screen map.
47
+ - Component/state inventory.
48
+ - Interaction notes.
49
+ - Accessibility and responsive considerations.
50
+ - Figma MCP plan when applicable.
51
+ - Visual QA checklist.
52
+
53
+ ## Handoff
54
+
55
+ Design handoff should include:
56
+
57
+ - Screens and states covered.
58
+ - Design decisions.
59
+ - Component rules.
60
+ - Interaction behavior.
61
+ - Accessibility notes.
62
+ - Assets or Figma references.
63
+ - Implementation constraints.
64
+
65
+ ## Good Prompts
66
+
67
+ ```text
68
+ Sagaz: create a UX flow and screen inventory for this app before any UI generation.
69
+ ```
70
+
71
+ ```text
72
+ Sagaz: use the design team to review this UI for visual QA, responsiveness, and accessibility.
73
+ ```
74
+
75
+ ```text
76
+ Sagaz: prepare Figma MCP instructions for app-like mockups with states and interactions.
77
+ ```
78
+
79
+ ## Common Mistakes
80
+
81
+ - Asking for only a pretty screen without states.
82
+ - Skipping empty, loading, error, and success states.
83
+ - Forgetting mobile or desktop breakpoints.
84
+ - Treating Figma output as implementation-ready without visual QA.
85
+ - Using external connectors without approval.
86
+
87
+ ## Verification
88
+
89
+ Before handoff to engineering, confirm:
90
+
91
+ - Screens and states are complete.
92
+ - Components are reusable.
93
+ - Accessibility issues are identified.
94
+ - Responsive behavior is described.
95
+ - Implementation notes are clear.
@@ -0,0 +1,94 @@
1
+ # Engineering Onboarding
2
+
3
+ ## Purpose
4
+
5
+ Help engineers and tech leads use Sagaz for architecture, stack selection, implementation, refactors, tests, and safe delivery.
6
+
7
+ ## Use When
8
+
9
+ - Inspecting an existing codebase.
10
+ - Planning implementation.
11
+ - Choosing or confirming a stack.
12
+ - Refactoring safely.
13
+ - Adding tests or production readiness.
14
+ - Preparing commits, pull requests, or releases.
15
+
16
+ ## Invocation
17
+
18
+ Start by asking Sagaz to inspect before changing files.
19
+
20
+ ```text
21
+ Sagaz: inspect this project as the engineering team.
22
+
23
+ Goal:
24
+ -
25
+
26
+ Constraints:
27
+ - use existing patterns where reasonable
28
+ - preserve current behavior
29
+ - run proportional tests
30
+
31
+ Do not change files until you propose a plan and ask for approval.
32
+ ```
33
+
34
+ ## Required Inputs
35
+
36
+ - Goal or bug description.
37
+ - Current constraints.
38
+ - Risk tolerance.
39
+ - Existing stack expectations.
40
+ - Test command or deployment target, if known.
41
+ - Files or areas likely involved, if known.
42
+
43
+ ## Expected Output
44
+
45
+ - Project structure summary.
46
+ - Stack and pattern observations.
47
+ - Implementation plan.
48
+ - Risk assessment.
49
+ - Test plan.
50
+ - Permission level for the next step.
51
+
52
+ ## Handoff
53
+
54
+ Engineering handoff should include:
55
+
56
+ - Files inspected.
57
+ - Files changed.
58
+ - Commands run.
59
+ - Tests and results.
60
+ - Behavior preserved.
61
+ - Residual risk.
62
+ - Recommended next verification or release step.
63
+
64
+ ## Good Prompts
65
+
66
+ ```text
67
+ Sagaz: implement this feature using existing project patterns and explain the tests you will run first.
68
+ ```
69
+
70
+ ```text
71
+ Sagaz: refactor this module safely. Preserve behavior, add focused tests if needed, and keep the diff small.
72
+ ```
73
+
74
+ ```text
75
+ Sagaz: diagnose this bug, identify root cause, make the smallest safe fix, and prepare release notes.
76
+ ```
77
+
78
+ ## Common Mistakes
79
+
80
+ - Skipping repository inspection.
81
+ - Installing dependencies without approval.
82
+ - Refactoring unrelated files.
83
+ - Treating build success as complete verification.
84
+ - Forgetting rollback or deployment notes.
85
+
86
+ ## Verification
87
+
88
+ Before declaring engineering done, confirm:
89
+
90
+ - The implementation matches existing patterns.
91
+ - Tests are proportional to risk.
92
+ - Build or lint ran when relevant.
93
+ - User-facing behavior was checked.
94
+ - Residual risk is documented.
@@ -0,0 +1,114 @@
1
+ # Handoff Examples
2
+
3
+ ## Purpose
4
+
5
+ Provide short examples of good Sagaz handoffs between product, design, engineering, QA, and release work.
6
+
7
+ ## Use When
8
+
9
+ - A team member is learning what a Sagaz handoff should contain.
10
+ - A project needs consistent phase transitions.
11
+ - A reviewer wants to compare a handoff against the expected shape.
12
+
13
+ ## Invocation
14
+
15
+ Ask Sagaz to produce a handoff for the next role:
16
+
17
+ ```text
18
+ Sagaz: prepare a handoff from the current role to the next role. Include context, evidence, risks, assumptions, and permission needed before the next step.
19
+ ```
20
+
21
+ ## Handoff
22
+
23
+ Use the examples below as calibration. A handoff should be short enough to scan and complete enough that the next role can continue without reconstructing the whole thread.
24
+
25
+ ## Product To Design
26
+
27
+ ```md
28
+ Handoff: Product to Design
29
+ Goal: Create scheduling flow for small service businesses.
30
+ Users: Owner, staff member, customer.
31
+ Scope: Booking, reschedule, cancellation, admin calendar.
32
+ Non-goals: Payments in this release.
33
+ Acceptance criteria:
34
+ - Customer can book an available time.
35
+ - Owner can see bookings by day.
36
+ - Staff conflicts are prevented.
37
+ Risks:
38
+ - Calendar timezone behavior needs engineering review.
39
+ Next squad: Design Studio
40
+ Permission needed: approve UX exploration.
41
+ ```
42
+
43
+ ## Design To Engineering
44
+
45
+ ```md
46
+ Handoff: Design to Engineering
47
+ Screens covered: calendar, booking form, confirmation, empty state, error state.
48
+ Components: date picker, time slot list, booking summary, status banner.
49
+ Responsive notes: mobile single-column, desktop split calendar/detail view.
50
+ Accessibility: keyboard focus states and form labels required.
51
+ Assets: Figma link or local design references.
52
+ Next squad: Product Factory or Implementation Engineer
53
+ Permission needed: approve implementation.
54
+ ```
55
+
56
+ ## Engineering To QA
57
+
58
+ ```md
59
+ Handoff: Engineering to QA
60
+ Files changed:
61
+ - src/features/scheduling/*
62
+ - src/components/calendar/*
63
+ Commands run:
64
+ - npm test
65
+ - npm run build
66
+ Behavior verified:
67
+ - booking creation
68
+ - unavailable slot prevention
69
+ Residual risk:
70
+ - timezone edge cases need manual QA.
71
+ Next squad: QA Verifier
72
+ Permission needed: approve broader regression pass.
73
+ ```
74
+
75
+ ## QA To Release
76
+
77
+ ```md
78
+ Handoff: QA to Release
79
+ Verification:
80
+ - unit tests passed
81
+ - build passed
82
+ - manual booking flow passed on desktop and mobile viewport
83
+ Release blockers: none
84
+ Risks accepted:
85
+ - timezone behavior limited to configured business timezone.
86
+ Rollback:
87
+ - revert release commit
88
+ - redeploy previous known-good build
89
+ Next squad: GitHub Ops
90
+ Permission needed: approve commit, tag, release, deploy, or publish.
91
+ ```
92
+
93
+ ## Release To Maintenance
94
+
95
+ ```md
96
+ Handoff: Release to Maintenance
97
+ Released version: vX.Y.Z
98
+ Published artifacts:
99
+ - GitHub release
100
+ - deployment URL
101
+ - package version, when applicable
102
+ Monitoring:
103
+ - check runtime logs
104
+ - check user-reported booking failures
105
+ Follow-up:
106
+ - add timezone regression test
107
+ - review operational memory preference for release cadence
108
+ Next squad: Maintenance Ops
109
+ Permission needed: approve follow-up issue creation.
110
+ ```
111
+
112
+ ## Verification
113
+
114
+ A good handoff is complete when another role can continue without rereading the whole thread.