@tgoodington/intuition 11.4.0 → 11.5.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (131) hide show
  1. package/package.json +1 -2
  2. package/skills/intuition-enuncia-compose/SKILL.md +5 -10
  3. package/skills/intuition-enuncia-design/SKILL.md +0 -4
  4. package/skills/intuition-enuncia-discovery/SKILL.md +1 -7
  5. package/skills/intuition-enuncia-execute/SKILL.md +0 -4
  6. package/skills/intuition-enuncia-handoff/SKILL.md +0 -4
  7. package/skills/intuition-enuncia-initialize/references/claude_template.md +20 -0
  8. package/skills/intuition-enuncia-start/SKILL.md +0 -4
  9. package/skills/intuition-enuncia-verify/SKILL.md +0 -4
  10. package/docs/archive/ARCHITECTURE_OVERVIEW.txt +0 -405
  11. package/docs/archive/INSTALLATION.md +0 -431
  12. package/docs/archive/PROJECT_CONTEXT.md +0 -361
  13. package/docs/archive/QUICK_TEST_CHECKLIST.md +0 -467
  14. package/docs/archive/SKILL_INTERACTION_GUIDE.md +0 -993
  15. package/docs/archive/TESTING_README.md +0 -215
  16. package/docs/archive/TESTING_SUMMARY.md +0 -781
  17. package/docs/archive/WALDO_V3_COMPLETE_DOCUMENTATION.md +0 -538
  18. package/docs/archive/WALDO_V3_DESIGN_SUMMARY.md +0 -449
  19. package/docs/archive/intuition-architecture.md +0 -342
  20. package/docs/archive/intuition-workflow.md +0 -210
  21. package/docs/archive/intuition_design_skill_spec.md +0 -219
  22. package/docs/archive/v7_design_spec.md +0 -1111
  23. package/docs/archive/v7_plan.md +0 -339
  24. package/docs/archive/v9-design/decision-framework-direction.md +0 -142
  25. package/docs/archive/v9-design/decision-framework-implementation.md +0 -114
  26. package/docs/archive/v9-design/domain-adaptive-team-architecture.md +0 -1016
  27. package/docs/archive/v9-test/SESSION_SUMMARY.md +0 -117
  28. package/docs/archive/v9-test/TEST_PLAN.md +0 -119
  29. package/docs/archive/v9-test/blueprints/legal-analyst.md +0 -166
  30. package/docs/archive/v9-test/output/07_cover_letter.md +0 -41
  31. package/docs/archive/v9-test/phase2/mock_plan.md +0 -89
  32. package/docs/archive/v9-test/phase2/producers.json +0 -32
  33. package/docs/archive/v9-test/phase2/specialists/database-architect.specialist.md +0 -10
  34. package/docs/archive/v9-test/phase2/specialists/financial-analyst.specialist.md +0 -10
  35. package/docs/archive/v9-test/phase2/specialists/legal-analyst.specialist.md +0 -10
  36. package/docs/archive/v9-test/phase2/specialists/technical-writer.specialist.md +0 -10
  37. package/docs/archive/v9-test/phase2/team_assignment.json +0 -61
  38. package/docs/archive/v9-test/phase3/blueprints/legal-analyst.md +0 -840
  39. package/docs/archive/v9-test/phase3/legal-analyst-full.specialist.md +0 -111
  40. package/docs/archive/v9-test/phase3/project_context/nh_landlord_tenant_notes.md +0 -35
  41. package/docs/archive/v9-test/phase3/project_context/property_facts.md +0 -32
  42. package/docs/archive/v9-test/phase3b/blueprints/legal-analyst.md +0 -1715
  43. package/docs/archive/v9-test/phase3b/legal-analyst.specialist.md +0 -153
  44. package/docs/archive/v9-test/phase3b/scratch/legal-analyst-stage1.md +0 -270
  45. package/docs/archive/v9-test/phase4/TEST_PLAN.md +0 -32
  46. package/docs/archive/v9-test/phase4/blueprints/financial-analyst-T2.md +0 -538
  47. package/docs/archive/v9-test/phase4/blueprints/legal-analyst-T4.md +0 -253
  48. package/docs/archive/v9-test/phase4/cross-blueprint-check.md +0 -280
  49. package/docs/archive/v9-test/phase4/scratch/financial-analyst-T2-stage1.md +0 -67
  50. package/docs/archive/v9-test/phase4/scratch/legal-analyst-T4-stage1.md +0 -54
  51. package/docs/archive/v9-test/phase4/specialists/financial-analyst.specialist.md +0 -156
  52. package/docs/archive/v9-test/phase4/specialists/legal-analyst.specialist.md +0 -153
  53. package/docs/archive/v9-test/phase5/TEST_PLAN.md +0 -35
  54. package/docs/archive/v9-test/phase5/blueprints/code-architect-hw-vetter.md +0 -375
  55. package/docs/archive/v9-test/phase5/output/04_compliance_checklist.md +0 -149
  56. package/docs/archive/v9-test/phase5/output/hardware-vetter-SKILL-v2.md +0 -561
  57. package/docs/archive/v9-test/phase5/output/hardware-vetter-SKILL.md +0 -459
  58. package/docs/archive/v9-test/phase5/producers/code-writer.producer.md +0 -49
  59. package/docs/archive/v9-test/phase5/producers/document-writer.producer.md +0 -62
  60. package/docs/archive/v9-test/phase5/regression-comparison-v2.md +0 -60
  61. package/docs/archive/v9-test/phase5/regression-comparison.md +0 -197
  62. package/docs/archive/v9-test/phase5/review-5A-specialist.md +0 -213
  63. package/docs/archive/v9-test/phase5/specialist-test/TEST_PLAN.md +0 -60
  64. package/docs/archive/v9-test/phase5/specialist-test/blueprint-comparison.md +0 -252
  65. package/docs/archive/v9-test/phase5/specialist-test/blueprints/code-architect-hw-vetter.md +0 -916
  66. package/docs/archive/v9-test/phase5/specialist-test/scratch/code-architect-stage1.md +0 -427
  67. package/docs/archive/v9-test/phase5/specialists/code-architect.specialist.md +0 -168
  68. package/docs/archive/v9-test/phase5b/TEST_PLAN.md +0 -219
  69. package/docs/archive/v9-test/phase5b/blueprints/5B-10-stage2-with-decisions.md +0 -286
  70. package/docs/archive/v9-test/phase5b/decisions/5B-2-accept-all-decisions.json +0 -68
  71. package/docs/archive/v9-test/phase5b/decisions/5B-3-promote-decisions.json +0 -70
  72. package/docs/archive/v9-test/phase5b/decisions/5B-4-individual-decisions.json +0 -68
  73. package/docs/archive/v9-test/phase5b/decisions/5B-5-triage-decisions.json +0 -110
  74. package/docs/archive/v9-test/phase5b/decisions/5B-6-fallback-decisions.json +0 -40
  75. package/docs/archive/v9-test/phase5b/decisions/5B-8-partial-decisions.json +0 -46
  76. package/docs/archive/v9-test/phase5b/decisions/5B-9-complete-decisions.json +0 -54
  77. package/docs/archive/v9-test/phase5b/scratch/code-architect-stage1.md +0 -133
  78. package/docs/archive/v9-test/phase5b/specialists/code-architect.specialist.md +0 -202
  79. package/docs/archive/v9-test/phase5b/stage1-many-decisions.md +0 -139
  80. package/docs/archive/v9-test/phase5b/stage1-no-assumptions.md +0 -70
  81. package/docs/archive/v9-test/phase5b/stage1-with-assumptions.md +0 -86
  82. package/docs/archive/v9-test/phase5b/test-5B-1-results.md +0 -157
  83. package/docs/archive/v9-test/phase5b/test-5B-10-results.md +0 -130
  84. package/docs/archive/v9-test/phase5b/test-5B-2-results.md +0 -75
  85. package/docs/archive/v9-test/phase5b/test-5B-3-results.md +0 -104
  86. package/docs/archive/v9-test/phase5b/test-5B-4-results.md +0 -114
  87. package/docs/archive/v9-test/phase5b/test-5B-5-results.md +0 -126
  88. package/docs/archive/v9-test/phase5b/test-5B-6-results.md +0 -60
  89. package/docs/archive/v9-test/phase5b/test-5B-7-results.md +0 -141
  90. package/docs/archive/v9-test/phase5b/test-5B-8-results.md +0 -115
  91. package/docs/archive/v9-test/phase5b/test-5B-9-results.md +0 -76
  92. package/docs/archive/v9-test/producers/document-writer.producer.md +0 -62
  93. package/docs/archive/v9-test/specialists/legal-analyst.specialist.md +0 -58
  94. package/docs/project_notes/.project-memory-state.json +0 -100
  95. package/docs/project_notes/archive/trunk-v9.2-complete/.gitkeep +0 -0
  96. package/docs/project_notes/archive/trunk-v9.2-complete/.planning_research/decision_file_naming.md +0 -15
  97. package/docs/project_notes/archive/trunk-v9.2-complete/.planning_research/decisions_log.md +0 -32
  98. package/docs/project_notes/archive/trunk-v9.2-complete/.planning_research/orientation.md +0 -51
  99. package/docs/project_notes/archive/trunk-v9.2-complete/audit/plan-rename-hitlist.md +0 -654
  100. package/docs/project_notes/archive/trunk-v9.2-complete/blueprint-conflicts.md +0 -109
  101. package/docs/project_notes/archive/trunk-v9.2-complete/blueprints/database-architect.md +0 -416
  102. package/docs/project_notes/archive/trunk-v9.2-complete/blueprints/devops-infrastructure.md +0 -514
  103. package/docs/project_notes/archive/trunk-v9.2-complete/blueprints/technical-writer.md +0 -788
  104. package/docs/project_notes/archive/trunk-v9.2-complete/build_brief.md +0 -119
  105. package/docs/project_notes/archive/trunk-v9.2-complete/build_report.md +0 -250
  106. package/docs/project_notes/archive/trunk-v9.2-complete/detail_brief.md +0 -94
  107. package/docs/project_notes/archive/trunk-v9.2-complete/plan.md +0 -182
  108. package/docs/project_notes/archive/trunk-v9.2-complete/planning_brief.md +0 -96
  109. package/docs/project_notes/archive/trunk-v9.2-complete/prompt_brief.md +0 -60
  110. package/docs/project_notes/archive/trunk-v9.2-complete/prompt_output.json +0 -98
  111. package/docs/project_notes/archive/trunk-v9.2-complete/scratch/database-architect-decisions.json +0 -72
  112. package/docs/project_notes/archive/trunk-v9.2-complete/scratch/database-architect-research-plan.md +0 -10
  113. package/docs/project_notes/archive/trunk-v9.2-complete/scratch/database-architect-stage1.md +0 -226
  114. package/docs/project_notes/archive/trunk-v9.2-complete/scratch/devops-infrastructure-decisions.json +0 -71
  115. package/docs/project_notes/archive/trunk-v9.2-complete/scratch/devops-infrastructure-research-plan.md +0 -7
  116. package/docs/project_notes/archive/trunk-v9.2-complete/scratch/devops-infrastructure-stage1.md +0 -164
  117. package/docs/project_notes/archive/trunk-v9.2-complete/scratch/technical-writer-decisions.json +0 -88
  118. package/docs/project_notes/archive/trunk-v9.2-complete/scratch/technical-writer-research-plan.md +0 -7
  119. package/docs/project_notes/archive/trunk-v9.2-complete/scratch/technical-writer-stage1.md +0 -266
  120. package/docs/project_notes/archive/trunk-v9.2-complete/team_assignment.json +0 -108
  121. package/docs/project_notes/archive/trunk-v9.2-complete/test_brief.md +0 -75
  122. package/docs/project_notes/archive/trunk-v9.2-complete/test_report.md +0 -26
  123. package/docs/project_notes/archive/trunk-v9.2-complete/verification/devops-infrastructure-verification.md +0 -172
  124. package/docs/project_notes/branches/.gitkeep +0 -0
  125. package/docs/project_notes/bugs.md +0 -41
  126. package/docs/project_notes/decisions.md +0 -147
  127. package/docs/project_notes/issues.md +0 -101
  128. package/docs/project_notes/key_facts.md +0 -88
  129. package/docs/project_notes/trunk/.gitkeep +0 -0
  130. package/docs/project_notes/trunk/discovery_brief.md +0 -40
  131. package/docs/project_notes/v9.2-optimization-plan.md +0 -193
@@ -1,467 +0,0 @@
1
- # Quick Testing Checklist - Waldo v3 Validation
2
-
3
- **Use this checklist to verify critical functionality during testing.**
4
-
5
- **Version:** 3.0.0 | **Date:** February 5, 2026
6
-
7
- ---
8
-
9
- ## Pre-Testing Validation
10
-
11
- - [ ] Version in package.json is 3.0.0
12
- - [ ] All skill files present (start, initialize, discovery, handoff, plan, execute)
13
- - [ ] WALDO_V3_COMPLETE_DOCUMENTATION.md exists and is readable
14
- - [ ] .claude/USER_PROFILE.json template exists
15
- - [ ] Skills can be activated (npm install -g intuition)
16
-
17
- ---
18
-
19
- ## Test Session 1: Waldo v3 Core Features
20
-
21
- ### Setup
22
- - [ ] Create test project directory
23
- - [ ] Run `/intuition-initialize` to set up project memory
24
- - [ ] Verify docs/project_notes/ created with all templates
25
-
26
- ### Dialogue Mode Selection
27
- - [ ] Run `/intuition-discovery`
28
- - [ ] Waldo greets with mode selection question
29
- - [ ] Guided mode option available
30
- - [ ] Open-Ended mode option available
31
- - [ ] Can select mode without error
32
- - [ ] Mode stored in .project-memory-state.json > discovery.dialogue_mode
33
-
34
- ### Research Agent Launch
35
- - [ ] User describes context
36
- - [ ] Waldo identifies domain from context
37
- - [ ] Waldo launches 3 research tasks (check Task tool calls)
38
- - [ ] All 3 research tasks run in parallel (not sequential)
39
- - [ ] Research completes while dialogue continues
40
- - [ ] Research findings are available in state
41
-
42
- ### Guided Mode Dialogue
43
- - [ ] Questions use AskUserQuestion format
44
- - [ ] 2-4 options presented per question
45
- - [ ] "Other" option always available
46
- - [ ] Never more than 2 questions per turn
47
- - [ ] Questions informed by research findings
48
- - [ ] "Yes, and..." building evident in dialogue
49
- - [ ] Gentle steering (if applicable) is non-prescriptive
50
-
51
- ### Open-Ended Mode Dialogue
52
- - [ ] Questions are conversational (no options)
53
- - [ ] Natural dialogue flow
54
- - [ ] Never more than 2 questions per turn
55
- - [ ] Questions informed by research findings
56
- - [ ] Covers same GAPP dimensions as Guided mode
57
- - [ ] Same depth and rigor as Guided mode
58
-
59
- ### Discovery Completion
60
- - [ ] All GAPP dimensions explored (Problem, Goals, Context, Motivation)
61
- - [ ] Assumptions documented with confidence levels
62
- - [ ] User profile information discovered naturally
63
- - [ ] discovery_brief.md created with complete structure
64
- - [ ] discovery_output.json created with structured data
65
- - [ ] user_profile_learnings populated in discovery_output.json
66
-
67
- ---
68
-
69
- ## Test Session 2: Handoff Orchestration
70
-
71
- ### User Profile Extraction & Merging
72
- - [ ] Handoff reads discovery_output.json
73
- - [ ] Extracts user_profile_learnings
74
- - [ ] .claude/USER_PROFILE.json created or updated
75
- - [ ] Null fields populated from discovery learnings
76
- - [ ] Existing fields NOT overwritten (unless high confidence)
77
- - [ ] last_updated timestamp updated
78
- - [ ] projects_contributed_to includes current project
79
- - [ ] confidence_scores updated for each property
80
-
81
- ### Memory Updates
82
- - [ ] key_facts.md updated with discovered facts
83
- - [ ] Facts include dates and sources
84
- - [ ] decisions.md updated with new ADRs (if applicable)
85
- - [ ] ADRs include Context/Decision/Consequences
86
- - [ ] issues.md updated with work logged
87
- - [ ] All updates use proper formatting
88
-
89
- ### Brief Generation
90
- - [ ] planning_brief.md created (if discovery→planning)
91
- - [ ] planning_brief includes all required sections
92
- - [ ] Brief is focused and actionable for Magellan
93
- - [ ] execution_brief.md created (if planning→execution)
94
- - [ ] execution_brief includes all required sections
95
- - [ ] Brief is focused and actionable for Faraday
96
-
97
- ### State Transition
98
- - [ ] workflow.status updated correctly
99
- - [ ] Timestamps for phase starts/completes updated
100
- - [ ] discovery.completed = true (after discovery handoff)
101
- - [ ] planning.started = true (after discovery handoff)
102
- - [ ] All state changes consistent
103
-
104
- ---
105
-
106
- ## Test Session 3: Dual Dialogue Mode Equivalence
107
-
108
- ### Setup
109
- - [ ] Run /intuition-discovery in Guided mode (Project A)
110
- - [ ] Run /intuition-discovery in Open-Ended mode (Project B)
111
- - [ ] Use same context/topic for both
112
-
113
- ### Comparison
114
- - [ ] discovery_brief.md content is substantially same
115
- - [ ] GAPP coverage is equivalent
116
- - [ ] Assumptions are equivalent
117
- - [ ] Tone differs (options vs. conversational)
118
- - [ ] Depth and rigor are equivalent
119
- - [ ] Both route to /intuition-handoff
120
-
121
- ---
122
-
123
- ## Test Session 4: User Profile Persistence
124
-
125
- ### Setup
126
- - [ ] Complete discovery in Project A
127
- - [ ] Handoff merges profile
128
- - [ ] Check .claude/USER_PROFILE.json has data
129
-
130
- ### Verify Persistence
131
- - [ ] Start Project B
132
- - [ ] .claude/USER_PROFILE.json already populated
133
- - [ ] Magellan reads existing profile (if testing planning)
134
- - [ ] Faraday reads existing profile (if testing execution)
135
- - [ ] Run discovery in Project B
136
- - [ ] New learnings merged into profile
137
- - [ ] Profile completeness increases
138
- - [ ] Projects list includes both projects
139
-
140
- ---
141
-
142
- ## Test Session 5: Five-Skill Coordination
143
-
144
- ### Full Flow (Discovery → Plan → Execution)
145
- - [ ] /intuition-start shows discovery not started
146
- - [ ] /intuition-discovery creates discovery outputs
147
- - [ ] /intuition-start shows discovery complete
148
- - [ ] /intuition-handoff processes discovery
149
- - [ ] /intuition-start shows planning ready
150
- - [ ] /intuition-plan reads planning_brief.md
151
- - [ ] /intuition-plan creates plan.md
152
- - [ ] /intuition-plan routes to user approval
153
- - [ ] /intuition-handoff processes planning
154
- - [ ] /intuition-start shows execution ready
155
- - [ ] /intuition-execute reads execution_brief.md
156
- - [ ] /intuition-execute completes workflow
157
-
158
- ### State Consistency
159
- - [ ] .project-memory-state.json updated at each phase
160
- - [ ] No data loss between phases
161
- - [ ] File paths consistent
162
- - [ ] Timestamps accurate and ordered
163
-
164
- ---
165
-
166
- ## Test Session 6: Resume Capability
167
-
168
- ### Interrupt & Resume Discovery
169
- - [ ] Start /intuition-discovery
170
- - [ ] Partial dialogue (2-3 turns)
171
- - [ ] Interrupt (don't complete)
172
- - [ ] Check state tracking
173
- - [ ] Run /intuition-discovery again
174
- - [ ] Waldo resumes from last turn
175
- - [ ] Can complete discovery
176
- - [ ] Quality score shows same coverage
177
-
178
- ### Interrupt & Resume Handoff
179
- - [ ] Run /intuition-handoff (partial execution simulated)
180
- - [ ] Interrupt mid-way through updates
181
- - [ ] Run /intuition-handoff again
182
- - [ ] Resumes and completes
183
- - [ ] No duplicate updates in memory
184
-
185
- ---
186
-
187
- ## Test Session 7: File Management & State
188
-
189
- ### File Creation & Integrity
190
- - [ ] discovery_brief.md well-formed markdown
191
- - [ ] discovery_output.json valid JSON
192
- - [ ] planning_brief.md well-formed markdown
193
- - [ ] plan.md well-formed markdown
194
- - [ ] execution_brief.md well-formed markdown
195
- - [ ] .project-memory-state.json valid JSON (v3 schema)
196
- - [ ] USER_PROFILE.json valid JSON
197
-
198
- ### File Organization
199
- - [ ] All discovery files in docs/project_notes/
200
- - [ ] USER_PROFILE.json in .claude/
201
- - [ ] No files created outside expected locations
202
- - [ ] File naming conventions followed
203
-
204
- ### Timestamp Accuracy
205
- - [ ] Timestamps in ISO 8601 format
206
- - [ ] Ordered correctly (start < complete)
207
- - [ ] Consistent across related files
208
-
209
- ---
210
-
211
- ## Test Session 8: Cross-Sector Testing
212
-
213
- ### Test Domain 1: e-commerce
214
- - [ ] Run /intuition-discovery with e-commerce context
215
- - [ ] Research agents find e-commerce relevant practices
216
- - [ ] Questions are domain-appropriate
217
- - [ ] Profile learnings are domain-agnostic
218
-
219
- ### Test Domain 2: EdTech
220
- - [ ] Run /intuition-discovery with EdTech context
221
- - [ ] Research agents find EdTech relevant practices
222
- - [ ] Questions are domain-appropriate
223
- - [ ] Profile learnings are domain-agnostic
224
-
225
- ### Test Domain 3: Infrastructure/DevOps
226
- - [ ] Run /intuition-discovery with DevOps context
227
- - [ ] Research agents find infrastructure practices
228
- - [ ] Questions are domain-appropriate
229
- - [ ] Profile learnings are domain-agnostic
230
-
231
- **Result:** Waldo adapts to different domains without modification
232
-
233
- ---
234
-
235
- ## Test Session 9: Edge Cases
236
-
237
- ### Missing Files
238
- - [ ] Run handoff without discovery_output.json
239
- - [ ] Handoff falls back to reading discovery_brief.md
240
- - [ ] Completes with available data
241
- - [ ] Doesn't error out
242
-
243
- ### Poor Quality Output
244
- - [ ] Simulate incomplete discovery brief
245
- - [ ] Handoff documents as-is (doesn't "fix")
246
- - [ ] Quality score is low but handoff completes
247
- - [ ] User can request re-discovery
248
-
249
- ### Confidence Scoring in Profile Merge
250
- - [ ] Low confidence property discovered
251
- - [ ] Profile property still null
252
- - [ ] New low confidence value NOT overwritten
253
- - [ ] High confidence property discovered
254
- - [ ] Profile property with old value IS overwritten
255
-
256
- ### Corrupted State File
257
- - [ ] Corrupt .project-memory-state.json intentionally
258
- - [ ] Skill detects corruption
259
- - [ ] Falls back to file existence checking
260
- - [ ] Offers recovery option
261
-
262
- ---
263
-
264
- ## Test Session 10: Personalization (Magellan & Faraday)
265
-
266
- ### Magellan Reads Profile
267
- - [ ] Run /intuition-plan
268
- - [ ] Magellan reads .claude/USER_PROFILE.json
269
- - [ ] Plan detail adjusts based on user seniority
270
- - [ ] Communication matches user preference
271
- - [ ] Complexity matches user expertise
272
-
273
- ### Faraday Reads Profile
274
- - [ ] Run /intuition-execute
275
- - [ ] Faraday reads .claude/USER_PROFILE.json
276
- - [ ] Execution communication adjusted for user
277
- - [ ] Team size considered in delegation
278
- - [ ] Authority level respected in decisions
279
-
280
- ### Personalization Increases Over Time
281
- - [ ] Cycle 1: Profile is sparse, limited personalization
282
- - [ ] Cycle 2: Profile has more data, better personalization
283
- - [ ] Cycle 3: Profile is comprehensive, highly personalized
284
-
285
- ---
286
-
287
- ## Test Session 11: Quality & User Experience
288
-
289
- ### Wise Confidant Model
290
- - [ ] Waldo feels knowledgeable (not neutral)
291
- - [ ] Research insights come through in dialogue
292
- - [ ] User feels guided by peer, not interrogated
293
- - [ ] Trust builds through conversation
294
-
295
- ### "Yes, and..." Building
296
- - [ ] Waldo expands on user's ideas
297
- - [ ] Never negates or challenges user
298
- - [ ] Feels collaborative, not critical
299
- - [ ] Steers gently toward efficiency
300
-
301
- ### Gentle Steering
302
- - [ ] When flagging inefficiency, tone is respectful
303
- - [ ] User feels aware, not lectured
304
- - [ ] Steering is collaborative
305
- - [ ] User retains agency and decision-making
306
-
307
- ### Question Quality
308
- - [ ] Questions are thoughtful and relevant
309
- - [ ] 1-2 per turn is comfortable (not overwhelming)
310
- - [ ] Build naturally on previous responses
311
- - [ ] Guide toward deeper understanding
312
-
313
- ---
314
-
315
- ## Critical Gate: Security Review
316
-
317
- ### Faraday Security Gate
318
- - [ ] /intuition-execute includes security review step
319
- - [ ] Security review is MANDATORY (not optional)
320
- - [ ] No execution completion without security review
321
- - [ ] Security review status is tracked in state
322
- - [ ] All code passes security review before completion
323
-
324
- ---
325
-
326
- ## Regression Testing (v2 → v3 Compatibility)
327
-
328
- ### Project Memory Still Works
329
- - [ ] Existing bugs.md can be updated
330
- - [ ] Existing decisions.md can be updated
331
- - [ ] Existing issues.md can be updated
332
- - [ ] Existing key_facts.md can be updated
333
- - [ ] Old project memory accessible and usable
334
-
335
- ### Workflow Still Works
336
- - [ ] Discovery still produces outputs
337
- - [ ] Planning still creates structured plan
338
- - [ ] Execution still implements tasks
339
- - [ ] No breaking changes to workflow
340
-
341
- ### Resume Still Works
342
- - [ ] Can resume interrupted discovery
343
- - [ ] Can resume interrupted planning
344
- - [ ] Can resume interrupted execution
345
- - [ ] State preservation is reliable
346
-
347
- ---
348
-
349
- ## Success Criteria
350
-
351
- ### Functional Success
352
- - [x] All items in "Critical Gate" section pass
353
- - [x] All items in "Test Session 1-11" sections pass
354
- - [x] No critical errors during full workflow
355
-
356
- ### Quality Success
357
- - [x] Wise confidant model is evident
358
- - [x] "Yes, and..." building is noticeable
359
- - [x] Gentle steering is effective and respectful
360
- - [x] Questions feel natural and informed
361
- - [x] User feels like thinking partner is engaged
362
-
363
- ### Architecture Success
364
- - [x] File-based handoffs work correctly
365
- - [x] State management is reliable
366
- - [x] Memory consistency maintained
367
- - [x] Resume capability functional
368
- - [x] No data loss between phases
369
-
370
- ### v3 Specific Success
371
- - [x] Dual dialogue modes work
372
- - [x] User profile persistence works
373
- - [x] Profile merging uses correct logic
374
- - [x] Handoff orchestration is smooth
375
- - [x] Research agents launch and integrate findings
376
-
377
- ---
378
-
379
- ## Test Report Template
380
-
381
- ```markdown
382
- # Waldo v3 Testing Report
383
-
384
- **Date:** [Date]
385
- **Tester:** [Name]
386
- **Version Tested:** 3.0.0
387
-
388
- ## Overview
389
- [Summary of what was tested]
390
-
391
- ## Test Results
392
-
393
- ### Test Session 1: Waldo v3 Core Features
394
- - [ ] Dialogue mode selection ............ [PASS/FAIL]
395
- - [ ] Research agent launch ............. [PASS/FAIL]
396
- - [ ] Guided mode dialogue .............. [PASS/FAIL]
397
- - [ ] Open-ended mode dialogue .......... [PASS/FAIL]
398
- - [ ] Discovery completion .............. [PASS/FAIL]
399
-
400
- ### Test Session 2: Handoff Orchestration
401
- - [ ] User profile extraction ........... [PASS/FAIL]
402
- - [ ] Memory updates .................... [PASS/FAIL]
403
- - [ ] Brief generation .................. [PASS/FAIL]
404
- - [ ] State transition .................. [PASS/FAIL]
405
-
406
- [Continue for all test sessions]
407
-
408
- ## Issues Found
409
-
410
- ### Critical Issues
411
- [List any critical blockers]
412
-
413
- ### High Priority Issues
414
- [List important issues]
415
-
416
- ### Medium Priority Issues
417
- [List normal issues]
418
-
419
- ### Low Priority Issues
420
- [List minor issues]
421
-
422
- ## Recommendations
423
-
424
- [Testing findings and suggestions]
425
-
426
- ## Sign-Off
427
-
428
- - [x] Ready for production deployment
429
- - [ ] Needs additional testing
430
- - [ ] Blockers must be resolved
431
- ```
432
-
433
- ---
434
-
435
- ## Notes for Testers
436
-
437
- 1. **File Locations:** Always check `docs/project_notes/` for project-specific memory and `.claude/` for global profile
438
-
439
- 2. **State Tracking:** Review `.project-memory-state.json` after each phase to verify state consistency
440
-
441
- 3. **Mode Testing:** Test both Guided and Open-Ended modes with same context to verify equivalence
442
-
443
- 4. **Profile Building:** Use multiple projects to verify profile persistence and merging logic
444
-
445
- 5. **Research Integration:** Check that research findings are evident in question formulation (not obvious, but present)
446
-
447
- 6. **Documentation:** Refer to:
448
- - WALDO_V3_COMPLETE_DOCUMENTATION.md (full implementation)
449
- - SKILL_INTERACTION_GUIDE.md (how skills work together)
450
- - skills/intuition-discovery/references/waldo_core.md (detailed patterns)
451
- - skills/intuition-handoff/references/handoff_core.md (orchestration logic)
452
-
453
- 7. **When Tests Fail:**
454
- - Check state file first (often the root cause)
455
- - Verify file paths and locations
456
- - Review skill output for error messages
457
- - Consult documentation for expected behavior
458
- - Document exact failure scenario in test report
459
-
460
- ---
461
-
462
- **Testing Complete Checklist:**
463
- - [ ] All test sessions executed
464
- - [ ] Results documented
465
- - [ ] Issues categorized by severity
466
- - [ ] Test report generated
467
- - [ ] Go/no-go decision made