npm - sofia-cli - Versions diffs - 0.1.1 - Mend

sofia-cli 0.1.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (435) hide show

package/.github/agents/copilot-instructions.md +39 -0
package/.github/agents/speckit.analyze.agent.md +184 -0
package/.github/agents/speckit.checklist.agent.md +294 -0
package/.github/agents/speckit.clarify.agent.md +181 -0
package/.github/agents/speckit.constitution.agent.md +84 -0
package/.github/agents/speckit.implement.agent.md +135 -0
package/.github/agents/speckit.plan.agent.md +90 -0
package/.github/agents/speckit.specify.agent.md +258 -0
package/.github/agents/speckit.tasks.agent.md +137 -0
package/.github/agents/speckit.taskstoissues.agent.md +30 -0
package/.github/copilot-instructions.md +257 -0
package/.github/prompts/speckit.analyze.prompt.md +3 -0
package/.github/prompts/speckit.checklist.prompt.md +3 -0
package/.github/prompts/speckit.clarify.prompt.md +3 -0
package/.github/prompts/speckit.constitution.prompt.md +3 -0
package/.github/prompts/speckit.implement.prompt.md +3 -0
package/.github/prompts/speckit.plan.prompt.md +3 -0
package/.github/prompts/speckit.specify.prompt.md +3 -0
package/.github/prompts/speckit.tasks.prompt.md +3 -0
package/.github/prompts/speckit.taskstoissues.prompt.md +3 -0
package/.github/workflows/ci.yml +38 -0
package/.prettierrc +6 -0
package/.specify/memory/constitution.md +181 -0
package/.specify/scripts/bash/check-prerequisites.sh +166 -0
package/.specify/scripts/bash/common.sh +156 -0
package/.specify/scripts/bash/create-new-feature.sh +297 -0
package/.specify/scripts/bash/setup-plan.sh +61 -0
package/.specify/scripts/bash/update-agent-context.sh +810 -0
package/.specify/templates/agent-file-template.md +28 -0
package/.specify/templates/checklist-template.md +40 -0
package/.specify/templates/constitution-template.md +50 -0
package/.specify/templates/plan-template.md +113 -0
package/.specify/templates/spec-template.md +115 -0
package/.specify/templates/tasks-template.md +251 -0
package/.vscode/mcp.json +42 -0
package/.vscode/settings.json +19 -0
package/CODE_OF_CONDUCT.md +128 -0
package/LICENSE +21 -0
package/README.md +213 -0
package/dist/src/cli/developCommand.js +240 -0
package/dist/src/cli/directCommands.js +143 -0
package/dist/src/cli/envLoader.js +16 -0
package/dist/src/cli/exportCommand.js +53 -0
package/dist/src/cli/index.js +203 -0
package/dist/src/cli/ioContext.js +109 -0
package/dist/src/cli/preflight.js +57 -0
package/dist/src/cli/statusCommand.js +110 -0
package/dist/src/cli/workshopCommand.js +400 -0
package/dist/src/develop/checkpointState.js +86 -0
package/dist/src/develop/codeGenerator.js +319 -0
package/dist/src/develop/dynamicScaffolder.js +226 -0
package/dist/src/develop/githubMcpAdapter.js +122 -0
package/dist/src/develop/index.js +15 -0
package/dist/src/develop/mcpContextEnricher.js +195 -0
package/dist/src/develop/pocScaffolder.js +542 -0
package/dist/src/develop/ralphLoop.js +659 -0
package/dist/src/develop/templateRegistry.js +364 -0
package/dist/src/develop/testRunner.js +202 -0
package/dist/src/logging/logger.js +58 -0
package/dist/src/loop/conversationLoop.js +227 -0
package/dist/src/loop/phaseSummarizer.js +87 -0
package/dist/src/mcp/mcpManager.js +267 -0
package/dist/src/mcp/mcpTransport.js +391 -0
package/dist/src/mcp/retryPolicy.js +47 -0
package/dist/src/mcp/webSearch.js +254 -0
package/dist/src/phases/contextSummarizer.js +101 -0
package/dist/src/phases/discoveryEnricher.js +156 -0
package/dist/src/phases/phaseExtractors.js +222 -0
package/dist/src/phases/phaseHandlers.js +328 -0
package/dist/src/prompts/design.md +51 -0
package/dist/src/prompts/develop-boundary.md +51 -0
package/dist/src/prompts/develop.md +111 -0
package/dist/src/prompts/discover.md +58 -0
package/dist/src/prompts/ideate.md +56 -0
package/dist/src/prompts/plan.md +51 -0
package/dist/src/prompts/promptLoader.js +167 -0
package/dist/src/prompts/promptLoader.ts +198 -0
package/dist/src/prompts/select.md +47 -0
package/dist/src/prompts/summarize/README.md +8 -0
package/dist/src/prompts/summarize/design-summary.md +37 -0
package/dist/src/prompts/summarize/develop-summary.md +25 -0
package/dist/src/prompts/summarize/ideate-summary.md +27 -0
package/dist/src/prompts/summarize/plan-summary.md +27 -0
package/dist/src/prompts/summarize/select-summary.md +21 -0
package/dist/src/prompts/system.md +28 -0
package/dist/src/sessions/exportPaths.js +22 -0
package/dist/src/sessions/exportWriter.js +406 -0
package/dist/src/sessions/sessionManager.js +81 -0
package/dist/src/sessions/sessionStore.js +65 -0
package/dist/src/shared/activitySpinner.js +91 -0
package/dist/src/shared/copilotClient.js +129 -0
package/dist/src/shared/data/cards.json +1249 -0
package/dist/src/shared/data/cardsLoader.js +51 -0
package/dist/src/shared/errorClassifier.js +120 -0
package/dist/src/shared/events.js +28 -0
package/dist/src/shared/markdownRenderer.js +34 -0
package/dist/src/shared/schemas/session.js +265 -0
package/dist/src/shared/tableRenderer.js +20 -0
package/dist/src/vendor/chalk.js +2 -0
package/dist/src/vendor/cli-table3.js +3 -0
package/dist/src/vendor/commander.js +2 -0
package/dist/src/vendor/marked-terminal.js +3 -0
package/dist/src/vendor/marked.js +2 -0
package/dist/src/vendor/ora.js +2 -0
package/dist/src/vendor/pino.js +2 -0
package/dist/src/vendor/zod.js +2 -0
package/dist/tests/e2e/developE2e.spec.js +126 -0
package/dist/tests/e2e/developFailureE2e.spec.js +247 -0
package/dist/tests/e2e/developPty.spec.js +75 -0
package/dist/tests/e2e/discoveryWebSearchRelevance.spec.js +84 -0
package/dist/tests/e2e/harness.spec.js +83 -0
package/dist/tests/e2e/mcpLive.spec.js +120 -0
package/dist/tests/e2e/newSession.e2e.spec.js +177 -0
package/dist/tests/e2e/ralphLoopEnrichmentComparison.spec.js +62 -0
package/dist/tests/e2e/workiqEnrichment.spec.js +56 -0
package/dist/tests/e2e/zavaSimulation.spec.js +452 -0
package/dist/tests/fixtures/test-fixture-project/src/add.js +3 -0
package/dist/tests/fixtures/test-fixture-project/tests/failing.test.js +6 -0
package/dist/tests/fixtures/test-fixture-project/tests/hanging.test.js +8 -0
package/dist/tests/fixtures/test-fixture-project/tests/passing.test.js +10 -0
package/dist/tests/fixtures/test-fixture-project/vitest.config.js +6 -0
package/dist/tests/integration/autoStartConversation.spec.js +138 -0
package/dist/tests/integration/defaultCommand.spec.js +147 -0
package/dist/tests/integration/directCommandNonTty.spec.js +224 -0
package/dist/tests/integration/directCommandTty.spec.js +151 -0
package/dist/tests/integration/discoveryEnrichmentFlow.spec.js +175 -0
package/dist/tests/integration/exportArtifacts.spec.js +202 -0
package/dist/tests/integration/exportFallbackFlow.spec.js +99 -0
package/dist/tests/integration/mcpDegradationFlow.spec.js +190 -0
package/dist/tests/integration/mcpTransportFlow.spec.js +139 -0
package/dist/tests/integration/newSessionFlow.spec.js +343 -0
package/dist/tests/integration/pocGithubMcp.spec.js +186 -0
package/dist/tests/integration/pocLocalFallback.spec.js +171 -0
package/dist/tests/integration/pocScaffold.spec.js +163 -0
package/dist/tests/integration/ralphLoopFlow.spec.js +359 -0
package/dist/tests/integration/ralphLoopPartial.spec.js +368 -0
package/dist/tests/integration/resumeAndBacktrack.spec.js +247 -0
package/dist/tests/integration/spinnerLifecycle.spec.js +220 -0
package/dist/tests/integration/summarizationFlow.spec.js +115 -0
package/dist/tests/integration/testRunnerReal.spec.js +52 -0
package/dist/tests/integration/webSearchAgent.spec.js +128 -0
package/dist/tests/live/copilotSdkLive.spec.js +107 -0
package/dist/tests/live/zavaFullWorkshop.spec.js +392 -0
package/dist/tests/setup/loadEnv.js +3 -0
package/dist/tests/unit/cli/developCommand.spec.js +567 -0
package/dist/tests/unit/cli/directCommands.spec.js +279 -0
package/dist/tests/unit/cli/envLoader.spec.js +58 -0
package/dist/tests/unit/cli/ioContext.spec.js +119 -0
package/dist/tests/unit/cli/preflight.spec.js +108 -0
package/dist/tests/unit/cli/statusCommand.spec.js +111 -0
package/dist/tests/unit/cli/workshopClientFallback.spec.js +80 -0
package/dist/tests/unit/cli/workshopCommand.spec.js +329 -0
package/dist/tests/unit/config/vitestEnvSetup.spec.js +13 -0
package/dist/tests/unit/develop/checkpointState.spec.js +315 -0
package/dist/tests/unit/develop/codeGenerator.spec.js +355 -0
package/dist/tests/unit/develop/githubMcpAdapter.spec.js +231 -0
package/dist/tests/unit/develop/mcpContextEnricher.spec.js +433 -0
package/dist/tests/unit/develop/outputValidator.spec.js +119 -0
package/dist/tests/unit/develop/pocScaffolder.spec.js +353 -0
package/dist/tests/unit/develop/ralphLoop.spec.js +1248 -0
package/dist/tests/unit/develop/templateRegistry.spec.js +85 -0
package/dist/tests/unit/develop/testRunner.spec.js +249 -0
package/dist/tests/unit/infraBicep.spec.js +92 -0
package/dist/tests/unit/infraDeploy.spec.js +82 -0
package/dist/tests/unit/infraTeardown.spec.js +63 -0
package/dist/tests/unit/logging/logger.spec.js +43 -0
package/dist/tests/unit/loop/conversationLoop.spec.js +592 -0
package/dist/tests/unit/loop/phaseSummarizer.spec.js +141 -0
package/dist/tests/unit/loop/streamingMarkdown.spec.js +147 -0
package/dist/tests/unit/mcp/mcpManager.spec.js +279 -0
package/dist/tests/unit/mcp/mcpTransport.spec.js +529 -0
package/dist/tests/unit/mcp/retryPolicy.spec.js +218 -0
package/dist/tests/unit/mcp/timeoutValidation.spec.js +46 -0
package/dist/tests/unit/mcp/webSearch.spec.js +567 -0
package/dist/tests/unit/phases/contextSummarizer.spec.js +140 -0
package/dist/tests/unit/phases/discoveryEnricher.repeatCalls.spec.js +93 -0
package/dist/tests/unit/phases/discoveryEnricher.spec.js +411 -0
package/dist/tests/unit/phases/phaseExtractors.spec.js +352 -0
package/dist/tests/unit/phases/phaseHandlers.spec.js +425 -0
package/dist/tests/unit/prompts/promptLoader.spec.js +118 -0
package/dist/tests/unit/schemas/pocSchemas.spec.js +412 -0
package/dist/tests/unit/schemas/session.spec.js +257 -0
package/dist/tests/unit/sessions/exportPaths.spec.js +31 -0
package/dist/tests/unit/sessions/exportWriter.spec.js +655 -0
package/dist/tests/unit/sessions/sessionManager.spec.js +151 -0
package/dist/tests/unit/sessions/sessionStore.spec.js +116 -0
package/dist/tests/unit/shared/activitySpinner.spec.js +175 -0
package/dist/tests/unit/shared/cardsLoader.spec.js +76 -0
package/dist/tests/unit/shared/copilotClient.spec.js +155 -0
package/dist/tests/unit/shared/errorClassifier.spec.js +131 -0
package/dist/tests/unit/shared/events.spec.js +55 -0
package/dist/tests/unit/shared/markdownRenderer.spec.js +35 -0
package/dist/tests/unit/shared/markdownRendererChunks.spec.js +70 -0
package/dist/tests/unit/shared/tableRenderer.spec.js +34 -0
package/dist/vitest.config.js +14 -0
package/dist/vitest.live.config.js +18 -0
package/docs/README.md +35 -0
package/docs/architecture.md +169 -0
package/docs/cli-usage.md +207 -0
package/docs/environment.md +66 -0
package/docs/export-format.md +146 -0
package/docs/session-model.md +113 -0
package/eslint.config.js +35 -0
package/infra/deploy.sh +193 -0
package/infra/gather-env.sh +211 -0
package/infra/main.bicep +90 -0
package/infra/main.bicepparam +18 -0
package/infra/resources.bicep +134 -0
package/infra/teardown.sh +114 -0
package/package.json +63 -0
package/specs/001-cli-workshop-rebuild/checklists/requirements.md +35 -0
package/specs/001-cli-workshop-rebuild/contracts/cli.md +59 -0
package/specs/001-cli-workshop-rebuild/contracts/export-summary-json.md +23 -0
package/specs/001-cli-workshop-rebuild/contracts/session-json.md +30 -0
package/specs/001-cli-workshop-rebuild/data-model.md +210 -0
package/specs/001-cli-workshop-rebuild/plan.md +361 -0
package/specs/001-cli-workshop-rebuild/quickstart.md +83 -0
package/specs/001-cli-workshop-rebuild/research.md +116 -0
package/specs/001-cli-workshop-rebuild/spec.md +240 -0
package/specs/001-cli-workshop-rebuild/tasks.md +476 -0
package/specs/002-poc-generation/contracts/poc-output.md +172 -0
package/specs/002-poc-generation/contracts/ralph-loop.md +113 -0
package/specs/002-poc-generation/data-model.md +172 -0
package/specs/002-poc-generation/plan.md +109 -0
package/specs/002-poc-generation/quickstart.md +97 -0
package/specs/002-poc-generation/research.md +786 -0
package/specs/002-poc-generation/spec.md +81 -0
package/specs/002-poc-generation/tasks-fix.md +198 -0
package/specs/002-poc-generation/tasks.md +252 -0
package/specs/003-mcp-transport-integration/checklists/requirements.md +37 -0
package/specs/003-mcp-transport-integration/contracts/context-enricher.md +220 -0
package/specs/003-mcp-transport-integration/contracts/discovery-enricher.md +267 -0
package/specs/003-mcp-transport-integration/contracts/github-adapter.md +149 -0
package/specs/003-mcp-transport-integration/contracts/mcp-transport.md +288 -0
package/specs/003-mcp-transport-integration/data-model.md +326 -0
package/specs/003-mcp-transport-integration/plan.md +114 -0
package/specs/003-mcp-transport-integration/quickstart.md +311 -0
package/specs/003-mcp-transport-integration/research.md +395 -0
package/specs/003-mcp-transport-integration/spec.md +234 -0
package/specs/003-mcp-transport-integration/tasks.md +324 -0
package/specs/003-next-spec-gaps.md +150 -0
package/specs/004-dev-resume-hardening/checklists/requirements.md +37 -0
package/specs/004-dev-resume-hardening/contracts/cli.md +160 -0
package/specs/004-dev-resume-hardening/data-model.md +321 -0
package/specs/004-dev-resume-hardening/plan.md +107 -0
package/specs/004-dev-resume-hardening/quickstart.md +115 -0
package/specs/004-dev-resume-hardening/research.md +142 -0
package/specs/004-dev-resume-hardening/spec.md +221 -0
package/specs/004-dev-resume-hardening/tasks.md +333 -0
package/specs/005-ai-search-deploy/checklists/requirements.md +39 -0
package/specs/005-ai-search-deploy/contracts/web-search-tool.md +241 -0
package/specs/005-ai-search-deploy/data-model.md +130 -0
package/specs/005-ai-search-deploy/plan.md +93 -0
package/specs/005-ai-search-deploy/quickstart.md +96 -0
package/specs/005-ai-search-deploy/research.md +187 -0
package/specs/005-ai-search-deploy/spec.md +143 -0
package/specs/005-ai-search-deploy/tasks.md +284 -0
package/specs/006-workshop-extraction-fixes/checklists/requirements.md +61 -0
package/specs/006-workshop-extraction-fixes/contracts/summarization-and-export.md +131 -0
package/specs/006-workshop-extraction-fixes/data-model.md +149 -0
package/specs/006-workshop-extraction-fixes/plan.md +123 -0
package/specs/006-workshop-extraction-fixes/quickstart.md +101 -0
package/specs/006-workshop-extraction-fixes/research.md +143 -0
package/specs/006-workshop-extraction-fixes/spec.md +210 -0
package/specs/006-workshop-extraction-fixes/tasks.md +316 -0
package/src/cli/developCommand.ts +308 -0
package/src/cli/directCommands.ts +195 -0
package/src/cli/envLoader.ts +17 -0
package/src/cli/exportCommand.ts +65 -0
package/src/cli/index.ts +249 -0
package/src/cli/ioContext.ts +139 -0
package/src/cli/preflight.ts +86 -0
package/src/cli/statusCommand.ts +118 -0
package/src/cli/workshopCommand.ts +496 -0
package/src/develop/checkpointState.ts +121 -0
package/src/develop/codeGenerator.ts +402 -0
package/src/develop/dynamicScaffolder.ts +284 -0
package/src/develop/githubMcpAdapter.ts +199 -0
package/src/develop/index.ts +34 -0
package/src/develop/mcpContextEnricher.ts +279 -0
package/src/develop/pocScaffolder.ts +646 -0
package/src/develop/ralphLoop.ts +1044 -0
package/src/develop/templateRegistry.ts +427 -0
package/src/develop/testRunner.ts +276 -0
package/src/logging/logger.ts +73 -0
package/src/loop/conversationLoop.ts +355 -0
package/src/loop/phaseSummarizer.ts +114 -0
package/src/mcp/mcpManager.ts +365 -0
package/src/mcp/mcpTransport.ts +562 -0
package/src/mcp/retryPolicy.ts +87 -0
package/src/mcp/webSearch.ts +388 -0
package/src/originalPrompts/design_thinking.md +178 -0
package/src/originalPrompts/design_thinking_persona.md +76 -0
package/src/originalPrompts/document_generator_example.md +77 -0
package/src/originalPrompts/document_generator_persona.md +47 -0
package/src/originalPrompts/facilitator_persona.md +125 -0
package/src/originalPrompts/guardrails.md +47 -0
package/src/phases/contextSummarizer.ts +154 -0
package/src/phases/discoveryEnricher.ts +223 -0
package/src/phases/phaseExtractors.ts +247 -0
package/src/phases/phaseHandlers.ts +450 -0
package/src/prompts/design.md +51 -0
package/src/prompts/develop-boundary.md +51 -0
package/src/prompts/develop.md +111 -0
package/src/prompts/discover.md +58 -0
package/src/prompts/ideate.md +56 -0
package/src/prompts/plan.md +51 -0
package/src/prompts/promptLoader.ts +198 -0
package/src/prompts/select.md +47 -0
package/src/prompts/summarize/README.md +8 -0
package/src/prompts/summarize/design-summary.md +37 -0
package/src/prompts/summarize/develop-summary.md +25 -0
package/src/prompts/summarize/ideate-summary.md +27 -0
package/src/prompts/summarize/plan-summary.md +27 -0
package/src/prompts/summarize/select-summary.md +21 -0
package/src/prompts/system.md +28 -0
package/src/sessions/exportPaths.ts +28 -0
package/src/sessions/exportWriter.ts +490 -0
package/src/sessions/sessionManager.ts +119 -0
package/src/sessions/sessionStore.ts +69 -0
package/src/shared/activitySpinner.ts +108 -0
package/src/shared/copilotClient.ts +291 -0
package/src/shared/data/cards.json +1249 -0
package/src/shared/data/cardsLoader.ts +70 -0
package/src/shared/errorClassifier.ts +160 -0
package/src/shared/events.ts +103 -0
package/src/shared/markdownRenderer.ts +44 -0
package/src/shared/schemas/session.ts +346 -0
package/src/shared/tableRenderer.ts +28 -0
package/src/types/marked-terminal.d.ts +5 -0
package/src/vendor/chalk.ts +2 -0
package/src/vendor/cli-table3.ts +3 -0
package/src/vendor/commander.ts +2 -0
package/src/vendor/marked-terminal.ts +3 -0
package/src/vendor/marked.ts +2 -0
package/src/vendor/ora.ts +2 -0
package/src/vendor/pino.ts +3 -0
package/src/vendor/zod.ts +3 -0
package/tests/e2e/developE2e.spec.ts +152 -0
package/tests/e2e/developFailureE2e.spec.ts +289 -0
package/tests/e2e/developPty.spec.ts +86 -0
package/tests/e2e/discoveryWebSearchRelevance.spec.ts +103 -0
package/tests/e2e/harness.spec.ts +104 -0
package/tests/e2e/mcpLive.spec.ts +149 -0
package/tests/e2e/newSession.e2e.spec.ts +245 -0
package/tests/e2e/ralphLoopEnrichmentComparison.spec.ts +70 -0
package/tests/e2e/workiqEnrichment.spec.ts +72 -0
package/tests/e2e/zava-assessment/agent-interaction-script.md +258 -0
package/tests/e2e/zava-assessment/company-profile.md +98 -0
package/tests/e2e/zava-assessment/expected-results-checklist.md +454 -0
package/tests/e2e/zavaSimulation.spec.ts +511 -0
package/tests/fixtures/completedSession.json +141 -0
package/tests/fixtures/test-fixture-project/package-lock.json +1585 -0
package/tests/fixtures/test-fixture-project/package.json +12 -0
package/tests/fixtures/test-fixture-project/src/add.ts +3 -0
package/tests/fixtures/test-fixture-project/tests/failing.test.ts +7 -0
package/tests/fixtures/test-fixture-project/tests/hanging.test.ts +9 -0
package/tests/fixtures/test-fixture-project/tests/passing.test.ts +13 -0
package/tests/fixtures/test-fixture-project/vitest.config.ts +7 -0
package/tests/integration/autoStartConversation.spec.ts +168 -0
package/tests/integration/defaultCommand.spec.ts +179 -0
package/tests/integration/directCommandNonTty.spec.ts +260 -0
package/tests/integration/directCommandTty.spec.ts +185 -0
package/tests/integration/discoveryEnrichmentFlow.spec.ts +209 -0
package/tests/integration/exportArtifacts.spec.ts +232 -0
package/tests/integration/exportFallbackFlow.spec.ts +115 -0
package/tests/integration/mcpDegradationFlow.spec.ts +231 -0
package/tests/integration/mcpTransportFlow.spec.ts +178 -0
package/tests/integration/newSessionFlow.spec.ts +406 -0
package/tests/integration/pocGithubMcp.spec.ts +224 -0
package/tests/integration/pocLocalFallback.spec.ts +205 -0
package/tests/integration/pocScaffold.spec.ts +220 -0
package/tests/integration/ralphLoopFlow.spec.ts +430 -0
package/tests/integration/ralphLoopPartial.spec.ts +416 -0
package/tests/integration/resumeAndBacktrack.spec.ts +278 -0
package/tests/integration/spinnerLifecycle.spec.ts +270 -0
package/tests/integration/summarizationFlow.spec.ts +135 -0
package/tests/integration/testRunnerReal.spec.ts +63 -0
package/tests/integration/webSearchAgent.spec.ts +155 -0
package/tests/live/copilotSdkLive.spec.ts +149 -0
package/tests/live/zavaFullWorkshop.spec.ts +515 -0
package/tests/setup/loadEnv.ts +5 -0
package/tests/unit/cli/developCommand.spec.ts +679 -0
package/tests/unit/cli/directCommands.spec.ts +325 -0
package/tests/unit/cli/envLoader.spec.ts +73 -0
package/tests/unit/cli/ioContext.spec.ts +148 -0
package/tests/unit/cli/preflight.spec.ts +125 -0
package/tests/unit/cli/statusCommand.spec.ts +134 -0
package/tests/unit/cli/workshopClientFallback.spec.ts +100 -0
package/tests/unit/cli/workshopCommand.spec.ts +378 -0
package/tests/unit/config/vitestEnvSetup.spec.ts +24 -0
package/tests/unit/develop/checkpointState.spec.ts +378 -0
package/tests/unit/develop/codeGenerator.spec.ts +447 -0
package/tests/unit/develop/githubMcpAdapter.spec.ts +283 -0
package/tests/unit/develop/mcpContextEnricher.spec.ts +564 -0
package/tests/unit/develop/outputValidator.spec.ts +134 -0
package/tests/unit/develop/pocScaffolder.spec.ts +451 -0
package/tests/unit/develop/ralphLoop.spec.ts +1439 -0
package/tests/unit/develop/templateRegistry.spec.ts +106 -0
package/tests/unit/develop/testRunner.spec.ts +294 -0
package/tests/unit/infraBicep.spec.ts +116 -0
package/tests/unit/infraDeploy.spec.ts +102 -0
package/tests/unit/infraTeardown.spec.ts +77 -0
package/tests/unit/logging/logger.spec.ts +50 -0
package/tests/unit/loop/conversationLoop.spec.ts +719 -0
package/tests/unit/loop/phaseSummarizer.spec.ts +169 -0
package/tests/unit/loop/streamingMarkdown.spec.ts +180 -0
package/tests/unit/mcp/mcpManager.spec.ts +336 -0
package/tests/unit/mcp/mcpTransport.spec.ts +689 -0
package/tests/unit/mcp/retryPolicy.spec.ts +278 -0
package/tests/unit/mcp/timeoutValidation.spec.ts +55 -0
package/tests/unit/mcp/webSearch.spec.ts +718 -0
package/tests/unit/phases/contextSummarizer.spec.ts +158 -0
package/tests/unit/phases/discoveryEnricher.repeatCalls.spec.ts +125 -0
package/tests/unit/phases/discoveryEnricher.spec.ts +512 -0
package/tests/unit/phases/phaseExtractors.spec.ts +406 -0
package/tests/unit/phases/phaseHandlers.spec.ts +483 -0
package/tests/unit/prompts/promptLoader.spec.ts +144 -0
package/tests/unit/schemas/pocSchemas.spec.ts +457 -0
package/tests/unit/schemas/session.spec.ts +328 -0
package/tests/unit/sessions/exportPaths.spec.ts +38 -0
package/tests/unit/sessions/exportWriter.spec.ts +737 -0
package/tests/unit/sessions/sessionManager.spec.ts +174 -0
package/tests/unit/sessions/sessionStore.spec.ts +136 -0
package/tests/unit/shared/activitySpinner.spec.ts +211 -0
package/tests/unit/shared/cardsLoader.spec.ts +89 -0
package/tests/unit/shared/copilotClient.spec.ts +185 -0
package/tests/unit/shared/errorClassifier.spec.ts +152 -0
package/tests/unit/shared/events.spec.ts +71 -0
package/tests/unit/shared/markdownRenderer.spec.ts +42 -0
package/tests/unit/shared/markdownRendererChunks.spec.ts +83 -0
package/tests/unit/shared/tableRenderer.spec.ts +38 -0
package/tsconfig.json +20 -0
package/vitest.config.ts +15 -0
package/vitest.live.config.ts +19 -0

package/specs/002-poc-generation/research.md ADDED Viewed

@@ -0,0 +1,786 @@
+# Research: PoC Generation & Ralph Loop
+**Feature ID**: 002-poc-generation
+**Date**: 2026-02-27
+**Status**: Complete
+---
+## Topic 1: Ralph Loop Pattern
+### Findings
+The Ralph Loop is an **autonomous, iterative code-generation-test-refine** pattern originally conceived by Geoffrey Huntley ([ghuntley.com/ralph](https://ghuntley.com/ralph/)) and formalized as a Claude Code plugin at [`anthropics/claude-plugins-official/plugins/ralph-loop`](https://github.com/anthropics/claude-plugins-official/tree/main/plugins/ralph-loop).
+#### Canonical Pattern
+The core concept is simple: **a `while true` loop that repeatedly feeds the same prompt to an LLM**, where the LLM's work persists across iterations via the filesystem.
+```
+┌───────────────────────────────────────────┐
+│              Ralph Loop                   │
+│                                           │
+│   ┌─────────┐     ┌────────────────┐      │
+│   │  Prompt  │────▶│ LLM works on   │     │
+│   │ (fixed)  │     │ task, modifies  │     │
+│   └─────────┘     │ files, runs     │     │
+│       ▲           │ tests           │     │
+│       │           └───────┬────────┘      │
+│       │                   │               │
+│       │           ┌───────▼────────┐      │
+│       │           │ Check exit     │      │
+│       │           │ conditions     │      │
+│       │           └───────┬────────┘      │
+│       │                   │               │
+│       │         ┌─────────┴─────────┐     │
+│       │     CONTINUE             STOP     │
+│       │         │                   │     │
+│       └─────────┘           ┌───────▼──┐  │
+│                             │ Complete │  │
+│                             └──────────┘  │
+└───────────────────────────────────────────┘
+```
+#### Iteration Steps (per the canonical implementation)
+1. **LLM receives the SAME prompt** every iteration (the prompt never changes)
+2. **LLM works on the task** — generates/modifies code, runs tests, reviews output
+3. **LLM tries to exit** — considers itself "done" for this pass
+4. **Stop hook intercepts** — checks termination conditions
+5. **If not complete** — blocks exit, feeds the same prompt back, increments iteration counter
+6. **Self-reference** — LLM sees its previous work in files and git history
+#### Termination Conditions
+The canonical implementation uses three termination mechanisms:
+| Condition | Mechanism | Priority |
+|-----------|-----------|----------|
+| **Completion promise** | LLM outputs `<promise>EXACT_TEXT</promise>` tag; stop hook does exact string match | Primary (semantic) |
+| **Max iterations** | Counter in state file; stop hook checks `iteration >= max_iterations` | Safety net |
+| **State file removal** | User runs `/cancel-ralph` or hook detects corruption | Manual override |
+#### State File Format
+```markdown
+---
+active: true
+iteration: 1
+max_iterations: 10
+completion_promise: "All tests passing"
+started_at: "2026-02-27T14:30:00Z"
+---
+Build a REST API for todos.
+When complete:
+- All CRUD endpoints working
+- Tests passing (coverage > 80%)
+- Output: <promise>All tests passing</promise>
+```
+#### Feedback Mechanism
+**Key insight**: The feedback is NOT output-to-input piping. Instead:
+- The prompt stays the same every iteration
+- The LLM's work **persists in files on disk**
+- Each iteration, the LLM **reads its own prior work** from the filesystem
+- This creates a self-referential improvement loop via file-system state
+The stop hook outputs a JSON `block` decision:
+```json
+{
+  "decision": "block",
+  "reason": "<the original prompt text>",
+  "systemMessage": "🔄 Ralph iteration 5 | To stop: output <promise>DONE</promise>"
+}
+```
+#### Adaptation for sofIA
+For sofIA's Develop phase, we need to **internalize** the Ralph loop rather than using external bash hooks. Key differences from the canonical pattern:
+| Aspect | Canonical (Claude Code) | sofIA Adaptation |
+|--------|------------------------|-------------------|
+| Loop mechanism | Bash `while true` / Stop hook | TypeScript `while` loop in `ralphLoop.ts` |
+| Feedback | File system persistence | File system + structured `PocIteration` in session |
+| Prompt | Fixed markdown file | Dynamic, enriched with test failure context |
+| Termination | Promise tag + max iterations | Tests passing + max iterations + user abort |
+| State | `.claude/ralph-loop.local.md` | `WorkshopSession.poc.iterations[]` |
+**Critical enhancement**: Unlike the canonical Ralph loop where the prompt never changes, sofIA's adaptation should **inject test failure output** into subsequent prompts. This is closer to how the `skill-creator` plugin's `run_loop.py` works — evaluation results from iteration N feed into the improvement prompt for iteration N+1.
+### Decision
+Implement a **modified Ralph loop** in `src/develop/ralphLoop.ts` with these iteration steps:
+1. **Generate/refine code** — Send prompt + test failures to LLM, write output files
+2. **Run tests** — Execute test runner, capture structured results
+3. **Evaluate termination** — Check: tests pass? max iterations? user abort? stuck detection?
+4. **Record iteration** — Persist `PocIteration` to session
+5. **Loop or exit** — Feed failures as context into next iteration, or finalize
+### Rationale
+- Internalizing the loop (vs. external bash) gives us structured state tracking, session persistence, and the ability to enrich prompts with failure context
+- Adding test-failure injection improves convergence speed vs. plain prompt repetition
+- Keeping the `max_iterations` safety net and adding stuck-detection (same failures N times) prevents infinite loops
+### Alternatives Considered
+1. **External bash loop wrapping `sofiacli`** — Rejected: loses session integration, no structured state, platform-specific
+2. **Pure prompt repetition (canonical Ralph)** — Rejected: slower convergence without failure context injection
+3. **LangGraph-style state machine** — Rejected: over-engineered for this use case, adds a heavy dependency
+---
+## Topic 2: Test Runner Invocation from Node.js
+### Findings
+#### Approach: `child_process.spawn` with JSON reporter
+```typescript
+import { spawn } from 'node:child_process';
+interface TestResult {
+  passed: number;
+  failed: number;
+  skipped: number;
+  duration: number;
+  failures: TestFailure[];
+}
+interface TestFailure {
+  name: string;
+  message: string;
+  stack?: string;
+}
+async function runTests(cwd: string, timeout = 60_000): Promise<TestResult> {
+  return new Promise((resolve, reject) => {
+    const child = spawn('npx', ['vitest', 'run', '--reporter=json'], {
+      cwd,
+      stdio: ['ignore', 'pipe', 'pipe'],
+      timeout,
+      env: { ...process.env, CI: '1', NO_COLOR: '1' },
+    });
+    const stdoutChunks: Buffer[] = [];
+    const stderrChunks: Buffer[] = [];
+    child.stdout.on('data', (chunk) => stdoutChunks.push(chunk));
+    child.stderr.on('data', (chunk) => stderrChunks.push(chunk));
+    child.on('close', (code) => {
+      const stdout = Buffer.concat(stdoutChunks).toString();
+      const stderr = Buffer.concat(stderrChunks).toString();
+      try {
+        const json = JSON.parse(stdout);
+        resolve(parseVitestJson(json));
+      } catch {
+        // Fallback: parse exit code
+        resolve({
+          passed: code === 0 ? 1 : 0,
+          failed: code === 0 ? 0 : 1,
+          skipped: 0,
+          duration: 0,
+          failures: code !== 0
+            ? [{ name: 'unknown', message: stderr || stdout }]
+            : [],
+        });
+      }
+    });
+    child.on('error', (err) => {
+      reject(new Error(`Test runner failed to start: ${err.message}`));
+    });
+  });
+}
+```
+#### spawn vs exec
+| Factor | `spawn` | `exec` |
+|--------|---------|--------|
+| Buffer limit | **No limit** (streams) | 1MB default `maxBuffer` |
+| Streaming | Yes — can process real-time | No — waits for completion |
+| Timeout | Built-in `timeout` option | Built-in `timeout` option |
+| Signal handling | Direct `child.kill()` | Same via returned child |
+| **Verdict** | **Preferred** | Acceptable for small output |
+Use `spawn` because test output can be large (especially with failure stacks).
+#### JSON Reporters by Test Runner
+| Runner | JSON Flag | Output |
+|--------|-----------|--------|
+| **Vitest** | `--reporter=json` | `{ numPassedTests, numFailedTests, testResults[] }` |
+| **Jest** | `--json` | Same format (Vitest is Jest-compatible) |
+| **Node test runner** | `--test-reporter=spec` | TAP output (parse with `tap-parser`) |
+| **TAP** | Various | Use `tap-parser` npm package to parse |
+**Recommendation**: Use Vitest JSON reporter since the project already uses Vitest. Fall back to exit-code parsing if JSON fails.
+#### Timeout Handling
+```typescript
+const child = spawn('npx', ['vitest', 'run', '--reporter=json'], {
+  cwd,
+  timeout: 60_000,        // Kill after 60s
+  killSignal: 'SIGTERM',  // Graceful first
+});
+// Belt-and-suspenders: hard kill after grace period
+const hardKill = setTimeout(() => {
+  if (!child.killed) child.kill('SIGKILL');
+}, timeout + 5_000);
+child.on('close', () => clearTimeout(hardKill));
+```
+#### Environment Variables
+Set these to prevent interactive/hanging behavior:
+```typescript
+env: {
+  ...process.env,
+  CI: '1',           // Disable watch mode, interactive prompts
+  NO_COLOR: '1',     // Clean output for parsing
+  FORCE_COLOR: '0',  // Redundant safety
+}
+```
+### Decision
+Use `child_process.spawn` with Vitest's `--reporter=json` flag. Capture stdout/stderr separately. Apply a configurable timeout (default 60s) with belt-and-suspenders hard kill. Parse JSON output into a `TestResult` struct; fall back to exit-code parsing on malformed output.
+### Rationale
+- `spawn` handles arbitrarily large output without buffering issues
+- JSON reporter gives structured results without regex parsing
+- Vitest is already the project's test runner, so the JSON format is well-understood
+- Separate stdout/stderr capture allows clean JSON parsing even when warnings appear on stderr
+### Alternatives Considered
+1. **`exec` with `maxBuffer`** — Rejected: risk of truncation on large test output
+2. **TAP protocol** — Rejected: requires additional parser dependency; Vitest's JSON is sufficient
+3. **Vitest Node API** — Rejected: tightly couples to Vitest version; `spawn` is runner-agnostic
+4. **`node:test` built-in runner** — Rejected: less mature, fewer features than Vitest for this use case
+---
+## Topic 3: GitHub MCP Repo Creation
+### Findings
+The GitHub MCP server at `https://api.githubcopilot.com/mcp/` provides tools via the Model Context Protocol. Based on the MCP standard and GitHub's documentation, the available tools include:
+#### Available Tools (relevant subset)
+| Tool | Description |
+|------|-------------|
+| `create_repository` | Create a new GitHub repository |
+| `create_or_update_file` | Create or update a single file in a repo |
+| `push_files` | Push multiple files in a single commit |
+| `create_branch` | Create a new branch |
+| `create_pull_request` | Open a PR |
+| `search_repositories` | Search existing repos |
+| `get_file_contents` | Read file from repo |
+| `list_branches` | List branches |
+#### Tool Calling Pattern via Copilot SDK
+The Copilot SDK routes MCP tool calls automatically when MCP servers are configured. The flow is:
+```
+ConversationSession.send(prompt)
+  → SDK resolves MCP servers from config
+  → LLM decides to call a tool (e.g., create_repository)
+  → SDK routes to GitHub MCP server
+  → Server executes against GitHub API
+  → Result returned as ToolResult event
+```
+In sofIA's architecture, MCP tools are invoked **indirectly** — the LLM decides which tools to call based on the system prompt. The `developPocPrompt` would instruct the LLM to:
+1. Check if a repo already exists (or use local fallback)
+2. Create the repo with `create_repository`
+3. Push scaffold files with `push_files` or `create_or_update_file`
+4. Create a branch for the PoC work
+#### Direct MCP Invocation (Alternative)
+For more deterministic control, sofIA could call MCP tools directly without going through the LLM:
+```typescript
+// Hypothetical direct MCP tool call via SDK
+// The Copilot SDK's CopilotSession may expose tool invocation
+const result = await sdkSession.invokeTool('create_repository', {
+  name: `sofia-poc-${sessionId}`,
+  description: 'PoC generated by sofIA workshop',
+  private: true,
+  auto_init: true,
+});
+```
+However, the current `@github/copilot-sdk` API uses `sendAndWait`, which routes through the LLM. Direct tool invocation would require using the MCP protocol directly (e.g., `@modelcontextprotocol/sdk`).
+#### Availability Detection
+```typescript
+// Check if GitHub MCP is available before attempting repo creation
+const mcpManager = new McpManager(config);
+const githubAvailable = mcpManager.isAvailable('github');
+if (!githubAvailable) {
+  // Fall back to local scaffolding (D-003)
+  return scaffoldLocally(session, pocDir);
+}
+```
+### Decision
+Use **LLM-mediated MCP tool calls** for GitHub repo creation (the LLM decides when/how to call GitHub MCP tools based on the develop prompt). Add explicit availability detection via `McpManager.isAvailable('github')` to enable graceful fallback to local scaffolding. Do NOT attempt direct MCP protocol calls — keep the architecture aligned with how the Copilot SDK works.
+### Rationale
+- The Copilot SDK already handles MCP routing; adding a parallel MCP client adds complexity
+- LLM-mediated calls allow the model to adapt to errors (e.g., repo already exists, permission denied)
+- Graceful fallback to local scaffolding (D-003) ensures the feature works without GitHub MCP
+- The `McpManager` already has the detection infrastructure
+### Alternatives Considered
+1. **Direct MCP protocol client** (`@modelcontextprotocol/sdk`) — Rejected: adds a dependency, duplicates SDK functionality, and the control flow becomes harder to test
+2. **GitHub REST API directly** — Rejected: requires separate auth, loses MCP abstraction, doesn't benefit from SDK's tool routing
+3. **GitHub CLI (`gh repo create`)** — Rejected: requires `gh` installed, additional auth setup, not composable
+---
+## Topic 4: Local Filesystem PoC Scaffolding
+### Findings
+#### File Tree Generation Pattern
+Recommended approach: **Programmatic generation from in-memory template descriptors**, not template engines.
+```typescript
+interface ScaffoldFile {
+  relativePath: string;
+  content: string | ((ctx: ScaffoldContext) => string);
+}
+interface ScaffoldContext {
+  projectName: string;
+  sessionId: string;
+  description: string;
+  techStack: string;
+  architectureNotes?: string;
+}
+const SCAFFOLD_FILES: ScaffoldFile[] = [
+  {
+    relativePath: 'package.json',
+    content: (ctx) => JSON.stringify({
+      name: ctx.projectName,
+      version: '0.1.0',
+      scripts: { test: 'vitest run', build: 'tsc' },
+    }, null, 2),
+  },
+  {
+    relativePath: 'README.md',
+    content: (ctx) => `# ${ctx.projectName}\n\n${ctx.description}\n`,
+  },
+  {
+    relativePath: 'tsconfig.json',
+    content: JSON.stringify({
+      compilerOptions: { target: 'ES2022', module: 'nodenext', strict: true, outDir: 'dist' },
+      include: ['src'],
+    }, null, 2),
+  },
+  {
+    relativePath: 'src/index.ts',
+    content: '// Entry point — generated by sofIA\n',
+  },
+  {
+    relativePath: 'tests/smoke.test.ts',
+    content: (ctx) => `import { describe, it, expect } from 'vitest';\n\ndescribe('${ctx.projectName}', () => {\n  it('should be truthy', () => {\n    expect(true).toBe(true);\n  });\n});\n`,
+  },
+];
+```
+#### Idempotency Strategy
+```typescript
+async function scaffold(
+  outputDir: string,
+  files: ScaffoldFile[],
+  ctx: ScaffoldContext,
+  options: { overwrite?: boolean } = {},
+): Promise<string[]> {
+  const written: string[] = [];
+  await mkdir(outputDir, { recursive: true });
+  for (const file of files) {
+    const fullPath = join(outputDir, file.relativePath);
+    const dir = dirname(fullPath);
+    await mkdir(dir, { recursive: true });
+    // Idempotency: skip existing files unless overwrite is true
+    if (!options.overwrite) {
+      try {
+        await access(fullPath);
+        continue; // File exists, skip
+      } catch {
+        // File doesn't exist, proceed
+      }
+    }
+    const content = typeof file.content === 'function'
+      ? file.content(ctx)
+      : file.content;
+    await writeFile(fullPath, content, 'utf-8');
+    written.push(file.relativePath);
+  }
+  return written;
+}
+```
+#### Platform-Safe Path Handling
+```typescript
+import { join, resolve, normalize } from 'node:path';
+// ALWAYS use path.join() — never string concatenation
+const pocDir = join('.', 'poc', sessionId);  // ✅
+const pocDir = `./poc/${sessionId}`;          // ❌ Windows path separator issues
+// Normalize user-provided paths
+const safePath = normalize(userPath);
+// Prevent path traversal
+function isSafePath(base: string, target: string): boolean {
+  const resolvedBase = resolve(base);
+  const resolvedTarget = resolve(base, target);
+  return resolvedTarget.startsWith(resolvedBase);
+}
+```
+### Decision
+Use **programmatic generation from typed template descriptors** (no template engine dependency). Implement idempotency via "skip existing files unless `--overwrite`" semantics. Use `node:path` functions exclusively for all path operations. Output directory: `./poc/<sessionId>/`.
+### Rationale
+- Template descriptors are fully typed, testable, and don't require a runtime parser
+- Skip-existing-files idempotency is simpler and safer than diff-and-merge
+- `node:path` handles platform differences automatically
+- Keeping scaffolds as code (not files on disk) avoids packaging/distribution issues
+### Alternatives Considered
+1. **Template engine (Handlebars, EJS)** — Rejected: adds dependency, requires template files to ship, harder to type-check
+2. **Yeoman/Plop generators** — Rejected: heavy dependencies, CLI-centric design doesn't compose well
+3. **Copy directory tree from `templates/`** — Rejected: requires shipping template files, variable substitution still needed
+4. **Git clone template repo** — Partially viable for GitHub MCP path but adds network dependency
+---
+## Topic 5: Autonomous Loop vs Interactive Loop
+### Findings
+The current `ConversationLoop` class is fundamentally **interactive**:
+- It calls `this.io.readInput()` in a `while` loop waiting for user text
+- It uses `DecisionGate` to ask the user what to do next
+- It checks for `done` / empty input to break
+An autonomous Ralph loop needs to:
+- Supply its own "input" (the prompt + test failure context)
+- Never block waiting for user input
+- Terminate based on programmatic conditions (tests passing, max iterations)
+- Still produce streaming output for visibility
+#### Architecture Options Analysis
+##### Option A: Subclass ConversationLoop
+```typescript
+class AutonomousLoop extends ConversationLoop {
+  override async run(): Promise<WorkshopSession> {
+    // Override the main loop behavior
+  }
+}
+```
+**Pros**: Reuses streaming/rendering code
+**Cons**: `ConversationLoop.run()` is monolithic; overriding it means reimplementing most of the logic. Fragile inheritance.
+##### Option B: New standalone AutonomousLoop class
+```typescript
+class RalphLoop {
+  constructor(private options: RalphLoopOptions) {}
+  async run(): Promise<PocDevelopmentState> {
+    while (iteration < maxIterations && !testsPass) {
+      const code = await this.generate(prompt, failures);
+      await this.writeFiles(code);
+      const results = await this.runTests();
+      failures = results.failures;
+      iteration++;
+    }
+  }
+}
+```
+**Pros**: Clean separation of concerns; purpose-built for the autonomous case
+**Cons**: Duplicates streaming/rendering logic from ConversationLoop
+##### Option C: Parameterize ConversationLoop with a "driver"
+```typescript
+interface LoopDriver {
+  getNextInput(session: WorkshopSession, lastResponse: string): Promise<string | null>;
+  shouldContinue(session: WorkshopSession): boolean;
+}
+class InteractiveDriver implements LoopDriver {
+  async getNextInput() { return this.io.readInput(); }
+  shouldContinue() { return true; } // User controls via "done"
+}
+class AutonomousDriver implements LoopDriver {
+  async getNextInput(session, lastResponse) {
+    const testResults = await this.runTests(session.poc.repoPath);
+    if (testResults.allPassing) return null; // Signal done
+    return formatFailurePrompt(testResults);
+  }
+  shouldContinue(session) {
+    return session.poc.iterations.length < this.maxIterations;
+  }
+}
+```
+**Pros**: Open/Closed principle; ConversationLoop stays unchanged; easy to test drivers independently
+**Cons**: ConversationLoop needs refactoring to accept a driver; the streaming and turn-management code becomes shared
+##### Option D: Compose ConversationLoop as inner component
+```typescript
+class RalphLoop {
+  async run(): Promise<PocDevelopmentState> {
+    for (let i = 0; i < maxIterations; i++) {
+      // Use ConversationLoop for a single LLM turn
+      const loop = new ConversationLoop({
+        client: this.client,
+        io: this.createAutoIO(prompt),
+        session: this.session,
+        phaseHandler: this.handler,
+        initialMessage: prompt,
+      });
+      this.session = await loop.run();
+      // Run tests
+      const results = await this.runTests();
+      if (results.allPassing) break;
+      prompt = enrichPromptWithFailures(prompt, results);
+    }
+  }
+  private createAutoIO(prompt: string): LoopIO {
+    return {
+      write: (text) => this.outputHandler(text),
+      writeActivity: (text) => this.outputHandler(text),
+      readInput: async () => null,  // Immediately signal "done"
+      showDecisionGate: async () => ({ choice: 'continue' }),
+      isJsonMode: false,
+      isTTY: false,
+    };
+  }
+}
+```
+**Pros**: Reuses ConversationLoop's streaming exactly; no modification to existing code; each LLM turn is isolated
+**Cons**: Creates a new ConversationLoop per iteration (minor overhead); ConversationLoop does more than needed per call (signal handlers, etc.)
+### Decision
+**Option D: Compose ConversationLoop as inner component** for the initial implementation, with a path to evolve toward Option C.
+The `RalphLoop` class is the outer orchestrator. For each iteration, it creates a `ConversationLoop` with an auto-completing `LoopIO` (returns `null` from `readInput` immediately after the initial message is sent) and uses `initialMessage` to inject the prompt. This approach:
+1. Reuses all existing streaming/rendering infrastructure
+2. Requires zero changes to `ConversationLoop`
+3. Each iteration is isolated (clean session state handoff)
+4. The auto-completing `LoopIO` is trivially testable
+The `RalphLoop.run()` method owns the outer iteration, test execution, and termination logic.
+### Rationale
+- Minimizes risk: `ConversationLoop` is battle-tested and unchanged
+- The `LoopIO` mock pattern is simple: `readInput: async () => null`
+- Each iteration gets a fresh LLM session, preventing context window overflow
+- The composition pattern naturally supports the spec's requirement for multiple iteration records
+### Alternatives Considered
+See Options A–C above. Option C (driver pattern) is the best long-term architecture but requires refactoring `ConversationLoop.run()`, which is out of scope for feature 002's initial implementation.
+---
+## Topic 6: PocDevelopmentState Schema Extensions
+### Findings
+The current schema is minimal:
+```typescript
+// Current (from session.ts)
+export const pocIterationSchema = z.object({
+  iteration: z.number(),
+  startedAt: z.string(),
+  endedAt: z.string().optional(),
+  changesSummary: z.string().optional(),
+  testsRun: z.array(z.string()).optional(),
+});
+export const pocDevelopmentStateSchema = z.object({
+  repoPath: z.string().optional(),
+  iterations: z.array(pocIterationSchema),
+  finalStatus: z.enum(['success', 'failed']).optional(),
+});
+```
+This is insufficient for a working Ralph loop. The following extensions are needed:
+#### Per-Iteration Extensions
+```typescript
+export const testResultSchema = z.object({
+  passed: z.number(),
+  failed: z.number(),
+  skipped: z.number(),
+  duration: z.number(),             // milliseconds
+  failures: z.array(z.object({
+    testName: z.string(),
+    message: z.string(),
+    stack: z.string().optional(),
+  })),
+});
+export const pocIterationSchema = z.object({
+  iteration: z.number(),
+  startedAt: z.string(),             // ISO-8601
+  endedAt: z.string().optional(),    // ISO-8601
+  changesSummary: z.string().optional(),
+  // NEW: Structured test results
+  testResults: testResultSchema.optional(),
+  // NEW: Files touched in this iteration
+  filesChanged: z.array(z.string()).optional(),  // relative paths
+  // NEW: Prompt context tracking (for audit)
+  promptTokensUsed: z.number().optional(),
+  responseTokensUsed: z.number().optional(),
+  // NEW: Iteration outcome classification
+  outcome: z.enum([
+    'tests-passing',     // All tests pass — can terminate
+    'tests-improving',   // Fewer failures than previous iteration
+    'tests-regressing',  // More failures than previous iteration
+    'tests-stuck',       // Same failures as previous iteration
+    'error',             // Runtime error (test runner crash, timeout)
+  ]).optional(),
+  // DEPRECATED: replaced by testResults
+  testsRun: z.array(z.string()).optional(),
+});
+```
+#### Overall State Extensions
+```typescript
+export const pocDevelopmentStateSchema = z.object({
+  repoPath: z.string().optional(),       // local path or GitHub URL
+  iterations: z.array(pocIterationSchema),
+  finalStatus: z.enum(['success', 'failed', 'partial', 'aborted']).optional(),
+  // NEW: Technology context
+  techStack: z.string().optional(),         // e.g., "Node.js + TypeScript + Express"
+  templateUsed: z.string().optional(),      // e.g., "node-ts-api"
+  // NEW: Timing
+  totalDuration: z.number().optional(),     // total ms across all iterations
+  // NEW: Configuration used
+  maxIterations: z.number().optional(),     // configured limit
+  testCommand: z.string().optional(),       // e.g., "npm test"
+  // NEW: Source tracking
+  repoSource: z.enum(['github-mcp', 'local', 'existing']).optional(),
+  // NEW: Termination reason
+  terminationReason: z.enum([
+    'tests-passing',
+    'max-iterations',
+    'user-abort',
+    'stuck-detected',       // same failures for N consecutive iterations
+    'error',
+  ]).optional(),
+  // NEW: Summary for export/audit
+  finalTestResults: testResultSchema.optional(),
+});
+```
+#### Audit Trail Compliance
+The schema supports audit requirements through:
+1. **Per-iteration `testResults`** — exact pass/fail counts and failure messages recorded
+2. **`outcome` classification** — machine-readable iteration assessment
+3. **`filesChanged`** — what was modified (without storing full diffs, which could be large)
+4. **`terminationReason`** — why the loop stopped
+5. **Token usage** — cost tracking per iteration
+6. **Timestamps** — `startedAt`/`endedAt` on each iteration plus `totalDuration`
+What we deliberately **exclude** from the schema (stored elsewhere or not at all):
+- Full file contents (too large for JSON state; stored on disk)
+- Full LLM conversation history (already in `turns[]`)
+- Secrets/tokens (security policy)
+### Decision
+Extend `PocDevelopmentState` and `PocIteration` as described above. Add the new `TestResult` schema. Expand `finalStatus` to include `'partial'` and `'aborted'`. Add `terminationReason`, `repoSource`, `techStack`, `templateUsed`, `totalDuration`, `maxIterations`, `testCommand`, and `finalTestResults` to the state. Add `testResults`, `filesChanged`, `promptTokensUsed`, `responseTokensUsed`, and `outcome` to iterations. Keep `testsRun` for backward compatibility but mark as deprecated.
+### Rationale
+- Structured `TestResult` enables the Ralph loop to programmatically compare iterations and detect stuck states
+- `outcome` classification enables the termination logic to be data-driven
+- `terminationReason` + `repoSource` satisfy D-005 auditability requirements
+- Token usage tracking enables cost monitoring for workshop facilitators
+- Backward compatibility with existing `testsRun` field prevents breaking existing sessions
+### Alternatives Considered
+1. **Minimal extension (just add `testResults`)** — Rejected: insufficient for termination logic and audit trail
+2. **Separate `RalphLoopState` schema** — Rejected: the PoC state and Ralph loop state are the same thing; splitting adds indirection
+3. **Store full diffs per iteration** — Rejected: too large for JSON session files; incompatible with the lightweight session model
+---
+## Summary of Decisions
+| # | Topic | Decision |
+|---|-------|----------|
+| 1 | Ralph Loop Pattern | Modified Ralph loop with test-failure injection; internal TypeScript loop, not external bash |
+| 2 | Test Runner | `spawn` + Vitest `--reporter=json` + 60s timeout + belt-and-suspenders kill |
+| 3 | GitHub MCP | LLM-mediated MCP tool calls with `McpManager` availability detection; local fallback |
+| 4 | Local Scaffolding | Programmatic typed template descriptors; skip-existing idempotency; `node:path` for safety |
+| 5 | Loop Architecture | Compose: `RalphLoop` owns iteration, uses `ConversationLoop` per turn with auto-completing IO |
+| 6 | Schema Extensions | Full extension of `PocDevelopmentState` + `PocIteration` + new `TestResult` schema |