npm - sofia-cli - Versions diffs - 0.1.1 - Mend

sofia-cli 0.1.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (435) hide show

package/.github/agents/copilot-instructions.md +39 -0
package/.github/agents/speckit.analyze.agent.md +184 -0
package/.github/agents/speckit.checklist.agent.md +294 -0
package/.github/agents/speckit.clarify.agent.md +181 -0
package/.github/agents/speckit.constitution.agent.md +84 -0
package/.github/agents/speckit.implement.agent.md +135 -0
package/.github/agents/speckit.plan.agent.md +90 -0
package/.github/agents/speckit.specify.agent.md +258 -0
package/.github/agents/speckit.tasks.agent.md +137 -0
package/.github/agents/speckit.taskstoissues.agent.md +30 -0
package/.github/copilot-instructions.md +257 -0
package/.github/prompts/speckit.analyze.prompt.md +3 -0
package/.github/prompts/speckit.checklist.prompt.md +3 -0
package/.github/prompts/speckit.clarify.prompt.md +3 -0
package/.github/prompts/speckit.constitution.prompt.md +3 -0
package/.github/prompts/speckit.implement.prompt.md +3 -0
package/.github/prompts/speckit.plan.prompt.md +3 -0
package/.github/prompts/speckit.specify.prompt.md +3 -0
package/.github/prompts/speckit.tasks.prompt.md +3 -0
package/.github/prompts/speckit.taskstoissues.prompt.md +3 -0
package/.github/workflows/ci.yml +38 -0
package/.prettierrc +6 -0
package/.specify/memory/constitution.md +181 -0
package/.specify/scripts/bash/check-prerequisites.sh +166 -0
package/.specify/scripts/bash/common.sh +156 -0
package/.specify/scripts/bash/create-new-feature.sh +297 -0
package/.specify/scripts/bash/setup-plan.sh +61 -0
package/.specify/scripts/bash/update-agent-context.sh +810 -0
package/.specify/templates/agent-file-template.md +28 -0
package/.specify/templates/checklist-template.md +40 -0
package/.specify/templates/constitution-template.md +50 -0
package/.specify/templates/plan-template.md +113 -0
package/.specify/templates/spec-template.md +115 -0
package/.specify/templates/tasks-template.md +251 -0
package/.vscode/mcp.json +42 -0
package/.vscode/settings.json +19 -0
package/CODE_OF_CONDUCT.md +128 -0
package/LICENSE +21 -0
package/README.md +213 -0
package/dist/src/cli/developCommand.js +240 -0
package/dist/src/cli/directCommands.js +143 -0
package/dist/src/cli/envLoader.js +16 -0
package/dist/src/cli/exportCommand.js +53 -0
package/dist/src/cli/index.js +203 -0
package/dist/src/cli/ioContext.js +109 -0
package/dist/src/cli/preflight.js +57 -0
package/dist/src/cli/statusCommand.js +110 -0
package/dist/src/cli/workshopCommand.js +400 -0
package/dist/src/develop/checkpointState.js +86 -0
package/dist/src/develop/codeGenerator.js +319 -0
package/dist/src/develop/dynamicScaffolder.js +226 -0
package/dist/src/develop/githubMcpAdapter.js +122 -0
package/dist/src/develop/index.js +15 -0
package/dist/src/develop/mcpContextEnricher.js +195 -0
package/dist/src/develop/pocScaffolder.js +542 -0
package/dist/src/develop/ralphLoop.js +659 -0
package/dist/src/develop/templateRegistry.js +364 -0
package/dist/src/develop/testRunner.js +202 -0
package/dist/src/logging/logger.js +58 -0
package/dist/src/loop/conversationLoop.js +227 -0
package/dist/src/loop/phaseSummarizer.js +87 -0
package/dist/src/mcp/mcpManager.js +267 -0
package/dist/src/mcp/mcpTransport.js +391 -0
package/dist/src/mcp/retryPolicy.js +47 -0
package/dist/src/mcp/webSearch.js +254 -0
package/dist/src/phases/contextSummarizer.js +101 -0
package/dist/src/phases/discoveryEnricher.js +156 -0
package/dist/src/phases/phaseExtractors.js +222 -0
package/dist/src/phases/phaseHandlers.js +328 -0
package/dist/src/prompts/design.md +51 -0
package/dist/src/prompts/develop-boundary.md +51 -0
package/dist/src/prompts/develop.md +111 -0
package/dist/src/prompts/discover.md +58 -0
package/dist/src/prompts/ideate.md +56 -0
package/dist/src/prompts/plan.md +51 -0
package/dist/src/prompts/promptLoader.js +167 -0
package/dist/src/prompts/promptLoader.ts +198 -0
package/dist/src/prompts/select.md +47 -0
package/dist/src/prompts/summarize/README.md +8 -0
package/dist/src/prompts/summarize/design-summary.md +37 -0
package/dist/src/prompts/summarize/develop-summary.md +25 -0
package/dist/src/prompts/summarize/ideate-summary.md +27 -0
package/dist/src/prompts/summarize/plan-summary.md +27 -0
package/dist/src/prompts/summarize/select-summary.md +21 -0
package/dist/src/prompts/system.md +28 -0
package/dist/src/sessions/exportPaths.js +22 -0
package/dist/src/sessions/exportWriter.js +406 -0
package/dist/src/sessions/sessionManager.js +81 -0
package/dist/src/sessions/sessionStore.js +65 -0
package/dist/src/shared/activitySpinner.js +91 -0
package/dist/src/shared/copilotClient.js +129 -0
package/dist/src/shared/data/cards.json +1249 -0
package/dist/src/shared/data/cardsLoader.js +51 -0
package/dist/src/shared/errorClassifier.js +120 -0
package/dist/src/shared/events.js +28 -0
package/dist/src/shared/markdownRenderer.js +34 -0
package/dist/src/shared/schemas/session.js +265 -0
package/dist/src/shared/tableRenderer.js +20 -0
package/dist/src/vendor/chalk.js +2 -0
package/dist/src/vendor/cli-table3.js +3 -0
package/dist/src/vendor/commander.js +2 -0
package/dist/src/vendor/marked-terminal.js +3 -0
package/dist/src/vendor/marked.js +2 -0
package/dist/src/vendor/ora.js +2 -0
package/dist/src/vendor/pino.js +2 -0
package/dist/src/vendor/zod.js +2 -0
package/dist/tests/e2e/developE2e.spec.js +126 -0
package/dist/tests/e2e/developFailureE2e.spec.js +247 -0
package/dist/tests/e2e/developPty.spec.js +75 -0
package/dist/tests/e2e/discoveryWebSearchRelevance.spec.js +84 -0
package/dist/tests/e2e/harness.spec.js +83 -0
package/dist/tests/e2e/mcpLive.spec.js +120 -0
package/dist/tests/e2e/newSession.e2e.spec.js +177 -0
package/dist/tests/e2e/ralphLoopEnrichmentComparison.spec.js +62 -0
package/dist/tests/e2e/workiqEnrichment.spec.js +56 -0
package/dist/tests/e2e/zavaSimulation.spec.js +452 -0
package/dist/tests/fixtures/test-fixture-project/src/add.js +3 -0
package/dist/tests/fixtures/test-fixture-project/tests/failing.test.js +6 -0
package/dist/tests/fixtures/test-fixture-project/tests/hanging.test.js +8 -0
package/dist/tests/fixtures/test-fixture-project/tests/passing.test.js +10 -0
package/dist/tests/fixtures/test-fixture-project/vitest.config.js +6 -0
package/dist/tests/integration/autoStartConversation.spec.js +138 -0
package/dist/tests/integration/defaultCommand.spec.js +147 -0
package/dist/tests/integration/directCommandNonTty.spec.js +224 -0
package/dist/tests/integration/directCommandTty.spec.js +151 -0
package/dist/tests/integration/discoveryEnrichmentFlow.spec.js +175 -0
package/dist/tests/integration/exportArtifacts.spec.js +202 -0
package/dist/tests/integration/exportFallbackFlow.spec.js +99 -0
package/dist/tests/integration/mcpDegradationFlow.spec.js +190 -0
package/dist/tests/integration/mcpTransportFlow.spec.js +139 -0
package/dist/tests/integration/newSessionFlow.spec.js +343 -0
package/dist/tests/integration/pocGithubMcp.spec.js +186 -0
package/dist/tests/integration/pocLocalFallback.spec.js +171 -0
package/dist/tests/integration/pocScaffold.spec.js +163 -0
package/dist/tests/integration/ralphLoopFlow.spec.js +359 -0
package/dist/tests/integration/ralphLoopPartial.spec.js +368 -0
package/dist/tests/integration/resumeAndBacktrack.spec.js +247 -0
package/dist/tests/integration/spinnerLifecycle.spec.js +220 -0
package/dist/tests/integration/summarizationFlow.spec.js +115 -0
package/dist/tests/integration/testRunnerReal.spec.js +52 -0
package/dist/tests/integration/webSearchAgent.spec.js +128 -0
package/dist/tests/live/copilotSdkLive.spec.js +107 -0
package/dist/tests/live/zavaFullWorkshop.spec.js +392 -0
package/dist/tests/setup/loadEnv.js +3 -0
package/dist/tests/unit/cli/developCommand.spec.js +567 -0
package/dist/tests/unit/cli/directCommands.spec.js +279 -0
package/dist/tests/unit/cli/envLoader.spec.js +58 -0
package/dist/tests/unit/cli/ioContext.spec.js +119 -0
package/dist/tests/unit/cli/preflight.spec.js +108 -0
package/dist/tests/unit/cli/statusCommand.spec.js +111 -0
package/dist/tests/unit/cli/workshopClientFallback.spec.js +80 -0
package/dist/tests/unit/cli/workshopCommand.spec.js +329 -0
package/dist/tests/unit/config/vitestEnvSetup.spec.js +13 -0
package/dist/tests/unit/develop/checkpointState.spec.js +315 -0
package/dist/tests/unit/develop/codeGenerator.spec.js +355 -0
package/dist/tests/unit/develop/githubMcpAdapter.spec.js +231 -0
package/dist/tests/unit/develop/mcpContextEnricher.spec.js +433 -0
package/dist/tests/unit/develop/outputValidator.spec.js +119 -0
package/dist/tests/unit/develop/pocScaffolder.spec.js +353 -0
package/dist/tests/unit/develop/ralphLoop.spec.js +1248 -0
package/dist/tests/unit/develop/templateRegistry.spec.js +85 -0
package/dist/tests/unit/develop/testRunner.spec.js +249 -0
package/dist/tests/unit/infraBicep.spec.js +92 -0
package/dist/tests/unit/infraDeploy.spec.js +82 -0
package/dist/tests/unit/infraTeardown.spec.js +63 -0
package/dist/tests/unit/logging/logger.spec.js +43 -0
package/dist/tests/unit/loop/conversationLoop.spec.js +592 -0
package/dist/tests/unit/loop/phaseSummarizer.spec.js +141 -0
package/dist/tests/unit/loop/streamingMarkdown.spec.js +147 -0
package/dist/tests/unit/mcp/mcpManager.spec.js +279 -0
package/dist/tests/unit/mcp/mcpTransport.spec.js +529 -0
package/dist/tests/unit/mcp/retryPolicy.spec.js +218 -0
package/dist/tests/unit/mcp/timeoutValidation.spec.js +46 -0
package/dist/tests/unit/mcp/webSearch.spec.js +567 -0
package/dist/tests/unit/phases/contextSummarizer.spec.js +140 -0
package/dist/tests/unit/phases/discoveryEnricher.repeatCalls.spec.js +93 -0
package/dist/tests/unit/phases/discoveryEnricher.spec.js +411 -0
package/dist/tests/unit/phases/phaseExtractors.spec.js +352 -0
package/dist/tests/unit/phases/phaseHandlers.spec.js +425 -0
package/dist/tests/unit/prompts/promptLoader.spec.js +118 -0
package/dist/tests/unit/schemas/pocSchemas.spec.js +412 -0
package/dist/tests/unit/schemas/session.spec.js +257 -0
package/dist/tests/unit/sessions/exportPaths.spec.js +31 -0
package/dist/tests/unit/sessions/exportWriter.spec.js +655 -0
package/dist/tests/unit/sessions/sessionManager.spec.js +151 -0
package/dist/tests/unit/sessions/sessionStore.spec.js +116 -0
package/dist/tests/unit/shared/activitySpinner.spec.js +175 -0
package/dist/tests/unit/shared/cardsLoader.spec.js +76 -0
package/dist/tests/unit/shared/copilotClient.spec.js +155 -0
package/dist/tests/unit/shared/errorClassifier.spec.js +131 -0
package/dist/tests/unit/shared/events.spec.js +55 -0
package/dist/tests/unit/shared/markdownRenderer.spec.js +35 -0
package/dist/tests/unit/shared/markdownRendererChunks.spec.js +70 -0
package/dist/tests/unit/shared/tableRenderer.spec.js +34 -0
package/dist/vitest.config.js +14 -0
package/dist/vitest.live.config.js +18 -0
package/docs/README.md +35 -0
package/docs/architecture.md +169 -0
package/docs/cli-usage.md +207 -0
package/docs/environment.md +66 -0
package/docs/export-format.md +146 -0
package/docs/session-model.md +113 -0
package/eslint.config.js +35 -0
package/infra/deploy.sh +193 -0
package/infra/gather-env.sh +211 -0
package/infra/main.bicep +90 -0
package/infra/main.bicepparam +18 -0
package/infra/resources.bicep +134 -0
package/infra/teardown.sh +114 -0
package/package.json +63 -0
package/specs/001-cli-workshop-rebuild/checklists/requirements.md +35 -0
package/specs/001-cli-workshop-rebuild/contracts/cli.md +59 -0
package/specs/001-cli-workshop-rebuild/contracts/export-summary-json.md +23 -0
package/specs/001-cli-workshop-rebuild/contracts/session-json.md +30 -0
package/specs/001-cli-workshop-rebuild/data-model.md +210 -0
package/specs/001-cli-workshop-rebuild/plan.md +361 -0
package/specs/001-cli-workshop-rebuild/quickstart.md +83 -0
package/specs/001-cli-workshop-rebuild/research.md +116 -0
package/specs/001-cli-workshop-rebuild/spec.md +240 -0
package/specs/001-cli-workshop-rebuild/tasks.md +476 -0
package/specs/002-poc-generation/contracts/poc-output.md +172 -0
package/specs/002-poc-generation/contracts/ralph-loop.md +113 -0
package/specs/002-poc-generation/data-model.md +172 -0
package/specs/002-poc-generation/plan.md +109 -0
package/specs/002-poc-generation/quickstart.md +97 -0
package/specs/002-poc-generation/research.md +786 -0
package/specs/002-poc-generation/spec.md +81 -0
package/specs/002-poc-generation/tasks-fix.md +198 -0
package/specs/002-poc-generation/tasks.md +252 -0
package/specs/003-mcp-transport-integration/checklists/requirements.md +37 -0
package/specs/003-mcp-transport-integration/contracts/context-enricher.md +220 -0
package/specs/003-mcp-transport-integration/contracts/discovery-enricher.md +267 -0
package/specs/003-mcp-transport-integration/contracts/github-adapter.md +149 -0
package/specs/003-mcp-transport-integration/contracts/mcp-transport.md +288 -0
package/specs/003-mcp-transport-integration/data-model.md +326 -0
package/specs/003-mcp-transport-integration/plan.md +114 -0
package/specs/003-mcp-transport-integration/quickstart.md +311 -0
package/specs/003-mcp-transport-integration/research.md +395 -0
package/specs/003-mcp-transport-integration/spec.md +234 -0
package/specs/003-mcp-transport-integration/tasks.md +324 -0
package/specs/003-next-spec-gaps.md +150 -0
package/specs/004-dev-resume-hardening/checklists/requirements.md +37 -0
package/specs/004-dev-resume-hardening/contracts/cli.md +160 -0
package/specs/004-dev-resume-hardening/data-model.md +321 -0
package/specs/004-dev-resume-hardening/plan.md +107 -0
package/specs/004-dev-resume-hardening/quickstart.md +115 -0
package/specs/004-dev-resume-hardening/research.md +142 -0
package/specs/004-dev-resume-hardening/spec.md +221 -0
package/specs/004-dev-resume-hardening/tasks.md +333 -0
package/specs/005-ai-search-deploy/checklists/requirements.md +39 -0
package/specs/005-ai-search-deploy/contracts/web-search-tool.md +241 -0
package/specs/005-ai-search-deploy/data-model.md +130 -0
package/specs/005-ai-search-deploy/plan.md +93 -0
package/specs/005-ai-search-deploy/quickstart.md +96 -0
package/specs/005-ai-search-deploy/research.md +187 -0
package/specs/005-ai-search-deploy/spec.md +143 -0
package/specs/005-ai-search-deploy/tasks.md +284 -0
package/specs/006-workshop-extraction-fixes/checklists/requirements.md +61 -0
package/specs/006-workshop-extraction-fixes/contracts/summarization-and-export.md +131 -0
package/specs/006-workshop-extraction-fixes/data-model.md +149 -0
package/specs/006-workshop-extraction-fixes/plan.md +123 -0
package/specs/006-workshop-extraction-fixes/quickstart.md +101 -0
package/specs/006-workshop-extraction-fixes/research.md +143 -0
package/specs/006-workshop-extraction-fixes/spec.md +210 -0
package/specs/006-workshop-extraction-fixes/tasks.md +316 -0
package/src/cli/developCommand.ts +308 -0
package/src/cli/directCommands.ts +195 -0
package/src/cli/envLoader.ts +17 -0
package/src/cli/exportCommand.ts +65 -0
package/src/cli/index.ts +249 -0
package/src/cli/ioContext.ts +139 -0
package/src/cli/preflight.ts +86 -0
package/src/cli/statusCommand.ts +118 -0
package/src/cli/workshopCommand.ts +496 -0
package/src/develop/checkpointState.ts +121 -0
package/src/develop/codeGenerator.ts +402 -0
package/src/develop/dynamicScaffolder.ts +284 -0
package/src/develop/githubMcpAdapter.ts +199 -0
package/src/develop/index.ts +34 -0
package/src/develop/mcpContextEnricher.ts +279 -0
package/src/develop/pocScaffolder.ts +646 -0
package/src/develop/ralphLoop.ts +1044 -0
package/src/develop/templateRegistry.ts +427 -0
package/src/develop/testRunner.ts +276 -0
package/src/logging/logger.ts +73 -0
package/src/loop/conversationLoop.ts +355 -0
package/src/loop/phaseSummarizer.ts +114 -0
package/src/mcp/mcpManager.ts +365 -0
package/src/mcp/mcpTransport.ts +562 -0
package/src/mcp/retryPolicy.ts +87 -0
package/src/mcp/webSearch.ts +388 -0
package/src/originalPrompts/design_thinking.md +178 -0
package/src/originalPrompts/design_thinking_persona.md +76 -0
package/src/originalPrompts/document_generator_example.md +77 -0
package/src/originalPrompts/document_generator_persona.md +47 -0
package/src/originalPrompts/facilitator_persona.md +125 -0
package/src/originalPrompts/guardrails.md +47 -0
package/src/phases/contextSummarizer.ts +154 -0
package/src/phases/discoveryEnricher.ts +223 -0
package/src/phases/phaseExtractors.ts +247 -0
package/src/phases/phaseHandlers.ts +450 -0
package/src/prompts/design.md +51 -0
package/src/prompts/develop-boundary.md +51 -0
package/src/prompts/develop.md +111 -0
package/src/prompts/discover.md +58 -0
package/src/prompts/ideate.md +56 -0
package/src/prompts/plan.md +51 -0
package/src/prompts/promptLoader.ts +198 -0
package/src/prompts/select.md +47 -0
package/src/prompts/summarize/README.md +8 -0
package/src/prompts/summarize/design-summary.md +37 -0
package/src/prompts/summarize/develop-summary.md +25 -0
package/src/prompts/summarize/ideate-summary.md +27 -0
package/src/prompts/summarize/plan-summary.md +27 -0
package/src/prompts/summarize/select-summary.md +21 -0
package/src/prompts/system.md +28 -0
package/src/sessions/exportPaths.ts +28 -0
package/src/sessions/exportWriter.ts +490 -0
package/src/sessions/sessionManager.ts +119 -0
package/src/sessions/sessionStore.ts +69 -0
package/src/shared/activitySpinner.ts +108 -0
package/src/shared/copilotClient.ts +291 -0
package/src/shared/data/cards.json +1249 -0
package/src/shared/data/cardsLoader.ts +70 -0
package/src/shared/errorClassifier.ts +160 -0
package/src/shared/events.ts +103 -0
package/src/shared/markdownRenderer.ts +44 -0
package/src/shared/schemas/session.ts +346 -0
package/src/shared/tableRenderer.ts +28 -0
package/src/types/marked-terminal.d.ts +5 -0
package/src/vendor/chalk.ts +2 -0
package/src/vendor/cli-table3.ts +3 -0
package/src/vendor/commander.ts +2 -0
package/src/vendor/marked-terminal.ts +3 -0
package/src/vendor/marked.ts +2 -0
package/src/vendor/ora.ts +2 -0
package/src/vendor/pino.ts +3 -0
package/src/vendor/zod.ts +3 -0
package/tests/e2e/developE2e.spec.ts +152 -0
package/tests/e2e/developFailureE2e.spec.ts +289 -0
package/tests/e2e/developPty.spec.ts +86 -0
package/tests/e2e/discoveryWebSearchRelevance.spec.ts +103 -0
package/tests/e2e/harness.spec.ts +104 -0
package/tests/e2e/mcpLive.spec.ts +149 -0
package/tests/e2e/newSession.e2e.spec.ts +245 -0
package/tests/e2e/ralphLoopEnrichmentComparison.spec.ts +70 -0
package/tests/e2e/workiqEnrichment.spec.ts +72 -0
package/tests/e2e/zava-assessment/agent-interaction-script.md +258 -0
package/tests/e2e/zava-assessment/company-profile.md +98 -0
package/tests/e2e/zava-assessment/expected-results-checklist.md +454 -0
package/tests/e2e/zavaSimulation.spec.ts +511 -0
package/tests/fixtures/completedSession.json +141 -0
package/tests/fixtures/test-fixture-project/package-lock.json +1585 -0
package/tests/fixtures/test-fixture-project/package.json +12 -0
package/tests/fixtures/test-fixture-project/src/add.ts +3 -0
package/tests/fixtures/test-fixture-project/tests/failing.test.ts +7 -0
package/tests/fixtures/test-fixture-project/tests/hanging.test.ts +9 -0
package/tests/fixtures/test-fixture-project/tests/passing.test.ts +13 -0
package/tests/fixtures/test-fixture-project/vitest.config.ts +7 -0
package/tests/integration/autoStartConversation.spec.ts +168 -0
package/tests/integration/defaultCommand.spec.ts +179 -0
package/tests/integration/directCommandNonTty.spec.ts +260 -0
package/tests/integration/directCommandTty.spec.ts +185 -0
package/tests/integration/discoveryEnrichmentFlow.spec.ts +209 -0
package/tests/integration/exportArtifacts.spec.ts +232 -0
package/tests/integration/exportFallbackFlow.spec.ts +115 -0
package/tests/integration/mcpDegradationFlow.spec.ts +231 -0
package/tests/integration/mcpTransportFlow.spec.ts +178 -0
package/tests/integration/newSessionFlow.spec.ts +406 -0
package/tests/integration/pocGithubMcp.spec.ts +224 -0
package/tests/integration/pocLocalFallback.spec.ts +205 -0
package/tests/integration/pocScaffold.spec.ts +220 -0
package/tests/integration/ralphLoopFlow.spec.ts +430 -0
package/tests/integration/ralphLoopPartial.spec.ts +416 -0
package/tests/integration/resumeAndBacktrack.spec.ts +278 -0
package/tests/integration/spinnerLifecycle.spec.ts +270 -0
package/tests/integration/summarizationFlow.spec.ts +135 -0
package/tests/integration/testRunnerReal.spec.ts +63 -0
package/tests/integration/webSearchAgent.spec.ts +155 -0
package/tests/live/copilotSdkLive.spec.ts +149 -0
package/tests/live/zavaFullWorkshop.spec.ts +515 -0
package/tests/setup/loadEnv.ts +5 -0
package/tests/unit/cli/developCommand.spec.ts +679 -0
package/tests/unit/cli/directCommands.spec.ts +325 -0
package/tests/unit/cli/envLoader.spec.ts +73 -0
package/tests/unit/cli/ioContext.spec.ts +148 -0
package/tests/unit/cli/preflight.spec.ts +125 -0
package/tests/unit/cli/statusCommand.spec.ts +134 -0
package/tests/unit/cli/workshopClientFallback.spec.ts +100 -0
package/tests/unit/cli/workshopCommand.spec.ts +378 -0
package/tests/unit/config/vitestEnvSetup.spec.ts +24 -0
package/tests/unit/develop/checkpointState.spec.ts +378 -0
package/tests/unit/develop/codeGenerator.spec.ts +447 -0
package/tests/unit/develop/githubMcpAdapter.spec.ts +283 -0
package/tests/unit/develop/mcpContextEnricher.spec.ts +564 -0
package/tests/unit/develop/outputValidator.spec.ts +134 -0
package/tests/unit/develop/pocScaffolder.spec.ts +451 -0
package/tests/unit/develop/ralphLoop.spec.ts +1439 -0
package/tests/unit/develop/templateRegistry.spec.ts +106 -0
package/tests/unit/develop/testRunner.spec.ts +294 -0
package/tests/unit/infraBicep.spec.ts +116 -0
package/tests/unit/infraDeploy.spec.ts +102 -0
package/tests/unit/infraTeardown.spec.ts +77 -0
package/tests/unit/logging/logger.spec.ts +50 -0
package/tests/unit/loop/conversationLoop.spec.ts +719 -0
package/tests/unit/loop/phaseSummarizer.spec.ts +169 -0
package/tests/unit/loop/streamingMarkdown.spec.ts +180 -0
package/tests/unit/mcp/mcpManager.spec.ts +336 -0
package/tests/unit/mcp/mcpTransport.spec.ts +689 -0
package/tests/unit/mcp/retryPolicy.spec.ts +278 -0
package/tests/unit/mcp/timeoutValidation.spec.ts +55 -0
package/tests/unit/mcp/webSearch.spec.ts +718 -0
package/tests/unit/phases/contextSummarizer.spec.ts +158 -0
package/tests/unit/phases/discoveryEnricher.repeatCalls.spec.ts +125 -0
package/tests/unit/phases/discoveryEnricher.spec.ts +512 -0
package/tests/unit/phases/phaseExtractors.spec.ts +406 -0
package/tests/unit/phases/phaseHandlers.spec.ts +483 -0
package/tests/unit/prompts/promptLoader.spec.ts +144 -0
package/tests/unit/schemas/pocSchemas.spec.ts +457 -0
package/tests/unit/schemas/session.spec.ts +328 -0
package/tests/unit/sessions/exportPaths.spec.ts +38 -0
package/tests/unit/sessions/exportWriter.spec.ts +737 -0
package/tests/unit/sessions/sessionManager.spec.ts +174 -0
package/tests/unit/sessions/sessionStore.spec.ts +136 -0
package/tests/unit/shared/activitySpinner.spec.ts +211 -0
package/tests/unit/shared/cardsLoader.spec.ts +89 -0
package/tests/unit/shared/copilotClient.spec.ts +185 -0
package/tests/unit/shared/errorClassifier.spec.ts +152 -0
package/tests/unit/shared/events.spec.ts +71 -0
package/tests/unit/shared/markdownRenderer.spec.ts +42 -0
package/tests/unit/shared/markdownRendererChunks.spec.ts +83 -0
package/tests/unit/shared/tableRenderer.spec.ts +38 -0
package/tsconfig.json +20 -0
package/vitest.config.ts +15 -0
package/vitest.live.config.ts +19 -0

package/specs/004-dev-resume-hardening/tasks.md ADDED Viewed

@@ -0,0 +1,333 @@
+# Tasks: Dev Resume & Hardening
+**Input**: Design documents from `/specs/004-dev-resume-hardening/`
+**Prerequisites**: plan.md (required), spec.md (required for user stories), research.md, data-model.md, contracts/
+**Tests**: Tests are REQUIRED for new behavior in this repository (Red → Green → Review). Include test tasks for each user story and write them first.
+**Organization**: Tasks are grouped by user story to enable independent implementation and testing of each story.
+## Format: `[ID] [P?] [Story] Description`
+- **[P]**: Can run in parallel (different files, no dependencies)
+- **[Story]**: Which user story this task belongs to (e.g., US1, US2, US3)
+- Include exact file paths in descriptions
+## Phase 1: Setup (Shared Infrastructure)
+**Purpose**: Project initialization, shared types, and test infrastructure used by multiple stories
+- [x] T001 Create `CheckpointState` interface and `deriveCheckpointState()` function in `src/develop/checkpointState.ts` per data-model.md derivation logic
+- [x] T002 [P] Create `TemplateEntry` interface and `TemplateRegistry` type in `src/develop/templateRegistry.ts` (types only — no template content yet)
+- [x] T003 [P] Create test fixture project in `tests/fixtures/test-fixture-project/` with `package.json`, `vitest.config.ts`, `src/add.ts`, `tests/passing.test.ts`, `tests/failing.test.ts`, and `tests/hanging.test.ts` per data-model.md TestFixtureProject spec
+- [x] T004 Run `npm install` in `tests/fixtures/test-fixture-project/` and add `tests/fixtures/test-fixture-project/node_modules` to `.gitignore`
+---
+## Phase 2: Foundational (Blocking Prerequisites)
+**Purpose**: Core changes shared across multiple user stories — MUST complete before story work
+**⚠️ CRITICAL**: No user story work can begin until this phase is complete
+- [x] T005 Make `extractJson()` and `buildErrorResult()` methods `protected` in `src/develop/testRunner.ts` (like `parseOutput` already is) to enable subclass testing
+- [x] T006 Add `testCommand` optional parameter to `TestRunnerOptions` interface in `src/develop/testRunner.ts` and use it in `spawnTests()` instead of hardcoded `npm test -- --reporter=json`
+- [x] T007 Extract `NODE_TS_VITEST_TEMPLATE` from `src/develop/pocScaffolder.ts` into `src/develop/templateRegistry.ts` as the first `TemplateEntry`, including `techStack`, `installCommand`, `testCommand`, and `matchPatterns`
+- [x] T008 Add `selectTemplate()` function to `src/develop/templateRegistry.ts` implementing first-match-wins logic per contracts/cli.md Template Selection rules
+- [x] T009 Add `PYTHON_PYTEST_TEMPLATE` entry to `src/develop/templateRegistry.ts` with files (`.gitignore`, `requirements.txt`, `pytest.ini`, `README.md`, `src/__init__.py`, `src/main.py`, `tests/test_main.py`, `.sofia-metadata.json`), `techStack`, `installCommand`, `testCommand`, and `matchPatterns` per data-model.md TemplateEntry table
+- [x] T010 Update `PocScaffolder.buildContext()` in `src/develop/pocScaffolder.ts` to accept an optional `TemplateEntry` parameter and use its `techStack` instead of the hardcoded default
+- [x] T011 Update `PocScaffolder` constructor in `src/develop/pocScaffolder.ts` to accept `TemplateEntry` (using `entry.files`) instead of raw `TemplateFile[]`, preserving backward compatibility
+**Checkpoint**: Foundation ready — shared types, test fixtures, and registry exist. User story implementation can begin.
+---
+## Phase 3: User Story 1 — Resume an Interrupted PoC Session (Priority: P1) 🎯 MVP
+**Goal**: Running `sofia dev --session X` on an interrupted session resumes from the last completed iteration, skipping scaffold and re-running npm install.
+**Independent Test**: Interrupt after 2 iterations, re-run, verify iteration 3 starts without re-scaffolding.
+**FRs covered**: FR-001, FR-001a, FR-002, FR-003, FR-004, FR-005, FR-006, FR-007, FR-007a
+### Tests for User Story 1 (REQUIRED) ⚠️
+> **NOTE: Write these tests FIRST, ensure they FAIL before implementation**
+- [x] T012 [P] [US1] Unit test: `deriveCheckpointState` returns correct state for no-poc, completed, partial, interrupted sessions in `tests/unit/develop/checkpointState.spec.ts`
+- [x] T013 [P] [US1] Unit test: `RalphLoop.run()` seeds `iterations` from `session.poc.iterations` and starts from correct `iterNum` in `tests/unit/develop/ralphLoop.spec.ts` (add describe block "resume iteration seeding")
+- [x] T014 [P] [US1] Unit test: `RalphLoop.run()` skips scaffold when checkpoint says `canSkipScaffold=true` in `tests/unit/develop/ralphLoop.spec.ts`
+- [x] T015 [P] [US1] Unit test: `RalphLoop.run()` pops incomplete last iteration (no testResults) and re-runs it per FR-001a in `tests/unit/develop/ralphLoop.spec.ts`
+- [x] T016 [P] [US1] Unit test: `developCommand` exits with completion message when `poc.finalStatus === 'success'` per FR-005 in `tests/unit/cli/developCommand.spec.ts`
+- [x] T017 [P] [US1] Unit test: `developCommand` defaults to resume when `poc.finalStatus === 'failed'|'partial'` per FR-006 in `tests/unit/cli/developCommand.spec.ts`
+- [x] T018 [P] [US1] Unit test: resume re-scaffolds when output directory is missing but iterations exist per FR-007 in `tests/unit/develop/ralphLoop.spec.ts`
+- [x] T019 [US1] Integration test: full resume flow — create session with 2 completed iterations, run `RalphLoop`, verify starts at iteration 3 in `tests/integration/ralphLoopPartial.spec.ts` (add describe block "resume from interrupted session")
+- [x] T065 [P] [US1] Unit test: resume ALWAYS re-runs dependency install step even when scaffolding is skipped per FR-003 in `tests/unit/develop/ralphLoop.spec.ts`
+- [x] T066 [P] [US1] Unit test: resume includes prior iteration history in LLM prompt context (test results + applied changes summary) per FR-004 in `tests/unit/develop/ralphLoop.spec.ts`
+- [x] T067 [P] [US1] Unit test: resume decision logging emits info-level messages for iteration number, skip scaffold, incomplete-iteration rerun, and re-run install per FR-007a in `tests/unit/develop/ralphLoop.spec.ts`
+- [x] T068 [P] [US1] Unit test: corrupted/invalid `poc.iterations` causes safe fallback to fresh run (and warning log) per Edge Cases in `spec.md` in `tests/unit/develop/checkpointState.spec.ts`
+- [x] T069 [P] [US1] Unit test: output directory present but `.sofia-metadata.json` integrity mismatch triggers warning and forces re-scaffold (do not skip scaffold) per Edge Cases in `spec.md` in `tests/unit/develop/checkpointState.spec.ts`
+### Implementation for User Story 1
+- [x] T020 [US1] Implement `deriveCheckpointState()` logic in `src/develop/checkpointState.ts` per data-model.md derivation rules
+- [x] T021 [US1] Update `developCommand()` in `src/cli/developCommand.ts` to call `deriveCheckpointState()` before creating RalphLoop and handle FR-005 (success exit) and FR-006 (failed/partial default resume)
+- [x] T022 [US1] Modify `RalphLoop.run()` in `src/develop/ralphLoop.ts` to seed `iterations` from `session.poc.iterations`, derive `iterNum = iterations.length + 1`, and pop incomplete last iteration per FR-001/FR-001a
+- [x] T023 [US1] Modify `RalphLoop.run()` in `src/develop/ralphLoop.ts` to skip scaffold when output dir + `.sofia-metadata.json` exist per FR-002, and always re-run install per FR-003
+- [x] T024 [US1] Modify `RalphLoop.run()` in `src/develop/ralphLoop.ts` to include prior iteration history in LLM prompt context per FR-004 (include prior test results and a concise summary of applied changes across prior iterations; not just last failing tests)
+- [x] T025 [US1] Modify `RalphLoop.run()` in `src/develop/ralphLoop.ts` to re-scaffold when output dir is missing but iterations exist per FR-007
+- [x] T026 [US1] Add info-level resume decision logging in `src/develop/ralphLoop.ts` and `src/cli/developCommand.ts` per FR-007a (iteration number, skip scaffold, re-run install, incomplete iteration re-run)
+- [x] T070 [US1] Harden `deriveCheckpointState()` in `src/develop/checkpointState.ts` to validate iteration entries (missing/invalid shapes) and safely fall back to fresh run + warning log per Edge Cases in `spec.md`
+- [x] T071 [US1] Extend `deriveCheckpointState()` in `src/develop/checkpointState.ts` to validate `.sofia-metadata.json` integrity (at minimum: sessionId match; if Phase 9 adds `templateId`, validate that too) and disable `canSkipScaffold` + warn if mismatch per Edge Cases in `spec.md`
+**Checkpoint**: Resume works end-to-end. `sofia dev --session X` resumes from correct iteration after interruption. All resume decisions are logged at info level.
+---
+## Phase 4: User Story 2 — Force-Restart a PoC Session (Priority: P1)
+**Goal**: `sofia dev --session X --force` deletes output directory AND resets `session.poc` state, starting completely fresh.
+**Independent Test**: Create output via `sofia dev`, then `--force`, verify both directory and `poc.iterations` reset.
+**FRs covered**: FR-008, FR-009, FR-010
+### Tests for User Story 2 (REQUIRED) ⚠️
+- [x] T027 [P] [US2] Unit test: `developCommand` with `--force` clears `session.poc` and calls `store.save()` before creating RalphLoop per FR-008 in `tests/unit/cli/developCommand.spec.ts`
+- [x] T028 [P] [US2] Unit test: `developCommand` with `--force` on a `poc.finalStatus === 'success'` session clears status and starts fresh per FR-010 in `tests/unit/cli/developCommand.spec.ts`
+- [x] T029 [P] [US2] Unit test: `developCommand` with `--force` on a session with no prior poc state behaves identically to first run in `tests/unit/cli/developCommand.spec.ts`
+- [x] T030 [US2] Integration test: force-restart flow — create session with iterations, run with `--force`, verify empty iterations and fresh scaffold in `tests/integration/ralphLoopFlow.spec.ts` (add describe block "force restart")
+### Implementation for User Story 2
+- [x] T031 [US2] Update `developCommand()` in `src/cli/developCommand.ts` to clear `session.poc = undefined` and call `store.save(session)` when `--force` is set per FR-008, before creating RalphLoop
+- [x] T032 [US2] Ensure `--force` path logs info-level message "Cleared existing output directory and session state (--force)" in `src/cli/developCommand.ts`
+**Checkpoint**: `--force` resets both output directory and session state. Works on any `finalStatus` value including `'success'`.
+---
+## Phase 5: User Story 3 — PoC Template Selection Based on Plan (Priority: P2)
+**Goal**: Scaffolder auto-selects template based on plan's `architectureNotes` — Python plan gets `python-pytest`, TypeScript plan gets `node-ts-vitest`.
+**Independent Test**: Session with Python/FastAPI plan generates Python project structure.
+**FRs covered**: FR-011, FR-012, FR-013, FR-014, FR-015
+### Tests for User Story 3 (REQUIRED) ⚠️
+- [x] T033 [P] [US3] Unit test: `selectTemplate()` returns `python-pytest` for plans mentioning "Python" or "FastAPI" in `tests/unit/develop/templateRegistry.spec.ts`
+- [x] T034 [P] [US3] Unit test: `selectTemplate()` returns `node-ts-vitest` for plans mentioning "TypeScript" or with no architecture notes in `tests/unit/develop/templateRegistry.spec.ts`
+- [x] T035 [P] [US3] Unit test: `selectTemplate()` returns default `node-ts-vitest` for ambiguous plans in `tests/unit/develop/templateRegistry.spec.ts`
+- [x] T036 [P] [US3] Unit test: `PocScaffolder` uses `TemplateEntry.files` when constructed with a template entry in `tests/unit/develop/pocScaffolder.spec.ts`
+- [x] T037 [US3] Integration test: scaffold with `python-pytest` template generates expected file structure (`requirements.txt`, `src/main.py`, `tests/test_main.py`) in `tests/integration/pocScaffold.spec.ts` (add describe block "python-pytest template")
+### Implementation for User Story 3
+- [x] T038 [US3] Wire template selection into `developCommand.ts`: call `selectTemplate(registry, plan.architectureNotes, plan.dependencies)` and pass result to `PocScaffolder` and `RalphLoop`
+- [x] T039 [US3] Update `RalphLoop` to use `TemplateEntry.installCommand` for dependency installation instead of hardcoded `npm install` in `src/develop/ralphLoop.ts`
+- [x] T040 [US3] Update `RalphLoop` to pass `TemplateEntry.testCommand` to `TestRunner` constructor in `src/develop/ralphLoop.ts`
+- [x] T041 [US3] Add info-level log "Selected template: {id} (matched '{pattern}' in architecture notes)" in `src/cli/developCommand.ts`
+**Checkpoint**: Python plans produce Python scaffold. TypeScript plans preserve current behavior. Adding a new template requires only a registry entry.
+---
+## Phase 6: User Story 4 — TestRunner Coverage Hardening (Priority: P2)
+**Goal**: `testRunner.ts` test coverage increases from 45% to 80%+ via real fixture-based integration tests.
+**Independent Test**: Run fixture-based tests covering spawn, parse, timeout, and malformed output.
+**FRs covered**: FR-016, FR-017, FR-018, FR-019
+### Tests for User Story 4 (REQUIRED) ⚠️
+> **NOTE**: These are the deliverable for this story — the tests themselves ARE the feature
+- [x] T042 [P] [US4] Integration test: `testRunner.run()` against fixture project with passing tests, verify correct pass/fail/skip counts in `tests/integration/testRunnerReal.spec.ts`
+- [x] T043 [P] [US4] Integration test: `testRunner.run()` against fixture project with failing tests, verify failure details parsed correctly in `tests/integration/testRunnerReal.spec.ts`
+- [x] T044 [US4] Integration test: `testRunner.run()` with short timeout against hanging test fixture, verify SIGTERM→SIGKILL and timeout error result per FR-016/FR-018 in `tests/integration/testRunnerReal.spec.ts`
+- [x] T045 [US4] Unit test: `extractJson()` fallback path (first-`{`-to-last-`}`) with mixed console+JSON output per FR-017 in `tests/unit/develop/testRunner.spec.ts` (use `TestableTestRunner` subclass)
+- [x] T046 [US4] Unit test: `extractJson()` returns null for output with no valid JSON per FR-017 in `tests/unit/develop/testRunner.spec.ts`
+- [x] T047 [US4] Unit test: `buildErrorResult()` produces correct zero-count result with error message per FR-018 in `tests/unit/develop/testRunner.spec.ts`
+### Implementation for User Story 4
+- [x] T048 [US4] If any coverage gaps remain after writing the above tests, add targeted unit tests to reach 80%+ coverage for `src/develop/testRunner.ts` (run `npm test -- --coverage` to verify)
+**Checkpoint**: `testRunner.ts` coverage is at or above 80%. All critical code paths (spawn, parse, timeout, fallback) have automated tests using real fixtures.
+---
+## Phase 7: User Story 5 — PTY-Based Interactive E2E Tests (Priority: P3)
+**Goal**: PTY-based E2E tests validate Ctrl+C handling, progress output, and clean exit behavior for `sofia dev`.
+**Independent Test**: Spawn `sofia dev` in PTY, send Ctrl+C, verify recovery message and exit code.
+**FRs covered**: (implicit quality requirement from spec)
+### Tests for User Story 5 (REQUIRED) ⚠️
+> **NOTE**: The tests ARE the deliverable — this story is test-only
+- [x] T049 [P] [US5] E2E test: PTY-spawn `sofia dev`, send Ctrl+C during iteration, verify exit code 0 and recovery message in `tests/e2e/developPty.spec.ts`
+- [x] T050 [P] [US5] E2E test: PTY-spawn `sofia dev`, verify iteration progress lines ("Iteration N/M") appear in PTY output buffer in `tests/e2e/developPty.spec.ts`
+- [x] T051 [US5] Add PTY availability guard to `tests/e2e/developPty.spec.ts` — skip gracefully if `node-pty` allocation fails (e.g., CI without TTY)
+**Checkpoint**: Interactive behaviors (Ctrl+C, progress output) are validated in CI via PTY simulation.
+---
+## Phase 8: User Story 6 — Workshop-to-Dev Transition Clarity (Priority: P3)
+**Goal**: Workshop displays actionable `sofia dev --session <id>` command after Plan phase completes.
+**Independent Test**: Complete Plan phase, verify output contains exact `sofia dev` command with session ID.
+**FRs covered**: FR-020, FR-021
+### Tests for User Story 6 (REQUIRED) ⚠️
+- [x] T052 [P] [US6] Unit test: workshop command displays "sofia dev --session {id}" after Plan phase completes per FR-020 in `tests/unit/cli/workshopCommand.spec.ts`
+- [x] T053 [P] [US6] Unit test: workshop command offers auto-transition prompt in interactive mode per FR-021 in `tests/unit/cli/workshopCommand.spec.ts`
+### Implementation for User Story 6
+- [x] T054 [US6] Add transition guidance message in `src/cli/workshopCommand.ts` when `getNextPhase(phase) === 'Develop'` — display exact `sofia dev --session ${session.sessionId}` command per contracts/cli.md
+- [x] T055 [US6] Add interactive mode offer ("Would you like to start PoC development now?") in `src/cli/workshopCommand.ts` per FR-021 (SHOULD — use `@inquirer/prompts` confirm)
+**Checkpoint**: Workshop users see clear next-step guidance including the exact command to run after Plan phase.
+---
+## Phase 9: Polish & Cross-Cutting Concerns
+**Purpose**: Improvements that affect multiple user stories
+**TDD note**: Complete T072–T074 (tests) before implementing T056–T058 (FR-022) to satisfy the constitution's Red → Green requirement.
+- [x] T072 [P] Unit test: scaffold TODO marker scanning records `totalInitial`, `remaining`, and `markers` in `.sofia-metadata.json` per FR-022 in `tests/unit/develop/pocScaffolder.spec.ts`
+- [x] T073 [P] Unit test: TODO marker rescan after an iteration updates `.sofia-metadata.json.todos.remaining` per FR-022 in `tests/unit/develop/ralphLoop.spec.ts`
+- [x] T074 [P] Integration test: TODO tracking writes and updates `.sofia-metadata.json` in a real scaffold output directory per FR-022 in `tests/integration/ralphLoopFlow.spec.ts` (new describe block "todo tracking")
+- [x] T075 Validation task: compare fresh vs resumed run PoC quality (test pass counts) on the same plan/session to satisfy SC-004-005; capture results in test output or quickstart notes
+- [x] T076 [P] Benchmark/validation task: measure resume detection overhead (derive checkpoint + metadata checks) and ensure <500ms per SC-004-007 (can be a small integration test with timing guard or a quickstart step)
+- [x] T056 [P] Extend `.sofia-metadata.json` schema in `src/develop/pocScaffolder.ts` to include `templateId` and `todos` fields per FR-022 and contracts/cli.md extended schema
+- [x] T057 [P] Add TODO marker scanning logic to `src/develop/pocScaffolder.ts` — scan scaffold files at scaffold time for `TODO:` markers, record in `.sofia-metadata.json`
+- [x] T058 Add TODO marker rescan after each iteration in `src/develop/ralphLoop.ts` — update `.sofia-metadata.json` with remaining TODO count per FR-022
+- [x] T059 [P] Update `src/develop/index.ts` barrel export to include `checkpointState.ts` and `templateRegistry.ts`
+- [x] T060 Run `npm run typecheck` and fix any type errors across all modified files
+- [x] T061 Run `npm run lint` and fix any lint warnings (especially `import/order`) across all modified files
+- [x] T062 Run full test suite `npm test` and verify all tests pass (no regressions)
+- [x] T063 Run `npm test -- --coverage` on `src/develop/testRunner.ts` and verify coverage ≥ 80% per SC-004-004
+- [x] T064 Run quickstart.md validation — execute the quick verification steps from `specs/004-dev-resume-hardening/quickstart.md`
+---
+## Dependencies & Execution Order
+### Phase Dependencies
+- **Setup (Phase 1)**: No dependencies — can start immediately
+- **Foundational (Phase 2)**: Depends on Setup completion — BLOCKS all user stories
+- **User Story 1 (Phase 3)**: Depends on Foundational (Phase 2) — P1, MVP
+- **User Story 2 (Phase 4)**: Depends on Foundational (Phase 2) — P1, can parallel with US1 (different files: `developCommand.ts` vs `ralphLoop.ts`)
+- **User Story 3 (Phase 5)**: Depends on Foundational (Phase 2) — P2, uses `templateRegistry.ts` from Phase 2
+- **User Story 4 (Phase 6)**: Depends on Phase 2 T005 only (protected methods + test fixture) — P2, independent of all other stories
+- **User Story 5 (Phase 7)**: Depends on US1 completion (resume behavior must work for Ctrl+C test) — P3
+- **User Story 6 (Phase 8)**: Depends on Foundational only — P3, independent of other stories
+- **Polish (Phase 9)**: Depends on all desired user stories being complete
+### User Story Dependencies
+- **User Story 1 (P1)**: Can start after Phase 2. No dependencies on other stories. 🎯 **MVP target**
+- **User Story 2 (P1)**: Can start after Phase 2. Shares `developCommand.ts` with US1 — coordinate edits but independently testable
+- **User Story 3 (P2)**: Can start after Phase 2. Uses registry from Phase 2. Independent of US1/US2
+- **User Story 4 (P2)**: Can start after T005 (protected methods). Fully independent — test-only story
+- **User Story 5 (P3)**: Needs US1 resume working. Tests resume+Ctrl+C interaction
+- **User Story 6 (P3)**: After Phase 2 only. Fully independent — workshop command changes
+### Within Each User Story
+- Tests MUST be written and FAIL before implementation
+- Implementation follows test order (entity → service → endpoint)
+- Story complete before moving to next priority
+### Parallel Opportunities
+- T001, T002, T003 can run in parallel (different files)
+- T005, T006, T007, T008, T009, T010, T011 — some can parallel (T005 different file from T007-T011)
+- T012-T018 (US1 tests) can all run in parallel (test file additions)
+- T027-T029 (US2 tests) can run in parallel
+- T033-T036 (US3 tests) can run in parallel
+- T042-T047 (US4 tests) can run in parallel (different test files)
+- US4 (Phase 6) and US6 (Phase 8) can run in parallel with US1/US2/US3 after Phase 2
+---
+## Parallel Example: User Story 1
+```bash
+# Launch all tests for US1 together (different test files/blocks):
+T012: "Unit test: deriveCheckpointState in tests/unit/develop/checkpointState.spec.ts"
+T013: "Unit test: RalphLoop resume seeding in tests/unit/develop/ralphLoop.spec.ts"
+T014: "Unit test: RalphLoop skip scaffold in tests/unit/develop/ralphLoop.spec.ts"
+T015: "Unit test: RalphLoop pop incomplete in tests/unit/develop/ralphLoop.spec.ts"
+T016: "Unit test: developCommand success exit in tests/unit/cli/developCommand.spec.ts"
+T017: "Unit test: developCommand resume default in tests/unit/cli/developCommand.spec.ts"
+T018: "Unit test: resuming re-scaffolds on missing dir in tests/unit/develop/ralphLoop.spec.ts"
+# Then sequential implementation (shared files):
+T020 → T021 → T022 → T023 → T024 → T025 → T026
+```
+---
+## Implementation Strategy
+### MVP First (User Story 1 Only)
+1. Complete Phase 1: Setup (T001-T004)
+2. Complete Phase 2: Foundational (T005-T011)
+3. Complete Phase 3: User Story 1 — Resume (T012-T026)
+4. **STOP and VALIDATE**: Test resume independently
+5. All 583+ existing tests still pass + new resume tests green
+### Incremental Delivery
+1. Setup + Foundational → Foundation ready
+2. ✅ User Story 1 (Resume) → Test independently → **MVP!** (core usability fix)
+3. ✅ User Story 2 (Force) → Test independently → Resume + Force both work
+4. ✅ User Story 3 (Templates) → Test independently → Multi-language scaffold
+5. ✅ User Story 4 (TestRunner) → Coverage verified → Quality gate met
+6. ✅ User Story 5 (PTY E2E) → Interactive validation in CI
+7. ✅ User Story 6 (Transition) → Full workshop→dev UX
+8. Polish → Ship
+### Parallel Team Strategy
+With multiple developers:
+1. Team completes Setup + Foundational together
+2. Once Foundational is done:
+   - Developer A: US1 (Resume) + US2 (Force) — related, same area
+   - Developer B: US3 (Templates) — independent area
+   - Developer C: US4 (TestRunner coverage) — fully independent
+   - Developer D: US6 (Workshop transition) — independent area
+3. After US1: Developer A picks up US5 (PTY E2E, needs resume)
+---
+## Notes
+- [P] tasks = different files, no dependencies
+- [Story] label maps task to specific user story for traceability
+- Each user story should be independently completable and testable
+- Verify tests fail before implementing
+- Commit after each task or logical group
+- Stop at any checkpoint to validate story independently
+- `maxIterations` counts total iterations (not additional from resume) — e.g., 10 max with 3 done → runs 4-10
+- Existing session schema supports resume as-is — no migration needed

package/specs/005-ai-search-deploy/checklists/requirements.md ADDED Viewed

@@ -0,0 +1,39 @@
+# Specification Quality Checklist: AI Foundry Search Service Deployment
+**Purpose**: Validate specification completeness and quality before proceeding to planning
+**Created**: 2026-03-01
+**Feature**: [spec.md](../spec.md)
+## Content Quality
+- [x] No implementation details (languages, frameworks, APIs)
+- [x] Focused on user value and business needs
+- [x] Written for non-technical stakeholders
+- [x] All mandatory sections completed
+## Requirement Completeness
+- [x] No [NEEDS CLARIFICATION] markers remain
+- [x] Requirements are testable and unambiguous
+- [x] Success criteria are measurable
+- [x] Success criteria are technology-agnostic (no implementation details)
+- [x] All acceptance scenarios are defined
+- [x] Edge cases are identified
+- [x] Scope is clearly bounded
+- [x] Dependencies and assumptions identified
+## Feature Readiness
+- [x] All functional requirements have clear acceptance criteria
+- [x] User scenarios cover primary flows
+- [x] Feature meets measurable outcomes defined in Success Criteria
+- [x] No implementation details leak into specification
+## Notes
+- All items pass validation. Spec is ready for `/speckit.clarify` or `/speckit.plan`.
+- The Assumptions section documents that basic agent setup is chosen over standard setup, scoping the feature to workshop/PoC complexity levels.
+- FR-009 references the `web_search_preview` tool type name as defined in the Azure AI Foundry Agent Service documentation — this is a capability name, not an implementation detail.
+- Key Entities mention "GPT-4o" as an example model; the actual model choice is parameterized per FR-004.
+- Authentication uses Azure Identity (the user's `az login` credentials) rather than a separate API key, per the Foundry Agent Service SDK pattern documented at https://learn.microsoft.com/en-us/azure/foundry/agents/how-to/tools/web-search?pivots=typescript.
+- Environment variables align with Foundry conventions: `FOUNDRY_PROJECT_ENDPOINT` and `FOUNDRY_MODEL_DEPLOYMENT_NAME` (replacing the previous `SOFIA_FOUNDRY_AGENT_ENDPOINT` / `SOFIA_FOUNDRY_AGENT_KEY` pattern).

package/specs/005-ai-search-deploy/contracts/web-search-tool.md ADDED Viewed

@@ -0,0 +1,241 @@
+# Contract: `web.search` Copilot SDK Tool
+**Feature**: 005-ai-search-deploy
+**Date**: 2026-03-01
+**Interface type**: Copilot SDK custom tool (registered via `ToolDefinition`)
+## Overview
+The `web.search` tool is exposed to the LLM through the GitHub Copilot SDK's tool registration system. When invoked, it queries the Azure AI Foundry Agent Service (with `web_search_preview` enabled) and returns structured results with URL citations.
+## Tool Definition
+```typescript
+const WEB_SEARCH_TOOL_DEFINITION: ToolDefinition = {
+  name: 'web.search',
+  description:
+    'Search the web for information about companies, industries, technologies, and trends. ' +
+    'Returns structured results with title, URL, and snippet.',
+  parameters: {
+    type: 'object',
+    properties: {
+      query: {
+        type: 'string',
+        description: 'The search query string.',
+      },
+    },
+    required: ['query'],
+  },
+};
+```
+**Contract stability**: The tool name (`web.search`), parameter schema, and return format are stable contracts referenced by multiple prompts (`discover.md`, `develop.md`) and the `mcpContextEnricher.ts` module. Changes require updating all consumers.
+## Input
+| Parameter | Type   | Required | Description                                                         |
+| --------- | ------ | -------- | ------------------------------------------------------------------- |
+| `query`   | string | yes      | Web search query (e.g., "Contoso Ltd competitors in healthcare AI") |
+## Output
+### Success response
+```typescript
+interface WebSearchResult {
+  results: WebSearchResultItem[];
+  sources?: string[]; // Deduplicated citation URLs
+  degraded?: false;
+}
+interface WebSearchResultItem {
+  title: string; // Page title from citation
+  url: string; // Source URL (from url_citation annotation)
+  snippet: string; // Relevant text excerpt
+}
+```
+**Example**:
+```json
+{
+  "results": [
+    {
+      "title": "Contoso Ltd - Healthcare AI Solutions",
+      "url": "https://contoso.com/about",
+      "snippet": "Contoso Ltd is a leading provider of AI-powered healthcare solutions..."
+    },
+    {
+      "title": "Top Healthcare AI Companies 2026 - TechReview",
+      "url": "https://techreview.com/healthcare-ai-2026",
+      "snippet": "The healthcare AI market is dominated by... Contoso ranks #3..."
+    }
+  ],
+  "sources": ["https://contoso.com/about", "https://techreview.com/healthcare-ai-2026"]
+}
+```
+### Degraded response
+Returned when the Foundry Agent Service is unavailable, misconfigured, or returns an error. The workshop continues without web search capabilities.
+```json
+{
+  "results": [],
+  "degraded": true,
+  "error": "Foundry agent returned 401 Unauthorized — run `az login` to refresh credentials"
+}
+```
+### Degradation scenarios
+| Condition                          | `degraded` | `error` message                                                                              |
+| ---------------------------------- | ---------- | -------------------------------------------------------------------------------------------- |
+| `FOUNDRY_PROJECT_ENDPOINT` not set | `true`     | "Web search not configured — set FOUNDRY_PROJECT_ENDPOINT and FOUNDRY_MODEL_DEPLOYMENT_NAME" |
+| `DefaultAzureCredential` fails     | `true`     | "Azure authentication failed — run `az login`"                                               |
+| Agent creation fails               | `true`     | "Failed to create web search agent: {details}"                                               |
+| Query returns error                | `true`     | "Web search query failed: {status} {message}"                                                |
+| Network error                      | `true`     | "Network error: {message}"                                                                   |
+| Rate limited (429)                 | `true`     | "Web search rate limited — retry in {seconds}s"                                              |
+## Integration Points
+### Consuming prompts
+- [src/prompts/discover.md](../../src/prompts/discover.md): `**web.search**: Research the user's industry, competitors, and trends`
+- [src/prompts/develop.md](../../src/prompts/develop.md): `**web.search** — Use when stuck on an implementation pattern`
+### Consuming code
+- `src/develop/mcpContextEnricher.ts`: Calls `isWebSearchConfigured()` to conditionally query web search when stuck for 2+ iterations
+- `src/cli/preflight.ts` (new): Legacy env var detection check (FR-016)
+## Configuration Contract
+### Required environment variables
+| Variable                        | Example                                                                         | Description                              |
+| ------------------------------- | ------------------------------------------------------------------------------- | ---------------------------------------- |
+| `FOUNDRY_PROJECT_ENDPOINT`      | `https://sofia-foundry-abc123.services.ai.azure.com/api/projects/sofia-project` | Foundry project endpoint URL             |
+| `FOUNDRY_MODEL_DEPLOYMENT_NAME` | `gpt-4.1-mini`                                                                  | Model deployment name for agent creation |
+### Authentication
+Uses `DefaultAzureCredential` — no API key environment variables. User must be logged in via `az login` (local development) or have a managed identity (Azure-hosted).
+### Legacy env var rejection (FR-016)
+If either `SOFIA_FOUNDRY_AGENT_ENDPOINT` or `SOFIA_FOUNDRY_AGENT_KEY` is set:
+- Preflight check fails with `required: true`
+- Error message: `"Legacy web search env vars detected. Migrate: replace SOFIA_FOUNDRY_AGENT_ENDPOINT with FOUNDRY_PROJECT_ENDPOINT and remove SOFIA_FOUNDRY_AGENT_KEY (API key auth is no longer used). See docs/environment.md"`
+## Lifecycle Contract
+### Initialization (lazy)
+```
+Session starts → web.search NOT invoked → no agent created (zero overhead)
+Session starts → web.search invoked → agent + conversation created → reused for session
+```
+### Cleanup
+```
+Session ends → destroyWebSearchSession() called → conversation deleted → agent version deleted
+Process exit → process.beforeExit handler → same cleanup
+Cleanup fails → warning logged → no throw (stale agent cleaned manually)
+```
+### Public API
+```typescript
+// Check if web search can be used (env vars present)
+function isWebSearchConfigured(): boolean;
+// Create the tool handler function
+function createWebSearchTool(config: WebSearchConfig): (query: string) => Promise<WebSearchResult>;
+// Explicitly clean up the ephemeral agent and conversation
+function destroyWebSearchSession(): Promise<void>;
+```
+---
+# Contract: Deployment Script CLI
+**Interface type**: Shell script (Bash)
+## `deploy.sh`
+### Usage
+```bash
+./infra/deploy.sh \
+  --resource-group <resource-group-name> \
+  [--subscription <subscription-id>] \
+  [--location <azure-region>] \
+  [--account-name <foundry-account-name>] \
+  [--model <model-deployment-name>]
+```
+### Parameters
+| Flag                     | Required | Default                     | Description                              |
+| ------------------------ | -------- | --------------------------- | ---------------------------------------- |
+| `--resource-group`, `-g` | yes      | —                           | Resource group name (created if missing) |
+| `--subscription`, `-s`   | no       | current az CLI subscription | Azure subscription ID                    |
+| `--location`, `-l`       | no       | `swedencentral`             | Azure region                             |
+| `--account-name`, `-n`   | no       | `sofia-foundry`             | Foundry account name                     |
+| `--model`, `-m`          | no       | `gpt-4.1-mini`              | Model deployment name                    |
+### Exit codes
+| Code | Meaning                                                 |
+| ---- | ------------------------------------------------------- |
+| 0    | Deployment succeeded                                    |
+| 1    | Prerequisite check failed (az CLI, login, subscription) |
+| 2    | Deployment failed (Bicep error)                         |
+### Output (stdout on success)
+The script writes `FOUNDRY_PROJECT_ENDPOINT` and `FOUNDRY_MODEL_DEPLOYMENT_NAME` to a `.env` file in the workspace root (creating or updating it), then prints:
+```
+✅ Deployment complete!
+Environment variables written to /path/to/workspace/.env:
+  FOUNDRY_PROJECT_ENDPOINT="https://sofia-foundry-abc123.services.ai.azure.com/api/projects/sofia-project"
+  FOUNDRY_MODEL_DEPLOYMENT_NAME="gpt-4.1-mini"
+To tear down: ./infra/teardown.sh --resource-group <resource-group-name>
+```
+## `teardown.sh`
+### Usage
+```bash
+./infra/teardown.sh --resource-group <resource-group-name>
+```
+### Parameters
+| Flag                     | Required | Default | Description              |
+| ------------------------ | -------- | ------- | ------------------------ |
+| `--resource-group`, `-g` | yes      | —       | Resource group to delete |
+### Exit codes
+| Code | Meaning                                            |
+| ---- | -------------------------------------------------- |
+| 0    | Teardown succeeded or resource group doesn't exist |
+| 1    | Prerequisite check failed                          |
+| 2    | Deletion failed                                    |
+### Behavior
+- If resource group doesn't exist: prints informational message, exits 0
+- Prompts for confirmation before deletion (unless `--yes` flag)
+- Uses `az group delete --yes --no-wait` for non-blocking deletion