npm - retestkit - Versions diffs - 1.4.1 → 1.5.0 - Mend

retestkit 1.4.1 → 1.5.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (238) hide show

package/README.md +59 -40
package/dist/config.js +8 -8
package/dist/config.js.map +1 -1
package/dist/logger.js +1 -1
package/dist/logger.js.map +1 -1
package/dist/prompts/index.d.ts +1 -1
package/dist/prompts/index.d.ts.map +1 -1
package/dist/prompts/index.js +21 -21
package/dist/prompts/index.js.map +1 -1
package/dist/prompts/templates/mcp/retest-crawl.md +7 -0
package/{src/prompts/templates/mcp/webtest-discover-flows.md → dist/prompts/templates/mcp/retest-discover-flows.md} +1 -1
package/{src/prompts/templates/mcp/webtest-discover.md → dist/prompts/templates/mcp/retest-discover.md} +2 -2
package/dist/prompts/templates/mcp/retest-full-workflow.md +12 -0
package/{src/prompts/templates/mcp/webtest-generate-tests.md → dist/prompts/templates/mcp/retest-generate-tests.md} +1 -1
package/{src/prompts/templates/mcp/webtest-run-test.md → dist/prompts/templates/mcp/retest-run-test.md} +1 -1
package/{src/prompts/templates/mcp/webtest-start.md → dist/prompts/templates/mcp/retest-start.md} +1 -1
package/{src → dist}/prompts/templates/sampling/system-prefix.md +1 -1
package/dist/resources/index.js +7 -7
package/dist/resources/index.js.map +1 -1
package/dist/schemas/config.js +2 -2
package/dist/schemas/config.js.map +1 -1
package/dist/security/index.js +1 -1
package/dist/security/index.js.map +1 -1
package/dist/server.js +3 -3
package/dist/server.js.map +1 -1
package/dist/test-utils/mock-context.js +22 -22
package/dist/test-utils/mock-context.js.map +1 -1
package/dist/tools/index.d.ts +1 -1
package/dist/tools/index.d.ts.map +1 -1
package/dist/tools/index.js +5 -5
package/dist/tools/index.js.map +1 -1
package/dist/tools/retest/crawl.d.ts.map +1 -0
package/dist/tools/{webtest → retest}/crawl.js +7 -7
package/dist/tools/retest/crawl.js.map +1 -0
package/dist/tools/retest/discover-features.d.ts.map +1 -0
package/dist/tools/{webtest → retest}/discover-features.js +6 -6
package/dist/tools/retest/discover-features.js.map +1 -0
package/dist/tools/retest/discover-flows.d.ts.map +1 -0
package/dist/tools/{webtest → retest}/discover-flows.js +6 -6
package/dist/tools/retest/discover-flows.js.map +1 -0
package/dist/tools/retest/generate-tests.d.ts.map +1 -0
package/dist/tools/{webtest → retest}/generate-tests.js +5 -5
package/dist/tools/retest/generate-tests.js.map +1 -0
package/dist/tools/retest/index.d.ts.map +1 -0
package/dist/tools/retest/index.js.map +1 -0
package/dist/tools/retest/run-test-case.d.ts.map +1 -0
package/dist/tools/{webtest → retest}/run-test-case.js +3 -3
package/dist/tools/retest/run-test-case.js.map +1 -0
package/dist/tools/retest/schemas.d.ts.map +1 -0
package/dist/tools/retest/schemas.js.map +1 -0
package/dist/tools/retest/start-analysis.d.ts.map +1 -0
package/dist/tools/{webtest → retest}/start-analysis.js +5 -5
package/dist/tools/retest/start-analysis.js.map +1 -0
package/dist/workspace/index.js +8 -8
package/dist/workspace/index.js.map +1 -1
package/dist/workspace/types.d.ts +2 -2
package/dist/workspace/types.d.ts.map +1 -1
package/package.json +6 -2
package/.claude/commands/openspec/apply.md +0 -23
package/.claude/commands/openspec/archive.md +0 -27
package/.claude/commands/openspec/proposal.md +0 -28
package/.gemini/commands/openspec/apply.toml +0 -21
package/.gemini/commands/openspec/archive.toml +0 -25
package/.gemini/commands/openspec/proposal.toml +0 -26
package/.github/prompts/openspec-apply.prompt.md +0 -22
package/.github/prompts/openspec-archive.prompt.md +0 -26
package/.github/prompts/openspec-proposal.prompt.md +0 -27
package/.github/workflows/release.yml +0 -33
package/.kilocode/workflows/openspec-apply.md +0 -17
package/.kilocode/workflows/openspec-archive.md +0 -21
package/.kilocode/workflows/openspec-proposal.md +0 -22
package/.mcp.json +0 -23
package/.opencode/command/openspec-apply.md +0 -25
package/.opencode/command/openspec-archive.md +0 -28
package/.opencode/command/openspec-proposal.md +0 -30
package/.roo/commands/openspec-apply.md +0 -20
package/.roo/commands/openspec-archive.md +0 -24
package/.roo/commands/openspec-proposal.md +0 -25
package/.vscode/mcp.json +0 -23
package/AGENTS.md +0 -18
package/CLAUDE.md +0 -18
package/dist/tools/webtest/crawl.d.ts.map +0 -1
package/dist/tools/webtest/crawl.js.map +0 -1
package/dist/tools/webtest/discover-features.d.ts.map +0 -1
package/dist/tools/webtest/discover-features.js.map +0 -1
package/dist/tools/webtest/discover-flows.d.ts.map +0 -1
package/dist/tools/webtest/discover-flows.js.map +0 -1
package/dist/tools/webtest/generate-tests.d.ts.map +0 -1
package/dist/tools/webtest/generate-tests.js.map +0 -1
package/dist/tools/webtest/index.d.ts.map +0 -1
package/dist/tools/webtest/index.js.map +0 -1
package/dist/tools/webtest/run-test-case.d.ts.map +0 -1
package/dist/tools/webtest/run-test-case.js.map +0 -1
package/dist/tools/webtest/schemas.d.ts.map +0 -1
package/dist/tools/webtest/schemas.js.map +0 -1
package/dist/tools/webtest/start-analysis.d.ts.map +0 -1
package/dist/tools/webtest/start-analysis.js.map +0 -1
package/openspec/AGENTS.md +0 -456
package/openspec/changes/archive/2025-12-18-add-hybrid-artifact-paths/proposal.md +0 -33
package/openspec/changes/archive/2025-12-18-add-hybrid-artifact-paths/specs/webtest-resources/spec.md +0 -27
package/openspec/changes/archive/2025-12-18-add-hybrid-artifact-paths/specs/webtest-tools/spec.md +0 -304
package/openspec/changes/archive/2025-12-18-add-hybrid-artifact-paths/tasks.md +0 -43
package/openspec/changes/archive/2025-12-18-add-mcp-server-foundation/design.md +0 -209
package/openspec/changes/archive/2025-12-18-add-mcp-server-foundation/proposal.md +0 -41
package/openspec/changes/archive/2025-12-18-add-mcp-server-foundation/specs/mcp-server-core/spec.md +0 -183
package/openspec/changes/archive/2025-12-18-add-mcp-server-foundation/tasks.md +0 -112
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/design.md +0 -333
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/proposal.md +0 -66
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/specs/mcp-server-core/spec.md +0 -129
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/specs/webtest-lifecycle/spec.md +0 -138
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/specs/webtest-logging/spec.md +0 -211
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/specs/webtest-prompts/spec.md +0 -157
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/specs/webtest-resources/spec.md +0 -213
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/specs/webtest-sampling/spec.md +0 -257
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/specs/webtest-tools/spec.md +0 -501
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/tasks.md +0 -264
package/openspec/changes/archive/2025-12-18-allow-analysis-of-incomplete-crawls/proposal.md +0 -24
package/openspec/changes/archive/2025-12-18-allow-analysis-of-incomplete-crawls/specs/webtest-tools/spec.md +0 -80
package/openspec/changes/archive/2025-12-18-allow-analysis-of-incomplete-crawls/tasks.md +0 -8
package/openspec/changes/archive/2025-12-18-fix-crawl-loop-stability/design.md +0 -90
package/openspec/changes/archive/2025-12-18-fix-crawl-loop-stability/proposal.md +0 -28
package/openspec/changes/archive/2025-12-18-fix-crawl-loop-stability/specs/webtest-sampling/spec.md +0 -90
package/openspec/changes/archive/2025-12-18-fix-crawl-loop-stability/tasks.md +0 -33
package/openspec/changes/archive/2025-12-18-use-markdown-artifacts/design.md +0 -558
package/openspec/changes/archive/2025-12-18-use-markdown-artifacts/proposal.md +0 -119
package/openspec/changes/archive/2025-12-18-use-markdown-artifacts/specs/webtest-resources/spec.md +0 -109
package/openspec/changes/archive/2025-12-18-use-markdown-artifacts/specs/webtest-tools/spec.md +0 -121
package/openspec/changes/archive/2025-12-18-use-markdown-artifacts/tasks.md +0 -133
package/openspec/changes/extract-prompts-to-markdown/design.md +0 -86
package/openspec/changes/extract-prompts-to-markdown/proposal.md +0 -50
package/openspec/changes/extract-prompts-to-markdown/specs/webtest-prompts/spec.md +0 -74
package/openspec/changes/extract-prompts-to-markdown/tasks.md +0 -40
package/openspec/changes/refactor-webtest-naming/design.md +0 -95
package/openspec/changes/refactor-webtest-naming/proposal.md +0 -66
package/openspec/changes/refactor-webtest-naming/specs/webtest-prompts/spec.md +0 -79
package/openspec/changes/refactor-webtest-naming/specs/webtest-resources/spec.md +0 -80
package/openspec/changes/refactor-webtest-naming/specs/webtest-sampling/spec.md +0 -122
package/openspec/changes/refactor-webtest-naming/specs/webtest-tools/spec.md +0 -113
package/openspec/changes/refactor-webtest-naming/tasks.md +0 -119
package/openspec/changes/rename-package-to-retest/proposal.md +0 -52
package/openspec/changes/rename-package-to-retest/specs/mcp-server-core/spec.md +0 -53
package/openspec/changes/rename-package-to-retest/specs/retest-lifecycle/spec.md +0 -68
package/openspec/changes/rename-package-to-retest/specs/retest-logging/spec.md +0 -35
package/openspec/changes/rename-package-to-retest/specs/retest-prompts/spec.md +0 -159
package/openspec/changes/rename-package-to-retest/specs/retest-resources/spec.md +0 -251
package/openspec/changes/rename-package-to-retest/specs/retest-sampling/spec.md +0 -99
package/openspec/changes/rename-package-to-retest/specs/retest-tools/spec.md +0 -295
package/openspec/changes/rename-package-to-retest/tasks.md +0 -71
package/openspec/project.md +0 -31
package/openspec/specs/mcp-server-core/spec.md +0 -178
package/openspec/specs/webtest-lifecycle/spec.md +0 -136
package/openspec/specs/webtest-logging/spec.md +0 -209
package/openspec/specs/webtest-prompts/spec.md +0 -155
package/openspec/specs/webtest-resources/spec.md +0 -248
package/openspec/specs/webtest-sampling/spec.md +0 -344
package/openspec/specs/webtest-tools/spec.md +0 -282
package/release.config.js +0 -9
package/src/config.test.ts +0 -96
package/src/config.ts +0 -32
package/src/elicitation/index.test.ts +0 -399
package/src/elicitation/index.ts +0 -171
package/src/elicitation/types.ts +0 -68
package/src/index.ts +0 -83
package/src/lifecycle/index.test.ts +0 -260
package/src/lifecycle/index.ts +0 -101
package/src/logger.redaction.test.ts +0 -322
package/src/logger.test.ts +0 -123
package/src/logger.ts +0 -229
package/src/playwright-client/index.ts +0 -392
package/src/playwright-client/types.ts +0 -99
package/src/progress/index.test.ts +0 -327
package/src/progress/index.ts +0 -170
package/src/progress/types.ts +0 -25
package/src/prompts/index.test.ts +0 -451
package/src/prompts/index.ts +0 -246
package/src/prompts/loader.test.ts +0 -100
package/src/prompts/loader.ts +0 -59
package/src/prompts/templates/mcp/webtest-crawl.md +0 -7
package/src/prompts/templates/mcp/webtest-full-workflow.md +0 -12
package/src/resources/index.ts +0 -250
package/src/resources/subscriptions.ts +0 -37
package/src/sampling/index.test.ts +0 -414
package/src/sampling/index.ts +0 -286
package/src/sampling/prompts.ts +0 -194
package/src/sampling/types.ts +0 -60
package/src/schemas/config.ts +0 -39
package/src/security/index.test.ts +0 -441
package/src/security/index.ts +0 -361
package/src/security/security-scenarios.test.ts +0 -468
package/src/server.ts +0 -211
package/src/test-utils/index.ts +0 -6
package/src/test-utils/mock-context.ts +0 -426
package/src/test-utils/mock-playwright-client.ts +0 -422
package/src/tools/index.ts +0 -11
package/src/tools/webtest/crawl.test.ts +0 -834
package/src/tools/webtest/crawl.ts +0 -901
package/src/tools/webtest/discover-features.ts +0 -412
package/src/tools/webtest/discover-flows.ts +0 -408
package/src/tools/webtest/generate-tests.test.ts +0 -532
package/src/tools/webtest/generate-tests.ts +0 -425
package/src/tools/webtest/index.ts +0 -7
package/src/tools/webtest/integration.test.ts +0 -536
package/src/tools/webtest/run-test-case.test.ts +0 -659
package/src/tools/webtest/run-test-case.ts +0 -508
package/src/tools/webtest/schemas.ts +0 -201
package/src/tools/webtest/start-analysis.test.ts +0 -151
package/src/tools/webtest/start-analysis.ts +0 -158
package/src/transports/http.ts +0 -19
package/src/transports/index.ts +0 -30
package/src/transports/stdio.ts +0 -7
package/src/types/capabilities.test.ts +0 -193
package/src/types/capabilities.ts +0 -50
package/src/types/context.ts +0 -21
package/src/types/tool.ts +0 -11
package/src/workspace/index.ts +0 -945
package/src/workspace/markdown.ts +0 -272
package/src/workspace/types.ts +0 -186
package/tests/integration/server.test.ts +0 -89
package/tests/integration/tools.test.ts +0 -99
package/tsconfig.json +0 -20
package/vitest.config.ts +0 -9
package/vitest.integration.config.ts +0 -10
/package/{src → dist}/prompts/templates/sampling/crawl-action.md +0 -0
/package/{src → dist}/prompts/templates/sampling/feature-discovery.md +0 -0
/package/{src → dist}/prompts/templates/sampling/flow-discovery.md +0 -0
/package/{src → dist}/prompts/templates/sampling/page-content-wrapper.md +0 -0
/package/{src → dist}/prompts/templates/sampling/test-evaluation.md +0 -0
/package/{src → dist}/prompts/templates/sampling/test-generation.md +0 -0
/package/dist/tools/{webtest → retest}/crawl.d.ts +0 -0
/package/dist/tools/{webtest → retest}/discover-features.d.ts +0 -0
/package/dist/tools/{webtest → retest}/discover-flows.d.ts +0 -0
/package/dist/tools/{webtest → retest}/generate-tests.d.ts +0 -0
/package/dist/tools/{webtest → retest}/index.d.ts +0 -0
/package/dist/tools/{webtest → retest}/index.js +0 -0
/package/dist/tools/{webtest → retest}/run-test-case.d.ts +0 -0
/package/dist/tools/{webtest → retest}/schemas.d.ts +0 -0
/package/dist/tools/{webtest → retest}/schemas.js +0 -0
/package/dist/tools/{webtest → retest}/start-analysis.d.ts +0 -0

package/openspec/changes/archive/2025-12-18-add-hybrid-artifact-paths/specs/webtest-tools/spec.md DELETED Viewed

@@ -1,304 +0,0 @@
-## MODIFIED Requirements
-### Requirement: webtest_init Tool
-The system SHALL provide a `webtest_init` tool that initializes an analysis workspace for a target URL and focus.
-#### Scenario: Start analysis with valid URL
-- **GIVEN** the tool is called with a valid URL and focus
-- **WHEN** execution completes
-- **THEN** it SHALL generate a unique `analysisId`
-- **AND** create workspace directories
-- **AND** write initial `index.json` metadata
-- **AND** return `{ analysisId, workspaceRootPath, workspaceRootUri, statusUri }`
-#### Scenario: Start analysis validates URL
-- **GIVEN** the tool is called with an invalid URL
-- **WHEN** validation occurs
-- **THEN** it SHALL return an error with message "Invalid URL format"
-#### Scenario: Start analysis normalizes domain for allowlist
-- **GIVEN** the tool is called with URL "https://example.com/path"
-- **WHEN** workspace is created
-- **THEN** the default `allowedDomains` SHALL include "example.com"
-- **AND** this SHALL be stored in workspace metadata
-#### Scenario: Start analysis accepts custom limits
-- **GIVEN** the tool is called with `limits: { maxSteps: 50, maxPages: 10, maxMinutes: 5 }`
-- **WHEN** workspace is created
-- **THEN** limits SHALL be stored in workspace metadata
-- **AND** subsequent crawls SHALL respect these limits
-### Requirement: webtest_crawl_app Tool
-The system SHALL provide a `webtest_crawl_app` tool that dynamically explores a web application to achieve a goal.
-#### Scenario: Crawl navigates to starting URL
-- **GIVEN** the tool is called with a valid analysisId
-- **WHEN** crawl begins
-- **THEN** it SHALL launch Playwright MCP browser
-- **AND** navigate to the URL from analysis metadata
-#### Scenario: Crawl captures artifacts at each checkpoint
-- **GIVEN** a crawl iteration completes an action
-- **WHEN** state is captured
-- **THEN** it SHALL call Playwright MCP `browser_snapshot` for accessibility tree
-- **AND** call `browser_take_screenshot` for visual evidence
-- **AND** optionally extract HTML DOM
-- **AND** store artifacts in workspace with unique page IDs
-#### Scenario: Crawl uses sampling for next action
-- **GIVEN** crawl has captured current state
-- **WHEN** next action is needed
-- **THEN** it SHALL construct a sampling prompt with goal, history, current state
-- **AND** request next action via `sampling/createMessage`
-- **AND** validate and execute returned Playwright actions
-#### Scenario: Crawl terminates when goal satisfied
-- **GIVEN** crawl sampling returns `goalSatisfied: true`
-- **WHEN** this is detected
-- **THEN** crawl SHALL finalize with success status
-- **AND** write crawl summary to workspace
-#### Scenario: Crawl terminates when limits reached
-- **GIVEN** crawl has executed `maxSteps` actions
-- **WHEN** limit is checked
-- **THEN** crawl SHALL finalize with "limits_reached" status
-- **AND** preserve all collected artifacts
-#### Scenario: Crawl handles navigation loops
-- **GIVEN** crawl detects same page state 3 times consecutively
-- **WHEN** loop is detected
-- **THEN** it SHALL log a warning
-- **AND** request alternative action from sampling with loop context
-#### Scenario: Crawl triggers elicitation for cookie consent
-- **GIVEN** crawl detects a cookie consent dialog
-- **WHEN** elicitation is supported
-- **THEN** it SHALL call elicitation with options: "Accept", "Reject", "Dismiss"
-- **AND** execute the chosen action
-#### Scenario: Crawl triggers elicitation for blocking modal
-- **GIVEN** crawl detects a modal blocking navigation
-- **WHEN** elicitation is supported
-- **THEN** it SHALL call elicitation with options: "Close modal", "Interact with modal content"
-#### Scenario: Crawl triggers elicitation for ambiguous navigation
-- **GIVEN** sampling returns `needsElicitation: { type: "ambiguous", options: [...] }`
-- **WHEN** elicitation is supported
-- **THEN** it SHALL present the options to the user
-- **AND** use the selection to continue crawl
-#### Scenario: Crawl stops on authentication required
-- **GIVEN** crawl detects login form or auth wall
-- **WHEN** elicitation is supported
-- **THEN** it SHALL call elicitation with options: "Stop analysis", "Continue unauthenticated"
-- **AND** never request credentials
-#### Scenario: Crawl emits progress notifications
-- **GIVEN** crawl is running
-- **WHEN** each iteration completes
-- **THEN** it SHALL emit progress notification with step count, pages discovered, current intent
-#### Scenario: Crawl responds to cancellation
-- **GIVEN** crawl receives `notifications/cancelled`
-- **WHEN** cancellation is detected
-- **THEN** it SHALL stop the crawl loop promptly
-- **AND** finalize with "cancelled" status
-- **AND** preserve collected artifacts
-#### Scenario: Crawl returns fallback when sampling unavailable
-- **GIVEN** client does not support sampling
-- **WHEN** crawl needs next action
-- **THEN** it SHALL return `{ needsManualInput: true, promptUri, currentState }`
-- **AND** accept `manualNextActions` input to continue
-#### Scenario: Crawl outputs complete results
-- **GIVEN** crawl has finalized
-- **WHEN** output is returned
-- **THEN** it SHALL include `crawlId`, `crawlIndexFilePath`, `crawlIndexUri`, `pages[]`, `summaryUri`
-### Requirement: webtest_analyze_app Tool
-The system SHALL provide a `webtest_analyze_app` tool that reverse-engineers application structure from crawl data.
-#### Scenario: Analyze app loads crawl data
-- **GIVEN** the tool is called with valid analysisId and crawlId
-- **WHEN** execution begins
-- **THEN** it SHALL load crawl index and artifact references
-- **AND** load page snapshots for key pages
-#### Scenario: Analyze app uses sampling for analysis
-- **GIVEN** crawl data is loaded
-- **WHEN** analysis is performed
-- **THEN** it SHALL construct sampling prompt with crawl summary and snapshots
-- **AND** request structured analysis via `sampling/createMessage`
-#### Scenario: Analyze app extracts application purpose
-- **GIVEN** analysis sampling completes
-- **WHEN** results are processed
-- **THEN** output SHALL include identified app purpose
-- **AND** key entities (users, products, orders, etc.)
-#### Scenario: Analyze app identifies user flows
-- **GIVEN** analysis sampling completes
-- **WHEN** results are processed
-- **THEN** output SHALL include discovered user flows
-- **AND** each flow SHALL have id, name, description, steps
-#### Scenario: Analyze app suggests assertions
-- **GIVEN** analysis sampling completes
-- **WHEN** results are processed
-- **THEN** output SHALL include suggested assertions for testing
-- **AND** potential risks or edge cases
-#### Scenario: Analyze app writes markdown report
-- **GIVEN** analysis is complete
-- **WHEN** output is generated
-- **THEN** it SHALL write `app-analysis.md` resource to workspace
-#### Scenario: Analyze app outputs file paths and URIs
-- **GIVEN** analysis is complete
-- **WHEN** tool returns
-- **THEN** it SHALL include `appAnalysisFilePath`, `appAnalysisUri`, `flowsFilePath`, and `flowsUri`
-### Requirement: webtest_generate_tests Tool
-The system SHALL provide a `webtest_generate_tests` tool that produces test cases from application analysis.
-#### Scenario: Generate tests loads analysis
-- **GIVEN** the tool is called with valid analysisId and appAnalysisUri
-- **WHEN** execution begins
-- **THEN** it SHALL load app analysis and flows from workspace
-#### Scenario: Generate tests uses sampling
-- **GIVEN** analysis is loaded
-- **WHEN** test generation is performed
-- **THEN** it SHALL construct sampling prompt with analysis, flows, strategy
-- **AND** request test cases via `sampling/createMessage`
-#### Scenario: Generate tests applies strategy
-- **GIVEN** tool is called with `testStrategy: { count: 5, types: ["smoke", "negative"] }`
-- **WHEN** sampling prompt is built
-- **THEN** it SHALL instruct model to generate 5 tests covering smoke and negative scenarios
-#### Scenario: Generate tests outputs structured format
-- **GIVEN** test generation completes
-- **WHEN** results are written
-- **THEN** it SHALL produce `tests.md` with human-readable format
-- **AND** `tests.json` with structured test definitions
-#### Scenario: Test case structure is complete
-- **GIVEN** tests.json is generated
-- **WHEN** a test case is examined
-- **THEN** it SHALL include: id, name, purpose, preconditions, steps[], expected results, priority
-#### Scenario: Generate tests outputs file paths and URIs
-- **GIVEN** generation is complete
-- **WHEN** tool returns
-- **THEN** it SHALL include `testsFilePath`, `testsUri`, and `testIndexUri`
-### Requirement: webtest_run_tests Tool
-The system SHALL provide a `webtest_run_tests` tool that executes a test case with evidence capture.
-#### Scenario: Run test case loads test definition
-- **GIVEN** the tool is called with valid analysisId and testCaseId
-- **WHEN** execution begins
-- **THEN** it SHALL load test case from tests index
-- **AND** validate test case exists
-#### Scenario: Run test case executes steps sequentially
-- **GIVEN** test case has multiple steps
-- **WHEN** execution runs
-- **THEN** it SHALL execute each step in order
-- **AND** capture state before and after each step
-#### Scenario: Run test case uses sampling for step translation
-- **GIVEN** a test step needs execution
-- **WHEN** translation is needed
-- **THEN** it SHALL use sampling to convert step description to Playwright actions
-- **AND** validate actions before execution
-#### Scenario: Run test case captures evidence
-- **GIVEN** a step is executed
-- **WHEN** evidence is captured
-- **THEN** it SHALL take screenshot after action
-- **AND** capture accessibility snapshot
-- **AND** store with step identifier
-#### Scenario: Run test case evaluates pass/fail
-- **GIVEN** a step has executed
-- **WHEN** evaluation occurs
-- **THEN** it SHALL use sampling to compare expected vs actual
-- **AND** record pass or fail with reason
-#### Scenario: Run test case continues on step failure
-- **GIVEN** a step fails
-- **WHEN** failure is recorded
-- **THEN** execution SHALL continue to next step (unless critical)
-- **AND** overall test status SHALL be "failed"
-#### Scenario: Run test case emits progress
-- **GIVEN** test is running
-- **WHEN** each step completes
-- **THEN** it SHALL emit progress notification with step number, status
-#### Scenario: Run test case responds to cancellation
-- **GIVEN** test receives `notifications/cancelled`
-- **WHEN** cancellation is detected
-- **THEN** it SHALL stop after current step
-- **AND** finalize with "cancelled" status and partial results
-#### Scenario: Run test case outputs report
-- **GIVEN** test execution completes
-- **WHEN** output is generated
-- **THEN** it SHALL write `report.md` with pass/fail summary, step details, evidence links
-- **AND** `artifacts.json` with structured run data
-#### Scenario: Run test case returns file paths and URIs
-- **GIVEN** execution is complete
-- **WHEN** tool returns
-- **THEN** it SHALL include `testRunId`, `reportFilePath`, `reportUri`, `runArtifactsIndexUri`
-- **AND** each step result SHALL include `evidenceFilePath` and `evidenceUri` for captured artifacts

package/openspec/changes/archive/2025-12-18-add-hybrid-artifact-paths/tasks.md DELETED Viewed

@@ -1,43 +0,0 @@
-## 1. Workspace Manager Updates
-- [x] 1.1 Add `workspaceDir` getter to expose base directory path
-- [x] 1.2 Update `saveAnalysis` to return `{ appAnalysisFilePath, appAnalysisUri, flowsFilePath, flowsUri }`
-- [x] 1.3 Update `saveTests` to return `{ testsFilePath, testsUri, testsMarkdownFilePath }`
-- [x] 1.4 Update `savePage` to return `{ snapshotFilePath, screenshotFilePath, domFilePath }` alongside URIs
-- [x] 1.5 Update `saveTestStepEvidence` to return file paths alongside URIs
-- [x] 1.6 Update `createWorkspace` to return `{ workspacePath, workspaceUri }`
-- [x] 1.7 Update `createCrawl` to return `{ crawlPath, crawlIndexUri }`
-## 2. Type Definitions
-- [x] 2.1 Update workspace types to include FilePath variants in return types
-- [x] 2.2 Add TypeScript interfaces for hybrid artifact results
-## 3. Tool Updates
-- [x] 3.1 Update `webtest_init` to include `workspaceRootPath` in result
-- [x] 3.2 Update `webtest_crawl_app` to include `crawlIndexFilePath` in result
-- [x] 3.3 Update `webtest_analyze_app` to include `appAnalysisFilePath` and `flowsFilePath` in result
-- [x] 3.4 Update `webtest_generate_tests` to include `testsFilePath` in result
-- [x] 3.5 Update `webtest_run_tests` to include `reportFilePath` and evidence file paths in result
-## 4. Tests
-- [x] 4.1 Update workspace manager tests to verify file path return values
-- [x] 4.2 Update tool tests to verify file path fields in results
-- [x] 4.3 Verify file paths are absolute and valid
-## 5. Workspace Index Persistence (Fix for handover issue)
-- [x] 5.1 Add `appAnalysisFilePath` and `flowsFilePath` to `AnalysisReference` type
-- [x] 5.2 Add `testsFilePath` and `testsMarkdownFilePath` to `TestsReference` type
-- [x] 5.3 Update `saveAnalysis` to persist file paths in workspace index
-- [x] 5.4 Update `saveTests` to persist file paths in workspace index
-## 6. Manual Input Support (Fix for fallback mode when sampling unavailable)
-- [x] 6.1 Add `manualAnalysis` parameter to `webtest_analyze_app` input schema
-- [x] 6.2 Update `webtest_analyze_app` handler to use manual input when provided
-- [x] 6.3 Update fallback instructions to guide calling tool again with manual input
-- [x] 6.4 Add `manualTests` parameter to `webtest_generate_tests` input schema
-- [x] 6.5 Update `webtest_generate_tests` handler to use manual input when provided

package/openspec/changes/archive/2025-12-18-add-mcp-server-foundation/design.md DELETED Viewed

@@ -1,209 +0,0 @@
-## Context
-This MCP Web Testing Server needs a solid foundation that follows MCP protocol best practices while being structured for extensibility. The server will eventually provide tools for automated web application testing, so the initial architecture must support easy addition of testing-specific tools (browser automation, HTTP requests, assertions, etc.).
-**Stakeholders**: Developers building testing workflows, MCP client integrations (Claude Code, ChatGPT desktop, etc.)
-**Constraints**:
-- Must follow MCP specification (2025-06-18)
-- Must support both stdio and HTTP transports from day one
-- Proprietary codebase - no external contributions expected
-## Goals / Non-Goals
-**Goals:**
-- Establish clean, extensible project structure for adding testing tools
-- Follow official MCP TypeScript SDK patterns
-- Provide pluggable transport layer (stdio + Streamable HTTP)
-- Create self-describing tool registry pattern
-- Enable rapid development iteration with hot-reloading
-- Include comprehensive test infrastructure (unit + integration)
-- Production-ready logging and configuration
-**Non-Goals:**
-- Authentication/authorization (deferred to future proposal)
-- Resource or prompt handlers (tools-focused initially)
-- Actual testing functionality (that comes in subsequent proposals)
-## Decisions
-### Decision 1: TypeScript with Official MCP SDK
-**What**: Use TypeScript with `@modelcontextprotocol/sdk` and Zod for schemas.
-**Why**:
-- Official SDK maintained by protocol authors
-- TypeScript provides type safety and IDE support
-- Zod is required peer dependency and excellent for runtime validation
-- Largest ecosystem of MCP reference implementations
-**Alternatives considered**:
-- Python SDK: Good option but TypeScript has better MCP ecosystem maturity
-- Go SDK: Less mature, fewer reference implementations to learn from
-### Decision 2: ESM-Only with Node.js >= 22.18.0
-**What**: Use ECMAScript modules exclusively, recommend Node.js 22.18.0+.
-**Why**:
-- Node.js 22.18.0+ has built-in type stripping enabled by default (instant reloads without build step)
-- Earlier 22.x versions require `--experimental-strip-types` flag
-- ESM is the modern standard and SDK uses ESM imports
-- Simplifies tooling configuration
-**Alternatives considered**:
-- CommonJS: Legacy, more tooling complexity
-- Dual ESM/CJS: Unnecessary complexity for internal project
-### Decision 3: Pluggable Transport Architecture
-**What**: Abstract transport selection via environment variables with implementations in `src/transports/`.
-```
-TRANSPORT=stdio         # Default for local dev
-TRANSPORT=http PORT=3000  # For remote deployment
-```
-**Why**:
-- Aligns with how the TS SDK frames transports
-- stdio for local MCP client integration (Claude Code, etc.)
-- Streamable HTTP ready for remote/cloud deployment scenarios
-- Easy to add new transports (WebSocket, etc.) following the pattern
-**Structure**:
-```
-src/transports/
-├── index.ts      # Transport factory based on env config
-├── stdio.ts      # StdioServerTransport wrapper
-└── http.ts       # StreamableHTTPServerTransport wrapper
-```
-### Decision 4: Project Structure
-**What**: Organize code into modular directories by concern.
-```
-src/
-├── index.ts           # Entry point, bootstrap
-├── server.ts          # MCP server factory
-├── config.ts          # Zod-validated configuration
-├── logger.ts          # Structured logging with redaction
-├── transports/        # Transport implementations
-│   ├── index.ts       # Factory
-│   ├── stdio.ts
-│   └── http.ts
-├── tools/             # Tool definitions
-│   ├── index.ts       # Registry
-│   └── hello.ts       # Demo tool
-├── schemas/           # Shared Zod schemas
-└── types/             # TypeScript type definitions
-```
-**Why**:
-- Follows patterns from official MCP reference servers
-- Tools directory with registry enables easy addition of new tools
-- Transports directory encapsulates transport-specific logic
-- Separation of concerns for maintainability
-### Decision 5: Self-Describing Tool Registry
-**What**: Each tool exports a standard interface; registry auto-discovers and registers tools.
-```typescript
-// Tool interface
-export interface McpTool<TInput> {
-  name: string;
-  description: string;
-  inputSchema: z.ZodType<TInput>;
-  handler: (input: TInput) => Promise<ToolResult>;
-}
-// tools/index.ts exports all tools
-export const tools: McpTool<unknown>[] = [helloTool, /* future tools */];
-```
-**Why**:
-- Single source of truth: Zod schema generates both TS types and JSON Schema for MCP
-- Consistent pattern for adding new tools
-- Easy to test tools in isolation
-- Self-documenting tool capabilities
-### Decision 6: Structured Logging
-**What**: Use structured JSON logging with automatic secret redaction.
-**Why**:
-- Machine-parseable logs for production observability
-- Secret redaction prevents accidental credential exposure
-- Configurable log levels via environment
-**Pattern**:
-```typescript
-// Redact known sensitive fields
-const REDACT_KEYS = ['password', 'token', 'secret', 'apiKey', 'authorization'];
-```
-### Decision 7: Vitest for Testing
-**What**: Use Vitest for unit and integration tests.
-**Why**:
-- Native ESM support
-- Fast execution with built-in watch mode
-- TypeScript support out of box
-- Compatible with Node.js test patterns
-**Test Strategy**:
-- **Unit tests**: Test tool handlers in isolation with mocked inputs
-- **Integration tests**: Spawn actual server, connect via StdioServerTransport, execute tools end-to-end
-**Alternatives considered**:
-- Jest: Requires more ESM configuration
-- Node.js built-in test runner: Less mature feature set
-### Decision 8: tsx for Development
-**What**: Use `tsx` for development with watch mode.
-**Why**:
-- Instant TypeScript execution without separate compile step
-- Excellent watch mode for hot-reloading during development
-- Works seamlessly with ESM
-### Decision 9: Package.json Hygiene
-**What**: Proper ESM package configuration with exports map and bin entry.
-```json
-{
-  "type": "module",
-  "exports": {
-    ".": "./dist/index.js"
-  },
-  "bin": {
-    "testing-mcp": "./dist/index.js"
-  }
-}
-```
-**Why**:
-- Enables `npx testing-mcp` execution
-- Proper exports for potential future library consumption
-- Clear module resolution
-## Risks / Trade-offs
-| Risk | Impact | Mitigation |
-|------|--------|------------|
-| SDK breaking changes | Medium | Pin SDK version, update intentionally |
-| Node.js 22.18.0+ requirement | Low | Recent LTS, document flag for earlier 22.x |
-| HTTP transport without auth | Medium | Document as internal-only; auth in future proposal |
-| Structured logging overhead | Low | Negligible for typical tool execution volumes |
-## Migration Plan
-Not applicable - this is a greenfield implementation.
-## Open Questions
-None - ready for implementation approval.

package/openspec/changes/archive/2025-12-18-add-mcp-server-foundation/proposal.md DELETED Viewed

@@ -1,41 +0,0 @@
-# Change: Add MCP Server Foundation
-## Why
-This project needs a foundational MCP server implementation to support web application testing workflows. Starting with a well-structured foundation enables incremental addition of testing-specific tools while maintaining clean architecture and consistent patterns from the start.
-## What Changes
-### Core Infrastructure
-- Initialize TypeScript project with modern tooling (Node.js >= 22.18.0 recommended, TypeScript 5.x, ESM modules)
-- Implement MCP server using the official `@modelcontextprotocol/sdk` with Zod schema validation
-- Create project structure optimized for web testing tool development
-### Pluggable Transport Layer
-- Abstract transport selection behind environment config (`TRANSPORT=stdio|http`, `PORT=...`)
-- Implement stdio transport for local dev/integration
-- Implement Streamable HTTP transport ready for remote deployment
-- Transport modules in `src/transports/` following SDK patterns
-### Self-Describing Tool Registry
-- Create tool registry pattern in `src/tools/index.ts`
-- Each tool exports `{ name, description, inputSchema, handler }`
-- Generate MCP tool input JSON schema from Zod (single source of truth)
-- Add "hello" demonstration tool following the registry pattern
-### Production-Ready Basics
-- Structured logging with secret redaction
-- Graceful shutdown handling (SIGINT/SIGTERM)
-- Config validation via Zod at startup
-- Package.json hygiene: `"type":"module"`, exports map, bin entry for CLI usage
-### Testing Infrastructure
-- Unit tests with Vitest
-- Integration tests that spawn server and speak MCP protocol end-to-end
-- Test tool execution and error handling via StdioServerTransport
-## Impact
-- Affected specs: `mcp-server-core` (new capability)
-- Affected code: New project structure in `src/`, configuration files at root
-- Dependencies: `@modelcontextprotocol/sdk`, `zod`, TypeScript toolchain