npm - retestkit - Versions diffs - 1.4.1 - Mend

retestkit 1.4.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (327) hide show

package/.claude/commands/openspec/apply.md +23 -0
package/.claude/commands/openspec/archive.md +27 -0
package/.claude/commands/openspec/proposal.md +28 -0
package/.gemini/commands/openspec/apply.toml +21 -0
package/.gemini/commands/openspec/archive.toml +25 -0
package/.gemini/commands/openspec/proposal.toml +26 -0
package/.github/prompts/openspec-apply.prompt.md +22 -0
package/.github/prompts/openspec-archive.prompt.md +26 -0
package/.github/prompts/openspec-proposal.prompt.md +27 -0
package/.github/workflows/release.yml +33 -0
package/.kilocode/workflows/openspec-apply.md +17 -0
package/.kilocode/workflows/openspec-archive.md +21 -0
package/.kilocode/workflows/openspec-proposal.md +22 -0
package/.mcp.json +23 -0
package/.opencode/command/openspec-apply.md +25 -0
package/.opencode/command/openspec-archive.md +28 -0
package/.opencode/command/openspec-proposal.md +30 -0
package/.roo/commands/openspec-apply.md +20 -0
package/.roo/commands/openspec-archive.md +24 -0
package/.roo/commands/openspec-proposal.md +25 -0
package/.vscode/mcp.json +23 -0
package/AGENTS.md +18 -0
package/CLAUDE.md +18 -0
package/LICENSE +65 -0
package/README.md +303 -0
package/dist/config.d.ts +4 -0
package/dist/config.d.ts.map +1 -0
package/dist/config.js +27 -0
package/dist/config.js.map +1 -0
package/dist/elicitation/index.d.ts +17 -0
package/dist/elicitation/index.d.ts.map +1 -0
package/dist/elicitation/index.js +118 -0
package/dist/elicitation/index.js.map +1 -0
package/dist/elicitation/types.d.ts +35 -0
package/dist/elicitation/types.d.ts.map +1 -0
package/dist/elicitation/types.js +39 -0
package/dist/elicitation/types.js.map +1 -0
package/dist/index.d.ts +3 -0
package/dist/index.d.ts.map +1 -0
package/dist/index.js +76 -0
package/dist/index.js.map +1 -0
package/dist/lifecycle/index.d.ts +31 -0
package/dist/lifecycle/index.d.ts.map +1 -0
package/dist/lifecycle/index.js +61 -0
package/dist/lifecycle/index.js.map +1 -0
package/dist/logger.d.ts +21 -0
package/dist/logger.d.ts.map +1 -0
package/dist/logger.js +182 -0
package/dist/logger.js.map +1 -0
package/dist/playwright-client/index.d.ts +29 -0
package/dist/playwright-client/index.d.ts.map +1 -0
package/dist/playwright-client/index.js +288 -0
package/dist/playwright-client/index.js.map +1 -0
package/dist/playwright-client/types.d.ts +44 -0
package/dist/playwright-client/types.d.ts.map +1 -0
package/dist/playwright-client/types.js +49 -0
package/dist/playwright-client/types.js.map +1 -0
package/dist/progress/index.d.ts +39 -0
package/dist/progress/index.d.ts.map +1 -0
package/dist/progress/index.js +106 -0
package/dist/progress/index.js.map +1 -0
package/dist/progress/types.d.ts +24 -0
package/dist/progress/types.d.ts.map +1 -0
package/dist/progress/types.js +2 -0
package/dist/progress/types.js.map +1 -0
package/dist/prompts/index.d.ts +19 -0
package/dist/prompts/index.d.ts.map +1 -0
package/dist/prompts/index.js +207 -0
package/dist/prompts/index.js.map +1 -0
package/dist/prompts/loader.d.ts +20 -0
package/dist/prompts/loader.d.ts.map +1 -0
package/dist/prompts/loader.js +47 -0
package/dist/prompts/loader.js.map +1 -0
package/dist/resources/index.d.ts +27 -0
package/dist/resources/index.d.ts.map +1 -0
package/dist/resources/index.js +186 -0
package/dist/resources/index.js.map +1 -0
package/dist/resources/subscriptions.d.ts +10 -0
package/dist/resources/subscriptions.d.ts.map +1 -0
package/dist/resources/subscriptions.js +23 -0
package/dist/resources/subscriptions.js.map +1 -0
package/dist/sampling/index.d.ts +11 -0
package/dist/sampling/index.d.ts.map +1 -0
package/dist/sampling/index.js +201 -0
package/dist/sampling/index.js.map +1 -0
package/dist/sampling/prompts.d.ts +56 -0
package/dist/sampling/prompts.d.ts.map +1 -0
package/dist/sampling/prompts.js +124 -0
package/dist/sampling/prompts.js.map +1 -0
package/dist/sampling/types.d.ts +57 -0
package/dist/sampling/types.d.ts.map +1 -0
package/dist/sampling/types.js +2 -0
package/dist/sampling/types.js.map +1 -0
package/dist/schemas/config.d.ts +40 -0
package/dist/schemas/config.d.ts.map +1 -0
package/dist/schemas/config.js +30 -0
package/dist/schemas/config.js.map +1 -0
package/dist/security/index.d.ts +38 -0
package/dist/security/index.d.ts.map +1 -0
package/dist/security/index.js +281 -0
package/dist/security/index.js.map +1 -0
package/dist/server.d.ts +9 -0
package/dist/server.d.ts.map +1 -0
package/dist/server.js +142 -0
package/dist/server.js.map +1 -0
package/dist/test-utils/index.d.ts +6 -0
package/dist/test-utils/index.d.ts.map +1 -0
package/dist/test-utils/index.js +6 -0
package/dist/test-utils/index.js.map +1 -0
package/dist/test-utils/mock-context.d.ts +64 -0
package/dist/test-utils/mock-context.d.ts.map +1 -0
package/dist/test-utils/mock-context.js +347 -0
package/dist/test-utils/mock-context.js.map +1 -0
package/dist/test-utils/mock-playwright-client.d.ts +62 -0
package/dist/test-utils/mock-playwright-client.d.ts.map +1 -0
package/dist/test-utils/mock-playwright-client.js +315 -0
package/dist/test-utils/mock-playwright-client.js.map +1 -0
package/dist/tools/index.d.ts +4 -0
package/dist/tools/index.d.ts.map +1 -0
package/dist/tools/index.js +8 -0
package/dist/tools/index.js.map +1 -0
package/dist/tools/webtest/crawl.d.ts +46 -0
package/dist/tools/webtest/crawl.d.ts.map +1 -0
package/dist/tools/webtest/crawl.js +678 -0
package/dist/tools/webtest/crawl.js.map +1 -0
package/dist/tools/webtest/discover-features.d.ts +30 -0
package/dist/tools/webtest/discover-features.d.ts.map +1 -0
package/dist/tools/webtest/discover-features.js +343 -0
package/dist/tools/webtest/discover-features.js.map +1 -0
package/dist/tools/webtest/discover-flows.d.ts +29 -0
package/dist/tools/webtest/discover-flows.d.ts.map +1 -0
package/dist/tools/webtest/discover-flows.js +341 -0
package/dist/tools/webtest/discover-flows.js.map +1 -0
package/dist/tools/webtest/generate-tests.d.ts +54 -0
package/dist/tools/webtest/generate-tests.d.ts.map +1 -0
package/dist/tools/webtest/generate-tests.js +364 -0
package/dist/tools/webtest/generate-tests.js.map +1 -0
package/dist/tools/webtest/index.d.ts +8 -0
package/dist/tools/webtest/index.d.ts.map +1 -0
package/dist/tools/webtest/index.js +8 -0
package/dist/tools/webtest/index.js.map +1 -0
package/dist/tools/webtest/run-test-case.d.ts +28 -0
package/dist/tools/webtest/run-test-case.d.ts.map +1 -0
package/dist/tools/webtest/run-test-case.js +420 -0
package/dist/tools/webtest/run-test-case.js.map +1 -0
package/dist/tools/webtest/schemas.d.ts +175 -0
package/dist/tools/webtest/schemas.d.ts.map +1 -0
package/dist/tools/webtest/schemas.js +156 -0
package/dist/tools/webtest/schemas.js.map +1 -0
package/dist/tools/webtest/start-analysis.d.ts +16 -0
package/dist/tools/webtest/start-analysis.d.ts.map +1 -0
package/dist/tools/webtest/start-analysis.js +137 -0
package/dist/tools/webtest/start-analysis.js.map +1 -0
package/dist/transports/http.d.ts +8 -0
package/dist/transports/http.d.ts.map +1 -0
package/dist/transports/http.js +9 -0
package/dist/transports/http.js.map +1 -0
package/dist/transports/index.d.ts +14 -0
package/dist/transports/index.d.ts.map +1 -0
package/dist/transports/index.js +20 -0
package/dist/transports/index.js.map +1 -0
package/dist/transports/stdio.d.ts +4 -0
package/dist/transports/stdio.d.ts.map +1 -0
package/dist/transports/stdio.js +6 -0
package/dist/transports/stdio.js.map +1 -0
package/dist/types/capabilities.d.ts +18 -0
package/dist/types/capabilities.d.ts.map +1 -0
package/dist/types/capabilities.js +35 -0
package/dist/types/capabilities.js.map +1 -0
package/dist/types/context.d.ts +20 -0
package/dist/types/context.d.ts.map +1 -0
package/dist/types/context.js +2 -0
package/dist/types/context.js.map +1 -0
package/dist/types/tool.d.ts +10 -0
package/dist/types/tool.d.ts.map +1 -0
package/dist/types/tool.js +2 -0
package/dist/types/tool.js.map +1 -0
package/dist/workspace/index.d.ts +99 -0
package/dist/workspace/index.d.ts.map +1 -0
package/dist/workspace/index.js +648 -0
package/dist/workspace/index.js.map +1 -0
package/dist/workspace/markdown.d.ts +50 -0
package/dist/workspace/markdown.d.ts.map +1 -0
package/dist/workspace/markdown.js +210 -0
package/dist/workspace/markdown.js.map +1 -0
package/dist/workspace/types.d.ts +173 -0
package/dist/workspace/types.d.ts.map +1 -0
package/dist/workspace/types.js +2 -0
package/dist/workspace/types.js.map +1 -0
package/openspec/AGENTS.md +456 -0
package/openspec/changes/archive/2025-12-18-add-hybrid-artifact-paths/proposal.md +33 -0
package/openspec/changes/archive/2025-12-18-add-hybrid-artifact-paths/specs/webtest-resources/spec.md +27 -0
package/openspec/changes/archive/2025-12-18-add-hybrid-artifact-paths/specs/webtest-tools/spec.md +304 -0
package/openspec/changes/archive/2025-12-18-add-hybrid-artifact-paths/tasks.md +43 -0
package/openspec/changes/archive/2025-12-18-add-mcp-server-foundation/design.md +209 -0
package/openspec/changes/archive/2025-12-18-add-mcp-server-foundation/proposal.md +41 -0
package/openspec/changes/archive/2025-12-18-add-mcp-server-foundation/specs/mcp-server-core/spec.md +183 -0
package/openspec/changes/archive/2025-12-18-add-mcp-server-foundation/tasks.md +112 -0
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/design.md +333 -0
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/proposal.md +66 -0
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/specs/mcp-server-core/spec.md +129 -0
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/specs/webtest-lifecycle/spec.md +138 -0
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/specs/webtest-logging/spec.md +211 -0
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/specs/webtest-prompts/spec.md +157 -0
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/specs/webtest-resources/spec.md +213 -0
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/specs/webtest-sampling/spec.md +257 -0
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/specs/webtest-tools/spec.md +501 -0
package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/tasks.md +264 -0
package/openspec/changes/archive/2025-12-18-allow-analysis-of-incomplete-crawls/proposal.md +24 -0
package/openspec/changes/archive/2025-12-18-allow-analysis-of-incomplete-crawls/specs/webtest-tools/spec.md +80 -0
package/openspec/changes/archive/2025-12-18-allow-analysis-of-incomplete-crawls/tasks.md +8 -0
package/openspec/changes/archive/2025-12-18-fix-crawl-loop-stability/design.md +90 -0
package/openspec/changes/archive/2025-12-18-fix-crawl-loop-stability/proposal.md +28 -0
package/openspec/changes/archive/2025-12-18-fix-crawl-loop-stability/specs/webtest-sampling/spec.md +90 -0
package/openspec/changes/archive/2025-12-18-fix-crawl-loop-stability/tasks.md +33 -0
package/openspec/changes/archive/2025-12-18-use-markdown-artifacts/design.md +558 -0
package/openspec/changes/archive/2025-12-18-use-markdown-artifacts/proposal.md +119 -0
package/openspec/changes/archive/2025-12-18-use-markdown-artifacts/specs/webtest-resources/spec.md +109 -0
package/openspec/changes/archive/2025-12-18-use-markdown-artifacts/specs/webtest-tools/spec.md +121 -0
package/openspec/changes/archive/2025-12-18-use-markdown-artifacts/tasks.md +133 -0
package/openspec/changes/extract-prompts-to-markdown/design.md +86 -0
package/openspec/changes/extract-prompts-to-markdown/proposal.md +50 -0
package/openspec/changes/extract-prompts-to-markdown/specs/webtest-prompts/spec.md +74 -0
package/openspec/changes/extract-prompts-to-markdown/tasks.md +40 -0
package/openspec/changes/refactor-webtest-naming/design.md +95 -0
package/openspec/changes/refactor-webtest-naming/proposal.md +66 -0
package/openspec/changes/refactor-webtest-naming/specs/webtest-prompts/spec.md +79 -0
package/openspec/changes/refactor-webtest-naming/specs/webtest-resources/spec.md +80 -0
package/openspec/changes/refactor-webtest-naming/specs/webtest-sampling/spec.md +122 -0
package/openspec/changes/refactor-webtest-naming/specs/webtest-tools/spec.md +113 -0
package/openspec/changes/refactor-webtest-naming/tasks.md +119 -0
package/openspec/changes/rename-package-to-retest/proposal.md +52 -0
package/openspec/changes/rename-package-to-retest/specs/mcp-server-core/spec.md +53 -0
package/openspec/changes/rename-package-to-retest/specs/retest-lifecycle/spec.md +68 -0
package/openspec/changes/rename-package-to-retest/specs/retest-logging/spec.md +35 -0
package/openspec/changes/rename-package-to-retest/specs/retest-prompts/spec.md +159 -0
package/openspec/changes/rename-package-to-retest/specs/retest-resources/spec.md +251 -0
package/openspec/changes/rename-package-to-retest/specs/retest-sampling/spec.md +99 -0
package/openspec/changes/rename-package-to-retest/specs/retest-tools/spec.md +295 -0
package/openspec/changes/rename-package-to-retest/tasks.md +71 -0
package/openspec/project.md +31 -0
package/openspec/specs/mcp-server-core/spec.md +178 -0
package/openspec/specs/webtest-lifecycle/spec.md +136 -0
package/openspec/specs/webtest-logging/spec.md +209 -0
package/openspec/specs/webtest-prompts/spec.md +155 -0
package/openspec/specs/webtest-resources/spec.md +248 -0
package/openspec/specs/webtest-sampling/spec.md +344 -0
package/openspec/specs/webtest-tools/spec.md +282 -0
package/package.json +54 -0
package/release.config.js +9 -0
package/src/config.test.ts +96 -0
package/src/config.ts +32 -0
package/src/elicitation/index.test.ts +399 -0
package/src/elicitation/index.ts +171 -0
package/src/elicitation/types.ts +68 -0
package/src/index.ts +83 -0
package/src/lifecycle/index.test.ts +260 -0
package/src/lifecycle/index.ts +101 -0
package/src/logger.redaction.test.ts +322 -0
package/src/logger.test.ts +123 -0
package/src/logger.ts +229 -0
package/src/playwright-client/index.ts +392 -0
package/src/playwright-client/types.ts +99 -0
package/src/progress/index.test.ts +327 -0
package/src/progress/index.ts +170 -0
package/src/progress/types.ts +25 -0
package/src/prompts/index.test.ts +451 -0
package/src/prompts/index.ts +246 -0
package/src/prompts/loader.test.ts +100 -0
package/src/prompts/loader.ts +59 -0
package/src/prompts/templates/mcp/webtest-crawl.md +7 -0
package/src/prompts/templates/mcp/webtest-discover-flows.md +11 -0
package/src/prompts/templates/mcp/webtest-discover.md +12 -0
package/src/prompts/templates/mcp/webtest-full-workflow.md +12 -0
package/src/prompts/templates/mcp/webtest-generate-tests.md +11 -0
package/src/prompts/templates/mcp/webtest-run-test.md +11 -0
package/src/prompts/templates/mcp/webtest-start.md +8 -0
package/src/prompts/templates/sampling/crawl-action.md +35 -0
package/src/prompts/templates/sampling/feature-discovery.md +27 -0
package/src/prompts/templates/sampling/flow-discovery.md +29 -0
package/src/prompts/templates/sampling/page-content-wrapper.md +5 -0
package/src/prompts/templates/sampling/system-prefix.md +12 -0
package/src/prompts/templates/sampling/test-evaluation.md +17 -0
package/src/prompts/templates/sampling/test-generation.md +31 -0
package/src/resources/index.ts +250 -0
package/src/resources/subscriptions.ts +37 -0
package/src/sampling/index.test.ts +414 -0
package/src/sampling/index.ts +286 -0
package/src/sampling/prompts.ts +194 -0
package/src/sampling/types.ts +60 -0
package/src/schemas/config.ts +39 -0
package/src/security/index.test.ts +441 -0
package/src/security/index.ts +361 -0
package/src/security/security-scenarios.test.ts +468 -0
package/src/server.ts +211 -0
package/src/test-utils/index.ts +6 -0
package/src/test-utils/mock-context.ts +426 -0
package/src/test-utils/mock-playwright-client.ts +422 -0
package/src/tools/index.ts +11 -0
package/src/tools/webtest/crawl.test.ts +834 -0
package/src/tools/webtest/crawl.ts +901 -0
package/src/tools/webtest/discover-features.ts +412 -0
package/src/tools/webtest/discover-flows.ts +408 -0
package/src/tools/webtest/generate-tests.test.ts +532 -0
package/src/tools/webtest/generate-tests.ts +425 -0
package/src/tools/webtest/index.ts +7 -0
package/src/tools/webtest/integration.test.ts +536 -0
package/src/tools/webtest/run-test-case.test.ts +659 -0
package/src/tools/webtest/run-test-case.ts +508 -0
package/src/tools/webtest/schemas.ts +201 -0
package/src/tools/webtest/start-analysis.test.ts +151 -0
package/src/tools/webtest/start-analysis.ts +158 -0
package/src/transports/http.ts +19 -0
package/src/transports/index.ts +30 -0
package/src/transports/stdio.ts +7 -0
package/src/types/capabilities.test.ts +193 -0
package/src/types/capabilities.ts +50 -0
package/src/types/context.ts +21 -0
package/src/types/tool.ts +11 -0
package/src/workspace/index.ts +945 -0
package/src/workspace/markdown.ts +272 -0
package/src/workspace/types.ts +186 -0
package/tests/integration/server.test.ts +89 -0
package/tests/integration/tools.test.ts +99 -0
package/tsconfig.json +20 -0
package/vitest.config.ts +9 -0
package/vitest.integration.config.ts +10 -0

package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/proposal.md ADDED Viewed

@@ -0,0 +1,66 @@
+# Change: Add Dynamic Web Testing Orchestrator
+## Why
+The current MCP server provides only a basic "hello" demonstration tool. To fulfill its purpose as a web testing server, it needs to orchestrate dynamic web application exploration, analysis, test generation, and test execution. By leveraging the full MCP protocol (sampling, elicitation, resources, prompts, progress, cancellation) and integrating with Playwright MCP for browser automation, the server can provide AI-powered exploratory testing capabilities where the LLM reasoning happens client-side via MCP Sampling.
+## What Changes
+### New Capabilities
+- **MCP Lifecycle & Capability Negotiation**: Proper initialize/operate/shutdown lifecycle with runtime capability detection (sampling, elicitation, logging, progress, resources.listChanged, resources.subscribe)
+- **Protocol Version Requirements**: Requires MCP protocol revision 2025-06-18+ for elicitation; graceful degradation for older clients
+- **Webtest Tools**: Five orchestration tools for the testing workflow:
+  1. `webtest_init` - Initialize analysis workspace
+  2. `webtest_crawl_app` - Dynamic goal-directed exploration with checkpointing and loop detection
+  3. `webtest_analyze_app` - Reverse-engineer app structure and flows
+  4. `webtest_generate_tests` - Produce test cases from analysis
+  5. `webtest_run_tests` - Execute tests with evidence capture
+- **Webtest Resources**: Stable `webtest://` URI-based artifacts with **listChanged/subscribe** support for live artifact surfacing during long operations
+- **Webtest Prompts**: Prompt templates for smooth client UX
+- **MCP Sampling Integration**: Client-controlled LLM reasoning for all AI decisions with **fallback mode** when sampling unavailable
+- **Elicitation Support**: Interactive user decisions during crawl with **fallback mode** when elicitation unavailable
+- **Progress & Cancellation**: Long-running operations report progress (with budget status) and respond to cancellation
+- **Playwright MCP Integration**: Orchestration with **dynamic tool discovery and capability adapter** (version/implementation resilient)
+- **Structured Logging**: MCP logging notifications with correlation IDs, log level control, and sensitive data redaction
+### **BREAKING** Changes
+- Removes `hello` tool (demonstration no longer needed)
+- Server now requires Playwright MCP server as external dependency
+### Security Additions
+- Domain allowlist enforcement with subdomain support
+- **Comprehensive prompt injection hardening** with defense-in-depth:
+  - Untrusted page content demarcation
+  - Protected system instruction prefix
+  - Scope expansion detection and blocking
+  - Data exfiltration pattern blocking
+  - Audit logging of all sampling I/O
+- **Injection test suite** validating resistance to direct/indirect injection, goal hijacking, credential phishing
+- Sensitive data redaction in logs (URLs, cookies, passwords)
+- Never requests credentials via elicitation
+### Robustness Additions
+- **Crawl checkpointing** every N steps with resume support
+- **Loop detection and prevention**: DOM signature tracking, URL cycle detection, action repeat blocking
+- **Budget enforcement**: maxSteps, maxMinutes, maxPages limits with graceful partial output
+## Key Features Summary
+| Feature | Description |
+|---------|-------------|
+| Resources listChanged/subscribe | Surface new artifacts live during crawl/test execution |
+| Runtime fallbacks | Graceful degradation when Sampling/Elicitation unsupported |
+| Playwright MCP adapter | Dynamic tool discovery; version/implementation resilient |
+| Sampling injection hardening | Defense-in-depth with audit logging and test suite |
+| Crawl checkpointing | Resume interrupted crawls; partial results on timeout |
+| Loop prevention | DOM signatures, URL cycles, action repeats detected |
+## Impact
+- Affected specs: `mcp-server-core` (lifecycle changes), plus new specs for `webtest-tools`, `webtest-resources`, `webtest-prompts`, `webtest-sampling`, `webtest-lifecycle`, `webtest-logging`
+- Affected code: `src/server.ts`, `src/tools/`, new directories for resources/prompts/sampling/lifecycle/logging
+- External dependencies: Playwright MCP server (Microsoft or compatible implementation)

package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/specs/mcp-server-core/spec.md ADDED Viewed

@@ -0,0 +1,129 @@
+# mcp-server-core Spec Delta
+## REMOVED Requirements
+### Requirement: Hello Tool Implementation
+**Reason**: The demonstration "hello" tool is no longer needed now that real webtest tools are implemented.
+**Migration**: Remove `src/tools/hello.ts` and `src/tools/hello.test.ts`. Update tool registry to exclude hello tool.
+## MODIFIED Requirements
+### Requirement: MCP Server Initialization
+The system SHALL provide an MCP server that initializes with proper identification, negotiates client capabilities, and connects to the configured transport.
+#### Scenario: Server starts with stdio transport
+- **GIVEN** the environment variable `TRANSPORT` is set to `stdio` or not set
+- **WHEN** the server entry point is executed
+- **THEN** it SHALL identify itself with name "testing-mcp" and version from package.json
+- **AND** it SHALL connect to stdio transport for communication
+#### Scenario: Server starts with HTTP transport
+- **GIVEN** the environment variable `TRANSPORT` is set to `http`
+- **AND** the environment variable `PORT` is set to a valid port number
+- **WHEN** the server entry point is executed
+- **THEN** it SHALL start a Streamable HTTP server on the specified port
+- **AND** it SHALL accept MCP protocol connections over HTTP
+#### Scenario: Server handles graceful shutdown
+- **GIVEN** the server is running
+- **WHEN** the process receives SIGINT or SIGTERM
+- **THEN** the server SHALL disconnect gracefully
+- **AND** any active Playwright MCP subprocess SHALL be terminated
+- **AND** the process SHALL exit with code 0
+#### Scenario: Server negotiates client capabilities
+- **GIVEN** a client connects to the server
+- **WHEN** the initialize handshake completes
+- **THEN** the server SHALL record client capabilities for sampling, elicitation, logging, and progress
+- **AND** the server SHALL adapt runtime behavior based on available capabilities
+## MODIFIED Requirements
+### Requirement: Configuration Validation
+The system SHALL validate configuration at startup using Zod schemas and fail fast on invalid configuration, including webtest-specific settings.
+#### Scenario: Valid configuration starts server
+- **GIVEN** all required environment variables are valid
+- **WHEN** the server starts
+- **THEN** configuration SHALL be parsed and validated
+- **AND** the server SHALL proceed with initialization
+#### Scenario: Invalid configuration fails fast
+- **GIVEN** an environment variable has an invalid value (e.g., `PORT=invalid`)
+- **WHEN** the server attempts to start
+- **THEN** it SHALL log a descriptive error message
+- **AND** the process SHALL exit with a non-zero code
+#### Scenario: Webtest workspace configuration is validated
+- **GIVEN** the environment variable `WEBTEST_WORKSPACE_DIR` is set
+- **WHEN** the server starts
+- **THEN** it SHALL validate the path is writable
+- **AND** create the directory if it does not exist
+## MODIFIED Requirements
+### Requirement: Self-Describing Tool Registry
+The system SHALL maintain a tool registry where each tool exports a standard interface including name, description, Zod input schema, and async handler function, supporting the webtest tool namespace.
+#### Scenario: Tool is registered and discoverable
+- **GIVEN** a tool is added to the registry
+- **WHEN** an MCP client requests the tool list
+- **THEN** the tool SHALL appear in the list with its name and description
+- **AND** the input JSON Schema SHALL be generated from the Zod schema
+#### Scenario: New tool follows registry pattern
+- **GIVEN** a developer creates a new tool
+- **WHEN** the tool exports `{ name, description, inputSchema, handler }`
+- **AND** the tool is added to the registry index
+- **THEN** it SHALL be automatically registered with the MCP server
+#### Scenario: Webtest tools use namespaced naming
+- **GIVEN** the webtest tools are registered
+- **WHEN** an MCP client requests the tool list
+- **THEN** tools SHALL appear with `webtest_` prefix (e.g., `webtest_init`, `webtest_crawl_app`)
+## MODIFIED Requirements
+### Requirement: Structured Logging
+The system SHALL provide structured JSON logging with configurable log levels, automatic redaction of sensitive fields, and optional emission as MCP logging notifications.
+#### Scenario: Log output is structured JSON
+- **GIVEN** the server is running
+- **WHEN** a log event occurs
+- **THEN** it SHALL be output as a JSON object with timestamp, level, and message fields
+#### Scenario: Sensitive fields are redacted
+- **GIVEN** a log message contains a field matching a sensitive key pattern (password, token, secret, apiKey, authorization)
+- **WHEN** the log is written
+- **THEN** the sensitive field value SHALL be replaced with "[REDACTED]"
+#### Scenario: Log level is configurable
+- **GIVEN** the environment variable `LOG_LEVEL` is set to a valid level (debug, info, warn, error)
+- **WHEN** the server starts
+- **THEN** only log messages at or above that level SHALL be output
+#### Scenario: Logs are emitted as MCP notifications when supported
+- **GIVEN** the client supports MCP logging notifications
+- **WHEN** a log event occurs
+- **THEN** it SHALL be emitted as a `notifications/message` to the client
+- **AND** the log level SHALL map to MCP log levels (debug, info, warning, error)

package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/specs/webtest-lifecycle/spec.md ADDED Viewed

@@ -0,0 +1,138 @@
+# webtest-lifecycle Specification
+## Purpose
+Defines the MCP lifecycle management and capability negotiation for the web testing server.
+## ADDED Requirements
+### Requirement: MCP Protocol Version Requirements
+The system SHALL require MCP protocol revision 2025-06-18 or later to ensure elicitation support is available.
+#### Scenario: Server declares required protocol version
+- **GIVEN** the server starts
+- **WHEN** it responds to initialize request
+- **THEN** it SHALL declare `protocolVersion: "2025-06-18"` or later
+- **AND** include elicitation in server capabilities
+#### Scenario: Client with older protocol version
+- **GIVEN** a client connects with protocol version older than 2025-06-18
+- **WHEN** initialization completes
+- **THEN** the server SHALL record that elicitation is NOT available
+- **AND** log a warning about degraded functionality
+#### Scenario: Protocol version mismatch handling
+- **GIVEN** a client requests a protocol version the server cannot satisfy
+- **WHEN** version negotiation occurs
+- **THEN** the server SHALL negotiate to the highest mutually supported version
+- **AND** adjust available features accordingly
+### Requirement: MCP Lifecycle Management
+The system SHALL implement proper MCP lifecycle phases (initialize, operate, shutdown) and maintain lifecycle state.
+#### Scenario: Server transitions through lifecycle phases
+- **GIVEN** a client connects to the server
+- **WHEN** the connection is established
+- **THEN** the server SHALL be in "initializing" state
+- **AND** after successful initialize handshake, transition to "operating" state
+- **AND** on shutdown signal, transition to "shutdown" state
+#### Scenario: Server rejects operations before initialization
+- **GIVEN** the server is in "initializing" state
+- **WHEN** a client sends a tool call request
+- **THEN** the server SHALL return an error indicating initialization not complete
+#### Scenario: Server rejects new operations during shutdown
+- **GIVEN** the server is in "shutdown" state
+- **WHEN** a client sends a new tool call request
+- **THEN** the server SHALL return an error indicating server is shutting down
+### Requirement: Client Capability Negotiation
+The system SHALL query and record client capabilities during initialization and adapt behavior accordingly.
+#### Scenario: Server records sampling capability
+- **GIVEN** a client connects with `capabilities.sampling` present
+- **WHEN** initialization completes
+- **THEN** the server SHALL record that sampling is available
+- **AND** webtest tools SHALL use `sampling/createMessage` for LLM reasoning
+#### Scenario: Server records elicitation capability
+- **GIVEN** a client connects with `capabilities.elicitation` present
+- **WHEN** initialization completes
+- **THEN** the server SHALL record that elicitation is available
+- **AND** webtest tools SHALL use `elicitation/create` for user decisions
+#### Scenario: Server records logging capability
+- **GIVEN** a client connects with `capabilities.logging` present
+- **WHEN** initialization completes
+- **THEN** the server SHALL record that logging notifications are supported
+- **AND** the logger SHALL emit `notifications/message` to the client
+#### Scenario: Server records progress capability
+- **GIVEN** a client connects with MCP progress support
+- **WHEN** initialization completes
+- **THEN** the server SHALL record that progress notifications are supported
+- **AND** long-running tools SHALL emit `notifications/progress`
+#### Scenario: Server records resources listChanged capability
+- **GIVEN** a client connects with `capabilities.resources.listChanged` present
+- **WHEN** initialization completes
+- **THEN** the server SHALL record that resource list change notifications are supported
+- **AND** resource creation SHALL emit `notifications/resources/list_changed`
+#### Scenario: Server records resources subscribe capability
+- **GIVEN** a client connects with `capabilities.resources.subscribe` present
+- **WHEN** initialization completes
+- **THEN** the server SHALL record that resource subscriptions are supported
+- **AND** resource updates SHALL emit `notifications/resources/updated` to subscribers
+#### Scenario: Fallback when sampling not supported
+- **GIVEN** a client connects without `capabilities.sampling`
+- **WHEN** a webtest tool requires LLM reasoning
+- **THEN** the tool SHALL return a prompt resource for manual execution
+- **AND** the tool output SHALL include `needsManualInput: true`
+#### Scenario: Fallback when elicitation not supported
+- **GIVEN** a client connects without `capabilities.elicitation`
+- **WHEN** a webtest tool needs user decision
+- **THEN** the tool SHALL include questions in its output
+- **AND** the tool output SHALL include `needsInput: true` with question details
+### Requirement: Capability Query API
+The system SHALL provide internal APIs for tools to query client capabilities at runtime.
+#### Scenario: Tool queries sampling availability
+- **GIVEN** a tool handler is executing
+- **WHEN** it calls `capabilities.hasSampling()`
+- **THEN** it SHALL receive a boolean indicating sampling support
+#### Scenario: Tool queries elicitation availability
+- **GIVEN** a tool handler is executing
+- **WHEN** it calls `capabilities.hasElicitation()`
+- **THEN** it SHALL receive a boolean indicating elicitation support
+#### Scenario: Tool queries all capabilities
+- **GIVEN** a tool handler is executing
+- **WHEN** it calls `capabilities.getAll()`
+- **THEN** it SHALL receive an object with all recorded capabilities

package/openspec/changes/archive/2025-12-18-add-webtest-orchestrator/specs/webtest-logging/spec.md ADDED Viewed

@@ -0,0 +1,211 @@
+# webtest-logging Specification
+## Purpose
+Defines structured logging with MCP logging notifications, correlation IDs, sensitive data redaction, and log level control.
+## ADDED Requirements
+### Requirement: MCP Logging Notifications
+The system SHALL emit structured logs as MCP logging notifications when the client supports it.
+#### Scenario: Log emitted as MCP notification
+- **GIVEN** the client supports MCP logging (`capabilities.logging` present)
+- **WHEN** a log event occurs
+- **THEN** it SHALL emit `notifications/message` with:
+  - `level`: one of "debug", "info", "warning", "error"
+  - `logger`: "webtest"
+  - `data`: structured log payload
+#### Scenario: Fallback to stderr when logging unsupported
+- **GIVEN** the client does not support MCP logging
+- **WHEN** a log event occurs
+- **THEN** it SHALL write to stderr as JSON
+- **AND** not attempt MCP notification
+#### Scenario: Log level is respected
+- **GIVEN** client log level is set to "warning"
+- **WHEN** an "info" level log is generated
+- **THEN** it SHALL NOT be emitted
+- **AND** "warning" and "error" logs SHALL be emitted
+### Requirement: Logging Level Control
+The system SHALL support dynamic log level configuration via MCP and environment.
+#### Scenario: Log level set via environment
+- **GIVEN** environment variable `LOG_LEVEL` is set to "debug"
+- **WHEN** the server starts
+- **THEN** all log levels (debug, info, warning, error) SHALL be emitted
+#### Scenario: Client sets log level via logging/setLevel
+- **GIVEN** client supports `logging/setLevel`
+- **WHEN** client sends `logging/setLevel` with level "error"
+- **THEN** only "error" level logs SHALL be emitted thereafter
+- **AND** this SHALL override the environment setting
+#### Scenario: Default log level
+- **GIVEN** no log level is configured
+- **WHEN** the server starts
+- **THEN** the default log level SHALL be "info"
+### Requirement: Correlation IDs
+The system SHALL include correlation IDs in all log messages to enable tracing across operations.
+#### Scenario: Analysis ID is included in logs
+- **GIVEN** a tool is executing within an analysis context
+- **WHEN** a log is emitted
+- **THEN** it SHALL include `analysisId` in the log data
+#### Scenario: Crawl ID is included in crawl logs
+- **GIVEN** a crawl is in progress
+- **WHEN** a log is emitted during the crawl
+- **THEN** it SHALL include `crawlId` in addition to `analysisId`
+#### Scenario: Test run ID is included in test logs
+- **GIVEN** a test case is being executed
+- **WHEN** a log is emitted during test execution
+- **THEN** it SHALL include `testRunId` in addition to `analysisId`
+#### Scenario: Iteration number is included in loop logs
+- **GIVEN** a crawl or test loop is executing
+- **WHEN** a log is emitted during an iteration
+- **THEN** it SHALL include `iteration` number
+#### Scenario: Request ID is included when available
+- **GIVEN** a tool is handling an MCP request with `_meta.requestId`
+- **WHEN** logs are emitted during that request
+- **THEN** they SHALL include `requestId` for correlation with client logs
+### Requirement: Structured Log Format
+The system SHALL emit logs in a consistent structured format.
+#### Scenario: Log structure is consistent
+- **GIVEN** any log event occurs
+- **WHEN** it is emitted
+- **THEN** it SHALL include:
+  - `timestamp`: ISO 8601 format
+  - `level`: log level
+  - `message`: human-readable message
+  - `context`: object with correlation IDs
+  - `data`: optional additional structured data
+#### Scenario: Playwright MCP tool calls are logged
+- **GIVEN** a Playwright MCP tool is called
+- **WHEN** the call completes
+- **THEN** it SHALL log:
+  - `message`: "Playwright tool executed"
+  - `data.tool`: tool name
+  - `data.duration`: execution time in ms
+  - `data.success`: boolean
+  - `data.error`: error message if failed (sensitive data redacted)
+#### Scenario: Sampling calls are logged
+- **GIVEN** a sampling request is made
+- **WHEN** the request completes
+- **THEN** it SHALL log:
+  - `message`: "Sampling completed"
+  - `data.promptTokens`: approximate prompt size
+  - `data.responseTokens`: approximate response size
+  - `data.duration`: execution time in ms
+  - `data.validationPassed`: boolean
+### Requirement: Sensitive Data Redaction
+The system SHALL redact sensitive data from logs to prevent credential exposure.
+#### Scenario: URL query parameters are redacted
+- **GIVEN** a log includes a URL
+- **WHEN** the URL contains query parameters matching sensitive patterns (token, key, password, secret, auth, session)
+- **THEN** parameter values SHALL be replaced with "[REDACTED]"
+#### Scenario: Cookie values are redacted
+- **GIVEN** a log includes cookie data
+- **WHEN** cookies are serialized
+- **THEN** cookie values SHALL be replaced with "[REDACTED]"
+- **AND** only cookie names SHALL be visible
+#### Scenario: Form input values are redacted
+- **GIVEN** a log includes form interaction (type action)
+- **WHEN** the input is to a password or sensitive field
+- **THEN** the typed value SHALL be replaced with "[REDACTED]"
+#### Scenario: HTML content is truncated
+- **GIVEN** a log includes HTML content
+- **WHEN** the HTML exceeds 500 characters
+- **THEN** it SHALL be truncated with "...[truncated]"
+- **AND** sensitive elements (script, style) SHALL be removed
+#### Scenario: Known sensitive field patterns are redacted
+- **GIVEN** a log data object is being serialized
+- **WHEN** it contains keys matching sensitive patterns:
+  - password, passwd, pwd
+  - token, apiKey, api_key
+  - secret, credential
+  - authorization, auth
+  - session, cookie
+- **THEN** those values SHALL be replaced with "[REDACTED]"
+### Requirement: Operation Step Logging
+The system SHALL log detailed step information for debugging and audit.
+#### Scenario: Crawl step is logged
+- **GIVEN** a crawl iteration completes
+- **WHEN** step logging occurs
+- **THEN** it SHALL log at "debug" level:
+  - Current URL
+  - Action taken (tool + args with sensitive data redacted)
+  - Result summary
+  - Goal progress assessment
+#### Scenario: Test step is logged
+- **GIVEN** a test step executes
+- **WHEN** step logging occurs
+- **THEN** it SHALL log at "info" level:
+  - Step number and description
+  - Actions executed
+  - Pass/fail result
+  - Evidence URIs
+#### Scenario: Elicitation event is logged
+- **GIVEN** elicitation is triggered
+- **WHEN** user response is received
+- **THEN** it SHALL log at "info" level:
+  - Elicitation type (cookie, modal, ambiguous, auth)
+  - Options presented
+  - User selection
+#### Scenario: Security event is logged
+- **GIVEN** a security check fails (domain validation, injection detection)
+- **WHEN** the violation is detected
+- **THEN** it SHALL log at "warning" level:
+  - Violation type
+  - Attempted action (redacted as needed)
+  - Remediation taken