npm - @hustle-together/api-dev-tools - Versions diffs - 3.10.1 → 3.11.1 - Mend

@hustle-together/api-dev-tools 3.10.1 → 3.11.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (96) hide show

package/.claude/api-dev-state.json +159 -0
package/.claude/commands/README.md +185 -0
package/.claude/commands/add-command.md +209 -0
package/.claude/commands/api-create.md +499 -0
package/.claude/commands/api-env.md +50 -0
package/.claude/commands/api-interview.md +331 -0
package/.claude/commands/api-research.md +331 -0
package/.claude/commands/api-status.md +259 -0
package/.claude/commands/api-verify.md +231 -0
package/.claude/commands/beepboop.md +97 -0
package/.claude/commands/busycommit.md +112 -0
package/.claude/commands/commit.md +83 -0
package/.claude/commands/cycle.md +142 -0
package/.claude/commands/gap.md +86 -0
package/.claude/commands/green.md +142 -0
package/.claude/commands/issue.md +192 -0
package/.claude/commands/plan.md +168 -0
package/.claude/commands/pr.md +122 -0
package/.claude/commands/red.md +142 -0
package/.claude/commands/refactor.md +142 -0
package/.claude/commands/spike.md +142 -0
package/.claude/commands/summarize.md +94 -0
package/.claude/commands/tdd.md +144 -0
package/.claude/commands/worktree-add.md +315 -0
package/.claude/commands/worktree-cleanup.md +281 -0
package/.claude/hooks/api-workflow-check.py +227 -0
package/.claude/hooks/enforce-deep-research.py +185 -0
package/.claude/hooks/enforce-disambiguation.py +155 -0
package/.claude/hooks/enforce-documentation.py +192 -0
package/.claude/hooks/enforce-environment.py +253 -0
package/.claude/hooks/enforce-external-research.py +328 -0
package/.claude/hooks/enforce-interview.py +421 -0
package/.claude/hooks/enforce-refactor.py +189 -0
package/.claude/hooks/enforce-research.py +159 -0
package/.claude/hooks/enforce-schema.py +186 -0
package/.claude/hooks/enforce-scope.py +160 -0
package/.claude/hooks/enforce-tdd-red.py +250 -0
package/.claude/hooks/enforce-verify.py +186 -0
package/.claude/hooks/periodic-reground.py +154 -0
package/.claude/hooks/session-startup.py +151 -0
package/.claude/hooks/track-tool-use.py +626 -0
package/.claude/hooks/verify-after-green.py +282 -0
package/.claude/hooks/verify-implementation.py +225 -0
package/.claude/research/index.json +6 -0
package/.claude/settings.json +93 -0
package/.claude/settings.local.json +11 -0
package/.claude-plugin/marketplace.json +112 -0
package/.skills/README.md +291 -0
package/.skills/_shared/convert-commands.py +192 -0
package/.skills/_shared/hooks/api-workflow-check.py +227 -0
package/.skills/_shared/hooks/enforce-deep-research.py +185 -0
package/.skills/_shared/hooks/enforce-disambiguation.py +155 -0
package/.skills/_shared/hooks/enforce-documentation.py +192 -0
package/.skills/_shared/hooks/enforce-environment.py +253 -0
package/.skills/_shared/hooks/enforce-external-research.py +328 -0
package/.skills/_shared/hooks/enforce-interview.py +421 -0
package/.skills/_shared/hooks/enforce-refactor.py +189 -0
package/.skills/_shared/hooks/enforce-research.py +159 -0
package/.skills/_shared/hooks/enforce-schema.py +186 -0
package/.skills/_shared/hooks/enforce-scope.py +160 -0
package/.skills/_shared/hooks/enforce-tdd-red.py +250 -0
package/.skills/_shared/hooks/enforce-verify.py +186 -0
package/.skills/_shared/hooks/periodic-reground.py +154 -0
package/.skills/_shared/hooks/session-startup.py +151 -0
package/.skills/_shared/hooks/track-tool-use.py +626 -0
package/.skills/_shared/hooks/verify-after-green.py +282 -0
package/.skills/_shared/hooks/verify-implementation.py +225 -0
package/.skills/_shared/install.sh +114 -0
package/.skills/_shared/settings.json +93 -0
package/.skills/add-command/SKILL.md +222 -0
package/.skills/api-create/SKILL.md +512 -0
package/.skills/api-env/SKILL.md +63 -0
package/.skills/api-interview/SKILL.md +344 -0
package/.skills/api-research/SKILL.md +344 -0
package/.skills/api-status/SKILL.md +272 -0
package/.skills/api-verify/SKILL.md +244 -0
package/.skills/beepboop/SKILL.md +110 -0
package/.skills/busycommit/SKILL.md +125 -0
package/.skills/commit/SKILL.md +96 -0
package/.skills/cycle/SKILL.md +155 -0
package/.skills/gap/SKILL.md +99 -0
package/.skills/green/SKILL.md +155 -0
package/.skills/issue/SKILL.md +205 -0
package/.skills/plan/SKILL.md +181 -0
package/.skills/pr/SKILL.md +135 -0
package/.skills/red/SKILL.md +155 -0
package/.skills/refactor/SKILL.md +155 -0
package/.skills/spike/SKILL.md +155 -0
package/.skills/summarize/SKILL.md +107 -0
package/.skills/tdd/SKILL.md +157 -0
package/.skills/update-todos/SKILL.md +228 -0
package/.skills/worktree-add/SKILL.md +328 -0
package/.skills/worktree-cleanup/SKILL.md +294 -0
package/CHANGELOG.md +97 -0
package/README.md +58 -17
package/package.json +22 -11

package/.skills/refactor/SKILL.md ADDED Viewed

@@ -0,0 +1,155 @@
+---
+name: refactor
+description: TDD Refactor Phase - improve code structure, readability, and performance while keeping all tests green. Use after tests pass to clean up code. Keywords: tdd, refactoring, cleanup, optimization, quality
+license: MIT
+compatibility: Requires Claude Code with MCP servers (Context7, GitHub), Python 3.9+ for hooks, pnpm 10.11.0+
+metadata:
+  version: "3.0.0"
+  category: "testing"
+  tags: ['tdd', 'refactoring', 'cleanup', 'optimization']
+  author: "Hustle Together"
+allowed-tools: WebSearch WebFetch mcp__context7 mcp__github AskUserQuestion Read Write Edit Bash TodoWrite
+---
+---
+description: Execute TDD Refactor Phase - improve code structure while keeping tests green
+argument-hint: <refactoring description>
+---
+Apply this document (specifically the Refactor phase), to the info given by user input here: $ARGUMENTS
+## General Guidelines
+### Output Style
+- **Never explicitly mention TDD** in code, comments, commits, PRs, or issues
+- Write natural, descriptive code without meta-commentary about the development process
+- The code should speak for itself - TDD is the process, not the product
+(If there was no info above, fallback to the context of the conversation)
+## TDD Fundamentals
+### The TDD Cycle
+The foundation of TDD is the Red-Green-Refactor cycle:
+1. **Red Phase**: Write ONE failing test that describes desired behavior
+   - The test must fail for the RIGHT reason (not syntax/import errors)
+   - Only one test at a time - this is critical for TDD discipline
+     - Exception: For browser-level tests or expensive setup (e.g., Storybook `*.stories.tsx`), group multiple assertions within a single test block to avoid redundant setup - but only when adding assertions to an existing interaction flow. If new user interactions are required, still create a new test. Split files by category if they exceed ~1000 lines.
+   - **Adding a single test to a test file is ALWAYS allowed** - no prior test output needed
+   - Starting TDD for a new feature is always valid, even if test output shows unrelated work
+   - For DOM-based tests, use `data-testid` attributes to select elements rather than CSS classes, tag names, or text content
+   - Avoid hard-coded timeouts both in form of sleep() or timeout: 5000 etc; use proper async patterns (`waitFor`, `findBy*`, event-based sync) instead and rely on global test configs for timeout settings
+2. **Green Phase**: Write MINIMAL code to make the test pass
+   - Implement only what's needed for the current failing test
+   - No anticipatory coding or extra features
+   - Address the specific failure message
+3. **Refactor Phase**: Improve code structure while keeping tests green
+   - Only allowed when relevant tests are passing
+   - Requires proof that tests have been run and are green
+   - Applies to BOTH implementation and test code
+   - No refactoring with failing tests - fix them first
+### Core Violations
+1. **Multiple Test Addition**
+   - Adding more than one new test at once
+   - Exception: Initial test file setup or extracting shared test utilities
+2. **Over-Implementation**
+   - Code that exceeds what's needed to pass the current failing test
+   - Adding untested features, methods, or error handling
+   - Implementing multiple methods when test only requires one
+3. **Premature Implementation**
+   - Adding implementation before a test exists and fails properly
+   - Adding implementation without running the test first
+   - Refactoring when tests haven't been run or are failing
+### Critical Principle: Incremental Development
+Each step in TDD should address ONE specific issue:
+- Test fails "not defined" → Create empty stub/class only
+- Test fails "not a function" → Add method stub only
+- Test fails with assertion → Implement minimal logic only
+### Optional Pre-Phase: Spike Phase
+In rare cases where the problem space, interface, or expected behavior is unclear, a **Spike Phase** may be used **before the Red Phase**.
+This phase is **not part of the regular TDD workflow** and must only be applied under exceptional circumstances.
+- The goal of a Spike is **exploration and learning**, not implementation.
+- The code written during a Spike is **disposable** and **must not** be merged or reused directly.
+- Once sufficient understanding is achieved, all spike code is discarded, and normal TDD resumes starting from the **Red Phase**.
+- A Spike is justified only when it is impossible to define a meaningful failing test due to technical uncertainty or unknown system behavior.
+### General Information
+- Sometimes the test output shows as no tests have been run when a new test is failing due to a missing import or constructor. In such cases, allow the agent to create simple stubs. Ask them if they forgot to create a stub if they are stuck.
+- It is never allowed to introduce new logic without evidence of relevant failing tests. However, stubs and simple implementation to make imports and test infrastructure work is fine.
+- In the refactor phase, it is perfectly fine to refactor both test and implementation code. That said, completely new functionality is not allowed. Types, clean up, abstractions, and helpers are allowed as long as they do not introduce new behavior.
+- Adding types, interfaces, or a constant in order to replace magic values is perfectly fine during refactoring.
+- Provide the agent with helpful directions so that they do not get stuck when blocking them.
+1. **Consistency check** - Look for inconsistent patterns, naming conventions, or structure across the codebase
+## 🛡 Project Rules (Injected into every command)
+1. **NO BROKEN BUILDS:**
+   - Run `pnpm test` before every `/commit`
+   - Ensure all tests pass
+   - Fix any type errors immediately
+2. **API DEVELOPMENT:**
+   - All new APIs MUST have Zod request/response schemas
+   - All APIs MUST be documented in both:
+     - OpenAPI spec ([src/lib/openapi/](src/lib/openapi/))
+     - API test manifest ([src/app/api-test/api-tests-manifest.json](src/app/api-test/api-tests-manifest.json))
+   - Test ALL parameters and edge cases
+   - Include code examples and real-world outputs
+3. **TDD WORKFLOW:**
+   - ALWAYS use /red → /green → /refactor cycle
+   - NEVER write implementation without failing test first
+   - Use /cycle for feature development
+   - Use characterization tests for refactoring
+4. **API KEY MANAGEMENT:**
+   - Support three loading methods:
+     - Server environment variables
+     - NEXT_PUBLIC_ variables (client-side)
+     - Custom headers (X-OpenAI-Key, X-Anthropic-Key, etc.)
+   - Never hardcode API keys
+   - Always validate key availability before use
+5. **COMPREHENSIVE TESTING:**
+   - When researching APIs, read actual implementation code
+   - Discover ALL possible parameters (not just documented ones)
+   - Test with various parameter combinations
+   - Document custom headers, query params, request/response schemas
+   - Include validation rules and testing notes
+6. **NO UI BLOAT:**
+   - This is an API project with minimal frontend
+   - Only keep necessary test/documentation interfaces
+   - Delete unused components immediately
+   - No unnecessary UI libraries or features
+7. **DOCUMENTATION:**
+   - If you change an API, you MUST update:
+     - OpenAPI spec
+     - api-tests-manifest.json
+     - Code examples
+     - Testing notes
+   - Document expected behavior and edge cases
+   - Include real-world output examples

package/.skills/spike/SKILL.md ADDED Viewed

@@ -0,0 +1,155 @@
+---
+name: spike
+description: Execute TDD Spike Phase - exploratory coding to understand problem space before formal TDD. Use when requirements are unclear or architecture needs exploration. Keywords: spike, exploration, prototyping, learning, architecture
+license: MIT
+compatibility: Requires Claude Code with MCP servers (Context7, GitHub), Python 3.9+ for hooks, pnpm 10.11.0+
+metadata:
+  version: "3.0.0"
+  category: "workflow"
+  tags: ['spike', 'exploration', 'prototyping', 'learning']
+  author: "Hustle Together"
+allowed-tools: WebSearch WebFetch mcp__context7 mcp__github AskUserQuestion Read Write Edit Bash TodoWrite
+---
+---
+description: Execute TDD Spike Phase - exploratory coding to understand problem space before TDD
+argument-hint: <exploration description>
+---
+SPIKE PHASE! Apply the below to the info given by user input here:
+$ARGUMENTS
+## General Guidelines
+### Output Style
+- **Never explicitly mention TDD** in code, comments, commits, PRs, or issues
+- Write natural, descriptive code without meta-commentary about the development process
+- The code should speak for itself - TDD is the process, not the product
+(If there was no info above, fallback to the context of the conversation)
+## TDD Fundamentals
+### The TDD Cycle
+The foundation of TDD is the Red-Green-Refactor cycle:
+1. **Red Phase**: Write ONE failing test that describes desired behavior
+   - The test must fail for the RIGHT reason (not syntax/import errors)
+   - Only one test at a time - this is critical for TDD discipline
+     - Exception: For browser-level tests or expensive setup (e.g., Storybook `*.stories.tsx`), group multiple assertions within a single test block to avoid redundant setup - but only when adding assertions to an existing interaction flow. If new user interactions are required, still create a new test. Split files by category if they exceed ~1000 lines.
+   - **Adding a single test to a test file is ALWAYS allowed** - no prior test output needed
+   - Starting TDD for a new feature is always valid, even if test output shows unrelated work
+   - For DOM-based tests, use `data-testid` attributes to select elements rather than CSS classes, tag names, or text content
+   - Avoid hard-coded timeouts both in form of sleep() or timeout: 5000 etc; use proper async patterns (`waitFor`, `findBy*`, event-based sync) instead and rely on global test configs for timeout settings
+2. **Green Phase**: Write MINIMAL code to make the test pass
+   - Implement only what's needed for the current failing test
+   - No anticipatory coding or extra features
+   - Address the specific failure message
+3. **Refactor Phase**: Improve code structure while keeping tests green
+   - Only allowed when relevant tests are passing
+   - Requires proof that tests have been run and are green
+   - Applies to BOTH implementation and test code
+   - No refactoring with failing tests - fix them first
+### Core Violations
+1. **Multiple Test Addition**
+   - Adding more than one new test at once
+   - Exception: Initial test file setup or extracting shared test utilities
+2. **Over-Implementation**
+   - Code that exceeds what's needed to pass the current failing test
+   - Adding untested features, methods, or error handling
+   - Implementing multiple methods when test only requires one
+3. **Premature Implementation**
+   - Adding implementation before a test exists and fails properly
+   - Adding implementation without running the test first
+   - Refactoring when tests haven't been run or are failing
+### Critical Principle: Incremental Development
+Each step in TDD should address ONE specific issue:
+- Test fails "not defined" → Create empty stub/class only
+- Test fails "not a function" → Add method stub only
+- Test fails with assertion → Implement minimal logic only
+### Optional Pre-Phase: Spike Phase
+In rare cases where the problem space, interface, or expected behavior is unclear, a **Spike Phase** may be used **before the Red Phase**.
+This phase is **not part of the regular TDD workflow** and must only be applied under exceptional circumstances.
+- The goal of a Spike is **exploration and learning**, not implementation.
+- The code written during a Spike is **disposable** and **must not** be merged or reused directly.
+- Once sufficient understanding is achieved, all spike code is discarded, and normal TDD resumes starting from the **Red Phase**.
+- A Spike is justified only when it is impossible to define a meaningful failing test due to technical uncertainty or unknown system behavior.
+### General Information
+- Sometimes the test output shows as no tests have been run when a new test is failing due to a missing import or constructor. In such cases, allow the agent to create simple stubs. Ask them if they forgot to create a stub if they are stuck.
+- It is never allowed to introduce new logic without evidence of relevant failing tests. However, stubs and simple implementation to make imports and test infrastructure work is fine.
+- In the refactor phase, it is perfectly fine to refactor both test and implementation code. That said, completely new functionality is not allowed. Types, clean up, abstractions, and helpers are allowed as long as they do not introduce new behavior.
+- Adding types, interfaces, or a constant in order to replace magic values is perfectly fine during refactoring.
+- Provide the agent with helpful directions so that they do not get stuck when blocking them.
+## 🛡 Project Rules (Injected into every command)
+1. **NO BROKEN BUILDS:**
+   - Run `pnpm test` before every `/commit`
+   - Ensure all tests pass
+   - Fix any type errors immediately
+2. **API DEVELOPMENT:**
+   - All new APIs MUST have Zod request/response schemas
+   - All APIs MUST be documented in both:
+     - OpenAPI spec ([src/lib/openapi/](src/lib/openapi/))
+     - API test manifest ([src/app/api-test/api-tests-manifest.json](src/app/api-test/api-tests-manifest.json))
+   - Test ALL parameters and edge cases
+   - Include code examples and real-world outputs
+3. **TDD WORKFLOW:**
+   - ALWAYS use /red → /green → /refactor cycle
+   - NEVER write implementation without failing test first
+   - Use /cycle for feature development
+   - Use characterization tests for refactoring
+4. **API KEY MANAGEMENT:**
+   - Support three loading methods:
+     - Server environment variables
+     - NEXT_PUBLIC_ variables (client-side)
+     - Custom headers (X-OpenAI-Key, X-Anthropic-Key, etc.)
+   - Never hardcode API keys
+   - Always validate key availability before use
+5. **COMPREHENSIVE TESTING:**
+   - When researching APIs, read actual implementation code
+   - Discover ALL possible parameters (not just documented ones)
+   - Test with various parameter combinations
+   - Document custom headers, query params, request/response schemas
+   - Include validation rules and testing notes
+6. **NO UI BLOAT:**
+   - This is an API project with minimal frontend
+   - Only keep necessary test/documentation interfaces
+   - Delete unused components immediately
+   - No unnecessary UI libraries or features
+7. **DOCUMENTATION:**
+   - If you change an API, you MUST update:
+     - OpenAPI spec
+     - api-tests-manifest.json
+     - Code examples
+     - Testing notes
+   - Document expected behavior and edge cases
+   - Include real-world output examples

package/.skills/summarize/SKILL.md ADDED Viewed

@@ -0,0 +1,107 @@
+---
+name: summarize
+description: Summarize conversation progress and next steps. Provides overview of completed work and remaining tasks. Use at end of sessions or before breaks. Keywords: summary, progress, overview, tracking, communication
+license: MIT
+compatibility: Requires Claude Code with MCP servers (Context7, GitHub), Python 3.9+ for hooks, pnpm 10.11.0+
+metadata:
+  version: "3.0.0"
+  category: "workflow"
+  tags: ['summary', 'progress', 'overview', 'tracking']
+  author: "Hustle Together"
+allowed-tools: WebSearch WebFetch mcp__context7 mcp__github AskUserQuestion Read Write Edit Bash TodoWrite
+---
+---
+description: Summarize conversation progress and next steps
+argument-hint: [optional additional info]
+---
+## General Guidelines
+### Output Style
+- **Never explicitly mention TDD** in code, comments, commits, PRs, or issues
+- Write natural, descriptive code without meta-commentary about the development process
+- The code should speak for itself - TDD is the process, not the product
+Create a concise summary of the current conversation suitable for transferring context to a new conversation.
+Additional info: $ARGUMENTS
+## Summary Structure
+Provide a summary with these sections:
+### What We Did
+- Key accomplishments and changes made
+- Important decisions or discoveries
+- Files created, modified, or analyzed
+### What We're Doing Next
+- Immediate next steps
+- Pending tasks or work in progress
+- Goals or objectives to continue
+### Blockers & User Input Needed
+- Any issues requiring user intervention
+- Decisions that need to be made
+- Missing information or clarifications needed
+## Output Format
+Keep the summary concise and actionable - suitable for pasting into a new conversation to quickly restore context without needing the full conversation history.
+## 🛡 Project Rules (Injected into every command)
+1. **NO BROKEN BUILDS:**
+   - Run `pnpm test` before every `/commit`
+   - Ensure all tests pass
+   - Fix any type errors immediately
+2. **API DEVELOPMENT:**
+   - All new APIs MUST have Zod request/response schemas
+   - All APIs MUST be documented in both:
+     - OpenAPI spec ([src/lib/openapi/](src/lib/openapi/))
+     - API test manifest ([src/app/api-test/api-tests-manifest.json](src/app/api-test/api-tests-manifest.json))
+   - Test ALL parameters and edge cases
+   - Include code examples and real-world outputs
+3. **TDD WORKFLOW:**
+   - ALWAYS use /red → /green → /refactor cycle
+   - NEVER write implementation without failing test first
+   - Use /cycle for feature development
+   - Use characterization tests for refactoring
+4. **API KEY MANAGEMENT:**
+   - Support three loading methods:
+     - Server environment variables
+     - NEXT_PUBLIC_ variables (client-side)
+     - Custom headers (X-OpenAI-Key, X-Anthropic-Key, etc.)
+   - Never hardcode API keys
+   - Always validate key availability before use
+5. **COMPREHENSIVE TESTING:**
+   - When researching APIs, read actual implementation code
+   - Discover ALL possible parameters (not just documented ones)
+   - Test with various parameter combinations
+   - Document custom headers, query params, request/response schemas
+   - Include validation rules and testing notes
+6. **NO UI BLOAT:**
+   - This is an API project with minimal frontend
+   - Only keep necessary test/documentation interfaces
+   - Delete unused components immediately
+   - No unnecessary UI libraries or features
+7. **DOCUMENTATION:**
+   - If you change an API, you MUST update:
+     - OpenAPI spec
+     - api-tests-manifest.json
+     - Code examples
+     - Testing notes
+   - Document expected behavior and edge cases
+   - Include real-world output examples

package/.skills/tdd/SKILL.md ADDED Viewed

@@ -0,0 +1,157 @@
+---
+name: tdd
+description: Remind agent about TDD approach and continue conversation. Reinforces test-first methodology. Use when agent deviates from TDD practices. Keywords: tdd, reminder, methodology, testing, practices
+license: MIT
+compatibility: Requires Claude Code with MCP servers (Context7, GitHub), Python 3.9+ for hooks, pnpm 10.11.0+
+metadata:
+  version: "3.0.0"
+  category: "workflow"
+  tags: ['tdd', 'reminder', 'methodology', 'testing']
+  author: "Hustle Together"
+allowed-tools: WebSearch WebFetch mcp__context7 mcp__github AskUserQuestion Read Write Edit Bash TodoWrite
+---
+---
+description: Remind agent about TDD approach and continue conversation
+argument-hint: [optional-response-to-last-message]
+---
+# TDD Reminder
+## General Guidelines
+### Output Style
+- **Never explicitly mention TDD** in code, comments, commits, PRs, or issues
+- Write natural, descriptive code without meta-commentary about the development process
+- The code should speak for itself - TDD is the process, not the product
+## TDD Fundamentals
+### The TDD Cycle
+The foundation of TDD is the Red-Green-Refactor cycle:
+1. **Red Phase**: Write ONE failing test that describes desired behavior
+   - The test must fail for the RIGHT reason (not syntax/import errors)
+   - Only one test at a time - this is critical for TDD discipline
+     - Exception: For browser-level tests or expensive setup (e.g., Storybook `*.stories.tsx`), group multiple assertions within a single test block to avoid redundant setup - but only when adding assertions to an existing interaction flow. If new user interactions are required, still create a new test. Split files by category if they exceed ~1000 lines.
+   - **Adding a single test to a test file is ALWAYS allowed** - no prior test output needed
+   - Starting TDD for a new feature is always valid, even if test output shows unrelated work
+   - For DOM-based tests, use `data-testid` attributes to select elements rather than CSS classes, tag names, or text content
+   - Avoid hard-coded timeouts both in form of sleep() or timeout: 5000 etc; use proper async patterns (`waitFor`, `findBy*`, event-based sync) instead and rely on global test configs for timeout settings
+2. **Green Phase**: Write MINIMAL code to make the test pass
+   - Implement only what's needed for the current failing test
+   - No anticipatory coding or extra features
+   - Address the specific failure message
+3. **Refactor Phase**: Improve code structure while keeping tests green
+   - Only allowed when relevant tests are passing
+   - Requires proof that tests have been run and are green
+   - Applies to BOTH implementation and test code
+   - No refactoring with failing tests - fix them first
+### Core Violations
+1. **Multiple Test Addition**
+   - Adding more than one new test at once
+   - Exception: Initial test file setup or extracting shared test utilities
+2. **Over-Implementation**
+   - Code that exceeds what's needed to pass the current failing test
+   - Adding untested features, methods, or error handling
+   - Implementing multiple methods when test only requires one
+3. **Premature Implementation**
+   - Adding implementation before a test exists and fails properly
+   - Adding implementation without running the test first
+   - Refactoring when tests haven't been run or are failing
+### Critical Principle: Incremental Development
+Each step in TDD should address ONE specific issue:
+- Test fails "not defined" → Create empty stub/class only
+- Test fails "not a function" → Add method stub only
+- Test fails with assertion → Implement minimal logic only
+### Optional Pre-Phase: Spike Phase
+In rare cases where the problem space, interface, or expected behavior is unclear, a **Spike Phase** may be used **before the Red Phase**.
+This phase is **not part of the regular TDD workflow** and must only be applied under exceptional circumstances.
+- The goal of a Spike is **exploration and learning**, not implementation.
+- The code written during a Spike is **disposable** and **must not** be merged or reused directly.
+- Once sufficient understanding is achieved, all spike code is discarded, and normal TDD resumes starting from the **Red Phase**.
+- A Spike is justified only when it is impossible to define a meaningful failing test due to technical uncertainty or unknown system behavior.
+### General Information
+- Sometimes the test output shows as no tests have been run when a new test is failing due to a missing import or constructor. In such cases, allow the agent to create simple stubs. Ask them if they forgot to create a stub if they are stuck.
+- It is never allowed to introduce new logic without evidence of relevant failing tests. However, stubs and simple implementation to make imports and test infrastructure work is fine.
+- In the refactor phase, it is perfectly fine to refactor both test and implementation code. That said, completely new functionality is not allowed. Types, clean up, abstractions, and helpers are allowed as long as they do not introduce new behavior.
+- Adding types, interfaces, or a constant in order to replace magic values is perfectly fine during refactoring.
+- Provide the agent with helpful directions so that they do not get stuck when blocking them.
+## Continue Conversation
+User response to the last message: $ARGUMENTS
+Please continue with TDD approach based on the above response.
+## 🛡 Project Rules (Injected into every command)
+1. **NO BROKEN BUILDS:**
+   - Run `pnpm test` before every `/commit`
+   - Ensure all tests pass
+   - Fix any type errors immediately
+2. **API DEVELOPMENT:**
+   - All new APIs MUST have Zod request/response schemas
+   - All APIs MUST be documented in both:
+     - OpenAPI spec ([src/lib/openapi/](src/lib/openapi/))
+     - API test manifest ([src/app/api-test/api-tests-manifest.json](src/app/api-test/api-tests-manifest.json))
+   - Test ALL parameters and edge cases
+   - Include code examples and real-world outputs
+3. **TDD WORKFLOW:**
+   - ALWAYS use /red → /green → /refactor cycle
+   - NEVER write implementation without failing test first
+   - Use /cycle for feature development
+   - Use characterization tests for refactoring
+4. **API KEY MANAGEMENT:**
+   - Support three loading methods:
+     - Server environment variables
+     - NEXT_PUBLIC_ variables (client-side)
+     - Custom headers (X-OpenAI-Key, X-Anthropic-Key, etc.)
+   - Never hardcode API keys
+   - Always validate key availability before use
+5. **COMPREHENSIVE TESTING:**
+   - When researching APIs, read actual implementation code
+   - Discover ALL possible parameters (not just documented ones)
+   - Test with various parameter combinations
+   - Document custom headers, query params, request/response schemas
+   - Include validation rules and testing notes
+6. **NO UI BLOAT:**
+   - This is an API project with minimal frontend
+   - Only keep necessary test/documentation interfaces
+   - Delete unused components immediately
+   - No unnecessary UI libraries or features
+7. **DOCUMENTATION:**
+   - If you change an API, you MUST update:
+     - OpenAPI spec
+     - api-tests-manifest.json
+     - Code examples
+     - Testing notes
+   - Document expected behavior and edge cases
+   - Include real-world output examples