npm - @j0hanz/prompt-tuner-mcp-server - Versions diffs - 1.0.4 → 1.0.6 - Mend

@j0hanz/prompt-tuner-mcp-server 1.0.4 → 1.0.6

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (229) hide show

package/AGENTS.md +20 -23
package/CONFIGURATION.md +150 -159
package/README.md +136 -226
package/dist/config/constants.d.ts +4 -3
package/dist/config/constants.d.ts.map +1 -1
package/dist/config/constants.js +4 -3
package/dist/config/constants.js.map +1 -1
package/dist/config/env.d.ts +0 -1
package/dist/config/env.d.ts.map +1 -1
package/dist/config/env.js +0 -2
package/dist/config/env.js.map +1 -1
package/dist/config/instructions.d.ts +1 -1
package/dist/config/instructions.d.ts.map +1 -1
package/dist/config/instructions.js +21 -43
package/dist/config/instructions.js.map +1 -1
package/dist/config/types.d.ts +30 -57
package/dist/config/types.d.ts.map +1 -1
package/dist/config/types.js +0 -7
package/dist/config/types.js.map +1 -1
package/dist/config/typos.d.ts.map +1 -1
package/dist/config/typos.js +1 -4
package/dist/config/typos.js.map +1 -1
package/dist/index.js +93 -9
package/dist/index.js.map +1 -1
package/dist/lib/abort-signals.d.ts +2 -0
package/dist/lib/abort-signals.d.ts.map +1 -0
package/dist/lib/abort-signals.js +5 -0
package/dist/lib/abort-signals.js.map +1 -0
package/dist/lib/cache.d.ts +0 -1
package/dist/lib/cache.d.ts.map +1 -1
package/dist/lib/cache.js +0 -4
package/dist/lib/cache.js.map +1 -1
package/dist/lib/errors.d.ts +3 -7
package/dist/lib/errors.d.ts.map +1 -1
package/dist/lib/errors.js +52 -51
package/dist/lib/errors.js.map +1 -1
package/dist/lib/llm-client.d.ts +5 -1
package/dist/lib/llm-client.d.ts.map +1 -1
package/dist/lib/llm-client.js +4 -0
package/dist/lib/llm-client.js.map +1 -1
package/dist/lib/llm-json.d.ts.map +1 -1
package/dist/lib/llm-json.js +67 -1
package/dist/lib/llm-json.js.map +1 -1
package/dist/lib/llm-providers/helpers.d.ts +14 -0
package/dist/lib/llm-providers/helpers.d.ts.map +1 -0
package/dist/lib/llm-providers/helpers.js +45 -0
package/dist/lib/llm-providers/helpers.js.map +1 -0
package/dist/lib/llm-providers.d.ts +2 -0
package/dist/lib/llm-providers.d.ts.map +1 -1
package/dist/lib/llm-providers.js +28 -67
package/dist/lib/llm-providers.js.map +1 -1
package/dist/lib/llm-runtime.d.ts +1 -1
package/dist/lib/llm-runtime.d.ts.map +1 -1
package/dist/lib/llm-runtime.js +8 -6
package/dist/lib/llm-runtime.js.map +1 -1
package/dist/lib/llm.d.ts +1 -1
package/dist/lib/llm.d.ts.map +1 -1
package/dist/lib/llm.js +4 -11
package/dist/lib/llm.js.map +1 -1
package/dist/lib/output-normalization.d.ts +7 -0
package/dist/lib/output-normalization.d.ts.map +1 -0
package/dist/lib/output-normalization.js +65 -0
package/dist/lib/output-normalization.js.map +1 -0
package/dist/lib/output-validation.d.ts +11 -0
package/dist/lib/output-validation.d.ts.map +1 -0
package/dist/lib/output-validation.js +128 -0
package/dist/lib/output-validation.js.map +1 -0
package/dist/lib/prompt-analysis/scoring.d.ts.map +1 -1
package/dist/lib/prompt-analysis/scoring.js +7 -3
package/dist/lib/prompt-analysis/scoring.js.map +1 -1
package/dist/lib/prompt-analysis.d.ts +0 -2
package/dist/lib/prompt-analysis.d.ts.map +1 -1
package/dist/lib/prompt-analysis.js +0 -2
package/dist/lib/prompt-analysis.js.map +1 -1
package/dist/lib/prompt-policy.d.ts +3 -0
package/dist/lib/prompt-policy.d.ts.map +1 -0
package/dist/lib/prompt-policy.js +16 -0
package/dist/lib/prompt-policy.js.map +1 -0
package/dist/lib/retry.d.ts +1 -1
package/dist/lib/retry.d.ts.map +1 -1
package/dist/lib/retry.js +35 -13
package/dist/lib/retry.js.map +1 -1
package/dist/lib/technique-templates/format-instructions.d.ts +3 -0
package/dist/lib/technique-templates/format-instructions.d.ts.map +1 -0
package/dist/lib/technique-templates/format-instructions.js +42 -0
package/dist/lib/technique-templates/format-instructions.js.map +1 -0
package/dist/lib/technique-templates/templates-advanced.d.ts +5 -0
package/dist/lib/technique-templates/templates-advanced.d.ts.map +1 -0
package/dist/lib/technique-templates/templates-advanced.js +139 -0
package/dist/lib/technique-templates/templates-advanced.js.map +1 -0
package/dist/lib/technique-templates/templates-basic.d.ts +5 -0
package/dist/lib/technique-templates/templates-basic.d.ts.map +1 -0
package/dist/lib/technique-templates/templates-basic.js +129 -0
package/dist/lib/technique-templates/templates-basic.js.map +1 -0
package/dist/lib/technique-templates.d.ts +1 -1
package/dist/lib/technique-templates.d.ts.map +1 -1
package/dist/lib/technique-templates.js +15 -318
package/dist/lib/technique-templates.js.map +1 -1
package/dist/lib/tool-formatters.d.ts +13 -0
package/dist/lib/tool-formatters.d.ts.map +1 -0
package/dist/lib/tool-formatters.js +26 -0
package/dist/lib/tool-formatters.js.map +1 -0
package/dist/lib/tool-helpers.d.ts +8 -1
package/dist/lib/tool-helpers.d.ts.map +1 -1
package/dist/lib/tool-helpers.js +32 -7
package/dist/lib/tool-helpers.js.map +1 -1
package/dist/lib/tool-resources.d.ts +3 -0
package/dist/lib/tool-resources.d.ts.map +1 -0
package/dist/lib/tool-resources.js +23 -0
package/dist/lib/tool-resources.js.map +1 -0
package/dist/lib/validation.d.ts +0 -2
package/dist/lib/validation.d.ts.map +1 -1
package/dist/lib/validation.js +0 -13
package/dist/lib/validation.js.map +1 -1
package/dist/prompts/quick-workflows.d.ts.map +1 -1
package/dist/prompts/quick-workflows.js +127 -219
package/dist/prompts/quick-workflows.js.map +1 -1
package/dist/resources/index.d.ts +1 -2
package/dist/resources/index.d.ts.map +1 -1
package/dist/resources/index.js +2 -3
package/dist/resources/index.js.map +1 -1
package/dist/resources/prompt-templates.d.ts.map +1 -1
package/dist/resources/prompt-templates.js +3 -12
package/dist/resources/prompt-templates.js.map +1 -1
package/dist/schemas/index.d.ts +2 -3
package/dist/schemas/index.d.ts.map +1 -1
package/dist/schemas/index.js +2 -3
package/dist/schemas/index.js.map +1 -1
package/dist/schemas/inputs.d.ts +4 -27
package/dist/schemas/inputs.d.ts.map +1 -1
package/dist/schemas/inputs.js +0 -17
package/dist/schemas/inputs.js.map +1 -1
package/dist/schemas/llm-responses.d.ts +11 -186
package/dist/schemas/llm-responses.d.ts.map +1 -1
package/dist/schemas/llm-responses.js +8 -19
package/dist/schemas/llm-responses.js.map +1 -1
package/dist/schemas/outputs.d.ts +149 -350
package/dist/schemas/outputs.d.ts.map +1 -1
package/dist/schemas/outputs.js +47 -74
package/dist/schemas/outputs.js.map +1 -1
package/dist/server.d.ts.map +1 -1
package/dist/server.js +6 -5
package/dist/server.js.map +1 -1
package/dist/tools/analyze-prompt.d.ts.map +1 -1
package/dist/tools/analyze-prompt.js +121 -168
package/dist/tools/analyze-prompt.js.map +1 -1
package/dist/tools/compare-prompts.d.ts.map +1 -1
package/dist/tools/compare-prompts.js +48 -31
package/dist/tools/compare-prompts.js.map +1 -1
package/dist/tools/detect-format.d.ts.map +1 -1
package/dist/tools/detect-format.js +49 -39
package/dist/tools/detect-format.js.map +1 -1
package/dist/tools/index.d.ts.map +1 -1
package/dist/tools/index.js +0 -4
package/dist/tools/index.js.map +1 -1
package/dist/tools/optimize-prompt/formatters.d.ts +2 -0
package/dist/tools/optimize-prompt/formatters.d.ts.map +1 -0
package/dist/tools/optimize-prompt/formatters.js +57 -0
package/dist/tools/optimize-prompt/formatters.js.map +1 -0
package/dist/tools/optimize-prompt.d.ts.map +1 -1
package/dist/tools/optimize-prompt.js +156 -147
package/dist/tools/optimize-prompt.js.map +1 -1
package/dist/tools/refine-prompt.d.ts.map +1 -1
package/dist/tools/refine-prompt.js +66 -38
package/dist/tools/refine-prompt.js.map +1 -1
package/dist/tools/validate-prompt/prompt.d.ts +2 -0
package/dist/tools/validate-prompt/prompt.d.ts.map +1 -0
package/dist/tools/validate-prompt/prompt.js +40 -0
package/dist/tools/validate-prompt/prompt.js.map +1 -0
package/dist/tools/validate-prompt.d.ts.map +1 -1
package/dist/tools/validate-prompt.js +113 -143
package/dist/tools/validate-prompt.js.map +1 -1
package/package.json +5 -4
package/src/config/constants.ts +4 -3
package/src/config/env.ts +0 -3
package/src/config/instructions.ts +21 -43
package/src/config/types.ts +36 -66
package/src/index.ts +112 -10
package/src/lib/abort-signals.ts +7 -0
package/src/lib/errors.ts +90 -85
package/src/lib/llm-client.ts +9 -1
package/src/lib/llm-json.ts +85 -1
package/src/lib/llm-providers/helpers.ts +78 -0
package/src/lib/llm-providers.ts +59 -95
package/src/lib/llm-runtime.ts +28 -21
package/src/lib/llm.ts +6 -13
package/src/lib/output-normalization.ts +91 -0
package/src/lib/output-validation.ts +164 -0
package/src/lib/prompt-analysis.ts +0 -5
package/src/lib/prompt-policy.ts +18 -0
package/src/lib/retry.ts +51 -13
package/src/lib/technique-templates/format-instructions.ts +45 -0
package/src/lib/technique-templates/templates-advanced.ts +147 -0
package/src/lib/technique-templates/templates-basic.ts +137 -0
package/src/lib/technique-templates.ts +16 -326
package/src/lib/tool-formatters.ts +46 -0
package/src/lib/tool-helpers.ts +50 -12
package/src/lib/tool-resources.ts +31 -0
package/src/lib/validation.ts +0 -15
package/src/prompts/quick-workflows.ts +128 -230
package/src/schemas/index.ts +0 -12
package/src/schemas/inputs.ts +0 -19
package/src/schemas/llm-responses.ts +8 -32
package/src/schemas/outputs.ts +53 -79
package/src/server.ts +8 -6
package/src/tools/analyze-prompt.ts +158 -188
package/src/tools/index.ts +0 -4
package/src/tools/optimize-prompt/formatters.ts +70 -0
package/src/tools/optimize-prompt.ts +258 -174
package/src/tools/refine-prompt.ts +141 -60
package/src/tools/validate-prompt/prompt.ts +40 -0
package/src/tools/validate-prompt.ts +185 -167
package/src/types/regexp-escape.d.ts +3 -0
package/tests/llm-json.test.ts +17 -0
package/tests/quick-workflows.test.ts +1 -34
package/tsconfig.json +1 -1
package/src/config/typos.ts +0 -121
package/src/lib/cache.ts +0 -57
package/src/lib/prompt-analysis/scoring.ts +0 -235
package/src/lib/prompt-analysis/suggestions.ts +0 -115
package/src/resources/index.ts +0 -7
package/src/resources/prompt-templates/analysis.ts +0 -156
package/src/resources/prompt-templates/coding.ts +0 -302
package/src/resources/prompt-templates/data-extraction.ts +0 -122
package/src/resources/prompt-templates/system-prompts.ts +0 -81
package/src/resources/prompt-templates/writing.ts +0 -176
package/src/resources/prompt-templates.ts +0 -203
package/src/tools/compare-prompts.ts +0 -301
package/src/tools/detect-format.ts +0 -172

package/AGENTS.md CHANGED Viewed

@@ -2,41 +2,38 @@
 ## Project Structure & Module Organization
-- `src/` contains the TypeScript source: server setup, tools, resources, prompts, schemas, and shared utilities.
-- `tests/` holds Vitest specs (for example `tests/server.test.ts`).
-- `docs/` contains documentation assets (logo, guides). `CONFIGURATION.md` documents environment variables.
-- `dist/` is the compiled output (do not edit by hand).
+- `src/` holds TypeScript source. `src/index.ts` is the entry point and `src/server.ts` wires the MCP server. Subfolders include `config/`, `lib/`, `tools/`, `resources/`, `prompts/`, `schemas/`, and `types/`.
+- `tests/` contains Vitest suites; test files use the `*.test.ts` naming pattern.
+- `dist/` is generated build output (do not edit by hand).
+- `docs/` stores static assets. `CONFIGURATION.md` documents runtime environment variables.
 ## Build, Test, and Development Commands
-Run commands from the repo root.
-- `npm install` installs dependencies.
-- `npm run build` compiles TypeScript and sets the executable bit on `dist/index.js`.
-- `npm run dev` runs the server in watch mode; `npm run dev:http` enables HTTP mode.
-- `npm start` runs the built server; `npm run start:http` enables HTTP mode.
-- `npm test` runs Vitest once; `npm run test:watch` watches tests.
-- `npm run lint` runs ESLint; `npm run format` runs Prettier; `npm run type-check` runs `tsc --noEmit`.
-- `npm run inspector` starts the MCP Inspector (use `inspector:http` for HTTP).
+- `npm run dev` / `npm run dev:http`: run from source with tsx watch (HTTP variant adds `--http`).
+- `npm run build`: compile TypeScript into `dist/` and set executable permissions.
+- `npm run start` / `npm run start:http`: run the compiled server from `dist/`.
+- `npm run test` / `npm run test:watch`: run Vitest once or in watch mode.
+- `npm run lint` and `npm run format`: ESLint checks and Prettier formatting.
+- `npm run type-check`: `tsc --noEmit` for strict type validation.
 ## Coding Style & Naming Conventions
-- TypeScript ESM (`"type": "module"`). Prefer explicit return types and `type` imports.
-- Prettier enforces 2-space indentation, single quotes, semicolons, 80-char width, LF, and sorted imports.
-- ESLint is strict: no `any`, no unused imports, prefer `const`, and avoid floating promises.
-- Naming: `camelCase` for variables/functions, `PascalCase` for types/classes, `UPPER_CASE` for constants. Leading `_` is allowed for intentionally unused params.
+- TypeScript, ES modules, Node >= 20.
+- Prettier rules: 2-space indentation, single quotes, trailing commas, 80-char line width, sorted imports.
+- ESLint is strict; avoid `any`, unused imports, and floating promises; prefer `type` imports.
+- Naming: `camelCase` for variables/functions, `PascalCase` for types, `UPPER_CASE` for constants; leading `_` is allowed for unused args.
 ## Testing Guidelines
-- Framework: Vitest (node environment). Test files live in `tests/**/*.test.ts`.
-- Add or update tests when changing tools, scoring logic, or server behavior.
+- Use Vitest in the Node environment; keep tests in `tests/` and name `*.test.ts`.
+- Favor deterministic tests and keep individual tests under the 15s timeout.
 ## Commit & Pull Request Guidelines
-- Commit subjects in this repo are short, imperative, and sentence case (for example "Refactor cache handling"). Release commits may be version-only (for example "1.0.2").
-- PRs should include a clear description, reasoning, and the tests run. Update `README.md` or `CONFIGURATION.md` when changing tools, prompts, or environment variables.
+- History favors short, imperative summaries; common pattern is `refactor: ...`, plus plain `Add ...` and version bumps like `1.0.5`.
+- PRs should include a brief summary, tests run (for example, `npm run test`), and note any config or environment changes. Link related issues when applicable.
 ## Security & Configuration Tips
-- Configure via environment variables only; do not commit secrets. Set exactly one provider API key for the chosen `LLM_PROVIDER`.
-- Be cautious with `INCLUDE_ERROR_CONTEXT=true` since it can include prompt excerpts in errors.
+- Runtime behavior is driven by environment variables; see `CONFIGURATION.md` for required keys and limits.
+- Never commit API keys. Be cautious with `INCLUDE_ERROR_CONTEXT=true` in production.

package/CONFIGURATION.md CHANGED Viewed

@@ -1,227 +1,218 @@
 # PromptTuner MCP Configuration Guide
-## Environment Variables
+PromptTuner MCP is configured entirely via environment variables. Set them in your MCP client configuration (for example `mcp.json`, `claude_desktop_config.json`) or a `.env` file.
-All configuration is done through environment variables. Set them in your MCP client configuration (e.g., `mcp.json` for VS Code) or in a `.env` file.
+## Required configuration
-### Required Configuration
+You must pick a provider and supply its API key.
-| Variable            | Description             | Default           | Example                                                                              |
-| ------------------- | ----------------------- | ----------------- | ------------------------------------------------------------------------------------ |
-| `LLM_PROVIDER`      | LLM provider to use     | `openai`          | `openai`, `anthropic`, `google`                                                      |
-| `OPENAI_API_KEY`    | OpenAI API key          | -                 | `sk-...`                                                                             |
-| `ANTHROPIC_API_KEY` | Anthropic API key       | -                 | `sk-ant-...`                                                                         |
-| `GOOGLE_API_KEY`    | Google Gemini API key   | -                 | `AIzaSy...`                                                                          |
-| `LLM_MODEL`         | Model to use (optional) | Provider-specific | `gpt-4o`, `claude-3-5-sonnet-20241022`, `gemini-2.0-flash-exp`, `gemini-2.5-pro-exp` |
+| Variable            | Default  | Description                             |
+| ------------------- | -------- | --------------------------------------- |
+| `LLM_PROVIDER`      | `openai` | `openai`, `anthropic`, or `google`.     |
+| `OPENAI_API_KEY`    | -        | Required when `LLM_PROVIDER=openai`.    |
+| `ANTHROPIC_API_KEY` | -        | Required when `LLM_PROVIDER=anthropic`. |
+| `GOOGLE_API_KEY`    | -        | Required when `LLM_PROVIDER=google`.    |
-**Note**: Only provide the API key for your chosen provider.
+PromptTuner checks that the correct API key environment variable is set at startup. The provider will reject invalid keys at request time.
-### Performance & Limits (Optional)
+## Provider defaults
-| Variable            | Description               | Default         | Recommended Range       |
-| ------------------- | ------------------------- | --------------- | ----------------------- |
-| `LLM_TIMEOUT_MS`    | LLM request timeout (ms)  | `60000` (1 min) | 30000-120000            |
-| `LLM_MAX_TOKENS`    | Max tokens per response   | `8000`          | 2000-16000 (pro: 8000+) |
-| `MAX_PROMPT_LENGTH` | Max prompt length (chars) | `10000`         | 5000-50000              |
-| `CACHE_MAX_SIZE`    | Max cached refinements    | `1000`          | 500-5000                |
+| Provider    | Default model                | API key env         |
+| ----------- | ---------------------------- | ------------------- |
+| `openai`    | `gpt-4o`                     | `OPENAI_API_KEY`    |
+| `anthropic` | `claude-3-5-sonnet-20241022` | `ANTHROPIC_API_KEY` |
+| `google`    | `gemini-2.0-flash-exp`       | `GOOGLE_API_KEY`    |
-### Retry Configuration (Optional)
+Set `LLM_MODEL` to override the default model for the chosen provider.
-| Variable                 | Description                    | Default          | Recommended Range |
-| ------------------------ | ------------------------------ | ---------------- | ----------------- |
-| `RETRY_MAX_ATTEMPTS`     | Max retry attempts             | `3`              | 1-5               |
-| `RETRY_BASE_DELAY_MS`    | Initial retry delay (ms)       | `1000`           | 500-2000          |
-| `RETRY_MAX_DELAY_MS`     | Max retry delay (ms)           | `10000`          | 5000-30000        |
-| `RETRY_TOTAL_TIMEOUT_MS` | Total timeout for retries (ms) | `180000` (3 min) | 60000-300000      |
+## Limits and timeouts (optional)
-### Logging & Debugging (Optional)
+| Variable            | Default | Description                          |
+| ------------------- | ------- | ------------------------------------ |
+| `MAX_PROMPT_LENGTH` | `10000` | Max trimmed prompt length (chars).   |
+| `LLM_MAX_TOKENS`    | `8000`  | Upper bound for model output tokens. |
+| `LLM_TIMEOUT_MS`    | `60000` | Per-request timeout (ms).            |
-| Variable                | Description               | Default | Options         |
-| ----------------------- | ------------------------- | ------- | --------------- |
-| `LOG_FORMAT`            | Log output format         | `text`  | `text`, `json`  |
-| `DEBUG`                 | Enable debug logging      | `false` | `true`, `false` |
-| `INCLUDE_ERROR_CONTEXT` | Include context in errors | `false` | `true`, `false` |
+### Prompt length enforcement
-**Security Note**: When `INCLUDE_ERROR_CONTEXT=true`, error responses may include up to 500 characters of the prompt that caused the error. Only enable this in development.
+- Input is trimmed before validation.
+- If raw input exceeds `MAX_PROMPT_LENGTH * 2`, it is rejected as excessive whitespace.
+- If trimmed input exceeds `MAX_PROMPT_LENGTH`, it is rejected.
-### Provider-Specific (Optional)
+### Tool token caps
-| Variable                 | Description                   | Default | Options         |
-| ------------------------ | ----------------------------- | ------- | --------------- |
-| `GOOGLE_SAFETY_DISABLED` | Disable Gemini safety filters | `false` | `true`, `false` |
+Tool max tokens are derived from `LLM_MAX_TOKENS`:
-## Example Configurations
+| Tool              | Max tokens                  |
+| ----------------- | --------------------------- |
+| `analyze_prompt`  | `min(LLM_MAX_TOKENS, 4000)` |
+| `refine_prompt`   | `min(LLM_MAX_TOKENS, 2000)` |
+| `optimize_prompt` | `min(LLM_MAX_TOKENS, 3000)` |
+| `validate_prompt` | `min(LLM_MAX_TOKENS, 1000)` |
-### Minimal (Production)
+## Retry behavior (optional)
-```json
-{
-  "prompttuner": {
-    "command": "node",
-    "args": ["/path/to/prompttuner-mcp/dist/index.js"],
-    "env": {
-      "LLM_PROVIDER": "openai",
-      "OPENAI_API_KEY": "${input:openai-api-key}"
-    }
-  }
-}
-```
+| Variable                 | Default  | Description                                    |
+| ------------------------ | -------- | ---------------------------------------------- |
+| `RETRY_MAX_ATTEMPTS`     | `3`      | Max retry attempts (total attempts = max + 1). |
+| `RETRY_BASE_DELAY_MS`    | `1000`   | Base delay for exponential backoff.            |
+| `RETRY_MAX_DELAY_MS`     | `10000`  | Max delay between retries.                     |
+| `RETRY_TOTAL_TIMEOUT_MS` | `180000` | Total time allowed across retries.             |
+Retries use exponential backoff with jitter and stop when the total timeout is exceeded.
+## Logging and error context (optional)
+| Variable                | Default | Description                                                    |
+| ----------------------- | ------- | -------------------------------------------------------------- |
+| `DEBUG`                 | `false` | Enables debug logging. Logs are written to stderr.             |
+| `LOG_FORMAT`            | `text`  | Parsed but currently unused (logging output is JSON via pino). |
+| `INCLUDE_ERROR_CONTEXT` | `false` | Adds a sanitized prompt snippet (up to 200 chars) to errors.   |
+## Provider-specific settings
+| Variable                 | Default | Description                                |
+| ------------------------ | ------- | ------------------------------------------ |
+| `GOOGLE_SAFETY_DISABLED` | `false` | When true, disables Gemini safety filters. |
+## validate_prompt token limits
-### Performance Tuned
+`validate_prompt` uses fixed limits when calculating `tokenUtilization`:
+| targetModel | Token limit |
+| ----------- | ----------- |
+| `claude`    | `200000`    |
+| `gpt`       | `128000`    |
+| `gemini`    | `1000000`   |
+| `generic`   | `8000`      |
+## Example configurations
+### Minimal (npx)
 ```json
 {
-  "prompttuner": {
-    "command": "node",
-    "args": ["/path/to/prompttuner-mcp/dist/index.js"],
-    "env": {
-      "LLM_PROVIDER": "anthropic",
-      "ANTHROPIC_API_KEY": "${input:anthropic-api-key}",
-      "LLM_MODEL": "claude-3-5-sonnet-20241022",
-      "LLM_TIMEOUT_MS": "90000",
-      "LLM_MAX_TOKENS": "8000",
-      "CACHE_MAX_SIZE": "2000",
-      "RETRY_MAX_ATTEMPTS": "5"
+  "mcpServers": {
+    "prompttuner": {
+      "command": "npx",
+      "args": ["-y", "@j0hanz/prompt-tuner-mcp-server@latest"],
+      "env": {
+        "LLM_PROVIDER": "openai",
+        "OPENAI_API_KEY": "${input:openai-api-key}"
+      }
     }
   }
 }
 ```
-### Pro Models (Gemini 2.5 Pro, GPT-4o)
+### From source (dist build)
 ```json
 {
-  "prompttuner": {
-    "command": "node",
-    "args": ["/path/to/prompttuner-mcp/dist/index.js"],
-    "env": {
-      "LLM_PROVIDER": "google",
-      "GOOGLE_API_KEY": "${input:google-api-key}",
-      "LLM_MODEL": "gemini-2.5-pro-exp",
-      "LLM_TIMEOUT_MS": "120000",
-      "LLM_MAX_TOKENS": "16000",
-      "CACHE_MAX_SIZE": "3000"
+  "mcpServers": {
+    "prompttuner": {
+      "command": "node",
+      "args": ["/path/to/prompttuner-mcp/dist/index.js"],
+      "env": {
+        "LLM_PROVIDER": "anthropic",
+        "ANTHROPIC_API_KEY": "${input:anthropic-api-key}"
+      }
     }
   }
 }
 ```
-### Development/Debug
+### Performance tuned
 ```json
 {
-  "prompttuner": {
-    "command": "node",
-    "args": ["/path/to/prompttuner-mcp/dist/index.js"],
-    "env": {
-      "LLM_PROVIDER": "google",
-      "GOOGLE_API_KEY": "${input:google-api-key}",
-      "LLM_MODEL": "gemini-2.0-flash-exp",
-      "LOG_FORMAT": "json",
-      "DEBUG": "true",
-      "INCLUDE_ERROR_CONTEXT": "true"
+  "mcpServers": {
+    "prompttuner": {
+      "command": "node",
+      "args": ["/path/to/prompttuner-mcp/dist/index.js"],
+      "env": {
+        "LLM_PROVIDER": "anthropic",
+        "ANTHROPIC_API_KEY": "${input:anthropic-api-key}",
+        "LLM_MODEL": "claude-3-5-sonnet-20241022",
+        "LLM_TIMEOUT_MS": "90000",
+        "LLM_MAX_TOKENS": "8000",
+        "RETRY_MAX_ATTEMPTS": "5"
+      }
     }
   }
 }
 ```
-### High Volume / Low Latency
+### High volume / low latency
 ```json
 {
-  "prompttuner": {
-    "command": "node",
-    "args": ["/path/to/prompttuner-mcp/dist/index.js"],
-    "env": {
-      "LLM_PROVIDER": "openai",
-      "OPENAI_API_KEY": "${input:openai-api-key}",
-      "LLM_MODEL": "gpt-4o-mini",
-      "LLM_TIMEOUT_MS": "30000",
-      "LLM_MAX_TOKENS": "1500",
-      "CACHE_MAX_SIZE": "5000",
-      "RETRY_MAX_ATTEMPTS": "2",
-      "RETRY_BASE_DELAY_MS": "500"
+  "mcpServers": {
+    "prompttuner": {
+      "command": "node",
+      "args": ["/path/to/prompttuner-mcp/dist/index.js"],
+      "env": {
+        "LLM_PROVIDER": "openai",
+        "OPENAI_API_KEY": "${input:openai-api-key}",
+        "LLM_MODEL": "gpt-4o-mini",
+        "LLM_TIMEOUT_MS": "30000",
+        "LLM_MAX_TOKENS": "1500",
+        "RETRY_MAX_ATTEMPTS": "2",
+        "RETRY_BASE_DELAY_MS": "500"
+      }
     }
   }
 }
 ```
-## What's NOT Configurable (and Why)
-The following are intentionally hardcoded for stability and optimal performance:
-### Scoring Algorithm
-- **Scoring dimension weights** (clarity: 0.25, specificity: 0.25, etc.)
-- **Reason**: Carefully tuned based on prompt engineering research
+## What is not configurable
-### Analysis Constants
+The following behaviors are hardcoded for stability:
-- **Pattern matching regex** for detecting prompt characteristics
-- **Reason**: Complex regex patterns that work across all use cases
+- Scoring weights: clarity 0.25, specificity 0.25, completeness 0.2, structure 0.15, effectiveness 0.15.
+- Prompt format detection patterns and scoring heuristics.
+- OpenAI temperature (0.7). Other providers use SDK defaults.
+- LLM response length cap (500000 chars) and JSON parsing safeguards.
+- Error context truncation length (200 chars when enabled).
-### LLM Behavior
+## Migration notes (older configs)
-- **Temperature** (0.7 for refinement tasks)
-- **Reason**: Optimal balance between creativity and consistency for prompt refinement
+If you have an old `.env` file, remove unused settings:
-### Internal Limits
-- **Analysis max tokens** (1500)
-- **Analysis timeout** (60000ms)
-- **Max LLM response length** (500,000 chars)
-- **Error context truncation** (500 chars)
-- **Reason**: Safety constraints to prevent resource exhaustion
-## Migration from Old Configuration
-If you have an old `.env` file with these variables, **remove them** (they are not used):
-- ❌ `PORT` / `HOST` - HTTP mode not fully implemented in stdio version
-- ❌ `API_KEY` - No API authentication in current version
-- ❌ `CORS_ORIGIN` - No HTTP CORS in stdio version
-- ❌ `LOG_LEVEL` - Use `DEBUG=true/false` instead
-- ❌ `RATE_LIMIT` / `RATE_WINDOW_MS` - No rate limiting in current version
-- ❌ `REDIS_URL` / `CACHE_TTL` - In-memory cache only
-- ❌ `CIRCUIT_BREAKER_*` - Not implemented
-- ❌ `NODE_ENV` - Not used for configuration
-- ❌ `SESSION_TIMEOUT_MS` - No session management
+- `PORT`, `HOST`, `CORS_ORIGIN` (stdio transport only; `--http` is reserved).
+- `API_KEY` (no server-level auth).
+- `LOG_LEVEL` (use `DEBUG=true` or false).
+- `RATE_LIMIT`, `RATE_WINDOW_MS` (no server-side rate limiting).
+- `REDIS_URL`, `CACHE_TTL` (no caching).
+- `CIRCUIT_BREAKER_*` (not implemented).
+- `NODE_ENV` (not used for configuration).
+- `SESSION_TIMEOUT_MS` (no session management).
 ## Troubleshooting
-### High Memory Usage
-- Reduce `CACHE_MAX_SIZE` (e.g., `500`)
-- Reduce `MAX_PROMPT_LENGTH` (e.g., `5000`)
-### Timeout Errors
+### Prompt rejected
-- Increase `LLM_TIMEOUT_MS` (e.g., `90000`)
-- Increase `RETRY_TOTAL_TIMEOUT_MS` (e.g., `300000`)
-- Reduce `LLM_MAX_TOKENS` (e.g., `1500`)
+- Reduce `MAX_PROMPT_LENGTH` or trim the input to remove excessive whitespace.
-### Rate Limit Errors
+### Timeout errors
-- Increase `RETRY_BASE_DELAY_MS` (e.g., `2000`)
-- Increase `RETRY_MAX_ATTEMPTS` (e.g., `5`)
+- Increase `LLM_TIMEOUT_MS` or `RETRY_TOTAL_TIMEOUT_MS`.
+- Reduce `LLM_MAX_TOKENS`.
-### Slow Performance
+### Rate limit errors
-- Increase `CACHE_MAX_SIZE` (e.g., `2000`)
-- Use faster model (e.g., `gpt-4o-mini` or `gemini-2.0-flash-exp`)
-- Reduce `LLM_MAX_TOKENS` (e.g., `1500`)
+- Increase `RETRY_BASE_DELAY_MS` or `RETRY_MAX_ATTEMPTS`.
+- Reduce request frequency.
-### Debug Logging Not Showing
+### Slow performance
-- Set `DEBUG=true`
-- Check logs are going to stderr (where MCP logs are captured)
+- Use a faster model (for example `gpt-4o-mini` or `gemini-2.0-flash-exp`).
+- Reduce `LLM_MAX_TOKENS`.
-## Best Practices
+## Best practices
-1. **Always set only one API key** - Only configure the key for your chosen provider
-2. **Use input variables for secrets** - In mcp.json: `"OPENAI_API_KEY": "${input:openai-api-key}"`
-3. **Start with defaults** - Only override what you need
-4. **Enable debug logging temporarily** - `DEBUG=true` for troubleshooting only
-5. **Monitor cache hit rate** - Check logs for "Cache hit for refinement" messages
-6. **Test timeout settings** - Start conservative, increase if seeing timeout errors
-7. **Use JSON logging in production** - `LOG_FORMAT=json` for easier parsing
+1. Configure only the API key for your chosen provider.
+2. Use input variables for secrets (for example `"OPENAI_API_KEY": "${input:openai-api-key}"`).
+3. Start with defaults and tune only when needed.
+4. Enable `DEBUG=true` temporarily for troubleshooting.
+5. Prefer JSON logging in production (current output is JSON via pino).