npm - @j0hanz/prompt-tuner-mcp-server - Versions diffs - 1.0.5 → 1.0.7 - Mend

@j0hanz/prompt-tuner-mcp-server 1.0.5 → 1.0.7

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (171) hide show

package/AGENTS.md +20 -20
package/CONFIGURATION.md +150 -159
package/README.md +136 -188
package/dist/config/constants.d.ts +4 -2
package/dist/config/constants.d.ts.map +1 -1
package/dist/config/constants.js +4 -2
package/dist/config/constants.js.map +1 -1
package/dist/config/env.d.ts +0 -1
package/dist/config/env.d.ts.map +1 -1
package/dist/config/env.js +0 -2
package/dist/config/env.js.map +1 -1
package/dist/config/types.d.ts +30 -34
package/dist/config/types.d.ts.map +1 -1
package/dist/config/types.js.map +1 -1
package/dist/index.js +33 -18
package/dist/index.js.map +1 -1
package/dist/lib/abort-signals.d.ts +2 -0
package/dist/lib/abort-signals.d.ts.map +1 -0
package/dist/lib/abort-signals.js +5 -0
package/dist/lib/abort-signals.js.map +1 -0
package/dist/lib/errors.d.ts +2 -5
package/dist/lib/errors.d.ts.map +1 -1
package/dist/lib/errors.js +36 -64
package/dist/lib/errors.js.map +1 -1
package/dist/lib/llm-client.d.ts +5 -1
package/dist/lib/llm-client.d.ts.map +1 -1
package/dist/lib/llm-client.js +4 -0
package/dist/lib/llm-client.js.map +1 -1
package/dist/lib/llm-json.d.ts.map +1 -1
package/dist/lib/llm-json.js +75 -4
package/dist/lib/llm-json.js.map +1 -1
package/dist/lib/llm-providers/helpers.d.ts +14 -0
package/dist/lib/llm-providers/helpers.d.ts.map +1 -0
package/dist/lib/llm-providers/helpers.js +45 -0
package/dist/lib/llm-providers/helpers.js.map +1 -0
package/dist/lib/llm-providers.d.ts +3 -0
package/dist/lib/llm-providers.d.ts.map +1 -1
package/dist/lib/llm-providers.js +35 -65
package/dist/lib/llm-providers.js.map +1 -1
package/dist/lib/llm.d.ts +1 -1
package/dist/lib/llm.d.ts.map +1 -1
package/dist/lib/llm.js +4 -11
package/dist/lib/llm.js.map +1 -1
package/dist/lib/output-normalization.d.ts +7 -0
package/dist/lib/output-normalization.d.ts.map +1 -0
package/dist/lib/output-normalization.js +74 -0
package/dist/lib/output-normalization.js.map +1 -0
package/dist/lib/output-validation.d.ts +11 -0
package/dist/lib/output-validation.d.ts.map +1 -0
package/dist/lib/output-validation.js +144 -0
package/dist/lib/output-validation.js.map +1 -0
package/dist/lib/prompt-policy.d.ts +0 -2
package/dist/lib/prompt-policy.d.ts.map +1 -1
package/dist/lib/prompt-policy.js +6 -2
package/dist/lib/prompt-policy.js.map +1 -1
package/dist/lib/retry.d.ts.map +1 -1
package/dist/lib/retry.js +17 -10
package/dist/lib/retry.js.map +1 -1
package/dist/lib/technique-templates/format-instructions.d.ts +3 -0
package/dist/lib/technique-templates/format-instructions.d.ts.map +1 -0
package/dist/lib/technique-templates/format-instructions.js +42 -0
package/dist/lib/technique-templates/format-instructions.js.map +1 -0
package/dist/lib/technique-templates/templates-advanced.d.ts +5 -0
package/dist/lib/technique-templates/templates-advanced.d.ts.map +1 -0
package/dist/lib/technique-templates/templates-advanced.js +139 -0
package/dist/lib/technique-templates/templates-advanced.js.map +1 -0
package/dist/lib/technique-templates/templates-basic.d.ts +5 -0
package/dist/lib/technique-templates/templates-basic.d.ts.map +1 -0
package/dist/lib/technique-templates/templates-basic.js +129 -0
package/dist/lib/technique-templates/templates-basic.js.map +1 -0
package/dist/lib/technique-templates.d.ts +1 -1
package/dist/lib/technique-templates.d.ts.map +1 -1
package/dist/lib/technique-templates.js +12 -342
package/dist/lib/technique-templates.js.map +1 -1
package/dist/lib/tool-formatters.d.ts +13 -0
package/dist/lib/tool-formatters.d.ts.map +1 -0
package/dist/lib/tool-formatters.js +26 -0
package/dist/lib/tool-formatters.js.map +1 -0
package/dist/lib/tool-helpers.d.ts +8 -1
package/dist/lib/tool-helpers.d.ts.map +1 -1
package/dist/lib/tool-helpers.js +32 -7
package/dist/lib/tool-helpers.js.map +1 -1
package/dist/lib/tool-resources.d.ts +3 -0
package/dist/lib/tool-resources.d.ts.map +1 -0
package/dist/lib/tool-resources.js +23 -0
package/dist/lib/tool-resources.js.map +1 -0
package/dist/lib/validation.d.ts +0 -1
package/dist/lib/validation.d.ts.map +1 -1
package/dist/lib/validation.js +0 -6
package/dist/lib/validation.js.map +1 -1
package/dist/prompts/quick-workflows.d.ts.map +1 -1
package/dist/prompts/quick-workflows.js +13 -7
package/dist/prompts/quick-workflows.js.map +1 -1
package/dist/schemas/index.d.ts +0 -1
package/dist/schemas/index.d.ts.map +1 -1
package/dist/schemas/index.js +0 -1
package/dist/schemas/index.js.map +1 -1
package/dist/schemas/inputs.d.ts +4 -4
package/dist/schemas/inputs.d.ts.map +1 -1
package/dist/schemas/inputs.js +33 -2
package/dist/schemas/inputs.js.map +1 -1
package/dist/schemas/llm-responses.d.ts +8 -78
package/dist/schemas/llm-responses.d.ts.map +1 -1
package/dist/schemas/llm-responses.js +3 -3
package/dist/schemas/llm-responses.js.map +1 -1
package/dist/schemas/outputs.d.ts +150 -102
package/dist/schemas/outputs.d.ts.map +1 -1
package/dist/schemas/outputs.js +44 -13
package/dist/schemas/outputs.js.map +1 -1
package/dist/server.d.ts.map +1 -1
package/dist/server.js +6 -2
package/dist/server.js.map +1 -1
package/dist/tools/analyze-prompt.d.ts.map +1 -1
package/dist/tools/analyze-prompt.js +105 -161
package/dist/tools/analyze-prompt.js.map +1 -1
package/dist/tools/optimize-prompt/formatters.d.ts +2 -0
package/dist/tools/optimize-prompt/formatters.d.ts.map +1 -0
package/dist/tools/optimize-prompt/formatters.js +57 -0
package/dist/tools/optimize-prompt/formatters.js.map +1 -0
package/dist/tools/optimize-prompt.d.ts.map +1 -1
package/dist/tools/optimize-prompt.js +306 -155
package/dist/tools/optimize-prompt.js.map +1 -1
package/dist/tools/refine-prompt.d.ts.map +1 -1
package/dist/tools/refine-prompt.js +73 -36
package/dist/tools/refine-prompt.js.map +1 -1
package/dist/tools/validate-prompt/prompt.d.ts +2 -0
package/dist/tools/validate-prompt/prompt.d.ts.map +1 -0
package/dist/tools/validate-prompt/prompt.js +40 -0
package/dist/tools/validate-prompt/prompt.js.map +1 -0
package/dist/tools/validate-prompt.d.ts.map +1 -1
package/dist/tools/validate-prompt.js +108 -152
package/dist/tools/validate-prompt.js.map +1 -1
package/package.json +5 -2
package/src/config/constants.ts +4 -2
package/src/config/env.ts +0 -3
package/src/config/types.ts +38 -34
package/src/index.ts +43 -23
package/src/lib/abort-signals.ts +7 -0
package/src/lib/errors.ts +60 -103
package/src/lib/llm-client.ts +9 -1
package/src/lib/llm-json.ts +92 -3
package/src/lib/llm-providers/helpers.ts +78 -0
package/src/lib/llm-providers.ts +55 -83
package/src/lib/llm.ts +6 -13
package/src/lib/output-normalization.ts +100 -0
package/src/lib/output-validation.ts +183 -0
package/src/lib/prompt-policy.ts +9 -2
package/src/lib/retry.ts +27 -9
package/src/lib/technique-templates/format-instructions.ts +45 -0
package/src/lib/technique-templates/templates-advanced.ts +147 -0
package/src/lib/technique-templates/templates-basic.ts +137 -0
package/src/lib/technique-templates.ts +13 -350
package/src/lib/tool-formatters.ts +46 -0
package/src/lib/tool-helpers.ts +50 -12
package/src/lib/tool-resources.ts +31 -0
package/src/lib/validation.ts +0 -7
package/src/prompts/quick-workflows.ts +12 -7
package/src/schemas/index.ts +0 -7
package/src/schemas/inputs.ts +36 -5
package/src/schemas/llm-responses.ts +3 -13
package/src/schemas/outputs.ts +50 -13
package/src/server.ts +8 -3
package/src/tools/analyze-prompt.ts +135 -179
package/src/tools/optimize-prompt/formatters.ts +70 -0
package/src/tools/optimize-prompt.ts +495 -179
package/src/tools/refine-prompt.ts +150 -55
package/src/tools/validate-prompt/prompt.ts +40 -0
package/src/tools/validate-prompt.ts +172 -177
package/tests/llm-json.test.ts +17 -0
package/src/lib/cache.ts +0 -52
package/src/resources/index.ts +0 -3

package/AGENTS.md CHANGED Viewed

@@ -2,38 +2,38 @@
 ## Project Structure & Module Organization
-- `src/` contains TypeScript source. Key entry points: `src/index.ts` and `src/server.ts`. Feature areas include `src/tools/`, `src/resources/`, `src/prompts/`, `src/schemas/`, `src/config/`, and `src/lib/`.
-- `tests/` holds Vitest suites (for example, `tests/integration.test.ts`).
-- `docs/` contains documentation and assets such as `docs/logo.png`.
-- `dist/` is generated build output; do not edit directly.
+- `src/` holds TypeScript source. `src/index.ts` is the entry point and `src/server.ts` wires the MCP server. Subfolders include `config/`, `lib/`, `tools/`, `resources/`, `prompts/`, `schemas/`, and `types/`.
+- `tests/` contains Vitest suites; test files use the `*.test.ts` naming pattern.
+- `dist/` is generated build output (do not edit by hand).
+- `docs/` stores static assets. `CONFIGURATION.md` documents runtime environment variables.
 ## Build, Test, and Development Commands
-- `npm run build`: compile TypeScript to `dist/` and mark `dist/index.js` executable.
-- `npm run dev` / `npm run dev:http`: run the server from `src/` in watch mode (stdio or HTTP).
+- `npm run dev` / `npm run dev:http`: run from source with tsx watch (HTTP variant adds `--http`).
+- `npm run build`: compile TypeScript into `dist/` and set executable permissions.
 - `npm run start` / `npm run start:http`: run the compiled server from `dist/`.
-- `npm run test`: run Vitest once; `npm run test:watch` for watch mode.
-- `npm run lint`: ESLint checks; `npm run format`: Prettier formatting; `npm run type-check`: TypeScript without emit.
-- `npm run inspector`: launch the MCP inspector against `dist/`.
+- `npm run test` / `npm run test:watch`: run Vitest once or in watch mode.
+- `npm run lint` and `npm run format`: ESLint checks and Prettier formatting.
+- `npm run type-check`: `tsc --noEmit` for strict type validation.
 ## Coding Style & Naming Conventions
-- Formatting is enforced by Prettier: 2-space indentation, single quotes, semicolons, 80-char print width.
-- ESLint with typescript-eslint is strict: no unused imports, no `any`, prefer type-only imports, explicit return types.
-- Naming: camelCase for variables/functions, PascalCase for types/classes, UPPER_CASE for constants. Leading underscores are allowed for intentionally unused args.
+- TypeScript, ES modules, Node >= 20.
+- Prettier rules: 2-space indentation, single quotes, trailing commas, 80-char line width, sorted imports.
+- ESLint is strict; avoid `any`, unused imports, and floating promises; prefer `type` imports.
+- Naming: `camelCase` for variables/functions, `PascalCase` for types, `UPPER_CASE` for constants; leading `_` is allowed for unused args.
 ## Testing Guidelines
-- Tests use Vitest and live in `tests/` using `*.test.ts` filenames.
-- Integration tests require an API key (`OPENAI_API_KEY`, `ANTHROPIC_API_KEY`, or `GOOGLE_API_KEY`) and a built `dist/` (`npm run build`). If no key is set, integration tests are skipped.
-- Keep unit tests fast; add integration coverage for new tools/resources when behavior depends on the running server.
+- Use Vitest in the Node environment; keep tests in `tests/` and name `*.test.ts`.
+- Favor deterministic tests and keep individual tests under the 15s timeout.
 ## Commit & Pull Request Guidelines
-- Commit subjects in this repo are short, descriptive, imperative sentences (for example, "Refactor cache configuration..."). Version bumps may be bare tags like `1.0.4`.
-- Before opening a PR, run `npm run lint`, `npm run type-check`, and `npm run test`.
-- PRs should describe the change, include test commands run, and update `README.md` or `CONFIGURATION.md` when behavior or env vars change.
+- History favors short, imperative summaries; common pattern is `refactor: ...`, plus plain `Add ...` and version bumps like `1.0.5`.
+- PRs should include a brief summary, tests run (for example, `npm run test`), and note any config or environment changes. Link related issues when applicable.
-## Security & Configuration
+## Security & Configuration Tips
-- Runtime configuration is via environment variables documented in `CONFIGURATION.md`. Never commit API keys; prefer MCP client config or a local `.env` file.
+- Runtime behavior is driven by environment variables; see `CONFIGURATION.md` for required keys and limits.
+- Never commit API keys. Be cautious with `INCLUDE_ERROR_CONTEXT=true` in production.

package/CONFIGURATION.md CHANGED Viewed

@@ -1,227 +1,218 @@
 # PromptTuner MCP Configuration Guide
-## Environment Variables
+PromptTuner MCP is configured entirely via environment variables. Set them in your MCP client configuration (for example `mcp.json`, `claude_desktop_config.json`) or a `.env` file.
-All configuration is done through environment variables. Set them in your MCP client configuration (e.g., `mcp.json` for VS Code) or in a `.env` file.
+## Required configuration
-### Required Configuration
+You must pick a provider and supply its API key.
-| Variable            | Description             | Default           | Example                                                                              |
-| ------------------- | ----------------------- | ----------------- | ------------------------------------------------------------------------------------ |
-| `LLM_PROVIDER`      | LLM provider to use     | `openai`          | `openai`, `anthropic`, `google`                                                      |
-| `OPENAI_API_KEY`    | OpenAI API key          | -                 | `sk-...`                                                                             |
-| `ANTHROPIC_API_KEY` | Anthropic API key       | -                 | `sk-ant-...`                                                                         |
-| `GOOGLE_API_KEY`    | Google Gemini API key   | -                 | `AIzaSy...`                                                                          |
-| `LLM_MODEL`         | Model to use (optional) | Provider-specific | `gpt-4o`, `claude-3-5-sonnet-20241022`, `gemini-2.0-flash-exp`, `gemini-2.5-pro-exp` |
+| Variable            | Default  | Description                             |
+| ------------------- | -------- | --------------------------------------- |
+| `LLM_PROVIDER`      | `openai` | `openai`, `anthropic`, or `google`.     |
+| `OPENAI_API_KEY`    | -        | Required when `LLM_PROVIDER=openai`.    |
+| `ANTHROPIC_API_KEY` | -        | Required when `LLM_PROVIDER=anthropic`. |
+| `GOOGLE_API_KEY`    | -        | Required when `LLM_PROVIDER=google`.    |
-**Note**: Only provide the API key for your chosen provider.
+PromptTuner checks that the correct API key environment variable is set at startup. The provider will reject invalid keys at request time.
-### Performance & Limits (Optional)
+## Provider defaults
-| Variable            | Description               | Default         | Recommended Range       |
-| ------------------- | ------------------------- | --------------- | ----------------------- |
-| `LLM_TIMEOUT_MS`    | LLM request timeout (ms)  | `60000` (1 min) | 30000-120000            |
-| `LLM_MAX_TOKENS`    | Max tokens per response   | `8000`          | 2000-16000 (pro: 8000+) |
-| `MAX_PROMPT_LENGTH` | Max prompt length (chars) | `10000`         | 5000-50000              |
-| `CACHE_MAX_SIZE`    | Max cached refinements    | `1000`          | 500-5000                |
+| Provider    | Default model                | API key env         |
+| ----------- | ---------------------------- | ------------------- |
+| `openai`    | `gpt-4o`                     | `OPENAI_API_KEY`    |
+| `anthropic` | `claude-3-5-sonnet-20241022` | `ANTHROPIC_API_KEY` |
+| `google`    | `gemini-2.0-flash-exp`       | `GOOGLE_API_KEY`    |
-### Retry Configuration (Optional)
+Set `LLM_MODEL` to override the default model for the chosen provider.
-| Variable                 | Description                    | Default          | Recommended Range |
-| ------------------------ | ------------------------------ | ---------------- | ----------------- |
-| `RETRY_MAX_ATTEMPTS`     | Max retry attempts             | `3`              | 1-5               |
-| `RETRY_BASE_DELAY_MS`    | Initial retry delay (ms)       | `1000`           | 500-2000          |
-| `RETRY_MAX_DELAY_MS`     | Max retry delay (ms)           | `10000`          | 5000-30000        |
-| `RETRY_TOTAL_TIMEOUT_MS` | Total timeout for retries (ms) | `180000` (3 min) | 60000-300000      |
+## Limits and timeouts (optional)
-### Logging & Debugging (Optional)
+| Variable            | Default | Description                          |
+| ------------------- | ------- | ------------------------------------ |
+| `MAX_PROMPT_LENGTH` | `10000` | Max trimmed prompt length (chars).   |
+| `LLM_MAX_TOKENS`    | `8000`  | Upper bound for model output tokens. |
+| `LLM_TIMEOUT_MS`    | `60000` | Per-request timeout (ms).            |
-| Variable                | Description               | Default | Options         |
-| ----------------------- | ------------------------- | ------- | --------------- |
-| `LOG_FORMAT`            | Log output format         | `text`  | `text`, `json`  |
-| `DEBUG`                 | Enable debug logging      | `false` | `true`, `false` |
-| `INCLUDE_ERROR_CONTEXT` | Include context in errors | `false` | `true`, `false` |
+### Prompt length enforcement
-**Security Note**: When `INCLUDE_ERROR_CONTEXT=true`, error responses may include up to 500 characters of the prompt that caused the error. Only enable this in development.
+- Input is trimmed before validation.
+- If raw input exceeds `MAX_PROMPT_LENGTH * 2`, it is rejected as excessive whitespace.
+- If trimmed input exceeds `MAX_PROMPT_LENGTH`, it is rejected.
-### Provider-Specific (Optional)
+### Tool token caps
-| Variable                 | Description                   | Default | Options         |
-| ------------------------ | ----------------------------- | ------- | --------------- |
-| `GOOGLE_SAFETY_DISABLED` | Disable Gemini safety filters | `false` | `true`, `false` |
+Tool max tokens are derived from `LLM_MAX_TOKENS`:
-## Example Configurations
+| Tool              | Max tokens                  |
+| ----------------- | --------------------------- |
+| `analyze_prompt`  | `min(LLM_MAX_TOKENS, 4000)` |
+| `refine_prompt`   | `min(LLM_MAX_TOKENS, 2000)` |
+| `optimize_prompt` | `min(LLM_MAX_TOKENS, 3000)` |
+| `validate_prompt` | `min(LLM_MAX_TOKENS, 1000)` |
-### Minimal (Production)
+## Retry behavior (optional)
-```json
-{
-  "prompttuner": {
-    "command": "node",
-    "args": ["/path/to/prompttuner-mcp/dist/index.js"],
-    "env": {
-      "LLM_PROVIDER": "openai",
-      "OPENAI_API_KEY": "${input:openai-api-key}"
-    }
-  }
-}
-```
+| Variable                 | Default  | Description                                    |
+| ------------------------ | -------- | ---------------------------------------------- |
+| `RETRY_MAX_ATTEMPTS`     | `3`      | Max retry attempts (total attempts = max + 1). |
+| `RETRY_BASE_DELAY_MS`    | `1000`   | Base delay for exponential backoff.            |
+| `RETRY_MAX_DELAY_MS`     | `10000`  | Max delay between retries.                     |
+| `RETRY_TOTAL_TIMEOUT_MS` | `180000` | Total time allowed across retries.             |
+Retries use exponential backoff with jitter and stop when the total timeout is exceeded.
+## Logging and error context (optional)
+| Variable                | Default | Description                                                    |
+| ----------------------- | ------- | -------------------------------------------------------------- |
+| `DEBUG`                 | `false` | Enables debug logging. Logs are written to stderr.             |
+| `LOG_FORMAT`            | `text`  | Parsed but currently unused (logging output is JSON via pino). |
+| `INCLUDE_ERROR_CONTEXT` | `false` | Adds a sanitized prompt snippet (up to 200 chars) to errors.   |
+## Provider-specific settings
+| Variable                 | Default | Description                                |
+| ------------------------ | ------- | ------------------------------------------ |
+| `GOOGLE_SAFETY_DISABLED` | `false` | When true, disables Gemini safety filters. |
+## validate_prompt token limits
-### Performance Tuned
+`validate_prompt` uses fixed limits when calculating `tokenUtilization`:
+| targetModel | Token limit |
+| ----------- | ----------- |
+| `claude`    | `200000`    |
+| `gpt`       | `128000`    |
+| `gemini`    | `1000000`   |
+| `generic`   | `8000`      |
+## Example configurations
+### Minimal (npx)
 ```json
 {
-  "prompttuner": {
-    "command": "node",
-    "args": ["/path/to/prompttuner-mcp/dist/index.js"],
-    "env": {
-      "LLM_PROVIDER": "anthropic",
-      "ANTHROPIC_API_KEY": "${input:anthropic-api-key}",
-      "LLM_MODEL": "claude-3-5-sonnet-20241022",
-      "LLM_TIMEOUT_MS": "90000",
-      "LLM_MAX_TOKENS": "8000",
-      "CACHE_MAX_SIZE": "2000",
-      "RETRY_MAX_ATTEMPTS": "5"
+  "mcpServers": {
+    "prompttuner": {
+      "command": "npx",
+      "args": ["-y", "@j0hanz/prompt-tuner-mcp-server@latest"],
+      "env": {
+        "LLM_PROVIDER": "openai",
+        "OPENAI_API_KEY": "${input:openai-api-key}"
+      }
     }
   }
 }
 ```
-### Pro Models (Gemini 2.5 Pro, GPT-4o)
+### From source (dist build)
 ```json
 {
-  "prompttuner": {
-    "command": "node",
-    "args": ["/path/to/prompttuner-mcp/dist/index.js"],
-    "env": {
-      "LLM_PROVIDER": "google",
-      "GOOGLE_API_KEY": "${input:google-api-key}",
-      "LLM_MODEL": "gemini-2.5-pro-exp",
-      "LLM_TIMEOUT_MS": "120000",
-      "LLM_MAX_TOKENS": "16000",
-      "CACHE_MAX_SIZE": "3000"
+  "mcpServers": {
+    "prompttuner": {
+      "command": "node",
+      "args": ["/path/to/prompttuner-mcp/dist/index.js"],
+      "env": {
+        "LLM_PROVIDER": "anthropic",
+        "ANTHROPIC_API_KEY": "${input:anthropic-api-key}"
+      }
     }
   }
 }
 ```
-### Development/Debug
+### Performance tuned
 ```json
 {
-  "prompttuner": {
-    "command": "node",
-    "args": ["/path/to/prompttuner-mcp/dist/index.js"],
-    "env": {
-      "LLM_PROVIDER": "google",
-      "GOOGLE_API_KEY": "${input:google-api-key}",
-      "LLM_MODEL": "gemini-2.0-flash-exp",
-      "LOG_FORMAT": "json",
-      "DEBUG": "true",
-      "INCLUDE_ERROR_CONTEXT": "true"
+  "mcpServers": {
+    "prompttuner": {
+      "command": "node",
+      "args": ["/path/to/prompttuner-mcp/dist/index.js"],
+      "env": {
+        "LLM_PROVIDER": "anthropic",
+        "ANTHROPIC_API_KEY": "${input:anthropic-api-key}",
+        "LLM_MODEL": "claude-3-5-sonnet-20241022",
+        "LLM_TIMEOUT_MS": "90000",
+        "LLM_MAX_TOKENS": "8000",
+        "RETRY_MAX_ATTEMPTS": "5"
+      }
     }
   }
 }
 ```
-### High Volume / Low Latency
+### High volume / low latency
 ```json
 {
-  "prompttuner": {
-    "command": "node",
-    "args": ["/path/to/prompttuner-mcp/dist/index.js"],
-    "env": {
-      "LLM_PROVIDER": "openai",
-      "OPENAI_API_KEY": "${input:openai-api-key}",
-      "LLM_MODEL": "gpt-4o-mini",
-      "LLM_TIMEOUT_MS": "30000",
-      "LLM_MAX_TOKENS": "1500",
-      "CACHE_MAX_SIZE": "5000",
-      "RETRY_MAX_ATTEMPTS": "2",
-      "RETRY_BASE_DELAY_MS": "500"
+  "mcpServers": {
+    "prompttuner": {
+      "command": "node",
+      "args": ["/path/to/prompttuner-mcp/dist/index.js"],
+      "env": {
+        "LLM_PROVIDER": "openai",
+        "OPENAI_API_KEY": "${input:openai-api-key}",
+        "LLM_MODEL": "gpt-4o-mini",
+        "LLM_TIMEOUT_MS": "30000",
+        "LLM_MAX_TOKENS": "1500",
+        "RETRY_MAX_ATTEMPTS": "2",
+        "RETRY_BASE_DELAY_MS": "500"
+      }
     }
   }
 }
 ```
-## What's NOT Configurable (and Why)
-The following are intentionally hardcoded for stability and optimal performance:
-### Scoring Algorithm
-- **Scoring dimension weights** (clarity: 0.25, specificity: 0.25, etc.)
-- **Reason**: Carefully tuned based on prompt engineering research
+## What is not configurable
-### Analysis Constants
+The following behaviors are hardcoded for stability:
-- **Pattern matching regex** for detecting prompt characteristics
-- **Reason**: Complex regex patterns that work across all use cases
+- Scoring weights: clarity 0.25, specificity 0.25, completeness 0.2, structure 0.15, effectiveness 0.15.
+- Prompt format detection patterns and scoring heuristics.
+- OpenAI temperature (0.7). Other providers use SDK defaults.
+- LLM response length cap (500000 chars) and JSON parsing safeguards.
+- Error context truncation length (200 chars when enabled).
-### LLM Behavior
+## Migration notes (older configs)
-- **Temperature** (0.7 for refinement tasks)
-- **Reason**: Optimal balance between creativity and consistency for prompt refinement
+If you have an old `.env` file, remove unused settings:
-### Internal Limits
-- **Analysis max tokens** (1500)
-- **Analysis timeout** (60000ms)
-- **Max LLM response length** (500,000 chars)
-- **Error context truncation** (500 chars)
-- **Reason**: Safety constraints to prevent resource exhaustion
-## Migration from Old Configuration
-If you have an old `.env` file with these variables, **remove them** (they are not used):
-- ❌ `PORT` / `HOST` - HTTP mode not fully implemented in stdio version
-- ❌ `API_KEY` - No API authentication in current version
-- ❌ `CORS_ORIGIN` - No HTTP CORS in stdio version
-- ❌ `LOG_LEVEL` - Use `DEBUG=true/false` instead
-- ❌ `RATE_LIMIT` / `RATE_WINDOW_MS` - No rate limiting in current version
-- ❌ `REDIS_URL` / `CACHE_TTL` - In-memory cache only
-- ❌ `CIRCUIT_BREAKER_*` - Not implemented
-- ❌ `NODE_ENV` - Not used for configuration
-- ❌ `SESSION_TIMEOUT_MS` - No session management
+- `PORT`, `HOST`, `CORS_ORIGIN` (stdio transport only; `--http` is reserved).
+- `API_KEY` (no server-level auth).
+- `LOG_LEVEL` (use `DEBUG=true` or false).
+- `RATE_LIMIT`, `RATE_WINDOW_MS` (no server-side rate limiting).
+- `REDIS_URL`, `CACHE_TTL` (no caching).
+- `CIRCUIT_BREAKER_*` (not implemented).
+- `NODE_ENV` (not used for configuration).
+- `SESSION_TIMEOUT_MS` (no session management).
 ## Troubleshooting
-### High Memory Usage
-- Reduce `CACHE_MAX_SIZE` (e.g., `500`)
-- Reduce `MAX_PROMPT_LENGTH` (e.g., `5000`)
-### Timeout Errors
+### Prompt rejected
-- Increase `LLM_TIMEOUT_MS` (e.g., `90000`)
-- Increase `RETRY_TOTAL_TIMEOUT_MS` (e.g., `300000`)
-- Reduce `LLM_MAX_TOKENS` (e.g., `1500`)
+- Reduce `MAX_PROMPT_LENGTH` or trim the input to remove excessive whitespace.
-### Rate Limit Errors
+### Timeout errors
-- Increase `RETRY_BASE_DELAY_MS` (e.g., `2000`)
-- Increase `RETRY_MAX_ATTEMPTS` (e.g., `5`)
+- Increase `LLM_TIMEOUT_MS` or `RETRY_TOTAL_TIMEOUT_MS`.
+- Reduce `LLM_MAX_TOKENS`.
-### Slow Performance
+### Rate limit errors
-- Increase `CACHE_MAX_SIZE` (e.g., `2000`)
-- Use faster model (e.g., `gpt-4o-mini` or `gemini-2.0-flash-exp`)
-- Reduce `LLM_MAX_TOKENS` (e.g., `1500`)
+- Increase `RETRY_BASE_DELAY_MS` or `RETRY_MAX_ATTEMPTS`.
+- Reduce request frequency.
-### Debug Logging Not Showing
+### Slow performance
-- Set `DEBUG=true`
-- Check logs are going to stderr (where MCP logs are captured)
+- Use a faster model (for example `gpt-4o-mini` or `gemini-2.0-flash-exp`).
+- Reduce `LLM_MAX_TOKENS`.
-## Best Practices
+## Best practices
-1. **Always set only one API key** - Only configure the key for your chosen provider
-2. **Use input variables for secrets** - In mcp.json: `"OPENAI_API_KEY": "${input:openai-api-key}"`
-3. **Start with defaults** - Only override what you need
-4. **Enable debug logging temporarily** - `DEBUG=true` for troubleshooting only
-5. **Monitor cache hit rate** - Check logs for "Cache hit for refinement" messages
-6. **Test timeout settings** - Start conservative, increase if seeing timeout errors
-7. **Use JSON logging in production** - `LOG_FORMAT=json` for easier parsing
+1. Configure only the API key for your chosen provider.
+2. Use input variables for secrets (for example `"OPENAI_API_KEY": "${input:openai-api-key}"`).
+3. Start with defaults and tune only when needed.
+4. Enable `DEBUG=true` temporarily for troubleshooting.
+5. Prefer JSON logging in production (current output is JSON via pino).