npm - @kairos-sdk/core - Versions diffs - 0.3.2 → 0.4.5 - Mend

@kairos-sdk/core 0.3.2 → 0.4.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (35) hide show

package/README.md +46 -11
package/dist/{chunk-KQSNT3HZ.js → chunk-4TS6GW6O.js} +148 -368
package/dist/chunk-4TS6GW6O.js.map +1 -0
package/dist/chunk-6CLI43FI.js +315 -0
package/dist/chunk-6CLI43FI.js.map +1 -0
package/dist/chunk-6FOFWVMG.js +1 -0
package/dist/chunk-6FOFWVMG.js.map +1 -0
package/dist/{chunk-RYGYNOR6.js → chunk-6IXW3WCC.js} +936 -412
package/dist/chunk-6IXW3WCC.js.map +1 -0
package/dist/chunk-CR2NHLOH.js +523 -0
package/dist/chunk-CR2NHLOH.js.map +1 -0
package/dist/cli.cjs +1402 -170
package/dist/cli.cjs.map +1 -1
package/dist/cli.js +140 -10
package/dist/cli.js.map +1 -1
package/dist/index.cjs +1262 -156
package/dist/index.cjs.map +1 -1
package/dist/index.d.cts +5 -537
package/dist/index.d.ts +5 -537
package/dist/index.js +8 -4
package/dist/mcp-server.cjs +1259 -129
package/dist/mcp-server.cjs.map +1 -1
package/dist/mcp-server.js +113 -8
package/dist/mcp-server.js.map +1 -1
package/dist/reader-CpUcHhKW.d.cts +566 -0
package/dist/reader-CpUcHhKW.d.ts +566 -0
package/dist/standalone.cjs +2460 -0
package/dist/standalone.cjs.map +1 -0
package/dist/standalone.d.cts +105 -0
package/dist/standalone.d.ts +105 -0
package/dist/standalone.js +58 -0
package/dist/standalone.js.map +1 -0
package/package.json +6 -1
package/dist/chunk-KQSNT3HZ.js.map +0 -1
package/dist/chunk-RYGYNOR6.js.map +0 -1

package/README.md CHANGED Viewed

@@ -8,7 +8,7 @@
 ![Kairos SDK Demo](demo.gif)
-Kairos turns plain-English workflow descriptions into validated, deployable n8n workflow JSON. Use it as an **MCP server** (connect to Claude Code, Claude Desktop, or any MCP host — your LLM generates, Kairos validates and deploys, no Anthropic API key needed) or as a **TypeScript SDK** for programmatic control (calls Claude internally with a specialized prompt). Either way, workflows pass through a **23-rule structural validator** with automatic correction, and a local workflow library with **hybrid retrieval** (TF-IDF + node fingerprinting + outcome history + cluster reranking) injects past failure patterns into future generations. With a seeded template library, Kairos achieves **100% first-try structural validation pass rate** across 20 benchmark prompts (meaning the generated JSON is structurally valid on the first attempt — runtime behavior depends on your credentials and node configuration).
+Kairos turns plain-English workflow descriptions into validated, deployable n8n workflow JSON. Use it as an **MCP server** (connect to Claude Code, Claude Desktop, or any MCP host — your LLM generates, Kairos validates and deploys, no Anthropic API key needed) or as a **TypeScript SDK** for programmatic control (calls Claude internally with a specialized prompt). Either way, workflows pass through a **26-rule structural validator** with automatic correction, and a local workflow library with **hybrid retrieval** (TF-IDF + node fingerprinting + outcome history + cluster reranking) injects past failure patterns into future generations. With a seeded template library, Kairos achieves **100% first-try structural validation pass rate** across 20 benchmark prompts (meaning the generated JSON is structurally valid on the first attempt — runtime behavior depends on your credentials and node configuration).
 ```ts
 import { Kairos } from '@kairos-sdk/core'
@@ -27,11 +27,21 @@ console.log(result.workflowId)      // deployed workflow ID
 console.log(result.credentialsNeeded) // what the user still needs to configure
 ```
+### What Kairos does and does not do
+| Kairos does | Kairos does not guarantee (yet) |
+|---|---|
+| Generates valid n8n workflow JSON | Perfect business logic |
+| Validates structure before deploy (23 rules) | Correct credentials |
+| Syncs node types from your live instance | Runtime success for every API |
+| Learns from prior successful builds | That every workflow matches intent perfectly |
+| Works through MCP, SDK, or CLI | Full replacement for human review |
 ---
 ## Use as MCP Server (no code required)
-Connect Kairos to any MCP-compatible host — Claude Code, Claude Desktop, ChatGPT, Cursor, or any agent that supports the Model Context Protocol. Your host LLM generates the workflow using Kairos's specialized context, then Kairos validates and deploys it. No Anthropic API key needed — no double-LLM calls, no wasted tokens. Kairos auto-syncs your n8n instance's node types so the catalog always matches your exact setup.
+Connect Kairos to any MCP-compatible host — Claude Code, Claude Desktop, Cursor, or any agent that supports the Model Context Protocol. Your host LLM generates the workflow using Kairos's specialized context, then Kairos validates and deploys it. No Anthropic API key needed — no double-LLM calls, no wasted tokens. Kairos auto-syncs your n8n instance's node types so the catalog always matches your exact setup.
 ### Setup
@@ -85,7 +95,7 @@ The MCP server does **not** call an LLM internally. Instead, it gives your host
 1. **Host LLM calls `kairos_prompt`** — gets the n8n system prompt, node catalog, library matches, and failure patterns
 2. **Host LLM generates the workflow JSON** using that context (no separate API call)
-3. **Host LLM calls `kairos_validate`** — checks the JSON against 23 structural rules
+3. **Host LLM calls `kairos_validate`** — checks the JSON against 26 structural rules
 4. If invalid, the host LLM fixes the issues and validates again
 5. **Host LLM calls `kairos_deploy`** — sends the validated workflow to n8n
@@ -98,7 +108,7 @@ This means Kairos works with **any LLM** — Claude, GPT, Gemini, Llama, or anyt
 | Tool | Description |
 |------|-------------|
 | `kairos_prompt` | Returns the specialized system prompt, node catalog, library matches, and failure patterns for a given description |
-| `kairos_validate` | Validates workflow JSON against 23 structural rules — returns errors and warnings |
+| `kairos_validate` | Validates workflow JSON against 26 structural rules — returns errors and warnings |
 | `kairos_search` | Searches the local workflow library for similar past builds |
 | `kairos_sync` | Manually refresh the node catalog from your n8n instance (auto-runs on first `kairos_prompt` call) |
@@ -170,7 +180,7 @@ console.log(deployed.workflowId) // now live in n8n
 ## Benchmark Results
-Tested against 20 workflow prompts of varying complexity (simple triggers, multi-step conditional logic, AI agents with memory). Results measure **structural validation pass rate** — whether the generated workflow passes all 23 validator rules, not end-to-end execution correctness.
+Tested against 20 workflow prompts of varying complexity (simple triggers, multi-step conditional logic, AI agents with memory). Results measure **structural validation pass rate** — whether the generated workflow passes all 26 validator rules, not end-to-end execution correctness.
 ### Before vs After: Template-Seeded Library
@@ -182,7 +192,7 @@ Tested against 20 workflow prompts of varying complexity (simple triggers, multi
 | Avg generation time | 30.6s | **20.7s** | -32% |
 | Failures | 0 | 0 | — |
-The baseline run used Claude with the 22-rule validator and correction loop but no library. The seeded run used the same validator plus a library of 105 workflows (16 organic + 89 ingested from the n8n community). Template seeding eliminated the correction loop entirely and cut generation time by a third.
+The baseline run used Claude with the 26-rule validator and correction loop but no library. The seeded run used the same validator plus a library of 105 workflows (16 organic + 89 ingested from the n8n community). The broader local development library now contains 286+ generated/ingested workflows. Template seeding eliminated the correction loop entirely and cut generation time by a third.
 > **Note:** These results confirm that generated workflows are structurally valid and deployable to n8n. They do not verify runtime execution correctness, credential configuration, or whether the workflow output matches user intent.
@@ -195,7 +205,7 @@ The baseline run used Claude with the 22-rule validator and correction loop but
 1. **Search** — Kairos searches its local workflow library for similar past builds. Matching workflows and their failure patterns are pulled into context.
 2. **Warn** — Known failure patterns (from library matches and global telemetry rates) are injected into the system prompt so Claude avoids repeating known mistakes.
 3. **Generate** — Your description is sent to Claude with a detailed system prompt, forcing a `generate_workflow` tool call that produces structured n8n workflow JSON.
-4. **Validate** — The workflow is checked against **23 structural rules** covering node IDs, types, versions, names, positions, connections, forbidden fields, trigger presence, AI connection direction, cycle detection, webhook pairing, and required parameters.
+4. **Validate** — The workflow is checked against **26 structural rules** covering node IDs, types, versions, names, positions, connections, forbidden fields, trigger presence, AI connection direction, cycle detection, webhook pairing, and required parameters.
 5. **Correct** — If validation fails, the specific rule violations are sent back to Claude for correction (up to 3 attempts, with tighter temperature on the final try).
 6. **Strip** — Forbidden server-assigned fields (`id`, `createdAt`, `updatedAt`, etc.) are stripped before deployment.
 7. **Deploy** — The validated workflow is posted to your n8n instance via REST API.
@@ -205,7 +215,7 @@ The baseline run used Claude with the 22-rule validator and correction loop but
 1. **Prompt** — Your LLM calls `kairos_prompt`, which searches the library and returns the specialized system prompt, node catalog, library matches, and failure patterns.
 2. **Generate** — Your LLM generates the workflow JSON itself using that context. No separate API call.
-3. **Validate** — Your LLM calls `kairos_validate`, which checks the JSON against the same 23 structural rules.
+3. **Validate** — Your LLM calls `kairos_validate`, which checks the JSON against the same 26 structural rules.
 4. **Correct** — If validation fails, your LLM fixes the issues and calls `kairos_validate` again.
 5. **Deploy** — Your LLM calls `kairos_deploy`, which strips forbidden fields and posts the workflow to n8n.
 6. **Record** — The deployed workflow is saved to the local library for future retrieval.
@@ -214,7 +224,7 @@ The baseline run used Claude with the 22-rule validator and correction loop but
 ## Validator Rules
-The 22-rule validator is the core of what makes Kairos reliable. In baseline testing (no library), Claude needed the correction loop 45% of the time. Each rule targets a specific class of error:
+The 26-rule validator is the core of what makes Kairos reliable. In baseline testing (no library), Claude needed the correction loop 45% of the time. Each rule targets a specific class of error:
 | Rule | Severity | What it checks |
 |------|----------|----------------|
@@ -241,6 +251,9 @@ The 22-rule validator is the core of what makes Kairos reliable. In baseline tes
 | 21 | warn | Webhook with responseMode="responseNode" has respondToWebhook |
 | 22 | warn | Required parameters present for known node types |
 | 23 | warn | Node type is recognized in the registry (unknown types may not exist in n8n) |
+| 24 | warn | No deprecated `$node["..."]` accessor syntax in expressions |
+| 25 | warn | No `$json.items[n]` array access (n8n flattens items automatically) |
+| 26 | warn | Node references use `.first()` or `.all()` (bare `$('Node').json` throws at runtime) |
 Errors block deployment. Warnings are recorded and fed back into the prompt for future builds.
@@ -359,6 +372,9 @@ try {
     for (const issue of err.issues) {
       console.error(`[Rule ${issue.rule}] ${issue.message}`)
     }
+    // Attempt metadata and warned rules are also available
+    console.log(err.attemptMetadata)  // per-attempt timing, tokens, issues
+    console.log(err.warnedRules)      // which pattern rules were warned about
   } else if (err instanceof GenerationError) {
     // Anthropic API call failed (auth, quota, timeout)
     console.error(err.message, err.cause)
@@ -376,7 +392,7 @@ try {
 |---|---|
 | `GenerationError` | Anthropic API call failed |
 | `ResponseParseError` | Claude responded but produced no usable tool call |
-| `ValidationError` | Workflow failed 22-rule validation after max retries |
+| `ValidationError` | Workflow failed 26-rule validation after max retries (carries `.attemptMetadata` and `.warnedRules`) |
 | `ProviderError` | Network/auth failure talking to n8n |
 | `ApiError` | n8n returned a 4xx or 5xx (carries `.statusCode`) |
 | `GuardError` | Input validation failed (empty description) or `delete()` called without `{ confirm: true }` |
@@ -397,6 +413,10 @@ kairos build "Monitor a webhook and log payloads" --dry-run
 # Seed library with n8n community templates
 kairos sync-templates --max 200
+# View pattern analysis
+kairos patterns
+kairos patterns --days 60 --json
 # Manage workflows
 kairos list
 kairos get <workflow-id>
@@ -438,7 +458,22 @@ telemetry: '/path/to/telemetry/dir'
 Each event includes timestamp, session ID, token counts, validation issues, and duration — useful for benchmarking and analyzing the correction loop.
-Kairos also reads telemetry data to compute **per-rule failure rates** across all builds. Rules that fail frequently (>= 15% of builds) are automatically surfaced as warnings in the generation prompt, helping Claude avoid systemic issues. Failure rates use distinct session counting to avoid inflation from retry loops, and results are cached for 5 minutes.
+### Pattern Learning
+When telemetry is enabled, Kairos runs a **pattern analyzer** that learns from every build — successes and failures. The analyzer produces `patterns.json` which is fed back into future generations:
+- **Composite scoring** — patterns are scored using `rawConfidence × impact × recency × (1 + stickinessBoost)`, so frequent, recent, sticky failures rank highest
+- **Stickiness detection** — rules that persist across consecutive failed retry attempts (the LLM can't self-correct) get a scoring boost
+- **State lifecycle** — patterns progress through `draft → confirmed → resolved`, with per-rule resolved thresholds (5 clean builds) and 90-day TTL on resolved patterns
+- **Regression detection** — if a resolved rule starts failing again, it's flagged as regressed and prioritized in the prompt
+- **Warning effectiveness** — tracks whether warning the LLM about a rule actually prevented the failure, with per-rule pass/fail rates
+- **Schema migration** — pattern data auto-migrates across versions (currently v2) so no accumulated knowledge is lost on upgrades
+- **Rule co-occurrence** — identifies pairs of rules that commonly fail together (e.g., rules 5+17 always break at the same time)
+- **Session depth analysis** — tracks how many attempts each session needed (e.g., 80% are 1-attempt, 15% need 2, 5% need all 3)
+- **Warning cap** — max 10 patterns in the LLM prompt, prioritized: regressed > confirmed > drafts
+- **Analysis history** — each analysis run appends a summary to `pattern-history.jsonl` for trend tracking over time
+Run `kairos patterns` to view the current analysis, or `kairos patterns --json` for raw output.
 For CLI usage, set `KAIROS_TELEMETRY=true` in your environment.