thumbgate 1.14.1 → 1.16.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (150) hide show
  1. package/.claude-plugin/marketplace.json +6 -6
  2. package/.claude-plugin/plugin.json +3 -3
  3. package/.well-known/llms.txt +5 -5
  4. package/.well-known/mcp/server-card.json +1 -1
  5. package/README.md +60 -35
  6. package/adapters/chatgpt/openapi.yaml +118 -2
  7. package/adapters/claude/.mcp.json +2 -2
  8. package/adapters/mcp/server-stdio.js +217 -84
  9. package/adapters/opencode/opencode.json +1 -1
  10. package/bench/prompt-eval-suite.json +5 -1
  11. package/bin/cli.js +211 -8
  12. package/config/enforcement.json +59 -7
  13. package/config/evals/agent-safety-eval.json +338 -22
  14. package/config/gates/default.json +33 -0
  15. package/config/gates/routine.json +43 -0
  16. package/config/github-about.json +3 -3
  17. package/config/mcp-allowlists.json +4 -0
  18. package/config/merge-quality-checks.json +2 -1
  19. package/config/model-candidates.json +131 -0
  20. package/openapi/openapi.yaml +118 -2
  21. package/package.json +70 -51
  22. package/public/blog.html +7 -7
  23. package/public/codex-plugin.html +13 -7
  24. package/public/compare.html +29 -23
  25. package/public/dashboard.html +105 -12
  26. package/public/guide.html +28 -28
  27. package/public/index.html +233 -97
  28. package/public/learn.html +87 -20
  29. package/public/lessons.html +26 -2
  30. package/public/numbers.html +271 -0
  31. package/public/pro.html +89 -19
  32. package/scripts/agent-audit-trace.js +55 -0
  33. package/scripts/agent-memory-lifecycle.js +96 -0
  34. package/scripts/agent-readiness-plan.js +118 -0
  35. package/scripts/agentic-data-pipeline.js +21 -1
  36. package/scripts/agents-sdk-sandbox-plan.js +57 -0
  37. package/scripts/ai-org-governance.js +98 -0
  38. package/scripts/ai-search-distribution.js +43 -0
  39. package/scripts/artifact-agent-plan.js +81 -0
  40. package/scripts/billing.js +27 -8
  41. package/scripts/cli-feedback.js +2 -1
  42. package/scripts/cli-schema.js +60 -5
  43. package/scripts/code-mode-mcp-plan.js +71 -0
  44. package/scripts/commercial-offer.js +1 -1
  45. package/scripts/context-engine.js +1 -2
  46. package/scripts/context-manager.js +4 -1
  47. package/scripts/contextfs.js +214 -32
  48. package/scripts/dashboard-render-spec.js +1 -1
  49. package/scripts/dashboard.js +275 -9
  50. package/scripts/decision-journal.js +13 -3
  51. package/scripts/document-workflow-governance.js +62 -0
  52. package/scripts/enterprise-agent-rollout.js +34 -0
  53. package/scripts/experience-replay-governance.js +69 -0
  54. package/scripts/export-hf-dataset.js +1 -1
  55. package/scripts/feedback-loop.js +141 -9
  56. package/scripts/feedback-to-rules.js +17 -23
  57. package/scripts/gates-engine.js +4 -6
  58. package/scripts/growth-campaigns.js +49 -0
  59. package/scripts/harness-selector.js +145 -1
  60. package/scripts/hybrid-supervisor-agent.js +64 -0
  61. package/scripts/inference-cache-policy.js +72 -0
  62. package/scripts/inference-economics.js +53 -0
  63. package/scripts/internal-agent-bootstrap.js +12 -2
  64. package/scripts/knowledge-layer-plan.js +108 -0
  65. package/scripts/lesson-canonical.js +181 -0
  66. package/scripts/lesson-db.js +71 -10
  67. package/scripts/lesson-inference.js +183 -44
  68. package/scripts/lesson-search.js +4 -1
  69. package/scripts/lesson-synthesis.js +23 -2
  70. package/scripts/llm-client.js +157 -26
  71. package/scripts/mailer/resend-mailer.js +112 -1
  72. package/scripts/mcp-transport-strategy.js +66 -0
  73. package/scripts/memory-store-governance.js +60 -0
  74. package/scripts/meta-agent-loop.js +7 -13
  75. package/scripts/model-access-eligibility.js +38 -0
  76. package/scripts/model-migration-readiness.js +55 -0
  77. package/scripts/native-messaging-audit.js +514 -0
  78. package/scripts/operational-integrity.js +96 -3
  79. package/scripts/otel-declarative-config.js +56 -0
  80. package/scripts/perplexity-client.js +1 -1
  81. package/scripts/post-training-governance.js +34 -0
  82. package/scripts/pr-manager.js +47 -7
  83. package/scripts/private-core-boundary.js +72 -0
  84. package/scripts/production-agent-readiness.js +40 -0
  85. package/scripts/profile-router.js +16 -1
  86. package/scripts/prompt-eval.js +564 -32
  87. package/scripts/prompt-programs.js +93 -0
  88. package/scripts/provider-action-normalizer.js +585 -0
  89. package/scripts/rule-validator.js +285 -0
  90. package/scripts/scaling-law-claims.js +60 -0
  91. package/scripts/security-scanner.js +1 -1
  92. package/scripts/self-distill-agent.js +7 -32
  93. package/scripts/seo-gsd.js +400 -43
  94. package/scripts/skill-rag-router.js +53 -0
  95. package/scripts/spec-gate.js +1 -1
  96. package/scripts/student-consistent-training.js +73 -0
  97. package/scripts/synthetic-data-provenance.js +98 -0
  98. package/scripts/task-context-result.js +81 -0
  99. package/scripts/telemetry-analytics.js +149 -0
  100. package/scripts/thompson-sampling.js +2 -2
  101. package/scripts/token-savings.js +7 -6
  102. package/scripts/token-tco.js +46 -0
  103. package/scripts/tool-registry.js +75 -3
  104. package/scripts/verification-loop.js +10 -1
  105. package/scripts/verifier-scoring.js +71 -0
  106. package/scripts/workflow-sentinel.js +284 -28
  107. package/scripts/workspace-agent-routines.js +118 -0
  108. package/skills/thumbgate/SKILL.md +1 -1
  109. package/src/api/server.js +434 -120
  110. package/.claude-plugin/README.md +0 -170
  111. package/adapters/README.md +0 -12
  112. package/scripts/analytics-report.js +0 -328
  113. package/scripts/autonomous-workflow.js +0 -377
  114. package/scripts/billing-setup.js +0 -109
  115. package/scripts/creator-campaigns.js +0 -239
  116. package/scripts/cross-encoder-reranker.js +0 -235
  117. package/scripts/daemon-manager.js +0 -108
  118. package/scripts/decision-trace.js +0 -354
  119. package/scripts/delegation-runtime.js +0 -896
  120. package/scripts/dispatch-brief.js +0 -159
  121. package/scripts/distribution-surfaces.js +0 -110
  122. package/scripts/feedback-history-distiller.js +0 -382
  123. package/scripts/funnel-analytics.js +0 -35
  124. package/scripts/history-distiller.js +0 -200
  125. package/scripts/hosted-job-launcher.js +0 -256
  126. package/scripts/intent-router.js +0 -392
  127. package/scripts/lesson-reranker.js +0 -263
  128. package/scripts/lesson-retrieval.js +0 -148
  129. package/scripts/managed-lesson-agent.js +0 -183
  130. package/scripts/operational-dashboard.js +0 -103
  131. package/scripts/operational-summary.js +0 -129
  132. package/scripts/operator-artifacts.js +0 -608
  133. package/scripts/optimize-context.js +0 -17
  134. package/scripts/org-dashboard.js +0 -206
  135. package/scripts/partner-orchestration.js +0 -146
  136. package/scripts/predictive-insights.js +0 -356
  137. package/scripts/pulse.js +0 -80
  138. package/scripts/reflector-agent.js +0 -221
  139. package/scripts/sales-pipeline.js +0 -681
  140. package/scripts/session-episode-store.js +0 -329
  141. package/scripts/session-health-sensor.js +0 -242
  142. package/scripts/session-report.js +0 -120
  143. package/scripts/swarm-coordinator.js +0 -81
  144. package/scripts/tool-kpi-tracker.js +0 -12
  145. package/scripts/webhook-delivery.js +0 -62
  146. package/scripts/workflow-sprint-intake.js +0 -475
  147. package/skills/agent-memory/SKILL.md +0 -97
  148. package/skills/solve-architecture-autonomy/SKILL.md +0 -17
  149. package/skills/solve-architecture-autonomy/tool.js +0 -33
  150. package/skills/thumbgate-feedback/SKILL.md +0 -49
@@ -1,97 +0,0 @@
1
- ---
2
- name: Agent Memory
3
- description: Recall past mistakes and capture feedback so your agent stops repeating errors. Works locally via MCP server — no API key needed.
4
- ---
5
-
6
- # Agent Memory
7
-
8
- Give your agent persistent memory across sessions. Before starting any task, recall what went wrong last time. After completing work, capture whether it succeeded or failed. Prevention rules are auto-generated from repeated mistakes.
9
-
10
- ## Setup
11
-
12
- Add the MCP server (one-time):
13
-
14
- ```bash
15
- claude mcp add thumbgate -- npx -y thumbgate serve
16
- ```
17
-
18
- No API key needed. All data stays local.
19
-
20
- ## When to Use
21
-
22
- - Starting a new task or session — recall past context first
23
- - After completing work that succeeded or failed — capture feedback
24
- - When the agent keeps making the same mistake — check prevention rules
25
-
26
- ## Workflow
27
-
28
- ### Step 1: Recall past context (do this FIRST on every task)
29
-
30
- Call the `recall` MCP tool with a description of your current task. The tool returns:
31
-
32
- - Past feedback relevant to this task (vector similarity search)
33
- - Active prevention rules (auto-generated from repeated failures)
34
- - Recent feedback summary with approval rate
35
-
36
- Read the prevention rules carefully. These are patterns that failed before — follow them.
37
-
38
- ### Step 2: Do your work
39
-
40
- Complete the task as normal. Keep track of what you did and whether it worked.
41
-
42
- ### Step 3: Capture feedback
43
-
44
- Call the `capture_feedback` MCP tool:
45
-
46
- **If succeeded:**
47
- - signal: `up`
48
- - context: What worked and why
49
- - tags: Category labels
50
-
51
- **If failed:**
52
- - signal: `down`
53
- - context: What you were trying to do
54
- - whatWentWrong: Specific failure description
55
- - whatToChange: How to avoid this next time
56
- - tags: Category labels
57
-
58
- Vague feedback like "it failed" will be rejected. Be specific.
59
-
60
- ### Step 4: Check improvement (optional)
61
-
62
- Call the `feedback_stats` MCP tool to see approval rate, top failure domains, and whether the agent is trending better or worse.
63
-
64
- ## Available MCP Tools
65
-
66
- | Tool | What it does |
67
- |------|-------------|
68
- | `recall` | Search past feedback and prevention rules for current task |
69
- | `capture_feedback` | Record what worked or failed with structured context |
70
- | `prevention_rules` | View auto-generated rules from repeated mistakes |
71
- | `feedback_stats` | Approval rate, trend analysis, top failure domains |
72
- | `feedback_summary` | Human-readable summary of recent signals |
73
-
74
- ## MCP Profiles
75
-
76
- | Profile | Tools | Use case |
77
- |---------|-------|----------|
78
- | `essential` | 5 core tools | Default — start here |
79
- | `commerce` | 6 tools + commerce_recall | Agentic commerce agents |
80
- | `default` | 12 tools | Full pipeline including DPO export |
81
-
82
- Set profile: `THUMBGATE_MCP_PROFILE=essential npx thumbgate serve`
83
-
84
- ## How Prevention Rules Work
85
-
86
- 1. Agent makes mistake A → you capture `down` feedback
87
- 2. Agent makes mistake A again → you capture `down` feedback again
88
- 3. System detects pattern → auto-generates prevention rule: "NEVER do A"
89
- 4. Next session → `recall` returns the rule → agent follows it
90
-
91
- This is the core value. The agent doesn't learn — but it reads the rules and follows them.
92
-
93
- ## Links
94
-
95
- - [GitHub](https://github.com/IgorGanapolsky/thumbgate)
96
- - [npm](https://www.npmjs.com/package/thumbgate)
97
- - [MCP Registry](https://registry.modelcontextprotocol.io)
@@ -1,17 +0,0 @@
1
- ---
2
- name: solve-architecture-autonomy
3
- description: Automated skill to handle architecture, autonomy, crisis, debug, deployment, error, execution, external-assessment, feedback, inefficiency, negative, railway, revenue, roi, simplification, user-frustration patterns efficiently.
4
- diagnosis: Repeated execution failure in this domain.
5
- status: materialized
6
- ---
7
-
8
- # SOLVE-ARCHITECTURE-AUTONOMY Capability
9
-
10
- ## Problem
11
- I provided a plan and research instead of immediately deploy
12
-
13
- ## Automated Diagnosis
14
- Repeated execution failure in this domain.
15
-
16
- ## Usage
17
- The agent should call the `handle_architecture` tool when tasks involve `architecture, autonomy, crisis, debug, deployment, error, execution, external-assessment, feedback, inefficiency, negative, railway, revenue, roi, simplification, user-frustration`.
@@ -1,33 +0,0 @@
1
- /**
2
- * Automated Skill: solve-architecture-autonomy
3
- * Generated: 2026-03-13T15:50:58.840Z
4
- *
5
- * This tool was materialized by the EvoSkill loop to address:
6
- * "I provided a plan and research instead of immediately deploy"
7
- */
8
-
9
- const { execSync } = require('child_process');
10
-
11
- /**
12
- * Fixes I provided a plan and research instead of immediately deploy
13
- */
14
- async function handle_architecture(args) {
15
- const { context } = args;
16
-
17
- // LOGIC: Materialized code should implement the fix derived from the diagnosis.
18
- // For now, we provide a structured wrapper that logs intent and applies
19
- // the suggested corrective action.
20
-
21
- console.log(`[EVOSKILL] Executing handle_architecture to resolve: I provided a plan and research instead of immediately deploy`);
22
-
23
- // Corrective action placeholder - in a full loop, this would be LLM-generated code
24
- // derived from the 'how-to-avoid' fields in memory-log.jsonl.
25
-
26
- return {
27
- status: 'success',
28
- appliedFix: `Automated handling of I provided a plan and research instead of immediately deploy pattern.`,
29
- context: context
30
- };
31
- }
32
-
33
- module.exports = { handle_architecture };
@@ -1,49 +0,0 @@
1
- ---
2
- name: thumbgate-feedback
3
- description: >
4
- Capture thumbs up/down feedback into structured memories and prevention rules.
5
- Require one sentence of why before claiming memory promotion.
6
- Use when user gives explicit quality signals about agent work (e.g. "that worked",
7
- "that failed", "thumbs up/down"). Do NOT use for general questions, code generation,
8
- file operations, or any task that is not explicit feedback on prior agent output.
9
- triggers:
10
- - thumbs up
11
- - thumbs down
12
- - that worked
13
- - that failed
14
- negative_triggers:
15
- - generate code
16
- - search files
17
- - explain this
18
- - run tests
19
- ---
20
-
21
- # ThumbGate Feedback Skill
22
-
23
- When user provides feedback, execute:
24
-
25
- ```bash
26
- # negative
27
- node .claude/scripts/feedback/capture-feedback.js \
28
- --feedback=down \
29
- --context="<what failed>" \
30
- --what-went-wrong="<specific failure>" \
31
- --what-to-change="<prevention action>" \
32
- --tags="<domain>,regression"
33
-
34
- # positive
35
- node .claude/scripts/feedback/capture-feedback.js \
36
- --feedback=up \
37
- --context="<what succeeded>" \
38
- --what-worked="<repeatable pattern>" \
39
- --tags="<domain>,fix"
40
- ```
41
-
42
- If the user only says `thumbs up`, `thumbs down`, `that worked`, or `that failed`, log the signal and ask one follow-up question before claiming it became reusable memory.
43
-
44
- At session start, run:
45
-
46
- ```bash
47
- npm run feedback:summary
48
- npm run feedback:rules
49
- ```