thumbgate 1.14.1 → 1.16.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.claude-plugin/marketplace.json +6 -6
- package/.claude-plugin/plugin.json +3 -3
- package/.well-known/llms.txt +5 -5
- package/.well-known/mcp/server-card.json +1 -1
- package/README.md +60 -35
- package/adapters/chatgpt/openapi.yaml +118 -2
- package/adapters/claude/.mcp.json +2 -2
- package/adapters/mcp/server-stdio.js +217 -84
- package/adapters/opencode/opencode.json +1 -1
- package/bench/prompt-eval-suite.json +5 -1
- package/bin/cli.js +211 -8
- package/config/enforcement.json +59 -7
- package/config/evals/agent-safety-eval.json +338 -22
- package/config/gates/default.json +33 -0
- package/config/gates/routine.json +43 -0
- package/config/github-about.json +3 -3
- package/config/mcp-allowlists.json +4 -0
- package/config/merge-quality-checks.json +2 -1
- package/config/model-candidates.json +131 -0
- package/openapi/openapi.yaml +118 -2
- package/package.json +70 -51
- package/public/blog.html +7 -7
- package/public/codex-plugin.html +13 -7
- package/public/compare.html +29 -23
- package/public/dashboard.html +105 -12
- package/public/guide.html +28 -28
- package/public/index.html +233 -97
- package/public/learn.html +87 -20
- package/public/lessons.html +26 -2
- package/public/numbers.html +271 -0
- package/public/pro.html +89 -19
- package/scripts/agent-audit-trace.js +55 -0
- package/scripts/agent-memory-lifecycle.js +96 -0
- package/scripts/agent-readiness-plan.js +118 -0
- package/scripts/agentic-data-pipeline.js +21 -1
- package/scripts/agents-sdk-sandbox-plan.js +57 -0
- package/scripts/ai-org-governance.js +98 -0
- package/scripts/ai-search-distribution.js +43 -0
- package/scripts/artifact-agent-plan.js +81 -0
- package/scripts/billing.js +27 -8
- package/scripts/cli-feedback.js +2 -1
- package/scripts/cli-schema.js +60 -5
- package/scripts/code-mode-mcp-plan.js +71 -0
- package/scripts/commercial-offer.js +1 -1
- package/scripts/context-engine.js +1 -2
- package/scripts/context-manager.js +4 -1
- package/scripts/contextfs.js +214 -32
- package/scripts/dashboard-render-spec.js +1 -1
- package/scripts/dashboard.js +275 -9
- package/scripts/decision-journal.js +13 -3
- package/scripts/document-workflow-governance.js +62 -0
- package/scripts/enterprise-agent-rollout.js +34 -0
- package/scripts/experience-replay-governance.js +69 -0
- package/scripts/export-hf-dataset.js +1 -1
- package/scripts/feedback-loop.js +141 -9
- package/scripts/feedback-to-rules.js +17 -23
- package/scripts/gates-engine.js +4 -6
- package/scripts/growth-campaigns.js +49 -0
- package/scripts/harness-selector.js +145 -1
- package/scripts/hybrid-supervisor-agent.js +64 -0
- package/scripts/inference-cache-policy.js +72 -0
- package/scripts/inference-economics.js +53 -0
- package/scripts/internal-agent-bootstrap.js +12 -2
- package/scripts/knowledge-layer-plan.js +108 -0
- package/scripts/lesson-canonical.js +181 -0
- package/scripts/lesson-db.js +71 -10
- package/scripts/lesson-inference.js +183 -44
- package/scripts/lesson-search.js +4 -1
- package/scripts/lesson-synthesis.js +23 -2
- package/scripts/llm-client.js +157 -26
- package/scripts/mailer/resend-mailer.js +112 -1
- package/scripts/mcp-transport-strategy.js +66 -0
- package/scripts/memory-store-governance.js +60 -0
- package/scripts/meta-agent-loop.js +7 -13
- package/scripts/model-access-eligibility.js +38 -0
- package/scripts/model-migration-readiness.js +55 -0
- package/scripts/native-messaging-audit.js +514 -0
- package/scripts/operational-integrity.js +96 -3
- package/scripts/otel-declarative-config.js +56 -0
- package/scripts/perplexity-client.js +1 -1
- package/scripts/post-training-governance.js +34 -0
- package/scripts/pr-manager.js +47 -7
- package/scripts/private-core-boundary.js +72 -0
- package/scripts/production-agent-readiness.js +40 -0
- package/scripts/profile-router.js +16 -1
- package/scripts/prompt-eval.js +564 -32
- package/scripts/prompt-programs.js +93 -0
- package/scripts/provider-action-normalizer.js +585 -0
- package/scripts/rule-validator.js +285 -0
- package/scripts/scaling-law-claims.js +60 -0
- package/scripts/security-scanner.js +1 -1
- package/scripts/self-distill-agent.js +7 -32
- package/scripts/seo-gsd.js +400 -43
- package/scripts/skill-rag-router.js +53 -0
- package/scripts/spec-gate.js +1 -1
- package/scripts/student-consistent-training.js +73 -0
- package/scripts/synthetic-data-provenance.js +98 -0
- package/scripts/task-context-result.js +81 -0
- package/scripts/telemetry-analytics.js +149 -0
- package/scripts/thompson-sampling.js +2 -2
- package/scripts/token-savings.js +7 -6
- package/scripts/token-tco.js +46 -0
- package/scripts/tool-registry.js +75 -3
- package/scripts/verification-loop.js +10 -1
- package/scripts/verifier-scoring.js +71 -0
- package/scripts/workflow-sentinel.js +284 -28
- package/scripts/workspace-agent-routines.js +118 -0
- package/skills/thumbgate/SKILL.md +1 -1
- package/src/api/server.js +434 -120
- package/.claude-plugin/README.md +0 -170
- package/adapters/README.md +0 -12
- package/scripts/analytics-report.js +0 -328
- package/scripts/autonomous-workflow.js +0 -377
- package/scripts/billing-setup.js +0 -109
- package/scripts/creator-campaigns.js +0 -239
- package/scripts/cross-encoder-reranker.js +0 -235
- package/scripts/daemon-manager.js +0 -108
- package/scripts/decision-trace.js +0 -354
- package/scripts/delegation-runtime.js +0 -896
- package/scripts/dispatch-brief.js +0 -159
- package/scripts/distribution-surfaces.js +0 -110
- package/scripts/feedback-history-distiller.js +0 -382
- package/scripts/funnel-analytics.js +0 -35
- package/scripts/history-distiller.js +0 -200
- package/scripts/hosted-job-launcher.js +0 -256
- package/scripts/intent-router.js +0 -392
- package/scripts/lesson-reranker.js +0 -263
- package/scripts/lesson-retrieval.js +0 -148
- package/scripts/managed-lesson-agent.js +0 -183
- package/scripts/operational-dashboard.js +0 -103
- package/scripts/operational-summary.js +0 -129
- package/scripts/operator-artifacts.js +0 -608
- package/scripts/optimize-context.js +0 -17
- package/scripts/org-dashboard.js +0 -206
- package/scripts/partner-orchestration.js +0 -146
- package/scripts/predictive-insights.js +0 -356
- package/scripts/pulse.js +0 -80
- package/scripts/reflector-agent.js +0 -221
- package/scripts/sales-pipeline.js +0 -681
- package/scripts/session-episode-store.js +0 -329
- package/scripts/session-health-sensor.js +0 -242
- package/scripts/session-report.js +0 -120
- package/scripts/swarm-coordinator.js +0 -81
- package/scripts/tool-kpi-tracker.js +0 -12
- package/scripts/webhook-delivery.js +0 -62
- package/scripts/workflow-sprint-intake.js +0 -475
- package/skills/agent-memory/SKILL.md +0 -97
- package/skills/solve-architecture-autonomy/SKILL.md +0 -17
- package/skills/solve-architecture-autonomy/tool.js +0 -33
- package/skills/thumbgate-feedback/SKILL.md +0 -49
|
@@ -1,97 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
name: Agent Memory
|
|
3
|
-
description: Recall past mistakes and capture feedback so your agent stops repeating errors. Works locally via MCP server — no API key needed.
|
|
4
|
-
---
|
|
5
|
-
|
|
6
|
-
# Agent Memory
|
|
7
|
-
|
|
8
|
-
Give your agent persistent memory across sessions. Before starting any task, recall what went wrong last time. After completing work, capture whether it succeeded or failed. Prevention rules are auto-generated from repeated mistakes.
|
|
9
|
-
|
|
10
|
-
## Setup
|
|
11
|
-
|
|
12
|
-
Add the MCP server (one-time):
|
|
13
|
-
|
|
14
|
-
```bash
|
|
15
|
-
claude mcp add thumbgate -- npx -y thumbgate serve
|
|
16
|
-
```
|
|
17
|
-
|
|
18
|
-
No API key needed. All data stays local.
|
|
19
|
-
|
|
20
|
-
## When to Use
|
|
21
|
-
|
|
22
|
-
- Starting a new task or session — recall past context first
|
|
23
|
-
- After completing work that succeeded or failed — capture feedback
|
|
24
|
-
- When the agent keeps making the same mistake — check prevention rules
|
|
25
|
-
|
|
26
|
-
## Workflow
|
|
27
|
-
|
|
28
|
-
### Step 1: Recall past context (do this FIRST on every task)
|
|
29
|
-
|
|
30
|
-
Call the `recall` MCP tool with a description of your current task. The tool returns:
|
|
31
|
-
|
|
32
|
-
- Past feedback relevant to this task (vector similarity search)
|
|
33
|
-
- Active prevention rules (auto-generated from repeated failures)
|
|
34
|
-
- Recent feedback summary with approval rate
|
|
35
|
-
|
|
36
|
-
Read the prevention rules carefully. These are patterns that failed before — follow them.
|
|
37
|
-
|
|
38
|
-
### Step 2: Do your work
|
|
39
|
-
|
|
40
|
-
Complete the task as normal. Keep track of what you did and whether it worked.
|
|
41
|
-
|
|
42
|
-
### Step 3: Capture feedback
|
|
43
|
-
|
|
44
|
-
Call the `capture_feedback` MCP tool:
|
|
45
|
-
|
|
46
|
-
**If succeeded:**
|
|
47
|
-
- signal: `up`
|
|
48
|
-
- context: What worked and why
|
|
49
|
-
- tags: Category labels
|
|
50
|
-
|
|
51
|
-
**If failed:**
|
|
52
|
-
- signal: `down`
|
|
53
|
-
- context: What you were trying to do
|
|
54
|
-
- whatWentWrong: Specific failure description
|
|
55
|
-
- whatToChange: How to avoid this next time
|
|
56
|
-
- tags: Category labels
|
|
57
|
-
|
|
58
|
-
Vague feedback like "it failed" will be rejected. Be specific.
|
|
59
|
-
|
|
60
|
-
### Step 4: Check improvement (optional)
|
|
61
|
-
|
|
62
|
-
Call the `feedback_stats` MCP tool to see approval rate, top failure domains, and whether the agent is trending better or worse.
|
|
63
|
-
|
|
64
|
-
## Available MCP Tools
|
|
65
|
-
|
|
66
|
-
| Tool | What it does |
|
|
67
|
-
|------|-------------|
|
|
68
|
-
| `recall` | Search past feedback and prevention rules for current task |
|
|
69
|
-
| `capture_feedback` | Record what worked or failed with structured context |
|
|
70
|
-
| `prevention_rules` | View auto-generated rules from repeated mistakes |
|
|
71
|
-
| `feedback_stats` | Approval rate, trend analysis, top failure domains |
|
|
72
|
-
| `feedback_summary` | Human-readable summary of recent signals |
|
|
73
|
-
|
|
74
|
-
## MCP Profiles
|
|
75
|
-
|
|
76
|
-
| Profile | Tools | Use case |
|
|
77
|
-
|---------|-------|----------|
|
|
78
|
-
| `essential` | 5 core tools | Default — start here |
|
|
79
|
-
| `commerce` | 6 tools + commerce_recall | Agentic commerce agents |
|
|
80
|
-
| `default` | 12 tools | Full pipeline including DPO export |
|
|
81
|
-
|
|
82
|
-
Set profile: `THUMBGATE_MCP_PROFILE=essential npx thumbgate serve`
|
|
83
|
-
|
|
84
|
-
## How Prevention Rules Work
|
|
85
|
-
|
|
86
|
-
1. Agent makes mistake A → you capture `down` feedback
|
|
87
|
-
2. Agent makes mistake A again → you capture `down` feedback again
|
|
88
|
-
3. System detects pattern → auto-generates prevention rule: "NEVER do A"
|
|
89
|
-
4. Next session → `recall` returns the rule → agent follows it
|
|
90
|
-
|
|
91
|
-
This is the core value. The agent doesn't learn — but it reads the rules and follows them.
|
|
92
|
-
|
|
93
|
-
## Links
|
|
94
|
-
|
|
95
|
-
- [GitHub](https://github.com/IgorGanapolsky/thumbgate)
|
|
96
|
-
- [npm](https://www.npmjs.com/package/thumbgate)
|
|
97
|
-
- [MCP Registry](https://registry.modelcontextprotocol.io)
|
|
@@ -1,17 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
name: solve-architecture-autonomy
|
|
3
|
-
description: Automated skill to handle architecture, autonomy, crisis, debug, deployment, error, execution, external-assessment, feedback, inefficiency, negative, railway, revenue, roi, simplification, user-frustration patterns efficiently.
|
|
4
|
-
diagnosis: Repeated execution failure in this domain.
|
|
5
|
-
status: materialized
|
|
6
|
-
---
|
|
7
|
-
|
|
8
|
-
# SOLVE-ARCHITECTURE-AUTONOMY Capability
|
|
9
|
-
|
|
10
|
-
## Problem
|
|
11
|
-
I provided a plan and research instead of immediately deploy
|
|
12
|
-
|
|
13
|
-
## Automated Diagnosis
|
|
14
|
-
Repeated execution failure in this domain.
|
|
15
|
-
|
|
16
|
-
## Usage
|
|
17
|
-
The agent should call the `handle_architecture` tool when tasks involve `architecture, autonomy, crisis, debug, deployment, error, execution, external-assessment, feedback, inefficiency, negative, railway, revenue, roi, simplification, user-frustration`.
|
|
@@ -1,33 +0,0 @@
|
|
|
1
|
-
/**
|
|
2
|
-
* Automated Skill: solve-architecture-autonomy
|
|
3
|
-
* Generated: 2026-03-13T15:50:58.840Z
|
|
4
|
-
*
|
|
5
|
-
* This tool was materialized by the EvoSkill loop to address:
|
|
6
|
-
* "I provided a plan and research instead of immediately deploy"
|
|
7
|
-
*/
|
|
8
|
-
|
|
9
|
-
const { execSync } = require('child_process');
|
|
10
|
-
|
|
11
|
-
/**
|
|
12
|
-
* Fixes I provided a plan and research instead of immediately deploy
|
|
13
|
-
*/
|
|
14
|
-
async function handle_architecture(args) {
|
|
15
|
-
const { context } = args;
|
|
16
|
-
|
|
17
|
-
// LOGIC: Materialized code should implement the fix derived from the diagnosis.
|
|
18
|
-
// For now, we provide a structured wrapper that logs intent and applies
|
|
19
|
-
// the suggested corrective action.
|
|
20
|
-
|
|
21
|
-
console.log(`[EVOSKILL] Executing handle_architecture to resolve: I provided a plan and research instead of immediately deploy`);
|
|
22
|
-
|
|
23
|
-
// Corrective action placeholder - in a full loop, this would be LLM-generated code
|
|
24
|
-
// derived from the 'how-to-avoid' fields in memory-log.jsonl.
|
|
25
|
-
|
|
26
|
-
return {
|
|
27
|
-
status: 'success',
|
|
28
|
-
appliedFix: `Automated handling of I provided a plan and research instead of immediately deploy pattern.`,
|
|
29
|
-
context: context
|
|
30
|
-
};
|
|
31
|
-
}
|
|
32
|
-
|
|
33
|
-
module.exports = { handle_architecture };
|
|
@@ -1,49 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
name: thumbgate-feedback
|
|
3
|
-
description: >
|
|
4
|
-
Capture thumbs up/down feedback into structured memories and prevention rules.
|
|
5
|
-
Require one sentence of why before claiming memory promotion.
|
|
6
|
-
Use when user gives explicit quality signals about agent work (e.g. "that worked",
|
|
7
|
-
"that failed", "thumbs up/down"). Do NOT use for general questions, code generation,
|
|
8
|
-
file operations, or any task that is not explicit feedback on prior agent output.
|
|
9
|
-
triggers:
|
|
10
|
-
- thumbs up
|
|
11
|
-
- thumbs down
|
|
12
|
-
- that worked
|
|
13
|
-
- that failed
|
|
14
|
-
negative_triggers:
|
|
15
|
-
- generate code
|
|
16
|
-
- search files
|
|
17
|
-
- explain this
|
|
18
|
-
- run tests
|
|
19
|
-
---
|
|
20
|
-
|
|
21
|
-
# ThumbGate Feedback Skill
|
|
22
|
-
|
|
23
|
-
When user provides feedback, execute:
|
|
24
|
-
|
|
25
|
-
```bash
|
|
26
|
-
# negative
|
|
27
|
-
node .claude/scripts/feedback/capture-feedback.js \
|
|
28
|
-
--feedback=down \
|
|
29
|
-
--context="<what failed>" \
|
|
30
|
-
--what-went-wrong="<specific failure>" \
|
|
31
|
-
--what-to-change="<prevention action>" \
|
|
32
|
-
--tags="<domain>,regression"
|
|
33
|
-
|
|
34
|
-
# positive
|
|
35
|
-
node .claude/scripts/feedback/capture-feedback.js \
|
|
36
|
-
--feedback=up \
|
|
37
|
-
--context="<what succeeded>" \
|
|
38
|
-
--what-worked="<repeatable pattern>" \
|
|
39
|
-
--tags="<domain>,fix"
|
|
40
|
-
```
|
|
41
|
-
|
|
42
|
-
If the user only says `thumbs up`, `thumbs down`, `that worked`, or `that failed`, log the signal and ask one follow-up question before claiming it became reusable memory.
|
|
43
|
-
|
|
44
|
-
At session start, run:
|
|
45
|
-
|
|
46
|
-
```bash
|
|
47
|
-
npm run feedback:summary
|
|
48
|
-
npm run feedback:rules
|
|
49
|
-
```
|