npm - instar - Versions diffs - 0.6.10 → 0.6.12 - Mend

instar 0.6.10 → 0.6.12

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (5) hide show

package/README.md +52 -5
package/dist/commands/init.js +429 -4
package/dist/scaffold/templates.js +36 -0
package/package.json +1 -1
package/skills/credential-leak-detector/SKILL.md +377 -0

package/README.md CHANGED Viewed

@@ -95,6 +95,7 @@ instar feedback --type bug --title "Session timeout" --description "Details..."
 - **[Identity System](#identity-that-survives-context-death)** -- AGENT.md + USER.md + MEMORY.md with hooks that enforce continuity across compaction.
 - **[Telegram Integration](#telegram-integration)** -- Two-way messaging. Each job gets its own topic. Your group becomes a living dashboard.
 - **[Relationship Tracking](#relationships-as-fundamental-infrastructure)** -- Cross-platform identity resolution, significance scoring, context injection.
+- **[Evolution System](#evolution-system)** -- Four subsystems for structured growth: proposal queue, learning registry, gap tracking, and commitment follow-through.
 - **[Self-Evolution](#self-evolution)** -- The agent modifies its own jobs, hooks, skills, and infrastructure. It builds what it needs.
 - **[Behavioral Hooks](#behavioral-hooks)** -- Structural guardrails: identity injection, dangerous command guards, grounding before messaging.
 - **[Default Coherence Jobs](#default-coherence-jobs)** -- Health checks, reflection, relationship maintenance. A circadian rhythm out of the box.
@@ -285,6 +286,20 @@ The server runs 24/7 in the background, surviving terminal disconnects and auto-
 | GET | `/telegram/topics` | List topic-session mappings |
 | POST | `/telegram/reply/:topicId` | Send message to a topic |
 | GET | `/telegram/topics/:topicId/messages` | Topic message history (`?limit=20`) |
+| GET | `/evolution` | Full evolution dashboard |
+| GET | `/evolution/proposals` | List proposals (`?status=`, `?type=`) |
+| POST | `/evolution/proposals` | Create a proposal |
+| PATCH | `/evolution/proposals/:id` | Update proposal status |
+| GET | `/evolution/learnings` | List learnings (`?applied=`, `?category=`) |
+| POST | `/evolution/learnings` | Record a learning |
+| PATCH | `/evolution/learnings/:id/apply` | Mark learning applied |
+| GET | `/evolution/gaps` | List capability gaps |
+| POST | `/evolution/gaps` | Report a gap |
+| PATCH | `/evolution/gaps/:id/address` | Mark gap addressed |
+| GET | `/evolution/actions` | List action items |
+| POST | `/evolution/actions` | Create an action item |
+| GET | `/evolution/actions/overdue` | List overdue actions |
+| PATCH | `/evolution/actions/:id` | Update action status |
 ### Identity That Survives Context Death
@@ -312,6 +327,28 @@ Every person the agent interacts with gets a relationship record that grows over
 - **Context injection** -- The agent *knows* who it's talking to before the conversation starts
 - **Stale detection** -- Surfaces relationships that haven't been contacted in a while
+### Evolution System
+Self-evolution isn't just "the agent can edit files." It's a structured system with four subsystems that turn running into growing:
+**Evolution Queue** -- Staged self-improvement proposals. The agent identifies something that could be better, proposes a change, and a review job evaluates and implements it. Not impulsive self-modification -- deliberate, staged improvement with a paper trail.
+**Learning Registry** -- Structured, searchable insights. When the agent discovers a pattern, solves a tricky problem, or learns a user preference, it records it in a format that future sessions can query. An insight-harvest job synthesizes patterns across learnings into evolution proposals.
+**Capability Gap Tracker** -- The agent tracks what it's missing. When it can't fulfill a request, encounters a limitation, or notices a workflow gap, it records the gap with severity and a proposed solution. This is the difference between "I can't do that" and "I can't do that *yet*, and here's what I need."
+**Action Queue** -- Commitment tracking with stale detection. When the agent promises to follow up, creates a TODO, or identifies work that needs doing, it gets tracked. A commitment-check job surfaces overdue items so nothing falls through the cracks.
+Built-in skills (`/evolve`, `/learn`, `/gaps`, `/commit-action`) make recording effortless. A post-action reflection hook nudges the agent to pause after significant actions (commits, deploys) and consider what it learned. Three default jobs drive the cycle:
+| Job | Schedule | Purpose |
+|-----|----------|---------|
+| **evolution-review** | Every 6h | Review proposals, implement approved ones |
+| **insight-harvest** | Every 8h | Synthesize learnings into proposals |
+| **commitment-check** | Every 4h | Surface overdue action items |
+All state is file-based JSON in `.instar/state/evolution/`. No database, no external dependencies.
 ### Self-Evolution
 The agent can edit its own job definitions, write new scripts, update its identity, create hooks, and modify its configuration. When asked to do something it can't do yet, the expected behavior is: **"Let me build that capability."**
@@ -331,8 +368,11 @@ Automatic hooks fire via Claude Code's hook system:
 |------|------|-------------|
 | **Dangerous command guard** | PreToolUse (blocking) | Blocks destructive operations structurally |
 | **Grounding before messaging** | PreToolUse (advisory) | Forces identity re-read before external communication |
-| **Session start** | PostToolUse | Injects identity context at session start |
-| **Compaction recovery** | Notification (compact) | Restores identity when context compresses |
+| **Deferral detector** | PreToolUse (advisory) | Catches the agent deferring work it could do itself |
+| **External communication guard** | PreToolUse (advisory) | Identity grounding before posting to external platforms |
+| **Post-action reflection** | PreToolUse (advisory) | Nudges learning capture after commits, deploys, and significant actions |
+| **Session start** | SessionStart | Injects identity context at session start |
+| **Compaction recovery** | SessionStart (compact) | Restores identity when context compresses |
 ### Default Coherence Jobs
@@ -343,10 +383,15 @@ Ships out of the box:
 | **health-check** | Every 5 min | Haiku | Verify infrastructure health |
 | **reflection-trigger** | Every 4h | Sonnet | Reflect on recent work |
 | **relationship-maintenance** | Daily | Sonnet | Review stale relationships |
-| **update-check** | Daily | Haiku | Detect new Instar versions |
+| **update-check** | Every 30 min | Haiku | Detect new Instar versions |
 | **feedback-retry** | Every 6h | Haiku | Retry un-forwarded feedback items |
+| **dispatch-check** | Every 30 min | Haiku | Poll for intelligence dispatches |
+| **self-diagnosis** | Every 2h | Sonnet | Proactive infrastructure scanning |
+| **evolution-review** | Every 6h | Sonnet | Review and implement evolution proposals |
+| **insight-harvest** | Every 8h | Sonnet | Synthesize learnings into proposals |
+| **commitment-check** | Every 4h | Haiku | Surface overdue action items |
-These give the agent a **circadian rhythm** -- regular self-maintenance without user intervention.
+These give the agent a **circadian rhythm** -- regular self-maintenance, evolution, and growth without user intervention.
 ### The Feedback Loop: A Rising Tide Lifts All Ships
@@ -375,13 +420,15 @@ One agent's growing pain becomes every agent's growth.
   AGENT.md                # Agent identity (who am I?)
   USER.md                 # User context (who am I working with?)
   MEMORY.md               # Persistent learnings across sessions
-  hooks/                  # Behavioral scripts (guards, identity injection)
+  hooks/                  # Behavioral scripts (guards, identity injection, reflection)
   state/                  # Runtime state (sessions, jobs)
+    evolution/            # Evolution queue, learnings, gaps, actions (JSON)
   relationships/          # Per-person relationship files
   logs/                   # Server logs
 .claude/                  # Claude Code configuration
   settings.json           # Hook registrations
   scripts/                # Health watchdog, Telegram relay, smart-fetch
+  skills/                 # Built-in + agent-created skills (evolve, learn, gaps, commit-action)
 ```
 Everything is file-based. No database. JSON state files the agent can read and modify. tmux for session management -- battle-tested, survives disconnects, fully scriptable.

package/dist/commands/init.js CHANGED Viewed

@@ -196,10 +196,11 @@ async function initFreshProject(projectName, options) {
     console.log(`  ${pc.green('✓')} Created .claude/scripts/health-watchdog.sh`);
     installSmartFetch(projectDir);
     console.log(`  ${pc.green('✓')} Created .claude/scripts/smart-fetch.py (agentic web conventions)`);
-    // Create .claude/skills/ directory for agent-created skills
+    // Create .claude/skills/ directory and install built-in skills
     const skillsDir = path.join(projectDir, '.claude', 'skills');
     fs.mkdirSync(skillsDir, { recursive: true });
-    console.log(`  ${pc.green('✓')} Created .claude/skills/ (agent skill directory)`);
+    installBuiltinSkills(skillsDir, port);
+    console.log(`  ${pc.green('✓')} Created .claude/skills/ (with built-in evolution skills)`);
     // Write CLAUDE.md (standalone version for fresh projects)
     const claudeMd = generateClaudeMd(projectName, identity.name, port, false);
     fs.writeFileSync(path.join(projectDir, 'CLAUDE.md'), claudeMd);
@@ -401,10 +402,11 @@ async function initExistingProject(options) {
     // Install smart-fetch for agentic web conventions
     installSmartFetch(projectDir);
     console.log(pc.green('  Created:') + ' .claude/scripts/smart-fetch.py (agentic web conventions)');
-    // Create .claude/skills/ directory for agent-created skills
+    // Create .claude/skills/ directory and install built-in skills
     const skillsDir = path.join(projectDir, '.claude', 'skills');
     fs.mkdirSync(skillsDir, { recursive: true });
-    console.log(pc.green('  Created:') + ' .claude/skills/ (agent skill directory)');
+    installBuiltinSkills(skillsDir, port);
+    console.log(pc.green('  Created:') + ' .claude/skills/ (with built-in evolution skills)');
     // Append to .gitignore
     const gitignorePath = path.join(projectDir, '.gitignore');
     const instarIgnores = '\n# Instar runtime state (contains auth token, session data, relationships)\n.instar/state/\n.instar/logs/\n.instar/relationships/\n.instar/config.json\n';
@@ -587,6 +589,38 @@ Instar has a built-in feedback loop. When something isn't working, your user can
 - Report: \`curl -X POST http://localhost:${port}/feedback -d '{"issue":"description","context":"relevant logs"}'\`
 - Check updates: \`npm outdated -g instar\`
+### Evolution System
+You have a built-in evolution system with four subsystems. This is not a metaphor — it's infrastructure that tracks your growth.
+**Evolution Queue** — Staged self-improvement proposals.
+- View: \`curl http://localhost:${port}/evolution/proposals\`
+- Propose: \`/evolve\` skill or \`POST /evolution/proposals\`
+- The \`evolution-review\` job evaluates and implements proposals every 6 hours.
+**Learning Registry** — Structured, searchable insights.
+- View: \`curl http://localhost:${port}/evolution/learnings\`
+- Record: \`/learn\` skill or \`POST /evolution/learnings\`
+- The \`insight-harvest\` job synthesizes patterns into proposals every 8 hours.
+**Capability Gaps** — Track what you're missing.
+- View: \`curl http://localhost:${port}/evolution/gaps\`
+- Report: \`/gaps\` skill or \`POST /evolution/gaps\`
+**Action Queue** — Commitments with follow-through tracking.
+- View: \`curl http://localhost:${port}/evolution/actions\`
+- Create: \`/commit-action\` skill or \`POST /evolution/actions\`
+- The \`commitment-check\` job surfaces overdue items every 4 hours.
+**Dashboard** — Full evolution health:
+\`\`\`bash
+curl http://localhost:${port}/evolution
+\`\`\`
+**Skills:** \`/evolve\`, \`/learn\`, \`/gaps\`, \`/commit-action\`
+**The principle:** Evolution is not separate from work. Every task is an opportunity to notice what could be better. The post-action reflection hook reminds you to pause after significant actions and consider what you learned.
 ### Self-Evolution
 **Record what you learn.** When you discover a new pattern, solution, or capability — write it to \`.instar/MEMORY.md\`. The next session should benefit from what this session learned.
@@ -597,6 +631,230 @@ Instar has a built-in feedback loop. When something isn't working, your user can
 `;
     return section;
 }
+/**
+ * Install built-in skills for evolution system.
+ * Only writes skill files that don't already exist (preserves customizations).
+ */
+function installBuiltinSkills(skillsDir, port) {
+    const skills = {
+        'evolve': {
+            name: 'evolve',
+            description: 'Propose an evolution improvement to your own infrastructure, behavior, or capabilities.',
+            content: `---
+name: evolve
+description: Propose an evolution improvement to your own infrastructure, behavior, or capabilities.
+metadata:
+  user_invocable: "true"
+---
+# /evolve
+Propose an evolution improvement. Use this when you identify something about yourself that could be better — a new capability, a workflow improvement, a behavioral pattern worth encoding, or infrastructure worth building.
+## Steps
+1. **Identify the improvement** — What did you notice? What triggered this? Be specific.
+2. **Classify it**:
+   - \`capability\` — New skill or ability
+   - \`infrastructure\` — System improvement (hooks, jobs, scripts)
+   - \`voice\` — Communication or tone improvement
+   - \`workflow\` — Process improvement
+   - \`philosophy\` — Principle or value update
+   - \`integration\` — New external connection
+   - \`performance\` — Speed or efficiency improvement
+3. **Assess impact and effort** (high/medium/low for each)
+4. **Submit the proposal**:
+\`\`\`bash
+curl -s -X POST http://localhost:${port}/evolution/proposals \\
+  -H 'Content-Type: application/json' \\
+  -d '{"title":"TITLE","source":"WHERE_YOU_NOTICED","description":"WHAT_AND_WHY","type":"TYPE","impact":"medium","effort":"medium"}'
+\`\`\`
+5. **Check the dashboard** for context:
+\`\`\`bash
+curl -s http://localhost:${port}/evolution
+\`\`\`
+## When to Use
+- After completing a task in a way that felt unnecessarily difficult
+- When you notice a pattern you keep repeating manually
+- When a user request reveals a capability you should have but don't
+- When you find yourself working around a limitation
+- After reading about a technique or pattern that would improve your work
+## Philosophy
+Evolution is not a separate activity from work — it IS the work. Every task is an opportunity to notice what could be better. The best proposals come from real experience, not abstract planning.
+`,
+        },
+        'learn': {
+            name: 'learn',
+            description: 'Record a learning or insight in the structured learning registry.',
+            content: `---
+name: learn
+description: Record a learning or insight in the structured learning registry.
+metadata:
+  user_invocable: "true"
+---
+# /learn
+Record a learning or insight. Use this when you discover something worth remembering — a pattern, a solution, a mistake, or an observation that future sessions should know about.
+## Steps
+1. **Identify the learning** — What did you discover? What's the actionable insight?
+2. **Categorize it** (e.g., debugging, architecture, user-preference, integration, communication, workflow)
+3. **Tag it** for searchability
+4. **Submit**:
+\`\`\`bash
+curl -s -X POST http://localhost:${port}/evolution/learnings \\
+  -H 'Content-Type: application/json' \\
+  -d '{"title":"TITLE","category":"CATEGORY","description":"FULL_INSIGHT","source":{"discoveredAt":"DATE","platform":"WHERE","session":"SESSION_ID"},"tags":["tag1","tag2"]}'
+\`\`\`
+5. **If it suggests an improvement**, note the evolution relevance:
+   - Add \`"evolutionRelevance": "This could become a skill/hook/job because..."\`
+   - The insight-harvest job will pick this up and potentially create a proposal
+## When to Use
+- After solving a tricky problem (capture the solution pattern)
+- After a user interaction reveals a preference you didn't know
+- After discovering a tool or technique that works well
+- After making a mistake (capture what went wrong and the fix)
+- After noticing a pattern across multiple tasks
+## Difference from MEMORY.md
+MEMORY.md is your personal scratchpad — unstructured, read by you.
+The learning registry is structured, searchable, and connected to the evolution system.
+Use MEMORY.md for quick notes. Use /learn for insights that should influence future behavior.
+`,
+        },
+        'gaps': {
+            name: 'gaps',
+            description: 'Report a capability gap — something you need but don\'t have.',
+            content: `---
+name: gaps
+description: Report a capability gap — something you need but don't have.
+metadata:
+  user_invocable: "true"
+---
+# /gaps
+Report a capability gap. Use this when you discover something you should be able to do but can't — a missing skill, knowledge area, integration, or workflow that would make you more effective.
+## Steps
+1. **Describe the gap** — What were you trying to do? What's missing?
+2. **Classify it**:
+   - \`skill\` — Missing ability (e.g., can't parse a specific format)
+   - \`knowledge\` — Missing information (e.g., don't know how a system works)
+   - \`integration\` — Missing connection (e.g., can't talk to a service)
+   - \`workflow\` — Missing process (e.g., no standard way to do X)
+   - \`communication\` — Missing voice capability (e.g., can't express X well)
+   - \`monitoring\` — Missing observability (e.g., can't detect when X happens)
+3. **Assess severity** (critical/high/medium/low)
+4. **Describe current state** — What do you do instead? What's the workaround?
+5. **Propose a solution** if you have one
+6. **Submit**:
+\`\`\`bash
+curl -s -X POST http://localhost:${port}/evolution/gaps \\
+  -H 'Content-Type: application/json' \\
+  -d '{"title":"TITLE","category":"CATEGORY","severity":"medium","description":"WHAT_IS_MISSING","context":"WHEN_DID_YOU_NOTICE","currentState":"CURRENT_WORKAROUND","proposedSolution":"HOW_TO_FIX"}'
+\`\`\`
+## When to Use
+- When you can't fulfill a user request and have to say "I can't do that yet"
+- When you notice yourself repeatedly working around a limitation
+- When an integration you need doesn't exist
+- When you lack knowledge about a system you interact with
+- When monitoring would catch an issue before it becomes a problem
+## View Current Gaps
+\`\`\`bash
+curl -s http://localhost:${port}/evolution/gaps
+\`\`\`
+`,
+        },
+        'commit-action': {
+            name: 'commit-action',
+            description: 'Create a tracked action item — a commitment with follow-through tracking.',
+            content: `---
+name: commit-action
+description: Create a tracked action item — a commitment with follow-through tracking.
+metadata:
+  user_invocable: "true"
+---
+# /commit-action
+Create a tracked action item. Use this when you promise to do something, identify a task that needs follow-through, or want to ensure something doesn't fall through the cracks.
+## Steps
+1. **Define the action** — What needs to be done? Be specific and actionable.
+2. **Set priority** (critical/high/medium/low)
+3. **Set a due date** if applicable (ISO 8601 format)
+4. **Identify who/what you're committing to** (optional)
+5. **Submit**:
+\`\`\`bash
+curl -s -X POST http://localhost:${port}/evolution/actions \\
+  -H 'Content-Type: application/json' \\
+  -d '{"title":"TITLE","description":"WHAT_TO_DO","priority":"medium","dueBy":"2026-03-01T00:00:00Z","commitTo":"WHO_OR_WHAT","tags":["tag1"]}'
+\`\`\`
+6. **When complete**, mark it done:
+\`\`\`bash
+curl -s -X PATCH http://localhost:${port}/evolution/actions/ACT-XXX \\
+  -H 'Content-Type: application/json' \\
+  -d '{"status":"completed","resolution":"What was done"}'
+\`\`\`
+## When to Use
+- When you promise a user you'll follow up on something
+- When you identify a task during work that shouldn't be forgotten
+- When a learning or gap requires a specific action
+- When you need to check back on something later
+- When committing to implement an evolution proposal
+## View Actions
+\`\`\`bash
+# All pending actions
+curl -s http://localhost:${port}/evolution/actions?status=pending
+# Overdue actions
+curl -s http://localhost:${port}/evolution/actions/overdue
+\`\`\`
+## The Commitment Check
+The commitment-check job runs every 4 hours and surfaces overdue items. If you create an action and forget it, the system won't.
+`,
+        },
+    };
+    for (const [slug, skill] of Object.entries(skills)) {
+        const skillDir = path.join(skillsDir, slug);
+        const skillFile = path.join(skillDir, 'SKILL.md');
+        if (!fs.existsSync(skillFile)) {
+            fs.mkdirSync(skillDir, { recursive: true });
+            fs.writeFileSync(skillFile, skill.content);
+        }
+    }
+}
 function getDefaultJobs(port) {
     return [
         {
@@ -727,6 +985,92 @@ If everything looks healthy, exit silently. Only report issues.`,
             },
             tags: ['coherence', 'default'],
         },
+        {
+            slug: 'evolution-review',
+            name: 'Evolution Review',
+            description: 'Review pending evolution proposals, evaluate their merit, and implement approved ones.',
+            schedule: '0 */6 * * *',
+            priority: 'medium',
+            expectedDurationMinutes: 5,
+            model: 'sonnet',
+            enabled: true,
+            gate: `curl -sf http://localhost:${port}/evolution/proposals?status=proposed 2>/dev/null | python3 -c "import sys,json; d=json.load(sys.stdin); exit(0 if len(d.get('proposals',[])) > 0 else 1)"`,
+            execute: {
+                type: 'prompt',
+                value: `Review pending evolution proposals: curl -s http://localhost:${port}/evolution/proposals?status=proposed
+For each proposal:
+1. Read the title, description, type, and source
+2. Evaluate: Is this a genuine improvement? Is the effort worth the impact? Does it align with our goals?
+3. If approved, update status: curl -s -X PATCH http://localhost:${port}/evolution/proposals/EVO-XXX -H 'Content-Type: application/json' -d '{"status":"approved"}'
+4. Then implement it: create the skill/hook/job/config change described in the proposal
+5. After implementation, mark complete: curl -s -X PATCH http://localhost:${port}/evolution/proposals/EVO-XXX -H 'Content-Type: application/json' -d '{"status":"implemented","resolution":"What was done"}'
+If a proposal should be deferred or rejected, update with reason.
+Also check the dashboard: curl -s http://localhost:${port}/evolution — report any highlights to the user if they seem important.
+If no proposals need attention, exit silently.`,
+            },
+            tags: ['coherence', 'default', 'evolution'],
+        },
+        {
+            slug: 'insight-harvest',
+            name: 'Insight Harvest',
+            description: 'Synthesize learnings from the learning registry, detect patterns, and generate evolution proposals from high-confidence insights.',
+            schedule: '0 */8 * * *',
+            priority: 'low',
+            expectedDurationMinutes: 3,
+            model: 'sonnet',
+            enabled: true,
+            gate: `curl -sf http://localhost:${port}/evolution/learnings?applied=false 2>/dev/null | python3 -c "import sys,json; d=json.load(sys.stdin); exit(0 if len(d.get('learnings',[])) > 0 else 1)"`,
+            execute: {
+                type: 'prompt',
+                value: `Harvest and synthesize learnings: curl -s http://localhost:${port}/evolution/learnings?applied=false
+Review unapplied learnings and look for:
+1. **Patterns**: Multiple learnings pointing to the same conclusion
+2. **Actionable insights**: Learnings that suggest a specific change
+3. **Cross-domain connections**: Insights from one area that apply to another
+For each actionable pattern found, create an evolution proposal:
+curl -s -X POST http://localhost:${port}/evolution/proposals -H 'Content-Type: application/json' -d '{"title":"...","source":"insight-harvest from LRN-XXX","description":"...","type":"...","impact":"...","effort":"..."}'
+Then mark the relevant learnings as applied:
+curl -s -X PATCH http://localhost:${port}/evolution/learnings/LRN-XXX/apply -H 'Content-Type: application/json' -d '{"appliedTo":"EVO-XXX"}'
+Also update MEMORY.md with any patterns worth preserving long-term.
+If no actionable patterns found, exit silently.`,
+            },
+            tags: ['coherence', 'default', 'evolution'],
+        },
+        {
+            slug: 'commitment-check',
+            name: 'Commitment Check',
+            description: 'Track action items and commitments. Surface overdue items and stale commitments.',
+            schedule: '0 */4 * * *',
+            priority: 'low',
+            expectedDurationMinutes: 2,
+            model: 'haiku',
+            enabled: true,
+            gate: `curl -sf http://localhost:${port}/evolution/actions/overdue 2>/dev/null | python3 -c "import sys,json; d=json.load(sys.stdin); exit(0 if len(d.get('overdue',[])) > 0 else 1)"`,
+            execute: {
+                type: 'prompt',
+                value: `Check for overdue commitments: curl -s http://localhost:${port}/evolution/actions/overdue
+For each overdue action:
+1. Assess: Can this be completed now? Is it still relevant?
+2. If actionable, attempt to complete it or advance it
+3. If no longer relevant, cancel it: curl -s -X PATCH http://localhost:${port}/evolution/actions/ACT-XXX -H 'Content-Type: application/json' -d '{"status":"cancelled","resolution":"No longer relevant because..."}'
+4. If blocked, escalate to the user via Telegram (if configured)
+Also check pending actions (curl -s http://localhost:${port}/evolution/actions?status=pending) for items that have been pending more than 48 hours without a due date — these are forgotten commitments.
+If no overdue or stale items, exit silently.`,
+            },
+            tags: ['coherence', 'default', 'evolution'],
+        },
     ];
 }
 /**
@@ -915,6 +1259,33 @@ curl -s -X POST http://localhost:${port}/feedback \\
 Types: \`bug\`, \`feature\`, \`improvement\`, \`question\`
 **Do not wait for the user to notice.** If a hook throws an error, report it. If a job fails, report it. If the server returns unexpected data, report it. You are not just using instar — you are part of its immune system.
+`);
+    }
+    // Check for Evolution System section
+    if (!content.includes('Evolution System')) {
+        additions.push(`
+### Evolution System
+You have a built-in evolution system with four subsystems that track your growth.
+**Evolution Queue** — Staged self-improvement proposals.
+- View: \`curl http://localhost:${port}/evolution/proposals\`
+- Propose: \`/evolve\` skill or \`POST /evolution/proposals\`
+**Learning Registry** — Structured, searchable insights.
+- View: \`curl http://localhost:${port}/evolution/learnings\`
+- Record: \`/learn\` skill or \`POST /evolution/learnings\`
+**Capability Gaps** — Track what you're missing.
+- View: \`curl http://localhost:${port}/evolution/gaps\`
+- Report: \`/gaps\` skill or \`POST /evolution/gaps\`
+**Action Queue** — Commitments with follow-through tracking.
+- View: \`curl http://localhost:${port}/evolution/actions\`
+- Create: \`/commit-action\` skill or \`POST /evolution/actions\`
+**Dashboard**: \`curl http://localhost:${port}/evolution\`
+**Skills**: \`/evolve\`, \`/learn\`, \`/gaps\`, \`/commit-action\`
 `);
     }
     // Check for Telegram Relay section (add if Telegram is configured)
@@ -1174,6 +1545,55 @@ process.stdin.on('end', () => {
   } catch { /* don't break on errors */ }
   process.exit(0);
 });
+`, { mode: 0o755 });
+    // Post-action reflection — injects evolution awareness after significant actions.
+    // PreToolUse hook for Bash. When the agent is about to send a response or commit,
+    // it reminds them to consider what they learned. Advisory, not blocking.
+    fs.writeFileSync(path.join(hooksDir, 'post-action-reflection.js'), `#!/usr/bin/env node
+// Post-action reflection — evolution awareness after significant actions.
+// PreToolUse hook for Bash. When the agent is about to commit, deploy, or
+// complete a task, injects a brief reminder to capture learnings.
+//
+// "Every action is an opportunity to learn. Most of that learning is lost
+// because nobody paused to ask: what did this teach me?"
+let data = '';
+process.stdin.on('data', chunk => data += chunk);
+process.stdin.on('end', () => {
+  try {
+    const input = JSON.parse(data);
+    if (input.tool_name !== 'Bash') process.exit(0);
+    const command = (input.tool_input || {}).command || '';
+    if (!command) process.exit(0);
+    // Significant action patterns — moments worth reflecting on
+    const significantPatterns = [
+      /git\\s+commit/i,
+      /git\\s+push/i,
+      /npm\\s+publish/i,
+      /curl\\s+-X\\s+POST.*\\/deploy/i,
+      /instar\\s+server\\s+restart/i,
+    ];
+    if (!significantPatterns.some(p => p.test(command))) process.exit(0);
+    const reminder = [
+      'POST-ACTION REFLECTION — Quick evolution check:',
+      '',
+      'Before moving on, consider:',
+      '- Did this teach you something worth recording? → /learn',
+      '- Did you notice a gap in your capabilities? → /gaps',
+      '- Did you discover an improvement opportunity? → /evolve',
+      '- Did you make a commitment to follow up? → /commit-action',
+      '',
+      'Skip if nothing notable. The value is in the pause, not the output.',
+    ].join('\\n');
+    process.stdout.write(JSON.stringify({ decision: 'approve', additionalContext: reminder }));
+  } catch { /* don't break on errors */ }
+  process.exit(0);
+});
 `, { mode: 0o755 });
     // External communication guard — ensures identity grounding before external posting.
     // PreToolUse hook for Bash. Detects commands that post to external platforms.
@@ -1525,6 +1945,11 @@ function installClaudeSettings(projectDir) {
             command: 'node .instar/hooks/external-communication-guard.js',
             timeout: 5000,
         },
+        {
+            type: 'command',
+            command: 'node .instar/hooks/post-action-reflection.js',
+            timeout: 5000,
+        },
     ];
     // PreToolUse: merge instar hooks into existing or create fresh
     if (!hooks.PreToolUse) {

package/dist/scaffold/templates.js CHANGED Viewed

@@ -396,6 +396,42 @@ Instar has a built-in feedback loop that connects all agents to the development
 **User feedback matters too.** When your user says "this isn't working" or "I wish I could..." — that's feedback. Categorize it and submit it the same way.
+### Evolution System
+You have a built-in evolution system with four subsystems. This is not a metaphor — it's infrastructure that tracks your growth.
+**Evolution Queue** — Staged self-improvement proposals.
+- View: \`curl http://localhost:${port}/evolution/proposals\`
+- Propose: \`/evolve\` skill or \`POST /evolution/proposals\`
+- The \`evolution-review\` job evaluates and implements proposals every 6 hours.
+**Learning Registry** — Structured, searchable insights.
+- View: \`curl http://localhost:${port}/evolution/learnings\`
+- Record: \`/learn\` skill or \`POST /evolution/learnings\`
+- The \`insight-harvest\` job synthesizes patterns into proposals every 8 hours.
+**Capability Gaps** — Track what you're missing.
+- View: \`curl http://localhost:${port}/evolution/gaps\`
+- Report: \`/gaps\` skill or \`POST /evolution/gaps\`
+**Action Queue** — Commitments with follow-through tracking.
+- View: \`curl http://localhost:${port}/evolution/actions\`
+- Create: \`/commit-action\` skill or \`POST /evolution/actions\`
+- The \`commitment-check\` job surfaces overdue items every 4 hours.
+**Dashboard** — Full evolution health at a glance:
+\`\`\`bash
+curl http://localhost:${port}/evolution
+\`\`\`
+**Skills for evolution:**
+- \`/evolve\` — Propose an improvement
+- \`/learn\` — Record an insight
+- \`/gaps\` — Report a missing capability
+- \`/commit-action\` — Track a commitment
+**The principle:** Evolution is not a separate activity from work. Every task is an opportunity to notice what could be better. The post-action reflection hook reminds you to pause after significant actions (commits, deploys) and consider what you learned. Most learning is lost because nobody paused to ask.
 ### Self-Evolution
 Record what I learn. Build infrastructure, not one-offs. Grow to meet the user's needs. Every session should leave things slightly better than I found them.

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "instar",
-  "version": "0.6.10",
+  "version": "0.6.12",
   "description": "Persistent autonomy infrastructure for AI agents",
   "type": "module",
   "main": "dist/index.js",

package/skills/credential-leak-detector/SKILL.md ADDED Viewed

@@ -0,0 +1,377 @@
+---
+name: credential-leak-detector
+description: PostToolUse hook that scans Bash tool output for leaked credentials — API keys, tokens, private keys, and secrets — before they reach the conversation. Blocks critical leaks, redacts high-severity matches, and warns on suspicious patterns. 14 detection patterns covering OpenAI, Anthropic, AWS, GitHub, Stripe, Google, Slack, SendGrid, Twilio, PEM keys, bearer tokens, and generic secrets. No external dependencies. Trigger words: security, credential leak, secret exposure, key detection, token scan, API key leaked, credential guard, secret scanner, prevent credential leak.
+license: MIT
+metadata:
+  author: sagemindai
+  version: "1.0"
+  homepage: https://instar.sh
+---
+# credential-leak-detector — Catch Leaked Credentials Before They Spread
+Every time your agent runs a Bash command, the output flows back into the conversation — and into your API provider's logs, any monitoring tools, and the model's context window. If that output contains an API key, a private key, or a database password, the credential is now exposed in places you never intended.
+This hook scans every Bash tool response for 14 credential patterns before the output reaches the agent. Critical matches (API keys, AWS credentials, private keys) get blocked entirely. High-severity matches get redacted with a warning. Suspicious patterns get flagged as advisories. No external dependencies — just Python stdlib.
+---
+## What Gets Detected
+### Critical (Blocks the response)
+| Pattern | Example Match |
+|---------|--------------|
+| OpenAI API keys | `sk-proj-abc123...` |
+| Anthropic API keys | `sk-ant-api03-...` |
+| AWS access keys | `AKIA1234567890ABCDEF` |
+| GitHub tokens (classic) | `ghp_xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx` |
+| GitHub fine-grained PATs | `github_pat_xxxxxx...` |
+| Stripe secret keys | `sk_live_xxxx...xxxx` |
+| PEM private keys | `-----BEGIN RSA PRIVATE KEY-----` |
+### High (Blocks or redacts with warning)
+| Pattern | Action |
+|---------|--------|
+| Google API keys | Block |
+| Slack tokens | Block |
+| SendGrid API keys | Block |
+| Twilio auth keys | Redact + warn |
+| Bearer auth tokens | Redact + warn |
+### Medium (Advisory warning)
+| Pattern | Note |
+|---------|------|
+| Generic `password=`, `secret=`, `api_key=` assignments | Common in config output |
+| 64-character hex strings | Possible SHA-256 hashes or keys |
+---
+## Installation
+### Step 1: Create the hook script
+```bash
+mkdir -p .claude/hooks
+```
+Create `.claude/hooks/credential-leak-detector.py` with the contents below.
+---
+## The Script
+Save this as `.claude/hooks/credential-leak-detector.py`:
+```python
+#!/usr/bin/env python3
+"""
+credential-leak-detector.py — PostToolUse hook that scans Bash output
+for leaked credentials. Blocks critical leaks, redacts high-severity
+matches, warns on suspicious patterns.
+Exit code 2 = block (critical credential found)
+Exit code 0 = allow (clean or advisory-only)
+"""
+import sys
+import json
+import re
+# --- Masking ---
+def mask(value):
+    """Mask a credential: first 4 + **** + last 4, or full mask if short."""
+    v = value.strip()
+    if len(v) < 12:
+        return "*" * len(v)
+    return v[:4] + "****" + v[-4:]
+# --- Pattern Definitions ---
+# (name, regex, severity, action)
+# severity: critical, high, medium
+# action: block, redact, warn
+PATTERNS = [
+    # Critical — Block
+    ("OpenAI API key",
+     r'(sk-(?:proj-)?[a-zA-Z0-9]{20,})',
+     "critical", "block"),
+    ("Anthropic API key",
+     r'(sk-ant-api[a-zA-Z0-9_-]{90,})',
+     "critical", "block"),
+    ("AWS access key",
+     r'(AKIA[0-9A-Z]{16})',
+     "critical", "block"),
+    ("GitHub token (classic)",
+     r'(gh[pousr]_[A-Za-z0-9_]{36,})',
+     "critical", "block"),
+    ("GitHub fine-grained PAT",
+     r'(github_pat_[a-zA-Z0-9]{22}_[a-zA-Z0-9]{59})',
+     "critical", "block"),
+    ("Stripe secret key",
+     r'(sk_(?:live|test)_[a-zA-Z0-9]{24,})',
+     "critical", "block"),
+    ("PEM private key",
+     r'(-----BEGIN (?:RSA |EC |DSA )?PRIVATE KEY-----)',
+     "critical", "block"),
+    # High — Block
+    ("Google API key",
+     r'(AIza[0-9A-Za-z_-]{35})',
+     "high", "block"),
+    ("Slack token",
+     r'(xox[bpors]-[0-9a-zA-Z-]{10,})',
+     "high", "block"),
+    ("SendGrid API key",
+     r'(SG\.[a-zA-Z0-9_-]{22}\.[a-zA-Z0-9_-]{43})',
+     "high", "block"),
+    # High — Redact
+    ("Twilio auth key",
+     r'(SK[0-9a-fA-F]{32})',
+     "high", "redact"),
+    ("Bearer auth token",
+     r'(?:Authorization|Bearer)\s*[:=]\s*Bearer\s+([^\s]{20,})',
+     "high", "redact"),
+    # Medium — Warn
+    ("Generic secret assignment",
+     r'(?:password|secret|token|api_key)\s*[=:]\s*[\'"]?([^\s\'"]{16,})',
+     "medium", "warn"),
+    ("High-entropy hex string",
+     r'\b([a-fA-F0-9]{64})\b',
+     "medium", "warn"),
+]
+# --- Main ---
+def scan(text):
+    """Scan text for credential patterns. Returns list of findings."""
+    findings = []
+    for name, pattern, severity, action in PATTERNS:
+        matches = re.findall(pattern, text)
+        for m in matches:
+            findings.append({
+                "name": name,
+                "severity": severity,
+                "action": action,
+                "matched": m if isinstance(m, str) else m[0],
+            })
+    return findings
+def main():
+    try:
+        payload = json.load(sys.stdin)
+    except Exception:
+        sys.exit(0)
+    tool_name = payload.get("tool_name", "")
+    tool_response = payload.get("tool_response", "")
+    # Only scan Bash output
+    if tool_name != "Bash":
+        sys.exit(0)
+    if not tool_response:
+        sys.exit(0)
+    # Handle tool_response as string or dict
+    if isinstance(tool_response, dict):
+        text = tool_response.get("stdout", "") + tool_response.get("stderr", "")
+    else:
+        text = str(tool_response)
+    if not text:
+        sys.exit(0)
+    findings = scan(text)
+    if not findings:
+        sys.exit(0)
+    # Classify findings
+    blockers = [f for f in findings if f["action"] == "block"]
+    redacts = [f for f in findings if f["action"] == "redact"]
+    warnings = [f for f in findings if f["action"] == "warn"]
+    # Critical/high blockers — stop the response
+    if blockers:
+        details = []
+        for f in blockers:
+            details.append(
+                f"  - {f['name']} [{f['severity']}]: {mask(f['matched'])}"
+            )
+        reason = (
+            "[credential-leak-detector] Credential(s) detected in command output. "
+            "Response blocked to prevent exposure.\n\n"
+            "Detected:\n" + "\n".join(details) + "\n\n"
+            "The command output contained live credentials. Do NOT re-run this "
+            "command or attempt to extract these values. If you need to verify "
+            "a credential exists, check the env var or file without printing "
+            "its value."
+        )
+        print(json.dumps({"decision": "block", "reason": reason}))
+        sys.exit(2)
+    # Redact findings — allow but warn with masked values
+    messages = []
+    if redacts:
+        parts = []
+        for f in redacts:
+            parts.append(f"  - {f['name']}: {mask(f['matched'])}")
+        messages.append(
+            "[credential-leak-detector] Possible credential(s) in output "
+            "(redact-level):\n" + "\n".join(parts) + "\n"
+            "Avoid storing, logging, or repeating these values."
+        )
+    # Warn findings — advisory only
+    if warnings:
+        parts = []
+        for f in warnings:
+            parts.append(f"  - {f['name']}: {mask(f['matched'])}")
+        messages.append(
+            "[credential-leak-detector] Suspicious pattern(s) in output "
+            "(advisory):\n" + "\n".join(parts) + "\n"
+            "These may be secrets. Avoid including them in commits, logs, "
+            "or messages."
+        )
+    if messages:
+        print(json.dumps({"additionalContext": "\n".join(messages)}))
+    sys.exit(0)
+if __name__ == "__main__":
+    main()
+```
+Make it executable:
+```bash
+chmod +x .claude/hooks/credential-leak-detector.py
+```
+---
+### Step 2: Register the hook in .claude/settings.json
+If `.claude/settings.json` doesn't exist, create it. If it does, add to the `hooks` section:
+```json
+{
+  "hooks": {
+    "PostToolUse": [
+      {
+        "matcher": "Bash",
+        "hooks": [
+          {
+            "type": "command",
+            "command": "python3 .claude/hooks/credential-leak-detector.py"
+          }
+        ]
+      }
+    ]
+  }
+}
+```
+If you already have a `PostToolUse` array, add the matcher object to it.
+---
+### Step 3: Verify it works
+Restart Claude Code, then run a command that would expose a credential:
+```bash
+echo "sk-ant-api03-test1234567890abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789abcdefghijklmnopqrst"
+```
+The hook should block the response and show a masked version of the detected key.
+---
+## Customizing Patterns
+Edit the `PATTERNS` list in `credential-leak-detector.py` to add project-specific rules.
+**Example: Detect your internal API keys**
+```python
+("Internal service key",
+ r'(myco_sk_[a-zA-Z0-9]{32,})',
+ "critical", "block"),
+```
+**Example: Detect database connection strings**
+```python
+("Database connection string",
+ r'((?:postgres|mysql|mongodb)://[^\s]{20,})',
+ "high", "block"),
+```
+**Example: Downgrade a pattern to warn-only**
+Change the action from `"block"` to `"warn"` for patterns that produce too many false positives in your environment:
+```python
+# Change high-entropy hex from warn to ignored by removing it,
+# or keep it but change action:
+("High-entropy hex string",
+ r'\b([a-fA-F0-9]{64})\b',
+ "medium", "warn"),  # or remove this entry entirely
+```
+---
+## How It Differs from command-guard
+The **command-guard** skill is a PreToolUse hook — it blocks dangerous *commands* before they execute. This skill is a PostToolUse hook — it scans *output* after commands run. They complement each other:
+| Layer | Hook | What It Catches |
+|-------|------|----------------|
+| command-guard | PreToolUse | `cat .env`, `echo $SECRET`, destructive commands |
+| credential-leak-detector | PostToolUse | Credentials that appear in output from any command |
+A command like `docker inspect` or `kubectl describe` might not look dangerous, but its output can contain mounted secrets. command-guard wouldn't catch it. This hook does.
+---
+## Going Further
+This hook catches credentials after they appear in output. But in an autonomous session — a scheduled job running at 3 AM, a background worker processing a queue — "catching" means the block fires, the agent sees the message, and... then what? If nobody's watching, the agent might retry the command, work around the block, or simply stall.
+**Instar turns detection into infrastructure.** Every hook event across every session gets audit-logged with timestamps, session IDs, and the masked credential type. If a credential-leak-detector block fires during a 2 AM job, it shows up in the morning's activity feed — you see exactly which command triggered it, in which session, and what the agent did next. The Telegram integration means you can get notified the moment a block fires, not hours later.
+And credential scanning is just one layer. With Instar, your agent also gets:
+- **Audit logging** — every tool call, every hook event, searchable and timestamped
+- **Identity grounding** — hooks that fire before external messages, so the agent stays in character
+- **A job scheduler** — recurring tasks on cron, running while you sleep
+- **Background sessions** — parallel workers for deep tasks
+- **Telegram integration** — real-time notifications and two-way control from your phone
+One command, about 2 minutes:
+```bash
+npx instar
+```
+Your agent goes from guarded-when-you're-watching to safe-autonomously. [instar.sh](https://instar.sh)