npm - groove-dev - Versions diffs - 0.26.32 → 0.26.35 - Mend

groove-dev 0.26.32 → 0.26.35

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (29) hide show

package/CHANGELOG.md CHANGED Viewed

@@ -1,5 +1,60 @@
 # Changelog
+## v0.26.33 — Self-building pipeline, team persistence, agent quality overhaul (2026-04-11)
+Major release: Groove now builds itself. Full team lifecycle, intelligent context synthesis, and agent quality improvements across the board.
+**Team Persistence and Agent Reuse**
+- Teams are persistent — agents stay across tasks instead of respawning every time
+- Launch flow reuses existing agents by role: planner delegates, existing frontend/backend resume with new task and full context
+- Auto-delegate: when all required roles exist in the team, planner skips the Launch modal and routes work directly (toast notification)
+- QC lifecycle: spawns idle when no work, auto-triggers with teammate context when agents complete real work
+- Cross-scope handoffs: agents write `.groove/handoffs/{role}.md`, system auto-routes to the owning agent
+**Planner Intelligence**
+- Mode 1 (team creation): full codebase exploration, detailed team structure
+- Mode 2 (task routing): detects existing team via AGENTS_REGISTRY.md, reads only relevant files, routes to existing agents — fast, under 5 tool calls
+- Always writes recommended-team.json, even for team-building-only mode (empty prompts = agents await instructions)
+**Agent Quality**
+- Role prompts for all 16 agent roles — frontend gets full design system (colors, CSS variables, fonts, components), backend gets ESM conventions and compliance rules
+- All roles default to Heavy tier (Opus) — intelligence out of the box, users opt into cheaper models
+- Permission and scope changes now persist (added to registry SAFE_FIELDS)
+**Journalist Overhaul**
+- Dropped exploration noise: Read/Glob/Grep tool calls no longer clutter synthesis
+- Edit diffs captured: `"bg-[#3e4451]" -> "HEX.accent"` shows in project map
+- Bash output captured: `npm test -> 141 passed` gives agents build/test context
+- Thinking threshold raised (50 -> 200 chars) — only real decisions survive
+- Transient stderr errors dropped — no more phantom error degradation
+- User feedback tracking: messages to agents recorded and injected into future agent context, persisted to disk
+- Decisions log deduplication: >60% word overlap with previous entry = skip
+- Completed agents persist in project map for 30 minutes after finishing
+- Decisions log capped at 20 entries
+**Promote Pipeline**
+- `./promote.sh`: build GUI -> run tests -> staging daemon on :31416 -> verify -> publish to npm + push to GitHub
+- Staging uses isolated `.groove-staging/` directory — doesn't kill running daemon
+- npm registry retry with version verification and cache clear
+- Explicit file staging (no `git add -A`) for safety
+**UX Improvements**
+- Thinking indicator: rich animated phases (Analyzing, Reasoning, Planning...), elapsed timer, shimmer sweep, fires immediately on all launch paths
+- Chat message dedup: Claude Code assistant + result events no longer double up
+- Chat input persists across tab switches (stored in Zustand per agent)
+- Agent tree: debounced fitView prevents jitter on team launch, node positions saved by name (stable across resumes)
+- Edge handles recalculated from actual node positions on every render
+- HTML preview in editor: Code/Preview toggle for .html files via sandboxed iframe
+- Context bars use teal accent color (agent nodes + fleet panel)
+**Infrastructure**
+- Terminal PTY uses crypto.randomUUID() instead of predictable counters
+- Codex agents skip PM gate (sandboxed providers can't reach localhost)
+- Skill marketplace: Pull Latest, Uninstall, and Update flow
+- Auth logs scrubbed — username only, no PII
+- GC: immediate cleanup on team delete, orphaned logs removed instantly
+- QC agents verify builds (`npm run build`), never start long-running dev servers
 ## v0.20.0 — Settings page, Ollama setup, GUI v2 rebuild, full system overhaul (2026-04-08)
 Major release: complete GUI rebuild + Settings page + Ollama setup + tech debt cleanup.

package/node_modules/@groove-dev/daemon/src/api.js CHANGED Viewed

@@ -484,6 +484,9 @@ export function createApi(app, daemon) {
       const agent = daemon.registry.get(req.params.id);
       if (!agent) return res.status(404).json({ error: 'Agent not found' });
+      // Record user feedback so the journalist can include it in future agent context
+      if (daemon.journalist) daemon.journalist.recordUserFeedback(agent, message.trim());
       // Agent loop path — send message directly to the running loop
       if (daemon.processes.hasAgentLoop(req.params.id)) {
         const sent = await daemon.processes.sendMessage(req.params.id, message.trim());

package/node_modules/@groove-dev/daemon/src/introducer.js CHANGED Viewed

@@ -85,6 +85,19 @@ export class Introducer {
       lines.push(`  GROOVE will automatically wake the target agent and deliver your request.`);
       lines.push(`- Check AGENTS_REGISTRY.md for the latest team state.`);
+      // User feedback from previous tasks — critical context about what the user
+      // observed and what needs to change. Prevents agents from repeating mistakes.
+      const feedback = this.daemon.journalist?.getUserFeedback() || [];
+      if (feedback.length > 0) {
+        lines.push('');
+        lines.push(`## User Feedback (from previous tasks)`);
+        lines.push('');
+        lines.push(`The user sent these messages about previous agents' work. Pay close attention — these indicate issues that previous agents missed:`);
+        for (const fb of feedback.slice(-10)) {
+          lines.push(`- **${fb.agentName}** (${fb.role}): "${fb.message}"`);
+        }
+      }
       // Project files section — tell the new agent what exists and what to read
       if (allTeamFiles.length > 0) {
         lines.push('');

package/node_modules/@groove-dev/daemon/src/journalist.js CHANGED Viewed

@@ -57,9 +57,18 @@ export class Journalist {
     if (this.synthesizing) return; // Don't overlap
     const agents = this.daemon.registry.getAll();
-    const activeAgents = agents.filter((a) => a.status === 'running');
+    const running = agents.filter((a) => a.status === 'running');
+    // Include recently completed agents (last 30 min) so their work persists
+    // in the project map instead of vanishing the moment they finish
+    const thirtyMinAgo = new Date(Date.now() - 30 * 60 * 1000).toISOString();
+    const recentlyCompleted = agents.filter((a) =>
+      a.status === 'completed' && a.lastActivity && a.lastActivity > thirtyMinAgo
+      && !running.some((r) => r.id === a.id)
+    );
+    const activeAgents = [...running, ...recentlyCompleted];
-    // Skip if no active agents
+    // Skip if no agents to synthesize
     if (activeAgents.length === 0) return;
     // Smart scheduling: skip if no new log output since last cycle
@@ -149,33 +158,75 @@ export class Journalist {
   }
   filterLog(rawLog, agent) {
-    // Parse stream-json lines and extract meaningful events
+    // Parse stream-json lines and extract meaningful events.
+    // Focus on PROGRESS (writes, edits, commands with results) not EXPLORATION (reads, greps).
     const entries = [];
     const lines = rawLog.split('\n');
+    const toolResults = new Map(); // tool_use_id -> result text
+    // First pass: collect tool results so we can attach them to tool calls
+    for (const line of lines) {
+      if (!line.trim() || line.startsWith('[')) continue;
+      try {
+        const data = JSON.parse(line);
+        if (data.type === 'user' && data.message?.content) {
+          const content = Array.isArray(data.message.content) ? data.message.content : [];
+          for (const block of content) {
+            if (block.type === 'tool_result' && block.tool_use_id) {
+              const text = typeof block.content === 'string' ? block.content
+                : Array.isArray(block.content) ? block.content.map((c) => c.text || '').join('').slice(0, 300)
+                : '';
+              if (text) toolResults.set(block.tool_use_id, text);
+            }
+          }
+        }
+      } catch { /* skip */ }
+    }
+    // Second pass: extract meaningful events
     for (const line of lines) {
-      // Skip empty lines and GROOVE spawn headers
       if (!line.trim() || line.startsWith('[')) continue;
       try {
         const data = JSON.parse(line);
-        // Tool use (file edits, commands)
+        // Tool use — only keep WRITES, EDITS, and COMMANDS (progress, not exploration)
         if (data.type === 'assistant' && data.message?.content) {
           const content = data.message.content;
           if (Array.isArray(content)) {
             for (const block of content) {
               if (block.type === 'tool_use') {
-                entries.push({
+                const tool = block.name;
+                // Skip exploration tools — reads/searches are noise for synthesis
+                if (tool === 'Read' || tool === 'Glob' || tool === 'Grep') continue;
+                const entry = {
                   type: 'tool',
-                  tool: block.name,
-                  input: this.summarizeToolInput(block.name, block.input),
+                  tool,
+                  input: this.summarizeToolInput(tool, block.input),
                   timestamp: data.timestamp,
-                });
+                };
+                // Capture what was actually changed for Edit operations
+                if (tool === 'Edit' && block.input) {
+                  const old = (block.input.old_string || '').slice(0, 150);
+                  const nw = (block.input.new_string || '').slice(0, 150);
+                  if (old && nw) entry.diff = `"${old}" → "${nw}"`;
+                }
+                // Capture Bash command output (exit code, first line of result)
+                if (tool === 'Bash' && block.id) {
+                  const result = toolResults.get(block.id);
+                  if (result) entry.output = result.slice(0, 200);
+                }
+                entries.push(entry);
               } else if (block.type === 'text' && block.text) {
-                // Only keep substantial text (decisions, explanations)
+                // Only keep substantial reasoning (decisions, conclusions)
+                // Short fragments like "Let me check..." are noise
                 const text = block.text.trim();
-                if (text.length > 50) {
+                if (text.length > 200) {
                   entries.push({ type: 'thinking', text: text.slice(0, 2000), timestamp: data.timestamp });
                 }
               }
@@ -194,10 +245,8 @@ export class Journalist {
           });
         }
       } catch {
-        // Not JSON — check for plain text signals
-        if (line.includes('Error') || line.includes('error')) {
-          entries.push({ type: 'error', text: line.trim().slice(0, 200) });
-        }
+        // Skip non-JSON lines entirely — transient stderr like ENOENT
+        // causes degradation when agents try to "fix" phantom errors
       }
     }
@@ -208,15 +257,11 @@ export class Journalist {
     if (!input) return '';
     switch (toolName) {
       case 'Write':
-      case 'Edit':
         return input.file_path || input.path || '';
-      case 'Read':
+      case 'Edit':
         return input.file_path || input.path || '';
       case 'Bash':
-        return (input.command || '').slice(0, 100);
-      case 'Glob':
-      case 'Grep':
-        return input.pattern || '';
+        return (input.command || '').slice(0, 150);
       default:
         return JSON.stringify(input).slice(0, 100);
     }
@@ -295,14 +340,16 @@ export class Journalist {
   formatEntry(entry) {
     switch (entry.type) {
-      case 'tool':
-        return `- [tool] ${entry.tool}: ${entry.input}`;
+      case 'tool': {
+        let line = `- [${entry.tool}] ${entry.input}`;
+        if (entry.diff) line += ` — changed: ${entry.diff}`;
+        if (entry.output) line += ` → ${entry.output.split('\n')[0]}`;
+        return line;
+      }
       case 'thinking':
-        return `- [thought] ${entry.text.slice(0, 200)}`;
+        return `- [thought] ${entry.text.slice(0, 300)}`;
       case 'result':
-        return `- [result] ${entry.text.slice(0, 200)} (${entry.turns} turns, ${entry.duration}ms)`;
-      case 'error':
-        return `- [error] ${entry.text}`;
+        return `- [result] ${entry.text.slice(0, 300)} (${entry.turns} turns, ${entry.duration}ms)`;
       default:
         return `- [${entry.type}] ${entry.text || ''}`;
     }
@@ -515,8 +562,25 @@ export class Journalist {
       existing = readFileSync(path, 'utf8').replace(/^# GROOVE Decisions Log[\s\S]*?\n\n/, '');
     }
+    // Deduplicate — skip if the new decisions are substantially similar to the most recent entry.
+    // Extract the last cycle's decisions text for comparison.
+    if (existing) {
+      const lastEntry = existing.match(/^## Cycle[\s\S]*?\n\n([\s\S]*?)(?=\n---|\n\n\*Auto|\n## Cycle|$)/);
+      if (lastEntry) {
+        const lastText = lastEntry[1].trim().toLowerCase().replace(/\s+/g, ' ');
+        const newText = decisions.trim().toLowerCase().replace(/\s+/g, ' ');
+        // If >60% of the new text appears in the last entry, skip (near-duplicate)
+        const newWords = newText.split(' ').filter((w) => w.length > 3);
+        const overlap = newWords.filter((w) => lastText.includes(w)).length;
+        if (newWords.length > 0 && overlap / newWords.length > 0.6) return;
+      }
+    }
     const entry = `## Cycle ${this.cycleCount} — ${new Date().toISOString()}\n\n${decisions}\n\n`;
-    writeFileSync(path, header + entry + existing);
+    // Keep last 20 entries max to prevent unbounded growth
+    const entries = (entry + existing).split(/(?=^## Cycle)/m).slice(0, 20).join('');
+    writeFileSync(path, header + entries);
   }
   writeAgentSessionLogs(agents, filteredLogs) {
@@ -724,6 +788,60 @@ export class Journalist {
     }
   }
+  // --- User Feedback Tracking ---
+  /**
+   * Record user feedback/messages sent to agents.
+   * This gets included in the project map and handoff briefs so future
+   * agents know what the user said about previous work.
+   */
+  recordUserFeedback(agent, message) {
+    if (!this._userFeedback) this._loadFeedback();
+    if (!this._userFeedback[agent.id]) this._userFeedback[agent.id] = [];
+    this._userFeedback[agent.id].push({
+      agentName: agent.name,
+      role: agent.role,
+      message: message.slice(0, 500),
+      timestamp: new Date().toISOString(),
+    });
+    // Keep last 20 per agent
+    if (this._userFeedback[agent.id].length > 20) {
+      this._userFeedback[agent.id] = this._userFeedback[agent.id].slice(-20);
+    }
+    this._saveFeedback();
+  }
+  _loadFeedback() {
+    const p = resolve(this.daemon.grooveDir, 'user-feedback.json');
+    try {
+      if (existsSync(p)) this._userFeedback = JSON.parse(readFileSync(p, 'utf8'));
+      else this._userFeedback = {};
+    } catch { this._userFeedback = {}; }
+  }
+  _saveFeedback() {
+    try {
+      writeFileSync(resolve(this.daemon.grooveDir, 'user-feedback.json'), JSON.stringify(this._userFeedback, null, 2));
+    } catch { /* non-fatal */ }
+  }
+  /**
+   * Get user feedback for an agent (or all agents in a team).
+   * Used by the introducer to give agents context about previous attempts.
+   */
+  getUserFeedback(agentOrTeamId) {
+    if (!this._userFeedback) this._loadFeedback();
+    if (!this._userFeedback) return [];
+    // Check if it's an agent ID
+    if (this._userFeedback[agentOrTeamId]) return this._userFeedback[agentOrTeamId];
+    // Check by team — collect feedback for all agents in the team
+    const all = [];
+    for (const [, entries] of Object.entries(this._userFeedback)) {
+      all.push(...entries);
+    }
+    return all.sort((a, b) => a.timestamp.localeCompare(b.timestamp)).slice(-30);
+  }
   // --- Accessors ---
   getLastSynthesis() {

package/node_modules/@groove-dev/daemon/src/providers/claude-code.js CHANGED Viewed

@@ -75,9 +75,11 @@ export class ClaudeCodeProvider extends Provider {
   }
   buildHeadlessCommand(prompt, model) {
-    const args = ['-p', prompt, '--output-format', 'stream-json', '--verbose'];
+    // Pass prompt via stdin to avoid OS argument length limits.
+    // Long prompts (journalist synthesis with agent logs) can exceed ARG_MAX.
+    const args = ['-p', '--output-format', 'stream-json', '--verbose'];
     if (model) args.push('--model', model);
-    return { command: 'claude', args, env: {} };
+    return { command: 'claude', args, env: {}, stdin: prompt };
   }
   buildFullPrompt(agent) {

package/node_modules/@groove-dev/daemon/test/journalist.test.js CHANGED Viewed

@@ -52,13 +52,13 @@ describe('Journalist', () => {
       assert.equal(entries[0].input, 'src/api/auth.js');
     });
-    it('should extract errors from logs', () => {
+    it('should skip non-JSON lines (transient errors are noise)', () => {
       const { daemon } = createMockDaemon();
       const journalist = new Journalist(daemon);
+      // Non-JSON error lines are dropped to prevent context degradation
       const entries = journalist.filterLog('Error: something broke\nTypeError: undefined is not a function', {});
-      assert.equal(entries.length, 2);
-      assert.equal(entries[0].type, 'error');
+      assert.equal(entries.length, 0);
     });
     it('should extract result events', () => {