npm - gencode-ai - Versions diffs - 0.3.0 → 0.4.0 - Mend

gencode-ai 0.3.0 → 0.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (116) hide show

package/RELEASE_NOTES_v0.4.0.md +140 -0
package/dist/agent/agent.d.ts +17 -2
package/dist/agent/agent.d.ts.map +1 -1
package/dist/agent/agent.js +279 -49
package/dist/agent/agent.js.map +1 -1
package/dist/agent/types.d.ts +15 -1
package/dist/agent/types.d.ts.map +1 -1
package/dist/checkpointing/checkpoint-manager.d.ts +24 -0
package/dist/checkpointing/checkpoint-manager.d.ts.map +1 -1
package/dist/checkpointing/checkpoint-manager.js +28 -0
package/dist/checkpointing/checkpoint-manager.js.map +1 -1
package/dist/cli/components/App.d.ts +8 -0
package/dist/cli/components/App.d.ts.map +1 -1
package/dist/cli/components/App.js +478 -36
package/dist/cli/components/App.js.map +1 -1
package/dist/cli/components/CommandSuggestions.d.ts.map +1 -1
package/dist/cli/components/CommandSuggestions.js +2 -0
package/dist/cli/components/CommandSuggestions.js.map +1 -1
package/dist/cli/components/Header.d.ts +6 -1
package/dist/cli/components/Header.d.ts.map +1 -1
package/dist/cli/components/Header.js +3 -3
package/dist/cli/components/Header.js.map +1 -1
package/dist/cli/components/Messages.d.ts.map +1 -1
package/dist/cli/components/Messages.js +7 -9
package/dist/cli/components/Messages.js.map +1 -1
package/dist/cli/index.js +3 -2
package/dist/cli/index.js.map +1 -1
package/dist/config/types.d.ts +20 -1
package/dist/config/types.d.ts.map +1 -1
package/dist/config/types.js.map +1 -1
package/dist/index.d.ts +2 -2
package/dist/index.js +2 -2
package/dist/input/history-manager.d.ts +78 -0
package/dist/input/history-manager.d.ts.map +1 -0
package/dist/input/history-manager.js +224 -0
package/dist/input/history-manager.js.map +1 -0
package/dist/input/index.d.ts +6 -0
package/dist/input/index.d.ts.map +1 -0
package/dist/input/index.js +5 -0
package/dist/input/index.js.map +1 -0
package/dist/prompts/index.js +3 -3
package/dist/prompts/index.js.map +1 -1
package/dist/providers/gemini.d.ts.map +1 -1
package/dist/providers/gemini.js +33 -2
package/dist/providers/gemini.js.map +1 -1
package/dist/providers/google.d.ts +22 -0
package/dist/providers/google.d.ts.map +1 -0
package/dist/providers/google.js +297 -0
package/dist/providers/google.js.map +1 -0
package/dist/providers/index.d.ts +4 -4
package/dist/providers/index.js +11 -11
package/dist/providers/index.js.map +1 -1
package/dist/providers/openai.d.ts.map +1 -1
package/dist/providers/openai.js +6 -0
package/dist/providers/openai.js.map +1 -1
package/dist/providers/registry.js +3 -3
package/dist/providers/registry.js.map +1 -1
package/dist/providers/types.d.ts +30 -4
package/dist/providers/types.d.ts.map +1 -1
package/dist/session/compression/engine.d.ts +109 -0
package/dist/session/compression/engine.d.ts.map +1 -0
package/dist/session/compression/engine.js +311 -0
package/dist/session/compression/engine.js.map +1 -0
package/dist/session/compression/index.d.ts +12 -0
package/dist/session/compression/index.d.ts.map +1 -0
package/dist/session/compression/index.js +11 -0
package/dist/session/compression/index.js.map +1 -0
package/dist/session/compression/types.d.ts +90 -0
package/dist/session/compression/types.d.ts.map +1 -0
package/dist/session/compression/types.js +17 -0
package/dist/session/compression/types.js.map +1 -0
package/dist/session/manager.d.ts +64 -3
package/dist/session/manager.d.ts.map +1 -1
package/dist/session/manager.js +254 -2
package/dist/session/manager.js.map +1 -1
package/dist/session/types.d.ts +16 -0
package/dist/session/types.d.ts.map +1 -1
package/dist/session/types.js.map +1 -1
package/docs/README.md +1 -0
package/docs/diagrams/compression-decision.mmd +30 -0
package/docs/diagrams/compression-workflow.mmd +54 -0
package/docs/diagrams/layer1-pruning.mmd +45 -0
package/docs/diagrams/layer2-compaction.mmd +42 -0
package/docs/proposals/0007-context-management.md +252 -2
package/docs/proposals/README.md +4 -3
package/docs/providers.md +3 -3
package/docs/session-compression.md +695 -0
package/examples/agent-demo.ts +23 -1
package/examples/basic.ts +3 -3
package/package.json +3 -4
package/src/agent/agent.ts +314 -52
package/src/agent/types.ts +19 -1
package/src/checkpointing/checkpoint-manager.ts +48 -0
package/src/cli/components/App.tsx +553 -34
package/src/cli/components/CommandSuggestions.tsx +2 -0
package/src/cli/components/Header.tsx +16 -1
package/src/cli/components/Messages.tsx +20 -14
package/src/cli/index.tsx +3 -2
package/src/config/types.ts +26 -1
package/src/index.ts +3 -3
package/src/input/history-manager.ts +289 -0
package/src/input/index.ts +6 -0
package/src/prompts/index.test.ts +2 -1
package/src/prompts/index.ts +3 -3
package/src/providers/{gemini.ts → google.ts} +69 -18
package/src/providers/index.ts +14 -14
package/src/providers/openai.ts +7 -0
package/src/providers/registry.ts +3 -3
package/src/providers/types.ts +33 -3
package/src/session/compression/engine.ts +406 -0
package/src/session/compression/index.ts +18 -0
package/src/session/compression/types.ts +102 -0
package/src/session/manager.ts +326 -3
package/src/session/types.ts +21 -0
package/tests/input-history-manager.test.ts +335 -0
package/tests/session-checkpoint-persistence.test.ts +198 -0

package/docs/diagrams/layer1-pruning.mmd ADDED Viewed

@@ -0,0 +1,45 @@
+%% Layer 1: Tool Output Pruning
+%% Detailed pruneToolOutputs flow
+flowchart TD
+    Start([pruneToolOutputs]) --> CheckMin{total tokens > 20k?}
+    CheckMin -->|No| Return0[Return: pruned=false<br/>count=0, saved=0]
+    CheckMin -->|Yes| InitVars[Initialize:<br/>protectedTokens = 0<br/>protectedIndices = Set]
+    InitVars --> LoopBackward[Iterate backward<br/>i = length-1 to 0]
+    LoopBackward --> CheckMsg{Message contains<br/>tool_result?}
+    CheckMsg -->|No| NextMsg1[Continue to next]
+    CheckMsg -->|Yes| CalcMsgTokens[Calculate message tokens]
+    CalcMsgTokens --> CheckProtect{protectedTokens<br/>< 40k?}
+    CheckProtect -->|Yes| AddProtected[protectedTokens += msgTokens<br/>protectedIndices.add i]
+    CheckProtect -->|No| StopLoop[Stop loop]
+    AddProtected --> NextMsg1
+    NextMsg1 --> MoreMsg1{More messages?}
+    MoreMsg1 -->|Yes| LoopBackward
+    MoreMsg1 -->|No| LoopForward
+    StopLoop --> LoopForward[Iterate forward<br/>i = 0 to length-1]
+    LoopForward --> CheckProtected{i in<br/>protectedIndices?}
+    CheckProtected -->|Yes| NextMsg2[Continue to next]
+    CheckProtected -->|No| HasToolRes{Contains<br/>tool_result?}
+    HasToolRes -->|No| NextMsg2
+    HasToolRes -->|Yes| CalcBefore[Record tokens before clear]
+    CalcBefore --> ClearContent[Clear tool result:<br/>content = Old tool result cleared<br/>pruned = true<br/>prunedAt = ISO timestamp]
+    ClearContent --> CalcAfter[Record tokens after clear]
+    CalcAfter --> UpdateStats[savedTokens += before - after<br/>prunedCount++]
+    UpdateStats --> NextMsg2
+    NextMsg2 --> MoreMsg2{More messages?}
+    MoreMsg2 -->|Yes| LoopForward
+    MoreMsg2 -->|No| ReturnStats[Return: pruned, count, saved]
+    style CheckMin fill:#fff3e0
+    style CheckProtect fill:#fff3e0
+    style ClearContent fill:#ffd93d,stroke:#333,stroke-width:2px
+    style ReturnStats fill:#51cf66

package/docs/diagrams/layer2-compaction.mmd ADDED Viewed

@@ -0,0 +1,42 @@
+%% Layer 2: Compaction (Summarization)
+%% Detailed compact flow
+flowchart TD
+    Start([compact messages, range]) --> Slice[Extract messages in range<br/>messages.slice start, end+1]
+    Slice --> Par1[Extract info in parallel]
+    Par1 --> ExtFiles[extractFilesModified<br/>Iterate tool_use blocks<br/>Collect Write/Edit file_path]
+    Par1 --> ExtTools[extractToolUsage<br/>Count each tool usage<br/>Record top 3 notable uses]
+    Par1 --> ExtDecisions[extractKeyDecisions<br/>Find sentences with decision keywords<br/>decided/chose/will use/going with]
+    ExtFiles --> BuildPrompt[Build continuation prompt]
+    ExtTools --> BuildPrompt
+    ExtDecisions --> BuildPrompt
+    BuildPrompt --> FormatConv[Format conversation history:<br/>role idx: content 500 chars]
+    FormatConv --> CreatePrompt[Prompt template:<br/>Provide detailed prompt for continuing...<br/>Focus on:<br/>1. What we accomplished<br/>2. Current work<br/>3. Files modified + changes<br/>4. Next steps<br/>5. Important context/decisions]
+    CreatePrompt --> CallLLM[provider.complete<br/>model: config.model<br/>max_tokens: 1500]
+    CallLLM --> ExtractText[Extract text content<br/>from response.content]
+    ExtractText --> CreateSum[Create ConversationSummary]
+    CreateSum --> SetID[id: sum-timestamp-random]
+    CreateSum --> SetType[type: compaction]
+    CreateSum --> SetRange[coveringMessages: start, end]
+    CreateSum --> SetContent[content: continuation prompt]
+    CreateSum --> SetMeta[metadata: decisions, files, tools]
+    CreateSum --> EstTokens[estimatedTokens: content.length / 4]
+    CreateSum --> SetTime[generatedAt: ISO timestamp]
+    SetID --> Return[Return ConversationSummary]
+    SetType --> Return
+    SetRange --> Return
+    SetContent --> Return
+    SetMeta --> Return
+    EstTokens --> Return
+    SetTime --> Return
+    style CallLLM fill:#ff6b6b,stroke:#333,stroke-width:2px
+    style CreatePrompt fill:#74c0fc
+    style Return fill:#51cf66

package/docs/proposals/0007-context-management.md CHANGED Viewed

@@ -2,9 +2,9 @@
 - **Proposal ID**: 0007
 - **Author**: mycode team
-- **Status**: Draft
+- **Status**: Implemented - Pending Verification
 - **Created**: 2025-01-15
-- **Updated**: 2025-01-15
+- **Updated**: 2026-01-18
 ## Summary
@@ -427,3 +427,253 @@ Existing sessions will work without context stats; stats begin tracking on first
 - [Claude Code Context Management](https://code.claude.com/docs/en/context)
 - [OpenAI Tokenizer (tiktoken)](https://github.com/openai/tiktoken)
 - [Anthropic Token Counting](https://docs.anthropic.com/en/docs/tokens)
+## Implementation Status
+### ✅ Implemented (Phase 1-3)
+**Session Compression System**:
+- ✅ `CompressionEngine` class with Layer 1 (Pruning) and Layer 2 (Compaction)
+  - Message deduplication and quality scoring
+  - Context-aware summarization
+  - Intelligent message selection (recent, high-value, tool results)
+  - Configurable thresholds and parameters
+- ✅ Integration with `SessionManager`
+  - Automatic compression when approaching context limits
+  - Compression statistics tracking
+  - Persistent compression metadata in session files
+**CLI Commands**:
+- ✅ `/compact` - Manual conversation compaction
+  - Triggers compression immediately
+  - Shows statistics (active/total messages, summaries, saved %)
+  - Visual ASCII box display with progress bars
+- ✅ `/context` - Context usage statistics
+  - Shows active vs total message counts
+  - Displays compression status (Compressed/Uncompressed)
+  - Progress bar visualization
+  - ASCII box display with colored status
+**UI Rendering Fixes** (2026-01-18):
+- ✅ Fixed info icon "ℹ" appearing on separate line before box output
+  - Added box content detection in `renderHistoryItem()` (App.tsx:1389-1396)
+  - Box content now renders directly without InfoMessage wrapper
+- ✅ Fixed right border alignment for `/context` and `/compact` commands
+  - Corrected padding calculation from `-2` to `-3` (App.tsx:904, 965)
+  - All border characters (`+`, `|`) now perfectly aligned
+  - Consistent 50-character width across all lines
+**Visual Output** (After Fixes):
+```
++------------------------------------------------+
+| Context Usage Statistics                       |
++------------------------------------------------+
+| Active Messages      12                        |
+| Total Messages       45                        |
+| Summaries             2                        |
+|                                                |
+| Usage  [#####...............]  27%             |
+|                                                |
+| Status: Compressed                             |
++------------------------------------------------+
+```
+### ✅ Newly Implemented (2026-01-18 - Pending Verification)
+**Token Counting & Tracking**:
+- ✅ **Cumulative token tracking** from API responses
+  - `SessionManager.cumulativeTokens` tracks input/output/total
+  - `calculateCumulativeTokens()` sums from session metadata
+  - `updateTokenUsageFromLatestCompletion()` for incremental updates
+  - `getTokenUsage()` public getter for current usage
+  - Persisted to `session.metadata.tokenUsage` on save
+- ✅ **Actual API token usage** instead of 4:1 estimates
+  - Token usage passed to `CompressionEngine.needsCompression()`
+  - Uses provider-returned `inputTokens` and `outputTokens`
+  - Falls back to 4:1 estimate if API doesn't return usage
+**Auto-Compaction with Thresholds**:
+- ✅ **Threshold-based compression triggering**
+  - 80% warning threshold: emits `context-warning` event
+  - 90% auto-compact threshold: triggers compression automatically
+  - Returns `usagePercent` and `shouldWarn` flags from `needsCompression()`
+- ✅ **Event-driven architecture** with EventEmitter
+  - `context-warning` - Emitted at 80% usage
+  - `auto-compacting` - Emitted before compression at 90%
+  - `compaction-complete` - Emitted after compression finishes
+- ✅ **User feedback in UI**
+  - ⚠️ "Context usage at 82% - Consider using /compact"
+  - 📦 "Auto-compacting (91% usage, strategy: prune)..."
+  - ✓ "Compaction complete (prune)"
+  - Smart warning deduplication (shows once per session)
+**Status Display**:
+- ✅ **Context usage in header**
+  - Format: `Context: 45/120 msgs (37%)`
+  - Real-time updates after each completion
+  - Only shown when activeMessages > 0
+  - Calculates percentage from actual token usage vs context window
+- ✅ **Real-time token tracking**
+  - Header refreshes on every render
+  - Pulls from `SessionManager.getTokenUsage()` and `getCompressionStats()`
+**Implementation Details**:
+- ✅ **Files Modified**:
+  - `src/session/manager.ts` - Token tracking, event emission
+  - `src/session/compression/engine.ts` - Threshold logic
+  - `src/cli/components/Header.tsx` - Context stats display
+  - `src/cli/components/App.tsx` - Event listeners, header updates
+- ✅ **Backward Compatible**: Works with existing sessions
+- ✅ **Build Status**: TypeScript compilation successful
+### ❌ Not Implemented (Deferred - Low Priority)
+**Provider-Specific Tokenizers**:
+- ❌ Client-side tokenizer implementations
+  - No OpenAITokenizer, AnthropicTokenizer, or GeminiTokenizer classes
+  - Not needed: Using actual API token counts instead
+  - Could be added later for pre-submission estimates
+  - **Decision**: Deferred - API usage is more accurate
+**Memory Tool**:
+- ❌ Claude Code-style Memory Tool implementation
+  - No persistent storage across context resets
+  - Memory system exists but uses different approach (GEN.md files)
+  - **Decision**: Out of scope for this proposal
+### 📋 Verification & Testing Required
+**Core functionality implemented - needs real-world testing:**
+1. **Verification Tasks** (High Priority):
+   - ✅ Build successful - TypeScript compilation passed
+   - ⏳ **Test 80% warning trigger** - Start long conversation and verify warning appears
+   - ⏳ **Test 90% auto-compact** - Continue until auto-compaction triggers
+   - ⏳ **Verify token accuracy** - Compare displayed tokens vs API actual usage
+   - ⏳ **Test header display** - Confirm context stats update in real-time
+   - ⏳ **Test session persistence** - Reload session and verify token counts preserved
+   - ⏳ **Test event deduplication** - Verify warning only shows once per session
+2. **Edge Cases to Test**:
+   - Session load with no token usage data (backward compatibility)
+   - Session fork inherits correct token counts
+   - Compression resets warning flag after compaction
+   - Multiple rapid completions don't spam warnings
+   - Very short sessions (< 10 messages) display correctly
+3. **Future Optimizations** (Low Priority - Post-Verification):
+   - Advanced compaction strategies
+   - Better summarization quality
+   - Provider-specific tokenizers for pre-submission estimates
+   - Memory tool integration
+### 📁 Implementation Files
+| File | Status | Notes |
+|------|--------|-------|
+| `src/session/compression/engine.ts` | ✅ Complete | Layer 1 & 2 compression + threshold logic |
+| `src/session/compression/types.ts` | ✅ Complete | All compression types |
+| `src/session/compression/index.ts` | ✅ Complete | Module exports |
+| `src/session/manager.ts` | ✅ Modified | Token tracking + EventEmitter + compression |
+| `src/session/types.ts` | ✅ Modified | Token usage in metadata |
+| `src/cli/components/App.tsx` | ✅ Modified | Event listeners + header stats |
+| `src/cli/components/Header.tsx` | ✅ Modified | Context stats display |
+| `src/context/tokenizer.ts` | ⏸️ Deferred | Using API token counts instead |
+| `src/context/context-manager.ts` | ⏸️ Deferred | Context tracking in SessionManager |
+### 🐛 Bug Fixes
+**UI Rendering Issues** (Fixed 2026-01-18):
+**Problem 1**: Info icon "ℹ" appearing on separate line before box output
+- **Root Cause**: `InfoMessage` component always prepended icon, causing it to appear on separate line
+- **Solution**: Added box content detection (`content.trim().startsWith('+---')`) in `renderHistoryItem()`
+- **Files Changed**: `src/cli/components/App.tsx` (lines 1389-1396)
+**Problem 2**: Right border `|` not aligned properly
+- **Root Cause**: Padding calculation was off by 1 character
+- **Before**: `w - text.length - 2` and `w - visible - 2`
+- **After**: `w - text.length - 3` and `w - visible - 3`
+- **Explanation**: Border line `'| ' + pad(text) + '|'` = 2 + pad + 1 = w, so pad = w - 3
+- **Files Changed**: `src/cli/components/App.tsx` (lines 904, 965)
+**Test Results**:
+```
+✅ All lines same length: true
+✅ Expected: 50, Actual: 50
+✅ /compact box: Passed
+✅ /context box: Passed
+✅ No info icon in output
+✅ Perfect border alignment
+```
+---
+## 📦 Latest Implementation (2026-01-18)
+### Summary
+Completed all high-priority features from the "Remaining Work" section:
+- ✅ Accurate token tracking from API responses
+- ✅ 80% warning threshold + 90% auto-compaction
+- ✅ Real-time context display in header
+- ✅ Event-driven architecture for extensibility
+### Implementation Phases
+**Phase 1: Token Usage Tracking** (~30 min)
+- Added cumulative token tracking to SessionManager
+- Implemented token calculation from session metadata
+- Updated compression to use actual API token counts
+- Added public `getTokenUsage()` getter
+**Phase 2: Threshold Warnings** (~45 min)
+- Modified `needsCompression()` to return usage % and warning flags
+- Extended SessionManager with EventEmitter
+- Implemented 3 events: `context-warning`, `auto-compacting`, `compaction-complete`
+- Added UI event listeners with smart deduplication
+**Phase 3: Context Display** (~30 min)
+- Updated Header component with optional context stats
+- Real-time header updates showing "Context: X/Y msgs (Z%)"
+- Only displays when activeMessages > 0
+### Code Changes
+**Total**: ~105 lines across 4 files
+| File | Changes | Lines |
+|------|---------|-------|
+| `src/session/manager.ts` | Token tracking, events, getters | +65 |
+| `src/session/compression/engine.ts` | Threshold logic | +15 |
+| `src/cli/components/Header.tsx` | Context stats display | +15 |
+| `src/cli/components/App.tsx` | Event listeners, header stats | +40 |
+### Key Design Decisions
+1. **API Token Counts over Tokenizers**
+   - Using actual usage from API responses instead of client-side estimation
+   - More accurate, no external dependencies (tiktoken, etc.)
+   - Falls back to 4:1 estimate if API doesn't provide usage
+2. **Event-Driven Architecture**
+   - SessionManager extends EventEmitter
+   - Loosely coupled: compression engine doesn't need UI knowledge
+   - Easy to add more listeners (logging, analytics, etc.)
+3. **Smart Warning Deduplication**
+   - Warning only shown once per session using `contextWarningShownRef`
+   - Resets after compaction completes
+   - Prevents spam during long conversations
+### Testing Required
+See "📋 Verification & Testing Required" section above for:
+- Functional tests (80% warning, 90% auto-compact)
+- Edge cases (session load, fork, persistence)
+- Real-world usage validation
+### References
+Implementation plan: `STREAMING_IMPLEMENTATION_SUMMARY.md` (Phase 4 context management)
+Related proposal: `0007-context-management.md` (this document)

package/docs/proposals/README.md CHANGED Viewed

@@ -8,7 +8,8 @@ This directory contains enhancement proposals for the gencode project. Each prop
 2. **Under Review**: Community and maintainer review
 3. **Accepted**: Approved for implementation
 4. **Implemented**: Feature has been implemented
-5. **Rejected**: Proposal was rejected with explanation
+5. **Implemented ⚠️**: Implemented but not fully verified/tested
+6. **Rejected**: Proposal was rejected with explanation
 ## Proposal Index
@@ -22,8 +23,8 @@ This directory contains enhancement proposals for the gencode project. Each prop
 | [0004](./0004-plan-mode.md) | Plan Mode | Implemented |
 | [0005](./0005-todo-system.md) | Todo System | Implemented |
 | [0006](./0006-memory-system.md) | Memory System (MYCODE.md) | Implemented |
-| [0007](./0007-context-management.md) | Context Management | Draft |
-| [0008](./0008-checkpointing.md) | Checkpointing | Partially Implemented |
+| [0007](./0007-context-management.md) | Context Management | Implemented ⚠️ |
+| [0008](./0008-checkpointing.md) | Checkpointing | Implemented ⚠️ |
 | [0009](./0009-hooks-system.md) | Hooks System | Draft |
 | [0010](./0010-mcp-integration.md) | MCP Integration | Draft |
 | [0021](./0021-skills-system.md) | Skills System | Draft |

package/docs/providers.md CHANGED Viewed

@@ -22,9 +22,9 @@ GPT models from OpenAI:
 |-------------------|----------------------|-------------|
 | API Key | `OPENAI_API_KEY` | Direct API access |
-### Google Gemini
+### Google
-Gemini models from Google:
+Google Generative AI (Gemini models):
 | Connection Method | Environment Variables | Description |
 |-------------------|----------------------|-------------|
@@ -182,7 +182,7 @@ gcloud services enable aiplatform.googleapis.com
 GenCode uses a two-layer provider architecture:
-- **Layer 1: Provider** (Semantic layer) - `anthropic` | `openai` | `gemini`
+- **Layer 1: Provider** (Semantic layer) - `anthropic` | `openai` | `google`
 - **Layer 2: AuthMethod** (Implementation layer) - `api_key` | `vertex` | `bedrock` | `azure`
 Each provider can support multiple authentication methods. For example, Anthropic supports: