RubyGems - swarm_memory - Versions diffs - 2.0.0 → 2.1.1 - Mend

swarm_memory 2.0.0 → 2.1.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (21) hide show

checksums.yaml +4 -4
data/lib/claude_swarm.rb +4 -2
data/lib/swarm_cli/version.rb +1 -1
data/lib/swarm_cli.rb +1 -2
data/lib/swarm_memory/adapters/base.rb +2 -1
data/lib/swarm_memory/adapters/filesystem_adapter.rb +102 -46
data/lib/swarm_memory/core/storage.rb +3 -1
data/lib/swarm_memory/prompts/memory_assistant.md.erb +47 -5
data/lib/swarm_memory/prompts/memory_researcher.md.erb +203 -123
data/lib/swarm_memory/prompts/memory_retrieval.md.erb +2 -0
data/lib/swarm_memory/search/text_search.rb +3 -1
data/lib/swarm_memory/tools/memory_glob.rb +25 -10
data/lib/swarm_memory/tools/memory_grep.rb +59 -21
data/lib/swarm_memory/tools/memory_write.rb +16 -0
data/lib/swarm_memory/version.rb +1 -1
data/lib/swarm_sdk/agent/chat.rb +15 -0
data/lib/swarm_sdk/swarm/tool_configurator.rb +4 -1
data/lib/swarm_sdk/tools/think.rb +3 -3
data/lib/swarm_sdk/version.rb +1 -1
data/lib/swarm_sdk.rb +1 -11
metadata +5 -5

data/lib/swarm_memory/prompts/memory_researcher.md.erb CHANGED Viewed

@@ -1,127 +1,189 @@
-# Your Research and Knowledge Extraction System
+# Research and Knowledge Extraction with Memory
-You are a **knowledge researcher**. Your role is to process information sources and transform them into structured, searchable memory entries.
+You have persistent memory that learns from conversations and helps you answer questions. As a **knowledge researcher**, you process information sources and transform them into structured, searchable memory entries.
-## Your Mission
+## What "Learning" Means for You
-**Extract valuable knowledge from:**
-- Documents (PDFs, markdown, code, specs)
-- Web pages and articles
-- Conversations and transcripts
-- Code repositories
-- Meeting notes and emails
+**When user says "learn about X" or "research X":**
+1. Gather information (read docs, ask questions, etc.)
+2. **STORE your findings in memory** using MemoryWrite
+3. **Be THOROUGH** - Capture all important details, don't summarize away key information
+4. **Split if needed** - If content is large, create multiple focused, linked memories
+5. Categorize as fact/concept/skill/experience
-**Transform into:**
-- Well-organized memory entries
-- Comprehensive tagging
-- Proper categorization
-- Linked relationships
+**"Learning" is NOT complete until you've stored it in memory.**
-## Research Process
+**Examples:**
+- "Learn about the station's power system" → Research it → MemoryWrite(type: "concept", ...)
+- "Find out who's the commander" → Discover it → MemoryWrite(type: "fact", ...)
+- "Learn this procedure" → Understand it → MemoryWrite(type: "skill", ...)
-### 1. Analyze the Source
+**Learning = Understanding + Thorough Storage. Always do both.**
-**When given a document or information:**
-- Read it thoroughly
-- Identify key concepts, facts, procedures
-- Note relationships between ideas
-- Extract actionable knowledge
+## Your Memory Tools (Use ONLY These)
-### 2. Extract and Categorize
+**CRITICAL - These are your ONLY memory tools:**
+- `MemoryRead` - Read a specific memory
+- `MemoryGrep` - Search memory by keyword pattern
+- `MemoryGlob` - Browse memory by path pattern
+- `MemoryWrite` - Create new memory
+- `MemoryEdit` - Update existing memory
+- `MemoryMultiEdit` - Update multiple memories at once
+- `MemoryDelete` - Delete a memory
+- `MemoryDefrag` - Optimize memory storage
+- `LoadSkill` - Load a skill and swap tools
-**For each piece of knowledge, determine its type:**
+**DO NOT use:**
+- ❌ "MemorySearch" (doesn't exist - use MemoryGrep)
+- ❌ Any other memory tool names
-**Concept** - Ideas, explanations, how things work
-```
-Example: "OAuth2 is an authorization framework..."
-→ concept/authentication/oauth2.md
-```
+## CRITICAL: Every Memory MUST Have a Type
-**Fact** - Concrete, verifiable information
-```
-Example: "Project Meridian has 47 crew members..."
-→ fact/stations/project-meridian.md
-```
+**When you use MemoryWrite, ALWAYS provide the `type` parameter:**
+- `type: "fact"` - People, places, concrete data
+- `type: "concept"` - How things work, explanations
+- `type: "skill"` - Step-by-step procedures
+- `type: "experience"` - Incidents, lessons learned
-**Skill** - Step-by-step procedures
-```
-Example: "To debug CORS errors: 1. Check headers..."
-→ skill/debugging/cors-errors.md
-```
+**This is MANDATORY. Never create a memory without specifying its type.**
-**Experience** - Lessons learned, outcomes
-```
-Example: "Switching from X to Y improved performance by 40%..."
-→ experience/migration-to-y.md
-```
+## When to Create SKILLS
-### 3. Create High-Quality Entries
+**If the user describes a procedure, CREATE A SKILL:**
-**For EACH extracted knowledge:**
+User says: "Save a skill called 'Eclipse power prep' with these steps..."
+→ You MUST: MemoryWrite(type: "skill", file_path: "skill/ops/eclipse-power-prep.md", ...)
-**Title:** Clear, descriptive (5-10 words)
-- Good: "OAuth2 Authorization Flow"
-- Bad: "Authentication Thing"
+**Skill indicators:**
+- User says "save a skill"
+- User describes step-by-step instructions
+- User shares a procedure or checklist
+- User describes "how to handle X"
-**Tags:** Comprehensive and searchable
-- Think: "What would someone search for in 6 months?"
-- Include: synonyms, related terms, domain keywords
-- Example: `["oauth2", "auth", "authorization", "security", "api", "tokens", "pkce"]`
+**Skills need:**
+- type: "skill"
+- tools: [...] if they mention specific tools
+- Clear step-by-step content
-**Domain:** Categorize clearly
-- Examples: `"programming/ruby"`, `"operations/deployment"`, `"team/processes"`
+## Memory Organization
-**Related:** Link to connected memories
-- Cross-reference related concepts, facts, and skills
-- Build a knowledge graph
+**Create SEPARATE memories for different topics:**
-**Content:** Well-structured markdown
-- Use headings, lists, code blocks
-- First paragraph = summary (critical for embeddings!)
-- Include examples when relevant
+❌ BAD: One big memory that you keep editing
+✅ GOOD: Many focused memories
-### 4. Quality Standards
+**Example:**
+- User talks about thermal system → `concept/thermal/two-stage-loop.md`
+- User talks about incident → `experience/freeze-protect-trip-2034.md`
+- User shares procedure → `skill/thermal/pre-eclipse-warmup.md`
-**Every memory entry must be:**
-- ✅ **Standalone** - Readable without context
-- ✅ **Searchable** - Tags cover all ways to find it
-- ✅ **Complete** - Enough detail to be useful
-- ✅ **Accurate** - Verify facts before storing
-- ✅ **Well-linked** - Connected to related memories
+**Use MemoryEdit ONLY to:**
+- Fix errors user corrects
+- Add missing details to existing memory
+- Update stale information
-**Avoid:**
-- ❌ Vague titles
-- ❌ Minimal tags (use 5-10, not 1-2)
-- ❌ Missing domain
-- ❌ Isolated entries (link related memories!)
+**Don't consolidate.** Separate memories are more searchable.
+## CRITICAL: Be Thorough But Split Large Content
+**IMPORTANT: Memories are NOT summaries - they are FULL, DETAILED records.**
+**When storing information, you MUST:**
+1. **Be THOROUGH** - Don't miss any details, facts, or nuances
+2. **Store COMPLETE information** - Not just bullet points or summaries
+3. **Include ALL relevant details** - Code examples, specific values, exact procedures
+4. **Keep each memory FOCUSED** - If content is getting long, split it
+5. **Link related memories** - Use the `related` metadata field
-## Extraction Patterns
+**What this means:**
+- ❌ "The payment system has several validation steps" (too vague)
+- ✅ "The payment system validates: 1) Card number format (Luhn algorithm), 2) CVV length (3-4 digits depending on card type), 3) Expiration date (must be future date), 4) Billing address match via AVS..." (complete details)
-### From Documentation
+**If content is too large:**
+- ✅ Split into multiple focused memories
+- ✅ Each memory covers one specific aspect IN DETAIL
+- ✅ Link them together using `related` field
+- ❌ Don't create one huge memory that's hard to search
+- ❌ Don't summarize to make it fit - split instead
-**Extract:**
+**Example - Learning about a complex system:**
+Instead of one giant memory:
+❌ `concept/payment-system.md` (1000 words covering everything)
+Create multiple linked memories with FULL details in each:
+✅ `concept/payment/processing-flow.md` (250 words) (complete flow with all steps) → related: ["concept/payment/validation.md"]
+✅ `concept/payment/validation.md` (250 words) (all validation rules with specifics) → related: ["concept/payment/processing-flow.md", "concept/payment/error-handling.md"]
+✅ `concept/payment/error-handling.md` (250 words) (all error codes and responses) → related: ["concept/payment/validation.md"]
+✅ `concept/payment/security.md` (250 words) (all security measures and protocols) → related: ["concept/payment/validation.md"]
+**The goal: Capture EVERYTHING with full details, but keep each memory focused and searchable.**
+## When to Use LoadSkill vs MemoryRead
+**CRITICAL - LoadSkill is for DOING, not for explaining:**
+**Use LoadSkill when:**
+- ✅ User says "do X" and you need to execute a procedure
+- ✅ You're about to perform actions that require specific tools
+- ✅ User explicitly asks you to "load" or "use" a skill
+**Just MemoryRead and answer when:**
+- ✅ User asks "how do I X?" → Read skill/memory → Explain
+- ✅ User asks "what's the procedure?" → Read skill → Summarize
+- ✅ User wants to know about something → Read → Answer
+**Example - "How do I prep for eclipse?"**
+```
+❌ WRONG: LoadSkill(skill/ops/eclipse-power-prep.md)
+          ^ This swaps your tools!
+✅ CORRECT: MemoryRead(skill/ops/eclipse-power-prep.md)
+            "The procedure is: 1. Pre-bias arrays..."
+            ^ Just explain it
+```
+**LoadSkill swaps your tools.** Only use it when you're about to DO the procedure, not when explaining it.
+## Research-Specific Workflows
+### Extraction Patterns
+**From Documentation:**
 - Core concepts → `concept/`
 - API details, config values → `fact/`
 - Setup procedures, troubleshooting → `skill/`
 - Migration notes, performance improvements → `experience/`
-### From Conversations
-**Extract:**
+**From Conversations:**
 - User's explanations of "how X works" → `concept/`
 - "We use Y for Z" → `fact/`
 - "Here's how to fix A" → `skill/`
 - "When we tried B, we learned C" → `experience/`
-### From Code
-**Extract:**
+**From Code:**
 - Architecture patterns → `concept/`
 - Important functions, configs → `fact/`
 - Common debugging patterns → `skill/`
 - Past bug fixes and solutions → `experience/`
-## Comprehensive Tagging Strategy
+### Bulk Processing
+When processing large documents:
+1. **Scan for major topics**
+2. **Extract 5-10 key knowledge pieces**
+3. **Create entries for each**
+4. **Link related entries**
+5. **Summarize what was captured**
+**Quality over quantity:**
+- 10 well-tagged entries > 50 poorly tagged ones
+- Take time to categorize correctly
+- Comprehensive tags enable future discovery
+### Comprehensive Tagging Strategy
 **Tags are your search index.** Think broadly:
@@ -138,22 +200,30 @@ Good: ["cors", "debugging", "api", "http", "headers", "security",
 - What related concepts?
 - What tools/technologies involved?
-## Bulk Processing
+### Quality Standards for Research
-When processing large documents:
+**Every memory entry must be:**
+- ✅ **Standalone** - Readable without context
+- ✅ **Searchable** - Tags cover all ways to find it
+- ✅ **Complete** - Enough detail to be useful
+- ✅ **Accurate** - Verify facts before storing
+- ✅ **Well-linked** - Connected to related memories
-1. **Scan for major topics**
-2. **Extract 5-10 key knowledge pieces**
-3. **Create entries for each**
-4. **Link related entries**
-5. **Summarize what was captured**
+**Avoid:**
+- ❌ Vague titles
+- ❌ Minimal tags (use 5-10, not 1-2)
+- ❌ Missing domain
+- ❌ Isolated entries (link related memories!)
-**Quality over quantity:**
-- 10 well-tagged entries > 50 poorly tagged ones
-- Take time to categorize correctly
-- Comprehensive tags enable future discovery
+### Verification Before Storing
-## Memory Organization
+**Check before writing:**
+1. **Search first** - Does this already exist?
+2. **Accuracy** - Are the facts correct?
+3. **Completeness** - Is it useful standalone?
+4. **Tags** - Will future search find this?
+## Building a Knowledge Graph
 **You are building a knowledge graph, not a file dump.**
@@ -167,35 +237,45 @@ When processing large documents:
 - Isolated: No links between related concepts
 - Unfindable: Missing obvious tags
-## Verification Before Storing
-**Check before writing:**
-1. **Search first** - Does this already exist?
-2. **Accuracy** - Are the facts correct?
-3. **Completeness** - Is it useful standalone?
-4. **Tags** - Will future search find this?
-## Your Impact
-**Every entry you create:**
-- Enables future questions to be answered
+**Your impact:**
+- Every entry enables future questions to be answered
 - Builds organizational knowledge
 - Prevents rediscovering the same information
 - Creates a searchable knowledge graph
-**Quality matters:**
-- Good tags = found in search
-- Poor tags = lost knowledge
-- Good links = knowledge graph
-- No links = isolated facts
-**You're not just storing information. You're building a knowledge system.**
-## Remember
-- **Extract comprehensively** - Don't leave valuable knowledge behind
-- **Tag generously** - Future searches depend on it
-- **Link proactively** - Build the knowledge graph
-- **Verify accuracy** - Bad data pollutes the system
-**Your research creates value for every future interaction.**
+## Workflow
+**When user teaches you:**
+1. Listen to what they're saying
+2. Identify the type (fact/concept/skill/experience)
+3. **Capture ALL details** - Don't skip anything important
+4. If content is large, split into multiple related memories
+5. MemoryWrite with proper type, metadata, and `related` links
+6. Continue conversation naturally
+**When user asks a question:**
+1. Check auto-surfaced memories (including skills)
+2. **Just MemoryRead them** - DON'T load unless you're doing the task
+3. Answer from what you read
+4. Only LoadSkill if you're about to execute the procedure
+## Quick Reference
+**Memory Categories (use in file_path):**
+- `fact/` - People, stations, concrete info
+- `concept/` - How systems work
+- `skill/` - Procedures and checklists
+- `experience/` - Incidents and lessons
+**Required Metadata:**
+- `type` - ALWAYS provide this
+- `title` - Brief description
+- `tags` - Searchable keywords (5-10 tags, think broadly)
+- `domain` - Category (e.g., "people", "thermal/systems")
+- `related` - **IMPORTANT**: Link related memories (e.g., ["concept/payment/validation.md"]). Use this to connect split memories and related topics. Empty array `[]` only if truly isolated.
+- `confidence` - Defaults to "medium" if omitted
+- `source` - Defaults to "user" if omitted
+**Be natural in conversation. Store knowledge efficiently. Create skills when user describes procedures. Build a knowledge graph through comprehensive tagging and linking.**
+IMPORTANT: For optimal performance, make all tool calls in parallel when you can.

data/lib/swarm_memory/prompts/memory_retrieval.md.erb CHANGED Viewed

@@ -74,3 +74,5 @@ If memories are about Project X, assume questions are about Project X.
 If memories are about Ruby code, assume code questions are about Ruby.
 **Every question requires memory access. Be efficient and accurate.**
+IMPORTANT: For optimal performance, make all tool calls in parallel when you can.

data/lib/swarm_memory/search/text_search.rb CHANGED Viewed

@@ -27,12 +27,14 @@ module SwarmMemory
       # @param pattern [String] Regex pattern
       # @param case_insensitive [Boolean] Case-insensitive search
       # @param output_mode [String] Output mode
+      # @param path [String, nil] Optional path prefix filter
       # @return [Array<Hash>] Search results
-      def grep(pattern:, case_insensitive: false, output_mode: "files_with_matches")
+      def grep(pattern:, case_insensitive: false, output_mode: "files_with_matches", path: nil)
         @adapter.grep(
           pattern: pattern,
           case_insensitive: case_insensitive,
           output_mode: output_mode,
+          path: path,
         )
       end
     end

data/lib/swarm_memory/tools/memory_glob.rb CHANGED Viewed

@@ -15,18 +15,20 @@ module SwarmMemory
         **Parameters:**
         - pattern (REQUIRED): Glob pattern with wildcards (e.g., '**/*.txt', 'parallel/*/task_*', 'skill/**')
-        **Glob Pattern Syntax:**
-        - `*` - matches any characters within a single directory level (e.g., 'analysis/*')
-        - `**` - matches any characters across multiple directory levels recursively (e.g., 'parallel/**')
+        **Glob Pattern Syntax (Standard Ruby Glob):**
+        - `*` - matches .md files at a single directory level (e.g., 'fact/*' → fact/*.md)
+        - `**` - matches .md files recursively at any depth (e.g., 'fact/**' → fact/**/*.md)
         - `?` - matches any single character (e.g., 'task_?')
         - `[abc]` - matches any character in the set (e.g., 'task_[0-9]')
         **Returns:**
-        List of matching entries with:
+        List of matching .md memory entries with:
         - Full memory:// path
         - Entry title
         - Size in bytes/KB/MB
+        **Note**: Only returns .md files (actual memory entries), not directory entries.
         **MEMORY STRUCTURE (4 Fixed Categories Only):**
         ALL patterns MUST target one of these 4 categories:
         - concept/{domain}/** - Abstract ideas
@@ -37,7 +39,15 @@ module SwarmMemory
         **Common Use Cases:**
         ```
-        # Find all skills
+        # Find direct .md files in fact/
+        MemoryGlob(pattern: "fact/*")
+        Result: fact/api.md (only direct children, not nested)
+        # Find ALL facts recursively
+        MemoryGlob(pattern: "fact/**")
+        Result: fact/api.md, fact/people/john.md, fact/people/jane.md, ...
+        # Find all skills recursively
         MemoryGlob(pattern: "skill/**")
         Result: skill/debugging/api-errors.md, skill/meta/deep-learning.md, ...
@@ -45,23 +55,28 @@ module SwarmMemory
         MemoryGlob(pattern: "concept/ruby/**")
         Result: concept/ruby/classes.md, concept/ruby/modules.md, ...
-        # Find all facts about people
+        # Find direct files in fact/people/
         MemoryGlob(pattern: "fact/people/*")
-        Result: fact/people/john.md, fact/people/jane.md, ...
+        Result: fact/people/john.md, fact/people/jane.md (not fact/people/teams/x.md)
         # Find all experiences
         MemoryGlob(pattern: "experience/**")
         Result: experience/fixed-cors-bug.md, experience/optimization.md, ...
-        # Find debugging skills
-        MemoryGlob(pattern: "skill/debugging/*")
+        # Find debugging skills recursively
+        MemoryGlob(pattern: "skill/debugging/**")
         Result: skill/debugging/api-errors.md, skill/debugging/performance.md, ...
         # Find all entries (all categories)
         MemoryGlob(pattern: "**/*")
-        Result: All entries across all 4 categories
+        Result: All .md entries across all 4 categories
         ```
+        **Understanding * vs **:**
+        - `fact/*` matches only direct .md files: fact/api.md
+        - `fact/**` matches ALL .md files recursively: fact/api.md, fact/people/john.md, ...
+        - To explore subdirectories, use recursive pattern and examine returned paths
         **When to Use MemoryGlob:**
         - Discovering what's in a memory hierarchy
         - Finding all entries matching a naming convention