RubyGems - htm - Versions diffs - 0.0.1 - Mend

htm 0.0.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (155) hide show

checksums.yaml +7 -0
data/.architecture/decisions/adrs/001-use-postgresql-timescaledb-storage.md +227 -0
data/.architecture/decisions/adrs/002-two-tier-memory-architecture.md +322 -0
data/.architecture/decisions/adrs/003-ollama-default-embedding-provider.md +339 -0
data/.architecture/decisions/adrs/004-multi-robot-shared-memory-hive-mind.md +374 -0
data/.architecture/decisions/adrs/005-rag-based-retrieval-with-hybrid-search.md +443 -0
data/.architecture/decisions/adrs/006-context-assembly-strategies.md +444 -0
data/.architecture/decisions/adrs/007-working-memory-eviction-strategy.md +461 -0
data/.architecture/decisions/adrs/008-robot-identification-system.md +550 -0
data/.architecture/decisions/adrs/009-never-forget-explicit-deletion-only.md +570 -0
data/.architecture/decisions/adrs/010-redis-working-memory-rejected.md +323 -0
data/.architecture/decisions/adrs/011-database-side-embedding-generation-with-pgai.md +585 -0
data/.architecture/decisions/adrs/012-llm-driven-ontology-topic-extraction.md +583 -0
data/.architecture/decisions/adrs/013-activerecord-orm-and-many-to-many-tagging.md +299 -0
data/.architecture/decisions/adrs/014-client-side-embedding-generation-workflow.md +569 -0
data/.architecture/decisions/adrs/015-hierarchical-tag-ontology-and-llm-extraction.md +701 -0
data/.architecture/decisions/adrs/016-async-embedding-and-tag-generation.md +694 -0
data/.architecture/members.yml +144 -0
data/.architecture/reviews/2025-10-29-llm-configuration-and-async-processing-review.md +1137 -0
data/.architecture/reviews/initial-system-analysis.md +330 -0
data/.envrc +32 -0
data/.irbrc +145 -0
data/CHANGELOG.md +150 -0
data/COMMITS.md +196 -0
data/LICENSE +21 -0
data/README.md +1347 -0
data/Rakefile +51 -0
data/SETUP.md +268 -0
data/config/database.yml +67 -0
data/db/migrate/20250101000001_enable_extensions.rb +14 -0
data/db/migrate/20250101000002_create_robots.rb +14 -0
data/db/migrate/20250101000003_create_nodes.rb +42 -0
data/db/migrate/20250101000005_create_tags.rb +38 -0
data/db/migrate/20250101000007_add_node_vector_indexes.rb +30 -0
data/db/schema.sql +473 -0
data/db/seed_data/README.md +100 -0
data/db/seed_data/presidents.md +136 -0
data/db/seed_data/states.md +151 -0
data/db/seeds.rb +208 -0
data/dbdoc/README.md +173 -0
data/dbdoc/public.node_stats.md +48 -0
data/dbdoc/public.node_stats.svg +41 -0
data/dbdoc/public.node_tags.md +40 -0
data/dbdoc/public.node_tags.svg +112 -0
data/dbdoc/public.nodes.md +54 -0
data/dbdoc/public.nodes.svg +118 -0
data/dbdoc/public.nodes_tags.md +39 -0
data/dbdoc/public.nodes_tags.svg +112 -0
data/dbdoc/public.ontology_structure.md +48 -0
data/dbdoc/public.ontology_structure.svg +38 -0
data/dbdoc/public.operations_log.md +42 -0
data/dbdoc/public.operations_log.svg +130 -0
data/dbdoc/public.relationships.md +39 -0
data/dbdoc/public.relationships.svg +41 -0
data/dbdoc/public.robot_activity.md +46 -0
data/dbdoc/public.robot_activity.svg +35 -0
data/dbdoc/public.robots.md +35 -0
data/dbdoc/public.robots.svg +90 -0
data/dbdoc/public.schema_migrations.md +29 -0
data/dbdoc/public.schema_migrations.svg +26 -0
data/dbdoc/public.tags.md +35 -0
data/dbdoc/public.tags.svg +60 -0
data/dbdoc/public.topic_relationships.md +45 -0
data/dbdoc/public.topic_relationships.svg +32 -0
data/dbdoc/schema.json +1437 -0
data/dbdoc/schema.svg +154 -0
data/docs/api/database.md +806 -0
data/docs/api/embedding-service.md +532 -0
data/docs/api/htm.md +797 -0
data/docs/api/index.md +259 -0
data/docs/api/long-term-memory.md +1096 -0
data/docs/api/working-memory.md +665 -0
data/docs/architecture/adrs/001-postgresql-timescaledb.md +314 -0
data/docs/architecture/adrs/002-two-tier-memory.md +411 -0
data/docs/architecture/adrs/003-ollama-embeddings.md +421 -0
data/docs/architecture/adrs/004-hive-mind.md +437 -0
data/docs/architecture/adrs/005-rag-retrieval.md +531 -0
data/docs/architecture/adrs/006-context-assembly.md +496 -0
data/docs/architecture/adrs/007-eviction-strategy.md +645 -0
data/docs/architecture/adrs/008-robot-identification.md +625 -0
data/docs/architecture/adrs/009-never-forget.md +648 -0
data/docs/architecture/adrs/010-redis-working-memory-rejected.md +323 -0
data/docs/architecture/adrs/011-pgai-integration.md +494 -0
data/docs/architecture/adrs/index.md +215 -0
data/docs/architecture/hive-mind.md +736 -0
data/docs/architecture/index.md +351 -0
data/docs/architecture/overview.md +538 -0
data/docs/architecture/two-tier-memory.md +873 -0
data/docs/assets/css/custom.css +83 -0
data/docs/assets/images/htm-core-components.svg +63 -0
data/docs/assets/images/htm-database-schema.svg +93 -0
data/docs/assets/images/htm-hive-mind-architecture.svg +125 -0
data/docs/assets/images/htm-importance-scoring-framework.svg +83 -0
data/docs/assets/images/htm-layered-architecture.svg +71 -0
data/docs/assets/images/htm-long-term-memory-architecture.svg +115 -0
data/docs/assets/images/htm-working-memory-architecture.svg +120 -0
data/docs/assets/images/htm.jpg +0 -0
data/docs/assets/images/htm_demo.gif +0 -0
data/docs/assets/js/mathjax.js +18 -0
data/docs/assets/videos/htm_video.mp4 +0 -0
data/docs/database_rake_tasks.md +322 -0
data/docs/development/contributing.md +787 -0
data/docs/development/index.md +336 -0
data/docs/development/schema.md +596 -0
data/docs/development/setup.md +719 -0
data/docs/development/testing.md +819 -0
data/docs/guides/adding-memories.md +824 -0
data/docs/guides/context-assembly.md +1009 -0
data/docs/guides/getting-started.md +577 -0
data/docs/guides/index.md +118 -0
data/docs/guides/long-term-memory.md +941 -0
data/docs/guides/multi-robot.md +866 -0
data/docs/guides/recalling-memories.md +927 -0
data/docs/guides/search-strategies.md +953 -0
data/docs/guides/working-memory.md +717 -0
data/docs/index.md +214 -0
data/docs/installation.md +477 -0
data/docs/multi_framework_support.md +519 -0
data/docs/quick-start.md +655 -0
data/docs/setup_local_database.md +302 -0
data/docs/using_rake_tasks_in_your_app.md +383 -0
data/examples/basic_usage.rb +93 -0
data/examples/cli_app/README.md +317 -0
data/examples/cli_app/htm_cli.rb +270 -0
data/examples/custom_llm_configuration.rb +183 -0
data/examples/example_app/Rakefile +71 -0
data/examples/example_app/app.rb +206 -0
data/examples/sinatra_app/Gemfile +21 -0
data/examples/sinatra_app/app.rb +335 -0
data/lib/htm/active_record_config.rb +113 -0
data/lib/htm/configuration.rb +342 -0
data/lib/htm/database.rb +594 -0
data/lib/htm/embedding_service.rb +115 -0
data/lib/htm/errors.rb +34 -0
data/lib/htm/job_adapter.rb +154 -0
data/lib/htm/jobs/generate_embedding_job.rb +65 -0
data/lib/htm/jobs/generate_tags_job.rb +82 -0
data/lib/htm/long_term_memory.rb +965 -0
data/lib/htm/models/node.rb +109 -0
data/lib/htm/models/node_tag.rb +33 -0
data/lib/htm/models/robot.rb +52 -0
data/lib/htm/models/tag.rb +76 -0
data/lib/htm/railtie.rb +76 -0
data/lib/htm/sinatra.rb +157 -0
data/lib/htm/tag_service.rb +135 -0
data/lib/htm/tasks.rb +38 -0
data/lib/htm/version.rb +5 -0
data/lib/htm/working_memory.rb +182 -0
data/lib/htm.rb +400 -0
data/lib/tasks/db.rake +19 -0
data/lib/tasks/htm.rake +147 -0
data/lib/tasks/jobs.rake +312 -0
data/mkdocs.yml +190 -0
data/scripts/install_local_database.sh +309 -0
metadata +341 -0

data/docs/architecture/adrs/010-redis-working-memory-rejected.md ADDED Viewed

@@ -0,0 +1,323 @@
+# ADR-010: Redis-Based Working Memory (Rejected)
+**Status**: Rejected
+**Date**: 2025-10-25
+**Decision Makers**: Dewayne VanHoozer, Claude (Anthropic)
+---
+## Quick Summary
+**Proposal**: Add Redis as a persistent storage layer for working memory, creating a three-tier architecture (Working Memory in Redis, Long-term Memory in PostgreSQL, with in-process caching).
+**Decision**: **REJECTED** - Keep the current two-tier architecture with in-memory working memory.
+**Why Rejected**: Redis adds complexity, cost, and failure modes without solving a proven problem. PostgreSQL already provides durability, and working memory's ephemeral nature is a feature, not a bug.
+**Impact**: Avoiding unnecessary complexity while maintaining simplicity, performance, and reliability.
+---
+## Context
+### Motivation for Consideration
+During architectural review, we identified that working memory is currently volatile (in-process Ruby hash) and loses state on process restart. This raised the question:
+> "Should working memory persist across restarts using Redis?"
+### Current Architecture (Two-Tier)
+```
+┌─────────────────┐
+│   HTM Instance  │
+│                 │
+│  ┌───────────┐  │     ┌──────────────┐
+│  │ Working   │  │────>│  PostgreSQL  │
+│  │ Memory    │  │     │ (Long-Term)  │
+│  │ (Hash)    │  │     │              │
+│  └───────────┘  │     └──────────────┘
+│   volatile      │       persistent
+└─────────────────┘
+```
+**How it works**:
+1. `add_node()` saves **immediately** to PostgreSQL
+2. Node is **also** added to working memory (cache)
+3. Working memory evicts old nodes when full
+4. Eviction **only removes from cache** - data remains in PostgreSQL
+**Key insight**: Working memory is a **write-through cache**, not the source of truth.
+### Proposed Architecture (Three-Tier)
+```
+┌─────────────────┐
+│   HTM Instance  │
+│                 │
+│      ││          │
+│      ││          │
+│      ▼▼          │
+│  ┌───────────┐  │     ┌──────────────┐
+│  │   Redis   │  │────>│  PostgreSQL  │
+│  │ (Working) │  │     │ (Long-Term)  │
+│  │           │  │     │              │
+│  └───────────┘  │     └──────────────┘
+│   persistent    │       persistent
+└─────────────────┘
+```
+**Proposed changes**:
+- Store working memory in Redis (shared across processes)
+- Persist working memory state across restarts
+- Allow multi-process working memory sharing
+- Optional flush strategies (on-demand, auto-exit, periodic)
+---
+## Analysis
+### Perceived Benefits (Why We Considered It)
+1. **Persistence Across Restarts**
+   - Working memory survives process crashes
+   - Can resume conversations exactly where left off
+2. **Multi-Process Sharing**
+   - Multiple HTM instances can share hot context
+   - "Hive mind" working memory across robots
+3. **Larger Capacity**
+   - Not limited by process memory (~2GB)
+   - Could scale to 10s-100s of GB in Redis
+4. **External Observability**
+   - Inspect working memory via `redis-cli`
+   - Monitor access patterns externally
+### Actual Drawbacks (Why We Rejected It)
+#### 1. **Adds Complexity Without Clear Benefit**
+| Aspect | Current | With Redis |
+|--------|---------|------------|
+| Dependencies | PostgreSQL only | PostgreSQL + Redis |
+| Failure Modes | 1 database | 2 databases |
+| Deployment | Single service | Multiple services |
+| Configuration | Simple | Complex (URLs, pools, namespaces) |
+| Debugging | Straightforward | More moving parts |
+#### 2. **PostgreSQL Already Solves the Problem**
+**Restart recovery is trivial**:
+```ruby
+# On restart, rebuild working memory from PostgreSQL
+htm = HTM.new(robot_name: "Assistant")
+recent_memories = htm.recall(
+  timeframe: "last 10 minutes",
+  topic: "",
+  limit: 50
+)
+# ↑ Automatically added to working memory
+```
+**Multi-process sharing already works**:
+```ruby
+# Process A
+htm_a.add_node("decision", "Use PostgreSQL")
+# → Saved to PostgreSQL
+# Process B (different process)
+memories = htm_b.recall(timeframe: "last minute", topic: "PostgreSQL")
+# → Retrieved from PostgreSQL, added to Process B's working memory
+```
+The "hive mind" already exists via shared PostgreSQL!
+#### 3. **Performance Penalty**
+| Operation | In-Memory | Redis (Local) | Redis (Network) |
+|-----------|-----------|---------------|-----------------|
+| `add()` | ~0.001ms | ~0.5ms | ~5ms |
+| `get()` | ~0.001ms | ~0.5ms | ~5ms |
+| Network overhead | None | TCP localhost | TCP network |
+**100-500x slower** for working memory operations, even locally.
+#### 4. **Working Memory is *Supposed* to be Ephemeral**
+The whole design philosophy:
+- **Token-limited** (128k) for LLM context windows
+- **Fast access** for immediate context
+- **Disposable** - it's a performance optimization
+Making it persistent contradicts its purpose!
+#### 5. **Operational Burden**
+**Additional costs**:
+- Redis server hosting/management
+- Memory allocation for Redis
+- Monitoring Redis health
+- Backup/recovery for Redis
+- Network configuration
+- Connection pool tuning
+**Additional failure scenarios**:
+- Redis connection failures
+- Redis out of memory
+- Redis network partitions
+- Redis data corruption
+- Synchronization issues between Redis and PostgreSQL
+#### 6. **YAGNI (You Aren't Gonna Need It)**
+No proven requirement for:
+- Sub-millisecond working memory access across processes
+- Exact working memory state preservation across crashes
+- Real-time synchronization of working memory between instances
+This is **premature optimization** solving a hypothetical problem.
+---
+## Decision
+**We will NOT implement Redis-based working memory.**
+We will **maintain the current two-tier architecture**:
+- **Working Memory**: In-memory Ruby hash (volatile)
+- **Long-term Memory**: PostgreSQL (durable)
+---
+## Rationale
+### Why the Current Design is Sufficient
+1. **Data is Already Safe**
+   - All nodes are immediately persisted to PostgreSQL
+   - Working memory is just a cache
+   - Nothing is lost on restart except cache state
+2. **Restart Recovery is Fast**
+   - Rebuild working memory via `recall()`
+   - Takes milliseconds to query recent context
+   - No need for persistent cache state
+3. **Multi-Process Works Today**
+   - Processes share via PostgreSQL
+   - No real-time synchronization needed
+   - Each process maintains its own hot cache
+4. **Simplicity Wins**
+   - One database (PostgreSQL)
+   - One failure mode
+   - Easy to understand and debug
+   - Lower operational cost
+5. **Performance is Excellent**
+   - In-memory hash: <1ms operations
+   - PostgreSQL: 10-50ms queries (acceptable)
+   - No need for Redis middle layer
+### When Redis *Might* Make Sense (Future)
+We'll reconsider if we encounter:
+- **Proven requirement** for cross-process hot memory sharing
+- **Measured performance problem** with PostgreSQL recall
+- **Specific use case** needing persistent working memory state
+- **User demand** for this feature
+Until then: **YAGNI**.
+---
+## Consequences
+### Positive
+✅ **Simplicity maintained**
+- Single database dependency
+- Straightforward architecture
+- Easy to understand and debug
+✅ **Lower operational cost**
+- No Redis hosting
+- No Redis management
+- Fewer failure modes
+✅ **Better performance**
+- In-memory working memory is fastest possible
+- No network overhead
+✅ **Sufficient for use cases**
+- All data persisted in PostgreSQL
+- Multi-process sharing via PostgreSQL
+- Fast restart recovery
+### Negative (Accepted Trade-offs)
+❌ **Working memory lost on crash**
+- **Mitigation**: Rebuild via `recall()` in <1 second
+- **Impact**: Minimal - data is safe in PostgreSQL
+❌ **No real-time cross-process working memory**
+- **Mitigation**: Processes share via PostgreSQL
+- **Impact**: Acceptable - no proven requirement
+❌ **Limited by process memory**
+- **Mitigation**: 128k token limit is sufficient for LLM context
+- **Impact**: None - this is by design
+---
+## Alternatives Considered
+### Alternative 1: Hybrid L1/L2 Caching
+- L1: In-memory (hot data)
+- L2: Redis (warm data)
+- **Rejected**: Even more complexity for minimal gain
+### Alternative 2: PostgreSQL UNLOGGED Tables
+- Use unlogged PostgreSQL tables for working memory
+- Faster writes, but not crash-safe
+- **Rejected**: Still slower than in-memory, adds DB complexity
+### Alternative 3: Shared Memory (IPC)
+- Use OS shared memory for cross-process working memory
+- **Rejected**: Platform-specific, complex, limited use case
+---
+## References
+- **Discussion**: `/tmp/redis_working_memory_architecture.md`
+- **Related ADR**: ADR-002 (Two-Tier Memory Architecture)
+- **Architecture Review**: `ARCHITECTURE_REVIEW.md`
+- **GitHub Issues**: #1-#10 (focus on proven improvements)
+---
+## Lessons Learned
+1. **Question assumptions**: "Working memory is volatile" seemed like a problem, but it's actually by design
+2. **PostgreSQL is powerful**: Already provides durability, querying, and sharing
+3. **Simplicity has value**: Adding Redis would double complexity for minimal real benefit
+4. **YAGNI applies**: Solve proven problems, not hypothetical ones
+5. **Architecture reviews are valuable**: Thoroughly analyzing alternatives leads to better decisions (even when the decision is "no")
+---
+## Future Review
+This decision should be revisited if:
+- User requests for persistent working memory
+- Measured performance problems with PostgreSQL recall
+- Multi-process real-time sharing becomes a requirement
+- Benchmarks show significant benefit to Redis caching
+Until then, this decision stands: **Keep it simple. Trust PostgreSQL.**