RubyGems - htm - Versions diffs - 0.0.1 → 0.0.10 - Mend

htm 0.0.1 → 0.0.10

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (184) hide show

checksums.yaml +4 -4
data/.aigcm_msg +1 -0
data/.architecture/reviews/comprehensive-codebase-review.md +577 -0
data/.claude/settings.local.json +92 -0
data/.envrc +1 -0
data/.irbrc +283 -80
data/.tbls.yml +31 -0
data/CHANGELOG.md +314 -16
data/CLAUDE.md +603 -0
data/README.md +76 -5
data/Rakefile +5 -0
data/SETUP.md +132 -101
data/db/migrate/{20250101000001_enable_extensions.rb → 00001_enable_extensions.rb} +0 -1
data/db/migrate/00002_create_robots.rb +11 -0
data/db/migrate/00003_create_file_sources.rb +20 -0
data/db/migrate/00004_create_nodes.rb +65 -0
data/db/migrate/00005_create_tags.rb +13 -0
data/db/migrate/00006_create_node_tags.rb +18 -0
data/db/migrate/00007_create_robot_nodes.rb +26 -0
data/db/migrate/00009_add_working_memory_to_robot_nodes.rb +12 -0
data/db/schema.sql +390 -36
data/docs/api/database.md +19 -232
data/docs/api/embedding-service.md +1 -7
data/docs/api/htm.md +305 -364
data/docs/api/index.md +1 -7
data/docs/api/long-term-memory.md +342 -590
data/docs/api/yard/HTM/ActiveRecordConfig.md +23 -0
data/docs/api/yard/HTM/AuthorizationError.md +11 -0
data/docs/api/yard/HTM/CircuitBreaker.md +92 -0
data/docs/api/yard/HTM/CircuitBreakerOpenError.md +34 -0
data/docs/api/yard/HTM/Configuration.md +175 -0
data/docs/api/yard/HTM/Database.md +99 -0
data/docs/api/yard/HTM/DatabaseError.md +14 -0
data/docs/api/yard/HTM/EmbeddingError.md +18 -0
data/docs/api/yard/HTM/EmbeddingService.md +58 -0
data/docs/api/yard/HTM/Error.md +11 -0
data/docs/api/yard/HTM/JobAdapter.md +39 -0
data/docs/api/yard/HTM/LongTermMemory.md +342 -0
data/docs/api/yard/HTM/NotFoundError.md +17 -0
data/docs/api/yard/HTM/Observability.md +107 -0
data/docs/api/yard/HTM/QueryTimeoutError.md +19 -0
data/docs/api/yard/HTM/Railtie.md +27 -0
data/docs/api/yard/HTM/ResourceExhaustedError.md +13 -0
data/docs/api/yard/HTM/TagError.md +18 -0
data/docs/api/yard/HTM/TagService.md +67 -0
data/docs/api/yard/HTM/Timeframe/Result.md +24 -0
data/docs/api/yard/HTM/Timeframe.md +40 -0
data/docs/api/yard/HTM/TimeframeExtractor/Result.md +24 -0
data/docs/api/yard/HTM/TimeframeExtractor.md +45 -0
data/docs/api/yard/HTM/ValidationError.md +20 -0
data/docs/api/yard/HTM/WorkingMemory.md +131 -0
data/docs/api/yard/HTM.md +80 -0
data/docs/api/yard/index.csv +179 -0
data/docs/api/yard-reference.md +51 -0
data/docs/architecture/adrs/001-postgresql-timescaledb.md +1 -1
data/docs/architecture/adrs/003-ollama-embeddings.md +1 -1
data/docs/architecture/adrs/010-redis-working-memory-rejected.md +2 -27
data/docs/architecture/adrs/index.md +2 -13
data/docs/architecture/hive-mind.md +165 -166
data/docs/architecture/index.md +2 -2
data/docs/architecture/overview.md +5 -171
data/docs/architecture/two-tier-memory.md +1 -35
data/docs/assets/images/adr-010-current-architecture.svg +37 -0
data/docs/assets/images/adr-010-proposed-architecture.svg +48 -0
data/docs/assets/images/adr-dependency-tree.svg +93 -0
data/docs/assets/images/class-hierarchy.svg +55 -0
data/docs/assets/images/exception-hierarchy.svg +45 -0
data/docs/assets/images/htm-architecture-overview.svg +83 -0
data/docs/assets/images/htm-complete-memory-flow.svg +160 -0
data/docs/assets/images/htm-context-assembly-flow.svg +148 -0
data/docs/assets/images/htm-eviction-process.svg +141 -0
data/docs/assets/images/htm-memory-addition-flow.svg +138 -0
data/docs/assets/images/htm-memory-recall-flow.svg +152 -0
data/docs/assets/images/htm-node-states.svg +123 -0
data/docs/assets/images/project-structure.svg +78 -0
data/docs/assets/images/test-directory-structure.svg +38 -0
data/{dbdoc → docs/database}/README.md +127 -125
data/docs/database/public.file_sources.md +42 -0
data/docs/database/public.file_sources.svg +211 -0
data/{dbdoc → docs/database}/public.node_tags.md +7 -8
data/docs/database/public.node_tags.svg +239 -0
data/{dbdoc → docs/database}/public.nodes.md +22 -17
data/docs/database/public.nodes.svg +271 -0
data/docs/database/public.robot_nodes.md +46 -0
data/docs/database/public.robot_nodes.svg +243 -0
data/{dbdoc → docs/database}/public.robots.md +2 -3
data/docs/database/public.robots.svg +161 -0
data/docs/database/public.tags.svg +139 -0
data/{dbdoc → docs/database}/schema.json +941 -630
data/docs/database/schema.svg +282 -0
data/docs/development/index.md +1 -29
data/docs/development/schema.md +134 -309
data/docs/development/testing.md +1 -9
data/docs/getting-started/index.md +47 -0
data/docs/{installation.md → getting-started/installation.md} +2 -2
data/docs/{quick-start.md → getting-started/quick-start.md} +5 -5
data/docs/guides/adding-memories.md +295 -643
data/docs/guides/recalling-memories.md +36 -1
data/docs/guides/search-strategies.md +85 -51
data/docs/images/htm-er-diagram.svg +156 -0
data/docs/index.md +16 -31
data/docs/multi_framework_support.md +4 -4
data/examples/README.md +280 -0
data/examples/basic_usage.rb +18 -16
data/examples/cli_app/htm_cli.rb +146 -8
data/examples/cli_app/temp.log +93 -0
data/examples/custom_llm_configuration.rb +1 -2
data/examples/example_app/app.rb +11 -14
data/examples/file_loader_usage.rb +177 -0
data/examples/robot_groups/lib/robot_group.rb +419 -0
data/examples/robot_groups/lib/working_memory_channel.rb +140 -0
data/examples/robot_groups/multi_process.rb +286 -0
data/examples/robot_groups/robot_worker.rb +136 -0
data/examples/robot_groups/same_process.rb +229 -0
data/examples/sinatra_app/Gemfile +1 -0
data/examples/sinatra_app/Gemfile.lock +166 -0
data/examples/sinatra_app/app.rb +219 -24
data/examples/timeframe_demo.rb +276 -0
data/lib/htm/active_record_config.rb +10 -3
data/lib/htm/circuit_breaker.rb +202 -0
data/lib/htm/configuration.rb +313 -80
data/lib/htm/database.rb +67 -36
data/lib/htm/embedding_service.rb +39 -2
data/lib/htm/errors.rb +131 -11
data/lib/htm/{sinatra.rb → integrations/sinatra.rb} +87 -12
data/lib/htm/job_adapter.rb +10 -3
data/lib/htm/jobs/generate_embedding_job.rb +5 -4
data/lib/htm/jobs/generate_tags_job.rb +4 -0
data/lib/htm/loaders/markdown_loader.rb +263 -0
data/lib/htm/loaders/paragraph_chunker.rb +112 -0
data/lib/htm/long_term_memory.rb +601 -321
data/lib/htm/models/file_source.rb +99 -0
data/lib/htm/models/node.rb +116 -12
data/lib/htm/models/robot.rb +53 -4
data/lib/htm/models/robot_node.rb +51 -0
data/lib/htm/models/tag.rb +302 -0
data/lib/htm/observability.rb +395 -0
data/lib/htm/tag_service.rb +60 -3
data/lib/htm/tasks.rb +29 -0
data/lib/htm/timeframe.rb +194 -0
data/lib/htm/timeframe_extractor.rb +307 -0
data/lib/htm/version.rb +1 -1
data/lib/htm/working_memory.rb +165 -70
data/lib/htm.rb +352 -133
data/lib/tasks/doc.rake +300 -0
data/lib/tasks/files.rake +299 -0
data/lib/tasks/htm.rake +188 -2
data/lib/tasks/jobs.rake +10 -12
data/lib/tasks/tags.rake +194 -0
data/mkdocs.yml +91 -9
data/notes/ARCHITECTURE_REVIEW.md +1167 -0
data/notes/IMPLEMENTATION_SUMMARY.md +606 -0
data/notes/MULTI_FRAMEWORK_IMPLEMENTATION.md +451 -0
data/notes/next_steps.md +100 -0
data/notes/plan.md +627 -0
data/notes/tag_ontology_enhancement_ideas.md +222 -0
data/notes/timescaledb_removal_summary.md +200 -0
metadata +177 -37
data/db/migrate/20250101000002_create_robots.rb +0 -14
data/db/migrate/20250101000003_create_nodes.rb +0 -42
data/db/migrate/20250101000005_create_tags.rb +0 -38
data/db/migrate/20250101000007_add_node_vector_indexes.rb +0 -30
data/dbdoc/public.node_tags.svg +0 -112
data/dbdoc/public.nodes.svg +0 -118
data/dbdoc/public.robots.svg +0 -90
data/dbdoc/public.tags.svg +0 -60
data/dbdoc/schema.svg +0 -154
data/{dbdoc → docs/database}/public.node_stats.md +0 -0
data/{dbdoc → docs/database}/public.node_stats.svg +0 -0
data/{dbdoc → docs/database}/public.nodes_tags.md +0 -0
data/{dbdoc → docs/database}/public.nodes_tags.svg +0 -0
data/{dbdoc → docs/database}/public.ontology_structure.md +0 -0
data/{dbdoc → docs/database}/public.ontology_structure.svg +0 -0
data/{dbdoc → docs/database}/public.operations_log.md +0 -0
data/{dbdoc → docs/database}/public.operations_log.svg +0 -0
data/{dbdoc → docs/database}/public.relationships.md +0 -0
data/{dbdoc → docs/database}/public.relationships.svg +0 -0
data/{dbdoc → docs/database}/public.robot_activity.md +0 -0
data/{dbdoc → docs/database}/public.robot_activity.svg +0 -0
data/{dbdoc → docs/database}/public.schema_migrations.md +0 -0
data/{dbdoc → docs/database}/public.schema_migrations.svg +0 -0
data/{dbdoc → docs/database}/public.tags.md +3 -3
/data/{dbdoc → docs/database}/public.topic_relationships.md +0 -0
/data/{dbdoc → docs/database}/public.topic_relationships.svg +0 -0

data/docs/api/htm.md CHANGED Viewed

@@ -7,15 +7,15 @@ The main interface for HTM's intelligent memory management system.
 `HTM` is the primary class that orchestrates the two-tier memory system:
 - **Working Memory**: Token-limited active context for immediate LLM use
-- **Long-term Memory**: Durable PostgreSQL storage
+- **Long-term Memory**: Durable PostgreSQL storage with vector embeddings
 Key features:
 - Never forgets unless explicitly told (`forget`)
 - RAG-based retrieval (temporal + semantic search)
-- Multi-robot "hive mind" - all robots share global memory
-- Relationship graphs for knowledge connections
-- Time-series optimized with TimescaleDB
+- Multi-robot "hive mind" - all robots share global memory via content deduplication
+- Hierarchical tagging for knowledge organization
+- Tag-enhanced hybrid search for improved relevance
 ## Class Definition
@@ -34,11 +34,12 @@ Create a new HTM instance.
 ```ruby
 HTM.new(
   working_memory_size: 128_000,
-  robot_id: nil,
   robot_name: nil,
   db_config: nil,
-  embedding_service: :ollama,
-  embedding_model: 'gpt-oss'
+  db_pool_size: 5,
+  db_query_timeout: 30_000,
+  db_cache_size: 1000,
+  db_cache_ttl: 300
 )
 ```
@@ -47,11 +48,12 @@ HTM.new(
 | Parameter | Type | Default | Description |
 |-----------|------|---------|-------------|
 | `working_memory_size` | Integer | `128_000` | Maximum tokens for working memory |
-| `robot_id` | String, nil | Auto-generated UUID | Unique identifier for this robot |
-| `robot_name` | String, nil | `"robot_#{id[0..7]}"` | Human-readable name |
+| `robot_name` | String, nil | `"robot_#{uuid[0..7]}"` | Human-readable name |
 | `db_config` | Hash, nil | From `ENV['HTM_DBURL']` | Database configuration |
-| `embedding_service` | Symbol | `:ollama` | Embedding provider (`:ollama`, `:openai`, `:cohere`, `:local`) |
-| `embedding_model` | String | `'gpt-oss'` | Model name for embeddings |
+| `db_pool_size` | Integer | `5` | Database connection pool size |
+| `db_query_timeout` | Integer | `30_000` | Query timeout in milliseconds |
+| `db_cache_size` | Integer | `1000` | Query cache size (0 to disable) |
+| `db_cache_ttl` | Integer | `300` | Cache TTL in seconds |
 #### Returns
@@ -69,14 +71,7 @@ htm = HTM.new(
   working_memory_size: 256_000
 )
-# OpenAI embeddings
-htm = HTM.new(
-  robot_name: "Research Bot",
-  embedding_service: :openai,
-  embedding_model: 'text-embedding-3-small'
-)
-# Custom database
+# Custom database configuration
 htm = HTM.new(
   db_config: {
     host: 'localhost',
@@ -86,6 +81,12 @@ htm = HTM.new(
     password: 'secret'
   }
 )
+# With caching disabled
+htm = HTM.new(
+  robot_name: "No Cache Bot",
+  db_cache_size: 0
+)
 ```
 ---
@@ -94,13 +95,13 @@ htm = HTM.new(
 ### `robot_id` {: #robot_id }
-Unique identifier for this robot instance.
+Unique integer identifier for this robot instance.
-- **Type**: String (UUID format)
+- **Type**: Integer
 - **Read-only**: Yes
 ```ruby
-htm.robot_id  # => "a1b2c3d4-e5f6-..."
+htm.robot_id  # => 42
 ```
 ### `robot_name` {: #robot_name }
@@ -140,106 +141,102 @@ htm.long_term_memory.stats  # => {...}
 ## Public Methods
-### `add_node(key, value, **options)` {: #add_node }
+### `remember(content, tags:, metadata:)` {: #remember }
-Add a new memory node to both working and long-term memory.
+Remember new information by storing it in long-term memory.
 ```ruby
-add_node(key, value,
-  type: nil,
-  category: nil,
-  importance: 1.0,
-  related_to: [],
-  tags: []
-)
+remember(content, tags: [], metadata: {})
 ```
 #### Parameters
 | Parameter | Type | Default | Description |
 |-----------|------|---------|-------------|
-| `key` | String | *required* | Unique identifier for this node |
-| `value` | String | *required* | Content of the memory |
-| `type` | Symbol, nil | `nil` | Memory type (`:fact`, `:context`, `:code`, `:preference`, `:decision`, `:question`) |
-| `category` | String, nil | `nil` | Optional category for organization |
-| `importance` | Float | `1.0` | Importance score (0.0-10.0) |
-| `related_to` | Array\<String\> | `[]` | Keys of related nodes |
-| `tags` | Array\<String\> | `[]` | Tags for categorization |
+| `content` | String | *required* | The information to remember |
+| `tags` | Array\<String\> | `[]` | Manual tags to assign (in addition to auto-extracted tags) |
+| `metadata` | Hash | `{}` | Arbitrary key-value metadata stored as JSONB. Keys must be strings or symbols. |
 #### Returns
-- `Integer` - Database ID of the created node
+- `Integer` - Database ID of the memory node
 #### Side Effects
-- Stores node in PostgreSQL with vector embedding
+- Stores node in PostgreSQL with content deduplication (via SHA-256 hash)
+- Creates/updates `robot_nodes` association for this robot
 - Adds node to working memory (evicts if needed)
-- Creates relationships to `related_to` nodes
-- Adds tags to the node
-- Logs operation to `operations_log` table
+- Enqueues background job for embedding generation (new nodes only)
+- Enqueues background job for tag extraction (new nodes only)
 - Updates robot activity timestamp
+#### Content Deduplication
+When `remember()` is called:
+1. A SHA-256 hash of the content is computed
+2. If a node with the same hash exists, the existing node is reused
+3. A new `robot_nodes` association is created (or `remember_count` is incremented)
+4. This ensures identical memories are stored once but can be "remembered" by multiple robots
 #### Examples
 ```ruby
-# Simple fact
-htm.add_node("db_choice", "We chose PostgreSQL for its reliability")
-# Architectural decision
-htm.add_node(
-  "api_gateway_decision",
-  "Decided to use Kong as API gateway for rate limiting and auth",
-  type: :decision,
-  importance: 9.0,
-  tags: ["architecture", "api", "gateway"],
-  related_to: ["microservices_architecture"]
+# Basic usage
+node_id = htm.remember("PostgreSQL supports vector similarity search via pgvector")
+# With manual tags
+node_id = htm.remember(
+  "Time-series data works great with hypertables",
+  tags: ["database:timescaledb", "performance"]
 )
-# Code snippet
-code = <<~RUBY
-  def calculate_total(items)
-    items.sum(&:price)
-  end
-RUBY
-htm.add_node(
-  "total_calculation_v1",
-  code,
-  type: :code,
-  category: "helpers",
-  importance: 5.0,
-  tags: ["ruby", "calculation"]
+# With metadata
+node_id = htm.remember(
+  "User prefers dark mode for all interfaces",
+  metadata: { category: "preference", priority: "high", source_app: "settings" }
 )
-# User preference
-htm.add_node(
-  "user_123_timezone",
-  "User prefers UTC timezone for all timestamps",
-  type: :preference,
-  category: "user_settings",
-  importance: 6.0
+# With both tags and metadata
+node_id = htm.remember(
+  "API rate limit is 1000 requests per minute",
+  tags: ["api:rate-limiting", "infrastructure"],
+  metadata: { environment: "production", version: 2 }
 )
+# Multiple robots remembering the same content
+robot1 = HTM.new(robot_name: "assistant_1")
+robot2 = HTM.new(robot_name: "assistant_2")
+# Both robots remember the same fact - stored once, linked to both
+robot1.remember("Ruby 3.3 was released in December 2023")
+robot2.remember("Ruby 3.3 was released in December 2023")
+# Same node_id returned, remember_count incremented for robot2
 ```
 #### Notes
-- The `key` must be unique across all nodes
-- Embeddings are generated automatically
-- Token count is calculated automatically
-- If working memory is full, less important nodes are evicted
+- Embeddings and hierarchical tags are generated asynchronously via background jobs
+- Empty content returns the ID of the most recent node without creating a duplicate
+- Token count is calculated automatically using the configured token counter
+- Metadata is stored in a JSONB column with a GIN index for efficient queries
 ---
-### `recall(timeframe:, topic:, **options)` {: #recall }
+### `recall(topic, **options)` {: #recall }
-Recall memories from a timeframe and topic using RAG-based retrieval.
+Recall memories from long-term storage using RAG-based retrieval.
 ```ruby
 recall(
-  timeframe:,
-  topic:,
+  topic,
+  timeframe: "last 7 days",
   limit: 20,
-  strategy: :vector
+  strategy: :vector,
+  with_relevance: false,
+  query_tags: [],
+  metadata: {},
+  raw: false
 )
 ```
@@ -247,10 +244,14 @@ recall(
 | Parameter | Type | Default | Description |
 |-----------|------|---------|-------------|
-| `timeframe` | String, Range | *required* | Time range (e.g., `"last week"`, `7.days.ago..Time.now`) |
 | `topic` | String | *required* | Topic to search for |
+| `timeframe` | String, Range | `"last 7 days"` | Time range |
 | `limit` | Integer | `20` | Maximum number of nodes to retrieve |
 | `strategy` | Symbol | `:vector` | Search strategy (`:vector`, `:fulltext`, `:hybrid`) |
+| `with_relevance` | Boolean | `false` | Include dynamic relevance scores |
+| `query_tags` | Array\<String\> | `[]` | Tags to boost relevance |
+| `metadata` | Hash | `{}` | Filter results by metadata (uses JSONB `@>` containment) |
+| `raw` | Boolean | `false` | Return full node hashes instead of content strings |
 #### Timeframe Formats
@@ -274,27 +275,36 @@ Range format:
 |----------|-------------|----------|
 | `:vector` | Semantic similarity using embeddings | Find conceptually related content |
 | `:fulltext` | PostgreSQL full-text search | Find exact terms and phrases |
-| `:hybrid` | Fulltext prefilter + vector ranking | Best accuracy + semantic understanding |
+| `:hybrid` | Vector + fulltext + tag matching | Best accuracy with tag boosting |
+#### Tag-Enhanced Hybrid Search
+When using `:hybrid` strategy, the search automatically:
+1. Finds tags matching query terms (words 3+ chars)
+2. Includes nodes with matching tags in the candidate pool
+3. Calculates combined score: `(similarity × 0.7) + (tag_boost × 0.3)`
+4. Returns results sorted by combined score
 #### Returns
-- `Array<Hash>` - Retrieved memory nodes
+- `Array<String>` - Content strings (when `raw: false`, default)
+- `Array<Hash>` - Full node hashes (when `raw: true`)
-Each hash contains:
+When `raw: true`, each hash contains:
 ```ruby
 {
   "id" => 123,                    # Database ID
-  "key" => "node_key",            # Node identifier
-  "value" => "content...",        # Node content
-  "type" => "fact",               # Node type
-  "category" => "architecture",   # Category
-  "importance" => 8.0,            # Importance score
+  "content" => "...",             # Node content
+  "content_hash" => "abc123...",  # SHA-256 hash
+  "access_count" => 5,            # Times accessed
   "created_at" => "2025-01-15...", # Creation timestamp
-  "robot_id" => "abc123...",      # Robot that created it
   "token_count" => 125,           # Token count
-  "similarity" => 0.87            # Similarity score (vector/hybrid only)
-  # or "rank" => 0.456            # Rank score (fulltext only)
+  "metadata" => { "category" => "preference", "priority" => "high" },  # JSONB metadata
+  "similarity" => 0.87,           # Similarity score (hybrid/vector)
+  "tag_boost" => 0.3,             # Tag boost score (hybrid only)
+  "combined_score" => 0.79        # Combined score (hybrid only)
 }
 ```
@@ -302,118 +312,96 @@ Each hash contains:
 - Adds recalled nodes to working memory
 - Evicts existing nodes if working memory is full
-- Logs operation to `operations_log` table
 - Updates robot activity timestamp
 #### Examples
 ```ruby
+# Basic usage (returns content strings)
+memories = htm.recall("PostgreSQL")
+# => ["PostgreSQL supports vector search...", "PostgreSQL with pgvector..."]
+# Get full node hashes
+nodes = htm.recall("PostgreSQL", raw: true)
+# => [{"id" => 1, "content" => "...", "similarity" => 0.92, ...}, ...]
 # Vector semantic search
 memories = htm.recall(
+  "database performance optimization",
   timeframe: "last week",
-  topic: "database performance optimization"
+  strategy: :vector
 )
 # Fulltext search for exact phrases
 memories = htm.recall(
+  "PostgreSQL connection pooling",
   timeframe: "last 30 days",
-  topic: "PostgreSQL connection pooling",
   strategy: :fulltext,
   limit: 10
 )
-# Hybrid search (best of both)
+# Hybrid search with tag boosting (recommended)
 memories = htm.recall(
+  "API rate limiting implementation",
   timeframe: "this month",
-  topic: "API rate limiting implementation",
   strategy: :hybrid,
-  limit: 15
+  limit: 15,
+  raw: true
 )
+# Check matching tags for a query
+matching_tags = htm.long_term_memory.find_query_matching_tags("PostgreSQL")
+# => ["database:postgresql", "database:postgresql:extensions"]
 # Custom time range
 start_time = Time.new(2025, 1, 1)
 end_time = Time.now
 memories = htm.recall(
+  "security vulnerabilities",
   timeframe: start_time..end_time,
-  topic: "security vulnerabilities",
   limit: 50
 )
-# Process results
-memories.each do |memory|
-  puts "#{memory['created_at']}: #{memory['value']}"
-  puts "  Similarity: #{memory['similarity']}" if memory['similarity']
-  puts "  Robot: #{memory['robot_id']}"
-end
+# Filter by metadata
+memories = htm.recall(
+  "user preferences",
+  metadata: { category: "preference" }
+)
+# => Returns only nodes with metadata containing { category: "preference" }
+# Combine metadata with other filters
+memories = htm.recall(
+  "API configuration",
+  timeframe: "last month",
+  strategy: :hybrid,
+  metadata: { environment: "production", version: 2 },
+  raw: true
+)
+# => Returns production configs with version 2, sorted by relevance
 ```
 #### Performance Notes
 - Vector search: Best for semantic understanding, requires embedding generation
 - Fulltext search: Fastest for exact matches, no embedding overhead
-- Hybrid search: Slower but most accurate, combines both approaches
+- Hybrid search: Most accurate, combines vector + fulltext + tags with weighted scoring
 ---
-### `retrieve(key)` {: #retrieve }
-Retrieve a specific memory node by its key.
-```ruby
-retrieve(key)
-```
-#### Parameters
-| Parameter | Type | Description |
-|-----------|------|-------------|
-| `key` | String | Key of the node to retrieve |
-#### Returns
-- `Hash` - Node data if found
-- `nil` - If node doesn't exist
-#### Side Effects
-- Updates `last_accessed` timestamp for the node
-- Logs operation to `operations_log` table
-#### Examples
-```ruby
-# Retrieve a node
-node = htm.retrieve("api_decision_001")
-if node
-  puts node['value']
-  puts "Created: #{node['created_at']}"
-  puts "Importance: #{node['importance']}"
-else
-  puts "Node not found"
-end
-# Use retrieved data
-config = htm.retrieve("database_config")
-db_url = JSON.parse(config['value'])['url'] if config
-```
----
-### `forget(key, confirm:)` {: #forget }
+### `forget(node_id, confirm:)` {: #forget }
 Explicitly delete a memory node. Requires confirmation to prevent accidental deletion.
 ```ruby
-forget(key, confirm: :confirmed)
+forget(node_id, confirm: :confirmed)
 ```
 #### Parameters
 | Parameter | Type | Description |
 |-----------|------|-------------|
-| `key` | String | Key of the node to delete |
+| `node_id` | Integer | ID of the node to delete |
 | `confirm` | Symbol | Must be `:confirmed` to proceed |
 #### Returns
@@ -423,27 +411,28 @@ forget(key, confirm: :confirmed)
 #### Raises
 - `ArgumentError` - If `confirm` is not `:confirmed`
+- `ArgumentError` - If `node_id` is nil
+- `HTM::NotFoundError` - If node doesn't exist
 #### Side Effects
 - Deletes node from PostgreSQL
 - Removes node from working memory
-- Logs operation before deletion
 - Updates robot activity timestamp
 #### Examples
 ```ruby
 # Correct usage
-htm.forget("temp_note_123", confirm: :confirmed)
+htm.forget(123, confirm: :confirmed)
 # This will raise ArgumentError
-htm.forget("temp_note_123")  # Missing confirm parameter
+htm.forget(123)  # Missing confirm parameter
 # Safe deletion with verification
-if htm.retrieve("old_data")
-  htm.forget("old_data", confirm: :confirmed)
-  puts "Deleted old_data"
+if htm.long_term_memory.exists?(node_id)
+  htm.forget(node_id, confirm: :confirmed)
+  puts "Deleted node #{node_id}"
 end
 ```
@@ -451,248 +440,166 @@ end
 - This is the **only** way to delete data from HTM
 - Deletion is permanent and cannot be undone
-- Related relationships and tags are also deleted (CASCADE)
+- Related robot_nodes, node_tags are also deleted (CASCADE)
+- Other robots' associations to this node are also removed
 ---
-### `create_context(strategy:, max_tokens:)` {: #create_context }
+### `load_file(path, force: false)` {: #load_file }
-Create a context string from working memory for LLM consumption.
+Load a markdown file into long-term memory with automatic chunking and source tracking.
 ```ruby
-create_context(strategy: :balanced, max_tokens: nil)
+load_file(path, force: false)
 ```
 #### Parameters
 | Parameter | Type | Default | Description |
 |-----------|------|---------|-------------|
-| `strategy` | Symbol | `:balanced` | Assembly strategy |
-| `max_tokens` | Integer, nil | Working memory max | Optional token limit |
+| `path` | String | *required* | Path to the markdown file to load |
+| `force` | Boolean | `false` | Force re-sync even if file hasn't changed |
-#### Assembly Strategies
+#### Returns
-| Strategy | Behavior | Use Case |
-|----------|----------|----------|
-| `:recent` | Most recently accessed first | Prioritize latest information |
-| `:important` | Highest importance scores first | Focus on critical information |
-| `:balanced` | Weighted by importance × recency | Best general-purpose strategy |
+- `Hash` with keys:
+  - `file_source_id` - ID of the FileSource record
+  - `chunks_created` - Number of new nodes created
+  - `chunks_updated` - Number of existing nodes updated
+  - `chunks_deleted` - Number of nodes soft-deleted
-#### Returns
+#### Side Effects
-- `String` - Assembled context with nodes separated by `"\n\n"`
+- Creates or updates a FileSource record for tracking
+- Parses YAML frontmatter and stores as metadata
+- Chunks content by paragraph, preserving code blocks
+- Creates nodes for each chunk with `source_id` linking to file
+- Triggers async embedding and tag extraction for new nodes
 #### Examples
 ```ruby
-# Balanced context (default)
-context = htm.create_context(strategy: :balanced)
-# Recent context with token limit
-context = htm.create_context(
-  strategy: :recent,
-  max_tokens: 50_000
-)
-# Important context only
-context = htm.create_context(strategy: :important)
+# Load a file
+result = htm.load_file("docs/guide.md")
+# => { file_source_id: 1, chunks_created: 5, chunks_updated: 0, chunks_deleted: 0 }
-# Use in LLM prompt
-prompt = <<~PROMPT
-  You are a helpful assistant.
+# Force reload even if unchanged
+result = htm.load_file("docs/guide.md", force: true)
-  Context from memory:
-  #{context}
-  User question: #{user_input}
-PROMPT
+# File with frontmatter
+# ---
+# title: User Guide
+# tags: [documentation, tutorial]
+# ---
+# Content here...
+result = htm.load_file("docs/guide.md")
+# Frontmatter stored in FileSource.frontmatter
 ```
-#### Notes
-- Nodes are concatenated with double newlines
-- Token limits are respected (stops adding when limit reached)
-- Empty string if working memory is empty
 ---
-### `memory_stats()` {: #memory_stats }
+### `load_directory(path, pattern: '**/*.md', force: false)` {: #load_directory }
-Get comprehensive statistics about memory usage.
+Load all matching files in a directory into long-term memory.
 ```ruby
-memory_stats()
+load_directory(path, pattern: '**/*.md', force: false)
 ```
-#### Returns
+#### Parameters
-- `Hash` - Statistics hash
+| Parameter | Type | Default | Description |
+|-----------|------|---------|-------------|
+| `path` | String | *required* | Directory path to scan |
+| `pattern` | String | `'**/*.md'` | Glob pattern for matching files |
+| `force` | Boolean | `false` | Force re-sync all files |
-Structure:
+#### Returns
-```ruby
-{
-  robot_id: "abc123...",
-  robot_name: "Assistant",
-  # Long-term memory stats
-  total_nodes: 1234,
-  nodes_by_robot: {
-    "robot-1" => 500,
-    "robot-2" => 734
-  },
-  nodes_by_type: [
-    {"type" => "fact", "count" => 400},
-    {"type" => "decision", "count" => 200},
-    ...
-  ],
-  total_relationships: 567,
-  total_tags: 890,
-  oldest_memory: "2025-01-01 12:00:00",
-  newest_memory: "2025-01-15 14:30:00",
-  active_robots: 3,
-  robot_activity: [...],
-  database_size: 12345678,
-  # Working memory stats
-  working_memory: {
-    current_tokens: 45234,
-    max_tokens: 128000,
-    utilization: 35.34,
-    node_count: 23
-  }
-}
-```
+- `Array<Hash>` - Results for each file loaded, each containing:
+  - `file_path` - Path of the loaded file
+  - `file_source_id` - ID of the FileSource record
+  - `chunks_created` - Number of new nodes created
+  - `chunks_updated` - Number of existing nodes updated
+  - `chunks_deleted` - Number of nodes soft-deleted
 #### Examples
 ```ruby
-stats = htm.memory_stats
-puts "Total memories: #{stats[:total_nodes]}"
-puts "Working memory: #{stats[:working_memory][:utilization]}% full"
-puts "Active robots: #{stats[:active_robots]}"
+# Load all markdown files
+results = htm.load_directory("docs/")
-# Check if working memory is getting full
-if stats[:working_memory][:utilization] > 80
-  puts "Warning: Working memory is #{stats[:working_memory][:utilization]}% full"
-end
+# Load with custom pattern
+results = htm.load_directory("content/", pattern: "**/*.md")
-# Display by robot
-stats[:nodes_by_robot].each do |robot_id, count|
-  puts "#{robot_id}: #{count} nodes"
-end
+# Force reload all
+results = htm.load_directory("docs/", force: true)
 ```
 ---
-### `which_robot_said(topic, limit:)` {: #which_robot_said }
+### `nodes_from_file(file_path)` {: #nodes_from_file }
-Find which robots have discussed a specific topic.
+Get all nodes loaded from a specific file.
 ```ruby
-which_robot_said(topic, limit: 100)
+nodes_from_file(file_path)
 ```
 #### Parameters
-| Parameter | Type | Default | Description |
-|-----------|------|---------|-------------|
-| `topic` | String | *required* | Topic to search for |
-| `limit` | Integer | `100` | Maximum results to consider |
+| Parameter | Type | Description |
+|-----------|------|-------------|
+| `file_path` | String | Path of the source file |
 #### Returns
-- `Hash` - Robot IDs mapped to mention counts
-```ruby
-{
-  "robot-abc123" => 15,
-  "robot-def456" => 8,
-  "robot-ghi789" => 3
-}
-```
+- `Array<HTM::Models::Node>` - Nodes from the file, ordered by chunk position
 #### Examples
 ```ruby
-# Find who discussed deployment
-robots = htm.which_robot_said("deployment")
-# => {"robot-1" => 12, "robot-2" => 5}
-# Top contributor
-top_robot, count = robots.max_by { |robot, count| count }
-puts "#{top_robot} mentioned it #{count} times"
-# Check if specific robot discussed it
-if robots.key?("robot-123")
-  puts "Robot-123 discussed deployment #{robots['robot-123']} times"
+nodes = htm.nodes_from_file("docs/guide.md")
+nodes.each do |node|
+  puts "Chunk #{node.chunk_position}: #{node.content[0..50]}..."
 end
 ```
 ---
-### `conversation_timeline(topic, limit:)` {: #conversation_timeline }
+### `unload_file(file_path)` {: #unload_file }
-Get a chronological timeline of conversation about a topic.
+Remove a file from memory by soft-deleting all its chunks and the file source.
 ```ruby
-conversation_timeline(topic, limit: 50)
+unload_file(file_path)
 ```
 #### Parameters
-| Parameter | Type | Default | Description |
-|-----------|------|---------|-------------|
-| `topic` | String | *required* | Topic to search for |
-| `limit` | Integer | `50` | Maximum results |
+| Parameter | Type | Description |
+|-----------|------|-------------|
+| `file_path` | String | Path of the source file to unload |
 #### Returns
-- `Array<Hash>` - Timeline entries sorted by timestamp
+- `true` if file was found and unloaded
+- `false` if file was not found
-Structure:
+#### Side Effects
-```ruby
-[
-  {
-    timestamp: "2025-01-15 10:30:00",
-    robot: "robot-abc123",
-    content: "We should consider PostgreSQL...",
-    type: "decision"
-  },
-  {
-    timestamp: "2025-01-15 11:45:00",
-    robot: "robot-def456",
-    content: "Agreed, PostgreSQL has better...",
-    type: "fact"
-  },
-  ...
-]
-```
+- Soft-deletes all nodes from the file (sets `deleted_at`)
+- Destroys the FileSource record
 #### Examples
 ```ruby
-# Get timeline
-timeline = htm.conversation_timeline("API design", limit: 20)
-# Display timeline
-timeline.each do |entry|
-  puts "[#{entry[:timestamp]}] #{entry[:robot]}:"
-  puts "  #{entry[:content]}"
-  puts "  (#{entry[:type]})"
-  puts
-end
+# Unload a file
+htm.unload_file("docs/guide.md")
-# Find first mention
-first = timeline.first
-puts "First discussed by #{first[:robot]} at #{first[:timestamp]}"
-# Group by robot
-by_robot = timeline.group_by { |e| e[:robot] }
-by_robot.each do |robot, entries|
-  puts "#{robot}: #{entries.size} contributions"
+# Check if file is loaded
+if htm.nodes_from_file("docs/guide.md").empty?
+  puts "File not loaded"
 end
 ```
@@ -704,12 +611,24 @@ end
 ```ruby
 # Invalid confirm parameter
-htm.forget("key")
+htm.forget(123)
 # => ArgumentError: Must pass confirm: :confirmed to delete
+# Nil node_id
+htm.forget(nil, confirm: :confirmed)
+# => ArgumentError: node_id cannot be nil
 # Invalid timeframe
-htm.recall(timeframe: nil, topic: "test")
-# => ArgumentError: Invalid timeframe: nil
+htm.recall("test", timeframe: 123)
+# => ValidationError: Timeframe must be a Range or String
+```
+### HTM::NotFoundError
+```ruby
+# Node doesn't exist
+htm.forget(999999, confirm: :confirmed)
+# => HTM::NotFoundError: Node not found: 999999
 ```
 ### PG::Error
@@ -718,73 +637,95 @@ htm.recall(timeframe: nil, topic: "test")
 # Database connection issues
 htm = HTM.new(db_config: { host: 'invalid' })
 # => PG::ConnectionBad: could not translate host name...
-# Duplicate key
-htm.add_node("existing_key", "value")
-# => PG::UniqueViolation: duplicate key value...
 ```
 ## Best Practices
-### Memory Organization
+### Content Organization
 ```ruby
-# Use consistent key naming
-htm.add_node("decision_20250115_api_gateway", ...)
-htm.add_node("fact_20250115_database_choice", ...)
-# Use importance strategically
-htm.add_node(key, value, importance: 9.0)  # Critical
-htm.add_node(key, value, importance: 5.0)  # Normal
-htm.add_node(key, value, importance: 2.0)  # Low priority
-# Build knowledge graphs
-htm.add_node(
-  "api_v2_implementation",
-  "...",
-  related_to: ["api_v1_design", "authentication_decision"]
+# Use meaningful content that stands alone
+htm.remember("PostgreSQL was chosen for its reliability and pgvector support")
+# Add hierarchical tags for organization
+htm.remember(
+  "Rate limiting implemented using Redis sliding window algorithm",
+  tags: ["architecture:api:rate-limiting", "database:redis"]
 )
+# Let the system extract tags automatically for most content
+htm.remember("The authentication system uses JWT tokens with 1-hour expiry")
+# Auto-extracted tags might include: security:authentication, technology:jwt
 ```
 ### Search Strategies
 ```ruby
+# Use hybrid for best results (recommended)
+memories = htm.recall(
+  "security vulnerability",
+  strategy: :hybrid  # Combines vector + fulltext + tags
+)
 # Use vector for semantic understanding
 memories = htm.recall(
-  timeframe: "last month",
-  topic: "performance issues",
+  "performance issues",
   strategy: :vector  # Finds "slow queries", "optimization", etc.
 )
 # Use fulltext for exact terms
 memories = htm.recall(
-  timeframe: "this week",
-  topic: "PostgreSQL EXPLAIN ANALYZE",
+  "PostgreSQL EXPLAIN ANALYZE",
   strategy: :fulltext  # Exact match
 )
+```
-# Use hybrid for best results
-memories = htm.recall(
-  timeframe: "last week",
-  topic: "security vulnerability",
-  strategy: :hybrid  # Accurate + semantic
-)
+### Leveraging Tag-Enhanced Search
+```ruby
+# Check what tags exist for a topic
+tags = htm.long_term_memory.find_query_matching_tags("database")
+# => ["database:postgresql", "database:redis", "database:timescaledb"]
+# Hybrid search automatically boosts nodes with matching tags
+memories = htm.recall("database optimization", strategy: :hybrid, raw: true)
+memories.each do |m|
+  puts "Score: #{m['combined_score']} (sim: #{m['similarity']}, tag: #{m['tag_boost']})"
+end
+```
+### Multi-Robot Memory Sharing
+```ruby
+# Content is deduplicated across robots
+assistant = HTM.new(robot_name: "assistant")
+researcher = HTM.new(robot_name: "researcher")
+# Both robots remember the same fact
+assistant.remember("Ruby 3.3 supports YJIT by default")
+researcher.remember("Ruby 3.3 supports YJIT by default")
+# Node stored once, linked to both robots
+# Any robot can recall shared memories
+memories = assistant.recall("Ruby YJIT")
+# Returns the shared memory
 ```
 ### Resource Management
 ```ruby
 # Check working memory before large operations
-stats = htm.memory_stats
-if stats[:working_memory][:utilization] > 90
-  # Maybe explicitly recall less
+stats = htm.working_memory.stats
+if stats[:utilization] > 90
+  # Consider clearing working memory or using smaller limits
 end
 # Use appropriate limits
-htm.recall(topic: "common_topic", limit: 10)  # Not 1000
+htm.recall("common_topic", limit: 10)  # Not 1000
-# Monitor database size
-if stats[:database_size] > 1_000_000_000  # 1GB
+# Monitor node counts
+node_count = HTM::Models::Node.count
+if node_count > 1_000_000
   # Consider archival strategy
 end
 ```