npm - @soulcraft/brainy - Versions diffs - 3.34.0 → 3.36.0 - Mend

@soulcraft/brainy 3.34.0 → 3.36.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (27) hide show

package/CHANGELOG.md +63 -0
package/README.md +32 -12
package/dist/brainy.d.ts +15 -0
package/dist/brainy.js +63 -34
package/dist/hnsw/hnswIndex.d.ts +128 -1
package/dist/hnsw/hnswIndex.js +411 -17
package/dist/hnsw/hnswIndexOptimized.d.ts +3 -15
package/dist/hnsw/hnswIndexOptimized.js +11 -42
package/dist/hnsw/partitionedHNSWIndex.js +1 -1
package/dist/interfaces/IIndex.d.ts +195 -0
package/dist/interfaces/IIndex.js +15 -0
package/dist/storage/adapters/baseStorageAdapter.d.ts +17 -0
package/dist/storage/adapters/fileSystemStorage.d.ts +32 -0
package/dist/storage/adapters/fileSystemStorage.js +66 -0
package/dist/storage/adapters/gcsStorage.d.ts +36 -0
package/dist/storage/adapters/gcsStorage.js +90 -0
package/dist/storage/adapters/memoryStorage.d.ts +32 -0
package/dist/storage/adapters/memoryStorage.js +43 -0
package/dist/storage/adapters/opfsStorage.d.ts +36 -0
package/dist/storage/adapters/opfsStorage.js +101 -0
package/dist/storage/adapters/s3CompatibleStorage.d.ts +36 -0
package/dist/storage/adapters/s3CompatibleStorage.js +112 -0
package/dist/utils/memoryDetection.d.ts +119 -0
package/dist/utils/memoryDetection.js +321 -0
package/dist/utils/unifiedCache.d.ts +75 -1
package/dist/utils/unifiedCache.js +123 -4
package/package.json +1 -1

package/CHANGELOG.md CHANGED Viewed

@@ -2,6 +2,69 @@
 All notable changes to this project will be documented in this file. See [standard-version](https://github.com/conventional-changelog/standard-version) for commit guidelines.
+### [3.36.0](https://github.com/soulcraftlabs/brainy/compare/v3.35.0...v3.36.0) (2025-10-10)
+#### 🚀 Always-Adaptive Caching with Enhanced Monitoring
+**Zero Breaking Changes** - Internal optimizations with automatic performance improvements
+#### What's New
+- **Renamed API**: `getLazyModeStats()` → `getCacheStats()` (backward compatible)
+- **Enhanced Metrics**: Changed `lazyModeEnabled: boolean` → `cachingStrategy: 'preloaded' | 'on-demand'`
+- **Improved Thresholds**: Updated preloading threshold from 30% to 80% for better cache utilization
+- **Better Terminology**: Eliminated "lazy mode" concept in favor of "adaptive caching strategy"
+- **Production Monitoring**: Comprehensive diagnostics for capacity planning and tuning
+#### Benefits
+- ✅ **Clearer Semantics**: "preloaded" vs "on-demand" instead of confusing "lazy mode enabled/disabled"
+- ✅ **Better Cache Utilization**: 80% threshold maximizes memory usage before switching to on-demand
+- ✅ **Enhanced Monitoring**: `getCacheStats()` provides actionable insights for production deployments
+- ✅ **Backward Compatible**: Deprecated `lazy` option still accepted (ignored, always adaptive)
+- ✅ **Zero Config**: System automatically chooses optimal strategy based on dataset size and available memory
+#### API Changes
+```typescript
+// New API (recommended)
+const stats = brain.hnsw.getCacheStats()
+console.log(`Strategy: ${stats.cachingStrategy}`) // 'preloaded' or 'on-demand'
+console.log(`Hit Rate: ${stats.unifiedCache.hitRatePercent}%`)
+console.log(`Recommendations: ${stats.recommendations.join(', ')}`)
+// Old API (deprecated but still works)
+const oldStats = brain.hnsw.getLazyModeStats() // Returns same data
+```
+#### Documentation Updates
+- Added comprehensive migration guide: `docs/guides/migration-3.36.0.md`
+- Added operations guide: `docs/operations/capacity-planning.md`
+- Updated architecture docs with new terminology
+- Renamed example: `monitor-lazy-mode.ts` → `monitor-cache-performance.ts`
+#### Files Changed
+- `src/hnsw/hnswIndex.ts`: Core adaptive caching improvements
+- `src/interfaces/IIndex.ts`: Updated interface documentation
+- `docs/guides/migration-3.36.0.md`: Complete migration guide
+- `docs/operations/capacity-planning.md`: Enterprise operations guide
+- `examples/monitor-cache-performance.ts`: Production monitoring example
+- All documentation updated to reflect new terminology
+#### Migration
+**No action required!** All changes are backward compatible. Update your code to use `getCacheStats()` when convenient.
+---
+### [3.35.0](https://github.com/soulcraftlabs/brainy/compare/v3.34.0...v3.35.0) (2025-10-10)
+- feat: implement HNSW index rebuild and unified index interface (6a4d1ae)
+- cleaning up (12d78ba)
 ### [3.34.0](https://github.com/soulcraftlabs/brainy/compare/v3.33.0...v3.34.0) (2025-10-09)
 - test: adjust type-matching tests for real embeddings (v3.33.0) (1c5c77e)

package/README.md CHANGED Viewed

@@ -19,6 +19,32 @@
 ## 🎉 Key Features
+### ⚡ **NEW in 3.36.0: Production-Scale Memory & Performance**
+**Enterprise-grade adaptive sizing and zero-overhead optimizations:**
+- **🎯 Adaptive Memory Sizing**: Auto-scales from 2GB to 128GB+ based on available system resources
+  - Container-aware (Docker/K8s cgroups v1/v2 detection)
+  - Environment-smart (development 25%, container 40%, production 50% allocation)
+  - Model memory accounting (150MB Q8, 250MB FP32 reserved before cache)
+- **⚡ Sync Fast Path**: Zero async overhead when vectors are cached
+  - Intelligent sync/async branching - synchronous when data is in memory
+  - Falls back to async only when loading from storage
+  - Massive performance win for hot paths (vector search, distance calculations)
+- **📊 Production Monitoring**: Comprehensive diagnostics
+  - `getCacheStats()` - UnifiedCache hit rates, fairness metrics, memory pressure
+  - Actionable recommendations for tuning
+  - Tracks model memory, cache efficiency, and competition across indexes
+- **🛡️ Zero Breaking Changes**: All optimizations are internal - your code stays the same
+  - Public API unchanged
+  - Automatic memory detection and allocation
+  - Progressive enhancement for existing applications
+**[📖 Operations Guide →](docs/operations/capacity-planning.md)** | **[🎯 Migration Guide →](docs/guides/migration-3.36.0.md)**
 ### 🚀 **NEW in 3.21.0: Enhanced Import & Neural Processing**
 - **📊 Progress Tracking**: Unified progress reporting with automatic time estimation
@@ -38,7 +64,7 @@
 - **Modern Syntax**: `brain.add()`, `brain.find()`, `brain.relate()`
 - **Type Safety**: Full TypeScript integration
-- **Zero Config**: Works out of the box with memory storage
+- **Zero Config**: Works out of the box with intelligent storage auto-detection
 - **Consistent Parameters**: Clean, predictable API surface
 ### ⚡ **Performance & Reliability**
@@ -352,7 +378,7 @@ const brain = new Brainy()
 // 2. Custom configuration
 const brain = new Brainy({
-  storage: { type: 'memory' },
+  storage: { type: 'filesystem', path: './brainy-data' },
   embeddings: { model: 'all-MiniLM-L6-v2' },
   cache: { enabled: true, maxSize: 1000 }
 })
@@ -368,7 +394,7 @@ const customBrain = new Brainy({
 **What's Auto-Detected:**
-- **Storage**: S3/GCS/R2 → Filesystem → Memory (priority order)
+- **Storage**: S3/GCS/R2 → Filesystem (priority order)
 - **Models**: Always Q8 for optimal balance
 - **Features**: Minimal → Default → Full based on environment
 - **Memory**: Optimal cache sizes and batching
@@ -390,13 +416,12 @@ Most users **never need this** - zero-config handles everything. For advanced us
 const brain = new Brainy()  // Uses Q8 automatically
 // Storage control (auto-detected by default)
-const memoryBrain = new Brainy({storage: 'memory'})     // RAM only
-const diskBrain = new Brainy({storage: 'disk'})         // Local filesystem
+const diskBrain = new Brainy({storage: 'disk'})         // Local filesystem
 const cloudBrain = new Brainy({storage: 'cloud'})       // S3/GCS/R2
 // Legacy full config (still supported)
 const legacyBrain = new Brainy({
-    storage: {forceMemoryStorage: true}
+    storage: {type: 'filesystem', path: './data'}
 })
 ```
@@ -665,12 +690,7 @@ const context = await brain.find({
 Brainy supports multiple storage backends:
 ```javascript
-// Memory (default for testing)
-const brain = new Brainy({
-    storage: {type: 'memory'}
-})
-// FileSystem (Node.js)
+// FileSystem (Node.js - recommended for development)
 const brain = new Brainy({
     storage: {
         type: 'filesystem',

package/dist/brainy.d.ts CHANGED Viewed

@@ -1070,6 +1070,21 @@ export declare class Brainy<T = any> implements BrainyInterface<T> {
     /**
      * Rebuild indexes if there's existing data but empty indexes
      */
+    /**
+     * Rebuild indexes from persisted data if needed (v3.35.0+)
+     *
+     * FIXES FOR CRITICAL BUGS:
+     * - Bug #1: GraphAdjacencyIndex rebuild never called ✅ FIXED
+     * - Bug #2: Early return blocks recovery when count=0 ✅ FIXED
+     * - Bug #4: HNSW index has no rebuild mechanism ✅ FIXED
+     *
+     * Production-grade rebuild with:
+     * - Handles millions of entities via pagination
+     * - Smart threshold-based decisions (auto-rebuild < 1000 items)
+     * - Progress reporting for large datasets
+     * - Parallel index rebuilds for performance
+     * - Robust error recovery (continues on partial failures)
+     */
     private rebuildIndexesIfNeeded;
     /**
      * Close and cleanup

package/dist/brainy.js CHANGED Viewed

@@ -2385,59 +2385,88 @@ export class Brainy {
     /**
      * Rebuild indexes if there's existing data but empty indexes
      */
+    /**
+     * Rebuild indexes from persisted data if needed (v3.35.0+)
+     *
+     * FIXES FOR CRITICAL BUGS:
+     * - Bug #1: GraphAdjacencyIndex rebuild never called ✅ FIXED
+     * - Bug #2: Early return blocks recovery when count=0 ✅ FIXED
+     * - Bug #4: HNSW index has no rebuild mechanism ✅ FIXED
+     *
+     * Production-grade rebuild with:
+     * - Handles millions of entities via pagination
+     * - Smart threshold-based decisions (auto-rebuild < 1000 items)
+     * - Progress reporting for large datasets
+     * - Parallel index rebuilds for performance
+     * - Robust error recovery (continues on partial failures)
+     */
     async rebuildIndexesIfNeeded() {
         try {
-            // Check if storage has data
+            // Check if auto-rebuild is explicitly disabled
+            if (this.config.disableAutoRebuild === true) {
+                if (!this.config.silent) {
+                    console.log('⚡ Auto-rebuild explicitly disabled via config');
+                }
+                return;
+            }
+            // BUG #2 FIX: Don't trust counts - check actual storage instead
+            // Counts can be lost/corrupted in container restarts
             const entities = await this.storage.getNouns({ pagination: { limit: 1 } });
             const totalCount = entities.totalCount || 0;
-            if (totalCount === 0) {
-                // No data in storage, no rebuild needed
+            // If storage is truly empty, no rebuild needed
+            if (totalCount === 0 && entities.items.length === 0) {
                 return;
             }
             // Intelligent decision: Auto-rebuild only for small datasets
             // For large datasets, use lazy loading for optimal performance
             const AUTO_REBUILD_THRESHOLD = 1000; // Only auto-rebuild if < 1000 items
-            // Check if metadata index is empty
+            // Check if indexes need rebuilding
             const metadataStats = await this.metadataIndex.getStats();
-            if (metadataStats.totalEntries === 0 && totalCount > 0) {
-                if (totalCount < AUTO_REBUILD_THRESHOLD) {
-                    // Small dataset - rebuild for convenience
-                    if (!this.config.silent) {
-                        console.log(`🔄 Small dataset (${totalCount} items) - rebuilding index for optimal performance...`);
-                    }
-                    await this.metadataIndex.rebuild();
-                    const newStats = await this.metadataIndex.getStats();
-                    if (!this.config.silent) {
-                        console.log(`✅ Index rebuilt: ${newStats.totalEntries} entries`);
-                    }
-                }
-                else {
-                    // Large dataset - use lazy loading
-                    if (!this.config.silent) {
-                        console.log(`⚡ Large dataset (${totalCount} items) - using lazy loading for optimal startup performance`);
-                        console.log('💡 Tip: Indexes will build automatically as you use the system');
-                    }
-                }
+            const hnswIndexSize = this.index.size();
+            const graphIndexSize = await this.graphIndex.size();
+            const needsRebuild = metadataStats.totalEntries === 0 ||
+                hnswIndexSize === 0 ||
+                graphIndexSize === 0 ||
+                this.config.disableAutoRebuild === false; // Explicitly enabled
+            if (!needsRebuild) {
+                // All indexes populated, no rebuild needed
+                return;
             }
-            // Override with explicit config if provided
-            if (this.config.disableAutoRebuild === true) {
+            // Small dataset: Rebuild all indexes for best performance
+            if (totalCount < AUTO_REBUILD_THRESHOLD || this.config.disableAutoRebuild === false) {
                 if (!this.config.silent) {
-                    console.log('⚡ Auto-rebuild explicitly disabled via config');
+                    console.log(this.config.disableAutoRebuild === false
+                        ? '🔄 Auto-rebuild explicitly enabled - rebuilding all indexes...'
+                        : `🔄 Small dataset (${totalCount} items) - rebuilding all indexes...`);
+                }
+                // BUG #1 FIX: Actually call graphIndex.rebuild()
+                // BUG #4 FIX: Actually call HNSW index.rebuild()
+                // Rebuild all 3 indexes in parallel for performance
+                const startTime = Date.now();
+                await Promise.all([
+                    metadataStats.totalEntries === 0 ? this.metadataIndex.rebuild() : Promise.resolve(),
+                    hnswIndexSize === 0 ? this.index.rebuild() : Promise.resolve(),
+                    graphIndexSize === 0 ? this.graphIndex.rebuild() : Promise.resolve()
+                ]);
+                const duration = Date.now() - startTime;
+                if (!this.config.silent) {
+                    console.log(`✅ All indexes rebuilt in ${duration}ms:\n` +
+                        `   - Metadata: ${await this.metadataIndex.getStats().then(s => s.totalEntries)} entries\n` +
+                        `   - HNSW Vector: ${this.index.size()} nodes\n` +
+                        `   - Graph Adjacency: ${await this.graphIndex.size()} relationships`);
                 }
-                return;
             }
-            else if (this.config.disableAutoRebuild === false && metadataStats.totalEntries === 0) {
-                // Explicitly enabled - rebuild regardless of size
+            else {
+                // Large dataset: Use lazy loading for fast startup
                 if (!this.config.silent) {
-                    console.log('🔄 Auto-rebuild explicitly enabled - rebuilding index...');
+                    console.log(`⚡ Large dataset (${totalCount} items) - using lazy loading for optimal startup`);
+                    console.log('💡 Indexes will build automatically as you query the system');
                 }
-                await this.metadataIndex.rebuild();
             }
-            // Note: GraphAdjacencyIndex will rebuild itself as relationships are added
-            // Vector index should already be populated if storage has data
         }
         catch (error) {
-            console.warn('Warning: Could not check or rebuild indexes:', error);
+            console.warn('Warning: Could not rebuild indexes:', error);
+            // Don't throw - allow system to start even if rebuild fails
         }
     }
     /**

package/dist/hnsw/hnswIndex.d.ts CHANGED Viewed

@@ -3,6 +3,7 @@
  * Based on the paper: "Efficient and robust approximate nearest neighbor search using Hierarchical Navigable Small World graphs"
  */
 import { DistanceFunction, HNSWConfig, HNSWNoun, Vector, VectorDocument } from '../coreTypes.js';
+import type { BaseStorage } from '../storage/baseStorage.js';
 export declare class HNSWIndex {
     private nouns;
     private entryPointId;
@@ -13,8 +14,11 @@ export declare class HNSWIndex {
     private distanceFunction;
     private dimension;
     private useParallelization;
+    private storage;
+    private unifiedCache;
     constructor(config?: Partial<HNSWConfig>, distanceFunction?: DistanceFunction, options?: {
         useParallelization?: boolean;
+        storage?: BaseStorage;
     });
     /**
      * Set whether to use parallelization for performance-critical operations
@@ -45,7 +49,7 @@ export declare class HNSWIndex {
     /**
      * Remove an item from the index
      */
-    removeItem(id: string): boolean;
+    removeItem(id: string): Promise<boolean>;
     /**
      * Get all nouns in the index
      * @deprecated Use getNounsPaginated() instead for better scalability
@@ -93,11 +97,86 @@ export declare class HNSWIndex {
      * Get the configuration
      */
     getConfig(): HNSWConfig;
+    /**
+     * Get vector safely (always uses adaptive caching via UnifiedCache)
+     *
+     * Production-grade adaptive caching (v3.36.0+):
+     * - Vector already loaded: Returns immediately (O(1))
+     * - Vector in cache: Loads from UnifiedCache (O(1) hash lookup)
+     * - Vector on disk: Loads from storage → UnifiedCache (O(disk))
+     * - Cost-aware caching: UnifiedCache manages memory competition
+     *
+     * @param noun The HNSW noun (may have empty vector if not yet loaded)
+     * @returns Promise<Vector> The vector (loaded on-demand if needed)
+     */
+    private getVectorSafe;
+    /**
+     * Get vector synchronously if available in memory (v3.36.0+)
+     *
+     * Sync fast path optimization:
+     * - Vector in memory: Returns immediately (zero overhead)
+     * - Vector in cache: Returns from UnifiedCache synchronously
+     * - Returns null if vector not available (caller must handle async path)
+     *
+     * Use for sync fast path in distance calculations - eliminates async overhead
+     * when vectors are already cached.
+     *
+     * @param noun The HNSW noun
+     * @returns Vector | null - vector if in memory/cache, null if needs async load
+     */
+    private getVectorSync;
+    /**
+     * Preload multiple vectors in parallel via UnifiedCache
+     *
+     * Optimization for search operations:
+     * - Loads all candidate vectors before distance calculations
+     * - Reduces serial disk I/O (parallel loads are faster)
+     * - Uses UnifiedCache's request coalescing to prevent stampede
+     * - Always active (no "mode" check) for optimal performance
+     *
+     * @param nodeIds Array of node IDs to preload
+     */
+    private preloadVectors;
+    /**
+     * Calculate distance with sync fast path (v3.36.0+)
+     *
+     * Eliminates async overhead when vectors are in memory:
+     * - Sync path: Vector in memory → returns number (zero overhead)
+     * - Async path: Vector needs loading → returns Promise<number>
+     *
+     * Callers must handle union type: `const dist = await Promise.resolve(distance)`
+     *
+     * @param queryVector The query vector
+     * @param noun The target noun (may have empty vector in lazy mode)
+     * @returns number | Promise<number> - sync when cached, async when needs load
+     */
+    private distanceSafe;
     /**
      * Get all nodes at a specific level for clustering
      * This enables O(n) clustering using HNSW's natural hierarchy
      */
     getNodesAtLevel(level: number): HNSWNoun[];
+    /**
+     * Rebuild HNSW index from persisted graph data (v3.35.0+)
+     *
+     * This is a production-grade O(N) rebuild that restores the pre-computed graph structure
+     * from storage. Much faster than re-building which is O(N log N).
+     *
+     * Designed for millions of entities with:
+     * - Cursor-based pagination (no memory overflow)
+     * - Batch processing (configurable batch size)
+     * - Progress reporting (optional callback)
+     * - Error recovery (continues on partial failures)
+     * - Lazy mode support (memory-efficient for constrained environments)
+     *
+     * @param options Rebuild options
+     * @returns Promise that resolves when rebuild is complete
+     */
+    rebuild(options?: {
+        lazy?: boolean;
+        batchSize?: number;
+        onProgress?: (loaded: number, total: number) => void;
+    }): Promise<void>;
     /**
      * Get level statistics for understanding the hierarchy
      */
@@ -115,6 +194,54 @@ export declare class HNSWIndex {
         maxLayer: number;
         totalNodes: number;
     };
+    /**
+     * Get cache performance statistics for monitoring and diagnostics (v3.36.0+)
+     *
+     * Production-grade monitoring:
+     * - Adaptive caching strategy (preloading vs on-demand)
+     * - UnifiedCache performance (hits, misses, evictions)
+     * - HNSW-specific cache statistics
+     * - Fair competition metrics across all indexes
+     * - Actionable recommendations for tuning
+     *
+     * Use this to:
+     * - Diagnose performance issues (low hit rate = increase cache)
+     * - Monitor memory competition (fairness violations = adjust costs)
+     * - Verify adaptive caching decisions (memory estimates vs actual)
+     * - Track cache efficiency over time
+     *
+     * @returns Comprehensive caching and performance statistics
+     */
+    getCacheStats(): {
+        cachingStrategy: 'preloaded' | 'on-demand';
+        autoDetection: {
+            entityCount: number;
+            estimatedVectorMemoryMB: number;
+            availableCacheMB: number;
+            threshold: number;
+            rationale: string;
+        };
+        unifiedCache: {
+            totalSize: number;
+            maxSize: number;
+            utilizationPercent: number;
+            itemCount: number;
+            hitRatePercent: number;
+            totalAccessCount: number;
+        };
+        hnswCache: {
+            vectorsInCache: number;
+            cacheKeyPrefix: string;
+            estimatedMemoryMB: number;
+        };
+        fairness: {
+            hnswAccessCount: number;
+            hnswAccessPercent: number;
+            totalAccessCount: number;
+            fairnessViolation: boolean;
+        };
+        recommendations: string[];
+    };
     /**
      * Search within a specific layer
      * Returns a map of noun IDs to distances, sorted by distance