npm - @soulcraft/brainy - Versions diffs - 3.35.0 → 3.36.1 - Mend

@soulcraft/brainy 3.35.0 → 3.36.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (14) hide show

package/CHANGELOG.md +62 -0
package/README.md +32 -12
package/dist/hnsw/hnswIndex.d.ts +104 -1
package/dist/hnsw/hnswIndex.js +282 -25
package/dist/hnsw/hnswIndexOptimized.d.ts +1 -2
package/dist/hnsw/hnswIndexOptimized.js +3 -5
package/dist/hnsw/partitionedHNSWIndex.js +1 -1
package/dist/interfaces/IIndex.d.ts +14 -5
package/dist/storage/adapters/gcsStorage.js +15 -5
package/dist/utils/memoryDetection.d.ts +119 -0
package/dist/utils/memoryDetection.js +321 -0
package/dist/utils/unifiedCache.d.ts +75 -1
package/dist/utils/unifiedCache.js +123 -4
package/package.json +1 -1

package/CHANGELOG.md CHANGED Viewed

@@ -2,6 +2,68 @@
 All notable changes to this project will be documented in this file. See [standard-version](https://github.com/conventional-changelog/standard-version) for commit guidelines.
+### [3.36.1](https://github.com/soulcraftlabs/brainy/compare/v3.36.0...v3.36.1) (2025-10-10)
+- fix: resolve critical GCS storage bugs preventing production use (3cd0b9a)
+### [3.36.0](https://github.com/soulcraftlabs/brainy/compare/v3.35.0...v3.36.0) (2025-10-10)
+#### 🚀 Always-Adaptive Caching with Enhanced Monitoring
+**Zero Breaking Changes** - Internal optimizations with automatic performance improvements
+#### What's New
+- **Renamed API**: `getLazyModeStats()` → `getCacheStats()` (backward compatible)
+- **Enhanced Metrics**: Changed `lazyModeEnabled: boolean` → `cachingStrategy: 'preloaded' | 'on-demand'`
+- **Improved Thresholds**: Updated preloading threshold from 30% to 80% for better cache utilization
+- **Better Terminology**: Eliminated "lazy mode" concept in favor of "adaptive caching strategy"
+- **Production Monitoring**: Comprehensive diagnostics for capacity planning and tuning
+#### Benefits
+- ✅ **Clearer Semantics**: "preloaded" vs "on-demand" instead of confusing "lazy mode enabled/disabled"
+- ✅ **Better Cache Utilization**: 80% threshold maximizes memory usage before switching to on-demand
+- ✅ **Enhanced Monitoring**: `getCacheStats()` provides actionable insights for production deployments
+- ✅ **Backward Compatible**: Deprecated `lazy` option still accepted (ignored, always adaptive)
+- ✅ **Zero Config**: System automatically chooses optimal strategy based on dataset size and available memory
+#### API Changes
+```typescript
+// New API (recommended)
+const stats = brain.hnsw.getCacheStats()
+console.log(`Strategy: ${stats.cachingStrategy}`) // 'preloaded' or 'on-demand'
+console.log(`Hit Rate: ${stats.unifiedCache.hitRatePercent}%`)
+console.log(`Recommendations: ${stats.recommendations.join(', ')}`)
+// Old API (deprecated but still works)
+const oldStats = brain.hnsw.getLazyModeStats() // Returns same data
+```
+#### Documentation Updates
+- Added comprehensive migration guide: `docs/guides/migration-3.36.0.md`
+- Added operations guide: `docs/operations/capacity-planning.md`
+- Updated architecture docs with new terminology
+- Renamed example: `monitor-lazy-mode.ts` → `monitor-cache-performance.ts`
+#### Files Changed
+- `src/hnsw/hnswIndex.ts`: Core adaptive caching improvements
+- `src/interfaces/IIndex.ts`: Updated interface documentation
+- `docs/guides/migration-3.36.0.md`: Complete migration guide
+- `docs/operations/capacity-planning.md`: Enterprise operations guide
+- `examples/monitor-cache-performance.ts`: Production monitoring example
+- All documentation updated to reflect new terminology
+#### Migration
+**No action required!** All changes are backward compatible. Update your code to use `getCacheStats()` when convenient.
+---
 ### [3.35.0](https://github.com/soulcraftlabs/brainy/compare/v3.34.0...v3.35.0) (2025-10-10)
 - feat: implement HNSW index rebuild and unified index interface (6a4d1ae)

package/README.md CHANGED Viewed

@@ -19,6 +19,32 @@
 ## 🎉 Key Features
+### ⚡ **NEW in 3.36.0: Production-Scale Memory & Performance**
+**Enterprise-grade adaptive sizing and zero-overhead optimizations:**
+- **🎯 Adaptive Memory Sizing**: Auto-scales from 2GB to 128GB+ based on available system resources
+  - Container-aware (Docker/K8s cgroups v1/v2 detection)
+  - Environment-smart (development 25%, container 40%, production 50% allocation)
+  - Model memory accounting (150MB Q8, 250MB FP32 reserved before cache)
+- **⚡ Sync Fast Path**: Zero async overhead when vectors are cached
+  - Intelligent sync/async branching - synchronous when data is in memory
+  - Falls back to async only when loading from storage
+  - Massive performance win for hot paths (vector search, distance calculations)
+- **📊 Production Monitoring**: Comprehensive diagnostics
+  - `getCacheStats()` - UnifiedCache hit rates, fairness metrics, memory pressure
+  - Actionable recommendations for tuning
+  - Tracks model memory, cache efficiency, and competition across indexes
+- **🛡️ Zero Breaking Changes**: All optimizations are internal - your code stays the same
+  - Public API unchanged
+  - Automatic memory detection and allocation
+  - Progressive enhancement for existing applications
+**[📖 Operations Guide →](docs/operations/capacity-planning.md)** | **[🎯 Migration Guide →](docs/guides/migration-3.36.0.md)**
 ### 🚀 **NEW in 3.21.0: Enhanced Import & Neural Processing**
 - **📊 Progress Tracking**: Unified progress reporting with automatic time estimation
@@ -38,7 +64,7 @@
 - **Modern Syntax**: `brain.add()`, `brain.find()`, `brain.relate()`
 - **Type Safety**: Full TypeScript integration
-- **Zero Config**: Works out of the box with memory storage
+- **Zero Config**: Works out of the box with intelligent storage auto-detection
 - **Consistent Parameters**: Clean, predictable API surface
 ### ⚡ **Performance & Reliability**
@@ -352,7 +378,7 @@ const brain = new Brainy()
 // 2. Custom configuration
 const brain = new Brainy({
-  storage: { type: 'memory' },
+  storage: { type: 'filesystem', path: './brainy-data' },
   embeddings: { model: 'all-MiniLM-L6-v2' },
   cache: { enabled: true, maxSize: 1000 }
 })
@@ -368,7 +394,7 @@ const customBrain = new Brainy({
 **What's Auto-Detected:**
-- **Storage**: S3/GCS/R2 → Filesystem → Memory (priority order)
+- **Storage**: S3/GCS/R2 → Filesystem (priority order)
 - **Models**: Always Q8 for optimal balance
 - **Features**: Minimal → Default → Full based on environment
 - **Memory**: Optimal cache sizes and batching
@@ -390,13 +416,12 @@ Most users **never need this** - zero-config handles everything. For advanced us
 const brain = new Brainy()  // Uses Q8 automatically
 // Storage control (auto-detected by default)
-const memoryBrain = new Brainy({storage: 'memory'})     // RAM only
-const diskBrain = new Brainy({storage: 'disk'})         // Local filesystem
+const diskBrain = new Brainy({storage: 'disk'})         // Local filesystem
 const cloudBrain = new Brainy({storage: 'cloud'})       // S3/GCS/R2
 // Legacy full config (still supported)
 const legacyBrain = new Brainy({
-    storage: {forceMemoryStorage: true}
+    storage: {type: 'filesystem', path: './data'}
 })
 ```
@@ -665,12 +690,7 @@ const context = await brain.find({
 Brainy supports multiple storage backends:
 ```javascript
-// Memory (default for testing)
-const brain = new Brainy({
-    storage: {type: 'memory'}
-})
-// FileSystem (Node.js)
+// FileSystem (Node.js - recommended for development)
 const brain = new Brainy({
     storage: {
         type: 'filesystem',

package/dist/hnsw/hnswIndex.d.ts CHANGED Viewed

@@ -15,6 +15,7 @@ export declare class HNSWIndex {
     private dimension;
     private useParallelization;
     private storage;
+    private unifiedCache;
     constructor(config?: Partial<HNSWConfig>, distanceFunction?: DistanceFunction, options?: {
         useParallelization?: boolean;
         storage?: BaseStorage;
@@ -48,7 +49,7 @@ export declare class HNSWIndex {
     /**
      * Remove an item from the index
      */
-    removeItem(id: string): boolean;
+    removeItem(id: string): Promise<boolean>;
     /**
      * Get all nouns in the index
      * @deprecated Use getNounsPaginated() instead for better scalability
@@ -96,6 +97,60 @@ export declare class HNSWIndex {
      * Get the configuration
      */
     getConfig(): HNSWConfig;
+    /**
+     * Get vector safely (always uses adaptive caching via UnifiedCache)
+     *
+     * Production-grade adaptive caching (v3.36.0+):
+     * - Vector already loaded: Returns immediately (O(1))
+     * - Vector in cache: Loads from UnifiedCache (O(1) hash lookup)
+     * - Vector on disk: Loads from storage → UnifiedCache (O(disk))
+     * - Cost-aware caching: UnifiedCache manages memory competition
+     *
+     * @param noun The HNSW noun (may have empty vector if not yet loaded)
+     * @returns Promise<Vector> The vector (loaded on-demand if needed)
+     */
+    private getVectorSafe;
+    /**
+     * Get vector synchronously if available in memory (v3.36.0+)
+     *
+     * Sync fast path optimization:
+     * - Vector in memory: Returns immediately (zero overhead)
+     * - Vector in cache: Returns from UnifiedCache synchronously
+     * - Returns null if vector not available (caller must handle async path)
+     *
+     * Use for sync fast path in distance calculations - eliminates async overhead
+     * when vectors are already cached.
+     *
+     * @param noun The HNSW noun
+     * @returns Vector | null - vector if in memory/cache, null if needs async load
+     */
+    private getVectorSync;
+    /**
+     * Preload multiple vectors in parallel via UnifiedCache
+     *
+     * Optimization for search operations:
+     * - Loads all candidate vectors before distance calculations
+     * - Reduces serial disk I/O (parallel loads are faster)
+     * - Uses UnifiedCache's request coalescing to prevent stampede
+     * - Always active (no "mode" check) for optimal performance
+     *
+     * @param nodeIds Array of node IDs to preload
+     */
+    private preloadVectors;
+    /**
+     * Calculate distance with sync fast path (v3.36.0+)
+     *
+     * Eliminates async overhead when vectors are in memory:
+     * - Sync path: Vector in memory → returns number (zero overhead)
+     * - Async path: Vector needs loading → returns Promise<number>
+     *
+     * Callers must handle union type: `const dist = await Promise.resolve(distance)`
+     *
+     * @param queryVector The query vector
+     * @param noun The target noun (may have empty vector in lazy mode)
+     * @returns number | Promise<number> - sync when cached, async when needs load
+     */
+    private distanceSafe;
     /**
      * Get all nodes at a specific level for clustering
      * This enables O(n) clustering using HNSW's natural hierarchy
@@ -139,6 +194,54 @@ export declare class HNSWIndex {
         maxLayer: number;
         totalNodes: number;
     };
+    /**
+     * Get cache performance statistics for monitoring and diagnostics (v3.36.0+)
+     *
+     * Production-grade monitoring:
+     * - Adaptive caching strategy (preloading vs on-demand)
+     * - UnifiedCache performance (hits, misses, evictions)
+     * - HNSW-specific cache statistics
+     * - Fair competition metrics across all indexes
+     * - Actionable recommendations for tuning
+     *
+     * Use this to:
+     * - Diagnose performance issues (low hit rate = increase cache)
+     * - Monitor memory competition (fairness violations = adjust costs)
+     * - Verify adaptive caching decisions (memory estimates vs actual)
+     * - Track cache efficiency over time
+     *
+     * @returns Comprehensive caching and performance statistics
+     */
+    getCacheStats(): {
+        cachingStrategy: 'preloaded' | 'on-demand';
+        autoDetection: {
+            entityCount: number;
+            estimatedVectorMemoryMB: number;
+            availableCacheMB: number;
+            threshold: number;
+            rationale: string;
+        };
+        unifiedCache: {
+            totalSize: number;
+            maxSize: number;
+            utilizationPercent: number;
+            itemCount: number;
+            hitRatePercent: number;
+            totalAccessCount: number;
+        };
+        hnswCache: {
+            vectorsInCache: number;
+            cacheKeyPrefix: string;
+            estimatedMemoryMB: number;
+        };
+        fairness: {
+            hnswAccessCount: number;
+            hnswAccessPercent: number;
+            totalAccessCount: number;
+            fairnessViolation: boolean;
+        };
+        recommendations: string[];
+    };
     /**
      * Search within a specific layer
      * Returns a map of noun IDs to distances, sorted by distance