npm - @soulcraft/brainy - Versions diffs - 2.0.1 → 2.1.0 - Mend

@soulcraft/brainy 2.0.1 → 2.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (12) hide show

package/README.md +141 -28
package/dist/augmentations/neuralImport.d.ts +12 -3
package/dist/augmentations/neuralImport.js +195 -55
package/dist/augmentations/typeMatching/intelligentTypeMatcher.d.ts +83 -0
package/dist/augmentations/typeMatching/intelligentTypeMatcher.js +425 -0
package/dist/brainyData.d.ts +4 -4
package/dist/brainyData.js +34 -28
package/dist/importManager.d.ts +78 -0
package/dist/importManager.js +258 -0
package/dist/neural/embeddedPatterns.d.ts +1 -1
package/dist/neural/embeddedPatterns.js +1 -1
package/package.json +5 -3

package/README.md CHANGED Viewed

@@ -9,18 +9,24 @@
 [![MIT License](https://img.shields.io/badge/license-MIT-blue.svg)](LICENSE)
 [![TypeScript](https://img.shields.io/badge/%3C%2F%3E-TypeScript-%230074c1.svg)](https://www.typescriptlang.org/)
-**🧠 Brainy 2.0 - Zero-Configuration AI Database with Triple Intelligence™**
+**🧠 Brainy 2.0 - The Universal Knowledge Protocol™**
-The industry's first truly zero-configuration AI database that combines vector similarity, metadata filtering, and graph relationships with O(log n) performance. Production-ready with 3ms search latency, 220 pre-computed NLP patterns, and only 24MB memory footprint.
+**World's first Triple Intelligence™ database**—unifying vector similarity, graph relationships, and document filtering in one magical API. Model ANY data from ANY domain using 31 standardized noun types × 40 verb types.
+**Why Brainy Leads**: We're the first to solve the impossible—combining three different database paradigms (vector, graph, document) into one unified query interface. This breakthrough enables us to be the Universal Knowledge Protocol where all tools, augmentations, and AI models speak the same language.
+**Build once, integrate everywhere.** O(log n) performance, 3ms search latency, 24MB memory footprint.
 ## 🎉 What's New in 2.0
-- **Triple Intelligence™**: Unified Vector + Metadata + Graph queries in one API
+- **World's First Triple Intelligence™**: Unified vector + graph + document in ONE query
+- **Universal Knowledge Protocol**: 31 nouns × 40 verbs standardize all knowledge
+- **Infinite Expressiveness**: Model ANY data with unlimited metadata
 - **API Consolidation**: 15+ methods → 2 clean APIs (`search()` and `find()`)
 - **Natural Language**: Ask questions in plain English
 - **Zero Configuration**: Works instantly, no setup required
 - **O(log n) Performance**: Binary search on sorted indices
-- **220+ NLP Patterns**: Pre-computed for instant understanding
+- **Perfect Interoperability**: All tools and AI models speak the same language
 - **Universal Compatibility**: Node.js, Browser, Edge, Workers
 ## ⚡ Quick Start
@@ -35,29 +41,53 @@ import { BrainyData } from 'brainy'
 const brain = new BrainyData()
 await brain.init()
-// Add data with automatic embedding
-await brain.addNoun("JavaScript is a programming language", {
+// Add entities (nouns) with automatic embedding
+const jsId = await brain.addNoun("JavaScript is a programming language", {
   type: "language",
-  year: 1995
+  year: 1995,
+  paradigm: "multi-paradigm"
 })
-// Natural language search
-const results = await brain.find("programming languages from the 90s")
+const nodeId = await brain.addNoun("Node.js runtime environment", {
+  type: "runtime",
+  year: 2009,
+  platform: "server-side"
+})
-// Vector similarity with metadata filtering
-const filtered = await brain.search("JavaScript", {
-  metadata: { type: "language" },
-  limit: 5
+// Create relationships (verbs) between entities
+await brain.addVerb(nodeId, jsId, "executes", {
+  since: 2009,
+  performance: "high"
+})
+// Natural language search with graph relationships
+const results = await brain.find("programming languages used by server runtimes")
+// Triple Intelligence: vector + metadata + relationships
+const filtered = await brain.find({
+  like: "JavaScript",                    // Vector similarity
+  where: { type: "language" },           // Metadata filtering
+  connected: { from: nodeId, depth: 1 }  // Graph relationships
 })
 ```
 ## 🚀 Key Features
-### Triple Intelligence Engine
-Combines three search paradigms in one unified API:
+### World's First Triple Intelligence™ Engine
+**The breakthrough that enables the Universal Knowledge Protocol:**
 - **Vector Search**: Semantic similarity with HNSW indexing
-- **Metadata Filtering**: O(log n) field lookups with binary search
-- **Graph Relationships**: Navigate connected knowledge
+- **Graph Relationships**: Navigate connected knowledge like Neo4j
+- **Document Filtering**: MongoDB-style queries with O(log n) performance
+- **Unified in ONE API**: No separate queries, no complex joins
+- **First to solve this**: Others do vector OR graph OR document—we do ALL
+### Universal Knowledge Protocol with Infinite Expressiveness
+**Enabled by Triple Intelligence, standardized for everyone:**
+- **24 Noun Types × 40 Verb Types**: 960 base combinations
+- **∞ Expressiveness**: Unlimited metadata = model ANY data
+- **One Language**: All tools, augmentations, AI models speak the same types
+- **Perfect Interoperability**: Move data between any Brainy instance
+- **No Schema Lock-in**: Evolve without migrations
 ### Natural Language Understanding
 ```javascript
@@ -108,17 +138,26 @@ const results = await brain.find({
 ### CRUD Operations
 ```javascript
-// Create
+// Create entities (nouns)
 const id = await brain.addNoun(data, metadata)
+// Create relationships (verbs)
+const verbId = await brain.addVerb(sourceId, targetId, "relationType", {
+  strength: 0.9,
+  bidirectional: false
+})
 // Read
 const item = await brain.getNoun(id)
+const verb = await brain.getVerb(verbId)
 // Update
 await brain.updateNoun(id, newData, newMetadata)
+await brain.updateVerb(verbId, newMetadata)
 // Delete
 await brain.deleteNoun(id)
+await brain.deleteVerb(verbId)
 // Bulk operations
 await brain.import(arrayOfData)
@@ -127,16 +166,35 @@ const exported = await brain.export({ format: 'json' })
 ## 🎯 Use Cases
-### Knowledge Management
+### Knowledge Management with Relationships
 ```javascript
-// Store and search documentation
-await brain.addNoun(documentContent, {
+// Store documentation with rich relationships
+const apiGuide = await brain.addNoun("REST API Guide", {
   title: "API Guide",
   category: "documentation",
   version: "2.0"
 })
-const docs = await brain.find("API documentation for version 2")
+const author = await brain.addNoun("Jane Developer", {
+  type: "person",
+  role: "tech-lead"
+})
+const project = await brain.addNoun("E-commerce Platform", {
+  type: "project",
+  status: "active"
+})
+// Create knowledge graph
+await brain.addVerb(author, apiGuide, "authored", {
+  date: "2024-03-15"
+})
+await brain.addVerb(apiGuide, project, "documents", {
+  coverage: "complete"
+})
+// Query the knowledge graph naturally
+const docs = await brain.find("documentation authored by tech leads for active projects")
 ```
 ### Semantic Search
@@ -148,17 +206,35 @@ const similar = await brain.search(existingContent, {
 })
 ```
-### AI Memory Layer
+### AI Memory Layer with Context
 ```javascript
-// Store conversation context
-await brain.addNoun(userMessage, {
-  userId: "123",
+// Store conversation with relationships
+const userId = await brain.addNoun("User 123", {
+  type: "user",
+  tier: "premium"
+})
+const messageId = await brain.addNoun(userMessage, {
+  type: "message",
   timestamp: Date.now(),
   session: "abc"
 })
-// Retrieve relevant context
-const context = await brain.find(`previous conversations with user 123`)
+const topicId = await brain.addNoun("Product Support", {
+  type: "topic",
+  category: "support"
+})
+// Link conversation elements
+await brain.addVerb(userId, messageId, "sent")
+await brain.addVerb(messageId, topicId, "about")
+// Retrieve context with relationships
+const context = await brain.find({
+  where: { type: "message" },
+  connected: { from: userId, type: "sent" },
+  like: "previous product issues"
+})
 ```
 ## 💾 Storage Options
@@ -272,6 +348,42 @@ Key changes:
 We welcome contributions! See [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines.
+## 🧠 The Universal Knowledge Protocol Explained
+### How We Achieved The Impossible
+**Triple Intelligence™** makes us the **world's first** to unify three database paradigms:
+1. **Vector databases** (Pinecone, Weaviate) - semantic similarity
+2. **Graph databases** (Neo4j, ArangoDB) - relationships
+3. **Document databases** (MongoDB, Elasticsearch) - metadata filtering
+**One API to rule them all.** Others make you choose. We unified them.
+### The Math of Infinite Expressiveness
+```
+24 Nouns × 40 Verbs × ∞ Metadata × Triple Intelligence = Universal Protocol
+```
+- **960 base combinations** from standardized types
+- **∞ domain specificity** via unlimited metadata
+- **∞ relationship depth** via graph traversal
+- **= Model ANYTHING**: From quantum physics to social networks
+### Why This Changes Everything
+**Like HTTP for the web, Brainy for knowledge:**
+- All augmentations compose perfectly - same noun-verb language
+- All AI models share knowledge - GPT, Claude, Llama all understand
+- All tools integrate seamlessly - no translation layers
+- All data flows freely - perfect portability
+**The Vision**: One protocol. All knowledge. Every tool. Any AI.
+**Proven across industries**: Healthcare, Finance, Manufacturing, Education, Legal, Retail, Government, and beyond.
+[→ See the Mathematical Proof & Full Taxonomy](docs/architecture/noun-verb-taxonomy.md)
 ## 📖 Documentation
 - [Getting Started Guide](docs/guides/getting-started.md)
@@ -279,6 +391,7 @@ We welcome contributions! See [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines.
 - [Architecture Overview](docs/architecture/overview.md)
 - [Natural Language Guide](docs/guides/natural-language.md)
 - [Triple Intelligence](docs/architecture/triple-intelligence.md)
+- [Noun-Verb Taxonomy](docs/architecture/noun-verb-taxonomy.md)
 ## 🏢 Enterprise & Cloud

package/dist/augmentations/neuralImport.d.ts CHANGED Viewed

@@ -59,6 +59,7 @@ export declare class NeuralImportAugmentation extends BaseAugmentation {
     readonly priority = 80;
     private config;
     private analysisCache;
+    private typeMatcher;
     constructor(config?: Partial<NeuralImportConfig>);
     protected onInitialize(): Promise<void>;
     protected onShutdown(): Promise<void>;
@@ -79,15 +80,23 @@ export declare class NeuralImportAugmentation extends BaseAugmentation {
      */
     private parseRawData;
     /**
-     * Parse CSV data
+     * Parse CSV data - handles quoted values, escaped quotes, and edge cases
      */
     private parseCSV;
+    /**
+     * Parse YAML data
+     */
+    private parseYAML;
+    /**
+     * Parse a YAML value (handle strings, numbers, booleans, null)
+     */
+    private parseYAMLValue;
     /**
      * Perform neural analysis on parsed data
      */
     private performNeuralAnalysis;
     /**
-     * Infer noun type from object structure
+     * Infer noun type from object structure using intelligent type matching
      */
     private inferNounType;
     /**
@@ -95,7 +104,7 @@ export declare class NeuralImportAugmentation extends BaseAugmentation {
      */
     private detectRelationships;
     /**
-     * Infer verb type from field name
+     * Infer verb type from field name using intelligent type matching
      */
     private inferVerbType;
     /**

package/dist/augmentations/neuralImport.js CHANGED Viewed

@@ -8,6 +8,8 @@
  */
 import { BaseAugmentation } from './brainyAugmentation.js';
 import * as path from '../universal/path.js';
+import { getTypeMatcher } from './typeMatching/intelligentTypeMatcher.js';
+import { prodLog } from '../utils/logger.js';
 /**
  * Neural Import Augmentation - Unified Implementation
  * Processes data with AI before storage operations
@@ -20,6 +22,7 @@ export class NeuralImportAugmentation extends BaseAugmentation {
         this.operations = ['add', 'addNoun', 'addVerb', 'all']; // Use 'all' to catch batch operations
         this.priority = 80; // High priority for data processing
         this.analysisCache = new Map();
+        this.typeMatcher = null;
         this.config = {
             confidenceThreshold: 0.7,
             enableWeights: true,
@@ -29,7 +32,13 @@ export class NeuralImportAugmentation extends BaseAugmentation {
         };
     }
     async onInitialize() {
-        this.log('🧠 Neural Import augmentation initialized');
+        try {
+            this.typeMatcher = await getTypeMatcher();
+            this.log('🧠 Neural Import augmentation initialized with intelligent type matching');
+        }
+        catch (error) {
+            this.log('⚠️ Failed to initialize type matcher, falling back to heuristics', 'warn');
+        }
     }
     async onShutdown() {
         this.analysisCache.clear();
@@ -128,13 +137,7 @@ export class NeuralImportAugmentation extends BaseAugmentation {
                 return this.parseCSV(content);
             case 'yaml':
             case 'yml':
-                // For now, basic YAML support - in full implementation would use yaml parser
-                try {
-                    return JSON.parse(content); // Placeholder
-                }
-                catch {
-                    return [{ text: content }];
-                }
+                return this.parseYAML(content);
             case 'txt':
             case 'text':
                 // Split text into sentences/paragraphs for analysis
@@ -145,24 +148,174 @@ export class NeuralImportAugmentation extends BaseAugmentation {
         }
     }
     /**
-     * Parse CSV data
+     * Parse CSV data - handles quoted values, escaped quotes, and edge cases
      */
     parseCSV(content) {
-        const lines = content.split('\n').filter(line => line.trim());
+        const lines = content.split('\n');
         if (lines.length === 0)
             return [];
-        const headers = lines[0].split(',').map(h => h.trim());
+        // Parse a CSV line handling quotes
+        const parseLine = (line) => {
+            const result = [];
+            let current = '';
+            let inQuotes = false;
+            let i = 0;
+            while (i < line.length) {
+                const char = line[i];
+                const nextChar = line[i + 1];
+                if (char === '"') {
+                    if (inQuotes && nextChar === '"') {
+                        // Escaped quote
+                        current += '"';
+                        i += 2;
+                    }
+                    else {
+                        // Toggle quote mode
+                        inQuotes = !inQuotes;
+                        i++;
+                    }
+                }
+                else if (char === ',' && !inQuotes) {
+                    // Field separator
+                    result.push(current.trim());
+                    current = '';
+                    i++;
+                }
+                else {
+                    current += char;
+                    i++;
+                }
+            }
+            // Add last field
+            result.push(current.trim());
+            return result;
+        };
+        // Parse headers
+        const headers = parseLine(lines[0]);
         const data = [];
+        // Parse data rows
         for (let i = 1; i < lines.length; i++) {
-            const values = lines[i].split(',').map(v => v.trim());
+            const line = lines[i].trim();
+            if (!line)
+                continue; // Skip empty lines
+            const values = parseLine(line);
             const row = {};
             headers.forEach((header, index) => {
-                row[header] = values[index] || '';
+                const value = values[index] || '';
+                // Try to parse numbers
+                const num = Number(value);
+                row[header] = !isNaN(num) && value !== '' ? num : value;
             });
             data.push(row);
         }
         return data;
     }
+    /**
+     * Parse YAML data
+     */
+    parseYAML(content) {
+        try {
+            // Simple YAML parser for basic structures
+            // For full YAML support, we'd use js-yaml library
+            const lines = content.split('\n');
+            const result = [];
+            let currentObject = null;
+            let currentIndent = 0;
+            for (const line of lines) {
+                const trimmed = line.trim();
+                if (!trimmed || trimmed.startsWith('#'))
+                    continue; // Skip empty lines and comments
+                // Calculate indentation
+                const indent = line.length - line.trimStart().length;
+                // Check for array item
+                if (trimmed.startsWith('- ')) {
+                    const value = trimmed.substring(2).trim();
+                    if (indent === 0) {
+                        // Top-level array item
+                        if (value.includes(':')) {
+                            // Object in array
+                            currentObject = {};
+                            result.push(currentObject);
+                            const [key, val] = value.split(':').map(s => s.trim());
+                            currentObject[key] = this.parseYAMLValue(val);
+                        }
+                        else {
+                            result.push(this.parseYAMLValue(value));
+                        }
+                    }
+                    else if (currentObject) {
+                        // Nested array
+                        const lastKey = Object.keys(currentObject).pop();
+                        if (lastKey) {
+                            if (!Array.isArray(currentObject[lastKey])) {
+                                currentObject[lastKey] = [];
+                            }
+                            currentObject[lastKey].push(this.parseYAMLValue(value));
+                        }
+                    }
+                }
+                else if (trimmed.includes(':')) {
+                    // Key-value pair
+                    const colonIndex = trimmed.indexOf(':');
+                    const key = trimmed.substring(0, colonIndex).trim();
+                    const value = trimmed.substring(colonIndex + 1).trim();
+                    if (indent === 0) {
+                        // Top-level object
+                        if (!currentObject) {
+                            currentObject = {};
+                            result.push(currentObject);
+                        }
+                        currentObject[key] = this.parseYAMLValue(value);
+                        currentIndent = 0;
+                    }
+                    else if (currentObject) {
+                        // Nested object
+                        if (indent > currentIndent && !value) {
+                            // Start of nested object
+                            const lastKey = Object.keys(currentObject).pop();
+                            if (lastKey) {
+                                currentObject[lastKey] = { [key]: '' };
+                            }
+                        }
+                        else {
+                            currentObject[key] = this.parseYAMLValue(value);
+                        }
+                        currentIndent = indent;
+                    }
+                }
+            }
+            // If we built a single object and not an array, wrap it
+            if (result.length === 0 && currentObject) {
+                result.push(currentObject);
+            }
+            return result.length > 0 ? result : [{ text: content }];
+        }
+        catch (error) {
+            prodLog.warn('YAML parsing failed, treating as text:', error);
+            return [{ text: content }];
+        }
+    }
+    /**
+     * Parse a YAML value (handle strings, numbers, booleans, null)
+     */
+    parseYAMLValue(value) {
+        if (!value || value === '~' || value === 'null')
+            return null;
+        if (value === 'true')
+            return true;
+        if (value === 'false')
+            return false;
+        // Remove quotes if present
+        if ((value.startsWith('"') && value.endsWith('"')) ||
+            (value.startsWith("'") && value.endsWith("'"))) {
+            return value.slice(1, -1);
+        }
+        // Try to parse as number
+        const num = Number(value);
+        if (!isNaN(num) && value !== '')
+            return num;
+        return value;
+    }
     /**
      * Perform neural analysis on parsed data
      */
@@ -177,14 +330,14 @@ export class NeuralImportAugmentation extends BaseAugmentation {
                 const entityId = item.id || item.name || item.title || `entity_${Date.now()}_${Math.random()}`;
                 detectedEntities.push({
                     originalData: item,
-                    nounType: this.inferNounType(item),
+                    nounType: await this.inferNounType(item),
                     confidence: 0.85,
                     suggestedId: String(entityId),
                     reasoning: 'Detected from structured data',
                     alternativeTypes: []
                 });
                 // Detect relationships from references
-                this.detectRelationships(item, entityId, detectedRelationships);
+                await this.detectRelationships(item, entityId, detectedRelationships);
             }
         }
         // Generate insights
@@ -216,36 +369,31 @@ export class NeuralImportAugmentation extends BaseAugmentation {
         };
     }
     /**
-     * Infer noun type from object structure
+     * Infer noun type from object structure using intelligent type matching
      */
-    inferNounType(obj) {
-        // Simple heuristics for type detection
-        if (obj.email || obj.username)
-            return 'Person';
-        if (obj.title && obj.content)
-            return 'Document';
-        if (obj.price || obj.product)
-            return 'Product';
-        if (obj.date || obj.timestamp)
-            return 'Event';
-        if (obj.url || obj.link)
-            return 'Resource';
-        if (obj.lat || obj.longitude)
-            return 'Location';
-        // Default fallback
-        return 'Entity';
+    async inferNounType(obj) {
+        if (!this.typeMatcher) {
+            // Initialize type matcher if not available
+            this.typeMatcher = await getTypeMatcher();
+        }
+        const result = await this.typeMatcher.matchNounType(obj);
+        // Log if confidence is low for debugging
+        if (result.confidence < 0.5) {
+            this.log(`Low confidence (${result.confidence.toFixed(2)}) for noun type: ${result.type}`, 'warn');
+        }
+        return result.type;
     }
     /**
      * Detect relationships from object references
      */
-    detectRelationships(obj, sourceId, relationships) {
+    async detectRelationships(obj, sourceId, relationships) {
         // Look for reference patterns
         for (const [key, value] of Object.entries(obj)) {
             if (key.endsWith('Id') || key.endsWith('_id') || key === 'parentId' || key === 'userId') {
                 relationships.push({
                     sourceId,
                     targetId: String(value),
-                    verbType: this.inferVerbType(key),
+                    verbType: await this.inferVerbType(key, obj, { id: value }),
                     confidence: 0.75,
                     weight: 1,
                     reasoning: `Reference detected in field: ${key}`,
@@ -259,7 +407,7 @@ export class NeuralImportAugmentation extends BaseAugmentation {
                         relationships.push({
                             sourceId,
                             targetId: String(targetId),
-                            verbType: this.inferVerbType(key),
+                            verbType: await this.inferVerbType(key, obj, { id: targetId }),
                             confidence: 0.7,
                             weight: 1,
                             reasoning: `Array reference in field: ${key}`,
@@ -271,27 +419,19 @@ export class NeuralImportAugmentation extends BaseAugmentation {
         }
     }
     /**
-     * Infer verb type from field name
+     * Infer verb type from field name using intelligent type matching
      */
-    inferVerbType(fieldName) {
-        const normalized = fieldName.toLowerCase();
-        if (normalized.includes('parent'))
-            return 'childOf';
-        if (normalized.includes('user'))
-            return 'belongsTo';
-        if (normalized.includes('author'))
-            return 'authoredBy';
-        if (normalized.includes('owner'))
-            return 'ownedBy';
-        if (normalized.includes('creator'))
-            return 'createdBy';
-        if (normalized.includes('member'))
-            return 'memberOf';
-        if (normalized.includes('tag'))
-            return 'taggedWith';
-        if (normalized.includes('category'))
-            return 'categorizedAs';
-        return 'relatedTo';
+    async inferVerbType(fieldName, sourceObj, targetObj) {
+        if (!this.typeMatcher) {
+            // Initialize type matcher if not available
+            this.typeMatcher = await getTypeMatcher();
+        }
+        const result = await this.typeMatcher.matchVerbType(sourceObj, targetObj, fieldName);
+        // Log if confidence is low for debugging
+        if (result.confidence < 0.5) {
+            this.log(`Low confidence (${result.confidence.toFixed(2)}) for verb type: ${result.type}`, 'warn');
+        }
+        return result.type;
     }
     /**
      * Group entities by type