npm - lynkr - Versions diffs - 8.0.0 → 8.0.1 - Mend

lynkr 8.0.0 → 8.0.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (102) hide show

package/README.md +1 -1
package/package.json +1 -1
package/src/api/openai-router.js +34 -2
package/src/clients/standard-tools.js +23 -0
package/src/config/index.js +20 -0
package/src/orchestrator/index.js +2 -2
package/src/server.js +2 -12
package/src/tools/index.js +4 -0
package/src/tools/lazy-loader.js +7 -0
package/src/tools/tinyfish.js +358 -0
package/src/tools/truncate.js +1 -0
package/.github/FUNDING.yml +0 -15
package/.github/workflows/README.md +0 -215
package/.github/workflows/ci.yml +0 -69
package/.github/workflows/index.yml +0 -62
package/.github/workflows/web-tools-tests.yml +0 -56
package/CITATIONS.bib +0 -6
package/DEPLOYMENT.md +0 -1001
package/LYNKR-TUI-PLAN.md +0 -984
package/PERFORMANCE-REPORT.md +0 -866
package/PLAN-per-client-model-routing.md +0 -252
package/docs/42642f749da6234f41b6b425c3bb07c9.txt +0 -1
package/docs/BingSiteAuth.xml +0 -4
package/docs/docs-style.css +0 -478
package/docs/docs.html +0 -198
package/docs/google5be250e608e6da39.html +0 -1
package/docs/index.html +0 -577
package/docs/index.md +0 -584
package/docs/robots.txt +0 -4
package/docs/sitemap.xml +0 -44
package/docs/style.css +0 -1223
package/docs/toon-integration-spec.md +0 -130
package/documentation/README.md +0 -101
package/documentation/api.md +0 -806
package/documentation/claude-code-cli.md +0 -679
package/documentation/codex-cli.md +0 -397
package/documentation/contributing.md +0 -571
package/documentation/cursor-integration.md +0 -734
package/documentation/docker.md +0 -874
package/documentation/embeddings.md +0 -762
package/documentation/faq.md +0 -713
package/documentation/features.md +0 -403
package/documentation/headroom.md +0 -519
package/documentation/installation.md +0 -758
package/documentation/memory-system.md +0 -476
package/documentation/production.md +0 -636
package/documentation/providers.md +0 -1009
package/documentation/routing.md +0 -476
package/documentation/testing.md +0 -629
package/documentation/token-optimization.md +0 -325
package/documentation/tools.md +0 -697
package/documentation/troubleshooting.md +0 -969
package/final-test.js +0 -33
package/headroom-sidecar/config.py +0 -93
package/headroom-sidecar/requirements.txt +0 -14
package/headroom-sidecar/server.py +0 -451
package/monitor-agents.sh +0 -31
package/scripts/audit-log-reader.js +0 -399
package/scripts/compact-dictionary.js +0 -204
package/scripts/test-deduplication.js +0 -448
package/src/db/database.sqlite +0 -0
package/te +0 -11622
package/test/README.md +0 -212
package/test/azure-openai-config.test.js +0 -213
package/test/azure-openai-error-resilience.test.js +0 -238
package/test/azure-openai-format-conversion.test.js +0 -354
package/test/azure-openai-integration.test.js +0 -287
package/test/azure-openai-routing.test.js +0 -175
package/test/azure-openai-streaming.test.js +0 -171
package/test/bedrock-integration.test.js +0 -457
package/test/comprehensive-test-suite.js +0 -928
package/test/config-validation.test.js +0 -207
package/test/cursor-integration.test.js +0 -484
package/test/format-conversion.test.js +0 -578
package/test/hybrid-routing-integration.test.js +0 -269
package/test/hybrid-routing-performance.test.js +0 -428
package/test/llamacpp-integration.test.js +0 -882
package/test/lmstudio-integration.test.js +0 -347
package/test/memory/extractor.test.js +0 -398
package/test/memory/retriever.test.js +0 -613
package/test/memory/retriever.test.js.bak +0 -585
package/test/memory/search.test.js +0 -537
package/test/memory/search.test.js.bak +0 -389
package/test/memory/store.test.js +0 -344
package/test/memory/store.test.js.bak +0 -312
package/test/memory/surprise.test.js +0 -300
package/test/memory-performance.test.js +0 -472
package/test/openai-integration.test.js +0 -683
package/test/openrouter-error-resilience.test.js +0 -418
package/test/passthrough-mode.test.js +0 -385
package/test/performance-benchmark.js +0 -351
package/test/performance-tests.js +0 -528
package/test/routing.test.js +0 -225
package/test/toon-compression.test.js +0 -131
package/test/web-tools.test.js +0 -329
package/test-agents-simple.js +0 -43
package/test-cli-connection.sh +0 -33
package/test-learning-unit.js +0 -126
package/test-learning.js +0 -112
package/test-parallel-agents.sh +0 -124
package/test-parallel-direct.js +0 -155
package/test-subagents.sh +0 -117

package/documentation/features.md DELETED Viewed

@@ -1,403 +0,0 @@
-# Core Features & Architecture
-Complete guide to Lynkr's architecture, request flow, and core capabilities.
----
-## Architecture Overview
-```
-┌─────────────────┐
-│ Claude Code CLI │  or  Cursor IDE
-└────────┬────────┘
-         │ Anthropic/OpenAI Format
-         ↓
-┌─────────────────┐
-│  Lynkr Proxy    │
-│  Port: 8081     │
-│                 │
-│ • Format Conv.  │
-│ • Token Optim.  │
-│ • Provider Route│
-│ • Tool Calling  │
-│ • Caching       │
-└────────┬────────┘
-         │
-         ├──→ Databricks (Claude 4.5)
-         ├──→ AWS Bedrock (100+ models)
-         ├──→ OpenRouter (100+ models)
-         ├──→ Moonshot AI (Kimi K2)
-         ├──→ Ollama (local, free)
-         ├──→ llama.cpp (local, free)
-         ├──→ Azure OpenAI (GPT-4o, o1)
-         ├──→ OpenAI (GPT-4o, o3)
-         └──→ Azure Anthropic (Claude)
-```
----
-## Request Flow
-### 1. Request Reception
-**Entry Points:**
-- `/v1/messages` - Anthropic format (Claude Code CLI)
-- `/v1/chat/completions` - OpenAI format (Cursor IDE)
-**Middleware Stack:**
-1. Load shedding (reject if overloaded)
-2. Request logging (with correlation ID)
-3. Validation (schema check)
-4. Metrics collection
-5. Route to orchestrator
-### 2. Provider Routing
-**4-Tier Intelligent Routing:**
-Lynkr uses a multi-phase complexity analysis to route each request to the optimal model tier:
-| Tier | Score | Routes To |
-|------|-------|-----------|
-| SIMPLE (0-25) | Greetings, simple Q&A | Cheap/local models (Ollama, llama.cpp) |
-| MEDIUM (26-50) | Code reading, simple edits | Mid-range models (GPT-4o, Claude Sonnet) |
-| COMPLEX (51-75) | Multi-file changes, debugging | Capable models (o1-mini, Claude Sonnet) |
-| REASONING (76-100) | Security audits, architecture | Best models (o1, Claude Opus) |
-Includes agentic workflow detection, 15-dimension weighted scoring, and cost optimization.
-See **[Routing & Model Tiering](routing.md)** for full details.
-**Automatic Fallback:**
-- If primary provider fails → Use FALLBACK_PROVIDER
-- Transparent to client
-- No request failures due to provider issues
-### 3. Format Conversion
-**Anthropic → Provider:**
-```javascript
-{
-  model: "claude-3-5-sonnet",
-  messages: [...],
-  tools: [...]
-}
-↓
-Provider-specific format
-(Databricks, Bedrock, OpenRouter, etc.)
-```
-**Provider → Anthropic:**
-```javascript
-Provider response
-↓
-{
-  id: "msg_...",
-  type: "message",
-  role: "assistant",
-  content: [{type: "text", text: "..."}],
-  usage: {input_tokens: 123, output_tokens: 456}
-}
-```
-### 4. Token Optimization
-**6 Phases Applied:**
-1. Smart tool selection
-2. Prompt caching
-3. Memory deduplication
-4. Tool response truncation
-5. Dynamic system prompts
-6. Conversation compression
-**Result:** 60-80% token reduction
-### 5. Tool Execution
-**Server Mode (default):**
-- Tools execute on Lynkr server
-- Access server filesystem
-- Server-side command execution
-**Client Mode (passthrough):**
-- Tools execute on CLI side
-- Access client filesystem
-- Client-side command execution
-### 6. Response Streaming
-**Token-by-Token Streaming:**
-```javascript
-// SSE format
-event: message
-data: {"type":"content_block_delta","delta":{"type":"text_delta","text":"Hello"}}
-event: message
-data: {"type":"content_block_delta","delta":{"type":"text_delta","text":" world"}}
-event: done
-data: {}
-```
-**Benefits:**
-- Real-time user feedback
-- Lower perceived latency
-- Better UX for long responses
----
-## Core Components
-### API Layer (`src/api/`)
-**router.js** - Main routes
-- `/v1/messages` - Anthropic format
-- `/v1/chat/completions` - OpenAI format
-- `/v1/models` - List models
-- `/v1/embeddings` - Generate embeddings
-- `/health/*` - Health checks
-- `/metrics` - Prometheus metrics
-**Middleware:**
-- `load-shedding.js` - Overload protection
-- `request-logging.js` - Structured logging
-- `metrics.js` - Metrics collection
-- `validation.js` - Input validation
-- `error-handling.js` - Error formatting
-### Provider Clients (`src/clients/`)
-**databricks.js** - Main invocation function
-- `invokeModel()` - Route to provider
-- `invokeDatabricks()` - Databricks API
-- `invokeAzureAnthropic()` - Azure Anthropic
-- `invokeOpenRouter()` - OpenRouter
-- `invokeOllama()` - Ollama local
-- `invokeLlamaCpp()` - llama.cpp
-- `invokeBedrock()` - AWS Bedrock
-- `invokeMoonshot()` - Moonshot AI (Kimi)
-- `invokeZai()` - Z.AI (Zhipu AI)
-**Format converters:**
-- `openrouter-utils.js` - OpenAI format conversion
-- `bedrock-utils.js` - Bedrock format conversion
-**Reliability:**
-- `circuit-breaker.js` - Circuit breaker pattern
-- `retry.js` - Exponential backoff with jitter
-### Orchestrator (`src/orchestrator/`)
-**Agent Loop:**
-1. Receive request
-2. Inject memories
-3. Call provider
-4. Execute tools (if requested)
-5. Return to provider
-6. Repeat until done (max 8 steps)
-7. Extract memories
-8. Return final response
-**Features:**
-- Tool execution modes (server/client)
-- Policy enforcement
-- Memory injection/extraction
-- Token optimization
-### Tools (`src/tools/`)
-**Standard Tools:**
-- `workspace.js` - Read, Write, Edit files
-- `git.js` - Git operations
-- `bash.js` - Shell command execution
-- `test.js` - Test harness
-- `task.js` - Task tracking
-- `memory.js` - Memory management
-**MCP Tools:**
-- Dynamic tool registration
-- JSON-RPC 2.0 communication
-- Sandbox isolation (optional)
-### Caching (`src/cache/`)
-**Prompt Cache:**
-- LRU cache with TTL
-- SHA-256 keying
-- Hit rate tracking
-**Memory Cache:**
-- In-memory storage
-- TTL-based eviction
-- Automatic cleanup
-### Database (`src/db/`)
-**SQLite Databases:**
-- `memories.db` - Long-term memories
-- `sessions.db` - Conversation history
-- `workspace-index.db` - Workspace metadata
-**Operations:**
-- Memory CRUD
-- Session tracking
-- FTS5 search
-### Observability (`src/observability/`)
-**Metrics:**
-- Request rate, latency, errors
-- Token usage, cache hits
-- Circuit breaker state
-- System resources
-**Logging:**
-- Structured JSON logs (pino)
-- Request ID correlation
-- Error tracking
-- Performance profiling
-### Configuration (`src/config/`)
-**Environment Variables:**
-- Provider configuration
-- Feature flags
-- Policy settings
-- Performance tuning
-**Validation:**
-- Required field checks
-- Type validation
-- Value constraints
-- Provider-specific validation
----
-## Key Features
-### 1. Multi-Provider Support
-**12+ Providers:**
-- Cloud: Databricks, Bedrock, OpenRouter, Azure, OpenAI, Moonshot AI, Z.AI, Vertex AI
-- Local: Ollama, llama.cpp, LM Studio
-**Hybrid Routing:**
-- [4-tier intelligent routing](routing.md) with complexity scoring
-- Automatic provider selection and transparent failover
-- Agentic workflow detection with tier upgrades
-- Cost optimization with multi-source pricing
-### 2. Token Optimization
-**60-80% Cost Reduction:**
-- 6 optimization phases
-- $77k-$115k annual savings
-- Automatic optimization
-### 3. Long-Term Memory
-**Titans-Inspired:**
-- Surprise-based storage
-- Semantic search (FTS5)
-- Multi-signal retrieval
-- Automatic extraction
-### 4. Production Hardening
-**14 Features:**
-- Circuit breakers
-- Load shedding
-- Graceful shutdown
-- Prometheus metrics
-- Health checks
-- Error resilience
-### 5. MCP Integration
-**Model Context Protocol:**
-- Automatic discovery
-- JSON-RPC 2.0 client
-- Dynamic tool registration
-- Sandbox isolation
-### 6. IDE Compatibility
-**Works With:**
-- Claude Code CLI (native)
-- Cursor IDE (OpenAI format)
-- Continue.dev (OpenAI format)
-- Any OpenAI-compatible client
----
-## Performance
-### Benchmarks
-**Request Throughput:**
-- **140,000 requests/second** capacity
-- **~7μs overhead** per request
-- Minimal performance impact
-**Latency:**
-- Local providers: 100-500ms
-- Cloud providers: 500ms-2s
-- Caching: <1ms (cache hits)
-**Memory Usage:**
-- Base: ~100MB
-- Per connection: ~1MB
-- Caching: ~50MB
-**Token Optimization:**
-- Average reduction: 60-80%
-- Cache hit rate: 70-90%
-- Dedup effectiveness: 85%
----
-## Scaling
-### Horizontal Scaling
-```bash
-# Run multiple instances
-PM2_INSTANCES=4 pm2 start lynkr
-# Behind load balancer (nginx, HAProxy)
-# Shared database for memories
-```
-### Vertical Scaling
-```bash
-# Increase cache size
-PROMPT_CACHE_MAX_ENTRIES=256
-# Increase connection pool
-# (provider-specific)
-```
-### Database Optimization
-```bash
-# Enable WAL mode (better concurrency)
-# Automatic vacuum
-# Index optimization
-```
----
-## Next Steps
-- **[Routing & Model Tiering](routing.md)** - Intelligent routing and scoring algorithm
-- **[Memory System](memory-system.md)** - Long-term memory details
-- **[Token Optimization](token-optimization.md)** - Cost reduction strategies
-- **[Production Guide](production.md)** - Deploy to production
-- **[Tools Guide](tools.md)** - Tool execution modes
----
-## Getting Help
-- **[GitHub Discussions](https://github.com/vishalveerareddy123/Lynkr/discussions)** - Ask questions
-- **[GitHub Issues](https://github.com/vishalveerareddy123/Lynkr/issues)** - Report issues