npm - loki-mode - Versions diffs - 5.17.0 → 5.19.0 - Mend

loki-mode 5.17.0 → 5.19.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (38) hide show

package/README.md +62 -10
package/SKILL.md +2 -2
package/VERSION +1 -1
package/api/middleware/error.ts +29 -1
package/api/middleware/timing.ts +135 -0
package/api/openapi.yaml +590 -0
package/api/routes/health.ts +133 -1
package/api/routes/learning.ts +482 -0
package/api/routes/learning_test.ts +456 -0
package/api/routes/memory.ts +526 -16
package/api/routes/sessions.ts +56 -0
package/api/server.ts +62 -0
package/api/server_test.ts +93 -0
package/api/services/cli-bridge.ts +27 -19
package/api/services/learning-collector.ts +792 -0
package/api/services/learning-collector_test.ts +468 -0
package/api/services/state-notifications.ts +421 -0
package/api/services/state-watcher.ts +68 -52
package/api/types/memory.ts +62 -0
package/autonomy/loki +503 -0
package/autonomy/run.sh +239 -18
package/docs/SYNERGY-TASKS.md +138 -0
package/docs/loki-mode-presentation.pptx +0 -0
package/memory/__init__.py +61 -1
package/memory/embeddings.py +1065 -145
package/memory/namespace.py +552 -0
package/memory/retrieval.py +517 -7
package/memory/schemas.py +80 -1
package/memory/storage.py +505 -5
package/memory/test_importance.py +352 -0
package/memory/tests/test_namespace.py +538 -0
package/memory/token_economics.py +156 -0
package/memory/unified_access.py +591 -0
package/package.json +13 -3
package/references/agent-types.md +2 -2
package/references/agents.md +1 -1
package/skills/00-index.md +2 -2
package/skills/agents.md +2 -2

package/README.md CHANGED Viewed

@@ -3,7 +3,7 @@
 **The First Truly Autonomous Multi-Agent Startup System**
 [![Claude Code](https://img.shields.io/badge/Claude-Code-orange)](https://claude.ai)
-[![Agent Types](https://img.shields.io/badge/Agent%20Types-37-blue)]()
+[![Agent Types](https://img.shields.io/badge/Agent%20Types-41-blue)]()
 [![Loki Mode](https://img.shields.io/badge/Loki%20Mode-98.78%25%20Pass%401-blueviolet)](benchmarks/results/)
 [![HumanEval](https://img.shields.io/badge/HumanEval-98.17%25%20Pass%401-brightgreen)](benchmarks/results/)
 [![SWE-bench](https://img.shields.io/badge/SWE--bench-99.67%25%20Patch%20Gen-brightgreen)](benchmarks/results/)
@@ -29,7 +29,7 @@
 ![Loki Mode Presentation](docs/loki-mode-presentation.gif)
-*9 slides: Problem, Solution, 37 Agents, RARV Cycle, Benchmarks, Multi-Provider, Full Lifecycle*
+*9 slides: Problem, Solution, 41 Agents, RARV Cycle, Benchmarks, Multi-Provider, Full Lifecycle*
 **[Download PPTX](docs/loki-mode-presentation.pptx)** for offline viewing
@@ -172,7 +172,7 @@ See [benchmarks/results/](benchmarks/results/) for full methodology and solution
 ## What is Loki Mode?
-Loki Mode is a multi-provider AI skill that orchestrates **37 specialized AI agent types** across **6 swarms** to autonomously build, test, deploy, and scale complete startups. Works with **Claude Code**, **OpenAI Codex CLI**, and **Google Gemini CLI**. It dynamically spawns only the agents you need—**5-10 for simple projects, 100+ for complex startups**—working in parallel with continuous self-verification.
+Loki Mode is a multi-provider AI skill that orchestrates **41 specialized AI agent types** across **7 swarms** to autonomously build, test, deploy, and scale complete startups. Works with **Claude Code**, **OpenAI Codex CLI**, and **Google Gemini CLI**. It dynamically spawns only the agents you need—**5-10 for simple projects, 100+ for complex startups**—working in parallel with continuous self-verification.
 ```
 PRD → Research → Architecture → Development → Testing → Deployment → Marketing → Revenue
@@ -190,7 +190,7 @@ PRD → Research → Architecture → Development → Testing → Deployment →
 |----------------|---------------------|
 | **Single agent** writes code linearly | **100+ agents** work in parallel across engineering, ops, business, data, product, and growth |
 | **Manual deployment** required | **Autonomous deployment** to AWS, GCP, Azure, Vercel, Railway with blue-green and canary strategies |
-| **No testing** or basic unit tests | **14 automated quality gates**: security scans, load tests, accessibility audits, code reviews |
+| **No testing** or basic unit tests | **7 automated quality gates**: input/output guardrails, static analysis, blind review, anti-sycophancy, severity blocking, test coverage |
 | **Code only** - you handle the rest | **Full business operations**: marketing, sales, legal, HR, finance, investor relations |
 | **Stops on errors** | **Self-healing**: circuit breakers, dead letter queues, exponential backoff, automatic recovery |
 | **No visibility** into progress | **Real-time dashboard** with agent monitoring, task queues, and live status updates |
@@ -216,9 +216,9 @@ PRD → Research → Architecture → Development → Testing → Deployment →
 | **CLI (v4.1.0)** | `loki` command for start/stop/pause/status | [CLI Commands](#cli-commands-v410) |
 | **Config Files** | YAML configuration support | [autonomy/config.example.yaml](autonomy/config.example.yaml) |
 | **Dashboard** | Realtime Kanban board, agent monitoring | [Dashboard Guide](docs/dashboard-guide.md) |
-| **37 Agent Types** | Engineering, Ops, Business, Data, Product, Growth | [Agent Definitions](references/agent-types.md) |
+| **41 Agent Types** | Engineering, Ops, Business, Data, Product, Growth, Orchestration | [Agent Definitions](references/agent-types.md) |
 | **RARV Cycle** | Reason-Act-Reflect-Verify workflow | [Core Workflow](references/core-workflow.md) |
-| **Quality Gates** | 7-gate review system with anti-sycophancy | [Quality Control](references/quality-control.md) |
+| **Quality Gates** | 7-gate system: guardrails, static analysis, blind review, anti-sycophancy, severity blocking, test coverage | [Quality Control](references/quality-control.md) |
 | **Memory System (v5.15.0)** | Complete 3-tier memory with progressive disclosure | [Memory Architecture](references/memory-system.md) |
 | **Parallel Workflows** | Git worktree-based parallelism | [Parallel Workflows](skills/parallel-workflows.md) |
 | **GitHub Integration** | Issue import, PR creation, status sync | [GitHub Integration](skills/github-integration.md) |
@@ -477,9 +477,9 @@ Config search order: `.loki/config.yaml` (project) -> `~/.config/loki-mode/confi
 ---
-## Agent Swarms (37 Types)
+## Agent Swarms (41 Types)
-Loki Mode has **37 predefined agent types** organized into **6 specialized swarms**. The orchestrator spawns only what you need—simple projects use 5-10 agents, complex startups spawn 100+.
+Loki Mode has **41 predefined agent types** organized into **7 specialized swarms**. The orchestrator spawns only what you need—simple projects use 5-10 agents, complex startups spawn 100+.
 <img width="5309" height="979" alt="Agent Swarms Visualization" src="https://github.com/user-attachments/assets/7d18635d-a606-401f-8d9f-430e6e4ee689" />
@@ -504,7 +504,59 @@ Loki Mode has **37 predefined agent types** organized into **6 specialized swarm
 ### **Review (3 types)**
 `review-code` `review-business` `review-security`
-See [references/agents.md](references/agents.md) for complete agent type definitions.
+### **Orchestration (4 types)**
+`orch-planner` `orch-sub-planner` `orch-judge` `orch-coordinator`
+<details>
+<summary><strong>View All 41 Agent Types with Capabilities</strong></summary>
+| Swarm | Agent | Capabilities |
+|-------|-------|--------------|
+| **Engineering** | `eng-frontend` | React/Vue/Svelte, TypeScript, Tailwind, accessibility, responsive design |
+| | `eng-backend` | Node/Python/Go, REST/GraphQL, auth, business logic, middleware |
+| | `eng-database` | PostgreSQL/MySQL/MongoDB, migrations, query optimization, indexing |
+| | `eng-mobile` | React Native/Flutter/Swift/Kotlin, offline-first, push notifications |
+| | `eng-api` | OpenAPI specs, SDK generation, versioning, webhooks, rate limiting |
+| | `eng-qa` | Unit/integration/E2E tests, coverage, automation, test data |
+| | `eng-perf` | Profiling, benchmarking, optimization, caching, load testing |
+| | `eng-infra` | Docker, K8s manifests, IaC, networking, security hardening |
+| **Operations** | `ops-devops` | CI/CD pipelines, GitHub Actions, GitLab CI, Jenkins |
+| | `ops-sre` | Reliability, SLOs/SLIs, capacity planning, runbooks |
+| | `ops-security` | SAST/DAST, pen testing, vulnerability management |
+| | `ops-monitor` | Observability, Datadog/Grafana, alerting, dashboards |
+| | `ops-incident` | Incident response, RCA, post-mortems, communication |
+| | `ops-release` | Versioning, changelogs, blue-green, canary, rollbacks |
+| | `ops-cost` | Cloud cost optimization, right-sizing, FinOps |
+| | `ops-compliance` | SOC2, GDPR, HIPAA, PCI-DSS, audit preparation |
+| **Business** | `biz-marketing` | Landing pages, SEO, content, email campaigns, social media |
+| | `biz-sales` | CRM setup, outreach, demos, proposals, pipeline |
+| | `biz-finance` | Billing (Stripe), invoicing, metrics, runway, pricing |
+| | `biz-legal` | ToS, privacy policy, contracts, IP protection |
+| | `biz-support` | Help docs, FAQs, ticket system, chatbot, knowledge base |
+| | `biz-hr` | Job posts, recruiting, onboarding, culture docs |
+| | `biz-investor` | Pitch decks, investor updates, data room, cap table |
+| | `biz-partnerships` | BD outreach, integrations, co-marketing, API partnerships |
+| **Data** | `data-ml` | Model training, MLOps, feature engineering, inference |
+| | `data-eng` | ETL pipelines, data warehousing, dbt, Airflow |
+| | `data-analytics` | Product analytics, A/B tests, dashboards, insights |
+| **Product** | `prod-pm` | Backlog grooming, prioritization, roadmap, specs |
+| | `prod-design` | Design system, Figma, UX patterns, prototypes |
+| | `prod-techwriter` | API docs, guides, tutorials, release notes |
+| **Growth** | `growth-hacker` | Growth experiments, viral loops, referral programs |
+| | `growth-community` | Community building, Discord/Slack, ambassador programs |
+| | `growth-success` | Customer success, health scoring, churn prevention |
+| | `growth-lifecycle` | Email lifecycle, in-app messaging, re-engagement |
+| **Review** | `review-code` | Code quality, design patterns, SOLID, maintainability |
+| | `review-business` | Requirements alignment, business logic, edge cases |
+| | `review-security` | Vulnerabilities, auth/authz, OWASP Top 10 |
+| **Orchestration** | `orch-planner` | Task decomposition, dependency analysis, work distribution |
+| | `orch-sub-planner` | Domain-specific planning, recursive task breakdown |
+| | `orch-judge` | Cycle continuation decisions, goal assessment, escalation |
+| | `orch-coordinator` | Cross-stream coordination, merge decisions, conflict resolution |
+</details>
+See [references/agent-types.md](references/agent-types.md) for complete agent type definitions.
 ---
@@ -543,7 +595,7 @@ references/                    # Deep documentation (23KB+ files)
 | **2. Architecture** | Tech stack selection with self-reflection |
 | **3. Infrastructure** | Provision cloud, CI/CD, monitoring |
 | **4. Development** | Implement with TDD, parallel code review |
-| **5. QA** | 14 quality gates, security audit, load testing |
+| **5. QA** | 7 quality gates, security audit, load testing |
 | **6. Deployment** | Blue-green deploy, auto-rollback on errors |
 | **7. Business** | Marketing, sales, legal, support setup |
 | **8. Growth** | Continuous optimization, A/B testing, feedback loops |

package/SKILL.md CHANGED Viewed

@@ -3,7 +3,7 @@ name: loki-mode
 description: Multi-agent autonomous startup system. Triggers on "Loki Mode". Takes PRD to deployed product with zero human intervention. Requires --dangerously-skip-permissions flag.
 ---
-# Loki Mode v5.17.0
+# Loki Mode v5.19.0
 **You are an autonomous agent. You make decisions. You do not ask questions. You do not stop.**
@@ -253,4 +253,4 @@ Auto-detected or force with `LOKI_COMPLEXITY`:
 ---
-**v5.17.0 | Unified Event Bus, MCP Integration, Complete Memory System | ~250 lines core**
+**v5.19.0 | Complete Synergy, Learning System, Swarm Intelligence | ~250 lines core**

package/VERSION CHANGED Viewed

	@@ -1 +1 @@
1	- 5.17.0
1	+ 5.19.0

package/api/middleware/error.ts CHANGED Viewed

@@ -2,9 +2,11 @@
  * Error Handling Middleware
  *
  * Provides consistent error responses and logging.
+ * Emits ErrorPatternSignal for learning from API errors.
  */
 import type { ApiError } from "../types/api.ts";
+import { learningCollector } from "../services/learning-collector.ts";
 // Error codes
 export const ErrorCodes = {
@@ -105,7 +107,8 @@ export function errorMiddleware(
 }
 /**
- * Handle an error and return appropriate response
+ * Handle an error and return appropriate response.
+ * Also emits an ErrorPatternSignal for learning.
  */
 export function handleError(err: unknown, req?: Request): Response {
   // Log error for debugging
@@ -115,6 +118,31 @@ export function handleError(err: unknown, req?: Request): Response {
   console.error(`Error handling ${requestInfo}:`, err);
+  // Emit error pattern signal for learning
+  const errorType = err instanceof LokiApiError
+    ? err.code
+    : err instanceof Error
+    ? err.name
+    : "UnknownError";
+  const errorMessage = err instanceof Error
+    ? err.message
+    : "An unexpected error occurred";
+  learningCollector.emitErrorPattern(
+    requestInfo,
+    errorType,
+    errorMessage,
+    {
+      stackTrace: err instanceof Error ? err.stack : undefined,
+      context: {
+        method: req?.method,
+        path: req ? new URL(req.url).pathname : undefined,
+        errorCode: err instanceof LokiApiError ? err.code : undefined,
+        statusCode: err instanceof LokiApiError ? err.status : 500,
+      },
+    }
+  );
   // Handle known API errors
   if (err instanceof LokiApiError) {
     return err.toResponse();

package/api/middleware/timing.ts ADDED Viewed

@@ -0,0 +1,135 @@
+/**
+ * Timing Middleware
+ *
+ * Tracks API request timing and emits learning signals for efficiency tracking.
+ */
+import { learningCollector } from "../services/learning-collector.ts";
+/**
+ * Request timing context
+ */
+export interface TimingContext {
+  startTime: number;
+  method: string;
+  path: string;
+}
+/**
+ * Create timing middleware that wraps handlers to track response times.
+ *
+ * Emits ToolEfficiencySignal for each request with timing data.
+ */
+export function timingMiddleware(
+  handler: (req: Request) => Promise<Response> | Response
+): (req: Request) => Promise<Response> {
+  return async (req: Request): Promise<Response> => {
+    const startTime = Date.now();
+    const url = new URL(req.url);
+    const method = req.method;
+    const path = url.pathname;
+    try {
+      const response = await handler(req);
+      const duration = Date.now() - startTime;
+      // Emit success signal for non-error responses
+      if (response.status < 400) {
+        learningCollector.emitApiRequest(path, method, startTime, true, {
+          statusCode: response.status,
+          context: {
+            durationMs: duration,
+          },
+        });
+      } else {
+        // Clone response to read body for error details
+        const clonedResponse = response.clone();
+        let errorMessage = "Request failed";
+        try {
+          const body = await clonedResponse.json();
+          errorMessage = body.error || body.message || errorMessage;
+        } catch {
+          // Body might not be JSON
+        }
+        learningCollector.emitApiRequest(path, method, startTime, false, {
+          statusCode: response.status,
+          errorMessage,
+          context: {
+            durationMs: duration,
+          },
+        });
+      }
+      // Add timing header to response
+      const headers = new Headers(response.headers);
+      headers.set("X-Response-Time", `${duration}ms`);
+      return new Response(response.body, {
+        status: response.status,
+        statusText: response.statusText,
+        headers,
+      });
+    } catch (error) {
+      // Emit error signal
+      const errorMessage =
+        error instanceof Error ? error.message : "Unknown error";
+      learningCollector.emitApiRequest(path, method, startTime, false, {
+        statusCode: 500,
+        errorMessage,
+        context: {
+          errorType: error instanceof Error ? error.name : "UnknownError",
+        },
+      });
+      throw error;
+    }
+  };
+}
+/**
+ * Create a timing context for manual timing in route handlers.
+ *
+ * Usage:
+ *   const timing = startTiming(req);
+ *   // ... do work ...
+ *   endTiming(timing, true);
+ */
+export function startTiming(req: Request): TimingContext {
+  const url = new URL(req.url);
+  return {
+    startTime: Date.now(),
+    method: req.method,
+    path: url.pathname,
+  };
+}
+/**
+ * End timing and emit a learning signal.
+ */
+export function endTiming(
+  context: TimingContext,
+  success: boolean,
+  options: {
+    statusCode?: number;
+    errorMessage?: string;
+    metadata?: Record<string, unknown>;
+  } = {}
+): number {
+  const duration = Date.now() - context.startTime;
+  learningCollector.emitApiRequest(
+    context.path,
+    context.method,
+    context.startTime,
+    success,
+    {
+      statusCode: options.statusCode,
+      errorMessage: options.errorMessage,
+      context: options.metadata,
+    }
+  );
+  return duration;
+}