npm - loki-mode - Versions diffs - 5.23.0 → 5.26.2 - Mend

loki-mode 5.23.0 → 5.26.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (52) hide show

package/README.md +28 -160
package/SKILL.md +8 -8
package/VERSION +1 -1
package/autonomy/completion-council.sh +760 -0
package/autonomy/hooks/quality-gate.sh +51 -0
package/autonomy/hooks/session-init.sh +54 -0
package/autonomy/hooks/store-episode.sh +54 -0
package/autonomy/hooks/track-metrics.sh +16 -0
package/autonomy/hooks/validate-bash.sh +53 -0
package/autonomy/loki +247 -40
package/autonomy/run.sh +153 -40
package/bin/postinstall.js +27 -3
package/dashboard/__init__.py +1 -1
package/dashboard/server.py +211 -7
package/dashboard/static/index.html +4253 -0
package/docs/COMPARISON.md +21 -18
package/docs/COMPETITIVE-ANALYSIS.md +39 -15
package/docs/INSTALLATION.md +52 -12
package/docs/SYNERGY-ROADMAP.md +4 -4
package/docs/SYNERGY-TASKS.md +9 -4
package/docs/TOOL-INTEGRATION.md +4 -4
package/docs/architecture/DASHBOARD_V2_ARCHITECTURE.md +7 -7
package/docs/auto-claude-comparison.md +10 -10
package/docs/cursor-comparison.md +3 -3
package/docs/dashboard-guide.md +82 -23
package/docs/thick2thin.md +1 -1
package/events/bus.py +8 -4
package/memory/storage.py +26 -5
package/package.json +7 -16
package/providers/claude.sh +7 -7
package/providers/codex.sh +16 -19
package/providers/gemini.sh +8 -8
package/references/competitive-analysis.md +2 -2
package/references/cursor-learnings.md +2 -2
package/references/multi-provider.md +13 -13
package/skills/00-index.md +1 -1
package/skills/agents.md +1 -1
package/skills/github-integration.md +3 -3
package/skills/model-selection.md +26 -1
package/skills/parallel-workflows.md +1 -1
package/skills/providers.md +14 -11
package/skills/testing.md +1 -1
package/Dockerfile +0 -86
package/Dockerfile.sandbox +0 -254
package/autonomy/.loki/dashboard/index.html +0 -2768
package/dashboard/Dockerfile +0 -79
package/dashboard/docker-compose.yml +0 -47
package/docker-compose.yml +0 -37
package/docs/loki-mode-presentation.gif +0 -0
package/docs/loki-mode-presentation.pptx +0 -0
package/docs/screenshots/dashboard-agents.png +0 -0
package/docs/screenshots/dashboard-tasks.png +0 -0

package/README.md CHANGED Viewed

@@ -2,12 +2,15 @@
 **The First Truly Autonomous Multi-Agent Startup System**
+[![npm version](https://img.shields.io/npm/v/loki-mode)](https://www.npmjs.com/package/loki-mode)
+[![npm downloads](https://img.shields.io/npm/dw/loki-mode)](https://www.npmjs.com/package/loki-mode)
+[![GitHub stars](https://img.shields.io/github/stars/asklokesh/loki-mode)](https://github.com/asklokesh/loki-mode)
+[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
 [![Claude Code](https://img.shields.io/badge/Claude-Code-orange)](https://claude.ai)
 [![Agent Types](https://img.shields.io/badge/Agent%20Types-41-blue)]()
 [![Loki Mode](https://img.shields.io/badge/Loki%20Mode-98.78%25%20Pass%401-blueviolet)](benchmarks/results/)
 [![HumanEval](https://img.shields.io/badge/HumanEval-98.17%25%20Pass%401-brightgreen)](benchmarks/results/)
 [![SWE-bench](https://img.shields.io/badge/SWE--bench-99.67%25%20Patch%20Gen-brightgreen)](benchmarks/results/)
-[![License](https://img.shields.io/badge/License-MIT-green.svg)](LICENSE)
 **[Documentation Website](https://asklokesh.github.io/loki-mode/)** | **[Architecture](https://asklokesh.github.io/loki-mode/blog/#architecture)** | **[Research](https://asklokesh.github.io/loki-mode/blog/#research)** | **[Comparisons](https://asklokesh.github.io/loki-mode/blog/#comparisons)**
@@ -37,81 +40,22 @@
 ## Usage
-### Option 1: Claude Code Skill (Recommended)
-```bash
-# Install
-git clone https://github.com/asklokesh/loki-mode.git ~/.claude/skills/loki-mode
-# Run
-claude --dangerously-skip-permissions
-# Then say:
-Loki Mode with PRD at ./my-prd.md
-```
-### Option 2: Shell Script
-```bash
-# Clone repo
-git clone https://github.com/asklokesh/loki-mode.git
-cd loki-mode
-# Run directly
-./autonomy/run.sh ./my-prd.md
-```
-### Option 3: npm
+### Option 1: npm (Recommended)
 ```bash
 npm install -g loki-mode
 loki start ./my-prd.md
 ```
-### Option 4: Homebrew (macOS/Linux)
-```bash
-brew install asklokesh/tap/loki-mode
-loki start ./my-prd.md
-```
-### Option 5: Docker
-```bash
-docker run -v $(pwd):/workspace asklokesh/loki-mode:5.1.1 ./my-prd.md
-```
-### Option 6: VS Code Extension
-Install directly from the VS Code Marketplace for a visual interface:
-```bash
-# From VS Code
-1. Open Extensions (Cmd+Shift+X / Ctrl+Shift+X)
-2. Search "loki-mode"
-3. Click Install
-# Or via command line
-code --install-extension asklokesh.loki-mode
-```
+### Option 2: Claude Code Skill
-**Important:** Start the Loki Mode server before using the extension:
 ```bash
-loki start              # If using CLI
-# or
-./autonomy/run.sh       # If running from source
+git clone https://github.com/asklokesh/loki-mode.git ~/.claude/skills/loki-mode
+claude --dangerously-skip-permissions
+# Then say: Loki Mode with PRD at ./my-prd.md
 ```
-**Extension Features:**
-- Start/Stop/Pause/Resume sessions from the activity bar
-- Real-time task progress in the sidebar
-- Provider selection (Claude, Codex, Gemini)
-- Status bar showing current phase and progress
-- Quick actions menu (Cmd+Shift+L / Ctrl+Shift+L)
-[View on Marketplace](https://marketplace.visualstudio.com/items?itemName=asklokesh.loki-mode)
-See [Installation Guide](docs/INSTALLATION.md) for more details.
+Also available via **Homebrew**, **Docker**, **VS Code Extension**, and **direct shell script**. See the [Installation Guide](docs/INSTALLATION.md) for all 6 installation methods and detailed instructions.
 ### Multi-Provider Support (v5.0.0)
@@ -366,26 +310,7 @@ There is **NEVER** a "finished" state. After completing the PRD, Loki Mode:
 ## Quick Start
-### **1. Install**
-```bash
-# Option A: npm (recommended)
-npm install -g loki-mode
-# Option B: Homebrew (macOS/Linux)
-brew tap asklokesh/tap && brew install loki-mode
-loki-mode-install-skill  # Set up Claude Code integration
-# Option C: Docker
-docker pull asklokesh/loki-mode:5.0.0
-# Option D: Git clone
-git clone https://github.com/asklokesh/loki-mode.git ~/.claude/skills/loki-mode
-```
-See [Installation Guide](docs/INSTALLATION.md) for detailed instructions.
-### **2. Create a PRD**
+### **1. Write a PRD**
 ```markdown
 # Product: AI-Powered Todo App
@@ -409,38 +334,20 @@ Build a todo app with AI-powered task suggestions and deadline predictions.
 Save as `my-prd.md`.
-### **3. Run Loki Mode**
+### **2. Run It**
 ```bash
-# Using the CLI (v4.1.0)
 loki start ./my-prd.md
-# Or using run.sh directly
-./autonomy/run.sh ./my-prd.md
-# Or manual mode in Claude Code
-claude --dangerously-skip-permissions
-> Loki Mode with PRD at ./my-prd.md
 ```
-### **4. Monitor Progress**
+### **3. Monitor and Walk Away**
 ```bash
-# Check status
-loki status
-# Open dashboard in browser
-loki dashboard
-# Or watch terminal output
-watch -n 2 cat .loki/STATUS.txt
+loki status              # Check progress
+loki dashboard           # Open web dashboard
 ```
-### **5. Walk Away**
-Seriously. Go get coffee. It'll be deployed when you get back.
-**That's it.** No configuration. No manual steps. No intervention.
+Go get coffee. It'll be deployed when you get back.
 ---
@@ -507,56 +414,7 @@ Loki Mode has **41 predefined agent types** organized into **7 specialized swarm
 ### **Orchestration (4 types)**
 `orch-planner` `orch-sub-planner` `orch-judge` `orch-coordinator`
-<details>
-<summary><strong>View All 41 Agent Types with Capabilities</strong></summary>
-| Swarm | Agent | Capabilities |
-|-------|-------|--------------|
-| **Engineering** | `eng-frontend` | React/Vue/Svelte, TypeScript, Tailwind, accessibility, responsive design |
-| | `eng-backend` | Node/Python/Go, REST/GraphQL, auth, business logic, middleware |
-| | `eng-database` | PostgreSQL/MySQL/MongoDB, migrations, query optimization, indexing |
-| | `eng-mobile` | React Native/Flutter/Swift/Kotlin, offline-first, push notifications |
-| | `eng-api` | OpenAPI specs, SDK generation, versioning, webhooks, rate limiting |
-| | `eng-qa` | Unit/integration/E2E tests, coverage, automation, test data |
-| | `eng-perf` | Profiling, benchmarking, optimization, caching, load testing |
-| | `eng-infra` | Docker, K8s manifests, IaC, networking, security hardening |
-| **Operations** | `ops-devops` | CI/CD pipelines, GitHub Actions, GitLab CI, Jenkins |
-| | `ops-sre` | Reliability, SLOs/SLIs, capacity planning, runbooks |
-| | `ops-security` | SAST/DAST, pen testing, vulnerability management |
-| | `ops-monitor` | Observability, Datadog/Grafana, alerting, dashboards |
-| | `ops-incident` | Incident response, RCA, post-mortems, communication |
-| | `ops-release` | Versioning, changelogs, blue-green, canary, rollbacks |
-| | `ops-cost` | Cloud cost optimization, right-sizing, FinOps |
-| | `ops-compliance` | SOC2, GDPR, HIPAA, PCI-DSS, audit preparation |
-| **Business** | `biz-marketing` | Landing pages, SEO, content, email campaigns, social media |
-| | `biz-sales` | CRM setup, outreach, demos, proposals, pipeline |
-| | `biz-finance` | Billing (Stripe), invoicing, metrics, runway, pricing |
-| | `biz-legal` | ToS, privacy policy, contracts, IP protection |
-| | `biz-support` | Help docs, FAQs, ticket system, chatbot, knowledge base |
-| | `biz-hr` | Job posts, recruiting, onboarding, culture docs |
-| | `biz-investor` | Pitch decks, investor updates, data room, cap table |
-| | `biz-partnerships` | BD outreach, integrations, co-marketing, API partnerships |
-| **Data** | `data-ml` | Model training, MLOps, feature engineering, inference |
-| | `data-eng` | ETL pipelines, data warehousing, dbt, Airflow |
-| | `data-analytics` | Product analytics, A/B tests, dashboards, insights |
-| **Product** | `prod-pm` | Backlog grooming, prioritization, roadmap, specs |
-| | `prod-design` | Design system, Figma, UX patterns, prototypes |
-| | `prod-techwriter` | API docs, guides, tutorials, release notes |
-| **Growth** | `growth-hacker` | Growth experiments, viral loops, referral programs |
-| | `growth-community` | Community building, Discord/Slack, ambassador programs |
-| | `growth-success` | Customer success, health scoring, churn prevention |
-| | `growth-lifecycle` | Email lifecycle, in-app messaging, re-engagement |
-| **Review** | `review-code` | Code quality, design patterns, SOLID, maintainability |
-| | `review-business` | Requirements alignment, business logic, edge cases |
-| | `review-security` | Vulnerabilities, auth/authz, OWASP Top 10 |
-| **Orchestration** | `orch-planner` | Task decomposition, dependency analysis, work distribution |
-| | `orch-sub-planner` | Domain-specific planning, recursive task breakdown |
-| | `orch-judge` | Cycle continuation decisions, goal assessment, escalation |
-| | `orch-coordinator` | Cross-stream coordination, merge decisions, conflict resolution |
-</details>
-See [references/agent-types.md](references/agent-types.md) for complete agent type definitions.
+See [Agent Types](references/agent-types.md) for the full list of 41 specialized agents with detailed capabilities.
 ---
@@ -809,10 +667,20 @@ Run the comprehensive test suite:
 Contributions welcome! Please:
 1. Read [SKILL.md](SKILL.md) to understand the core architecture
 2. Review [skills/00-index.md](skills/00-index.md) for module organization (v3.0+)
-3. Check [references/agents.md](references/agents.md) for agent definitions
+3. Check [references/agent-types.md](references/agent-types.md) for agent definitions
 4. Open an issue for bugs or feature requests
 5. Submit PRs with clear descriptions and tests
+**Dev setup:**
+```bash
+git clone https://github.com/asklokesh/loki-mode.git && cd loki-mode
+npm install              # Install dependencies
+bash -n autonomy/run.sh  # Validate shell scripts
+cd dashboard-ui && npm ci && npm run build:all  # Build dashboard
+```
+See [CONTRIBUTING.md](CONTRIBUTING.md) for detailed development guidelines.
 ---
 ## License

package/SKILL.md CHANGED Viewed

@@ -3,7 +3,7 @@ name: loki-mode
 description: Multi-agent autonomous startup system. Triggers on "Loki Mode". Takes PRD to deployed product with zero human intervention. Requires --dangerously-skip-permissions flag.
 ---
-# Loki Mode v5.23.0
+# Loki Mode v5.26.2
 **You are an autonomous agent. You make decisions. You do not ask questions. You do not stop.**
@@ -96,8 +96,8 @@ These rules are ABSOLUTE. Violating them is a critical failure.
 **Default (v5.3.0):** Haiku disabled for quality. Use `--allow-haiku` or `LOKI_ALLOW_HAIKU=true` to enable.
-| Task Type | Tier | Claude (default) | Claude (--allow-haiku) | Codex | Gemini |
-|-----------|------|------------------|------------------------|-------|--------|
+| Task Type | Tier | Claude (default) | Claude (--allow-haiku) | Codex (GPT-5.3) | Gemini |
+|-----------|------|------------------|------------------------|------------------|--------|
 | PRD analysis, architecture, system design | **planning** | opus | opus | effort=xhigh | thinking=high |
 | Feature implementation, complex bugs | **development** | opus | sonnet | effort=high | thinking=medium |
 | Code review (always 3 parallel reviewers) | **development** | opus | sonnet | effort=high | thinking=medium |
@@ -106,7 +106,7 @@ These rules are ABSOLUTE. Violating them is a critical failure.
 **Parallelization rule (Claude only):** Launch up to 10 agents simultaneously for independent tasks.
-**Degraded mode (Codex/Gemini):** No parallel agents or Task tool. Runs RARV cycle sequentially. See `skills/model-selection.md`.
+**Degraded mode (Codex/Gemini):** No parallel agents or Task tool. Codex has MCP support. Runs RARV cycle sequentially. See `skills/model-selection.md`.
 **Git worktree parallelism:** For true parallel feature development, use `--parallel` flag with run.sh. See `skills/parallel-workflows.md`.
@@ -193,7 +193,7 @@ claude --dangerously-skip-permissions
 # With provider selection (supports .md and .json PRDs)
 ./autonomy/run.sh --provider claude ./prd.md   # Default, full features
-./autonomy/run.sh --provider codex ./prd.json  # GPT-5.2 Codex, degraded mode
+./autonomy/run.sh --provider codex ./prd.json  # GPT-5.3 Codex, degraded mode
 ./autonomy/run.sh --provider gemini ./prd.md   # Gemini 3 Pro, degraded mode
 # Or via CLI wrapper
@@ -204,8 +204,8 @@ loki start --provider codex ./prd.md
 ```
 **Provider capabilities:**
-- **Claude**: Full features (Task tool, parallel agents, MCP, 200K context)
-- **Codex**: Degraded mode (sequential only, no Task tool, 128K context)
+- **Claude**: Opus 4.6, 1M context (beta), 128K output, adaptive thinking, agent teams, full features (Task tool, parallel agents, MCP)
+- **Codex**: GPT-5.3, 400K context, 128K output, MCP support, --full-auto mode, degraded (sequential only, no Task tool)
 - **Gemini**: Degraded mode (sequential only, no Task tool, 1M context)
 ---
@@ -260,4 +260,4 @@ Auto-detected or force with `LOKI_COMPLEXITY`:
 ---
-**v5.23.0 | Dashboard File-Based API + All Components Functional | ~270 lines core**
+**v5.26.2 | Dashboard/shell/UX fixes, security hardening | ~270 lines core**

package/VERSION CHANGED Viewed

	@@ -1 +1 @@
1	- 5.23.0
1	+ 5.26.2