loki-mode 5.23.0 → 5.26.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (52) hide show
  1. package/README.md +28 -160
  2. package/SKILL.md +8 -8
  3. package/VERSION +1 -1
  4. package/autonomy/completion-council.sh +760 -0
  5. package/autonomy/hooks/quality-gate.sh +51 -0
  6. package/autonomy/hooks/session-init.sh +54 -0
  7. package/autonomy/hooks/store-episode.sh +54 -0
  8. package/autonomy/hooks/track-metrics.sh +16 -0
  9. package/autonomy/hooks/validate-bash.sh +53 -0
  10. package/autonomy/loki +247 -40
  11. package/autonomy/run.sh +153 -40
  12. package/bin/postinstall.js +27 -3
  13. package/dashboard/__init__.py +1 -1
  14. package/dashboard/server.py +211 -7
  15. package/dashboard/static/index.html +4253 -0
  16. package/docs/COMPARISON.md +21 -18
  17. package/docs/COMPETITIVE-ANALYSIS.md +39 -15
  18. package/docs/INSTALLATION.md +52 -12
  19. package/docs/SYNERGY-ROADMAP.md +4 -4
  20. package/docs/SYNERGY-TASKS.md +9 -4
  21. package/docs/TOOL-INTEGRATION.md +4 -4
  22. package/docs/architecture/DASHBOARD_V2_ARCHITECTURE.md +7 -7
  23. package/docs/auto-claude-comparison.md +10 -10
  24. package/docs/cursor-comparison.md +3 -3
  25. package/docs/dashboard-guide.md +82 -23
  26. package/docs/thick2thin.md +1 -1
  27. package/events/bus.py +8 -4
  28. package/memory/storage.py +26 -5
  29. package/package.json +7 -16
  30. package/providers/claude.sh +7 -7
  31. package/providers/codex.sh +16 -19
  32. package/providers/gemini.sh +8 -8
  33. package/references/competitive-analysis.md +2 -2
  34. package/references/cursor-learnings.md +2 -2
  35. package/references/multi-provider.md +13 -13
  36. package/skills/00-index.md +1 -1
  37. package/skills/agents.md +1 -1
  38. package/skills/github-integration.md +3 -3
  39. package/skills/model-selection.md +26 -1
  40. package/skills/parallel-workflows.md +1 -1
  41. package/skills/providers.md +14 -11
  42. package/skills/testing.md +1 -1
  43. package/Dockerfile +0 -86
  44. package/Dockerfile.sandbox +0 -254
  45. package/autonomy/.loki/dashboard/index.html +0 -2768
  46. package/dashboard/Dockerfile +0 -79
  47. package/dashboard/docker-compose.yml +0 -47
  48. package/docker-compose.yml +0 -37
  49. package/docs/loki-mode-presentation.gif +0 -0
  50. package/docs/loki-mode-presentation.pptx +0 -0
  51. package/docs/screenshots/dashboard-agents.png +0 -0
  52. package/docs/screenshots/dashboard-tasks.png +0 -0
package/README.md CHANGED
@@ -2,12 +2,15 @@
2
2
 
3
3
  **The First Truly Autonomous Multi-Agent Startup System**
4
4
 
5
+ [![npm version](https://img.shields.io/npm/v/loki-mode)](https://www.npmjs.com/package/loki-mode)
6
+ [![npm downloads](https://img.shields.io/npm/dw/loki-mode)](https://www.npmjs.com/package/loki-mode)
7
+ [![GitHub stars](https://img.shields.io/github/stars/asklokesh/loki-mode)](https://github.com/asklokesh/loki-mode)
8
+ [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
5
9
  [![Claude Code](https://img.shields.io/badge/Claude-Code-orange)](https://claude.ai)
6
10
  [![Agent Types](https://img.shields.io/badge/Agent%20Types-41-blue)]()
7
11
  [![Loki Mode](https://img.shields.io/badge/Loki%20Mode-98.78%25%20Pass%401-blueviolet)](benchmarks/results/)
8
12
  [![HumanEval](https://img.shields.io/badge/HumanEval-98.17%25%20Pass%401-brightgreen)](benchmarks/results/)
9
13
  [![SWE-bench](https://img.shields.io/badge/SWE--bench-99.67%25%20Patch%20Gen-brightgreen)](benchmarks/results/)
10
- [![License](https://img.shields.io/badge/License-MIT-green.svg)](LICENSE)
11
14
 
12
15
  **[Documentation Website](https://asklokesh.github.io/loki-mode/)** | **[Architecture](https://asklokesh.github.io/loki-mode/blog/#architecture)** | **[Research](https://asklokesh.github.io/loki-mode/blog/#research)** | **[Comparisons](https://asklokesh.github.io/loki-mode/blog/#comparisons)**
13
16
 
@@ -37,81 +40,22 @@
37
40
 
38
41
  ## Usage
39
42
 
40
- ### Option 1: Claude Code Skill (Recommended)
41
-
42
- ```bash
43
- # Install
44
- git clone https://github.com/asklokesh/loki-mode.git ~/.claude/skills/loki-mode
45
-
46
- # Run
47
- claude --dangerously-skip-permissions
48
-
49
- # Then say:
50
- Loki Mode with PRD at ./my-prd.md
51
- ```
52
-
53
- ### Option 2: Shell Script
54
-
55
- ```bash
56
- # Clone repo
57
- git clone https://github.com/asklokesh/loki-mode.git
58
- cd loki-mode
59
-
60
- # Run directly
61
- ./autonomy/run.sh ./my-prd.md
62
- ```
63
-
64
- ### Option 3: npm
43
+ ### Option 1: npm (Recommended)
65
44
 
66
45
  ```bash
67
46
  npm install -g loki-mode
68
47
  loki start ./my-prd.md
69
48
  ```
70
49
 
71
- ### Option 4: Homebrew (macOS/Linux)
72
-
73
- ```bash
74
- brew install asklokesh/tap/loki-mode
75
- loki start ./my-prd.md
76
- ```
77
-
78
- ### Option 5: Docker
79
-
80
- ```bash
81
- docker run -v $(pwd):/workspace asklokesh/loki-mode:5.1.1 ./my-prd.md
82
- ```
83
-
84
- ### Option 6: VS Code Extension
85
-
86
- Install directly from the VS Code Marketplace for a visual interface:
87
-
88
- ```bash
89
- # From VS Code
90
- 1. Open Extensions (Cmd+Shift+X / Ctrl+Shift+X)
91
- 2. Search "loki-mode"
92
- 3. Click Install
93
-
94
- # Or via command line
95
- code --install-extension asklokesh.loki-mode
96
- ```
50
+ ### Option 2: Claude Code Skill
97
51
 
98
- **Important:** Start the Loki Mode server before using the extension:
99
52
  ```bash
100
- loki start # If using CLI
101
- # or
102
- ./autonomy/run.sh # If running from source
53
+ git clone https://github.com/asklokesh/loki-mode.git ~/.claude/skills/loki-mode
54
+ claude --dangerously-skip-permissions
55
+ # Then say: Loki Mode with PRD at ./my-prd.md
103
56
  ```
104
57
 
105
- **Extension Features:**
106
- - Start/Stop/Pause/Resume sessions from the activity bar
107
- - Real-time task progress in the sidebar
108
- - Provider selection (Claude, Codex, Gemini)
109
- - Status bar showing current phase and progress
110
- - Quick actions menu (Cmd+Shift+L / Ctrl+Shift+L)
111
-
112
- [View on Marketplace](https://marketplace.visualstudio.com/items?itemName=asklokesh.loki-mode)
113
-
114
- See [Installation Guide](docs/INSTALLATION.md) for more details.
58
+ Also available via **Homebrew**, **Docker**, **VS Code Extension**, and **direct shell script**. See the [Installation Guide](docs/INSTALLATION.md) for all 6 installation methods and detailed instructions.
115
59
 
116
60
  ### Multi-Provider Support (v5.0.0)
117
61
 
@@ -366,26 +310,7 @@ There is **NEVER** a "finished" state. After completing the PRD, Loki Mode:
366
310
 
367
311
  ## Quick Start
368
312
 
369
- ### **1. Install**
370
-
371
- ```bash
372
- # Option A: npm (recommended)
373
- npm install -g loki-mode
374
-
375
- # Option B: Homebrew (macOS/Linux)
376
- brew tap asklokesh/tap && brew install loki-mode
377
- loki-mode-install-skill # Set up Claude Code integration
378
-
379
- # Option C: Docker
380
- docker pull asklokesh/loki-mode:5.0.0
381
-
382
- # Option D: Git clone
383
- git clone https://github.com/asklokesh/loki-mode.git ~/.claude/skills/loki-mode
384
- ```
385
-
386
- See [Installation Guide](docs/INSTALLATION.md) for detailed instructions.
387
-
388
- ### **2. Create a PRD**
313
+ ### **1. Write a PRD**
389
314
 
390
315
  ```markdown
391
316
  # Product: AI-Powered Todo App
@@ -409,38 +334,20 @@ Build a todo app with AI-powered task suggestions and deadline predictions.
409
334
 
410
335
  Save as `my-prd.md`.
411
336
 
412
- ### **3. Run Loki Mode**
337
+ ### **2. Run It**
413
338
 
414
339
  ```bash
415
- # Using the CLI (v4.1.0)
416
340
  loki start ./my-prd.md
417
-
418
- # Or using run.sh directly
419
- ./autonomy/run.sh ./my-prd.md
420
-
421
- # Or manual mode in Claude Code
422
- claude --dangerously-skip-permissions
423
- > Loki Mode with PRD at ./my-prd.md
424
341
  ```
425
342
 
426
- ### **4. Monitor Progress**
343
+ ### **3. Monitor and Walk Away**
427
344
 
428
345
  ```bash
429
- # Check status
430
- loki status
431
-
432
- # Open dashboard in browser
433
- loki dashboard
434
-
435
- # Or watch terminal output
436
- watch -n 2 cat .loki/STATUS.txt
346
+ loki status # Check progress
347
+ loki dashboard # Open web dashboard
437
348
  ```
438
349
 
439
- ### **5. Walk Away**
440
-
441
- Seriously. Go get coffee. It'll be deployed when you get back.
442
-
443
- **That's it.** No configuration. No manual steps. No intervention.
350
+ Go get coffee. It'll be deployed when you get back.
444
351
 
445
352
  ---
446
353
 
@@ -507,56 +414,7 @@ Loki Mode has **41 predefined agent types** organized into **7 specialized swarm
507
414
  ### **Orchestration (4 types)**
508
415
  `orch-planner` `orch-sub-planner` `orch-judge` `orch-coordinator`
509
416
 
510
- <details>
511
- <summary><strong>View All 41 Agent Types with Capabilities</strong></summary>
512
-
513
- | Swarm | Agent | Capabilities |
514
- |-------|-------|--------------|
515
- | **Engineering** | `eng-frontend` | React/Vue/Svelte, TypeScript, Tailwind, accessibility, responsive design |
516
- | | `eng-backend` | Node/Python/Go, REST/GraphQL, auth, business logic, middleware |
517
- | | `eng-database` | PostgreSQL/MySQL/MongoDB, migrations, query optimization, indexing |
518
- | | `eng-mobile` | React Native/Flutter/Swift/Kotlin, offline-first, push notifications |
519
- | | `eng-api` | OpenAPI specs, SDK generation, versioning, webhooks, rate limiting |
520
- | | `eng-qa` | Unit/integration/E2E tests, coverage, automation, test data |
521
- | | `eng-perf` | Profiling, benchmarking, optimization, caching, load testing |
522
- | | `eng-infra` | Docker, K8s manifests, IaC, networking, security hardening |
523
- | **Operations** | `ops-devops` | CI/CD pipelines, GitHub Actions, GitLab CI, Jenkins |
524
- | | `ops-sre` | Reliability, SLOs/SLIs, capacity planning, runbooks |
525
- | | `ops-security` | SAST/DAST, pen testing, vulnerability management |
526
- | | `ops-monitor` | Observability, Datadog/Grafana, alerting, dashboards |
527
- | | `ops-incident` | Incident response, RCA, post-mortems, communication |
528
- | | `ops-release` | Versioning, changelogs, blue-green, canary, rollbacks |
529
- | | `ops-cost` | Cloud cost optimization, right-sizing, FinOps |
530
- | | `ops-compliance` | SOC2, GDPR, HIPAA, PCI-DSS, audit preparation |
531
- | **Business** | `biz-marketing` | Landing pages, SEO, content, email campaigns, social media |
532
- | | `biz-sales` | CRM setup, outreach, demos, proposals, pipeline |
533
- | | `biz-finance` | Billing (Stripe), invoicing, metrics, runway, pricing |
534
- | | `biz-legal` | ToS, privacy policy, contracts, IP protection |
535
- | | `biz-support` | Help docs, FAQs, ticket system, chatbot, knowledge base |
536
- | | `biz-hr` | Job posts, recruiting, onboarding, culture docs |
537
- | | `biz-investor` | Pitch decks, investor updates, data room, cap table |
538
- | | `biz-partnerships` | BD outreach, integrations, co-marketing, API partnerships |
539
- | **Data** | `data-ml` | Model training, MLOps, feature engineering, inference |
540
- | | `data-eng` | ETL pipelines, data warehousing, dbt, Airflow |
541
- | | `data-analytics` | Product analytics, A/B tests, dashboards, insights |
542
- | **Product** | `prod-pm` | Backlog grooming, prioritization, roadmap, specs |
543
- | | `prod-design` | Design system, Figma, UX patterns, prototypes |
544
- | | `prod-techwriter` | API docs, guides, tutorials, release notes |
545
- | **Growth** | `growth-hacker` | Growth experiments, viral loops, referral programs |
546
- | | `growth-community` | Community building, Discord/Slack, ambassador programs |
547
- | | `growth-success` | Customer success, health scoring, churn prevention |
548
- | | `growth-lifecycle` | Email lifecycle, in-app messaging, re-engagement |
549
- | **Review** | `review-code` | Code quality, design patterns, SOLID, maintainability |
550
- | | `review-business` | Requirements alignment, business logic, edge cases |
551
- | | `review-security` | Vulnerabilities, auth/authz, OWASP Top 10 |
552
- | **Orchestration** | `orch-planner` | Task decomposition, dependency analysis, work distribution |
553
- | | `orch-sub-planner` | Domain-specific planning, recursive task breakdown |
554
- | | `orch-judge` | Cycle continuation decisions, goal assessment, escalation |
555
- | | `orch-coordinator` | Cross-stream coordination, merge decisions, conflict resolution |
556
-
557
- </details>
558
-
559
- See [references/agent-types.md](references/agent-types.md) for complete agent type definitions.
417
+ See [Agent Types](references/agent-types.md) for the full list of 41 specialized agents with detailed capabilities.
560
418
 
561
419
  ---
562
420
 
@@ -809,10 +667,20 @@ Run the comprehensive test suite:
809
667
  Contributions welcome! Please:
810
668
  1. Read [SKILL.md](SKILL.md) to understand the core architecture
811
669
  2. Review [skills/00-index.md](skills/00-index.md) for module organization (v3.0+)
812
- 3. Check [references/agents.md](references/agents.md) for agent definitions
670
+ 3. Check [references/agent-types.md](references/agent-types.md) for agent definitions
813
671
  4. Open an issue for bugs or feature requests
814
672
  5. Submit PRs with clear descriptions and tests
815
673
 
674
+ **Dev setup:**
675
+ ```bash
676
+ git clone https://github.com/asklokesh/loki-mode.git && cd loki-mode
677
+ npm install # Install dependencies
678
+ bash -n autonomy/run.sh # Validate shell scripts
679
+ cd dashboard-ui && npm ci && npm run build:all # Build dashboard
680
+ ```
681
+
682
+ See [CONTRIBUTING.md](CONTRIBUTING.md) for detailed development guidelines.
683
+
816
684
  ---
817
685
 
818
686
  ## License
package/SKILL.md CHANGED
@@ -3,7 +3,7 @@ name: loki-mode
3
3
  description: Multi-agent autonomous startup system. Triggers on "Loki Mode". Takes PRD to deployed product with zero human intervention. Requires --dangerously-skip-permissions flag.
4
4
  ---
5
5
 
6
- # Loki Mode v5.23.0
6
+ # Loki Mode v5.26.2
7
7
 
8
8
  **You are an autonomous agent. You make decisions. You do not ask questions. You do not stop.**
9
9
 
@@ -96,8 +96,8 @@ These rules are ABSOLUTE. Violating them is a critical failure.
96
96
 
97
97
  **Default (v5.3.0):** Haiku disabled for quality. Use `--allow-haiku` or `LOKI_ALLOW_HAIKU=true` to enable.
98
98
 
99
- | Task Type | Tier | Claude (default) | Claude (--allow-haiku) | Codex | Gemini |
100
- |-----------|------|------------------|------------------------|-------|--------|
99
+ | Task Type | Tier | Claude (default) | Claude (--allow-haiku) | Codex (GPT-5.3) | Gemini |
100
+ |-----------|------|------------------|------------------------|------------------|--------|
101
101
  | PRD analysis, architecture, system design | **planning** | opus | opus | effort=xhigh | thinking=high |
102
102
  | Feature implementation, complex bugs | **development** | opus | sonnet | effort=high | thinking=medium |
103
103
  | Code review (always 3 parallel reviewers) | **development** | opus | sonnet | effort=high | thinking=medium |
@@ -106,7 +106,7 @@ These rules are ABSOLUTE. Violating them is a critical failure.
106
106
 
107
107
  **Parallelization rule (Claude only):** Launch up to 10 agents simultaneously for independent tasks.
108
108
 
109
- **Degraded mode (Codex/Gemini):** No parallel agents or Task tool. Runs RARV cycle sequentially. See `skills/model-selection.md`.
109
+ **Degraded mode (Codex/Gemini):** No parallel agents or Task tool. Codex has MCP support. Runs RARV cycle sequentially. See `skills/model-selection.md`.
110
110
 
111
111
  **Git worktree parallelism:** For true parallel feature development, use `--parallel` flag with run.sh. See `skills/parallel-workflows.md`.
112
112
 
@@ -193,7 +193,7 @@ claude --dangerously-skip-permissions
193
193
 
194
194
  # With provider selection (supports .md and .json PRDs)
195
195
  ./autonomy/run.sh --provider claude ./prd.md # Default, full features
196
- ./autonomy/run.sh --provider codex ./prd.json # GPT-5.2 Codex, degraded mode
196
+ ./autonomy/run.sh --provider codex ./prd.json # GPT-5.3 Codex, degraded mode
197
197
  ./autonomy/run.sh --provider gemini ./prd.md # Gemini 3 Pro, degraded mode
198
198
 
199
199
  # Or via CLI wrapper
@@ -204,8 +204,8 @@ loki start --provider codex ./prd.md
204
204
  ```
205
205
 
206
206
  **Provider capabilities:**
207
- - **Claude**: Full features (Task tool, parallel agents, MCP, 200K context)
208
- - **Codex**: Degraded mode (sequential only, no Task tool, 128K context)
207
+ - **Claude**: Opus 4.6, 1M context (beta), 128K output, adaptive thinking, agent teams, full features (Task tool, parallel agents, MCP)
208
+ - **Codex**: GPT-5.3, 400K context, 128K output, MCP support, --full-auto mode, degraded (sequential only, no Task tool)
209
209
  - **Gemini**: Degraded mode (sequential only, no Task tool, 1M context)
210
210
 
211
211
  ---
@@ -260,4 +260,4 @@ Auto-detected or force with `LOKI_COMPLEXITY`:
260
260
 
261
261
  ---
262
262
 
263
- **v5.23.0 | Dashboard File-Based API + All Components Functional | ~270 lines core**
263
+ **v5.26.2 | Dashboard/shell/UX fixes, security hardening | ~270 lines core**
package/VERSION CHANGED
@@ -1 +1 @@
1
- 5.23.0
1
+ 5.26.2