loki-mode 6.71.1 → 6.72.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (91) hide show
  1. package/README.md +9 -1
  2. package/SKILL.md +2 -2
  3. package/VERSION +1 -1
  4. package/autonomy/hooks/migration-hooks.sh +26 -0
  5. package/autonomy/loki +429 -92
  6. package/autonomy/run.sh +219 -38
  7. package/dashboard/__init__.py +1 -1
  8. package/dashboard/server.py +101 -19
  9. package/docs/INSTALLATION.md +20 -11
  10. package/docs/bug-fixes/agent-01-cli-fixes.md +101 -0
  11. package/docs/bug-fixes/agent-02-purplelab-fixes.md +88 -0
  12. package/docs/bug-fixes/agent-03-dashboard-fixes.md +119 -0
  13. package/docs/bug-fixes/agent-04-memory-fixes.md +105 -0
  14. package/docs/bug-fixes/agent-05-provider-fixes.md +86 -0
  15. package/docs/bug-fixes/agent-06-integration-fixes.md +101 -0
  16. package/docs/bug-fixes/agent-07-dash-run-fixes.md +101 -0
  17. package/docs/bug-fixes/agent-08-docker-fixes.md +164 -0
  18. package/docs/bug-fixes/agent-09-e2e-build-fixes.md +69 -0
  19. package/docs/bug-fixes/agent-10-e2e-fullstack-fixes.md +102 -0
  20. package/docs/bug-fixes/agent-11-e2e-session-fixes.md +70 -0
  21. package/docs/bug-fixes/agent-12-scenario-fixes.md +120 -0
  22. package/docs/bug-fixes/agent-13-enterprise-fixes.md +143 -0
  23. package/docs/bug-fixes/agent-14-uat-newuser-fixes.md +88 -0
  24. package/docs/bug-fixes/agent-15-uat-poweruser-fixes.md +132 -0
  25. package/docs/bug-fixes/agent-19-code-review.md +316 -0
  26. package/docs/bug-fixes/agent-20-architecture-review.md +331 -0
  27. package/docs/competitive/bolt-new-analysis.md +579 -0
  28. package/docs/competitive/emergence-others-analysis.md +605 -0
  29. package/docs/competitive/replit-lovable-analysis.md +622 -0
  30. package/docs/test-scenarios/edge-cases.md +813 -0
  31. package/docs/test-scenarios/enterprise-scenarios.md +732 -0
  32. package/mcp/__init__.py +1 -1
  33. package/mcp/server.py +49 -5
  34. package/memory/consolidation.py +33 -0
  35. package/memory/embeddings.py +10 -1
  36. package/memory/engine.py +83 -38
  37. package/memory/retrieval.py +36 -0
  38. package/memory/storage.py +56 -4
  39. package/memory/token_economics.py +14 -2
  40. package/memory/vector_index.py +36 -7
  41. package/package.json +1 -1
  42. package/providers/gemini.sh +89 -2
  43. package/templates/README.md +1 -1
  44. package/templates/cli-tool.md +30 -0
  45. package/templates/dashboard.md +4 -0
  46. package/templates/data-pipeline.md +4 -0
  47. package/templates/discord-bot.md +47 -0
  48. package/templates/game.md +4 -0
  49. package/templates/microservice.md +4 -0
  50. package/templates/npm-library.md +4 -0
  51. package/templates/rest-api-auth.md +50 -20
  52. package/templates/rest-api.md +15 -0
  53. package/templates/saas-starter.md +1 -1
  54. package/templates/slack-bot.md +36 -0
  55. package/templates/static-landing-page.md +9 -1
  56. package/templates/web-scraper.md +4 -0
  57. package/web-app/dist/assets/Badge-CeBkFjo6.js +1 -0
  58. package/web-app/dist/assets/Button-yuhqo8Fq.js +1 -0
  59. package/web-app/dist/assets/{Card-B1bV4syB.js → Card-BG17vsX0.js} +1 -1
  60. package/web-app/dist/assets/{HomePage-CZTV6Nea.js → HomePage-BMSQ7Apj.js} +3 -3
  61. package/web-app/dist/assets/{LoginPage-D4UdURJc.js → LoginPage-aH_6iolg.js} +1 -1
  62. package/web-app/dist/assets/{NotFoundPage-CCLSeL6j.js → NotFoundPage-Di8cNtB1.js} +1 -1
  63. package/web-app/dist/assets/ProjectPage-BtRssmw9.js +285 -0
  64. package/web-app/dist/assets/ProjectsPage-B-FTFagc.js +6 -0
  65. package/web-app/dist/assets/{SettingsPage-Xuv8EfAg.js → SettingsPage-DIJPBla4.js} +1 -1
  66. package/web-app/dist/assets/TeamsPage--19fNX7w.js +36 -0
  67. package/web-app/dist/assets/TemplatesPage-ChUQNOOv.js +11 -0
  68. package/web-app/dist/assets/TerminalOutput-Dwrzecyl.js +31 -0
  69. package/web-app/dist/assets/activity-BNRWeu9N.js +6 -0
  70. package/web-app/dist/assets/{arrow-left-CaGtolHc.js → arrow-left-Ce6g1_YE.js} +1 -1
  71. package/web-app/dist/assets/circle-alert-LIndawHL.js +11 -0
  72. package/web-app/dist/assets/clock-Bpj4VPlP.js +6 -0
  73. package/web-app/dist/assets/{external-link-CazyUyav.js → external-link-BhhdF0iQ.js} +1 -1
  74. package/web-app/dist/assets/folder-open-CM2LgfxI.js +11 -0
  75. package/web-app/dist/assets/index-8-KpWWq7.css +1 -0
  76. package/web-app/dist/assets/index-kPDW4e_b.js +236 -0
  77. package/web-app/dist/assets/lock-sAk3Xe54.js +16 -0
  78. package/web-app/dist/assets/search-CR-2i9by.js +6 -0
  79. package/web-app/dist/assets/server-DuFh4ymA.js +26 -0
  80. package/web-app/dist/assets/trash-2-BmkkT8V_.js +11 -0
  81. package/web-app/dist/index.html +2 -2
  82. package/web-app/server.py +1321 -53
  83. package/web-app/dist/assets/Badge-CBUx2PjL.js +0 -6
  84. package/web-app/dist/assets/Button-DsRiznlh.js +0 -21
  85. package/web-app/dist/assets/ProjectPage-D0w_X9tG.js +0 -237
  86. package/web-app/dist/assets/ProjectsPage-ByYxDlKC.js +0 -16
  87. package/web-app/dist/assets/TemplatesPage-BKWN07mc.js +0 -1
  88. package/web-app/dist/assets/TerminalOutput-Dj98V8Z-.js +0 -51
  89. package/web-app/dist/assets/clock-C_CDmobx.js +0 -11
  90. package/web-app/dist/assets/index-D452pFGl.css +0 -1
  91. package/web-app/dist/assets/index-Df4_kgLY.js +0 -196
@@ -0,0 +1,605 @@
1
+ # Competitive Analysis: Emergence AI, Rork, Claude Code CLI, Codex CLI (March 2026)
2
+
3
+ **Agent 18 -- Loki Mode Competitive Intelligence**
4
+ **Version:** 1.3 (3 feedback loops completed) | **Date:** 2026-03-24 | **Loki Mode:** v6.71.1
5
+
6
+ ---
7
+
8
+ ## Table of Contents
9
+
10
+ 1. [Executive Summary](#executive-summary)
11
+ 2. [Platform Deep Dives](#platform-deep-dives)
12
+ - [Emergence AI (Agent-E)](#1-emergence-ai-agent-e)
13
+ - [Rork](#2-rork)
14
+ - [Anthropic Claude Code CLI](#3-anthropic-claude-code-cli)
15
+ - [OpenAI Codex CLI](#4-openai-codex-cli)
16
+ 3. [Feature Comparison Matrix](#feature-comparison-matrix)
17
+ 4. [Market Map](#market-map)
18
+ 5. [Gaps and Opportunities](#gaps-and-opportunities)
19
+ 6. [Differentiation Strategy for Loki Mode](#differentiation-strategy-for-loki-mode)
20
+ 7. [Strategic Recommendations](#strategic-recommendations)
21
+ 8. [Key Risk: Claude Code Agent SDK](#key-risk-claude-code-agent-sdk)
22
+ 9. [Sources](#sources)
23
+
24
+ ---
25
+
26
+ ## Executive Summary
27
+
28
+ The AI coding agent market has fragmented into four distinct segments: enterprise workflow automation (Emergence AI), no-code mobile app generation (Rork), developer-centric coding assistants (Claude Code CLI, Codex CLI), and autonomous PRD-to-deployment systems (Loki Mode). While Claude Code and Codex CLI dominate the developer coding assistant segment with strong model capabilities and broad distribution, none of the analyzed platforms deliver the full pipeline that Loki Mode targets -- taking a PRD through planning, implementation, testing, code review, and deployment with multi-provider flexibility and self-hosted operation.
29
+
30
+ **Key finding:** The market has strong tools for individual coding tasks but lacks integrated autonomous systems that handle the complete software development lifecycle. This is Loki Mode's primary differentiation opportunity.
31
+
32
+ ---
33
+
34
+ ## Platform Deep Dives
35
+
36
+ ### 1. Emergence AI (Agent-E)
37
+
38
+ **Website:** https://www.emergence.ai
39
+ **GitHub:** https://github.com/EmergenceAI/Agent-E (~1,200 stars)
40
+ **License:** Open source (Agent-E); proprietary (Emergence Platform)
41
+
42
+ #### What It Does
43
+
44
+ Emergence AI operates in two modes:
45
+
46
+ **Agent-E (Open Source):** A web automation agent built on the AG2 framework (formerly AutoGen). It translates natural language commands into browser actions using a skills-based architecture. Core capabilities:
47
+ - Web form automation (not PDF forms)
48
+ - E-commerce navigation and product search
49
+ - Content location across websites
50
+ - Media interaction and playback control
51
+ - JIRA task automation
52
+ - Web search and information gathering
53
+
54
+ **Emergence Platform (Enterprise):** An enterprise agentic platform with broader ambitions:
55
+ - Goal-to-Agent Translation (ACA) -- automatically creates agents from business goals
56
+ - Multi-agent orchestration engine with async execution, retries, and dependency management
57
+ - Persistent shared memory and state across tasks
58
+ - Skills and integrations with automatic connector code generation
59
+ - Policy-aware governance with enterprise access controls
60
+ - VPC and on-premises deployment options
61
+
62
+ #### Architecture
63
+
64
+ Agent-E uses a two-agent model:
65
+ - **User Proxy Agent:** Executes skills on behalf of the user
66
+ - **Browser Navigation Agent:** Contains all web interaction skills
67
+
68
+ Skills are divided into:
69
+ - **Sensing Skills:** `geturl`, `get_dom_with_content_type` -- understand webpage state
70
+ - **Action Skills:** `click`, `enter text`, `open url` -- interact with the web
71
+
72
+ The system uses configured skills that map to human browser interactions rather than allowing LLMs to write arbitrary code. This approach prioritizes safety and predictability. On the WebVoyager benchmark (643 tasks across 15 websites), Agent-E achieves a 73.2% task completion rate, significantly outperforming WILBUR (52.6%) and the multimodal Web Voyager Agent (57.1%).
73
+
74
+ The Emergence Platform adds an orchestration layer (the "Emergence Orchestrator") that acts as an autonomous meta-agent, planning, executing, verifying, and iterating workflows by routing tasks to optimal AI models.
75
+
76
+ #### Pricing
77
+
78
+ - **Agent-E:** Free (open source, bring your own LLM API keys)
79
+ - **Emergence Platform:** Custom enterprise pricing (annual contracts based on solution complexity, number of solutions, and usage volume). Typically starts in the thousands per month.
80
+
81
+ #### Limitations
82
+
83
+ - Agent-E is specifically a **web automation** tool, not a general-purpose coding agent
84
+ - Local LLM support exists but is not thoroughly tested
85
+ - Requires active LLM API connection for cloud models
86
+ - Constrained by AG2 framework limitations
87
+ - No code generation, refactoring, or software development capabilities in the open-source Agent-E
88
+ - Enterprise platform pricing is opaque and requires sales engagement
89
+ - Not designed for software development lifecycle management
90
+
91
+ #### Target User
92
+
93
+ Enterprise teams needing web workflow automation and data pipeline orchestration. Not developers building software.
94
+
95
+ ---
96
+
97
+ ### 2. Rork
98
+
99
+ **Website:** https://rork.com
100
+ **License:** Proprietary (closed-source platform; generated code is user-owned)
101
+
102
+ #### What It Does
103
+
104
+ Rork is an AI-powered no-code platform specifically for building mobile applications. Users describe app ideas in natural language, and Rork generates production-ready mobile apps using React Native and Expo.
105
+
106
+ Core capabilities:
107
+ - Natural language to mobile app generation (iOS + Android + Web)
108
+ - Live preview via Expo Go with QR code scanning
109
+ - Full code export (React Native source code)
110
+ - App Store and Play Store deployment support
111
+ - Over-the-air updates via Expo Application Services
112
+ - **Rork Max** (Feb 2026): Native Swift code generation for the entire Apple ecosystem (iPhone, iPad, Apple Watch, Apple TV, Vision Pro, iMessage), unlocking AR/LiDAR, Metal graphics, Home Screen widgets, Dynamic Island, Live Activities, Siri Intents, HealthKit, and Core ML
113
+
114
+ #### Architecture
115
+
116
+ - **AI Engine:** Multi-provider setup using GeminiProvider and ClaudeProvider with system prompts for Expo SDK 54. A RorkAgent runs an 11-tool agentic loop.
117
+ - **Frontend Infrastructure:** Built with Next.js, Zustand state management, Supabase backend
118
+ - **Live Preview:** Serverless esbuild API compiles apps to React Native Web in a phone-sized iframe, updating in real-time
119
+ - **Deployment:** Expo Application Services handles build, distribution, and OTA updates
120
+
121
+ #### Pricing
122
+
123
+ Credit-based subscription model (credits reset monthly, no rollover):
124
+ | Plan | Price | Credits/Month | Key Features |
125
+ |------|-------|--------------|--------------|
126
+ | Free | $0 | 35 (5/day) | Basic prototyping |
127
+ | Junior | $25/mo | ~250 | Idea validation, demos |
128
+ | Senior | $100/mo | ~750 | Full MVP building and iteration |
129
+
130
+ Each AI interaction consumes one credit. Complex or iterative builds can consume credits rapidly.
131
+
132
+ #### Limitations
133
+
134
+ - **Stability issues:** Users report crashes, black screens, persistent errors, slow loading. Trustpilot average: 2.9/5
135
+ - **Publishing difficulties:** Many users report the "Publish" button failing, apps stuck in deployment limbo
136
+ - **Iteration fragility:** Generated React Native codebases accumulate state and navigation logic; each change risks breaking upstream code
137
+ - **Credit burn:** Fixing small UI issues consumes credits at the same rate as major features
138
+ - **Customer support:** Described as nearly non-existent by multiple reviewers
139
+ - **Mobile-only scope:** Cannot build web apps, backends, APIs, CLI tools, or anything outside mobile
140
+ - **No SDLC integration:** No code review, no testing framework, no CI/CD pipeline
141
+
142
+ #### Target User
143
+
144
+ Non-technical entrepreneurs and product managers who want to prototype or build mobile apps without coding. Not developer-centric.
145
+
146
+ ---
147
+
148
+ ### 3. Anthropic Claude Code CLI
149
+
150
+ **Website:** https://code.claude.com
151
+ **GitHub:** https://github.com/anthropics/claude-code
152
+ **License:** Proprietary (source-available, 51 contributors)
153
+
154
+ #### What It Does
155
+
156
+ Claude Code is Anthropic's agentic coding tool -- the most comprehensive developer coding assistant in the market as of March 2026. It operates across terminal, VS Code, JetBrains, a desktop app, and the web.
157
+
158
+ Core capabilities:
159
+ - Codebase understanding and multi-file editing
160
+ - Command execution and shell integration
161
+ - Git workflows (commits, branches, PRs)
162
+ - Multi-agent teams with coordinated sub-agents in isolated git worktrees
163
+ - Code review (standalone feature launched March 9, 2026)
164
+ - MCP (Model Context Protocol) for external tool integration
165
+ - CLAUDE.md project memory files
166
+ - Auto-memory that persists learnings across sessions
167
+ - Custom skills/commands (e.g., `/review-pr`, `/deploy-staging`)
168
+ - Hooks for pre/post action automation
169
+ - Scheduled/recurring tasks (cloud and local) via `/loop` and `/schedule`
170
+ - Third-party provider support (Amazon Bedrock, Google Vertex AI, Microsoft Foundry) -- still Claude models only
171
+ - GitHub Actions and GitLab CI/CD integration
172
+ - Slack integration (bug reports to PRs)
173
+ - Chrome extension for debugging live web apps
174
+ - Remote Control (phone/browser access to sessions)
175
+ - Agent SDK for building custom agents
176
+
177
+ #### Architecture
178
+
179
+ - **Runtime:** Built with Bun for compilation, CommanderJS for CLI structure, React Ink for terminal rendering (ANSI escape codes)
180
+ - **Models:** Claude Opus 4.5 (80.9% SWE-bench), Claude Sonnet 4.5, Claude Haiku 4.5
181
+ - **Context:** 200K token context window
182
+ - **Multi-agent:** Coordinated sub-agents with shared task lists, dependency tracking, and direct agent-to-agent communication via git worktrees
183
+ - **Optimization:** 92% prefix caching reuse rate for cost reduction
184
+ - **Token usage:** Consumes 3-4x more tokens than Codex CLI for equivalent tasks but produces more thorough output
185
+
186
+ #### Pricing
187
+
188
+ | Plan | Price | Usage Limit | Notes |
189
+ |------|-------|-------------|-------|
190
+ | Free | $0 | Basic | Limited |
191
+ | Pro | $20/mo | 5x Free | 5-hour rolling window + 7-day cap |
192
+ | Max 5x | $100/mo | 5x Pro | For moderate power users |
193
+ | Max 20x | $200/mo | 20x Pro | For heavy usage |
194
+ | API (BYOK) | Pay-per-token | Unlimited | Haiku: $1/$5, Sonnet: $3/$15, Opus: $5/$25 per 1M tokens |
195
+
196
+ Prompt caching provides 90% input cost reduction. Batch API offers 50% discount for async processing.
197
+
198
+ #### Limitations
199
+
200
+ - **Rate limits are the primary complaint:** Users report hitting caps within 10-15 minutes of heavy Opus usage. Rolling windows (5-hour and 7-day) create unpredictable availability.
201
+ - **Cost at scale:** Heavy users can face $200+/month or unpredictable API costs with BYOK
202
+ - **Proprietary:** Not fully open source (source-available but not Apache-2.0)
203
+ - **Token-hungry:** 3-4x more tokens than Codex for equivalent tasks
204
+ - **Model family lock-in:** Supports third-party hosting (Amazon Bedrock, Google Vertex AI, Microsoft Foundry) but only runs Claude models -- no GPT, Gemini, or open-source model support
205
+ - **No PRD-to-deployment pipeline:** Excels at individual coding tasks but lacks orchestrated SDLC workflow
206
+ - **Semi-autonomous at best:** `/loop` and `/schedule` provide repeating task execution, but lack PRD parsing, SDLC phasing, completion council, or quality gate orchestration. These are task-level automation, not end-to-end SDLC autonomy.
207
+ - **OAuth token controversy:** Anthropic shut down OAuth token extraction in January 2026, frustrating power users
208
+
209
+ #### Target User
210
+
211
+ Professional developers and engineering teams who want an AI pair programmer for daily coding work. The most popular choice for complex codebase work.
212
+
213
+ ---
214
+
215
+ ### 4. OpenAI Codex CLI
216
+
217
+ **Website:** https://developers.openai.com/codex/cli
218
+ **GitHub:** https://github.com/openai/codex (Apache-2.0, 365 contributors)
219
+ **License:** Apache-2.0 (fully open source)
220
+
221
+ #### What It Does
222
+
223
+ Codex CLI is OpenAI's open-source coding agent that runs locally in the terminal. Built in Rust for performance, it provides an interactive coding experience with strong safety controls.
224
+
225
+ Core capabilities:
226
+ - Interactive terminal UI with real-time action review
227
+ - Non-interactive `exec` mode for scripting and automation
228
+ - Conversation resumption (persistent sessions)
229
+ - Web search (enabled by default, cached results)
230
+ - Code review (dedicated `/review` preset)
231
+ - MCP support for third-party tool integration
232
+ - Image input (screenshots, design specs)
233
+ - Multi-agent support (experimental, isolated git worktrees)
234
+ - Codex Cloud for remote task execution
235
+ - GitHub Action for CI/CD integration
236
+ - Shell completions (bash, zsh, fish)
237
+ - Slash command framework for custom workflows
238
+
239
+ #### Architecture
240
+
241
+ - **Runtime:** Built in Rust for speed and efficiency
242
+ - **Models:** GPT-5.4 (default), GPT-5.3-Codex, codex-mini
243
+ - **Multi-agent:** Isolated cloud sandboxes per task, independent threads, no inter-agent messaging
244
+ - **Speed:** 240+ tokens/second (2.5x faster than Opus)
245
+ - **Token efficiency:** 3-4x fewer tokens than Claude Code for equivalent tasks
246
+ - **Safety:** Three approval modes -- Auto (default), Read-only, Full Access
247
+ - **Benchmarks:** SWE-bench Pro 56.8%, Terminal-Bench 2.0 77.3%
248
+
249
+ #### Pricing
250
+
251
+ | Access Method | Price | Notes |
252
+ |--------------|-------|-------|
253
+ | ChatGPT Go | $8/mo | Budget option, basic Codex access |
254
+ | ChatGPT Plus | $20/mo | 30-150 messages per 5-hour window |
255
+ | ChatGPT Pro | $200/mo | Highest limits |
256
+ | API (BYOK) | $1.50/$6 per 1M tokens | codex-mini-latest, 75% cache discount |
257
+ | Open Source (free) | $0 + API costs | Self-host, bring your own keys |
258
+
259
+ Open-source maintainers (1,000+ star projects) qualify for 6 months of free ChatGPT Pro access.
260
+
261
+ #### Limitations
262
+
263
+ - **Model lock-in:** Only works with OpenAI models (GPT family)
264
+ - **Multi-agent is experimental:** No inter-agent communication, isolated sandboxes only
265
+ - **Windows support is experimental:** WSL recommended
266
+ - **No autonomous SDLC pipeline:** Like Claude Code, it handles individual tasks, not end-to-end workflows
267
+ - **MCP overhead:** Each added MCP tool increases context consumption and reduces message limits
268
+ - **Less thorough than Claude Code:** Faster but produces less complete solutions on complex problems
269
+ - **No built-in code review quality gates:** Single-pass review only
270
+ - **No memory system beyond session resumption:** No episodic/semantic memory, no learning across projects
271
+
272
+ #### Target User
273
+
274
+ Developers who value open-source tooling, speed, and terminal-native workflows. Particularly strong for rapid prototyping and high-volume edits.
275
+
276
+ ---
277
+
278
+ ## Feature Comparison Matrix
279
+
280
+ | Feature | Emergence AI (Agent-E) | Rork | Claude Code CLI | Codex CLI | Loki Mode |
281
+ |---------|:---------------------:|:----:|:--------------:|:---------:|:---------:|
282
+ | **Primary Focus** | Web automation | Mobile apps | Coding assistant | Coding assistant | PRD-to-deploy |
283
+ | **Open Source** | Partial (Agent-E only) | No | Source-available | Yes (Apache-2.0) | Yes |
284
+ | **Multi-Provider** | Yes (OpenAI, Azure, Ollama) | Yes (Gemini, Claude) | Partial (Claude models via Bedrock/Vertex/Foundry) | No (GPT only) | Yes (5 providers, 3+ model families) |
285
+ | **Multi-Agent** | Yes (2-agent model) | No | Yes (coordinated teams) | Yes (experimental) | Yes (41 agent types) |
286
+ | **Autonomous Iteration** | No (task-level) | No | Partial (/loop, /schedule) | No (requires prompting) | Yes (RARV loop + completion council) |
287
+ | **SDLC Pipeline** | No | No | No | No | Yes (9 phases) |
288
+ | **Code Review** | No | No | Yes (single-pass) | Yes (single-pass) | Yes (3-reviewer blind) |
289
+ | **Quality Gates** | No | No | No | No | Yes (10 gates) |
290
+ | **Anti-Sycophancy** | No | No | No | No | Yes (devil's advocate) |
291
+ | **Memory System** | Enterprise only | No | CLAUDE.md + auto-memory | Session resumption | Episodic/semantic/procedural |
292
+ | **Self-Hosted** | Partial (Agent-E) | No | Partial (CLI local, but subscription or API required) | Yes (with API key) | Yes (fully, any provider API key) |
293
+ | **CI/CD Integration** | No | No | Yes (GH Actions, GitLab) | Yes (GH Action) | Yes (built-in) |
294
+ | **Complexity Detection** | No | No | No | No | Yes (auto-tier) |
295
+ | **Budget Controls** | No | Credit system | Rate limits | Rate limits | Yes (circuit breaker) |
296
+ | **Legacy System Healing** | No | No | No | No | Yes |
297
+ | **Benchmark (SWE-bench)** | N/A | N/A | 80.9% (Opus 4.5) | 56.8% (Pro) | Configurable (uses any model) |
298
+ | **CLI Interface** | No (Python API) | No (web UI) | Yes | Yes | Yes |
299
+ | **IDE Integration** | No | No | Yes (VS Code, JetBrains) | No | Yes (VS Code extension) |
300
+ | **MCP Support** | No | No | Yes | Yes | Yes (15 tools) |
301
+ | **Cost (Heavy Use)** | Enterprise contract | $100/mo | $200/mo or API | $20-200/mo or API | $0 + API costs |
302
+ | **Context Window** | Model-dependent | N/A | 200K tokens | Model-dependent | Model-dependent |
303
+
304
+ ---
305
+
306
+ ## Market Map
307
+
308
+ ```
309
+ AUTONOMOUS <----------------------------> ASSISTED
310
+ | |
311
+ E Loki Mode | |
312
+ N (PRD-to-deploy, | |
313
+ T multi-provider, | |
314
+ E self-hosted) | |
315
+ R | |
316
+ P Emergence AI | |
317
+ R Platform | |
318
+ I (enterprise | |
319
+ S workflow | |
320
+ E automation) | |
321
+ -----------------+---------------------------------------+
322
+ P | Claude Code CLI |
323
+ R | (complex codebase |
324
+ O | editing, reviews) |
325
+ S | |
326
+ U | Codex CLI |
327
+ M | (fast prototyping, |
328
+ E | open source) |
329
+ R | |
330
+ | Rork |
331
+ -----------------+-------(mobile app------gen)----------+
332
+ | |
333
+ AUTONOMOUS ASSISTED
334
+ ```
335
+
336
+ ### Segment Breakdown
337
+
338
+ | Segment | Players | Target User | Price Range |
339
+ |---------|---------|-------------|-------------|
340
+ | **Autonomous SDLC** | Loki Mode | Engineering teams, solo developers, startups | $0 + API costs |
341
+ | **Enterprise Automation** | Emergence AI Platform | Enterprise data/ops teams | $$$$ (custom) |
342
+ | **Developer Coding Assistant** | Claude Code CLI, Codex CLI | Professional developers | $20-200/mo |
343
+ | **No-Code App Builder** | Rork | Non-technical founders, PMs | $0-100/mo |
344
+ | **Web Automation** | Agent-E (open source) | Automation engineers | $0 + API costs |
345
+
346
+ ### Who Targets Whom
347
+
348
+ - **Claude Code CLI** and **Codex CLI** compete head-to-head for professional developers. Claude Code leads on reasoning depth; Codex leads on speed and open-source ethos.
349
+ - **Rork** targets a completely different audience (non-coders wanting mobile apps) and does not compete with developer tools.
350
+ - **Emergence AI** targets enterprise workflow automation, not software development.
351
+ - **Loki Mode** sits in an underserved category -- autonomous end-to-end software development -- that none of the others fully address.
352
+
353
+ ---
354
+
355
+ ## Gaps and Opportunities
356
+
357
+ ### Gap 1: No Competitor Offers End-to-End Autonomous SDLC
358
+
359
+ Claude Code and Codex CLI are powerful coding assistants, but they operate at the task level. Claude Code's `/loop` and `/schedule` commands provide task-level automation (repeat a prompt, run periodic checks), and its Agent Teams can coordinate sub-agents -- but these are building blocks, not a complete SDLC pipeline. No competitor provides:
360
+ - Autonomous PRD parsing and task decomposition into phased work
361
+ - Multi-phase SDLC execution (plan, implement, test, review, deploy) with phase gates
362
+ - Continuous iteration loops with RARV cycle and model tier selection
363
+ - Completion detection via council voting (not just "did the task finish")
364
+ - Budget circuit breakers, stagnation detection, and complexity auto-tiering
365
+
366
+ **Opportunity:** Loki Mode is the only tool that takes a PRD and autonomously produces a deployed, tested, reviewed product. Claude Code could theoretically be scripted to do this externally, but Loki Mode has it built in as a first-class capability. This is a category-defining advantage, though the window may narrow as Claude Code's Agent SDK matures.
367
+
368
+ ### Gap 2: No True Multi-Model-Family Flexibility
369
+
370
+ Every competitor is locked to a single model family (even if hostable on multiple clouds):
371
+ - Claude Code: Claude models only (hostable on Bedrock, Vertex AI, Foundry -- but still Claude)
372
+ - Codex CLI: OpenAI models only
373
+ - Emergence AI: Primarily OpenAI, some alternatives via LiteLLM
374
+ - Rork: Internal (Gemini + Claude, not user-selectable)
375
+
376
+ **Nuance:** Claude Code's third-party provider support (Bedrock, Vertex AI) gives enterprises deployment flexibility, but not model diversity. You cannot run GPT or Gemini models through Claude Code.
377
+
378
+ **Opportunity:** Loki Mode supports 5 providers spanning 3+ model families (Claude, GPT, Gemini) with automatic degraded-mode handling. This enables cost optimization (use cheaper models for simple tasks), redundancy (failover between providers), and leveraging provider-specific strengths (Claude for reasoning, GPT for speed).
379
+
380
+ ### Gap 3: No Structured Quality Assurance Pipeline
381
+
382
+ Claude Code and Codex CLI offer single-pass code review. Neither provides:
383
+ - 3-reviewer blind parallel review
384
+ - Anti-sycophancy checks (devil's advocate on unanimous approval)
385
+ - Severity-based blocking gates
386
+ - Test coverage enforcement (>80% unit, 100% pass)
387
+ - Static analysis integration (CodeQL, ESLint)
388
+
389
+ **Opportunity:** Loki Mode's 10-gate quality system provides enterprise-grade assurance that no competitor matches.
390
+
391
+ ### Gap 4: No Persistent Cross-Project Learning
392
+
393
+ - Claude Code has CLAUDE.md and auto-memory (session-scoped)
394
+ - Codex CLI has session resumption only
395
+ - Neither has episodic-to-semantic memory consolidation
396
+ - Neither learns patterns across projects
397
+
398
+ **Opportunity:** Loki Mode's memory system (episodic, semantic, procedural) with consolidation pipeline and vector search enables compound learning. Solutions discovered in one project are available in the next.
399
+
400
+ ### Gap 5: No Legacy System Healing
401
+
402
+ No competitor addresses the challenge of modernizing legacy codebases:
403
+ - No friction-as-semantics analysis
404
+ - No failure-first learning
405
+ - No institutional knowledge preservation
406
+ - No incremental healing phases
407
+
408
+ **Opportunity:** Loki Mode's healing capability (inspired by Amazon AGI Lab research) is unique in the market.
409
+
410
+ ### Gap 6: Self-Hosted Operation Without Vendor Lock-in
411
+
412
+ - Claude Code requires Anthropic subscription or API
413
+ - Codex CLI requires OpenAI subscription or API
414
+ - Rork is fully cloud-hosted (no self-hosting)
415
+ - Emergence Platform requires enterprise contract
416
+
417
+ **Opportunity:** Loki Mode is fully self-hosted, works with any supported provider's API key, and has no cloud dependency. This matters for regulated industries, air-gapped environments, and cost-conscious teams.
418
+
419
+ ### Gap 7: No Mobile-to-Enterprise Bridging
420
+
421
+ Rork generates mobile apps but cannot handle backends, APIs, or infrastructure. Claude Code and Codex CLI cannot generate mobile apps end-to-end from a description. No tool bridges the full stack autonomously.
422
+
423
+ **Opportunity:** Loki Mode's PRD templates (13 types including SaaS, CLI, Discord bot) could expand to include mobile app generation, combining Rork's accessibility with developer-grade quality gates.
424
+
425
+ ---
426
+
427
+ ## Differentiation Strategy for Loki Mode
428
+
429
+ ### Primary Positioning
430
+
431
+ **"The only autonomous system that takes a PRD to a deployed, tested, reviewed product -- using any AI provider you choose."**
432
+
433
+ This positioning highlights three unique capabilities no competitor offers together:
434
+ 1. **Autonomous SDLC** (not just coding assistance)
435
+ 2. **Multi-provider** (not locked to one vendor)
436
+ 3. **Quality-assured** (10-gate system, 3-reviewer blind review)
437
+
438
+ ### Differentiation by Competitor
439
+
440
+ #### vs. Claude Code CLI
441
+ | Dimension | Claude Code | Loki Mode |
442
+ |-----------|-------------|-----------|
443
+ | Autonomy | Semi-autonomous (/loop, /schedule, but no SDLC orchestration) | Fully autonomous (RARV loop + completion council) |
444
+ | Scope | Individual coding tasks, PR reviews | Full SDLC pipeline (9 phases) |
445
+ | Providers | Claude models only (multi-cloud hosting) | 5 providers, 3+ model families |
446
+ | Quality | Single-pass review, GitHub Action | 10-gate, 3-reviewer blind system, anti-sycophancy |
447
+ | Memory | CLAUDE.md + auto-memory (session-scoped) | Episodic/semantic/procedural (cross-project) |
448
+ | Cost model | Subscription with rate limits or API | Self-hosted, pay-per-token, any provider |
449
+ | IDE/surface | Terminal, VS Code, JetBrains, Desktop, Web | Terminal, VS Code (via extension) |
450
+ | **Loki Mode advantage:** | End-to-end SDLC autonomy, multi-model flexibility, structured quality |
451
+ | **Claude Code advantage:** | Broader surface coverage (IDE, desktop, web, mobile), stronger single-task capabilities |
452
+
453
+ #### vs. Codex CLI
454
+ | Dimension | Codex CLI | Loki Mode |
455
+ |-----------|-----------|-----------|
456
+ | Autonomy | Assisted (human prompts each task) | Fully autonomous |
457
+ | Open source | Yes (Apache-2.0) | Yes |
458
+ | Speed | 240+ tokens/sec | Depends on provider |
459
+ | Providers | OpenAI only | 5 providers |
460
+ | Multi-agent | Experimental (isolated) | 41 agent types, 8 swarms |
461
+ | Quality | Single-pass review | 10-gate system |
462
+ | **Loki Mode advantage:** | Autonomous pipeline, multi-provider, mature multi-agent |
463
+
464
+ #### vs. Emergence AI
465
+ | Dimension | Emergence AI | Loki Mode |
466
+ |-----------|-------------|-----------|
467
+ | Focus | Web/workflow automation | Software development |
468
+ | Pricing | Enterprise contracts | Free + API costs |
469
+ | Self-hosted | VPC option | Fully self-hosted |
470
+ | Open source | Partial | Yes |
471
+ | **Loki Mode advantage:** | Purpose-built for software, open source, accessible pricing |
472
+
473
+ #### vs. Rork
474
+ | Dimension | Rork | Loki Mode |
475
+ |-----------|------|-----------|
476
+ | Focus | Mobile apps (no-code) | Full-stack software |
477
+ | Target user | Non-technical | Developers + technical teams |
478
+ | Quality | No testing/review | 10-gate quality system |
479
+ | Output | Mobile app only | Any software type |
480
+ | **Loki Mode advantage:** | Developer-grade, full-stack, quality-assured |
481
+
482
+ ### Messaging Framework
483
+
484
+ **For developers currently using Claude Code or Codex CLI:**
485
+ "You already use AI for coding. Loki Mode makes it autonomous -- give it a PRD, and it handles planning, implementation, testing, code review, and deployment. Keep using Claude or Codex under the hood."
486
+
487
+ **For engineering leaders evaluating AI tooling:**
488
+ "Loki Mode is the only open-source system with enterprise-grade quality gates (10 gates, 3-reviewer blind review, anti-sycophancy checks) that runs autonomously on any AI provider. Self-hosted, no vendor lock-in."
489
+
490
+ **For startups and solo developers:**
491
+ "Go from idea to deployed product overnight. Write a PRD, invoke Loki Mode, and let it build, test, and deploy while you sleep. Works with your existing Claude or OpenAI API key."
492
+
493
+ ---
494
+
495
+ ## Strategic Recommendations
496
+
497
+ ### 1. Position Against Claude Code and Codex CLI as the "Orchestration Layer"
498
+
499
+ Rather than competing as another coding assistant, position Loki Mode as the layer above Claude Code and Codex CLI. Frame it as: "Claude Code and Codex are excellent coding agents. Loki Mode orchestrates them into an autonomous development pipeline."
500
+
501
+ This avoids direct feature-by-feature competition on model quality (where Anthropic and OpenAI will always have advantages) and instead competes on workflow orchestration (where Loki Mode is uniquely strong).
502
+
503
+ ### 2. Publish Autonomous Benchmark Results
504
+
505
+ Claude Code publishes SWE-bench scores. Codex CLI publishes Terminal-Bench scores. Loki Mode should publish "PRD-to-Deploy" benchmark results:
506
+ - Time from PRD to working deployed product
507
+ - Lines of code generated per PRD
508
+ - Test coverage achieved autonomously
509
+ - Code review gate pass rates
510
+ - Comparison of multi-provider results (same PRD, different providers)
511
+
512
+ This creates a new benchmark category that competitors cannot match because they do not offer end-to-end SDLC.
513
+
514
+ ### 3. Build a "Bring Your Own Agent" Ecosystem
515
+
516
+ Loki Mode already supports 5 providers. Extend this to support any CLI agent as a provider through a plugin interface. This turns every new coding agent (Aider, Cline, future entrants) into a Loki Mode capability rather than a competitor.
517
+
518
+ ### 4. Target the "AI-Native Startup" Segment
519
+
520
+ Solo founders and small teams using AI to build products overnight represent a growing segment that no competitor specifically targets:
521
+ - Claude Code is for professional developers doing daily work
522
+ - Codex CLI is for terminal-native developers
523
+ - Rork is for non-technical mobile app builders
524
+ - Emergence AI is for enterprise operations
525
+
526
+ Loki Mode can own the "AI-native startup" segment with PRD templates, automated deployment, and cost-efficient multi-provider operation.
527
+
528
+ ### 5. Strengthen the Mobile/Frontend Story
529
+
530
+ Rork demonstrates demand for AI-generated mobile apps, but with serious quality problems (2.9/5 Trustpilot). Loki Mode could add a mobile PRD template that generates React Native/Expo apps through the same quality-gated pipeline, offering Rork's convenience with developer-grade reliability.
531
+
532
+ ### 6. Enterprise Self-Hosted Narrative
533
+
534
+ Emergence AI charges enterprise contract rates for VPC deployment. Loki Mode is already fully self-hosted and free. For regulated industries (finance, healthcare, government) that cannot use cloud-hosted AI tools, Loki Mode with BYOK API access is compelling:
535
+ - No data leaves your infrastructure (beyond API calls)
536
+ - Audit trail via `.loki/` state files
537
+ - Configurable security controls (LOKI_SANDBOX_MODE, LOKI_BLOCKED_COMMANDS)
538
+
539
+ ### 7. Watch List -- Emerging Threats
540
+
541
+ | Threat | Severity | Timeline | Mitigation |
542
+ |--------|----------|----------|------------|
543
+ | Claude Code Agent SDK enables custom SDLC pipelines | HIGH | 3-6 months | Deepen multi-provider advantage (Agent SDK is Claude-only); publish PRD-to-Deploy benchmarks that prove the integrated pipeline |
544
+ | Claude Code adds autonomous mode natively | HIGH | 6-12 months | Quality gates, memory system, and multi-provider flexibility are structural advantages that cannot be replicated by adding a single feature |
545
+ | Codex CLI adds orchestration layer | MEDIUM | 6-12 months | Codex is OpenAI-only; emphasize 41 agent types, cross-project memory, healing |
546
+ | New entrant builds "Rork for full-stack" | MEDIUM | 6-12 months | Add mobile PRD template; Loki Mode's quality gates differentiate from naive generation |
547
+ | Open-source Auto-Claude gains traction | LOW | Ongoing | Already adopted key patterns (v3.4.0); maintain feature lead |
548
+ | Enterprise CI/CD platforms (GitHub, GitLab) add native AI SDLC | HIGH | 12-18 months | Self-hosted, provider-agnostic positioning; these will be vendor-locked |
549
+
550
+ ### 8. Key Risk: Claude Code Agent SDK
551
+
552
+ The most significant near-term competitive threat is Anthropic's Agent SDK (https://platform.claude.com/docs/en/agent-sdk/overview). It allows developers to build custom agents with Claude Code's tools and capabilities, with full control over orchestration, tool access, and permissions. This means:
553
+
554
+ - A motivated team could build a Loki-Mode-like SDLC pipeline on top of Claude Code's Agent SDK
555
+ - It would have native access to Claude Code's broader surface area (VS Code, JetBrains, Desktop, Web)
556
+ - It would benefit from Anthropic's ongoing model improvements
557
+
558
+ **However, Loki Mode's structural advantages remain:**
559
+ 1. **Multi-provider:** Agent SDK is Claude-only. Loki Mode works with any provider.
560
+ 2. **Battle-tested pipeline:** 10 quality gates, completion council, healing, memory -- these took months to build and validate. A new Agent SDK project starts from zero.
561
+ 3. **Open source and self-hosted:** No dependency on Anthropic's platform decisions.
562
+ 4. **Research foundation:** Built on patterns from OpenAI, DeepMind, Anthropic, and academic research. Not just engineering, but applied AI safety research (Constitutional AI, anti-sycophancy, alignment faking detection).
563
+
564
+ ---
565
+
566
+ ## Sources
567
+
568
+ ### Emergence AI
569
+ - [Emergence AI Platform](https://www.emergence.ai/platform)
570
+ - [Agent-E GitHub Repository](https://github.com/EmergenceAI/Agent-E)
571
+ - [Emergence AI SOTA Results on WebVoyager Benchmark](https://www.emergence.ai/blog/agent-e-sota)
572
+ - [Emergence AI Multi-Agent Orchestrator](https://www.emergence.ai/blog/introducing-the-emergence-multi-agent-orchestrator)
573
+
574
+ ### Rork
575
+ - [Rork Official Site](https://rork.com)
576
+ - [Rork AI Review 2026 -- Rapid Dev](https://www.rapidevelopers.com/blog/rork-ai-review)
577
+ - [Rork AI Review 2026: Pricing, Features & Honest Verdict -- No Code MBA](https://www.nocode.mba/articles/rork-ai-review-2026)
578
+ - [Rork Review -- Medium](https://medium.com/@e2larsen/rork-com-review-can-this-no-code-platform-really-build-your-mobile-app-d17f32bd2870)
579
+ - [Rork on Product Hunt](https://www.producthunt.com/products/rork-app-for-ios)
580
+
581
+ ### Claude Code CLI
582
+ - [Claude Code Overview](https://code.claude.com/docs/en/overview)
583
+ - [Claude Code GitHub](https://github.com/anthropics/claude-code)
584
+ - [Claude Code on Amazon Bedrock](https://code.claude.com/docs/en/amazon-bedrock)
585
+ - [Claude Code Scheduled Tasks](https://code.claude.com/docs/en/scheduled-tasks)
586
+ - [Claude Code Agent SDK](https://platform.claude.com/docs/en/agent-sdk/overview)
587
+ - [Claude Code Pricing -- ClaudeLog](https://claudelog.com/claude-code-pricing/)
588
+ - [Claude Code Rate Limits -- The Register](https://www.theregister.com/2026/01/05/claude_devs_usage_limits/)
589
+ - [Claude Code vs Codex -- Builder.io](https://www.builder.io/blog/codex-vs-claude-code)
590
+ - [Claude Pricing Plans](https://claude.com/pricing)
591
+ - [Claude Code Loops and Scheduling -- Medium](https://medium.com/@richardhightower/put-claude-on-autopilot-scheduled-tasks-with-loop-and-schedule-built-in-skills-43f3be5ac1ec)
592
+
593
+ ### Codex CLI
594
+ - [Codex CLI Documentation](https://developers.openai.com/codex/cli)
595
+ - [Codex CLI Features](https://developers.openai.com/codex/cli/features)
596
+ - [Codex CLI GitHub](https://github.com/openai/codex)
597
+ - [Codex Pricing](https://developers.openai.com/codex/pricing)
598
+ - [Codex for Open Source](https://developers.openai.com/codex/open-source)
599
+
600
+ ### Comparative Analysis
601
+ - [Codex vs Claude Code Benchmarks -- MorphLLM](https://www.morphllm.com/comparisons/codex-vs-claude-code)
602
+ - [State of AI Coding Agents 2026 -- Medium](https://medium.com/@dave-patten/the-state-of-ai-coding-agents-2026-from-pair-programming-to-autonomous-ai-teams-b11f2b39232a)
603
+ - [Best AI Coding Agents 2026 -- Codegen](https://codegen.com/blog/best-ai-coding-agents/)
604
+ - [15 AI Coding Agents Tested -- MorphLLM](https://www.morphllm.com/ai-coding-agent)
605
+ - [Agentic CLI Tools Compared -- AiMultiple](https://aimultiple.com/agentic-cli)