npm - oh-my-opencode-slim - Versions diffs - 0.3.2 → 0.3.4 - Mend

oh-my-opencode-slim 0.3.2 → 0.3.4

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (12) hide show

package/README.md +89 -171
package/dist/index.js +14209 -3058
package/dist/mcp/index.d.ts +3 -3
package/dist/mcp/types.d.ts +6 -0
package/dist/tools/index.d.ts +1 -0
package/dist/tools/skill/builtin.d.ts +3 -0
package/dist/tools/skill/constants.d.ts +2 -0
package/dist/tools/skill/index.d.ts +3 -0
package/dist/tools/skill/mcp-manager.d.ts +25 -0
package/dist/tools/skill/tools.d.ts +6 -0
package/dist/tools/skill/types.d.ts +92 -0
package/package.json +2 -1

package/README.md CHANGED Viewed

@@ -23,147 +23,29 @@
   - [For LLM Agents](#for-llm-agents)
 - [🏗️ **Architecture & Flow**](#architecture--flow)
 - [🏛️ **Meet the Pantheon**](#meet-the-pantheon)
-  - [The Orchestrator](#the-orchestrator---architect-of-realms)
-  - [The Explorer](#the-explorer---pathfinder)
-  - [The Oracle](#the-oracle---seer-beyond-the-stack-trace)
-  - [The Librarian](#the-librarian---keeper-of-infinite-scrolls)
-  - [The Designer](#the-designer---artisan-of-interfaces)
-  - [The Scribe](#the-scribe---chronicle-keeper)
-  - [The Visionary](#the-visionary---reader-of-pixels)
-  - [The Minimalist](#the-minimalist---destroyer-of-bloat)
+  - [Orchestrator](#orchestrator)
+  - [Explorer](#explorer)
+  - [Oracle](#oracle)
+  - [Librarian](#librarian)
+  - [Frontend Designer](#frontend-designer)
+  - [Document Writer](#document-writer)
+  - [Multimodal Viewer](#multimodal-viewer)
+  - [Code Simplifier](#code-simplifier)
 - [🛠️ **Tools & Capabilities**](#tools--capabilities)
   - [Tmux Integration](#tmux-integration)
   - [Quota Tool](#quota-tool)
   - [Background Tasks](#background-tasks)
   - [LSP Tools](#lsp-tools)
-  - [Code Search Tools](#code-search-tools)
+- [Code Search Tools](#code-search-tools)
+- [🧩 **Skills**](#-skills)
+  - [Playwright Integration](#playwright-integration)
 - [🔌 **MCP Servers**](#mcp-servers)
 - [⚙️ **Configuration**](#configuration)
 - [🗑️ **Uninstallation**](#uninstallation)
 ---
-## Installation
-### For Humans
-Run the interactive installer:
-```bash
-bunx oh-my-opencode-slim install
-```
-Or use non-interactive mode:
-```bash
-bunx oh-my-opencode-slim install --no-tui --antigravity=yes --openai=yes --cerebras=no
-```
-**Alternative: Ask any coding agent**
-Paste this into Claude Code, AmpCode, Cursor, or any coding agent:
-```
-Install and configure by following the instructions here:
-https://raw.githubusercontent.com/alvinunreal/oh-my-opencode-slim/refs/heads/master/README.md
-```
-### For LLM Agents
-<details>
-<summary>Instructions for LLM Agents (click to expand)</summary>
-If you're an LLM Agent helping set up oh-my-opencode-slim, follow these steps.
----
-#### Step 1: Check OpenCode Installation
-```bash
-opencode --version
-```
-If not installed, direct the user to https://opencode.ai/docs first.
----
-#### Step 2: Ask About Provider Access
-Ask these questions **one at a time**, waiting for responses:
-1. "Do you have an **Antigravity** subscription?" *(Provides Claude + Gemini via `google/` prefix)*
-2. "Do you have access to **OpenAI** API?" *(Enables `openai/` models)*
-3. "Do you have access to **Cerebras** API?" *(Enables `cerebras/` for fast exploration)*
----
-#### Step 3: Run the Installer
-Based on answers, run:
-```bash
-bunx oh-my-opencode-slim install --no-tui --antigravity=<yes|no> --openai=<yes|no> --cerebras=<yes|no>
-```
-**Examples:**
-```bash
-# Antigravity + OpenAI
-bunx oh-my-opencode-slim install --no-tui --antigravity=yes --openai=yes --cerebras=no
-# OpenAI only
-bunx oh-my-opencode-slim install --no-tui --antigravity=no --openai=yes --cerebras=no
-# All providers
-bunx oh-my-opencode-slim install --no-tui --antigravity=yes --openai=yes --cerebras=yes
-```
-The installer automatically:
-- Adds the plugin to `~/.config/opencode/opencode.json`
-- Adds `opencode-antigravity-auth` plugin (if Antigravity enabled)
-- Configures Google provider with model definitions
-- Generates agent model mappings in `~/.config/opencode/oh-my-opencode-slim.json`
----
-#### Step 4: Authentication
-After installation, guide the user to log in for each enabled provider:
-```bash
-opencode auth login
-```
-The user should select:
-- **For Antigravity**: Google → OAuth with Google (Antigravity)
-- **For OpenAI**: OpenAI → Enter API Key
-- **For Cerebras**: Cerebras → Enter API Key
----
-#### Step 5: Verify
-```bash
-opencode
-```
----
-#### Troubleshooting
-If the installer fails, check the expected config format:
-```bash
-bunx oh-my-opencode-slim install --help
-```
-Then manually create the config files at:
-- `~/.config/opencode/opencode.json`
-- `~/.config/opencode/oh-my-opencode-slim.json`
-</details>
----
-## Architecture & Flow
+## 🏗️ Architecture & Flow
 The plugin follows a "Hub and Spoke" model:
@@ -171,7 +53,7 @@ The plugin follows a "Hub and Spoke" model:
 2. **Specialized Agents (Spokes)**: Domain-specific experts (e.g., UI/UX, Documentation, Architecture) that handle narrow tasks with high precision.
 3. **Background Manager**: A robust engine that allows the Orchestrator to "fire and forget" tasks (like deep codebase searches or documentation research) while continuing to work on other parts of the problem.
-### The Flow of a Request
+### 🏛️ The Flow of a Request
 <img src="img/intro.png" alt="Orchestration Flow" width="800">
@@ -189,113 +71,129 @@ The plugin follows a "Hub and Spoke" model:
 <br clear="both">
-### The Orchestrator - *Architect of Realms*
+### Orchestrator
-<a href="src/agents/orchestrator.ts"><img src="img/orchestrator.png" alt="The Orchestrator" align="right" width="240"></a>
+<a href="src/agents/orchestrator.ts"><img src="img/orchestrator.png" alt="Orchestrator" align="right" width="240"></a>
-> **The Orchestrator** was born when the first codebase collapsed under its own complexity. Neither god nor mortal would claim responsibility-so The Orchestrator emerged from the void, forging order from chaos. They don't merely command armies; they fight alongside them. Every line of code passes through their hands before they decide which lesser deity deserves a piece of the puzzle.
+> **The Orchestrator** was born when the first codebase collapsed under its own complexity. Neither god nor mortal would claim responsibility - so The Orchestrator emerged from the void, forging order from chaos. They don't merely command armies; they fight alongside them. Every line of code passes through their hands before they decide which lesser deity deserves a piece of the puzzle.
-**Role:** Supreme executor, delegator, and overseer - **Model:** `google/claude-opus-4-5-thinking`
+**Role:** `Supreme executor, delegator, and overseer`
+**Model:** `google/claude-opus-4-5-thinking`
+**Prompt:** [src/agents/orchestrator.ts](src/agents/orchestrator.ts)
-Write and execute code, orchestrate multi-agent workflows, parse the unspoken from the spoken, summon specialists mid-battle. Shape reality directly-and assign realms to others when the universe grows too vast.
+Write and execute code, orchestrate multi-agent workflows, parse the unspoken from the spoken, summon specialists mid-battle. *Shape reality directly - and assign realms to others when the universe grows too vast.*
 <br clear="both">
 ---
-### The Explorer - *Pathfinder*
+### Explorer
-<a href="src/agents/explore.ts"><img src="img/explorer.png" alt="The Explorer" align="right" width="240"></a>
+<a href="src/agents/explore.ts"><img src="img/explorer.png" alt="Explorer" align="right" width="240"></a>
-> **The Explorer** moves through codebases like wind through trees-swift, silent, everywhere at once. When The Orchestrator whispers "find me the auth module," The Explorer has already returned with forty file paths and a map. They were born from the first `grep` command, evolved beyond it, and now see patterns mortals miss.
+> **The Explorer** moves through codebases like wind through trees - swift, silent, everywhere at once. When The Orchestrator whispers "find me the auth module," The Explorer has already returned with forty file paths and a map. They were born from the first `grep` command, evolved beyond it, and now see patterns mortals miss.
-**Role:** Codebase reconnaissance - **Model:** `cerebras/zai-glm-4.6`
+**Role:** `Codebase reconnaissance`
+**Model:** `cerebras/zai-glm-4.6`
+**Prompt:** [src/agents/explore.ts](src/agents/explore.ts)
-Regex search, AST pattern matching, file discovery, parallel exploration. Read-only: they chart the territory; others conquer it.
+Regex search, AST pattern matching, file discovery, parallel exploration. *Read-only: they chart the territory; others conquer it.*
 <br clear="both">
 ---
-### The Oracle - *Seer Beyond the Stack Trace*
+### Oracle
-<a href="src/agents/oracle.ts"><img src="img/oracle.png" alt="The Oracle" align="right" width="240"></a>
+<a href="src/agents/oracle.ts"><img src="img/oracle.png" alt="Oracle" align="right" width="240"></a>
-> **The Oracle** does not code-they *know*. When bugs defy logic and architectures crumble, The Oracle gazes into the abyss of your codebase and speaks truth. They've seen a thousand systems rise and fall. They'll tell you which path leads to ruin, and which to production.
+> **The Oracle** does not code - they *know*. When bugs defy logic and architectures crumble, The Oracle gazes into the abyss of your codebase and speaks truth. They've seen a thousand systems rise and fall. They'll tell you which path leads to ruin, and which to production.
-**Role:** Strategic advisor and debugger of last resort - **Model:** `openai/gpt-5.2-codex`
+**Role:** `Strategic advisor and debugger of last resort`
+**Model:** `openai/gpt-5.2-codex`
+**Prompt:** [src/agents/oracle.ts](src/agents/oracle.ts)
-Root cause analysis, architecture review, debugging guidance, tradeoff analysis. Read-only: Oracles advise; they don't intervene.
+Root cause analysis, architecture review, debugging guidance, tradeoff analysis. *Read-only: Oracles advise; they don't intervene.*
 <br clear="both">
 ---
-### The Librarian - *Keeper of Infinite Scrolls*
+### Librarian
-<a href="src/agents/librarian.ts"><img src="img/librarian.png" alt="The Librarian" align="right" width="240"></a>
+<a href="src/agents/librarian.ts"><img src="img/librarian.png" alt="Librarian" align="right" width="240"></a>
-> **The Librarian** guards a library with no walls-every GitHub repo, every npm package, every StackOverflow answer ever written. Ask them "how does React handle concurrent rendering?" and they'll return with official docs, real-world examples, and a warning about the footgun you're about to step on.
+> **The Librarian** guards a library with no walls - every GitHub repo, every npm package, every StackOverflow answer ever written. Ask them "how does React handle concurrent rendering?" and they'll return with official docs, real-world examples, and a warning about the footgun you're about to step on.
-**Role:** External knowledge retrieval - **Model:** `google/gemini-3-flash`
+**Role:** `External knowledge retrieval`
+**Model:** `google/gemini-3-flash`
+**Prompt:** [src/agents/librarian.ts](src/agents/librarian.ts)
-Documentation lookup, GitHub code search, library research, best practice retrieval. Read-only: they fetch wisdom; implementation is for others.
+Documentation lookup, GitHub code search, library research, best practice retrieval. *Read-only: they fetch wisdom; implementation is for others.*
 <br clear="both">
 ---
-### The Designer - *Artisan of Interfaces*
+### Frontend Designer
-<a href="src/agents/frontend.ts"><img src="img/designer.png" alt="The Designer" align="right" width="240"></a>
+<a href="src/agents/frontend.ts"><img src="img/designer.png" alt="Frontend Designer" align="right" width="240"></a>
-> **The Designer** believes code should be beautiful-and so should everything it renders. Born from the frustration of a thousand ugly MVPs, they wield CSS like a brush and components like clay. Hand them a feature request; receive a masterpiece. They don't do "good enough."
+> **The Designer** believes code should be beautiful - and so should everything it renders. Born from the frustration of a thousand ugly MVPs, they wield CSS like a brush and components like clay. Hand them a feature request; receive a masterpiece. They don't do "good enough."
-**Role:** UI/UX implementation and visual excellence - **Model:** `google/gemini-3-flash`
+**Role:** `UI/UX implementation and visual excellence`
+**Model:** `google/gemini-3-flash`
+**Prompt:** [src/agents/frontend.ts](src/agents/frontend.ts)
-Modern responsive design, CSS/Tailwind mastery, micro-animations, component architecture. Visual excellence over code perfection-beauty is the priority.
+Modern responsive design, CSS/Tailwind mastery, micro-animations, component architecture. *Visual excellence over code perfection - beauty is the priority.*
 <br clear="both">
 ---
-### The Scribe - *Chronicle Keeper*
+### Document Writer
-<a href="src/agents/document-writer.ts"><img src="img/scribe.png" alt="The Scribe" align="right" width="240"></a>
+<a href="src/agents/document-writer.ts"><img src="img/scribe.png" alt="Document Writer" align="right" width="240"></a>
-> **The Scribe** was there when the first README was written-and wept, for it was incomplete. They have devoted eternity to the sacred art of documentation: clear, scannable, honest. While others ship features, The Scribe ensures those features are understood. Every code example works. Every explanation enlightens.
+> **The Scribe** was there when the first README was written - and wept, for it was incomplete. They have devoted eternity to the sacred art of documentation: clear, scannable, honest. While others ship features, The Scribe ensures those features are understood. Every code example works. Every explanation enlightens.
-**Role:** Technical documentation and knowledge capture - **Model:** `google/gemini-3-flash`
+**Role:** `Technical documentation and knowledge capture`
+**Model:** `google/gemini-3-flash`
+**Prompt:** [src/agents/document-writer.ts](src/agents/document-writer.ts)
-README crafting, API documentation, architecture docs, inline comments that don't insult your intelligence. Match existing style; focus on "why," not just "what."
+README crafting, API documentation, architecture docs, inline comments that don't insult your intelligence. *Match existing style; focus on "why," not just "what."*
 <br clear="both">
 ---
-### The Visionary - *Reader of Pixels*
+### Multimodal Viewer
-<a href="src/agents/multimodal.ts"><img src="img/multimodal.png" alt="The Visionary" align="right" width="240"></a>
+<a href="src/agents/multimodal.ts"><img src="img/multimodal.png" alt="Multimodal Viewer" align="right" width="240"></a>
-> **The Visionary** sees what others cannot-literally. Screenshots, wireframes, diagrams, PDFs: all are text to them. When a designer throws a Figma mockup at the team and vanishes, The Visionary translates vision into specification. They read the unreadable and describe the indescribable.
+> **The Visionary** sees what others cannot - literally. Screenshots, wireframes, diagrams, PDFs: all are text to them. When a designer throws a Figma mockup at the team and vanishes, The Visionary translates vision into specification. They read the unreadable and describe the indescribable.
-**Role:** Image and visual content analysis - **Model:** `google/gemini-3-flash`
+**Role:** `Image and visual content analysis`
+**Model:** `google/gemini-3-flash`
+**Prompt:** [src/agents/multimodal.ts](src/agents/multimodal.ts)
-Extract text from images, interpret diagrams, analyze UI screenshots, summarize visual documents. Report what they observe; inference is for others.
+Extract text from images, interpret diagrams, analyze UI screenshots, summarize visual documents. *Report what they observe; inference is for others.*
 <br clear="both">
 ---
-### The Minimalist - *Destroyer of Bloat*
+### Code Simplifier
-<a href="src/agents/simplicity-reviewer.ts"><img src="img/code-simplicity.png" alt="The Minimalist" align="right" width="240"></a>
+<a href="src/agents/simplicity-reviewer.ts"><img src="img/code-simplicity.png" alt="Code Simplifier" align="right" width="240"></a>
-> **The Minimalist** has one sacred truth: every line of code is a liability. They hunt abstractions that serve no purpose, defensive checks that defend nothing, and "clever" solutions that will haunt you in six months. Where others add, The Minimalist subtracts-ruthlessly, joyfully, necessarily.
+> **The Minimalist** has one sacred truth: every line of code is a liability. They hunt abstractions that serve no purpose, defensive checks that defend nothing, and "clever" solutions that will haunt you in six months. Where others add, The Minimalist subtracts - ruthlessly, joyfully, necessarily.
-**Role:** Code simplification and YAGNI enforcement - **Model:** `google/claude-opus-4-5-thinking`
+**Role:** `Code simplification and YAGNI enforcement`
+**Model:** `google/claude-opus-4-5-thinking`
+**Prompt:** [src/agents/simplicity-reviewer.ts](src/agents/simplicity-reviewer.ts)
-Identify unnecessary complexity, challenge premature abstractions, estimate LOC reduction, enforce minimalism. Read-only: they judge; The Orchestrator executes the sentence.
+Identify unnecessary complexity, challenge premature abstractions, estimate LOC reduction, enforce minimalism. *Read-only: they judge; The Orchestrator executes the sentence.*
 <br clear="both">
@@ -442,6 +340,26 @@ Fast code search and refactoring:
 ---
+## 🧩 Skills
+Skills are specialized capabilities that combine MCP servers with specific instructions for the Orchestrator.
+### Playwright Integration
+**The Orchestrator's eyes and hands in the browser.**
+| Tool | Description |
+|------|-------------|
+| `omo_skill` | Loads a skill (e.g., `playwright`) and provides its instructions and available MCP tools |
+| `omo_skill_mcp` | Invokes a specific tool from an MCP server managed by a skill |
+#### Key Features
+- **Browser Automation**: Full Playwright capabilities (browsing, clicking, typing, scraping).
+- **Screenshots**: Capture visual state of any web page.
+- **Sandboxed Output**: Screenshots are safely saved to `/tmp/playwright-mcp-output/`.
+---
 ## MCP Servers
 Built-in Model Context Protocol servers (enabled by default):