npm - oh-my-claude-sisyphus - Versions diffs - 3.3.1 → 3.3.3 - Mend

oh-my-claude-sisyphus 3.3.1 → 3.3.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

package/README.md +42 -18
package/commands/research.md +19 -431
package/docs/CLAUDE.md +7 -0
package/docs/FULL-README.md +1 -1
package/package.json +1 -1
package/skills/research/SKILL.md +511 -0

package/README.md CHANGED Viewed

@@ -68,37 +68,60 @@ Want explicit control? Include these words anywhere in your message:
 ---
-## Data Analysis with Scientist Agent (v3.3.0)
+## Data Analysis & Research (v3.3.3)
-The scientist agent provides persistent Python execution for data analysis:
+### Scientist Agent Tiers
-```
-# Variables persist across calls - no need to reload data!
+Three tiers of scientist agents for quantitative analysis and data science:
+| Agent | Model | Use For |
+|-------|-------|---------|
+| `scientist-low` | Haiku | Quick data inspection, simple statistics, file enumeration |
+| `scientist` | Sonnet | Standard analysis, pattern detection, visualization |
+| `scientist-high` | Opus | Complex reasoning, hypothesis validation, ML workflows |
+**Features:**
+- **Persistent Python REPL** - Variables persist across calls (no pickle/reload overhead)
+- **Structured markers** - `[FINDING]`, `[STAT:*]`, `[DATA]`, `[LIMITATION]` for parsed output
+- **Quality gates** - Every finding requires statistical evidence (CI, effect size, p-value)
+- **Auto-visualization** - Charts saved to `.omc/scientist/figures/`
+- **Report generation** - Markdown reports with embedded figures
+```python
+# Variables persist across calls!
 python_repl(action="execute", researchSessionID="analysis",
             code="import pandas as pd; df = pd.read_csv('data.csv')")
-# df still exists in the next call
+# df still exists - no need to reload
 python_repl(action="execute", researchSessionID="analysis",
             code="print(df.describe())")
 ```
-**Features:**
-- Variable persistence via Unix socket bridge
-- Structured markers: `[FINDING]`, `[STAT:*]`, `[DATA]`, `[LIMITATION]`
-- Memory tracking (RSS/VMS)
-- Session locking for safe concurrent access
-### /research Command
+### /research Command (NEW)
-Orchestrate parallel scientist agents for comprehensive research:
+Orchestrate parallel scientist agents for comprehensive research workflows:
 ```
-/research <goal>           # Standard research with checkpoints
-/research AUTO: <goal>     # Fully autonomous until complete
-/research status           # Check current session
+/research <goal>                    # Standard research with checkpoints
+/research AUTO: <goal>              # Fully autonomous until complete
+/research status                    # Check current session
+/research resume                    # Resume interrupted session
+/research list                      # List all sessions
+/research report <session-id>       # Generate report for session
 ```
-Features multi-stage decomposition, smart model routing, cross-validation, and structured report generation.
+**Research Protocol:**
+1. **Decomposition** - Breaks goal into 3-7 independent stages
+2. **Parallel Execution** - Fires scientist agents concurrently (max 5)
+3. **Cross-Validation** - Verifies consistency across findings
+4. **Synthesis** - Generates comprehensive markdown report
+**Smart Model Routing:**
+- Data gathering tasks → `scientist-low` (Haiku)
+- Standard analysis → `scientist` (Sonnet)
+- Complex reasoning → `scientist-high` (Opus)
+**Session Management:** Research state persists at `.omc/research/{session-id}/` enabling resume after interruption.
 ---
@@ -117,7 +140,8 @@ I'll intelligently determine what to stop based on context.
 - **28 Specialized Agents** - architect, researcher, explore, designer, writer, vision, critic, analyst, executor, planner, qa-tester, scientist (with tier variants)
 - **30 Skills** - orchestrate, ultrawork, ralph, planner, deepsearch, deepinit, git-master, frontend-ui-ux, learner, research, and more
-- **Persistent Python REPL** - True variable persistence for data analysis (new in 3.3.0)
+- **Persistent Python REPL** - True variable persistence for data analysis
+- **Research Workflow** - Parallel scientist orchestration with `/research` command (new in 3.3.3)
 - **HUD Statusline** - Real-time visualization of orchestration state
 - **Learned Skills** - Extract reusable insights from sessions with `/learner`
 - **Memory System** - Persistent context that survives compaction

package/commands/research.md CHANGED Viewed

@@ -1,22 +1,12 @@
 ---
-name: research
 description: Orchestrate parallel scientist agents for comprehensive research with AUTO mode
-user-invocable: true
-argument-hint: <research goal>
 ---
 # Research Skill
-Orchestrate parallel scientist agents for comprehensive research workflows with optional AUTO mode for fully autonomous execution.
-## Overview
-Research is a multi-stage workflow that decomposes complex research goals into parallel investigations:
+[RESEARCH MODE ACTIVATED]
-1. **Decomposition** - Break research goal into independent stages/hypotheses
-2. **Execution** - Run parallel scientist agents on each stage
-3. **Verification** - Cross-validate findings, check consistency
-4. **Synthesis** - Aggregate results into comprehensive report
+Orchestrate parallel scientist agents for comprehensive research workflows with optional AUTO mode for fully autonomous execution.
 ## Usage Examples
@@ -29,34 +19,16 @@ Research is a multi-stage workflow that decomposes complex research goals into p
 /research report <session-id>       # Generate report for session
 ```
-### Quick Examples
-```
-/research What are the performance characteristics of different sorting algorithms?
-/research AUTO: Analyze authentication patterns in this codebase
-/research How does the error handling work across the API layer?
-```
 ## Research Protocol
-### Stage Decomposition Pattern
+### Stage Decomposition
 When given a research goal, decompose into 3-7 independent stages:
-```markdown
-## Research Decomposition
-**Goal:** <original research goal>
-### Stage 1: <stage-name>
-- **Focus:** What this stage investigates
-- **Hypothesis:** Expected finding (if applicable)
-- **Scope:** Files/areas to examine
-- **Tier:** LOW | MEDIUM | HIGH
-### Stage 2: <stage-name>
-...
-```
+1. **Decomposition** - Break research goal into independent stages/hypotheses
+2. **Execution** - Run parallel scientist agents on each stage
+3. **Verification** - Cross-validate findings, check consistency
+4. **Synthesis** - Aggregate results into comprehensive report
 ### Parallel Scientist Invocation
@@ -75,65 +47,20 @@ Task(subagent_type="oh-my-claudecode:scientist-high", model="opus", prompt="[RES
 ### Smart Model Routing
-**CRITICAL: Always pass `model` parameter explicitly!**
 | Task Complexity | Agent | Model | Use For |
 |-----------------|-------|-------|---------|
 | Data gathering | `scientist-low` | haiku | File enumeration, pattern counting, simple lookups |
 | Standard analysis | `scientist` | sonnet | Code analysis, pattern detection, documentation review |
 | Complex reasoning | `scientist-high` | opus | Architecture analysis, cross-cutting concerns, hypothesis validation |
-### Routing Decision Guide
-| Research Task | Tier | Example Prompt |
-|---------------|------|----------------|
-| "Count occurrences of X" | LOW | "Count all usages of useState hook" |
-| "Find all files matching Y" | LOW | "List all test files in the project" |
-| "Analyze pattern Z" | MEDIUM | "Analyze error handling patterns in API routes" |
-| "Document how W works" | MEDIUM | "Document the authentication flow" |
-| "Explain why X happens" | HIGH | "Explain why race conditions occur in the cache layer" |
-| "Compare approaches A vs B" | HIGH | "Compare Redux vs Context for state management here" |
-### Verification Loop
-After parallel execution completes, verify findings:
+### Concurrency Limit
-```
-// Cross-validation stage
-Task(subagent_type="oh-my-claudecode:scientist", model="sonnet", prompt="
-[RESEARCH_VERIFICATION]
-Cross-validate these findings for consistency:
-Stage 1 findings: <summary>
-Stage 2 findings: <summary>
-Stage 3 findings: <summary>
-Check for:
-1. Contradictions between stages
-2. Missing connections
-3. Gaps in coverage
-4. Evidence quality
-Output: [VERIFIED] or [CONFLICTS:<list>]
-")
-```
+**Maximum 5 concurrent scientist agents** to prevent resource exhaustion.
 ## AUTO Mode
 AUTO mode runs the complete research workflow autonomously with loop control.
-### Loop Control Protocol
-```
-[RESEARCH + AUTO - ITERATION {{ITERATION}}/{{MAX}}]
-Your previous attempt did not output the completion promise. Continue working.
-Current state: {{STATE}}
-Completed stages: {{COMPLETED_STAGES}}
-Pending stages: {{PENDING_STAGES}}
-```
 ### Promise Tags
 | Tag | Meaning | When to Use |
@@ -143,340 +70,18 @@ Pending stages: {{PENDING_STAGES}}
 ### AUTO Mode Rules
-1. **Max Iterations:** 10 (configurable)
+1. **Max Iterations:** 10
 2. **Continue until:** Promise tag emitted OR max iterations
 3. **State tracking:** Persist after each stage completion
 4. **Cancellation:** `/cancel-research` or "stop", "cancel"
-### AUTO Mode Example
-```
-/research AUTO: Comprehensive security analysis of the authentication system
-[Decomposition]
-- Stage 1 (LOW): Enumerate auth-related files
-- Stage 2 (MEDIUM): Analyze token handling
-- Stage 3 (MEDIUM): Review session management
-- Stage 4 (HIGH): Identify vulnerability patterns
-- Stage 5 (MEDIUM): Document security controls
-[Execution - Parallel]
-Firing stages 1-3 in parallel...
-Firing stages 4-5 after dependencies complete...
-[Verification]
-Cross-validating findings...
-[Synthesis]
-Generating report...
-[PROMISE:RESEARCH_COMPLETE]
-```
-## Parallel Execution Patterns
-### Independent Dataset Analysis (Parallel)
-When stages analyze different data sources:
-```
-// All fire simultaneously
-Task(subagent_type="oh-my-claudecode:scientist-low", model="haiku", prompt="[STAGE:1] Analyze src/api/...")
-Task(subagent_type="oh-my-claudecode:scientist-low", model="haiku", prompt="[STAGE:2] Analyze src/utils/...")
-Task(subagent_type="oh-my-claudecode:scientist-low", model="haiku", prompt="[STAGE:3] Analyze src/components/...")
-```
-### Hypothesis Battery (Parallel)
-When testing multiple hypotheses:
-```
-// Test hypotheses simultaneously
-Task(subagent_type="oh-my-claudecode:scientist", model="sonnet", prompt="[HYPOTHESIS:A] Test if caching improves...")
-Task(subagent_type="oh-my-claudecode:scientist", model="sonnet", prompt="[HYPOTHESIS:B] Test if batching reduces...")
-Task(subagent_type="oh-my-claudecode:scientist", model="sonnet", prompt="[HYPOTHESIS:C] Test if lazy loading helps...")
-```
-### Cross-Validation (Sequential)
-When verification depends on all findings:
-```
-// Wait for all parallel stages
-[stages complete]
-// Then sequential verification
-Task(subagent_type="oh-my-claudecode:scientist-high", model="opus", prompt="
-[CROSS_VALIDATION]
-Validate consistency across all findings:
-- Finding 1: ...
-- Finding 2: ...
-- Finding 3: ...
-")
-```
-### Concurrency Limit
-**Maximum 5 concurrent scientist agents** to prevent resource exhaustion.
-If more than 5 stages, batch them:
-```
-Batch 1: Stages 1-5 (parallel)
-[wait for completion]
-Batch 2: Stages 6-7 (parallel)
-```
 ## Session Management
-### Directory Structure
-```
-.omc/research/{session-id}/
-  state.json              # Session state and progress
-  stages/
-    stage-1.md            # Stage 1 findings
-    stage-2.md            # Stage 2 findings
-    ...
-  findings/
-    raw/                  # Raw findings from scientists
-    verified/             # Post-verification findings
-  figures/
-    figure-1.png          # Generated visualizations
-    ...
-  report.md               # Final synthesized report
-```
-### State File Format
-```json
-{
-  "id": "research-20240115-abc123",
-  "goal": "Original research goal",
-  "status": "in_progress | complete | blocked | cancelled",
-  "mode": "standard | auto",
-  "iteration": 3,
-  "maxIterations": 10,
-  "stages": [
-    {
-      "id": 1,
-      "name": "Stage name",
-      "tier": "LOW | MEDIUM | HIGH",
-      "status": "pending | running | complete | failed",
-      "startedAt": "ISO timestamp",
-      "completedAt": "ISO timestamp",
-      "findingsFile": "stages/stage-1.md"
-    }
-  ],
-  "verification": {
-    "status": "pending | passed | failed",
-    "conflicts": [],
-    "completedAt": "ISO timestamp"
-  },
-  "createdAt": "ISO timestamp",
-  "updatedAt": "ISO timestamp"
-}
-```
-### Session Commands
-| Command | Action |
-|---------|--------|
-| `/research status` | Show current session progress |
-| `/research resume` | Resume most recent interrupted session |
-| `/research resume <session-id>` | Resume specific session |
-| `/research list` | List all sessions with status |
-| `/research report <session-id>` | Generate/regenerate report |
-| `/research cancel` | Cancel current session (preserves state) |
-## Tag Extraction
-Scientists use structured tags for findings. Extract them with these patterns:
-### Finding Tags
-```
-[FINDING:<id>] <title>
-<evidence and analysis>
-[/FINDING]
-[EVIDENCE:<finding-id>]
-- File: <path>
-- Lines: <range>
-- Content: <relevant code/text>
-[/EVIDENCE]
-[CONFIDENCE:<level>] # HIGH | MEDIUM | LOW
-<reasoning for confidence level>
-```
-### Extraction Regex Patterns
-```javascript
-// Finding extraction
-const findingPattern = /\[FINDING:(\w+)\]\s*(.*?)\n([\s\S]*?)\[\/FINDING\]/g;
-// Evidence extraction
-const evidencePattern = /\[EVIDENCE:(\w+)\]([\s\S]*?)\[\/EVIDENCE\]/g;
-// Confidence extraction
-const confidencePattern = /\[CONFIDENCE:(HIGH|MEDIUM|LOW)\]\s*(.*)/g;
-// Stage completion
-const stageCompletePattern = /\[STAGE_COMPLETE:(\d+)\]/;
-// Verification result
-const verificationPattern = /\[(VERIFIED|CONFLICTS):?(.*?)\]/;
-```
-### Evidence Window
-When extracting evidence, include context window:
-```
-[EVIDENCE:F1]
-- File: /src/auth/login.ts
-- Lines: 45-52 (context: 40-57)
-- Content:
-  ```typescript
-  // Lines 45-52 with 5 lines context above/below
-  ```
-[/EVIDENCE]
-```
-### Quality Validation
-Findings must meet quality threshold:
-| Quality Check | Requirement |
-|---------------|-------------|
-| Evidence present | At least 1 [EVIDENCE] per [FINDING] |
-| Confidence stated | Each finding has [CONFIDENCE] |
-| Source cited | File paths are absolute and valid |
-| Reproducible | Another agent could verify |
-## Report Generation
-### Report Template
-```markdown
-# Research Report: {{GOAL}}
-**Session ID:** {{SESSION_ID}}
-**Date:** {{DATE}}
-**Status:** {{STATUS}}
-## Executive Summary
-{{2-3 paragraph summary of key findings}}
-## Methodology
-### Research Stages
-| Stage | Focus | Tier | Status |
-|-------|-------|------|--------|
-{{STAGES_TABLE}}
-### Approach
-{{Description of decomposition rationale and execution strategy}}
-## Key Findings
-### Finding 1: {{TITLE}}
-**Confidence:** {{HIGH|MEDIUM|LOW}}
-{{Detailed finding with evidence}}
-#### Evidence
-{{Embedded evidence blocks}}
-### Finding 2: {{TITLE}}
-...
-## Visualizations
-{{FIGURES}}
-## Cross-Validation Results
-{{Verification summary, any conflicts resolved}}
-## Limitations
-- {{Limitation 1}}
-- {{Limitation 2}}
-- {{Areas not covered and why}}
-## Recommendations
-1. {{Actionable recommendation}}
-2. {{Actionable recommendation}}
-## Appendix
-### Raw Data
-{{Links to raw findings files}}
-### Session State
-{{Link to state.json}}
-```
-### Figure Embedding Protocol
-Scientists generate visualizations using this marker:
-```
-[FIGURE:path/to/figure.png]
-Caption: Description of what the figure shows
-Alt: Accessibility description
-[/FIGURE]
-```
-Report generator embeds figures:
-```markdown
-## Visualizations
-![Figure 1: Description](figures/figure-1.png)
-*Caption: Description of what the figure shows*
-![Figure 2: Description](figures/figure-2.png)
-*Caption: Description of what the figure shows*
-```
-### Figure Types
-| Type | Use For | Generated By |
-|------|---------|--------------|
-| Architecture diagram | System structure | scientist-high |
-| Flow chart | Process flows | scientist |
-| Dependency graph | Module relationships | scientist |
-| Timeline | Sequence of events | scientist |
-| Comparison table | A vs B analysis | scientist |
-## Configuration
-Optional settings in `.claude/settings.json`:
-```json
-{
-  "omc": {
-    "research": {
-      "maxIterations": 10,
-      "maxConcurrentScientists": 5,
-      "defaultTier": "MEDIUM",
-      "autoVerify": true,
-      "generateFigures": true,
-      "evidenceContextLines": 5
-    }
-  }
-}
-```
+Sessions are stored at `.omc/research/{session-id}/` with:
+- `state.json` - Session state and progress
+- `stages/` - Individual stage findings
+- `findings/` - Raw and verified findings
+- `report.md` - Final synthesized report
 ## Cancellation
@@ -486,26 +91,9 @@ Optional settings in `.claude/settings.json`:
 Or say: "stop research", "cancel research", "abort"
-Progress is preserved in `.omc/research/{session-id}/` for resume.
+Progress is preserved for resume.
-## Troubleshooting
-**Stuck in verification loop?**
-- Check for conflicting findings between stages
-- Review state.json for specific conflicts
-- May need to re-run specific stages with different approach
-**Scientists returning low-quality findings?**
-- Check tier assignment - complex analysis needs HIGH tier
-- Ensure prompts include clear scope and expected output format
-- Review if research goal is too broad
-**AUTO mode exhausted iterations?**
-- Review state to see where it's stuck
-- Check if goal is achievable with available data
-- Consider breaking into smaller research sessions
+---
-**Missing figures in report?**
-- Verify figures/ directory exists
-- Check [FIGURE:] tags in findings
-- Ensure paths are relative to session directory
+Research goal:
+{{PROMPT}}

package/docs/CLAUDE.md CHANGED Viewed

@@ -34,6 +34,7 @@ RULE 4: NEVER complete without Architect verification
 | **Deep analysis** | NEVER | architect / analyst |
 | **Codebase exploration** | NEVER | explore / explore-medium |
 | **Research tasks** | NEVER | researcher |
+| **Data analysis** | NEVER | scientist / scientist-high |
 | **Visual analysis** | NEVER | vision |
 ### Mandatory Skill Invocation
@@ -52,6 +53,7 @@ When you detect these patterns, you MUST invoke the corresponding skill:
 | Git/commit work | `git-master` (silent) |
 | "analyze", "debug", "investigate" | `analyze` |
 | "search", "find in codebase" | `deepsearch` |
+| "research", "analyze data", "statistics" | `research` |
 | "stop", "cancel", "abort" | appropriate cancel skill |
 ### Smart Model Routing (SAVE TOKENS)
@@ -186,6 +188,7 @@ User says "stop", "cancel", "abort" → You determine what to stop:
 | `cancel-ralph` | Cancel active ralph loop | "stop" in ralph | `/cancel-ralph` |
 | `cancel-ultrawork` | Cancel ultrawork mode | "stop" in ultrawork | `/cancel-ultrawork` |
 | `cancel-ultraqa` | Cancel ultraqa workflow | "stop" in ultraqa | `/cancel-ultraqa` |
+| `research` | Parallel scientist orchestration | "research", "analyze data" | `/research` |
 ### All 28 Agents
@@ -208,6 +211,7 @@ Always use `oh-my-claudecode:` prefix when calling via Task tool.
 | **Build** | `build-fixer-low` | `build-fixer` | - |
 | **TDD** | `tdd-guide-low` | `tdd-guide` | - |
 | **Code Review** | `code-reviewer-low` | - | `code-reviewer` |
+| **Data Science** | `scientist-low` | `scientist` | `scientist-high` |
 ### Agent Selection Guide
@@ -237,6 +241,9 @@ Always use `oh-my-claudecode:` prefix when calling via Task tool.
 | Quick test suggestions | `tdd-guide-low` | haiku |
 | Code review | `code-reviewer` | opus |
 | Quick code check | `code-reviewer-low` | haiku |
+| Data analysis/stats | `scientist` | sonnet |
+| Quick data inspection | `scientist-low` | haiku |
+| Complex ML/hypothesis | `scientist-high` | opus |
 ---

package/docs/FULL-README.md CHANGED Viewed

@@ -10,7 +10,7 @@
 # oh-my-claudecode
-[![Version](https://img.shields.io/badge/version-3.2.0-ff6b6b)](https://github.com/Yeachan-Heo/oh-my-claudecode)
+[![Version](https://img.shields.io/badge/version-3.3.3-ff6b6b)](https://github.com/Yeachan-Heo/oh-my-claudecode)
 [![npm version](https://img.shields.io/npm/v/oh-my-claudecode?color=cb3837)](https://www.npmjs.com/package/oh-my-claudecode)
 [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
 [![Node.js](https://img.shields.io/badge/Node.js-20+-339933?logo=node.js&logoColor=white)](https://nodejs.org/)

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "oh-my-claude-sisyphus",
-  "version": "3.3.1",
+  "version": "3.3.3",
   "description": "Multi-agent orchestration system for Claude Code - Inspired by oh-my-opencode",
   "type": "module",
   "main": "dist/index.js",

package/skills/research/SKILL.md ADDED Viewed

@@ -0,0 +1,511 @@
+---
+name: research
+description: Orchestrate parallel scientist agents for comprehensive research with AUTO mode
+user-invocable: true
+argument-hint: <research goal>
+---
+# Research Skill
+Orchestrate parallel scientist agents for comprehensive research workflows with optional AUTO mode for fully autonomous execution.
+## Overview
+Research is a multi-stage workflow that decomposes complex research goals into parallel investigations:
+1. **Decomposition** - Break research goal into independent stages/hypotheses
+2. **Execution** - Run parallel scientist agents on each stage
+3. **Verification** - Cross-validate findings, check consistency
+4. **Synthesis** - Aggregate results into comprehensive report
+## Usage Examples
+```
+/research <goal>                    # Standard research with user checkpoints
+/research AUTO: <goal>              # Fully autonomous until complete
+/research status                    # Check current research session status
+/research resume                    # Resume interrupted research session
+/research list                      # List all research sessions
+/research report <session-id>       # Generate report for session
+```
+### Quick Examples
+```
+/research What are the performance characteristics of different sorting algorithms?
+/research AUTO: Analyze authentication patterns in this codebase
+/research How does the error handling work across the API layer?
+```
+## Research Protocol
+### Stage Decomposition Pattern
+When given a research goal, decompose into 3-7 independent stages:
+```markdown
+## Research Decomposition
+**Goal:** <original research goal>
+### Stage 1: <stage-name>
+- **Focus:** What this stage investigates
+- **Hypothesis:** Expected finding (if applicable)
+- **Scope:** Files/areas to examine
+- **Tier:** LOW | MEDIUM | HIGH
+### Stage 2: <stage-name>
+...
+```
+### Parallel Scientist Invocation
+Fire independent stages in parallel via Task tool:
+```
+// Stage 1 - Simple data gathering
+Task(subagent_type="oh-my-claudecode:scientist-low", model="haiku", prompt="[RESEARCH_STAGE:1] Investigate...")
+// Stage 2 - Standard analysis
+Task(subagent_type="oh-my-claudecode:scientist", model="sonnet", prompt="[RESEARCH_STAGE:2] Analyze...")
+// Stage 3 - Complex reasoning
+Task(subagent_type="oh-my-claudecode:scientist-high", model="opus", prompt="[RESEARCH_STAGE:3] Deep analysis of...")
+```
+### Smart Model Routing
+**CRITICAL: Always pass `model` parameter explicitly!**
+| Task Complexity | Agent | Model | Use For |
+|-----------------|-------|-------|---------|
+| Data gathering | `scientist-low` | haiku | File enumeration, pattern counting, simple lookups |
+| Standard analysis | `scientist` | sonnet | Code analysis, pattern detection, documentation review |
+| Complex reasoning | `scientist-high` | opus | Architecture analysis, cross-cutting concerns, hypothesis validation |
+### Routing Decision Guide
+| Research Task | Tier | Example Prompt |
+|---------------|------|----------------|
+| "Count occurrences of X" | LOW | "Count all usages of useState hook" |
+| "Find all files matching Y" | LOW | "List all test files in the project" |
+| "Analyze pattern Z" | MEDIUM | "Analyze error handling patterns in API routes" |
+| "Document how W works" | MEDIUM | "Document the authentication flow" |
+| "Explain why X happens" | HIGH | "Explain why race conditions occur in the cache layer" |
+| "Compare approaches A vs B" | HIGH | "Compare Redux vs Context for state management here" |
+### Verification Loop
+After parallel execution completes, verify findings:
+```
+// Cross-validation stage
+Task(subagent_type="oh-my-claudecode:scientist", model="sonnet", prompt="
+[RESEARCH_VERIFICATION]
+Cross-validate these findings for consistency:
+Stage 1 findings: <summary>
+Stage 2 findings: <summary>
+Stage 3 findings: <summary>
+Check for:
+1. Contradictions between stages
+2. Missing connections
+3. Gaps in coverage
+4. Evidence quality
+Output: [VERIFIED] or [CONFLICTS:<list>]
+")
+```
+## AUTO Mode
+AUTO mode runs the complete research workflow autonomously with loop control.
+### Loop Control Protocol
+```
+[RESEARCH + AUTO - ITERATION {{ITERATION}}/{{MAX}}]
+Your previous attempt did not output the completion promise. Continue working.
+Current state: {{STATE}}
+Completed stages: {{COMPLETED_STAGES}}
+Pending stages: {{PENDING_STAGES}}
+```
+### Promise Tags
+| Tag | Meaning | When to Use |
+|-----|---------|-------------|
+| `[PROMISE:RESEARCH_COMPLETE]` | Research finished successfully | All stages done, verified, report generated |
+| `[PROMISE:RESEARCH_BLOCKED]` | Cannot proceed | Missing data, access issues, circular dependency |
+### AUTO Mode Rules
+1. **Max Iterations:** 10 (configurable)
+2. **Continue until:** Promise tag emitted OR max iterations
+3. **State tracking:** Persist after each stage completion
+4. **Cancellation:** `/cancel-research` or "stop", "cancel"
+### AUTO Mode Example
+```
+/research AUTO: Comprehensive security analysis of the authentication system
+[Decomposition]
+- Stage 1 (LOW): Enumerate auth-related files
+- Stage 2 (MEDIUM): Analyze token handling
+- Stage 3 (MEDIUM): Review session management
+- Stage 4 (HIGH): Identify vulnerability patterns
+- Stage 5 (MEDIUM): Document security controls
+[Execution - Parallel]
+Firing stages 1-3 in parallel...
+Firing stages 4-5 after dependencies complete...
+[Verification]
+Cross-validating findings...
+[Synthesis]
+Generating report...
+[PROMISE:RESEARCH_COMPLETE]
+```
+## Parallel Execution Patterns
+### Independent Dataset Analysis (Parallel)
+When stages analyze different data sources:
+```
+// All fire simultaneously
+Task(subagent_type="oh-my-claudecode:scientist-low", model="haiku", prompt="[STAGE:1] Analyze src/api/...")
+Task(subagent_type="oh-my-claudecode:scientist-low", model="haiku", prompt="[STAGE:2] Analyze src/utils/...")
+Task(subagent_type="oh-my-claudecode:scientist-low", model="haiku", prompt="[STAGE:3] Analyze src/components/...")
+```
+### Hypothesis Battery (Parallel)
+When testing multiple hypotheses:
+```
+// Test hypotheses simultaneously
+Task(subagent_type="oh-my-claudecode:scientist", model="sonnet", prompt="[HYPOTHESIS:A] Test if caching improves...")
+Task(subagent_type="oh-my-claudecode:scientist", model="sonnet", prompt="[HYPOTHESIS:B] Test if batching reduces...")
+Task(subagent_type="oh-my-claudecode:scientist", model="sonnet", prompt="[HYPOTHESIS:C] Test if lazy loading helps...")
+```
+### Cross-Validation (Sequential)
+When verification depends on all findings:
+```
+// Wait for all parallel stages
+[stages complete]
+// Then sequential verification
+Task(subagent_type="oh-my-claudecode:scientist-high", model="opus", prompt="
+[CROSS_VALIDATION]
+Validate consistency across all findings:
+- Finding 1: ...
+- Finding 2: ...
+- Finding 3: ...
+")
+```
+### Concurrency Limit
+**Maximum 5 concurrent scientist agents** to prevent resource exhaustion.
+If more than 5 stages, batch them:
+```
+Batch 1: Stages 1-5 (parallel)
+[wait for completion]
+Batch 2: Stages 6-7 (parallel)
+```
+## Session Management
+### Directory Structure
+```
+.omc/research/{session-id}/
+  state.json              # Session state and progress
+  stages/
+    stage-1.md            # Stage 1 findings
+    stage-2.md            # Stage 2 findings
+    ...
+  findings/
+    raw/                  # Raw findings from scientists
+    verified/             # Post-verification findings
+  figures/
+    figure-1.png          # Generated visualizations
+    ...
+  report.md               # Final synthesized report
+```
+### State File Format
+```json
+{
+  "id": "research-20240115-abc123",
+  "goal": "Original research goal",
+  "status": "in_progress | complete | blocked | cancelled",
+  "mode": "standard | auto",
+  "iteration": 3,
+  "maxIterations": 10,
+  "stages": [
+    {
+      "id": 1,
+      "name": "Stage name",
+      "tier": "LOW | MEDIUM | HIGH",
+      "status": "pending | running | complete | failed",
+      "startedAt": "ISO timestamp",
+      "completedAt": "ISO timestamp",
+      "findingsFile": "stages/stage-1.md"
+    }
+  ],
+  "verification": {
+    "status": "pending | passed | failed",
+    "conflicts": [],
+    "completedAt": "ISO timestamp"
+  },
+  "createdAt": "ISO timestamp",
+  "updatedAt": "ISO timestamp"
+}
+```
+### Session Commands
+| Command | Action |
+|---------|--------|
+| `/research status` | Show current session progress |
+| `/research resume` | Resume most recent interrupted session |
+| `/research resume <session-id>` | Resume specific session |
+| `/research list` | List all sessions with status |
+| `/research report <session-id>` | Generate/regenerate report |
+| `/research cancel` | Cancel current session (preserves state) |
+## Tag Extraction
+Scientists use structured tags for findings. Extract them with these patterns:
+### Finding Tags
+```
+[FINDING:<id>] <title>
+<evidence and analysis>
+[/FINDING]
+[EVIDENCE:<finding-id>]
+- File: <path>
+- Lines: <range>
+- Content: <relevant code/text>
+[/EVIDENCE]
+[CONFIDENCE:<level>] # HIGH | MEDIUM | LOW
+<reasoning for confidence level>
+```
+### Extraction Regex Patterns
+```javascript
+// Finding extraction
+const findingPattern = /\[FINDING:(\w+)\]\s*(.*?)\n([\s\S]*?)\[\/FINDING\]/g;
+// Evidence extraction
+const evidencePattern = /\[EVIDENCE:(\w+)\]([\s\S]*?)\[\/EVIDENCE\]/g;
+// Confidence extraction
+const confidencePattern = /\[CONFIDENCE:(HIGH|MEDIUM|LOW)\]\s*(.*)/g;
+// Stage completion
+const stageCompletePattern = /\[STAGE_COMPLETE:(\d+)\]/;
+// Verification result
+const verificationPattern = /\[(VERIFIED|CONFLICTS):?(.*?)\]/;
+```
+### Evidence Window
+When extracting evidence, include context window:
+```
+[EVIDENCE:F1]
+- File: /src/auth/login.ts
+- Lines: 45-52 (context: 40-57)
+- Content:
+  ```typescript
+  // Lines 45-52 with 5 lines context above/below
+  ```
+[/EVIDENCE]
+```
+### Quality Validation
+Findings must meet quality threshold:
+| Quality Check | Requirement |
+|---------------|-------------|
+| Evidence present | At least 1 [EVIDENCE] per [FINDING] |
+| Confidence stated | Each finding has [CONFIDENCE] |
+| Source cited | File paths are absolute and valid |
+| Reproducible | Another agent could verify |
+## Report Generation
+### Report Template
+```markdown
+# Research Report: {{GOAL}}
+**Session ID:** {{SESSION_ID}}
+**Date:** {{DATE}}
+**Status:** {{STATUS}}
+## Executive Summary
+{{2-3 paragraph summary of key findings}}
+## Methodology
+### Research Stages
+| Stage | Focus | Tier | Status |
+|-------|-------|------|--------|
+{{STAGES_TABLE}}
+### Approach
+{{Description of decomposition rationale and execution strategy}}
+## Key Findings
+### Finding 1: {{TITLE}}
+**Confidence:** {{HIGH|MEDIUM|LOW}}
+{{Detailed finding with evidence}}
+#### Evidence
+{{Embedded evidence blocks}}
+### Finding 2: {{TITLE}}
+...
+## Visualizations
+{{FIGURES}}
+## Cross-Validation Results
+{{Verification summary, any conflicts resolved}}
+## Limitations
+- {{Limitation 1}}
+- {{Limitation 2}}
+- {{Areas not covered and why}}
+## Recommendations
+1. {{Actionable recommendation}}
+2. {{Actionable recommendation}}
+## Appendix
+### Raw Data
+{{Links to raw findings files}}
+### Session State
+{{Link to state.json}}
+```
+### Figure Embedding Protocol
+Scientists generate visualizations using this marker:
+```
+[FIGURE:path/to/figure.png]
+Caption: Description of what the figure shows
+Alt: Accessibility description
+[/FIGURE]
+```
+Report generator embeds figures:
+```markdown
+## Visualizations
+![Figure 1: Description](figures/figure-1.png)
+*Caption: Description of what the figure shows*
+![Figure 2: Description](figures/figure-2.png)
+*Caption: Description of what the figure shows*
+```
+### Figure Types
+| Type | Use For | Generated By |
+|------|---------|--------------|
+| Architecture diagram | System structure | scientist-high |
+| Flow chart | Process flows | scientist |
+| Dependency graph | Module relationships | scientist |
+| Timeline | Sequence of events | scientist |
+| Comparison table | A vs B analysis | scientist |
+## Configuration
+Optional settings in `.claude/settings.json`:
+```json
+{
+  "omc": {
+    "research": {
+      "maxIterations": 10,
+      "maxConcurrentScientists": 5,
+      "defaultTier": "MEDIUM",
+      "autoVerify": true,
+      "generateFigures": true,
+      "evidenceContextLines": 5
+    }
+  }
+}
+```
+## Cancellation
+```
+/cancel-research
+```
+Or say: "stop research", "cancel research", "abort"
+Progress is preserved in `.omc/research/{session-id}/` for resume.
+## Troubleshooting
+**Stuck in verification loop?**
+- Check for conflicting findings between stages
+- Review state.json for specific conflicts
+- May need to re-run specific stages with different approach
+**Scientists returning low-quality findings?**
+- Check tier assignment - complex analysis needs HIGH tier
+- Ensure prompts include clear scope and expected output format
+- Review if research goal is too broad
+**AUTO mode exhausted iterations?**
+- Review state to see where it's stuck
+- Check if goal is achievable with available data
+- Consider breaking into smaller research sessions
+**Missing figures in report?**
+- Verify figures/ directory exists
+- Check [FIGURE:] tags in findings
+- Ensure paths are relative to session directory