npm - opencode-multiagent - Versions diffs - 0.2.1 → 0.4.0 - Mend

opencode-multiagent 0.2.1 → 0.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (160) hide show

package/AGENTS.md +83 -0
package/CHANGELOG.md +31 -0
package/CONTRIBUTING.md +36 -0
package/README.md +44 -168
package/README.tr.md +84 -0
package/RELEASE.md +68 -0
package/agents/AGENTS.md +91 -0
package/agents/auditor.md +67 -23
package/agents/{worker.md → coder.md} +24 -17
package/agents/docmaster.md +91 -0
package/agents/executor.md +63 -79
package/agents/planner.md +78 -58
package/agents/reviewer.md +31 -15
package/agents/scout.md +25 -17
package/agents/sec-coder.md +83 -0
package/agents/ui-coder.md +77 -0
package/commands/board.md +17 -0
package/commands/execute.md +9 -7
package/commands/init-deep.md +7 -6
package/commands/init.md +5 -5
package/commands/inspect.md +6 -5
package/commands/plan.md +8 -6
package/commands/quality.md +4 -3
package/commands/review.md +5 -3
package/commands/status.md +5 -3
package/defaults/AGENTS.md +48 -0
package/defaults/opencode-multiagent.json +180 -0
package/defaults/opencode-multiagent.schema.json +265 -0
package/dist/control-plane.d.ts +4 -0
package/dist/control-plane.d.ts.map +1 -0
package/dist/index.d.ts +5 -0
package/dist/index.d.ts.map +1 -0
package/dist/index.js +1916 -0
package/dist/opencode-multiagent/compiler.d.ts +25 -0
package/dist/opencode-multiagent/compiler.d.ts.map +1 -0
package/dist/opencode-multiagent/constants.d.ts +128 -0
package/dist/opencode-multiagent/constants.d.ts.map +1 -0
package/dist/opencode-multiagent/correlation.d.ts +21 -0
package/dist/opencode-multiagent/correlation.d.ts.map +1 -0
package/dist/opencode-multiagent/defaults.d.ts +10 -0
package/dist/opencode-multiagent/defaults.d.ts.map +1 -0
package/dist/opencode-multiagent/hooks.d.ts +62 -0
package/dist/opencode-multiagent/hooks.d.ts.map +1 -0
package/dist/opencode-multiagent/log.d.ts +2 -0
package/dist/opencode-multiagent/log.d.ts.map +1 -0
package/dist/opencode-multiagent/markdown.d.ts +8 -0
package/dist/opencode-multiagent/markdown.d.ts.map +1 -0
package/dist/opencode-multiagent/mcp.d.ts +3 -0
package/dist/opencode-multiagent/mcp.d.ts.map +1 -0
package/dist/opencode-multiagent/policy.d.ts +5 -0
package/dist/opencode-multiagent/policy.d.ts.map +1 -0
package/dist/opencode-multiagent/quality.d.ts +18 -0
package/dist/opencode-multiagent/quality.d.ts.map +1 -0
package/dist/opencode-multiagent/runtime.d.ts +7 -0
package/dist/opencode-multiagent/runtime.d.ts.map +1 -0
package/dist/opencode-multiagent/session-tracker.d.ts +32 -0
package/dist/opencode-multiagent/session-tracker.d.ts.map +1 -0
package/dist/opencode-multiagent/skills.d.ts +17 -0
package/dist/opencode-multiagent/skills.d.ts.map +1 -0
package/dist/opencode-multiagent/supervision.d.ts +26 -0
package/dist/opencode-multiagent/supervision.d.ts.map +1 -0
package/dist/opencode-multiagent/task-manager.d.ts +54 -0
package/dist/opencode-multiagent/task-manager.d.ts.map +1 -0
package/dist/opencode-multiagent/telemetry.d.ts +28 -0
package/dist/opencode-multiagent/telemetry.d.ts.map +1 -0
package/dist/opencode-multiagent/tools.d.ts +87 -0
package/dist/opencode-multiagent/tools.d.ts.map +1 -0
package/dist/opencode-multiagent/types.d.ts +36 -0
package/dist/opencode-multiagent/types.d.ts.map +1 -0
package/dist/opencode-multiagent/utils.d.ts +9 -0
package/dist/opencode-multiagent/utils.d.ts.map +1 -0
package/docs/agents.md +148 -0
package/docs/agents.tr.md +149 -0
package/docs/configuration.md +244 -0
package/docs/configuration.tr.md +244 -0
package/docs/usage-guide.md +224 -0
package/docs/usage-guide.tr.md +225 -0
package/examples/opencode.with-overrides.json +3 -7
package/package.json +23 -13
package/skills/AGENTS.md +51 -0
package/skills/advanced-evaluation/SKILL.md +37 -21
package/skills/advanced-evaluation/manifest.json +2 -13
package/skills/cek-context-engineering/SKILL.md +159 -87
package/skills/cek-context-engineering/manifest.json +1 -3
package/skills/cek-prompt-engineering/SKILL.md +13 -10
package/skills/cek-prompt-engineering/manifest.json +1 -3
package/skills/cek-test-prompt/SKILL.md +38 -28
package/skills/cek-test-prompt/manifest.json +1 -3
package/skills/cek-thought-based-reasoning/SKILL.md +75 -21
package/skills/cek-thought-based-reasoning/manifest.json +1 -3
package/skills/context-degradation/SKILL.md +14 -13
package/skills/context-degradation/manifest.json +1 -3
package/skills/debate/SKILL.md +23 -78
package/skills/debate/manifest.json +2 -12
package/skills/design-first/manifest.json +2 -13
package/skills/dispatching-parallel-agents/SKILL.md +14 -3
package/skills/dispatching-parallel-agents/manifest.json +1 -4
package/skills/drift-analysis/SKILL.md +50 -29
package/skills/drift-analysis/manifest.json +2 -12
package/skills/evaluation/manifest.json +2 -12
package/skills/executing-plans/SKILL.md +15 -8
package/skills/executing-plans/manifest.json +1 -3
package/skills/handoff-protocols/manifest.json +2 -12
package/skills/parallel-investigation/SKILL.md +25 -12
package/skills/parallel-investigation/manifest.json +1 -4
package/skills/reflexion-critique/SKILL.md +21 -10
package/skills/reflexion-critique/manifest.json +1 -3
package/skills/reflexion-reflect/SKILL.md +36 -34
package/skills/reflexion-reflect/manifest.json +2 -10
package/skills/root-cause-analysis/manifest.json +2 -13
package/skills/sadd-judge-with-debate/SKILL.md +50 -26
package/skills/sadd-judge-with-debate/manifest.json +1 -3
package/skills/structured-code-review/manifest.json +2 -11
package/skills/task-decomposition/manifest.json +2 -13
package/skills/verification-before-completion/manifest.json +2 -15
package/skills/verification-gates/SKILL.md +27 -19
package/skills/verification-gates/manifest.json +2 -12
package/agents/advisor.md +0 -57
package/agents/critic.md +0 -127
package/agents/deep-worker.md +0 -65
package/agents/devil.md +0 -36
package/agents/heavy-worker.md +0 -68
package/agents/lead.md +0 -155
package/agents/librarian.md +0 -62
package/agents/qa.md +0 -50
package/agents/quick.md +0 -65
package/agents/scribe.md +0 -78
package/agents/strategist.md +0 -63
package/agents/ui-heavy-worker.md +0 -62
package/agents/ui-worker.md +0 -69
package/agents/validator.md +0 -47
package/defaults/agent-settings.json +0 -102
package/defaults/agent-settings.schema.json +0 -25
package/defaults/flags.json +0 -35
package/defaults/flags.schema.json +0 -119
package/defaults/mcp-defaults.json +0 -47
package/defaults/mcp-defaults.schema.json +0 -38
package/defaults/profiles.json +0 -53
package/defaults/profiles.schema.json +0 -60
package/defaults/team-profiles.json +0 -83
package/src/control-plane.ts +0 -21
package/src/index.ts +0 -8
package/src/opencode-multiagent/compiler.ts +0 -168
package/src/opencode-multiagent/constants.ts +0 -178
package/src/opencode-multiagent/file-lock.ts +0 -90
package/src/opencode-multiagent/hooks.ts +0 -599
package/src/opencode-multiagent/log.ts +0 -12
package/src/opencode-multiagent/mailbox.ts +0 -287
package/src/opencode-multiagent/markdown.ts +0 -99
package/src/opencode-multiagent/mcp.ts +0 -35
package/src/opencode-multiagent/policy.ts +0 -67
package/src/opencode-multiagent/quality.ts +0 -140
package/src/opencode-multiagent/runtime.ts +0 -55
package/src/opencode-multiagent/skills.ts +0 -144
package/src/opencode-multiagent/supervision.ts +0 -156
package/src/opencode-multiagent/task-manager.ts +0 -148
package/src/opencode-multiagent/team-manager.ts +0 -219
package/src/opencode-multiagent/team-tools.ts +0 -359
package/src/opencode-multiagent/telemetry.ts +0 -124
package/src/opencode-multiagent/utils.ts +0 -54

package/skills/handoff-protocols/manifest.json CHANGED Viewed

@@ -2,18 +2,8 @@
   "name": "handoff-protocols",
   "version": "1.0.0",
   "description": "Guidance for safe multi-agent or multi-step handoffs",
-  "triggers": [
-    "handoff",
-    "transfer",
-    "transition",
-    "pass to",
-    "onboard"
-  ],
-  "applicable_agents": [
-    "executor",
-    "planner",
-    "worker"
-  ],
+  "triggers": ["handoff", "transfer", "transition", "pass to", "onboard"],
+  "applicable_agents": ["executor", "planner"],
   "max_context_tokens": 1500,
   "entry_file": "SKILL.md"
 }

package/skills/parallel-investigation/SKILL.md CHANGED Viewed

@@ -16,8 +16,8 @@ tags:
 difficulty: advanced
 estimatedTime: 15
 relatedSkills:
-  - debugging/root-cause-analysis
-  - collaboration/handoff-protocols
+  - root-cause-analysis
+  - handoff-protocols
 ---
 # Parallel Investigation
@@ -58,13 +58,15 @@ Assign threads with clear ownership:
 ```markdown
 ## Thread A: Database Performance
 **Investigator:** [Name/Agent A]
 **Duration:** 30 minutes
 **Scope:**
 - Query execution times
 - Index utilization
 - Connection pool metrics
-**Report Format:** Summary + evidence
+  **Report Format:** Summary + evidence
 ```
 ### Phase 3: Parallel Execution
@@ -77,17 +79,22 @@ Each thread follows this pattern:
 4. Prepare summary for sync point
 **Thread Log Template:**
 ```markdown
 ## Thread: [Name]
 **Start:** [Time]
 ### Findings
 - [Timestamp] [Finding]
 ### Evidence
 - [Log/Metric/Screenshot]
 ### Preliminary Conclusion
 [What this thread suggests about the problem]
 ```
@@ -103,6 +110,7 @@ Sync Point Agenda:
 ```
 **Sync Point Decisions:**
 - **Continue**: Threads are progressing, maintain parallel execution
 - **Pivot**: Redirect threads based on new evidence
 - **Converge**: One thread found the answer, others join to validate
@@ -146,13 +154,13 @@ When a thread identifies the likely cause:
 ## Decision Framework
-| Thread Status | Action |
-|---------------|--------|
-| All exploring | Continue parallel |
-| One hot lead | Validate lead, others support |
-| Multiple leads | Prioritize by evidence strength |
-| All dead ends | Reframe problem, new threads |
-| Confirmed cause | Converge, begin fix |
+| Thread Status   | Action                          |
+| --------------- | ------------------------------- |
+| All exploring   | Continue parallel               |
+| One hot lead    | Validate lead, others support   |
+| Multiple leads  | Prioritize by evidence strength |
+| All dead ends   | Reframe problem, new threads    |
+| Confirmed cause | Converge, begin fix             |
 ## Time Management
@@ -176,31 +184,36 @@ Adjust sync point cadence based on incident severity — every 20 minutes for cr
 # Investigation: [Problem]
 ## Summary
 [Brief description and resolution]
 ## Threads Explored
 ### Thread A: [Area]
 - Investigator: [Name]
 - Findings: [Summary]
 - Outcome: [Lead / Dead End / Root Cause]
 ## Root Cause
 [Detailed explanation of what was found]
 ## Evidence
 - [Evidence 1]
 - [Evidence 2]
 ## Resolution
 [What was done to fix]
 ## Lessons Learned
 - [Learning 1]
 ```
 ## Integration with Other Skills
-- **debugging/root-cause-analysis**: Each thread follows RCA principles
-- **debugging/hypothesis-testing**: Threads test specific hypotheses
+- **root-cause-analysis**: Each thread follows RCA principles
 - **handoff-protocols**: When passing a thread to another person

package/skills/parallel-investigation/manifest.json CHANGED Viewed

@@ -9,10 +9,7 @@
     "incident",
     "simultaneous debug"
   ],
-  "applicable_agents": [
-    "critic",
-    "strategist"
-  ],
+  "applicable_agents": ["planner"],
   "max_context_tokens": 2200,
   "entry_file": "SKILL.md"
 }

package/skills/reflexion-critique/SKILL.md CHANGED Viewed

@@ -187,7 +187,7 @@ Be objective and consider the context of the project (size, team, constraints).
 **Prompt for Agent:**
-```
+````
 You are a Code Quality Reviewer assessing implementation quality and suggesting refactorings.
 ## Your Task
@@ -262,9 +262,9 @@ Project Conventions: {any known conventions from codebase}
    ...
 Provide specific, actionable feedback with code examples.
-```
+````
-**Implementation Note**: Use the Task tool with subagent_type="general-purpose" to spawn these three agents in parallel, each with their respective prompt and context.
+**Implementation Note**: Use the Task tool with subagent_type="general" to spawn these three agents in parallel, each with their respective prompt and context.
 ### Phase 3: Cross-Review & Debate
@@ -294,6 +294,7 @@ Compile all findings into a comprehensive, actionable report:
 # 🔍 Work Critique Report
 ## Executive Summary
 [2-3 sentences summarizing overall assessment]
 **Overall Quality Score**: X/10 (average of three judge scores)
@@ -302,11 +303,11 @@ Compile all findings into a comprehensive, actionable report:
 ## 📊 Judge Scores
-| Judge | Score | Key Finding |
-|-------|-------|-------------|
-| Requirements Validator | X/10 | [one-line summary] |
-| Solution Architect | X/10 | [one-line summary] |
-| Code Quality Reviewer | X/10 | [one-line summary] |
+| Judge                  | Score | Key Finding        |
+| ---------------------- | ----- | ------------------ |
+| Requirements Validator | X/10  | [one-line summary] |
+| Solution Architect     | X/10  | [one-line summary] |
+| Code Quality Reviewer  | X/10  | [one-line summary] |
 ---
@@ -323,6 +324,7 @@ Compile all findings into a comprehensive, actionable report:
 ## ⚠️ Issues & Gaps
 ### Critical Issues
 [Issues that need immediate attention]
 - **[Issue 1]**
@@ -332,12 +334,15 @@ Compile all findings into a comprehensive, actionable report:
   - Recommendation: [what to do]
 ### High Priority
 [Important but not blocking]
 ### Medium Priority
 [Nice to have improvements]
 ### Low Priority
 [Minor polish items]
 ---
@@ -360,6 +365,7 @@ Compile all findings into a comprehensive, actionable report:
 **Chosen Approach**: [brief description]
 **Alternative Approaches Considered**:
 1. [Alternative 1] - [Why chosen approach is better/worse]
 2. [Alternative 2] - [Why chosen approach is better/worse]
@@ -379,6 +385,7 @@ Compile all findings into a comprehensive, actionable report:
    - Before/After: [code examples]
 ### Medium Priority Refactorings
 [similar structure]
 ---
@@ -397,6 +404,7 @@ Compile all findings into a comprehensive, actionable report:
 [If applicable - where judges disagreed]
 **Debate 1: [Topic]**
 - Requirements Validator position: [summary]
 - Solution Architect position: [summary]
 - Resolution: [consensus reached or "reasonable disagreement"]
@@ -408,14 +416,17 @@ Compile all findings into a comprehensive, actionable report:
 Based on the critique, here are recommended next steps:
 **Must Do**:
 - [ ] [Critical action 1]
 - [ ] [Critical action 2]
 **Should Do**:
 - [ ] [High priority action 1]
 - [ ] [High priority action 2]
 **Could Do**:
 - [ ] [Medium priority action 1]
 - [ ] [Nice to have action 2]
@@ -438,8 +449,8 @@ Based on the critique, here are recommended next steps:
 ---
-*Generated using Multi-Agent Debate + LLM-as-a-Judge pattern*
-*Review Date: [timestamp]*
+_Generated using Multi-Agent Debate + LLM-as-a-Judge pattern_
+_Review Date: [timestamp]_
 ```
 ## Important Guidelines

package/skills/reflexion-critique/manifest.json CHANGED Viewed

@@ -9,9 +9,7 @@
     "judge with debate",
     "consensus"
   ],
-  "applicable_agents": [
-    "critic"
-  ],
+  "applicable_agents": ["planner"],
   "max_context_tokens": 2400,
   "entry_file": "SKILL.md"
 }

package/skills/reflexion-reflect/SKILL.md CHANGED Viewed

@@ -1,12 +1,12 @@
 ---
 name: reflexion-reflect
-description: Reflect on previus response and output, based on Self-refinement framework for iterative improvement with complexity triage and verification
+description: Reflect on previous response and output, based on Self-refinement framework for iterative improvement with complexity triage and verification
 argument-hint: Optional focus area or confidence threshold to use, for example "security" or "deep reflect if less than 90% confidence"
 ---
 # Self-Refinement and Iterative Improvement Framework
-Reflect on previus response and output.
+Reflect on previous response and output.
 ## Your Identity (NON-NEGOTIABLE)
@@ -82,15 +82,13 @@ Before proceeding, evaluate your most recent output against these criteria:
    - [ ] Are there edge cases that haven't been considered?
    - [ ] Could there be unintended side effects?
-4. **Dependency & Impact Verification**
+4. **Dependency & Impact Verification**
    - [ ] For ANY proposed addition/deletion/modification, have you checked for dependencies?
    - [ ] Have you searched for related decisions that may be superseded or supersede this?
    - [ ] Have you checked the configuration or docs (for example AUTHORITATIVE.yaml) for active evaluations or status?
    - [ ] Have you searched the ecosystem for files/processes that depend on items being changed?
    - [ ] If recommending removal of anything, have you verified nothing depends on it?
    **HARD RULE:** If ANY check reveals active dependencies, evaluations, or pending decisions, FLAG THIS IN THE EVALUATION. Do not approve work that recommends changes without dependency verification.
 5. **Fact-Checking Required**
@@ -202,7 +200,7 @@ When the output involves code, additionally evaluate:
 // utils/dateFormatter.js
 function formatDate(date) {
   const d = new Date(date);
-  return `${d.getMonth()+1}/${d.getDate()}/${d.getFullYear()}`;
+  return `${d.getMonth() + 1}/${d.getDate()}/${d.getFullYear()}`;
 }
 ```
@@ -366,7 +364,7 @@ const formatted = format(new Date(), 'MM/dd/yyyy');
 1. Search for benchmark or documentation comparing both approaches
 2. Provide algorithmic analysis
-**Corrected Statement**: "Map performs better for large collections (10K+ items), while Object is more efficient for small sets (<100 items)"
+   **Corrected Statement**: "Map performs better for large collections (10K+ items), while Object is more efficient for small sets (<100 items)"
 ## NON-CODE OUTPUT REFLECTION
@@ -405,31 +403,35 @@ For documentation, explanations, and analysis outputs:
 ## Detailed Analysis
 ### [Criterion 1 Name] (Weight: 0.XX)
 **Practical Check**: [If applicable - what you verified with tools]
 **Analysis**: [Explain how evidence maps to rubric level]
 **Score**: X/5
 **Improvement**: [Specific suggestion if score < 5]
 #### Evidences
 [Specific quotes/references]
 ### [Criterion 2 Name] (Weight: 0.XX)
 [Repeat pattern...]
 ## Score Summary
-| Criterion | Score | Weight | Weighted |
-|-----------|-------|--------|----------|
-| Instruction Following | X/5 | 0.30 | X.XX |
-| Output Completeness | X/5 | 0.25 | X.XX |
-| Solution Quality | X/5 | 0.25 | X.XX |
-| Reasoning Quality | X/5 | 0.10 | X.XX |
-| Response Coherence | X/5 | 0.10 | X.XX |
-| **Weighted Total** | | | **X.XX/5.0** |
+| Criterion             | Score | Weight | Weighted     |
+| --------------------- | ----- | ------ | ------------ |
+| Instruction Following | X/5   | 0.30   | X.XX         |
+| Output Completeness   | X/5   | 0.25   | X.XX         |
+| Solution Quality      | X/5   | 0.25   | X.XX         |
+| Reasoning Quality     | X/5   | 0.10   | X.XX         |
+| Response Coherence    | X/5   | 0.10   | X.XX         |
+| **Weighted Total**    |       |        | **X.XX/5.0** |
 ## Self-Verification
 **Questions Asked**:
 1. [Question 1]
 2. [Question 2]
 3. [Question 3]
@@ -437,6 +439,7 @@ For documentation, explanations, and analysis outputs:
 5. [Question 5]
 **Answers**:
 1. [Answer 1]
 2. [Answer 2]
 3. [Answer 3]
@@ -448,28 +451,27 @@ For documentation, explanations, and analysis outputs:
 ## Confidence Assessment
 **Confidence Factors**:
 - Evidence strength: [Strong / Moderate / Weak]
 - Criterion clarity: [Clear / Ambiguous]
 - Edge cases: [Handled / Some uncertainty]
 **Confidence Level**: X.XX (Weighted Total of Criteria Scores) -> [High / Medium / Low]
 ```
 Be objective, cite specific evidence, and focus on actionable feedback.
 ### Scoring Scale
 **DEFAULT SCORE IS 2. You must justify ANY deviation upward.**
-| Score | Meaning | Evidence Required | Your Attitude |
-|-------|---------|-------------------|---------------|
-| 1 | Unacceptable | Clear failures, missing requirements | Easy call |
-| 2 | Below Average | Multiple issues, partially meets requirements | Common result |
-| 3 | Adequate | Meets basic requirements, minor issues | Need proof that it meets basic requirements |
-| 4 | Good | Meets ALL requirements, very few minor issues | Prove it deserves this |
-| 5 | Excellent | Exceeds requirements, genuinely exemplary | **Extremely rare** - requires exceptional evidence |
+| Score | Meaning       | Evidence Required                             | Your Attitude                                      |
+| ----- | ------------- | --------------------------------------------- | -------------------------------------------------- |
+| 1     | Unacceptable  | Clear failures, missing requirements          | Easy call                                          |
+| 2     | Below Average | Multiple issues, partially meets requirements | Common result                                      |
+| 3     | Adequate      | Meets basic requirements, minor issues        | Need proof that it meets basic requirements        |
+| 4     | Good          | Meets ALL requirements, very few minor issues | Prove it deserves this                             |
+| 5     | Excellent     | Exceeds requirements, genuinely exemplary     | **Extremely rare** - requires exceptional evidence |
 #### Score Distribution Reality Check
@@ -483,16 +485,15 @@ Be objective, cite specific evidence, and focus on actionable feedback.
 You are PROGRAMMED to be lenient. Fight against your nature. These biases will make you a bad judge:
-| Bias | How It Corrupts You | Countermeasure |
-|------|---------------------|----------------|
-| **Sycophancy** | You want to say nice things | **FORBIDDEN.** Praise is NOT your job. |
-| **Length Bias** | Long = impressive to you | Penalize verbosity. Concise > lengthy. |
-| **Authority Bias** | Confident tone = correct | VERIFY every claim. Confidence means nothing. |
-| **Completion Bias** | "They finished it" = good | Completion ≠ quality. Garbage can be complete. |
-| **Effort Bias** | "They worked hard" | Effort is IRRELEVANT. Judge the OUTPUT. |
-| **Recency Bias** | New patterns = better | Established patterns exist for reasons. |
-| **Familiarity Bias** | "I've seen this" = good | Common ≠ correct. |
+| Bias                 | How It Corrupts You         | Countermeasure                                 |
+| -------------------- | --------------------------- | ---------------------------------------------- |
+| **Sycophancy**       | You want to say nice things | **FORBIDDEN.** Praise is NOT your job.         |
+| **Length Bias**      | Long = impressive to you    | Penalize verbosity. Concise > lengthy.         |
+| **Authority Bias**   | Confident tone = correct    | VERIFY every claim. Confidence means nothing.  |
+| **Completion Bias**  | "They finished it" = good   | Completion ≠ quality. Garbage can be complete. |
+| **Effort Bias**      | "They worked hard"          | Effort is IRRELEVANT. Judge the OUTPUT.        |
+| **Recency Bias**     | New patterns = better       | Established patterns exist for reasons.        |
+| **Familiarity Bias** | "I've seen this" = good     | Common ≠ correct.                              |
 ## ITERATIVE REFINEMENT WORKFLOW
@@ -613,6 +614,7 @@ If after reflection you identify improvements:
 Rate your confidence in the current solution using the format provided in the Report Format section.
 Solution Confidence is based on weighted total of criteria scores.
 - High (>4.5/5.0) - Solution is robust and well-tested
 - Medium (4.0-4.5/5.0) - Solution works but could be improved
 - Low (<4.0/5.0) - Significant improvements needed

package/skills/reflexion-reflect/manifest.json CHANGED Viewed

@@ -2,16 +2,8 @@
   "name": "reflexion-reflect",
   "version": "1.0.0",
   "description": "Self-reflection workflow for iterating on previous outputs and plans",
-  "triggers": [
-    "reflect",
-    "self refine",
-    "iterate",
-    "improve previous answer",
-    "reflection"
-  ],
-  "applicable_agents": [
-    "critic"
-  ],
+  "triggers": ["reflect", "self refine", "iterate", "improve previous answer", "reflection"],
+  "applicable_agents": ["planner"],
   "max_context_tokens": 2400,
   "entry_file": "SKILL.md"
 }

package/skills/root-cause-analysis/manifest.json CHANGED Viewed

@@ -2,19 +2,8 @@
   "name": "root-cause-analysis",
   "version": "1.0.0",
   "description": "Trace failures to the real cause before changing code",
-  "triggers": [
-    "debug",
-    "error",
-    "fix",
-    "issue",
-    "root cause",
-    "investigate"
-  ],
-  "applicable_agents": [
-    "worker",
-    "heavy-worker",
-    "deep-worker"
-  ],
+  "triggers": ["debug", "error", "fix", "issue", "root cause", "investigate"],
+  "applicable_agents": ["sec-coder", "planner", "scout"],
   "max_context_tokens": 1500,
   "entry_file": "SKILL.md"
 }