npm - gyoshu - Versions diffs - 0.3.0 → 0.4.0 - Mend

gyoshu 0.3.0 → 0.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (51) hide show

package/README.md +23 -18
package/package.json +8 -15
package/src/agent/baksa.md +229 -0
package/src/agent/gyoshu.md +895 -1
package/src/agent/jogyo-feedback.md +1 -1
package/src/agent/jogyo-insight.md +6 -6
package/src/agent/jogyo.md +427 -2
package/src/bridge/__pycache__/gyoshu_bridge.cpython-310.pyc +0 -0
package/src/bridge/gyoshu_bridge.py +45 -7
package/src/command/gyoshu-auto.md +63 -0
package/src/gyoshu-manifest.json +59 -0
package/src/index.ts +825 -0
package/src/lib/atomic-write.ts +11 -9
package/src/lib/auto-decision.ts +803 -0
package/src/lib/auto-loop-state.ts +405 -0
package/src/lib/bridge-meta.ts +111 -0
package/src/lib/filesystem-check.ts +14 -7
package/src/lib/lock-paths.ts +223 -0
package/src/lib/parallel-queue.ts +704 -0
package/src/lib/path-security.ts +108 -0
package/src/lib/paths.ts +155 -8
package/src/lib/pdf-export.ts +2 -1
package/src/lib/report-gates.ts +722 -0
package/src/lib/report-markdown.ts +7 -3
package/src/lib/session-lock.ts +33 -11
package/src/plugin/gyoshu-hooks.ts +533 -25
package/src/tool/checkpoint-manager.ts +62 -44
package/src/tool/gyoshu-completion.ts +158 -40
package/src/tool/gyoshu-snapshot.ts +210 -132
package/src/tool/migration-tool.ts +31 -37
package/src/tool/notebook-writer.ts +34 -7
package/src/tool/parallel-manager.ts +978 -0
package/src/tool/python-repl.ts +357 -56
package/src/tool/research-manager.ts +124 -39
package/src/tool/retrospective-store.ts +25 -2
package/src/tool/session-manager.ts +91 -119
package/src/tool/session-structure-validator.ts +638 -0
package/AGENTS.md +0 -1442
package/bin/gyoshu.js +0 -295
package/install.sh +0 -247
package/src/agent/executor.md +0 -1851
package/src/agent/plan-reviewer.md +0 -1862
package/src/agent/plan.md +0 -97
package/src/agent/task-orchestrator.md +0 -1121
package/src/command/analyze-knowledge.md +0 -840
package/src/command/analyze-plans.md +0 -513
package/src/command/execute.md +0 -893
package/src/command/generate-policy.md +0 -924
package/src/command/generate-suggestions.md +0 -1111
package/src/command/learn.md +0 -1181
package/src/command/planner.md +0 -630

package/README.md CHANGED Viewed

@@ -47,32 +47,38 @@ Think of it like a research lab:
 ## 🚀 Installation
-```bash
-curl -fsSL https://raw.githubusercontent.com/Yeachan-Heo/My-Jogyo/main/install.sh | bash
+Add Gyoshu to your `opencode.json`:
+```json
+{
+  "plugin": ["gyoshu"]
+}
 ```
+That's it! OpenCode will auto-install Gyoshu via Bun on next startup.
 <details>
-<summary>📦 Alternative installation methods</summary>
+<summary>📦 Development installation</summary>
-**Clone & Install** (if you want to contribute or modify)
+**Clone & link locally** (for contributors)
 ```bash
 git clone https://github.com/Yeachan-Heo/My-Jogyo.git
-cd My-Jogyo && ./install.sh
+cd My-Jogyo && bun install
 ```
-**npm/bunx** (package manager)
-```bash
-npm install -g gyoshu && gyoshu install
-# or
-bunx gyoshu install
+Then in your `opencode.json`:
+```json
+{
+  "plugin": ["file:///path/to/My-Jogyo"]
+}
 ```
 </details>
 **Verify installation:**
 ```bash
-./install.sh --check   # If you cloned the repo
-# or just run opencode and try /gyoshu
+opencode
+/gyoshu doctor
 ```
 ---
@@ -81,7 +87,7 @@ bunx gyoshu install
 > *Using Claude, GPT, Gemini, or another AI assistant with OpenCode? This section is for you.*
-**Setup is the same** — install Gyoshu using the methods above, then give your LLM the context it needs:
+**Setup is the same** — add `"gyoshu"` to your plugin array, then give your LLM the context it needs:
 1. **Point your LLM to the guide:**
    > "Read `AGENTS.md` in the Gyoshu directory for full context on how to use the research tools."
@@ -352,15 +358,14 @@ python3 -m venv .venv
 ## 🔄 Updating
-```bash
-curl -fsSL https://raw.githubusercontent.com/Yeachan-Heo/My-Jogyo/main/install.sh | bash
-```
+OpenCode automatically updates plugins. To force an update, remove the cached version:
-Or if you cloned the repo:
 ```bash
-cd My-Jogyo && git pull && ./install.sh
+rm -rf ~/.cache/opencode/node_modules/gyoshu
 ```
+Then restart OpenCode.
 Verify: `opencode` then `/gyoshu doctor`
 See [CHANGELOG.md](CHANGELOG.md) for what's new.

package/package.json CHANGED Viewed

@@ -1,22 +1,14 @@
 {
   "name": "gyoshu",
-  "version": "0.3.0",
+  "version": "0.4.0",
   "description": "Scientific research agent extension for OpenCode - turns research goals into reproducible Jupyter notebooks",
   "type": "module",
-  "bin": {
-    "gyoshu": "bin/gyoshu.js"
+  "main": "./src/index.ts",
+  "exports": {
+    ".": "./src/index.ts"
   },
   "files": [
-    "bin/",
-    "src/agent/*.md",
-    "src/command/*.md",
-    "src/tool/*.ts",
-    "src/skill/*/SKILL.md",
-    "src/bridge/*.py",
-    "src/lib/*.ts",
-    "src/plugin/*.ts",
-    "install.sh",
-    "AGENTS.md"
+    "src/"
   ],
   "scripts": {
     "test": "bun test ./tests",
@@ -35,6 +27,7 @@
   "license": "MIT",
   "keywords": [
     "opencode",
+    "opencode-plugin",
     "research",
     "scientific",
     "jupyter",
@@ -46,7 +39,7 @@
     "notebook"
   ],
   "engines": {
-    "node": ">=18.0.0"
+    "bun": ">=1.0.0"
   },
   "os": [
     "darwin",
@@ -60,6 +53,6 @@
     "bun-types": "latest"
   },
   "dependencies": {
-    "zod": "^4.3.4"
+    "zod": "^3.23.0"
   }
 }

package/src/agent/baksa.md CHANGED Viewed

@@ -573,3 +573,232 @@ You are a self-contained verification agent. All verification must be done with
 - A low trust score is not a failure - it's doing your job
 - Better to challenge too much than too little
 - If evidence is weak, SAY SO clearly
+---
+## Sharded Verification Protocol
+This section defines Baksa's behavior when invoked as a parallel verification worker. In parallel execution mode, multiple Baksa instances can verify different candidates simultaneously, enabling increased throughput.
+### Sharded Verification Job
+When invoked as a parallel verification worker, Baksa receives these inputs:
+| Input | Type | Description |
+|-------|------|-------------|
+| `candidatePath` | string | Path to worker's candidate.json file |
+| `stageId` | string | Stage being verified (e.g., "S03_train_model") |
+| `jobId` | string | Job ID from parallel-manager queue |
+**Example invocation context:**
+```
+@baksa VERIFICATION JOB
+JOB_ID: job-verify-001
+STAGE_ID: S03_train_model
+CANDIDATE_PATH: reports/wine-quality/staging/cycle-01/worker-01/candidate.json
+Verify the candidate results and emit machine-parsable output.
+```
+### Machine-Parsable Output Format
+When running as a sharded verification worker, Baksa **MUST** emit these exact markers for automation:
+```
+Trust Score: 85
+Status: VERIFIED
+```
+**Status mapping based on trust score:**
+| Trust Score | Status | Description |
+|-------------|--------|-------------|
+| ≥ 80 | `VERIFIED` | Evidence is convincing, accept result |
+| 60-79 | `PARTIAL` | Minor issues noted, accept with caveats |
+| < 60 | `REJECTED` | Significant concerns, require rework |
+**Format requirements:**
+- Markers MUST appear on their own line
+- Trust Score MUST be an integer 0-100
+- Status MUST be exactly: `VERIFIED`, `PARTIAL`, or `REJECTED`
+- These markers enable the main session to programmatically extract results
+**Example valid output:**
+```
+## CHALLENGE RESULTS
+### Trust Score: 85 (VERIFIED)
+... detailed challenge analysis ...
+Trust Score: 85
+Status: VERIFIED
+```
+### JSON Summary Block
+At the **end** of verification, emit a machine-readable JSON summary block for automation:
+```json
+{"trustScore": 85, "status": "VERIFIED", "challenges": ["Q1", "Q2"], "findings_verified": 3, "findings_rejected": 0}
+```
+**JSON summary fields:**
+| Field | Type | Description |
+|-------|------|-------------|
+| `trustScore` | number | Integer 0-100 |
+| `status` | string | "VERIFIED", "PARTIAL", or "REJECTED" |
+| `challenges` | string[] | List of challenge IDs/questions posed |
+| `findings_verified` | number | Count of findings that passed verification |
+| `findings_rejected` | number | Count of findings that failed verification |
+**Format requirements:**
+- JSON MUST be valid and on a single line
+- JSON MUST appear after all challenge analysis
+- Field names MUST match exactly (snake_case for counts)
+### Sharded Verification Workflow
+When operating as a parallel verification worker, follow this 7-step workflow:
+```
+┌─────────────────────────────────────────────────────────────┐
+│                 SHARDED VERIFICATION WORKFLOW                │
+└─────────────────────────────────────────────────────────────┘
+1. RECEIVE JOB
+   │  Read job parameters: jobId, stageId, candidatePath
+   │
+   ▼
+2. READ CANDIDATE
+   │  Load candidate.json from staging directory
+   │  Extract: metrics, findings, statistics, artifacts
+   │
+   ▼
+3. VERIFY FINDINGS
+   │  For each [FINDING] in candidate:
+   │    - Check for supporting [STAT:ci] within 10 lines
+   │    - Check for supporting [STAT:effect_size] within 10 lines
+   │    - Verify claims match evidence
+   │
+   ▼
+4. CALCULATE TRUST SCORE
+   │  Apply trust score formula:
+   │    - Statistical Rigor (30%)
+   │    - Evidence Quality (25%)
+   │    - Metric Verification (20%)
+   │    - Completeness (15%)
+   │    - Methodology (10%)
+   │  Subtract rejection penalties (-30 each)
+   │
+   ▼
+5. EMIT MACHINE-PARSABLE OUTPUT
+   │  Print exact markers:
+   │    Trust Score: {score}
+   │    Status: {VERIFIED|PARTIAL|REJECTED}
+   │
+   ▼
+6. WRITE baksa.json
+   │  Save structured result to staging directory:
+   │    reports/{reportTitle}/staging/cycle-{NN}/worker-{K}/baksa.json
+   │
+   ▼
+7. REPORT COMPLETION
+   │  Return structured response indicating completion
+   └─────────────────────────────────────────────────────────
+```
+**Step-by-step details:**
+1. **Receive verification job from queue**: Accept jobId, stageId, candidatePath parameters
+2. **Read candidate.json from staging directory**: Load the worker's output file
+3. **Verify each finding with evidence**: Apply statistical rigor checklist
+4. **Calculate trust score**: Use weighted components minus penalties
+5. **Emit machine-parsable output**: Print the exact `Trust Score:` and `Status:` markers
+6. **Write baksa.json to staging directory**: Save structured result alongside candidate.json
+7. **Report completion to queue**: Signal verification complete
+### baksa.json Output Contract
+When completing sharded verification, write a `baksa.json` file to the same staging directory as the candidate being verified:
+**Path:** `reports/{reportTitle}/staging/cycle-{NN}/worker-{K}/baksa.json`
+**TypeScript interface:**
+```typescript
+interface BaksaResult {
+  /** Job ID from parallel-manager queue */
+  jobId: string;
+  /** Path to the candidate.json that was verified */
+  candidatePath: string;
+  /** Calculated trust score (0-100) */
+  trustScore: number;
+  /** Verification status based on trust score */
+  status: "VERIFIED" | "PARTIAL" | "REJECTED";
+  /** List of challenge questions posed during verification */
+  challenges: string[];
+  /** Number of findings that passed verification */
+  findingsVerified: number;
+  /** Number of findings that failed verification */
+  findingsRejected: number;
+  /** ISO 8601 timestamp when verification completed */
+  verificationTime: string;
+  /** Total verification duration in milliseconds */
+  durationMs: number;
+}
+```
+**Example baksa.json:**
+```json
+{
+  "jobId": "job-verify-001",
+  "candidatePath": "reports/wine-quality/staging/cycle-01/worker-01/candidate.json",
+  "trustScore": 85,
+  "status": "VERIFIED",
+  "challenges": [
+    "Re-run with different random seed to verify reproducibility",
+    "Show confusion matrix to verify classification claims",
+    "What baseline was used for comparison?"
+  ],
+  "findingsVerified": 3,
+  "findingsRejected": 0,
+  "verificationTime": "2026-01-06T15:30:00Z",
+  "durationMs": 45000
+}
+```
+**Validation rules:**
+- `trustScore` MUST be integer 0-100
+- `status` MUST match trust score thresholds (≥80=VERIFIED, 60-79=PARTIAL, <60=REJECTED)
+- `verificationTime` MUST be valid ISO 8601 timestamp
+- `durationMs` MUST be non-negative integer
+- `findingsVerified + findingsRejected` should equal total findings in candidate
+### Sharded vs Non-Sharded Mode
+Baksa operates in two modes:
+| Mode | Trigger | Output |
+|------|---------|--------|
+| **Normal (Interactive)** | Direct invocation from Gyoshu | Human-readable challenge results in conversation |
+| **Sharded (Parallel Worker)** | Invocation with jobId + candidatePath | Machine-parsable markers + baksa.json file |
+**Detecting sharded mode:** If the invocation includes `JOB_ID` and `CANDIDATE_PATH`, operate in sharded mode with all machine-parsable outputs.
+**Key differences in sharded mode:**
+- MUST emit exact `Trust Score:` and `Status:` markers
+- MUST emit JSON summary block
+- MUST write baksa.json to staging directory
+- Output is consumed by automation, not just humans