npm - @yeyuan98/opencode-bioresearcher-plugin - Versions diffs - 1.3.1-alpha.1 → 1.4.0 - Mend

@yeyuan98/opencode-bioresearcher-plugin 1.3.1-alpha.1 → 1.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (36) hide show

package/README.md +14 -0
package/dist/index.js +4 -1
package/dist/misc-tools/index.d.ts +3 -0
package/dist/misc-tools/index.js +3 -0
package/dist/misc-tools/json-extract.d.ts +13 -0
package/dist/misc-tools/json-extract.js +394 -0
package/dist/misc-tools/json-infer.d.ts +13 -0
package/dist/misc-tools/json-infer.js +199 -0
package/dist/misc-tools/json-tools.d.ts +33 -0
package/dist/misc-tools/json-tools.js +187 -0
package/dist/misc-tools/json-validate.d.ts +13 -0
package/dist/misc-tools/json-validate.js +228 -0
package/dist/skills/bioresearcher-core/README.md +210 -0
package/dist/skills/bioresearcher-core/SKILL.md +128 -0
package/dist/skills/bioresearcher-core/examples/contexts.json +29 -0
package/dist/skills/bioresearcher-core/examples/data-exchange-example.md +303 -0
package/dist/skills/bioresearcher-core/examples/template.md +49 -0
package/dist/skills/bioresearcher-core/patterns/calculator.md +215 -0
package/dist/skills/bioresearcher-core/patterns/data-exchange.md +406 -0
package/dist/skills/bioresearcher-core/patterns/json-tools.md +263 -0
package/dist/skills/bioresearcher-core/patterns/progress.md +127 -0
package/dist/skills/bioresearcher-core/patterns/retry.md +110 -0
package/dist/skills/bioresearcher-core/patterns/shell-commands.md +79 -0
package/dist/skills/bioresearcher-core/patterns/subagent-waves.md +186 -0
package/dist/skills/bioresearcher-core/patterns/table-tools.md +260 -0
package/dist/skills/bioresearcher-core/patterns/user-confirmation.md +187 -0
package/dist/skills/bioresearcher-core/python/template.md +273 -0
package/dist/skills/bioresearcher-core/python/template.py +323 -0
package/dist/skills/long-table-summary/SKILL.md +437 -0
package/dist/skills/long-table-summary/combine_outputs.py +336 -0
package/dist/skills/long-table-summary/generate_prompts.py +211 -0
package/dist/skills/long-table-summary/pyproject.toml +8 -0
package/dist/skills/pubmed-weekly/SKILL.md +329 -329
package/dist/skills/pubmed-weekly/pubmed_weekly.py +411 -411
package/dist/skills/pubmed-weekly/pyproject.toml +8 -8
package/package.json +7 -2

package/dist/skills/bioresearcher-core/patterns/calculator.md ADDED Viewed

@@ -0,0 +1,215 @@
+# Calculator Pattern
+In-workflow calculations using the calculator tool.
+## Overview
+Use the calculator tool for arithmetic operations in workflows. It's more reliable than manual calculations and provides consistent precision.
+## Tool: calculator
+```
+calculator(formula: string, precision: number = 3)
+```
+### Parameters
+- `formula`: Mathematical expression (string)
+- `precision`: Decimal places for result (0-15, default 3)
+### Return Format
+```json
+{
+  "formula": "(45 / 100) * 100",
+  "result": 45
+}
+```
+### Supported Operations
+| Operation | Symbol | Example |
+|-----------|--------|---------|
+| Addition | + | `2 + 3` |
+| Subtraction | - | `5 - 2` |
+| Multiplication | * | `3 * 4` |
+| Division | / | `10 / 2` |
+| Power | ^ | `2 ^ 3` |
+| Brackets | () | `(2 + 3) * 4` |
+| Scientific | e/E | `1e5`, `1.5e-3` |
+### Important Rules
+1. **MUST use explicit * for multiplication**: `2*(3)` NOT `2(3)`
+2. **Maximum precision**: 15 decimal places
+3. **Default precision**: 3 decimal places
+4. **No functions**: ceil, floor, sqrt not supported
+## Common Use Cases
+### Batch Calculations
+```
+# Calculate number of batches needed
+calculator(formula="ceil(100 / 30)", precision=0)
+# Note: ceil not supported, use workaround:
+calculator(formula="(100 + 30 - 1) / 30", precision=0)
+# Result: 4.333 -> Use ceiling logic in agent
+```
+### Progress Percentages
+```
+# Calculate completion percentage
+calculator(formula="(45 / 100) * 100", precision=1)
+# Result: 45
+# With variables in workflow
+completed = 67
+total = 120
+calculator(formula="({completed} / {total}) * 100", precision=1)
+# Result: 55.8
+```
+### Time Estimates
+```
+# Estimate remaining time
+remaining_items = 50
+items_per_minute = 10
+calculator(formula="50 / 10", precision=0)
+# Result: 5 minutes
+```
+### Data Size Calculations
+```
+# Calculate total rows across batches
+batch_size = 30
+num_batches = 4
+calculator(formula="30 * 4", precision=0)
+# Result: 120
+```
+## Ceiling/Floor Workarounds
+Since ceil/floor are not supported:
+### Ceiling
+```
+# ceil(a / b) = (a + b - 1) / b (for positive integers)
+# ceil(100 / 30)
+calculator(formula="(100 + 30 - 1) / 30", precision=0)
+# Result: 4.333 -> Agent interprets as 5
+```
+### Floor
+```
+# floor(a / b) = a / b (truncate decimal)
+# floor(100 / 30)
+calculator(formula="100 / 30", precision=0)
+# Result: 3.333 -> Agent interprets as 3
+```
+## Example Workflow
+### Calculate Batch Configuration
+```
+# Given
+total_rows = 250
+batch_size = 30
+# Calculate batches needed
+# ceil(250 / 30) = 9 batches
+batches_needed = calculator(
+    formula="(250 + 30 - 1) / 30",
+    precision=0
+)
+# Result: 9.3 -> Agent rounds up to 10
+# Calculate rows in last batch
+# 250 - (9 * 30) = 250 - 270 = -20 -> use 30
+# Actually: 250 - (8 * 30) = 250 - 240 = 10
+last_batch_rows = calculator(
+    formula="250 - (8 * 30)",
+    precision=0
+)
+# Result: 10
+```
+### Progress Tracking
+```
+# During batch processing
+completed = 0
+total = 10
+for batch in batches:
+    process(batch)
+    completed += 1
+    # Calculate and report progress
+    percent = calculator(
+        formula="({completed} / {total}) * 100",
+        precision=0
+    )
+    report(f"Progress: {completed}/{total} ({percent}%)")
+```
+### Wave Timing
+```
+# Calculate expected completion time
+waves_remaining = 3
+seconds_per_wave = 45
+estimated_seconds = calculator(
+    formula="3 * 45",
+    precision=0
+)
+# Result: 135 seconds
+# Convert to minutes
+estimated_minutes = calculator(
+    formula="135 / 60",
+    precision=1
+)
+# Result: 2.3 minutes
+```
+## Error Handling
+### Invalid Formula
+```
+# Missing explicit multiplication
+calculator(formula="2(3)")
+# Error: "CALCULATOR ERROR: Invalid syntax: parentheses-less multiplication not allowed"
+```
+### Division by Zero
+```
+calculator(formula="10 / 0")
+# Error: "CALCULATOR ERROR: Division by zero"
+```
+### Invalid Characters
+```
+calculator(formula="2 + abc")
+# Error: "CALCULATOR ERROR: Formula contains invalid characters: a b c"
+```
+## Integration with Other Patterns
+| Pattern | Calculator Usage |
+|---------|-----------------|
+| `progress.md` | Calculate percentages |
+| `retry.md` | Calculate backoff delays |
+| `subagent-waves.md` | Calculate wave counts |
+| `table-tools.md` | Calculate row counts |
+## Best Practices
+1. **Use precision=0 for counts**: Integer results
+2. **Use precision=1 for percentages**: One decimal place
+3. **Always use explicit ***: Never implicit multiplication
+4. **Check for division by zero**: Validate divisors
+5. **Document formulas**: Explain what calculation does

package/dist/skills/bioresearcher-core/patterns/data-exchange.md ADDED Viewed

@@ -0,0 +1,406 @@
+# Data Exchange Pattern
+Standardized protocol for data exchange between main agent and subagents.
+## Overview
+This pattern ensures reliable communication between main agent and subagents using:
+- File-based prompts with embedded schemas
+- JSON output files with validation
+- Schema-first design for type safety
+## Data Exchange Protocol
+```
+Main Agent                          Subagent
+    |                                   |
+    |--- Write prompt file ----------->|
+    |   (with embedded schema)          |
+    |                                   |
+    |                                   |--- Read prompt
+    |                                   |--- Process data
+    |                                   |--- Write output JSON
+    |                                   |
+    |<-- Write output file -------------|
+    |   (JSON matching schema)          |
+    |                                   |
+    |--- jsonExtract ------------------>|
+    |--- jsonValidate ----------------->|
+    |--- Process validated data         |
+```
+## Main Agent Responsibilities
+### 1. Create Prompt File with Embedded Schema
+Include the output schema directly in the prompt:
+```markdown
+# Task Description
+Process the data and output results.
+## Output Format
+Your output must be valid JSON matching this schema:
+```json
+{
+  "batch_number": <integer>,
+  "row_count": <integer>,
+  "summaries": [
+    {
+      "row_number": <integer>,
+      "field1": "<string>",
+      "field2": "<string>"
+    }
+  ]
+}
+```
+Write your output to: {output_file}
+**CRITICAL:** Write ONLY the JSON object, no additional text.
+```
+### 2. Launch Subagent with File Reference
+```
+task(
+  subagent_type="general",
+  description="Process batch 001",
+  prompt="Read your prompt from ./prompts/batch001.md and perform the task exactly as written."
+)
+```
+### 3. Validate Subagent Output
+```python
+# Extract JSON from output file
+result = jsonExtract(file_path="./outputs/batch001.md")
+if not result.success:
+    log_error(f"Failed to extract JSON")
+    handle_failure()
+# Validate against expected schema
+validation = jsonValidate(
+    data=json.dumps(result.data),
+    schema=expected_schema
+)
+if not validation.valid:
+    log_error(f"Validation failed: {validation.errors}")
+    handle_failure()
+# Process validated data
+process(result.data)
+```
+## Subagent Responsibilities
+### 1. Read Prompt File
+The subagent reads the prompt file to understand:
+- Task description
+- Input data location
+- Output format (schema)
+- Output file path
+### 2. Process and Generate Output
+The subagent:
+- Reads input data using available tools
+- Processes according to instructions
+- Generates output matching the schema exactly
+### 3. Write Output File
+Write ONLY valid JSON to the specified output file:
+```json
+{
+  "batch_number": 1,
+  "row_count": 30,
+  "summaries": [
+    {"row_number": 2, "field1": "value1", "field2": "value2"},
+    {"row_number": 3, "field1": "value3", "field2": "value4"}
+  ]
+}
+```
+## Schema Definition Guidelines
+### Basic Schema Example
+```json
+{
+  "batch_number": <integer>,
+  "row_count": <integer>,
+  "summaries": [
+    {
+      "row_number": <integer>,
+      "field_name": "<type_description>"
+    }
+  ]
+}
+```
+### Type Annotations in Schema
+Use clear type annotations in markdown:
+| Annotation | Type |
+|------------|------|
+| `<integer>` | Integer number |
+| `<number>` | Any number |
+| `<string>` | Text string |
+| `<boolean>` | true or false |
+| `<array>` | JSON array |
+| `<object>` | JSON object |
+### Enum Values
+Specify allowed values:
+```json
+{
+  "status": "<one of: active/inactive/pending>"
+}
+```
+### Optional Fields
+Mark optional fields clearly:
+```json
+{
+  "required_field": "<string>",
+  "optional_field?": "<string or null>"
+}
+```
+## Validation Flow
+### Step 1: Infer Schema from First Output
+```python
+# Get first output
+first_result = jsonExtract(file_path="./outputs/batch001.md")
+# Infer schema
+schema_result = jsonInfer(
+    data=json.dumps(first_result.data),
+    strict=true
+)
+# Store schema for validation
+expected_schema = json.dumps(schema_result.data)
+```
+### Step 2: Validate All Outputs
+```python
+for file_path in output_files:
+    # Extract
+    result = jsonExtract(file_path=file_path)
+    if not result.success:
+        log_error(f"Extraction failed: {file_path}")
+        continue
+    # Validate
+    validation = jsonValidate(
+        data=json.dumps(result.data),
+        schema=expected_schema
+    )
+    if not validation.valid:
+        log_error(f"Validation failed: {file_path}")
+        log_error(validation.errors)
+        continue
+    # Collect valid data
+    valid_outputs.append(result.data)
+```
+## Error Handling
+### Error Structures
+#### jsonExtract Error Response
+```json
+{
+  "success": false,
+  "data": null,
+  "metadata": {
+    "error": {
+      "code": "NO_JSON_FOUND",
+      "message": "No valid JSON found in file"
+    }
+  }
+}
+```
+#### Error Codes
+| Code | Description |
+|------|-------------|
+| `FILE_NOT_FOUND` | File does not exist |
+| `FILE_TOO_LARGE` | File exceeds 200MB limit |
+| `BINARY_FILE` | File is binary format |
+| `EMPTY_FILE` | File has no content |
+| `NO_JSON_FOUND` | No valid JSON found |
+#### jsonValidate Error Response
+```json
+{
+  "success": true,
+  "valid": false,
+  "errors": [
+    {
+      "path": "summaries.0.row_number",
+      "message": "Expected number, received string",
+      "code": "invalid_type",
+      "expected": "number",
+      "received": "string"
+    }
+  ]
+}
+```
+### Extraction Failures
+```python
+if not result.success:
+    error_code = result.metadata.get("error", {}).get("code", "UNKNOWN")
+    if error_code == "NO_JSON_FOUND":
+        log_error("Subagent did not output valid JSON")
+        log_error("Check subagent output for errors")
+    retry_or_skip()
+```
+### Validation Failures
+```python
+if not validation.valid:
+    for error in validation.errors:
+        log_error(f"Field {error['path']}: {error['message']}")
+        if error['code'] == "invalid_type":
+            log_error(f"  Expected: {error['expected']}")
+            log_error(f"  Received: {error['received']}")
+```
+### Subagent Execution Failures
+Beyond output validation, subagents may fail during execution:
+| Failure Type | Detection | Recovery |
+|--------------|-----------|----------|
+| Timeout | Task exceeds time limit | Retry with smaller batch |
+| Crash | No output file created | Retry or skip |
+| Partial output | Incomplete JSON | Retry or use partial data |
+| Wrong format | JSON doesn't match schema | Re-prompt with clearer instructions |
+### Failure Handling Pattern
+```python
+# After launching subagent wave
+failed_batches = []
+for batch_file in expected_outputs:
+    if not file_exists(batch_file):
+        log_error(f"Subagent failed to create output: {batch_file}")
+        failed_batches.append(batch_file)
+        continue
+    result = jsonExtract(file_path=batch_file)
+    if not result.success:
+        log_error(f"Failed to extract JSON: {batch_file}")
+        failed_batches.append(batch_file)
+        continue
+    validation = jsonValidate(data=json.dumps(result.data), schema=expected_schema)
+    if not validation.valid:
+        log_error(f"Validation failed: {batch_file}")
+        failed_batches.append(batch_file)
+        continue
+    valid_outputs.append(result.data)
+# Retry failed batches using retry.md pattern
+if failed_batches:
+    for batch in failed_batches:
+        retry_subagent(batch, max_attempts=3, delay=5)
+```
+> **Note:** Reference `patterns/retry.md` for implementing retry logic with exponential backoff.
+## Complete Example
+### Main Agent: Create Prompt
+```markdown
+# Gene Classification Task
+## Input
+- File: ./data/genes.xlsx
+- Sheet: Sheet1
+- Rows: 2-31
+## Instructions
+For each row, classify the gene by species and function.
+## Output Format
+Write JSON to: ./outputs/batch001.md
+```json
+{
+  "batch_number": 1,
+  "row_count": 30,
+  "summaries": [
+    {
+      "row_number": <integer>,
+      "gene_name": "<string>",
+      "species": "<one of: human/mouse/other>",
+      "function": "<string>"
+    }
+  ]
+}
+```
+```
+### Subagent: Write Output
+```json
+{
+  "batch_number": 1,
+  "row_count": 30,
+  "summaries": [
+    {"row_number": 2, "gene_name": "BRAF", "species": "human", "function": "Kinase"},
+    {"row_number": 3, "gene_name": "TP53", "species": "human", "function": "Tumor suppressor"}
+  ]
+}
+```
+### Main Agent: Validate
+```python
+# Extract
+result = jsonExtract(file_path="./outputs/batch001.md")
+# Validate
+validation = jsonValidate(
+    data=json.dumps(result.data),
+    schema='{"type":"object","properties":{"batch_number":{"type":"integer"},"summaries":{"type":"array"}}}'
+)
+if validation.valid:
+    process(result.data)
+```
+## Best Practices
+1. **Embed schema in prompt**: Don't rely on external schema files
+2. **Use strict typing**: Specify exact types and allowed values
+3. **Validate every output**: Never skip validation
+4. **Handle errors gracefully**: Log and continue with other outputs
+5. **Keep schemas simple**: Avoid complex nested structures