npm - scientify - Versions diffs - 1.7.1 → 1.7.3 - Mend

scientify 1.7.1 → 1.7.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (9) hide show

package/README.md +312 -189
package/README.zh.md +312 -189
package/dist/src/hooks/research-mode.d.ts +4 -6
package/dist/src/hooks/research-mode.d.ts.map +1 -1
package/dist/src/hooks/research-mode.js +12 -129
package/dist/src/hooks/research-mode.js.map +1 -1
package/package.json +1 -1
package/skills/install-scientify/SKILL.md +31 -19
package/skills/research-pipeline/SKILL.md +7 -1

package/README.md CHANGED Viewed

@@ -2,253 +2,381 @@
 **AI-powered research workflow automation for OpenClaw.**
+Scientify is an [OpenClaw](https://github.com/openclaw/openclaw) plugin that automates the full academic research pipeline — from literature survey to experiment execution — using LLM-driven sub-agents.
 [中文文档](./README.zh.md)
 ---
-## Features
+## What It Does
-### Skills (LLM-powered)
+Scientify turns a single research prompt into a complete automated pipeline. Each phase runs as an independent sub-agent — the orchestrator verifies outputs between steps and passes context forward.
-| Skill | Description |
-|-------|-------------|
-| **research-pipeline** | Orchestrator for end-to-end ML research. Spawns sub-agents for each phase, verifies outputs between steps. |
-| **research-survey** | Deep analysis of downloaded papers: extract formulas, map to code, produce method comparison table. |
-| **research-plan** | Create structured 4-part implementation plan (Dataset/Model/Training/Testing) from survey results. |
-| **research-implement** | Implement ML code from plan, run 2-epoch validation with `uv` venv isolation, verify real results. |
-| **research-review** | Review implementation against plan and survey. Iterates fix-rerun-review up to 3 times. |
-| **research-experiment** | Full training run + ablation experiments + result analysis. Requires review PASS. |
-| **literature-survey** | Comprehensive literature survey: search → filter → download → cluster → report. |
-| **idea-generation** | Generate innovative research ideas from a topic. Searches arXiv/GitHub, downloads papers, outputs 5 ideas. |
+### Scenario 1 — End-to-End Research Pipeline
-### Commands (Direct, no LLM)
+> *"Research scaling laws for classical ML classifiers on Fashion-MNIST"*
-| Command | Description |
-|---------|-------------|
-| `/research-status` | Show workspace status |
-| `/papers` | List downloaded papers |
-| `/ideas` | List generated ideas |
-| `/projects` | List all projects |
-| `/project-switch <id>` | Switch project |
-| `/project-delete <id>` | Delete project |
+The **research-pipeline** orchestrator runs all 6 phases in sequence, spawning a dedicated sub-agent for each:
-### Tools
+```mermaid
+flowchart LR
+    A["Literature\nSurvey"] --> B["Deep\nAnalysis"] --> C["Implementation\nPlan"] --> D["Code\nImplementation"] --> E["Automated\nReview"] --> F["Full\nExperiment"]
+```
-| Tool | Description |
-|------|-------------|
-| **arxiv_search** | Search arXiv.org API for papers. Returns metadata only (title, authors, abstract, arxiv_id). No side effects. |
-| **arxiv_download** | Download arXiv papers by ID. Tries .tex source first, falls back to PDF. Requires absolute `output_dir` path. |
-| **github_search** | Search GitHub repositories by keyword, filter by language, sort by stars/updated |
+<details>
+<summary><b>What each phase produces</b></summary>
+| Phase | What Happens | Output File |
+|:------|:-------------|:------------|
+| **1. Literature Survey** | Search arXiv + OpenAlex, filter, download .tex sources, cluster by direction | `survey/report.md` |
+| **2. Deep Analysis** | Extract formulas, map methods to code, build cross-comparison | `survey_res.md` |
+| **3. Implementation Plan** | Design 4-part plan — Dataset / Model / Training / Testing | `plan_res.md` |
+| **4. Code Implementation** | Write ML code in `uv`-isolated venv, validate with 2-epoch run | `project/run.py` |
+| **5. Automated Review** | Review code → fix issues → rerun → re-review (up to 3 rounds) | `iterations/judge_v*.md` |
+| **6. Full Experiment** | Complete training + ablation studies with final analysis | `experiment_res.md` |
+</details>
 ---
-## Quick Start
+### Scenario 2 — Idea Generation
-```bash
-# Install the plugin
-openclaw plugins install scientify
+> *"Explore recent advances in protein folding and generate innovative research ideas"*
-# Start using
-openclaw "Research transformer efficiency and generate ideas"
-```
+The **idea-generation** skill surveys the field, then:
+1. Generates **5 diverse research ideas** grounded in real papers
+2. Scores each on novelty, feasibility, and impact
+3. Selects the best and produces an **enhanced proposal** with detailed methodology
+> [!TIP]
+> **Output:** `ideas/selected_idea.md` — a ready-to-develop research proposal.
 ---
-## Installation
+### Scenario 3 — Standalone Literature Survey
-```bash
-openclaw plugins install scientify
-```
+> *"Survey the latest papers on vision-language models for medical imaging"*
+Run just the survey phase when you need a structured reading list without running the full pipeline:
-> **Note:** Do NOT use `npm install scientify`. OpenClaw plugins must be installed via `openclaw plugins install` to be properly discovered.
+- Searches **arXiv** (CS/ML) and **OpenAlex** (cross-disciplinary, broader coverage)
+- Downloads `.tex` source files; retrieves open-access PDFs via **Unpaywall**
+- Clusters papers by sub-topic and extracts key findings
+- Generates a structured survey report
-The plugin will be installed to `~/.openclaw/extensions/scientify/` and automatically enabled.
+> [!TIP]
+> **Output:** `survey/report.md` + raw papers in `papers/_downloads/`
 ---
-## Usage Scenarios
+### Scenario 4 — Review Paper Drafting
-### 1. One-shot Idea Generation
+> *"Write a survey paper based on my project's research outputs"*
-```
-You: Research "long document summarization" and generate some innovative ideas
-Agent: [Auto-executes]
-  1. Search arXiv papers
-  2. Search GitHub repositories
-  3. Download and analyze .tex sources
-  4. Generate 5 innovative ideas
-  5. Select and enhance the best idea
-  6. Map to code implementations
-```
+After completing a research pipeline (or just a literature survey + deep analysis), the **write-review-paper** skill assembles a draft:
-### 2. Daily Literature Tracking
+- Synthesizes survey reports, analysis notes, and comparison tables
+- Structures the paper with Introduction, Related Work, Methods, and Discussion
+- Produces a publication-ready draft in Markdown
-```
-You: Set up a daily task at 9am to check arXiv for new papers on
-    "transformer efficiency", summarize relevant ones and send to Feishu
-Agent: Setting up:
-  1. Create scheduled Hook (cron: "0 9 * * *")
-  2. Daily arxiv search for papers from last 24h
-  3. Compare against your idea (selected_idea.md)
-  4. Filter relevant papers, generate summary
-  5. Push via Feishu webhook
-[Example push]
-📚 Today's Relevant Papers (3)
-• "FlashAttention-3: Fast Attention with ..." - Highly relevant to your idea
-• "Efficient Long-Context Transformers" - Medium relevance
-• "..."
-```
+> [!TIP]
+> **Output:** a survey/review paper draft based on all accumulated project artifacts.
-### 3. Research Progress Reporter
+---
-```
-You: Generate weekly research progress report every Friday and send to Slack
+### Advanced Scenarios — Combining OpenClaw Platform Capabilities
+As an OpenClaw plugin, Scientify can leverage the platform's MCP servers, browser automation, multi-session concurrency, and more to build powerful composite workflows.
+---
-Agent: [Auto-executes every Friday]
-  1. Read $WORKSPACE changes
-  2. Stats: X new papers, Y new ideas, Z code commits
-  3. Generate weekly report markdown
-  4. Push to Slack #research channel
+### Scenario 5 — Literature Monitoring Bot
+> *"Automatically search for new diffusion model papers every day and push a digest to our Slack channel"*
+Combine OpenClaw's **MCP integration** (Slack / Feishu / Email) with **scheduled triggers** to build automated literature monitoring:
+```mermaid
+flowchart LR
+    A["Scheduled Trigger\n(cron / webhook)"] --> B["arxiv_search\n+ openalex_search"]
+    B --> C["LLM Filtering\n+ Summary"]
+    C --> D["Push to\nSlack / Feishu / Email"]
 ```
-### 4. Competitive Paper Monitor
+1. External cron job or OpenClaw webhook triggers a session periodically
+2. Scientify's `arxiv_search` + `openalex_search` fetch the latest papers
+3. LLM scores and filters by your research interests, generates concise summaries
+4. MCP tools push the digest to Slack, Feishu, or email
+> [!NOTE]
+> **Requires:** A configured MCP server (e.g., `slack-mcp`, `feishu-mcp`). OpenClaw supports declaring MCP servers in `openclaw.json`.
+---
+### Scenario 6 — Download Paywalled Papers via Browser
+> *"Download these 5 IEEE papers using my university VPN"*
+Scientify's built-in `arxiv_download` and `unpaywall_download` only handle open-access papers. For paywalled content, combine with OpenClaw's **browser automation** (Playwright MCP):
+```mermaid
+flowchart LR
+    A["Scientify\nprovides paper URLs"] --> B["Playwright MCP\nopens browser"]
+    B --> C["Institutional Proxy\nauto-authenticate"]
+    C --> D["Navigate to Publisher\ndownload PDF"]
 ```
-You: Monitor new papers from "Yann LeCun" and "Meta AI"
-Agent: Setting up monitoring:
-  - Daily check arxiv author "Yann LeCun"
-  - Check arxiv affiliation "Meta AI"
-  - Push notification when new papers appear
+- OpenClaw launches a controlled browser via Playwright MCP server
+- The browser accesses publisher sites through your institutional proxy / VPN
+- Automatically navigates to the paper page and downloads the PDF to `papers/_downloads/`
+- Works with IEEE, Springer, Elsevier, ACM, and other subscription-based publishers
+> [!NOTE]
+> **Requires:** Playwright MCP server configured, and institutional network access to the papers.
+---
+### Scenario 7 — Multi-Topic Parallel Research
+> *"Research 3 directions simultaneously: LoRA fine-tuning, MoE architectures, KV-Cache optimization"*
+Leverage OpenClaw's **multi-session concurrency** (`sessions_spawn`) to run multiple research pipelines in parallel:
+```mermaid
+flowchart TD
+    O["Main Agent\n(Orchestrator)"] --> A["Sub-session 1\nLoRA Fine-tuning"]
+    O --> B["Sub-session 2\nMoE Architectures"]
+    O --> C["Sub-session 3\nKV-Cache Optimization"]
+    A --> D["Independent project dirs\nisolated from each other"]
+    B --> D
+    C --> D
 ```
-### 5. Paper Reading Assistant
+- Each sub-topic runs a full pipeline with its own project directory
+- The main agent collects results and produces a cross-topic comparative analysis
+- Ideal for quickly scouting multiple directions during the topic-selection phase of a survey paper
+---
+### Scenario 8 — Interactive Paper Reading Assistant
+> *"Walk me through 'Attention Is All You Need' section by section, explain every formula"*
+Combine OpenClaw's conversational interface with Scientify's `paper_browser` tool for interactive deep reading:
+- `paper_browser` loads papers page-by-page, avoiding context overflow
+- Discuss section by section: LLM explains derivations, compares with related work, highlights contributions
+- Follow up on implementation details — LLM uses `github_search` to find corresponding open-source code
+- All analysis notes are saved to `notes/paper_{id}.md`
+---
+### Scenario 9 — Paper-to-Reproducible-Experiment
+> *"Reproduce the results from Table 2 of this paper"*
+End-to-end automation: understand paper → implement code → run experiment → compare results:
+```mermaid
+flowchart LR
+    A["paper_browser\nDeep read paper"] --> B["research-plan\nExtract experiment design"]
+    B --> C["research-implement\nWrite code"]
+    C --> D["research-experiment\nRun experiment"]
+    D --> E["Compare with\npaper's Table 2"]
 ```
-You: Read papers/2401.12345/ and compare its method with my idea
-Agent: [Reading paper .tex files]
+1. `paper_browser` reads the method and experiment sections in detail
+2. `research-plan` extracts experiment config (hyperparameters, datasets, metrics)
+3. `research-implement` generates code and validates in a `uv`-isolated environment
+4. `research-experiment` runs the full experiment
+5. LLM automatically compares your results against the paper's reported numbers
-  📄 Paper: "Efficient Attention for Long Documents"
+---
-  ## Comparison with Your Idea
+## Prerequisites
-  | Aspect | Paper Method | Your Method |
-  |--------|-------------|-------------|
-  | Attention | Sparse Attention | Hierarchical Attention |
-  | Complexity | O(n√n) | O(n log n) |
-  | Advantage | Simple implementation | Preserves more info |
+- **Node.js** >= 18
+- **Python 3** + **uv** (for ML code execution)
+- **git**
-  ## Suggestions
-  Consider combining both approaches...
+---
+## Install OpenClaw
+```bash
+# Install OpenClaw globally
+pnpm add -g openclaw    # or: npm install -g openclaw
+# Run onboarding wizard (configures model provider, API key, workspace)
+openclaw onboard
+# Start the gateway (runs the WebUI server)
+openclaw gateway
 ```
-### 6. Related Work Writer
+After `openclaw gateway`, the WebUI is available at **http://127.0.0.1:18789/** (default port).
+> **Proxy users:** If you have `http_proxy` set, access the WebUI with `--noproxy 127.0.0.1` or configure your browser accordingly.
+---
+## Install Scientify
+### From npm (recommended)
+```bash
+openclaw plugins install scientify
 ```
-You: Write Related Work section based on papers in my project
-Agent: [Reading papers/ and literature/notes/]
+The plugin installs to `~/.openclaw/extensions/scientify/` and is automatically enabled.
-  Generating related_work.tex:
+### From source (development)
-  \section{Related Work}
+```bash
+git clone https://github.com/user/scientify.git
+cd scientify && pnpm install && pnpm build
+# Link as dev plugin
+openclaw plugins install -l ./
+```
-  \subsection{Long Document Summarization}
-  Recent works have explored various approaches...
-  \cite{paper1} proposed...
-  \cite{paper2} extended this by...
+### Verify installation
-  \subsection{Efficient Transformers}
-  ...
+```bash
+openclaw plugins list
+# Should show: scientify (enabled)
 ```
-### 7. Idea Mashup
+After installation, **restart the gateway** to load the plugin:
+```bash
+# Stop the running gateway (Ctrl+C), then:
+openclaw gateway
 ```
-You: Combine "reinforcement learning" and "text summarization" to spark new ideas
-Agent: [Searching papers from both fields]
+---
-  🔥 Mashup Results:
+## Usage via WebUI
-  1. **RL-guided Summarization**
-     Use RL to optimize readability and information coverage
-     References: [2301.xxx], [2302.xxx]
+### 1. Open the WebUI
-  2. **Interactive Summarization Agent**
-     User feedback as reward signal for iterative optimization
+Navigate to **http://127.0.0.1:18789/** in your browser.
-  3. ...
+### 2. Start a research task
+Type a research prompt in the chat. Scientify skills are auto-matched by the LLM:
+```
+Research "transformer efficiency" and generate some innovative ideas
 ```
-### 8. Experiment Design Helper
+Or invoke a specific skill directly with a slash command:
 ```
-You: My idea is xxx, help me design experiments
+/research-pipeline
+/literature-survey
+/idea-generation
+```
-Agent: [Reading selected_idea.md and related papers]
+### 3. Monitor sub-agent progress
-  ## Experiment Design
+When the orchestrator spawns sub-agents, you'll see:
+- **Spawn notification** — "Phase 1: Literature Survey started"
+- **Completion announcement** — automatic message when the sub-agent finishes
+- **Progress updates** — the orchestrator verifies outputs and advances to the next phase
-  ### Datasets
-  1. CNN/DailyMail - Standard news summarization (287k samples)
-  2. arXiv - Long scientific papers (215k samples)
-  3. ...
+You can also check status anytime with:
-  ### Baselines
-  1. BART-large (ref: paper_001.md)
-  2. LED (ref: paper_003.md)
+```
+/research-status
+```
-  ### Metrics
-  - ROUGE-1/2/L
-  - BERTScore
-  - Human evaluation: fluency, information coverage
+### 4. Manage projects
-  ### Ablation Studies
-  1. Remove xxx module
-  2. ...
 ```
+/projects              # List all projects
+/project-switch <id>   # Switch to a different project
+/papers                # List downloaded papers
+/ideas                 # List generated ideas
+```
+---
+## Skills
+### Pipeline Skills (LLM-powered)
+| Skill | Slash Command | Description |
+|-------|---------------|-------------|
+| **research-pipeline** | `/research-pipeline` | Orchestrator. Spawns sub-agents for each phase, verifies outputs between steps. |
+| **literature-survey** | `/literature-survey` | Search arXiv → filter → download .tex sources → cluster → generate survey report. |
+| **research-survey** | `/research-survey` | Deep analysis of papers: extract formulas, map to code, produce method comparison table. |
+| **research-plan** | `/research-plan` | Create 4-part implementation plan (Dataset/Model/Training/Testing) from survey results. |
+| **research-implement** | `/research-implement` | Implement ML code from plan, run 2-epoch validation with `uv` venv isolation. |
+| **research-review** | `/research-review` | Review implementation. Iterates fix → rerun → review up to 3 times. |
+| **research-experiment** | `/research-experiment` | Full training + ablation experiments. Requires review PASS. |
+| **idea-generation** | `/idea-generation` | Generate 5 innovative research ideas from a topic, select and enhance the best one. |
+### Standalone Skills
+| Skill | Description |
+|-------|-------------|
+| **write-review-paper** | Draft a review/survey paper from project research outputs. |
+### Tools (available to LLM)
+| Tool | Description |
+|------|-------------|
+| `arxiv_search` | Search arXiv papers. Returns metadata (title, authors, abstract, ID). Does not download files. Supports sorting by relevance/date and date filtering. |
+| `arxiv_download` | Batch download papers by arXiv ID. Prefers .tex source files (PDF fallback). Requires absolute output directory path. |
+| `openalex_search` | Search cross-disciplinary academic papers via OpenAlex API. Returns DOI, authors, citation count, OA status. Broader coverage than arXiv. |
+| `unpaywall_download` | Download open access PDFs by DOI via Unpaywall API. Non-OA papers are silently skipped (no failure). |
+| `github_search` | Search GitHub repositories. Returns repo name, description, stars, URL. Supports language filtering and sorting. |
+| `paper_browser` | Paginated browsing of large paper files (.tex/.md) to avoid loading thousands of lines into context. Returns specified line range with navigation info. |
+### Commands (direct, no LLM)
+| Command | Description |
+|---------|-------------|
+| `/research-status` | Show workspace status and active project |
+| `/papers` | List downloaded papers with metadata |
+| `/ideas` | List generated ideas |
+| `/projects` | List all projects |
+| `/project-switch <id>` | Switch active project |
+| `/project-delete <id>` | Delete a project |
 ---
 ## Workspace Structure
+All research data is organized under `~/.openclaw/workspace/projects/`:
 ```
-~/.openclaw/workspace/projects/
-├── .active                      # Current project ID
-├── nlp-summarization/           # Project A
-│   ├── project.json             # Metadata
-│   ├── task.json                # Task definition
-│   ├── survey/
-│   │   ├── search_terms.json    # Search terms used
-│   │   └── report.md            # Final survey report
+projects/
+├── .active                        # Current project ID
+├── scaling-law-fashion-mnist/     # Example project
+│   ├── project.json               # Metadata
+│   ├── task.json                  # Task definition
 │   ├── papers/
-│   │   ├── _downloads/          # Raw downloaded files
-│   │   ├── _meta/               # Paper metadata JSON files
-│   │   │   └── {arxiv_id}.json
-│   │   └── {direction}/         # Clustered papers by research direction
-│   ├── repos/                   # Cloned repos
-│   ├── notes/                   # /research-survey: per-paper analysis
+│   │   ├── _meta/                 # Paper metadata (*.json)
+│   │   └── _downloads/            # Raw .tex/.pdf files
+│   ├── survey/
+│   │   └── report.md              # Literature survey report
+│   ├── notes/                     # Per-paper deep analysis
 │   │   └── paper_{arxiv_id}.md
-│   ├── survey_res.md            # /research-survey: method comparison
-│   ├── plan_res.md              # /research-plan: implementation plan
-│   ├── project/                 # /research-implement: ML code
-│   │   ├── model/
-│   │   ├── data/
+│   ├── survey_res.md              # Method comparison table
+│   ├── plan_res.md                # Implementation plan
+│   ├── project/                   # ML code
 │   │   ├── run.py
 │   │   └── requirements.txt
-│   ├── ml_res.md                # /research-implement: execution report
-│   ├── iterations/              # /research-review: judge reports
+│   ├── ml_res.md                  # Implementation results
+│   ├── iterations/                # Review iterations
 │   │   └── judge_v*.md
-│   ├── experiment_res.md        # /research-experiment: final results
-│   └── ideas/                   # Generated ideas
-│       ├── idea_1.md
-│       ├── idea_2.md
-│       └── selected_idea.md     # Best idea
+│   ├── experiment_res.md          # Final experiment results
+│   └── ideas/                     # Generated ideas
+│       ├── idea_*.md
+│       └── selected_idea.md
 └── another-project/
 ```
@@ -256,58 +384,53 @@ Agent: [Reading selected_idea.md and related papers]
 ## Configuration
-After installation, the plugin is automatically enabled. You can customize settings in `~/.openclaw/openclaw.json`:
+Plugin settings in `~/.openclaw/openclaw.json`:
 ```json
 {
   "plugins": {
     "entries": {
       "scientify": {
-        "enabled": true,
-        "workspaceRoot": "~/my-research",
-        "defaultMaxPapers": 15
+        "enabled": true
       }
     }
   }
 }
 ```
-### Plugin Management
+### Plugin management
 ```bash
-# List installed plugins
-openclaw plugins list
-# Disable plugin
-openclaw plugins disable scientify
-# Enable plugin
-openclaw plugins enable scientify
-# Update to latest version
-openclaw plugins update scientify
+openclaw plugins list               # List installed plugins
+openclaw plugins enable scientify    # Enable
+openclaw plugins disable scientify   # Disable
+openclaw plugins update scientify    # Update to latest
+openclaw plugins doctor              # Diagnose issues
 ```
 ---
 ## Known Limitations
-### Sandbox & GPU
-The `research-pipeline` skill's code execution step depends on your OpenClaw agent configuration:
-- If `sandbox.mode: "off"` (default for CLI), commands run directly on host
-- Current sandbox does NOT support GPU (`--gpus`) or custom shared memory (`--shm-size`)
-For GPU-accelerated ML training, consider:
-1. Running outside sandbox (configure agent with `sandbox.mode: "off"`)
-2. Using a dedicated cloud GPU instance
-3. Waiting for OpenClaw GPU support
+- **Sub-agent timeout**: Each sub-agent has a 30-minute timeout (`runTimeoutSeconds: 1800`). Complex literature surveys with many papers may need longer.
+- **GPU/Sandbox**: Code execution runs on host by default. OpenClaw sandbox does not support GPU passthrough yet.
+- **Model dependency**: Research quality depends heavily on the LLM model used. Claude Opus 4.5+ or GPT-5+ recommended.
 ---
 ## Development
+```bash
+git clone https://github.com/user/scientify.git
+cd scientify
+pnpm install
+pnpm build          # Build TypeScript
+pnpm dev            # Watch mode
+# Link to OpenClaw for testing
+openclaw plugins install -l ./
+```
 See [CLAUDE.md](./CLAUDE.md) for version update SOP and contribution guide.
 ---