npm - scientify - Versions diffs - 1.13.6 → 2.0.0 - Mend

scientify 1.13.6 → 2.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (100) hide show

package/README.en.md +350 -0
package/README.md +148 -358
package/dist/index.d.ts +8 -2
package/dist/index.d.ts.map +1 -1
package/dist/index.js +131 -122
package/dist/index.js.map +1 -1
package/dist/src/cli/research.d.ts +1 -6
package/dist/src/cli/research.d.ts.map +1 -1
package/dist/src/cli/research.js +227 -123
package/dist/src/cli/research.js.map +1 -1
package/dist/src/commands/metabolism-status.d.ts +3 -3
package/dist/src/commands/metabolism-status.d.ts.map +1 -1
package/dist/src/commands/metabolism-status.js +72 -75
package/dist/src/commands/metabolism-status.js.map +1 -1
package/dist/src/commands.d.ts +1 -1
package/dist/src/commands.d.ts.map +1 -1
package/dist/src/commands.js +0 -55
package/dist/src/commands.js.map +1 -1
package/dist/src/hooks/cron-skill-inject.d.ts +6 -7
package/dist/src/hooks/cron-skill-inject.d.ts.map +1 -1
package/dist/src/hooks/cron-skill-inject.js +6 -15
package/dist/src/hooks/cron-skill-inject.js.map +1 -1
package/dist/src/hooks/research-mode.d.ts +1 -1
package/dist/src/hooks/research-mode.d.ts.map +1 -1
package/dist/src/hooks/research-mode.js +24 -101
package/dist/src/hooks/research-mode.js.map +1 -1
package/dist/src/hooks/scientify-signature.d.ts +1 -1
package/dist/src/hooks/scientify-signature.d.ts.map +1 -1
package/dist/src/hooks/scientify-signature.js +2 -5
package/dist/src/hooks/scientify-signature.js.map +1 -1
package/dist/src/knowledge-state/render.d.ts +1 -9
package/dist/src/knowledge-state/render.d.ts.map +1 -1
package/dist/src/knowledge-state/render.js +33 -187
package/dist/src/knowledge-state/render.js.map +1 -1
package/dist/src/knowledge-state/store.d.ts.map +1 -1
package/dist/src/knowledge-state/store.js +65 -1100
package/dist/src/knowledge-state/store.js.map +1 -1
package/dist/src/knowledge-state/types.d.ts +0 -76
package/dist/src/knowledge-state/types.d.ts.map +1 -1
package/dist/src/literature/subscription-state.d.ts +0 -2
package/dist/src/literature/subscription-state.d.ts.map +1 -1
package/dist/src/literature/subscription-state.js +7 -1375
package/dist/src/literature/subscription-state.js.map +1 -1
package/dist/src/research-subscriptions/constants.d.ts +1 -1
package/dist/src/research-subscriptions/constants.js +1 -1
package/dist/src/research-subscriptions/cron-client.d.ts +1 -1
package/dist/src/research-subscriptions/cron-client.d.ts.map +1 -1
package/dist/src/research-subscriptions/delivery.d.ts +1 -1
package/dist/src/research-subscriptions/delivery.d.ts.map +1 -1
package/dist/src/research-subscriptions/handlers.d.ts +1 -1
package/dist/src/research-subscriptions/handlers.d.ts.map +1 -1
package/dist/src/research-subscriptions/handlers.js +10 -20
package/dist/src/research-subscriptions/handlers.js.map +1 -1
package/dist/src/research-subscriptions/parse.d.ts.map +1 -1
package/dist/src/research-subscriptions/parse.js +0 -25
package/dist/src/research-subscriptions/parse.js.map +1 -1
package/dist/src/research-subscriptions/prompt.d.ts +1 -1
package/dist/src/research-subscriptions/prompt.d.ts.map +1 -1
package/dist/src/research-subscriptions/prompt.js +195 -244
package/dist/src/research-subscriptions/prompt.js.map +1 -1
package/dist/src/research-subscriptions/types.d.ts +1 -3
package/dist/src/research-subscriptions/types.d.ts.map +1 -1
package/dist/src/templates/bootstrap.d.ts.map +1 -1
package/dist/src/templates/bootstrap.js +32 -19
package/dist/src/templates/bootstrap.js.map +1 -1
package/dist/src/tools/arxiv-download.d.ts +1 -2
package/dist/src/tools/arxiv-download.d.ts.map +1 -1
package/dist/src/tools/arxiv-search.d.ts +1 -2
package/dist/src/tools/arxiv-search.d.ts.map +1 -1
package/dist/src/tools/github-search-tool.d.ts +1 -2
package/dist/src/tools/github-search-tool.d.ts.map +1 -1
package/dist/src/tools/openalex-search.d.ts +1 -2
package/dist/src/tools/openalex-search.d.ts.map +1 -1
package/dist/src/tools/openreview-lookup.d.ts +1 -2
package/dist/src/tools/openreview-lookup.d.ts.map +1 -1
package/dist/src/tools/paper-browser.d.ts +1 -2
package/dist/src/tools/paper-browser.d.ts.map +1 -1
package/dist/src/tools/result.d.ts +3 -5
package/dist/src/tools/result.d.ts.map +1 -1
package/dist/src/tools/result.js +5 -7
package/dist/src/tools/result.js.map +1 -1
package/dist/src/tools/scientify-cron.d.ts +4 -11
package/dist/src/tools/scientify-cron.d.ts.map +1 -1
package/dist/src/tools/scientify-cron.js +19 -524
package/dist/src/tools/scientify-cron.js.map +1 -1
package/dist/src/tools/scientify-literature-state.d.ts +1 -76
package/dist/src/tools/scientify-literature-state.d.ts.map +1 -1
package/dist/src/tools/scientify-literature-state.js +46 -363
package/dist/src/tools/scientify-literature-state.js.map +1 -1
package/dist/src/tools/unpaywall-download.d.ts +1 -2
package/dist/src/tools/unpaywall-download.d.ts.map +1 -1
package/dist/src/types.d.ts +16 -0
package/dist/src/types.d.ts.map +1 -0
package/dist/src/types.js +2 -0
package/dist/src/types.js.map +1 -0
package/openclaw.plugin.json +4 -2
package/package.json +1 -1
package/skills/metabolism/SKILL.md +2 -0
package/skills/research-subscription/SKILL.md +1 -29
package/README.zh.md +0 -494

package/README.en.md ADDED Viewed

@@ -0,0 +1,350 @@
+<h1 align="center">Scientify</h1>
+<p align="center">
+  <em>Continuous Knowledge Metabolism for AI Research</em>
+</p>
+<p align="center">
+  <a href="https://www.npmjs.com/package/scientify"><img src="https://img.shields.io/npm/v/scientify?style=for-the-badge&logo=npm&logoColor=white" alt="npm version"></a>
+  <a href="https://github.com/tsingyuai/scientify"><img src="https://img.shields.io/github/stars/tsingyuai/scientify?style=for-the-badge&logo=github" alt="GitHub stars"></a>
+  <a href="LICENSE"><img src="https://img.shields.io/badge/License-MIT-blue.svg?style=for-the-badge" alt="MIT License"></a>
+  <a href="https://github.com/openclaw/openclaw"><img src="https://img.shields.io/badge/OpenClaw-plugin-00FF9F?style=for-the-badge" alt="OpenClaw plugin"></a>
+</p>
+<p align="center">
+  <a href="https://scientify.tech">Website</a> · <a href="./README.md">中文</a> · <a href="https://github.com/tsingyuai/scientify/issues">Issues</a>
+</p>
+---
+## What It Does
+> [!IMPORTANT]
+> Scientify is not another "ask once, answer once" AI tool. It acts like a real research partner — **continuously thinking, accumulating, and delivering**.
+### 1. Metabolism: Continuous Thinking, Not One-Shot Answers
+Existing AI research tools work in **batch mode** — give it a question, run a pipeline, get a report, done. Ask the same direction next time, it starts from scratch. Running 10 times is no different from running once.
+But human researchers don't work this way. You read, run experiments, and think every day. Yesterday's failure changes today's reading; last week's conversation changes this week's experiment design.
+Scientify adopts a **metabolism mode** — continuously ingesting, digesting, accumulating, and re-ingesting:
+- **Continuous ingestion**: Automatically tracks frontier papers daily, no manual trigger needed
+- **Digestion & accumulation**: Links new knowledge with existing accumulation, writes to persistent knowledge base
+- **Hypothesis evolution**: Discards ineffective hypotheses, evolves effective paths — every failure feeds the next round
+- **Proactive delivery**: Automatically verifies noteworthy findings and pushes results to you
+The longer you use it, the deeper its research goes.
+<p align="center">
+  <img src="docs/assets/showcase/3.png" width="50%" alt="Scientify proactively pushes research progress via Feishu">
+  <br>
+  <sub>Scientify proactively pushes latest findings to researchers via Feishu, with insights drawn from its knowledge base</sub>
+</p>
+### 2. End-to-End Autonomous Research: Achieving SOTA-Level Results
+Give it a research topic, and it completes the entire study — producing new algorithms that outperform existing literature.
+Driven by multi-agent iteration: the orchestrator holds hypotheses and all accumulated knowledge, only dispatching — never writing code itself; each round spawns independent sub-agents for implementation, review, and experimentation; every failure is distilled into experience for the next round, refining hypotheses until a superior method is discovered.
+### Showcase: Autonomously Discovered the KV2 Algorithm with Field-Leading Performance
+> **Objective**: For long-context LLM inference, design a strategy that simultaneously reduces time-to-first-token latency and per-request communication volume.
+>
+> Scientify autonomously completed literature survey, hypothesis generation, code implementation, and experimental validation, proposing the **KV2 algorithm**. Compared to existing research, both TTFT p95 and bytes/request were reduced to varying degrees, achieving SOTA-level performance.
+<p align="center">
+  <img src="docs/assets/showcase/1.png" width="80%" alt="KV2 algorithm experimental results">
+  <br>
+  <sub>KV2 algorithm experimental results on first-token latency and communication volume</sub>
+</p>
+<p align="center">
+  <img src="docs/assets/showcase/2.png" width="80%" alt="KV2 comparison with existing methods">
+  <br>
+  <sub>KV2 SOTA comparison with existing methods</sub>
+</p>
+---
+## Architecture
+```
+┌─────────────────────────────────────────────────────────────┐
+│  Researcher                                                  │
+│  Conversation · Feed materials · Judge hypotheses            │
+└──────────────┬──────────────────────────────┬───────────────┘
+               ↓                              ↓
+┌──────────────────────────┐   ┌──────────────────────────────┐
+│  Agent Layer              │   │  Knowledge Base (persistent)  │
+│                          │   │                              │
+│  Heartbeat  Daily wakeup │←→│  _index.md                   │
+│  Reflection Cross-domain │   │  topic-*.md                  │
+│  Pipeline   Hypothesis   │   │  hypotheses/                 │
+│             verification │   │  experiments/                │
+└──────────┬───────────────┘   │  conversations/              │
+           ↓                   │                              │
+┌──────────────────────────┐   │  Markdown files · Git-managed│
+│  Tool Layer               │   │  Fully auditable · You can   │
+│                          │──→│  edit them too               │
+│  arxiv_search            │   └──────────────────────────────┘
+│  openalex_search         │
+│  github_search           │
+│  paper_browser           │
+│  code_executor           │
+└──────────────────────────┘
+```
+Four components, each with a clear role:
+### Researcher
+You are part of the system. Inject judgment through conversation, feed materials, confirm or reject hypotheses. Your participation makes the metabolism direction more accurate and research hypotheses more precise.
+### Agent Layer
+Three loops drive the metabolism:
+| Agent | What it does | Trigger |
+|-------|-------------|---------|
+| **Heartbeat** | Tracks frontier papers daily; when it finds relevant work, autonomously verifies and pushes results to you | Scheduled auto-wakeup |
+| **Reflection** | Cross-domain exploration — connects knowledge across different topics, discovers unexpected links | Heartbeat / Researcher |
+| **Pipeline** | End-to-end research execution — literature survey → deep analysis → implementation → review → experiment | Researcher / Reflection |
+Pipeline internally uses multi-agent iteration: the orchestrator holds hypotheses and spawns sub-agents for implementation (`implement`), review (`review`), and experimentation (`experiment`). Each failure is distilled into experience; hypotheses are refined with each round.
+### Tool Layer
+The agents' hands and eyes:
+| Tool | Capability |
+|------|-----------|
+| `arxiv_search` / `openalex_search` | Search academic papers (arXiv + cross-disciplinary) |
+| `github_search` | Search open-source code implementations |
+| `paper_browser` | Paginated paper reading, avoids context overflow |
+| `code_executor` | Execute experiment code in `uv`-isolated environment |
+> Scientify runs on [OpenClaw](https://github.com/openclaw/openclaw), natively leveraging the platform's MCP servers (Slack / Feishu push), browser automation (paywalled paper downloads), multi-session concurrency (parallel multi-direction research), and more.
+### Knowledge Base
+All accumulation is persisted as Markdown files, Git-managed, every change is traceable. You and the agents read and write the same files:
+```
+knowledge_state/
+├── _index.md              # Global research index
+├── topic-*.md             # Knowledge organized by topic
+├── hypotheses/            # Hypothesis evolution records
+├── experiments/           # Experiment results & analysis
+├── paper_notes/           # Per-paper deep reading records
+└── logs/                  # Metabolism run logs
+```
+---
+## Prerequisites
+- **Node.js** >= 18
+- **Python 3** + **uv** (for ML code execution)
+- **git**
+---
+## Install OpenClaw
+```bash
+# Install OpenClaw globally
+pnpm add -g openclaw    # or: npm install -g openclaw
+# Run onboarding wizard (configures model provider, API key, workspace)
+openclaw onboard
+# Start the gateway (runs the WebUI server)
+openclaw gateway
+```
+After `openclaw gateway`, the WebUI is available at **http://127.0.0.1:18789/** (default port).
+> **Proxy users:** If you have `http_proxy` set, access the WebUI with `--noproxy 127.0.0.1` or configure your browser accordingly.
+---
+## Install Scientify
+### From npm (recommended)
+```bash
+openclaw plugins install scientify
+```
+The plugin installs to `~/.openclaw/extensions/scientify/` and is automatically enabled.
+### From source (development)
+```bash
+git clone https://github.com/tsingyuai/scientify.git
+cd scientify && pnpm install && pnpm build
+# Link as dev plugin
+openclaw plugins install -l ./
+```
+### Verify installation
+```bash
+openclaw plugins list
+# Should show: scientify (enabled)
+```
+After installation, **restart the gateway** to load the plugin:
+```bash
+# Stop the running gateway (Ctrl+C), then:
+openclaw gateway
+```
+---
+## Usage via WebUI
+### 1. Open the WebUI
+Navigate to **http://127.0.0.1:18789/** in your browser.
+### 2. Start a research task
+Type a research prompt in the chat. Scientify skills are auto-matched by the LLM:
+```
+Research "transformer efficiency" and generate some innovative ideas
+```
+Or invoke a specific skill directly with a slash command:
+```
+/research-pipeline
+/research-collect
+/idea-generation
+```
+### 3. Monitor sub-agent progress
+When the orchestrator spawns sub-agents, you'll see:
+- **Spawn notification** — "Phase 1: Literature Survey started"
+- **Completion announcement** — automatic message when the sub-agent finishes
+- **Progress updates** — the orchestrator verifies outputs and advances to the next phase
+Check status anytime:
+```
+/research-status
+```
+### 4. Manage projects
+```
+/projects              # List all projects
+/project-switch <id>   # Switch to a different project
+/papers                # List downloaded papers
+/ideas                 # List generated ideas
+```
+---
+## Skills & Tools
+<details>
+<summary><b>Pipeline Skills (LLM-powered)</b></summary>
+| Skill | Slash Command | Description |
+|-------|---------------|-------------|
+| **research-pipeline** | `/research-pipeline` | Orchestrator. Spawns sub-agents for each phase, verifies outputs between steps. |
+| **research-collect** | `/research-collect` | Search arXiv → filter → download .tex sources → cluster → generate survey report. |
+| **research-survey** | `/research-survey` | Deep analysis of papers: extract formulas, map to code, produce method comparison table. |
+| **research-plan** | `/research-plan` | Create 4-part implementation plan (Dataset/Model/Training/Testing) from survey results. |
+| **research-implement** | `/research-implement` | Implement ML code from plan, run 2-epoch validation with `uv` venv isolation. |
+| **research-review** | `/research-review` | Review implementation. Iterates fix → rerun → review up to 3 times. |
+| **research-experiment** | `/research-experiment` | Full training + ablation experiments. Requires review PASS. |
+| **idea-generation** | `/idea-generation` | Generate 5 innovative research ideas from a topic, select and enhance the best one. |
+</details>
+<details>
+<summary><b>Standalone Skills</b></summary>
+| Skill | Description |
+|-------|-------------|
+| **write-review-paper** | Draft a review/survey paper from project research outputs. |
+| **research-subscription** | Create/list/remove scheduled Scientify jobs via `scientify_cron_job` (research digests or plain reminders). |
+</details>
+<details>
+<summary><b>Tools (available to LLM)</b></summary>
+| Tool | Description |
+|------|-------------|
+| `arxiv_search` | Search arXiv papers. Returns metadata (title, authors, abstract, ID). Supports sorting by relevance/date and date filtering. |
+| `arxiv_download` | Batch download papers by arXiv ID. Prefers .tex source files (PDF fallback). |
+| `openalex_search` | Search cross-disciplinary academic papers via OpenAlex API. Returns DOI, authors, citation count, OA status. |
+| `openreview_lookup` | Lookup OpenReview evidence by title/ID/forum. Returns decision, review rating/confidence aggregates, and review summaries. |
+| `unpaywall_download` | Download open access PDFs by DOI via Unpaywall API. Non-OA papers are silently skipped. |
+| `github_search` | Search GitHub repositories. Returns repo name, description, stars, URL. Supports language filtering and sorting. |
+| `paper_browser` | Paginated browsing of large paper files (.tex/.md) to avoid context overflow. |
+| `scientify_cron_job` | Manage scheduled Scientify jobs (`upsert`/`list`/`remove`). |
+| `scientify_literature_state` | Persistent incremental state for subscriptions: dedupe, record, feedback, and status inspection. |
+</details>
+<details>
+<summary><b>Commands (direct, no LLM)</b></summary>
+| Command | Description |
+|---------|-------------|
+| `/research-status` | Show workspace status and active project |
+| `/papers` | List downloaded papers with metadata |
+| `/ideas` | List generated ideas |
+| `/projects` | List all projects |
+| `/project-switch <id>` | Switch active project |
+| `/project-delete <id>` | Delete a project |
+| `/research-subscribe ...` | Create/update scheduled Scientify jobs |
+| `/research-subscriptions` | Show your scheduled Scientify jobs |
+| `/research-unsubscribe [job-id]` | Remove your scheduled Scientify jobs |
+</details>
+---
+## Known Limitations
+- **Sub-agent timeout**: Each sub-agent has a 30-minute timeout. Complex literature surveys may need longer.
+- **GPU/Sandbox**: Code execution runs on host by default. OpenClaw sandbox does not support GPU passthrough yet.
+- **Model dependency**: Research quality depends heavily on the LLM model used. Claude Opus 4.5+ or GPT-5+ recommended.
+---
+## Development
+```bash
+git clone https://github.com/tsingyuai/scientify.git
+cd scientify
+pnpm install
+pnpm build          # Build TypeScript
+pnpm dev            # Watch mode
+# Link to OpenClaw for testing
+openclaw plugins install -l ./
+```
+See [CLAUDE.md](./CLAUDE.md) for version update SOP and contribution guide.
+---
+## License
+MIT
+## Author
+tsingyuai