scientify 2.1.0 → 3.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (186) hide show
  1. package/README.en.md +21 -1
  2. package/README.md +27 -0
  3. package/dist/index.d.ts.map +1 -1
  4. package/dist/index.js +2 -77
  5. package/dist/index.js.map +1 -1
  6. package/dist/src/cli/research.d.ts.map +1 -1
  7. package/dist/src/cli/research.js +47 -23
  8. package/dist/src/cli/research.js.map +1 -1
  9. package/dist/src/commands/metabolism-status.d.ts.map +1 -1
  10. package/dist/src/commands/metabolism-status.js +5 -25
  11. package/dist/src/commands/metabolism-status.js.map +1 -1
  12. package/dist/src/commands.d.ts +8 -8
  13. package/dist/src/commands.d.ts.map +1 -1
  14. package/dist/src/commands.js +230 -243
  15. package/dist/src/commands.js.map +1 -1
  16. package/dist/src/release-gate.d.ts +14 -0
  17. package/dist/src/release-gate.d.ts.map +1 -0
  18. package/dist/src/release-gate.js +124 -0
  19. package/dist/src/release-gate.js.map +1 -0
  20. package/dist/src/templates/bootstrap.d.ts.map +1 -1
  21. package/dist/src/templates/bootstrap.js +157 -94
  22. package/dist/src/templates/bootstrap.js.map +1 -1
  23. package/dist/src/types.d.ts +2 -10
  24. package/dist/src/types.d.ts.map +1 -1
  25. package/openclaw.plugin.json +11 -17
  26. package/package.json +2 -3
  27. package/skills/algorithm-selection/SKILL.md +103 -0
  28. package/skills/algorithm-selection/references/candidate-template.md +13 -0
  29. package/skills/algorithm-selection/references/selection-template.md +39 -0
  30. package/skills/artifact-review/SKILL.md +146 -0
  31. package/skills/artifact-review/references/release-gate-template.md +40 -0
  32. package/skills/artifact-review/references/review-checklist.md +45 -0
  33. package/skills/artifact-review/references/style-review-checklist.md +30 -0
  34. package/skills/baseline-runner/SKILL.md +103 -0
  35. package/skills/baseline-runner/references/baseline-matrix-template.md +9 -0
  36. package/skills/baseline-runner/references/baseline-report-template.md +25 -0
  37. package/skills/dataset-validate/SKILL.md +104 -0
  38. package/skills/dataset-validate/references/data-validation-template.md +38 -0
  39. package/skills/figure-standardize/SKILL.md +110 -0
  40. package/skills/figure-standardize/references/caption-template.md +12 -0
  41. package/skills/figure-standardize/references/figure-placement-template.md +30 -0
  42. package/skills/figure-standardize/references/figure-style-guide.md +36 -0
  43. package/skills/idea-generation/SKILL.md +20 -44
  44. package/skills/idea-generation/references/code-mapping.md +3 -3
  45. package/skills/idea-generation/references/idea-template.md +1 -1
  46. package/skills/idea-generation/references/reading-long-papers.md +3 -3
  47. package/skills/metabolism/SKILL.md +80 -36
  48. package/skills/paper-download/SKILL.md +61 -0
  49. package/skills/release-layout/SKILL.md +73 -0
  50. package/skills/release-layout/references/page-structure.md +14 -0
  51. package/skills/research-collect/SKILL.md +41 -111
  52. package/skills/research-experiment/SKILL.md +20 -12
  53. package/skills/research-implement/SKILL.md +10 -11
  54. package/skills/research-pipeline/SKILL.md +23 -31
  55. package/skills/research-plan/SKILL.md +7 -11
  56. package/skills/research-review/SKILL.md +21 -22
  57. package/skills/research-survey/SKILL.md +28 -25
  58. package/skills/write-paper/SKILL.md +252 -0
  59. package/skills/write-paper/references/boundary-notes-template.md +34 -0
  60. package/skills/write-paper/references/claim-inventory-template.md +32 -0
  61. package/skills/write-paper/references/evidence-contract.md +57 -0
  62. package/skills/write-paper/references/figure-callout-template.md +38 -0
  63. package/skills/write-paper/references/figures-manifest-template.md +44 -0
  64. package/skills/write-paper/references/latex/README.md +22 -0
  65. package/skills/write-paper/references/latex/build_paper.sh +41 -0
  66. package/skills/write-paper/references/latex/manuscript.tex +39 -0
  67. package/skills/write-paper/references/latex/references.bib +10 -0
  68. package/skills/write-paper/references/latex/sections/ablations.tex +3 -0
  69. package/skills/write-paper/references/latex/sections/abstract.tex +3 -0
  70. package/skills/write-paper/references/latex/sections/conclusion.tex +3 -0
  71. package/skills/write-paper/references/latex/sections/discussion_scope.tex +7 -0
  72. package/skills/write-paper/references/latex/sections/experimental_protocol.tex +3 -0
  73. package/skills/write-paper/references/latex/sections/introduction.tex +3 -0
  74. package/skills/write-paper/references/latex/sections/main_results.tex +9 -0
  75. package/skills/write-paper/references/latex/sections/method_system.tex +3 -0
  76. package/skills/write-paper/references/latex/sections/problem_setup.tex +3 -0
  77. package/skills/write-paper/references/latex/sections/related_work.tex +3 -0
  78. package/skills/write-paper/references/paper-template.md +155 -0
  79. package/skills/write-paper/references/paragraph-contract.md +139 -0
  80. package/skills/write-paper/references/paragraph-examples.md +171 -0
  81. package/skills/write-paper/references/style-banlist.md +81 -0
  82. package/skills/write-review-paper/SKILL.md +22 -16
  83. package/skills/write-review-paper/references/note-template.md +1 -1
  84. package/skills/write-review-paper/references/survey-template.md +1 -1
  85. package/dist/src/hooks/research-mode.d.ts +0 -22
  86. package/dist/src/hooks/research-mode.d.ts.map +0 -1
  87. package/dist/src/hooks/research-mode.js +0 -35
  88. package/dist/src/hooks/research-mode.js.map +0 -1
  89. package/dist/src/hooks/scientify-cron-autofill.d.ts +0 -15
  90. package/dist/src/hooks/scientify-cron-autofill.d.ts.map +0 -1
  91. package/dist/src/hooks/scientify-cron-autofill.js +0 -156
  92. package/dist/src/hooks/scientify-cron-autofill.js.map +0 -1
  93. package/dist/src/hooks/scientify-signature.d.ts +0 -21
  94. package/dist/src/hooks/scientify-signature.d.ts.map +0 -1
  95. package/dist/src/hooks/scientify-signature.js +0 -150
  96. package/dist/src/hooks/scientify-signature.js.map +0 -1
  97. package/dist/src/knowledge-state/project.d.ts +0 -13
  98. package/dist/src/knowledge-state/project.d.ts.map +0 -1
  99. package/dist/src/knowledge-state/project.js +0 -88
  100. package/dist/src/knowledge-state/project.js.map +0 -1
  101. package/dist/src/knowledge-state/render.d.ts +0 -63
  102. package/dist/src/knowledge-state/render.d.ts.map +0 -1
  103. package/dist/src/knowledge-state/render.js +0 -368
  104. package/dist/src/knowledge-state/render.js.map +0 -1
  105. package/dist/src/knowledge-state/store.d.ts +0 -19
  106. package/dist/src/knowledge-state/store.d.ts.map +0 -1
  107. package/dist/src/knowledge-state/store.js +0 -978
  108. package/dist/src/knowledge-state/store.js.map +0 -1
  109. package/dist/src/knowledge-state/types.d.ts +0 -182
  110. package/dist/src/knowledge-state/types.d.ts.map +0 -1
  111. package/dist/src/knowledge-state/types.js +0 -2
  112. package/dist/src/knowledge-state/types.js.map +0 -1
  113. package/dist/src/literature/subscription-state.d.ts +0 -112
  114. package/dist/src/literature/subscription-state.d.ts.map +0 -1
  115. package/dist/src/literature/subscription-state.js +0 -696
  116. package/dist/src/literature/subscription-state.js.map +0 -1
  117. package/dist/src/research-subscriptions/constants.d.ts +0 -16
  118. package/dist/src/research-subscriptions/constants.d.ts.map +0 -1
  119. package/dist/src/research-subscriptions/constants.js +0 -59
  120. package/dist/src/research-subscriptions/constants.js.map +0 -1
  121. package/dist/src/research-subscriptions/cron-client.d.ts +0 -8
  122. package/dist/src/research-subscriptions/cron-client.d.ts.map +0 -1
  123. package/dist/src/research-subscriptions/cron-client.js +0 -81
  124. package/dist/src/research-subscriptions/cron-client.js.map +0 -1
  125. package/dist/src/research-subscriptions/delivery.d.ts +0 -10
  126. package/dist/src/research-subscriptions/delivery.d.ts.map +0 -1
  127. package/dist/src/research-subscriptions/delivery.js +0 -82
  128. package/dist/src/research-subscriptions/delivery.js.map +0 -1
  129. package/dist/src/research-subscriptions/handlers.d.ts +0 -6
  130. package/dist/src/research-subscriptions/handlers.d.ts.map +0 -1
  131. package/dist/src/research-subscriptions/handlers.js +0 -204
  132. package/dist/src/research-subscriptions/handlers.js.map +0 -1
  133. package/dist/src/research-subscriptions/parse.d.ts +0 -11
  134. package/dist/src/research-subscriptions/parse.d.ts.map +0 -1
  135. package/dist/src/research-subscriptions/parse.js +0 -492
  136. package/dist/src/research-subscriptions/parse.js.map +0 -1
  137. package/dist/src/research-subscriptions/prompt.d.ts +0 -5
  138. package/dist/src/research-subscriptions/prompt.d.ts.map +0 -1
  139. package/dist/src/research-subscriptions/prompt.js +0 -347
  140. package/dist/src/research-subscriptions/prompt.js.map +0 -1
  141. package/dist/src/research-subscriptions/types.d.ts +0 -66
  142. package/dist/src/research-subscriptions/types.d.ts.map +0 -1
  143. package/dist/src/research-subscriptions/types.js +0 -2
  144. package/dist/src/research-subscriptions/types.js.map +0 -1
  145. package/dist/src/research-subscriptions.d.ts +0 -2
  146. package/dist/src/research-subscriptions.d.ts.map +0 -1
  147. package/dist/src/research-subscriptions.js +0 -2
  148. package/dist/src/research-subscriptions.js.map +0 -1
  149. package/dist/src/services/auto-updater.d.ts +0 -15
  150. package/dist/src/services/auto-updater.d.ts.map +0 -1
  151. package/dist/src/services/auto-updater.js +0 -188
  152. package/dist/src/services/auto-updater.js.map +0 -1
  153. package/dist/src/tools/arxiv-download.d.ts +0 -24
  154. package/dist/src/tools/arxiv-download.d.ts.map +0 -1
  155. package/dist/src/tools/arxiv-download.js +0 -177
  156. package/dist/src/tools/arxiv-download.js.map +0 -1
  157. package/dist/src/tools/github-search-tool.d.ts +0 -25
  158. package/dist/src/tools/github-search-tool.d.ts.map +0 -1
  159. package/dist/src/tools/github-search-tool.js +0 -114
  160. package/dist/src/tools/github-search-tool.js.map +0 -1
  161. package/dist/src/tools/openreview-lookup.d.ts +0 -31
  162. package/dist/src/tools/openreview-lookup.d.ts.map +0 -1
  163. package/dist/src/tools/openreview-lookup.js +0 -414
  164. package/dist/src/tools/openreview-lookup.js.map +0 -1
  165. package/dist/src/tools/paper-browser.d.ts +0 -23
  166. package/dist/src/tools/paper-browser.d.ts.map +0 -1
  167. package/dist/src/tools/paper-browser.js +0 -121
  168. package/dist/src/tools/paper-browser.js.map +0 -1
  169. package/dist/src/tools/scientify-cron.d.ts +0 -63
  170. package/dist/src/tools/scientify-cron.d.ts.map +0 -1
  171. package/dist/src/tools/scientify-cron.js +0 -265
  172. package/dist/src/tools/scientify-cron.js.map +0 -1
  173. package/dist/src/tools/scientify-literature-state.d.ts +0 -303
  174. package/dist/src/tools/scientify-literature-state.d.ts.map +0 -1
  175. package/dist/src/tools/scientify-literature-state.js +0 -957
  176. package/dist/src/tools/scientify-literature-state.js.map +0 -1
  177. package/dist/src/tools/unpaywall-download.d.ts +0 -21
  178. package/dist/src/tools/unpaywall-download.d.ts.map +0 -1
  179. package/dist/src/tools/unpaywall-download.js +0 -169
  180. package/dist/src/tools/unpaywall-download.js.map +0 -1
  181. package/dist/src/tools/workspace.d.ts +0 -32
  182. package/dist/src/tools/workspace.d.ts.map +0 -1
  183. package/dist/src/tools/workspace.js +0 -69
  184. package/dist/src/tools/workspace.js.map +0 -1
  185. package/skills/metabolism-init/SKILL.md +0 -80
  186. package/skills/research-subscription/SKILL.md +0 -119
@@ -0,0 +1,73 @@
1
+ ---
2
+ name: release-layout
3
+ description: "Use this when the user wants to improve README, docs pages, or microsites so a new reader can understand what the project is, how to use it, what artifacts exist, and what the scope boundaries are within one screen."
4
+ metadata:
5
+ {
6
+ "openclaw":
7
+ {
8
+ "emoji": "🪄",
9
+ },
10
+ }
11
+ ---
12
+
13
+ # Release Layout
14
+
15
+ **Don't ask permission. Just do it.**
16
+
17
+ Use this skill for outward-facing packaging surfaces such as:
18
+
19
+ - `README.md`
20
+ - `docs/index.html`
21
+ - release page generator scripts
22
+
23
+ This skill improves structure and legibility. It does **not** upgrade the scientific claim on its own.
24
+
25
+ ## Core Goal
26
+
27
+ A first-time reader should understand, within one screen:
28
+
29
+ 1. what this is
30
+ 2. how to use it
31
+ 3. what artifacts it produces
32
+ 4. what the scope boundary is
33
+
34
+ ## Workflow
35
+
36
+ ### Step 1: Detect the Real Edit Target
37
+
38
+ If a page is generated by a script, prefer editing the generator rather than the built HTML.
39
+
40
+ If `review/release_gate.json` exists, read it before polishing release-facing copy.
41
+
42
+ ### Step 2: Audit the First Screen
43
+
44
+ Check whether the hero / opening section answers the four core questions above.
45
+
46
+ ### Step 3: Reshape the Page
47
+
48
+ Prefer this order:
49
+
50
+ 1. hero / product definition
51
+ 2. quick-start or usage path
52
+ 3. artifact map
53
+ 4. evidence / results block
54
+ 5. scope note
55
+ 6. FAQ or next steps
56
+
57
+ Use `references/page-structure.md`.
58
+
59
+ ### Step 4: Clean the Reading Path
60
+
61
+ Reduce:
62
+
63
+ - duplicated claims
64
+ - buried usage instructions
65
+ - unexplained metrics
66
+ - isolated figures without framing text
67
+
68
+ ## Safety Rules
69
+
70
+ 1. Do not hide limitations for the sake of visual polish.
71
+ 2. Do not introduce stronger language than the underlying artifacts support.
72
+ 3. If the result is simulator-only, say that near the top instead of burying it below the fold.
73
+ 4. If the release gate is `HOLD`, stale, or missing for a share-ready artifact set, do not present the project as fully ready to share.
@@ -0,0 +1,14 @@
1
+ # Page Structure
2
+
3
+ Recommended first-screen order:
4
+
5
+ 1. one-line definition
6
+ 2. quick-start
7
+ 3. artifact outputs
8
+ 4. evidence boundary
9
+
10
+ Avoid:
11
+
12
+ - leading with large result claims before the project is defined
13
+ - hiding usage instructions below the fold
14
+ - showing figures without telling the reader what they mean
@@ -14,24 +14,15 @@ metadata:
14
14
 
15
15
  **Don't ask permission. Just do it.**
16
16
 
17
- **Workspace:** `$W` = working directory provided in task parameter.
18
-
19
17
  ## Output Structure
20
18
 
21
19
  ```
22
- $W/
23
- ├── survey/
24
- │ ├── search_terms.json # 检索词列表
25
- │ └── report.md # 最终报告
26
20
  ├── papers/
27
- │ ├── _downloads/ # 原始下载
28
- │ ├── _meta/ # 每篇论文的元数据
29
- └── {arxiv_id}.json
30
- │ └── {direction}/ # 整理后的分类
31
- ├── repos/ # 参考代码仓库(Phase 3)
32
- │ ├── {repo_name_1}/
33
- │ └── {repo_name_2}/
34
- └── prepare_res.md # 仓库选择报告(Phase 3)
21
+ │ ├── {arxiv_id}/ # arXiv 论文源文件
22
+ │ ├── {doi_slug}.pdf # DOI 论文 PDF
23
+ │ └── {direction}/ # 整理后的分类目录
24
+ ├── repos/ # 参考代码仓库(Phase 3)
25
+ └── survey_report.md # 调研报告
35
26
  ```
36
27
 
37
28
  ---
@@ -40,13 +31,11 @@ $W/
40
31
 
41
32
  ### Phase 1: 准备
42
33
 
43
- 确保工作目录结构存在:
44
-
45
34
  ```bash
46
- mkdir -p "$W/survey" "$W/papers/_downloads" "$W/papers/_meta"
35
+ mkdir -p "papers"
47
36
  ```
48
37
 
49
- 生成 4-8 个检索词,保存到 `$W/survey/search_terms.json`。
38
+ 生成 4-8 个检索词。
50
39
 
51
40
  ---
52
41
 
@@ -58,40 +47,21 @@ mkdir -p "$W/survey" "$W/papers/_downloads" "$W/papers/_meta"
58
47
 
59
48
  ```
60
49
  arxiv_search({ query: "<term>", max_results: 30 })
50
+ openalex_search({ query: "<term>", max_results: 20 })
61
51
  ```
62
52
 
63
- #### 2.2 即时筛选
64
-
65
- 对返回的论文**立即**评分(1-5),只保留 ≥4 分的。
53
+ 合并两个来源的结果,按 arXiv ID / DOI 去重。
66
54
 
67
- 评分标准:
68
- - 5分:核心论文,直接研究该主题
69
- - 4分:相关方法或应用
70
- - 3分及以下:跳过
55
+ #### 2.2 筛选
71
56
 
72
- #### 2.3 下载有用论文
57
+ 只看**相关性**——这篇论文是否和研究主题直接相关?
73
58
 
74
- ```
75
- arxiv_download({
76
- arxiv_ids: ["<有用的论文ID>"],
77
- output_dir: "papers/_downloads"
78
- })
79
- ```
59
+ - **相关**:直接研究该主题,或提出了可借鉴的方法 → 保留
60
+ - **不相关**:主题偏离,仅在关键词上有交集 → 跳过
80
61
 
81
- #### 2.4 写入元数据
62
+ #### 2.3 下载论文
82
63
 
83
- 为每篇下载的论文创建元数据文件 `$W/papers/_meta/{arxiv_id}.json`:
84
-
85
- ```json
86
- {
87
- "arxiv_id": "2401.12345",
88
- "title": "...",
89
- "abstract": "...",
90
- "score": 5,
91
- "source_term": "battery RUL prediction",
92
- "downloaded_at": "2024-01-15T10:00:00Z"
93
- }
94
- ```
64
+ /paper-download 的方式下载论文到 `papers/`。
95
65
 
96
66
  **完成一个检索词后,再进行下一个。** 这样避免上下文被大量搜索结果污染。
97
67
 
@@ -101,9 +71,9 @@ arxiv_download({
101
71
 
102
72
  **目标**:为下游 skill(research-survey、research-plan、research-implement)提供可参考的开源实现。
103
73
 
104
- #### 3.1 选择高分论文
74
+ #### 3.1 选择论文
105
75
 
106
- 读取 `$W/papers/_meta/` 下得分 ≥4 的论文,选出 **Top 5** 最相关论文。
76
+ `papers/` 中选出 **Top 5** 最相关论文。
107
77
 
108
78
  #### 3.2 搜索参考仓库
109
79
 
@@ -112,87 +82,47 @@ arxiv_download({
112
82
  - 核心方法名 + 作者名
113
83
  - 论文中提到的数据集名 + 任务名
114
84
 
115
- 使用 `github_search` 工具:
116
- ```javascript
117
- github_search({
118
- query: "{paper_title} implementation",
119
- max_results: 10,
120
- sort: "stars",
121
- language: "python"
122
- })
85
+ ```bash
86
+ gh search repos "{paper_title} implementation" --limit 10 --sort stars --language python
123
87
  ```
124
88
 
125
89
  #### 3.3 筛选与 clone
126
90
 
127
- 对搜索到的仓库,评估:
128
- - Star 数(建议 >100)
129
- - 代码质量(有 README、有 requirements.txt、代码结构清晰)
130
- - 与论文的匹配度
131
-
132
- 选择 **3-5 个**最相关的仓库,clone 到 `$W/repos/`:
91
+ 选择 **3-5 个**最相关的仓库:
133
92
 
134
93
  ```bash
135
- mkdir -p "$W/repos"
136
- cd "$W/repos"
137
- git clone --depth 1 <repo_url>
94
+ mkdir -p "repos"
95
+ git clone --depth 1 <repo_url> "repos/{name}"
138
96
  ```
139
97
 
140
- #### 3.4 写入选择报告
141
-
142
- 创建 `$W/prepare_res.md`:
143
-
144
- ```markdown
145
- # 参考仓库选择
146
-
147
- | 仓库 | 对应论文 | Stars | 选择理由 |
148
- |------|----------|-------|----------|
149
- | repos/{repo_name} | {paper_title} (arxiv:{id}) | {N} | {理由} |
150
-
151
- ## 各仓库关键文件
152
-
153
- ### {repo_name}
154
- - **模型实现**: `model/` 或 `models/`
155
- - **训练脚本**: `train.py` 或 `main.py`
156
- - **数据加载**: `data/` 或 `dataset.py`
157
- - **核心文件**: `{关键文件路径}` — {描述}
158
- ```
159
-
160
- **如果搜不到相关仓库**,在 `prepare_res.md` 中注明"无可用参考仓库",后续 skill 将不依赖代码映射。
98
+ **如果搜不到相关仓库**,跳过本阶段。
161
99
 
162
100
  ---
163
101
 
164
102
  ### Phase 4: 分类整理
165
103
 
166
- 所有检索词和代码搜索完毕后:
167
-
168
- #### 4.1 读取所有元数据
169
-
170
- ```bash
171
- ls $W/papers/_meta/
172
- ```
173
-
174
- 读取所有 `.json` 文件,汇总论文列表。
104
+ 所有检索词完毕后:
175
105
 
176
- #### 4.2 聚类分析
106
+ #### 4.1 聚类分析
177
107
 
178
- 根据论文的标题、摘要、来源检索词,识别 3-6 个研究方向。
108
+ 根据已下载论文的标题和摘要,识别 3-6 个研究方向。
179
109
 
180
- #### 4.3 创建文件夹并移动
110
+ #### 4.2 创建分类目录
181
111
 
182
112
  ```bash
183
- mkdir -p "$W/papers/data-driven"
184
- mv "$W/papers/_downloads/2401.12345" "$W/papers/data-driven/"
113
+ mkdir -p "papers/{direction}"
114
+ mv "papers/2401.12345" "papers/data-driven/"
185
115
  ```
186
116
 
187
117
  ---
188
118
 
189
119
  ### Phase 5: 生成报告
190
120
 
191
- 创建 `$W/survey/report.md`:
121
+ 创建 `survey_report.md`:
192
122
  - 调研概要(检索词数、论文数、方向数)
193
123
  - 各研究方向概述
194
- - Top 10 论文
195
- - **参考仓库摘要**(引用 prepare_res.md)
124
+ - Top 10 论文(标题 + ID + 一句话价值)
125
+ - 参考仓库摘要(如有)
196
126
  - 建议阅读顺序
197
127
 
198
128
  ---
@@ -201,14 +131,14 @@ mv "$W/papers/_downloads/2401.12345" "$W/papers/data-driven/"
201
131
 
202
132
  | 原则 | 说明 |
203
133
  |------|------|
204
- | **增量处理** | 每个检索词独立完成搜索→筛选→下载→写元数据,避免上下文膨胀 |
205
- | **元数据驱动** | 分类基于 `_meta/*.json`,不依赖内存中的大列表 |
206
- | **文件夹即分类** | 聚类结果通过 `papers/{direction}/` 体现,无需额外 JSON |
134
+ | **增量处理** | 每个检索词独立完成搜索→筛选→下载,避免上下文膨胀 |
135
+ | **文件夹即分类** | 聚类结果通过 `papers/{direction}/` 体现 |
207
136
 
208
- ## Tools
137
+ ## Tools / Commands
209
138
 
210
- | Tool | Purpose |
211
- |------|---------|
212
- | `arxiv_search` | 搜索论文(无副作用) |
213
- | `arxiv_download` | 下载 .tex/.pdf(需绝对路径) |
214
- | `github_search` | 搜索参考仓库 |
139
+ | Tool / Command | Purpose |
140
+ |----------------|---------|
141
+ | `arxiv_search` | 搜索 arXiv 论文 |
142
+ | `openalex_search` | 搜索跨学科论文(覆盖更广) |
143
+ | /paper-download | 下载论文(arXiv .tex/PDF、DOI via Unpaywall) |
144
+ | `gh search repos "query"` | 搜索 GitHub 仓库 |
@@ -15,15 +15,14 @@ metadata:
15
15
 
16
16
  **Don't ask permission. Just do it.**
17
17
 
18
- **Workspace:** `$W` = working directory provided in task parameter.
19
18
 
20
19
  ## Prerequisites
21
20
 
22
21
  | File | Source |
23
22
  |------|--------|
24
- | `$W/project/` | /research-implement |
25
- | `$W/plan_res.md` | /research-plan |
26
- | `$W/iterations/judge_v*.md` | /research-review(最后一份 verdict 必须是 PASS) |
23
+ | `project/` | /research-implement |
24
+ | `plan_res.md` | /research-plan |
25
+ | `iterations/judge_v*.md` | /research-review(最后一份 verdict 必须是 PASS) |
27
26
 
28
27
  **验证 PASS:** 读取最新的 `judge_v*.md`,确认 `verdict: PASS`。如果不是,STOP。
29
28
 
@@ -31,8 +30,8 @@ metadata:
31
30
 
32
31
  | File | Content |
33
32
  |------|---------|
34
- | `$W/experiment_res.md` | 完整实验报告(含 full training + 消融 + 补充实验) |
35
- | `$W/experiment_analysis/analysis_{N}.md` | 每轮实验分析报告(迭代过程中产生) |
33
+ | `experiment_res.md` | Full experiment report (full training, ablations, supplementary experiments) with explicit headline metrics, baselines, guardrails, and figure anchors |
34
+ | `experiment_analysis/analysis_{N}.md` | 每轮实验分析报告(迭代过程中产生) |
36
35
 
37
36
  ---
38
37
 
@@ -43,7 +42,7 @@ metadata:
43
42
  修改 epoch 数为 plan_res.md 中指定的正式值。**不要改代码逻辑,只改 epoch。**
44
43
 
45
44
  ```bash
46
- cd $W/project && source .venv/bin/activate
45
+ cd project && source .venv/bin/activate
47
46
  python3 run.py # full epochs
48
47
  ```
49
48
 
@@ -78,7 +77,7 @@ python3 run.py --epochs 2 --ablation no_attention
78
77
 
79
78
  #### 4.1 分析当前结果
80
79
 
81
- 读取当前所有实验结果(full training + 消融),写入分析报告 `$W/experiment_analysis/analysis_{N}.md`:
80
+ 读取当前所有实验结果(full training + 消融),写入分析报告 `experiment_analysis/analysis_{N}.md`:
82
81
 
83
82
  ```markdown
84
83
  # Experiment Analysis Round {N}
@@ -108,7 +107,7 @@ python3 run.py --epochs 2 --ablation no_attention
108
107
  根据分析报告中的计划,修改代码并执行补充实验。**只改实验相关参数/配置,不改核心算法逻辑。**
109
108
 
110
109
  ```bash
111
- cd $W/project && source .venv/bin/activate
110
+ cd project && source .venv/bin/activate
112
111
  python3 run.py --experiment {exp_name}
113
112
  ```
114
113
 
@@ -118,7 +117,7 @@ python3 run.py --experiment {exp_name}
118
117
 
119
118
  ### Step 5: 写入最终实验报告
120
119
 
121
- 汇总所有实验结果(full training + 消融 + 2 轮补充实验),写入 `$W/experiment_res.md`:
120
+ 汇总所有实验结果(full training + 消融 + 2 轮补充实验),写入 `experiment_res.md`:
122
121
 
123
122
  ```markdown
124
123
  # Experiment Report
@@ -129,6 +128,9 @@ python3 run.py --experiment {exp_name}
129
128
  - [RESULT] val_metric={value}
130
129
  - [RESULT] elapsed={value}
131
130
  - [RESULT] device={device}
131
+ - [METRIC] name={headline_metric} value={value} unit={unit} baseline={baseline}
132
+ - [GUARD] name={guard_name} value={value} threshold={threshold} pass={true/false}
133
+ - [FIGURE] file={figure path}
132
134
 
133
135
  > 以上数值来自真实执行输出。
134
136
 
@@ -157,9 +159,14 @@ python3 run.py --experiment {exp_name}
157
159
  | Ours | {value} | — |
158
160
  | {Baseline} | {value} | ... |
159
161
 
162
+ ## Scope / Evidence Boundary
163
+ - baseline: {which baseline is used}
164
+ - protocol / guardrail: {evaluation rule}
165
+ - evidence_type: {simulator / local_runtime / full_runtime}
166
+
160
167
  ### Visualizations
161
- - 训练曲线: `$W/project/figures/training_curve.png`
162
- - {其他可视化}: `$W/project/figures/{name}.png`
168
+ - 训练曲线: `project/figures/training_curve.png`
169
+ - {其他可视化}: `project/figures/{name}.png`
163
170
 
164
171
  ## Conclusions
165
172
  - {key findings from all experiments}
@@ -178,3 +185,4 @@ python3 run.py --experiment {exp_name}
178
185
  4. 如果 full training 失败(OOM 等),调整 batch_size 后重试,不要跳过
179
186
  5. **补充实验迭代必须做 2 轮(Novix Exp Analyzer 机制)** — 第 1 轮针对初始结果,第 2 轮针对补充实验结果
180
187
  6. 补充实验不改核心算法,只改实验配置/参数/可视化代码
188
+ 7. Every headline metric must include a baseline, and every main conclusion must point back to real outputs or figure files
@@ -15,15 +15,14 @@ metadata:
15
15
 
16
16
  **Don't ask permission. Just do it.**
17
17
 
18
- **Workspace:** `$W` = working directory provided in task parameter.
19
18
 
20
19
  ## Prerequisites
21
20
 
22
21
  | File | Source |
23
22
  |------|--------|
24
- | `$W/plan_res.md` | /research-plan |
25
- | `$W/survey_res.md` | /research-survey |
26
- | `$W/repos/` (optional) | reference code |
23
+ | `plan_res.md` | /research-plan |
24
+ | `survey_res.md` | /research-survey |
25
+ | `repos/` (optional) | reference code |
27
26
 
28
27
  **If `plan_res.md` is missing, STOP:** "需要先运行 /research-plan 完成实现计划"
29
28
 
@@ -31,8 +30,8 @@ metadata:
31
30
 
32
31
  | File | Content |
33
32
  |------|---------|
34
- | `$W/project/` | 完整可运行代码 |
35
- | `$W/ml_res.md` | 实现报告(含真实执行结果) |
33
+ | `project/` | 完整可运行代码 |
34
+ | `ml_res.md` | 实现报告(含真实执行结果) |
36
35
 
37
36
  ---
38
37
 
@@ -40,7 +39,7 @@ metadata:
40
39
 
41
40
  ### Step 1: 读取计划
42
41
 
43
- 读取 `$W/plan_res.md`,提取:
42
+ 读取 `plan_res.md`,提取:
44
43
  - 所有组件列表
45
44
  - 数据集信息
46
45
  - 训练参数
@@ -48,7 +47,7 @@ metadata:
48
47
  ### Step 2: 创建项目结构
49
48
 
50
49
  ```
51
- $W/project/
50
+ project/
52
51
  model/ # 模型组件(每个组件一个文件)
53
52
  data/ # 数据加载
54
53
  training/ # 训练循环 + loss
@@ -66,7 +65,7 @@ $W/project/
66
65
 
67
66
  **3b. 数据管道**
68
67
  ```bash
69
- cd $W/project && uv venv .venv && source .venv/bin/activate
68
+ cd project && uv venv .venv && source .venv/bin/activate
70
69
  uv pip install -r requirements.txt
71
70
  python3 -c "from data.dataset import *; print('data OK')"
72
71
  ```
@@ -93,7 +92,7 @@ print(f"[RESULT] device={device}")
93
92
  ### Step 4: 环境搭建 + 执行
94
93
 
95
94
  ```bash
96
- cd $W/project
95
+ cd project
97
96
  uv venv .venv
98
97
  source .venv/bin/activate
99
98
 
@@ -125,7 +124,7 @@ python3 run.py --epochs 2
125
124
 
126
125
  ### Step 6: 写入报告
127
126
 
128
- 写入 `$W/ml_res.md`:
127
+ 写入 `ml_res.md`:
129
128
 
130
129
  ```markdown
131
130
  # Implementation Report
@@ -92,19 +92,11 @@ task 必须以 `/skill-name` 开头(触发 slash command 解析),后续行
92
92
 
93
93
  ---
94
94
 
95
- ## Workspace
96
-
97
- `$W` = agent workspace root (see AGENTS.md for layout).
98
-
99
- ---
100
-
101
95
  ## Step 0: 初始化
102
96
 
103
- `$W` 即当前 agent 的工作目录(AGENTS.md 中定义)。
104
-
105
- 检查 `$W/SOUL.md` 是否包含研究方向信息。如果没有(BOOTSTRAP 未完成),提示用户先完成 BOOTSTRAP 配置。
97
+ 检查 `SOUL.md` 是否包含研究方向信息。如果没有(BOOTSTRAP 未完成),提示用户先完成 BOOTSTRAP 配置。
106
98
 
107
- 确保 `$W` 下存在必要的子目录(如 `survey/`, `papers/` 等)。
99
+ 确保 `papers/`、`knowledge/`、`ideas/`、`experiments/` 目录存在。
108
100
 
109
101
  ---
110
102
 
@@ -114,65 +106,65 @@ task 必须以 `/skill-name` 开头(触发 slash command 解析),后续行
114
106
 
115
107
  ### Phase 1: Literature Survey
116
108
 
117
- **检查:** `$W/papers/_meta/` 目录存在且有 `.json` 文件?
109
+ **检查:** `papers/` 目录存在且有论文文件?
118
110
 
119
111
  **如果缺失,调用 sessions_spawn 工具(然后停止,等待完成通知):**
120
- - task: `"/research-collect\n工作目录: {$W绝对路径}\n研究主题: {从task.json提取}\n请搜索、筛选、下载论文到工作目录的 papers/ 下。"`
112
+ - task: `"/research-collect\n研究主题: {从SOUL.md提取}\n请搜索、筛选、下载论文到工作目录的 papers/ 下。"`
121
113
  - label: `"Research Collect"`
122
114
  - runTimeoutSeconds: `1800`
123
115
 
124
- **验证:** `ls $W/papers/_meta/*.json` 至少有 3 个文件
116
+ **验证:** `ls papers/` 至少有 3 篇论文
125
117
 
126
118
  ---
127
119
 
128
120
  ### Phase 2: Deep Survey
129
121
 
130
- **检查:** `$W/survey_res.md` 存在?
122
+ **检查:** `survey_res.md` 存在?
131
123
 
132
124
  **如果缺失,先读取 Phase 1 摘要(论文数量、方向),然后调用 sessions_spawn 工具(然后停止,等待完成通知):**
133
- - task: `"/research-survey\n工作目录: {$W绝对路径}\n上下文: 已下载 {N} 篇论文,方向包括 {directions}。\n重点论文: {top 3 arxiv_id 和标题}\n请深度分析论文、提取公式,写入 survey_res.md。"`
125
+ - task: `"/research-survey\n上下文: 已下载 {N} 篇论文,方向包括 {directions}。\n重点论文: {top 3 arxiv_id 和标题}\n请深度分析论文、提取公式,写入 survey_res.md。"`
134
126
  - label: `"Deep Survey"`
135
127
  - runTimeoutSeconds: `1800`
136
128
 
137
- **验证:** `$W/survey_res.md` 存在且包含"核心方法对比"表格
129
+ **验证:** `survey_res.md` 存在且包含"核心方法对比"表格
138
130
 
139
131
  ---
140
132
 
141
133
  ### Phase 3: Implementation Plan
142
134
 
143
- **检查:** `$W/plan_res.md` 存在?
135
+ **检查:** `plan_res.md` 存在?
144
136
 
145
137
  **如果缺失,读取 survey_res.md 摘要,然后调用 sessions_spawn 工具(然后停止,等待完成通知):**
146
- - task: `"/research-plan\n工作目录: {$W绝对路径}\n上下文: 调研发现核心方法是 {method},推荐技术路线 {route}。\n关键公式: {1-2个公式}\n请制定实现计划到 plan_res.md。"`
138
+ - task: `"/research-plan\n上下文: 调研发现核心方法是 {method},推荐技术路线 {route}。\n关键公式: {1-2个公式}\n请制定实现计划到 plan_res.md。"`
147
139
  - label: `"Research Plan"`
148
140
  - runTimeoutSeconds: `1800`
149
141
 
150
- **验证:** `$W/plan_res.md` 存在且包含 4 个 section(Dataset/Model/Training/Testing)
142
+ **验证:** `plan_res.md` 存在且包含 4 个 section(Dataset/Model/Training/Testing)
151
143
 
152
144
  ---
153
145
 
154
146
  ### Phase 4: Implementation
155
147
 
156
- **检查:** `$W/ml_res.md` 存在?
148
+ **检查:** `ml_res.md` 存在?
157
149
 
158
150
  **如果缺失,读取 plan_res.md 要点,然后调用 sessions_spawn 工具(然后停止,等待完成通知):**
159
- - task: `"/research-implement\n工作目录: {$W绝对路径}\n上下文:\n- 计划包含 {N} 个组件: {list}\n- 数据集: {dataset}\n- 框架: PyTorch\n请实现代码到 project/,运行 2 epoch 验证,写入 ml_res.md。"`
151
+ - task: `"/research-implement\n上下文:\n- 计划包含 {N} 个组件: {list}\n- 数据集: {dataset}\n- 框架: PyTorch\n请实现代码到 project/,运行 2 epoch 验证,写入 ml_res.md。"`
160
152
  - label: `"Research Implement"`
161
153
  - runTimeoutSeconds: `1800`
162
154
 
163
155
  **验证:**
164
- - `$W/project/run.py` 存在
165
- - `$W/ml_res.md` 包含 `[RESULT]` 行
156
+ - `project/run.py` 存在
157
+ - `ml_res.md` 包含 `[RESULT]` 行
166
158
  - loss 值非 NaN/Inf
167
159
 
168
160
  ---
169
161
 
170
162
  ### Phase 5: Review
171
163
 
172
- **检查:** `$W/iterations/` 下最新 `judge_v*.md` 的 verdict 是否为 PASS?
164
+ **检查:** `iterations/` 下最新 `judge_v*.md` 的 verdict 是否为 PASS?
173
165
 
174
166
  **如果没有 PASS,调用 sessions_spawn 工具(然后停止,等待完成通知):**
175
- - task: `"/research-review\n工作目录: {$W绝对路径}\n上下文:\n- ml_res.md 显示 train_loss={value}\n- 计划在 plan_res.md\n请审查代码,如需修改则迭代修复(最多 3 轮)。"`
167
+ - task: `"/research-review\n上下文:\n- ml_res.md 显示 train_loss={value}\n- 计划在 plan_res.md\n请审查代码,如需修改则迭代修复(最多 3 轮)。"`
176
168
  - label: `"Research Review"`
177
169
  - runTimeoutSeconds: `1800`
178
170
 
@@ -184,14 +176,14 @@ task 必须以 `/skill-name` 开头(触发 slash command 解析),后续行
184
176
 
185
177
  ### Phase 6: Full Experiment
186
178
 
187
- **检查:** `$W/experiment_res.md` 存在?
179
+ **检查:** `experiment_res.md` 存在?
188
180
 
189
181
  **如果缺失,调用 sessions_spawn 工具(然后停止,等待完成通知):**
190
- - task: `"/research-experiment\n工作目录: {$W绝对路径}\n上下文:\n- Review PASS,代码已验证\n- plan_res.md 中指定 full epochs\n请执行完整训练 + 消融实验,写入 experiment_res.md。"`
182
+ - task: `"/research-experiment\n上下文:\n- Review PASS,代码已验证\n- plan_res.md 中指定 full epochs\n请执行完整训练 + 消融实验,写入 experiment_res.md。"`
191
183
  - label: `"Research Experiment"`
192
184
  - runTimeoutSeconds: `1800`
193
185
 
194
- **验证:** `$W/experiment_res.md` 包含 `[RESULT]` 行和消融表格
186
+ **验证:** `experiment_res.md` 包含 `[RESULT]` 行和消融表格
195
187
 
196
188
  ---
197
189
 
@@ -202,9 +194,9 @@ task 必须以 `/skill-name` 开头(触发 slash command 解析),后续行
202
194
  ```
203
195
  研究流程完成!
204
196
  - 论文: {N} 篇分析
205
- - 代码: $W/project/
206
- - 结果: $W/experiment_res.md
207
- - 审查: $W/iterations/ ({N} 轮)
197
+ - 代码: project/
198
+ - 结果: experiment_res.md
199
+ - 审查: iterations/ ({N} 轮)
208
200
  ```
209
201
 
210
202
  ---
@@ -14,17 +14,14 @@ metadata:
14
14
 
15
15
  **Don't ask permission. Just do it.**
16
16
 
17
- **Workspace:** `$W` = working directory provided in task parameter.
18
17
 
19
18
  ## Prerequisites
20
19
 
21
20
  | File | Source |
22
21
  |------|--------|
23
- | `$W/task.json` | /research-pipeline or user |
24
- | `$W/survey_res.md` | /research-survey |
25
- | `$W/notes/paper_*.md` | /research-survey |
26
- | `$W/repos/` | /research-collect Phase 3 |
27
- | `$W/prepare_res.md` | /research-collect Phase 3 |
22
+ | `SOUL.md` | 研究方向和目标 |
23
+ | `survey_res.md` | /research-survey |
24
+ | `knowledge/paper_*.md` | /research-survey |
28
25
 
29
26
  **If `survey_res.md` is missing, STOP:** "需要先运行 /research-survey 完成深度分析"
30
27
 
@@ -32,7 +29,7 @@ metadata:
32
29
 
33
30
  | File | Content |
34
31
  |------|---------|
35
- | `$W/plan_res.md` | 四部分实现计划 |
32
+ | `plan_res.md` | 四部分实现计划 |
36
33
 
37
34
  ---
38
35
 
@@ -41,9 +38,8 @@ metadata:
41
38
  ### Step 1: 读取上下文
42
39
 
43
40
  读取以下文件,理解研究目标和技术方案:
44
- - `$W/task.json` — 研究目标
45
- - `$W/survey_res.md` — 技术路线建议、核心公式、**公式→代码映射表**、参考代码架构摘要
46
- - `$W/prepare_res.md` — 参考仓库列表及关键文件说明
41
+ - `SOUL.md` — 研究方向和目标
42
+ - `survey_res.md` — 技术路线建议、核心公式、方法对比
47
43
 
48
44
  ### Step 2: 参考代码深度分析
49
45
 
@@ -59,7 +55,7 @@ metadata:
59
55
 
60
56
  ### Step 3: 制定四部分计划
61
57
 
62
- 写入 `$W/plan_res.md`:
58
+ 写入 `plan_res.md`:
63
59
 
64
60
  ```markdown
65
61
  # Implementation Plan