academic-army 0.1.1 → 0.1.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -18,11 +18,11 @@ Parts that require fine-grained research should mainly be handled by skills that
18
18
 
19
19
  First use three planning skills to produce three AI-facing Markdown artifacts:
20
20
 
21
- | Step | Artifact | Role |
22
- |---|---|---|
23
- | `academic-army-architect` | `paper_blueprint.md` | The strategic paper blueprint that fixes the paper identity, target venue posture, claims, contribution boundary, candidate method space, evidence needs, and downstream constraints. |
24
- | `academic-army-experiment-plan` | `experiment_plan.md` | The experiment strategy that maps paper claims to evidence, datasets or workloads, metrics, baselines, ablations, robustness checks, and reviewer-facing validation needs. |
25
- | `academic-army-coding-plan` | `coding_plan.md` | The implementation contract that turns the blueprint and experiment plan into logical module boundaries, interface and entrypoint semantics, harnesses, testing categories, raw-result artifact schemas, and method-freeze rules. |
21
+ | Step | Artifact | Role |
22
+ | ------------------------------- | -------------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
23
+ | `academic-army-architect` | `paper_blueprint.md` | The strategic paper blueprint that fixes the paper identity, target venue posture, claims, contribution boundary, candidate method space, evidence needs, and downstream constraints. |
24
+ | `academic-army-experiment-plan` | `experiment_plan.md` | The experiment strategy that maps paper claims to evidence, datasets or workloads, metrics, baselines, ablations, robustness checks, and reviewer-facing validation needs. |
25
+ | `academic-army-coding-plan` | `coding_plan.md` | The implementation contract that turns the blueprint and experiment plan into logical module boundaries, interface and entrypoint semantics, harnesses, testing categories, raw-result artifact schemas, and method-freeze rules. |
26
26
 
27
27
  Each planning skill also writes a Chinese `*.explain.md` companion for human review, but the development runner consumes the three English Markdown files above.
28
28
 
@@ -119,16 +119,16 @@ Find the closest papers to this research idea, compare their methods, and return
119
119
 
120
120
  ## Project Structure
121
121
 
122
- | Path | Purpose |
123
- |---|---|
124
- | `agent-forge.yaml` | Agent and team wiring. |
125
- | `install_mcp.py` | Installs the project MCP server into Codex for direct skill runs. |
126
- | `mcp-server/` | Local stdio MCP implementation that exposes `deepresearch`. |
127
- | `skills/` | Prepared AcademicArmy skills. |
128
- | `metaskills/` | Matching metaskill design/evolution files. |
129
- | `runs/` | Convenience wrappers around TypeScript pipelines. |
130
- | `src/` | TypeScript pipeline structure and implementation notes. |
131
- | `output/` | Generated planning artifacts, codebase output, and archives. |
122
+ | Path | Purpose |
123
+ | ------------------ | ----------------------------------------------------------------- |
124
+ | `agent-forge.yaml` | Agent and team wiring. |
125
+ | `install_mcp.py` | Installs the project MCP server into Codex for direct skill runs. |
126
+ | `mcp-server/` | Local stdio MCP implementation that exposes `deepresearch`. |
127
+ | `skills/` | Prepared AcademicArmy skills. |
128
+ | `metaskills/` | Matching metaskill design/evolution files. |
129
+ | `runs/` | Convenience wrappers around TypeScript pipelines. |
130
+ | `src/` | TypeScript pipeline structure and implementation notes. |
131
+ | `output/` | Generated planning artifacts, codebase output, and archives. |
132
132
 
133
133
  Agent and team wiring lives in [`agent-forge.yaml`](agent-forge.yaml). The current TypeScript agents are implemented under [`src/developing/agents`](src/developing/agents) and [`src/evolve-skill/agents`](src/evolve-skill/agents).
134
134
 
@@ -136,11 +136,11 @@ Prepared AcademicArmy skills live under [`skills/`](skills/), and their matching
136
136
 
137
137
  ## Configuration Reference
138
138
 
139
- | File or variable | Required for | Notes |
140
- |---|---|---|
141
- | `.env` / `OPENAI_API_KEY` | DeepResearch MCP | Read by the MCP server and by `install_mcp.py`. |
142
- | `agent-forge.yaml` | Project pipelines | Launches `academic_army_mcp_tools` as `python -m mcp-server` with `PYTHONPATH=.` and `cwd=.`. |
143
- | `secret.yaml` | Prepared shell scripts | Local ignored config overlay used by the prepared wrappers. It may contain passwords, API keys, runtime credentials, or other private values that must not be committed or uploaded to GitHub. |
139
+ | File or variable | Required for | Notes |
140
+ | ------------------------- | ---------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
141
+ | `.env` / `OPENAI_API_KEY` | DeepResearch MCP | Read by the MCP server and by `install_mcp.py`. |
142
+ | `agent-forge.yaml` | Project pipelines | Launches `academic_army_mcp_tools` as `python -m mcp-server` with `PYTHONPATH=.` and `cwd=.`. |
143
+ | `secret.yaml` | Prepared shell scripts | Local ignored config overlay used by the prepared wrappers. It may contain passwords, API keys, runtime credentials, or other private values that must not be committed or uploaded to GitHub. |
144
144
 
145
145
  To override or add environment variables directly when installing MCP into Codex, repeat `-e/--env NAME=VALUE`:
146
146
 
@@ -152,11 +152,11 @@ Running the installer refreshes the Codex `academic_army_mcp_tools` entry, regis
152
152
 
153
153
  ## Troubleshooting
154
154
 
155
- | Problem | Likely cause | Fix |
156
- |---|---|---|
157
- | `OPENAI_API_KEY` is missing | `.env` is not present or was not forwarded to Codex MCP. | Create `.env`; when running skills directly in Codex, rerun `python install_mcp.py`. |
155
+ | Problem | Likely cause | Fix |
156
+ | ----------------------------------- | ------------------------------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------------------- |
157
+ | `OPENAI_API_KEY` is missing | `.env` is not present or was not forwarded to Codex MCP. | Create `.env`; when running skills directly in Codex, rerun `python install_mcp.py`. |
158
158
  | A wrapper cannot find `secret.yaml` | The prepared shell scripts pass a local config overlay for private values such as passwords, API keys, and runtime credentials. | Create local `secret.yaml` or adjust the script to use your config files. Do not commit or upload this file to GitHub. |
159
- | Development output is drifting | The planning artifacts are not specific enough. | Revise `paper_blueprint.md`, `experiment_plan.md`, and `coding_plan.md` before continuing development. |
159
+ | Development output is drifting | The planning artifacts are not specific enough. | Revise `paper_blueprint.md`, `experiment_plan.md`, and `coding_plan.md` before continuing development. |
160
160
 
161
161
  ## Development
162
162
 
package/README.zh-CN.md CHANGED
@@ -18,11 +18,11 @@ AcademicArmy 的主体核心可以概括为一句话:按图施工。
18
18
 
19
19
  先使用三个规划类 skill 交互生成三份面向 AI 执行的 Markdown 产物:
20
20
 
21
- | 步骤 | Artifact | 作用 |
22
- |---|---|---|
23
- | `academic-army-architect` | `paper_blueprint.md` | 论文战略蓝图,用来固定论文身份、目标 venue 姿态、核心 claims、贡献边界、候选方法空间、证据需求和下游约束。 |
24
- | `academic-army-experiment-plan` | `experiment_plan.md` | 实验策略,把论文 claims 映射到证据链、数据集或 workload、指标、baselines、消融、鲁棒性检查和审稿人关心的验证点。 |
25
- | `academic-army-coding-plan` | `coding_plan.md` | 代码实现契约,把论文蓝图和实验方案转成逻辑模块边界、接口与 entrypoint 语义、实验 harness、测试类别、raw result artifact schema 和 method freeze 规则。 |
21
+ | 步骤 | Artifact | 作用 |
22
+ | ------------------------------- | -------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------ |
23
+ | `academic-army-architect` | `paper_blueprint.md` | 论文战略蓝图,用来固定论文身份、目标 venue 姿态、核心 claims、贡献边界、候选方法空间、证据需求和下游约束。 |
24
+ | `academic-army-experiment-plan` | `experiment_plan.md` | 实验策略,把论文 claims 映射到证据链、数据集或 workload、指标、baselines、消融、鲁棒性检查和审稿人关心的验证点。 |
25
+ | `academic-army-coding-plan` | `coding_plan.md` | 代码实现契约,把论文蓝图和实验方案转成逻辑模块边界、接口与 entrypoint 语义、实验 harness、测试类别、raw result artifact schema 和 method freeze 规则。 |
26
26
 
27
27
  每个规划类 skill 还会同时生成一份中文 `*.explain.md` 解释文件,方便用户审阅;但后续开发 runner 读取的是上面三份英文 Markdown。
28
28
 
@@ -119,16 +119,16 @@ Find the closest papers to this research idea, compare their methods, and return
119
119
 
120
120
  ## 项目结构
121
121
 
122
- | 路径 | 用途 |
123
- |---|---|
124
- | `agent-forge.yaml` | Agent 和团队 wiring。 |
125
- | `install_mcp.py` | 把项目 MCP server 安装到 Codex,供直接运行 skill 时使用。 |
126
- | `mcp-server/` | 本地 stdio MCP 实现,暴露 `deepresearch`。 |
127
- | `skills/` | 已准备的 AcademicArmy skills。 |
128
- | `metaskills/` | 对应的 metaskill 设计与 evolution 文件。 |
129
- | `runs/` | TypeScript pipelines 的便捷 wrappers。 |
130
- | `src/` | TypeScript pipeline 的目录结构和实现说明。 |
131
- | `output/` | 生成的规划产物、代码库输出和归档。 |
122
+ | 路径 | 用途 |
123
+ | ------------------ | --------------------------------------------------------- |
124
+ | `agent-forge.yaml` | Agent 和团队 wiring。 |
125
+ | `install_mcp.py` | 把项目 MCP server 安装到 Codex,供直接运行 skill 时使用。 |
126
+ | `mcp-server/` | 本地 stdio MCP 实现,暴露 `deepresearch`。 |
127
+ | `skills/` | 已准备的 AcademicArmy skills。 |
128
+ | `metaskills/` | 对应的 metaskill 设计与 evolution 文件。 |
129
+ | `runs/` | TypeScript pipelines 的便捷 wrappers。 |
130
+ | `src/` | TypeScript pipeline 的目录结构和实现说明。 |
131
+ | `output/` | 生成的规划产物、代码库输出和归档。 |
132
132
 
133
133
  Agent 和团队 wiring 位于 [`agent-forge.yaml`](agent-forge.yaml)。当前 TypeScript agents 分别实现于 [`src/developing/agents`](src/developing/agents) 和 [`src/evolve-skill/agents`](src/evolve-skill/agents)。
134
134
 
@@ -136,11 +136,11 @@ Agent 和团队 wiring 位于 [`agent-forge.yaml`](agent-forge.yaml)。当前 Ty
136
136
 
137
137
  ## 配置参考
138
138
 
139
- | 文件或变量 | 用于 | 说明 |
140
- |---|---|---|
141
- | `.env` / `OPENAI_API_KEY` | DeepResearch MCP | MCP server 和 `install_mcp.py` 会读取。 |
142
- | `agent-forge.yaml` | 项目 pipelines | 以 `PYTHONPATH=.` 和 `cwd=.` 运行 `python -m mcp-server`。 |
143
- | `secret.yaml` | 预设 shell scripts | 预设 wrappers 使用的本地忽略 config overlay。它可以包含密码、API key、runtime 凭据等不能提交或上传到 GitHub 的隐私内容。 |
139
+ | 文件或变量 | 用于 | 说明 |
140
+ | ------------------------- | ------------------ | ------------------------------------------------------------------------------------------------------------------------ |
141
+ | `.env` / `OPENAI_API_KEY` | DeepResearch MCP | MCP server 和 `install_mcp.py` 会读取。 |
142
+ | `agent-forge.yaml` | 项目 pipelines | 以 `PYTHONPATH=.` 和 `cwd=.` 运行 `python -m mcp-server`。 |
143
+ | `secret.yaml` | 预设 shell scripts | 预设 wrappers 使用的本地忽略 config overlay。它可以包含密码、API key、runtime 凭据等不能提交或上传到 GitHub 的隐私内容。 |
144
144
 
145
145
  如果需要覆盖或补充环境变量,可以重复使用 `-e/--env NAME=VALUE`:
146
146
 
@@ -152,11 +152,11 @@ python install_mcp.py -e OPENAI_API_KEY=your_api_key_here
152
152
 
153
153
  ## 常见问题
154
154
 
155
- | 问题 | 常见原因 | 解决办法 |
156
- |---|---|---|
157
- | 缺少 `OPENAI_API_KEY` | 没有 `.env`,或没有把变量转发给 Codex MCP。 | 创建 `.env`;如果直接在 Codex 中跑 skill,再执行 `python install_mcp.py`。 |
155
+ | 问题 | 常见原因 | 解决办法 |
156
+ | ---------------------------- | -------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------- |
157
+ | 缺少 `OPENAI_API_KEY` | 没有 `.env`,或没有把变量转发给 Codex MCP。 | 创建 `.env`;如果直接在 Codex 中跑 skill,再执行 `python install_mcp.py`。 |
158
158
  | Wrapper 找不到 `secret.yaml` | 预设脚本传入了本地 config overlay,用来放密码、API key、runtime 凭据等隐私内容。 | 创建本地 `secret.yaml`,或调整脚本使用你的 config 文件。不要把这个文件提交或上传到 GitHub。 |
159
- | 开发输出偏离规划 | 三份规划产物还不够具体。 | 先修订 `paper_blueprint.md`、`experiment_plan.md` 和 `coding_plan.md`,再继续开发。 |
159
+ | 开发输出偏离规划 | 三份规划产物还不够具体。 | 先修订 `paper_blueprint.md`、`experiment_plan.md` 和 `coding_plan.md`,再继续开发。 |
160
160
 
161
161
  ## 开发
162
162
 
package/dist/cli.d.ts CHANGED
@@ -1,2 +1,3 @@
1
+ #!/usr/bin/env node
1
2
  export {};
2
3
  //# sourceMappingURL=cli.d.ts.map
package/dist/cli.js CHANGED
@@ -1,3 +1,4 @@
1
+ #!/usr/bin/env node
1
2
  import { runPipelineCli } from "./pipeline.js";
2
3
  import { developingPipeline, developingSkillPipeline } from "./developing/index.js";
3
4
  import { evolveSkillPipeline } from "./evolve-skill/index.js";
package/dist/cli.js.map CHANGED
@@ -1 +1 @@
1
- {"version":3,"file":"cli.js","sourceRoot":"","sources":["../src/cli.ts"],"names":[],"mappings":"AACA,OAAO,EAA2B,cAAc,EAAE,MAAM,eAAe,CAAC;AACxE,OAAO,EAAE,kBAAkB,EAAE,uBAAuB,EAAE,MAAM,uBAAuB,CAAC;AACpF,OAAO,EAAE,mBAAmB,EAAE,MAAM,yBAAyB,CAAC;AAE9D,SAAS,SAAS,CAAwD,KAIzE;IACC,OAAO;QACL,IAAI,EAAE,KAAK,CAAC,IAAI;QAChB,WAAW,EAAE,KAAK,CAAC,WAAW;QAC9B,GAAG,EAAE,CAAC,IAAuB,EAAE,EAAE,CAAC,cAAc,CAAC,KAAK,CAAC,UAAU,EAAE,IAAI,CAAC;KACzE,CAAC;AACJ,CAAC;AAED,MAAM,cAAc,GAAG;IACrB,SAAS,CAAC;QACR,IAAI,EAAE,YAAY;QAClB,WAAW,EAAE,gCAAgC;QAC7C,UAAU,EAAE,kBAAkB;KAC/B,CAAC;IACF,SAAS,CAAC;QACR,IAAI,EAAE,kBAAkB;QACxB,WAAW,EAAE,qDAAqD;QAClE,UAAU,EAAE,uBAAuB;KACpC,CAAC;IACF,SAAS,CAAC;QACR,IAAI,EAAE,cAAc;QACpB,WAAW,EAAE,+BAA+B;QAC5C,UAAU,EAAE,mBAAmB;KAChC,CAAC;CACM,CAAC;AAEX,SAAS,SAAS;IAChB,MAAM,YAAY,GAAG,cAAc;SAChC,GAAG,CAAC,CAAC,QAAQ,EAAE,EAAE,CAAC,KAAK,QAAQ,CAAC,IAAI,CAAC,MAAM,CAAC,EAAE,CAAC,IAAI,QAAQ,CAAC,WAAW,EAAE,CAAC;SAC1E,IAAI,CAAC,IAAI,CAAC,CAAC;IAEd,OAAO;;;EAGP,YAAY,EAAE,CAAC;AACjB,CAAC;AAED,MAAM,CAAC,YAAY,EAAE,GAAG,YAAY,CAAC,GAAG,OAAO,CAAC,IAAI,CAAC,KAAK,CAAC,CAAC,CAAC,CAAC;AAC9D,MAAM,kBAAkB,GAAG,cAAc,CAAC,IAAI,CAAC,CAAC,QAAQ,EAAE,EAAE,CAAC,QAAQ,CAAC,IAAI,KAAK,YAAY,CAAC,CAAC;AAE7F,IAAI,kBAAkB,KAAK,SAAS,EAAE,CAAC;IACrC,OAAO,CAAC,GAAG,CAAC,SAAS,EAAE,CAAC,CAAC;IACzB,OAAO,CAAC,QAAQ,GAAG,CAAC,CAAC;AACvB,CAAC;KAAM,CAAC;IACN,MAAM,kBAAkB,CAAC,GAAG,CAAC,YAAY,CAAC,CAAC;AAC7C,CAAC"}
1
+ {"version":3,"file":"cli.js","sourceRoot":"","sources":["../src/cli.ts"],"names":[],"mappings":";AAEA,OAAO,EAA2B,cAAc,EAAE,MAAM,eAAe,CAAC;AACxE,OAAO,EAAE,kBAAkB,EAAE,uBAAuB,EAAE,MAAM,uBAAuB,CAAC;AACpF,OAAO,EAAE,mBAAmB,EAAE,MAAM,yBAAyB,CAAC;AAE9D,SAAS,SAAS,CAAwD,KAIzE;IACC,OAAO;QACL,IAAI,EAAE,KAAK,CAAC,IAAI;QAChB,WAAW,EAAE,KAAK,CAAC,WAAW;QAC9B,GAAG,EAAE,CAAC,IAAuB,EAAE,EAAE,CAAC,cAAc,CAAC,KAAK,CAAC,UAAU,EAAE,IAAI,CAAC;KACzE,CAAC;AACJ,CAAC;AAED,MAAM,cAAc,GAAG;IACrB,SAAS,CAAC;QACR,IAAI,EAAE,YAAY;QAClB,WAAW,EAAE,gCAAgC;QAC7C,UAAU,EAAE,kBAAkB;KAC/B,CAAC;IACF,SAAS,CAAC;QACR,IAAI,EAAE,kBAAkB;QACxB,WAAW,EAAE,qDAAqD;QAClE,UAAU,EAAE,uBAAuB;KACpC,CAAC;IACF,SAAS,CAAC;QACR,IAAI,EAAE,cAAc;QACpB,WAAW,EAAE,+BAA+B;QAC5C,UAAU,EAAE,mBAAmB;KAChC,CAAC;CACM,CAAC;AAEX,SAAS,SAAS;IAChB,MAAM,YAAY,GAAG,cAAc;SAChC,GAAG,CAAC,CAAC,QAAQ,EAAE,EAAE,CAAC,KAAK,QAAQ,CAAC,IAAI,CAAC,MAAM,CAAC,EAAE,CAAC,IAAI,QAAQ,CAAC,WAAW,EAAE,CAAC;SAC1E,IAAI,CAAC,IAAI,CAAC,CAAC;IAEd,OAAO;;;EAGP,YAAY,EAAE,CAAC;AACjB,CAAC;AAED,MAAM,CAAC,YAAY,EAAE,GAAG,YAAY,CAAC,GAAG,OAAO,CAAC,IAAI,CAAC,KAAK,CAAC,CAAC,CAAC,CAAC;AAC9D,MAAM,kBAAkB,GAAG,cAAc,CAAC,IAAI,CAAC,CAAC,QAAQ,EAAE,EAAE,CAAC,QAAQ,CAAC,IAAI,KAAK,YAAY,CAAC,CAAC;AAE7F,IAAI,kBAAkB,KAAK,SAAS,EAAE,CAAC;IACrC,OAAO,CAAC,GAAG,CAAC,SAAS,EAAE,CAAC,CAAC;IACzB,OAAO,CAAC,QAAQ,GAAG,CAAC,CAAC;AACvB,CAAC;KAAM,CAAC;IACN,MAAM,kBAAkB,CAAC,GAAG,CAAC,YAAY,CAAC,CAAC;AAC7C,CAAC"}
@@ -22,13 +22,13 @@ This is especially useful for the three planning skills used at the start of the
22
22
 
23
23
  Prepared AcademicArmy skill metaskills:
24
24
 
25
- | Skill | Edit this file | Run this script |
26
- |---|---|---|
27
- | `academic-army-architect` | [`academic-army-architect/METASKILL.md`](academic-army-architect/METASKILL.md) | [`academic-army-architect/envolve.sh`](academic-army-architect/envolve.sh) with `bash` |
25
+ | Skill | Edit this file | Run this script |
26
+ | ------------------------------- | ------------------------------------------------------------------------------------------ | -------------------------------------------------------------------------------------------------- |
27
+ | `academic-army-architect` | [`academic-army-architect/METASKILL.md`](academic-army-architect/METASKILL.md) | [`academic-army-architect/envolve.sh`](academic-army-architect/envolve.sh) with `bash` |
28
28
  | `academic-army-experiment-plan` | [`academic-army-experiment-plan/METASKILL.md`](academic-army-experiment-plan/METASKILL.md) | [`academic-army-experiment-plan/envolve.sh`](academic-army-experiment-plan/envolve.sh) with `bash` |
29
- | `academic-army-coding-plan` | [`academic-army-coding-plan/METASKILL.md`](academic-army-coding-plan/METASKILL.md) | [`academic-army-coding-plan/envolve.sh`](academic-army-coding-plan/envolve.sh) with `bash` |
30
- | `academic-army-repo-scaffold` | [`academic-army-repo-scaffold/METASKILL.md`](academic-army-repo-scaffold/METASKILL.md) | [`academic-army-repo-scaffold/envolve.sh`](academic-army-repo-scaffold/envolve.sh) with `bash` |
31
- | `academic-army-coding-style` | [`academic-army-coding-style/METASKILL.md`](academic-army-coding-style/METASKILL.md) | [`../runs/develop-skill.sh`](../runs/develop-skill.sh) with `bash` |
29
+ | `academic-army-coding-plan` | [`academic-army-coding-plan/METASKILL.md`](academic-army-coding-plan/METASKILL.md) | [`academic-army-coding-plan/envolve.sh`](academic-army-coding-plan/envolve.sh) with `bash` |
30
+ | `academic-army-repo-scaffold` | [`academic-army-repo-scaffold/METASKILL.md`](academic-army-repo-scaffold/METASKILL.md) | [`academic-army-repo-scaffold/envolve.sh`](academic-army-repo-scaffold/envolve.sh) with `bash` |
31
+ | `academic-army-coding-style` | [`academic-army-coding-style/METASKILL.md`](academic-army-coding-style/METASKILL.md) | [`../runs/develop-skill.sh`](../runs/develop-skill.sh) with `bash` |
32
32
 
33
33
  Before calling `evolve-skill` for `academic-army-architect`, create or confirm [`academic-army-architect/ENVOLVETASK.md`](academic-army-architect/ENVOLVETASK.md). This fixed task is what the runner uses to test the architect skill across evolution rounds.
34
34
 
@@ -124,8 +124,8 @@ Copy an existing [`envolve.sh`](academic-army-architect/envolve.sh) and update t
124
124
 
125
125
  ## Troubleshooting
126
126
 
127
- | Problem | Likely cause | Fix |
128
- |---|---|---|
127
+ | Problem | Likely cause | Fix |
128
+ | ------------------------------------------- | -------------------------------------------------------- | ------------------------------------------------------------------------- |
129
129
  | The skill output is still not stable enough | The metaskill guidance is still too vague or incomplete. | Add concrete tips about the failure mode and repeat the evolution script. |
130
- | Different versions are hard to compare | The fixed evolution task is not stable enough. | Rewrite `ENVOLVETASK.md` as a representative fixed task. |
131
- | The script fails before the loop starts | Dependencies or pipeline config are missing. | Run `npm install` and check the TypeScript entry points and config paths. |
130
+ | Different versions are hard to compare | The fixed evolution task is not stable enough. | Rewrite `ENVOLVETASK.md` as a representative fixed task. |
131
+ | The script fails before the loop starts | Dependencies or pipeline config are missing. | Run `npm install` and check the TypeScript entry points and config paths. |
@@ -22,13 +22,13 @@
22
22
 
23
23
  当前已准备的 AcademicArmy skill metaskill 如下:
24
24
 
25
- | Skill | 修改这个文件 | 运行这个脚本 |
26
- |---|---|---|
27
- | `academic-army-architect` | [`academic-army-architect/METASKILL.md`](academic-army-architect/METASKILL.md) | 用 `bash` 运行 [`academic-army-architect/envolve.sh`](academic-army-architect/envolve.sh) |
25
+ | Skill | 修改这个文件 | 运行这个脚本 |
26
+ | ------------------------------- | ------------------------------------------------------------------------------------------ | ----------------------------------------------------------------------------------------------------- |
27
+ | `academic-army-architect` | [`academic-army-architect/METASKILL.md`](academic-army-architect/METASKILL.md) | 用 `bash` 运行 [`academic-army-architect/envolve.sh`](academic-army-architect/envolve.sh) |
28
28
  | `academic-army-experiment-plan` | [`academic-army-experiment-plan/METASKILL.md`](academic-army-experiment-plan/METASKILL.md) | 用 `bash` 运行 [`academic-army-experiment-plan/envolve.sh`](academic-army-experiment-plan/envolve.sh) |
29
- | `academic-army-coding-plan` | [`academic-army-coding-plan/METASKILL.md`](academic-army-coding-plan/METASKILL.md) | 用 `bash` 运行 [`academic-army-coding-plan/envolve.sh`](academic-army-coding-plan/envolve.sh) |
30
- | `academic-army-repo-scaffold` | [`academic-army-repo-scaffold/METASKILL.md`](academic-army-repo-scaffold/METASKILL.md) | 用 `bash` 运行 [`academic-army-repo-scaffold/envolve.sh`](academic-army-repo-scaffold/envolve.sh) |
31
- | `academic-army-coding-style` | [`academic-army-coding-style/METASKILL.md`](academic-army-coding-style/METASKILL.md) | 用 `bash` 运行 [`../runs/develop-skill.sh`](../runs/develop-skill.sh) |
29
+ | `academic-army-coding-plan` | [`academic-army-coding-plan/METASKILL.md`](academic-army-coding-plan/METASKILL.md) | 用 `bash` 运行 [`academic-army-coding-plan/envolve.sh`](academic-army-coding-plan/envolve.sh) |
30
+ | `academic-army-repo-scaffold` | [`academic-army-repo-scaffold/METASKILL.md`](academic-army-repo-scaffold/METASKILL.md) | 用 `bash` 运行 [`academic-army-repo-scaffold/envolve.sh`](academic-army-repo-scaffold/envolve.sh) |
31
+ | `academic-army-coding-style` | [`academic-army-coding-style/METASKILL.md`](academic-army-coding-style/METASKILL.md) | 用 `bash` 运行 [`../runs/develop-skill.sh`](../runs/develop-skill.sh) |
32
32
 
33
33
  对 `academic-army-architect` 调用 `evolve-skill` 前,先创建或确认 [`academic-army-architect/ENVOLVETASK.md`](academic-army-architect/ENVOLVETASK.md)(Windows 路径:`metaskills\academic-army-architect\ENVOLVETASK.md`)。这个固定任务是 runner 在 evolution 轮次中测试 architect skill 的输入。
34
34
 
@@ -124,8 +124,8 @@ envolve.sh
124
124
 
125
125
  ## 常见问题
126
126
 
127
- | 问题 | 常见原因 | 解决办法 |
128
- |---|---|---|
129
- | Skill 输出仍然不稳定 | Metaskill guidance 仍然太模糊或不完整。 | 继续补充具体 failure mode 和 tips,然后再次运行 evolution 脚本。 |
130
- | 不同版本难以比较 | 固定 evolution task 不够稳定。 | 把 `ENVOLVETASK.md` 改成更有代表性的固定任务。 |
131
- | 脚本在 loop 开始前失败 | 依赖或 pipeline config 缺失。 | 运行 `npm install`,并检查 TypeScript 入口和 config 路径。 |
127
+ | 问题 | 常见原因 | 解决办法 |
128
+ | ---------------------- | --------------------------------------- | ---------------------------------------------------------------- |
129
+ | Skill 输出仍然不稳定 | Metaskill guidance 仍然太模糊或不完整。 | 继续补充具体 failure mode 和 tips,然后再次运行 evolution 脚本。 |
130
+ | 不同版本难以比较 | 固定 evolution task 不够稳定。 | 把 `ENVOLVETASK.md` 改成更有代表性的固定任务。 |
131
+ | 脚本在 loop 开始前失败 | 依赖或 pipeline config 缺失。 | 运行 `npm install`,并检查 TypeScript 入口和 config 路径。 |
@@ -1 +1 @@
1
- Create a coding plan based on the paper blueprint in output/paper_blueprint.md and the experiment plan in output/experiment_plan.md.
1
+ Create a coding plan based on the paper blueprint in output/paper_blueprint.md and the experiment plan in output/experiment_plan.md.
@@ -1 +1 @@
1
- Create an experiment plan based on the paper blueprint in output/paper_blueprint.md.
1
+ Create an experiment plan based on the paper blueprint in output/paper_blueprint.md.
@@ -44,7 +44,6 @@ experiment plan主要是给AI看的,不是主要给人看的;它只需要客
44
44
  experiment plan中不应出现`Assumptions to validate`、`Artifact cautions`、`Do not assume reviewers will run code`这类面向用户的部分。
45
45
  如果确实存在不确定信息,应将其表达为实验方案中的开放变量、依赖条件或后续规划需要继承的信息,而不是写成用户提醒。
46
46
 
47
-
48
47
  `experiment_plan.md`和`experiment_plan.explain.md`都应为Markdown格式。
49
48
  `experiment_plan.md`固定只放experiment plan本身,且使用英文输出。
50
49
  `experiment_plan.explain.md`固定只放experiment plan解释和思考流程,且使用中文输出。
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "academic-army",
3
- "version": "0.1.1",
3
+ "version": "0.1.2",
4
4
  "description": "Agent workflows and skills for AcademicArmy.",
5
5
  "license": "MIT",
6
6
  "repository": {
@@ -64,22 +64,22 @@ artifact access, or runtime environment.
64
64
  Operate at paper-strategy levels 0-2 and route tactical work to downstream
65
65
  skills.
66
66
 
67
- | Level | This skill owns |
68
- |---|---|
69
- | 0: paper identity | idea, field context, target readers, target venue, paper type |
70
- | 1: thesis and claims | problem pressure, insight, reader promise, goals, claim hierarchy |
67
+ | Level | This skill owns |
68
+ | ------------------------ | --------------------------------------------------------------------------------------------------------------------------------------------------- |
69
+ | 0: paper identity | idea, field context, target readers, target venue, paper type |
70
+ | 1: thesis and claims | problem pressure, insight, reader promise, goals, claim hierarchy |
71
71
  | 2: strategic constraints | prior-work boundary, contribution roles, high-level design logic, candidate method space, evidence principles, downstream contracts, open variables |
72
72
 
73
73
  Translate tactical inputs into strategic contracts:
74
74
 
75
- | User input type | Blueprint-level representation |
76
- |---|---|
77
- | algorithm, proof, or optimization idea | required method property, selected strategic direction, or candidate method route |
78
- | module or method combination | candidate route with purpose, evidence needed, demotion condition, and fair-comparison principle |
79
- | dataset, trace, benchmark, device, or baseline | evidence dimension, fairness obligation, or user-specified evaluation constraint |
80
- | metric formula | outcome family and claim-evidence requirement |
81
- | figure or section idea | visual or narrative argument that downstream planning must preserve |
82
- | implementation detail | inherited capability, deployment boundary, or downstream engineering constraint |
75
+ | User input type | Blueprint-level representation |
76
+ | ---------------------------------------------- | ------------------------------------------------------------------------------------------------ |
77
+ | algorithm, proof, or optimization idea | required method property, selected strategic direction, or candidate method route |
78
+ | module or method combination | candidate route with purpose, evidence needed, demotion condition, and fair-comparison principle |
79
+ | dataset, trace, benchmark, device, or baseline | evidence dimension, fairness obligation, or user-specified evaluation constraint |
80
+ | metric formula | outcome family and claim-evidence requirement |
81
+ | figure or section idea | visual or narrative argument that downstream planning must preserve |
82
+ | implementation detail | inherited capability, deployment boundary, or downstream engineering constraint |
83
83
 
84
84
  ## Strategy Ledger
85
85
 
@@ -14,16 +14,27 @@ Use this default structure:
14
14
  # Paper Blueprint: <Working Title>
15
15
 
16
16
  ## Paper Identity
17
+
17
18
  ## Core Thesis and Reader Promise
19
+
18
20
  ## Target Venue Fit
21
+
19
22
  ## Problem Context and Prior-Work Gap
23
+
20
24
  ## Paper Goals
25
+
21
26
  ## Contribution Strategy
27
+
22
28
  ## Claim-Evidence Architecture
29
+
23
30
  ## High-Level Design Logic and Scope
31
+
24
32
  ## Candidate Method Space
33
+
25
34
  ## Evidence Strategy
35
+
26
36
  ## Downstream Planning Constraints
37
+
27
38
  ## Open Strategic Variables
28
39
  ```
29
40
 
@@ -182,9 +193,11 @@ Add contracts for relevant planning areas:
182
193
  ### <Planning Area> Contract
183
194
 
184
195
  Preserve:
196
+
185
197
  - <strategic invariant>
186
198
 
187
199
  Delegate:
200
+
188
201
  - <tactical design space>
189
202
  ```
190
203
 
@@ -243,19 +256,33 @@ Use this default structure:
243
256
  # 论文蓝图说明:<Working Title>
244
257
 
245
258
  ## 当前论文方案概括
259
+
246
260
  ## 用户已经明确的内容
261
+
247
262
  ### 已确认约束
263
+
248
264
  ### 偏好但未锁定
265
+
249
266
  ## 当前工作假设
267
+
250
268
  ## 使用的研究信号
269
+
251
270
  ### 跨论文模式
271
+
252
272
  ### 承重信号
273
+
253
274
  ### 额外背景信号
275
+
254
276
  ## 核心出发点
277
+
255
278
  ## 按蓝图顺序解释论文方案
279
+
256
280
  ## 候选方法空间说明
281
+
257
282
  ## 开放验证项
283
+
258
284
  ## 本轮已收缩的开放项
285
+
259
286
  ## 证据变化时如何校准 Claim
260
287
  ```
261
288
 
@@ -45,18 +45,20 @@ For `output/evolve-*` outputs, or whenever artifact-access feedback is active or
45
45
 
46
46
  Use five-backtick fences for full-file handoffs so embedded command fences remain readable:
47
47
 
48
- ````markdown
48
+ ``````markdown
49
49
  ## output/evolve-.../coding_plan.md
50
50
 
51
- `````markdown
51
+ ```markdown
52
52
  <full coding_plan.md content>
53
- `````
53
+ ```
54
+ ``````
54
55
 
55
56
  ## output/evolve-.../coding_plan.explain.md
56
57
 
57
- `````markdown
58
+ ```markdown
58
59
  <full coding_plan.explain.md content>
59
- `````
60
+ ```
61
+
60
62
  ````
61
63
 
62
64
  When files are long, read each file with a complete read method or bounded chunks before composing the handoff. Paste the read-back contents, not a regenerated approximation. If read-back fails after writing, try another local read mechanism. If read-back remains impossible, report the read-back failure clearly and mark delivery blocked rather than presenting unverified contents.
@@ -489,3 +491,4 @@ After writing and validating the files, summarize:
489
491
  - validation performed, including read-back result
490
492
 
491
493
  For `output/evolve-*` outputs or when artifact-access feedback requests pasted contents, add a `Review Handoff` heading immediately after the concise validation sentence and paste the complete read-back contents of both files using the five-backtick handoff format. A path-only response is incomplete for access-limited review.
494
+ ````
@@ -328,7 +328,7 @@ objectives.
328
328
  ## 2. Claim-to-Evidence Map
329
329
 
330
330
  | Claim | Reviewer Concern | Evidence Objective | Story Placement | Expected Evidence Output |
331
- |---|---|---|---|---|
331
+ | ----- | ---------------- | ------------------ | --------------- | ------------------------ |
332
332
 
333
333
  ## 3. Workload and Dataset Registry
334
334
 
@@ -414,7 +414,7 @@ is clearer.
414
414
  ## 实时调研如何影响实验取舍
415
415
 
416
416
  | 来源 | 日期 | venue_status | 影响到的规划决定 |
417
- |---|---:|---|---|
417
+ | ---- | ---: | ------------ | ---------------- |
418
418
 
419
419
  ## 实验故事线
420
420
 
@@ -699,7 +699,7 @@ Treat any of these as validation failures:
699
699
  - README, REFERENCES, harness explanations, test notes, or retained docs use
700
700
  internal artifact-management wording such as `External Task Inputs`,
701
701
  `external inputs`, whole-word `bucket`, vague `consulted`, `operators
702
- execute`, or similar process language,
702
+ execute`, or similar process language,
703
703
  - REFERENCES includes sources that were only searched or rejected and have no
704
704
  dependency, attribution, license, benchmark, harness, or implementation value,
705
705
  - retained template files still contain default template project names, template