academic-army 0.1.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.editorconfig +9 -0
- package/.github/workflows/publish.yml +44 -0
- package/.prettierrc.json +3 -0
- package/LICENSE +21 -0
- package/README.md +172 -0
- package/README.zh-CN.md +172 -0
- package/agent-forge.yaml +83 -0
- package/eslint.config.js +28 -0
- package/install_mcp.py +85 -0
- package/mcp-server/__main__.py +33 -0
- package/mcp-server/deepresearch/__init__.py +3 -0
- package/mcp-server/deepresearch/tools.py +33 -0
- package/mcp-server/requirements.txt +4 -0
- package/metaskills/README.md +131 -0
- package/metaskills/README.zh-CN.md +131 -0
- package/metaskills/academic-army-architect/METASKILL.md +91 -0
- package/metaskills/academic-army-architect/envolve.sh +9 -0
- package/metaskills/academic-army-coding-plan/ENVOLVETASK.md +1 -0
- package/metaskills/academic-army-coding-plan/METASKILL.md +118 -0
- package/metaskills/academic-army-coding-plan/envolve.sh +9 -0
- package/metaskills/academic-army-coding-style/METASKILL.md +292 -0
- package/metaskills/academic-army-experiment-plan/ENVOLVETASK.md +1 -0
- package/metaskills/academic-army-experiment-plan/METASKILL.md +82 -0
- package/metaskills/academic-army-experiment-plan/envolve.sh +9 -0
- package/metaskills/academic-army-repo-scaffold/ENVOLVETASK.md +1 -0
- package/metaskills/academic-army-repo-scaffold/METASKILL.md +223 -0
- package/metaskills/academic-army-repo-scaffold/envolve.sh +9 -0
- package/package.json +35 -0
- package/runs/develop-skill.sh +17 -0
- package/runs/develop.sh +16 -0
- package/skills/academic-army-architect/SKILL.md +336 -0
- package/skills/academic-army-architect/agents/openai.yaml +11 -0
- package/skills/academic-army-architect/references/blueprint-schema.md +345 -0
- package/skills/academic-army-coding-plan/SKILL.md +491 -0
- package/skills/academic-army-coding-plan/agents/openai.yaml +11 -0
- package/skills/academic-army-coding-style/SKILL.md +915 -0
- package/skills/academic-army-coding-style/agents/openai.yaml +11 -0
- package/skills/academic-army-experiment-plan/SKILL.md +517 -0
- package/skills/academic-army-experiment-plan/agents/openai.yaml +11 -0
- package/skills/academic-army-repo-scaffold/SKILL.md +756 -0
- package/skills/academic-army-repo-scaffold/agents/openai.yaml +10 -0
- package/src/README.md +79 -0
- package/src/README.zh-CN.md +79 -0
- package/src/cli.ts +55 -0
- package/src/developing/README.md +146 -0
- package/src/developing/README.zh-CN.md +146 -0
- package/src/developing/agents/developer.ts +40 -0
- package/src/developing/agents/factory.ts +11 -0
- package/src/developing/agents/index.ts +8 -0
- package/src/developing/agents/manager.ts +74 -0
- package/src/developing/agents/prompts.ts +12 -0
- package/src/developing/agents/reviewer.ts +44 -0
- package/src/developing/agents/trajectory-optimizer.ts +70 -0
- package/src/developing/agents/types.ts +41 -0
- package/src/developing/index.ts +2 -0
- package/src/developing/pipeline.ts +306 -0
- package/src/developing/pipelineskill.ts +169 -0
- package/src/evolve-skill/README.md +116 -0
- package/src/evolve-skill/README.zh-CN.md +116 -0
- package/src/evolve-skill/agents/evaluator.ts +28 -0
- package/src/evolve-skill/agents/factory.ts +11 -0
- package/src/evolve-skill/agents/index.ts +4 -0
- package/src/evolve-skill/agents/modifier.ts +27 -0
- package/src/evolve-skill/agents/runner.ts +19 -0
- package/src/evolve-skill/index.ts +1 -0
- package/src/evolve-skill/pipeline.ts +140 -0
- package/src/pipeline.ts +65 -0
- package/tsconfig.json +22 -0
|
@@ -0,0 +1,336 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: academic-army-architect
|
|
3
|
+
description: >-
|
|
4
|
+
Generate or revise a goal-oriented strategic paper blueprint for an
|
|
5
|
+
autoresearch pipeline. Use when Codex must refine a paper idea, position a
|
|
6
|
+
submission for a target venue, define claims and contribution boundaries,
|
|
7
|
+
build an evidence-grounded candidate method space, audit or revise an
|
|
8
|
+
existing blueprint, or hand off stable paper strategy to downstream content,
|
|
9
|
+
method, experiment, figure, writing, or review-planning skills. Always create
|
|
10
|
+
exactly two Markdown artifacts: English-only `paper_blueprint.md` and
|
|
11
|
+
Chinese-language `paper_blueprint.explain.md`. Use
|
|
12
|
+
`academic_army_mcp_tools.deepresearch` for live venue, literature,
|
|
13
|
+
high-quality-paper, method, benchmark, and autoresearch-workflow evidence.
|
|
14
|
+
---
|
|
15
|
+
|
|
16
|
+
# Academic Army Architect
|
|
17
|
+
|
|
18
|
+
## Mission
|
|
19
|
+
|
|
20
|
+
Create a target-oriented paper strategy blueprint for downstream AI planning.
|
|
21
|
+
The blueprint is not a full paper, experiment plan, figure plan, implementation
|
|
22
|
+
plan, or advice checklist. It is the stable paper core that later skills inherit:
|
|
23
|
+
|
|
24
|
+
- paper identity, target venue, readers, field context, and reviewer posture
|
|
25
|
+
- core thesis, problem pressure, reader promise, and paper goals
|
|
26
|
+
- prior-work boundary and differentiated positioning
|
|
27
|
+
- one to three core claims and the evidence each claim needs
|
|
28
|
+
- contribution shapes and novelty boundary
|
|
29
|
+
- high-level method or system logic
|
|
30
|
+
- experiment-dependent candidate method routes
|
|
31
|
+
- minimum sufficient evidence chain and evaluation principles
|
|
32
|
+
- downstream planning constraints and unresolved strategic variables
|
|
33
|
+
|
|
34
|
+
Before drafting or auditing output, read
|
|
35
|
+
[`references/blueprint-schema.md`](references/blueprint-schema.md).
|
|
36
|
+
|
|
37
|
+
## Output Contract
|
|
38
|
+
|
|
39
|
+
Write exactly two Markdown files in the requested output directory:
|
|
40
|
+
|
|
41
|
+
- `paper_blueprint.md`: English-only, AI-facing paper strategy blueprint.
|
|
42
|
+
- `paper_blueprint.explain.md`: Chinese-language validation companion for the
|
|
43
|
+
user. It uses Chinese sentences as the default and preserves conventional
|
|
44
|
+
English titles, venue names, dataset names, benchmark names, method names,
|
|
45
|
+
and technical terms when those names are standard in the field.
|
|
46
|
+
|
|
47
|
+
Keep the output directory limited to these two Markdown artifacts.
|
|
48
|
+
|
|
49
|
+
`paper_blueprint.md` objectively states the selected paper strategy, claim
|
|
50
|
+
envelope, contribution boundary, candidate method space when needed, evidence
|
|
51
|
+
requirements, downstream inheritance contracts, and open strategic variables.
|
|
52
|
+
It should be usable by later AI skills without reading the Chinese explanation.
|
|
53
|
+
|
|
54
|
+
`paper_blueprint.explain.md` helps the user validate why the blueprint is
|
|
55
|
+
reasonable. It records user-specified facts, restates each important blueprint
|
|
56
|
+
decision before explaining its rationale, and explains paper-design decisions
|
|
57
|
+
only: idea positioning, venue fit, claim logic, evidence chain,
|
|
58
|
+
source-backed patterns, candidate-method reasoning, and open-variable status.
|
|
59
|
+
It should not explain the skill workflow, template choices, tool calls,
|
|
60
|
+
artifact access, or runtime environment.
|
|
61
|
+
|
|
62
|
+
## Strategic Level
|
|
63
|
+
|
|
64
|
+
Operate at paper-strategy levels 0-2 and route tactical work to downstream
|
|
65
|
+
skills.
|
|
66
|
+
|
|
67
|
+
| Level | This skill owns |
|
|
68
|
+
|---|---|
|
|
69
|
+
| 0: paper identity | idea, field context, target readers, target venue, paper type |
|
|
70
|
+
| 1: thesis and claims | problem pressure, insight, reader promise, goals, claim hierarchy |
|
|
71
|
+
| 2: strategic constraints | prior-work boundary, contribution roles, high-level design logic, candidate method space, evidence principles, downstream contracts, open variables |
|
|
72
|
+
|
|
73
|
+
Translate tactical inputs into strategic contracts:
|
|
74
|
+
|
|
75
|
+
| User input type | Blueprint-level representation |
|
|
76
|
+
|---|---|
|
|
77
|
+
| algorithm, proof, or optimization idea | required method property, selected strategic direction, or candidate method route |
|
|
78
|
+
| module or method combination | candidate route with purpose, evidence needed, demotion condition, and fair-comparison principle |
|
|
79
|
+
| dataset, trace, benchmark, device, or baseline | evidence dimension, fairness obligation, or user-specified evaluation constraint |
|
|
80
|
+
| metric formula | outcome family and claim-evidence requirement |
|
|
81
|
+
| figure or section idea | visual or narrative argument that downstream planning must preserve |
|
|
82
|
+
| implementation detail | inherited capability, deployment boundary, or downstream engineering constraint |
|
|
83
|
+
|
|
84
|
+
## Strategy Ledger
|
|
85
|
+
|
|
86
|
+
When creating or revising a blueprint, maintain an internal ledger:
|
|
87
|
+
|
|
88
|
+
1. `User-specified facts`: explicit constraints accumulated across iterations.
|
|
89
|
+
2. `Evidence-backed findings`: live-research findings tied to source links.
|
|
90
|
+
3. `Preferred but not locked`: user preferences that guide the best strategy.
|
|
91
|
+
4. `Working assumptions`: defaults that keep the blueprint coherent.
|
|
92
|
+
5. `Strategic decisions`: identity, thesis, goals, claims, contribution roles,
|
|
93
|
+
novelty boundary, and scope.
|
|
94
|
+
6. `Candidate method routes`: experiment-dependent approaches that downstream
|
|
95
|
+
method or experiment planning must compare.
|
|
96
|
+
7. `Rejected strategic alternatives`: plausible alternatives not selected, with
|
|
97
|
+
a concise paper-level reason.
|
|
98
|
+
8. `Open strategic variables`: unresolved choices that require private facts,
|
|
99
|
+
unpublished constraints, user preference, or unavailable evidence.
|
|
100
|
+
9. `Resolved since prior iteration`: items closed by user facts, live evidence,
|
|
101
|
+
experiments, or a documented strategy decision.
|
|
102
|
+
|
|
103
|
+
When revising an existing blueprint, read the prior
|
|
104
|
+
`paper_blueprint.explain.md` first and preserve accumulated user-specified
|
|
105
|
+
facts unless the user explicitly corrects them.
|
|
106
|
+
|
|
107
|
+
Use this precedence order:
|
|
108
|
+
|
|
109
|
+
1. explicit user correction
|
|
110
|
+
2. accumulated user-specified fact
|
|
111
|
+
3. completed experiment result supplied by the user
|
|
112
|
+
4. evidence-backed finding
|
|
113
|
+
5. open strategic variable
|
|
114
|
+
6. preferred but not locked
|
|
115
|
+
7. working assumption
|
|
116
|
+
|
|
117
|
+
Evidence can refine scope and positioning, but it does not silently override a
|
|
118
|
+
user-specified constraint. Explain any resulting tradeoff in the Chinese file.
|
|
119
|
+
|
|
120
|
+
## Strategic Judgment
|
|
121
|
+
|
|
122
|
+
Take responsibility for choosing the strongest current paper strategy. Use the
|
|
123
|
+
paper goal, user facts, live evidence, venue posture, source-backed paper
|
|
124
|
+
patterns, claim requirements, and downstream executability to commit to the
|
|
125
|
+
best-supported position.
|
|
126
|
+
|
|
127
|
+
When one option is materially stronger, write only the selected strategy in
|
|
128
|
+
`paper_blueprint.md`. In `paper_blueprint.explain.md`, briefly explain why the
|
|
129
|
+
selected route is more persuasive than the most relevant rejected alternative
|
|
130
|
+
when that contrast helps the user validate the design.
|
|
131
|
+
|
|
132
|
+
Keep an item open only when available information cannot support a reliable
|
|
133
|
+
strategy decision and the choice would materially affect paper identity, claim
|
|
134
|
+
scope, contribution boundary, deployment boundary, evidence chain, or downstream
|
|
135
|
+
contracts.
|
|
136
|
+
|
|
137
|
+
## Candidate Method Space
|
|
138
|
+
|
|
139
|
+
Distinguish the strategic method direction from experiment-dependent
|
|
140
|
+
implementation routes.
|
|
141
|
+
|
|
142
|
+
Commit to a strategic method direction when venue posture, user facts, and
|
|
143
|
+
literature support it. For example: "the paper needs an interpretable online
|
|
144
|
+
controller with explicit uncertainty and deadline handling."
|
|
145
|
+
|
|
146
|
+
When method superiority depends on future results, create a `Candidate Method
|
|
147
|
+
Space` instead of selecting a final method or asking the user to choose. Each
|
|
148
|
+
candidate route should state:
|
|
149
|
+
|
|
150
|
+
- strategic purpose
|
|
151
|
+
- source or theoretical motivation
|
|
152
|
+
- relation to the core claim
|
|
153
|
+
- potential contribution role
|
|
154
|
+
- evidence needed to promote it into the proposed method
|
|
155
|
+
- disqualifier or demotion condition
|
|
156
|
+
- fair-comparison principle
|
|
157
|
+
- downstream owner, usually method planning or experiment planning
|
|
158
|
+
|
|
159
|
+
Candidate routes can later become the proposed method, a baseline, an ablation
|
|
160
|
+
variant, or a rejected route. Baseline planning may include original prior
|
|
161
|
+
methods, single-backbone variants, candidate routes without the paper's key
|
|
162
|
+
mechanism, partial-module variants, or key-module replacements. Leave exact
|
|
163
|
+
experiments, budgets, hyperparameters, and run matrices to downstream planning.
|
|
164
|
+
|
|
165
|
+
Exclude methods that live research shows are incompatible with the problem,
|
|
166
|
+
inputs, cost envelope, venue posture, or claim needs. Explain the paper-level
|
|
167
|
+
reason in the Chinese file.
|
|
168
|
+
|
|
169
|
+
## Open Variable Policy
|
|
170
|
+
|
|
171
|
+
Open strategic variables are a last-resort output. Before preserving one:
|
|
172
|
+
|
|
173
|
+
1. Check whether it is already settled by user facts.
|
|
174
|
+
2. Check whether completed experiments supplied by the user settle it.
|
|
175
|
+
3. Check whether live evidence and venue-oriented judgment support a clear best
|
|
176
|
+
strategy.
|
|
177
|
+
4. Convert tactical choices to downstream contracts.
|
|
178
|
+
5. Convert experiment-dependent method superiority questions to candidate
|
|
179
|
+
method routes.
|
|
180
|
+
6. Commit to a best-supported stance when a reasonable default exists.
|
|
181
|
+
7. Preserve the variable only if private resources, unpublished constraints,
|
|
182
|
+
user preference, or missing evidence materially affect strategy.
|
|
183
|
+
|
|
184
|
+
When an open variable remains, give a current best-supported stance and allowed
|
|
185
|
+
strategy-level resolutions. Re-audit existing open variables on every revision
|
|
186
|
+
and shrink the list when new facts or evidence support a decision.
|
|
187
|
+
|
|
188
|
+
## Live Research
|
|
189
|
+
|
|
190
|
+
Run live research before drafting or materially revising a blueprint.
|
|
191
|
+
|
|
192
|
+
Use the exact MCP tool:
|
|
193
|
+
|
|
194
|
+
- server: `academic_army_mcp_tools`
|
|
195
|
+
- tool: `deepresearch`
|
|
196
|
+
- canonical name: `mcp__academic_army_mcp_tools__deepresearch`
|
|
197
|
+
|
|
198
|
+
Research should cover:
|
|
199
|
+
|
|
200
|
+
- current target-venue scope, posture, reviewer expectations, and paper fit
|
|
201
|
+
- closest substrates, nearest competitors, and literature boundaries
|
|
202
|
+
- relevant methods, systems, datasets, benchmarks, code, and theoretical frames
|
|
203
|
+
- recent high-quality storytelling exemplars from the target venue or adjacent
|
|
204
|
+
venues such as SIGGRAPH, CVPR, SIGCOMM, NSDI, INFOCOM, or field equivalents
|
|
205
|
+
- canonical older precedents that still shape method, benchmark, or evaluation
|
|
206
|
+
expectations
|
|
207
|
+
- source-backed evaluation patterns affecting the minimum evidence chain
|
|
208
|
+
- autoresearch, paper-writing, literature-review, scientific-discovery,
|
|
209
|
+
workflow-agent, prompt-template, benchmark, and open-source-tool patterns that
|
|
210
|
+
improve this blueprint as a downstream AI-planning artifact. Use these
|
|
211
|
+
findings to improve the blueprint's role as an inheritable planning object;
|
|
212
|
+
do not dump generic autoresearch background into either artifact.
|
|
213
|
+
|
|
214
|
+
For storytelling and writing-style patterns, prefer recent papers from the last
|
|
215
|
+
one to three years. For methods, datasets, benchmarks, baselines, and technical
|
|
216
|
+
background, include older canonical work when still load-bearing. Prefer
|
|
217
|
+
primary sources: official venue pages, reviewer guidance, papers, project pages,
|
|
218
|
+
repositories, benchmark pages, datasets, and official documentation. Separate
|
|
219
|
+
source-supported facts from inference.
|
|
220
|
+
|
|
221
|
+
Use this research prompt shape:
|
|
222
|
+
|
|
223
|
+
```text
|
|
224
|
+
Research brief:
|
|
225
|
+
[IDEA, TARGET PROBLEM, KNOWN SUBSTRATES, USER-SPECIFIED FACTS,
|
|
226
|
+
PREFERENCES, CURRENT CANDIDATE METHOD SPACE, AND CURRENT OPEN VALIDATION ITEMS]
|
|
227
|
+
|
|
228
|
+
Target venue:
|
|
229
|
+
[VENUE]
|
|
230
|
+
|
|
231
|
+
Return concise evidence in nine sections:
|
|
232
|
+
1. Venue posture, target readers, and reviewer expectations
|
|
233
|
+
2. Closest technical substrates and literature boundary
|
|
234
|
+
3. Nearest competing work and differentiated positioning
|
|
235
|
+
4. Recent high-impact storytelling exemplars and why their framing works
|
|
236
|
+
5. Cross-paper synthesis: framing, contribution, and evidence-chain lessons
|
|
237
|
+
6. Canonical method, dataset, benchmark, baseline-family, and evaluation precedents
|
|
238
|
+
7. Candidate method space: plausible routes, combinations, evidence needed, and disqualifiers
|
|
239
|
+
8. Strategy implications: decisions to commit, routes to delegate to experiments, alternatives to reject, and only genuinely unresolved items
|
|
240
|
+
9. Source-role table with title, venue/year, stable link, role, persuasive pattern, and blueprint lesson
|
|
241
|
+
|
|
242
|
+
Separate source-supported facts from inference. Prioritize sources that
|
|
243
|
+
materially change the paper strategy. Use recent papers for storytelling
|
|
244
|
+
analysis and older papers only for load-bearing precedents. Identify the
|
|
245
|
+
best-supported paper strategy. Label experiment-dependent method routes as
|
|
246
|
+
candidates rather than proven best choices.
|
|
247
|
+
```
|
|
248
|
+
|
|
249
|
+
In `paper_blueprint.explain.md`, start the research discussion with three to
|
|
250
|
+
five cross-paper patterns. Then explain six to eight load-bearing sources: what
|
|
251
|
+
each source establishes, why its framing or evidence style matters, and which
|
|
252
|
+
blueprint choice it influences.
|
|
253
|
+
|
|
254
|
+
## Workflow
|
|
255
|
+
|
|
256
|
+
1. Read the request, prior explanation file when present, and
|
|
257
|
+
`references/blueprint-schema.md`.
|
|
258
|
+
2. Build or update the strategy ledger.
|
|
259
|
+
3. Shrink candidate open variables using the open-variable policy.
|
|
260
|
+
4. Run `academic_army_mcp_tools.deepresearch`.
|
|
261
|
+
5. Synthesize venue fit, prior-work boundary, storytelling patterns, method
|
|
262
|
+
families, benchmark expectations, evidence precedents, and blueprint-design
|
|
263
|
+
implications from autoresearch workflow evidence.
|
|
264
|
+
6. Commit to the best-supported thesis, paper goals, contribution strategy,
|
|
265
|
+
one to three claims, novelty boundary, and minimum sufficient evidence chain.
|
|
266
|
+
7. Build a candidate method space when method superiority depends on future
|
|
267
|
+
experiments.
|
|
268
|
+
8. Retain only genuinely unresolved strategic variables.
|
|
269
|
+
9. Draft both Markdown files.
|
|
270
|
+
10. Audit for language separation, strategic level, source support, candidate
|
|
271
|
+
method routing, open-variable shrinkage, and downstream inheritance.
|
|
272
|
+
11. Confirm exactly two Markdown files exist in the output directory.
|
|
273
|
+
|
|
274
|
+
## Quality Bar
|
|
275
|
+
|
|
276
|
+
`paper_blueprint.md` should:
|
|
277
|
+
|
|
278
|
+
- use semantic headings rather than global lookup-heavy codes
|
|
279
|
+
- state the selected paper strategy directly
|
|
280
|
+
- define the idea, problem, venue fit, target readers, field context, and
|
|
281
|
+
reviewer expectation
|
|
282
|
+
- position against existing methods, systems, datasets, benchmarks, or theory
|
|
283
|
+
- define one to three core claims and required evidence classes
|
|
284
|
+
- state contribution shapes and how they serve the claims
|
|
285
|
+
- describe high-level method or system logic without tactical implementation
|
|
286
|
+
- define evidence goals, benchmark/baseline/metric principles, and minimum
|
|
287
|
+
sufficient evidence chain
|
|
288
|
+
- state risks as open variables, claim-calibration rules, or downstream
|
|
289
|
+
planning constraints
|
|
290
|
+
- define downstream contracts for content, method, experiment, figure, writing,
|
|
291
|
+
and review planning when relevant
|
|
292
|
+
|
|
293
|
+
`paper_blueprint.explain.md` should:
|
|
294
|
+
|
|
295
|
+
- begin with a concise paper-strategy summary
|
|
296
|
+
- record accumulated user-specified facts near the start
|
|
297
|
+
- explain cross-paper patterns and load-bearing sources before detailed
|
|
298
|
+
blueprint rationale
|
|
299
|
+
- follow the blueprint's heading order when explaining important decisions
|
|
300
|
+
- restate each decision before explaining why it follows from user facts, live
|
|
301
|
+
evidence, venue posture, high-quality paper patterns, claim needs, and
|
|
302
|
+
downstream planning constraints
|
|
303
|
+
- explain why candidate method routes remain candidates when experiments must
|
|
304
|
+
decide them
|
|
305
|
+
- explain unresolved strategic variables as paper-design variables with current
|
|
306
|
+
best-supported stances
|
|
307
|
+
- use section titles and natural names rather than global codes or mechanical
|
|
308
|
+
cross-references
|
|
309
|
+
|
|
310
|
+
## Boundary Checks
|
|
311
|
+
|
|
312
|
+
Keep the artifacts focused on the paper being designed. Represent detailed
|
|
313
|
+
algorithms, training recipes, implementation steps, section plans, experiment
|
|
314
|
+
tables, figure layouts, run budgets, plotting plans, and reviewer operations as
|
|
315
|
+
strategic constraints, evidence needs, candidate-route conditions, or
|
|
316
|
+
downstream contracts.
|
|
317
|
+
|
|
318
|
+
Keep user-facing explanation at the paper-design layer. Explain why the paper
|
|
319
|
+
strategy is selected; leave skill workflow, tool rationale, template choices,
|
|
320
|
+
and artifact-management details out of the two blueprint files.
|
|
321
|
+
|
|
322
|
+
Before delivery, remove or rewrite user-facing reminders such as `Artifact
|
|
323
|
+
cautions`, `Assumptions to validate`, `Do not assume reviewers will run code`,
|
|
324
|
+
workflow notes, access caveats, or implementation to-do lists. If such content
|
|
325
|
+
is relevant to the paper, express it as an objective design constraint,
|
|
326
|
+
evidence requirement, open strategic variable, claim-calibration rule, or
|
|
327
|
+
downstream planning contract.
|
|
328
|
+
|
|
329
|
+
## Final Response
|
|
330
|
+
|
|
331
|
+
Return a concise completion note with the two artifact paths, the paper identity
|
|
332
|
+
summary, unresolved strategic-variable names when present, and candidate
|
|
333
|
+
method-space status when present. Paste artifact contents only when the current
|
|
334
|
+
user explicitly requests pasted contents or when the surrounding task contract
|
|
335
|
+
outside this skill requires it. Do not add any extra files to the artifact
|
|
336
|
+
directory.
|
|
@@ -0,0 +1,11 @@
|
|
|
1
|
+
interface:
|
|
2
|
+
display_name: "Academic Army Architect"
|
|
3
|
+
short_description: "Strategic paper blueprint with Chinese rationale"
|
|
4
|
+
default_prompt: "Use $academic-army-architect to refine this paper idea into an evidence-grounded strategic blueprint for downstream planning."
|
|
5
|
+
|
|
6
|
+
dependencies:
|
|
7
|
+
tools:
|
|
8
|
+
- type: "mcp"
|
|
9
|
+
value: "academic_army_mcp_tools"
|
|
10
|
+
description: "Provides academic_army_mcp_tools.deepresearch for live venue, literature, exemplar-paper, evaluation-precedent, and reviewer-context evidence."
|
|
11
|
+
transport: "stdio"
|
|
@@ -0,0 +1,345 @@
|
|
|
1
|
+
# Blueprint Schema and Audit
|
|
2
|
+
|
|
3
|
+
Read this file before drafting or auditing the two blueprint artifacts.
|
|
4
|
+
|
|
5
|
+
## English Blueprint Schema
|
|
6
|
+
|
|
7
|
+
Use semantic headings and natural titles. Local numbering inside one section is
|
|
8
|
+
acceptable when it improves clarity; section titles should carry the meaning
|
|
9
|
+
without requiring lookup-heavy labels.
|
|
10
|
+
|
|
11
|
+
Use this default structure:
|
|
12
|
+
|
|
13
|
+
```markdown
|
|
14
|
+
# Paper Blueprint: <Working Title>
|
|
15
|
+
|
|
16
|
+
## Paper Identity
|
|
17
|
+
## Core Thesis and Reader Promise
|
|
18
|
+
## Target Venue Fit
|
|
19
|
+
## Problem Context and Prior-Work Gap
|
|
20
|
+
## Paper Goals
|
|
21
|
+
## Contribution Strategy
|
|
22
|
+
## Claim-Evidence Architecture
|
|
23
|
+
## High-Level Design Logic and Scope
|
|
24
|
+
## Candidate Method Space
|
|
25
|
+
## Evidence Strategy
|
|
26
|
+
## Downstream Planning Constraints
|
|
27
|
+
## Open Strategic Variables
|
|
28
|
+
```
|
|
29
|
+
|
|
30
|
+
Omit `Candidate Method Space` only when the method direction is strategically
|
|
31
|
+
settled and no experiment-dependent method route needs to be compared.
|
|
32
|
+
|
|
33
|
+
Omit `Open Strategic Variables` when no unresolved strategy-level choice
|
|
34
|
+
remains.
|
|
35
|
+
|
|
36
|
+
### Paper Identity
|
|
37
|
+
|
|
38
|
+
State the working title, research area, field context, research object, target
|
|
39
|
+
readers, venue, paper type, and closest substrate when applicable.
|
|
40
|
+
|
|
41
|
+
Keep accumulated user facts in the Chinese explanation; use the English
|
|
42
|
+
blueprint for the selected paper strategy and downstream contracts.
|
|
43
|
+
|
|
44
|
+
### Core Thesis and Reader Promise
|
|
45
|
+
|
|
46
|
+
State the core idea, problem pressure, high-level insight, and reviewer-facing
|
|
47
|
+
promise. Keep the thesis compact enough to guide later planning.
|
|
48
|
+
|
|
49
|
+
### Target Venue Fit
|
|
50
|
+
|
|
51
|
+
State why the problem matters to venue readers, how the paper matches current
|
|
52
|
+
venue posture, and which contribution and evidence style the fit requires. Use
|
|
53
|
+
live evidence rather than fixed venue stereotypes.
|
|
54
|
+
|
|
55
|
+
### Problem Context and Prior-Work Gap
|
|
56
|
+
|
|
57
|
+
State the strongest existing approaches, their key limitation relative to this
|
|
58
|
+
idea, and the differentiated position. Treat inherited components as
|
|
59
|
+
substrates. Describe nearest competing work as a comparison boundary.
|
|
60
|
+
|
|
61
|
+
Select a single best-supported paper position. Preserve alternatives only when
|
|
62
|
+
they are experiment-dependent candidate routes or genuine open strategic
|
|
63
|
+
variables.
|
|
64
|
+
|
|
65
|
+
### Paper Goals
|
|
66
|
+
|
|
67
|
+
Define three to six goals. Each goal states:
|
|
68
|
+
|
|
69
|
+
- `Objective`: target state.
|
|
70
|
+
- `Role in the argument`: support for the thesis or a core claim.
|
|
71
|
+
- `Success signal`: evidence-level condition showing the goal is met.
|
|
72
|
+
|
|
73
|
+
Each goal must guide at least one downstream planning area.
|
|
74
|
+
|
|
75
|
+
### Contribution Strategy
|
|
76
|
+
|
|
77
|
+
Name intended contribution shapes, such as method, system, dataset, benchmark,
|
|
78
|
+
analysis, theory, measurement, or application insight. State how each
|
|
79
|
+
contribution serves a claim and works with the other contributions.
|
|
80
|
+
|
|
81
|
+
Distinguish:
|
|
82
|
+
|
|
83
|
+
- selected strategic contribution direction
|
|
84
|
+
- inherited substrate or prior component
|
|
85
|
+
- candidate method route that still needs experimental convergence
|
|
86
|
+
|
|
87
|
+
### Claim-Evidence Architecture
|
|
88
|
+
|
|
89
|
+
Define one to three core claims with natural-language titles. For each claim,
|
|
90
|
+
state:
|
|
91
|
+
|
|
92
|
+
- claim
|
|
93
|
+
- why it matters
|
|
94
|
+
- supporting contribution components
|
|
95
|
+
- required evidence class
|
|
96
|
+
- calibration boundary if evidence is narrower than the strongest claim
|
|
97
|
+
|
|
98
|
+
Keep evidence at claim level, not experiment-protocol level.
|
|
99
|
+
|
|
100
|
+
Treat method, module, dataset, or benchmark superiority as evidence-backed only
|
|
101
|
+
when completed evidence is already supplied by the user; otherwise place the
|
|
102
|
+
choice in the candidate method space or evidence strategy.
|
|
103
|
+
|
|
104
|
+
### High-Level Design Logic and Scope
|
|
105
|
+
|
|
106
|
+
State canonical terminology, conceptual mechanism, positive scope, inherited
|
|
107
|
+
capabilities, explicit novelty boundary, and important risks or weak points for
|
|
108
|
+
later planning. Leave formulas, modules, parameters, training details, and
|
|
109
|
+
engineering steps to downstream skills.
|
|
110
|
+
|
|
111
|
+
Express selected strategies as decisions. Express experiment-dependent method
|
|
112
|
+
families in `Candidate Method Space`, not as final conclusions.
|
|
113
|
+
|
|
114
|
+
### Candidate Method Space
|
|
115
|
+
|
|
116
|
+
Use this section when method superiority depends on future experiments. This is
|
|
117
|
+
not a user-open question. It is a downstream method/experiment-planning
|
|
118
|
+
contract.
|
|
119
|
+
|
|
120
|
+
Start with:
|
|
121
|
+
|
|
122
|
+
```markdown
|
|
123
|
+
This section contains experiment-dependent candidate routes. Downstream method
|
|
124
|
+
and experiment planning must compare these routes before promoting one into the
|
|
125
|
+
final proposed method or converting it into a baseline, ablation, or rejected
|
|
126
|
+
route.
|
|
127
|
+
```
|
|
128
|
+
|
|
129
|
+
For each route, use:
|
|
130
|
+
|
|
131
|
+
```markdown
|
|
132
|
+
### <Route name>
|
|
133
|
+
|
|
134
|
+
Status: candidate; experiment-dependent.
|
|
135
|
+
|
|
136
|
+
Strategic purpose: <how this route could support the thesis or a core claim>.
|
|
137
|
+
|
|
138
|
+
Source or theoretical motivation: <source-backed or reasoning-backed basis>.
|
|
139
|
+
|
|
140
|
+
Potential contribution role: <proposed method core, supporting module,
|
|
141
|
+
baseline candidate, ablation variant, or risk-control mechanism>.
|
|
142
|
+
|
|
143
|
+
Evidence needed to promote: <claim-level evidence class, not a run matrix>.
|
|
144
|
+
|
|
145
|
+
Disqualifier or demotion condition: <what result or constraint would make it a
|
|
146
|
+
baseline, ablation, or rejected route>.
|
|
147
|
+
|
|
148
|
+
Fair comparison principle: <budget, inputs, outputs, protocol, implementation
|
|
149
|
+
quality, or resource-control principle>.
|
|
150
|
+
```
|
|
151
|
+
|
|
152
|
+
Keep this section strategic: describe route logic, evidence needs, demotion
|
|
153
|
+
conditions, and fair comparison principles while leaving hyperparameters,
|
|
154
|
+
training recipes, ablation tables, and implementation steps to downstream
|
|
155
|
+
planning.
|
|
156
|
+
|
|
157
|
+
### Evidence Strategy
|
|
158
|
+
|
|
159
|
+
State:
|
|
160
|
+
|
|
161
|
+
- minimum sufficient evidence chain for the core claims
|
|
162
|
+
- relevant effectiveness, generalization, robustness, efficiency,
|
|
163
|
+
interpretability, deployment-value, or insight goals
|
|
164
|
+
- dataset, benchmark, baseline-family, and metric-selection principles
|
|
165
|
+
- method-route convergence evidence when candidate routes remain
|
|
166
|
+
- evidence breadth and fairness obligations
|
|
167
|
+
- evidence-dependent claim calibration
|
|
168
|
+
|
|
169
|
+
Explain why each evidence class is necessary. Delegate exact datasets,
|
|
170
|
+
baseline implementations, metrics, tables, run matrices, and resource budgets.
|
|
171
|
+
|
|
172
|
+
When candidate method routes remain, state how downstream experiments should
|
|
173
|
+
convert them into a proposed method, baseline, ablation variant, or rejected
|
|
174
|
+
route. Emphasize fair comparison principles rather than exact experiment
|
|
175
|
+
design.
|
|
176
|
+
|
|
177
|
+
### Downstream Planning Constraints
|
|
178
|
+
|
|
179
|
+
Add contracts for relevant planning areas:
|
|
180
|
+
|
|
181
|
+
```markdown
|
|
182
|
+
### <Planning Area> Contract
|
|
183
|
+
|
|
184
|
+
Preserve:
|
|
185
|
+
- <strategic invariant>
|
|
186
|
+
|
|
187
|
+
Delegate:
|
|
188
|
+
- <tactical design space>
|
|
189
|
+
```
|
|
190
|
+
|
|
191
|
+
Typical areas are content, method, experiment, figure, writing, and optional
|
|
192
|
+
review planning. Downstream skills inherit thesis, contribution boundary, claim
|
|
193
|
+
hierarchy, venue posture, terminology, candidate method space, evidence
|
|
194
|
+
requirements, and open variables. They must raise a strategic-variable update
|
|
195
|
+
before materially changing them.
|
|
196
|
+
|
|
197
|
+
### Open Strategic Variables
|
|
198
|
+
|
|
199
|
+
Use this section only for unresolved choices that require user/private
|
|
200
|
+
information and affect paper identity, claim scope, contribution boundary,
|
|
201
|
+
deployment boundary, evidence chain, or downstream contracts.
|
|
202
|
+
|
|
203
|
+
Place method superiority questions that require experiments in `Candidate
|
|
204
|
+
Method Space`; reserve this section for strategy-level variables that need
|
|
205
|
+
user/private information.
|
|
206
|
+
|
|
207
|
+
State once:
|
|
208
|
+
|
|
209
|
+
```markdown
|
|
210
|
+
Default propagation rule: downstream skills must use each variable's current
|
|
211
|
+
best-supported stance unless the user, private project facts, or completed
|
|
212
|
+
evidence resolves that variable.
|
|
213
|
+
```
|
|
214
|
+
|
|
215
|
+
Then use:
|
|
216
|
+
|
|
217
|
+
```markdown
|
|
218
|
+
### <Variable name>
|
|
219
|
+
|
|
220
|
+
Status: unresolved.
|
|
221
|
+
|
|
222
|
+
Affects: <strategic areas>.
|
|
223
|
+
|
|
224
|
+
Why unresolved: <why user facts, live evidence, and venue-oriented judgment do
|
|
225
|
+
not yet support a reliable best choice>.
|
|
226
|
+
|
|
227
|
+
Current best-supported stance: <default propagated to downstream skills>.
|
|
228
|
+
|
|
229
|
+
Allowed resolutions: <short strategy-level alternatives>.
|
|
230
|
+
```
|
|
231
|
+
|
|
232
|
+
## Chinese Explanation Schema
|
|
233
|
+
|
|
234
|
+
`paper_blueprint.explain.md` is a Chinese-language rationale file. Use Chinese
|
|
235
|
+
sentences as the default. Keep English paper titles, venue names, dataset names,
|
|
236
|
+
benchmark names, method names, and technical terms only when those names are
|
|
237
|
+
the conventional field form. Do not mix in English workflow explanation,
|
|
238
|
+
template rationale, or runtime/tooling commentary.
|
|
239
|
+
|
|
240
|
+
Use this default structure:
|
|
241
|
+
|
|
242
|
+
```markdown
|
|
243
|
+
# 论文蓝图说明:<Working Title>
|
|
244
|
+
|
|
245
|
+
## 当前论文方案概括
|
|
246
|
+
## 用户已经明确的内容
|
|
247
|
+
### 已确认约束
|
|
248
|
+
### 偏好但未锁定
|
|
249
|
+
## 当前工作假设
|
|
250
|
+
## 使用的研究信号
|
|
251
|
+
### 跨论文模式
|
|
252
|
+
### 承重信号
|
|
253
|
+
### 额外背景信号
|
|
254
|
+
## 核心出发点
|
|
255
|
+
## 按蓝图顺序解释论文方案
|
|
256
|
+
## 候选方法空间说明
|
|
257
|
+
## 开放验证项
|
|
258
|
+
## 本轮已收缩的开放项
|
|
259
|
+
## 证据变化时如何校准 Claim
|
|
260
|
+
```
|
|
261
|
+
|
|
262
|
+
Omit `候选方法空间说明` when the English blueprint has no candidate method
|
|
263
|
+
space.
|
|
264
|
+
|
|
265
|
+
Omit `开放验证项` when no strategy-level question remains.
|
|
266
|
+
|
|
267
|
+
Omit `本轮已收缩的开放项` on the first run or when nothing was resolved.
|
|
268
|
+
|
|
269
|
+
### Explanation Rules
|
|
270
|
+
|
|
271
|
+
- Start with a concise, independently readable paper-strategy summary.
|
|
272
|
+
- Accumulate explicit user facts under `用户已经明确的内容`.
|
|
273
|
+
- Follow English blueprint heading order under `按蓝图顺序解释论文方案`.
|
|
274
|
+
- Restate each blueprint decision in Chinese before explaining its rationale.
|
|
275
|
+
- Explain relationships among thesis, goals, claims, contribution shapes,
|
|
276
|
+
candidate method routes, evidence chain, venue fit, and downstream
|
|
277
|
+
constraints.
|
|
278
|
+
- Discuss recent high-impact storytelling patterns from the target or adjacent
|
|
279
|
+
venues and connect those patterns to the blueprint's choices.
|
|
280
|
+
- When a materially plausible alternative was rejected, briefly explain why
|
|
281
|
+
the selected strategy better serves the paper goal, venue fit, claims, or
|
|
282
|
+
downstream executability.
|
|
283
|
+
- When method routes remain candidates, explain why the current stage should
|
|
284
|
+
not select a unique final method before experiments.
|
|
285
|
+
- Distinguish user facts, evidence findings, preferences, assumptions,
|
|
286
|
+
candidate method routes, and open validation items.
|
|
287
|
+
- Explain why each open item cannot yet be resolved from user facts, live
|
|
288
|
+
evidence, venue posture, and paper goals; then state the current
|
|
289
|
+
best-supported stance and what each resolution changes.
|
|
290
|
+
- Explain claim calibration separately from user-input changes.
|
|
291
|
+
|
|
292
|
+
Use Chinese sentences as the default. Preserve English terms when they are the
|
|
293
|
+
conventional technical name. Refer to sections by their natural headings or
|
|
294
|
+
plain descriptions rather than lookup-heavy labels.
|
|
295
|
+
|
|
296
|
+
## Boundary Tests
|
|
297
|
+
|
|
298
|
+
The artifacts should contain paper-design information, claim-level evidence
|
|
299
|
+
requirements, strategic constraints, candidate method routes, open variables,
|
|
300
|
+
and Chinese paper-rationale explanation.
|
|
301
|
+
|
|
302
|
+
Before delivery, rewrite tactical or process-level material into paper-level
|
|
303
|
+
strategy. Full paper prose, detailed section plans, run matrices, figure
|
|
304
|
+
layouts, implementation recipes, hyperparameter plans, user-operation advice,
|
|
305
|
+
artifact notes, workflow commentary, and unsupported best-result claims should
|
|
306
|
+
be represented only as contribution boundaries, downstream contracts, evidence
|
|
307
|
+
needs, candidate-route conditions, or claim-calibration rules.
|
|
308
|
+
|
|
309
|
+
Remove sections or phrases that read like user reminders or artifact/runtime
|
|
310
|
+
warnings, including `Artifact cautions`, `Assumptions to validate`, `Do not
|
|
311
|
+
assume reviewers will run code`, access notes, tool-fallback notes, template
|
|
312
|
+
explanations, and process logs. If a reproducibility, artifact, or assumption
|
|
313
|
+
issue affects the paper strategy, express it as an evidence need, deployment
|
|
314
|
+
boundary, open strategic variable, or downstream planning contract.
|
|
315
|
+
|
|
316
|
+
## Final Audit
|
|
317
|
+
|
|
318
|
+
Check:
|
|
319
|
+
|
|
320
|
+
- exactly two Markdown files exist in the requested output directory
|
|
321
|
+
- `paper_blueprint.md` is English-only, declarative, strategic, and AI-facing
|
|
322
|
+
- `paper_blueprint.explain.md` is Chinese-language, preserves conventional
|
|
323
|
+
English technical names naturally, is self-contained, and focuses on
|
|
324
|
+
paper-design rationale
|
|
325
|
+
- accumulated user facts appear only in the Chinese explanation
|
|
326
|
+
- each important rationale first restates the corresponding blueprint decision
|
|
327
|
+
- semantic headings replace lookup-heavy identifier systems
|
|
328
|
+
- readers, venue fit, prior-work gap, goals, contribution strategy, one to
|
|
329
|
+
three claims, minimum sufficient evidence chain, scope, downstream
|
|
330
|
+
constraints, and open variables are present when applicable
|
|
331
|
+
- every core claim has a claim-level evidence requirement
|
|
332
|
+
- user facts remain constraints instead of reopened questions
|
|
333
|
+
- open items are strategic, deduplicated, and shrink across iterations
|
|
334
|
+
- experiment-dependent method choices appear as candidate routes rather than
|
|
335
|
+
user-open validation items
|
|
336
|
+
- candidate method routes include evidence-needed, disqualifier/demotion, and
|
|
337
|
+
fair-comparison principles
|
|
338
|
+
- selected strategies replace unnecessary menus of plausible paper positions
|
|
339
|
+
- tactical choices are delegated
|
|
340
|
+
- recent storytelling exemplars and load-bearing sources influence the Chinese
|
|
341
|
+
explanation
|
|
342
|
+
- venue/high-quality-paper analysis is used to justify concrete blueprint
|
|
343
|
+
decisions rather than dumped as generic background
|
|
344
|
+
- no artifact cautions, workflow explanations, tool-rationale text, access
|
|
345
|
+
notes, or user to-do reminders remain in either artifact
|