academic-army 0.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (68) hide show
  1. package/.editorconfig +9 -0
  2. package/.github/workflows/publish.yml +44 -0
  3. package/.prettierrc.json +3 -0
  4. package/LICENSE +21 -0
  5. package/README.md +172 -0
  6. package/README.zh-CN.md +172 -0
  7. package/agent-forge.yaml +83 -0
  8. package/eslint.config.js +28 -0
  9. package/install_mcp.py +85 -0
  10. package/mcp-server/__main__.py +33 -0
  11. package/mcp-server/deepresearch/__init__.py +3 -0
  12. package/mcp-server/deepresearch/tools.py +33 -0
  13. package/mcp-server/requirements.txt +4 -0
  14. package/metaskills/README.md +131 -0
  15. package/metaskills/README.zh-CN.md +131 -0
  16. package/metaskills/academic-army-architect/METASKILL.md +91 -0
  17. package/metaskills/academic-army-architect/envolve.sh +9 -0
  18. package/metaskills/academic-army-coding-plan/ENVOLVETASK.md +1 -0
  19. package/metaskills/academic-army-coding-plan/METASKILL.md +118 -0
  20. package/metaskills/academic-army-coding-plan/envolve.sh +9 -0
  21. package/metaskills/academic-army-coding-style/METASKILL.md +292 -0
  22. package/metaskills/academic-army-experiment-plan/ENVOLVETASK.md +1 -0
  23. package/metaskills/academic-army-experiment-plan/METASKILL.md +82 -0
  24. package/metaskills/academic-army-experiment-plan/envolve.sh +9 -0
  25. package/metaskills/academic-army-repo-scaffold/ENVOLVETASK.md +1 -0
  26. package/metaskills/academic-army-repo-scaffold/METASKILL.md +223 -0
  27. package/metaskills/academic-army-repo-scaffold/envolve.sh +9 -0
  28. package/package.json +35 -0
  29. package/runs/develop-skill.sh +17 -0
  30. package/runs/develop.sh +16 -0
  31. package/skills/academic-army-architect/SKILL.md +336 -0
  32. package/skills/academic-army-architect/agents/openai.yaml +11 -0
  33. package/skills/academic-army-architect/references/blueprint-schema.md +345 -0
  34. package/skills/academic-army-coding-plan/SKILL.md +491 -0
  35. package/skills/academic-army-coding-plan/agents/openai.yaml +11 -0
  36. package/skills/academic-army-coding-style/SKILL.md +915 -0
  37. package/skills/academic-army-coding-style/agents/openai.yaml +11 -0
  38. package/skills/academic-army-experiment-plan/SKILL.md +517 -0
  39. package/skills/academic-army-experiment-plan/agents/openai.yaml +11 -0
  40. package/skills/academic-army-repo-scaffold/SKILL.md +756 -0
  41. package/skills/academic-army-repo-scaffold/agents/openai.yaml +10 -0
  42. package/src/README.md +79 -0
  43. package/src/README.zh-CN.md +79 -0
  44. package/src/cli.ts +55 -0
  45. package/src/developing/README.md +146 -0
  46. package/src/developing/README.zh-CN.md +146 -0
  47. package/src/developing/agents/developer.ts +40 -0
  48. package/src/developing/agents/factory.ts +11 -0
  49. package/src/developing/agents/index.ts +8 -0
  50. package/src/developing/agents/manager.ts +74 -0
  51. package/src/developing/agents/prompts.ts +12 -0
  52. package/src/developing/agents/reviewer.ts +44 -0
  53. package/src/developing/agents/trajectory-optimizer.ts +70 -0
  54. package/src/developing/agents/types.ts +41 -0
  55. package/src/developing/index.ts +2 -0
  56. package/src/developing/pipeline.ts +306 -0
  57. package/src/developing/pipelineskill.ts +169 -0
  58. package/src/evolve-skill/README.md +116 -0
  59. package/src/evolve-skill/README.zh-CN.md +116 -0
  60. package/src/evolve-skill/agents/evaluator.ts +28 -0
  61. package/src/evolve-skill/agents/factory.ts +11 -0
  62. package/src/evolve-skill/agents/index.ts +4 -0
  63. package/src/evolve-skill/agents/modifier.ts +27 -0
  64. package/src/evolve-skill/agents/runner.ts +19 -0
  65. package/src/evolve-skill/index.ts +1 -0
  66. package/src/evolve-skill/pipeline.ts +140 -0
  67. package/src/pipeline.ts +65 -0
  68. package/tsconfig.json +22 -0
@@ -0,0 +1,336 @@
1
+ ---
2
+ name: academic-army-architect
3
+ description: >-
4
+ Generate or revise a goal-oriented strategic paper blueprint for an
5
+ autoresearch pipeline. Use when Codex must refine a paper idea, position a
6
+ submission for a target venue, define claims and contribution boundaries,
7
+ build an evidence-grounded candidate method space, audit or revise an
8
+ existing blueprint, or hand off stable paper strategy to downstream content,
9
+ method, experiment, figure, writing, or review-planning skills. Always create
10
+ exactly two Markdown artifacts: English-only `paper_blueprint.md` and
11
+ Chinese-language `paper_blueprint.explain.md`. Use
12
+ `academic_army_mcp_tools.deepresearch` for live venue, literature,
13
+ high-quality-paper, method, benchmark, and autoresearch-workflow evidence.
14
+ ---
15
+
16
+ # Academic Army Architect
17
+
18
+ ## Mission
19
+
20
+ Create a target-oriented paper strategy blueprint for downstream AI planning.
21
+ The blueprint is not a full paper, experiment plan, figure plan, implementation
22
+ plan, or advice checklist. It is the stable paper core that later skills inherit:
23
+
24
+ - paper identity, target venue, readers, field context, and reviewer posture
25
+ - core thesis, problem pressure, reader promise, and paper goals
26
+ - prior-work boundary and differentiated positioning
27
+ - one to three core claims and the evidence each claim needs
28
+ - contribution shapes and novelty boundary
29
+ - high-level method or system logic
30
+ - experiment-dependent candidate method routes
31
+ - minimum sufficient evidence chain and evaluation principles
32
+ - downstream planning constraints and unresolved strategic variables
33
+
34
+ Before drafting or auditing output, read
35
+ [`references/blueprint-schema.md`](references/blueprint-schema.md).
36
+
37
+ ## Output Contract
38
+
39
+ Write exactly two Markdown files in the requested output directory:
40
+
41
+ - `paper_blueprint.md`: English-only, AI-facing paper strategy blueprint.
42
+ - `paper_blueprint.explain.md`: Chinese-language validation companion for the
43
+ user. It uses Chinese sentences as the default and preserves conventional
44
+ English titles, venue names, dataset names, benchmark names, method names,
45
+ and technical terms when those names are standard in the field.
46
+
47
+ Keep the output directory limited to these two Markdown artifacts.
48
+
49
+ `paper_blueprint.md` objectively states the selected paper strategy, claim
50
+ envelope, contribution boundary, candidate method space when needed, evidence
51
+ requirements, downstream inheritance contracts, and open strategic variables.
52
+ It should be usable by later AI skills without reading the Chinese explanation.
53
+
54
+ `paper_blueprint.explain.md` helps the user validate why the blueprint is
55
+ reasonable. It records user-specified facts, restates each important blueprint
56
+ decision before explaining its rationale, and explains paper-design decisions
57
+ only: idea positioning, venue fit, claim logic, evidence chain,
58
+ source-backed patterns, candidate-method reasoning, and open-variable status.
59
+ It should not explain the skill workflow, template choices, tool calls,
60
+ artifact access, or runtime environment.
61
+
62
+ ## Strategic Level
63
+
64
+ Operate at paper-strategy levels 0-2 and route tactical work to downstream
65
+ skills.
66
+
67
+ | Level | This skill owns |
68
+ |---|---|
69
+ | 0: paper identity | idea, field context, target readers, target venue, paper type |
70
+ | 1: thesis and claims | problem pressure, insight, reader promise, goals, claim hierarchy |
71
+ | 2: strategic constraints | prior-work boundary, contribution roles, high-level design logic, candidate method space, evidence principles, downstream contracts, open variables |
72
+
73
+ Translate tactical inputs into strategic contracts:
74
+
75
+ | User input type | Blueprint-level representation |
76
+ |---|---|
77
+ | algorithm, proof, or optimization idea | required method property, selected strategic direction, or candidate method route |
78
+ | module or method combination | candidate route with purpose, evidence needed, demotion condition, and fair-comparison principle |
79
+ | dataset, trace, benchmark, device, or baseline | evidence dimension, fairness obligation, or user-specified evaluation constraint |
80
+ | metric formula | outcome family and claim-evidence requirement |
81
+ | figure or section idea | visual or narrative argument that downstream planning must preserve |
82
+ | implementation detail | inherited capability, deployment boundary, or downstream engineering constraint |
83
+
84
+ ## Strategy Ledger
85
+
86
+ When creating or revising a blueprint, maintain an internal ledger:
87
+
88
+ 1. `User-specified facts`: explicit constraints accumulated across iterations.
89
+ 2. `Evidence-backed findings`: live-research findings tied to source links.
90
+ 3. `Preferred but not locked`: user preferences that guide the best strategy.
91
+ 4. `Working assumptions`: defaults that keep the blueprint coherent.
92
+ 5. `Strategic decisions`: identity, thesis, goals, claims, contribution roles,
93
+ novelty boundary, and scope.
94
+ 6. `Candidate method routes`: experiment-dependent approaches that downstream
95
+ method or experiment planning must compare.
96
+ 7. `Rejected strategic alternatives`: plausible alternatives not selected, with
97
+ a concise paper-level reason.
98
+ 8. `Open strategic variables`: unresolved choices that require private facts,
99
+ unpublished constraints, user preference, or unavailable evidence.
100
+ 9. `Resolved since prior iteration`: items closed by user facts, live evidence,
101
+ experiments, or a documented strategy decision.
102
+
103
+ When revising an existing blueprint, read the prior
104
+ `paper_blueprint.explain.md` first and preserve accumulated user-specified
105
+ facts unless the user explicitly corrects them.
106
+
107
+ Use this precedence order:
108
+
109
+ 1. explicit user correction
110
+ 2. accumulated user-specified fact
111
+ 3. completed experiment result supplied by the user
112
+ 4. evidence-backed finding
113
+ 5. open strategic variable
114
+ 6. preferred but not locked
115
+ 7. working assumption
116
+
117
+ Evidence can refine scope and positioning, but it does not silently override a
118
+ user-specified constraint. Explain any resulting tradeoff in the Chinese file.
119
+
120
+ ## Strategic Judgment
121
+
122
+ Take responsibility for choosing the strongest current paper strategy. Use the
123
+ paper goal, user facts, live evidence, venue posture, source-backed paper
124
+ patterns, claim requirements, and downstream executability to commit to the
125
+ best-supported position.
126
+
127
+ When one option is materially stronger, write only the selected strategy in
128
+ `paper_blueprint.md`. In `paper_blueprint.explain.md`, briefly explain why the
129
+ selected route is more persuasive than the most relevant rejected alternative
130
+ when that contrast helps the user validate the design.
131
+
132
+ Keep an item open only when available information cannot support a reliable
133
+ strategy decision and the choice would materially affect paper identity, claim
134
+ scope, contribution boundary, deployment boundary, evidence chain, or downstream
135
+ contracts.
136
+
137
+ ## Candidate Method Space
138
+
139
+ Distinguish the strategic method direction from experiment-dependent
140
+ implementation routes.
141
+
142
+ Commit to a strategic method direction when venue posture, user facts, and
143
+ literature support it. For example: "the paper needs an interpretable online
144
+ controller with explicit uncertainty and deadline handling."
145
+
146
+ When method superiority depends on future results, create a `Candidate Method
147
+ Space` instead of selecting a final method or asking the user to choose. Each
148
+ candidate route should state:
149
+
150
+ - strategic purpose
151
+ - source or theoretical motivation
152
+ - relation to the core claim
153
+ - potential contribution role
154
+ - evidence needed to promote it into the proposed method
155
+ - disqualifier or demotion condition
156
+ - fair-comparison principle
157
+ - downstream owner, usually method planning or experiment planning
158
+
159
+ Candidate routes can later become the proposed method, a baseline, an ablation
160
+ variant, or a rejected route. Baseline planning may include original prior
161
+ methods, single-backbone variants, candidate routes without the paper's key
162
+ mechanism, partial-module variants, or key-module replacements. Leave exact
163
+ experiments, budgets, hyperparameters, and run matrices to downstream planning.
164
+
165
+ Exclude methods that live research shows are incompatible with the problem,
166
+ inputs, cost envelope, venue posture, or claim needs. Explain the paper-level
167
+ reason in the Chinese file.
168
+
169
+ ## Open Variable Policy
170
+
171
+ Open strategic variables are a last-resort output. Before preserving one:
172
+
173
+ 1. Check whether it is already settled by user facts.
174
+ 2. Check whether completed experiments supplied by the user settle it.
175
+ 3. Check whether live evidence and venue-oriented judgment support a clear best
176
+ strategy.
177
+ 4. Convert tactical choices to downstream contracts.
178
+ 5. Convert experiment-dependent method superiority questions to candidate
179
+ method routes.
180
+ 6. Commit to a best-supported stance when a reasonable default exists.
181
+ 7. Preserve the variable only if private resources, unpublished constraints,
182
+ user preference, or missing evidence materially affect strategy.
183
+
184
+ When an open variable remains, give a current best-supported stance and allowed
185
+ strategy-level resolutions. Re-audit existing open variables on every revision
186
+ and shrink the list when new facts or evidence support a decision.
187
+
188
+ ## Live Research
189
+
190
+ Run live research before drafting or materially revising a blueprint.
191
+
192
+ Use the exact MCP tool:
193
+
194
+ - server: `academic_army_mcp_tools`
195
+ - tool: `deepresearch`
196
+ - canonical name: `mcp__academic_army_mcp_tools__deepresearch`
197
+
198
+ Research should cover:
199
+
200
+ - current target-venue scope, posture, reviewer expectations, and paper fit
201
+ - closest substrates, nearest competitors, and literature boundaries
202
+ - relevant methods, systems, datasets, benchmarks, code, and theoretical frames
203
+ - recent high-quality storytelling exemplars from the target venue or adjacent
204
+ venues such as SIGGRAPH, CVPR, SIGCOMM, NSDI, INFOCOM, or field equivalents
205
+ - canonical older precedents that still shape method, benchmark, or evaluation
206
+ expectations
207
+ - source-backed evaluation patterns affecting the minimum evidence chain
208
+ - autoresearch, paper-writing, literature-review, scientific-discovery,
209
+ workflow-agent, prompt-template, benchmark, and open-source-tool patterns that
210
+ improve this blueprint as a downstream AI-planning artifact. Use these
211
+ findings to improve the blueprint's role as an inheritable planning object;
212
+ do not dump generic autoresearch background into either artifact.
213
+
214
+ For storytelling and writing-style patterns, prefer recent papers from the last
215
+ one to three years. For methods, datasets, benchmarks, baselines, and technical
216
+ background, include older canonical work when still load-bearing. Prefer
217
+ primary sources: official venue pages, reviewer guidance, papers, project pages,
218
+ repositories, benchmark pages, datasets, and official documentation. Separate
219
+ source-supported facts from inference.
220
+
221
+ Use this research prompt shape:
222
+
223
+ ```text
224
+ Research brief:
225
+ [IDEA, TARGET PROBLEM, KNOWN SUBSTRATES, USER-SPECIFIED FACTS,
226
+ PREFERENCES, CURRENT CANDIDATE METHOD SPACE, AND CURRENT OPEN VALIDATION ITEMS]
227
+
228
+ Target venue:
229
+ [VENUE]
230
+
231
+ Return concise evidence in nine sections:
232
+ 1. Venue posture, target readers, and reviewer expectations
233
+ 2. Closest technical substrates and literature boundary
234
+ 3. Nearest competing work and differentiated positioning
235
+ 4. Recent high-impact storytelling exemplars and why their framing works
236
+ 5. Cross-paper synthesis: framing, contribution, and evidence-chain lessons
237
+ 6. Canonical method, dataset, benchmark, baseline-family, and evaluation precedents
238
+ 7. Candidate method space: plausible routes, combinations, evidence needed, and disqualifiers
239
+ 8. Strategy implications: decisions to commit, routes to delegate to experiments, alternatives to reject, and only genuinely unresolved items
240
+ 9. Source-role table with title, venue/year, stable link, role, persuasive pattern, and blueprint lesson
241
+
242
+ Separate source-supported facts from inference. Prioritize sources that
243
+ materially change the paper strategy. Use recent papers for storytelling
244
+ analysis and older papers only for load-bearing precedents. Identify the
245
+ best-supported paper strategy. Label experiment-dependent method routes as
246
+ candidates rather than proven best choices.
247
+ ```
248
+
249
+ In `paper_blueprint.explain.md`, start the research discussion with three to
250
+ five cross-paper patterns. Then explain six to eight load-bearing sources: what
251
+ each source establishes, why its framing or evidence style matters, and which
252
+ blueprint choice it influences.
253
+
254
+ ## Workflow
255
+
256
+ 1. Read the request, prior explanation file when present, and
257
+ `references/blueprint-schema.md`.
258
+ 2. Build or update the strategy ledger.
259
+ 3. Shrink candidate open variables using the open-variable policy.
260
+ 4. Run `academic_army_mcp_tools.deepresearch`.
261
+ 5. Synthesize venue fit, prior-work boundary, storytelling patterns, method
262
+ families, benchmark expectations, evidence precedents, and blueprint-design
263
+ implications from autoresearch workflow evidence.
264
+ 6. Commit to the best-supported thesis, paper goals, contribution strategy,
265
+ one to three claims, novelty boundary, and minimum sufficient evidence chain.
266
+ 7. Build a candidate method space when method superiority depends on future
267
+ experiments.
268
+ 8. Retain only genuinely unresolved strategic variables.
269
+ 9. Draft both Markdown files.
270
+ 10. Audit for language separation, strategic level, source support, candidate
271
+ method routing, open-variable shrinkage, and downstream inheritance.
272
+ 11. Confirm exactly two Markdown files exist in the output directory.
273
+
274
+ ## Quality Bar
275
+
276
+ `paper_blueprint.md` should:
277
+
278
+ - use semantic headings rather than global lookup-heavy codes
279
+ - state the selected paper strategy directly
280
+ - define the idea, problem, venue fit, target readers, field context, and
281
+ reviewer expectation
282
+ - position against existing methods, systems, datasets, benchmarks, or theory
283
+ - define one to three core claims and required evidence classes
284
+ - state contribution shapes and how they serve the claims
285
+ - describe high-level method or system logic without tactical implementation
286
+ - define evidence goals, benchmark/baseline/metric principles, and minimum
287
+ sufficient evidence chain
288
+ - state risks as open variables, claim-calibration rules, or downstream
289
+ planning constraints
290
+ - define downstream contracts for content, method, experiment, figure, writing,
291
+ and review planning when relevant
292
+
293
+ `paper_blueprint.explain.md` should:
294
+
295
+ - begin with a concise paper-strategy summary
296
+ - record accumulated user-specified facts near the start
297
+ - explain cross-paper patterns and load-bearing sources before detailed
298
+ blueprint rationale
299
+ - follow the blueprint's heading order when explaining important decisions
300
+ - restate each decision before explaining why it follows from user facts, live
301
+ evidence, venue posture, high-quality paper patterns, claim needs, and
302
+ downstream planning constraints
303
+ - explain why candidate method routes remain candidates when experiments must
304
+ decide them
305
+ - explain unresolved strategic variables as paper-design variables with current
306
+ best-supported stances
307
+ - use section titles and natural names rather than global codes or mechanical
308
+ cross-references
309
+
310
+ ## Boundary Checks
311
+
312
+ Keep the artifacts focused on the paper being designed. Represent detailed
313
+ algorithms, training recipes, implementation steps, section plans, experiment
314
+ tables, figure layouts, run budgets, plotting plans, and reviewer operations as
315
+ strategic constraints, evidence needs, candidate-route conditions, or
316
+ downstream contracts.
317
+
318
+ Keep user-facing explanation at the paper-design layer. Explain why the paper
319
+ strategy is selected; leave skill workflow, tool rationale, template choices,
320
+ and artifact-management details out of the two blueprint files.
321
+
322
+ Before delivery, remove or rewrite user-facing reminders such as `Artifact
323
+ cautions`, `Assumptions to validate`, `Do not assume reviewers will run code`,
324
+ workflow notes, access caveats, or implementation to-do lists. If such content
325
+ is relevant to the paper, express it as an objective design constraint,
326
+ evidence requirement, open strategic variable, claim-calibration rule, or
327
+ downstream planning contract.
328
+
329
+ ## Final Response
330
+
331
+ Return a concise completion note with the two artifact paths, the paper identity
332
+ summary, unresolved strategic-variable names when present, and candidate
333
+ method-space status when present. Paste artifact contents only when the current
334
+ user explicitly requests pasted contents or when the surrounding task contract
335
+ outside this skill requires it. Do not add any extra files to the artifact
336
+ directory.
@@ -0,0 +1,11 @@
1
+ interface:
2
+ display_name: "Academic Army Architect"
3
+ short_description: "Strategic paper blueprint with Chinese rationale"
4
+ default_prompt: "Use $academic-army-architect to refine this paper idea into an evidence-grounded strategic blueprint for downstream planning."
5
+
6
+ dependencies:
7
+ tools:
8
+ - type: "mcp"
9
+ value: "academic_army_mcp_tools"
10
+ description: "Provides academic_army_mcp_tools.deepresearch for live venue, literature, exemplar-paper, evaluation-precedent, and reviewer-context evidence."
11
+ transport: "stdio"
@@ -0,0 +1,345 @@
1
+ # Blueprint Schema and Audit
2
+
3
+ Read this file before drafting or auditing the two blueprint artifacts.
4
+
5
+ ## English Blueprint Schema
6
+
7
+ Use semantic headings and natural titles. Local numbering inside one section is
8
+ acceptable when it improves clarity; section titles should carry the meaning
9
+ without requiring lookup-heavy labels.
10
+
11
+ Use this default structure:
12
+
13
+ ```markdown
14
+ # Paper Blueprint: <Working Title>
15
+
16
+ ## Paper Identity
17
+ ## Core Thesis and Reader Promise
18
+ ## Target Venue Fit
19
+ ## Problem Context and Prior-Work Gap
20
+ ## Paper Goals
21
+ ## Contribution Strategy
22
+ ## Claim-Evidence Architecture
23
+ ## High-Level Design Logic and Scope
24
+ ## Candidate Method Space
25
+ ## Evidence Strategy
26
+ ## Downstream Planning Constraints
27
+ ## Open Strategic Variables
28
+ ```
29
+
30
+ Omit `Candidate Method Space` only when the method direction is strategically
31
+ settled and no experiment-dependent method route needs to be compared.
32
+
33
+ Omit `Open Strategic Variables` when no unresolved strategy-level choice
34
+ remains.
35
+
36
+ ### Paper Identity
37
+
38
+ State the working title, research area, field context, research object, target
39
+ readers, venue, paper type, and closest substrate when applicable.
40
+
41
+ Keep accumulated user facts in the Chinese explanation; use the English
42
+ blueprint for the selected paper strategy and downstream contracts.
43
+
44
+ ### Core Thesis and Reader Promise
45
+
46
+ State the core idea, problem pressure, high-level insight, and reviewer-facing
47
+ promise. Keep the thesis compact enough to guide later planning.
48
+
49
+ ### Target Venue Fit
50
+
51
+ State why the problem matters to venue readers, how the paper matches current
52
+ venue posture, and which contribution and evidence style the fit requires. Use
53
+ live evidence rather than fixed venue stereotypes.
54
+
55
+ ### Problem Context and Prior-Work Gap
56
+
57
+ State the strongest existing approaches, their key limitation relative to this
58
+ idea, and the differentiated position. Treat inherited components as
59
+ substrates. Describe nearest competing work as a comparison boundary.
60
+
61
+ Select a single best-supported paper position. Preserve alternatives only when
62
+ they are experiment-dependent candidate routes or genuine open strategic
63
+ variables.
64
+
65
+ ### Paper Goals
66
+
67
+ Define three to six goals. Each goal states:
68
+
69
+ - `Objective`: target state.
70
+ - `Role in the argument`: support for the thesis or a core claim.
71
+ - `Success signal`: evidence-level condition showing the goal is met.
72
+
73
+ Each goal must guide at least one downstream planning area.
74
+
75
+ ### Contribution Strategy
76
+
77
+ Name intended contribution shapes, such as method, system, dataset, benchmark,
78
+ analysis, theory, measurement, or application insight. State how each
79
+ contribution serves a claim and works with the other contributions.
80
+
81
+ Distinguish:
82
+
83
+ - selected strategic contribution direction
84
+ - inherited substrate or prior component
85
+ - candidate method route that still needs experimental convergence
86
+
87
+ ### Claim-Evidence Architecture
88
+
89
+ Define one to three core claims with natural-language titles. For each claim,
90
+ state:
91
+
92
+ - claim
93
+ - why it matters
94
+ - supporting contribution components
95
+ - required evidence class
96
+ - calibration boundary if evidence is narrower than the strongest claim
97
+
98
+ Keep evidence at claim level, not experiment-protocol level.
99
+
100
+ Treat method, module, dataset, or benchmark superiority as evidence-backed only
101
+ when completed evidence is already supplied by the user; otherwise place the
102
+ choice in the candidate method space or evidence strategy.
103
+
104
+ ### High-Level Design Logic and Scope
105
+
106
+ State canonical terminology, conceptual mechanism, positive scope, inherited
107
+ capabilities, explicit novelty boundary, and important risks or weak points for
108
+ later planning. Leave formulas, modules, parameters, training details, and
109
+ engineering steps to downstream skills.
110
+
111
+ Express selected strategies as decisions. Express experiment-dependent method
112
+ families in `Candidate Method Space`, not as final conclusions.
113
+
114
+ ### Candidate Method Space
115
+
116
+ Use this section when method superiority depends on future experiments. This is
117
+ not a user-open question. It is a downstream method/experiment-planning
118
+ contract.
119
+
120
+ Start with:
121
+
122
+ ```markdown
123
+ This section contains experiment-dependent candidate routes. Downstream method
124
+ and experiment planning must compare these routes before promoting one into the
125
+ final proposed method or converting it into a baseline, ablation, or rejected
126
+ route.
127
+ ```
128
+
129
+ For each route, use:
130
+
131
+ ```markdown
132
+ ### <Route name>
133
+
134
+ Status: candidate; experiment-dependent.
135
+
136
+ Strategic purpose: <how this route could support the thesis or a core claim>.
137
+
138
+ Source or theoretical motivation: <source-backed or reasoning-backed basis>.
139
+
140
+ Potential contribution role: <proposed method core, supporting module,
141
+ baseline candidate, ablation variant, or risk-control mechanism>.
142
+
143
+ Evidence needed to promote: <claim-level evidence class, not a run matrix>.
144
+
145
+ Disqualifier or demotion condition: <what result or constraint would make it a
146
+ baseline, ablation, or rejected route>.
147
+
148
+ Fair comparison principle: <budget, inputs, outputs, protocol, implementation
149
+ quality, or resource-control principle>.
150
+ ```
151
+
152
+ Keep this section strategic: describe route logic, evidence needs, demotion
153
+ conditions, and fair comparison principles while leaving hyperparameters,
154
+ training recipes, ablation tables, and implementation steps to downstream
155
+ planning.
156
+
157
+ ### Evidence Strategy
158
+
159
+ State:
160
+
161
+ - minimum sufficient evidence chain for the core claims
162
+ - relevant effectiveness, generalization, robustness, efficiency,
163
+ interpretability, deployment-value, or insight goals
164
+ - dataset, benchmark, baseline-family, and metric-selection principles
165
+ - method-route convergence evidence when candidate routes remain
166
+ - evidence breadth and fairness obligations
167
+ - evidence-dependent claim calibration
168
+
169
+ Explain why each evidence class is necessary. Delegate exact datasets,
170
+ baseline implementations, metrics, tables, run matrices, and resource budgets.
171
+
172
+ When candidate method routes remain, state how downstream experiments should
173
+ convert them into a proposed method, baseline, ablation variant, or rejected
174
+ route. Emphasize fair comparison principles rather than exact experiment
175
+ design.
176
+
177
+ ### Downstream Planning Constraints
178
+
179
+ Add contracts for relevant planning areas:
180
+
181
+ ```markdown
182
+ ### <Planning Area> Contract
183
+
184
+ Preserve:
185
+ - <strategic invariant>
186
+
187
+ Delegate:
188
+ - <tactical design space>
189
+ ```
190
+
191
+ Typical areas are content, method, experiment, figure, writing, and optional
192
+ review planning. Downstream skills inherit thesis, contribution boundary, claim
193
+ hierarchy, venue posture, terminology, candidate method space, evidence
194
+ requirements, and open variables. They must raise a strategic-variable update
195
+ before materially changing them.
196
+
197
+ ### Open Strategic Variables
198
+
199
+ Use this section only for unresolved choices that require user/private
200
+ information and affect paper identity, claim scope, contribution boundary,
201
+ deployment boundary, evidence chain, or downstream contracts.
202
+
203
+ Place method superiority questions that require experiments in `Candidate
204
+ Method Space`; reserve this section for strategy-level variables that need
205
+ user/private information.
206
+
207
+ State once:
208
+
209
+ ```markdown
210
+ Default propagation rule: downstream skills must use each variable's current
211
+ best-supported stance unless the user, private project facts, or completed
212
+ evidence resolves that variable.
213
+ ```
214
+
215
+ Then use:
216
+
217
+ ```markdown
218
+ ### <Variable name>
219
+
220
+ Status: unresolved.
221
+
222
+ Affects: <strategic areas>.
223
+
224
+ Why unresolved: <why user facts, live evidence, and venue-oriented judgment do
225
+ not yet support a reliable best choice>.
226
+
227
+ Current best-supported stance: <default propagated to downstream skills>.
228
+
229
+ Allowed resolutions: <short strategy-level alternatives>.
230
+ ```
231
+
232
+ ## Chinese Explanation Schema
233
+
234
+ `paper_blueprint.explain.md` is a Chinese-language rationale file. Use Chinese
235
+ sentences as the default. Keep English paper titles, venue names, dataset names,
236
+ benchmark names, method names, and technical terms only when those names are
237
+ the conventional field form. Do not mix in English workflow explanation,
238
+ template rationale, or runtime/tooling commentary.
239
+
240
+ Use this default structure:
241
+
242
+ ```markdown
243
+ # 论文蓝图说明:<Working Title>
244
+
245
+ ## 当前论文方案概括
246
+ ## 用户已经明确的内容
247
+ ### 已确认约束
248
+ ### 偏好但未锁定
249
+ ## 当前工作假设
250
+ ## 使用的研究信号
251
+ ### 跨论文模式
252
+ ### 承重信号
253
+ ### 额外背景信号
254
+ ## 核心出发点
255
+ ## 按蓝图顺序解释论文方案
256
+ ## 候选方法空间说明
257
+ ## 开放验证项
258
+ ## 本轮已收缩的开放项
259
+ ## 证据变化时如何校准 Claim
260
+ ```
261
+
262
+ Omit `候选方法空间说明` when the English blueprint has no candidate method
263
+ space.
264
+
265
+ Omit `开放验证项` when no strategy-level question remains.
266
+
267
+ Omit `本轮已收缩的开放项` on the first run or when nothing was resolved.
268
+
269
+ ### Explanation Rules
270
+
271
+ - Start with a concise, independently readable paper-strategy summary.
272
+ - Accumulate explicit user facts under `用户已经明确的内容`.
273
+ - Follow English blueprint heading order under `按蓝图顺序解释论文方案`.
274
+ - Restate each blueprint decision in Chinese before explaining its rationale.
275
+ - Explain relationships among thesis, goals, claims, contribution shapes,
276
+ candidate method routes, evidence chain, venue fit, and downstream
277
+ constraints.
278
+ - Discuss recent high-impact storytelling patterns from the target or adjacent
279
+ venues and connect those patterns to the blueprint's choices.
280
+ - When a materially plausible alternative was rejected, briefly explain why
281
+ the selected strategy better serves the paper goal, venue fit, claims, or
282
+ downstream executability.
283
+ - When method routes remain candidates, explain why the current stage should
284
+ not select a unique final method before experiments.
285
+ - Distinguish user facts, evidence findings, preferences, assumptions,
286
+ candidate method routes, and open validation items.
287
+ - Explain why each open item cannot yet be resolved from user facts, live
288
+ evidence, venue posture, and paper goals; then state the current
289
+ best-supported stance and what each resolution changes.
290
+ - Explain claim calibration separately from user-input changes.
291
+
292
+ Use Chinese sentences as the default. Preserve English terms when they are the
293
+ conventional technical name. Refer to sections by their natural headings or
294
+ plain descriptions rather than lookup-heavy labels.
295
+
296
+ ## Boundary Tests
297
+
298
+ The artifacts should contain paper-design information, claim-level evidence
299
+ requirements, strategic constraints, candidate method routes, open variables,
300
+ and Chinese paper-rationale explanation.
301
+
302
+ Before delivery, rewrite tactical or process-level material into paper-level
303
+ strategy. Full paper prose, detailed section plans, run matrices, figure
304
+ layouts, implementation recipes, hyperparameter plans, user-operation advice,
305
+ artifact notes, workflow commentary, and unsupported best-result claims should
306
+ be represented only as contribution boundaries, downstream contracts, evidence
307
+ needs, candidate-route conditions, or claim-calibration rules.
308
+
309
+ Remove sections or phrases that read like user reminders or artifact/runtime
310
+ warnings, including `Artifact cautions`, `Assumptions to validate`, `Do not
311
+ assume reviewers will run code`, access notes, tool-fallback notes, template
312
+ explanations, and process logs. If a reproducibility, artifact, or assumption
313
+ issue affects the paper strategy, express it as an evidence need, deployment
314
+ boundary, open strategic variable, or downstream planning contract.
315
+
316
+ ## Final Audit
317
+
318
+ Check:
319
+
320
+ - exactly two Markdown files exist in the requested output directory
321
+ - `paper_blueprint.md` is English-only, declarative, strategic, and AI-facing
322
+ - `paper_blueprint.explain.md` is Chinese-language, preserves conventional
323
+ English technical names naturally, is self-contained, and focuses on
324
+ paper-design rationale
325
+ - accumulated user facts appear only in the Chinese explanation
326
+ - each important rationale first restates the corresponding blueprint decision
327
+ - semantic headings replace lookup-heavy identifier systems
328
+ - readers, venue fit, prior-work gap, goals, contribution strategy, one to
329
+ three claims, minimum sufficient evidence chain, scope, downstream
330
+ constraints, and open variables are present when applicable
331
+ - every core claim has a claim-level evidence requirement
332
+ - user facts remain constraints instead of reopened questions
333
+ - open items are strategic, deduplicated, and shrink across iterations
334
+ - experiment-dependent method choices appear as candidate routes rather than
335
+ user-open validation items
336
+ - candidate method routes include evidence-needed, disqualifier/demotion, and
337
+ fair-comparison principles
338
+ - selected strategies replace unnecessary menus of plausible paper positions
339
+ - tactical choices are delegated
340
+ - recent storytelling exemplars and load-bearing sources influence the Chinese
341
+ explanation
342
+ - venue/high-quality-paper analysis is used to justify concrete blueprint
343
+ decisions rather than dumped as generic background
344
+ - no artifact cautions, workflow explanations, tool-rationale text, access
345
+ notes, or user to-do reminders remain in either artifact