superlab 0.1.78 → 0.1.79
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/package-assets/shared/lab/.managed/templates/rebuttal-panel.md +18 -6
- package/package-assets/shared/skills/lab/references/paper-writing/argument-stress-test.md +34 -0
- package/package-assets/shared/skills/lab/references/paper-writing/section-question-bank.md +59 -0
- package/package-assets/shared/skills/lab/references/paper-writing/writing-anti-patterns.md +33 -0
- package/package-assets/shared/skills/lab/references/rebuttal-mode.md +22 -0
- package/package-assets/shared/skills/lab/stages/review.md +8 -4
- package/package-assets/shared/skills/lab/stages/write.md +6 -3
- package/package.json +1 -1
|
@@ -22,9 +22,15 @@
|
|
|
22
22
|
|
|
23
23
|
## External Rebuttal Intake
|
|
24
24
|
|
|
25
|
-
| Source | Raw criticism summary | Affected unit | Reviewer axis | Severity | Route | Acceptance check |
|
|
26
|
-
| --- | --- | --- | --- | --- | --- | --- |
|
|
27
|
-
| | | | | | | |
|
|
25
|
+
| Source | Raw criticism summary | Affected unit | Reviewer axis | Severity | Why it matters | Route | Acceptance check |
|
|
26
|
+
| --- | --- | --- | --- | --- | --- | --- | --- |
|
|
27
|
+
| | | | | | | | |
|
|
28
|
+
|
|
29
|
+
## Revision Traceability
|
|
30
|
+
|
|
31
|
+
| Author claim or revised statement | Claimed artifact location | Independently verified | Remaining gap |
|
|
32
|
+
| --- | --- | --- | --- |
|
|
33
|
+
| | | | |
|
|
28
34
|
|
|
29
35
|
## Reviewer Panel Findings
|
|
30
36
|
|
|
@@ -70,9 +76,15 @@
|
|
|
70
76
|
|
|
71
77
|
## Actionable Issue Register
|
|
72
78
|
|
|
73
|
-
| ID | Axis | Severity | Affected artifact | Finding | Required fix | Route | Acceptance check | Core mutation required |
|
|
74
|
-
| --- | --- | --- | --- | --- | --- | --- | --- | --- |
|
|
75
|
-
| | | | | | | | | |
|
|
79
|
+
| ID | Axis | Severity | Affected artifact | Finding | Why it matters | Required fix | Route | Acceptance check | Core mutation required |
|
|
80
|
+
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
|
|
81
|
+
| | | | | | | | | | |
|
|
82
|
+
|
|
83
|
+
## Consensus / Disagreement / Resolution
|
|
84
|
+
|
|
85
|
+
- Consensus:
|
|
86
|
+
- Disagreement:
|
|
87
|
+
- Resolution:
|
|
76
88
|
|
|
77
89
|
## Core Mutation Check
|
|
78
90
|
|
|
@@ -0,0 +1,34 @@
|
|
|
1
|
+
# Argument Stress Test
|
|
2
|
+
|
|
3
|
+
Run this after the logic pass and before language polish. The goal is to break weak argument chains early.
|
|
4
|
+
|
|
5
|
+
## Required Checks
|
|
6
|
+
|
|
7
|
+
1. **Weakest-link test**
|
|
8
|
+
Which paragraph carries the most unsupported inference? Rewrite or narrow it first.
|
|
9
|
+
|
|
10
|
+
2. **Reverse-claim test**
|
|
11
|
+
If the main claim were false, which sentence in this section would fail first? That sentence usually reveals the overclaim.
|
|
12
|
+
|
|
13
|
+
3. **Alternative-explanation test**
|
|
14
|
+
What simpler explanation could produce the same pattern? State it explicitly and check whether the section already weakens it.
|
|
15
|
+
|
|
16
|
+
4. **Boundary test**
|
|
17
|
+
What does the current evidence still not justify? Add that limit before polishing.
|
|
18
|
+
|
|
19
|
+
5. **Evidence-trace test**
|
|
20
|
+
Can every major claim be traced to a figure, table, protocol statement, citation, or definition already present?
|
|
21
|
+
|
|
22
|
+
## Section-Specific Triggers
|
|
23
|
+
|
|
24
|
+
- **Introduction**: would the paper still look necessary if the claimed challenge were softened?
|
|
25
|
+
- **Method**: could a reviewer say the module is an arbitrary stack choice?
|
|
26
|
+
- **Experiments**: does the prose explain what the result teaches, not only what number is larger?
|
|
27
|
+
- **Conclusion**: does any sentence exceed what the experiments actually support?
|
|
28
|
+
|
|
29
|
+
## Repair Moves
|
|
30
|
+
|
|
31
|
+
1. Replace vague claims with a narrower mechanism claim.
|
|
32
|
+
2. Add one explicit bridge sentence from evidence to interpretation.
|
|
33
|
+
3. Move unsupported motivation out of results and back into hypothesis language.
|
|
34
|
+
4. State the remaining alternative explanation if it is not fully ruled out.
|
|
@@ -0,0 +1,59 @@
|
|
|
1
|
+
# Section Question Bank
|
|
2
|
+
|
|
3
|
+
Use this file before drafting or revising a paper section. The goal is to force one pass on section purpose before wording.
|
|
4
|
+
|
|
5
|
+
## Global Questions
|
|
6
|
+
|
|
7
|
+
1. What single job must this section do for the paper?
|
|
8
|
+
2. What must the reader understand after this section that they did not understand before?
|
|
9
|
+
3. Which claim, protocol, metric, or boundary does this section depend on?
|
|
10
|
+
4. Which paragraph is most likely to confuse a skeptical reviewer?
|
|
11
|
+
5. Which sentence would become false if one supporting result disappeared?
|
|
12
|
+
|
|
13
|
+
## Abstract
|
|
14
|
+
|
|
15
|
+
1. Is the task clear in one pass?
|
|
16
|
+
2. Is the gap tied to prior work rather than hype?
|
|
17
|
+
3. Is the mechanism-level insight visible?
|
|
18
|
+
4. Is the main result bounded by scope?
|
|
19
|
+
5. Is the main boundary stated?
|
|
20
|
+
|
|
21
|
+
## Introduction
|
|
22
|
+
|
|
23
|
+
1. What common assumption or prior explanation is being challenged?
|
|
24
|
+
2. What failure, anomaly, or unmet need forces the paper to exist?
|
|
25
|
+
3. What is the core insight anchor?
|
|
26
|
+
4. Why does the proposed direction follow from that insight?
|
|
27
|
+
5. Which contribution is truly central, and which are supporting?
|
|
28
|
+
|
|
29
|
+
## Related Work
|
|
30
|
+
|
|
31
|
+
1. What are the 2-4 actual research clusters readers need?
|
|
32
|
+
2. Which paper is the closest prior work?
|
|
33
|
+
3. What exact capability remains unresolved in each cluster?
|
|
34
|
+
4. Does each cluster end with a gap statement?
|
|
35
|
+
5. Did any paragraph become a citation list instead of a comparison?
|
|
36
|
+
|
|
37
|
+
## Method
|
|
38
|
+
|
|
39
|
+
1. Which design choice would a reviewer call arbitrary?
|
|
40
|
+
2. What problem forces each module, loss, or representation?
|
|
41
|
+
3. Is the information flow described in executable order?
|
|
42
|
+
4. What technical effect is expected from each major design?
|
|
43
|
+
5. Which term or label still lacks a first-mention explanation?
|
|
44
|
+
|
|
45
|
+
## Experiments
|
|
46
|
+
|
|
47
|
+
1. Are dataset scope, split protocol, and baseline setup separately visible?
|
|
48
|
+
2. Is each metric defined once in reader-facing language?
|
|
49
|
+
3. Which table answers the main effectiveness question?
|
|
50
|
+
4. Which ablation rules out the strongest simpler explanation?
|
|
51
|
+
5. What boundary or failure mode remains after the main result?
|
|
52
|
+
|
|
53
|
+
## Conclusion
|
|
54
|
+
|
|
55
|
+
1. What is the strongest supported takeaway?
|
|
56
|
+
2. What broader principle follows from that takeaway?
|
|
57
|
+
3. What boundary prevents overclaiming?
|
|
58
|
+
4. What next step follows from the boundary instead of from habit?
|
|
59
|
+
5. Did the section introduce any new evidence or claim?
|
|
@@ -0,0 +1,33 @@
|
|
|
1
|
+
# Writing Anti-Patterns
|
|
2
|
+
|
|
3
|
+
Run this only after logic and theory blockers are cleared. The goal is to remove common low-quality prose habits without inventing new content.
|
|
4
|
+
|
|
5
|
+
## Common Anti-Patterns
|
|
6
|
+
|
|
7
|
+
1. **Throat-clearing openings**
|
|
8
|
+
Paragraphs that spend one or two sentences warming up before making the point. Move the point to the first sentence.
|
|
9
|
+
|
|
10
|
+
2. **Uniform paragraph length**
|
|
11
|
+
Every paragraph having the same size often signals template writing rather than argument structure. Split or merge by idea, not by rhythm.
|
|
12
|
+
|
|
13
|
+
3. **Synonym cycling**
|
|
14
|
+
Switching among multiple names for the same concept creates false variety and weakens terminology control. Keep one canonical paper-facing name.
|
|
15
|
+
|
|
16
|
+
4. **Generic evaluative vocabulary**
|
|
17
|
+
Words such as `important`, `significant`, `effective`, `clear`, or `promising` without concrete content. Replace with the specific consequence, metric, or boundary.
|
|
18
|
+
|
|
19
|
+
5. **List-shaped filler**
|
|
20
|
+
Sentences that pile up three or more loosely related claims to sound complete. Split them into one claim per sentence when the logic differs.
|
|
21
|
+
|
|
22
|
+
6. **Overconnected transitions**
|
|
23
|
+
Repeated `therefore`, `moreover`, `furthermore`, or similar connectors can hide weak logic. Keep the connective only when the relation is explicit.
|
|
24
|
+
|
|
25
|
+
7. **Meta-writer commentary**
|
|
26
|
+
Phrases about what the paper is doing rather than the research itself, unless the sentence is a legitimate roadmap or scope statement.
|
|
27
|
+
|
|
28
|
+
## Quick Repair Questions
|
|
29
|
+
|
|
30
|
+
1. Can the first sentence of each paragraph state the paragraph's job directly?
|
|
31
|
+
2. Did any concept get renamed just to avoid repetition?
|
|
32
|
+
3. Can a vague adjective be replaced by a metric, mechanism, or constraint?
|
|
33
|
+
4. Did any sentence become longer because two different ideas were forced together?
|
|
@@ -56,11 +56,23 @@ For each external comment, record:
|
|
|
56
56
|
- affected paper unit: claim, section, table, figure, protocol, metric, threat model, experiment, or wording
|
|
57
57
|
- reviewer axis: R1, R2, R3, R4, or R5
|
|
58
58
|
- severity: fatal, major, minor, or clarification
|
|
59
|
+
- why it matters: what acceptance risk or scientific risk it creates
|
|
59
60
|
- route: `write`, `iterate`, `report`, `framing`, `data`, `spec`, or `ask-user`
|
|
60
61
|
- acceptance check: concrete evidence or manuscript condition that resolves the issue
|
|
61
62
|
|
|
62
63
|
Do not answer external criticism with prose-only reassurance. If the issue is valid, it must become a repair task. If it is invalid, state the evidence that rules it out.
|
|
63
64
|
|
|
65
|
+
## Revision Traceability
|
|
66
|
+
|
|
67
|
+
When the paper already contains a claimed repair, rebuttal response, or revised section, record:
|
|
68
|
+
|
|
69
|
+
- author claim: what the current draft or response says was fixed
|
|
70
|
+
- claimed artifact location: where the claimed fix lives
|
|
71
|
+
- independently verified: yes / no / partial
|
|
72
|
+
- remaining gap: what still fails the acceptance check
|
|
73
|
+
|
|
74
|
+
Do not mark an issue resolved only because the manuscript says it was resolved. The pass must independently verify the fix or keep the remaining gap open.
|
|
75
|
+
|
|
64
76
|
## Reviewer Panel
|
|
65
77
|
|
|
66
78
|
Run five independent review lenses. Each lens must produce actionable issues, not vague advice.
|
|
@@ -112,6 +124,16 @@ Every issue must include:
|
|
|
112
124
|
|
|
113
125
|
Prioritize fatal and major issues before language polish. Minor presentation fixes may be batched.
|
|
114
126
|
|
|
127
|
+
## Consensus / Disagreement / Resolution
|
|
128
|
+
|
|
129
|
+
At the end of the panel, summarize:
|
|
130
|
+
|
|
131
|
+
- consensus: what multiple review lenses agree is the main blocking problem
|
|
132
|
+
- disagreement: what remains contested across lenses or between criticism and evidence
|
|
133
|
+
- resolution: what concrete route, evidence, or rewrite will settle the disagreement
|
|
134
|
+
|
|
135
|
+
Use this summary to keep rebuttal mode actionable instead of devolving into parallel opinions.
|
|
136
|
+
|
|
115
137
|
## Core Mutation Policy
|
|
116
138
|
|
|
117
139
|
Core mutation means changing any of:
|
|
@@ -5,9 +5,11 @@
|
|
|
5
5
|
1. Give a concise summary of the artifact or result under review.
|
|
6
6
|
2. State the top review question or risk focus.
|
|
7
7
|
3. Audit in reviewer mode.
|
|
8
|
-
4.
|
|
9
|
-
5.
|
|
10
|
-
6.
|
|
8
|
+
4. Convert findings into structured issues: problem, why it matters, required fix, severity, route, and acceptance check.
|
|
9
|
+
5. When the target already has claimed fixes or rebuttal responses, add revision traceability: author claim, claimed artifact location, independently verified or not, and remaining gap.
|
|
10
|
+
6. Output fatal flaws first when present.
|
|
11
|
+
7. Rank the fix priority.
|
|
12
|
+
8. End with consensus, disagreement, resolution, residual risks, and a clear next action.
|
|
11
13
|
|
|
12
14
|
## Context Read Set
|
|
13
15
|
|
|
@@ -48,7 +50,8 @@
|
|
|
48
50
|
- For quick prompts such as "rebuttal一下看有什么缺点", start with the rebuttal Light Read Set only: active LaTeX, result summaries, managed indices, and supplied criticism. Do not run a whole-repository scan unless the panel records a specific escalation reason.
|
|
49
51
|
- External rebuttal, AC, meta-review, colleague, or user criticism must be converted into internal actionable issues before any rewrite or response draft.
|
|
50
52
|
- The Reviewer Panel must classify issues across R1 Significance / Originality / Insight, R2 Soundness / Technical Quality, R3 Evaluation / Analysis, R4 Results / Tables / Numeric Evidence, and R5 Presentation / Clarity.
|
|
51
|
-
- Each issue must include severity, affected artifact, required fix, route, acceptance check, and whether core mutation is required.
|
|
53
|
+
- Each issue must include problem, why it matters, severity, affected artifact, required fix, route, acceptance check, and whether core mutation is required.
|
|
54
|
+
- For re-review or rebuttal-repair passes, record author claim, claimed location, independently verified status, and remaining gap before marking an issue resolved.
|
|
52
55
|
- In L1/L2, core mutation remains an approval boundary unless explicitly authorized. In L3, route core mutation through the shared ledger policy instead of treating it as a reviewer-stage blocker.
|
|
53
56
|
|
|
54
57
|
## Output Style
|
|
@@ -57,6 +60,7 @@
|
|
|
57
60
|
- findings first
|
|
58
61
|
- fatal flaws called out explicitly
|
|
59
62
|
- fix priority stated clearly
|
|
63
|
+
- consensus / disagreement / resolution stated clearly
|
|
60
64
|
- evidence-linked critique
|
|
61
65
|
- explicit residual risks
|
|
62
66
|
- explicit alternative explanations and boundary risks
|
|
@@ -70,6 +70,9 @@ Run these on every round:
|
|
|
70
70
|
- section flow check -> `skills/lab/references/paper-writing/does-my-writing-flow-source.md`
|
|
71
71
|
- reviewer pass -> `skills/lab/references/paper-writing/paper-review.md`
|
|
72
72
|
- section-specific style policy -> `skills/lab/references/paper-writing/section-style-policies.md` (load the block matching the current section)
|
|
73
|
+
- section question bank -> `skills/lab/references/paper-writing/section-question-bank.md`
|
|
74
|
+
- argument stress test -> `skills/lab/references/paper-writing/argument-stress-test.md`
|
|
75
|
+
- language anti-pattern sweep -> `skills/lab/references/paper-writing/writing-anti-patterns.md`
|
|
73
76
|
|
|
74
77
|
## Rebuttal Mode
|
|
75
78
|
|
|
@@ -167,9 +170,9 @@ Do not enter prose polish until the current section has passed the reference-con
|
|
|
167
170
|
- In Conclusion, state the broader principle or action implication implied by the evidence, then state the boundary. Do not introduce a new insight there.
|
|
168
171
|
- Avoid paper-facing headings such as `Our Insights` or `核心洞见`; if a heading is needed, use normal section roles such as motivation, analysis, ablation, or discussion and let the insight appear in the prose.
|
|
169
172
|
- Nontrivial section work must use three separated revision passes instead of one all-purpose rewrite:
|
|
170
|
-
- Logic pass: check the paragraph role, claim chain, premise-to-conclusion transition, evidence dependency, and whether the section naturally follows from adjacent sections. Do not polish wording in this pass.
|
|
171
|
-
- Theory / field pass: after the logic pass is clean, check concept use, field terminology, metric definitions, citation anchors, and whether the chosen framework actually fits the claim. Do not treat fluent language as proof that the theory is right.
|
|
172
|
-
- Language pass: only after logic and theory blockers are resolved, revise academic tone, sentence rhythm, transitions, concision, and local readability.
|
|
173
|
+
- Logic pass: check the paragraph role, claim chain, premise-to-conclusion transition, evidence dependency, and whether the section naturally follows from adjacent sections. Use `section-question-bank.md` to force explicit answers about section purpose. Do not polish wording in this pass.
|
|
174
|
+
- Theory / field pass: after the logic pass is clean, check concept use, field terminology, metric definitions, citation anchors, and whether the chosen framework actually fits the claim. Run `argument-stress-test.md` here, including the weakest-link test, reverse-claim test, and strongest alternative explanation check. Do not treat fluent language as proof that the theory is right.
|
|
175
|
+
- Language pass: only after logic and theory blockers are resolved, revise academic tone, sentence rhythm, transitions, concision, and local readability. Run `writing-anti-patterns.md` here and remove throat-clearing openings, uniform paragraph patterns, synonym cycling, and generic evaluative filler before accepting the pass.
|
|
173
176
|
- Do not continue into language polish when the logic pass or theory / field pass still has an unresolved blocker; repair the blocker first or route back to `review`, `iterate`, or `report` if the blocker is evidentiary.
|
|
174
177
|
- Default automation should not require the user to approve every pass. Record the three pass outcomes in the write-iteration artifact and stop for one user question only when a failed pass would change paper-level framing, claims, protocol, or downstream section structure.
|
|
175
178
|
- If the user explicitly asks for interactive or human-in-the-loop rewriting, show the result of each pass and wait before moving from logic -> theory -> language.
|