superlab 0.1.9 → 0.1.10
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +35 -7
- package/README.zh-CN.md +35 -7
- package/bin/superlab.cjs +422 -4
- package/lib/context.cjs +223 -1
- package/lib/i18n.cjs +450 -199
- package/lib/install.cjs +27 -0
- package/package-assets/claude/commands/lab/data.md +11 -0
- package/package-assets/claude/commands/lab.md +4 -0
- package/package-assets/codex/prompts/lab-data.md +9 -0
- package/package-assets/codex/prompts/lab.md +4 -0
- package/package-assets/shared/lab/.managed/templates/data.md +168 -0
- package/package-assets/shared/lab/config/workflow.json +4 -1
- package/package-assets/shared/lab/context/data-decisions.md +59 -0
- package/package-assets/shared/lab/system/core.md +11 -1
- package/package-assets/shared/skills/lab/SKILL.md +29 -8
- package/package-assets/shared/skills/lab/references/workflow.md +3 -0
- package/package-assets/shared/skills/lab/stages/data.md +69 -0
- package/package-assets/shared/skills/lab/stages/iterate.md +9 -0
- package/package-assets/shared/skills/lab/stages/report.md +1 -0
- package/package-assets/shared/skills/lab/stages/review.md +3 -0
- package/package-assets/shared/skills/lab/stages/run.md +5 -0
- package/package-assets/shared/skills/lab/stages/spec.md +7 -0
- package/package-assets/shared/skills/lab/stages/write.md +5 -0
- package/package.json +1 -1
|
@@ -14,6 +14,7 @@
|
|
|
14
14
|
- `.lab/context/mission.md`
|
|
15
15
|
- `.lab/context/decisions.md`
|
|
16
16
|
- `.lab/context/evidence-index.md`
|
|
17
|
+
- `.lab/context/data-decisions.md`
|
|
17
18
|
- `.lab/context/terminology-lock.md`
|
|
18
19
|
|
|
19
20
|
## Context Write Set
|
|
@@ -24,6 +25,8 @@
|
|
|
24
25
|
## Reviewer Priorities
|
|
25
26
|
|
|
26
27
|
- unfair or weak baselines
|
|
28
|
+
- missing canonical baselines, strong historical baselines, recent strong public methods, or closest prior work without a justified omission
|
|
29
|
+
- unrepresentative benchmark mix or missing classic-public versus recent-strong-public coverage
|
|
27
30
|
- leakage risk
|
|
28
31
|
- metric mismatch
|
|
29
32
|
- unsupported causal or statistical claims
|
|
@@ -11,6 +11,8 @@
|
|
|
11
11
|
|
|
12
12
|
- `.lab/context/mission.md`
|
|
13
13
|
- `.lab/context/state.md`
|
|
14
|
+
- `.lab/context/data-decisions.md`
|
|
15
|
+
- `.lab/config/workflow.json`
|
|
14
16
|
|
|
15
17
|
## Context Write Set
|
|
16
18
|
|
|
@@ -22,6 +24,9 @@
|
|
|
22
24
|
- Prefer the smallest experiment that exercises the full pipeline.
|
|
23
25
|
- Fail fast on data, environment, or metric wiring problems.
|
|
24
26
|
- Record the exact launch command and output location.
|
|
27
|
+
- Write durable run outputs, logs, and checkpoints under `results_root`.
|
|
28
|
+
- Write figures or plots under `figures_root`.
|
|
29
|
+
- Keep change-local files under `.lab/changes/<change-id>/` limited to manifests, tiny fixtures, and one-off harnesses, not canonical results.
|
|
25
30
|
|
|
26
31
|
## Minimum Procedure
|
|
27
32
|
|
|
@@ -17,6 +17,7 @@
|
|
|
17
17
|
- `.lab/context/mission.md`
|
|
18
18
|
- `.lab/context/decisions.md`
|
|
19
19
|
- `.lab/context/state.md`
|
|
20
|
+
- `.lab/context/data-decisions.md`
|
|
20
21
|
|
|
21
22
|
## Context Write Set
|
|
22
23
|
|
|
@@ -33,6 +34,7 @@
|
|
|
33
34
|
## Conversion Rules
|
|
34
35
|
|
|
35
36
|
- Convert the chosen idea into explicit requirements.
|
|
37
|
+
- Carry the approved dataset package, source choices, and benchmark mix into the change.
|
|
36
38
|
- Preserve evaluation boundaries from the idea stage.
|
|
37
39
|
- Translate risks into concrete tasks when possible.
|
|
38
40
|
- Make task granularity small enough that `/lab:run` and `/lab:iterate` can execute predictably.
|
|
@@ -48,6 +50,11 @@
|
|
|
48
50
|
## Minimum Task Coverage
|
|
49
51
|
|
|
50
52
|
- change setup
|
|
53
|
+
- dataset acquisition
|
|
54
|
+
- source verification
|
|
55
|
+
- split verification
|
|
56
|
+
- preprocessing
|
|
57
|
+
- benchmark protocol
|
|
51
58
|
- artifact creation
|
|
52
59
|
- validation run
|
|
53
60
|
- evaluation normalization
|
|
@@ -12,12 +12,14 @@
|
|
|
12
12
|
## Config Read Set
|
|
13
13
|
|
|
14
14
|
- `.lab/config/workflow.json`
|
|
15
|
+
- `paper_template_root` from `.lab/config/workflow.json`
|
|
15
16
|
|
|
16
17
|
## Context Read Set
|
|
17
18
|
|
|
18
19
|
- `.lab/context/mission.md`
|
|
19
20
|
- `.lab/context/decisions.md`
|
|
20
21
|
- `.lab/context/evidence-index.md`
|
|
22
|
+
- `.lab/context/data-decisions.md`
|
|
21
23
|
- `.lab/context/terminology-lock.md`
|
|
22
24
|
|
|
23
25
|
## Context Write Set
|
|
@@ -45,6 +47,9 @@ Run these on every round:
|
|
|
45
47
|
|
|
46
48
|
- Change one section or one clearly bounded subsection per round.
|
|
47
49
|
- LaTeX is the required manuscript output format.
|
|
50
|
+
- If `paper_template_root` is configured, inspect that template directory before drafting and align the manuscript structure to it.
|
|
51
|
+
- Treat attached template directories as user-owned and potentially modified. Do not rewrite template files unless the user explicitly asks.
|
|
52
|
+
- If no paper template is configured, use the default LaTeX scaffold under the deliverable paper directory.
|
|
48
53
|
- Load only the current section guide. Do not load every section guide at once.
|
|
49
54
|
- Build a compact mini-outline before prose.
|
|
50
55
|
- For each subsection, explicitly include motivation, design, and technical advantage when applicable.
|