superlab 0.1.9 → 0.1.10

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -14,6 +14,7 @@
14
14
  - `.lab/context/mission.md`
15
15
  - `.lab/context/decisions.md`
16
16
  - `.lab/context/evidence-index.md`
17
+ - `.lab/context/data-decisions.md`
17
18
  - `.lab/context/terminology-lock.md`
18
19
 
19
20
  ## Context Write Set
@@ -24,6 +25,8 @@
24
25
  ## Reviewer Priorities
25
26
 
26
27
  - unfair or weak baselines
28
+ - missing canonical baselines, strong historical baselines, recent strong public methods, or closest prior work without a justified omission
29
+ - unrepresentative benchmark mix or missing classic-public versus recent-strong-public coverage
27
30
  - leakage risk
28
31
  - metric mismatch
29
32
  - unsupported causal or statistical claims
@@ -11,6 +11,8 @@
11
11
 
12
12
  - `.lab/context/mission.md`
13
13
  - `.lab/context/state.md`
14
+ - `.lab/context/data-decisions.md`
15
+ - `.lab/config/workflow.json`
14
16
 
15
17
  ## Context Write Set
16
18
 
@@ -22,6 +24,9 @@
22
24
  - Prefer the smallest experiment that exercises the full pipeline.
23
25
  - Fail fast on data, environment, or metric wiring problems.
24
26
  - Record the exact launch command and output location.
27
+ - Write durable run outputs, logs, and checkpoints under `results_root`.
28
+ - Write figures or plots under `figures_root`.
29
+ - Keep change-local files under `.lab/changes/<change-id>/` limited to manifests, tiny fixtures, and one-off harnesses, not canonical results.
25
30
 
26
31
  ## Minimum Procedure
27
32
 
@@ -17,6 +17,7 @@
17
17
  - `.lab/context/mission.md`
18
18
  - `.lab/context/decisions.md`
19
19
  - `.lab/context/state.md`
20
+ - `.lab/context/data-decisions.md`
20
21
 
21
22
  ## Context Write Set
22
23
 
@@ -33,6 +34,7 @@
33
34
  ## Conversion Rules
34
35
 
35
36
  - Convert the chosen idea into explicit requirements.
37
+ - Carry the approved dataset package, source choices, and benchmark mix into the change.
36
38
  - Preserve evaluation boundaries from the idea stage.
37
39
  - Translate risks into concrete tasks when possible.
38
40
  - Make task granularity small enough that `/lab:run` and `/lab:iterate` can execute predictably.
@@ -48,6 +50,11 @@
48
50
  ## Minimum Task Coverage
49
51
 
50
52
  - change setup
53
+ - dataset acquisition
54
+ - source verification
55
+ - split verification
56
+ - preprocessing
57
+ - benchmark protocol
51
58
  - artifact creation
52
59
  - validation run
53
60
  - evaluation normalization
@@ -12,12 +12,14 @@
12
12
  ## Config Read Set
13
13
 
14
14
  - `.lab/config/workflow.json`
15
+ - `paper_template_root` from `.lab/config/workflow.json`
15
16
 
16
17
  ## Context Read Set
17
18
 
18
19
  - `.lab/context/mission.md`
19
20
  - `.lab/context/decisions.md`
20
21
  - `.lab/context/evidence-index.md`
22
+ - `.lab/context/data-decisions.md`
21
23
  - `.lab/context/terminology-lock.md`
22
24
 
23
25
  ## Context Write Set
@@ -45,6 +47,9 @@ Run these on every round:
45
47
 
46
48
  - Change one section or one clearly bounded subsection per round.
47
49
  - LaTeX is the required manuscript output format.
50
+ - If `paper_template_root` is configured, inspect that template directory before drafting and align the manuscript structure to it.
51
+ - Treat attached template directories as user-owned and potentially modified. Do not rewrite template files unless the user explicitly asks.
52
+ - If no paper template is configured, use the default LaTeX scaffold under the deliverable paper directory.
48
53
  - Load only the current section guide. Do not load every section guide at once.
49
54
  - Build a compact mini-outline before prose.
50
55
  - For each subsection, explicitly include motivation, design, and technical advantage when applicable.
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "superlab",
3
- "version": "0.1.9",
3
+ "version": "0.1.10",
4
4
  "description": "Strict /lab research workflow installer for Codex and Claude",
5
5
  "keywords": [
6
6
  "codex",