claude-turing 4.7.0 → 4.8.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (172) hide show
  1. package/.claude-plugin/plugin.json +2 -2
  2. package/README.md +1 -1
  3. package/agents/ml-evaluator.md +4 -4
  4. package/agents/ml-researcher.md +2 -2
  5. package/bin/turing-init.sh +2 -2
  6. package/commands/ablate.md +3 -4
  7. package/commands/annotate.md +2 -3
  8. package/commands/archive.md +2 -3
  9. package/commands/audit.md +3 -4
  10. package/commands/baseline.md +3 -4
  11. package/commands/brief.md +5 -6
  12. package/commands/budget.md +3 -4
  13. package/commands/calibrate.md +3 -4
  14. package/commands/card.md +3 -4
  15. package/commands/changelog.md +2 -3
  16. package/commands/checkpoint.md +3 -4
  17. package/commands/cite.md +2 -3
  18. package/commands/compare.md +1 -2
  19. package/commands/counterfactual.md +2 -3
  20. package/commands/curriculum.md +3 -4
  21. package/commands/design.md +3 -4
  22. package/commands/diagnose.md +4 -5
  23. package/commands/diff.md +3 -4
  24. package/commands/distill.md +3 -4
  25. package/commands/doctor.md +2 -3
  26. package/commands/ensemble.md +3 -4
  27. package/commands/explore.md +4 -5
  28. package/commands/export.md +3 -4
  29. package/commands/feature.md +3 -4
  30. package/commands/flashback.md +2 -3
  31. package/commands/fork.md +3 -4
  32. package/commands/frontier.md +3 -4
  33. package/commands/init.md +5 -6
  34. package/commands/leak.md +3 -4
  35. package/commands/lit.md +3 -4
  36. package/commands/logbook.md +5 -6
  37. package/commands/merge.md +2 -3
  38. package/commands/mode.md +1 -2
  39. package/commands/onboard.md +2 -3
  40. package/commands/paper.md +3 -4
  41. package/commands/plan.md +2 -3
  42. package/commands/poster.md +3 -4
  43. package/commands/postmortem.md +2 -3
  44. package/commands/preflight.md +5 -6
  45. package/commands/present.md +2 -3
  46. package/commands/profile.md +3 -4
  47. package/commands/prune.md +2 -3
  48. package/commands/quantize.md +2 -3
  49. package/commands/queue.md +3 -4
  50. package/commands/registry.md +2 -3
  51. package/commands/regress.md +3 -4
  52. package/commands/replay.md +2 -3
  53. package/commands/report.md +3 -4
  54. package/commands/reproduce.md +3 -4
  55. package/commands/retry.md +3 -4
  56. package/commands/review.md +2 -3
  57. package/commands/rules/loop-protocol.md +11 -11
  58. package/commands/sanity.md +3 -4
  59. package/commands/scale.md +4 -5
  60. package/commands/search.md +2 -3
  61. package/commands/seed.md +3 -4
  62. package/commands/sensitivity.md +3 -4
  63. package/commands/share.md +2 -3
  64. package/commands/simulate.md +2 -3
  65. package/commands/status.md +1 -2
  66. package/commands/stitch.md +3 -4
  67. package/commands/suggest.md +5 -6
  68. package/commands/surgery.md +2 -3
  69. package/commands/sweep.md +8 -9
  70. package/commands/template.md +2 -3
  71. package/commands/train.md +5 -6
  72. package/commands/transfer.md +3 -4
  73. package/commands/trend.md +2 -3
  74. package/commands/try.md +4 -5
  75. package/commands/turing.md +3 -3
  76. package/commands/update.md +2 -3
  77. package/commands/validate.md +4 -5
  78. package/commands/warm.md +3 -4
  79. package/commands/watch.md +4 -5
  80. package/commands/whatif.md +2 -3
  81. package/commands/xray.md +3 -4
  82. package/config/commands.yaml +75 -75
  83. package/package.json +3 -2
  84. package/skills/turing/SKILL.md +3 -3
  85. package/skills/turing/ablate/SKILL.md +3 -4
  86. package/skills/turing/annotate/SKILL.md +2 -3
  87. package/skills/turing/archive/SKILL.md +2 -3
  88. package/skills/turing/audit/SKILL.md +3 -4
  89. package/skills/turing/baseline/SKILL.md +3 -4
  90. package/skills/turing/brief/SKILL.md +5 -6
  91. package/skills/turing/budget/SKILL.md +3 -4
  92. package/skills/turing/calibrate/SKILL.md +3 -4
  93. package/skills/turing/card/SKILL.md +3 -4
  94. package/skills/turing/changelog/SKILL.md +2 -3
  95. package/skills/turing/checkpoint/SKILL.md +3 -4
  96. package/skills/turing/cite/SKILL.md +2 -3
  97. package/skills/turing/compare/SKILL.md +1 -2
  98. package/skills/turing/counterfactual/SKILL.md +2 -3
  99. package/skills/turing/curriculum/SKILL.md +3 -4
  100. package/skills/turing/design/SKILL.md +3 -4
  101. package/skills/turing/diagnose/SKILL.md +4 -5
  102. package/skills/turing/diff/SKILL.md +3 -4
  103. package/skills/turing/distill/SKILL.md +3 -4
  104. package/skills/turing/doctor/SKILL.md +2 -3
  105. package/skills/turing/ensemble/SKILL.md +3 -4
  106. package/skills/turing/explore/SKILL.md +4 -5
  107. package/skills/turing/export/SKILL.md +3 -4
  108. package/skills/turing/feature/SKILL.md +3 -4
  109. package/skills/turing/flashback/SKILL.md +2 -3
  110. package/skills/turing/fork/SKILL.md +3 -4
  111. package/skills/turing/frontier/SKILL.md +3 -4
  112. package/skills/turing/init/SKILL.md +5 -6
  113. package/skills/turing/leak/SKILL.md +3 -4
  114. package/skills/turing/lit/SKILL.md +3 -4
  115. package/skills/turing/logbook/SKILL.md +5 -6
  116. package/skills/turing/merge/SKILL.md +2 -3
  117. package/skills/turing/mode/SKILL.md +1 -2
  118. package/skills/turing/onboard/SKILL.md +2 -3
  119. package/skills/turing/paper/SKILL.md +3 -4
  120. package/skills/turing/plan/SKILL.md +2 -3
  121. package/skills/turing/poster/SKILL.md +3 -4
  122. package/skills/turing/postmortem/SKILL.md +2 -3
  123. package/skills/turing/preflight/SKILL.md +5 -6
  124. package/skills/turing/present/SKILL.md +2 -3
  125. package/skills/turing/profile/SKILL.md +3 -4
  126. package/skills/turing/prune/SKILL.md +2 -3
  127. package/skills/turing/quantize/SKILL.md +2 -3
  128. package/skills/turing/queue/SKILL.md +3 -4
  129. package/skills/turing/registry/SKILL.md +2 -3
  130. package/skills/turing/regress/SKILL.md +3 -4
  131. package/skills/turing/replay/SKILL.md +2 -3
  132. package/skills/turing/report/SKILL.md +3 -4
  133. package/skills/turing/reproduce/SKILL.md +3 -4
  134. package/skills/turing/retry/SKILL.md +3 -4
  135. package/skills/turing/review/SKILL.md +2 -3
  136. package/skills/turing/rules/loop-protocol.md +11 -11
  137. package/skills/turing/sanity/SKILL.md +3 -4
  138. package/skills/turing/scale/SKILL.md +4 -5
  139. package/skills/turing/search/SKILL.md +2 -3
  140. package/skills/turing/seed/SKILL.md +3 -4
  141. package/skills/turing/sensitivity/SKILL.md +3 -4
  142. package/skills/turing/share/SKILL.md +2 -3
  143. package/skills/turing/simulate/SKILL.md +2 -3
  144. package/skills/turing/status/SKILL.md +1 -2
  145. package/skills/turing/stitch/SKILL.md +3 -4
  146. package/skills/turing/suggest/SKILL.md +5 -6
  147. package/skills/turing/surgery/SKILL.md +2 -3
  148. package/skills/turing/sweep/SKILL.md +8 -9
  149. package/skills/turing/template/SKILL.md +2 -3
  150. package/skills/turing/train/SKILL.md +5 -6
  151. package/skills/turing/transfer/SKILL.md +3 -4
  152. package/skills/turing/trend/SKILL.md +2 -3
  153. package/skills/turing/try/SKILL.md +4 -5
  154. package/skills/turing/update/SKILL.md +2 -3
  155. package/skills/turing/validate/SKILL.md +4 -5
  156. package/skills/turing/warm/SKILL.md +3 -4
  157. package/skills/turing/watch/SKILL.md +4 -5
  158. package/skills/turing/whatif/SKILL.md +2 -3
  159. package/skills/turing/xray/SKILL.md +3 -4
  160. package/src/command-registry.js +12 -0
  161. package/src/install.js +4 -3
  162. package/src/sync-commands-layout.js +149 -0
  163. package/src/sync-skills-layout.js +4 -133
  164. package/templates/README.md +5 -8
  165. package/templates/program.md +18 -18
  166. package/templates/pyproject.toml +10 -0
  167. package/templates/requirements.txt +4 -1
  168. package/templates/scripts/generate_onboarding.py +1 -1
  169. package/templates/scripts/post-train-hook.sh +7 -8
  170. package/templates/scripts/scaffold.py +24 -26
  171. package/templates/scripts/stop-hook.sh +2 -3
  172. package/templates/scripts/turing-run-python.sh +9 -0
@@ -7,10 +7,10 @@ You are the Turing ML research router. Detect the user's intent and identify the
7
7
 
8
8
  ## Execution Contract
9
9
 
10
- Turing sub-commands are explicit slash-command skills. Current sub-commands are `slash_only` and use `disable-model-invocation: true`, so router handling must not claim model dispatch into those skills.
10
+ Turing sub-commands are slash-command skills that allow model invocation, so router handling may select the focused skill when the user's intent matches a sub-command.
11
11
 
12
- - If the user explicitly invokes `/turing:<cmd>`, Claude Code runtime handles that slash command.
13
- - If the user invokes `/turing` as a router and the detected command is `slash_only`, give the exact slash command to run.
12
+ - If the user explicitly invokes `/turing:<cmd>`, handle that focused sub-command directly.
13
+ - If the user invokes `/turing` as a router and the detected command is `slash_only`, route to the focused sub-command skill when appropriate.
14
14
  - If a command has a documented safe equivalent script, the assistant may execute those documented steps inline when safe and appropriate.
15
15
 
16
16
  ## Routing Table
@@ -1,7 +1,6 @@
1
1
  ---
2
2
  name: update
3
3
  description: Incremental model update — add new data without full retraining, with forgetting detection.
4
- disable-model-invocation: true
5
4
  argument-hint: "<exp-id> --new-data <path> [--replay-ratio 0.1] [--tolerance 0.005]"
6
5
  allowed-tools: Read, Bash(*), Grep, Glob
7
6
  ---
@@ -9,8 +8,8 @@ allowed-tools: Read, Bash(*), Grep, Glob
9
8
  Add new data to an existing model without starting from scratch. Detects catastrophic forgetting.
10
9
 
11
10
  ## Steps
12
- 1. `source .venv/bin/activate`
13
- 2. `python scripts/incremental_update.py $ARGUMENTS`
11
+ 1. `uv sync`
12
+ 2. `uv run python scripts/incremental_update.py $ARGUMENTS`
14
13
  3. **Saved:** `experiments/updates/`
15
14
 
16
15
  ## Model-specific strategies
@@ -1,7 +1,6 @@
1
1
  ---
2
2
  name: validate
3
3
  description: Run stability validation on the current experiment configuration. Executes N runs to measure metric variance and auto-configures multi-run evaluation if variance is too high.
4
- disable-model-invocation: true
5
4
  argument-hint: "[--auto]"
6
5
  allowed-tools: Read, Bash(*), Grep, Glob
7
6
  ---
@@ -10,19 +9,19 @@ Validate the stability of the current ML pipeline by running it multiple times a
10
9
 
11
10
  ## Steps
12
11
 
13
- 1. **Activate environment:**
12
+ 1. **Sync environment:**
14
13
  ```bash
15
- source .venv/bin/activate
14
+ uv sync
16
15
  ```
17
16
 
18
17
  2. **Run stability check:**
19
18
  ```bash
20
- python scripts/validate_stability.py
19
+ uv run python scripts/validate_stability.py
21
20
  ```
22
21
 
23
22
  3. **If `$ARGUMENTS` contains `--auto`:**
24
23
  ```bash
25
- python scripts/validate_stability.py --auto
24
+ uv run python scripts/validate_stability.py --auto
26
25
  ```
27
26
  This auto-writes `evaluation.n_runs: 3` to `config.yaml` if CV > 5%.
28
27
 
package/commands/warm.md CHANGED
@@ -1,7 +1,6 @@
1
1
  ---
2
2
  name: warm
3
3
  description: Warm-start from a prior model — load checkpoint, optionally freeze layers, adjust learning rate, and continue training.
4
- disable-model-invocation: true
5
4
  argument-hint: "<exp-id> [--freeze-layers encoder] [--unfreeze-after 5]"
6
5
  allowed-tools: Read, Bash(*), Grep, Glob
7
6
  ---
@@ -10,9 +9,9 @@ Take a trained checkpoint and use it as initialization for a new experiment. Aut
10
9
 
11
10
  ## Steps
12
11
 
13
- 1. **Activate environment:**
12
+ 1. **Sync environment:**
14
13
  ```bash
15
- source .venv/bin/activate
14
+ uv sync
16
15
  ```
17
16
 
18
17
  2. **Parse arguments from `$ARGUMENTS`:**
@@ -24,7 +23,7 @@ Take a trained checkpoint and use it as initialization for a new experiment. Aut
24
23
 
25
24
  3. **Run warm-start planner:**
26
25
  ```bash
27
- python scripts/warm_start.py $ARGUMENTS
26
+ uv run python scripts/warm_start.py $ARGUMENTS
28
27
  ```
29
28
 
30
29
  4. **Report results:**
package/commands/watch.md CHANGED
@@ -1,7 +1,6 @@
1
1
  ---
2
2
  name: watch
3
3
  description: Live training monitor with early-warning alerts for loss spikes, NaN, overfitting, and metric plateaus.
4
- disable-model-invocation: true
5
4
  argument-hint: "[--alerts] [--interval 10] [--analyze run.log]"
6
5
  allowed-tools: Read, Bash(*), Grep, Glob
7
6
  ---
@@ -10,9 +9,9 @@ Stream metrics during training with early-warning alerts. Catches problems mid-r
10
9
 
11
10
  ## Steps
12
11
 
13
- 1. **Activate environment:**
12
+ 1. **Sync environment:**
14
13
  ```bash
15
- source .venv/bin/activate
14
+ uv sync
16
15
  ```
17
16
 
18
17
  2. **Parse arguments from `$ARGUMENTS`:**
@@ -24,13 +23,13 @@ Stream metrics during training with early-warning alerts. Catches problems mid-r
24
23
 
25
24
  3. **For post-hoc analysis:**
26
25
  ```bash
27
- python scripts/training_monitor.py --analyze run.log
26
+ uv run python scripts/training_monitor.py --analyze run.log
28
27
  ```
29
28
 
30
29
  4. **For live monitoring (inform user):**
31
30
  Live monitoring requires a running training process. Suggest the user run in a separate terminal:
32
31
  ```bash
33
- python scripts/training_monitor.py --log run.log --interval 10
32
+ uv run python scripts/training_monitor.py --log run.log --interval 10
34
33
  ```
35
34
 
36
35
  5. **Alert types:**
@@ -1,7 +1,6 @@
1
1
  ---
2
2
  name: whatif
3
3
  description: What-if analysis — answer hypotheticals from existing experiment data without running new experiments.
4
- disable-model-invocation: true
5
4
  argument-hint: "\"<question>\" [--json]"
6
5
  allowed-tools: Read, Bash(*), Grep, Glob
7
6
  ---
@@ -9,8 +8,8 @@ allowed-tools: Read, Bash(*), Grep, Glob
9
8
  Answer "what if?" questions using existing experiment data. Routes to the right estimator automatically.
10
9
 
11
10
  ## Steps
12
- 1. `source .venv/bin/activate`
13
- 2. `python scripts/whatif_engine.py $ARGUMENTS`
11
+ 1. `uv sync`
12
+ 2. `uv run python scripts/whatif_engine.py $ARGUMENTS`
14
13
  3. **Saved:** `experiments/whatif/`
15
14
 
16
15
  ## Supported question types
package/commands/xray.md CHANGED
@@ -1,7 +1,6 @@
1
1
  ---
2
2
  name: xray
3
3
  description: Internal model diagnostics — gradient flow, dead neurons, activation stats, weight distributions, tree depth analysis.
4
- disable-model-invocation: true
5
4
  argument-hint: "[exp-id] [--layer encoder.layer.2] [--compare exp-a exp-b]"
6
5
  allowed-tools: Read, Bash(*), Grep, Glob
7
6
  ---
@@ -10,9 +9,9 @@ See inside the model. When it underperforms, the fix depends on *why*.
10
9
 
11
10
  ## Steps
12
11
 
13
- 1. **Activate environment:**
12
+ 1. **Sync environment:**
14
13
  ```bash
15
- source .venv/bin/activate
14
+ uv sync
16
15
  ```
17
16
 
18
17
  2. **Parse arguments from `$ARGUMENTS`:**
@@ -23,7 +22,7 @@ See inside the model. When it underperforms, the fix depends on *why*.
23
22
 
24
23
  3. **Run model diagnostics:**
25
24
  ```bash
26
- python scripts/model_xray.py $ARGUMENTS
25
+ uv run python scripts/model_xray.py $ARGUMENTS
27
26
  ```
28
27
 
29
28
  4. **Diagnostics by model type:**