@researai/deepscientist 1.5.12 → 1.5.14

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (99) hide show
  1. package/bin/ds.js +20 -3
  2. package/docs/en/00_QUICK_START.md +24 -5
  3. package/docs/en/01_SETTINGS_REFERENCE.md +4 -0
  4. package/docs/en/05_TUI_GUIDE.md +466 -96
  5. package/docs/en/09_DOCTOR.md +24 -5
  6. package/docs/en/15_CODEX_PROVIDER_SETUP.md +113 -15
  7. package/docs/en/README.md +2 -0
  8. package/docs/zh/00_QUICK_START.md +24 -5
  9. package/docs/zh/01_SETTINGS_REFERENCE.md +4 -0
  10. package/docs/zh/05_TUI_GUIDE.md +465 -82
  11. package/docs/zh/09_DOCTOR.md +24 -5
  12. package/docs/zh/15_CODEX_PROVIDER_SETUP.md +113 -15
  13. package/docs/zh/README.md +2 -0
  14. package/package.json +2 -1
  15. package/pyproject.toml +1 -1
  16. package/src/deepscientist/__init__.py +1 -1
  17. package/src/deepscientist/artifact/service.py +125 -2
  18. package/src/deepscientist/cli.py +3 -0
  19. package/src/deepscientist/codex_cli_compat.py +117 -0
  20. package/src/deepscientist/config/service.py +53 -6
  21. package/src/deepscientist/connector/lingzhu_support.py +23 -4
  22. package/src/deepscientist/daemon/app.py +111 -30
  23. package/src/deepscientist/mcp/server.py +161 -19
  24. package/src/deepscientist/prompts/builder.py +13 -54
  25. package/src/deepscientist/quest/service.py +99 -0
  26. package/src/deepscientist/quest/stage_views.py +134 -29
  27. package/src/deepscientist/runners/codex.py +11 -2
  28. package/src/deepscientist/runners/runtime_overrides.py +3 -0
  29. package/src/deepscientist/shared.py +6 -1
  30. package/src/prompts/system.md +220 -2065
  31. package/src/skills/baseline/SKILL.md +265 -994
  32. package/src/skills/baseline/references/artifact-payload-examples.md +39 -0
  33. package/src/skills/baseline/references/baseline-checklist-template.md +21 -32
  34. package/src/skills/baseline/references/baseline-plan-template.md +41 -57
  35. package/src/tui/dist/app/AppContainer.js +1442 -52
  36. package/src/tui/dist/components/Composer.js +1 -1
  37. package/src/tui/dist/components/ConfigScreen.js +190 -36
  38. package/src/tui/dist/components/GradientStatusText.js +1 -20
  39. package/src/tui/dist/components/InputPrompt.js +41 -32
  40. package/src/tui/dist/components/LoadingIndicator.js +1 -1
  41. package/src/tui/dist/components/Logo.js +61 -38
  42. package/src/tui/dist/components/MainContent.js +10 -3
  43. package/src/tui/dist/components/WelcomePanel.js +4 -12
  44. package/src/tui/dist/components/messages/AssistantMessage.js +1 -1
  45. package/src/tui/dist/components/messages/BashExecOperationMessage.js +3 -3
  46. package/src/tui/dist/components/messages/OperationMessage.js +1 -1
  47. package/src/tui/dist/index.js +28 -1
  48. package/src/tui/dist/layouts/DefaultAppLayout.js +3 -3
  49. package/src/tui/dist/lib/api.js +17 -0
  50. package/src/tui/dist/lib/connectorConfig.js +90 -0
  51. package/src/tui/dist/lib/connectors.js +261 -0
  52. package/src/tui/dist/lib/qr.js +21 -0
  53. package/src/tui/dist/semantic-colors.js +29 -19
  54. package/src/tui/package.json +2 -1
  55. package/src/ui/dist/assets/{AiManusChatView-CnJcXynW.js → AiManusChatView-DaF9Nge_.js} +12 -12
  56. package/src/ui/dist/assets/{AnalysisPlugin-DeyzPEhV.js → AnalysisPlugin-BSVx6dXE.js} +1 -1
  57. package/src/ui/dist/assets/{CliPlugin-CB1YODQn.js → CliPlugin-C9gzJX41.js} +9 -9
  58. package/src/ui/dist/assets/{CodeEditorPlugin-B-xicq1e.js → CodeEditorPlugin-DU9G0Tox.js} +8 -8
  59. package/src/ui/dist/assets/{CodeViewerPlugin-DT54ysXa.js → CodeViewerPlugin-DoX_fI9l.js} +5 -5
  60. package/src/ui/dist/assets/{DocViewerPlugin-DQtKT-VD.js → DocViewerPlugin-C4FWIXuU.js} +3 -3
  61. package/src/ui/dist/assets/{GitDiffViewerPlugin-hqHbCfnv.js → GitDiffViewerPlugin-BgfFMgtf.js} +20 -20
  62. package/src/ui/dist/assets/{ImageViewerPlugin-OcVo33jV.js → ImageViewerPlugin-tcPkfY_x.js} +5 -5
  63. package/src/ui/dist/assets/{LabCopilotPanel-DdGwhEUV.js → LabCopilotPanel-_dKV60Bf.js} +11 -11
  64. package/src/ui/dist/assets/{LabPlugin-Ciz1gDaX.js → LabPlugin-Bje0ayoC.js} +2 -2
  65. package/src/ui/dist/assets/{LatexPlugin-BhmjNQRC.js → LatexPlugin-CVsBzAln.js} +7 -7
  66. package/src/ui/dist/assets/{MarkdownViewerPlugin-BzdVH9Bx.js → MarkdownViewerPlugin-xjmrqv_8.js} +4 -4
  67. package/src/ui/dist/assets/{MarketplacePlugin-DmyHspXt.js → MarketplacePlugin-mMM2A8wP.js} +3 -3
  68. package/src/ui/dist/assets/{NotebookEditor-BTVYRGkm.js → NotebookEditor-3kVDSOBo.js} +11 -11
  69. package/src/ui/dist/assets/{NotebookEditor-BMXKrDRk.js → NotebookEditor-SoJ8X-MO.js} +1 -1
  70. package/src/ui/dist/assets/{PdfLoader-CvcjJHXv.js → PdfLoader-DElVuHl9.js} +1 -1
  71. package/src/ui/dist/assets/{PdfMarkdownPlugin-DW2ej8Vk.js → PdfMarkdownPlugin-Bq88XT4G.js} +2 -2
  72. package/src/ui/dist/assets/{PdfViewerPlugin-CmlDxbhU.js → PdfViewerPlugin-CsCXMo9S.js} +10 -10
  73. package/src/ui/dist/assets/{SearchPlugin-DAjQZPSv.js → SearchPlugin-oUPvy19k.js} +1 -1
  74. package/src/ui/dist/assets/{TextViewerPlugin-C-nVAZb_.js → TextViewerPlugin-CRkT9yNy.js} +5 -5
  75. package/src/ui/dist/assets/{VNCViewer-D7-dIYon.js → VNCViewer-BgbuvWhR.js} +10 -10
  76. package/src/ui/dist/assets/{bot-C_G4WtNI.js → bot-v_RASACv.js} +1 -1
  77. package/src/ui/dist/assets/{code-Cd7WfiWq.js → code-5hC9d0VH.js} +1 -1
  78. package/src/ui/dist/assets/{file-content-B57zsL9y.js → file-content-D1PxfOrp.js} +1 -1
  79. package/src/ui/dist/assets/{file-diff-panel-DVoheLFq.js → file-diff-panel-DG1oT_Hj.js} +1 -1
  80. package/src/ui/dist/assets/{file-socket-B5kXFxZP.js → file-socket-BmdFYQlk.js} +1 -1
  81. package/src/ui/dist/assets/{image-LLOjkMHF.js → image-Dqe2X2tW.js} +1 -1
  82. package/src/ui/dist/assets/{index-Dxa2eYMY.js → index-DVsMKK_y.js} +1 -1
  83. package/src/ui/dist/assets/{index-C3r2iGrp.js → index-Duvz8Ip0.js} +12 -12
  84. package/src/ui/dist/assets/{index-CLQauncb.js → index-Nt9hS4ck.js} +470 -165
  85. package/src/ui/dist/assets/{index-hOUOWbW2.js → index-RDlNXXx1.js} +2 -2
  86. package/src/ui/dist/assets/{monaco-BGGAEii3.js → monaco-DIXge1CP.js} +1 -1
  87. package/src/ui/dist/assets/{pdf-effect-queue-DlEr1_y5.js → pdf-effect-queue-BBTTQaO-.js} +1 -1
  88. package/src/ui/dist/assets/{popover-CWJbJuYY.js → popover-BWlolyxo.js} +1 -1
  89. package/src/ui/dist/assets/{project-sync-CRJiucYO.js → project-sync-BM5PkFH4.js} +1 -1
  90. package/src/ui/dist/assets/{select-CoHB7pvH.js → select-D4dAtrA8.js} +2 -2
  91. package/src/ui/dist/assets/{sigma-D5aJWR8J.js → sigma-CKbE5jJT.js} +1 -1
  92. package/src/ui/dist/assets/{square-check-big-DUK_mnkS.js → square-check-big-CZNGMgiB.js} +1 -1
  93. package/src/ui/dist/assets/{trash-ChU3SEE3.js → trash-DaB37xAz.js} +1 -1
  94. package/src/ui/dist/assets/{useCliAccess-BrJBV3tY.js → useCliAccess-C2OmAcWe.js} +1 -1
  95. package/src/ui/dist/assets/{useFileDiffOverlay-C2OQaVWc.js → useFileDiffOverlay-Dowd1Ij4.js} +1 -1
  96. package/src/ui/dist/assets/{wrap-text-C7Qqh-om.js → wrap-text-BGjAhAUq.js} +1 -1
  97. package/src/ui/dist/assets/{zoom-out-rtX0FKya.js → zoom-out-dMZQMXzc.js} +1 -1
  98. package/src/ui/dist/index.html +1 -1
  99. package/uv.lock +1 -1
@@ -0,0 +1,39 @@
1
+ # Artifact Payload Examples
2
+
3
+ Use this reference when the `baseline` stage needs a stable payload shape without re-expanding the main skill body.
4
+
5
+ ## Route or blocked decision
6
+
7
+ Keep these fields when route choice or blocking status matters:
8
+
9
+ - `kind`
10
+ - `action`
11
+ - `reason`
12
+ - `baseline_id`
13
+ - `baseline_variant_id` when relevant
14
+ - `evidence_paths`
15
+ - `next_direction`
16
+
17
+ ## Accepted baseline
18
+
19
+ Keep these fields when writing the accepted baseline artifact:
20
+
21
+ - `kind`
22
+ - `baseline_id`
23
+ - `baseline_kind`
24
+ - `path`
25
+ - `task`
26
+ - `dataset`
27
+ - `primary_metric`
28
+ - `metrics_summary`
29
+ - `default_variant_id` when relevant
30
+ - `baseline_variants` when relevant
31
+ - `environment`
32
+ - `source`
33
+ - `summary`
34
+
35
+ ## Rules
36
+
37
+ - keep payloads compact but audit-friendly
38
+ - do not omit the trusted comparison surface just because one headline metric exists
39
+ - do not publish a blocked or verification-incomplete baseline payload as if it were accepted
@@ -1,7 +1,7 @@
1
1
  # Baseline Checklist Template
2
2
 
3
3
  Use this as a living checklist.
4
- Update it during reading, setup, smoke testing, real execution, verification, and route changes.
4
+ Keep it short by default. For a fast path, complete the core checklist first and expand only if the route becomes complex or unstable.
5
5
 
6
6
  ## Identity
7
7
 
@@ -9,49 +9,38 @@ Update it during reading, setup, smoke testing, real execution, verification, an
9
9
  - route:
10
10
  - owner stage:
11
11
 
12
- ## Analysis
12
+ ## Core
13
+
14
+ - [ ] baseline object and route are explicit
15
+ - [ ] dataset / split and metric contract are explicit enough to judge comparability
16
+ - [ ] `PLAN.md` captures the command path, expected outputs, acceptance condition, and fallback
17
+ - [ ] smoke decision is explicit:
18
+ - skipped for a justified reason, or run once with outputs checked
19
+ - [ ] real validation/run decision is explicit:
20
+ - skipped for a justified reason, or launched/read with durable evidence
21
+ - [ ] expected result files and required metrics are checked
22
+ - [ ] baseline is accepted, blocked, or waived with a durable note
23
+
24
+ ## Closeout
25
+
26
+ - [ ] concise `1-2` sentence baseline summary written
27
+ - [ ] next stage named explicitly
28
+
29
+ ## Optional Expansion
30
+
31
+ Fill this only when the route becomes full-audit, repair-heavy, or publication-oriented.
13
32
 
14
33
  - [ ] paper source identified
15
34
  - [ ] repo source identified
16
35
  - [ ] paper read enough to restate the core method faithfully
17
36
  - [ ] repo read enough to identify the real entrypoints
18
- - [ ] dataset / split contract confirmed
19
- - [ ] metric contract confirmed
20
37
  - [ ] main files to inspect or modify listed
21
- - [ ] risks and fallbacks written into `PLAN.md`
22
-
23
- ## Setup
24
-
25
38
  - [ ] working directory confirmed
26
39
  - [ ] environment route chosen
27
40
  - [ ] key dependencies checked
28
41
  - [ ] model / data download path confirmed
29
42
  - [ ] fallback source recorded for critical downloads
30
-
31
- ## Smoke Test
32
-
33
- - [ ] smoke command written in `PLAN.md`
34
- - [ ] smoke command executed
35
- - [ ] smoke outputs verified
36
- - [ ] smoke failure handled or route revised
37
-
38
- ## Main Run
39
-
40
- - [ ] real command written in `PLAN.md`
41
- - [ ] real run launched with durable logging
42
43
  - [ ] monitoring cadence started
43
44
  - [ ] health signals confirmed
44
45
  - [ ] any execution deviation reflected back into `PLAN.md`
45
-
46
- ## Verification
47
-
48
- - [ ] expected result files exist
49
- - [ ] metric keys are complete
50
- - [ ] baseline is comparable to the intended contract
51
46
  - [ ] verification note written
52
- - [ ] baseline accepted or explicitly blocked / waived
53
-
54
- ## Closeout
55
-
56
- - [ ] concise `1-2` sentence baseline summary written
57
- - [ ] next stage named explicitly
@@ -1,9 +1,10 @@
1
1
  # Baseline Plan Template
2
2
 
3
3
  Use this when the `baseline` stage becomes concrete enough to act.
4
- Keep it short when the route is simple, but do not skip the sections that affect reproducibility, code touchpoints, or fallback handling.
4
+ Keep it short when the route is simple. For fast-path attach/import/prebound validation, a one-screen plan is enough if it preserves the route, command path, outputs, acceptance condition, and fallback.
5
+ Expand the optional sections only when the route is ambiguous, code-touching, broken, multi-variant, or intended for reuse beyond the current quest.
5
6
 
6
- ## 1. Objective
7
+ ## 1. Core Contract
7
8
 
8
9
  - quest goal:
9
10
  - user's core requirements:
@@ -12,80 +13,63 @@ Keep it short when the route is simple, but do not skip the sections that affect
12
13
  - attach / import / reproduce / repair
13
14
  - baseline id:
14
15
  - variant id:
15
-
16
- ## 2. Source Package
17
-
18
16
  - source paper:
19
17
  - source repo:
20
- - fallback repo or mirror:
21
18
  - source commit / version / tag:
22
19
  - task:
23
20
  - dataset / split:
24
21
  - metric contract:
22
+ - expected command path:
23
+ - expected outputs:
24
+ - acceptance condition:
25
+ - cheapest fallback:
25
26
 
26
- ## 3. Paper And Repo Reading Notes
27
-
28
- - paper summary in `1-3` bullets:
29
- - repo summary in `1-3` bullets:
30
- - what the baseline actually does:
31
- - what the likely bottlenecks or brittle points are:
32
- - what still needs verification:
33
-
34
- ## 4. Code Touchpoints
35
-
36
- List the main files or modules that matter before you change anything substantial.
37
-
38
- | Path | Role | Why it matters now | Expected action | Notes |
39
- |---|---|---|---|---|
40
- | | | | inspect / modify / leave alone | |
41
-
42
- ## 5. Environment And Asset Plan
27
+ ## 2. Execution Path
43
28
 
44
29
  - working directory:
45
30
  - environment plan:
46
31
  - required downloads:
47
- - checkpoints / models:
48
32
  - hardware assumptions:
49
- - likely external blockers:
50
-
51
- Fallbacks and contingency options:
52
-
53
- - if Hugging Face is slow, blocked, or rate-limited:
54
- - try ModelScope, official mirrors, quest-local caches, or manually staged files
55
- - if the official repo is unavailable:
56
- - use a verified mirror and record the exact provenance
57
- - if the full run is too expensive:
58
- - define the smoke-test path and the cheapest comparable reduced pilot
33
+ - smoke test needed:
34
+ - yes / no
35
+ - smoke command:
36
+ - main validation or run command:
37
+ - expected runtime / budget:
38
+ - durable log path:
39
+ - verification targets:
40
+ - fastest failure signal:
59
41
 
60
- ## 6. Execution Strategy
42
+ ## 3. Risks And Revision
61
43
 
62
- ### Smoke Test
44
+ - main risks:
45
+ - when to escalate from fast path to full audit:
46
+ - revision note:
63
47
 
64
- - command:
65
- - purpose:
66
- - expected outputs:
67
- - fastest failure signal:
48
+ ## 4. Optional Expansion
68
49
 
69
- ### Main Run
50
+ Fill this only when the route is no longer simple.
70
51
 
71
- - command:
72
- - expected outputs:
73
- - expected runtime / budget:
74
- - durable log path:
52
+ - fallback repo or mirror:
53
+ - checkpoints / models:
54
+ - likely external blockers:
75
55
  - safe efficiency levers to try first:
76
-
77
- ### Monitoring And Sleep Rules
78
-
79
- - first checks:
80
- - `60s`
81
- - `120s`
82
- - `300s`
83
- - `600s`
84
- - `1800s`
85
56
  - health signals that justify continued monitoring rather than intervention:
86
57
  - conditions that require plan revision or kill-and-relaunch:
58
+ - paper summary in `1-3` bullets:
59
+ - repo summary in `1-3` bullets:
60
+ - what the baseline actually does:
61
+ - what the likely bottlenecks or brittle points are:
62
+ - what still needs verification:
63
+
64
+ ## 5. Optional Code Touchpoints
65
+
66
+ List the main files or modules only when you expect real inspection or edits.
67
+
68
+ | Path | Role | Why it matters now | Expected action | Notes |
69
+ |---|---|---|---|---|
70
+ | | | | inspect / modify / leave alone | |
87
71
 
88
- ## 7. Verification Plan
72
+ ## 6. Optional Verification Plan
89
73
 
90
74
  - required result files:
91
75
  - required metric keys:
@@ -93,12 +77,12 @@ Fallbacks and contingency options:
93
77
  - acceptance condition:
94
78
  - downgrade / blocked condition:
95
79
 
96
- ## 8. Checklist Link
80
+ ## 7. Checklist Link
97
81
 
98
82
  - checklist path:
99
83
  - which item should move next:
100
84
 
101
- ## 9. Revision Log
85
+ ## 8. Revision Log
102
86
 
103
87
  | Time | What changed | Why it changed | Impact on execution |
104
88
  |---|---|---|---|