@researai/deepscientist 1.5.12 → 1.5.14
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/bin/ds.js +20 -3
- package/docs/en/00_QUICK_START.md +24 -5
- package/docs/en/01_SETTINGS_REFERENCE.md +4 -0
- package/docs/en/05_TUI_GUIDE.md +466 -96
- package/docs/en/09_DOCTOR.md +24 -5
- package/docs/en/15_CODEX_PROVIDER_SETUP.md +113 -15
- package/docs/en/README.md +2 -0
- package/docs/zh/00_QUICK_START.md +24 -5
- package/docs/zh/01_SETTINGS_REFERENCE.md +4 -0
- package/docs/zh/05_TUI_GUIDE.md +465 -82
- package/docs/zh/09_DOCTOR.md +24 -5
- package/docs/zh/15_CODEX_PROVIDER_SETUP.md +113 -15
- package/docs/zh/README.md +2 -0
- package/package.json +2 -1
- package/pyproject.toml +1 -1
- package/src/deepscientist/__init__.py +1 -1
- package/src/deepscientist/artifact/service.py +125 -2
- package/src/deepscientist/cli.py +3 -0
- package/src/deepscientist/codex_cli_compat.py +117 -0
- package/src/deepscientist/config/service.py +53 -6
- package/src/deepscientist/connector/lingzhu_support.py +23 -4
- package/src/deepscientist/daemon/app.py +111 -30
- package/src/deepscientist/mcp/server.py +161 -19
- package/src/deepscientist/prompts/builder.py +13 -54
- package/src/deepscientist/quest/service.py +99 -0
- package/src/deepscientist/quest/stage_views.py +134 -29
- package/src/deepscientist/runners/codex.py +11 -2
- package/src/deepscientist/runners/runtime_overrides.py +3 -0
- package/src/deepscientist/shared.py +6 -1
- package/src/prompts/system.md +220 -2065
- package/src/skills/baseline/SKILL.md +265 -994
- package/src/skills/baseline/references/artifact-payload-examples.md +39 -0
- package/src/skills/baseline/references/baseline-checklist-template.md +21 -32
- package/src/skills/baseline/references/baseline-plan-template.md +41 -57
- package/src/tui/dist/app/AppContainer.js +1442 -52
- package/src/tui/dist/components/Composer.js +1 -1
- package/src/tui/dist/components/ConfigScreen.js +190 -36
- package/src/tui/dist/components/GradientStatusText.js +1 -20
- package/src/tui/dist/components/InputPrompt.js +41 -32
- package/src/tui/dist/components/LoadingIndicator.js +1 -1
- package/src/tui/dist/components/Logo.js +61 -38
- package/src/tui/dist/components/MainContent.js +10 -3
- package/src/tui/dist/components/WelcomePanel.js +4 -12
- package/src/tui/dist/components/messages/AssistantMessage.js +1 -1
- package/src/tui/dist/components/messages/BashExecOperationMessage.js +3 -3
- package/src/tui/dist/components/messages/OperationMessage.js +1 -1
- package/src/tui/dist/index.js +28 -1
- package/src/tui/dist/layouts/DefaultAppLayout.js +3 -3
- package/src/tui/dist/lib/api.js +17 -0
- package/src/tui/dist/lib/connectorConfig.js +90 -0
- package/src/tui/dist/lib/connectors.js +261 -0
- package/src/tui/dist/lib/qr.js +21 -0
- package/src/tui/dist/semantic-colors.js +29 -19
- package/src/tui/package.json +2 -1
- package/src/ui/dist/assets/{AiManusChatView-CnJcXynW.js → AiManusChatView-DaF9Nge_.js} +12 -12
- package/src/ui/dist/assets/{AnalysisPlugin-DeyzPEhV.js → AnalysisPlugin-BSVx6dXE.js} +1 -1
- package/src/ui/dist/assets/{CliPlugin-CB1YODQn.js → CliPlugin-C9gzJX41.js} +9 -9
- package/src/ui/dist/assets/{CodeEditorPlugin-B-xicq1e.js → CodeEditorPlugin-DU9G0Tox.js} +8 -8
- package/src/ui/dist/assets/{CodeViewerPlugin-DT54ysXa.js → CodeViewerPlugin-DoX_fI9l.js} +5 -5
- package/src/ui/dist/assets/{DocViewerPlugin-DQtKT-VD.js → DocViewerPlugin-C4FWIXuU.js} +3 -3
- package/src/ui/dist/assets/{GitDiffViewerPlugin-hqHbCfnv.js → GitDiffViewerPlugin-BgfFMgtf.js} +20 -20
- package/src/ui/dist/assets/{ImageViewerPlugin-OcVo33jV.js → ImageViewerPlugin-tcPkfY_x.js} +5 -5
- package/src/ui/dist/assets/{LabCopilotPanel-DdGwhEUV.js → LabCopilotPanel-_dKV60Bf.js} +11 -11
- package/src/ui/dist/assets/{LabPlugin-Ciz1gDaX.js → LabPlugin-Bje0ayoC.js} +2 -2
- package/src/ui/dist/assets/{LatexPlugin-BhmjNQRC.js → LatexPlugin-CVsBzAln.js} +7 -7
- package/src/ui/dist/assets/{MarkdownViewerPlugin-BzdVH9Bx.js → MarkdownViewerPlugin-xjmrqv_8.js} +4 -4
- package/src/ui/dist/assets/{MarketplacePlugin-DmyHspXt.js → MarketplacePlugin-mMM2A8wP.js} +3 -3
- package/src/ui/dist/assets/{NotebookEditor-BTVYRGkm.js → NotebookEditor-3kVDSOBo.js} +11 -11
- package/src/ui/dist/assets/{NotebookEditor-BMXKrDRk.js → NotebookEditor-SoJ8X-MO.js} +1 -1
- package/src/ui/dist/assets/{PdfLoader-CvcjJHXv.js → PdfLoader-DElVuHl9.js} +1 -1
- package/src/ui/dist/assets/{PdfMarkdownPlugin-DW2ej8Vk.js → PdfMarkdownPlugin-Bq88XT4G.js} +2 -2
- package/src/ui/dist/assets/{PdfViewerPlugin-CmlDxbhU.js → PdfViewerPlugin-CsCXMo9S.js} +10 -10
- package/src/ui/dist/assets/{SearchPlugin-DAjQZPSv.js → SearchPlugin-oUPvy19k.js} +1 -1
- package/src/ui/dist/assets/{TextViewerPlugin-C-nVAZb_.js → TextViewerPlugin-CRkT9yNy.js} +5 -5
- package/src/ui/dist/assets/{VNCViewer-D7-dIYon.js → VNCViewer-BgbuvWhR.js} +10 -10
- package/src/ui/dist/assets/{bot-C_G4WtNI.js → bot-v_RASACv.js} +1 -1
- package/src/ui/dist/assets/{code-Cd7WfiWq.js → code-5hC9d0VH.js} +1 -1
- package/src/ui/dist/assets/{file-content-B57zsL9y.js → file-content-D1PxfOrp.js} +1 -1
- package/src/ui/dist/assets/{file-diff-panel-DVoheLFq.js → file-diff-panel-DG1oT_Hj.js} +1 -1
- package/src/ui/dist/assets/{file-socket-B5kXFxZP.js → file-socket-BmdFYQlk.js} +1 -1
- package/src/ui/dist/assets/{image-LLOjkMHF.js → image-Dqe2X2tW.js} +1 -1
- package/src/ui/dist/assets/{index-Dxa2eYMY.js → index-DVsMKK_y.js} +1 -1
- package/src/ui/dist/assets/{index-C3r2iGrp.js → index-Duvz8Ip0.js} +12 -12
- package/src/ui/dist/assets/{index-CLQauncb.js → index-Nt9hS4ck.js} +470 -165
- package/src/ui/dist/assets/{index-hOUOWbW2.js → index-RDlNXXx1.js} +2 -2
- package/src/ui/dist/assets/{monaco-BGGAEii3.js → monaco-DIXge1CP.js} +1 -1
- package/src/ui/dist/assets/{pdf-effect-queue-DlEr1_y5.js → pdf-effect-queue-BBTTQaO-.js} +1 -1
- package/src/ui/dist/assets/{popover-CWJbJuYY.js → popover-BWlolyxo.js} +1 -1
- package/src/ui/dist/assets/{project-sync-CRJiucYO.js → project-sync-BM5PkFH4.js} +1 -1
- package/src/ui/dist/assets/{select-CoHB7pvH.js → select-D4dAtrA8.js} +2 -2
- package/src/ui/dist/assets/{sigma-D5aJWR8J.js → sigma-CKbE5jJT.js} +1 -1
- package/src/ui/dist/assets/{square-check-big-DUK_mnkS.js → square-check-big-CZNGMgiB.js} +1 -1
- package/src/ui/dist/assets/{trash-ChU3SEE3.js → trash-DaB37xAz.js} +1 -1
- package/src/ui/dist/assets/{useCliAccess-BrJBV3tY.js → useCliAccess-C2OmAcWe.js} +1 -1
- package/src/ui/dist/assets/{useFileDiffOverlay-C2OQaVWc.js → useFileDiffOverlay-Dowd1Ij4.js} +1 -1
- package/src/ui/dist/assets/{wrap-text-C7Qqh-om.js → wrap-text-BGjAhAUq.js} +1 -1
- package/src/ui/dist/assets/{zoom-out-rtX0FKya.js → zoom-out-dMZQMXzc.js} +1 -1
- package/src/ui/dist/index.html +1 -1
- package/uv.lock +1 -1
|
@@ -0,0 +1,39 @@
|
|
|
1
|
+
# Artifact Payload Examples
|
|
2
|
+
|
|
3
|
+
Use this reference when the `baseline` stage needs a stable payload shape without re-expanding the main skill body.
|
|
4
|
+
|
|
5
|
+
## Route or blocked decision
|
|
6
|
+
|
|
7
|
+
Keep these fields when route choice or blocking status matters:
|
|
8
|
+
|
|
9
|
+
- `kind`
|
|
10
|
+
- `action`
|
|
11
|
+
- `reason`
|
|
12
|
+
- `baseline_id`
|
|
13
|
+
- `baseline_variant_id` when relevant
|
|
14
|
+
- `evidence_paths`
|
|
15
|
+
- `next_direction`
|
|
16
|
+
|
|
17
|
+
## Accepted baseline
|
|
18
|
+
|
|
19
|
+
Keep these fields when writing the accepted baseline artifact:
|
|
20
|
+
|
|
21
|
+
- `kind`
|
|
22
|
+
- `baseline_id`
|
|
23
|
+
- `baseline_kind`
|
|
24
|
+
- `path`
|
|
25
|
+
- `task`
|
|
26
|
+
- `dataset`
|
|
27
|
+
- `primary_metric`
|
|
28
|
+
- `metrics_summary`
|
|
29
|
+
- `default_variant_id` when relevant
|
|
30
|
+
- `baseline_variants` when relevant
|
|
31
|
+
- `environment`
|
|
32
|
+
- `source`
|
|
33
|
+
- `summary`
|
|
34
|
+
|
|
35
|
+
## Rules
|
|
36
|
+
|
|
37
|
+
- keep payloads compact but audit-friendly
|
|
38
|
+
- do not omit the trusted comparison surface just because one headline metric exists
|
|
39
|
+
- do not publish a blocked or verification-incomplete baseline payload as if it were accepted
|
|
@@ -1,7 +1,7 @@
|
|
|
1
1
|
# Baseline Checklist Template
|
|
2
2
|
|
|
3
3
|
Use this as a living checklist.
|
|
4
|
-
|
|
4
|
+
Keep it short by default. For a fast path, complete the core checklist first and expand only if the route becomes complex or unstable.
|
|
5
5
|
|
|
6
6
|
## Identity
|
|
7
7
|
|
|
@@ -9,49 +9,38 @@ Update it during reading, setup, smoke testing, real execution, verification, an
|
|
|
9
9
|
- route:
|
|
10
10
|
- owner stage:
|
|
11
11
|
|
|
12
|
-
##
|
|
12
|
+
## Core
|
|
13
|
+
|
|
14
|
+
- [ ] baseline object and route are explicit
|
|
15
|
+
- [ ] dataset / split and metric contract are explicit enough to judge comparability
|
|
16
|
+
- [ ] `PLAN.md` captures the command path, expected outputs, acceptance condition, and fallback
|
|
17
|
+
- [ ] smoke decision is explicit:
|
|
18
|
+
- skipped for a justified reason, or run once with outputs checked
|
|
19
|
+
- [ ] real validation/run decision is explicit:
|
|
20
|
+
- skipped for a justified reason, or launched/read with durable evidence
|
|
21
|
+
- [ ] expected result files and required metrics are checked
|
|
22
|
+
- [ ] baseline is accepted, blocked, or waived with a durable note
|
|
23
|
+
|
|
24
|
+
## Closeout
|
|
25
|
+
|
|
26
|
+
- [ ] concise `1-2` sentence baseline summary written
|
|
27
|
+
- [ ] next stage named explicitly
|
|
28
|
+
|
|
29
|
+
## Optional Expansion
|
|
30
|
+
|
|
31
|
+
Fill this only when the route becomes full-audit, repair-heavy, or publication-oriented.
|
|
13
32
|
|
|
14
33
|
- [ ] paper source identified
|
|
15
34
|
- [ ] repo source identified
|
|
16
35
|
- [ ] paper read enough to restate the core method faithfully
|
|
17
36
|
- [ ] repo read enough to identify the real entrypoints
|
|
18
|
-
- [ ] dataset / split contract confirmed
|
|
19
|
-
- [ ] metric contract confirmed
|
|
20
37
|
- [ ] main files to inspect or modify listed
|
|
21
|
-
- [ ] risks and fallbacks written into `PLAN.md`
|
|
22
|
-
|
|
23
|
-
## Setup
|
|
24
|
-
|
|
25
38
|
- [ ] working directory confirmed
|
|
26
39
|
- [ ] environment route chosen
|
|
27
40
|
- [ ] key dependencies checked
|
|
28
41
|
- [ ] model / data download path confirmed
|
|
29
42
|
- [ ] fallback source recorded for critical downloads
|
|
30
|
-
|
|
31
|
-
## Smoke Test
|
|
32
|
-
|
|
33
|
-
- [ ] smoke command written in `PLAN.md`
|
|
34
|
-
- [ ] smoke command executed
|
|
35
|
-
- [ ] smoke outputs verified
|
|
36
|
-
- [ ] smoke failure handled or route revised
|
|
37
|
-
|
|
38
|
-
## Main Run
|
|
39
|
-
|
|
40
|
-
- [ ] real command written in `PLAN.md`
|
|
41
|
-
- [ ] real run launched with durable logging
|
|
42
43
|
- [ ] monitoring cadence started
|
|
43
44
|
- [ ] health signals confirmed
|
|
44
45
|
- [ ] any execution deviation reflected back into `PLAN.md`
|
|
45
|
-
|
|
46
|
-
## Verification
|
|
47
|
-
|
|
48
|
-
- [ ] expected result files exist
|
|
49
|
-
- [ ] metric keys are complete
|
|
50
|
-
- [ ] baseline is comparable to the intended contract
|
|
51
46
|
- [ ] verification note written
|
|
52
|
-
- [ ] baseline accepted or explicitly blocked / waived
|
|
53
|
-
|
|
54
|
-
## Closeout
|
|
55
|
-
|
|
56
|
-
- [ ] concise `1-2` sentence baseline summary written
|
|
57
|
-
- [ ] next stage named explicitly
|
|
@@ -1,9 +1,10 @@
|
|
|
1
1
|
# Baseline Plan Template
|
|
2
2
|
|
|
3
3
|
Use this when the `baseline` stage becomes concrete enough to act.
|
|
4
|
-
Keep it short when the route is simple,
|
|
4
|
+
Keep it short when the route is simple. For fast-path attach/import/prebound validation, a one-screen plan is enough if it preserves the route, command path, outputs, acceptance condition, and fallback.
|
|
5
|
+
Expand the optional sections only when the route is ambiguous, code-touching, broken, multi-variant, or intended for reuse beyond the current quest.
|
|
5
6
|
|
|
6
|
-
## 1.
|
|
7
|
+
## 1. Core Contract
|
|
7
8
|
|
|
8
9
|
- quest goal:
|
|
9
10
|
- user's core requirements:
|
|
@@ -12,80 +13,63 @@ Keep it short when the route is simple, but do not skip the sections that affect
|
|
|
12
13
|
- attach / import / reproduce / repair
|
|
13
14
|
- baseline id:
|
|
14
15
|
- variant id:
|
|
15
|
-
|
|
16
|
-
## 2. Source Package
|
|
17
|
-
|
|
18
16
|
- source paper:
|
|
19
17
|
- source repo:
|
|
20
|
-
- fallback repo or mirror:
|
|
21
18
|
- source commit / version / tag:
|
|
22
19
|
- task:
|
|
23
20
|
- dataset / split:
|
|
24
21
|
- metric contract:
|
|
22
|
+
- expected command path:
|
|
23
|
+
- expected outputs:
|
|
24
|
+
- acceptance condition:
|
|
25
|
+
- cheapest fallback:
|
|
25
26
|
|
|
26
|
-
##
|
|
27
|
-
|
|
28
|
-
- paper summary in `1-3` bullets:
|
|
29
|
-
- repo summary in `1-3` bullets:
|
|
30
|
-
- what the baseline actually does:
|
|
31
|
-
- what the likely bottlenecks or brittle points are:
|
|
32
|
-
- what still needs verification:
|
|
33
|
-
|
|
34
|
-
## 4. Code Touchpoints
|
|
35
|
-
|
|
36
|
-
List the main files or modules that matter before you change anything substantial.
|
|
37
|
-
|
|
38
|
-
| Path | Role | Why it matters now | Expected action | Notes |
|
|
39
|
-
|---|---|---|---|---|
|
|
40
|
-
| | | | inspect / modify / leave alone | |
|
|
41
|
-
|
|
42
|
-
## 5. Environment And Asset Plan
|
|
27
|
+
## 2. Execution Path
|
|
43
28
|
|
|
44
29
|
- working directory:
|
|
45
30
|
- environment plan:
|
|
46
31
|
- required downloads:
|
|
47
|
-
- checkpoints / models:
|
|
48
32
|
- hardware assumptions:
|
|
49
|
-
-
|
|
50
|
-
|
|
51
|
-
|
|
52
|
-
|
|
53
|
-
-
|
|
54
|
-
|
|
55
|
-
-
|
|
56
|
-
|
|
57
|
-
- if the full run is too expensive:
|
|
58
|
-
- define the smoke-test path and the cheapest comparable reduced pilot
|
|
33
|
+
- smoke test needed:
|
|
34
|
+
- yes / no
|
|
35
|
+
- smoke command:
|
|
36
|
+
- main validation or run command:
|
|
37
|
+
- expected runtime / budget:
|
|
38
|
+
- durable log path:
|
|
39
|
+
- verification targets:
|
|
40
|
+
- fastest failure signal:
|
|
59
41
|
|
|
60
|
-
##
|
|
42
|
+
## 3. Risks And Revision
|
|
61
43
|
|
|
62
|
-
|
|
44
|
+
- main risks:
|
|
45
|
+
- when to escalate from fast path to full audit:
|
|
46
|
+
- revision note:
|
|
63
47
|
|
|
64
|
-
|
|
65
|
-
- purpose:
|
|
66
|
-
- expected outputs:
|
|
67
|
-
- fastest failure signal:
|
|
48
|
+
## 4. Optional Expansion
|
|
68
49
|
|
|
69
|
-
|
|
50
|
+
Fill this only when the route is no longer simple.
|
|
70
51
|
|
|
71
|
-
-
|
|
72
|
-
-
|
|
73
|
-
-
|
|
74
|
-
- durable log path:
|
|
52
|
+
- fallback repo or mirror:
|
|
53
|
+
- checkpoints / models:
|
|
54
|
+
- likely external blockers:
|
|
75
55
|
- safe efficiency levers to try first:
|
|
76
|
-
|
|
77
|
-
### Monitoring And Sleep Rules
|
|
78
|
-
|
|
79
|
-
- first checks:
|
|
80
|
-
- `60s`
|
|
81
|
-
- `120s`
|
|
82
|
-
- `300s`
|
|
83
|
-
- `600s`
|
|
84
|
-
- `1800s`
|
|
85
56
|
- health signals that justify continued monitoring rather than intervention:
|
|
86
57
|
- conditions that require plan revision or kill-and-relaunch:
|
|
58
|
+
- paper summary in `1-3` bullets:
|
|
59
|
+
- repo summary in `1-3` bullets:
|
|
60
|
+
- what the baseline actually does:
|
|
61
|
+
- what the likely bottlenecks or brittle points are:
|
|
62
|
+
- what still needs verification:
|
|
63
|
+
|
|
64
|
+
## 5. Optional Code Touchpoints
|
|
65
|
+
|
|
66
|
+
List the main files or modules only when you expect real inspection or edits.
|
|
67
|
+
|
|
68
|
+
| Path | Role | Why it matters now | Expected action | Notes |
|
|
69
|
+
|---|---|---|---|---|
|
|
70
|
+
| | | | inspect / modify / leave alone | |
|
|
87
71
|
|
|
88
|
-
##
|
|
72
|
+
## 6. Optional Verification Plan
|
|
89
73
|
|
|
90
74
|
- required result files:
|
|
91
75
|
- required metric keys:
|
|
@@ -93,12 +77,12 @@ Fallbacks and contingency options:
|
|
|
93
77
|
- acceptance condition:
|
|
94
78
|
- downgrade / blocked condition:
|
|
95
79
|
|
|
96
|
-
##
|
|
80
|
+
## 7. Checklist Link
|
|
97
81
|
|
|
98
82
|
- checklist path:
|
|
99
83
|
- which item should move next:
|
|
100
84
|
|
|
101
|
-
##
|
|
85
|
+
## 8. Revision Log
|
|
102
86
|
|
|
103
87
|
| Time | What changed | Why it changed | Impact on execution |
|
|
104
88
|
|---|---|---|---|
|