selftune 0.2.23 → 0.2.25
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/CHANGELOG.md +6 -0
- package/README.md +93 -15
- package/apps/local-dashboard/dist/assets/index-DgY2KGP-.css +1 -0
- package/apps/local-dashboard/dist/assets/index-Dhgv5BQO.js +15 -0
- package/apps/local-dashboard/dist/assets/vendor-react-C5oyHiV1.js +11 -0
- package/apps/local-dashboard/dist/assets/{vendor-table-BIiI3YhS.js → vendor-table-Bc_bbKd8.js} +1 -1
- package/apps/local-dashboard/dist/assets/vendor-ui-B3BPIYy7.js +1 -0
- package/apps/local-dashboard/dist/index.html +5 -5
- package/cli/selftune/adapters/codex/install.ts +310 -78
- package/cli/selftune/adapters/opencode/install.ts +3 -4
- package/cli/selftune/alpha-upload/build-payloads.ts +3 -3
- package/cli/selftune/alpha-upload/stage-canonical.ts +17 -11
- package/cli/selftune/auto-update.ts +200 -8
- package/cli/selftune/canonical-export.ts +55 -25
- package/cli/selftune/command-surface.ts +397 -0
- package/cli/selftune/contribute/contribute.ts +64 -13
- package/cli/selftune/contribution-config.ts +57 -3
- package/cli/selftune/contribution-preferences.ts +117 -0
- package/cli/selftune/contribution-signals.ts +8 -4
- package/cli/selftune/contribution-staging.ts +13 -2
- package/cli/selftune/contributions.ts +55 -121
- package/cli/selftune/creator-contributions.ts +29 -10
- package/cli/selftune/cron/setup.ts +7 -3
- package/cli/selftune/dashboard-contract.ts +73 -0
- package/cli/selftune/dashboard-server.ts +168 -17
- package/cli/selftune/dashboard.ts +350 -17
- package/cli/selftune/eval/baseline.ts +21 -5
- package/cli/selftune/eval/execution-eval.ts +170 -0
- package/cli/selftune/eval/family-overlap.ts +2 -2
- package/cli/selftune/eval/hooks-to-evals.ts +228 -82
- package/cli/selftune/eval/import-skillsbench.ts +2 -2
- package/cli/selftune/eval/invocation-classifier.ts +56 -0
- package/cli/selftune/eval/synthetic-evals.ts +5 -3
- package/cli/selftune/eval/unit-test-cli.ts +7 -4
- package/cli/selftune/evolution/apply-proposal.ts +295 -0
- package/cli/selftune/evolution/engines/replay-engine.ts +79 -57
- package/cli/selftune/evolution/evolve-body.ts +100 -39
- package/cli/selftune/evolution/evolve.ts +244 -52
- package/cli/selftune/evolution/rollback.ts +0 -1
- package/cli/selftune/evolution/validate-body.ts +68 -42
- package/cli/selftune/evolution/validate-host-replay.ts +510 -60
- package/cli/selftune/evolution/validate-proposal.ts +11 -150
- package/cli/selftune/evolution/validate-routing.ts +43 -41
- package/cli/selftune/evolution/validation-contract.ts +91 -0
- package/cli/selftune/grading/auto-grade.ts +11 -7
- package/cli/selftune/grading/grade-session.ts +10 -16
- package/cli/selftune/index.ts +35 -10
- package/cli/selftune/ingestors/claude-replay.ts +15 -10
- package/cli/selftune/ingestors/codex-wrapper.ts +3 -3
- package/cli/selftune/ingestors/opencode-ingest.ts +2 -2
- package/cli/selftune/ingestors/pi-ingest.ts +3 -2
- package/cli/selftune/init.ts +27 -3
- package/cli/selftune/localdb/direct-write.ts +35 -1
- package/cli/selftune/localdb/queries/cron.ts +34 -0
- package/cli/selftune/localdb/queries/dashboard.ts +834 -0
- package/cli/selftune/localdb/queries/evolution.ts +158 -0
- package/cli/selftune/localdb/queries/execution.ts +133 -0
- package/cli/selftune/localdb/queries/json.ts +18 -0
- package/cli/selftune/localdb/queries/monitoring.ts +263 -0
- package/cli/selftune/localdb/queries/raw.ts +95 -0
- package/cli/selftune/localdb/queries/staging.ts +270 -0
- package/cli/selftune/localdb/queries/trust.ts +392 -0
- package/cli/selftune/localdb/queries.ts +60 -2288
- package/cli/selftune/localdb/schema.ts +21 -0
- package/cli/selftune/monitoring/watch.ts +96 -29
- package/cli/selftune/normalization.ts +3 -0
- package/cli/selftune/observability.ts +4 -2
- package/cli/selftune/orchestrate/cli.ts +161 -0
- package/cli/selftune/orchestrate/execute.ts +295 -0
- package/cli/selftune/orchestrate/finalize.ts +157 -0
- package/cli/selftune/orchestrate/locks.ts +40 -0
- package/cli/selftune/orchestrate/plan.ts +131 -0
- package/cli/selftune/orchestrate/post-run.ts +59 -0
- package/cli/selftune/orchestrate/prepare.ts +334 -0
- package/cli/selftune/orchestrate/report.ts +182 -0
- package/cli/selftune/orchestrate/runtime.ts +120 -0
- package/cli/selftune/orchestrate/signals.ts +48 -0
- package/cli/selftune/orchestrate.ts +150 -1173
- package/cli/selftune/repair/skill-usage.ts +5 -2
- package/cli/selftune/routes/overview.ts +5 -2
- package/cli/selftune/routes/skill-report.ts +15 -2
- package/cli/selftune/schedule.ts +5 -5
- package/cli/selftune/status.ts +39 -2
- package/cli/selftune/testing-readiness.ts +597 -0
- package/cli/selftune/types.ts +44 -4
- package/cli/selftune/uninstall.ts +2 -1
- package/cli/selftune/utils/canonical-log.ts +1 -9
- package/cli/selftune/utils/cli-error.ts +9 -0
- package/cli/selftune/utils/llm-call.ts +126 -6
- package/cli/selftune/utils/skill-discovery.ts +2 -0
- package/cli/selftune/workflows/proposals.ts +184 -0
- package/cli/selftune/workflows/skill-scaffold.ts +241 -0
- package/cli/selftune/workflows/workflows.ts +100 -26
- package/node_modules/@selftune/telemetry-contract/fixtures/complete-push.ts +1 -1
- package/node_modules/@selftune/telemetry-contract/fixtures/evidence-only-push.ts +1 -1
- package/node_modules/@selftune/telemetry-contract/fixtures/partial-push-no-sessions.ts +1 -1
- package/node_modules/@selftune/telemetry-contract/fixtures/partial-push-unresolved-parents.ts +1 -1
- package/node_modules/@selftune/telemetry-contract/src/schemas.ts +41 -1
- package/node_modules/@selftune/telemetry-contract/src/types.ts +103 -2
- package/package.json +25 -9
- package/packages/dashboard-core/AGENTS.md +18 -0
- package/packages/dashboard-core/README.md +30 -0
- package/packages/dashboard-core/index.ts +3 -0
- package/packages/dashboard-core/package.json +39 -0
- package/packages/dashboard-core/src/chrome/DashboardChrome.tsx +74 -0
- package/packages/dashboard-core/src/chrome/DashboardHeader.tsx +200 -0
- package/packages/dashboard-core/src/chrome/DashboardSidebar.tsx +219 -0
- package/packages/dashboard-core/src/chrome/RuntimeBadge.tsx +46 -0
- package/packages/dashboard-core/src/chrome/index.ts +14 -0
- package/packages/dashboard-core/src/chrome/types.ts +81 -0
- package/packages/dashboard-core/src/chrome/utils.ts +23 -0
- package/packages/dashboard-core/src/gates/FeatureGate.tsx +11 -0
- package/packages/dashboard-core/src/gates/LockedRoute.tsx +29 -0
- package/packages/dashboard-core/src/gates/UpgradeCard.tsx +89 -0
- package/packages/dashboard-core/src/gates/index.ts +3 -0
- package/packages/dashboard-core/src/host/DashboardHostProvider.tsx +62 -0
- package/packages/dashboard-core/src/host/adapter.ts +47 -0
- package/packages/dashboard-core/src/host/capabilities.ts +55 -0
- package/packages/dashboard-core/src/host/index.ts +3 -0
- package/packages/dashboard-core/src/models/analytics.ts +39 -0
- package/packages/dashboard-core/src/models/index.ts +4 -0
- package/packages/dashboard-core/src/models/overview.ts +98 -0
- package/packages/dashboard-core/src/models/runtime.ts +7 -0
- package/packages/dashboard-core/src/models/skills.ts +34 -0
- package/packages/dashboard-core/src/routes/index.ts +2 -0
- package/packages/dashboard-core/src/routes/manifest.test.ts +70 -0
- package/packages/dashboard-core/src/routes/manifest.ts +451 -0
- package/packages/dashboard-core/src/routes/types.ts +39 -0
- package/packages/dashboard-core/src/screens/analytics/AnalyticsScreen.tsx +278 -0
- package/packages/dashboard-core/src/screens/analytics/index.ts +1 -0
- package/packages/dashboard-core/src/screens/index.ts +37 -0
- package/packages/dashboard-core/src/screens/overview/OverviewComparisonSurface.test.ts +101 -0
- package/packages/dashboard-core/src/screens/overview/OverviewComparisonSurface.tsx +393 -0
- package/packages/dashboard-core/src/screens/overview/OverviewCompositionSurface.test.tsx +113 -0
- package/packages/dashboard-core/src/screens/overview/OverviewCompositionSurface.tsx +72 -0
- package/packages/dashboard-core/src/screens/overview/OverviewCoreSurface.tsx +71 -0
- package/packages/dashboard-core/src/screens/overview/OverviewOnboardingBanner.tsx +90 -0
- package/packages/dashboard-core/src/screens/overview/OverviewRunSummary.tsx +40 -0
- package/packages/dashboard-core/src/screens/overview/index.ts +16 -0
- package/packages/dashboard-core/src/screens/overview/types.ts +13 -0
- package/packages/dashboard-core/src/screens/skill-report/SkillReportDailyBreakdownSection.tsx +99 -0
- package/packages/dashboard-core/src/screens/skill-report/SkillReportDataQualityTabContent.tsx +35 -0
- package/packages/dashboard-core/src/screens/skill-report/SkillReportEvidenceRail.tsx +71 -0
- package/packages/dashboard-core/src/screens/skill-report/SkillReportEvidenceSection.tsx +63 -0
- package/packages/dashboard-core/src/screens/skill-report/SkillReportEvidenceTabContent.tsx +25 -0
- package/packages/dashboard-core/src/screens/skill-report/SkillReportInvocationsSection.tsx +24 -0
- package/packages/dashboard-core/src/screens/skill-report/SkillReportMissedQueriesSection.tsx +79 -0
- package/packages/dashboard-core/src/screens/skill-report/SkillReportScaffold.tsx +150 -0
- package/packages/dashboard-core/src/screens/skill-report/SkillReportSections.test.tsx +224 -0
- package/packages/dashboard-core/src/screens/skill-report/SkillReportTabs.test.tsx +76 -0
- package/packages/dashboard-core/src/screens/skill-report/SkillReportTabs.tsx +88 -0
- package/packages/dashboard-core/src/screens/skill-report/SkillReportTrendSection.tsx +33 -0
- package/packages/dashboard-core/src/screens/skill-report/SkillReportTrustBadge.tsx +67 -0
- package/packages/dashboard-core/src/screens/skill-report/index.ts +45 -0
- package/packages/dashboard-core/src/screens/skills/SkillsLibraryScreen.tsx +162 -0
- package/packages/dashboard-core/src/screens/skills/index.ts +6 -0
- package/packages/telemetry-contract/fixtures/complete-push.ts +1 -1
- package/packages/telemetry-contract/fixtures/evidence-only-push.ts +1 -1
- package/packages/telemetry-contract/fixtures/partial-push-no-sessions.ts +1 -1
- package/packages/telemetry-contract/fixtures/partial-push-unresolved-parents.ts +1 -1
- package/packages/telemetry-contract/src/schemas.ts +41 -1
- package/packages/telemetry-contract/src/types.ts +103 -2
- package/packages/ui/src/components/EvidenceViewer.tsx +80 -25
- package/packages/ui/src/components/OverviewPanels.tsx +67 -26
- package/packages/ui/src/primitives/tabs.tsx +7 -6
- package/packages/ui/src/types.ts +10 -0
- package/skill/SKILL.md +130 -332
- package/skill/agents/diagnosis-analyst.md +3 -3
- package/skill/agents/evolution-reviewer.md +3 -3
- package/skill/agents/integration-guide.md +3 -3
- package/skill/agents/pattern-analyst.md +2 -2
- package/skill/references/cli-quick-reference.md +89 -0
- package/skill/references/creator-playbook.md +131 -0
- package/skill/references/examples.md +48 -0
- package/skill/references/troubleshooting.md +47 -0
- package/skill/references/version-history.md +1 -1
- package/skill/selftune.contribute.json +11 -0
- package/skill/{Workflows → workflows}/Baseline.md +20 -1
- package/skill/{Workflows → workflows}/Contribute.md +23 -10
- package/skill/{Workflows → workflows}/Contributions.md +13 -5
- package/skill/workflows/CreateTestDeploy.md +170 -0
- package/skill/{Workflows → workflows}/CreatorContributions.md +18 -6
- package/skill/{Workflows → workflows}/Cron.md +1 -1
- package/skill/{Workflows → workflows}/Dashboard.md +20 -0
- package/skill/{Workflows → workflows}/Doctor.md +1 -1
- package/skill/{Workflows → workflows}/Evals.md +67 -2
- package/skill/{Workflows → workflows}/Evolve.md +119 -30
- package/skill/{Workflows → workflows}/EvolveBody.md +41 -1
- package/skill/{Workflows → workflows}/Grade.md +1 -1
- package/skill/{Workflows → workflows}/Initialize.md +8 -4
- package/skill/{Workflows → workflows}/Orchestrate.md +13 -3
- package/skill/{Workflows → workflows}/Schedule.md +3 -3
- package/skill/workflows/SignalsDashboard.md +87 -0
- package/skill/{Workflows → workflows}/UnitTest.md +19 -0
- package/skill/{Workflows → workflows}/Watch.md +42 -2
- package/skill/{Workflows → workflows}/Workflows.md +39 -2
- package/apps/local-dashboard/dist/assets/index-CwOtTrUS.css +0 -1
- package/apps/local-dashboard/dist/assets/index-f1HQpbeH.js +0 -59
- package/apps/local-dashboard/dist/assets/vendor-react-CKkiCskZ.js +0 -11
- package/apps/local-dashboard/dist/assets/vendor-ui-jVSaIZey.js +0 -12
- /package/skill/{Workflows → workflows}/AlphaUpload.md +0 -0
- /package/skill/{Workflows → workflows}/AutoActivation.md +0 -0
- /package/skill/{Workflows → workflows}/Badge.md +0 -0
- /package/skill/{Workflows → workflows}/Composability.md +0 -0
- /package/skill/{Workflows → workflows}/EvolutionMemory.md +0 -0
- /package/skill/{Workflows → workflows}/ExportCanonical.md +0 -0
- /package/skill/{Workflows → workflows}/Hook.md +0 -0
- /package/skill/{Workflows → workflows}/ImportSkillsBench.md +0 -0
- /package/skill/{Workflows → workflows}/Ingest.md +0 -0
- /package/skill/{Workflows → workflows}/PlatformHooks.md +0 -0
- /package/skill/{Workflows → workflows}/Quickstart.md +0 -0
- /package/skill/{Workflows → workflows}/Recover.md +0 -0
- /package/skill/{Workflows → workflows}/Registry.md +0 -0
- /package/skill/{Workflows → workflows}/RepairSkillUsage.md +0 -0
- /package/skill/{Workflows → workflows}/Replay.md +0 -0
- /package/skill/{Workflows → workflows}/Rollback.md +0 -0
- /package/skill/{Workflows → workflows}/Sync.md +0 -0
- /package/skill/{Workflows → workflows}/Telemetry.md +0 -0
- /package/skill/{Workflows → workflows}/Uninstall.md +0 -0
package/skill/SKILL.md
CHANGED
|
@@ -4,50 +4,33 @@ description: >
|
|
|
4
4
|
Self-improving skills toolkit that watches real agent sessions, detects missed
|
|
5
5
|
triggers, grades execution quality, and evolves skill descriptions to match how
|
|
6
6
|
users actually talk. Use when grading sessions, generating evals, evolving skill
|
|
7
|
-
descriptions or routing tables,
|
|
8
|
-
|
|
7
|
+
descriptions or routing tables, discovering reusable workflows, scaffolding new
|
|
8
|
+
workflow skills, checking skill health, viewing the dashboard, ingesting sessions
|
|
9
|
+
from other platforms, or running autonomous improvement loops.
|
|
9
10
|
Make sure to use this skill whenever the user mentions skill improvement, skill
|
|
10
11
|
performance, skill triggers, skill evolution, skill health, undertriggering,
|
|
11
12
|
overtriggering, session grading, or wants to know how their skills are doing —
|
|
12
13
|
even if they don't say "selftune" explicitly.
|
|
13
14
|
metadata:
|
|
14
15
|
author: selftune-dev
|
|
15
|
-
version: 0.2.
|
|
16
|
+
version: 0.2.25
|
|
16
17
|
category: developer-tools
|
|
17
18
|
---
|
|
18
19
|
|
|
19
20
|
# selftune
|
|
20
21
|
|
|
21
22
|
Observe real agent sessions, detect missed triggers, grade execution quality,
|
|
22
|
-
|
|
23
|
+
evolve skill descriptions toward the language real users actually use, and
|
|
24
|
+
scaffold workflow skills from repeated telemetry patterns.
|
|
23
25
|
|
|
24
26
|
**You are the operator.** The user installed this skill so YOU can manage their
|
|
25
27
|
skill health autonomously. They will say things like "set up selftune",
|
|
26
28
|
"improve my skills", or "how are my skills doing?" — and you route to the
|
|
27
29
|
correct workflow below. The user does not run CLI commands directly; you do.
|
|
28
30
|
|
|
29
|
-
## Why this matters
|
|
30
|
-
|
|
31
|
-
Skills are only useful when they trigger at the right time with the right
|
|
32
|
-
instructions. But user language drifts — the phrases people use to ask for help
|
|
33
|
-
rarely match the trigger keywords a skill author imagined. selftune closes this
|
|
34
|
-
gap by observing real sessions, finding where skills fail to activate or
|
|
35
|
-
execute poorly, and rewriting descriptions to match actual usage patterns. The
|
|
36
|
-
result: skills that get better over time without manual tuning.
|
|
37
|
-
|
|
38
|
-
## Communicating with the user
|
|
39
|
-
|
|
40
|
-
Users range from experienced developers who'll say "evolve the pptx description
|
|
41
|
-
using the latest eval set" to non-technical users who'll say "make my skills
|
|
42
|
-
better". Pay attention to context cues:
|
|
43
|
-
|
|
44
|
-
- If they use terms like "eval set", "routing table", "JSONL" — match their precision
|
|
45
|
-
- If they say "improve my skills" or "how's it going" — explain what you're doing in plain language, summarize results, and suggest next steps
|
|
46
|
-
- When in doubt, briefly explain what a command does before running it
|
|
47
|
-
|
|
48
31
|
## Bootstrap
|
|
49
32
|
|
|
50
|
-
If `~/.selftune/config.json` does not exist, read `
|
|
33
|
+
If `~/.selftune/config.json` does not exist, read `workflows/Initialize.md`
|
|
51
34
|
first. The CLI must be installed (`selftune` on PATH) before other commands
|
|
52
35
|
will work. Do not proceed with other commands until initialization is complete.
|
|
53
36
|
|
|
@@ -57,143 +40,82 @@ will work. Do not proceed with other commands until initialization is complete.
|
|
|
57
40
|
selftune <command> [options]
|
|
58
41
|
```
|
|
59
42
|
|
|
60
|
-
Commands vary in output format
|
|
61
|
-
`selftune evolve --dry-run` emit structured JSON on stdout. `selftune status`,
|
|
62
|
-
`selftune last`, and `selftune doctor` print human-readable text or structured
|
|
63
|
-
JSON depending on the command. For alpha/bootstrap and health remediation, prefer
|
|
64
|
-
machine-readable `guidance.next_command` or top-level `next_command` when present
|
|
65
|
-
instead of inferring the next step from prose. `selftune dashboard` starts a
|
|
66
|
-
local SPA server — it does not emit data.
|
|
43
|
+
Commands vary in output format:
|
|
67
44
|
|
|
68
|
-
|
|
45
|
+
- **JSON by default:** `selftune doctor` and `selftune watch` emit structured JSON on stdout.
|
|
46
|
+
- **Text by default:** `selftune status`, `selftune last`, `selftune orchestrate`, and `selftune evolve` print human-readable text.
|
|
47
|
+
- **JSON opt-in:** `selftune sync --json` enables structured JSON output.
|
|
48
|
+
- **Server:** `selftune dashboard` starts a local SPA server — it does not emit data.
|
|
69
49
|
|
|
70
|
-
|
|
71
|
-
|
|
72
|
-
|
|
73
|
-
|
|
74
|
-
selftune
|
|
75
|
-
|
|
76
|
-
|
|
77
|
-
|
|
78
|
-
|
|
79
|
-
|
|
80
|
-
|
|
81
|
-
|
|
82
|
-
|
|
83
|
-
|
|
84
|
-
selftune evolve
|
|
85
|
-
selftune
|
|
86
|
-
selftune evolve
|
|
87
|
-
|
|
88
|
-
|
|
89
|
-
|
|
90
|
-
|
|
91
|
-
|
|
92
|
-
|
|
93
|
-
selftune eval family-overlap --prefix <family-> | --skills <a,b,c> [--parent-skill <name>] [--min-overlap 0.3] [--min-shared 2]
|
|
94
|
-
|
|
95
|
-
# Other commands
|
|
96
|
-
selftune watch --skill <name> --skill-path <path> [--auto-rollback]
|
|
97
|
-
selftune status
|
|
98
|
-
selftune last
|
|
99
|
-
selftune doctor
|
|
100
|
-
selftune dashboard [--port <port>] [--no-open]
|
|
101
|
-
selftune contributions [status|preview <skill>|upload [--dry-run]|approve <skill>|revoke <skill>|default <ask|always|never>|reset]
|
|
102
|
-
selftune creator-contributions [status|enable --skill <name>|enable --all [--prefix <value>]|disable --skill <name>]
|
|
103
|
-
selftune contribute [--skill NAME] [--preview] [--sanitize LEVEL] [--submit]
|
|
104
|
-
selftune cron setup [--dry-run] # auto-detect platform (cron/launchd/systemd)
|
|
105
|
-
selftune cron setup --platform openclaw [--dry-run] [--tz <timezone>] # OpenClaw-specific
|
|
106
|
-
selftune cron list
|
|
107
|
-
selftune cron remove [--dry-run]
|
|
108
|
-
selftune telemetry [status|enable|disable]
|
|
109
|
-
selftune export [TABLE...] [--output/-o DIR] [--since DATE]
|
|
110
|
-
|
|
111
|
-
# Autonomous loop
|
|
112
|
-
selftune orchestrate [--dry-run] [--review-required] [--auto-approve] [--skill NAME] [--max-skills N] [--recent-window HOURS] [--sync-force] [--max-auto-grade N] [--loop] [--loop-interval SECS]
|
|
113
|
-
selftune sync [--since DATE] [--dry-run] [--force] [--no-claude] [--no-codex] [--no-opencode] [--no-openclaw] [--no-pi] [--no-repair] [--json]
|
|
114
|
-
|
|
115
|
-
# Discovery + badges
|
|
116
|
-
selftune workflows [--skill NAME] [--skill-path PATH] [--min-occurrences N] [--window N] [--json] [save --skill NAME --skill-path PATH]
|
|
117
|
-
selftune badge --skill <name> [--format svg|markdown|url] [--output PATH]
|
|
118
|
-
|
|
119
|
-
# Maintenance
|
|
120
|
-
selftune quickstart
|
|
121
|
-
selftune repair-skill-usage [--since DATE] [--dry-run]
|
|
122
|
-
selftune recover [--full] [--force] [--since DATE]
|
|
123
|
-
selftune export-canonical [--out FILE] [--platform NAME] [--record-kind KIND] [--pretty] [--push-payload]
|
|
124
|
-
selftune uninstall [--dry-run] [--keep-logs] [--npm-uninstall]
|
|
125
|
-
|
|
126
|
-
# Hook dispatch (for debugging/manual invocation)
|
|
127
|
-
selftune hook <name> # prompt-log | session-stop | skill-eval | auto-activate | skill-change-guard | evolution-guard
|
|
128
|
-
|
|
129
|
-
# Platform hooks (non-Claude-Code agents)
|
|
130
|
-
selftune codex hook
|
|
131
|
-
selftune codex install [--dry-run] [--uninstall]
|
|
132
|
-
selftune opencode hook
|
|
133
|
-
selftune opencode install [--dry-run] [--uninstall]
|
|
134
|
-
selftune cline hook
|
|
135
|
-
selftune cline install [--dry-run] [--uninstall]
|
|
136
|
-
selftune pi hook
|
|
137
|
-
selftune pi install [--dry-run] [--uninstall]
|
|
138
|
-
|
|
139
|
-
# Registry (team skill distribution)
|
|
140
|
-
selftune registry push [name] [--version=<semver>] [--summary=<text>]
|
|
141
|
-
selftune registry install <name> [--global]
|
|
142
|
-
selftune registry sync
|
|
143
|
-
selftune registry status
|
|
144
|
-
selftune registry rollback <name> [--to=<version>] [--reason=<text>]
|
|
145
|
-
selftune registry history <name>
|
|
146
|
-
selftune registry list
|
|
147
|
-
|
|
148
|
-
# Alpha enrollment (device-code flow — browser opens automatically)
|
|
149
|
-
selftune init --alpha --alpha-email <email>
|
|
150
|
-
selftune alpha upload [--dry-run]
|
|
151
|
-
selftune alpha relink
|
|
152
|
-
selftune status # shows cloud link state + upload readiness
|
|
153
|
-
```
|
|
50
|
+
For health remediation, prefer machine-readable `guidance.next_command` or
|
|
51
|
+
top-level `next_command` from `selftune doctor` output instead of inferring the
|
|
52
|
+
next step from prose.
|
|
53
|
+
|
|
54
|
+
Run `selftune <command> --help` for exact flags. Read
|
|
55
|
+
`references/cli-quick-reference.md` when you need the full flag reference.
|
|
56
|
+
|
|
57
|
+
## Creator Trust Loop
|
|
58
|
+
|
|
59
|
+
When the user wants to improve a skill, default to this creator loop before
|
|
60
|
+
jumping straight to mutation:
|
|
61
|
+
|
|
62
|
+
1. `selftune eval generate --skill <name> --skill-path <path>`
|
|
63
|
+
2. `selftune eval unit-test --skill <name> --generate --skill-path <path>`
|
|
64
|
+
3. `selftune evolve --skill <name> --skill-path <path> --dry-run --validation-mode replay`
|
|
65
|
+
4. `selftune grade baseline --skill <name> --skill-path <path>`
|
|
66
|
+
5. `selftune evolve --skill <name> --skill-path <path> --with-baseline`
|
|
67
|
+
6. then `selftune watch --skill <name>`
|
|
68
|
+
|
|
69
|
+
If the user asks "how do I know this skill works?" or "can I trust this skill
|
|
70
|
+
yet?", start with this loop, then use `selftune status`, the dashboard, or the
|
|
71
|
+
skill report to explain what is still missing, whether the skill is ready to
|
|
72
|
+
deploy, or whether it is already being watched live.
|
|
154
73
|
|
|
155
74
|
## Workflow Routing
|
|
156
75
|
|
|
157
|
-
| Trigger keywords
|
|
158
|
-
|
|
|
159
|
-
|
|
|
160
|
-
|
|
|
161
|
-
|
|
|
162
|
-
| evolve
|
|
163
|
-
| evolve
|
|
164
|
-
|
|
|
165
|
-
|
|
|
166
|
-
|
|
|
167
|
-
|
|
|
168
|
-
|
|
|
169
|
-
|
|
|
170
|
-
|
|
|
171
|
-
|
|
|
172
|
-
|
|
|
173
|
-
|
|
|
174
|
-
|
|
|
175
|
-
|
|
|
176
|
-
|
|
|
177
|
-
|
|
|
178
|
-
|
|
|
179
|
-
|
|
|
180
|
-
|
|
|
181
|
-
|
|
|
182
|
-
|
|
|
183
|
-
|
|
|
184
|
-
|
|
|
185
|
-
|
|
|
186
|
-
|
|
|
187
|
-
|
|
|
188
|
-
|
|
|
189
|
-
|
|
|
190
|
-
|
|
|
191
|
-
|
|
|
192
|
-
|
|
|
193
|
-
|
|
|
194
|
-
|
|
|
195
|
-
|
|
|
196
|
-
|
|
|
76
|
+
| Trigger keywords | Workflow | File |
|
|
77
|
+
| --- | --- | --- |
|
|
78
|
+
| create test deploy, creator loop, ship skill, ready to deploy, can I trust this skill, how do I know this skill works | CreateTestDeploy | workflows/CreateTestDeploy.md |
|
|
79
|
+
| grade, score, evaluate, assess session, auto-grade | Grade | workflows/Grade.md |
|
|
80
|
+
| evals, eval set, undertriggering, skill stats, eval generate | Evals | workflows/Evals.md |
|
|
81
|
+
| evolve, improve, optimize skills, make skills better, triggers, catch more queries, apply proposal, apply contributor proposal | Evolve | workflows/Evolve.md |
|
|
82
|
+
| evolve body, evolve routing, full body evolution, rewrite skill, teacher student | EvolveBody | workflows/EvolveBody.md |
|
|
83
|
+
| evolve rollback, undo, restore, revert evolution, go back, undo last change | Rollback | workflows/Rollback.md |
|
|
84
|
+
| watch, monitor, regression, post-deploy, keep an eye on | Watch | workflows/Watch.md |
|
|
85
|
+
| doctor, health, hooks, broken, diagnose, not working, something wrong | Doctor | workflows/Doctor.md |
|
|
86
|
+
| ingest, import, codex logs, opencode, openclaw, pi, wrap codex | Ingest | workflows/Ingest.md |
|
|
87
|
+
| replay, backfill, claude transcripts, historical sessions | Replay | workflows/Replay.md |
|
|
88
|
+
| contributions, sharing preferences, opt in/out creator sharing, approve/revoke contributions | Contributions | workflows/Contributions.md |
|
|
89
|
+
| creator contributions, selftune.contribute.json, enable/disable creator contribution | CreatorContributions | workflows/CreatorContributions.md |
|
|
90
|
+
| signals dashboard, contributor signals, signals page, community dashboard, community data, contributor stats, signal health, how are signals, how is community | SignalsDashboard | workflows/SignalsDashboard.md |
|
|
91
|
+
| contribute, share, export bundle, export data, anonymized, give back | Contribute | workflows/Contribute.md |
|
|
92
|
+
| init, setup, set up, bootstrap, first time, install, configure selftune, alpha, enroll | Initialize | workflows/Initialize.md |
|
|
93
|
+
| cron, schedule, automate evolution, run automatically | Cron | workflows/Cron.md |
|
|
94
|
+
| schedule, selftune schedule, launchd, systemd, crontab, automation setup | Schedule | workflows/Schedule.md |
|
|
95
|
+
| auto-activate, suggestions, activation rules, nag, why suggest | AutoActivation | workflows/AutoActivation.md |
|
|
96
|
+
| dashboard, visual, open dashboard, show dashboard, serve dashboard | Dashboard | workflows/Dashboard.md |
|
|
97
|
+
| evolution memory, session continuity, what happened last | EvolutionMemory | workflows/EvolutionMemory.md |
|
|
98
|
+
| grade baseline, baseline lift, adds value, skill value, no-skill comparison | Baseline | workflows/Baseline.md |
|
|
99
|
+
| eval unit-test, skill test, test skill, generate tests, run tests | UnitTest | workflows/UnitTest.md |
|
|
100
|
+
| eval composability, co-occurrence, skill conflicts, family overlap, sibling confusion | Composability | workflows/Composability.md |
|
|
101
|
+
| eval import, skillsbench, external evals, benchmark tasks | ImportSkillsBench | workflows/ImportSkillsBench.md |
|
|
102
|
+
| telemetry, analytics, disable analytics, opt out, tracking, privacy | Telemetry | workflows/Telemetry.md |
|
|
103
|
+
| orchestrate, autonomous, full loop, improve all skills, run selftune loop | Orchestrate | workflows/Orchestrate.md |
|
|
104
|
+
| sync, refresh, source truth, rescan sessions | Sync | workflows/Sync.md |
|
|
105
|
+
| badge, readme badge, skill badge, health badge | Badge | workflows/Badge.md |
|
|
106
|
+
| workflows, discover workflows, scaffold workflow skill, build skill from logs | Workflows | workflows/Workflows.md |
|
|
107
|
+
| alpha upload, upload data, send alpha data, manual upload | AlphaUpload | workflows/AlphaUpload.md |
|
|
108
|
+
| recover, rebuild sqlite, recover db, legacy backfill | Recover | workflows/Recover.md |
|
|
109
|
+
| quickstart, getting started, onboard, first time setup, new user | Quickstart | workflows/Quickstart.md |
|
|
110
|
+
| uninstall, remove selftune, clean up, teardown | Uninstall | workflows/Uninstall.md |
|
|
111
|
+
| repair, rebuild usage, fix skill usage, trustworthy usage | RepairSkillUsage | workflows/RepairSkillUsage.md |
|
|
112
|
+
| export canonical, canonical export, canonical telemetry, push payload | ExportCanonical | workflows/ExportCanonical.md |
|
|
113
|
+
| hook, run hook, invoke hook, manual hook, debug hook | Hook | workflows/Hook.md |
|
|
114
|
+
| codex/opencode/cline/pi hooks, platform hooks, non-claude hooks, multi-agent | PlatformHooks | workflows/PlatformHooks.md |
|
|
115
|
+
| registry, distribute, push/install/sync/rollback skill, team skills | Registry | workflows/Registry.md |
|
|
116
|
+
| export, dump, jsonl, export sqlite, debug export | Export | _(direct: `selftune export`)_ |
|
|
117
|
+
| status, health summary, skill health, how are skills, run selftune | Status | _(direct: `selftune status`)_ |
|
|
118
|
+
| last, last session, recent session, what happened | Last | _(direct: `selftune last`)_ |
|
|
197
119
|
|
|
198
120
|
Workflows Grade, Evolve, Watch, and Ingest also run autonomously via `selftune orchestrate`.
|
|
199
121
|
|
|
@@ -203,188 +125,64 @@ Before running mutating workflows (evolve, evolve-body, evals, baseline), consul
|
|
|
203
125
|
`references/interactive-config.md` for the pre-flight configuration pattern, model
|
|
204
126
|
tier reference, and quick-path rules.
|
|
205
127
|
|
|
206
|
-
## The Feedback Loop
|
|
207
|
-
|
|
208
|
-
The core idea: observe how users actually talk, find where skills miss, propose
|
|
209
|
-
better descriptions, validate them, and deploy — with automatic rollback if things
|
|
210
|
-
get worse. Every step produces evidence so you can explain _why_ a change was made.
|
|
211
|
-
|
|
212
|
-
```text
|
|
213
|
-
Observe --> Detect --> Diagnose --> Propose --> Validate --> Audit --> Deploy --> Watch --> Rollback
|
|
214
|
-
| |
|
|
215
|
-
+--------------------------------------------------------------------+
|
|
216
|
-
```
|
|
217
|
-
|
|
218
|
-
1. **Observe** — Hooks capture every session (queries, triggers, metrics)
|
|
219
|
-
2. **Detect** — `selftune eval generate` extracts missed-trigger patterns
|
|
220
|
-
3. **Diagnose** — `selftune grade` evaluates session quality with evidence
|
|
221
|
-
4. **Propose** — `selftune evolve` generates description improvements
|
|
222
|
-
5. **Validate** — Evolution is tested against the eval set before deploying
|
|
223
|
-
6. **Audit** — Persist proposal, evidence, and decision metadata for traceability
|
|
224
|
-
7. **Deploy** — Updated description replaces the original (backup kept)
|
|
225
|
-
8. **Watch** — `selftune watch` monitors for regressions post-deploy
|
|
226
|
-
9. **Rollback** — `selftune evolve rollback` restores previous version if needed
|
|
227
|
-
|
|
228
128
|
## Specialized Agents
|
|
229
129
|
|
|
230
|
-
selftune bundles focused agents in `agents/`.
|
|
231
|
-
|
|
232
|
-
by spawning a subagent with those instructions as its prompt.
|
|
233
|
-
|
|
234
|
-
On Claude Code, `selftune init` also syncs compatibility copies into
|
|
235
|
-
`~/.claude/agents/` so native `--agent <name>` calls keep matching these
|
|
236
|
-
bundled definitions.
|
|
237
|
-
|
|
238
|
-
Treat these as worker-style subagents:
|
|
239
|
-
|
|
240
|
-
- pass the required inputs from the parent agent
|
|
241
|
-
- expect a structured report back
|
|
242
|
-
- do not have them question the user directly unless you explicitly want that
|
|
243
|
-
|
|
244
|
-
| Trigger keywords | Agent file | When to use |
|
|
245
|
-
| ---------------------------------------------------------- | ------------------------------ | ---------------------------------------------------------------------------------------------------------- |
|
|
246
|
-
| diagnose, root cause, why failing, debug performance | `agents/diagnosis-analyst.md` | When one skill has recurring low grades, regressions, or unclear failures after basic doctor/status review |
|
|
247
|
-
| patterns, conflicts, cross-skill, overlap, optimize skills | `agents/pattern-analyst.md` | When multiple skills may overlap, misroute, or interfere, especially after composability flags conflict |
|
|
248
|
-
| review evolution, check proposal, safe to deploy | `agents/evolution-reviewer.md` | Before deploying a dry-run or pending proposal, especially for high-stakes skills or marginal improvements |
|
|
249
|
-
| set up selftune, integrate, configure project | `agents/integration-guide.md` | For complex setup and verification work in monorepos, multi-skill repos, or mixed-platform environments |
|
|
250
|
-
|
|
251
|
-
## Examples
|
|
252
|
-
|
|
253
|
-
### Scenario 1: First-time setup
|
|
254
|
-
|
|
255
|
-
User says: "Set up selftune" or "Install selftune"
|
|
256
|
-
|
|
257
|
-
Actions:
|
|
258
|
-
|
|
259
|
-
1. Read `Workflows/Initialize.md`
|
|
260
|
-
2. Run `selftune init` to bootstrap config (hooks are installed automatically)
|
|
261
|
-
3. Run `selftune doctor` to verify
|
|
262
|
-
|
|
263
|
-
Result: Config at `~/.selftune/config.json`, hooks active, ready for session capture.
|
|
264
|
-
|
|
265
|
-
### Scenario 2: Improve a skill
|
|
266
|
-
|
|
267
|
-
User says: "Make the pptx skill catch more queries" or "Evolve the Research skill"
|
|
268
|
-
|
|
269
|
-
Actions:
|
|
270
|
-
|
|
271
|
-
1. `selftune eval generate --skill pptx` to find missed triggers
|
|
272
|
-
2. `selftune evolve --skill pptx --skill-path <path>` to propose changes
|
|
273
|
-
3. `selftune watch --skill pptx --skill-path <path>` to monitor post-deploy
|
|
274
|
-
|
|
275
|
-
Result: Skill description updated to match real user language, with rollback available.
|
|
276
|
-
|
|
277
|
-
### Scenario 3: Check skill health
|
|
278
|
-
|
|
279
|
-
User says: "How are my skills doing?" or "Run selftune"
|
|
280
|
-
|
|
281
|
-
Actions:
|
|
282
|
-
|
|
283
|
-
1. `selftune status` for overall health summary
|
|
284
|
-
2. `selftune last` for most recent session insight
|
|
285
|
-
3. `selftune doctor` if issues detected
|
|
286
|
-
|
|
287
|
-
Result: Pass rates, trend data, and actionable recommendations.
|
|
288
|
-
|
|
289
|
-
### Scenario 4: Autonomous operation
|
|
290
|
-
|
|
291
|
-
User says: "Set up cron jobs" or "Run selftune automatically"
|
|
292
|
-
|
|
293
|
-
Actions:
|
|
294
|
-
|
|
295
|
-
1. `selftune cron setup` to install OS-level scheduling
|
|
296
|
-
2. Orchestrate loop runs: ingest → grade → evolve → watch
|
|
297
|
-
|
|
298
|
-
Result: Skills improve continuously without manual intervention.
|
|
299
|
-
|
|
300
|
-
## Troubleshooting
|
|
301
|
-
|
|
302
|
-
### CLI not found
|
|
303
|
-
|
|
304
|
-
Error: `command not found: selftune`
|
|
305
|
-
|
|
306
|
-
Cause: CLI not installed or not on PATH.
|
|
307
|
-
|
|
308
|
-
Solution:
|
|
309
|
-
|
|
310
|
-
1. Run `npm install -g selftune` or check `bin/selftune.cjs` exists
|
|
311
|
-
2. Verify with `which selftune`
|
|
312
|
-
3. If using bun: `bun link` in the repo root
|
|
313
|
-
|
|
314
|
-
### No sessions to grade
|
|
315
|
-
|
|
316
|
-
Error: `selftune grade` returns empty results.
|
|
317
|
-
|
|
318
|
-
Cause: Hooks not capturing sessions, or no sessions since last ingest.
|
|
319
|
-
|
|
320
|
-
Solution:
|
|
321
|
-
|
|
322
|
-
1. Run `selftune doctor` to verify hook installation
|
|
323
|
-
2. Run `selftune ingest claude --force` to re-ingest
|
|
324
|
-
3. Check `~/.claude/` for telemetry JSONL files
|
|
325
|
-
|
|
326
|
-
### Evolution proposes no changes
|
|
327
|
-
|
|
328
|
-
Cause: Eval set too small or skill already well-tuned.
|
|
329
|
-
|
|
330
|
-
Solution:
|
|
331
|
-
|
|
332
|
-
1. Run `selftune eval generate --skill <name> --max 50` for a larger eval set
|
|
333
|
-
2. Check `selftune status` — if pass rate is >90%, evolution may not be needed
|
|
334
|
-
3. Try `selftune evolve body` for deeper structural changes
|
|
335
|
-
|
|
336
|
-
### Dashboard won't serve
|
|
337
|
-
|
|
338
|
-
Error: Port already in use or blank page.
|
|
339
|
-
|
|
340
|
-
Solution:
|
|
130
|
+
selftune bundles focused agents in `agents/`. Read the relevant agent file and
|
|
131
|
+
follow its instructions — either inline or by spawning a subagent.
|
|
341
132
|
|
|
342
|
-
|
|
343
|
-
|
|
344
|
-
|
|
133
|
+
| Trigger keywords | Agent file | When to use |
|
|
134
|
+
| --- | --- | --- |
|
|
135
|
+
| diagnose, root cause, why failing, debug performance | `agents/diagnosis-analyst.md` | Recurring low grades or unclear failures after doctor/status |
|
|
136
|
+
| patterns, conflicts, cross-skill, overlap | `agents/pattern-analyst.md` | Skills overlap, misroute, or interfere |
|
|
137
|
+
| review evolution, check proposal, safe to deploy | `agents/evolution-reviewer.md` | Before deploying high-stakes or marginal evolutions |
|
|
138
|
+
| set up selftune, integrate, configure project | `agents/integration-guide.md` | Complex setup: monorepos, multi-skill, mixed-platform |
|
|
345
139
|
|
|
346
140
|
## Negative Examples
|
|
347
141
|
|
|
348
|
-
These should NOT trigger selftune —
|
|
349
|
-
share keywords but need different solutions:
|
|
142
|
+
These should NOT trigger selftune — near-misses that share keywords:
|
|
350
143
|
|
|
351
144
|
- "Fix this React hydration bug" — general debugging, not skill improvement
|
|
352
|
-
- "Create a PowerPoint about Q3 results" —
|
|
353
|
-
- "Run my unit tests" — project tests, not skill eval tests
|
|
354
|
-
- "How do I use the Research skill?" — skill _usage_, not
|
|
145
|
+
- "Create a PowerPoint about Q3 results" — pptx skill, not selftune
|
|
146
|
+
- "Run my unit tests" — project tests, not skill eval tests
|
|
147
|
+
- "How do I use the Research skill?" — skill _usage_, not _improvement_
|
|
355
148
|
- "Generate a report from this data" — content generation, not skill evolution
|
|
356
|
-
- "My build is failing" — project issue, not selftune health
|
|
357
|
-
- "Evaluate this code for security issues" —
|
|
358
|
-
- "Improve this function's performance" — code optimization, not skill optimization
|
|
359
|
-
|
|
360
|
-
The key distinction: selftune
|
|
361
|
-
|
|
362
|
-
|
|
363
|
-
|
|
364
|
-
|
|
365
|
-
|
|
366
|
-
|
|
367
|
-
|
|
368
|
-
|
|
369
|
-
|
|
370
|
-
|
|
371
|
-
|
|
372
|
-
|
|
373
|
-
|
|
374
|
-
|
|
375
|
-
|
|
376
|
-
|
|
377
|
-
|
|
378
|
-
|
|
379
|
-
|
|
380
|
-
|
|
381
|
-
|
|
382
|
-
|
|
383
|
-
|
|
|
384
|
-
|
|
|
385
|
-
| `references/
|
|
386
|
-
| `references/
|
|
387
|
-
| `references/
|
|
388
|
-
| `
|
|
389
|
-
| `
|
|
390
|
-
| `
|
|
149
|
+
- "My build is failing" — project issue, not selftune health
|
|
150
|
+
- "Evaluate this code for security issues" — code review, not session grading
|
|
151
|
+
- "Improve this function's performance" — code optimization, not skill optimization
|
|
152
|
+
|
|
153
|
+
The key distinction: selftune improves _skills themselves_ (descriptions, triggers,
|
|
154
|
+
execution quality). If the user wants to _use_ a skill, route to that skill instead.
|
|
155
|
+
|
|
156
|
+
## Creator-Directed Contribution
|
|
157
|
+
|
|
158
|
+
This skill ships a `selftune.contribute.json` config that enables creator-directed
|
|
159
|
+
signal relay. When users install selftune and approve contribution, anonymous
|
|
160
|
+
usage signals (trigger, grade, miss_category) are relayed to the selftune
|
|
161
|
+
creator to improve the skill for everyone.
|
|
162
|
+
|
|
163
|
+
- The `creator_id` in the bundled config is the selftune creator's cloud user UUID.
|
|
164
|
+
- Users opt in via `selftune contributions approve selftune`.
|
|
165
|
+
- No raw session content is ever shared -- only privacy-safe aggregate signals.
|
|
166
|
+
- See `workflows/CreatorContributions.md` for creator-side setup.
|
|
167
|
+
- See `workflows/Contributions.md` for end-user opt-in/opt-out.
|
|
168
|
+
|
|
169
|
+
Routing keywords: creator contribution, selftune signals, dogfood relay,
|
|
170
|
+
community contribution, signal sharing, opt in creator, creator UUID.
|
|
171
|
+
|
|
172
|
+
## Additional References
|
|
173
|
+
|
|
174
|
+
Load these on demand — do not read unless needed for the current task:
|
|
175
|
+
|
|
176
|
+
| Reference | When to read |
|
|
177
|
+
| --- | --- |
|
|
178
|
+
| `references/cli-quick-reference.md` | Need exact CLI flags beyond `--help` |
|
|
179
|
+
| `references/troubleshooting.md` | Diagnosing common errors |
|
|
180
|
+
| `references/examples.md` | Need step-by-step scenario walkthroughs |
|
|
181
|
+
| `references/creator-playbook.md` | Publishing skills others install; before-ship vs after-ship creator loop |
|
|
182
|
+
| `references/interactive-config.md` | Before mutating workflows |
|
|
183
|
+
| `references/grading-methodology.md` | Grading sessions or interpreting grades |
|
|
184
|
+
| `references/invocation-taxonomy.md` | Analyzing trigger coverage |
|
|
185
|
+
| `references/logs.md` | Parsing or debugging log files |
|
|
186
|
+
| `references/setup-patterns.md` | Complex platform-specific setup |
|
|
187
|
+
| `references/version-history.md` | Checking what changed between versions |
|
|
188
|
+
| `settings_snippet.json` | During initialization |
|
|
@@ -50,9 +50,9 @@ parent. Do not ask the user directly unless the parent explicitly told you to.
|
|
|
50
50
|
- The target skill's `SKILL.md`
|
|
51
51
|
- Session transcripts referenced from telemetry or grading evidence
|
|
52
52
|
- Relevant workflow docs:
|
|
53
|
-
- `skill/
|
|
54
|
-
- `skill/
|
|
55
|
-
- `skill/
|
|
53
|
+
- `skill/workflows/Doctor.md`
|
|
54
|
+
- `skill/workflows/Evals.md`
|
|
55
|
+
- `skill/workflows/Evolve.md`
|
|
56
56
|
- `skill/references/grading-methodology.md`
|
|
57
57
|
- `skill/references/invocation-taxonomy.md`
|
|
58
58
|
|
|
@@ -45,9 +45,9 @@ parent. Do not ask the user directly unless the parent explicitly told you to.
|
|
|
45
45
|
- The current `SKILL.md`
|
|
46
46
|
- Existing backup files if present
|
|
47
47
|
- Eval set used for validation
|
|
48
|
-
- `skill/
|
|
49
|
-
- `skill/
|
|
50
|
-
- `skill/
|
|
48
|
+
- `skill/workflows/Evolve.md`
|
|
49
|
+
- `skill/workflows/EvolveBody.md`
|
|
50
|
+
- `skill/workflows/Watch.md`
|
|
51
51
|
- `skill/references/invocation-taxonomy.md`
|
|
52
52
|
|
|
53
53
|
## Review Workflow
|
|
@@ -33,9 +33,9 @@ parent. Do not ask the user directly unless the parent explicitly told you to.
|
|
|
33
33
|
- `selftune doctor` returns structured health data. Use it after each material
|
|
34
34
|
setup change.
|
|
35
35
|
- Use current workflow docs, especially:
|
|
36
|
-
- `skill/
|
|
37
|
-
- `skill/
|
|
38
|
-
- `skill/
|
|
36
|
+
- `skill/workflows/Initialize.md`
|
|
37
|
+
- `skill/workflows/Doctor.md`
|
|
38
|
+
- `skill/workflows/Ingest.md`
|
|
39
39
|
- `skill/references/setup-patterns.md`
|
|
40
40
|
- Respect platform boundaries:
|
|
41
41
|
- Claude Code prefers hooks installed by `selftune init`
|
|
@@ -46,8 +46,8 @@ you to.
|
|
|
46
46
|
- `~/.claude/session_telemetry_log.jsonl`
|
|
47
47
|
- `~/.claude/evolution_audit_log.jsonl`
|
|
48
48
|
- Relevant `SKILL.md` files in the workspace
|
|
49
|
-
- `skill/
|
|
50
|
-
- `skill/
|
|
49
|
+
- `skill/workflows/Composability.md`
|
|
50
|
+
- `skill/workflows/Evals.md`
|
|
51
51
|
- `skill/references/invocation-taxonomy.md`
|
|
52
52
|
|
|
53
53
|
## Analysis Workflow
|