npm - tink-harness - Versions diffs - 1.0.0 - Mend

tink-harness 1.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (44) hide show

package/.claude-plugin/marketplace.json +14 -0
package/.claude-plugin/plugin.json +8 -0
package/CHANGELOG.md +109 -0
package/LICENSE +21 -0
package/README.ko.md +224 -0
package/README.md +166 -0
package/VERSIONING.md +73 -0
package/bin/install.js +520 -0
package/commands/cast.md +484 -0
package/commands/frog.md +77 -0
package/commands/list.md +104 -0
package/commands/setup.md +185 -0
package/commands/update.md +90 -0
package/commands/weave.md +81 -0
package/hooks/hooks.json +15 -0
package/package.json +52 -0
package/skills/tink/SKILL.md +66 -0
package/templates/claude/commands/tink/cast.md +484 -0
package/templates/claude/commands/tink/frog.md +77 -0
package/templates/claude/commands/tink/list.md +104 -0
package/templates/claude/commands/tink/setup.md +185 -0
package/templates/claude/commands/tink/update.md +90 -0
package/templates/claude/commands/tink/weave.md +81 -0
package/templates/claude/skills/tink/SKILL.md +66 -0
package/templates/tink/config.json +20 -0
package/templates/tink/harnesses/HARNESS.md +28 -0
package/templates/tink/harnesses/bug-fix.md +31 -0
package/templates/tink/harnesses/code-change.md +30 -0
package/templates/tink/harnesses/docs.md +30 -0
package/templates/tink/harnesses/harness-curation.md +78 -0
package/templates/tink/harnesses/harness-synthesis.md +52 -0
package/templates/tink/harnesses/index.json +157 -0
package/templates/tink/harnesses/pre-publish-multi-agent-verify.md +44 -0
package/templates/tink/harnesses/research.md +31 -0
package/templates/tink/harnesses/review.md +31 -0
package/templates/tink/harnesses/ship.md +33 -0
package/templates/tink/harnesses/tink-feedback-apply.md +37 -0
package/templates/tink/hooks/user-prompt-submit.json +7 -0
package/templates/tink/hooks/user-prompt-submit.mjs +49 -0
package/templates/tink/maintenance/ledger.jsonl +0 -0
package/templates/tink/maintenance/weave-queue.json +3 -0
package/templates/tink/memory/lessons.md +17 -0
package/templates/tink/memory/mistakes.md +16 -0
package/templates/tink/memory/preferences.md +16 -0

package/templates/claude/commands/tink/cast.md ADDED Viewed

@@ -0,0 +1,484 @@
+---
+description: Choose, build, or synthesize the right harness for the current task.
+---
+# /tink:cast
+Cast the right harness for the task, run it, and capture reusable learning.
+`cast` is the main Tink command. Use it before non-trivial work.
+## Product promise
+Tink is not a harness recommendation list. It must leave the user with an active run state and a concrete next action.
+Tink should:
+1. understand the task,
+2. choose the smallest effective harness/tool set,
+3. replace heavy harnesses when the current stage or token budget makes them harmful,
+4. build or synthesize a narrow harness when none fits,
+5. materialize the harness as a run plan,
+6. execute the first safe step after approval,
+7. prevent repeated mistakes while working,
+8. maintain the harness set through approved memory, weave, or frog proposals.
+## Default behavior
+Do not stop after saying which harness might fit.
+A valid `/tink:cast` response must do one of these:
+- create or update `.tink/current/` and start the harnessed work,
+- ask one blocking question that is required to create `.tink/current/`, or
+- cancel because the user chose not to proceed.
+If the task is clear enough to classify, do not ask broad clarification first. Make a best recommendation, ask for approval, then act.
+## Interaction policy
+Always call the `AskUserQuestion` tool for choice prompts. Do not render `❯` text format. Do not ask the user to type a number inline.
+Map prompt content to `AskUserQuestion` fields:
+- `question`: the full question text
+- `header`: max 12-character tag (e.g. "진행 방식", "하네스 선택", "Git 설정")
+- `label`: 1–5 word option name (e.g. "승인", "조정", "취소"). Add "(권장)" to the first option label if it is recommended.
+- `description`: explanatory text for the option
+Use Korean field values when `.tink/config.json` language is `ko` or `auto` with Korean input; use English otherwise.
+## Readiness check
+Before normal classification, check whether Tink is fully initialized. If `.tink/harnesses/index.json`, `.tink/config.json`, or `.tink/memory/` is missing, do not fail and do not write anything yet. Show a short recovery prompt:
+```text
+Tink is not fully initialized.
+? What would you like to do?
+❯ 1. Run /tink:setup to review or repair setup
+  2. Create the minimal .tink scaffold for this repo
+  3. Continue once with a lightweight one-run harness
+  4. Cancel
+```
+If legacy Tiny files such as `.tiny/` or `/tiny:use` instructions are present, treat them as old state. Explain that `/tink:cast` replaces `/tiny:use`, and offer to migrate useful `.tiny/harnesses/`, `.tiny/config.json`, and `.tiny/memory/` into `.tink/` only after approval. Never tell the user to run `/tiny:use`.
+## Stitch
+Before committing to `.tink/current/`, run Stitch exactly once. Stitch is an internal quality gate inside `/tink:cast`, not a separate `/tink:grill` command and not a real subagent in v1.0.0.
+Evaluate Stitch every time, but show it to the user only when it finds a high-impact quality or safety branch. A clean internal Stitch pass is not recorded.
+When Stitch is visible, show exactly one proposal in this order: proposal, reason, choices.
+1. proposal
+2. reason
+3. choices
+Choose the one proposal by priority:
+1. safety or irreversibility
+2. success criteria or verification
+3. goal or scope ambiguity
+4. harness mismatch
+5. reusable improvement opportunity
+Stitch may change the order or method of work, but it must not change the user's goal without separate approval.
+Follow `.tink/config.json` for language. If language is `auto`, use the current user message language and fall back to English only when unclear.
+Soft gate choices:
+- English: `Approve`, `Add requirements`, `Continue as-is`
+- Korean: `승인`, `요구사항 입력`, `이대로 진행`
+Hard gate choices:
+- English: `Approve`, `Add requirements`, `Cancel`
+- Korean: `승인`, `요구사항 입력`, `취소`
+Hard gates apply when at least one of the following is true for the next action: it is difficult or unsafe to reverse (reusable memory or harness saves, harness creation, edits, frog, weave, deleting files, removing configuration); it has external side-effects or visibility (publishing, deploying, tagging, releasing, opening a public PR, changing broad architecture or public contracts); or it involves sensitive data (secrets, credentials, payments, personal data, or destructive/external side-effect commands).
+Some harnesses are inherently hard-gate territory regardless of the immediate next action. `ship` covers release/publish/deploy/PR, which are listed above. When such a harness is selected, trigger Stitch as a `safety` hard gate during the initial approval — even if the first action is read-only inspection. The hard gate protects the entire run, not just one step.
+Hard gates must not offer `Continue as-is` or `이대로 진행`.
+When Stitch triggers as a **soft gate**, do not call a separate `AskUserQuestion` for Stitch. Instead, add a `**🔍 Stitch**` section inside the main approval format and use a single `AskUserQuestion`. Hard gate Stitch remains a separate call.
+When Stitch is visible and the user responds, record current-run state:
+- `.tink/current/answers.md`: proposal, user choice, explicit assumptions
+- `.tink/current/notes.md`: proposal, risk, reason, follow-up needed
+If the user chooses `Continue as-is` / `이대로 진행`, proceed with the explicit assumptions recorded in `answers.md`.
+Do not record a clean Stitch pass.
+## Reusable State Save Gate
+Reusable State Save Gate is a separate absolute hard approval gate, not merely a Stitch subtype. Current-run approval does not authorize reusable-state writes.
+Reusable state includes:
+- `.tink/memory/*`
+- `.tink/harnesses/*`
+- `.tink/harnesses/index.json`
+- `.tink/config.json` policy changes
+- `.claude/` workflow-affecting commands, skills, settings, or hooks (not simple preferences such as theme or model)
+- template/plugin files that affect future installs
+Before reusable-state writes, show a separate approval payload:
+- operation
+- destination files
+- exact entry text or patch summary
+- why it is reusable
+- sensitive/private content excluded
+- rollback or removal path
+Reusable-state approval choices are `Approve`, `Add requirements`, and `Cancel`, localized when appropriate. Never offer `Continue as-is` or `이대로 진행` for reusable-state writes.
+Show the payload directly at the point of proposal. Do not add a preliminary "do you want to save?" question before it — the payload IS the question.
+When the plan's only non-trivial action is a reusable-state write, create run state silently first, then use Save Gate as the sole approval — skip the separate run-approval question.
+## Run state contract
+After approval, create `.tink/current/` with these files before doing deeper work. `.tink/current/` is the current workbench: the one active task plan Claude should keep updating while it works. It is temporary, local runtime state, not reusable memory and not a knowledge base:
+- `plan.md`: goal, selected harnesses, assumptions, scope, out-of-scope, next steps
+- `checks.md`: done criteria, verification commands, evidence required before final
+- `steps.json`: machine-readable step list with `pending`, `in_progress`, `done`, or `blocked`
+- `notes.md`: short working notes, failures, last safe point, recovery actions
+- `answers.md`: user answers or inferred defaults used for this run
+Also append a compact run record to `.tink/runs/YYYY-MM-DD-HHMM-<slug>.md` when the task completes, is canceled, is blocked, or is superseded. Do not store secrets, raw logs, full diffs, or one-off private context.
+## Current run lifecycle
+Before creating a new `.tink/current/`, check whether one already exists:
+1. No current run: create `.tink/current/` and start.
+2. Same task still active in the same conversation: resume it, update `notes.md`, and continue from the next pending step.
+3. `.tink/current/` exists but the conversation context is gone or uncertain: treat it as a recovery candidate, not as active truth. Even if the user says “continue” or “이어서 해”, first read `plan.md`, `checks.md`, `steps.json`, `notes.md`, and `answers.md`, show the five-line recovery summary below, then ask the user to resume, archive, replace, or cancel. If the user resumes, reuse the prior Stitch decision recorded in `answers.md`; do not re-evaluate Stitch.
+4. Different task requested: if every step in `steps.json` is `done`, auto-archive to `.tink/runs/` without asking and create the new current run. If any step is not `done`, ask whether to archive/replace the old current run. Do not overwrite silently.
+5. Blocked or canceled task: write a compact run record with `outcome: blocked` or `outcome: canceled`, then clear or replace `.tink/current/` after approval.
+6. Superseded task: archive the old state as `outcome: superseded` before creating the new current run.
+A completed or archived current run should not remain ambiguous. Either keep it only because the user explicitly chose to resume, or archive it to `.tink/runs/` and replace it. When context was lost, do not silently continue from `steps.json`; first rebuild a short human summary and get a resume/archive/replace decision.
+Recovery prompt format:
+```text
+이전 작업 복구:
+- 목표:
+- 마지막 안전 지점:
+- 다음 단계:
+- 열린 질문:
+- 검증 상태:
+? 어떻게 할까요?
+❯ 1. 이어가기
+  2. 보관하고 새 작업
+  3. 교체
+  4. 취소
+```
+## Run record schema
+Each `.tink/runs/*.md` record starts with YAML frontmatter:
+```yaml
+---
+run_id: "run-YYYYMMDD-HHMM-slug"
+started_at: "YYYY-MM-DDTHH:MM:SSZ"
+ended_at: "YYYY-MM-DDTHH:MM:SSZ"
+outcome: completed # completed | blocked | canceled | superseded
+task_summary: ""
+selected_harnesses: []
+actually_loaded_harnesses: []
+considered_but_rejected: [] # {name, reason}
+checks_result: pass # pass | fail | blocked | not_run
+user_corrections: [] # compact handles only
+maintenance_suggestions: [] # {op_id, type, target, evidence}
+approved_saves: [] # approval op IDs from .tink/maintenance/ledger.jsonl
+context_footprint: unknown # tiny | small | large | unknown
+---
+```
+The body should be a short human summary: goal, evidence, negative signals, and next safe action if blocked.
+## Maintenance evidence
+When proposing memory saves, harness edits, index updates, weave, or frog, create an operation ID and cite evidence handles. Evidence handles should be compact paths such as `.tink/runs/<file>.md`, `.tink/current/notes.md`, failed check names, or user correction snippets. Do not use raw logs as evidence.
+Approved reusable changes should append one JSON line to `.tink/maintenance/ledger.jsonl` with:
+```json
+{ "timestamp": "", "op_id": "op-...", "type": "weave|frog|memory|index-update|harness-create|harness-edit", "files": [], "evidence": [], "approval": "", "result": "applied|rejected|deferred", "rollback": "" }
+```
+## Procedure
+1. Read `.tink/harnesses/index.json` first. Do not read every harness.
+2. Read small memory files where `config.json` sets `memory_has_entries.<name>: true`. Skip files set to `false`. After a Save Gate approves a new memory entry, set that file's flag to `true` in `config.json`.
+   - `.tink/memory/mistakes.md`
+   - `.tink/memory/preferences.md`
+   - `.tink/memory/lessons.md`
+3. Classify the task:
+   - code change
+   - bug fix
+   - research
+   - review
+   - docs
+   - ship/release
+   - new pattern not covered yet
+4. Pick the best existing harness set using the context budget policy below. Prefer 1-3 harnesses, but do not use a hard cap when several tiny harnesses add useful checks without crowding context. When the task is ambiguous (Stitch goal-ambiguity is expected to trigger), start with a single best-fit harness; add a second only after the user clarifies. Do not bundle 2+ harnesses for ambiguous tasks upfront.
+5. Run the synthesis probe on the initial harness choice. The probe produces one of three outcomes: strong fit (0-1 yes), generic fit (2-3 yes), or no fit (4-5 yes or no harness matches).
+6. If the probe finds no fit, load `harness-synthesis` and draft a domain-specific harness for this run instead of forcing a bad fit.
+7. If the probe finds a generic fit (2-3 yes), propose a run-only draft harness or domain rules alongside the built-in harness. Do not save it by default.
+8. If too many tools, skills, agents, or harnesses are available, load `harness-curation` and choose the smallest effective set before loading more context.
+9. If lightweight signals show a recurring operating habit, use `harness-curation` (its habit calibration section) to make one advisory recommendation without loading a separate body.
+10. If the user points to research, notes, examples, prior failures, or "what I learned today", synthesize from those inputs. Extract behavior-shaping rules and reusable procedure, not a summary.
+11. Run Stitch once before committing to `.tink/current/`. If it triggers, show exactly one proposal before approval. Call `AskUserQuestion` as described in the Interaction policy section.
+12. Ask for explicit approval before non-trivial work.
+13. After approval, read only the selected harness files and any approved run-only draft.
+14. Create `.tink/current/` files from the run state contract.
+15. Execute the first safe step immediately:
+   - inspect relevant files,
+   - run a read-only diagnostic,
+   - draft the first artifact,
+   - or reproduce the issue.
+16. Keep `steps.json` and `notes.md` current as work progresses.
+17. Before final, verify `checks.md` and report evidence.
+18. If the task exposed a repeated mistake or reusable improvement, use the Reusable State Save Gate approval payload below. Save only after separate user approval.
+## Synthesis probe
+Run this short probe even when a built-in harness seems usable. It prevents broad default harnesses from hiding repeatable domain workflows.
+Answer yes/no:
+1. Is this likely to recur in this repo, product, customer segment, release process, or personal workflow?
+2. Would a domain-specific rule change the first action, the order of steps, the stop condition, or the verification evidence?
+3. Is the selected built-in harness only a loose or generic fit?
+4. Did the user correction, prior run note, failed check, research source, or named project context expose a reusable rule?
+5. Would a one-screen draft reduce future context or repeated explanation?
+Decision:
+- 0-1 yes: use the selected built-in harness only. Record why no draft is needed if relevant.
+- 2-3 yes: propose a run-only draft harness. It applies to this run, is written into `.tink/current/plan.md` or `notes.md`, and is not saved by default.
+- 4-5 yes: propose a run-only draft now and ask whether it should become a save candidate after the run. Saving still needs the approval payload.
+Run-only draft format:
+```text
+임시 하네스 초안 (이번 작업 전용):
+- name: <specific-lowercase-name>
+- why not just built-in: <one sentence>
+- domain rules: <2-4 bullets that change execution>
+- checks: <2-4 evidence checks>
+- save policy: 이번 run에는 적용, 저장은 반복 근거와 별도 승인 후만
+```
+A run-only draft is not reusable memory. Do not update `.tink/harnesses/`, `index.json`, or `.tink/maintenance/ledger.jsonl` unless the user separately approves saving.
+## Context budget policy
+Do not use one universal harness cap. Choose by context footprint and task risk. Classify size by how much thinking and checking the harness adds, not only by file length:
+- Tiny harnesses: one screen or less, one clear trigger, no extra tool chain, and one or two checks. May exceed 4 when each is directly useful. Still explain why each earns its place.
+- Small harnesses: checklist-sized, one work type, a few checks, and limited recovery rules. Usually 1-4 active bodies. Add more only when the task has separate risks that need separate checks.
+- Large harnesses: multi-phase, tool-heavy, research-heavy, multi-agent, or broad enough to change the whole workflow. Load one at a time and only after approval.
+- Meta harnesses (`harness-curation`, `harness-synthesis`): do not do the end-user task directly. They decide whether to choose, reduce, replace, create, or tune other harnesses. Count their context cost and use them to reduce or replace the active set, not to pile on top by default.
+- No hard cap mode is allowed for complex tasks, but it must be explicit: state the expected context cost, why no cap is safer, and what will be unloaded or summarized first.
+If the harness list feels heavy, stop and use `harness-curation` before loading more bodies.
+## Approval payload for saves
+This is the Reusable State Save Gate payload. Before saving memory, a new harness, a harness edit, or index metadata, show:
+- operation: memory-save | harness-create | harness-edit | index-update | frog | weave
+- destination files
+- exact entry text or patch summary
+- why it is reusable
+- sensitive/private content excluded
+- evidence handles
+- rollback or removal path
+- approval ledger entry path: `.tink/maintenance/ledger.jsonl`
+Do not save if the user approved only the current run. Saving reusable state needs separate approval.
+## Approval format
+Use concise, selection-oriented wording. The recommendation must include the first action Tink will perform, not only the harness name.
+Approval option counts (always exactly one applies):
+- Default (no Stitch, no run-only draft): 4 options — 승인 / 조정 / 새 하네스 초안 만들기 / 취소
+- Run-only draft offered: 4 options — 승인 / 조정 / 기본 하네스만 사용 / 취소
+- Stitch soft gate: 4 options — 승인 / 요구사항 입력 / 이대로 진행 / 취소
+- Stitch hard gate (or Save Gate): 3 options — 승인 / 요구사항 입력 / 취소. Never offer `이대로 진행` / `Continue as-is`.
+```text
+### 🧶 Run: <task name>
+**🎯 Goals**
+- <goal>
+**🛠️ Harness**: `code-change + review`
+- **Probe:** 1 yes — built-in 하네스로 충분
+- **이유:** 변경 범위가 좁고, 회귀 확인이 필요합니다.
+- **첫 실행:** 관련 파일을 먼저 읽고 검증 명령 후보를 확정합니다.
+? 진행할까요?
+❯ 1. 승인 (권장) — 실행 상태 생성 후 첫 실행까지 진행
+  2. 조정 — 다른 하네스 조합 선택
+  3. 새 하네스 초안 만들기
+  4. 취소
+```
+If a run-only draft or new harness is useful:
+```text
+### 🧶 Run: <task name>
+**🎯 Goals**
+- <goal>
+**🛠️ Harness**: `<built-in>` (probe: 3 yes — generic fit)
+**임시 하네스 초안** (이번 작업 전용):
+- **name:** `customer-interview-synthesis`
+- **why not just built-in:** 일반 research보다 인터뷰 단위, 원문 근거, pain point 반복성이 중요합니다.
+- **domain rules:**
+  - 인터뷰별 원문 근거를 먼저 분리
+  - 반복 pain point와 단발 의견을 구분
+  - 제품 기회와 다음 검증 질문을 함께 남김
+- **checks:** 원문 근거, 추측 분리, 다음 액션
+- **save policy:** 이번 run에는 적용, 저장은 반복 근거와 별도 승인 후만
+? 진행할까요?
+❯ 1. 승인 (권장) — 기본 하네스 + 임시 초안으로 `.tink/current/` 생성
+  2. 조정
+  3. 기본 하네스만 사용
+  4. 취소
+```
+If Stitch triggers as a soft gate, merge it into the approval format. The user-facing block uses plain language — never the word `Stitch`. The Korean default uses `점검 사항`; English uses `Review note`:
+```text
+### 🧶 Run: <task name>
+**🎯 Goals**
+- <goal>
+**🔍 점검 사항**
+- 제안: <one proposal>
+- 이유: <reason>
+- 이대로 진행 시 가정: <explicit assumption>
+**🛠️ Harness**: `<harness>`
+- **Probe:** ...
+- **이유:** ...
+- **첫 실행:** ...
+? 진행할까요?
+❯ 1. 승인 (권장) — 점검 가정 포함 진행
+  2. 요구사항 입력 — 점검 제안 또는 계획 조정
+  3. 이대로 진행 — 점검 무시하고 원래 계획대로
+  4. 취소
+```
+## Harness synthesis contract
+When creating a new harness or run-only draft, Tink must create a procedure that would outperform a generic skill recommendation for a repeated task.
+Do not wait for total mismatch. `generic fit` is enough to draft when the synthesis probe says the task has repeatable domain rules.
+A generated harness can encode:
+- domain triggers: when this exact workflow should run
+- source inputs: research notes, examples, project files, prior run notes, failures, user corrections
+- decision rules: how to choose options, reject bad paths, or stop
+- tool sequence: what to inspect, search, run, draft, verify, or avoid first
+- checks: objective evidence required before final
+- recovery: what to do when a check fails
+- memory rule: what may become reusable memory or harness improvement
+Do not generate broad harnesses like `coding-helper` or `research-assistant`. Generate narrow harnesses like `nextjs-rsc-boundary-refactor`, `pre-pr-security-gate`, or `cafe-menu-validation-note`.
+Before saving, score the candidate 1-5 on specificity, actionability, verifiability, reuse likelihood, and context cost. Save only if the weak points are acceptable and the user approves.
+## `plan.md` template
+```md
+# Tink current run
+## Goal
+-
+## Selected harnesses
+-
+## Why this harness
+-
+## Scope
+-
+## Out of scope
+-
+## Assumptions / answers
+-
+## Next steps
+1.
+```
+## `checks.md` template
+```md
+# Checks
+## Done means
+-
+## Verification
+-
+## Evidence to report
+-
+## Stop conditions
+-
+```
+## `answers.md` template
+```md
+# Answers and assumptions
+## User answers
+-
+## Inferred defaults
+-
+## Open questions
+-
+```
+## `steps.json` template
+```json
+{
+  "goal": "",
+  "harnesses": [],
+  "steps": [
+    { "id": "1", "status": "in_progress", "description": "Create run state and inspect the target", "started_at": "", "completed_at": "" }
+  ]
+}
+```
+## Meaning of `context`
+When listing harnesses, define `context` once:
+```text
+context는 이 harness가 Claude 작업 컨텍스트를 얼마나 차지하는지입니다.
+- tiny: 아주 짧음
+- small: 보통 체크리스트
+- large: 별도 승인 후 읽는 큰 하네스
+```
+## Other slash skills
+Tink does not automatically wrap `/grill-me`, `/diagnose`, `/tdd`, or other slash skills. That is intentional. If needed, run `/tink:cast` first, then use the other skill output as input.
+## Failure behavior
+If a check fails:
+- write the failure to `.tink/current/notes.md`,
+- identify the last safe point,
+- take one recovery action,
+- update `steps.json`,
+- then update the harness or memory only if the lesson is reusable and approved.
+## Do not
+- Do not end with a harness recommendation only.
+- Do not load every harness body up front.
+- Do not create memory entries without separate Reusable State Save Gate approval.
+- Do not store raw logs, full diffs, secrets, or one-off task progress as reusable memory.
+- Do not ask "do you want to save?" before showing the Reusable State Save Gate payload. Show the payload directly.
+- Do not narrate .tink/ file writes (current/, runs/, memory/, config.json) in the response body. Do not show diff summaries, file lists, or "I created X / I updated Y" breakdowns. The tool-use header is sufficient on its own. At the end of the response, add at most one short sentence summarizing what changed across all .tink/ writes.
+- Do not use Tink-internal jargon (Stitch, hard gate, Save Gate, Reusable State, or temporary labels like G1/G2/G3) when writing user-facing responses. Translate to plain language matching `config.json` language. Internal documentation and code keep original terms for consistency.

package/templates/claude/commands/tink/frog.md ADDED Viewed

@@ -0,0 +1,77 @@
+---
+description: Propose unused or redundant harness cleanup without deleting automatically.
+---
+# /tink:frog
+Find harnesses that are probably unused or redundant, then ask before removing them.
+## Purpose
+Keep Tink small. A large harness set defeats the point.
+## Interaction policy
+Always call the `AskUserQuestion` tool for choice prompts. Do not render `❯` text format. Do not ask the user to type a number inline.
+Map prompt content to `AskUserQuestion` fields:
+- `question`: the full question text
+- `header`: max 12-character tag (e.g. "진행 방식", "정리 방식")
+- `label`: 1–5 word option name. Add "(권장)" if recommended.
+- `description`: explanatory text for the option
+Use Korean field values when `.tink/config.json` language is `ko` or `auto` with Korean input; use English otherwise.
+## Procedure
+1. Read `.tink/harnesses/index.json`.
+2. Check compact evidence if available:
+   - `.tink/runs/` summaries
+   - `.tink/maintenance/ledger.jsonl`
+   - `.tink/maintenance/weave-queue.json`
+   - references in memory files
+   - recent git history touching harness files as weak context only
+3. Treat `.tink/current/notes.md` as weak evidence unless it is clearly from the same active conversation. If uncertain, label it `stale current candidate`.
+4. Grade evidence before recommending action:
+   - strong: multiple run or ledger records show non-use, repeated rejection, replacement, or accepted alternative
+   - medium: one run or ledger record plus clear overlap or memory evidence
+   - weak: static index, git-only evidence, stale current notes, or model judgment
+5. Identify candidates:
+   - never used with strong evidence
+   - not used recently with strong evidence
+   - overlaps strongly with another harness
+   - too broad to guide behavior
+   - repeatedly ignored during `/tink:cast`
+6. For each candidate, show evidence grade and recommendation:
+   - keep
+   - merge into another harness
+   - delete
+   - rewrite via `/tink:weave`
+7. Only strong evidence may recommend `delete`. Medium evidence may recommend `merge` or `hone`. Weak evidence must default to `keep` or `needs evidence`.
+8. For each non-keep action, prepare an operation-specific approval payload with exact files, op ID, evidence handles, and rollback.
+9. If the recommendation is `weave`, write or present a weave handoff packet and, after approval, add it to `.tink/maintenance/weave-queue.json`:
+   - id
+   - target harness
+   - evidence
+   - proposed direction
+   - affected files
+   - approval status
+10. Ask for approval before changing files.
+11. If approved, remove or merge surgically, update `.tink/harnesses/index.json`, and append the approval/result to `.tink/maintenance/ledger.jsonl`.
+## Approval format
+```text
+Purge candidates with operation IDs:
+- docs: keep. Evidence grade=strong. Used recently and distinct.
+- op-1 ship: hone. Evidence grade=medium. Handoff: target=ship, direction=tighten release checks.
+- old-research: needs evidence. Evidence grade=weak. Static index only, so no delete recommendation.
+? 진행할까요?
+❯ 1. 승인 — 추천안 적용
+  2. 일부만 적용 — op ID로 선택
+  3. 취소
+```
+## Do not
+- Do not delete without approval.
+- Do not delete built-in harnesses only because usage data is missing.
+- Do not treat missing `.tink/runs/` as proof of non-use.
+- Do not recommend delete from weak evidence.
+- Do not apply a delete, merge, weave handoff, or index update without an operation-specific approval payload.

package/templates/claude/commands/tink/list.md ADDED Viewed

@@ -0,0 +1,104 @@
+---
+description: Inspect available Tink harnesses and recent usage signals.
+---
+# /tink:list
+List available Tink harnesses without loading every harness body.
+## Procedure
+1. Read `.tink/harnesses/index.json`.
+2. Read only compact usage metadata from `.tink/runs/` (frontmatter `selected_harnesses` / `actually_loaded_harnesses` + dates), `.tink/maintenance/ledger.jsonl`, and `.tink/maintenance/weave-queue.json`. Do not load raw logs.
+3. Treat `.tink/current/` as weak evidence unless it is clearly from the same active conversation. If context is uncertain, label it `stale current candidate`, not proof of usage.
+4. Classify every harness into exactly one of three categories:
+   - **working** — directly performs tasks (e.g. `code-change`, `bug-fix`, `research`, `review`, `docs`, `ship`).
+   - **meta** — manages other harnesses or Tink itself. Treat these names as meta regardless of `kind`: `harness-synthesis`, `harness-curation`, `tink-feedback-apply`.
+   - **custom (this repo)** — `kind: synthesized` in `index.json` (created in this repo, not part of the default set). If a synthesized harness also matches a meta name, prefer meta.
+5. Compute the signal per harness:
+   - 🟢 **active** — appears in any `.tink/runs/*.md` frontmatter or `.tink/maintenance/ledger.jsonl` entry.
+   - ⚪ **unknown** — no run/ledger/memory evidence. Do not call it `quiet` or `candidate for purge` from the static index alone. Do not infer non-use from missing evidence.
+6. Show all three categories every time, even when one is empty. For an empty category, render `_(아직 없음)_` (or the English equivalent if the project language is `en`) instead of an item list.
+7. Do not output the `evidence` field. Usage is now compressed into `signal`.
+## Output format
+Always start with a header block that defines the fields and categories. Render each harness as a multi-line block — one field per line, never collapsed onto one line. Close with an assessment and command suggestions.
+Use this exact skeleton (translate field labels and descriptions to the language in `.tink/config.json`):
+````markdown
+### 🧶 Tink 하네스 목록
+> **필드 설명**
+> - **purpose** — 이 하네스가 다루는 작업
+> - **context** — Claude 컨텍스트 점유량
+>   · `tiny` 아주 짧음  · `small` 보통 체크리스트  · `large` 별도 승인 후 읽는 큰 하네스
+> - **last used** — 가장 최근 실행 날짜 (없으면 `미사용`)
+> - **signal** — 🟢 `active` 사용 기록 있음  · ⚪ `unknown` 아직 사용 기록 없음
+>
+> **카테고리 설명**
+> - **작업 하네스** — 실제 작업을 수행 (코드 변경·리뷰·문서 등)
+> - **메타 하네스** — 다른 하네스나 Tink 자체를 관리 (선택·합성·피드백 반영)
+> - **이 저장소 전용** — 이 프로젝트에서 직접 만들어 저장된 하네스
+---
+#### 🛠️ 작업 하네스
+##### `<name>`
+- **purpose**: <one short sentence>
+- **context**: <tiny | small | large>
+- **last used**: <YYYY-MM-DD | 미사용>
+- **signal**: 🟢 active | ⚪ unknown
+#### 🧭 메타 하네스
+##### `<name>`
+- **purpose**: …
+- **context**: …
+- **last used**: …
+- **signal**: …
+(또는 비어 있으면)
+_(아직 없음)_
+#### 🔧 이 저장소 전용
+##### `<name>`
+- **purpose**: …
+- **context**: …
+- **last used**: …
+- **signal**: …
+(또는 비어 있으면)
+_(아직 없음)_
+---
+### 📊 평가
+- **가장 활발**: …
+- **한 번도 안 쓴 하네스**: …
+- **균형/주의점**: 한두 문장 평가.
+### 💡 다음에 쓸 수 있는 명령
+- `/tink:cast <작업 설명>` — 적절한 하네스를 골라 작업 시작
+- `/tink:weave` — 자주 쓰는 하네스에 누적된 개선 사항 반영 (해당될 때만)
+- `/tink:frog` — 오래 사용 안 된 하네스 정리 후보 검토 (실제 삭제는 별도 승인)
+- `/tink:setup` — 언어·범위·훅 정책 등 Tink 설정 점검
+````
+## Assessment & command-suggestion rules
+- The 평가 section must mention at least: the most-used harness, every harness with an `unknown` signal, and any obvious imbalance (e.g. meta harnesses all untouched).
+- Always include `/tink:cast` and `/tink:setup` as default next steps.
+- Only suggest `/tink:weave` when at least one active harness has user-correction evidence, repeated runs of the same category, or items queued in `.tink/maintenance/weave-queue.json`.
+- Only suggest `/tink:frog` when at least one harness has been `unknown` for the entire visible history AND there is no plausible upcoming use. Frame it as "정리 후보 검토", not "삭제".
+## Output style
+Use bullets, not tables. One field per line per harness. Never collapse a harness into a single line.
+## Do not
+- Do not read every harness body by default.
+- Do not infer non-use from missing evidence.
+- Do not remove anything. Use `/tink:frog` for removal candidates.
+- Do not output the `evidence` field.
+- Do not hide a category because it has zero items — render `_(아직 없음)_` instead.