omakaseagent 0.1.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (187) hide show
  1. package/LICENSE +182 -0
  2. package/OMAKASE-CRITIQUE.md +12 -0
  3. package/OMAKASE-PRINCIPLES.md +15 -0
  4. package/OMAKASE-RULES.md +25 -0
  5. package/README.md +96 -0
  6. package/bin/omakase.js +571 -0
  7. package/dist/agents/.agents/skills/omakase/OMAKASE-CRITIQUE.md +12 -0
  8. package/dist/agents/.agents/skills/omakase/OMAKASE-PRINCIPLES.md +15 -0
  9. package/dist/agents/.agents/skills/omakase/OMAKASE-RULES.md +25 -0
  10. package/dist/agents/.agents/skills/omakase/SKILL.md +177 -0
  11. package/dist/agents/.agents/skills/omakase/TEAMS.md +120 -0
  12. package/dist/agents/.agents/skills/omakase/core/omakase-core.md +43 -0
  13. package/dist/agents/.agents/skills/omakase/reference/archivist-workflows.md +178 -0
  14. package/dist/agents/.agents/skills/omakase/reference/backlog-audit.md +168 -0
  15. package/dist/agents/.agents/skills/omakase/reference/critique.md +92 -0
  16. package/dist/agents/.agents/skills/omakase/reference/dark-factory.md +111 -0
  17. package/dist/agents/.agents/skills/omakase/reference/engineering.md +137 -0
  18. package/dist/agents/.agents/skills/omakase/reference/execution-plan.md +159 -0
  19. package/dist/agents/.agents/skills/omakase/reference/factory-orchestration.md +123 -0
  20. package/dist/agents/.agents/skills/omakase/reference/handoff.md +43 -0
  21. package/dist/agents/.agents/skills/omakase/reference/init.md +146 -0
  22. package/dist/agents/.agents/skills/omakase/reference/learn.md +66 -0
  23. package/dist/agents/.agents/skills/omakase/reference/native-agents.md +45 -0
  24. package/dist/agents/.agents/skills/omakase/reference/plan.md +79 -0
  25. package/dist/agents/.agents/skills/omakase/reference/skill-judge.md +133 -0
  26. package/dist/agents/.agents/skills/omakase/reference/task-intake.md +94 -0
  27. package/dist/agents/.agents/skills/omakase/reference/taste.md +33 -0
  28. package/dist/agents/.agents/skills/omakase/reference/team-architecture.md +38 -0
  29. package/dist/agents/.agents/skills/omakase/teams/archives/lead.md +77 -0
  30. package/dist/agents/.agents/skills/omakase/teams/archives/sub-personas/memory-synthesizer.md +66 -0
  31. package/dist/agents/.agents/skills/omakase/teams/critics/lead.md +94 -0
  32. package/dist/agents/.agents/skills/omakase/teams/critics/sub-personas/deslop-critic.md +52 -0
  33. package/dist/agents/.agents/skills/omakase/teams/critics/sub-personas/skill-judge.md +59 -0
  34. package/dist/agents/.agents/skills/omakase/teams/critics/sub-personas/structural-critic.md +112 -0
  35. package/dist/agents/.agents/skills/omakase/teams/critics/sub-personas/verification-critic.md +73 -0
  36. package/dist/agents/.agents/skills/omakase/teams/engineering/lead.md +111 -0
  37. package/dist/agents/.agents/skills/omakase/teams/engineering/sub-personas/debugger.md +44 -0
  38. package/dist/agents/.agents/skills/omakase/teams/engineering/sub-personas/implementation-lead.md +43 -0
  39. package/dist/agents/.agents/skills/omakase/teams/engineering/sub-personas/refactor-specialist.md +56 -0
  40. package/dist/agents/.agents/skills/omakase/teams/engineering/sub-personas/senior-reviewer.md +83 -0
  41. package/dist/agents/.opencode/agents/omakase-archivist.md +24 -0
  42. package/dist/agents/.opencode/agents/omakase-critic.md +32 -0
  43. package/dist/agents/.opencode/agents/omakase-debugger.md +15 -0
  44. package/dist/agents/.opencode/agents/omakase-deslop-critic.md +15 -0
  45. package/dist/agents/.opencode/agents/omakase-engineer.md +38 -0
  46. package/dist/agents/.opencode/agents/omakase-implementation-lead.md +15 -0
  47. package/dist/agents/.opencode/agents/omakase-memory-synthesizer.md +15 -0
  48. package/dist/agents/.opencode/agents/omakase-refactor-specialist.md +15 -0
  49. package/dist/agents/.opencode/agents/omakase-senior-reviewer.md +17 -0
  50. package/dist/agents/.opencode/agents/omakase-skill-judge.md +17 -0
  51. package/dist/agents/.opencode/agents/omakase-structural-critic.md +15 -0
  52. package/dist/agents/.opencode/agents/omakase-verification-critic.md +15 -0
  53. package/dist/chat/omakase/SKILL.md +84 -0
  54. package/dist/claude/.claude/agents/omakase-archivist.md +21 -0
  55. package/dist/claude/.claude/agents/omakase-critic.md +25 -0
  56. package/dist/claude/.claude/agents/omakase-engineer.md +32 -0
  57. package/dist/claude/.claude/skills/omakase/OMAKASE-CRITIQUE.md +12 -0
  58. package/dist/claude/.claude/skills/omakase/OMAKASE-PRINCIPLES.md +15 -0
  59. package/dist/claude/.claude/skills/omakase/OMAKASE-RULES.md +25 -0
  60. package/dist/claude/.claude/skills/omakase/SKILL.md +177 -0
  61. package/dist/claude/.claude/skills/omakase/TEAMS.md +120 -0
  62. package/dist/claude/.claude/skills/omakase/core/omakase-core.md +43 -0
  63. package/dist/claude/.claude/skills/omakase/reference/archivist-workflows.md +178 -0
  64. package/dist/claude/.claude/skills/omakase/reference/backlog-audit.md +168 -0
  65. package/dist/claude/.claude/skills/omakase/reference/critique.md +92 -0
  66. package/dist/claude/.claude/skills/omakase/reference/dark-factory.md +111 -0
  67. package/dist/claude/.claude/skills/omakase/reference/engineering.md +137 -0
  68. package/dist/claude/.claude/skills/omakase/reference/execution-plan.md +159 -0
  69. package/dist/claude/.claude/skills/omakase/reference/factory-orchestration.md +123 -0
  70. package/dist/claude/.claude/skills/omakase/reference/handoff.md +43 -0
  71. package/dist/claude/.claude/skills/omakase/reference/init.md +146 -0
  72. package/dist/claude/.claude/skills/omakase/reference/learn.md +66 -0
  73. package/dist/claude/.claude/skills/omakase/reference/native-agents.md +45 -0
  74. package/dist/claude/.claude/skills/omakase/reference/plan.md +79 -0
  75. package/dist/claude/.claude/skills/omakase/reference/skill-judge.md +133 -0
  76. package/dist/claude/.claude/skills/omakase/reference/task-intake.md +94 -0
  77. package/dist/claude/.claude/skills/omakase/reference/taste.md +33 -0
  78. package/dist/claude/.claude/skills/omakase/reference/team-architecture.md +38 -0
  79. package/dist/claude/.claude/skills/omakase/teams/archives/lead.md +77 -0
  80. package/dist/claude/.claude/skills/omakase/teams/archives/sub-personas/memory-synthesizer.md +66 -0
  81. package/dist/claude/.claude/skills/omakase/teams/critics/lead.md +94 -0
  82. package/dist/claude/.claude/skills/omakase/teams/critics/sub-personas/deslop-critic.md +52 -0
  83. package/dist/claude/.claude/skills/omakase/teams/critics/sub-personas/skill-judge.md +59 -0
  84. package/dist/claude/.claude/skills/omakase/teams/critics/sub-personas/structural-critic.md +112 -0
  85. package/dist/claude/.claude/skills/omakase/teams/critics/sub-personas/verification-critic.md +73 -0
  86. package/dist/claude/.claude/skills/omakase/teams/engineering/lead.md +111 -0
  87. package/dist/claude/.claude/skills/omakase/teams/engineering/sub-personas/debugger.md +44 -0
  88. package/dist/claude/.claude/skills/omakase/teams/engineering/sub-personas/implementation-lead.md +43 -0
  89. package/dist/claude/.claude/skills/omakase/teams/engineering/sub-personas/refactor-specialist.md +56 -0
  90. package/dist/claude/.claude/skills/omakase/teams/engineering/sub-personas/senior-reviewer.md +83 -0
  91. package/dist/codex/.codex/agents/omakase-archivist.toml +133 -0
  92. package/dist/codex/.codex/agents/omakase-critic.toml +149 -0
  93. package/dist/codex/.codex/agents/omakase-debugger.toml +92 -0
  94. package/dist/codex/.codex/agents/omakase-deslop-critic.toml +100 -0
  95. package/dist/codex/.codex/agents/omakase-engineer.toml +167 -0
  96. package/dist/codex/.codex/agents/omakase-implementation-lead.toml +91 -0
  97. package/dist/codex/.codex/agents/omakase-memory-synthesizer.toml +114 -0
  98. package/dist/codex/.codex/agents/omakase-refactor-specialist.toml +104 -0
  99. package/dist/codex/.codex/agents/omakase-senior-reviewer.toml +127 -0
  100. package/dist/codex/.codex/agents/omakase-skill-judge.toml +106 -0
  101. package/dist/codex/.codex/agents/omakase-structural-critic.toml +160 -0
  102. package/dist/codex/.codex/agents/omakase-verification-critic.toml +121 -0
  103. package/dist/cursor/.cursor/agents/omakase-archivist.md +21 -0
  104. package/dist/cursor/.cursor/agents/omakase-critic.md +25 -0
  105. package/dist/cursor/.cursor/agents/omakase-engineer.md +32 -0
  106. package/dist/cursor/.cursor/skills/omakase/OMAKASE-CRITIQUE.md +12 -0
  107. package/dist/cursor/.cursor/skills/omakase/OMAKASE-PRINCIPLES.md +15 -0
  108. package/dist/cursor/.cursor/skills/omakase/OMAKASE-RULES.md +25 -0
  109. package/dist/cursor/.cursor/skills/omakase/SKILL.md +177 -0
  110. package/dist/cursor/.cursor/skills/omakase/TEAMS.md +120 -0
  111. package/dist/cursor/.cursor/skills/omakase/core/omakase-core.md +43 -0
  112. package/dist/cursor/.cursor/skills/omakase/reference/archivist-workflows.md +178 -0
  113. package/dist/cursor/.cursor/skills/omakase/reference/backlog-audit.md +168 -0
  114. package/dist/cursor/.cursor/skills/omakase/reference/critique.md +92 -0
  115. package/dist/cursor/.cursor/skills/omakase/reference/dark-factory.md +111 -0
  116. package/dist/cursor/.cursor/skills/omakase/reference/engineering.md +137 -0
  117. package/dist/cursor/.cursor/skills/omakase/reference/execution-plan.md +159 -0
  118. package/dist/cursor/.cursor/skills/omakase/reference/factory-orchestration.md +123 -0
  119. package/dist/cursor/.cursor/skills/omakase/reference/handoff.md +43 -0
  120. package/dist/cursor/.cursor/skills/omakase/reference/init.md +146 -0
  121. package/dist/cursor/.cursor/skills/omakase/reference/learn.md +66 -0
  122. package/dist/cursor/.cursor/skills/omakase/reference/native-agents.md +45 -0
  123. package/dist/cursor/.cursor/skills/omakase/reference/plan.md +79 -0
  124. package/dist/cursor/.cursor/skills/omakase/reference/skill-judge.md +133 -0
  125. package/dist/cursor/.cursor/skills/omakase/reference/task-intake.md +94 -0
  126. package/dist/cursor/.cursor/skills/omakase/reference/taste.md +33 -0
  127. package/dist/cursor/.cursor/skills/omakase/reference/team-architecture.md +38 -0
  128. package/dist/cursor/.cursor/skills/omakase/teams/archives/lead.md +77 -0
  129. package/dist/cursor/.cursor/skills/omakase/teams/archives/sub-personas/memory-synthesizer.md +66 -0
  130. package/dist/cursor/.cursor/skills/omakase/teams/critics/lead.md +94 -0
  131. package/dist/cursor/.cursor/skills/omakase/teams/critics/sub-personas/deslop-critic.md +52 -0
  132. package/dist/cursor/.cursor/skills/omakase/teams/critics/sub-personas/skill-judge.md +59 -0
  133. package/dist/cursor/.cursor/skills/omakase/teams/critics/sub-personas/structural-critic.md +112 -0
  134. package/dist/cursor/.cursor/skills/omakase/teams/critics/sub-personas/verification-critic.md +73 -0
  135. package/dist/cursor/.cursor/skills/omakase/teams/engineering/lead.md +111 -0
  136. package/dist/cursor/.cursor/skills/omakase/teams/engineering/sub-personas/debugger.md +44 -0
  137. package/dist/cursor/.cursor/skills/omakase/teams/engineering/sub-personas/implementation-lead.md +43 -0
  138. package/dist/cursor/.cursor/skills/omakase/teams/engineering/sub-personas/refactor-specialist.md +56 -0
  139. package/dist/cursor/.cursor/skills/omakase/teams/engineering/sub-personas/senior-reviewer.md +83 -0
  140. package/dist/grok/.grok/agents/omakase-archivist.md +25 -0
  141. package/dist/grok/.grok/agents/omakase-critic.md +28 -0
  142. package/dist/grok/.grok/agents/omakase-debugger.md +17 -0
  143. package/dist/grok/.grok/agents/omakase-deslop-critic.md +17 -0
  144. package/dist/grok/.grok/agents/omakase-engineer.md +36 -0
  145. package/dist/grok/.grok/agents/omakase-implementation-lead.md +17 -0
  146. package/dist/grok/.grok/agents/omakase-memory-synthesizer.md +17 -0
  147. package/dist/grok/.grok/agents/omakase-refactor-specialist.md +17 -0
  148. package/dist/grok/.grok/agents/omakase-senior-reviewer.md +17 -0
  149. package/dist/grok/.grok/agents/omakase-skill-judge.md +17 -0
  150. package/dist/grok/.grok/agents/omakase-structural-critic.md +17 -0
  151. package/dist/grok/.grok/agents/omakase-verification-critic.md +17 -0
  152. package/dist/grok/.grok/skills/omakase/OMAKASE-CRITIQUE.md +12 -0
  153. package/dist/grok/.grok/skills/omakase/OMAKASE-PRINCIPLES.md +15 -0
  154. package/dist/grok/.grok/skills/omakase/OMAKASE-RULES.md +25 -0
  155. package/dist/grok/.grok/skills/omakase/SKILL.md +177 -0
  156. package/dist/grok/.grok/skills/omakase/TEAMS.md +120 -0
  157. package/dist/grok/.grok/skills/omakase/core/omakase-core.md +43 -0
  158. package/dist/grok/.grok/skills/omakase/reference/archivist-workflows.md +178 -0
  159. package/dist/grok/.grok/skills/omakase/reference/backlog-audit.md +168 -0
  160. package/dist/grok/.grok/skills/omakase/reference/critique.md +92 -0
  161. package/dist/grok/.grok/skills/omakase/reference/dark-factory.md +111 -0
  162. package/dist/grok/.grok/skills/omakase/reference/engineering.md +137 -0
  163. package/dist/grok/.grok/skills/omakase/reference/execution-plan.md +159 -0
  164. package/dist/grok/.grok/skills/omakase/reference/factory-orchestration.md +123 -0
  165. package/dist/grok/.grok/skills/omakase/reference/handoff.md +43 -0
  166. package/dist/grok/.grok/skills/omakase/reference/init.md +146 -0
  167. package/dist/grok/.grok/skills/omakase/reference/learn.md +66 -0
  168. package/dist/grok/.grok/skills/omakase/reference/native-agents.md +45 -0
  169. package/dist/grok/.grok/skills/omakase/reference/plan.md +79 -0
  170. package/dist/grok/.grok/skills/omakase/reference/skill-judge.md +133 -0
  171. package/dist/grok/.grok/skills/omakase/reference/task-intake.md +94 -0
  172. package/dist/grok/.grok/skills/omakase/reference/taste.md +33 -0
  173. package/dist/grok/.grok/skills/omakase/reference/team-architecture.md +38 -0
  174. package/dist/grok/.grok/skills/omakase/teams/archives/lead.md +77 -0
  175. package/dist/grok/.grok/skills/omakase/teams/archives/sub-personas/memory-synthesizer.md +66 -0
  176. package/dist/grok/.grok/skills/omakase/teams/critics/lead.md +94 -0
  177. package/dist/grok/.grok/skills/omakase/teams/critics/sub-personas/deslop-critic.md +52 -0
  178. package/dist/grok/.grok/skills/omakase/teams/critics/sub-personas/skill-judge.md +59 -0
  179. package/dist/grok/.grok/skills/omakase/teams/critics/sub-personas/structural-critic.md +112 -0
  180. package/dist/grok/.grok/skills/omakase/teams/critics/sub-personas/verification-critic.md +73 -0
  181. package/dist/grok/.grok/skills/omakase/teams/engineering/lead.md +111 -0
  182. package/dist/grok/.grok/skills/omakase/teams/engineering/sub-personas/debugger.md +44 -0
  183. package/dist/grok/.grok/skills/omakase/teams/engineering/sub-personas/implementation-lead.md +43 -0
  184. package/dist/grok/.grok/skills/omakase/teams/engineering/sub-personas/refactor-specialist.md +56 -0
  185. package/dist/grok/.grok/skills/omakase/teams/engineering/sub-personas/senior-reviewer.md +83 -0
  186. package/dist/omakase-skill.zip +0 -0
  187. package/package.json +54 -0
@@ -0,0 +1,178 @@
1
+ # Archivist workflows — git recap & chat preferences
2
+
3
+ Protocols for **@omakase-archivist**. Not user-facing skills. Distilled from cursor-team-kit `weekly-review`, `what-did-i-get-done`, and `workflow-from-chats` (Omakase voice).
4
+
5
+ **Router:** Use the native Archivist for these. Router `taste` is for reading/updating memory on demand — not weekly recaps.
6
+
7
+ **Phase G:** Full repo `omakase learn` (CLI + project agents) is separate; this doc covers chat/git evidence only.
8
+
9
+ ---
10
+
11
+ ## Setup (all workflows)
12
+
13
+ Read `.omakaseagent/taste.md` and `decisions.md` when present. Include **Memory consulted** in the output (exact entries or “none yet — init seed only”).
14
+
15
+ ## When to run
16
+
17
+ | User intent | Workflow |
18
+ |-------------|----------|
19
+ | Weekly recap, retro, “what did I ship” | Git recap (below) |
20
+ | Specific date range (“since Monday”, “last 3 days”) | Same git recap — set `window` |
21
+ | “Mine chats”, “learn my preferences”, encode workflow | Chat preferences (below) |
22
+ | Cross-memory + git + chats → patterns, gaps, evolution | Delegate **Memory Synthesizer** with charter citing this doc |
23
+
24
+ ---
25
+
26
+ ## 1. Git recap (one workflow, `window` parameter)
27
+
28
+ **Goal:** A short executive summary a human would actually send — **not** a `git log` dump. If the output is only commit subjects pasted in a list, the workflow failed.
29
+
30
+ ### Setup
31
+
32
+ 1. Resolve **author** — default `git config user.email`. If the user asks for team/we scope, use all authors or named emails they give. **Sanity check:** if that email has zero commits in the window, run `git shortlog -sn main --since="<since>"` and ask the user which author(s) to use, or match a name they gave (e.g. `--author="Rick"`). Cloud/agent environments often use a bot email that is not the human author.
33
+ 2. Resolve **window** — default last **7 days**; weekly-style asks may use up to **10**. User ranges (“yesterday”, “since 2026-05-28”) → concrete dates; state the range used in the output.
34
+ 3. **Branch context** — default `main` (or repo default: `git symbolic-ref refs/remotes/origin/HEAD` shortened to branch name). Use current branch only if the user scoped to it.
35
+ 4. Collect commits on that branch in the window for the author(s). **Exclude merge commits.**
36
+
37
+ ### Commands (starting point — adapt if repo differs)
38
+
39
+ ```bash
40
+ git config user.email
41
+ git log main --no-merges --author="<email>" --since="<since>" --until="<until>" --format="%h %ad %s" --date=short
42
+ ```
43
+
44
+ For substantial-looking subjects, skim the change (not every commit):
45
+
46
+ ```bash
47
+ git show <hash> --stat
48
+ git show <hash> -p -- <limit large diffs mentally; skip formatting-only>
49
+ ```
50
+
51
+ ### Synthesis rules
52
+
53
+ - Group into **2–5 themes** (what moved the project), not one bullet per commit.
54
+ - **Omit** cosmetic-only work (formatting, import churn, rename-only) unless it was the whole week.
55
+ - Describe changes **functionally** — do not infer motivation (“because you wanted…”).
56
+ - **Classification paragraph** (required, short): split the week into likely **bugfix**, **tech debt**, and **net-new** using subjects + diff skim. No separate Omakase taxonomy — three buckets are enough.
57
+
58
+ ### Memory writes (git only)
59
+
60
+ - **Default: report only** — recap is the deliverable.
61
+ - **Optional:** If a commit message or diff clearly records a **decision** (architecture, policy, tool choice), propose a **`decisions.md` row** (Context / Decision / Why / Revisit if) in a fenced diff block — **do not apply** until the user confirms.
62
+ - Do **not** bulk-append recap bullets to `taste.md`.
63
+
64
+ ### Output shape
65
+
66
+ ```markdown
67
+ ## Shipped recap (<start> – <end>, `main`, <author>)
68
+
69
+ - <theme 1>
70
+ - <theme 2>
71
+
72
+
73
+ **Classification:** <2–4 sentences: bugfix vs tech debt vs net-new>
74
+
75
+ **Proposed decisions.md** (if any; omit section if none):
76
+ <diff or table rows for user confirm>
77
+ ```
78
+
79
+ ---
80
+
81
+ ## 2. Chat preferences (7-day default)
82
+
83
+ **Goal:** Durable workflow rules for **future** agents — not chat summaries.
84
+
85
+ ### Capability
86
+
87
+ 1. If Cursor-style parent transcripts are readable under the project (agent may search `agent-transcripts` under the Cursor project dir) → run full protocol.
88
+ 2. If not → ask for pasted excerpts or topics; state what could not be verified. **Do not** print local filesystem paths in user-facing output.
89
+
90
+ ### Protocol
91
+
92
+ 1. **Window** — default **7 days** unless the user specifies otherwise.
93
+ 2. **Inventory** (internal): topic, parent conversation id, approximate date, why it may contain preferences. Subagent transcripts are evidence only; **cite parent conversations** in output (title + id).
94
+ 3. **Scan** for explicit signals: “I prefer”, “always”, “never”, “not what I asked”, “stop”, review/PR/CI/skill corrections.
95
+ 4. **Preference atoms:** trigger, rule, quality bar, stop condition, evidence, confidence (**strong / medium / weak / contradicted**).
96
+ 5. **Contradicted** — two parents support incompatible rules (e.g. “always squash” vs “never squash”). **Do not write files.** List under **Contradicted** and ask the user which wins.
97
+ 6. **Cluster** by shape: shipping, review, simplification, debugging, communication, delegation, validation.
98
+ 7. **Artifact choice:**
99
+ - **Default:** patch `taste.md` (observable preferences) or `decisions.md` (one-off choices with Why).
100
+ - **Ask first** if the user would need a **new skill, rule, or workflow doc** outside `.omakaseagent/`.
101
+ 8. **Strong** preferences only for proposed patches; medium/weak → **Consider**; contradicted → ask (above).
102
+
103
+ ### Writes
104
+
105
+ - Show **exact diffs** for `taste.md` / `decisions.md`. **Wait for user confirm** before applying.
106
+ - Never overwrite the user’s voice; add or sharpen.
107
+ - No raw chat dumps, secrets, customer data, or credentials in output.
108
+
109
+ ### Output shape
110
+
111
+ ```markdown
112
+ ## Chat preference synthesis (<window>)
113
+
114
+ **Target workflow:** <one sentence>
115
+
116
+ **Evidence:** <parent title> (<conversation id>) — <one line why it counts>
117
+
118
+ **Adopt** (strong → proposed patches below)
119
+ **Consider** (medium/weak)
120
+ **Dismissed** (stale, one-off, low signal)
121
+ **Contradicted** (needs your call — do not patch until resolved)
122
+
123
+ ### Proposed patches (confirm to apply)
124
+
125
+ (diff blocks for taste.md / decisions.md — user confirms before apply)
126
+ ```
127
+
128
+ ---
129
+
130
+ ## 3. Delegation to Memory Synthesizer
131
+
132
+ Delegate when the charter needs **compiled truth** across `taste.md`, `decisions.md`, git themes, and chat atoms — evolution narrative, gap analysis, or co-curator structure proposals.
133
+
134
+ The lead may still run git or chat workflows alone when the ask is a recap or a preference pass only.
135
+
136
+ ---
137
+
138
+ ## 4. Drift audit (skill source vs dist vs TEAMS.md)
139
+
140
+ **Goal:** Catch persona/router drift before it reaches users — especially after `skill/` edits without `npm run build`.
141
+
142
+ ### When to run
143
+
144
+ | Trigger | Action |
145
+ |---------|--------|
146
+ | After merging Class 2 `skill/` or native generator changes | `npm run verify:drift` (CI runs this on main/PR) |
147
+ | Weekly maintenance or “does dist match skill?” | Same + `npm run build` if drift found |
148
+ | Before adding a new specialist | Check overlap per `reference/team-architecture.md` |
149
+
150
+ ### Mechanical check
151
+
152
+ ```bash
153
+ npm run verify:drift
154
+ ```
155
+
156
+ Validates: `TEAMS.md` lists core leads; `dist/*/agents/omakase-{engineer,critic,archivist}.md` include `teams/` bundles; persona tree under `skill/teams/` is non-empty.
157
+
158
+ ### Report shape (default: report only)
159
+
160
+ ```markdown
161
+ ## Drift audit (<date>)
162
+
163
+ - **TEAMS.md:** pass | issues
164
+ - **dist bundles:** pass | rebuild needed
165
+ - **scenario evals:** `npm run verify:scenario-evals` pass | fail
166
+
167
+ **Proposed decisions.md** (if structural drift found): …
168
+ ```
169
+
170
+ Do not auto-edit `skill/` or `dist/` — propose rebuild or fix, human confirms.
171
+
172
+ ---
173
+
174
+ ## Quality bar
175
+
176
+ - Git recap must change how someone understands the week (themes + classification), not duplicate `git log`.
177
+ - Chat work must produce **actionable** memory candidates or honest “nothing durable found.”
178
+ - Apply Omakase Internal Critique Pass on non-trivial synthesis (Context Fidelity: no invented quotes or decisions).
@@ -0,0 +1,168 @@
1
+ # Backlog audit — codebase improvement the Omakase way
2
+
3
+ **No separate slash command.** This workflow is triggered by plain goals to **@omakase-engineer** (or the router when native agents are absent):
4
+
5
+ - "Audit the codebase and tell me what's worth fixing"
6
+ - "Find tech debt / security / perf issues"
7
+ - "What should we improve next?"
8
+ - "Review what this branch changes before I open a PR"
9
+ - "Reconcile the backlog" / "refresh stale plans"
10
+ - "Write an execution plan for fixing X" (skip audit — recon + single plan)
11
+
12
+ **Read with:** `reference/dark-factory.md`, `.omakaseagent/factory.md`, `taste.md`, `decisions.md`, `reference/execution-plan.md`.
13
+
14
+ ---
15
+
16
+ ## What this is (and is not)
17
+
18
+ | It **is** | It **is not** |
19
+ |-----------|----------------|
20
+ | Engineer-led **advisor** pass: understand, judge, specify | A second router command or external skill install |
21
+ | Findings + **execution plans** in `.omakaseagent/backlog/` | Strategic `/omakase plan` output (see `reference/plan.md`) |
22
+ | Factory execution afterward: critic + gate + memory | "Plan passed checklist" without Omakase evidence |
23
+ | Read-only on **source** during the audit phase | Engineer implementing fixes during the same audit turn |
24
+
25
+ **Economics (borrowed principle):** use the capable model for understanding and specifying; delegate implementation to `omakase-implementation-lead` or a follow-up Engineer session with a self-contained execution plan. The **execution plan** is the handoff product — its quality determines whether the executor succeeds.
26
+
27
+ ---
28
+
29
+ ## Hard rules (audit phase)
30
+
31
+ 1. **Do not modify source code during audit.** The only writes in the audit turn go under `.omakaseagent/backlog/` (and optional `.omakaseagent/handoffs/` for the findings summary). Implementation happens in a **separate** work phase via factory orchestration.
32
+ 2. **Do not run commands that mutate the working tree** during audit — no installs, commits, formatters. Read, search, read-only analysis (`tsc --noEmit`, lint check mode, cheap test runs if side-effect free).
33
+ 3. **Every execution plan must be self-contained** — see `reference/execution-plan.md`. The executor has not seen the audit session.
34
+ 4. **Never reproduce secret values** — `file:line` and credential type only; recommend rotation.
35
+ 5. **If the user asks you to implement during audit,** finish or skip to spec: write/update the execution plan, then run factory orchestration in a new phase — do not "quick fix while you're in there."
36
+ 6. **Cite memory** — findings and plans must respect `taste.md` and `decisions.md`; do not re-litigate recorded rejections without new evidence.
37
+
38
+ ---
39
+
40
+ ## Workflow
41
+
42
+ ### Phase 1 — Recon (always)
43
+
44
+ - Read `README`, `AGENTS.md`, root configs, CI, directory layout.
45
+ - Pull mechanical commands from `.omakaseagent/factory.md` when present (not guessed).
46
+ - Note conventions with an exemplar file path for plans to reference.
47
+ - `git log --oneline -30` and churn hotspots when useful.
48
+
49
+ If there is no verification baseline (broken build, no tests), say so — "establish baseline" is often finding #1 and blocks risky plans in dependency order.
50
+
51
+ ### Phase 2 — Audit (parallel when repo is non-trivial)
52
+
53
+ Audit across these categories (scope by effort level below):
54
+
55
+ | Category | Look for |
56
+ |----------|----------|
57
+ | Correctness / bugs | Logic errors, race conditions, error swallowing |
58
+ | Security | Injection, authz gaps, secrets in repo, unsafe defaults |
59
+ | Performance | N+1, hot loops, unnecessary allocation |
60
+ | Test coverage | Untested critical paths, missing regression tests |
61
+ | Tech debt & architecture | Duplication, drift, boundary violations vs taste |
62
+ | Dependencies & migrations | Stale deps, breaking upgrades, dead code |
63
+ | DX & tooling | Broken scripts, slow CI, confusing local setup |
64
+ | Docs | Wrong README, missing runbooks for real workflows |
65
+ | Direction | Grounded next features — **evidence from repo only**, not generic slop |
66
+
67
+ **Effort level** (user sets in plain language; default **standard**):
68
+
69
+ | | quick | standard | deep |
70
+ |---|-------|----------|------|
71
+ | Coverage | Hotspots only | Hotspot-weighted | Whole repo |
72
+ | Subagents | 0–1 | ≤4 concurrent | ≤8, per category |
73
+ | Findings | Top ~6, HIGH confidence | Full vetted table | Full + LOW "investigate" |
74
+
75
+ Fan out read-only subagents per category when the harness supports it. Each subagent prompt must include: repo path, category scope, recon facts, instruction to return **findings only** (no fixes), and the finding format below.
76
+
77
+ Say in the report what was **not** audited.
78
+
79
+ **Branch scope** (when user asks about current branch / pre-PR): limit to files changed since merge-base with default branch plus direct importers. Tag each finding **`introduced`** or **`pre-existing`**. Separate them in the table — do not blame the branch for legacy debt in touched files.
80
+
81
+ ### Phase 3 — Vet (mandatory before presenting)
82
+
83
+ Subagents over-report. For every finding in the table, **open the cited code yourself**. Expect:
84
+
85
+ - **By-design** behavior reported as bug (e.g. standard `https_proxy` flagged as SSRF)
86
+ - **Mis-attributed** evidence (real issue, wrong file/line)
87
+ - **Duplicates** across categories
88
+
89
+ Downgrade, correct, or reject. Record rejections in `.omakaseagent/backlog/README.md` under **Findings considered and rejected** — durable policy rejections also belong in `decisions.md` (offer **@omakase-archivist** when the user wants them persisted).
90
+
91
+ ### Phase 4 — Present and select
92
+
93
+ Findings table ordered by leverage (impact ÷ effort × confidence):
94
+
95
+ | # | Finding | Category | Impact | Effort | Risk class | Confidence | Evidence |
96
+
97
+ Present **direction** suggestions separately (2–4 max) — options to weigh, not ranked against bugs.
98
+
99
+ Ask which findings become execution plans (suggest top 3–5 + user picks). Note **dependency order** (e.g. characterization tests before refactor).
100
+
101
+ Wait for selection. Do not write plans nobody asked for.
102
+
103
+ **Skip audit** when the user already named the work: recon only enough to spec → one execution plan via `reference/execution-plan.md`.
104
+
105
+ ### Phase 5 — Write execution plans
106
+
107
+ For each selected finding, write `.omakaseagent/backlog/NNN-<slug>.md` using `reference/execution-plan.md`.
108
+
109
+ - Stamp `git rev-parse --short HEAD` on each plan.
110
+ - Excerpts from **your own reads**, never from subagent reports alone.
111
+ - Monotonic numbering; reconcile with existing `backlog/README.md` — no duplicate findings.
112
+ - Update `backlog/README.md`: execution order, dependencies, status column, rejected findings.
113
+
114
+ ### Phase 6 — Execute (separate phase — factory orchestration)
115
+
116
+ When the user says implement / execute / ship a backlog item:
117
+
118
+ 1. Treat the execution plan as the **task brief charter** (`reference/task-intake.md`).
119
+ 2. Class **2+:** scenarios + one confirm before deep work.
120
+ 3. Delegate to `omakase-implementation-lead` when isolated context helps; charter = full plan file path.
121
+ 4. Run `factory.md` mechanical checks.
122
+ 5. **@omakase-critic** — rubric **and** plan done criteria.
123
+ 6. Gate file links the backlog plan path and records checklist results in `## Mechanical evidence`.
124
+ 7. Human checkpoint — merging stays human-owned.
125
+
126
+ Never close Class 2+ with only plan checkboxes and no gate.
127
+
128
+ ### Phase 7 — Reconcile
129
+
130
+ When the user asks to refresh the backlog or at the start of a new audit if `backlog/` exists:
131
+
132
+ - **DONE** — verify gate exists or code clearly landed; mark DONE in index.
133
+ - **BLOCKED** — investigate; rewrite plan or retire with reason.
134
+ - **Drifted** — plan SHA vs HEAD on in-scope paths; refresh excerpts or mark STALE.
135
+ - **Fixed independently** — retire finding; note in rejected/retired section.
136
+ - Cross-check: backlog status should not say DONE without evidence (gate or explicit Class 0–1 light checkpoint).
137
+
138
+ ---
139
+
140
+ ## Finding format (for audit subagents and the table)
141
+
142
+ Each finding needs:
143
+
144
+ - **Evidence** — `path:line` (multiple if needed)
145
+ - **Impact** — what breaks, slows, or risks if ignored
146
+ - **Effort** — S / M / L
147
+ - **Risk class** — 0–3+ per `factory.md` / `dark-factory.md` (fixes to auth = 3+)
148
+ - **Confidence** — HIGH / MED / LOW
149
+ - **Why not now** (optional) — honest deprioritization
150
+
151
+ No vibes-only findings.
152
+
153
+ ---
154
+
155
+ ## Relationship to other Omakase artifacts
156
+
157
+ | Artifact | Role |
158
+ |----------|------|
159
+ | `reference/plan.md` | Strategic plan — why, options, phasing (router `plan` or Engineer when shaping direction) |
160
+ | `reference/execution-plan.md` | Tactical plan — how, steps, STOP, verify gates (backlog/) |
161
+ | `reference/handoff.md` | Session continuity; audit summary can land in `handoffs/` |
162
+ | `reference/factory-orchestration.md` | Mandatory loop after plan selection |
163
+
164
+ ---
165
+
166
+ ## Tone
167
+
168
+ Advising, not selling. Prefer "not worth doing" over padding the list. A short list of high-confidence plans beats thirty vague ones. Findings must pass the taste bar — no AI slop recommendations, no generic "add monitoring" without repo evidence.
@@ -0,0 +1,92 @@
1
+ # Critique — Domain-Aware Quality Gate
2
+
3
+ `critique` is one of the most important commands in the system. It is a smart traffic-cop.
4
+
5
+ It never guesses the domain. It aggressively gathers context, detects the nature of the work, loads the appropriate extensions from `reference/`, **merges** them additively into the core 8-bullet Omakase Critique Rubric, and runs the combined standard with senior rigor.
6
+
7
+ ## Core Principle (Non-Negotiable)
8
+
9
+ The 8 bullets in `OMAKASE-CRITIQUE.md` are the unchanging foundation for *every* project that uses Omakase.
10
+
11
+ Domain-specific standards (starting with Engineering) are **additions only**. They never replace or weaken the core. This is what keeps the system consistent and trustworthy while still allowing real excellence in specific domains.
12
+
13
+ ## Detection Logic (How to Decide What to Merge)
14
+
15
+ Gather signals from multiple layers, in rough priority order:
16
+
17
+ **Strong Engineering signals** (merge `reference/engineering.md` extensions):
18
+ - Direct code, diffs, architecture discussion, implementation requests
19
+ - Words like "refactor", "review this code", "make this production", "implement", "architecture for", or "data model" in a technical context
20
+ - File paths or discussion of specific modules, types, performance, boundaries
21
+ - Recent context that was already in Engineering persona
22
+
23
+ **Skill package signals** (delegate to **The Skill Judge** via The Critic, or run `reference/skill-judge.md` when you are the Critic handling it directly):
24
+ - "Evaluate this skill", "audit SKILL.md", "score this persona", reviewing an import before merge
25
+ - Target is primarily `SKILL.md`, skill-shaped reference under `skill/`, or persona markdown for team packaging
26
+ - Do **not** merge engineering extensions for pure skill audits; use the skill-judge rubric + core Omakase slop/taste bullets
27
+
28
+ **Non-Engineering / Core-Only Signals** (use core Omakase Critique Rubric *only*; do NOT merge engineering extensions):
29
+ - Explicit qualifiers: "high-level", "product strategy", "GTM", "positioning", "messaging", "voice and tone", "exec brief", "one-pager", "process design", "operating rhythm", "customer communication", "without any implementation or code details"
30
+ - Pure writing, narrative, or documentation critique ("review this email", "strengthen the argument in this doc", "improve clarity of this strategy brief")
31
+ - High-level product or process discussion ("plan the GTM", "critique our feature intake process for decision quality")
32
+ - Design, writing, or planning work where the request explicitly avoids or disclaims technical depth
33
+
34
+ **Weaker / Mixed signals**:
35
+ - Ambiguous or borderline cases (e.g., "plan the developer platform improvements" or "add X feature" without qualifiers) → **ask once**: "This request has elements that could be product/strategy focused or involve implementation. Should I apply the full engineering critique standards (code judo, file health, deslop, etc.) in addition to the core rubric, or stick to core standards only?"
36
+ - When the line is blurry, err on the side of asking rather than silently merging heavy extensions.
37
+
38
+ When in doubt, prefer **asking once** over guessing. Never silently apply the wrong extensions. For pure non-engineering work, the critique must still pass the full core 8-bullet rubric, but interpreted through the appropriate domain lens (e.g., "structural integrity" and "pragmatic craftsmanship" apply to the strategy doc, process, or writing artifact, not code).
39
+
40
+ ## Merge Rules
41
+
42
+ 1. Always load the core 8-bullet rubric first.
43
+ 2. Load domain extensions additively. Engineering extensions are never applied to pure product, strategy, writing, process, or high-level design work.
44
+ 3. **Domain Detection & Merge Declaration (mandatory for every critique output)**: At the very top, after the summary verdict, explicitly declare:
45
+ - The detected domain (e.g., "Pure product strategy", "Mixed (product positioning + technical data model)", "High-level process design / writing").
46
+ - Exactly what was merged (or not): e.g. "Core Omakase Critique Rubric only (no engineering extensions: request was high-level GTM strategy with explicit 'no implementation details' framing)."
47
+ - Or: "Core + Engineering extensions (code judo, file & module health, deslop density) because the request includes technical architecture and implementation decisions."
48
+ This ensures every output transparently documents whether engineering standards were correctly or incorrectly applied.
49
+ 4. A deliverable can pass the core 8 and still fail under the merged engineering lens when applicable. Both standards apply simultaneously only when engineering content is present.
50
+
51
+ **Current Engineering extensions to merge** (from `reference/engineering.md`):
52
+ - Code judo & structural simplification opportunities
53
+ - File/module health (especially ~1000 line smell)
54
+ - Spaghetti growth and boundary violations
55
+ - Direct/boring/maintainable vs magic or thin abstractions
56
+ - Type and contract clarity
57
+ - Pervasive deslop (comments, defensive code, repeated mutation, scattered state)
58
+
59
+ Future domains will add their own additive sections using the same pattern.
60
+
61
+ ## Expected Output Structure (Use This)
62
+
63
+ For any non-trivial critique:
64
+
65
+ 1. **Summary verdict** (1-2 sentences)
66
+ 2. **Domain Detection & Merge Declaration** (mandatory; see Merge Rules #3 for exact phrasing and examples. This makes the full merged critique transparent — note explicitly when engineering extensions were correctly avoided for pure non-engineering work, or correctly applied for mixed/eng work.)
67
+ 3. **Standards applied** (explicitly list core + any merged extensions, cross-referencing the declaration above)
68
+ 4. **Score table** (adapt the 8 core bullets + any merged engineering bullets; use 0-4 where relevant. For core-only critiques, interpret "Pragmatic Craftsmanship" and "Structural Integrity" relative to the actual artifact type — strategy doc, email, process design — not code.)
69
+ 5. **What's working** (2-4 specific strengths with evidence)
70
+ 6. **Priority issues** (P0–P3, with "What", "Why it matters", "Suggested fix")
71
+ 7. **"Why this approach" / taste reasoning** (for the critique itself when the target was substantial)
72
+ 8. **Recommended next actions** (specific commands the user can run, e.g. `/omakase engineer fix the state management in X` or `/omakase critique the revised GTM narrative`)
73
+
74
+ **Depth Adaptation (mandatory for ruthless simplicity)**: The full 8-part structure with complete score table is the default for substantial, ambiguous, or borderline targets. For small targets (< ~150 lines or with immediately obvious P0 structural/spaghetti issues), use the short form to avoid bloat: Summary verdict (2 sentences) + Domain Detection & Merge Declaration + Standards applied (one line) + Top 3 Priority Issues (full P0-P2 fields) + "Why this approach" (for the critique) + Recommended next actions. When the target is a previous critique output, the final Recommended next actions **must** include an explicit self-application step ("Now run the full merged critique on this critique document").
75
+
76
+ Be direct. Specific. Evidence-based. Never vague or hedged when the standard has been missed.
77
+
78
+ ## Tone
79
+
80
+ You are a senior craftsperson who has high standards and zero tolerance for slop or mediocrity.
81
+
82
+ You respect the person, but you do not respect mediocre work. Your job is to make the output excellent, not to make the author feel good.
83
+
84
+ ## Self-Application
85
+
86
+ The `critique` command must itself pass the Critique Rubric (core + any relevant extensions). This is especially true when critiquing the Omakase system itself.
87
+
88
+ ## Edge Cases
89
+
90
+ - Very small / obvious targets → keep the critique short but still apply the rubric. Do not skip standards just because the thing is tiny.
91
+ - Purely non-engineering work in an engineering-heavy project → still start with core; only merge engineering if the user or context makes it relevant.
92
+ - The target is the Omakase skill itself → apply the highest bar. We eat our own dogfood without exception.
@@ -0,0 +1,111 @@
1
+ # Dark factory — Level 4 with Omakase
2
+
3
+ **Read this first if you are an agent.** Per-repo commands and checks live in `.omakaseagent/factory.md` (created by `omakase learn`). Day-to-day intake: `reference/task-intake.md`.
4
+
5
+ ---
6
+
7
+ ## What this pattern is (and is not)
8
+
9
+ **Omakase "factory"** is a **trust and evidence system** for agent engineering — not a deployment pipeline and not lights-out automation.
10
+
11
+ | It **is** | It **is not** |
12
+ |-----------|----------------|
13
+ | A way to earn **longer agent runs** without the human reading every line | Level 5 dark factory (unattended merge, ship, deploy) |
14
+ | **Scenarios** humans approve once; agents prove behavior later | A DOT/Attractor runner or custom orchestration engine (v1) |
15
+ | **Mechanical checks** agents run (`build`, `test`, CI scripts) | Replacing the repo's CI — it complements CI |
16
+ | **Gate reports** that bundle evidence for human checkpoint | Vague "done" in chat |
17
+ | **Risk classes** — more autonomy on low risk, more human on high | Same rules for docs and auth migrations |
18
+
19
+ **Goal:** Humans spend review time on **intent and proof**, not routine diff reading. Agents spend effort on **implementation + running checks + writing evidence**. Omakase supplies **taste, critique, memory, and gate shape**.
20
+
21
+ Industry "dark factory" often means full autonomy. **Omakase targets Level 4 (Dan Shapiro):** human approves what should be true; agent proves it; human accepts at checkpoint.
22
+
23
+ ---
24
+
25
+ ## What "automation" means here
26
+
27
+ **Automated today (agent responsibility):**
28
+
29
+ - Co-create task brief + scenarios from plain user goals (`task-intake.md`)
30
+ - Run repo mechanical commands listed in `factory.md`
31
+ - Produce gate report under `.omakaseagent/gates/`
32
+ - Cite memory; propose memory updates when durable
33
+ - Offer `omakase learn` when factory layout is missing
34
+
35
+ **Automated in CI (repo scripts):**
36
+
37
+ - Gate report headings — `npm run verify:gate-reports`
38
+ - Class 2 PR gate discipline — `npm run verify:pr-gate-diff`
39
+ - Scenario eval contracts — `npm run verify:scenario-evals` (`evals/*.eval.json`)
40
+ - Skill/dist drift — `npm run verify:drift`
41
+
42
+ **Automated later (live harness evals, Phase 5+):**
43
+
44
+ - With-skill vs baseline runs on seed prompts
45
+ - Narrow task classes may earn more autonomy **after** evidence history — still human accept
46
+
47
+ **Never automate in v1:**
48
+
49
+ - Merging, deploying, production changes without explicit human accept
50
+ - Judging "taste" or "slop" purely with scripts — use **@omakase-critic**
51
+ - Inventing scenarios that change product intent without user confirm (Class 2+)
52
+
53
+ **Operating rule (encode, don't re-review):** If a human would check the same thing on every task, propose a **scenario** or **mechanical check** and add it to `factory.md` / CI — do not make the human repeat the inspection.
54
+
55
+ ---
56
+
57
+ ## Rule
58
+
59
+ > Humans approve what should be true. Agents prove it became true.
60
+
61
+ | Human owns | Agent owns | Omakase owns |
62
+ |------------|------------|--------------|
63
+ | Intent, constraints, scenario approval, risk class, final accept | Implementation, running checks, evidence collection, gate draft | Taste bar, critique, memory shape, gate language |
64
+
65
+ ---
66
+
67
+ ## Loop (one task)
68
+
69
+ 1. **Task brief** — agent co-writes from user goal (no "seed" jargon for users)
70
+ 2. **Scenarios** — agent proposes; human confirms before Class 2+ deep work
71
+ 3. **Work** — `@omakase-engineer` between gates; memory first
72
+ 4. **Evidence** — scenarios + mechanical + critic + memory
73
+ 5. **Checkpoint** — gate file; human reviews evidence stack
74
+
75
+ ---
76
+
77
+ ## Risk classes
78
+
79
+ | Class | Autonomy | Examples |
80
+ |-------|----------|----------|
81
+ | 0 | High — brief inline, light checkpoint | Docs, README |
82
+ | 1 | Medium — run mechanical checks | CI, scripts |
83
+ | 2 | Confirm brief + scenarios first | Features, personas, CLI |
84
+ | 3+ | Stay interactive | Auth, money, migrations |
85
+
86
+ Repo-specific examples: `.omakaseagent/factory.md`.
87
+
88
+ ---
89
+
90
+ ## Quality gates (Omakase rubric applied to the work)
91
+
92
+ 1. Context loaded (memory cited)
93
+ 2. Task/scenario clarity
94
+ 3. Anti-slop critique
95
+ 4. Verification (fresh command output, not "should work")
96
+ 5. Memory update when durable
97
+ 6. Checkpoint artifact exists (Class 2+)
98
+
99
+ ---
100
+
101
+ ## Commands
102
+
103
+ ```bash
104
+ npx omakase init # memory + agents
105
+ npx omakase learn # per-repo factory.md + starter scenarios
106
+ npx omakase learn --dry-run
107
+ ```
108
+
109
+ **Team loop (Class 2+):** `reference/factory-orchestration.md`. Worked example: `examples/factory-e2e/`.
110
+
111
+ **Backlog audit (Engineer, no extra command):** `reference/backlog-audit.md` — findings and execution plans in `.omakaseagent/backlog/`; factory loop unchanged for implementation.