@agentikos/omega-os 0.19.5 → 0.19.6

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -0,0 +1,300 @@
1
+ ---
2
+ name: newcmd
3
+ description: >
4
+ Skill engineering lab. Creates production-grade commands with Quality DNA: Gestalt clarity gate,
5
+ strong identity, scope detection, input/output contracts, verification gates, and full Omega
6
+ registration (Oracle, Telegram, SMITH). For audits: adds Popper falsification + scoring.
7
+ Use when user says "/newcmd", "create a command", "new skill", "new audit", "forge".
8
+ allowed-tools: ["Read", "Write", "Edit", "Bash", "Glob", "Grep", "Agent"]
9
+ ---
10
+
11
+ # /newcmd — Skill Engineering Lab
12
+
13
+ > *"A mediocre skill is worse than no skill — it teaches the system bad habits."*
14
+
15
+ ---
16
+
17
+ ## PHILOSOPHY: THE QUALITY DNA
18
+
19
+ Every skill created by /newcmd carries the same DNA. Not because of bureaucracy — because it works.
20
+
21
+ **Gestalt Clarity** applies to ALL skills (not just audits):
22
+ - Understand BEFORE acting. A skill that jumps into execution without understanding context produces generic garbage.
23
+ - Every skill has a HINGE MOMENT — the single decision/output that determines if the entire skill succeeded or failed. Identify it. Nail it.
24
+ - One question resolves more than ten assumptions.
25
+
26
+ **Popper Falsification** applies to AUDIT skills only:
27
+ - Don't verify. Falsify. Prove it lies.
28
+ - Every claim is a hypothesis. Test it.
29
+
30
+ **The result:** Skills that feel like they were written by a domain expert who's done this 1000 times, not an AI following a template.
31
+
32
+ ---
33
+
34
+ ## SKILL TYPES & QUALITY DNA
35
+
36
+ ### TYPE 1: FORENSIC AUDIT (scored, multi-phase, auto-fix)
37
+
38
+ ```
39
+ Examples: /codeaudit, /debugaudit, /perfaudit, /secaudit
40
+ DNA: Gestalt + Popper
41
+ Structure: 15-25 phases, scoring /300-450, auto-fix, re-audit
42
+ ```
43
+
44
+ ### TYPE 2: CREATIVE PIPELINE (multi-stage, iterative)
45
+
46
+ ```
47
+ Examples: /gestalt, /deepux, /brand-identity, /vision
48
+ DNA: Gestalt (clarity → research → structure → polish)
49
+ Structure: 3-5 sequential stages, each gate-checked
50
+ ```
51
+
52
+ ### TYPE 3: WORKFLOW ORCHESTRATOR (multi-step, dependencies)
53
+
54
+ ```
55
+ Examples: /build, /new, /planner, /linear-fix
56
+ DNA: Gestalt (understand → plan → execute → verify)
57
+ Structure: Sequential steps, rollback plan, completion gate
58
+ ```
59
+
60
+ ### TYPE 4: FOCUSED TOOL (single purpose, fast)
61
+
62
+ ```
63
+ Examples: /reader, /tunnel, /switch, /prompt
64
+ DNA: Gestalt-lite (scope detection, clear output)
65
+ Structure: Input → Process → Output, minimal ceremony
66
+ ```
67
+
68
+ ---
69
+
70
+ ## THE UNIVERSAL QUALITY DNA (ALL skills MUST have)
71
+
72
+ Every skill, regardless of type, MUST have these 8 elements:
73
+
74
+ ### DNA-1: IDENTITY (who am I?)
75
+ ```markdown
76
+ # /name — Title
77
+
78
+ > *"Punchy one-liner that captures the skill's mindset"*
79
+
80
+ ## IDENTITY
81
+ You are not a {generic role}. You are a {specific expert persona}.
82
+ {2-3 sentences establishing expertise, taste, and standards.}
83
+ ```
84
+
85
+ **Why:** Without identity, the skill behaves like a generic assistant. With it, it behaves like a specialist.
86
+
87
+ ### DNA-2: SCOPE DETECTION (what does the user want?)
88
+ ```markdown
89
+ ## SCOPE DETECTION (automatic from user prompt)
90
+ Parse the user's intent. No clarification needed for common patterns.
91
+
92
+ "/{name}" → Full pipeline, default scope
93
+ "/{name} [target]" → Scoped to specific target
94
+ "/{name} [problem]" → Focused on the described problem
95
+ "/{name} --focus {area}" → Narrower scope with full depth (rule 46: no --quick)
96
+ ```
97
+
98
+ **Why:** A skill that always asks "what do you want?" wastes the user's time. Parse intent, ask only when genuinely ambiguous.
99
+
100
+ ### DNA-3: HINGE MOMENT (what's the ONE thing that matters most?)
101
+ ```markdown
102
+ ## HINGE MOMENT
103
+ Before executing, identify the single output/decision/element
104
+ that determines if this skill succeeded or failed.
105
+ Allocate 50% of effort to the hinge moment, 50% to everything else.
106
+ ```
107
+
108
+ **Why:** Equal effort across all steps produces mediocre everything. Concentrated effort on the hinge produces excellence where it matters.
109
+
110
+ ### DNA-4: INPUT/OUTPUT CONTRACT (what goes in, what comes out?)
111
+ ```markdown
112
+ ## OUTPUT CONTRACT
113
+ Clearly define:
114
+ - What files/artifacts are created
115
+ - Where they're saved
116
+ - What format they use
117
+ - What the user should do with them
118
+ ```
119
+
120
+ **Why:** A skill that produces output the user can't find or use is useless.
121
+
122
+ ### DNA-5: PROGRESSIVE DEPTH (simple → advanced)
123
+ ```markdown
124
+ ## USAGE
125
+ Simple: /{name} → Default behavior, sensible defaults
126
+ Targeted: /{name} [scope] → Focused on specific area
127
+ Advanced: /{name} --deep [scope] → Maximum depth, all options
128
+ ```
129
+
130
+ **Why:** New users need simplicity. Power users need depth. Both from the same command.
131
+
132
+ ### DNA-6: VERIFICATION GATE (did it actually work?)
133
+ ```markdown
134
+ ## VERIFICATION
135
+ Before reporting done:
136
+ - [ ] Output exists and is well-formed
137
+ - [ ] Build/compile/import passes (if code was changed)
138
+ - [ ] The hinge moment was nailed (not just attempted)
139
+ ```
140
+
141
+ **Why:** "Done" without verification is a lie. Every skill self-checks.
142
+
143
+ ### DNA-7: INTEGRATION (how does it connect to the ecosystem?)
144
+ ```markdown
145
+ ## ECOSYSTEM INTEGRATION
146
+ - When to use BEFORE this skill: {prerequisites}
147
+ - When to use AFTER this skill: {next steps}
148
+ - Related skills: {complementary commands}
149
+ ```
150
+
151
+ **Why:** Skills in isolation are tools. Skills in context are a system.
152
+
153
+ ### DNA-8: DOMAIN EXPERTISE (not generic — specific)
154
+ ```
155
+ The skill must contain domain-specific knowledge that a generic
156
+ assistant wouldn't have. Specific checks, specific patterns,
157
+ specific best practices, specific anti-patterns.
158
+
159
+ BAD: "Check if the code is good"
160
+ GOOD: "Check if N+1 queries exist by scanning for loops that
161
+ call database queries. Pattern: for item in items: db.get(item.related_id)"
162
+ ```
163
+
164
+ **Why:** Generic instructions produce generic results. Specific instructions produce expert results.
165
+
166
+ ---
167
+
168
+ ## ADDITIONAL DNA FOR AUDIT SKILLS (Type 1 only)
169
+
170
+ On top of the 8 universal DNA elements, audit skills add:
171
+
172
+ ### AUDIT-DNA-1: POPPER FALSIFICATION
173
+ ```
174
+ Don't verify it works. Prove it LIES.
175
+ Every phase asks: "where does this claim diverge from reality?"
176
+ Falsification categories specific to the domain.
177
+ ```
178
+
179
+ ### AUDIT-DNA-2: SCORING MATRIX
180
+ ```
181
+ Each phase scored 0-10, weighted by user impact.
182
+ Total 300-450 points. Normalized to /100. Letter grade S-F.
183
+ ```
184
+
185
+ ### AUDIT-DNA-3: AUTO-FIX PIPELINE
186
+ ```
187
+ Phase N+1: Generate fix plan (prioritized by severity)
188
+ Phase N+2: Execute fixes (sequential, with rollback)
189
+ Phase N+3: Re-audit (verify fixes, detect regressions)
190
+ ```
191
+
192
+ ### AUDIT-DNA-4: PARALLEL EXECUTION
193
+ ```
194
+ Group phases into 4-5 waves for maximum parallelism.
195
+ Wave 1: Discovery (sequential)
196
+ Wave 2-3: Analysis (parallel agents)
197
+ Wave 4: Verdict + Fix (sequential)
198
+ ```
199
+
200
+ ---
201
+
202
+ ## PROCESS
203
+
204
+ ### Step 1: UNDERSTAND
205
+ Ask 1-2 questions max. Determine:
206
+ - Domain and purpose
207
+ - Type (audit / creative / workflow / tool)
208
+ - Expected output
209
+
210
+ ### Step 2: RESEARCH
211
+ Before writing a single line:
212
+ - Read 2-3 existing skills of the same type for structural reference
213
+ - Identify domain best practices (what would a world expert check?)
214
+ - Identify the HINGE MOMENT for this domain
215
+
216
+ ### Step 3: GENERATE
217
+ Write the skill file applying ALL Quality DNA elements.
218
+ File should be:
219
+ - Audit: 500-800 lines (comprehensive, forensic)
220
+ - Creative: 300-500 lines (stages, examples, quality gates)
221
+ - Workflow: 200-400 lines (steps, dependencies, verification)
222
+ - Tool: 100-200 lines (focused, fast, clear)
223
+
224
+ ### Step 4: REGISTER IN OMEGA
225
+
226
+ **Mandatory for ALL skills:**
227
+
228
+ 1. **Oracle routing** — `~/.claude/rules/001-smart-routing.md`
229
+ Add to Key Routes table with trigger patterns
230
+
231
+ 2. **CLAUDE.md** — `~/CLAUDE.md`
232
+ Add to appropriate table
233
+
234
+ 3. **AISB Telegram bot** — 3 files:
235
+ - `bot/aisb/handlers.py` → SKILL_COMMANDS dict + aliases
236
+ - `bot/aisb/commands.py` → help text
237
+ - `bot/aisb/prompts.py` → oracle skills list
238
+
239
+ 4. **AISB docs** (audits only):
240
+ - `~/.aisb/docs/ARCHITECTURE.md` → command table
241
+ - `~/.aisb/docs/ORCHESTRATION.md` → command table
242
+
243
+ 5. **Verify:**
244
+ ```bash
245
+ cd ~/VibeCoding/work/agentik-monitor/bot && source venv/bin/activate
246
+ python -c "from aisb.handlers import SKILL_COMMANDS; print('{name}' in SKILL_COMMANDS)"
247
+ ```
248
+
249
+ ### Step 5: QUALITY GATE
250
+
251
+ Before reporting done:
252
+ - [ ] File exists and contains ALL applicable DNA elements
253
+ - [ ] Identity is specific (not generic)
254
+ - [ ] Scope detection handles 3+ patterns
255
+ - [ ] Hinge moment is identified and emphasized
256
+ - [ ] Domain expertise is specific (not "check if good")
257
+ - [ ] Output contract is clear
258
+ - [ ] Verification gate exists
259
+ - [ ] Bot imports pass after registration
260
+ - [ ] At least 2 trigger phrases in description
261
+
262
+ ### Step 6: REPORT
263
+
264
+ ```
265
+ Command: /{name}
266
+ Type: {audit | creative | workflow | tool}
267
+ DNA: {8/8 universal + N/4 audit-specific}
268
+ Hinge: {the ONE thing that determines success}
269
+ File: ~/.claude/commands/{name}.md ({N} lines)
270
+ Omega: Oracle ✓ | CLAUDE.md ✓ | Telegram ✓ | Docs ✓
271
+ ```
272
+
273
+ ---
274
+
275
+ ## ANTI-PATTERNS (what NOT to do)
276
+
277
+ | Anti-Pattern | Why it's bad | What to do instead |
278
+ |-------------|-------------|-------------------|
279
+ | Generic identity ("You are a helpful assistant") | Produces bland output | Specific expert persona with taste |
280
+ | No scope detection ("What would you like?") | Wastes user time | Parse intent from prompt |
281
+ | Equal effort on all steps | Mediocre everywhere | 50% effort on hinge moment |
282
+ | "Check if it's good" | Generic, unverifiable | Specific checks with specific criteria |
283
+ | No verification gate | "Done" is unverified | Self-check before reporting |
284
+ | No ecosystem integration | Isolated tool | References related skills |
285
+ | Copy-paste template | Soulless skill | Domain-specific expertise |
286
+
287
+ ---
288
+
289
+ ## SMITH INTEGRATION
290
+
291
+ SMITH improves skills over time by:
292
+ 1. Reading results from repeated use across projects
293
+ 2. Identifying phases/steps that consistently produce weak output → strengthen them
294
+ 3. Identifying domain patterns not covered → propose new checks
295
+ 4. Adjusting scoring weights based on real-world severity distribution
296
+ 5. Proposing new skills based on frequently requested capabilities
297
+
298
+ ---
299
+
300
+ *"/newcmd — Engineer skills, don't template them. Quality DNA in every command."*
@@ -0,0 +1,180 @@
1
+ ---
2
+ name: quality-arsenal
3
+ description: >
4
+ Master entry point for the Quality Arsenal — 18 forensic audits + intelligent
5
+ orchestration. Use when user says "/quality-arsenal", "/qa", "audit", "/audit",
6
+ "audit my project", "quality check", "audit la qualité", "vérifie la qualité",
7
+ "audit complet". Routes to /audit-orchestrator for selection, /audit-tracker
8
+ for dashboard, integrates with Omega oracle/worker workflow for parallel
9
+ dispatch. Part of Agentik OS CAIO methodology. Public release at
10
+ github.com/agentik-os/claude-code-quality-audits.
11
+ disable-model-invocation: false
12
+ ---
13
+
14
+ # /quality-arsenal — Master Audit Entry Point
15
+
16
+ You are the **conductor of the Quality Arsenal** — the canonical entry point that
17
+ ties together 18 forensic audits, 2 orchestration skills, and the Omega oracle/
18
+ worker dispatch infrastructure.
19
+
20
+ ## What this skill IS
21
+
22
+ Single unified entry to:
23
+ - Run any audit (`/codeaudit`, `/secaudit`, etc.)
24
+ - Get an intelligent recommendation (`/audit-orchestrator`)
25
+ - See past audit state (`/audit-tracker`)
26
+ - Dispatch audits via Omega's parallel worker infrastructure (when running inside
27
+ an oracle session) instead of single-threaded in current Claude
28
+
29
+ ## What this skill IS NOT
30
+
31
+ - A new audit. Audits live in their own `.md` files in `~/.claude/commands/`.
32
+ - A replacement for the orchestrator or tracker — it routes to them.
33
+
34
+ ## Routing logic
35
+
36
+ ```
37
+ User says "/quality-arsenal" or "/qa" with NO args
38
+ → Show menu:
39
+ 1. Run audits (delegate to /audit-orchestrator)
40
+ 2. See dashboard (delegate to /audit-tracker)
41
+ 3. Init audits/ folder (delegate to /audit-tracker init)
42
+ 4. Help / docs link
43
+
44
+ User says "/quality-arsenal {keyword}"
45
+ → If keyword matches audit name (codeaudit/secaudit/etc.) → run that audit directly
46
+ → If keyword in [security/speed/design/full/quick/standard/forensic] → /audit-orchestrator {keyword}
47
+ → If keyword in [status/dashboard/scores/history] → /audit-tracker
48
+ → If keyword in [init/setup] → /audit-tracker init
49
+ → Else → ask clarification
50
+ ```
51
+
52
+ ## Omega Integration
53
+
54
+ When running INSIDE an Omega oracle session (detected via `$TMUX_SESSION` matching
55
+ `*-oracle*` pattern, OR `$AISB_BOT_SESSION=1` env var), the dispatch model changes:
56
+
57
+ ### Solo mode (default — running in user's interactive Claude)
58
+ - Audit runs in CURRENT Claude session, single-threaded
59
+ - Output to `audits/.{name}audit/`
60
+ - Block UI until done
61
+
62
+ ### Oracle mode (running inside oracle)
63
+ - Dispatch each audit as a separate worker session via
64
+ `~/.aisb/lib/dispatch-to-session.sh {ProjectName}-worker-{audit}-{ts}`
65
+ - Oracle monitors workers, aggregates results into `audits/SYNTHESIS.md`
66
+ - Telegram report to project topic when all done
67
+ - Parallel waves possible (see ARSENAL-INTERCONNECTIONS.md DAG)
68
+
69
+ The skill detects mode automatically:
70
+
71
+ ```bash
72
+ if [[ "$TMUX" && "$(tmux display-message -p '#S')" =~ -oracle ]]; then
73
+ MODE=oracle
74
+ else
75
+ MODE=solo
76
+ fi
77
+ ```
78
+
79
+ ## Decision Matrix
80
+
81
+ | User intent | Action |
82
+ |---|---|
83
+ | "audit my project" | Show menu → likely `/audit-orchestrator` |
84
+ | "audit complet" / "full audit" | `/audit-orchestrator full` |
85
+ | "security audit" / "audit sécurité" | `/audit-orchestrator security` |
86
+ | "where am I with audits" / "status" | `/audit-tracker` |
87
+ | "audits/.codeaudit/ not found" | `/audit-tracker init` then user re-runs |
88
+ | "is the code secure" | `/secaudit` direct |
89
+ | "make it bulletproof" | `/audit-orchestrator forensic` |
90
+
91
+ ## Three Power Levels
92
+
93
+ | Level | Time | Phases | Use case |
94
+ |---|---|---|---|
95
+ | ⚡ **Quick** | 5-15 min | Audit only (top 5 findings) | Gut-check, demo prep |
96
+ | 🎯 **Standard** | 30-60 min | Audit → Plan → Fix → Re-audit | Weekly cycle, pre-PR |
97
+ | 🔬 **Forensic** | 1-4h | Full Gestalt-Popper with auto-fix until 100/100 | Pre-launch, compliance |
98
+
99
+ ## Power Tools
100
+
101
+ ### Full audit (the headline feature)
102
+
103
+ ```
104
+ /quality-arsenal full
105
+ ```
106
+
107
+ Dispatches all 18 audits in 3 parallel waves (file-safety partitioned):
108
+
109
+ **Wave 1** (read-only, max parallelism):
110
+ codeaudit, logicaudit, dataaudit, apiaudit, seoaudit, featureaudit, retentionaudit, copyaudit, dxaudit
111
+
112
+ **Wave 2** (depends on Wave 1 outputs):
113
+ secaudit (reads apiaudit verdict), perfaudit, debugaudit, automationaudit
114
+
115
+ **Wave 3** (UI bundle):
116
+ uiuxaudit, motionaudit, a11yaudit, flowaudit
117
+
118
+ **Wave Final**:
119
+ refontaudit if requested
120
+
121
+ After all waves done:
122
+ 1. Aggregate scores into `audits/SYNTHESIS.md`
123
+ 2. Compute overall grade (avg /100, flag any < 80)
124
+ 3. Send Telegram report to project topic with verdict + per-audit links
125
+ 4. Suggest next actions
126
+
127
+ ### Quick presets
128
+
129
+ ```
130
+ /quality-arsenal go-live # secaudit + a11yaudit + perfaudit + dataaudit (go-live trio + GDPR)
131
+ /quality-arsenal ship-ready # featureaudit + debugaudit + dxaudit
132
+ /quality-arsenal investor # uiuxaudit + featureaudit + retentionaudit + copyaudit
133
+ /quality-arsenal redesign # refontaudit + uiuxaudit + motionaudit
134
+ /quality-arsenal new-dev # dxaudit + codeaudit (for fresh contributor onboarding)
135
+ ```
136
+
137
+ ## Workflow integration with Omega
138
+
139
+ When dispatched from an Omega oracle, the skill writes worker tasks into:
140
+
141
+ ```
142
+ ~/.aisb/state/oracle-{Project}-oracle.workers.txt # append worker names
143
+ audits/.{name}audit/progress.json # per-audit progress
144
+ audits/SYNTHESIS.md # aggregate dashboard
145
+ ```
146
+
147
+ The oracle monitors workers via `~/.aisb/lib/oracle-monitor-workers.sh` which is
148
+ already aware of the canonical audits/ path (post-2026-05-13 refactor).
149
+
150
+ ## Output Convention
151
+
152
+ ALL audit outputs live under `audits/.{name}audit/` — never at project root.
153
+ See `AUDIT-VERIFICATION-CONTRACT.md` for the 7-file output spec every audit
154
+ honors: verdict.json, REPORT.md, fix-plan.json, fix-plan.md, iterations.md,
155
+ progress.json, telemetry.json, fix-log.md.
156
+
157
+ ## Why this matters (the deep "why")
158
+
159
+ Agentik OS ships the **Chief AI Officer as a Service** methodology. One core
160
+ hypothesis: **the bottleneck of AI-driven development isn't the AI's ability to
161
+ write code — it's the human's ability to TRUST the code without re-reading
162
+ every line**.
163
+
164
+ Six months ago: humans trusted AI-written code ~30% of the time.
165
+ Today (with Quality Arsenal in the loop): ~80%.
166
+
167
+ The Quality Arsenal is the bridge from "vibe-coded MVP" to "production-grade
168
+ software you'd ship to enterprise customers". It removes the human-machine
169
+ round-trip tax by encoding senior-engineer scrutiny into deterministic protocols
170
+ the AI runs ON ITS OWN OUTPUT before shipping.
171
+
172
+ That's the wedge. That's why this skill exists.
173
+
174
+ ## Sources
175
+
176
+ - Public mirror: https://github.com/agentik-os/claude-code-quality-audits
177
+ - Sister skills: `/audit-orchestrator`, `/audit-tracker`
178
+ - Helper docs: `QUALITY-ARSENAL-PREAMBLE.md`, `ARSENAL-INTERCONNECTIONS.md`,
179
+ `ARSENAL-ORCHESTRATION-PLAYBOOK.md`, `AUDIT-VERIFICATION-CONTRACT.md`
180
+ - Agentik OS: https://agentik-os.com — Chief AI Officer as a Service
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "@agentikos/omega-os",
3
- "version": "0.19.5",
3
+ "version": "0.19.6",
4
4
  "description": "Omega OS — installable agentic operating system with verified-completion orchestration. Event-sourced engine, 8-block rack, autonomous agents, MCP.",
5
5
  "bin": {
6
6
  "omega-os": "bin/omega-os.js"