omgkit 2.24.1 → 2.24.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -37,9 +37,9 @@ All coordinated through **Omega-level thinking** - a framework for finding break
37
37
  | Component | Count | Description |
38
38
  |-----------|-------|-------------|
39
39
  | **Agents** | 41 | Specialized AI team members with distinct roles |
40
- | **Commands** | 151 | Slash commands for every development task |
41
- | **Workflows** | 67 | Complete development processes from idea to deploy |
42
- | **Skills** | 157 | Domain expertise modules across 24 categories |
40
+ | **Commands** | 160 | Slash commands for every development task |
41
+ | **Workflows** | 69 | Complete development processes from idea to deploy |
42
+ | **Skills** | 161 | Domain expertise modules across 24 categories |
43
43
  | **Modes** | 10 | Behavioral configurations for different contexts |
44
44
  | **Archetypes** | 14 | Project templates for autonomous development |
45
45
 
@@ -88,6 +88,53 @@ OMGKIT brings agile methodology to AI-assisted development:
88
88
  - **Sprints**: Time-boxed development cycles
89
89
  - **AI Team**: Autonomous execution with human oversight
90
90
 
91
+ ### 4. Testing Automation (New)
92
+
93
+ OMGKIT includes a comprehensive testing automation system:
94
+
95
+ #### Auto-Generate Test Tasks
96
+ When you create a feature, OMGKIT automatically generates corresponding test tasks:
97
+
98
+ ```yaml
99
+ # workflow.yaml
100
+ testing:
101
+ auto_generate_tasks: true
102
+ required_test_types:
103
+ - unit
104
+ - integration
105
+ ```
106
+
107
+ Feature tasks automatically spawn test tasks based on feature type (API → Contract tests, UI → Snapshot tests, etc.)
108
+
109
+ #### Enforce Tests Before Done
110
+ No task can be marked "done" without passing tests:
111
+
112
+ ```yaml
113
+ testing:
114
+ enforcement:
115
+ level: standard # soft | standard | strict
116
+ blocking:
117
+ on_test_failure: true
118
+ on_coverage_below_minimum: true
119
+ ```
120
+
121
+ #### Coverage Gates
122
+ Set minimum and target coverage thresholds:
123
+
124
+ ```yaml
125
+ testing:
126
+ coverage_gates:
127
+ unit:
128
+ minimum: 80
129
+ target: 90
130
+ integration:
131
+ minimum: 60
132
+ target: 75
133
+ overall:
134
+ minimum: 75
135
+ target: 85
136
+ ```
137
+
91
138
  ---
92
139
 
93
140
  ## Installation
@@ -222,7 +269,7 @@ Agents are specialized AI team members, each with distinct expertise and respons
222
269
 
223
270
  ---
224
271
 
225
- ## Commands (151)
272
+ ## Commands (160)
226
273
 
227
274
  Commands are slash-prefixed actions organized by namespace.
228
275
 
@@ -260,10 +307,13 @@ Commands are slash-prefixed actions organized by namespace.
260
307
  ### Quality (`/quality:*`)
261
308
 
262
309
  ```bash
263
- /quality:security-scan # Scan for vulnerabilities
310
+ /quality:security-scan # Scan for vulnerabilities
264
311
  /quality:refactor <file> # Improve code structure
265
312
  /quality:optimize <file> # Performance optimization
266
- /quality:lint # Run linting
313
+ /quality:lint # Run linting
314
+ /quality:verify-done # Verify test requirements before completion
315
+ /quality:coverage-check # Check coverage against gates
316
+ /quality:test-plan # Generate comprehensive test plan
267
317
  ```
268
318
 
269
319
  ### Omega (`/omega:*`)
@@ -370,7 +420,7 @@ Commands are slash-prefixed actions organized by namespace.
370
420
 
371
421
  ---
372
422
 
373
- ## Workflows (67)
423
+ ## Workflows (69)
374
424
 
375
425
  Workflows are orchestrated sequences of agents, commands, and skills.
376
426
 
@@ -383,6 +433,12 @@ Workflows are orchestrated sequences of agents, commands, and skills.
383
433
  | `development/refactor` | Code improvement and restructuring |
384
434
  | `development/code-review` | Comprehensive code review |
385
435
 
436
+ ### Testing Automation (New)
437
+
438
+ | Workflow | Description |
439
+ |----------|-------------|
440
+ | `testing/automated-testing` | End-to-end testing automation with task generation, enforcement, and coverage gates |
441
+
386
442
  ### AI Engineering
387
443
 
388
444
  | Workflow | Description |
@@ -454,7 +510,7 @@ Workflows are orchestrated sequences of agents, commands, and skills.
454
510
 
455
511
  ---
456
512
 
457
- ## Skills (157)
513
+ ## Skills (161)
458
514
 
459
515
  Skills are domain expertise modules organized in 24 categories.
460
516
 
@@ -498,7 +554,7 @@ Based on Chip Huyen's "Designing ML Systems" and Stanford CS 329S:
498
554
  | `ml-systems/robust-ai` | Reliability, monitoring, drift detection |
499
555
  | `ml-systems/deployment-paradigms` | Batch vs real-time vs streaming |
500
556
 
501
- ### Methodology (17 skills)
557
+ ### Methodology (19 skills)
502
558
 
503
559
  | Skill | Description |
504
560
  |-------|-------------|
@@ -507,6 +563,8 @@ Based on Chip Huyen's "Designing ML Systems" and Stanford CS 329S:
507
563
  | `methodology/debugging` | Systematic debugging approach |
508
564
  | `methodology/code-review` | Review standards and checklists |
509
565
  | `methodology/tdd` | Test-driven development |
566
+ | `methodology/test-task-generation` | Auto-generate test tasks from features |
567
+ | `methodology/test-enforcement` | Enforce tests before task completion |
510
568
 
511
569
  ### Frameworks (10 skills)
512
570
 
@@ -735,7 +793,7 @@ If any sync issue is detected (missing pages, wrong counts, broken links), the v
735
793
 
736
794
  ## Validation & Testing
737
795
 
738
- OMGKIT has 5700+ automated tests ensuring system integrity.
796
+ OMGKIT has 7300+ automated tests ensuring system integrity.
739
797
 
740
798
  ### Run Tests
741
799
 
package/package.json CHANGED
@@ -1,7 +1,7 @@
1
1
  {
2
2
  "name": "omgkit",
3
- "version": "2.24.1",
4
- "description": "Omega-Level Development Kit - AI Team System for Claude Code. 41 agents, 156 commands, 159 skills, 68 workflows.",
3
+ "version": "2.24.3",
4
+ "description": "Omega-Level Development Kit - AI Team System for Claude Code. 41 agents, 160 commands, 161 skills, 69 workflows.",
5
5
  "keywords": [
6
6
  "claude-code",
7
7
  "ai",
@@ -6,6 +6,9 @@ model: inherit
6
6
  skills:
7
7
  - omega/omega-sprint
8
8
  - methodology/dispatching-parallel-agents
9
+ - methodology/test-task-generation
10
+ - methodology/test-enforcement
11
+ - devops/workflow-config
9
12
  commands:
10
13
  - /sprint:init
11
14
  - /sprint:sprint-new
@@ -198,20 +201,20 @@ Generates:
198
201
 
199
202
  ### Task Type Routing
200
203
 
201
- | Task Type | Primary Agent | Support Agents |
202
- |-----------|---------------|----------------|
203
- | feature | fullstack-developer | planner, tester |
204
- | bugfix | debugger | scout, tester |
205
- | research | oracle | researcher, scout |
206
- | design | architect | planner |
207
- | security | security-auditor | vulnerability-scanner |
208
- | docs | docs-manager | - |
209
- | test | tester | debugger |
210
- | review | code-reviewer | - |
211
- | deploy | git-manager | cicd-manager |
212
- | refactor | fullstack-developer | scout, code-reviewer |
213
- | optimize | fullstack-developer | architect |
214
- | brainstorm | brainstormer | oracle |
204
+ | Task Type | Primary Agent | Support Agents | Auto-Generate Tests? |
205
+ |-----------|---------------|----------------|---------------------|
206
+ | feature | fullstack-developer | planner, tester | ✅ Yes |
207
+ | bugfix | debugger | scout, tester | ✅ Yes (regression) |
208
+ | research | oracle | researcher, scout | ❌ No |
209
+ | design | architect | planner | ❌ No |
210
+ | security | security-auditor | vulnerability-scanner | ✅ Yes (security) |
211
+ | docs | docs-manager | - | ❌ No |
212
+ | test | tester | debugger | ❌ No (is test) |
213
+ | review | code-reviewer | - | ❌ No |
214
+ | deploy | git-manager | cicd-manager | ❌ No |
215
+ | refactor | fullstack-developer | scout, code-reviewer | ✅ Yes |
216
+ | optimize | fullstack-developer | architect | ✅ Yes (perf) |
217
+ | brainstorm | brainstormer | oracle | ❌ No |
215
218
 
216
219
  ### Assignment Protocol
217
220
 
@@ -231,15 +234,201 @@ Generates:
231
234
  - Balance workload
232
235
  - Ensure coverage
233
236
 
234
- 4. SET CONTEXT
237
+ 4. AUTO-GENERATE TEST TASKS (NEW)
238
+ - Read workflow.yaml testing config
239
+ - If auto_generate_tasks: true
240
+ - Create corresponding TEST-XXX tasks
241
+ - Assign to tester agent
242
+
243
+ 5. SET CONTEXT
235
244
  - Provide relevant files
236
245
  - Share dependencies
237
246
  - Define success criteria
247
+ - Include test requirements
238
248
 
239
- 5. MONITOR EXECUTION
249
+ 6. MONITOR EXECUTION
240
250
  - Track progress
241
251
  - Handle blockers
242
252
  - Coordinate handoffs
253
+ - Enforce tests before completion
254
+ ```
255
+
256
+ ---
257
+
258
+ ## Testing Automation Integration
259
+
260
+ ### Configuration Loading
261
+
262
+ At sprint start, read `.omgkit/workflow.yaml` for testing configuration:
263
+
264
+ ```yaml
265
+ # .omgkit/workflow.yaml
266
+ testing:
267
+ enforcement:
268
+ level: standard # soft | standard | strict
269
+ auto_generate_tasks: true
270
+ coverage_gates:
271
+ unit:
272
+ minimum: 80
273
+ target: 90
274
+ integration:
275
+ minimum: 60
276
+ target: 75
277
+ required_test_types:
278
+ - unit
279
+ - integration
280
+ blocking:
281
+ on_test_failure: true
282
+ on_coverage_below_minimum: true
283
+ ```
284
+
285
+ ### Auto Test Task Generation
286
+
287
+ When `auto_generate_tasks: true`, automatically create test tasks:
288
+
289
+ ```
290
+ FEATURE TASK CREATED:
291
+ TASK-042: Implement user authentication
292
+
293
+ AUTO-GENERATED TEST TASKS:
294
+ TEST-042-UNIT: Unit tests for auth service
295
+ TEST-042-INT: Integration tests for auth flow
296
+ TEST-042-SEC: Security tests for auth (if auth feature)
297
+
298
+ TASK LINKING:
299
+ TASK-042.tests = [TEST-042-UNIT, TEST-042-INT, TEST-042-SEC]
300
+ TASK-042.blocked_by = TEST-042-* (all must pass)
301
+ ```
302
+
303
+ ### Feature Type → Test Type Mapping
304
+
305
+ | Feature Type | Auto-Generated Tests |
306
+ |--------------|---------------------|
307
+ | API Endpoint | Unit + Integration + Contract |
308
+ | UI Component | Unit + Snapshot + Accessibility |
309
+ | Database | Unit + Integration + Migration |
310
+ | Auth/Security | Unit + Integration + Security |
311
+ | Business Logic | Unit + Property-based |
312
+ | External Integration | Unit + Integration + Contract |
313
+
314
+ ### Test Task Template
315
+
316
+ ```markdown
317
+ ## TEST-XXX: [Test Description]
318
+
319
+ **Parent Task**: TASK-XXX
320
+ **Type**: [unit | integration | e2e | security | performance]
321
+ **Priority**: Same as parent
322
+
323
+ ### Acceptance Criteria
324
+ - [ ] All tests pass
325
+ - [ ] Coverage ≥ minimum threshold
326
+ - [ ] No skipped critical tests
327
+ - [ ] Test isolation verified
328
+
329
+ ### Test Scope
330
+ - Functions/components to test
331
+ - Edge cases to cover
332
+ - Security scenarios (if applicable)
333
+ ```
334
+
335
+ ### Enforcement Levels
336
+
337
+ #### Soft Enforcement
338
+ ```
339
+ - Warn when completing without tests
340
+ - Allow override with justification
341
+ - Log for retrospective
342
+ - No blocking
343
+ ```
344
+
345
+ #### Standard Enforcement (Default)
346
+ ```
347
+ - Block task completion without tests
348
+ - Require minimum coverage
349
+ - Allow emergency override with approval
350
+ - Create follow-up tasks for overrides
351
+ ```
352
+
353
+ #### Strict Enforcement
354
+ ```
355
+ - Block ALL completion without full test suite
356
+ - Require coverage above target
357
+ - No overrides allowed
358
+ - Auto-reject PRs without tests
359
+ ```
360
+
361
+ ### Definition of Done (with Testing)
362
+
363
+ Before marking ANY task as DONE, verify:
364
+
365
+ ```
366
+ DEFINITION OF DONE CHECKLIST:
367
+ ├── Code
368
+ │ ├── [x] Implementation complete
369
+ │ ├── [x] Code review passed
370
+ │ └── [x] No lint errors
371
+
372
+ ├── Tests (MANDATORY)
373
+ │ ├── [x] Test tasks created
374
+ │ ├── [x] All tests passing
375
+ │ ├── [x] Coverage ≥ minimum
376
+ │ └── [x] No skipped critical tests
377
+
378
+ ├── Documentation
379
+ │ ├── [x] Code documented
380
+ │ └── [x] API docs updated (if API)
381
+
382
+ └── Ready for merge
383
+ ```
384
+
385
+ ### Completion Blocking
386
+
387
+ ```
388
+ User: "Mark TASK-042 as done"
389
+
390
+ Sprint Master checks:
391
+ ├── Has test tasks? → Yes (TEST-042-UNIT, TEST-042-INT)
392
+ ├── TEST-042-UNIT status? → ✅ Complete
393
+ ├── TEST-042-INT status? → ❌ In Progress
394
+ └── Block completion
395
+
396
+ Response: "Cannot mark TASK-042 as done.
397
+ TEST-042-INT is still in progress.
398
+
399
+ Test Status:
400
+ ✅ TEST-042-UNIT: Passed (45 tests, 92% coverage)
401
+ 🔄 TEST-042-INT: In Progress
402
+
403
+ Complete all test tasks first, or use /quality:verify-done for details."
404
+ ```
405
+
406
+ ### Override Protocol
407
+
408
+ ```
409
+ User: "Override test requirement for TASK-042"
410
+
411
+ Sprint Master checks:
412
+ ├── Enforcement level? → Standard
413
+ ├── Override allowed? → Yes
414
+ ├── Justification required? → Yes
415
+
416
+ Response: "Override requested for TASK-042.
417
+
418
+ Please provide:
419
+ 1. Reason for override
420
+ 2. Risk assessment
421
+ 3. Follow-up plan for tests
422
+
423
+ Note: This will be logged and reviewed in retrospective."
424
+
425
+ User: "Hotfix for production - tests will be added in TASK-043"
426
+
427
+ Sprint Master:
428
+ ├── Log override with justification
429
+ ├── Create follow-up task: TASK-043 (Add tests for TASK-042)
430
+ ├── Mark TASK-042 as done with override flag
431
+ └── Add to retrospective items
243
432
  ```
244
433
 
245
434
  ---
@@ -464,6 +653,12 @@ When blocker detected:
464
653
  - `/spawn [agent] [task]` - Run agent in parallel
465
654
  - `/spawn:collect` - Collect parallel results
466
655
 
656
+ ### Testing Commands
657
+ - `/quality:verify-done` - Verify test requirements before completion
658
+ - `/quality:coverage-check` - Check coverage against gates
659
+ - `/quality:test-plan` - Generate test plan for feature
660
+ - `/dev:feature-tested [desc]` - Create feature with auto-generated tests
661
+
467
662
  ### Omega Commands
468
663
  - `/init` - Initialize Omega mode
469
664
  - `/10x [task]` - Find 10x approach
@@ -0,0 +1,208 @@
1
+ ---
2
+ name: Feature Tested
3
+ description: Create a feature with automatically generated test tasks. Ensures every implementation task has corresponding test coverage before the feature can be marked complete.
4
+ category: dev
5
+ related_skills:
6
+ - methodology/test-task-generation
7
+ - methodology/test-enforcement
8
+ - methodology/executing-plans
9
+ related_commands:
10
+ - /quality:test-plan
11
+ - /quality:verify-done
12
+ - /dev:feature
13
+ - /dev:test
14
+ allowed-tools: Task, Read, Write, Bash, Grep, Glob
15
+ ---
16
+
17
+ # /dev:feature-tested
18
+
19
+ Build a feature with automatically generated test tasks. This command ensures comprehensive test coverage by creating test tasks alongside implementation tasks.
20
+
21
+ ## Usage
22
+
23
+ ```bash
24
+ /dev:feature-tested <feature-description>
25
+ /dev:feature-tested "Add user authentication" --coverage 90
26
+ /dev:feature-tested "Payment processing" --test-types unit,integration,e2e
27
+ ```
28
+
29
+ ## Options
30
+
31
+ | Option | Description | Default |
32
+ |--------|-------------|---------|
33
+ | `--coverage` | Minimum coverage target | 80% |
34
+ | `--test-types` | Required test types | unit,integration |
35
+ | `--tdd` | Use TDD approach (tests first) | false |
36
+ | `--strict` | Strict enforcement (no overrides) | false |
37
+
38
+ ## How It Works
39
+
40
+ ### 1. Feature Analysis
41
+ Analyzes the feature description to determine:
42
+ - Feature type (API, UI, business logic, etc.)
43
+ - Required test types
44
+ - Coverage targets
45
+ - Acceptance criteria
46
+
47
+ ### 2. Task Generation
48
+ Creates implementation tasks AND corresponding test tasks:
49
+
50
+ ```
51
+ Feature: Add user profile API
52
+
53
+ Generated Tasks:
54
+ ┌─────────────────────────────────────────────────────────────┐
55
+ │ Implementation Tasks │
56
+ ├─────────────────────────────────────────────────────────────┤
57
+ │ ☐ TASK-001: Create profile database schema │
58
+ │ ☐ TASK-002: Implement profile service │
59
+ │ ☐ TASK-003: Create profile API endpoints │
60
+ │ ☐ TASK-004: Add input validation │
61
+ └─────────────────────────────────────────────────────────────┘
62
+
63
+ ┌─────────────────────────────────────────────────────────────┐
64
+ │ Test Tasks (Auto-Generated) │
65
+ ├─────────────────────────────────────────────────────────────┤
66
+ │ ☐ TEST-001: Unit tests for profile service │
67
+ │ ☐ TEST-002: Integration tests for profile API │
68
+ │ ☐ TEST-003: Contract tests for API schema │
69
+ │ ☐ TEST-004: Security tests for profile endpoints │
70
+ └─────────────────────────────────────────────────────────────┘
71
+ ```
72
+
73
+ ### 3. Enforcement
74
+ - Cannot mark feature as "done" until all test tasks complete
75
+ - Coverage must meet minimum threshold
76
+ - All tests must pass
77
+
78
+ ## Output Format
79
+
80
+ ```
81
+ ╔══════════════════════════════════════════════════════════════╗
82
+ ║ FEATURE WITH TESTS CREATED ║
83
+ ╚══════════════════════════════════════════════════════════════╝
84
+
85
+ Feature: Add user profile API
86
+ ID: FEAT-042
87
+ Coverage Target: 90%
88
+
89
+ ┌─────────────────────────────────────────────────────────────┐
90
+ │ Implementation Tasks (4) │
91
+ ├─────────────────────────────────────────────────────────────┤
92
+ │ TASK-001: Create profile database schema [Pending] │
93
+ │ TASK-002: Implement profile service [Pending] │
94
+ │ TASK-003: Create profile API endpoints [Pending] │
95
+ │ TASK-004: Add input validation [Pending] │
96
+ └─────────────────────────────────────────────────────────────┘
97
+
98
+ ┌─────────────────────────────────────────────────────────────┐
99
+ │ Test Tasks (4) - Auto-Generated │
100
+ ├─────────────────────────────────────────────────────────────┤
101
+ │ TEST-001: Unit tests for profile service [Pending] │
102
+ │ → Coverage target: 90% for src/services/profile.ts │
103
+ │ → Test file: tests/unit/services/profile.test.ts │
104
+ │ │
105
+ │ TEST-002: Integration tests for profile API [Pending] │
106
+ │ → Coverage target: 75% for API endpoints │
107
+ │ → Test file: tests/integration/api/profile.int.test.ts │
108
+ │ │
109
+ │ TEST-003: Contract tests for API schema [Pending] │
110
+ │ → Validates: Request/response schemas │
111
+ │ → Test file: tests/contract/profile.contract.test.ts │
112
+ │ │
113
+ │ TEST-004: Security tests for profile endpoints [Pending] │
114
+ │ → Checks: Auth, injection, XSS │
115
+ │ → Test file: tests/security/profile.security.test.ts │
116
+ └─────────────────────────────────────────────────────────────┘
117
+
118
+ ┌─────────────────────────────────────────────────────────────┐
119
+ │ Completion Requirements │
120
+ ├─────────────────────────────────────────────────────────────┤
121
+ │ ☐ All implementation tasks complete │
122
+ │ ☐ All test tasks complete │
123
+ │ ☐ Overall coverage ≥ 90% │
124
+ │ ☐ All tests passing │
125
+ │ ☐ No security vulnerabilities │
126
+ │ ☐ Code review approved │
127
+ └─────────────────────────────────────────────────────────────┘
128
+
129
+ Next: Start with TASK-001 or use --tdd to write tests first
130
+ ```
131
+
132
+ ## TDD Mode
133
+
134
+ With `--tdd` flag, tests are created and executed first:
135
+
136
+ ```bash
137
+ /dev:feature-tested "Add user profile API" --tdd
138
+ ```
139
+
140
+ Flow:
141
+ 1. Generate test tasks first
142
+ 2. Write failing tests (Red)
143
+ 3. Implement to pass tests (Green)
144
+ 4. Refactor (Refactor)
145
+ 5. Verify coverage
146
+
147
+ ## Workflow Integration
148
+
149
+ ### Sprint Planning
150
+ ```bash
151
+ /sprint:sprint-new
152
+ # Add feature with tests
153
+ /dev:feature-tested "User profile management"
154
+ ```
155
+
156
+ ### Daily Development
157
+ ```bash
158
+ # Check what's needed
159
+ /quality:verify-done FEAT-042
160
+
161
+ # Work on implementation
162
+ # Work on tests
163
+ # Verify completion
164
+ /quality:verify-done FEAT-042
165
+ ```
166
+
167
+ ### Feature Completion
168
+ ```bash
169
+ # Attempt to complete
170
+ /quality:verify-done FEAT-042
171
+
172
+ # If all requirements met:
173
+ # ✅ Feature FEAT-042 marked as DONE
174
+
175
+ # If requirements not met:
176
+ # ❌ Cannot complete: Coverage 75% below 90% minimum
177
+ ```
178
+
179
+ ## Examples
180
+
181
+ ### Basic feature with tests
182
+ ```bash
183
+ /dev:feature-tested "Add user authentication"
184
+ ```
185
+
186
+ ### With strict coverage
187
+ ```bash
188
+ /dev:feature-tested "Payment processing" --coverage 95 --strict
189
+ ```
190
+
191
+ ### TDD approach
192
+ ```bash
193
+ /dev:feature-tested "Shopping cart" --tdd
194
+ ```
195
+
196
+ ### Specific test types
197
+ ```bash
198
+ /dev:feature-tested "Admin dashboard" --test-types unit,e2e,security
199
+ ```
200
+
201
+ ## Comparison with /dev:feature
202
+
203
+ | Aspect | /dev:feature | /dev:feature-tested |
204
+ |--------|-------------|---------------------|
205
+ | Test tasks | Manual | Auto-generated |
206
+ | Enforcement | Soft | Hard (blocking) |
207
+ | Coverage tracking | Manual | Automatic |
208
+ | Completion check | Manual | Automatic |