openhermes 2.8.0 → 4.0.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/CONTEXT.md +18 -0
- package/ETHOS.md +15 -0
- package/README.md +135 -292
- package/bootstrap.mjs +174 -512
- package/harness/agents/openhermes.md +87 -0
- package/harness/codex/CONSTITUTION.md +70 -148
- package/harness/codex/ROUTING.md +126 -0
- package/harness/commands/oh-doctor.md +26 -0
- package/harness/instructions/CONVENTIONS.md +206 -206
- package/harness/instructions/RUNTIME.md +54 -31
- package/harness/skills/oh-builder/SKILL.md +98 -0
- package/harness/skills/oh-caveman/SKILL.md +33 -0
- package/harness/skills/oh-expert/SKILL.md +121 -0
- package/harness/skills/oh-freeze/SKILL.md +28 -0
- package/harness/skills/oh-gauntlet/SKILL.md +119 -0
- package/harness/skills/oh-grill/SKILL.md +77 -0
- package/harness/skills/oh-guard/SKILL.md +33 -0
- package/harness/skills/oh-handoff/SKILL.md +33 -0
- package/harness/skills/oh-health/SKILL.md +90 -0
- package/harness/skills/oh-init/SKILL.md +78 -0
- package/harness/skills/oh-investigate/SKILL.md +35 -0
- package/harness/skills/oh-issue/SKILL.md +36 -0
- package/harness/skills/oh-learn/SKILL.md +28 -0
- package/harness/skills/oh-manifest/SKILL.md +84 -0
- package/harness/skills/oh-plan-review/SKILL.md +128 -0
- package/harness/skills/oh-planner/SKILL.md +157 -0
- package/harness/skills/oh-prd/SKILL.md +35 -0
- package/harness/skills/oh-retro/SKILL.md +33 -0
- package/harness/skills/oh-review/SKILL.md +110 -0
- package/harness/skills/oh-security/SKILL.md +110 -0
- package/harness/skills/oh-ship/SKILL.md +39 -0
- package/harness/skills/oh-skill-craft/SKILL.md +107 -0
- package/harness/skills/oh-skills-link/SKILL.md +29 -0
- package/harness/skills/oh-skills-list/SKILL.md +31 -0
- package/harness/skills/oh-triage/SKILL.md +36 -0
- package/index.mjs +3 -60
- package/lib/harness-resolver.mjs +77 -0
- package/lib/logger.mjs +62 -0
- package/package.json +49 -53
- package/test/plugins-behavioral.test.mjs +64 -0
- package/test/plugins.test.mjs +62 -0
- package/autorecall.mjs +0 -237
- package/curator.mjs +0 -482
- package/harness/commands/build-fix.md +0 -60
- package/harness/commands/checkpoint.md +0 -68
- package/harness/commands/code-review.md +0 -71
- package/harness/commands/doctor.md +0 -42
- package/harness/commands/eval.md +0 -89
- package/harness/commands/go-build.md +0 -87
- package/harness/commands/go-review.md +0 -71
- package/harness/commands/harness-audit.md +0 -90
- package/harness/commands/learn.md +0 -37
- package/harness/commands/loop-start.md +0 -38
- package/harness/commands/loop-status.md +0 -30
- package/harness/commands/memory-search.md +0 -37
- package/harness/commands/model-route.md +0 -32
- package/harness/commands/ohc.md +0 -13
- package/harness/commands/orchestrate.md +0 -88
- package/harness/commands/plan.md +0 -53
- package/harness/commands/quality-gate.md +0 -35
- package/harness/commands/refactor-clean.md +0 -102
- package/harness/commands/rust-build.md +0 -78
- package/harness/commands/rust-review.md +0 -65
- package/harness/commands/security.md +0 -93
- package/harness/commands/setup-pm.md +0 -65
- package/harness/commands/skill-create.md +0 -99
- package/harness/commands/test-coverage.md +0 -80
- package/harness/commands/update-codemaps.md +0 -81
- package/harness/commands/update-docs.md +0 -67
- package/harness/commands/verify.md +0 -68
- package/harness/prompts/architect.txt +0 -189
- package/harness/prompts/build-cpp.md +0 -98
- package/harness/prompts/build-error-resolver.md +0 -44
- package/harness/prompts/build-go.md +0 -340
- package/harness/prompts/build-java.md +0 -140
- package/harness/prompts/build-kotlin.md +0 -137
- package/harness/prompts/build-rust.md +0 -108
- package/harness/prompts/code-reviewer.md +0 -40
- package/harness/prompts/doc-updater.md +0 -206
- package/harness/prompts/docs-lookup.md +0 -71
- package/harness/prompts/e2e-runner.txt +0 -317
- package/harness/prompts/explore.md +0 -42
- package/harness/prompts/harness-optimizer.md +0 -42
- package/harness/prompts/loop-operator.md +0 -53
- package/harness/prompts/planner.md +0 -37
- package/harness/prompts/refactor-cleaner.md +0 -256
- package/harness/prompts/review-cpp.md +0 -81
- package/harness/prompts/review-database.md +0 -261
- package/harness/prompts/review-go.md +0 -257
- package/harness/prompts/review-java.md +0 -113
- package/harness/prompts/review-kotlin.md +0 -143
- package/harness/prompts/review-python.md +0 -101
- package/harness/prompts/review-rust.md +0 -77
- package/harness/prompts/security-reviewer.md +0 -42
- package/harness/prompts/tdd-guide.md +0 -228
- package/harness/rules/audit.md +0 -84
- package/harness/rules/checkpointing.md +0 -75
- package/harness/rules/context-loading.md +0 -33
- package/harness/rules/credential-exposure.md +0 -0
- package/harness/rules/delegation.md +0 -80
- package/harness/rules/handoff.md +0 -267
- package/harness/rules/memory-management.md +0 -28
- package/harness/rules/precedence.md +0 -52
- package/harness/rules/promotion.md +0 -46
- package/harness/rules/ranking.md +0 -64
- package/harness/rules/retrieval.md +0 -94
- package/harness/rules/runtime-guards.md +0 -196
- package/harness/rules/self-heal.md +0 -79
- package/harness/rules/session-start.md +0 -34
- package/harness/rules/skills-management.md +0 -165
- package/harness/rules/state-drift.md +0 -192
- package/harness/rules/verification.md +0 -88
- package/harness/scripts/sync-commands.mjs +0 -259
- package/harness/skills/.bundled_manifest +0 -17
- package/harness/skills/.usage.json +0 -6
- package/harness/skills/api-design/SKILL.md +0 -523
- package/harness/skills/backend-patterns/SKILL.md +0 -598
- package/harness/skills/coding-standards/SKILL.md +0 -549
- package/harness/skills/e2e-testing/SKILL.md +0 -326
- package/harness/skills/frontend-patterns/SKILL.md +0 -642
- package/harness/skills/frontend-slides/SKILL.md +0 -184
- package/harness/skills/security-review/SKILL.md +0 -495
- package/harness/skills/strategic-compact/SKILL.md +0 -131
- package/harness/skills/tdd-workflow/SKILL.md +0 -463
- package/harness/skills/verification-loop/SKILL.md +0 -126
- package/lib/ambient-memory.mjs +0 -167
- package/lib/handoff.mjs +0 -171
- package/lib/hardening.mjs +0 -146
- package/lib/memory-tools-plugin.mjs +0 -368
- package/lib/ohc/block-sync.mjs +0 -69
- package/lib/ohc/compress/search.mjs +0 -152
- package/lib/ohc/compress/state.mjs +0 -76
- package/lib/ohc/config.mjs +0 -185
- package/lib/ohc/message-ids.mjs +0 -178
- package/lib/ohc/notify.mjs +0 -135
- package/lib/ohc/protected-patterns.mjs +0 -55
- package/lib/ohc/prune-apply.mjs +0 -134
- package/lib/ohc/pruner.mjs +0 -608
- package/lib/ohc/reaper.mjs +0 -70
- package/lib/ohc/state.mjs +0 -265
- package/lib/ohc/strategies/deduplication.mjs +0 -72
- package/lib/ohc/strategies/index.mjs +0 -2
- package/lib/ohc/strategies/purge-errors.mjs +0 -43
- package/lib/ohc/token-utils.mjs +0 -26
- package/lib/ohc/updater.mjs +0 -132
- package/lib/paths.mjs +0 -49
- package/lib/schema-validator.mjs +0 -79
- package/lib/search.mjs +0 -48
- package/schemas/audit.schema.json +0 -82
- package/schemas/backlog.schema.json +0 -63
- package/schemas/checkpoint.schema.json +0 -65
- package/schemas/constraint.schema.json +0 -62
- package/schemas/decision.schema.json +0 -63
- package/schemas/instinct.schema.json +0 -63
- package/schemas/loop-state.schema.json +0 -33
- package/schemas/mistake.schema.json +0 -64
- package/schemas/verification_receipt.schema.json +0 -88
- package/skill-builder.mjs +0 -88
|
@@ -1,42 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
description: Run OpenCode openhermes health diagnostics
|
|
3
|
-
agent: OpenHermes
|
|
4
|
-
subtask: true
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
# Doctor Command
|
|
8
|
-
|
|
9
|
-
Run full OpenCode openhermes diagnostics. $ARGUMENTS
|
|
10
|
-
|
|
11
|
-
## Your Task
|
|
12
|
-
|
|
13
|
-
1. Load the opencode-doctor skill: `skill({ name: "opencode-doctor" })`
|
|
14
|
-
2. Follow its instructions to validate:
|
|
15
|
-
- Config syntax (opencode.json valid JSON)
|
|
16
|
-
- Provider connectivity (LM Studio at http://127.0.0.1:1234/v1)
|
|
17
|
-
- Cache state (memory records, recall cache)
|
|
18
|
-
- Auth file integrity
|
|
19
|
-
3. Report results with any fix suggestions
|
|
20
|
-
|
|
21
|
-
## Automated Checks
|
|
22
|
-
|
|
23
|
-
Run these commands and report results:
|
|
24
|
-
|
|
25
|
-
!opencode debug config
|
|
26
|
-
!opencode debug info
|
|
27
|
-
!opencode debug paths
|
|
28
|
-
|
|
29
|
-
## Report Format
|
|
30
|
-
|
|
31
|
-
| Check | Result | Issue |
|
|
32
|
-
|-------|--------|-------|
|
|
33
|
-
| Config JSON | PASS/FAIL | |
|
|
34
|
-
| Provider | PASS/FAIL | |
|
|
35
|
-
| Memory MCP | PASS/FAIL | |
|
|
36
|
-
| Plugins | PASS/FAIL | |
|
|
37
|
-
| Skills | PASS/FAIL | |
|
|
38
|
-
|
|
39
|
-
## After Diagnosis
|
|
40
|
-
|
|
41
|
-
If issues found: propose fixes, wait for approval, apply.
|
|
42
|
-
If clean: report "OpenHermes: HEALTHY."
|
package/harness/commands/eval.md
DELETED
|
@@ -1,89 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
description: Run evaluation against acceptance criteria
|
|
3
|
-
agent: planner
|
|
4
|
-
subtask: true
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
# Eval Command
|
|
8
|
-
|
|
9
|
-
Evaluate implementation against acceptance criteria: $ARGUMENTS
|
|
10
|
-
|
|
11
|
-
## Your Task
|
|
12
|
-
|
|
13
|
-
Run structured evaluation to verify the implementation meets requirements.
|
|
14
|
-
|
|
15
|
-
## Evaluation Framework
|
|
16
|
-
|
|
17
|
-
### Grader Types
|
|
18
|
-
|
|
19
|
-
1. **Binary Grader** - Pass/Fail
|
|
20
|
-
- Does it work? Yes/No
|
|
21
|
-
- Good for: feature completion, bug fixes
|
|
22
|
-
|
|
23
|
-
2. **Scalar Grader** - Score 0-100
|
|
24
|
-
- How well does it work?
|
|
25
|
-
- Good for: performance, quality metrics
|
|
26
|
-
|
|
27
|
-
3. **Rubric Grader** - Category scores
|
|
28
|
-
- Multiple dimensions evaluated
|
|
29
|
-
- Good for: comprehensive review
|
|
30
|
-
|
|
31
|
-
## Evaluation Process
|
|
32
|
-
|
|
33
|
-
### Step 1: Define Criteria
|
|
34
|
-
|
|
35
|
-
```
|
|
36
|
-
Acceptance Criteria:
|
|
37
|
-
1. [Criterion 1] - [weight]
|
|
38
|
-
2. [Criterion 2] - [weight]
|
|
39
|
-
3. [Criterion 3] - [weight]
|
|
40
|
-
```
|
|
41
|
-
|
|
42
|
-
### Step 2: Run Tests
|
|
43
|
-
|
|
44
|
-
For each criterion:
|
|
45
|
-
- Execute relevant test
|
|
46
|
-
- Collect evidence
|
|
47
|
-
- Score result
|
|
48
|
-
|
|
49
|
-
### Step 3: Calculate Score
|
|
50
|
-
|
|
51
|
-
```
|
|
52
|
-
Final Score = Σ (criterion_score × weight) / total_weight
|
|
53
|
-
```
|
|
54
|
-
|
|
55
|
-
### Step 4: Report
|
|
56
|
-
|
|
57
|
-
## Evaluation Report
|
|
58
|
-
|
|
59
|
-
### Overall: [PASS/FAIL] (Score: X/100)
|
|
60
|
-
|
|
61
|
-
### Criterion Breakdown
|
|
62
|
-
|
|
63
|
-
| Criterion | Score | Weight | Weighted |
|
|
64
|
-
|-----------|-------|--------|----------|
|
|
65
|
-
| [Criterion 1] | X/10 | 30% | X |
|
|
66
|
-
| [Criterion 2] | X/10 | 40% | X |
|
|
67
|
-
| [Criterion 3] | X/10 | 30% | X |
|
|
68
|
-
|
|
69
|
-
### Evidence
|
|
70
|
-
|
|
71
|
-
**Criterion 1: [Name]**
|
|
72
|
-
- Test: [what was tested]
|
|
73
|
-
- Result: [outcome]
|
|
74
|
-
- Evidence: [screenshot, log, output]
|
|
75
|
-
|
|
76
|
-
### Recommendations
|
|
77
|
-
|
|
78
|
-
[If not passing, what needs to change]
|
|
79
|
-
|
|
80
|
-
## Pass@K Metrics
|
|
81
|
-
|
|
82
|
-
For non-deterministic evaluations:
|
|
83
|
-
- Run K times
|
|
84
|
-
- Calculate pass rate
|
|
85
|
-
- Report: "Pass@K = X/K"
|
|
86
|
-
|
|
87
|
-
---
|
|
88
|
-
|
|
89
|
-
**TIP**: Use eval for acceptance testing before marking features complete.
|
|
@@ -1,87 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
description: Fix Go build and vet errors
|
|
3
|
-
agent: build-go
|
|
4
|
-
subtask: true
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
# Go Build Command
|
|
8
|
-
|
|
9
|
-
Fix Go build, vet, and compilation errors: $ARGUMENTS
|
|
10
|
-
|
|
11
|
-
## Your Task
|
|
12
|
-
|
|
13
|
-
1. **Run go build**: `go build ./...`
|
|
14
|
-
2. **Run go vet**: `go vet ./...`
|
|
15
|
-
3. **Fix errors** one by one
|
|
16
|
-
4. **Verify fixes** don't introduce new errors
|
|
17
|
-
|
|
18
|
-
## Common Go Errors
|
|
19
|
-
|
|
20
|
-
### Import Errors
|
|
21
|
-
```
|
|
22
|
-
imported and not used: "package"
|
|
23
|
-
```
|
|
24
|
-
**Fix**: Remove unused import or use `_` prefix
|
|
25
|
-
|
|
26
|
-
### Type Errors
|
|
27
|
-
```
|
|
28
|
-
cannot use x (type T) as type U
|
|
29
|
-
```
|
|
30
|
-
**Fix**: Add type conversion or fix type definition
|
|
31
|
-
|
|
32
|
-
### Undefined Errors
|
|
33
|
-
```
|
|
34
|
-
undefined: identifier
|
|
35
|
-
```
|
|
36
|
-
**Fix**: Import package, define variable, or fix typo
|
|
37
|
-
|
|
38
|
-
### Vet Errors
|
|
39
|
-
```
|
|
40
|
-
printf: call has arguments but no formatting directives
|
|
41
|
-
```
|
|
42
|
-
**Fix**: Add format directive or remove arguments
|
|
43
|
-
|
|
44
|
-
## Fix Order
|
|
45
|
-
|
|
46
|
-
1. **Import errors** - Fix or remove imports
|
|
47
|
-
2. **Type definitions** - Ensure types exist
|
|
48
|
-
3. **Function signatures** - Match parameters
|
|
49
|
-
4. **Vet warnings** - Address static analysis
|
|
50
|
-
|
|
51
|
-
## Build Commands
|
|
52
|
-
|
|
53
|
-
```bash
|
|
54
|
-
# Build all packages
|
|
55
|
-
go build ./...
|
|
56
|
-
|
|
57
|
-
# Build with race detector
|
|
58
|
-
go build -race ./...
|
|
59
|
-
|
|
60
|
-
# Build for specific OS/arch
|
|
61
|
-
GOOS=linux GOARCH=amd64 go build ./...
|
|
62
|
-
|
|
63
|
-
# Run go vet
|
|
64
|
-
go vet ./...
|
|
65
|
-
|
|
66
|
-
# Run staticcheck
|
|
67
|
-
staticcheck ./...
|
|
68
|
-
|
|
69
|
-
# Format code
|
|
70
|
-
gofmt -w .
|
|
71
|
-
|
|
72
|
-
# Tidy dependencies
|
|
73
|
-
go mod tidy
|
|
74
|
-
```
|
|
75
|
-
|
|
76
|
-
## Verification
|
|
77
|
-
|
|
78
|
-
After fixes:
|
|
79
|
-
```bash
|
|
80
|
-
go build ./... # Should succeed
|
|
81
|
-
go vet ./... # Should have no warnings
|
|
82
|
-
go test ./... # Tests should pass
|
|
83
|
-
```
|
|
84
|
-
|
|
85
|
-
---
|
|
86
|
-
|
|
87
|
-
**IMPORTANT**: Fix errors only. No refactoring, no improvements. Get the build green with minimal changes.
|
|
@@ -1,71 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
description: Go code review for idiomatic patterns
|
|
3
|
-
agent: review-go
|
|
4
|
-
subtask: true
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
# Go Review Command
|
|
8
|
-
|
|
9
|
-
Review Go code for idiomatic patterns and best practices: $ARGUMENTS
|
|
10
|
-
|
|
11
|
-
## Your Task
|
|
12
|
-
|
|
13
|
-
1. **Analyze Go code** for idioms and patterns
|
|
14
|
-
2. **Check concurrency** - goroutines, channels, mutexes
|
|
15
|
-
3. **Review error handling** - proper error wrapping
|
|
16
|
-
4. **Verify performance** - allocations, bottlenecks
|
|
17
|
-
|
|
18
|
-
## Review Checklist
|
|
19
|
-
|
|
20
|
-
### Idiomatic Go
|
|
21
|
-
- [ ] Package naming (lowercase, no underscores)
|
|
22
|
-
- [ ] Variable naming (camelCase, short)
|
|
23
|
-
- [ ] Interface naming (ends with -er)
|
|
24
|
-
- [ ] Error naming (starts with Err)
|
|
25
|
-
|
|
26
|
-
### Error Handling
|
|
27
|
-
- [ ] Errors are checked, not ignored
|
|
28
|
-
- [ ] Errors wrapped with context (`fmt.Errorf("...: %w", err)`)
|
|
29
|
-
- [ ] Sentinel errors used appropriately
|
|
30
|
-
- [ ] Custom error types when needed
|
|
31
|
-
|
|
32
|
-
### Concurrency
|
|
33
|
-
- [ ] Goroutines properly managed
|
|
34
|
-
- [ ] Channels buffered appropriately
|
|
35
|
-
- [ ] No data races (use `-race` flag)
|
|
36
|
-
- [ ] Context passed for cancellation
|
|
37
|
-
- [ ] WaitGroups used correctly
|
|
38
|
-
|
|
39
|
-
### Performance
|
|
40
|
-
- [ ] Avoid unnecessary allocations
|
|
41
|
-
- [ ] Use `sync.Pool` for frequent allocations
|
|
42
|
-
- [ ] Prefer value receivers for small structs
|
|
43
|
-
- [ ] Buffer I/O operations
|
|
44
|
-
|
|
45
|
-
### Code Organization
|
|
46
|
-
- [ ] Small, focused packages
|
|
47
|
-
- [ ] Clear dependency direction
|
|
48
|
-
- [ ] Internal packages for private code
|
|
49
|
-
- [ ] Godoc comments on exports
|
|
50
|
-
|
|
51
|
-
## Report Format
|
|
52
|
-
|
|
53
|
-
### Idiomatic Issues
|
|
54
|
-
- [file:line] Issue description
|
|
55
|
-
Suggestion: How to fix
|
|
56
|
-
|
|
57
|
-
### Error Handling Issues
|
|
58
|
-
- [file:line] Issue description
|
|
59
|
-
Suggestion: How to fix
|
|
60
|
-
|
|
61
|
-
### Concurrency Issues
|
|
62
|
-
- [file:line] Issue description
|
|
63
|
-
Suggestion: How to fix
|
|
64
|
-
|
|
65
|
-
### Performance Issues
|
|
66
|
-
- [file:line] Issue description
|
|
67
|
-
Suggestion: How to fix
|
|
68
|
-
|
|
69
|
-
---
|
|
70
|
-
|
|
71
|
-
**TIP**: Run `go vet` and `staticcheck` for additional automated checks.
|
|
@@ -1,90 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
description: Run a self-evaluation audit of the OpenHermes harness
|
|
3
|
-
agent: harness-optimizer
|
|
4
|
-
subtask: true
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
# Harness Audit Command
|
|
8
|
-
|
|
9
|
-
Run a self-evaluation audit of the OpenHermes harness and return a prioritized scorecard.
|
|
10
|
-
|
|
11
|
-
## Usage
|
|
12
|
-
|
|
13
|
-
`/harness-audit [scope] [--format text|json] [--root path]`
|
|
14
|
-
|
|
15
|
-
- `scope` (optional): `repo` (default), `hooks`, `skills`, `commands`, `agents`
|
|
16
|
-
- `--format`: output style (`text` default, `json` for automation)
|
|
17
|
-
- `--root`: audit a specific path instead of the current working directory
|
|
18
|
-
|
|
19
|
-
## Self-Evaluation Checklist
|
|
20
|
-
|
|
21
|
-
Evaluate each category by inspecting the harness files directly. No external script needed.
|
|
22
|
-
|
|
23
|
-
### 1. Tool Coverage (0-10)
|
|
24
|
-
- [ ] Commands exist for all subagent types
|
|
25
|
-
- [ ] Each command has correct frontmatter (description, agent, subtask)
|
|
26
|
-
- [ ] Agent mapping table is complete and accurate
|
|
27
|
-
- [ ] Language-specific agents exist (Go, Rust)
|
|
28
|
-
- [ ] All command files are discoverable
|
|
29
|
-
|
|
30
|
-
### 2. Context Efficiency (0-10)
|
|
31
|
-
- [ ] Commands are concise (under 100 lines each)
|
|
32
|
-
- [ ] No redundant or overlapping command content
|
|
33
|
-
- [ ] Delegation instructions are clear
|
|
34
|
-
- [ ] Subagent handoffs minimize context overhead
|
|
35
|
-
- [ ] Templates provide structured output formats
|
|
36
|
-
|
|
37
|
-
### 3. Quality Gates (0-10)
|
|
38
|
-
- [ ] verify.md has comprehensive checklist
|
|
39
|
-
- [ ] quality-gate.md covers lint/type/build
|
|
40
|
-
- [ ] test-coverage.md has meaningful targets
|
|
41
|
-
- [ ] eval.md has structured scoring framework
|
|
42
|
-
- [ ] checkpoint.md enables state tracking
|
|
43
|
-
|
|
44
|
-
### 4. Memory Persistence (0-10)
|
|
45
|
-
- [ ] Memory tools documented (ohc_save/ohc_get/ohc_list/ohc_latest/ohc_search)
|
|
46
|
-
- [ ] Checkpoint command references memory persistence
|
|
47
|
-
- [ ] Mistake/audit logging workflow documented
|
|
48
|
-
- [ ] Recall cache strategy defined
|
|
49
|
-
|
|
50
|
-
### 5. Eval Coverage (0-10)
|
|
51
|
-
- [ ] eval.md supports binary/scalar/rubric grading
|
|
52
|
-
- [ ] Acceptance criteria framework in place
|
|
53
|
-
- [ ] Pass@K metrics for non-deterministic evals
|
|
54
|
-
- [ ] Evaluation report format defined
|
|
55
|
-
|
|
56
|
-
### 6. Security Guardrails (0-10)
|
|
57
|
-
- [ ] verify.md includes security checklist items
|
|
58
|
-
- [ ] No hardcoded secrets guidance present
|
|
59
|
-
- [ ] Input validation guidance included
|
|
60
|
-
- [ ] SQL injection / XSS risks addressed
|
|
61
|
-
|
|
62
|
-
### 7. Cost Efficiency (0-10)
|
|
63
|
-
- [ ] model-route.md has budget tiers
|
|
64
|
-
- [ ] Model routing heuristic defined
|
|
65
|
-
- [ ] Subagent usage reduces main-context tokens
|
|
66
|
-
- [ ] Compression workflow documented
|
|
67
|
-
|
|
68
|
-
## Output Contract
|
|
69
|
-
|
|
70
|
-
Return:
|
|
71
|
-
|
|
72
|
-
1. `overall_score` out of `max_score` (70 for `repo`; smaller for scoped audits)
|
|
73
|
-
2. Category scores and concrete findings
|
|
74
|
-
3. Failed checks with exact file paths
|
|
75
|
-
4. Top 3 actions to improve
|
|
76
|
-
5. Suggested commands to apply next
|
|
77
|
-
|
|
78
|
-
## Example Result
|
|
79
|
-
|
|
80
|
-
```text
|
|
81
|
-
Harness Audit (repo): 66/70
|
|
82
|
-
- Tool Coverage: 10/10 (10/10 pts)
|
|
83
|
-
- Context Efficiency: 9/10 (9/10 pts)
|
|
84
|
-
- Quality Gates: 10/10 (10/10 pts)
|
|
85
|
-
|
|
86
|
-
Top 3 Actions:
|
|
87
|
-
1) [Security Guardrails] Add preflight security guard instructions in verify.md.
|
|
88
|
-
2) [Tool Coverage] Ensure all subagent types have corresponding command files.
|
|
89
|
-
3) [Eval Coverage] Add evaluation templates and examples.
|
|
90
|
-
```
|
|
@@ -1,37 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
description: Create a new skill from recent work patterns
|
|
3
|
-
agent: OpenHermes
|
|
4
|
-
subtask: true
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
# Learn Command
|
|
8
|
-
|
|
9
|
-
Create a new reusable skill from recent work patterns. $ARGUMENTS
|
|
10
|
-
|
|
11
|
-
## Your Task
|
|
12
|
-
|
|
13
|
-
1. **Search backlog** for pending skill candidates:
|
|
14
|
-
- Use `ohc_search` with query="skill-candidate" classes=["backlog"]
|
|
15
|
-
- If $ARGUMENTS is non-empty, narrow search to that topic
|
|
16
|
-
2. **Analyze the candidate** — what pattern did the session reveal?
|
|
17
|
-
3. **Create the skill**:
|
|
18
|
-
- Load the skill-creator skill: `skill({ name: "skill-creator" })`
|
|
19
|
-
- Follow its instructions to create a new SKILL.md
|
|
20
|
-
- Target: `%USERPROFILE%\.config\opencode\skills\<name>\SKILL.md`
|
|
21
|
-
- Naming: lowercase, hyphenated, descriptive
|
|
22
|
-
4. **Close the backlog entry**: `ohc_save(class="backlog", id="<candidate-id>", data={..., status:"closed"})`
|
|
23
|
-
5. **Report**: What skill was created, where, and what it does
|
|
24
|
-
|
|
25
|
-
## Skill Requirements
|
|
26
|
-
|
|
27
|
-
- name: lowercase-hyphenated, 1-64 chars
|
|
28
|
-
- description: 1-1024 chars, specific enough for agent to know when to load
|
|
29
|
-
- Must include frontmatter with name + description
|
|
30
|
-
- Must include: what it does, when to use it, step-by-step workflow
|
|
31
|
-
|
|
32
|
-
## Report Format
|
|
33
|
-
|
|
34
|
-
**Skill Created**: `<name>`
|
|
35
|
-
**Path**: `%USERPROFILE%\.config\opencode\skills\<name>\SKILL.md`
|
|
36
|
-
**Purpose**: [one-line summary]
|
|
37
|
-
**Trigger words**: [when agent should load this skill]
|
|
@@ -1,38 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
description: Start a managed autonomous loop pattern with safety defaults
|
|
3
|
-
agent: loop-operator
|
|
4
|
-
subtask: true
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
# Loop Start Command
|
|
8
|
-
|
|
9
|
-
Start a managed autonomous loop pattern with safety defaults.
|
|
10
|
-
|
|
11
|
-
## Usage
|
|
12
|
-
|
|
13
|
-
`/loop-start [pattern] [--mode safe|fast]`
|
|
14
|
-
|
|
15
|
-
- `pattern`: `sequential`, `continuous-pr`, `rfc-dag`, `infinite`
|
|
16
|
-
- `--mode`:
|
|
17
|
-
- `safe` (default): strict quality gates and checkpoints
|
|
18
|
-
- `fast`: reduced gates for speed
|
|
19
|
-
|
|
20
|
-
## Flow
|
|
21
|
-
|
|
22
|
-
1. Confirm repository state and branch strategy.
|
|
23
|
-
2. Select loop pattern and model tier strategy.
|
|
24
|
-
3. Enable required hooks/profile for the chosen mode.
|
|
25
|
-
4. Create loop plan and write runbook under `.opencode/plans/`.
|
|
26
|
-
5. Print commands to start and monitor the loop.
|
|
27
|
-
|
|
28
|
-
## Required Safety Checks
|
|
29
|
-
|
|
30
|
-
- Verify tests pass before first loop iteration.
|
|
31
|
-
- Ensure `OH_HOOK_PROFILE` is not disabled globally.
|
|
32
|
-
- Ensure loop has explicit stop condition.
|
|
33
|
-
|
|
34
|
-
## Arguments
|
|
35
|
-
|
|
36
|
-
$ARGUMENTS:
|
|
37
|
-
- `<pattern>` optional (`sequential|continuous-pr|rfc-dag|infinite`)
|
|
38
|
-
- `--mode safe|fast` optional
|
|
@@ -1,30 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
description: Inspect active loop state, progress, and failure signals
|
|
3
|
-
agent: OpenHermes
|
|
4
|
-
subtask: true
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
# Loop Status Command
|
|
8
|
-
|
|
9
|
-
Inspect active loop state, progress, and failure signals.
|
|
10
|
-
|
|
11
|
-
## Usage
|
|
12
|
-
|
|
13
|
-
`/loop-status [--watch]`
|
|
14
|
-
|
|
15
|
-
## What to Report
|
|
16
|
-
|
|
17
|
-
- active loop pattern
|
|
18
|
-
- current phase and last successful checkpoint
|
|
19
|
-
- failing checks (if any)
|
|
20
|
-
- estimated time/cost drift
|
|
21
|
-
- recommended intervention (continue/pause/stop)
|
|
22
|
-
|
|
23
|
-
## Watch Mode
|
|
24
|
-
|
|
25
|
-
When `--watch` is present, refresh status periodically and surface state changes.
|
|
26
|
-
|
|
27
|
-
## Arguments
|
|
28
|
-
|
|
29
|
-
$ARGUMENTS:
|
|
30
|
-
- `--watch` optional
|
|
@@ -1,37 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
description: Search openhermes memory with LLM summarization
|
|
3
|
-
agent: OpenHermes
|
|
4
|
-
subtask: true
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
# Memory Search Command
|
|
8
|
-
|
|
9
|
-
Search openhermes memory for: $ARGUMENTS
|
|
10
|
-
|
|
11
|
-
## Your Task
|
|
12
|
-
|
|
13
|
-
1. Call `ohc_search` with query="$ARGUMENTS" to get raw results
|
|
14
|
-
2. Summarize the top 5 results with natural language interpretation
|
|
15
|
-
3. Highlight patterns, recurring themes, and actionable insights
|
|
16
|
-
4. Return a structured report
|
|
17
|
-
|
|
18
|
-
## Report Format
|
|
19
|
-
|
|
20
|
-
### Search: "$ARGUMENTS"
|
|
21
|
-
|
|
22
|
-
**Top results** (by relevance score):
|
|
23
|
-
|
|
24
|
-
| # | Class | Summary | Score |
|
|
25
|
-
|---|-------|---------|-------|
|
|
26
|
-
|
|
27
|
-
### Key Patterns
|
|
28
|
-
|
|
29
|
-
[Extract 2-3 patterns from the results]
|
|
30
|
-
|
|
31
|
-
### Actionable Insights
|
|
32
|
-
|
|
33
|
-
- [What should be done based on these findings]
|
|
34
|
-
|
|
35
|
-
### Recommended Next Query
|
|
36
|
-
|
|
37
|
-
[Suggest a follow-up ohc_search query for deeper exploration]
|
|
@@ -1,32 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
description: Recommend optimal model tier for the current task
|
|
3
|
-
agent: OpenHermes
|
|
4
|
-
subtask: true
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
# Model Route Command
|
|
8
|
-
|
|
9
|
-
Recommend the best model tier for the current task by complexity and budget.
|
|
10
|
-
|
|
11
|
-
## Usage
|
|
12
|
-
|
|
13
|
-
`/model-route [task-description] [--budget low|med|high]`
|
|
14
|
-
|
|
15
|
-
## Routing Heuristic
|
|
16
|
-
|
|
17
|
-
- `haiku`: deterministic, low-risk mechanical changes
|
|
18
|
-
- `sonnet`: default for implementation and refactors
|
|
19
|
-
- `opus`: architecture, deep review, ambiguous requirements
|
|
20
|
-
|
|
21
|
-
## Required Output
|
|
22
|
-
|
|
23
|
-
- recommended model
|
|
24
|
-
- confidence level
|
|
25
|
-
- why this model fits
|
|
26
|
-
- fallback model if first attempt fails
|
|
27
|
-
|
|
28
|
-
## Arguments
|
|
29
|
-
|
|
30
|
-
$ARGUMENTS:
|
|
31
|
-
- `[task-description]` optional free-text
|
|
32
|
-
- `--budget low|med|high` optional
|
package/harness/commands/ohc.md
DELETED
|
@@ -1,13 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
description: OHC context management: /ohc status, /ohc compress [focus]
|
|
3
|
-
agent: OpenHermes
|
|
4
|
-
subtask: true
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
# Ohc Command
|
|
8
|
-
|
|
9
|
-
OHC context management — hook-handled.
|
|
10
|
-
|
|
11
|
-
Run `/ohc status` to check context usage, `/ohc compress [targetTokens] [focus]` to free space.
|
|
12
|
-
|
|
13
|
-
$ARGUMENTS
|
|
@@ -1,88 +0,0 @@
|
|
|
1
|
-
---
|
|
2
|
-
description: Orchestrate multiple agents for complex tasks
|
|
3
|
-
agent: planner
|
|
4
|
-
subtask: true
|
|
5
|
-
---
|
|
6
|
-
|
|
7
|
-
# Orchestrate Command
|
|
8
|
-
|
|
9
|
-
Orchestrate multiple specialized agents for this complex task: $ARGUMENTS
|
|
10
|
-
|
|
11
|
-
## Your Task
|
|
12
|
-
|
|
13
|
-
1. **Analyze task complexity** and break into subtasks
|
|
14
|
-
2. **Identify optimal agents** for each subtask
|
|
15
|
-
3. **Create execution plan** with dependencies
|
|
16
|
-
4. **Coordinate execution** - parallel where possible
|
|
17
|
-
5. **Synthesize results** into unified output
|
|
18
|
-
|
|
19
|
-
## Available Agents
|
|
20
|
-
|
|
21
|
-
| Agent | Specialty | Use For |
|
|
22
|
-
|-------|-----------|---------|
|
|
23
|
-
| planner | Implementation planning | Complex feature design |
|
|
24
|
-
| architect | System design | Architectural decisions |
|
|
25
|
-
| code-reviewer | Code quality | Review changes |
|
|
26
|
-
| security-reviewer | Security analysis | Vulnerability detection |
|
|
27
|
-
| tdd-guide | Test-driven dev | Feature implementation |
|
|
28
|
-
| build-error-resolver | Build fixes | TypeScript/build errors |
|
|
29
|
-
| e2e-runner | E2E testing | User flow testing |
|
|
30
|
-
| doc-updater | Documentation | Updating docs |
|
|
31
|
-
| refactor-cleaner | Code cleanup | Dead code removal |
|
|
32
|
-
| review-go | Go code | Go-specific review |
|
|
33
|
-
| build-go | Go builds | Go build errors |
|
|
34
|
-
| review-database | Database | Query optimization |
|
|
35
|
-
|
|
36
|
-
## Orchestration Patterns
|
|
37
|
-
|
|
38
|
-
### Sequential Execution
|
|
39
|
-
```
|
|
40
|
-
planner → tdd-guide → code-reviewer → security-reviewer
|
|
41
|
-
```
|
|
42
|
-
Use when: Later tasks depend on earlier results
|
|
43
|
-
|
|
44
|
-
### Parallel Execution
|
|
45
|
-
```
|
|
46
|
-
┌→ security-reviewer
|
|
47
|
-
planner →├→ code-reviewer
|
|
48
|
-
└→ architect
|
|
49
|
-
```
|
|
50
|
-
Use when: Tasks are independent
|
|
51
|
-
|
|
52
|
-
### Fan-Out/Fan-In
|
|
53
|
-
```
|
|
54
|
-
┌→ agent-1 ─┐
|
|
55
|
-
planner →├→ agent-2 ─┼→ synthesizer
|
|
56
|
-
└→ agent-3 ─┘
|
|
57
|
-
```
|
|
58
|
-
Use when: Multiple perspectives needed
|
|
59
|
-
|
|
60
|
-
## Execution Plan Format
|
|
61
|
-
|
|
62
|
-
### Phase 1: [Name]
|
|
63
|
-
- Agent: [agent-name]
|
|
64
|
-
- Task: [specific task]
|
|
65
|
-
- Depends on: [none or previous phase]
|
|
66
|
-
|
|
67
|
-
### Phase 2: [Name] (parallel)
|
|
68
|
-
- Agent A: [agent-name]
|
|
69
|
-
- Task: [specific task]
|
|
70
|
-
- Agent B: [agent-name]
|
|
71
|
-
- Task: [specific task]
|
|
72
|
-
- Depends on: Phase 1
|
|
73
|
-
|
|
74
|
-
### Phase 3: Synthesis
|
|
75
|
-
- Combine results from Phase 2
|
|
76
|
-
- Generate unified output
|
|
77
|
-
|
|
78
|
-
## Coordination Rules
|
|
79
|
-
|
|
80
|
-
1. **Plan before execute** - Create full execution plan first
|
|
81
|
-
2. **Minimize handoffs** - Reduce context switching
|
|
82
|
-
3. **Parallelize when possible** - Independent tasks in parallel
|
|
83
|
-
4. **Clear boundaries** - Each agent has specific scope
|
|
84
|
-
5. **Single source of truth** - One agent owns each artifact
|
|
85
|
-
|
|
86
|
-
---
|
|
87
|
-
|
|
88
|
-
**NOTE**: Complex tasks benefit from multi-agent orchestration. Simple tasks should use single agents directly.
|