npm - openhermes - Versions diffs - 1.2.2 - Mend

openhermes 1.2.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (69) hide show

package/README.md +281 -0
package/autorecall.mjs +167 -0
package/bootstrap.mjs +255 -0
package/curator.mjs +470 -0
package/harness/commands/build-fix.md +60 -0
package/harness/commands/code-review.md +71 -0
package/harness/commands/doctor.md +42 -0
package/harness/commands/learn.md +37 -0
package/harness/commands/memory-search.md +37 -0
package/harness/commands/plan.md +53 -0
package/harness/commands/security.md +93 -0
package/harness/constitution/soul.md +76 -0
package/harness/instructions/RUNTIME.md +21 -0
package/harness/prompts/architect.txt +175 -0
package/harness/prompts/build-error-resolver.md +37 -0
package/harness/prompts/code-reviewer.md +33 -0
package/harness/prompts/e2e-runner.txt +305 -0
package/harness/prompts/explore.md +29 -0
package/harness/prompts/planner.md +30 -0
package/harness/prompts/security-reviewer.md +35 -0
package/harness/rules/audit.md +84 -0
package/harness/rules/checkpointing.md +75 -0
package/harness/rules/context-loading.md +33 -0
package/harness/rules/credential-exposure.md +0 -0
package/harness/rules/delegation.md +76 -0
package/harness/rules/memory-management.md +28 -0
package/harness/rules/precedence.md +52 -0
package/harness/rules/promotion.md +46 -0
package/harness/rules/ranking.md +64 -0
package/harness/rules/retrieval.md +94 -0
package/harness/rules/runtime-guards.md +196 -0
package/harness/rules/self-heal.md +79 -0
package/harness/rules/session-start.md +34 -0
package/harness/rules/skills-management.md +165 -0
package/harness/rules/state-drift.md +192 -0
package/harness/rules/verification.md +88 -0
package/harness/skills/.bundled_manifest +17 -0
package/harness/skills/.usage.json +6 -0
package/harness/skills/api-design/SKILL.md +523 -0
package/harness/skills/backend-patterns/SKILL.md +598 -0
package/harness/skills/coding-standards/SKILL.md +549 -0
package/harness/skills/e2e-testing/SKILL.md +326 -0
package/harness/skills/frontend-patterns/SKILL.md +642 -0
package/harness/skills/frontend-slides/SKILL.md +184 -0
package/harness/skills/security-review/SKILL.md +495 -0
package/harness/skills/strategic-compact/SKILL.md +131 -0
package/harness/skills/tdd-workflow/SKILL.md +463 -0
package/harness/skills/verification-loop/SKILL.md +126 -0
package/index.mjs +5 -0
package/lib/hardening.mjs +113 -0
package/lib/memory-tools-plugin.mjs +265 -0
package/lib/schema-validator.mjs +77 -0
package/lib/tools/_memory.mjs +230 -0
package/lib/tools/hm_get.mjs +13 -0
package/lib/tools/hm_latest.mjs +12 -0
package/lib/tools/hm_list.mjs +13 -0
package/lib/tools/hm_put.mjs +14 -0
package/lib/tools/hm_search.mjs +16 -0
package/package.json +49 -0
package/schemas/audit.schema.json +61 -0
package/schemas/backlog.schema.json +42 -0
package/schemas/checkpoint.schema.json +44 -0
package/schemas/constraint.schema.json +41 -0
package/schemas/decision.schema.json +42 -0
package/schemas/instinct.schema.json +42 -0
package/schemas/loop-state.schema.json +33 -0
package/schemas/mistake.schema.json +43 -0
package/schemas/verification_receipt.schema.json +67 -0
package/skill-builder.mjs +113 -0

package/harness/skills/strategic-compact/SKILL.md ADDED Viewed

@@ -0,0 +1,131 @@
+---
+name: strategic-compact
+description: Suggests manual context compaction at logical intervals to preserve context through task phases rather than arbitrary auto-compaction.
+origin: ECC
+---
+# Strategic Compact Skill
+Suggests manual `/compact` at strategic points in your workflow rather than relying on arbitrary auto-compaction.
+## When to Activate
+- Running long sessions that approach context limits (200K+ tokens)
+- Working on multi-phase tasks (research → plan → implement → test)
+- Switching between unrelated tasks within the same session
+- After completing a major milestone and starting new work
+- When responses slow down or become less coherent (context pressure)
+## Why Strategic Compaction?
+Auto-compaction triggers at arbitrary points:
+- Often mid-task, losing important context
+- No awareness of logical task boundaries
+- Can interrupt complex multi-step operations
+Strategic compaction at logical boundaries:
+- **After exploration, before execution** — Compact research context, keep implementation plan
+- **After completing a milestone** — Fresh start for next phase
+- **Before major context shifts** — Clear exploration context before different task
+## How It Works
+The `suggest-compact.js` script runs on PreToolUse (Edit/Write) and:
+1. **Tracks tool calls** — Counts tool invocations in session
+2. **Threshold detection** — Suggests at configurable threshold (default: 50 calls)
+3. **Periodic reminders** — Reminds every 25 calls after threshold
+## Hook Setup
+Add to your `~/.claude/settings.json`:
+```json
+{
+  "hooks": {
+    "PreToolUse": [
+      {
+        "matcher": "Edit",
+        "hooks": [{ "type": "command", "command": "node ~/.claude/skills/strategic-compact/suggest-compact.js" }]
+      },
+      {
+        "matcher": "Write",
+        "hooks": [{ "type": "command", "command": "node ~/.claude/skills/strategic-compact/suggest-compact.js" }]
+      }
+    ]
+  }
+}
+```
+## Configuration
+Environment variables:
+- `COMPACT_THRESHOLD` — Tool calls before first suggestion (default: 50)
+## Compaction Decision Guide
+Use this table to decide when to compact:
+| Phase Transition | Compact? | Why |
+|-----------------|----------|-----|
+| Research → Planning | Yes | Research context is bulky; plan is the distilled output |
+| Planning → Implementation | Yes | Plan is in TodoWrite or a file; free up context for code |
+| Implementation → Testing | Maybe | Keep if tests reference recent code; compact if switching focus |
+| Debugging → Next feature | Yes | Debug traces pollute context for unrelated work |
+| Mid-implementation | No | Losing variable names, file paths, and partial state is costly |
+| After a failed approach | Yes | Clear the dead-end reasoning before trying a new approach |
+## What Survives Compaction
+Understanding what persists helps you compact with confidence:
+| Persists | Lost |
+|----------|------|
+| CLAUDE.md instructions | Intermediate reasoning and analysis |
+| TodoWrite task list | File contents you previously read |
+| Memory files (`~/.claude/memory/`) | Multi-step conversation context |
+| Git state (commits, branches) | Tool call history and counts |
+| Files on disk | Nuanced user preferences stated verbally |
+## Best Practices
+1. **Compact after planning** — Once plan is finalized in TodoWrite, compact to start fresh
+2. **Compact after debugging** — Clear error-resolution context before continuing
+3. **Don't compact mid-implementation** — Preserve context for related changes
+4. **Read the suggestion** — The hook tells you *when*, you decide *if*
+5. **Write before compacting** — Save important context to files or memory before compacting
+6. **Use `/compact` with a summary** — Add a custom message: `/compact Focus on implementing auth middleware next`
+## Token Optimization Patterns
+### Trigger-Table Lazy Loading
+Instead of loading full skill content at session start, use a trigger table that maps keywords to skill paths. Skills load only when triggered, reducing baseline context by 50%+:
+| Trigger | Skill | Load When |
+|---------|-------|-----------|
+| "test", "tdd", "coverage" | tdd-workflow | User mentions testing |
+| "security", "auth", "xss" | security-review | Security-related work |
+| "deploy", "ci/cd" | deployment-patterns | Deployment context |
+### Context Composition Awareness
+Monitor what's consuming your context window:
+- **CLAUDE.md files** — Always loaded, keep lean
+- **Loaded skills** — Each skill adds 1-5K tokens
+- **Conversation history** — Grows with each exchange
+- **Tool results** — File reads, search results add bulk
+### Duplicate Instruction Detection
+Common sources of duplicate context:
+- Same rules in both `~/.claude/rules/` and project `.claude/rules/`
+- Skills that repeat CLAUDE.md instructions
+- Multiple skills covering overlapping domains
+### Context Optimization Tools
+- `token-optimizer` MCP — Automated 95%+ token reduction via content deduplication
+- `context-mode` — Context virtualization (315KB to 5.4KB demonstrated)
+## Related
+- [The Longform Guide](https://x.com/affaanmustafa/status/2014040193557471352) — Token optimization section
+- Memory persistence hooks — For state that survives compaction
+- `continuous-learning` skill — Extracts patterns before session ends

package/harness/skills/tdd-workflow/SKILL.md ADDED Viewed

@@ -0,0 +1,463 @@
+---
+name: tdd-workflow
+description: Use this skill when writing new features, fixing bugs, or refactoring code. Enforces test-driven development with 80%+ coverage including unit, integration, and E2E tests.
+origin: ECC
+---
+# Test-Driven Development Workflow
+This skill ensures all code development follows TDD principles with comprehensive test coverage.
+## When to Activate
+- Writing new features or functionality
+- Fixing bugs or issues
+- Refactoring existing code
+- Adding API endpoints
+- Creating new components
+## Core Principles
+### 1. Tests BEFORE Code
+ALWAYS write tests first, then implement code to make tests pass.
+### 2. Coverage Requirements
+- Minimum 80% coverage (unit + integration + E2E)
+- All edge cases covered
+- Error scenarios tested
+- Boundary conditions verified
+### 3. Test Types
+#### Unit Tests
+- Individual functions and utilities
+- Component logic
+- Pure functions
+- Helpers and utilities
+#### Integration Tests
+- API endpoints
+- Database operations
+- Service interactions
+- External API calls
+#### E2E Tests (Playwright)
+- Critical user flows
+- Complete workflows
+- Browser automation
+- UI interactions
+### 4. Git Checkpoints
+- If the repository is under Git, create a checkpoint commit after each TDD stage
+- Do not squash or rewrite these checkpoint commits until the workflow is complete
+- Each checkpoint commit message must describe the stage and the exact evidence captured
+- Count only commits created on the current active branch for the current task
+- Do not treat commits from other branches, earlier unrelated work, or distant branch history as valid checkpoint evidence
+- Before treating a checkpoint as satisfied, verify that the commit is reachable from the current `HEAD` on the active branch and belongs to the current task sequence
+- The preferred compact workflow is:
+  - one commit for failing test added and RED validated
+  - one commit for minimal fix applied and GREEN validated
+  - one optional commit for refactor complete
+- Separate evidence-only commits are not required if the test commit clearly corresponds to RED and the fix commit clearly corresponds to GREEN
+## TDD Workflow Steps
+### Step 1: Write User Journeys
+```
+As a [role], I want to [action], so that [benefit]
+Example:
+As a user, I want to search for markets semantically,
+so that I can find relevant markets even without exact keywords.
+```
+### Step 2: Generate Test Cases
+For each user journey, create comprehensive test cases:
+```typescript
+describe('Semantic Search', () => {
+  it('returns relevant markets for query', async () => {
+    // Test implementation
+  })
+  it('handles empty query gracefully', async () => {
+    // Test edge case
+  })
+  it('falls back to substring search when Redis unavailable', async () => {
+    // Test fallback behavior
+  })
+  it('sorts results by similarity score', async () => {
+    // Test sorting logic
+  })
+})
+```
+### Step 3: Run Tests (They Should Fail)
+```bash
+npm test
+# Tests should fail - we haven't implemented yet
+```
+This step is mandatory and is the RED gate for all production changes.
+Before modifying business logic or other production code, you must verify a valid RED state via one of these paths:
+- Runtime RED:
+  - The relevant test target compiles successfully
+  - The new or changed test is actually executed
+  - The result is RED
+- Compile-time RED:
+  - The new test newly instantiates, references, or exercises the buggy code path
+  - The compile failure is itself the intended RED signal
+- In either case, the failure is caused by the intended business-logic bug, undefined behavior, or missing implementation
+- The failure is not caused only by unrelated syntax errors, broken test setup, missing dependencies, or unrelated regressions
+A test that was only written but not compiled and executed does not count as RED.
+Do not edit production code until this RED state is confirmed.
+If the repository is under Git, create a checkpoint commit immediately after this stage is validated.
+Recommended commit message format:
+- `test: add reproducer for <feature or bug>`
+- This commit may also serve as the RED validation checkpoint if the reproducer was compiled and executed and failed for the intended reason
+- Verify that this checkpoint commit is on the current active branch before continuing
+### Step 4: Implement Code
+Write minimal code to make tests pass:
+```typescript
+// Implementation guided by tests
+export async function searchMarkets(query: string) {
+  // Implementation here
+}
+```
+If the repository is under Git, stage the minimal fix now but defer the checkpoint commit until GREEN is validated in Step 5.
+### Step 5: Run Tests Again
+```bash
+npm test
+# Tests should now pass
+```
+Rerun the same relevant test target after the fix and confirm the previously failing test is now GREEN.
+Only after a valid GREEN result may you proceed to refactor.
+If the repository is under Git, create a checkpoint commit immediately after GREEN is validated.
+Recommended commit message format:
+- `fix: <feature or bug>`
+- The fix commit may also serve as the GREEN validation checkpoint if the same relevant test target was rerun and passed
+- Verify that this checkpoint commit is on the current active branch before continuing
+### Step 6: Refactor
+Improve code quality while keeping tests green:
+- Remove duplication
+- Improve naming
+- Optimize performance
+- Enhance readability
+If the repository is under Git, create a checkpoint commit immediately after refactoring is complete and tests remain green.
+Recommended commit message format:
+- `refactor: clean up after <feature or bug> implementation`
+- Verify that this checkpoint commit is on the current active branch before considering the TDD cycle complete
+### Step 7: Verify Coverage
+```bash
+npm run test:coverage
+# Verify 80%+ coverage achieved
+```
+## Testing Patterns
+### Unit Test Pattern (Jest/Vitest)
+```typescript
+import { render, screen, fireEvent } from '@testing-library/react'
+import { Button } from './Button'
+describe('Button Component', () => {
+  it('renders with correct text', () => {
+    render(<Button>Click me</Button>)
+    expect(screen.getByText('Click me')).toBeInTheDocument()
+  })
+  it('calls onClick when clicked', () => {
+    const handleClick = jest.fn()
+    render(<Button onClick={handleClick}>Click</Button>)
+    fireEvent.click(screen.getByRole('button'))
+    expect(handleClick).toHaveBeenCalledTimes(1)
+  })
+  it('is disabled when disabled prop is true', () => {
+    render(<Button disabled>Click</Button>)
+    expect(screen.getByRole('button')).toBeDisabled()
+  })
+})
+```
+### API Integration Test Pattern
+```typescript
+import { NextRequest } from 'next/server'
+import { GET } from './route'
+describe('GET /api/markets', () => {
+  it('returns markets successfully', async () => {
+    const request = new NextRequest('http://localhost/api/markets')
+    const response = await GET(request)
+    const data = await response.json()
+    expect(response.status).toBe(200)
+    expect(data.success).toBe(true)
+    expect(Array.isArray(data.data)).toBe(true)
+  })
+  it('validates query parameters', async () => {
+    const request = new NextRequest('http://localhost/api/markets?limit=invalid')
+    const response = await GET(request)
+    expect(response.status).toBe(400)
+  })
+  it('handles database errors gracefully', async () => {
+    // Mock database failure
+    const request = new NextRequest('http://localhost/api/markets')
+    // Test error handling
+  })
+})
+```
+### E2E Test Pattern (Playwright)
+```typescript
+import { test, expect } from '@playwright/test'
+test('user can search and filter markets', async ({ page }) => {
+  // Navigate to markets page
+  await page.goto('/')
+  await page.click('a[href="/markets"]')
+  // Verify page loaded
+  await expect(page.locator('h1')).toContainText('Markets')
+  // Search for markets
+  await page.fill('input[placeholder="Search markets"]', 'election')
+  // Wait for debounce and results
+  await page.waitForTimeout(600)
+  // Verify search results displayed
+  const results = page.locator('[data-testid="market-card"]')
+  await expect(results).toHaveCount(5, { timeout: 5000 })
+  // Verify results contain search term
+  const firstResult = results.first()
+  await expect(firstResult).toContainText('election', { ignoreCase: true })
+  // Filter by status
+  await page.click('button:has-text("Active")')
+  // Verify filtered results
+  await expect(results).toHaveCount(3)
+})
+test('user can create a new market', async ({ page }) => {
+  // Login first
+  await page.goto('/creator-dashboard')
+  // Fill market creation form
+  await page.fill('input[name="name"]', 'Test Market')
+  await page.fill('textarea[name="description"]', 'Test description')
+  await page.fill('input[name="endDate"]', '2025-12-31')
+  // Submit form
+  await page.click('button[type="submit"]')
+  // Verify success message
+  await expect(page.locator('text=Market created successfully')).toBeVisible()
+  // Verify redirect to market page
+  await expect(page).toHaveURL(/\/markets\/test-market/)
+})
+```
+## Test File Organization
+```
+src/
+├── components/
+│   ├── Button/
+│   │   ├── Button.tsx
+│   │   ├── Button.test.tsx          # Unit tests
+│   │   └── Button.stories.tsx       # Storybook
+│   └── MarketCard/
+│       ├── MarketCard.tsx
+│       └── MarketCard.test.tsx
+├── app/
+│   └── api/
+│       └── markets/
+│           ├── route.ts
+│           └── route.test.ts         # Integration tests
+└── e2e/
+    ├── markets.spec.ts               # E2E tests
+    ├── trading.spec.ts
+    └── auth.spec.ts
+```
+## Mocking External Services
+### Supabase Mock
+```typescript
+jest.mock('@/lib/supabase', () => ({
+  supabase: {
+    from: jest.fn(() => ({
+      select: jest.fn(() => ({
+        eq: jest.fn(() => Promise.resolve({
+          data: [{ id: 1, name: 'Test Market' }],
+          error: null
+        }))
+      }))
+    }))
+  }
+}))
+```
+### Redis Mock
+```typescript
+jest.mock('@/lib/redis', () => ({
+  searchMarketsByVector: jest.fn(() => Promise.resolve([
+    { slug: 'test-market', similarity_score: 0.95 }
+  ])),
+  checkRedisHealth: jest.fn(() => Promise.resolve({ connected: true }))
+}))
+```
+### OpenAI Mock
+```typescript
+jest.mock('@/lib/openai', () => ({
+  generateEmbedding: jest.fn(() => Promise.resolve(
+    new Array(1536).fill(0.1) // Mock 1536-dim embedding
+  ))
+}))
+```
+## Test Coverage Verification
+### Run Coverage Report
+```bash
+npm run test:coverage
+```
+### Coverage Thresholds
+```json
+{
+  "jest": {
+    "coverageThresholds": {
+      "global": {
+        "branches": 80,
+        "functions": 80,
+        "lines": 80,
+        "statements": 80
+      }
+    }
+  }
+}
+```
+## Common Testing Mistakes to Avoid
+### FAIL: WRONG: Testing Implementation Details
+```typescript
+// Don't test internal state
+expect(component.state.count).toBe(5)
+```
+### PASS: CORRECT: Test User-Visible Behavior
+```typescript
+// Test what users see
+expect(screen.getByText('Count: 5')).toBeInTheDocument()
+```
+### FAIL: WRONG: Brittle Selectors
+```typescript
+// Breaks easily
+await page.click('.css-class-xyz')
+```
+### PASS: CORRECT: Semantic Selectors
+```typescript
+// Resilient to changes
+await page.click('button:has-text("Submit")')
+await page.click('[data-testid="submit-button"]')
+```
+### FAIL: WRONG: No Test Isolation
+```typescript
+// Tests depend on each other
+test('creates user', () => { /* ... */ })
+test('updates same user', () => { /* depends on previous test */ })
+```
+### PASS: CORRECT: Independent Tests
+```typescript
+// Each test sets up its own data
+test('creates user', () => {
+  const user = createTestUser()
+  // Test logic
+})
+test('updates user', () => {
+  const user = createTestUser()
+  // Update logic
+})
+```
+## Continuous Testing
+### Watch Mode During Development
+```bash
+npm test -- --watch
+# Tests run automatically on file changes
+```
+### Pre-Commit Hook
+```bash
+# Runs before every commit
+npm test && npm run lint
+```
+### CI/CD Integration
+```yaml
+# GitHub Actions
+- name: Run Tests
+  run: npm test -- --coverage
+- name: Upload Coverage
+  uses: codecov/codecov-action@v3
+```
+## Best Practices
+1. **Write Tests First** - Always TDD
+2. **One Assert Per Test** - Focus on single behavior
+3. **Descriptive Test Names** - Explain what's tested
+4. **Arrange-Act-Assert** - Clear test structure
+5. **Mock External Dependencies** - Isolate unit tests
+6. **Test Edge Cases** - Null, undefined, empty, large
+7. **Test Error Paths** - Not just happy paths
+8. **Keep Tests Fast** - Unit tests < 50ms each
+9. **Clean Up After Tests** - No side effects
+10. **Review Coverage Reports** - Identify gaps
+## Success Metrics
+- 80%+ code coverage achieved
+- All tests passing (green)
+- No skipped or disabled tests
+- Fast test execution (< 30s for unit tests)
+- E2E tests cover critical user flows
+- Tests catch bugs before production
+---
+**Remember**: Tests are not optional. They are the safety net that enables confident refactoring, rapid development, and production reliability.