npm - codex-genesis-harness - Versions diffs - 0.1.7 → 0.1.8 - Mend

codex-genesis-harness 0.1.7 → 0.1.8

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (93) hide show

package/.codebase/COMPRESSED_CONTEXT.md +80 -0
package/.codebase/CURRENT_STATE.md +37 -11
package/.codebase/DEPENDENCY_GRAPH.md +14 -1
package/.codebase/IMPLEMENTATION_HANDOFF.md +34 -336
package/.codebase/KNOWN_PROBLEMS.md +54 -3
package/.codebase/MODULE_INDEX.md +8 -0
package/.codebase/PIPELINE_FLOW.md +7 -5
package/.codebase/RECOVERY_POINTS.md +17 -78
package/.codebase/TECH_DEBT.md +6 -0
package/.codebase/TEST_MATRIX.md +4 -3
package/.codebase/VISUAL_GRAPH.md +127 -0
package/.codebase/context-policy.json +68 -0
package/.codebase/memories/lessons_learned.md +21 -0
package/.codebase/memories/preferences.md +17 -0
package/.codebase/state.json +45 -24
package/.codex/skills/genesis-architecture/SKILL.md +5 -0
package/.codex/skills/genesis-debug-guide/SKILL.md +10 -4
package/.codex/skills/genesis-docs-automation/SKILL.md +52 -973
package/.codex/skills/genesis-executing-plans/SKILL.md +54 -0
package/.codex/skills/genesis-executing-plans/agents/openai.yaml +6 -0
package/.codex/skills/genesis-executing-plans/checklists/.gitkeep +0 -0
package/.codex/skills/genesis-executing-plans/examples/.gitkeep +0 -0
package/.codex/skills/genesis-executing-plans/templates/.gitkeep +0 -0
package/.codex/skills/genesis-harness/SKILL.md +64 -1385
package/.codex/skills/genesis-harness/scripts/check-docs-sync.sh +3 -3
package/.codex/skills/genesis-harness/scripts/init-planning.sh +1 -1
package/.codex/skills/genesis-new-design/SKILL.md +4 -1
package/.codex/skills/genesis-new-design/agents/openai.yaml +2 -0
package/.codex/skills/genesis-observability-automation/SKILL.md +69 -303
package/.codex/skills/genesis-observability-automation/references/common-mistakes-and-recovery.md +84 -0
package/.codex/skills/genesis-observability-automation/references/workflow-phases.md +78 -0
package/.codex/skills/genesis-performance-profiling/SKILL.md +1 -22
package/.codex/skills/genesis-performance-profiling/agents/openai.yaml +1 -1
package/.codex/skills/genesis-planning/SKILL.md +6 -1
package/.codex/skills/genesis-release/SKILL.md +5 -0
package/.codex/skills/genesis-research-first/SKILL.md +6 -0
package/.codex/skills/genesis-spec-propagation/SKILL.md +52 -504
package/.codex/skills/genesis-test-driven-development/SKILL.md +55 -0
package/.codex/skills/genesis-test-driven-development/agents/openai.yaml +6 -0
package/.codex/skills/genesis-test-driven-development/checklists/.gitkeep +0 -0
package/.codex/skills/genesis-test-driven-development/examples/.gitkeep +0 -0
package/.codex/skills/genesis-test-driven-development/templates/.gitkeep +0 -0
package/.codex/skills/genesis-upgrade-design/SKILL.md +4 -2
package/.codex/skills/genesis-upgrade-design/agents/openai.yaml +2 -0
package/.codex/skills/genesis-using-git-worktrees/SKILL.md +54 -0
package/.codex/skills/genesis-using-git-worktrees/agents/openai.yaml +6 -0
package/.codex/skills/genesis-using-git-worktrees/checklists/.gitkeep +0 -0
package/.codex/skills/genesis-using-git-worktrees/examples/.gitkeep +0 -0
package/.codex/skills/genesis-using-git-worktrees/templates/.gitkeep +0 -0
package/.codex/skills/genesis-verification-before-completion/SKILL.md +53 -0
package/.codex/skills/genesis-verification-before-completion/agents/openai.yaml +6 -0
package/.codex/skills/genesis-verification-before-completion/checklists/.gitkeep +0 -0
package/.codex/skills/genesis-verification-before-completion/examples/.gitkeep +0 -0
package/.codex/skills/genesis-verification-before-completion/templates/.gitkeep +0 -0
package/.codex/skills/spec-impact-engine/SKILL.md +77 -500
package/.codex/skills/spec-impact-engine/checklists/checklist.md +10 -0
package/.codex-plugin/plugin.json +3 -4
package/CHANGELOG.md +4 -1
package/README.EN.md +32 -17
package/README.VI.md +35 -19
package/README.md +48 -10
package/VERSION +1 -1
package/bin/genesis-harness.js +735 -5
package/contracts/features/registry-schema.json +15 -0
package/contracts/observability/agent-run-schema.json +34 -0
package/contracts/observability/failure-schema.json +35 -0
package/contracts/ui/auth/login-screen-contract.json +43 -0
package/features/REGISTRY.md +63 -0
package/features/SCOPE-template.md +65 -0
package/fixtures/planning/MOCKUP_PROMPT_TEMPLATE.md +16 -0
package/observability/agent-runs/sample-run.json +13 -0
package/observability/decision-logs/sample-decision.md +43 -0
package/observability/failures/sample-failure.json +12 -0
package/package.json +9 -3
package/playwright/e2e/app-template.spec.js +37 -0
package/playwright/e2e/auth/login-screen.spec.js +65 -0
package/playwright/e2e/web-template.spec.js +28 -0
package/scripts/check-scope.sh +100 -0
package/scripts/cold-start-check.js +133 -0
package/scripts/install.sh +4 -0
package/scripts/prompt_sentinel.js +35 -4
package/scripts/run-evals.sh +119 -3
package/scripts/scratch_parser.js +49 -0
package/scripts/spec_visual_sync.js +1 -1
package/scripts/test_generator.js +2 -2
package/scripts/uninstall.sh +4 -0
package/scripts/verify.sh +16 -1
package/tests/integration/cli-smoke.test.js +103 -0
package/tests/unit/feature_registry.test.js +152 -0
package/tests/unit/prompt_sentinel.test.js +1 -1
package/tests/unit/spec_visual_sync.test.js +1 -1
package/tests/unit/test_generator.test.js +1 -1
package/playwright/e2e/e2e-template.md +0 -4

package/.codex/skills/genesis-observability-automation/references/workflow-phases.md ADDED Viewed

@@ -0,0 +1,78 @@
+# Observability — Phase-by-Phase Workflow
+Tài liệu chi tiết từng giai đoạn triển khai observability.
+Được gọi bởi `genesis-observability-automation/SKILL.md` → `## Workflow Detail: Phase-by-Phase Execution`.
+---
+## Phase 1: Observability Architecture Generation
+**Goal**: Thiết kế và tài liệu hoá toàn bộ topology observability trước khi viết bất kỳ config nào.
+### Architecture components
+| Pillar | Component | Purpose |
+|--------|-----------|---------|
+| Metrics | Prometheus / Datadog Agent | Scrape and store numeric time-series |
+| Metrics | Grafana / Datadog Dashboards | Visualize and alert on metrics |
+| Logs | Structured logging library | Produce machine-readable log events |
+| Logs | Log aggregator (Loki/ELK/CloudWatch) | Collect and index logs |
+| Logs | Kibana/Grafana/Datadog | Search and visualize logs |
+| Traces | OpenTelemetry SDK | Instrument service for tracing |
+| Traces | Jaeger/Zipkin/Datadog APM | Collect and visualize traces |
+### Service instrumentation by language
+- **Node.js**: `prom-client` (metrics), `winston`/`pino` (structured logs), `@opentelemetry/sdk-node` (traces).
+- **Python**: `prometheus_client` (metrics), `structlog`/`python-json-logger` (logs), `opentelemetry-sdk` (traces).
+- **Go**: `prometheus/client_golang` (metrics), `zap`/`logrus` (logs), `go.opentelemetry.io/otel` (traces).
+---
+## Phase 2: Dashboard Generation
+**Required panels (RED metrics)**:
+- **Rate**: Requests per second (total and per endpoint)
+- **Errors**: Error rate percentage (4xx and 5xx separately)
+- **Duration**: Response time as histogram with p50, p95, p99
+**SATURATION metrics**:
+- **CPU**: Process CPU utilization %
+- **Memory**: Heap and RSS memory
+- **Connection pool**: Active connections vs. pool limit
+- **Queue depth**: Job queue length (background workers)
+See `templates/monitoring-dashboard-template.md` for complete Grafana JSON scaffold.
+---
+## Phase 3: Alerting Policy Generation
+**SLO-based alert thresholds (99.9% availability = 43.8 min/month error budget)**:
+```
+Fast burn (1h):   error_rate > 2%   → P1 page immediately
+Medium burn (6h): error_rate > 0.5% → P2 business hours
+Slow burn (3d):   error_rate > 0.1% → Slack + ticket
+```
+See `templates/alerting-policy-template.md` for complete Prometheus alerting rules.
+---
+## Phase 4: Health Check Automation
+**Standard health endpoint specification**:
+- `GET /health`     → 200 always (load balancer basic routing)
+- `GET /readiness`  → 200 if dependencies healthy, 503 if not
+- `GET /liveness`   → 200 if process alive + event loop not stuck
+- `GET /metrics`    → Prometheus text format
+---
+## Phase 5: Incident Response Runbook Generation
+**Runbook structure requirements** (every runbook must have):
+Severity definition → Detection signals → Triage steps (with commands) → Escalation triggers → Resolution steps → Rollback procedure → Communication templates → Post-mortem checklist.
+See `playbooks/incident-triage-playbook.md` for complete P0/P1/P2/P3 runbooks.

package/.codex/skills/genesis-performance-profiling/SKILL.md CHANGED Viewed

@@ -486,25 +486,4 @@ Default thresholds:
 **Goal**: Produce a prioritized, actionable list of optimizations ranked by expected impact vs implementation effort.
-**Recommendation template:**
-```markdown
-### [BOTTLENECK-001] Slow database query on /api/users (N+1 pattern)
-**Evidence**: EXPLAIN ANALYZE shows sequential scan on `users` table (150,000 rows).
-DB query time = 145 ms (81% of total response time).
-Identified via: slow query log + pg_stat_statements.
-**Recommended fix**: Add composite index on (tenant_id, status, created_at).
-Fix N+1 ORM query pattern: use eager loading (`include: ['profile']`).
-**Estimated impact**: HIGH — Expected p95 improvement: 100–140 ms (55–78% reduction).
-**Implementation complexity**: EASY — Index creation: 1 migration file.
-ORM fix: 3 lines of code change.
-**Validation method**: Re-run baseline after migration. Confirm p95 ≤ 80 ms.
-Run regression-detection phase against new baseline.
-**Risk**: Index creation on large table requires `CREATE INDEX CONCURRENTLY` to avoid table lock.
-```
+Use `templates/performance-report-template.md` for recommendation shape and include evidence, fix, impact, complexity, validation, and risk for each bottleneck.

package/.codex/skills/genesis-performance-profiling/agents/openai.yaml CHANGED Viewed

@@ -3,4 +3,4 @@ interface:
   short_description: "Automate performance baseline, profiling, and load testing"
   default_prompt: "Use $genesis-performance-profiling to establish baseline and identify bottlenecks."
 policy:
-  allow_implicit_invocation: false
+  allow_implicit_invocation: true

package/.codex/skills/genesis-planning/SKILL.md CHANGED Viewed

@@ -44,6 +44,11 @@ Planning implementation before tests, omitting fixtures, and leaving verificatio
 ## Recovery workflow
 If a plan becomes invalid, update the failing test and fixture first, then revise phases.
+## MCP Automation Requirements
+To ensure plans are grounded in long-term project context and not just short-term conversation history, you **MUST** use the following MCP server:
+1. **`@modelcontextprotocol/server-memory`**: Use this MCP tool to automatically query the Knowledge Graph for past bugs, developer preferences, and architectural rules *before* drafting the implementation phases. Any new plan must respect the persistent memory graph.
 ---
 ## 🏗️ 5-Phase MVP Roadmap Standards
@@ -58,7 +63,7 @@ When decomposing large-scale features or initializing new project modules, you m
    - Auto-generate test skeletons in `tests/integration/` (RED state).
 3. **Phase 3: Logic Implementation & Self-Healing** (Template: `phase-3-features.md`)
    - Implement minimal code handlers, controllers, or service endpoints.
-   - Execute closed-loop Verify-Fix cycles autonomously until all tests turn GREEN.
+   - Execute closed-loop Verify-Fix cycles autonomously until all tests turn GREEN. **CRITICAL**: Do NOT just run `npm test`. You MUST run `npx genesis-harness heal "<test-command>"`. If the test fails, this command will output an `[AGENT_DIRECTIVE]`. You must read the directive and automatically fix the code until it passes without stopping.
 4. **Phase 4: Multi-Agent & External Integrations** (Template: `phase-4-integrations.md`)
    - Wire up external components, API sync triggers, or media providers.
 5. **Phase 5: E2E Visual QA & Release Readiness** (Template: `phase-5-readiness.md`)

package/.codex/skills/genesis-release/SKILL.md CHANGED Viewed

@@ -44,6 +44,11 @@ Publishing cache files, missing executable scripts, and omitting npm 2FA/token r
 ## Recovery workflow
 If publish fails, capture error, update known problems, do not change version until root cause is fixed.
+## MCP Automation Requirements
+To ensure zero human error in releases and tags, you **MUST** use the following MCP server:
+1. **`@modelcontextprotocol/server-github`**: Use this MCP tool to automatically retrieve the list of closed Pull Requests since the last release tag to draft the `CHANGELOG.md` and Release Notes. You must also use it to automatically create the Git Tag and GitHub Release via the API. Do NOT ask the user to do this manually in the browser.
 ---
 ## 🚀 Automated Release & Deployment Orchestration

package/.codex/skills/genesis-research-first/SKILL.md CHANGED Viewed

@@ -84,6 +84,12 @@ Format:
   - Next Steps: What to verify?
 ```
+## MCP Automation Requirements
+To prevent hallucinations and avoid manual terminal scraping, you **MUST** use the following MCP servers during research:
+1. **`@modelcontextprotocol/server-fetch`**: Use this MCP tool to natively fetch and read the contents of external documentation URLs or Stack Overflow threads. Do NOT guess the API structure.
+2. **`@modelcontextprotocol/server-github`**: Use this MCP tool to search for existing issues, pull requests, or trending repositories related to the task.
 ## Output
 Each research produces: