npm - @jaguilar87/gaia-ops - Versions diffs - 4.0.0 → 4.4.0-beta.2 - Mend

@jaguilar87/gaia-ops 4.0.0 → 4.4.0-beta.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (241) hide show

package/.claude-plugin/marketplace.json +32 -0
package/.claude-plugin/plugin.json +17 -0
package/ARCHITECTURE.md +320 -0
package/CHANGELOG.md +15 -0
package/CODE_OF_CONDUCT.md +11 -0
package/CONTRIBUTING.md +146 -0
package/INSTALL.md +33 -34
package/README.md +30 -12
package/SECURITY.md +47 -0
package/agents/cloud-troubleshooter.md +1 -1
package/agents/devops-developer.md +1 -2
package/agents/gaia.md +1 -2
package/agents/gitops-operator.md +1 -2
package/agents/speckit-planner.md +29 -16
package/agents/terraform-architect.md +1 -2
package/bin/README.md +6 -6
package/bin/gaia-cleanup.js +296 -2
package/bin/gaia-doctor.js +12 -12
package/bin/gaia-history.js +20 -24
package/bin/gaia-metrics.js +494 -63
package/bin/gaia-scan +14 -0
package/bin/gaia-scan.py +640 -0
package/bin/gaia-skills-diagnose.js +3 -3
package/bin/gaia-status.js +82 -40
package/bin/gaia-update.js +3 -3
package/bin/pre-publish-validate.js +112 -10
package/commands/README.md +11 -52
package/commands/scan-project.md +67 -0
package/commands/speckit.add-task.md +4 -4
package/commands/speckit.analyze-task.md +3 -3
package/commands/speckit.init.md +14 -14
package/commands/speckit.plan.md +8 -34
package/commands/speckit.tasks.md +14 -12
package/config/README.md +4 -0
package/config/context-contracts.aws.json +20 -9
package/config/context-contracts.gcp.json +17 -11
package/config/context-contracts.json +43 -26
package/config/surface-routing.json +189 -0
package/config/universal-rules.json +8 -32
package/hooks/README.md +11 -7
package/hooks/adapters/__init__.py +52 -0
package/hooks/adapters/base.py +168 -0
package/hooks/adapters/channel.py +42 -0
package/hooks/adapters/claude_code.py +500 -0
package/hooks/adapters/types.py +193 -0
package/hooks/adapters/utils.py +25 -0
package/hooks/hooks.json +52 -0
package/hooks/modules/README.md +60 -25
package/hooks/modules/agents/__init__.py +28 -4
package/hooks/modules/agents/contract_validator.py +304 -0
package/hooks/modules/agents/response_contract.py +457 -0
package/hooks/modules/agents/task_info_builder.py +74 -0
package/hooks/modules/agents/transcript_reader.py +165 -0
package/hooks/modules/audit/logger.py +2 -26
package/hooks/modules/audit/metrics.py +14 -8
package/hooks/modules/audit/workflow_auditor.py +258 -0
package/hooks/modules/audit/workflow_recorder.py +266 -0
package/hooks/modules/context/__init__.py +3 -0
package/hooks/modules/context/context_freshness.py +145 -0
package/hooks/modules/context/context_injector.py +370 -0
package/hooks/modules/context/context_writer.py +96 -6
package/hooks/modules/context/contracts_loader.py +164 -0
package/hooks/modules/core/__init__.py +9 -1
package/hooks/modules/core/hook_entry.py +77 -0
package/hooks/modules/core/state.py +10 -1
package/hooks/modules/core/stdin.py +24 -0
package/hooks/modules/memory/__init__.py +8 -0
package/hooks/modules/memory/episode_writer.py +228 -0
package/hooks/modules/scanning/__init__.py +8 -0
package/hooks/modules/scanning/scan_trigger.py +96 -0
package/hooks/modules/security/__init__.py +61 -15
package/hooks/modules/security/approval_cleanup.py +57 -0
package/hooks/modules/security/approval_constants.py +11 -14
package/hooks/modules/security/approval_grants.py +712 -0
package/hooks/modules/security/approval_messages.py +81 -0
package/hooks/modules/security/approval_scopes.py +153 -0
package/hooks/modules/security/blocked_commands.py +389 -95
package/hooks/modules/security/command_semantics.py +134 -0
package/hooks/modules/security/mutative_verbs.py +707 -0
package/hooks/modules/security/prompt_validator.py +40 -0
package/hooks/modules/security/tiers.py +57 -31
package/hooks/modules/session/__init__.py +10 -0
package/hooks/modules/session/session_context_writer.py +106 -0
package/hooks/modules/session/session_event_injector.py +174 -0
package/hooks/modules/session/session_manager.py +31 -0
package/hooks/modules/tools/bash_validator.py +158 -103
package/hooks/modules/tools/cloud_pipe_validator.py +16 -11
package/hooks/modules/tools/hook_response.py +51 -0
package/hooks/modules/tools/task_validator.py +119 -85
package/hooks/post_tool_use.py +64 -205
package/hooks/pre_tool_use.py +153 -554
package/hooks/session_start.py +105 -0
package/hooks/stop_hook.py +72 -0
package/hooks/subagent_start.py +88 -0
package/hooks/subagent_stop.py +202 -773
package/hooks/task_completed.py +71 -0
package/package.json +13 -5
package/plugins/gaia-ops/.claude-plugin/plugin.json +8 -0
package/plugins/gaia-security/.claude-plugin/plugin.json +6 -0
package/pyproject.toml +29 -0
package/skills/README.md +16 -15
package/skills/agent-protocol/SKILL.md +178 -15
package/skills/approval/SKILL.md +29 -18
package/skills/approval/examples.md +2 -0
package/skills/approval/reference.md +7 -0
package/skills/command-execution/SKILL.md +3 -1
package/skills/command-execution/reference.md +1 -0
package/skills/context-updater/SKILL.md +28 -11
package/skills/developer-patterns/SKILL.md +2 -1
package/skills/execution/SKILL.md +13 -13
package/skills/fast-queries/SKILL.md +2 -1
package/skills/gaia-patterns/SKILL.md +14 -6
package/skills/git-conventions/SKILL.md +3 -2
package/skills/gitops-patterns/SKILL.md +2 -1
package/skills/investigation/SKILL.md +61 -6
package/skills/orchestrator-approval/SKILL.md +84 -0
package/skills/output-format/SKILL.md +10 -1
package/skills/security-tiers/SKILL.md +19 -3
package/skills/security-tiers/destructive-commands-reference.md +623 -0
package/skills/security-tiers/reference.md +4 -0
package/skills/skill-creation/SKILL.md +2 -1
package/skills/specification/SKILL.md +177 -0
package/skills/speckit-workflow/SKILL.md +139 -49
package/skills/speckit-workflow/reference.md +73 -57
package/skills/terraform-patterns/SKILL.md +2 -1
package/speckit/README.md +97 -132
package/speckit/templates/plan-template.md +10 -16
package/speckit/templates/tasks-template.md +241 -375
package/templates/CLAUDE.template.md +48 -72
package/templates/README.md +10 -10
package/templates/governance.template.md +1 -1
package/templates/settings.template.json +89 -638
package/tools/context/README.md +46 -15
package/tools/context/__init__.py +11 -0
package/tools/context/_paths.py +20 -0
package/tools/context/context_provider.py +164 -30
package/tools/context/context_section_reader.py +34 -16
package/tools/context/pending_updates.py +1 -1
package/tools/context/surface_router.py +278 -0
package/tools/memory/episodic.py +26 -4
package/tools/replay/__init__.py +33 -0
package/tools/replay/cli.py +355 -0
package/tools/replay/extractor.py +457 -0
package/tools/replay/reporter.py +258 -0
package/tools/replay/routing_simulator.py +335 -0
package/tools/replay/runner.py +506 -0
package/tools/replay/skills_mapper.py +263 -0
package/tools/scan/__init__.py +21 -0
package/tools/scan/config.py +226 -0
package/tools/scan/merge.py +212 -0
package/tools/scan/orchestrator.py +392 -0
package/tools/scan/registry.py +127 -0
package/tools/scan/scanners/__init__.py +18 -0
package/tools/scan/scanners/base.py +129 -0
package/tools/scan/scanners/environment.py +324 -0
package/tools/scan/scanners/git.py +453 -0
package/tools/scan/scanners/infrastructure.py +414 -0
package/tools/scan/scanners/orchestration.py +560 -0
package/tools/scan/scanners/stack.py +786 -0
package/tools/scan/scanners/tools.py +213 -0
package/tools/scan/setup.py +703 -0
package/tools/scan/tests/__init__.py +1 -0
package/tools/scan/tests/conftest.py +796 -0
package/tools/scan/tests/test_environment.py +323 -0
package/tools/scan/tests/test_git.py +339 -0
package/tools/scan/tests/test_infrastructure.py +336 -0
package/tools/scan/tests/test_integration.py +920 -0
package/tools/scan/tests/test_merge.py +269 -0
package/tools/scan/tests/test_orchestration.py +304 -0
package/tools/scan/tests/test_stack.py +518 -0
package/tools/scan/tests/test_tools.py +409 -0
package/tools/scan/ui.py +368 -0
package/tools/scan/verify.py +284 -0
package/tools/validation/README.md +6 -11
package/bin/gaia-init.js +0 -1777
package/commands/speckit.implement.md +0 -96
package/commands/speckit.specify.md +0 -177
package/hooks/modules/security/safe_commands.py +0 -391
package/hooks/modules/workflow/__init__.py +0 -5
package/tests/README.md +0 -94
package/tests/conftest.py +0 -195
package/tests/fixtures/project-context.aws.json +0 -53
package/tests/fixtures/project-context.full.json +0 -165
package/tests/fixtures/project-context.gcp.json +0 -53
package/tests/hooks/__init__.py +0 -1
package/tests/hooks/modules/context/__init__.py +0 -0
package/tests/hooks/modules/context/test_context_writer.py +0 -594
package/tests/hooks/modules/core/__init__.py +0 -0
package/tests/hooks/modules/core/test_paths.py +0 -235
package/tests/hooks/modules/core/test_state.py +0 -332
package/tests/hooks/modules/security/__init__.py +0 -0
package/tests/hooks/modules/security/test_blocked_commands.py +0 -290
package/tests/hooks/modules/security/test_gitops_validator.py +0 -357
package/tests/hooks/modules/security/test_safe_commands.py +0 -383
package/tests/hooks/modules/security/test_tiers.py +0 -230
package/tests/hooks/modules/tools/__init__.py +0 -0
package/tests/hooks/modules/tools/test_bash_validator.py +0 -243
package/tests/hooks/modules/tools/test_shell_parser.py +0 -290
package/tests/hooks/modules/tools/test_task_validator.py +0 -363
package/tests/hooks/test_subagent_stop_discovery.py +0 -124
package/tests/integration/__init__.py +0 -0
package/tests/integration/test_context_enrichment.py +0 -647
package/tests/integration/test_subagent_lifecycle.py +0 -783
package/tests/integration/test_subagent_stop_e2e.py +0 -639
package/tests/layer1_prompt_regression/test_agent_frontmatter.py +0 -152
package/tests/layer1_prompt_regression/test_agent_prompt_content.py +0 -170
package/tests/layer1_prompt_regression/test_context_contracts.py +0 -139
package/tests/layer1_prompt_regression/test_routing_table.py +0 -106
package/tests/layer1_prompt_regression/test_security_tier_consistency.py +0 -117
package/tests/layer1_prompt_regression/test_skill_content_rules.py +0 -148
package/tests/layer1_prompt_regression/test_skills_cross_reference.py +0 -168
package/tests/layer2_llm_evaluation/conftest.py +0 -6
package/tests/layer2_llm_evaluation/helpers/promptfoo_runner.py +0 -132
package/tests/layer2_llm_evaluation/test_agent_behavior.py +0 -198
package/tests/layer3_e2e/conftest.py +0 -6
package/tests/layer3_e2e/helpers/claude_headless.py +0 -169
package/tests/layer3_e2e/test_hook_lifecycle.py +0 -160
package/tests/layer3_e2e/test_installation_smoke.py +0 -117
package/tests/performance/__init__.py +0 -1
package/tests/performance/test_context_performance.py +0 -855
package/tests/promptfoo.yaml +0 -126
package/tests/system/__init__.py +0 -0
package/tests/system/permissions_helpers.py +0 -318
package/tests/system/test_agent_definitions.py +0 -179
package/tests/system/test_configuration_files.py +0 -121
package/tests/system/test_directory_structure.py +0 -221
package/tests/system/test_permissions_system.py +0 -1059
package/tests/system/test_schema_compatibility.py +0 -106
package/tests/test_cross_layer_consistency.py +0 -459
package/tests/tools/__init__.py +0 -0
package/tests/tools/test_context_provider.py +0 -208
package/tests/tools/test_deep_merge.py +0 -146
package/tests/tools/test_episodic.py +0 -463
package/tests/tools/test_pending_updates.py +0 -549
package/tests/tools/test_review_engine.py +0 -203
package/tools/context/benchmark_context.py +0 -389
package/tools/context/context_compressor.py +0 -444
package/tools/context/context_lazy_loader.py +0 -402
package/tools/context/context_selector.py +0 -451
package/tools/validation/skills_report.md +0 -162
/package/{tests/hooks/modules/__init__.py → speckit/scripts/.gitkeep} +0 -0

package/.claude-plugin/marketplace.json ADDED Viewed

@@ -0,0 +1,32 @@
+{
+  "marketplace": {
+    "name": "gaia-ops-marketplace",
+    "description": "Security, governance, and multi-agent orchestration for AI coding",
+    "owner": {
+      "name": "jaguilar87",
+      "email": "jaguilar1897@gmail.com"
+    },
+    "plugins": [
+      {
+        "name": "gaia-security",
+        "description": "Security hooks, approval system, audit logging, metrics, and anomaly detection for Claude Code",
+        "version": "4.4.0-beta.2",
+        "source": "./plugins/gaia-security",
+        "category": "security",
+        "tags": ["security", "hooks", "audit", "metrics", "approval"],
+        "dependencies": [],
+        "includes": ["hooks/pre_tool_use.py", "hooks/post_tool_use.py", "hooks/subagent_stop.py", "hooks/stop_hook.py", "hooks/modules/security/", "hooks/modules/tools/", "hooks/adapters/", "config/"]
+      },
+      {
+        "name": "gaia-ops",
+        "description": "Complete DevOps orchestration system: agents, scanning, context injection, episodic memory, speckit planning, and CLI tools \u2014 includes gaia-security",
+        "version": "4.4.0-beta.2",
+        "source": "./plugins/gaia-ops",
+        "category": "devops",
+        "tags": ["devops", "agents", "scanning", "orchestration", "speckit", "ops"],
+        "dependencies": [],
+        "includes": ["."]
+      }
+    ]
+  }
+}

package/.claude-plugin/plugin.json ADDED Viewed

@@ -0,0 +1,17 @@
+{
+  "name": "gaia-ops",
+  "version": "4.4.0-beta.2",
+  "description": "Security-first orchestrator with specialized agents, hooks, and governance for AI coding",
+  "author": {
+    "name": "jaguilar87",
+    "url": "https://github.com/metraton/gaia-ops"
+  },
+  "repository": "https://github.com/metraton/gaia-ops",
+  "license": "MIT",
+  "keywords": ["security", "devops", "orchestrator", "governance", "terraform", "kubernetes", "gitops"],
+  "engines": { "claude-code": ">=2.1.0" },
+  "categories": ["devops", "security", "orchestration"],
+  "commands": "./commands/",
+  "agents": "./agents/",
+  "skills": "./skills/"
+}

package/ARCHITECTURE.md ADDED Viewed

@@ -0,0 +1,320 @@
+# Architecture
+## What is gaia-ops?
+gaia-ops is an orchestration system for Claude Code agents. It turns a single Claude Code session into a coordinated multi-agent system with security enforcement, context injection, surface-based routing, episodic memory, and deterministic response contracts.
+The package is published as `@jaguilar87/gaia-ops` on npm and installed into a project's `.claude/` directory via symlinks.
+## Core Concepts
+| Concept | Definition |
+|---------|-----------|
+| **Agent** | A Markdown file in `agents/` defining identity, scope, skills, and delegation rules |
+| **Skill** | Injected procedural knowledge (in `skills/`) -- the HOW for agents |
+| **Hook** | Python scripts that intercept tool calls before and after execution |
+| **Tool** | Python modules in `tools/` providing context assembly, memory, and validation |
+| **Config** | JSON files in `config/` defining contracts, rules, surface routing, and security |
+| **Orchestrator** | The root `CLAUDE.md` that routes user requests to the correct agent |
+## Runtime Flow
+```
+User request
+    |
+    v
+Orchestrator (CLAUDE.md)
+    |  Routes by surface classification
+    v
+pre_tool_use.py  (PreToolUse hook)
+    |  1. Inject project-context into agent prompt
+    |  2. Inject session events
+    |  3. Validate Bash commands (security gate)
+    |  4. Validate Task/Agent invocations
+    v
+Agent executes
+    |  Uses tools, follows skills, emits AGENT_STATUS
+    v
+subagent_stop.py  (SubagentStop hook)
+    |  1. Read transcript, extract task description
+    |  2. Capture workflow metrics
+    |  3. Validate response contract
+    |  4. Detect anomalies
+    |  5. Store episodic memory
+    |  6. Process CONTEXT_UPDATE blocks
+    v
+Orchestrator processes AGENT_STATUS
+    |  COMPLETE -> summarize to user
+    |  PENDING_APPROVAL -> get approval -> resume
+    |  NEEDS_INPUT -> ask user -> resume
+    |  BLOCKED -> report blocker
+```
+## Hook Pipeline: pre_tool_use.py
+Entry point for all Bash and Task/Agent tool validation. With `Bash(*)` in the settings.json allow list, the hook is the sole security gate.
+### Bash Command Validation (BashValidator)
+Order is short-circuit -- first match wins:
+```
+1. blocked_commands.py    --> permanently denied patterns (exit 2)
+2. Claude footer strip    --> auto-remove Co-Authored-By (transparent updatedInput)
+3. Commit message check   --> conventional commits format validation
+4. cloud_pipe_validator   --> block pipes/redirects/chains on cloud CLIs (exit 0, corrective)
+5. mutative_verbs.py      --> scan tokens 1-5 for MUTATIVE verbs
+   |                          If mutative + no active grant -> generate nonce, block
+   |                          If mutative + active grant -> allow (T3)
+   |                          If not mutative -> safe by elimination (T0)
+6. gitops_validator       --> GitOps policy for kubectl/helm/flux
+```
+### Task/Agent Validation
+```
+1. Response contract guard  --> if pending repair exists, block new tasks until resolved
+2. Context injection        --> context_provider.py assembles payload, injected into prompt
+3. Session events injection --> recent git commits, pushes, file mods added to prompt
+4. Resume validation        --> validate agent ID format, detect approval nonces
+5. TaskValidator            --> validate agent name, check available agents
+```
+## Agent Completion Pipeline: subagent_stop.py
+Fires after every agent tool completes:
+```
+1. Consume approval file    --> delete pending approval if matches agent
+2. Capture workflow metrics  --> duration, exit code, plan status -> metrics.jsonl
+3. Validate response contract
+   |  Parse AGENT_STATUS block (plan_status, agent_id, pending_steps, next_action)
+   |  Parse EVIDENCE_REPORT block (7 required fields)
+   |  Parse CONSOLIDATION_REPORT if multi-surface task
+   |  If invalid -> save pending-repair.json for pre_tool_use guard
+   |  If valid -> clear pending repair
+4. Detect anomalies          --> execution failures, consecutive failures
+   |  If anomalies found -> create needs_analysis.flag for Gaia
+5. Capture episodic memory   --> store episode via tools/memory/episodic.py
+6. Process context updates   --> apply CONTEXT_UPDATE blocks via context_writer.py
+```
+## Surface Routing: surface_router.py
+Classifies user tasks into surfaces using signal matching against `config/surface-routing.json`.
+| Surface | Primary Agent | Typical Signals |
+|---------|--------------|-----------------|
+| `live_runtime` | cloud-troubleshooter | pods, services, logs, kubectl, gcloud |
+| `gitops_desired_state` | gitops-operator | manifests, Flux, Helm, Kustomize |
+| `terraform_iac` | terraform-architect | Terraform, Terragrunt, IAM, modules |
+| `app_ci_tooling` | devops-developer | CI/CD, Docker, package tooling |
+| `planning_specs` | speckit-planner | specs, plans, task breakdowns |
+| `gaia_system` | gaia | hooks, skills, agents/, CLAUDE.md |
+**Classification algorithm:**
+1. Normalize task text
+2. Score each surface by keyword (1.0), command (1.5), and artifact (1.0) matches
+3. Keep surfaces with score >= 1.0 and >= 55% of top score
+4. If no match and current agent maps to a surface, use agent-fallback (score 0.2)
+5. If still no match, dispatch reconnaissance agent
+**Investigation brief** is generated per agent from routing results. It contains role assignment (primary/cross_check/adjacent), required evidence fields, stop conditions, and whether a CONSOLIDATION_REPORT is required.
+## Context Injection: context_provider.py
+Assembles the context payload injected into agent prompts by pre_tool_use.py.
+```
+context_provider.py <agent_name> <user_task>
+    |
+    +--> Load project-context.json
+    +--> Detect cloud provider (GCP/AWS)
+    +--> Load base contracts (config/context-contracts.json)
+    +--> Merge cloud overrides (config/cloud/{provider}.json)
+    +--> Extract contracted sections for this agent (read permissions)
+    +--> Load universal rules (config/universal-rules.json)
+    +--> Load relevant episodic memory (similarity match)
+    +--> Classify surfaces (surface_router.py)
+    +--> Build investigation brief (surface_router.py)
+    |
+    v
+    JSON payload:
+      contract:               {sections the agent may read}
+      context_update_contract: {readable/writable section lists}
+      rules:                  {universal + agent-specific rules}
+      surface_routing:        {active surfaces, dispatch mode, confidence}
+      investigation_brief:    {role, required checks, stop conditions}
+      historical_context:     {relevant episodes if any}
+      metadata:               {provider, version, counts}
+```
+## Approval Flow
+Nonce-based T3 approval lifecycle:
+```
+1. Agent attempts dangerous command (e.g., terraform apply)
+2. mutative_verbs.py detects MUTATIVE verb
+3. BashValidator generates 128-bit nonce via generate_nonce()
+4. write_pending_approval() saves pending-{nonce}.json to .claude/cache/approvals/
+5. Hook returns corrective deny (exit 0) with NONCE:{hex} in message
+6. Agent includes NONCE:{hex} in PENDING_APPROVAL status to orchestrator
+7. Orchestrator presents plan to user, asks for approval
+8. User approves -> orchestrator resumes agent with "APPROVE:{nonce}"
+9. pre_tool_use.py detects APPROVE: prefix, calls activate_pending_approval()
+10. Pending grant converted to active grant (TTL 10 min, verb-matched)
+11. Agent retries command -> check_approval_grant() finds active grant -> allowed
+```
+## Response Contract Validation
+Every agent response must end with an AGENT_STATUS block. The contract validator (`hooks/modules/agents/response_contract.py`) enforces:
+- **AGENT_STATUS**: PLAN_STATUS (from 8 valid states), PENDING_STEPS, NEXT_ACTION, AGENT_ID
+- **EVIDENCE_REPORT**: required for all states except APPROVED_EXECUTING. Seven fields: PATTERNS_CHECKED, FILES_CHECKED, COMMANDS_RUN, KEY_OUTPUTS, VERBATIM_OUTPUTS, CROSS_LAYER_IMPACTS, OPEN_GAPS
+- **CONSOLIDATION_REPORT**: required when multi-surface or cross-check. Fields: OWNERSHIP_ASSESSMENT (enum), CONFIRMED_FINDINGS, SUSPECTED_FINDINGS, CONFLICTS, OPEN_GAPS, NEXT_BEST_AGENT
+Invalid responses trigger a repair loop: save pending-repair.json, pre_tool_use guard blocks new tasks, orchestrator must resume the same agent for repair (max 2 attempts before escalation).
+## Adapter Layer
+The adapter layer decouples business logic from CLI-specific protocols. Located at `hooks/adapters/`.
+### Components
+- `types.py` -- Normalized dataclasses (HookEvent, ValidationRequest, ValidationResult, etc.)
+- `base.py` -- Abstract HookAdapter interface
+- `claude_code.py` -- Claude Code adapter (stdin JSON <-> normalized types)
+- `channel.py` -- Distribution channel detection (plugin vs npm)
+### Flow
+```
+Claude Code stdin JSON -> ClaudeCodeAdapter.parse_event() -> normalized HookEvent
+    -> Business logic (unchanged) ->
+ClaudeCodeAdapter.format_validation_response() -> Claude Code stdout JSON
+```
+### Plugin Distribution
+gaia-ops is distributable as a Claude Code plugin via `.claude-plugin/plugin.json`.
+The plugin is auto-discovered by Claude Code -- agents, skills, commands, and hooks
+are loaded from their respective directories.
+See `.claude-plugin/marketplace.json` for the self-hosted marketplace with sub-plugins.
+## Adapter Coupling Points
+The adapter layer connects Claude Code's hook protocol to gaia-ops business logic through 5 coupling points. Each coupling point is a thin entry point that delegates to the adapter for JSON parsing/formatting and to business logic modules for decisions.
+### CP-1: `hooks/pre_tool_use.py` -- Command Validation Entry Point
+| Attribute | Value |
+|-----------|-------|
+| **File** | `hooks/pre_tool_use.py` |
+| **Hook event** | PreToolUse |
+| **What it does** | Security gate for all Bash, Task, and Agent tool invocations. Validates commands (blocked patterns, mutative verbs, nonce-based approval), injects project-context into agent prompts, guards pending contract repairs. |
+| **Adapter methods called** | `ClaudeCodeAdapter.parse_event()`, `ClaudeCodeAdapter.parse_pre_tool_use()`, `ClaudeCodeAdapter.format_validation_response()` |
+| **Business logic modules** | `security/blocked_commands.py`, `security/mutative_verbs.py`, `security/approval_grants.py`, `tools/bash_validator.py`, `tools/task_validator.py`, `agents/response_contract.py`, `context/context_provider.py` |
+### CP-2: `hooks/post_tool_use.py` -- Audit Logging Entry Point
+| Attribute | Value |
+|-----------|-------|
+| **File** | `hooks/post_tool_use.py` |
+| **Hook event** | PostToolUse |
+| **What it does** | Records execution audit logs, detects critical events (git commits, pushes, file modifications), updates active session context. Reads pre-hook state for timing and tier classification. |
+| **Adapter methods called** | `ClaudeCodeAdapter.parse_event()`, `ClaudeCodeAdapter.parse_post_tool_use()` |
+| **Business logic modules** | `audit/logger.py` (`log_execution`), `audit/event_detector.py` (`detect_critical_event`), `core/state.py` (`get_hook_state`, `clear_hook_state`) |
+### CP-3: `hooks/subagent_stop.py` -- Contract Validation + Memory Entry Point
+| Attribute | Value |
+|-----------|-------|
+| **File** | `hooks/subagent_stop.py` |
+| **Hook event** | SubagentStop |
+| **What it does** | Fires after every agent completes. Consumes approval files, captures workflow metrics, validates the response contract (AGENT_STATUS, EVIDENCE_REPORT, CONSOLIDATION_REPORT), detects anomalies, stores episodic memory, and processes CONTEXT_UPDATE blocks. |
+| **Adapter methods called** | `ClaudeCodeAdapter.parse_event()`, `ClaudeCodeAdapter.parse_agent_completion()` |
+| **Business logic modules** | `agents/response_contract.py` (`validate_response_contract`, `save_pending_repair`, `clear_pending_repair`), `tools/memory/episodic.py` (`EpisodicMemory.store_episode`), `context/context_writer.py` (`process_agent_output`) |
+### CP-4: `hooks/modules/tools/hook_response.py` -- Response Formatting
+| Attribute | Value |
+|-----------|-------|
+| **File** | `hooks/modules/tools/hook_response.py` |
+| **Hook event** | (shared utility, used by PreToolUse callers) |
+| **What it does** | Provides `build_hook_permission_response()` -- a shared builder for hookSpecificOutput JSON. Delegates to the adapter's `format_validation_response()` so all permission responses share a single code path. |
+| **Adapter methods called** | `ClaudeCodeAdapter.format_validation_response()` |
+| **Business logic modules** | None (pure formatting bridge) |
+### CP-5: `templates/settings.template.json` / `hooks/hooks.json` -- Hook Configuration
+| Attribute | Value |
+|-----------|-------|
+| **File (npm channel)** | `templates/settings.template.json` -- paths use `.claude/hooks/` prefix |
+| **File (plugin channel)** | `hooks/hooks.json` -- paths use `${CLAUDE_PLUGIN_ROOT}/hooks/` prefix |
+| **What it does** | Maps Claude Code hook events to handler scripts. Defines which events fire which entry points, the tool matchers (Bash, Task, Agent, `*`), and permissions (allow/deny lists). |
+| **Events configured** | PreToolUse, PostToolUse, SubagentStop, SessionStart, Stop, TaskCompleted, SubagentStart (UserPromptSubmit is a static echo in settings.json only) |
+### HookAdapter ABC Contract
+The abstract interface in `hooks/adapters/base.py` defines the adapter contract. Each CLI backend provides a concrete implementation.
+| Method | Signature | Description |
+|--------|-----------|-------------|
+| `parse_event` | `(stdin_data: str) -> HookEvent` | Parse raw stdin JSON into a normalized, CLI-agnostic event |
+| `format_validation_response` | `(result: ValidationResult) -> HookResponse` | Format a validation result for the CLI's permission protocol |
+| `format_completion_response` | `(result: CompletionResult) -> HookResponse` | Format a completion result for SubagentStop |
+| `format_context_response` | `(result: ContextResult) -> HookResponse` | Format a context injection result |
+| `detect_channel` | `() -> DistributionChannel` | Detect whether gaia-ops is running as NPM or PLUGIN |
+Additional abstract methods for P1/P2 events: `adapt_session_start`, `format_bootstrap_response`, `adapt_stop`, `adapt_task_completed`, `adapt_subagent_start`, `format_quality_response`, `format_verification_response`.
+**Invariants:**
+1. Business logic modules NEVER see `HookResponse`. They produce `ValidationResult`, `CompletionResult`, etc.
+2. The adapter NEVER modifies business logic results -- it only translates format.
+3. Adding a new hook event requires ONLY a new adapter method. Zero changes to business logic modules.
+### Adding a New Hook Event
+To add support for a new Claude Code hook event (e.g., a future `PreCompact` event):
+1. **Add enum value** to `HookEventType` in `hooks/adapters/types.py` (already present for all 19 known events).
+2. **Add adapter method** to `ClaudeCodeAdapter` in `hooks/adapters/claude_code.py` -- implement `adapt_<event_name>(raw: dict) -> <ResultType>` and the corresponding `format_<result>_response()` if a new result type is needed.
+3. **Add extract/format methods** for the event type -- the extract method pulls typed data from the raw payload, the format method builds the CLI response JSON.
+4. **Create hook script entry point** -- a new `hooks/<event_name>.py` file that reads stdin, calls `adapter.parse_event()`, delegates to business logic, and writes the response to stdout.
+5. **Add entry to `hooks/hooks.json`** (plugin channel) and `templates/settings.template.json` (npm channel) mapping the event name to the new script.
+**Zero changes to business logic modules required.** The adapter is the only layer that touches CLI-specific JSON.
+### Adding a New CLI Backend
+To support a CLI other than Claude Code (e.g., a hypothetical Cursor or Windsurf integration):
+1. **Subclass `HookAdapter`** from `hooks/adapters/base.py`.
+2. **Implement `parse_event()`** and all `format_*()` methods to translate between the new CLI's JSON protocol and the normalized types in `hooks/adapters/types.py`.
+3. **No changes to business logic or adapter interface.** The same `ValidationResult`, `CompletionResult`, `ContextResult`, etc. flow through unchanged.
+**Business logic modules remain untouched.** They consume and produce normalized types; only the adapter layer changes.
+## Key Files Reference
+| File | Purpose |
+|------|---------|
+| `CLAUDE.md` | Orchestrator identity, routing table, tool restrictions |
+| `hooks/pre_tool_use.py` | PreToolUse hook entry point |
+| `hooks/subagent_stop.py` | SubagentStop hook entry point |
+| `hooks/modules/tools/bash_validator.py` | Bash command security gate |
+| `hooks/modules/tools/task_validator.py` | Task/Agent invocation validator |
+| `hooks/modules/security/blocked_commands.py` | Permanently denied command patterns |
+| `hooks/modules/security/mutative_verbs.py` | CLI-agnostic mutative verb detector |
+| `hooks/modules/security/approval_grants.py` | Nonce grant lifecycle management |
+| `hooks/modules/agents/response_contract.py` | Agent response contract validator |
+| `hooks/modules/context/context_writer.py` | Progressive context enrichment |
+| `tools/context/context_provider.py` | Context payload assembly |
+| `tools/context/surface_router.py` | Surface classification and investigation briefs |
+| `tools/memory/episodic.py` | Episodic memory storage |
+| `config/context-contracts.json` | Agent read/write section permissions |
+| `config/universal-rules.json` | Universal and agent-specific rules |
+| `config/surface-routing.json` | Surface signals and routing config |
+| `agents/*.md` | Agent identity definitions |
+| `skills/*/SKILL.md` | Injected procedural knowledge |
+| `bin/*.js` | CLI tools (gaia-scan, gaia-doctor, gaia-status, etc.) |

package/CHANGELOG.md CHANGED Viewed

@@ -5,6 +5,21 @@ All notable changes to the CLAUDE.md orchestrator instructions are documented in
 The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
 and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
+## [Unreleased]
+### Added
+- Plugin distribution: `.claude-plugin/plugin.json` manifest for Claude Code native plugin system
+- Self-hosted marketplace: `.claude-plugin/marketplace.json` with 2 sub-plugin tiers (gaia-security, gaia-ops)
+- Adapter layer: `hooks/adapters/` with normalized types, abstract base, and Claude Code adapter
+- `hooks/hooks.json` for plugin-channel hook configuration
+- Distribution channel detection (`hooks/adapters/channel.py`)
+- Integration tests for adapter -> business logic -> response flow
+- Plugin manifest validation tests
+### Changed
+- Hook entry points (pre_tool_use.py, post_tool_use.py, subagent_stop.py) now use adapter layer for stdin/stdout
+- hook_response.py delegates to ClaudeCodeAdapter internally
 ## [4.0.0] - 2026-03-03
 ### Breaking: Contracts as Single Source of Truth

package/CODE_OF_CONDUCT.md ADDED Viewed

@@ -0,0 +1,11 @@
+# Code of Conduct
+This project follows the [Contributor Covenant Code of Conduct v2.1](https://www.contributor-covenant.org/version/2/1/code_of_conduct/).
+Please read the full text at the link above. All contributors, maintainers, and participants are expected to uphold these standards.
+## Reporting
+Report unacceptable behavior to jaguilar1897@gmail.com.
+Reports will be reviewed and investigated promptly and fairly.

package/CONTRIBUTING.md ADDED Viewed

@@ -0,0 +1,146 @@
+# Contributing to gaia-ops
+Thank you for your interest in contributing to gaia-ops. This guide covers how to set up your development environment, run tests, and submit changes.
+## Development Setup
+### Prerequisites
+- **Node.js** >= 18.0.0
+- **Python** >= 3.9
+- **Git** >= 2.30
+- **Claude Code** (latest version, for end-to-end testing)
+### Clone and Install
+```bash
+git clone https://github.com/metraton/gaia-ops.git
+cd gaia-ops
+npm install
+```
+Python test dependencies:
+```bash
+pip install pytest
+```
+## Running Tests
+The test suite is organized in layers:
+```bash
+# Layer 1 (fast, deterministic) - run these before every PR
+npm test
+# Equivalent:
+npm run test:layer1
+# Layer 2 (LLM evaluation) - requires Claude Code access
+npm run test:layer2
+# Layer 3 (end-to-end)
+npm run test:layer3
+# All layers
+npm run test:all
+# Run pytest directly with stop-on-first-failure
+python -m pytest tests/ -x
+# Linting
+npm run lint
+```
+Always ensure Layer 1 tests pass before submitting a PR.
+## Project Structure
+See [README.md](./README.md) for the full directory tree. Key areas for contributors:
+| Directory | What it contains |
+|-----------|-----------------|
+| `agents/` | Agent definition files (`.md`) - identity, scope, routing |
+| `skills/` | Skill modules (`SKILL.md` files) - injected procedural knowledge |
+| `hooks/` | Runtime validators (`pre_tool_use.py`, `post_tool_use.py`, `subagent_stop.py`) |
+| `hooks/modules/` | Modular hook components (blocked commands, safe commands, dangerous verbs) |
+| `tools/` | Orchestration tools (context provider, memory, validation) |
+| `config/` | Configuration files (contracts, git standards, rules) |
+| `tests/` | Test suite organized by layer |
+| `bin/` | CLI utilities (`gaia-scan`, `gaia-doctor`, etc.) |
+## Coding Standards
+### Python
+- Follow the existing code style in the repository.
+- Use [ruff](https://github.com/astral-sh/ruff) for linting and formatting.
+- Type hints are encouraged but not strictly required.
+- Keep functions focused and testable.
+### JavaScript / Node.js
+- ES modules (`import`/`export`), not CommonJS.
+- Follow the existing patterns in `bin/` and `index.js`.
+### Commit Messages
+All commits must follow [Conventional Commits](https://www.conventionalcommits.org/):
+```
+type(scope): short description
+```
+Allowed types: `feat`, `fix`, `refactor`, `docs`, `test`, `chore`, `ci`, `perf`, `style`, `build`
+Examples:
+- `feat(hooks): add timeout protection to bash validator`
+- `fix(skills): correct token budget in agent-protocol`
+- `docs(readme): update installation instructions`
+## PR Process
+1. **Fork** the repository and create a feature branch from `main`.
+2. **Make your changes** following the coding standards above.
+3. **Write tests** for new functionality. Changes to `hooks/` always need tests.
+4. **Run the test suite**: `npm test` must pass.
+5. **Commit** using Conventional Commits format.
+6. **Open a PR** against `main` with a clear description of what changed and why.
+PRs are reviewed for correctness, test coverage, and consistency with existing patterns.
+## Hooks Development
+The `hooks/` directory contains runtime validators that enforce security and workflow policies in Claude Code. These are critical-path code.
+- `pre_tool_use.py` - Main entry point; validates every tool call before execution.
+- `post_tool_use.py` - Audit and metrics after tool execution.
+- `hooks/modules/` - Individual validation modules (e.g., `blocked_commands.py`, `mutative_verbs.py`).
+**Key rules for hook changes:**
+- Every change to a hook module must have a corresponding test in `tests/`.
+- Hook modules must be deterministic -- no network calls, no randomness.
+- Test both the allow and deny paths for any new validation rule.
+## Skills Development
+Skills live in `skills/` as directories, each containing a `SKILL.md` file:
+```
+skills/
+  skill-name/
+    SKILL.md          # Main content (injected into agents)
+    reference.md      # Heavy reference material (read on-demand)
+    examples.md       # Concrete examples (optional)
+    scripts/          # Executable tools (optional)
+```
+- `SKILL.md` must stay under 100 lines (it is injected on every agent call).
+- Heavy content goes in `reference.md` (loaded on-demand).
+- Skills define process; agents define identity. Do not duplicate between them.
+For detailed guidance, see `skills/skill-creation/SKILL.md`.
+## Questions?
+Open an issue on [GitHub](https://github.com/metraton/gaia-ops/issues) or contact the maintainer at jaguilar1897@gmail.com.