@crewpilot/agent 2.0.0 → 3.0.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (27) hide show
  1. package/README.md +131 -131
  2. package/dist-npm/cli.js +5 -5
  3. package/dist-npm/index.js +100 -100
  4. package/package.json +69 -69
  5. package/prompts/agent.md +282 -282
  6. package/prompts/copilot-instructions.md +36 -36
  7. package/prompts/{catalyst.config.json → crewpilot.config.json} +72 -72
  8. package/prompts/skills/assure-code-quality/SKILL.md +112 -112
  9. package/prompts/skills/assure-pr-intelligence/SKILL.md +148 -148
  10. package/prompts/skills/assure-review-functional/SKILL.md +114 -114
  11. package/prompts/skills/assure-review-standards/SKILL.md +106 -106
  12. package/prompts/skills/assure-threat-model/SKILL.md +182 -182
  13. package/prompts/skills/assure-vulnerability-scan/SKILL.md +146 -146
  14. package/prompts/skills/autopilot-meeting/SKILL.md +434 -434
  15. package/prompts/skills/autopilot-worker/SKILL.md +737 -737
  16. package/prompts/skills/daily-digest/SKILL.md +188 -188
  17. package/prompts/skills/deliver-change-management/SKILL.md +132 -132
  18. package/prompts/skills/deliver-deploy-guard/SKILL.md +144 -144
  19. package/prompts/skills/deliver-doc-governance/SKILL.md +130 -130
  20. package/prompts/skills/engineer-feature-builder/SKILL.md +270 -270
  21. package/prompts/skills/engineer-root-cause-analysis/SKILL.md +150 -150
  22. package/prompts/skills/engineer-test-first/SKILL.md +148 -148
  23. package/prompts/skills/insights-knowledge-base/SKILL.md +202 -202
  24. package/prompts/skills/insights-pattern-detection/SKILL.md +142 -142
  25. package/prompts/skills/strategize-architecture-planner/SKILL.md +141 -141
  26. package/prompts/skills/strategize-solution-design/SKILL.md +118 -118
  27. package/scripts/postinstall.js +108 -108
@@ -1,36 +1,36 @@
1
- # Catalyst — Copilot Instructions
2
-
3
- This repository contains **Catalyst**, an AI Engineering Intelligence Platform built as a GitHub Copilot custom agent.
4
-
5
- ## Quick Reference
6
-
7
- - **Agent router**: `.github/agents/catalyst.md` — the single source of truth for skill routing, role matrix, and guardrails
8
- - **Skills**: `.github/skills/*/SKILL.md` — 16 structured methodology files across 5 pillars + 3 automation skills
9
- - **MCP Server**: `catalyst-engine/` (CrewPilot MCP server) — TypeScript MCP server with 55 tools across 8 modules + config
10
-
11
- ## How to Use
12
-
13
- Type `@catalyst` in GitHub Copilot Chat. The agent will ask for a session role (Build, Review, Plan, Design, or Just Ask), then route your requests to the appropriate skill.
14
-
15
- ## Key Conventions
16
-
17
- - **Conventional commits**: `type(scope): message`
18
- - **Human gates**: Autopilot always pauses for approval at critical points
19
- - **Branch protection**: Never commit directly to `main`/`master`/`release/*`
20
- - **Progressive disclosure**: Summaries first, details on request
21
- - **Confidence gating**: Findings below threshold (default 7/10) are suppressed
22
-
23
- ## Configuration
24
-
25
- `.github/catalyst.config.json` controls thresholds, pillar toggles, and per-skill overrides. See `catalyst_config_get` tool.
26
-
27
- ## Architecture
28
-
29
- ```
30
- User → @catalyst (router) → SKILL.md (methodology) → MCP tools (execution)
31
-
32
- SQLite (knowledge + workflows)
33
- gh CLI (GitHub Issues/PRs)
34
- ```
35
-
36
- For full details, see the [README](../README.md) or the [agent definition](agents/catalyst.md).
1
+ # CrewPilot — Copilot Instructions
2
+
3
+ This repository contains **CrewPilot**, an AI Engineering Intelligence Platform built as a GitHub Copilot custom agent.
4
+
5
+ ## Quick Reference
6
+
7
+ - **Agent router**: `.github/agents/crewpilot.md` — the single source of truth for skill routing, role matrix, and guardrails
8
+ - **Skills**: `.github/skills/*/SKILL.md` — 16 structured methodology files across 5 pillars + 3 automation skills
9
+ - **MCP Server**: `crewpilot-engine/` (CrewPilot MCP server) — TypeScript MCP server with 55 tools across 8 modules + config
10
+
11
+ ## How to Use
12
+
13
+ Type `@crewpilot` in GitHub Copilot Chat. The agent will ask for a session role (Build, Review, Plan, Design, or Just Ask), then route your requests to the appropriate skill.
14
+
15
+ ## Key Conventions
16
+
17
+ - **Conventional commits**: `type(scope): message`
18
+ - **Human gates**: Autopilot always pauses for approval at critical points
19
+ - **Branch protection**: Never commit directly to `main`/`master`/`release/*`
20
+ - **Progressive disclosure**: Summaries first, details on request
21
+ - **Confidence gating**: Findings below threshold (default 7/10) are suppressed
22
+
23
+ ## Configuration
24
+
25
+ `.github/crewpilot.config.json` controls thresholds, pillar toggles, and per-skill overrides. See `crewpilot_config_get` tool.
26
+
27
+ ## Architecture
28
+
29
+ ```
30
+ User → @crewpilot (router) → SKILL.md (methodology) → MCP tools (execution)
31
+
32
+ SQLite (knowledge + workflows)
33
+ gh CLI (GitHub Issues/PRs)
34
+ ```
35
+
36
+ For full details, see the [README](../README.md) or the [agent definition](agents/crewpilot.md).
@@ -1,72 +1,72 @@
1
- {
2
- "version": "1.3.0",
3
- "platform": "CrewPilot — Your AI Engineering Crew",
4
- "global": {
5
- "confidence_threshold": 7,
6
- "max_findings": 20,
7
- "output_style": "concise",
8
- "auto_chain": true
9
- },
10
- "board": {
11
- "provider": "github",
12
- "github": {
13
- "repo": "",
14
- "auto_detect": true
15
- },
16
- "azure": {
17
- "org": "performanceagent",
18
- "project": "Catalyst"
19
- },
20
- "sync_on_commit": true
21
- },
22
- "worker": {
23
- "enabled": true,
24
- "max_retries": 3,
25
- "human_gates": ["approve-plan", "approve-pr", "approve-merge"],
26
- "auto_assign_label": "digital-worker"
27
- },
28
- "notifications": {
29
- "channel": "email",
30
- "email_recipients": []
31
- },
32
- "pillars": {
33
- "strategize": {
34
- "enabled": true,
35
- "skills": {
36
- "solution-design": { "enabled": true, "max_options": 4 },
37
- "architecture-planner": { "enabled": true, "require_adr": true }
38
- }
39
- },
40
- "assure": {
41
- "enabled": true,
42
- "skills": {
43
- "code-quality": { "enabled": true, "severity_floor": "medium" },
44
- "vulnerability-scan": { "enabled": true, "owasp_top_10": true, "cwe_top_25": true },
45
- "pr-intelligence": { "enabled": true, "auto_summarize": true }
46
- }
47
- },
48
- "engineer": {
49
- "enabled": true,
50
- "skills": {
51
- "feature-builder": { "enabled": true, "scaffold_tests": true },
52
- "test-first": { "enabled": true, "enforcement": "strict", "min_coverage": 80 },
53
- "root-cause-analysis": { "enabled": true, "max_attempts": 5 }
54
- }
55
- },
56
- "deliver": {
57
- "enabled": true,
58
- "skills": {
59
- "change-management": { "enabled": true, "commit_format": "conventional" },
60
- "doc-governance": { "enabled": true, "drift_detection": true },
61
- "deploy-guard": { "enabled": true, "phase_gates": "strict" }
62
- }
63
- },
64
- "insights": {
65
- "enabled": true,
66
- "skills": {
67
- "pattern-detection": { "enabled": true, "scan_depth": "full" },
68
- "knowledge-base": { "enabled": true, "retention_days": 90 }
69
- }
70
- }
71
- }
72
- }
1
+ {
2
+ "version": "1.3.0",
3
+ "platform": "CrewPilot — Your AI Engineering Crew",
4
+ "global": {
5
+ "confidence_threshold": 7,
6
+ "max_findings": 20,
7
+ "output_style": "concise",
8
+ "auto_chain": true
9
+ },
10
+ "board": {
11
+ "provider": "github",
12
+ "github": {
13
+ "repo": "",
14
+ "auto_detect": true
15
+ },
16
+ "azure": {
17
+ "org": "performanceagent",
18
+ "project": "CrewPilot"
19
+ },
20
+ "sync_on_commit": true
21
+ },
22
+ "worker": {
23
+ "enabled": true,
24
+ "max_retries": 3,
25
+ "human_gates": ["approve-plan", "approve-pr", "approve-merge"],
26
+ "auto_assign_label": "digital-worker"
27
+ },
28
+ "notifications": {
29
+ "channel": "email",
30
+ "email_recipients": []
31
+ },
32
+ "pillars": {
33
+ "strategize": {
34
+ "enabled": true,
35
+ "skills": {
36
+ "solution-design": { "enabled": true, "max_options": 4 },
37
+ "architecture-planner": { "enabled": true, "require_adr": true }
38
+ }
39
+ },
40
+ "assure": {
41
+ "enabled": true,
42
+ "skills": {
43
+ "code-quality": { "enabled": true, "severity_floor": "medium" },
44
+ "vulnerability-scan": { "enabled": true, "owasp_top_10": true, "cwe_top_25": true },
45
+ "pr-intelligence": { "enabled": true, "auto_summarize": true }
46
+ }
47
+ },
48
+ "engineer": {
49
+ "enabled": true,
50
+ "skills": {
51
+ "feature-builder": { "enabled": true, "scaffold_tests": true },
52
+ "test-first": { "enabled": true, "enforcement": "strict", "min_coverage": 80 },
53
+ "root-cause-analysis": { "enabled": true, "max_attempts": 5 }
54
+ }
55
+ },
56
+ "deliver": {
57
+ "enabled": true,
58
+ "skills": {
59
+ "change-management": { "enabled": true, "commit_format": "conventional" },
60
+ "doc-governance": { "enabled": true, "drift_detection": true },
61
+ "deploy-guard": { "enabled": true, "phase_gates": "strict" }
62
+ }
63
+ },
64
+ "insights": {
65
+ "enabled": true,
66
+ "skills": {
67
+ "pattern-detection": { "enabled": true, "scan_depth": "full" },
68
+ "knowledge-base": { "enabled": true, "retention_days": 90 }
69
+ }
70
+ }
71
+ }
72
+ }
@@ -1,112 +1,112 @@
1
- # Code Quality
2
-
3
- > **Pillar**: Assure | **ID**: `assure-code-quality`
4
-
5
- ## Purpose
6
-
7
- Multi-pass code review that identifies quality issues across correctness, maintainability, performance, and readability. Goes beyond linting — analyzes design intent and structural health.
8
-
9
- ## Activation Triggers
10
-
11
- - "review this code", "code quality check", "refactor suggestions", "clean code"
12
- - "what's wrong with this", "improve this", "code smell"
13
- - When any file is shared for review
14
-
15
- ## Methodology
16
-
17
- ### Process Flow
18
-
19
- ```dot
20
- digraph code_quality {
21
- rankdir=LR;
22
- node [shape=box];
23
-
24
- correctness [label="Pass 1\nCorrectness"];
25
- maintainability [label="Pass 2\nMaintainability"];
26
- performance [label="Pass 3\nPerformance"];
27
- readability [label="Pass 4\nReadability"];
28
- synthesis [label="Synthesis\nRank & Filter", shape=doublecircle];
29
-
30
- correctness -> maintainability;
31
- maintainability -> performance;
32
- performance -> readability;
33
- readability -> synthesis;
34
- }
35
- ```
36
-
37
- ### Pass 1 — Correctness
38
- 1. Identify logic errors, off-by-one, null/undefined risks, race conditions
39
- 2. Check edge cases: empty inputs, boundary values, error paths
40
- 3. Verify resource cleanup (connections, file handles, subscriptions)
41
- 4. Confidence-gate: only report findings ≥ threshold
42
-
43
- ### Pass 2 — Maintainability
44
- 1. Check function/class length — flag functions > 30 lines, classes > 300 lines
45
- 2. Identify code duplication (exact and near-duplicate)
46
- 3. Evaluate naming clarity — do names reveal intent?
47
- 4. Check coupling: does this code depend on internal details of other modules?
48
- 5. Apply SOLID principles where applicable (but don't lecture)
49
-
50
- ### Pass 3 — Performance
51
- 1. Identify O(n²) or worse patterns in hot paths
52
- 2. Flag unnecessary allocations in loops
53
- 3. Check for N+1 query patterns (if data access is involved)
54
- 4. Look for missing caching opportunities on repeated computations
55
- 5. Identify blocking calls that could be async
56
-
57
- ### Pass 4 — Readability
58
- 1. Assess cognitive complexity (nesting depth, boolean chains)
59
- 2. Check for magic numbers/strings
60
- 3. Verify consistent style within the file
61
- 4. Evaluate comment quality — helpful vs. noise vs. missing
62
-
63
- ### Synthesis
64
- 1. Rank all findings by severity: `critical > high > medium > low`
65
- 2. Filter by `severity_floor` from config
66
- 3. Group by file/function
67
- 4. Provide specific fix suggestions with code snippets
68
-
69
- ## Tools Required
70
-
71
- - `codebase` — Read files and understand structure
72
- - `catalyst_metrics_complexity` — Get cyclomatic/cognitive complexity scores
73
- - `catalyst_metrics_coverage` — Check test coverage for reviewed code
74
-
75
- ## Output Format
76
-
77
- ```
78
- ## [Catalyst → Code Quality]
79
-
80
- ### Summary
81
- {N} findings across {files}: {critical} critical, {high} high, {medium} medium
82
-
83
- ### Findings
84
-
85
- #### [{severity}] {title}
86
- **File**: {path}:{line}
87
- **Issue**: {description}
88
- **Fix**:
89
- \`\`\`{lang}
90
- {suggested fix}
91
- \`\`\`
92
- **Confidence**: {N}/10
93
-
94
- ---
95
- (repeat per finding)
96
-
97
- ### Recommendations
98
- {top 3 structural improvements, if any}
99
- ```
100
-
101
- ## Chains To
102
-
103
- - `test-first` — Write tests for areas with correctness concerns
104
- - `vulnerability-scan` — If security-adjacent patterns are detected
105
- - `change-management` — When fixes are ready to commit
106
-
107
- ## Anti-Patterns
108
-
109
- - Do NOT report style preferences as quality issues
110
- - Do NOT suggest heroic refactoring for stable, working code
111
- - Do NOT report findings below the configured severity floor
112
- - Do NOT rewrite the user's code in a different paradigm unless asked
1
+ # Code Quality
2
+
3
+ > **Pillar**: Assure | **ID**: `assure-code-quality`
4
+
5
+ ## Purpose
6
+
7
+ Multi-pass code review that identifies quality issues across correctness, maintainability, performance, and readability. Goes beyond linting — analyzes design intent and structural health.
8
+
9
+ ## Activation Triggers
10
+
11
+ - "review this code", "code quality check", "refactor suggestions", "clean code"
12
+ - "what's wrong with this", "improve this", "code smell"
13
+ - When any file is shared for review
14
+
15
+ ## Methodology
16
+
17
+ ### Process Flow
18
+
19
+ ```dot
20
+ digraph code_quality {
21
+ rankdir=LR;
22
+ node [shape=box];
23
+
24
+ correctness [label="Pass 1\nCorrectness"];
25
+ maintainability [label="Pass 2\nMaintainability"];
26
+ performance [label="Pass 3\nPerformance"];
27
+ readability [label="Pass 4\nReadability"];
28
+ synthesis [label="Synthesis\nRank & Filter", shape=doublecircle];
29
+
30
+ correctness -> maintainability;
31
+ maintainability -> performance;
32
+ performance -> readability;
33
+ readability -> synthesis;
34
+ }
35
+ ```
36
+
37
+ ### Pass 1 — Correctness
38
+ 1. Identify logic errors, off-by-one, null/undefined risks, race conditions
39
+ 2. Check edge cases: empty inputs, boundary values, error paths
40
+ 3. Verify resource cleanup (connections, file handles, subscriptions)
41
+ 4. Confidence-gate: only report findings ≥ threshold
42
+
43
+ ### Pass 2 — Maintainability
44
+ 1. Check function/class length — flag functions > 30 lines, classes > 300 lines
45
+ 2. Identify code duplication (exact and near-duplicate)
46
+ 3. Evaluate naming clarity — do names reveal intent?
47
+ 4. Check coupling: does this code depend on internal details of other modules?
48
+ 5. Apply SOLID principles where applicable (but don't lecture)
49
+
50
+ ### Pass 3 — Performance
51
+ 1. Identify O(n²) or worse patterns in hot paths
52
+ 2. Flag unnecessary allocations in loops
53
+ 3. Check for N+1 query patterns (if data access is involved)
54
+ 4. Look for missing caching opportunities on repeated computations
55
+ 5. Identify blocking calls that could be async
56
+
57
+ ### Pass 4 — Readability
58
+ 1. Assess cognitive complexity (nesting depth, boolean chains)
59
+ 2. Check for magic numbers/strings
60
+ 3. Verify consistent style within the file
61
+ 4. Evaluate comment quality — helpful vs. noise vs. missing
62
+
63
+ ### Synthesis
64
+ 1. Rank all findings by severity: `critical > high > medium > low`
65
+ 2. Filter by `severity_floor` from config
66
+ 3. Group by file/function
67
+ 4. Provide specific fix suggestions with code snippets
68
+
69
+ ## Tools Required
70
+
71
+ - `codebase` — Read files and understand structure
72
+ - `crewpilot_metrics_complexity` — Get cyclomatic/cognitive complexity scores
73
+ - `crewpilot_metrics_coverage` — Check test coverage for reviewed code
74
+
75
+ ## Output Format
76
+
77
+ ```
78
+ ## [CrewPilot → Code Quality]
79
+
80
+ ### Summary
81
+ {N} findings across {files}: {critical} critical, {high} high, {medium} medium
82
+
83
+ ### Findings
84
+
85
+ #### [{severity}] {title}
86
+ **File**: {path}:{line}
87
+ **Issue**: {description}
88
+ **Fix**:
89
+ \`\`\`{lang}
90
+ {suggested fix}
91
+ \`\`\`
92
+ **Confidence**: {N}/10
93
+
94
+ ---
95
+ (repeat per finding)
96
+
97
+ ### Recommendations
98
+ {top 3 structural improvements, if any}
99
+ ```
100
+
101
+ ## Chains To
102
+
103
+ - `test-first` — Write tests for areas with correctness concerns
104
+ - `vulnerability-scan` — If security-adjacent patterns are detected
105
+ - `change-management` — When fixes are ready to commit
106
+
107
+ ## Anti-Patterns
108
+
109
+ - Do NOT report style preferences as quality issues
110
+ - Do NOT suggest heroic refactoring for stable, working code
111
+ - Do NOT report findings below the configured severity floor
112
+ - Do NOT rewrite the user's code in a different paradigm unless asked