agentsys 5.4.0 → 5.5.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -1,7 +1,7 @@
1
1
  {
2
2
  "name": "agentsys",
3
- "description": "15 specialized plugins for AI workflow automation - task orchestration, PR workflow, slop detection, code review, drift detection, enhancement analysis, documentation sync, repo mapping, perf investigations, topic research, agent config linting, cross-tool AI consultation, structured AI debate, and workflow pattern learning",
4
- "version": "5.4.0",
3
+ "description": "18 specialized plugins for AI workflow automation - task orchestration, PR workflow, slop detection, code review, drift detection, enhancement analysis, documentation sync, repo mapping, git intelligence, perf investigations, topic research, agent config linting, cross-tool AI consultation, structured AI debate, workflow pattern learning, codebase onboarding, and contributor guidance",
4
+ "version": "5.5.0",
5
5
  "owner": {
6
6
  "name": "Avi Fenesh",
7
7
  "url": "https://github.com/avifenesh"
@@ -138,7 +138,7 @@
138
138
  "source": "url",
139
139
  "url": "https://github.com/agent-sh/agnix.git"
140
140
  },
141
- "description": "Lint agent configuration files (SKILL.md, CLAUDE.md, hooks, MCP) against 155 rules across 10+ AI tools",
141
+ "description": "Lint agent configuration files (SKILL.md, CLAUDE.md, hooks, MCP) against 342 rules across 10+ AI tools",
142
142
  "version": "1.0.0",
143
143
  "category": "development",
144
144
  "homepage": "https://github.com/agent-sh/agnix"
@@ -186,6 +186,39 @@
186
186
  "version": "1.0.0",
187
187
  "category": "productivity",
188
188
  "homepage": "https://github.com/agent-sh/skillers"
189
+ },
190
+ {
191
+ "name": "git-map",
192
+ "source": {
193
+ "source": "url",
194
+ "url": "https://github.com/agent-sh/git-map.git"
195
+ },
196
+ "description": "Git history analysis: hotspots, coupling, ownership, bus factor, AI commit detection - backed by agent-analyzer Rust binary with lazy download",
197
+ "version": "1.0.0",
198
+ "category": "development",
199
+ "homepage": "https://github.com/agent-sh/git-map"
200
+ },
201
+ {
202
+ "name": "onboard",
203
+ "source": {
204
+ "source": "url",
205
+ "url": "https://github.com/agent-sh/onboard.git"
206
+ },
207
+ "description": "Codebase onboarding - automated data collection and interactive project orientation",
208
+ "version": "0.1.0",
209
+ "category": "productivity",
210
+ "homepage": "https://github.com/agent-sh/onboard"
211
+ },
212
+ {
213
+ "name": "can-i-help",
214
+ "source": {
215
+ "source": "url",
216
+ "url": "https://github.com/agent-sh/can-i-help.git"
217
+ },
218
+ "description": "Find where to contribute to any project - matches developer skills to test gaps, stale docs, bugspots, and open issues",
219
+ "version": "0.1.0",
220
+ "category": "productivity",
221
+ "homepage": "https://github.com/agent-sh/can-i-help"
189
222
  }
190
223
  ]
191
224
  }
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "agentsys",
3
- "version": "5.4.0",
3
+ "version": "5.5.0",
4
4
  "description": "Professional-grade slash commands for Claude Code with cross-platform support",
5
5
  "keywords": [
6
6
  "workflow",
package/CHANGELOG.md CHANGED
@@ -7,6 +7,22 @@ All notable changes to this project will be documented in this file.
7
7
  The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
8
8
  and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
9
9
 
10
+ ## [5.4.1] - 2026-03-10
11
+
12
+ ### Added
13
+
14
+ - **Project base branch** (`--base=BRANCH`) - `/next-task` now supports configuring a project-level base branch for batch workflows. All downstream operations (worktrees, diffs, PRs) use the configured branch instead of main.
15
+ - **Free-text preference caching** - When users select "Other" for any policy decision and type a custom response, it gets cached and offered as an option next time. Auto-removed after 3 skips.
16
+ - **Gate 0 hook** - SubagentStop hook blocks Phase 2 unless policy decisions are persisted to preference cache.
17
+ - **Multi-tool transcript support** - `/skillers compact` now reads from Claude Code, Codex CLI, and OpenCode (was Claude Code only).
18
+
19
+ ### Fixed
20
+
21
+ - **ship target branch validation** - `/ship` now reads `baseBranch` from flow state and validates non-default targets with user confirmation.
22
+ - **Quality sweep** - Removed 95 lines of prose slop and duplication across ship and skillers.
23
+ - **Pre-push hooks** - Fixed for repos without `npm test` script (falls back to JS syntax check).
24
+ - **Cached source null check** - `getPolicyQuestions` no longer crashes when preference file has freeText but no source.
25
+
10
26
  ## [5.4.0] - 2026-03-10
11
27
 
12
28
  ### Added
package/README.md CHANGED
@@ -8,8 +8,6 @@
8
8
  <strong>A modular runtime and orchestration system for AI agents.</strong>
9
9
  </p>
10
10
 
11
- > **Renamed from `awesome-slash`** — The `awesome-` prefix implies a curated list of links, but this project is a functional software suite and runtime. Please update your installs: `npm install -g agentsys`
12
-
13
11
  <p align="center">
14
12
  <a href="https://www.npmjs.com/package/agentsys"><img src="https://img.shields.io/npm/v/agentsys.svg" alt="npm version"></a>
15
13
  <a href="https://www.npmjs.com/package/agentsys"><img src="https://img.shields.io/npm/dm/agentsys.svg" alt="npm downloads"></a>
@@ -21,7 +19,7 @@
21
19
  </p>
22
20
 
23
21
  <p align="center">
24
- <b>15 plugins · 35 agents · 32 skills (across all repos) · 30k lines of lib code · 3,751 tests · 5 platforms</b><br>
22
+ <b>18 plugins · 38 agents · 36 skills (across all repos) · 30k lines of lib code · 3,575 tests · 5 platforms</b><br>
25
23
  <em>Plugins distributed as standalone repos under <a href="https://github.com/agent-sh">agent-sh</a> org — agentsys is the marketplace &amp; installer</em>
26
24
  </p>
27
25
 
@@ -43,11 +41,11 @@
43
41
  AI models can write code. That's not the hard part anymore. The hard part is everything around it — task selection, branch management, code review, artifact cleanup, CI, PR comments, deployment. **AgentSys is the runtime that orchestrates agents to handle all of it** — structured pipelines, gated phases, specialized agents, and persistent state that survives session boundaries.
44
42
 
45
43
  ---
46
- > Building custom skills, agents, hooks, or MCP tools? [agnix](https://github.com/agent-sh/agnix) is the CLI + LSP linter that catches config errors before they fail silently - real-time IDE validation, auto suggestions, auto-fix, and 155 rules for Cursor, Claude Code, Cline, Copilot, Codex, Windsurf, and more.
44
+ > Building custom skills, agents, hooks, or MCP tools? [agnix](https://github.com/agent-sh/agnix) is the CLI + LSP linter that catches config errors before they fail silently - real-time IDE validation, auto suggestions, auto-fix, and 342 rules for Claude Code, Codex, OpenCode, Cursor, Kiro, Copilot, Gemini CLI, Cline, Windsurf, Roo Code, Amp, and more.
47
45
 
48
46
  ## What This Is
49
47
 
50
- An agent orchestration system — 15 plugins, 35 agents, and 32 skills that compose into structured pipelines for software development. Each plugin lives in its own standalone repo under the [agent-sh](https://github.com/agent-sh) org. agentsys is the marketplace and installer that ties them together.
48
+ An agent orchestration system — 18 plugins, 38 agents, and 36 skills that compose into structured pipelines for software development. Each plugin lives in its own standalone repo under the [agent-sh](https://github.com/agent-sh) org. agentsys is the marketplace and installer that ties them together.
51
49
 
52
50
  Each agent has a single responsibility, a specific model assignment, and defined inputs/outputs. Pipelines enforce phase gates so agents can't skip steps. State persists across sessions so work survives interruptions.
53
51
 
@@ -80,6 +78,25 @@ This came from testing on 1,000+ repositories.
80
78
  <!-- GEN:START:readme-commands -->
81
79
  | Command | What it does |
82
80
  |---------|--------------|
81
+ | [`/next-task`](#next-task) | Task workflow: discovery, implementation, PR, merge |
82
+ | [`/agnix`](#agnix) | Lint agent configurations (342 rules) |
83
+ | [`/ship`](#ship) | PR creation, CI monitoring, merge |
84
+ | [`/deslop`](#deslop) | Clean AI slop patterns |
85
+ | [`/perf`](#perf) | Performance investigation with baselines and profiling |
86
+ | [`/drift-detect`](#drift-detect) | Compare plan vs implementation |
87
+ | [`/audit-project`](#audit-project) | Multi-agent iterative code review |
88
+ | [`/enhance`](#enhance) | Plugin, agent, and prompt analyzers |
89
+ | [`/repo-map`](#repo-map) | AST-based repository map |
90
+ | [`/sync-docs`](#sync-docs) | Sync documentation with code changes |
91
+ | [`/learn`](#learn) | Research topics, create learning guides |
92
+ | [`/consult`](#consult) | Cross-tool AI consultation |
93
+ | [`/debate`](#debate) | Structured debate between AI tools |
94
+ | [`/web-ctl`](#web-ctl) | Browser automation for AI agents |
95
+ | [`/release`](#release) | Versioned release with ecosystem detection |
96
+ | [`/skillers`](#skillers) | Workflow pattern learning and automation |
97
+ | [`/git-map`](#git-map) | Git history analysis: hotspots, coupling, ownership, bus factor |
98
+ | [`/onboard`](#onboard) | Codebase orientation for newcomers |
99
+ | [`/can-i-help`](#can-i-help) | Match contributor skills to project needs |
83
100
  <!-- GEN:END:readme-commands -->
84
101
 
85
102
  Each command works standalone. Together, they compose into end-to-end pipelines.
@@ -89,10 +106,20 @@ Each command works standalone. Together, they compose into end-to-end pipelines.
89
106
  ## Skills
90
107
 
91
108
  <!-- GEN:START:readme-skills -->
92
- 0 skills included across the plugins:
109
+ 36 skills included across the plugins:
93
110
 
94
111
  | Category | Skills |
95
112
  |----------|--------|
113
+ | **Workflow** | `discover-tasks`, `orchestrate-review`, `validate-delivery` |
114
+ | **Enhancement** | `enhance-agent-prompts`, `enhance-claude-memory`, `enhance-cross-file`, `enhance-docs`, `enhance-hooks`, `enhance-orchestrator`, `enhance-plugins`, `enhance-prompts`, `enhance-skills` |
115
+ | **Performance** | `baseline`, `benchmark`, `code-paths`, `investigation-logger`, `perf-analyzer`, `profile`, `theory-gatherer`, `theory-tester` |
116
+ | **Cleanup** | `deslop`, `sync-docs` |
117
+ | **Code Review** | `audit-project` |
118
+ | **AI Collaboration** | `consult`, `debate`, `learn`, `recommend`, `skillers-compact` |
119
+ | **Onboarding** | `can-i-help`, `onboard` |
120
+ | **Web** | `web-auth`, `web-browse` |
121
+ | **Release** | `release` |
122
+ | **Analysis** | `drift-analysis`, `git-mapping`, `repo-mapping` |
96
123
  <!-- GEN:END:readme-skills -->
97
124
 
98
125
  Skills are the reusable implementation units. Agents invoke skills; commands orchestrate agents. When you install a plugin, its skills become available to all agents in that session.
@@ -104,8 +131,8 @@ Skills are the reusable implementation units. Agents invoke skills; commands orc
104
131
  | Section | What's there |
105
132
  |---------|--------------|
106
133
  | [The Approach](#the-approach) | Why it's built this way |
107
- | [Commands](#commands) | All 14 commands overview |
108
- | [Skills](#skills) | 32 skills across plugins |
134
+ | [Commands](#commands) | All 19 commands overview |
135
+ | [Skills](#skills) | 36 skills across plugins |
109
136
  | [Command Details](#command-details) | Deep dive into each command |
110
137
  | [How Commands Work Together](#how-commands-work-together) | Standalone vs integrated |
111
138
  | [Design Philosophy](#design-philosophy) | The thinking behind the architecture |
@@ -144,7 +171,7 @@ Phase 9 uses the `orchestrate-review` skill to spawn parallel reviewers (code qu
144
171
  |-------|-------|------|
145
172
  | task-discoverer | sonnet | Finds and ranks tasks from your source |
146
173
  | worktree-manager | haiku | Creates git worktrees and branches |
147
- | exploration-agent | opus | Deep codebase analysis before planning |
174
+ | exploration-agent | sonnet | Deep codebase analysis before planning |
148
175
  | planning-agent | opus | Designs step-by-step implementation plan |
149
176
  | implementation-agent | opus | Writes the actual code |
150
177
  | test-coverage-checker | sonnet | Validates tests exist and are meaningful |
@@ -192,10 +219,10 @@ agnix catches these issues before they cause problems.
192
219
  | **Security** | Prompt injection vectors, overpermissive tools, exposed secrets |
193
220
  | **Consistency** | Conflicting rules, duplicate definitions, broken references |
194
221
  | **Best Practices** | Tool restrictions, model selection, trigger phrase quality |
195
- | **Cross-Platform** | Compatibility across Claude Code, Cursor, Copilot, Codex, OpenCode, Gemini CLI, Cline, and more |
222
+ | **Cross-Platform** | Compatibility across Claude Code, Codex, OpenCode, Cursor, Kiro, Copilot, Gemini CLI, Cline, Windsurf, Roo Code, Amp, and more |
196
223
 
197
- **155 validation rules** (57 auto-fixable) derived from:
198
- - Official tool specifications (Claude Code, Cursor, GitHub Copilot, Codex CLI, OpenCode, Gemini CLI, and more)
224
+ **342 validation rules** (102 auto-fixable) derived from:
225
+ - Official tool specifications (Claude Code, Codex CLI, OpenCode, Cursor, Kiro, GitHub Copilot, Gemini CLI, Cline, Windsurf, Roo Code, Amp, and more)
199
226
  - Research papers on agent reliability and prompt injection
200
227
  - Real-world testing across 500+ repositories
201
228
  - Community-reported issues and edge cases
@@ -210,6 +237,12 @@ agnix catches these issues before they cause problems.
210
237
  | MCP | `*.mcp.json`, MCP server configs |
211
238
  | Cursor | `.cursor/rules/*.mdc`, `.cursorrules` |
212
239
  | Copilot | `.github/copilot-instructions.md` |
240
+ | Kiro | `.kiro/steering/**/*.md`, `.kiro/agents/*.json`, `.kiro/hooks/*.kiro.hook`, `POWER.md` |
241
+ | Windsurf | `.windsurf/rules/**/*.md`, `.windsurf/workflows/**/*.md`, `.windsurfrules` |
242
+ | Roo Code | `.roo/rules/*.md`, `.roo/rules-{mode}/*.md`, `.roomodes`, `.rooignore`, `.roorules` |
243
+ | Gemini CLI | `GEMINI.md`, `.gemini/settings.json`, `gemini-extension.json` |
244
+ | OpenCode | `opencode.json` |
245
+ | Amp | `.agents/checks/**/*.md`, `.amp/settings.json` |
213
246
 
214
247
  **CI/CD Integration:**
215
248
 
@@ -610,7 +643,7 @@ agent-knowledge/
610
643
  /learn python async --no-enhance # Skip enhancement pass
611
644
  ```
612
645
 
613
- **Agent:** learn-agent (opus model for research quality)
646
+ **Agent:** learn-agent (sonnet model)
614
647
 
615
648
  ---
616
649
 
@@ -804,7 +837,115 @@ No per-turn overhead - it reads transcripts that Claude Code already saves.
804
837
 
805
838
  **Agents:** skillers-compactor (sonnet), skillers-recommender (opus)
806
839
 
807
- **Skills:** compact, recommend
840
+ **Skills:** skillers-compact, recommend
841
+
842
+ ---
843
+
844
+ ### /git-map
845
+
846
+ **Purpose:** Analyze git history to surface hotspots, coupling, ownership, bus factor, bugspots, area health, and AI attribution.
847
+
848
+ **How it works:**
849
+
850
+ The plugin wraps the [agent-analyzer](https://github.com/agent-sh/agent-analyzer) Rust binary. Run `init` once to scan git history and cache the result as `repo-intel.json`. Then run queries instantly.
851
+
852
+ **21 query types:**
853
+
854
+ | Category | Queries |
855
+ |----------|---------|
856
+ | Activity | `hotspots`, `coldspots`, `file-history` |
857
+ | Quality | `bugspots`, `test-gaps`, `diff-risk` |
858
+ | People | `ownership`, `contributors`, `bus-factor` |
859
+ | Coupling | `coupling` |
860
+ | Standards | `norms`, `conventions` |
861
+ | Health | `areas`, `health`, `release-info` |
862
+ | AI | `ai-ratio`, `recent-ai` |
863
+ | Guidance | `onboard`, `can-i-help` |
864
+ | Docs | `doc-drift` |
865
+
866
+ **9 plugins consume git-map data automatically** - deslop, sync-docs, drift-detect, audit-project, next-task, enhance, ship, onboard, can-i-help.
867
+
868
+ **Usage:**
869
+
870
+ ```bash
871
+ /git-map init # First-time scan
872
+ /git-map update # Add new commits
873
+ /git-map query hotspots # Most active files
874
+ /git-map query ownership src/ # Who owns a path
875
+ /git-map query bus-factor # Knowledge risk
876
+ ```
877
+
878
+ [Full query reference ->](https://github.com/agent-sh/git-map)
879
+
880
+ ---
881
+
882
+ ### /onboard
883
+
884
+ **Purpose:** Get oriented in any codebase in under 3 minutes.
885
+
886
+ **What happens when you run it:**
887
+
888
+ 1. **Collect** (68ms median) - Pure JavaScript scans manifest, structure, README, CLAUDE.md/AGENTS.md, CI, git, repo-intel (no LLM tokens)
889
+ 2. **Synthesize** - Opus agent produces a structured overview: tech stack, key files, active areas, conventions
890
+ 3. **Guide** - Interactive Q&A: ask about specific files, areas, or patterns
891
+
892
+ **74% fewer tokens** than manual onboarding. Validated on 100 repos across JS/TS, Rust, Go, Python, C/C++, Java, and Deno.
893
+
894
+ **Depth levels:**
895
+
896
+ | Level | Time | Data |
897
+ |-------|------|------|
898
+ | quick | ~2s | Manifest + README + structure |
899
+ | normal | ~5s | + CLAUDE.md/AGENTS.md + CI + repo-intel |
900
+ | deep | ~15s | + repo-map AST symbols |
901
+
902
+ **Supported manifests:** package.json, Cargo.toml, go.mod, pyproject.toml, deno.json, CMakeLists.txt, meson.build, setup.py, pom.xml, build.gradle. Detects monorepos (npm/pnpm/lerna/Cargo workspaces, Python libs/, Deno workspaces).
903
+
904
+ **Usage:**
905
+
906
+ ```bash
907
+ /onboard # Current repo
908
+ /onboard /path/to/repo # Specific repo
909
+ /onboard --depth=deep # Include AST data
910
+ ```
911
+
912
+ **Agent:** onboard-agent (opus model)
913
+
914
+ [Full documentation ->](https://github.com/agent-sh/onboard)
915
+
916
+ ---
917
+
918
+ ### /can-i-help
919
+
920
+ **Purpose:** Match a contributor's skills to specific areas where they can help.
921
+
922
+ **What happens when you run it:**
923
+
924
+ 1. **Collect** - Gathers project data + contributor signals (test gaps, doc drift, bugspots, good-first areas, open issues). Validated on 100 repos.
925
+ 2. **Match** - Opus agent asks about developer background and matches skills to project needs
926
+ 3. **Guide** - For each recommendation: reads code, explains what needs doing, gives a concrete first step
927
+
928
+ **Matching:**
929
+
930
+ | Developer profile | Gets recommended |
931
+ |-------------------|------------------|
932
+ | New to stack | Good-first areas with clear patterns |
933
+ | Experienced | Hard problems in pain-point areas |
934
+ | Test-focused | Test gaps in frequently-changed files |
935
+ | Bug-focused | Bugspot files + relevant open issues |
936
+ | Docs-focused | Stale documentation with code examples |
937
+
938
+ **Usage:**
939
+
940
+ ```bash
941
+ /can-i-help # Current repo
942
+ /can-i-help /path/to/repo # Specific repo
943
+ /can-i-help --depth=deep # Include AST data
944
+ ```
945
+
946
+ **Agent:** can-i-help-agent (opus model)
947
+
948
+ [Full documentation ->](https://github.com/agent-sh/can-i-help)
808
949
 
809
950
  ---
810
951
 
@@ -898,7 +1039,7 @@ Approve the plan. See the results. The middle is automated. One plan approval un
898
1039
  **8. Right model for the task**
899
1040
 
900
1041
  Match model capability to task complexity:
901
- - **opus** - Exploration, planning, implementation, review orchestration
1042
+ - **opus** - Planning, implementation, review orchestration
902
1043
  - **sonnet** - Pattern matching, validation, discovery
903
1044
  - **haiku** - Git operations, file moves, CI polling
904
1045
 
@@ -988,7 +1129,7 @@ agentsys --development # Dev mode (bypasses marketplace)
988
1129
  - ast-grep (`sg`) installed
989
1130
 
990
1131
  **For /agnix:**
991
- - [agnix CLI](https://github.com/agent-sh/agnix) installed (`cargo install agnix-cli` or `brew install agnix`)
1132
+ - [agnix CLI](https://github.com/agent-sh/agnix) installed (`npm install -g agnix`, `cargo install agnix-cli`, or `brew install agnix`)
992
1133
 
993
1134
  **Local diagnostics (optional):**
994
1135
  ```bash
@@ -1011,7 +1152,7 @@ The system is built on research, not guesswork.
1011
1152
  - Instruction following reliability
1012
1153
 
1013
1154
  **Testing:**
1014
- - 3,751 tests passing
1155
+ - 3,575 tests passing
1015
1156
  - Drift-detect validated on 1,000+ repositories
1016
1157
  - E2E workflow testing across all commands
1017
1158
  - Cross-platform validation (Claude Code, OpenCode, Codex CLI, Cursor, Kiro)