opencode-skills-collection 2.0.38 → 2.0.39
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +0 -3
- package/bundled-skills/.antigravity-install-manifest.json +2 -1
- package/bundled-skills/docs/integrations/jetski-cortex.md +3 -3
- package/bundled-skills/docs/integrations/jetski-gemini-loader/README.md +1 -1
- package/bundled-skills/docs/maintainers/repo-growth-seo.md +3 -3
- package/bundled-skills/docs/maintainers/skills-update-guide.md +1 -1
- package/bundled-skills/docs/users/bundles.md +1 -1
- package/bundled-skills/docs/users/claude-code-skills.md +1 -1
- package/bundled-skills/docs/users/gemini-cli-skills.md +1 -1
- package/bundled-skills/docs/users/getting-started.md +1 -1
- package/bundled-skills/docs/users/kiro-integration.md +1 -1
- package/bundled-skills/docs/users/usage.md +4 -4
- package/bundled-skills/docs/users/visual-guide.md +4 -4
- package/bundled-skills/skill-optimizer/SKILL.md +266 -0
- package/package.json +1 -1
- package/skills_index.json +24 -0
package/README.md
CHANGED
|
@@ -184,9 +184,6 @@ The old `opencode-skills-antigravity` package on npm is deprecated and re-export
|
|
|
184
184
|
Issues and pull requests are welcome
|
|
185
185
|
at [github.com/FrancoStino/opencode-skills-collection](https://github.com/FrancoStino/opencode-skills-collection/issues).
|
|
186
186
|
|
|
187
|
-
If you'd like to contribute new skills to the collection, open a PR adding a new folder inside `bundled-skills/` — it
|
|
188
|
-
will be automatically picked up on next sync.
|
|
189
|
-
|
|
190
187
|
---
|
|
191
188
|
|
|
192
189
|
## Beta Releases
|
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
{
|
|
2
2
|
"schemaVersion": 1,
|
|
3
|
-
"updatedAt": "2026-04-
|
|
3
|
+
"updatedAt": "2026-04-11T19:36:06.899Z",
|
|
4
4
|
"entries": [
|
|
5
5
|
"00-andruia-consultant",
|
|
6
6
|
"007",
|
|
@@ -1168,6 +1168,7 @@
|
|
|
1168
1168
|
"skill-developer",
|
|
1169
1169
|
"skill-improver",
|
|
1170
1170
|
"skill-installer",
|
|
1171
|
+
"skill-optimizer",
|
|
1171
1172
|
"skill-rails-upgrade",
|
|
1172
1173
|
"skill-router",
|
|
1173
1174
|
"skill-scanner",
|
|
@@ -1,9 +1,9 @@
|
|
|
1
1
|
---
|
|
2
2
|
title: Jetski/Cortex + Gemini Integration Guide
|
|
3
|
-
description: "Use antigravity-awesome-skills with Jetski/Cortex without hitting context-window overflow with 1.
|
|
3
|
+
description: "Use antigravity-awesome-skills with Jetski/Cortex without hitting context-window overflow with 1.401+ skills."
|
|
4
4
|
---
|
|
5
5
|
|
|
6
|
-
# Jetski/Cortex + Gemini: safe integration with 1,1.
|
|
6
|
+
# Jetski/Cortex + Gemini: safe integration with 1,1.401+ skills
|
|
7
7
|
|
|
8
8
|
This guide shows how to integrate the `antigravity-awesome-skills` repository with an agent based on **Jetski/Cortex + Gemini** (or similar frameworks) **without exceeding the model context window**.
|
|
9
9
|
|
|
@@ -23,7 +23,7 @@ Never do:
|
|
|
23
23
|
- concatenate all `SKILL.md` content into a single system prompt;
|
|
24
24
|
- re-inject the entire library for **every** request.
|
|
25
25
|
|
|
26
|
-
With over 1,1.
|
|
26
|
+
With over 1,1.401 skills, this approach fills the context window before user messages are even added, causing truncation.
|
|
27
27
|
|
|
28
28
|
---
|
|
29
29
|
|
|
@@ -20,7 +20,7 @@ This example shows one way to integrate **antigravity-awesome-skills** with a Je
|
|
|
20
20
|
- How to enforce a **maximum number of skills per turn** via `maxSkillsPerTurn`.
|
|
21
21
|
- How to choose whether to **truncate or error** when too many skills are requested via `overflowBehavior`.
|
|
22
22
|
|
|
23
|
-
This pattern avoids context overflow when you have 1,
|
|
23
|
+
This pattern avoids context overflow when you have 1,401+ skills installed.
|
|
24
24
|
|
|
25
25
|
---
|
|
26
26
|
|
|
@@ -6,7 +6,7 @@ This document keeps the repository's GitHub-facing discovery copy aligned with t
|
|
|
6
6
|
|
|
7
7
|
Preferred positioning:
|
|
8
8
|
|
|
9
|
-
> Installable GitHub library of 1,
|
|
9
|
+
> Installable GitHub library of 1,401+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and other AI coding assistants.
|
|
10
10
|
|
|
11
11
|
Key framing:
|
|
12
12
|
|
|
@@ -20,7 +20,7 @@ Key framing:
|
|
|
20
20
|
|
|
21
21
|
Preferred description:
|
|
22
22
|
|
|
23
|
-
> Installable GitHub library of 1,
|
|
23
|
+
> Installable GitHub library of 1,401+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and more. Includes installer CLI, bundles, workflows, and official/community skill collections.
|
|
24
24
|
|
|
25
25
|
Preferred homepage:
|
|
26
26
|
|
|
@@ -28,7 +28,7 @@ Preferred homepage:
|
|
|
28
28
|
|
|
29
29
|
Preferred social preview:
|
|
30
30
|
|
|
31
|
-
- use a clean preview image that says `1,
|
|
31
|
+
- use a clean preview image that says `1,401+ Agentic Skills`;
|
|
32
32
|
- mention Claude Code, Cursor, Codex CLI, and Gemini CLI;
|
|
33
33
|
- avoid dense text and tiny logos that disappear in social cards.
|
|
34
34
|
|
|
@@ -69,7 +69,7 @@ For manual updates, you need:
|
|
|
69
69
|
The update process refreshes:
|
|
70
70
|
- Skills index (`skills_index.json`)
|
|
71
71
|
- Web app skills data (`apps\web-app\public\skills.json`)
|
|
72
|
-
- All 1,
|
|
72
|
+
- All 1,401+ skills from the skills directory
|
|
73
73
|
|
|
74
74
|
## When to Update
|
|
75
75
|
|
|
@@ -12,7 +12,7 @@ Install the library into Claude Code, then invoke focused skills directly in the
|
|
|
12
12
|
|
|
13
13
|
## Why use this repo for Claude Code
|
|
14
14
|
|
|
15
|
-
- It includes 1,
|
|
15
|
+
- It includes 1,401+ skills instead of a narrow single-domain starter pack.
|
|
16
16
|
- It supports the standard `.claude/skills/` path and the Claude Code plugin marketplace flow.
|
|
17
17
|
- It also ships generated bundle plugins so teams can install focused packs like `Essentials` or `Security Developer` from the marketplace metadata.
|
|
18
18
|
- It includes onboarding docs, bundles, and workflows so new users do not need to guess where to begin.
|
|
@@ -12,7 +12,7 @@ Install into the Gemini skills path, then ask Gemini to apply one skill at a tim
|
|
|
12
12
|
|
|
13
13
|
- It installs directly into the expected Gemini skills path.
|
|
14
14
|
- It includes both core software engineering skills and deeper agent/LLM-oriented skills.
|
|
15
|
-
- It helps new users get started with bundles and workflows rather than forcing a cold start from 1,
|
|
15
|
+
- It helps new users get started with bundles and workflows rather than forcing a cold start from 1,401+ files.
|
|
16
16
|
- It is useful whether you want a broad internal skill library or a single repo to test many workflows quickly.
|
|
17
17
|
|
|
18
18
|
## Install Gemini CLI Skills
|
|
@@ -18,7 +18,7 @@ Kiro is AWS's agentic AI IDE that combines:
|
|
|
18
18
|
|
|
19
19
|
Kiro's agentic capabilities are enhanced by skills that provide:
|
|
20
20
|
|
|
21
|
-
- **Domain expertise** across 1,
|
|
21
|
+
- **Domain expertise** across 1,401+ specialized areas
|
|
22
22
|
- **Best practices** from Anthropic, OpenAI, Google, Microsoft, and AWS
|
|
23
23
|
- **Workflow automation** for common development tasks
|
|
24
24
|
- **AWS-specific patterns** for serverless, infrastructure, and cloud architecture
|
|
@@ -14,7 +14,7 @@ If you came in through a **Claude Code** or **Codex** plugin instead of a full l
|
|
|
14
14
|
|
|
15
15
|
When you ran `npx antigravity-awesome-skills` or cloned the repository, you:
|
|
16
16
|
|
|
17
|
-
✅ **Downloaded 1,
|
|
17
|
+
✅ **Downloaded 1,401+ skill files** to your computer (default: `~/.gemini/antigravity/skills/`; or a custom path like `~/.agent/skills/` if you used `--path`)
|
|
18
18
|
✅ **Made them available** to your AI assistant
|
|
19
19
|
❌ **Did NOT enable them all automatically** (they're just sitting there, waiting)
|
|
20
20
|
|
|
@@ -34,7 +34,7 @@ Bundles are **curated groups** of skills organized by role. They help you decide
|
|
|
34
34
|
|
|
35
35
|
**Analogy:**
|
|
36
36
|
|
|
37
|
-
- You installed a toolbox with 1,
|
|
37
|
+
- You installed a toolbox with 1,401+ tools (✅ done)
|
|
38
38
|
- Bundles are like **labeled organizer trays** saying: "If you're a carpenter, start with these 10 tools"
|
|
39
39
|
- You can either **pick skills from the tray** or install that tray as a focused marketplace bundle plugin
|
|
40
40
|
|
|
@@ -212,7 +212,7 @@ Let's actually use a skill right now. Follow these steps:
|
|
|
212
212
|
|
|
213
213
|
## Step 5: Picking Your First Skills (Practical Advice)
|
|
214
214
|
|
|
215
|
-
Don't try to use all 1,
|
|
215
|
+
Don't try to use all 1,401+ skills at once. Here's a sensible approach:
|
|
216
216
|
|
|
217
217
|
If you want a tool-specific starting point before choosing skills, use:
|
|
218
218
|
|
|
@@ -343,7 +343,7 @@ Usually no, but if your AI doesn't recognize a skill:
|
|
|
343
343
|
|
|
344
344
|
### "Can I load all skills into the model at once?"
|
|
345
345
|
|
|
346
|
-
No. Even though you have 1,
|
|
346
|
+
No. Even though you have 1,401+ skills installed locally, you should **not** concatenate every `SKILL.md` into a single system prompt or context block.
|
|
347
347
|
|
|
348
348
|
The intended pattern is:
|
|
349
349
|
|
|
@@ -34,7 +34,7 @@ antigravity-awesome-skills/
|
|
|
34
34
|
├── 📄 CONTRIBUTING.md ← Contributor workflow
|
|
35
35
|
├── 📄 CATALOG.md ← Full generated catalog
|
|
36
36
|
│
|
|
37
|
-
├── 📁 skills/ ← 1,
|
|
37
|
+
├── 📁 skills/ ← 1,401+ skills live here
|
|
38
38
|
│ │
|
|
39
39
|
│ ├── 📁 brainstorming/
|
|
40
40
|
│ │ └── 📄 SKILL.md ← Skill definition
|
|
@@ -47,7 +47,7 @@ antigravity-awesome-skills/
|
|
|
47
47
|
│ │ └── 📁 2d-games/
|
|
48
48
|
│ │ └── 📄 SKILL.md ← Nested skills also supported
|
|
49
49
|
│ │
|
|
50
|
-
│ └── ... (1,
|
|
50
|
+
│ └── ... (1,401+ total)
|
|
51
51
|
│
|
|
52
52
|
├── 📁 apps/
|
|
53
53
|
│ └── 📁 web-app/ ← Interactive browser
|
|
@@ -100,7 +100,7 @@ antigravity-awesome-skills/
|
|
|
100
100
|
|
|
101
101
|
```
|
|
102
102
|
┌─────────────────────────┐
|
|
103
|
-
│ 1,
|
|
103
|
+
│ 1,401+ SKILLS │
|
|
104
104
|
└────────────┬────────────┘
|
|
105
105
|
│
|
|
106
106
|
┌────────────────────────┼────────────────────────┐
|
|
@@ -201,7 +201,7 @@ If you want a workspace-style manual install instead, cloning into `.agent/skill
|
|
|
201
201
|
│ ├── 📁 brainstorming/ │
|
|
202
202
|
│ ├── 📁 stripe-integration/ │
|
|
203
203
|
│ ├── 📁 react-best-practices/ │
|
|
204
|
-
│ └── ... (1,
|
|
204
|
+
│ └── ... (1,401+ total) │
|
|
205
205
|
└─────────────────────────────────────────┘
|
|
206
206
|
```
|
|
207
207
|
|
|
@@ -0,0 +1,266 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: skill-optimizer
|
|
3
|
+
description: "Diagnose and optimize Agent Skills (SKILL.md) with real session data and research-backed static analysis. Works with Claude Code, Codex, and any Agent Skills-compatible agent."
|
|
4
|
+
risk: safe
|
|
5
|
+
source: hqhq1025/skill-optimizer (MIT)
|
|
6
|
+
date_added: "2026-04-11"
|
|
7
|
+
---
|
|
8
|
+
|
|
9
|
+
## When to Use This Skill
|
|
10
|
+
|
|
11
|
+
- Use when skills are not triggering as expected or seem broken
|
|
12
|
+
- Use when you want to audit and improve your skill library's quality
|
|
13
|
+
- Use when you want to understand which skills are underperforming or wasting context tokens
|
|
14
|
+
|
|
15
|
+
## Rules
|
|
16
|
+
|
|
17
|
+
- **Read-only**: never modify skill files. Only output report.
|
|
18
|
+
- **All 8 dimensions**: do not skip any. If data is insufficient, report "N/A — insufficient session data" rather than omitting.
|
|
19
|
+
- **Quantify**: "you had 12 research tasks last week but the skill never triggered" beats "you often do research".
|
|
20
|
+
- **Suggest, don't prescribe**: give specific wording suggestions for description improvements, but frame as suggestions.
|
|
21
|
+
- **Show evidence**: for undertrigger claims, quote the actual user message that should have triggered the skill.
|
|
22
|
+
- **Evidence-based suggestions**: when suggesting description rewrites, cite the specific research finding that motivates the change (e.g., "front-load trigger keywords — MCP study shows 3.6x selection rate improvement").
|
|
23
|
+
|
|
24
|
+
## Overview
|
|
25
|
+
|
|
26
|
+
Analyze skills using **historical session data + static quality checks**, output a diagnostic report with P0/P1/P2 prioritized fixes. Scores each skill on a 5-point composite scale across 8 dimensions.
|
|
27
|
+
|
|
28
|
+
CSO (Claude/Agent Search Optimization) = writing skill descriptions so agents select the right skill at the right time. This skill checks for CSO violations.
|
|
29
|
+
|
|
30
|
+
## Usage
|
|
31
|
+
|
|
32
|
+
- `/optimize-skill` → scan all skills
|
|
33
|
+
- `/optimize-skill my-skill` → single skill
|
|
34
|
+
- `/optimize-skill skill-a skill-b` → multiple specified skills
|
|
35
|
+
|
|
36
|
+
## Data Sources
|
|
37
|
+
|
|
38
|
+
Auto-detect the current agent platform and scan the corresponding paths:
|
|
39
|
+
|
|
40
|
+
| Source | Claude Code | Codex | Shared |
|
|
41
|
+
|--------|------------|-------|--------|
|
|
42
|
+
| Session transcripts | `~/.claude/projects/**/*.jsonl` | `~/.codex/sessions/**/*.jsonl` | — |
|
|
43
|
+
| Skill files | `~/.claude/skills/*/SKILL.md` | `~/.codex/skills/*/SKILL.md` | `~/.agents/skills/*/SKILL.md` |
|
|
44
|
+
|
|
45
|
+
**Platform detection:** Check which directories exist. Scan all available sources — a user may have both Claude Code and Codex installed.
|
|
46
|
+
|
|
47
|
+
## Workflow
|
|
48
|
+
|
|
49
|
+
```
|
|
50
|
+
Identify target skills
|
|
51
|
+
↓
|
|
52
|
+
Collect session data (python3 scripts scan JSONL transcripts)
|
|
53
|
+
↓
|
|
54
|
+
Run 8 analysis dimensions
|
|
55
|
+
↓
|
|
56
|
+
Compute composite scores
|
|
57
|
+
↓
|
|
58
|
+
Output report with P0/P1/P2
|
|
59
|
+
```
|
|
60
|
+
|
|
61
|
+
### Step 1: Identify Target Skills
|
|
62
|
+
|
|
63
|
+
Scan skill directories in order: `~/.claude/skills/`, `~/.codex/skills/`, `~/.agents/skills/`. Deduplicate by skill name (same name in multiple locations = same skill). For each, read `SKILL.md` and extract:
|
|
64
|
+
- name, description (from YAML frontmatter)
|
|
65
|
+
- trigger keywords (from description field)
|
|
66
|
+
- defined workflow steps (Step 1/2/3... or ### sections under Workflow)
|
|
67
|
+
- word count
|
|
68
|
+
|
|
69
|
+
If user specified skill names, filter to only those.
|
|
70
|
+
|
|
71
|
+
### Step 2: Collect Session Data
|
|
72
|
+
|
|
73
|
+
Use python3 scripts via Bash to scan session JSONL files. Extract:
|
|
74
|
+
|
|
75
|
+
**Claude Code sessions** (`~/.claude/projects/**/*.jsonl`):
|
|
76
|
+
- `Skill` tool_use calls (which skills were invoked)
|
|
77
|
+
- User messages (full text)
|
|
78
|
+
- Assistant messages after skill invocation (for workflow tracking)
|
|
79
|
+
- User messages after skill invocation (for reaction analysis)
|
|
80
|
+
|
|
81
|
+
**Codex sessions** (`~/.codex/sessions/**/*.jsonl`):
|
|
82
|
+
- `session_meta` events → extract `base_instructions` for skill loading evidence
|
|
83
|
+
- `response_item` events → assistant outputs (workflow tracking)
|
|
84
|
+
- `event_msg` events → tool execution and skill-related events
|
|
85
|
+
- User messages from `turn_context` events (for reaction analysis)
|
|
86
|
+
|
|
87
|
+
**Note:** Codex injects skills via context rather than explicit `Skill` tool calls. Skill loading (present in `base_instructions`) does NOT equal active invocation. To detect actual use, search for skill-specific workflow markers (step headers, output formats) in `response_item` content within that session. A skill is "invoked" only if the agent produced output following the skill's defined workflow.
|
|
88
|
+
|
|
89
|
+
**Aggregated:**
|
|
90
|
+
- Per-skill: invocation count, trigger keyword match count
|
|
91
|
+
- Per-skill: user reaction sentiment after invocation
|
|
92
|
+
- Per-skill: workflow step completion markers
|
|
93
|
+
|
|
94
|
+
### Step 3: Run 8 Analysis Dimensions
|
|
95
|
+
|
|
96
|
+
**You MUST run ALL 8 dimensions.** The baseline behavior without this skill is to skip dimensions 4.2, 4.3, 4.5b, and 4.8. These are the most valuable dimensions — do not skip them.
|
|
97
|
+
|
|
98
|
+
#### 4.1 Trigger Rate
|
|
99
|
+
|
|
100
|
+
Count how many times each skill was actually invoked vs how many times its trigger keywords appeared in user messages.
|
|
101
|
+
|
|
102
|
+
**Claude Code:** count `Skill` tool_use calls in transcripts.
|
|
103
|
+
**Codex:** count sessions where the agent produced output following the skill's workflow markers (not merely loaded in context).
|
|
104
|
+
|
|
105
|
+
**Diagnose:**
|
|
106
|
+
- Never triggered → skill may be useless or trigger words wrong
|
|
107
|
+
- Keywords match >> actual invocations → undertrigger problem, description needs work
|
|
108
|
+
- High frequency → core skill, worth optimizing
|
|
109
|
+
|
|
110
|
+
#### 4.2 Post-Invocation User Reaction
|
|
111
|
+
|
|
112
|
+
**This dimension is critical and easy to skip. Do not skip it.**
|
|
113
|
+
|
|
114
|
+
After a skill is invoked in a session, read the user's next 3 messages. Classify:
|
|
115
|
+
- **Negative**: "no", "wrong", "never mind", "not what I wanted", user interrupts
|
|
116
|
+
- **Correction**: user re-describes their intent, manually overrides skill output
|
|
117
|
+
- **Positive**: "good", "ok", "continue", "nice", user follows the workflow
|
|
118
|
+
- **Silent switch**: user changes topic entirely (likely false positive trigger)
|
|
119
|
+
|
|
120
|
+
Report per-skill satisfaction rate.
|
|
121
|
+
|
|
122
|
+
#### 4.3 Workflow Completion Rate
|
|
123
|
+
|
|
124
|
+
**This dimension is critical and easy to skip. Do not skip it.**
|
|
125
|
+
|
|
126
|
+
For each skill invocation found in session data:
|
|
127
|
+
1. Extract the skill's defined steps from SKILL.md
|
|
128
|
+
2. Search the assistant messages in that session for step markers (Step N, specific output formats defined in the skill)
|
|
129
|
+
3. Calculate: how far did execution get?
|
|
130
|
+
|
|
131
|
+
Report: `{skill-name} (N steps): avg completed Step X/N (Y%)`
|
|
132
|
+
|
|
133
|
+
If a specific step is frequently where execution stops, flag it.
|
|
134
|
+
|
|
135
|
+
#### 4.4 Static Quality Analysis
|
|
136
|
+
|
|
137
|
+
Check each SKILL.md against these 14 rules:
|
|
138
|
+
|
|
139
|
+
| Check | Pass Criteria |
|
|
140
|
+
|-------|--------------|
|
|
141
|
+
| Frontmatter format | Only `name` + `description`, total < 1024 chars |
|
|
142
|
+
| Name format | Letters, numbers, hyphens only |
|
|
143
|
+
| Description trigger | Starts with "Use when..." or has explicit trigger conditions |
|
|
144
|
+
| Description workflow leak | Description does NOT summarize the skill's workflow steps (CSO violation) |
|
|
145
|
+
| Description pushiness | Description actively claims scenarios where it should be used, not just passive |
|
|
146
|
+
| Overview section | Present |
|
|
147
|
+
| Rules section | Present |
|
|
148
|
+
| MUST/NEVER density | Count ALL-CAPS directive words; >5 per 100 words = flag |
|
|
149
|
+
| Word count | < 500 words (flag if over) |
|
|
150
|
+
| Narrative anti-pattern | No "In session X, we found..." storytelling |
|
|
151
|
+
| YAML quoting safety | description containing `: ` must be wrapped in double quotes |
|
|
152
|
+
| Critical info position | Core trigger conditions and primary actions must be in the first 20% of SKILL.md |
|
|
153
|
+
| Description 250-char check | Primary trigger keywords must appear within the first 250 characters of description |
|
|
154
|
+
| Trigger condition count | ≤ 2 trigger conditions in description is ideal |
|
|
155
|
+
|
|
156
|
+
#### 4.5a False Positive Rate (Overtrigger)
|
|
157
|
+
|
|
158
|
+
Skill was invoked but user immediately rejected or ignored it.
|
|
159
|
+
|
|
160
|
+
#### 4.5b Undertrigger Detection
|
|
161
|
+
|
|
162
|
+
**This is the highest-value dimension.** For each skill, extract its **capability keywords** (not just trigger keywords — what the skill CAN do). Then scan user messages for tasks that match those capabilities but where the skill was NOT invoked.
|
|
163
|
+
|
|
164
|
+
Report: which user messages SHOULD have triggered the skill but didn't, and suggest description improvements.
|
|
165
|
+
|
|
166
|
+
**Compounding Risk Assessment:**
|
|
167
|
+
For skills with chronic undertriggering (0 triggers across 5+ sessions where relevant tasks appeared), flag as "compounding risk" — undertriggered skills cannot self-improve through usage feedback, causing the gap to widen over time. Recommend immediate description rewrite as P0.
|
|
168
|
+
|
|
169
|
+
#### 4.6 Cross-Skill Conflicts
|
|
170
|
+
|
|
171
|
+
Compare all skill pairs:
|
|
172
|
+
- Trigger keyword overlap (same keywords in two descriptions)
|
|
173
|
+
- Workflow overlap (two skills teach similar processes)
|
|
174
|
+
- Contradictory guidance
|
|
175
|
+
|
|
176
|
+
#### 4.7 Environment Consistency
|
|
177
|
+
|
|
178
|
+
For each skill, extract referenced:
|
|
179
|
+
- File paths → check if they exist (`test -e`)
|
|
180
|
+
- CLI tools → check if installed (`which`)
|
|
181
|
+
- Directories → check if they exist
|
|
182
|
+
|
|
183
|
+
Flag any broken references.
|
|
184
|
+
|
|
185
|
+
#### 4.8 Token Economics
|
|
186
|
+
|
|
187
|
+
**This dimension is critical and easy to skip. Do not skip it.**
|
|
188
|
+
|
|
189
|
+
For each skill:
|
|
190
|
+
- Word count (from Step 1)
|
|
191
|
+
- Trigger frequency (from 4.1)
|
|
192
|
+
- Cost-effectiveness = trigger count / word count
|
|
193
|
+
- Flag: large + never-triggered skills as candidates for removal or compression
|
|
194
|
+
|
|
195
|
+
**Progressive Disclosure Tier Check:**
|
|
196
|
+
Evaluate each skill against the 3-tier loading model:
|
|
197
|
+
- Tier 1 (frontmatter): ~100 tokens. Check: is description ≤ 1024 chars?
|
|
198
|
+
- Tier 2 (SKILL.md body): <500 lines recommended. Check: word count.
|
|
199
|
+
- Tier 3 (reference files): loaded on demand. Check: does skill use reference files for detailed content, or cram everything into SKILL.md?
|
|
200
|
+
|
|
201
|
+
Flag skills that put 500+ words in SKILL.md without using reference files as "poor progressive disclosure".
|
|
202
|
+
|
|
203
|
+
### Step 4: Composite Score
|
|
204
|
+
|
|
205
|
+
Rate each skill on a 5-point scale:
|
|
206
|
+
|
|
207
|
+
| Score | Meaning |
|
|
208
|
+
|-------|---------|
|
|
209
|
+
| 5 | Healthy: high trigger rate, positive reactions, complete workflows, clean static |
|
|
210
|
+
| 4 | Good: minor issues in 1-2 dimensions |
|
|
211
|
+
| 3 | Needs attention: significant gap in 1 dimension or minor gaps in 3+ |
|
|
212
|
+
| 2 | Problematic: never triggered, or negative user reactions, or major static issues |
|
|
213
|
+
| 1 | Broken: doesn't work, references missing, or fundamentally misaligned |
|
|
214
|
+
|
|
215
|
+
**Scored dimensions** (weighted average):
|
|
216
|
+
- Trigger rate: 25%
|
|
217
|
+
- User reaction: 20%
|
|
218
|
+
- Workflow completion: 15%
|
|
219
|
+
- Static quality: 15%
|
|
220
|
+
- Undertrigger: 15%
|
|
221
|
+
- Token economics: 10%
|
|
222
|
+
|
|
223
|
+
**Qualitative dimensions** (reported but not scored):
|
|
224
|
+
- 4.5a Overtrigger: reported as count + examples
|
|
225
|
+
- 4.6 Cross-Skill Conflicts: reported as conflict pairs
|
|
226
|
+
- 4.7 Environment Consistency: reported as pass/fail per reference
|
|
227
|
+
|
|
228
|
+
## Report Format
|
|
229
|
+
|
|
230
|
+
```markdown
|
|
231
|
+
# Skill Optimization Report
|
|
232
|
+
**Date**: {date}
|
|
233
|
+
**Scope**: {all / specified skills}
|
|
234
|
+
**Session data**: {N} sessions, {date range}
|
|
235
|
+
|
|
236
|
+
## Overview
|
|
237
|
+
| Skill | Triggers | Reaction | Completion | Static | Undertrigger | Token | Score |
|
|
238
|
+
|-------|----------|----------|------------|--------|--------------|-------|-------|
|
|
239
|
+
| example-skill | 2 | 100% | 86% | B+ | 1 miss | 486w | 4/5 |
|
|
240
|
+
|
|
241
|
+
## P0 Fixes (blocking usage)
|
|
242
|
+
1. ...
|
|
243
|
+
|
|
244
|
+
## P1 Improvements (better experience)
|
|
245
|
+
1. ...
|
|
246
|
+
|
|
247
|
+
## P2 Optional Optimizations
|
|
248
|
+
1. ...
|
|
249
|
+
|
|
250
|
+
## Per-Skill Diagnostics
|
|
251
|
+
### {skill-name}
|
|
252
|
+
#### 4.1 Trigger Rate
|
|
253
|
+
...
|
|
254
|
+
#### 4.2 User Reaction
|
|
255
|
+
...
|
|
256
|
+
(all 8 dimensions)
|
|
257
|
+
```
|
|
258
|
+
|
|
259
|
+
## Research Background
|
|
260
|
+
|
|
261
|
+
The analysis dimensions in this report are grounded in the following research:
|
|
262
|
+
- **Undertrigger detection**: Memento-Skills (arXiv:2603.18743) — skills as structured files require accurate routing; unrouted skills cannot self-improve via the read-write learning loop
|
|
263
|
+
- **Description quality**: MCP Description Quality (arXiv:2602.18914) — well-written descriptions achieve 72% tool selection rate vs. 20% random baseline (3.6x improvement)
|
|
264
|
+
- **Information position**: Lost in the Middle (Liu et al., TACL 2024) — U-shaped LLM attention curve
|
|
265
|
+
- **Format impact**: He et al. (arXiv:2411.10541) — format changes alone can cause 9-40% performance variance
|
|
266
|
+
- **Instruction compliance**: IFEval (arXiv:2311.07911) — LLMs struggle with multi-constraint prompts
|
package/package.json
CHANGED
package/skills_index.json
CHANGED
|
@@ -25596,6 +25596,30 @@
|
|
|
25596
25596
|
"reasons": []
|
|
25597
25597
|
}
|
|
25598
25598
|
},
|
|
25599
|
+
{
|
|
25600
|
+
"id": "skill-optimizer",
|
|
25601
|
+
"path": "skills/skill-optimizer",
|
|
25602
|
+
"category": "meta",
|
|
25603
|
+
"name": "skill-optimizer",
|
|
25604
|
+
"description": "Diagnose and optimize Agent Skills (SKILL.md) with real session data and research-backed static analysis. Works with Claude Code, Codex, and any Agent Skills-compatible agent.",
|
|
25605
|
+
"risk": "safe",
|
|
25606
|
+
"source": "hqhq1025/skill-optimizer (MIT)",
|
|
25607
|
+
"date_added": "2026-04-11",
|
|
25608
|
+
"plugin": {
|
|
25609
|
+
"targets": {
|
|
25610
|
+
"codex": "blocked",
|
|
25611
|
+
"claude": "blocked"
|
|
25612
|
+
},
|
|
25613
|
+
"setup": {
|
|
25614
|
+
"type": "none",
|
|
25615
|
+
"summary": "",
|
|
25616
|
+
"docs": null
|
|
25617
|
+
},
|
|
25618
|
+
"reasons": [
|
|
25619
|
+
"target_specific_home_path"
|
|
25620
|
+
]
|
|
25621
|
+
}
|
|
25622
|
+
},
|
|
25599
25623
|
{
|
|
25600
25624
|
"id": "skill-rails-upgrade",
|
|
25601
25625
|
"path": "skills/skill-rails-upgrade",
|