promptup-plugin 0.2.1 → 0.2.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +66 -31
- package/dist/index.js +2 -2
- package/package.json +1 -1
package/README.md
CHANGED
|
@@ -10,36 +10,86 @@ Zero infrastructure. Just SQLite at `~/.promptup/`.
|
|
|
10
10
|
npx promptup-plugin
|
|
11
11
|
```
|
|
12
12
|
|
|
13
|
-
|
|
13
|
+
Installs MCP server, commands, hooks, statusline, and default config. Restart Claude Code to activate.
|
|
14
14
|
|
|
15
|
-
##
|
|
15
|
+
## How it works
|
|
16
16
|
|
|
17
|
-
|
|
17
|
+
### Session tracking
|
|
18
|
+
Every Claude Code session is passively tracked via hooks. On session creation, the current git branch is recorded. This links sessions to PRs automatically.
|
|
19
|
+
|
|
20
|
+
### Evaluation
|
|
21
|
+
`/pup:eval` spawns an independent `claude -p` to analyze your session transcript across 11 dimensions. It extracts developer decisions (steers, rejects, validates, scopes) and generates coaching recommendations with before/after examples from your actual prompts. Falls back to heuristic scoring if Claude is unavailable.
|
|
22
|
+
|
|
23
|
+
### PR reports
|
|
24
|
+
`/pup:pr-report` matches sessions to the current branch, auto-evaluates unscored sessions, gathers all decisions, and computes a Decision Quality Score (DQS). The session-to-PR link uses `sessions.branch` — set when the session was created. Optionally posts as a GitHub PR comment.
|
|
25
|
+
|
|
26
|
+
### DQS formula
|
|
27
|
+
```
|
|
28
|
+
DQS = autonomy * 25% + discipline * 25% + validation * 30% + diversity * 20%
|
|
29
|
+
```
|
|
30
|
+
- **Autonomy**: How often you steer, reject, or modify AI output vs accepting
|
|
31
|
+
- **Discipline**: Ratio of modifications to blind accepts
|
|
32
|
+
- **Validation**: How often you verify/test output
|
|
33
|
+
- **Diversity**: How many decision types you use (steer, reject, validate, modify, scope, accept)
|
|
34
|
+
|
|
35
|
+
## Commands
|
|
36
|
+
|
|
37
|
+
| Command | What |
|
|
38
|
+
|---------|------|
|
|
39
|
+
| `/pup:eval` | Evaluate session across 11 skill dimensions |
|
|
40
|
+
| `/pup:pr-report` | DQS report for current branch (add `--post` to comment on PR) |
|
|
41
|
+
| `/pup:status` | Sessions tracked, evaluations, decision counts |
|
|
42
|
+
| `/pup:config` | View/modify settings (`/pup:config evaluation.auto_trigger=prompt_count`) |
|
|
43
|
+
| `/pup:update` | Check for and install updates |
|
|
44
|
+
|
|
45
|
+
## MCP Tools
|
|
18
46
|
|
|
19
47
|
| Tool | What |
|
|
20
48
|
|------|------|
|
|
21
|
-
| `evaluate_session` | Score
|
|
22
|
-
| `generate_pr_report` |
|
|
23
|
-
| `get_status` |
|
|
24
|
-
| `configure` |
|
|
49
|
+
| `evaluate_session` | Score session, extract decisions, coaching recommendations |
|
|
50
|
+
| `generate_pr_report` | Match sessions to branch, compute DQS, optionally post to GitHub |
|
|
51
|
+
| `get_status` | Tracking status and recent activity |
|
|
52
|
+
| `configure` | Show/modify config (dot-path keys) |
|
|
53
|
+
|
|
54
|
+
## Statusline
|
|
55
|
+
|
|
56
|
+
`pupmeter` shows your latest composite score in the Claude Code status bar with a recommendation tip.
|
|
57
|
+
|
|
58
|
+
## Hooks
|
|
59
|
+
|
|
60
|
+
| Hook | What | Async |
|
|
61
|
+
|------|------|-------|
|
|
62
|
+
| `PostToolUse` | Logs tool events to `~/.promptup/tool-events.jsonl` | Yes |
|
|
63
|
+
| `Stop` | Captures session end to `~/.promptup/session-end.json` | Yes |
|
|
64
|
+
| `SessionStart` | Background update check against npm | Yes |
|
|
65
|
+
| `UserPromptSubmit` | Auto-eval every N prompts (off by default) | Yes |
|
|
66
|
+
|
|
67
|
+
## Configuration
|
|
25
68
|
|
|
26
|
-
|
|
27
|
-
|
|
28
|
-
|
|
29
|
-
|
|
69
|
+
```bash
|
|
70
|
+
/pup:config # show all settings
|
|
71
|
+
/pup:config evaluation.auto_trigger=prompt_count # enable auto-eval
|
|
72
|
+
/pup:config evaluation.interval=5 # eval every 5 prompts
|
|
73
|
+
/pup:config evaluation.weight_profile=bugfix # scoring profile
|
|
74
|
+
/pup:config decisions.signal_filter=all # show all decisions
|
|
75
|
+
```
|
|
76
|
+
|
|
77
|
+
Config file: `~/.promptup/config.json`
|
|
30
78
|
|
|
31
|
-
|
|
79
|
+
## 11 Dimensions
|
|
32
80
|
|
|
33
|
-
**
|
|
81
|
+
**Base (interaction quality):**
|
|
82
|
+
task decomposition, prompt specificity, output validation, iteration quality, strategic tool usage, context management
|
|
83
|
+
|
|
84
|
+
**Domain (depth of understanding):**
|
|
85
|
+
architectural awareness, error anticipation, technical vocabulary, dependency reasoning, tradeoff articulation
|
|
34
86
|
|
|
35
87
|
## Requirements
|
|
36
88
|
|
|
37
89
|
- Node.js >= 20
|
|
38
90
|
- Claude Code
|
|
39
91
|
|
|
40
|
-
## Manual setup
|
|
41
|
-
|
|
42
|
-
If you prefer not to use the installer:
|
|
92
|
+
## Manual setup
|
|
43
93
|
|
|
44
94
|
```json
|
|
45
95
|
{
|
|
@@ -52,21 +102,6 @@ If you prefer not to use the installer:
|
|
|
52
102
|
}
|
|
53
103
|
```
|
|
54
104
|
|
|
55
|
-
## Configuration
|
|
56
|
-
|
|
57
|
-
Run `configure` with no args to see all settings:
|
|
58
|
-
|
|
59
|
-
```
|
|
60
|
-
evaluation.auto_trigger off | prompt_count | session_end
|
|
61
|
-
evaluation.interval prompts between auto-evals (default: 10)
|
|
62
|
-
evaluation.weight_profile balanced | greenfield | bugfix | refactor | security_review
|
|
63
|
-
evaluation.feedback_detail brief | standard | detailed
|
|
64
|
-
decisions.signal_filter high | high+medium | all
|
|
65
|
-
pr_report.auto_post true | false
|
|
66
|
-
```
|
|
67
|
-
|
|
68
|
-
Config file: `~/.promptup/config.json`
|
|
69
|
-
|
|
70
105
|
## Uninstall
|
|
71
106
|
|
|
72
107
|
```bash
|
package/dist/index.js
CHANGED
|
@@ -20,14 +20,14 @@ const server = new McpServer({
|
|
|
20
20
|
version: '1.0.0',
|
|
21
21
|
});
|
|
22
22
|
// Tool 1: evaluate_session
|
|
23
|
-
server.tool('evaluate_session', 'Evaluate a coding session across 11 skill dimensions. Spawns an independent Claude analysis of the session transcript. Returns composite score, dimension breakdown, trends,
|
|
23
|
+
server.tool('evaluate_session', 'Evaluate a coding session across 11 skill dimensions. Spawns an independent Claude analysis of the session transcript. Extracts developer decisions (steers, rejects, validates, scopes). Returns composite score, dimension breakdown, trends, coaching recommendations with before/after examples. Session is tagged with current git branch for PR matching.', {
|
|
24
24
|
session_id: z.string().optional().describe('Session ID to evaluate. If omitted, evaluates the most recent session.'),
|
|
25
25
|
}, async (args) => {
|
|
26
26
|
const result = await handleEvaluateSession(args);
|
|
27
27
|
return { ...result };
|
|
28
28
|
});
|
|
29
29
|
// Tool 2: generate_pr_report
|
|
30
|
-
server.tool('generate_pr_report', 'Generate a Decision Quality Score (DQS) report for a git branch. Matches
|
|
30
|
+
server.tool('generate_pr_report', 'Generate a Decision Quality Score (DQS) report for a git branch. Matches sessions to the branch via sessions.branch column (set on session creation). Auto-evaluates unscored sessions, extracts decisions, computes DQS (autonomy + discipline + validation + diversity). Optionally posts as a GitHub PR comment via gh CLI.', {
|
|
31
31
|
branch: z.string().optional().describe('Git branch name. Defaults to current branch.'),
|
|
32
32
|
post: z.boolean().optional().default(false).describe('If true, post the report as a GitHub PR comment via gh CLI.'),
|
|
33
33
|
}, async (args) => {
|
package/package.json
CHANGED