bigpowers 2.34.1 → 2.35.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (48) hide show
  1. package/.pi/package.json +2 -2
  2. package/.pi/prompts/deploy.md +53 -28
  3. package/.pi/prompts/develop-tdd.md +5 -80
  4. package/.pi/prompts/migrate-spec.md +273 -197
  5. package/.pi/prompts/publish-package.md +125 -67
  6. package/.pi/prompts/release-branch.md +85 -69
  7. package/.pi/prompts/security-review.md +323 -0
  8. package/.pi/prompts/smoke-test.md +98 -58
  9. package/.pi/prompts/using-bigpowers.md +2 -2
  10. package/.pi/prompts/validate-contracts.md +169 -54
  11. package/.pi/prompts/wire-ci.md +147 -89
  12. package/.pi/skills/deploy/SKILL.md +53 -28
  13. package/.pi/skills/develop-tdd/SKILL.md +5 -80
  14. package/.pi/skills/migrate-spec/SKILL.md +273 -197
  15. package/.pi/skills/publish-package/SKILL.md +125 -67
  16. package/.pi/skills/release-branch/SKILL.md +85 -69
  17. package/.pi/skills/security-review/SKILL.md +324 -0
  18. package/.pi/skills/smoke-test/SKILL.md +98 -58
  19. package/.pi/skills/using-bigpowers/SKILL.md +2 -2
  20. package/.pi/skills/validate-contracts/SKILL.md +169 -54
  21. package/.pi/skills/wire-ci/SKILL.md +147 -89
  22. package/CHANGELOG.md +14 -0
  23. package/README.md +4 -4
  24. package/SKILL-INDEX.md +2 -2
  25. package/deploy/REFERENCE.md +82 -0
  26. package/deploy/SKILL.md +3 -63
  27. package/develop-tdd/SKILL.md +5 -80
  28. package/migrate-spec/REFERENCE.md +268 -0
  29. package/migrate-spec/SKILL.md +5 -199
  30. package/package.json +2 -2
  31. package/publish-package/REFERENCE.md +239 -0
  32. package/publish-package/SKILL.md +8 -192
  33. package/release-branch/REFERENCE.md +83 -0
  34. package/release-branch/SKILL.md +2 -69
  35. package/scripts/generate-reference-tables.sh +1 -0
  36. package/scripts/sync-skills.sh +4 -1
  37. package/security-review/REFERENCE-confidence-rubric.md +85 -0
  38. package/security-review/REFERENCE-false-positives.md +68 -0
  39. package/security-review/REFERENCE-vuln-categories.md +103 -0
  40. package/security-review/SKILL.md +63 -0
  41. package/skills-lock.json +14 -9
  42. package/smoke-test/REFERENCE.md +162 -0
  43. package/smoke-test/SKILL.md +5 -130
  44. package/using-bigpowers/SKILL.md +2 -2
  45. package/validate-contracts/REFERENCE.md +183 -0
  46. package/validate-contracts/SKILL.md +6 -77
  47. package/wire-ci/REFERENCE.md +257 -0
  48. package/wire-ci/SKILL.md +8 -210
@@ -0,0 +1,324 @@
1
+ ---
2
+ name: security-review
3
+ description: "> AI-powered security analysis of code changes — traces data flow, detects injection, auth bypass, secrets exposure, and unsafe deserialization across files. Use when reviewing pending changes, before release-branch, during verify-work Phase 5, during build-epic Step 0 threat modeling, or when the user says \"security review\" or \"scan for vulns\"."
4
+ ---
5
+
6
+
7
+ # Security Review
8
+
9
+ > **HARD GATE** — Requires git context (branch with merge-base or diff). Never
10
+ > writes files outside `specs/security/`. Findings below confidence 8/10 are
11
+ > suppressed. **→ verify:** `git rev-parse HEAD >/dev/null 2>&1 && echo "ok" || echo "BLOCKED"`
12
+
13
+ ## 5-phase scan
14
+
15
+ | # | Phase | What |
16
+ |---|-------|------|
17
+ | 1 | **Scope Resolution** | Detect diff via `git diff --merge-base origin/HEAD`; resolve languages/frameworks from dependency files |
18
+ | 2 | **Context Research** | Identify existing security patterns, sanitization, auth model in the codebase |
19
+ | 3 | **Vulnerability Assessment** | Trace user input → sink; check auth boundaries, crypto, deserialization, path ops |
20
+ | 4 | **False-Positive Filtering** | Cross-check each finding against exclusion rules; reject confidence < 8 |
21
+ | 5 | **Report Generation** | Output structured markdown: file:line, severity, category, exploit scenario, fix |
22
+
23
+ ## Categories
24
+
25
+ Covered: SQLi, XSS, SSRF, command injection, auth bypass, unsafe deserialization, path traversal, IDOR, crypto flaws, secrets exposure, template injection, NoSQLi
26
+
27
+ ## Integration points
28
+
29
+ | Skill | Touchpoint |
30
+ |-------|------------|
31
+ | `build-epic` | Step 0 — threat-model epic scope → `specs/security/epics/<id>/THREAT_MODEL.md` |
32
+ | `plan-work` | `security:` field (none/low/medium/high) on story tasks |
33
+ | `plan-release` | +2 WSJF risk boost for HIGH+ risk epics |
34
+ | `audit-code` | Checklist: "diff scanned — no unaddressed HIGH findings" |
35
+ | `request-review` | Inject threat model categories + false-positive rules into reviewer prompt |
36
+ | `investigate-bug` | Security-impact assessment in RCA (NONE→CRITICAL) |
37
+ | `validate-fix` | Recurrence hardening check for security bugs |
38
+ | `verify-work` | Phase 5 — blocks on HIGH findings ≥ 8 confidence |
39
+ | `release-branch` | Hard gate — blocks merge if unresolved HIGH findings |
40
+
41
+ ## Report format
42
+
43
+ Each finding: **`File:Line` — Severity — Category**
44
+ - Description: how the vulnerability manifests
45
+ - Exploit scenario: concrete attack path
46
+ - Recommendation: fix with code example
47
+
48
+ ## Reference files
49
+
50
+ - [Vuln categories](REFERENCE-vuln-categories.md) — detection guidance per vuln type
51
+ - [False positives](REFERENCE-false-positives.md) — hard exclusions + precedent
52
+ - [Confidence rubric](REFERENCE-confidence-rubric.md) — scoring methodology (0–10)
53
+
54
+ ## Verify
55
+
56
+ ```bash
57
+ test -d specs/security && echo "OK: specs/security/ exists" || mkdir -p specs/security
58
+ grep -q "Merge-base\|merge.base\|git diff" SKILL.md && echo "OK: git context verified"
59
+ ```
60
+
61
+ ---
62
+
63
+ # Confidence Scoring Rubric
64
+
65
+ Every finding that survives Phase 4 false-positive filtering receives a confidence
66
+ score from 1 (speculative) to 10 (certain). Only findings ≥ 8 are reported.
67
+
68
+ ## Score 9–10: Certain Exploit Path
69
+
70
+ **Criteria:**
71
+ - Concrete, testable exploit with clear reproduction steps
72
+ - No assumptions about uncommon configurations
73
+ - No chain of multiple unlikely conditions
74
+ - Attacker has full control over the input vector
75
+
76
+ **Examples:**
77
+ - User-supplied SQL in a `SELECT` statement with no parameterization
78
+ - `os.system(f"rm {user_path}")` where user controls the path
79
+ - Pickle deserialization of user-supplied data without any wrapping
80
+
81
+ **Severity:** HIGH
82
+
83
+ ## Score 8: Clear Vulnerability Pattern
84
+
85
+ **Criteria:**
86
+ - Well-known vulnerability pattern with standard exploitation method
87
+ - Requires specific conditions but conditions are commonly met
88
+ - Exploitability is well-documented in OWASP / CVE databases
89
+
90
+ **Examples:**
91
+ - JWT without signature verification in authentication middleware
92
+ - SSRF where attacker controls the full URL including host
93
+ - Hardcoded AWS secret key in source code
94
+
95
+ **Severity:** HIGH or MEDIUM
96
+
97
+ ## Score 7: Suspicious Pattern
98
+
99
+ **Criteria:**
100
+ - Unusual code that may indicate a vulnerability
101
+ - Requires specific conditions that may not be present
102
+ - Alternative secure interpretation is equally likely
103
+ - Defense-in-depth concern rather than direct exploit
104
+
105
+ **Examples:**
106
+ - A function accepting user input that passes through multiple layers before reaching a sink (unclear if sanitized)
107
+ - Custom encryption implementation (likely weak, but may not process sensitive data)
108
+ - Path construction that looks safe but has a subtle bypass
109
+
110
+ **Severity:** LOW or suppress
111
+
112
+ ## Score < 7: Do Not Report
113
+
114
+ **Criteria:**
115
+ - Theoretical concern without exploit path
116
+ - Requires unrealistic attacker capabilities
117
+ - Violates one or more hard exclusion rules
118
+ - Better handled by separate tooling (dependency scanner, SAST, secret scanner)
119
+ - Purely stylistic or best-practice concern without security impact
120
+
121
+ **Examples:**
122
+ - "This function doesn't validate all inputs" without proving the validated input is the attack surface
123
+ - "This uses MD5" where the hash is not used for security (e.g., cache key)
124
+ - "This function could consume too much memory" (DOS exclusion)
125
+
126
+ **Action:** Suppress entirely. Do not include in report.
127
+
128
+ ## Severity Mapping
129
+
130
+ Once confidence ≥ 8 is confirmed, map to severity:
131
+
132
+ | Severity | Impact | Examples |
133
+ |----------|--------|---------|
134
+ | **CRITICAL** | Remote compromise, full data breach | RCE, auth bypass with admin escalation, SQLi with data exfiltration |
135
+ | **HIGH** | Significant security boundary crossed | SSRF to internal services, hardcoded cloud credentials, insecure deserialization |
136
+ | **MEDIUM** | Limited impact or requires conditions | Stored XSS behind auth, IDOR on non-sensitive data, weak but not broken crypto |
137
+ | **LOW** | Defense-in-depth, minimal blast radius | Missing security header, verbose error messages in non-production |
138
+
139
+ ## Quality Gate
140
+
141
+ The confidence rubric double-checks each finding against three lenses:
142
+
143
+ | Lens | Question |
144
+ |------|----------|
145
+ | **Exploitability** | Can a real attacker trigger this from a trust boundary? |
146
+ | **Actionability** | Would a security engineer accept a fix recommendation for this? |
147
+ | **Precedent** | Has this type of finding passed/failed human review before? |
148
+
149
+ ---
150
+
151
+ # False-Positive Exclusion Rules
152
+
153
+ Applied during Phase 4 of the scan. Findings matching any hard exclusion are
154
+ automatically suppressed. Precedents from prior reviews guide borderline cases.
155
+
156
+ ## Hard Exclusions
157
+
158
+ Automatically exclude findings matching these patterns:
159
+
160
+ | # | Rule | Rationale |
161
+ |---|------|-----------|
162
+ | 1 | **Denial of Service (DOS)** — resource exhaustion, CPU/memory attacks | Handled separately; not actionable in code review |
163
+ | 2 | **Secrets on disk** if otherwise secured | Secrets management is a separate concern |
164
+ | 3 | **Rate limiting** concerns | Operational, not a code vulnerability |
165
+ | 4 | **Memory consumption / CPU exhaustion** | Not actionable in diff review |
166
+ | 5 | **Input validation on non-security-critical fields** without proven exploit path | Theoretical, not concrete |
167
+ | 6 | **GitHub Actions input sanitization** unless clearly triggerable via untrusted input | Most workflow vulns are not exploitable |
168
+ | 7 | **Lack of hardening measures** | Code is not expected to implement all best practices |
169
+ | 8 | **Race conditions / timing attacks** that are theoretical | Only report if concretely problematic |
170
+ | 9 | **Outdated third-party libraries** | Managed separately by dependency scanners |
171
+ | 10 | **Memory safety** in Rust or other memory-safe languages | Impossible by language guarantees |
172
+ | 11 | **Unit test files only** | Not production risk |
173
+ | 12 | **Log spoofing** | Outputting unsanitized input to logs is not a vuln |
174
+ | 13 | **SSRF that only controls path** | Only host/protocol control is exploitable |
175
+ | 14 | **User-controlled content in AI system prompts** | Not a security vulnerability |
176
+ | 15 | **Regex injection** | Injecting untrusted content into regex is not a vuln |
177
+ | 16 | **Regex DOS** | Excluded alongside general DOS |
178
+ | 17 | **Documentation files** (.md, .txt) | Insecure docs are not code vulnerabilities |
179
+ | 18 | **Lack of audit logs** | Not a vulnerability |
180
+
181
+ ## Precedent Rules
182
+
183
+ These guide borderline cases based on prior human review decisions:
184
+
185
+ | # | Precedent | Reasoning |
186
+ |---|-----------|-----------|
187
+ | 1 | **Logging high-value secrets in plaintext IS a vuln.** Logging URLs is safe. | Secrets in logs = credential exposure; URLs are not secrets |
188
+ | 2 | **UUIDs are unguessable** — no validation needed | Cryptographic property of UUID v4/v7 |
189
+ | 3 | **Environment variables and CLI flags are trusted values** | Attackers cannot modify these in secure environments |
190
+ | 4 | **Resource management issues** (memory leaks, fd leaks) are NOT valid | Operational, not security |
191
+ | 5 | **Tabnabbing, XS-Leaks, prototype pollution, open redirects** — do NOT report unless extremely high confidence | Subtle, low-impact, high false-positive rate |
192
+ | 6 | **React/Angular XSS** — safe unless `dangerouslySetInnerHTML`, `bypassSecurityTrustHtml`, etc. | Framework auto-escapes |
193
+ | 7 | **GitHub Action workflow vulns** — verify concrete attack path before reporting | Most are theoretical |
194
+ | 8 | **Client-side JS/TS auth checks** — not a vuln; server is authoritative | Client code is untrusted |
195
+ | 9 | **IPython notebook vulns** — only report if concrete untrusted-input trigger | Most are not exploitable |
196
+ | 10 | **Logging non-PII data** — not a vuln even if sensitive. Only PII/secrets/passwords. | Intent: operational logging vs credential exposure |
197
+ | 11 | **Shell script command injection** — only report if concrete untrusted-input path | Most shell scripts don't process untrusted input |
198
+
199
+ ## Confidence Scoring
200
+
201
+ Findings that survive exclusions get a confidence score (1–10):
202
+
203
+ | Range | Meaning | Action |
204
+ |-------|---------|--------|
205
+ | 9–10 | Certain exploit path, testable | Report as HIGH |
206
+ | 8 | Clear vulnerability pattern | Report as HIGH/MEDIUM |
207
+ | 7 | Suspicious, needs conditions | Report as LOW or suppress |
208
+ | <7 | Too speculative | **Do not report** |
209
+
210
+ **Hard threshold:** Only report findings with confidence ≥ 8.
211
+
212
+ ## Signal Quality Criteria
213
+
214
+ For remaining findings, assess:
215
+ 1. Is there a concrete, exploitable vulnerability with a clear attack path?
216
+ 2. Does this represent a real security risk (vs theoretical best practice)?
217
+ 3. Are there specific code locations and reproduction steps?
218
+ 4. Would this finding be actionable for a security team?
219
+
220
+ ---
221
+
222
+ # Vulnerability Categories — Detection Guidance
223
+
224
+ Each category: vulnerable pattern → safe pattern → code example.
225
+
226
+ ## SQL Injection
227
+
228
+ | Aspect | Detail |
229
+ |--------|--------|
230
+ | **Vulnerable** | String interpolation in SQL queries: `f"SELECT * FROM users WHERE id = {uid}"` |
231
+ | **Safe** | Parameterized queries / ORM: `cursor.execute("SELECT * FROM users WHERE id = %s", (uid,))` |
232
+ | **Look for** | f-strings, `+` concatenation, `format()` in query builders; raw SQL in ORM `.raw()` / `.execute()` |
233
+ | **False-positive guard** | Not a FP if the input is user-controlled (HTTP param, file, env var, CLI arg). Env vars are trusted (see exclusion rules). |
234
+
235
+ ## Cross-Site Scripting (XSS)
236
+
237
+ | Aspect | Detail |
238
+ |--------|--------|
239
+ | **Vulnerable** | `element.innerHTML = userInput`, `dangerouslySetInnerHTML={{__html: userInput}}` |
240
+ | **Safe** | `element.textContent = userInput`, React JSX (auto-escaped), template engines with auto-escaping |
241
+ | **Look for** | `.innerHTML`, `document.write()`, `dangerouslySetInnerHTML`, `v-html` (Vue), `bypassSecurityTrustHtml` (Angular) |
242
+ | **False-positive guard** | React/Angular components without unsafe methods are NOT vulnerable (see exclusion rules). |
243
+
244
+ ## Server-Side Request Forgery (SSRF)
245
+
246
+ | Aspect | Detail |
247
+ |--------|--------|
248
+ | **Vulnerable** | User-controlled URL passed to server-side HTTP client: `requests.get(user_url)` |
249
+ | **Safe** | URL allowlist validation, internal-network blocking, protocol/host restriction |
250
+ | **Look for** | User input → `fetch`, `requests.get`, `axios.get`, `urllib`, `curl`, `http.get`; host control only (path-only is excluded) |
251
+
252
+ ## Command Injection
253
+
254
+ | Aspect | Detail |
255
+ |--------|--------|
256
+ | **Vulnerable** | User input in shell commands: `os.system(f"ping {host}")`, `subprocess.run(f"grep {pattern} file", shell=True)` |
257
+ | **Safe** | `subprocess.run(["ping", host])` with arguments as list; `shlex.quote()` |
258
+ | **Look for** | `shell=True`, `os.system`, `os.popen`, `exec()`, `eval()`, `$()`, backticks |
259
+ | **False-positive guard** | Shell scripts without untrusted user input are generally not exploitable. |
260
+
261
+ ## Authentication/Authorization Bypass
262
+
263
+ | Aspect | Detail |
264
+ |--------|--------|
265
+ | **Vulnerable** | Missing auth check on protected endpoint; JWT without signature verification; hardcoded admin tokens |
266
+ | **Safe** | Consistent auth middleware; JWT with `RS256`/`HS256` verification; role-based access control |
267
+ | **Look for** | Routes without auth decorators; `@login_required` / `@require_auth` missing; JWT without `.verify()`; client-side auth checks only |
268
+
269
+ ## Unsafe Deserialization
270
+
271
+ | Aspect | Detail |
272
+ |--------|--------|
273
+ | **Vulnerable** | `pickle.load(user_data)`, `yaml.load(user_input)`, `JSON.parse()` on untrusted tokens, `eval(input())` |
274
+ | **Safe** | `yaml.safe_load()`, `json.loads()` (safe for JSON), `pickle.load(weights_only=True)` (PyTorch), schema validation |
275
+ | **Look for** | `pickle.load`, `yaml.load` (not safe_load), `torch.load(weights_only=False)`, `eval`, `marshal.load`, `node-serialize` |
276
+
277
+ ## Path Traversal
278
+
279
+ | Aspect | Detail |
280
+ |--------|--------|
281
+ | **Vulnerable** | User input in file paths: `open(f"/data/{filename}")`, `path.join(base, user_path)` |
282
+ | **Safe** | Path normalization + prefix check: `os.path.realpath(path).startswith(BASE_DIR)`; allowlist of valid filenames |
283
+ | **Look for** | `open()`, `read_file()`, `os.path.join` with user input; `../` traversal without normalization |
284
+
285
+ ## Insecure Direct Object Reference (IDOR)
286
+
287
+ | Aspect | Detail |
288
+ |--------|--------|
289
+ | **Vulnerable** | API endpoint uses user-supplied ID without ownership check: `GET /api/order/{order_id}` — returns any user's order |
290
+ | **Safe** | Ownership verification: verify `order.user_id == current_user.id` before returning data |
291
+ | **Look for** | CRUD endpoints that accept IDs without authorization; horizontal/vertical privilege checks missing |
292
+
293
+ ## Weak Cryptography
294
+
295
+ | Aspect | Detail |
296
+ |--------|--------|
297
+ | **Vulnerable** | MD5/SHA1 for passwords; ECB mode; hardcoded keys; `random` module (not `secrets`); short key lengths |
298
+ | **Safe** | `bcrypt`/`argon2` for passwords; AES-GCM; `secrets` module; RSA 2048+; proper IV generation |
299
+ | **Look for** | `md5`, `sha1`, `DES`, `ECB`, `PKCS1_v1_5`, `random` for crypto, hardcoded `key=`, `Crypto.Cipher` without AEAD |
300
+
301
+ ## Secrets Exposure
302
+
303
+ | Aspect | Detail |
304
+ |--------|--------|
305
+ | **Vulnerable** | Hardcoded API keys, passwords, tokens in source code; secrets in logs; secrets in client-side code |
306
+ | **Safe** | Environment variables; secret manager (AWS Secrets Manager, HashiCorp Vault); `.env` excluded from VCS |
307
+ | **Look for** | `API_KEY=`, `password=`, `secret=`, `token=` in code; AWS keys, GitHub tokens, Stripe keys, JWTs in source |
308
+ | **False-positive guard** | Secrets stored on disk but otherwise secured ARE excluded. Logging high-value secrets IS a vuln. Logging URLs is safe. |
309
+
310
+ ## Template Injection (SSTI)
311
+
312
+ | Aspect | Detail |
313
+ |--------|--------|
314
+ | **Vulnerable** | User input in template rendering: `Template(user_input).render()`, `render_template_string(user_input)` |
315
+ | **Safe** | Static templates; input passed as context variable, not template string |
316
+ | **Look for** | `render_template_string`, `Template()()` with user string; `eval` in template context; `${user_input}` in JS template literals on server |
317
+
318
+ ## NoSQL Injection
319
+
320
+ | Aspect | Detail |
321
+ |--------|--------|
322
+ | **Vulnerable** | User input in MongoDB queries: `db.users.find({username: user_input})` where input is `{"$gt": ""}` |
323
+ | **Safe** | Schema validation; type checking on query params; ORM sanitization |
324
+ | **Look for** | MongoDB `$where`, `$gt`, `$regex` from user input; raw mongo queries without type coercion |
@@ -22,6 +22,94 @@ Can be run standalone for quick health checks or chained as the final step of th
22
22
 
23
23
  Smoke checks are defined in `smoke-checks.yaml` at the project root:
24
24
 
25
+ See [REFERENCE.md](REFERENCE.md)
26
+
27
+ Checks can also be specified inline via environment variables or CLI arguments for ad-hoc use.
28
+
29
+ ### Check Schema
30
+
31
+ | Field | Required | Default | Description |
32
+ |-------|----------|---------|-------------|
33
+ | `name` | Yes | — | Human-readable check name (used in report) |
34
+ | `path` | Yes | `/` | URL path relative to base_url |
35
+ | `method` | No | `GET` | HTTP method |
36
+ | `expected_status` | No | `200` | Expected HTTP status code |
37
+ | `content_signal` | No | — | Regex or string to find in response body |
38
+ | `max_response_time_ms` | No | — | Fail if response slower than this threshold (ms) |
39
+
40
+ ## Process
41
+
42
+ ### 1. Load smoke checks
43
+
44
+ See [REFERENCE.md](REFERENCE.md)
45
+
46
+ ### 2. Run each check
47
+
48
+ For each check in the configuration, perform an HTTP request:
49
+
50
+ See [REFERENCE.md](REFERENCE.md)
51
+
52
+ ### 3. Assert results
53
+
54
+ See [REFERENCE.md](REFERENCE.md)
55
+
56
+ ### 4. Generate report
57
+
58
+ See [REFERENCE.md](REFERENCE.md)
59
+
60
+ ## Integration with deploy skill
61
+
62
+ The `deploy` skill references `smoke-test` as its final verification step:
63
+
64
+ ```bash
65
+ # In deploy workflow — after successful deploy
66
+ DEPLOY_URL="$DEPLOY_URL" bash scripts/run-smoke.sh
67
+ ```
68
+
69
+ ---
70
+
71
+ # Smoke Test — Reference
72
+
73
+ ## Runner script
74
+
75
+ A ready-to-use runner is provided for standalone operation:
76
+
77
+ ```bash
78
+ bash scripts/run-smoke.sh [url] [smoke-checks-file]
79
+ ```
80
+
81
+ The runner:
82
+ 1. Uses `$DEPLOY_URL`, `$SMOKE_CHECKS_FILE`, or CLI arguments
83
+ 2. Runs all defined checks
84
+ 3. Prints a pass/fail summary
85
+ 4. Exits 0 on all pass, non-zero on any failure
86
+
87
+
88
+ ---
89
+
90
+ ## Configuration reference
91
+
92
+ | Variable | Default | Description |
93
+ |----------|---------|-------------|
94
+ | `SMOKE_CHECKS_FILE` | `smoke-checks.yaml` | Path to smoke checks YAML |
95
+ | `DEPLOY_URL` / `BASE_URL` | *(required)* | Base URL for all checks |
96
+ | `SMOKE_TIMEOUT` | `30` | Per-check timeout (seconds) |
97
+ | `SMOKE_RETRIES` | `0` | Number of retries on failure |
98
+
99
+
100
+ ---
101
+
102
+ ## Verification
103
+
104
+ → verify: `test -f smoke-test/SKILL.md && grep -q 'name: smoke-test' smoke-test/SKILL.md && echo OK`
105
+ → verify: `grep -qi 'smoke.checks.yaml\|checklist\|expected_status\|content_signal' smoke-test/SKILL.md && echo OK`
106
+ → verify: `grep -ci 'pass\|fail\|summary\|report' smoke-test/SKILL.md | awk '{if($1>=2) print "OK"; else print "FAIL"}'`
107
+ → verify: `grep -q 'smoke-test' SKILL-INDEX.md && echo OK`
108
+
109
+ ---
110
+
111
+ ## Reference block 1
112
+
25
113
  ```yaml
26
114
  # smoke-checks.yaml — auto-loaded if present at project root
27
115
  base_url: "https://example.com"
@@ -52,22 +140,9 @@ checks:
52
140
  content_signal: "not found|404"
53
141
  ```
54
142
 
55
- Checks can also be specified inline via environment variables or CLI arguments for ad-hoc use.
56
-
57
- ### Check Schema
58
-
59
- | Field | Required | Default | Description |
60
- |-------|----------|---------|-------------|
61
- | `name` | Yes | — | Human-readable check name (used in report) |
62
- | `path` | Yes | `/` | URL path relative to base_url |
63
- | `method` | No | `GET` | HTTP method |
64
- | `expected_status` | No | `200` | Expected HTTP status code |
65
- | `content_signal` | No | — | Regex or string to find in response body |
66
- | `max_response_time_ms` | No | — | Fail if response slower than this threshold (ms) |
67
-
68
- ## Process
143
+ ---
69
144
 
70
- ### 1. Load smoke checks
145
+ ## Reference block 2
71
146
 
72
147
  ```bash
73
148
  SMOKE_CHECKS_FILE="${SMOKE_CHECKS_FILE:-smoke-checks.yaml}"
@@ -83,9 +158,9 @@ else
83
158
  fi
84
159
  ```
85
160
 
86
- ### 2. Run each check
161
+ ---
87
162
 
88
- For each check in the configuration, perform an HTTP request:
163
+ ## Reference block 3
89
164
 
90
165
  ```bash
91
166
  url="${BASE_URL}${path}"
@@ -98,7 +173,9 @@ status=$response
98
173
  body=$(cat /tmp/smoke_body.txt)
99
174
  ```
100
175
 
101
- ### 3. Assert results
176
+ ---
177
+
178
+ ## Reference block 4
102
179
 
103
180
  ```bash
104
181
  checks_passed=0
@@ -133,7 +210,9 @@ if [ -n "$max_response_time_ms" ] && [ "$response_time" -gt "$max_response_time_
133
210
  fi
134
211
  ```
135
212
 
136
- ### 4. Generate report
213
+ ---
214
+
215
+ ## Reference block 5
137
216
 
138
217
  ```bash
139
218
  total=$((checks_passed + checks_failed))
@@ -151,42 +230,3 @@ else
151
230
  exit 0
152
231
  fi
153
232
  ```
154
-
155
- ## Runner script
156
-
157
- A ready-to-use runner is provided for standalone operation:
158
-
159
- ```bash
160
- bash scripts/run-smoke.sh [url] [smoke-checks-file]
161
- ```
162
-
163
- The runner:
164
- 1. Uses `$DEPLOY_URL`, `$SMOKE_CHECKS_FILE`, or CLI arguments
165
- 2. Runs all defined checks
166
- 3. Prints a pass/fail summary
167
- 4. Exits 0 on all pass, non-zero on any failure
168
-
169
- ## Integration with deploy skill
170
-
171
- The `deploy` skill references `smoke-test` as its final verification step:
172
-
173
- ```bash
174
- # In deploy workflow — after successful deploy
175
- DEPLOY_URL="$DEPLOY_URL" bash scripts/run-smoke.sh
176
- ```
177
-
178
- ## Configuration reference
179
-
180
- | Variable | Default | Description |
181
- |----------|---------|-------------|
182
- | `SMOKE_CHECKS_FILE` | `smoke-checks.yaml` | Path to smoke checks YAML |
183
- | `DEPLOY_URL` / `BASE_URL` | *(required)* | Base URL for all checks |
184
- | `SMOKE_TIMEOUT` | `30` | Per-check timeout (seconds) |
185
- | `SMOKE_RETRIES` | `0` | Number of retries on failure |
186
-
187
- ## Verification
188
-
189
- → verify: `test -f smoke-test/SKILL.md && grep -q 'name: smoke-test' smoke-test/SKILL.md && echo OK`
190
- → verify: `grep -qi 'smoke.checks.yaml\|checklist\|expected_status\|content_signal' smoke-test/SKILL.md && echo OK`
191
- → verify: `grep -ci 'pass\|fail\|summary\|report' smoke-test/SKILL.md | awk '{if($1>=2) print "OK"; else print "FAIL"}'`
192
- → verify: `grep -q 'smoke-test' SKILL-INDEX.md && echo OK`
@@ -9,7 +9,7 @@ model: sonnet
9
9
  > **HARD GATE** — **HARD GATE** — This skill is the entry point. Do NOT skip it when onboarding new users or starting a new session. It establishes the bigpowers methodology, lifecycle phases, and conventions.
10
10
 
11
11
 
12
- Welcome to **bigpowers** — a lifecycle of **61** agent skills for production-ready, TDD-driven software by solo developers.
12
+ Welcome to **bigpowers** — a lifecycle of **70** agent skills for production-ready, TDD-driven software by solo developers.
13
13
 
14
14
  ## Install
15
15
 
@@ -100,7 +100,7 @@ Start the HTTP dashboard with `visual-dashboard` → `GET /api/status?projectDir
100
100
  - **Integrate:** team default is `gh pr` (team-pr); solo profile uses `land-branch.sh`. Never create GitHub issues from skills — use local Markdown files instead.
101
101
  - **One skill, one thing.** If you're unsure which skill to call, call `survey-context` — it reads your current state and recommends the next step.
102
102
  - **verify: every step.** Every epic task must have `verify: <runnable command>`. Evidence over claims.
103
- - **61 skills.** See `SKILL-INDEX.md`; find skills with `search-skills`.
103
+ - **70 skills.** See `SKILL-INDEX.md`; find skills with `search-skills`.
104
104
 
105
105
  ## After this
106
106