@event4u/agent-config 2.7.0 → 2.9.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (76) hide show
  1. package/.agent-src/personas/cmo.md +122 -0
  2. package/.agent-src/personas/customer-success-lead.md +126 -0
  3. package/.agent-src/personas/engineering-manager.md +133 -0
  4. package/.agent-src/personas/finance-partner.md +129 -0
  5. package/.agent-src/personas/growth-pm.md +134 -0
  6. package/.agent-src/personas/people-strategist.md +126 -0
  7. package/.agent-src/personas/revops.md +125 -0
  8. package/.agent-src/personas/strategist.md +129 -0
  9. package/.agent-src/skills/activation-design/SKILL.md +160 -0
  10. package/.agent-src/skills/build-buy-partner/SKILL.md +145 -0
  11. package/.agent-src/skills/churn-prevention/SKILL.md +156 -0
  12. package/.agent-src/skills/comp-banding/SKILL.md +160 -0
  13. package/.agent-src/skills/competitive-moat-analysis/SKILL.md +152 -0
  14. package/.agent-src/skills/content-funnel-design/SKILL.md +170 -0
  15. package/.agent-src/skills/contracts-cognition/SKILL.md +147 -0
  16. package/.agent-src/skills/data-handling-judgment/SKILL.md +155 -0
  17. package/.agent-src/skills/deal-qualification-meddic/SKILL.md +165 -0
  18. package/.agent-src/skills/editorial-calendar/SKILL.md +161 -0
  19. package/.agent-src/skills/expansion-playbook/SKILL.md +171 -0
  20. package/.agent-src/skills/forecast-accuracy/SKILL.md +157 -0
  21. package/.agent-src/skills/forecasting/SKILL.md +164 -0
  22. package/.agent-src/skills/fundraising-narrative/SKILL.md +189 -0
  23. package/.agent-src/skills/funnel-analysis/SKILL.md +26 -2
  24. package/.agent-src/skills/gtm-launch/SKILL.md +165 -0
  25. package/.agent-src/skills/hiring-loop-design/SKILL.md +167 -0
  26. package/.agent-src/skills/market-entry-analysis/SKILL.md +144 -0
  27. package/.agent-src/skills/messaging-architecture/SKILL.md +184 -0
  28. package/.agent-src/skills/onboarding-design/SKILL.md +158 -0
  29. package/.agent-src/skills/onboarding-program/SKILL.md +157 -0
  30. package/.agent-src/skills/one-on-one-cadence/SKILL.md +161 -0
  31. package/.agent-src/skills/org-design/SKILL.md +158 -0
  32. package/.agent-src/skills/perf-feedback-craft/SKILL.md +157 -0
  33. package/.agent-src/skills/pipeline-strategy/SKILL.md +159 -0
  34. package/.agent-src/skills/positioning-strategy/SKILL.md +177 -0
  35. package/.agent-src/skills/privacy-review/SKILL.md +160 -0
  36. package/.agent-src/skills/retention-loops/SKILL.md +161 -0
  37. package/.agent-src/skills/runway-cognition/SKILL.md +136 -0
  38. package/.agent-src/skills/scenario-modeling/SKILL.md +139 -0
  39. package/.agent-src/skills/subagent-orchestration/SKILL.md +1 -1
  40. package/.agent-src/skills/throughput-vs-morale-tradeoff/SKILL.md +165 -0
  41. package/.agent-src/skills/unit-economics-modeling/SKILL.md +54 -7
  42. package/.agent-src/skills/vision-articulation/SKILL.md +146 -0
  43. package/.agent-src/skills/voice-and-tone-design/SKILL.md +163 -0
  44. package/.agent-src/templates/agents/agent-project-settings.example.yml +1 -1
  45. package/.agent-src/templates/scripts/telemetry/settings.py +65 -0
  46. package/.agent-src/templates/scripts/tier_usage_report.py +183 -0
  47. package/.claude-plugin/marketplace.json +34 -2
  48. package/AGENTS.md +1 -1
  49. package/CHANGELOG.md +135 -153
  50. package/README.md +3 -3
  51. package/docs/architecture.md +37 -11
  52. package/docs/archive/CHANGELOG-pre-2.7.0.md +185 -0
  53. package/docs/catalog.md +38 -4
  54. package/docs/contracts/adr-forecast-construction-shape.md +89 -0
  55. package/docs/contracts/adr-gtm-context-spine.md +115 -0
  56. package/docs/contracts/adr-wing4-context-spine.md +125 -0
  57. package/docs/contracts/command-clusters.md +41 -0
  58. package/docs/contracts/command-surface-tiers.md +30 -9
  59. package/docs/contracts/context-spine.md +58 -12
  60. package/docs/contracts/cross-wing-handoff.md +3 -3
  61. package/docs/contracts/mcp-beta-criteria.md +129 -0
  62. package/docs/contracts/persona-schema.md +20 -3
  63. package/docs/guidelines/gtm-handoff.md +114 -0
  64. package/docs/guidelines/wing4-handoff.md +127 -0
  65. package/docs/mcp-server.md +1 -1
  66. package/package.json +1 -1
  67. package/scripts/_cli/cmd_doctor.py +527 -14
  68. package/scripts/_cli/cmd_validate.py +10 -0
  69. package/scripts/agent-config +19 -18
  70. package/scripts/install.py +5 -0
  71. package/scripts/lint_context_spine_usage.py +5 -1
  72. package/scripts/mcp_server/__init__.py +1 -0
  73. package/scripts/mcp_server/server.py +4 -3
  74. package/scripts/schemas/persona.schema.json +5 -0
  75. package/scripts/schemas/skill.schema.json +2 -2
  76. package/scripts/skill_linter.py +284 -6
@@ -0,0 +1,189 @@
1
+ ---
2
+ name: fundraising-narrative
3
+ description: "Use when shaping a capital-raise pitch — why-now / why-us / why-this framing, market-size reasoning, traction-story construction. Triggers on 'tighten the pitch', 'why-now is weak'."
4
+ status: active
5
+ tier: senior
6
+ source: package
7
+ domain: product
8
+ context_spine: [product, customer-segment]
9
+ ---
10
+
11
+ # fundraising-narrative
12
+
13
+ ## When to use
14
+
15
+ - A founder is preparing a capital-raise pitch and the why-now is borrowed from a deck template instead of earned from the segment-shift the team actually rides.
16
+ - A deck is landing as "interesting but not now" with investors and the team needs to diagnose whether the gap is *why-now*, *why-us*, or *why-this*.
17
+ - A traction story is being built from screenshots instead of from a coherent leading-indicator arc that explains why the next stage is reachable.
18
+
19
+ Do NOT use to manage the investor-CRM pipeline (out of scope), run
20
+ the data-room (out of scope), or draft the internal vision anchor
21
+ the org rallies behind (route to Wing-4 `vision-articulation` — the
22
+ external pitch under capital constraint and the internal anchor
23
+ are siblings, not the same artefact).
24
+
25
+ ## Cognition cluster
26
+
27
+ - **Mental model 1 — First-principles thinking.** *Why-now* is
28
+ the load-bearing claim and the one most often borrowed. Build
29
+ it from the segment-shift up — what changed in the world, the
30
+ customer, the technology — not from a deck template. See
31
+ [`docs/contracts/mental-models.md`](../../../docs/contracts/mental-models.md) § 1.
32
+ - **Mental model 9 — Hypothesis-driven development.** A pitch is
33
+ a falsifiable hypothesis: *if X is true, our round closes.*
34
+ Name the X. Investors who disagree with the hypothesis are not
35
+ rejecting taste — they are rejecting the falsifiable claim. See
36
+ `mental-models.md` § 9.
37
+ - **Mental model 16 — Leading vs. lagging indicators.** Revenue
38
+ is lagging; activation, retention curve, and qualified-pipeline
39
+ velocity are leading. The traction story leads with the leading
40
+ signals; revenue is the receipt, not the argument. See
41
+ `mental-models.md` § 16.
42
+ - **Mental model 30 — Inversion.** Run the round-failure
43
+ premortem before the deck locks: *which investor heard what we
44
+ did not say.* Inversion surfaces the claim the deck assumes the
45
+ room already shares and probably does not. See
46
+ `mental-models.md` § 30.
47
+ - **Context-spine — product + customer-segment.** Read **product**
48
+ for the proofs the traction story can actually back; read
49
+ **customer-segment** for the TAM/SAM argument that survives a
50
+ bottom-up scrutiny. See
51
+ [`context-spine`](../../../docs/contracts/context-spine.md).
52
+
53
+ ## Procedure
54
+
55
+ ### Step 0: Inherit the positioning frame and vision anchor
56
+
57
+ Identify the locked positioning anchors from
58
+ [`positioning-strategy`](../positioning-strategy/SKILL.md) and the internal vision
59
+ anchor from `vision-articulation` if it exists. The fundraising
60
+ narrative is the *external pitch under capital constraint*; it
61
+ inherits the internal frame, it does not re-invent it. A pitch
62
+ that contradicts the internal anchor will fracture on the first
63
+ hire after the round closes.
64
+
65
+ ### Step 1: Analyze the inherited why-now
66
+
67
+ Read the current why-now claim. Three checks: *is this a market
68
+ shift, a customer shift, or a technology shift?* *Did the shift
69
+ happen in the last 24 months?* *Would the segment recognise the
70
+ shift without prompting?* A why-now that fails two of three is
71
+ template-borrowed. Name what the inherited deck is leaning on.
72
+
73
+ ### Step 2: Build why-now from first principles
74
+
75
+ Strip the inherited claim. Rebuild from the segment-shift up:
76
+
77
+ - **Market shift.** What changed in the buyer's environment that
78
+ was not true 24 months ago? (Regulation, budget cycle, channel
79
+ collapse, competitive exit.)
80
+ - **Customer shift.** What changed in how the ICP measures the
81
+ problem? (New KPI, new buying committee, new procurement gate.)
82
+ - **Technology shift.** What is feasible now that was not? (Cost
83
+ curve, model capability, infrastructure unlock.)
84
+
85
+ Pick the *one* shift the segment would name without prompting.
86
+ That is the why-now spine. The others are supporting context.
87
+
88
+ ### Step 3: Construct why-us under capital constraint
89
+
90
+ Why-us is *unfair advantage under the next 18 months of capital*,
91
+ not credentials. Three anchors:
92
+
93
+ - **Earned access.** The audience the team can already reach that
94
+ the next funded peer cannot.
95
+ - **Earned proof.** The reference customer or load-bearing
96
+ retention curve the team owns now.
97
+ - **Capital fit.** What the round buys that competitors cannot buy
98
+ in the same window. *"More engineers"* is not capital fit;
99
+ *"distribution lead-time the round protects"* is.
100
+
101
+ ### Step 4: Build the traction story from leading indicators
102
+
103
+ Order the traction story leading-first:
104
+
105
+ 1. **Activation curve.** Time-to-first-value trend across cohorts.
106
+ 2. **Retention curve.** Cohort retention at the load-bearing
107
+ milestone, ideally non-trivial — not week 1.
108
+ 3. **Pipeline velocity.** Qualified-pipeline movement, not raw
109
+ pipeline volume.
110
+ 4. **Revenue.** The receipt, last in the sequence — not first.
111
+
112
+ A traction story that opens on revenue assumes the room already
113
+ believes the leading signals; the deck must earn that belief.
114
+
115
+ ### Step 5: Validate against the round-failure premortem
116
+
117
+ Validate the narrative on three checks:
118
+
119
+ 1. **Premortem coverage.** Run *"the round did not close because…"*
120
+ with five failure modes. Verify the deck explicitly neutralises
121
+ the top three or accepts-with-mitigation; unnamed failure modes
122
+ are silent rejection routes.
123
+ 2. **Falsifiable hypothesis.** Confirm the pitch is the form
124
+ *"if X, then our round closes."* A pitch that cannot be
125
+ disagreed with is also a pitch that cannot be agreed with.
126
+ 3. **Internal-external consistency.** Diff the external pitch
127
+ against the internal vision anchor. Contradictions kill the
128
+ first post-round hire round; name them now.
129
+
130
+ ### Step 6: Hand back
131
+
132
+ Hand the artefacts to the founder for delivery, to
133
+ `messaging-architecture` for the post-round message-stack refresh
134
+ (why-now often shifts the primary message), and to
135
+ `vision-articulation` (Wing-4) for the internal-anchor diff if
136
+ contradictions surfaced.
137
+
138
+ ## Related Skills
139
+
140
+ **WHEN to use this**
141
+
142
+ - The unit of work is the why-now / why-us / why-this triad under capital constraint, not a single deck slide.
143
+ - A diagnosed pitch gap needs a structured rebuild, not slide-polish.
144
+ - The traction story is being built screenshot-first; reorder it leading-first.
145
+
146
+ **WHEN NOT to use this**
147
+
148
+ - Internal vision-anchor authoring for org alignment — route to Wing-4 `vision-articulation`.
149
+ - Message-stack work post-round — route to [`messaging-architecture`](../messaging-architecture/SKILL.md).
150
+ - Positioning the category and segment — route to [`positioning-strategy`](../positioning-strategy/SKILL.md) first.
151
+ - Investor-CRM pipeline or data-room operations — out of scope.
152
+
153
+ ## When the agent should load this
154
+
155
+ - "Tighten the why-now for the seed round."
156
+ - "Bau mir die Traction-Story für den Pitch."
157
+ - "Investors keep saying 'interesting but not now' — diagnose."
158
+ - "Run the round-failure premortem on the deck."
159
+ - "Why-us reads as a credentials list — rebuild under capital constraint."
160
+
161
+ ## Output
162
+
163
+ 1. **`why-now-spine.md`** — the one market / customer / technology shift the segment names without prompting, with the 24-month evidence trail.
164
+ 2. **`why-us-anchors.md`** — earned-access · earned-proof · capital-fit, each with a load-bearing artefact citation.
165
+ 3. **`traction-arc.md`** — activation → retention → pipeline-velocity → revenue, leading-first ordering with the leading-indicator threshold per step.
166
+ 4. **`round-failure-premortem.md`** — five failure modes with neutraliser-or-accept verdict, internal-external consistency diff appended.
167
+
168
+ ## Gotcha
169
+
170
+ - Why-now is the most-borrowed claim in pitches because it is the hardest to earn from first principles — the room can tell.
171
+ - Capital-fit collapses to *"hire more"* when the team has not thought through what the round protects from competitors; protect-language is the discipline.
172
+ - Internal-external contradictions read as charm in the room and as betrayal at the post-round all-hands.
173
+
174
+ ## Do NOT
175
+
176
+ - Do NOT carry the internal vision anchor verbatim into the pitch — internal anchor is rally; external pitch is hypothesis under capital constraint.
177
+ - Do NOT lead the traction story with revenue when the leading signals are the actual argument.
178
+ - Do NOT make the why-now a template-shaped *"AI changes everything"* — the segment will know.
179
+ - Do NOT manage CRM or data-room operations from this skill; out of scope.
180
+
181
+ ## Runnable example
182
+
183
+ Mid-market HR analytics tool raising Series A, positioning locked (retention beats acquisition):
184
+
185
+ - Why-now spine — *customer shift*: HR directors now own a board-quarter retention KPI (was true on 30 % of ICP boards 24 months ago, now 70 %; verified via 14 ICP board-decks reviewed).
186
+ - Why-us anchors — *earned access*: 200-strong HR-director community already engaged. *Earned proof*: cohort-retention curve at week-12 holding at 78 % across 9 design-partner cohorts. *Capital-fit*: round protects 18 months of distribution lead-time before two funded peers reach the same segment.
187
+ - Traction arc — activation (time-to-first-cohort-roll-up: 14 → 6 days across last 4 cohorts) → retention (78 % week-12 cohort) → pipeline-velocity (qualified-pipeline movement at 2.4× quarter-on-quarter) → revenue (the receipt).
188
+ - Round-failure premortem — top three failure modes neutralised in deck; one accepted-with-mitigation (we are pre-revenue at enterprise tier — mitigated by 3 named pilot LOIs).
189
+ - Hand-off → founder for delivery; `messaging-architecture` queued for post-round refresh.
@@ -5,6 +5,7 @@ status: active
5
5
  tier: senior
6
6
  source: package
7
7
  domain: product
8
+ context_spine: [product, customer-segment, funnel-stage]
8
9
  ---
9
10
 
10
11
  # funnel-analysis
@@ -17,6 +18,29 @@ domain: product
17
18
 
18
19
  Do NOT use for ranking features, valuation, or OKR decomposition (see Related Skills). Funnel analysis is a **diagnostic**, not a roadmap.
19
20
 
21
+ ## Cognition cluster
22
+
23
+ - **Mental model 16 — Leading vs. lagging indicators.** Paid is
24
+ lagging; activation is leading; signup is upstream of both. A
25
+ funnel decision built on the lagging stage can only confirm the
26
+ miss; the leading stage names the binding fix. See
27
+ [`docs/contracts/mental-models.md`](../../../docs/contracts/mental-models.md) § 16.
28
+ - **Mental model 13 — Occam's razor.** When a stage drops, the
29
+ simpler explanation usually wins: *"acquisition mix shifted"*
30
+ beats *"users no longer understand the product."* Pick the simpler
31
+ cause; it changes the move. See `mental-models.md` § 13.
32
+ - **Mental model 3 — Pareto (80/20).** Drops are almost never
33
+ uniform across segments; ~20 % of the segment × stage cells carry
34
+ ~80 % of the loss. Segment before treating the average as
35
+ actionable. See `mental-models.md` § 3.
36
+ - **Context-spine — product + customer-segment + funnel-stage.**
37
+ Read the **product** slot for what activation can actually mean
38
+ in-product (the activation event must be shippable), the
39
+ **customer-segment** slot for which segments' switch-events the
40
+ funnel is built for, and the **funnel-stage** slot for the
41
+ position of each stage relative to the buying journey. See
42
+ [`context-spine`](../../../docs/contracts/context-spine.md).
43
+
20
44
  ## Procedure
21
45
 
22
46
  ### Step 0: Inspect
@@ -40,7 +64,7 @@ Do NOT use for ranking features, valuation, or OKR decomposition (see Related Sk
40
64
 
41
65
  1. The right benchmark is **your own funnel one quarter ago**, not industry averages. Industry averages mix verticals so coarsely they're useless for action.
42
66
  2. For each stage: is current rate within ±2 percentage points of trailing-quarter median? If not, that stage is the primary suspect.
43
- 3. If multiple stages drift simultaneously, the cause is upstream (acquisition mix change, broken instrumentation), not the stage itself.
67
+ 3. If multiple stages move off-band simultaneously, the cause is upstream (acquisition mix change, broken instrumentation), not the stage itself.
44
68
 
45
69
  ### Step 4: Segment the broken stage
46
70
 
@@ -98,4 +122,4 @@ Do NOT use for ranking features, valuation, or OKR decomposition (see Related Sk
98
122
 
99
123
  1. **`funnel-table.md`** — 5-stage funnel with cohort rates, 95% CI, and 12-week trend (sparkline or compact ASCII). One row per cohort week or month.
100
124
  2. **`segment-breakdown.md`** — table of the broken stage segmented by channel · device · plan · geo. Rates with CIs. Suspect segments highlighted.
101
- 3. **`hypothesis-list.md`** — top 3 for the broken segment-stage with cheapest-falsification experiment per cause and an explicit prediction for the next measurement.
125
+ 3. **`hypothesis-list.md`** — top 3 causes for the broken segment-stage with cheapest-falsification experiment per cause and an explicit prediction for the next measurement.
@@ -0,0 +1,165 @@
1
+ ---
2
+ name: gtm-launch
3
+ description: "Use when sequencing a launch — alpha / beta / GA waves, audience-by-wave logic, narrative beats per wave, engineering-readiness gates. Triggers on 'plan the launch', 'sequence GA'."
4
+ status: active
5
+ tier: senior
6
+ source: package
7
+ domain: product
8
+ context_spine: [product, customer-segment, channel-stage]
9
+ ---
10
+
11
+ # gtm-launch
12
+
13
+ ## When to use
14
+
15
+ - A product, feature, or major capability is approaching ship-readiness and the team needs a wave plan (alpha → beta → GA) keyed to audience and proof, not a date on a calendar.
16
+ - A launch is being planned date-first; the team needs to invert and plan readiness-first so an unmet gate stops a wave instead of leaking past it.
17
+ - A previous launch landed soft and the retro names "no audience-by-wave logic" or "narrative beats unclear per wave" as the cause.
18
+
19
+ Do NOT use to write announcement copy (route to `release-comms`),
20
+ lock the message stack (route to `messaging-architecture`), or plan
21
+ post-launch retention loops (route to `retention-loops`).
22
+
23
+ ## Cognition cluster
24
+
25
+ - **Mental model 10 — Reversible vs. irreversible decisions.** A GA
26
+ wave is largely irreversible: rolling back narrative and audience
27
+ expectations after public launch costs more than re-shipping the
28
+ product. Alpha and beta are reversible; treat them as the
29
+ decision-quality buffer. See
30
+ [`docs/contracts/mental-models.md`](../../../docs/contracts/mental-models.md) § 10.
31
+ - **Mental model 29 — Premortem.** Before the wave plan locks, write
32
+ the post-mortem of the launch as if it failed. The premortem
33
+ surfaces the gates that need to hold; the wave plan is the inverse
34
+ of that list. See `mental-models.md` § 29.
35
+ - **Mental model 16 — Leading vs. lagging indicators.** Engineering-
36
+ readiness signals (error rate, latency, support-load) are leading;
37
+ pipeline lift is lagging. A wave plan that gates on lagging signals
38
+ ships into a soft floor. See `mental-models.md` § 16.
39
+ - **Context-spine — product + customer-segment + channel-stage.**
40
+ Read the **product** slot for shippable scope, the
41
+ **customer-segment** slot for who hears the launch on which wave,
42
+ and the **channel-stage** slot for where each wave's audience lives
43
+ in the awareness → decision arc. See
44
+ [`context-spine`](../../../docs/contracts/context-spine.md).
45
+
46
+ ## Procedure
47
+
48
+ ### Step 0: Inherit the message stack
49
+
50
+ Identify the locked `primary-message.md`, `supporting-proofs.md`, and
51
+ `audience-matrix.md` from [`messaging-architecture`](../messaging-architecture/SKILL.md).
52
+ If the stack is missing or unstable, stop and route back. A launch
53
+ plan without a locked message stack ships three different stories at
54
+ three different surfaces.
55
+
56
+ ### Step 1: Run the premortem
57
+
58
+ Write the launch post-mortem **as if it has already failed**. Three
59
+ prompts: *"what did the segment hear that we did not say,"* *"what
60
+ broke in the first 48 hours,"* *"what did the alternative say first
61
+ and louder."* The premortem produces the failure-mode list the wave
62
+ plan must neutralise.
63
+
64
+ ### Step 2: Define the gates per wave
65
+
66
+ For each wave (alpha · beta · GA), define **entry gates** and **exit
67
+ gates**:
68
+
69
+ - *Alpha entry:* engineering-readiness signal threshold (error rate,
70
+ latency, instrumentation coverage). *Exit:* < N support tickets
71
+ per 100 sessions on the load-bearing flow.
72
+ - *Beta entry:* alpha exit + audience-matrix proof exists for the
73
+ beta audience. *Exit:* leading indicator (activation, time-to-
74
+ first-value) clears threshold per `mental-models.md § 16`.
75
+ - *GA entry:* beta exit + narrative beats locked for the public
76
+ segment. *Exit:* not applicable — GA is irreversible; the next
77
+ wave is *post-launch retention*, handed to `retention-loops`.
78
+
79
+ ### Step 3: Sequence the audience waves
80
+
81
+ Audience waves are not seniority waves. They are **proof waves**.
82
+ Each wave's audience is whichever segment generates the proof the
83
+ *next* wave needs. Order:
84
+
85
+ 1. *Alpha audience* — the segment where the team can sit next to
86
+ the user. Proof: load-bearing flow does not break under real use.
87
+ 2. *Beta audience* — the segment whose adoption is the credibility
88
+ anchor for GA. Proof: a quotable reference and an activation
89
+ curve.
90
+ 3. *GA audience* — the full ICP segment from the `customer-segment`
91
+ slot. Proof: pipeline lift, narrative pickup, retention curve.
92
+
93
+ ### Step 4: Assign narrative beats per wave
94
+
95
+ Each wave gets a narrative beat — the **one** thing the audience
96
+ remembers. Alpha beat = trust signal (we are not winging it). Beta
97
+ beat = proof signal (it works for someone like you). GA beat = the
98
+ primary message from `messaging-architecture` Step 1. Beats stack;
99
+ they do not contradict.
100
+
101
+ ### Step 5: Validate the plan against the premortem
102
+
103
+ Validate each premortem failure mode against the wave plan: verify a
104
+ specific gate or beat neutralises it. Any failure mode without an
105
+ explicit neutraliser is a known leak — name it, do not bury it.
106
+ Validation passes only when every premortem item is either
107
+ neutralised or accepted-with-mitigation.
108
+
109
+ ### Step 6: Hand back
110
+
111
+ Hand the artefacts to [`release-comms`](../release-comms/SKILL.md)
112
+ for announcement-surface drafting, to
113
+ [`editorial-calendar`](../editorial-calendar/SKILL.md) for cadence
114
+ mapping, and to [`launch-readiness`](../launch-readiness/SKILL.md)
115
+ for the merge-day checklist.
116
+
117
+ ## Related Skills
118
+
119
+ **WHEN to use this**
120
+
121
+ - The unit of work is the wave plan (alpha · beta · GA) with gates and beats, not a single announcement.
122
+ - A launch needs readiness-gated sequencing instead of calendar-driven sequencing.
123
+ - The team can name the message stack but not which audience hears which beat in which wave.
124
+
125
+ **WHEN NOT to use this**
126
+
127
+ - Writing the announcement copy or press surface — route to [`release-comms`](../release-comms/SKILL.md).
128
+ - Locking the primary message and proofs — route to [`messaging-architecture`](../messaging-architecture/SKILL.md).
129
+ - Pre-merge ops checklist (rollout, rollback, monitoring) — route to [`launch-readiness`](../launch-readiness/SKILL.md).
130
+ - Post-launch retention design — route to [`retention-loops`](../retention-loops/SKILL.md).
131
+
132
+ ## When the agent should load this
133
+
134
+ - "Plan the launch waves for the new pricing tier."
135
+ - "Wir starten den GA — gib mir die Alpha-Beta-GA Sequenz."
136
+ - "What are the entry gates for the beta wave?"
137
+ - "Premortem the launch and rebuild the wave plan from the failure list."
138
+ - "Sequence the audience waves around the proof we still need."
139
+
140
+ ## Output
141
+
142
+ 1. **`launch-premortem.md`** — three failure modes per prompt, ranked by carrying cost, each tagged with the wave that owns the neutraliser.
143
+ 2. **`wave-plan.md`** — three waves (alpha · beta · GA) with entry / exit gates, audience, leading-indicator threshold per wave.
144
+ 3. **`narrative-beats.md`** — one beat per wave (trust → proof → primary-message), with the line the team will not contradict on any surface during that wave.
145
+
146
+ ## Gotcha
147
+
148
+ - Calendar-driven launches confuse a date with a gate. A date does not signal readiness; a gate does. The wave plan must hold even if the date slips two weeks.
149
+ - "Friends-and-family alpha" is alpha-shaped theatre — it produces the wrong proof for the next wave. Recruit an alpha audience that exposes the load-bearing flow.
150
+ - A premortem that produces only three failure modes was rushed; push for ten and keep the load-bearing three.
151
+
152
+ ## Do NOT
153
+
154
+ - Do NOT write the announcement copy here — copy lives in `release-comms` downstream of locked beats.
155
+ - Do NOT collapse alpha and beta to save calendar time — alpha and beta produce different proofs.
156
+ - Do NOT lock GA without an explicit retention-loops handoff; an unowned post-launch fortnight is where most launches soften.
157
+
158
+ ## Runnable example
159
+
160
+ Mid-market HR analytics tool launching workforce-analytics layer:
161
+
162
+ - Premortem: (a) CFOs hear "another tool" not "retention saving"; (b) HRIS plug-in misconfigured under load; (c) reference customer quote not contractually approved by GA.
163
+ - Wave plan — *Alpha:* 3 design-partner HR directors, gate = HRIS plug-in error-rate < 1 % under load. *Beta:* 10 HR leaders matching ICP, gate = activation curve hits 5 cohort-roll-ups per week. *GA:* full ICP segment, gate = quoted reference contractually approved.
164
+ - Narrative beats — *Alpha:* "we sat next to you while it worked." *Beta:* "an HR director like you saved 7 hours last board-quarter." *GA:* primary message from `messaging-architecture` Step 1.
165
+ - Hand-off → `release-comms` drafts the GA-wave surface; `retention-loops` owns the 30-day post-GA cohort.
@@ -0,0 +1,167 @@
1
+ ---
2
+ name: hiring-loop-design
3
+ description: "Use when shaping an engineering hiring loop — stages, take-home vs live, calibration, bar-raiser, signal-vs-noise audit. Triggers on 'design our interview loop', 'audit our hiring bar'."
4
+ status: active
5
+ tier: senior
6
+ source: package
7
+ domain: process
8
+ context_spine: [org-stage, product, customer-segment]
9
+ ---
10
+
11
+ # hiring-loop-design
12
+
13
+ ## When to use
14
+
15
+ - A first engineering hiring loop is being designed (early-stage co, first dedicated EM, first PM hire) and the question is *what stages, in what order, with what signal each*.
16
+ - An existing loop is producing inconsistent outcomes (high false-positive rate, high false-negative rate, long time-to-hire) and the question is *which stage to fix*.
17
+ - A new role family inside engineering is opening (first staff IC, first SRE, first ML eng) and the question is *what does the loop look like for this archetype*.
18
+
19
+ Do NOT use this for non-engineering hiring as the primary surface (sales / GTM hiring is a different loop shape entirely), as a sourcing / recruiting-pipeline skill (separate surface area), or for applicant-tracking-system configuration.
20
+
21
+ ## Cognition cluster
22
+
23
+ - **Mental model 1 — First principles.** Strip hiring to: *what signal does each stage produce that no other stage can produce?* Stages that duplicate signal waste candidate-time and interviewer-time. The strongest loops have one stage per signal, not five stages probing the same thing. See [`mental-models.md`](../../../docs/contracts/mental-models.md) § 1.
24
+ - **Mental model 28 — Inversion.** *"What would make a great hire withdraw from this loop?"* — usually: 7+ stages, take-home > 6 hours, no role-context conversation, long calendar gaps, no senior-IC time, no offer-narrative. Inversion surfaces the canonical withdrawal causes; great candidates have options and use them.
25
+ - **Mental model 21 — Second-order thinking.** A loose bar at L4 produces a chain: weaker L4 → harder L5 calibration → erodes ladder credibility → ICs leave. A single accept-the-no-vote ripples for 2+ years. The cost of a wrong hire dwarfs the cost of a missed hire; bar discipline is a multi-year compounding decision.
26
+ - **Mental model — Base rates.** Most signals in interviews are noise; the most-confident signal-claim is usually the most-overfit to one observation. Calibrate against the base rate: *"out of 10 candidates who passed this stage with this signal, how many succeeded at 12 months?"* If unknown, the stage is unfalsifiable.
27
+ - **Context-spine — org-stage + product + customer-segment.** Read **org-stage** for what's affordable (pre-seed: 3-stage loop, fast; growth: 5-stage with calibration; late: 5–6 with bar-raiser). Read **product** for what behaviors matter (deep-systems = system-design heavier; consumer = velocity + judgment heavier; regulated = ethics + judgment heavier). Read **customer-segment** for stakeholder-management exposure needed.
28
+
29
+ ## Cross-wing handoff
30
+
31
+ - Composed downstream of Q1 `org-design` — the role-family shape determines the loop shape; hiring without a clear role definition is broken from stage 1.
32
+ - Composed downstream of Q4 `perf-feedback-craft` — the calibration session is structurally a feedback exchange about a candidate; Q4's SBI + ladder-of-inference apply.
33
+ - Hands off to Q3 `onboarding-program` — the loop's signal evidence becomes the day-1 ramp-evidence base.
34
+ - Hands off to Q2 `comp-banding` for the offer construction step.
35
+
36
+ ## Procedure
37
+
38
+ ### Step 0: Define the role-shape before designing the loop
39
+
40
+ For the role being hired, name:
41
+
42
+ 1. **Level** — L3 / L4 / L5 / L6 / staff / principal. Levels matter because signal evidence changes per level (L4 needs strong execution signal; L6 needs leverage / system-design signal).
43
+ 2. **Archetype** — IC-builder / IC-strategist / IC-system-designer / manager / staff-multiplier. Same level, different archetype = different loop shape.
44
+ 3. **First-90-day deliverable** — what should this person ship by day 90. Concrete enough to design loop signals against.
45
+
46
+ A loop designed without a role definition produces noise. Force the role definition step.
47
+
48
+ ### Step 1: Map signal needs to stages
49
+
50
+ For the role from Step 0, enumerate the signals that need direct evidence:
51
+
52
+ 1. **Coding / craft** — for IC roles. Live coding, take-home, or pair-programming.
53
+ 2. **System design** — for L5+. Two-hour bounded-scope problem.
54
+ 3. **Domain judgment** — for senior roles. Behavioral case with context-specific tradeoffs.
55
+ 4. **Communication / stakeholder** — for any role. Cross-functional collaboration exercise.
56
+ 5. **Leadership / multiplier** *(L6+)* — narrative of past leverage, mentoring decisions, ladder reasoning.
57
+ 6. **Values fit** — explicitly NOT culture fit. Concrete questions about handling pressure / disagreement / failure.
58
+
59
+ One signal per stage. If two stages probe the same signal, kill one.
60
+
61
+ ### Step 2: Pick the stage shape per signal
62
+
63
+ For each signal, pick the lightest-touch stage that produces the signal cleanly:
64
+
65
+ 1. **Recruiter / role-context call** *(30 min)* — role + company + light values. NOT a screen.
66
+ 2. **Hiring-manager screen** *(45 min)* — judgment + role-fit + reverse-context. Required.
67
+ 3. **Coding signal** — choose: (a) take-home ≤ 3 hours with explicit time-cap (most candidate-respectful for senior); (b) live coding 60 min (for L3–L4, faster signal); (c) pair-programming 90 min (best signal but heavy on interviewer time).
68
+ 4. **System design** *(2 hours, L5+)* — bounded scope; rubric set in advance.
69
+ 5. **Behavioral / domain judgment** *(45–60 min)* — structured by signal area; SBI-anchored prompts.
70
+ 6. **Leadership / values** *(45–60 min)* — narratives + specific past-decision probes.
71
+ 7. **Bar-raiser / cross-team** *(45 min, L5+)* — perspective from outside the hiring team to check ladder consistency.
72
+
73
+ Take-home > 6 hours = candidate-hostile and produces survivorship-biased pools (only those with no other options finish). Keep ≤ 3 hours or skip.
74
+
75
+ ### Step 3: Design the rubric per stage
76
+
77
+ Each stage gets a written rubric before the first interview runs:
78
+
79
+ 1. **Signal target** — what behavior demonstrates the signal at this level.
80
+ 2. **Anti-signal** — what behavior fails the signal (named, not inferred).
81
+ 3. **Strong-no-hire / no-hire / hire / strong-hire** — four-band scoring, not 5-band (5-band collapses to "3" for everything).
82
+ 4. **Evidence anchor** — what the interviewer writes down to support the rating; rating without evidence = inadmissible.
83
+
84
+ Loops without rubrics produce gut-feel hires and hidden bias. Force rubrics before the loop opens.
85
+
86
+ ### Step 4: Calibration session shape
87
+
88
+ For every offer-eligible candidate, run a calibration session before sending offer:
89
+
90
+ 1. **All interviewers attend** — synchronous or async-with-deadline.
91
+ 2. **Evidence-first reading** — each interviewer reads their rubric + evidence aloud (or shares the doc) before opinions are stated.
92
+ 3. **Bar-raiser veto** *(L5+)* — bar-raiser can no-hire even when the team votes yes; reverse is not true (team can no-hire over bar-raiser yes).
93
+ 4. **Decision** — strong-hire if all four bands are hire-or-above, no single strong-no; gray-zone = no-hire by default (the cost of a wrong hire dwarfs the cost of a missed hire).
94
+
95
+ A loop without calibration loosens; the bar erodes by 5% per quarter without explicit recalibration cycles.
96
+
97
+ ### Step 5: Validate the loop design before opening it
98
+
99
+ Before running the first candidate through, inspect three things:
100
+
101
+ 1. **Signal-stage mapping check** — confirm Step 1's signals each map to exactly one stage from Step 2; duplicate-signal stages fail and must be merged or dropped.
102
+ 2. **Rubric completeness** — assert every stage in Step 3 has a written rubric with signal + anti-signal + four-band scoring + evidence anchor; missing rubrics fail.
103
+ 3. **Candidate-time check** — verify total candidate hours ≤ 8 (5 interview hours + 3 take-home); loops exceeding 8 hours produce survivorship bias and erode top-of-funnel.
104
+
105
+ All three must pass. If any fails, return to the failing step.
106
+
107
+ ### Step 6: Emit the hiring-loop design
108
+
109
+ Produce the loop-design artifact for the hiring team. The artifact contains the role-shape, the stage-signal map, the per-stage rubrics, the calibration shape, and the candidate-time budget. The first three candidates after the loop opens trigger a retrospective check (signal-vs-actual review).
110
+
111
+ ## Related Skills
112
+
113
+ **WHEN to use this**
114
+
115
+ - New engineering role family being opened.
116
+ - First-pass hiring loop design.
117
+ - Audit of an existing loop with inconsistent outcomes.
118
+ - Calibration / bar-raiser design.
119
+
120
+ **WHEN NOT to use this**
121
+
122
+ - Non-engineering hiring loops (GTM / sales / ops) — different loop shape; out of scope here.
123
+ - Role / level decisions independent of hiring — route to [`comp-banding`](../comp-banding/SKILL.md) (Q2) for ladder design.
124
+ - Feedback shape — route to [`perf-feedback-craft`](../perf-feedback-craft/SKILL.md) (Q4); S2 composes Q4 at the calibration session.
125
+ - Onboarding after hire — route to [`onboarding-program`](../onboarding-program/SKILL.md) (Q3).
126
+
127
+ ## When the agent should load this
128
+
129
+ - "Design our interview loop."
130
+ - "Audit our hiring bar."
131
+ - "Should we use a take-home?"
132
+ - "Why are we mis-hiring at L5?"
133
+ - "Wie sieht unser Hiring-Loop für Staff Engineer aus?"
134
+
135
+ ## Output
136
+
137
+ 1. **`role-shape.md`** — level + archetype + first-90-day deliverable definition.
138
+ 2. **`stage-signal-map.md`** — signal-needed × stage × time-budget × interviewer pool.
139
+ 3. **`per-stage-rubrics.md`** — signal + anti-signal + four-band scoring + evidence anchor per stage.
140
+ 4. **`calibration-shape.md`** — session format + bar-raiser rules + gray-zone default.
141
+ 5. **`candidate-time-budget.md`** — total candidate hours + take-home cap + scheduling shape.
142
+
143
+ ## Gotcha
144
+
145
+ - "Culture fit" is a known bias-amplifier; use "values fit" with concrete questions about pressure / disagreement / failure.
146
+ - Take-home longer than 3 hours = survivorship bias in your funnel; you get only those with no other options.
147
+ - Loops that go above 5 onsite hours produce candidate withdrawal; great candidates have options.
148
+ - A no-rubric loop produces "felt good in the room" hires that don't replicate.
149
+ - Gray-zone calibration default must be no-hire; cost of false-positive dwarfs cost of false-negative.
150
+
151
+ ## Do NOT
152
+
153
+ - Do NOT design a loop without a written role-shape; un-defined roles produce un-falsifiable signals.
154
+ - Do NOT score on 5-band scales; everything collapses to "3" and the rubric becomes decorative.
155
+ - Do NOT skip calibration on a "we all agree" basis; the disagreements are where the signal lives.
156
+
157
+ ## Runnable example
158
+
159
+ Series-B SaaS opens its first staff-IC role; current loop is L4-shaped (45-min coding + 60-min behavioral + offer) and last two staff-level offers churned at 9 months.
160
+
161
+ - Step 0 — Role-shape: L6 staff IC, archetype = system-designer with multiplier impact. First-90-day deliverable: scope and own one cross-team platform initiative.
162
+ - Step 1 — Signal needs: system design (L6 anchor), domain judgment, leadership / multiplier, coding sanity-check, values fit. Five signals.
163
+ - Step 2 — Loop: recruiter call (30) + HM screen (45) + system design (120) + domain judgment behavioral (60) + leadership narrative (60) + bar-raiser (45) + light coding sanity-check (60). Total candidate time: 7 hours + 0 take-home = 7 hours. Within 8-hour budget.
164
+ - Step 3 — Rubrics drafted per stage; system-design rubric anchored to "drove three nontrivial architecture decisions with explicit tradeoffs" not "designed a great system".
165
+ - Step 4 — Calibration: all 7 interviewers + bar-raiser; evidence-first; gray-zone defaults to no-hire; bar-raiser can veto.
166
+ - Step 5 — Validate: signal-stage 1:1 mapping (no duplication); rubrics complete; 7 hours fits the 8-hour budget. Pass.
167
+ - Step 6 — Emit loop; first three candidates trigger retrospective; review whether system-design rubric is calibrated to actual L6 staff behavior or diverging toward L5.