npm - newsjack - Versions diffs - 0.1.5 - Mend

newsjack 0.1.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (54) hide show

package/.mcp.json +9 -0
package/.newsjack-npm +1 -0
package/COMMIT +1 -0
package/LICENSE +21 -0
package/README.md +133 -0
package/VERSION +1 -0
package/bin/newsjack +74 -0
package/package.json +37 -0
package/skills/.gitkeep +0 -0
package/skills/ETHICS.md +265 -0
package/skills/WHY-NOT-SPAM.md +257 -0
package/skills/angle-generator/SKILL.md +224 -0
package/skills/angle-generator/examples.md +517 -0
package/skills/angle-generator/rubric.md +219 -0
package/skills/coverage-tracker/SKILL.md +124 -0
package/skills/coverage-tracker-setup/SKILL.md +84 -0
package/skills/crisis-holding/SKILL.md +336 -0
package/skills/crisis-holding/examples.md +302 -0
package/skills/crisis-holding/rubric.md +218 -0
package/skills/fact-check/SKILL.md +212 -0
package/skills/fact-check/examples.md +195 -0
package/skills/fact-check/rubric.md +228 -0
package/skills/journalist-fit-check/SKILL.md +199 -0
package/skills/journalist-fit-check/examples.md +271 -0
package/skills/journalist-fit-check/rubric.md +251 -0
package/skills/meanest-editor/SKILL.md +112 -0
package/skills/meanest-editor/examples.md +331 -0
package/skills/meanest-editor/rubric.md +275 -0
package/skills/media-list-manager/SKILL.md +204 -0
package/skills/media-list-manager/examples.md +88 -0
package/skills/media-list-manager/rubric.md +67 -0
package/skills/news-search/SKILL.md +56 -0
package/skills/newsjack-detector/SKILL.md +286 -0
package/skills/newsjack-detector/examples.md +118 -0
package/skills/newsjack-detector/references/engine-cli.md +29 -0
package/skills/newsjack-detector/references/harness-routing.md +38 -0
package/skills/newsjack-detector/references/rss-feeds.json +106 -0
package/skills/newsjack-detector/rubric.md +160 -0
package/skills/newsjack-monitor-setup/SKILL.md +202 -0
package/skills/newsjack-monitor-setup/examples.md +106 -0
package/skills/newsjack-triage/SKILL.md +98 -0
package/skills/newsworthiness-check/SKILL.md +179 -0
package/skills/newsworthiness-check/examples.md +232 -0
package/skills/newsworthiness-check/rubric.md +218 -0
package/skills/pr-strategist/SKILL.md +304 -0
package/skills/reactive-comment/SKILL.md +297 -0
package/skills/reactive-comment/examples.md +284 -0
package/skills/reactive-comment/rubric.md +280 -0
package/skills/relevance-coarse-filter/SKILL.md +61 -0
package/skills/story-origin-check/SKILL.md +160 -0
package/skills/voice-extractor/SKILL.md +330 -0
package/skills/voice-extractor/examples.md +227 -0
package/skills/voice-extractor/rubric.md +251 -0
package/skills-manifest.json +254 -0

package/skills/newsjack-detector/SKILL.md ADDED Viewed

@@ -0,0 +1,286 @@
+---
+name: newsjack-detector
+description: "Monitor current news and reaction signals, then decide which are credible newsjacking opportunities for a client. Uses the local monitoring engine for evidence, but the skill owns PR judgment, brand safety, standing, decay, angle fit, and handoff."
+when_to_use: "User wants to monitor news for pitchable hooks, find newsjacking opportunities, react to breaking industry news, watch competitors/topics, or decide whether a current signal is worth turning into an angle or reactive comment."
+---
+# Newsjack Detector
+Find timely public signals and decide whether a client has a credible, non-spammy reason to use them. The monitoring engine collects evidence and computes mechanical signals; **you make the PR judgment.**
+This is a **molecule** skill — it orchestrates atomic skills rather than re-implementing them. Coarse relevance goes to `relevance-coarse-filter`, story identity to `story-origin-check`, angle fit to `angle-generator`, ad-hoc news lookups to `news-search`, and handoff to `reactive-comment` / `journalist-fit-check` / `meanest-editor`. Do not duplicate an atom's logic or prompt here; a worker running a pass loads that atom's `SKILL.md` directly, so the atom stays the single source of truth.
+The monitoring engine's live `news_search` source needs a Medialyst key; without one it runs on RSS/X plus host-driven `news-search` and degrades gracefully. Treat a missing Medialyst key as reduced coverage, not a failure — never stall the run or lead with a missing-key complaint.
+## Required Workflow (follow in order)
+**Default mode: run the canonical pipeline and return a report.** This skill exists to produce a freshness-gated newsjack report, including for scheduled/cron runs. Execute by default — only drop into discussion/planning when Step 2 is blocked.
+1. **CHECK DOCTRINE.** If `skills/ETHICS.md` or `skills/WHY-NOT-SPAM.md` exist, follow them. This skill refuses tragedy hooks, fabricated standing, fake urgency, and spray-and-pray output. These blocks are absolute and override every later step.
+2. **ANCHOR THE CLIENT — ASK FIRST ONLY IF BLOCKED.** Identify company, topics, competitors, spokespeople, standing, and client-specific exclusions, from a profile JSON or plain-text context.
+   - No profile **and** no usable client context → ask for it before running. Never invent profile facts.
+   - Genuinely ambiguous (which client? which topic? one-off vs recurring?) → ask one clarifying question, then proceed. Otherwise do not stall the run.
+   - Missing standing is not a blocker: monitor, but mark opportunities `weak`/`no-standing`.
+   - **Load the client brief.** Read the monitor's `brief.md` (its path is surfaced as `brief_path` by `monitor run`/`monitor status`, or it sits next to the profile). It is the **source of truth** for what this client will and won't pitch and how to present the scan — see **Client Brief** below. An empty/template brief carries no rules.
+3. **PICK THE RUN SHAPE.**
+   - One-off / "what's moving on X" → **Quick Run** below.
+   - Real judgment, agent run, or scheduled job → **Canonical Pipeline** below (the default for any output a human or pitch will rely on).
+   - Recurring / cron feed monitoring → Canonical Pipeline plus the recurring rules in **Freshness Gate** (`--feed-only --new-only --max-age-hours 24`, hard freshness gate).
+4. **RUN THE PIPELINE.** Execute the chosen path end to end. For anything beyond a Quick Run, never skip the story-origin / freshness gate.
+5. **JUDGE — NEVER TRUST MECHANICS AS PERMISSION.** `routing.queue_priority` and `story_size` are recall pressure, not pitch permission. You decide newsjacking-worthiness, standing, journalist shape, and brand safety (see **Engine vs Skill Boundary** and `rubric.md`). Gate angle fit through `angle-generator`.
+6. **VERIFY & CONCLUDE.** Run the **Completion Checklist**, then report: the `run.md` path, whether coarse passes were cost-optimized or fallback, whether every surfaced signal has verified ≤24h first-public freshness, and top findings.
+## Engine vs Skill Boundary
+The Go CLI owns (mechanical, deterministic):
+- ingestion, dedupe, clustering, novelty tracking
+- mechanical scores only: freshness, source agreement, novelty, profile match, source quality, momentum, major-news weight
+- deterministic story-size scoring from news-search metadata: log-scaled estimated monthly traffic + domain authority, with coverage spread across independently surfaced domains
+- deterministic hygiene filtering for docs/help/product/SEO pages
+- coarse-relevance application via `newsjack filter-apply`, plus two recall guards: a **big-story guard** that upgrades *any* `reject` of a `high`/`major` `story_size` signal to `monitor_only` (`big_story_recall`) — the cheap pass can never hard-drop a big story — and a **profile-match guard** that upgrades `reject/no_profile_bridge` to `monitor_only` when detector/profile evidence already matched the client, a competitor, or a profile term
+- deterministic freshness gating via `newsjack origin-apply`
+- operational routing: lane, queue priority, threshold-demotion flag
+- deterministic safety flags
+You own (PR judgment):
+- whether the signal is newsjacking-worthy and whether the client has standing
+- same-story / original-coverage judgment (via `story-origin-check`)
+- final decay explanation from `freshness_gate`
+- journalist shape, brand-safety judgment, and handoff to the next skill
+Never treat `routing.queue_priority` as permission to pitch — it is only operational queue order.
+## Client Brief
+Each monitor may carry a `brief.md` — a prose, user-owned statement of what this client will and won't pitch and how they want the scan presented. It is the **source of truth** for client pitch/output policy; the profile JSON governs *collection*, the brief governs *what gets pitched and shown*. The CLI only creates and surfaces the file (`monitor init` scaffolds it; `brief_path` is reported by `monitor run`/`monitor status`); it never parses it — reading and applying it is yours.
+- **Where it binds:** triage and report rendering — never collection. Keep retrieval and the coarse pass brief-agnostic so nothing is dropped before judgment; the brief only decides what an already-collected, already-fresh item is allowed to *be* and how it's *shown*.
+- **Never pitch rules** are hard: an item matching one can never be `pitch_ready`. A non-big item drops to `watch` (`client_policy_exclusion`); a fresh `high`/`major` item stays `big_story` with `off_policy: true` (the never-drop doctrine still holds — surface it, don't hide it). `newsjack-triage` enforces this.
+- **Audience / We pitch** set the standing *altitude*: topical overlap is not pitchability. A story can be on-topic and still off-brief.
+- **How to surface** is presentation only: collapse a section to a disclosed count, never silence it. Disclose what the brief held back (count + reason) so nothing is hidden.
+- **Feedback updates the brief.** When the user reacts to a run — "too policy-heavy," "stop showing me X," "this is exactly right" — propose an edit to `brief.md` (a new *We never pitch* rule, a *How to surface* line, or a dated *Example*) so the policy is captured durably, not just for this run. Confirm the edit. An empty/template brief means run with defaults.
+## Profile Setup File
+The monitor profile JSON is the source of truth for collection setup: a focused set of short broad beat topics, search terms, competitors, feeds, standing, spokespeople, and exclusions. Prefer 6-8 core 2-3 word topics, with one-word topics allowed when natural. If the user wants to change what the monitor looks for, edit the profile JSON rather than generating one-off retrieval terms during a detector run.
+- Installed monitors keep the setup file at `~/.newsjack/monitors/<slug>/profile.json`; `brief.md` sits next to it.
+- Direct detector runs use the file passed with `--profile`.
+- Fixture profiles live under `fixtures/newsjack-detector-agent/profile.<slug>.json`.
+Use `newsjack-monitor-setup` when the user wants to create or materially revise a profile. Collection feedback such as "watch broader accounting firm news" belongs in `profile.json` (`topics` / `search_terms` / `feed_urls`): put 6-8 core broad beats in `topics`, and put broad retrieval terms plus named platforms/products/regulators/competitors in `search_terms`. Pitch-policy feedback such as "don't pitch policy stories" belongs in `brief.md`. After editing `profile.json`, rerun a mock or fixture smoke before trusting the next live run.
+## Quick Run
+One-off discovery and scans:
+```bash
+newsjack detector run --profile profile.json --save
+```
+The detector emits JSON only; render any human scan yourself from the artifact facts. Use `--topic "explicit user topic"` only when the user deliberately asks to add a one-off retrieval topic. Routine profile runs should rely on the profile's durable `topics` and `search_terms`, not ad hoc generated retrieval terms. Use `--mock` for local verification without credentials. Full flag/source/env reference: `references/engine-cli.md`.
+For each queued signal, inspect title, sources, evidence URLs, age, `routing.lane`, `mechanical_scores` (`major_news`, `novelty`, `source_agreement`), profile matches, and safety flags. For `x` evidence inspect `x_signal_type`, `x_social_signals`, `x_author_followers`, `x_query_counts`; treat lone low-reach posts as noise. A high `major_news` means the story is broadly important, **not** that the client has standing. Treat engine age/decay as provisional until `story-origin-check` verifies the first-public clock. Then apply `rubric.md` and the **Output Format**.
+## Canonical Pipeline
+The artifact contract is the source of truth. Write all artifacts to a timestamped run folder:
+```text
+RUN_DIR/
+  candidates.json              # 1. detector output
+  coarse_relevance_decisions.json   # 2. coarse pass
+  relevant_candidates.json     # 3. filter-apply
+  clustered_candidates.json    # 3b. cluster — same-story dedup + stale pre-gate
+  origin_findings.json         # 4. story-origin pass (representatives only)
+  targeted_candidates.json     # 5. origin-apply (freshness authority)
+  triaged_candidates.json      # 5b. newsjack-triage — standing + consolidation
+  final_report.md              # 7. compiled 3-bucket scan (pitch-ready / big stories / watch)
+  run.md                       # 8. skill-rendered — THE human-facing artifact
+  detector.stderr.log  commands.log  summary.json
+```
+Only `run.md` is human-facing; the rest are provenance.
+1. **Run the detector and save candidates.** This is the **canonical invocation** — use it verbatim for any run a human or pitch will rely on, across every harness, so runs stay comparable:
+   ```bash
+   newsjack detector run --profile profile.json --sources news_search,x --lookback-days 1 --depth quick --limit 80 --min-queue-priority 40 --min-major-news 0.55 > candidates.json
+   ```
+   The floors `--min-queue-priority 40` and `--min-major-news 0.55` are the engine defaults; they define the emitted pool. **Do not lower them and do not pass `--include-all-scored` or `--no-hygiene-filter`** (debug-only) for a real run — they change which signals reach the report and make two runs of the same profile incomparable. Profile terms own durable retrieval; do not hand-tune the query per run unless the user explicitly asked for a one-off `--topic`. For recurring/cron precision add `--demote-unmatched-x` (see **Freshness Gate**); that is the only flag the canonical command grows.
+2. **Coarse relevance pass** → `coarse_relevance_decisions.json`. High-recall junk removal only — no ranking, angles, dates, or pitch decisions. Each worker loads `skills/relevance-coarse-filter/SKILL.md` and applies it to its assigned signals; merge every worker's output into one `decisions` array. For model/worker routing and chunking, see `references/harness-routing.md`.
+3. **Apply coarse decisions:**
+   ```bash
+   newsjack filter-apply --candidates candidates.json --decisions coarse_relevance_decisions.json --include keep --include monitor_only --output relevant_candidates.json
+   ```
+3b. **Cluster same-story signals before the expensive retrieval pass:**
+   ```bash
+   newsjack cluster --candidates relevant_candidates.json --drop-stale --window-hours 24 --output clustered_candidates.json
+   ```
+   The Go CLI collapses syndicated pickups / near-duplicate headlines of the **same public event** into one representative (it shares findings, so 15 NVIDIA-GTC copies cost one story-origin retrieval, not 15) and records the rest in `clustered_duplicates`. `--drop-stale` deterministically pre-gates low-story-size signals whose detector decay is clearly outside the window (`week`/`month`) into `pre_gated_stale`, so they skip retrieval entirely; large stories (`high`/`major`) are always researched regardless of age. Run story-origin on `clustered_candidates.json` (representatives only). Disclose how many duplicates and stale items were collapsed.
+4. **Story-origin pass** on `clustered_candidates.json` (representatives) → `origin_findings.json`. Each worker loads `skills/story-origin-check/SKILL.md` and applies it per signal: decide same-story vs material-new-development, recover `first_public_at`, `original_url`, and canonical major coverage. It must **not** compute `fresh`/`stale`, must return **one finding per signal (never skip)**, and must cite **≥2 independent corroborating sources** to support a fresh clock. Merge the per-signal results into one `findings` array, keyed by `signal_id`. Validate the count against the input and re-run any gaps. The story-origin pass needs retrieval — see `references/harness-routing.md`.
+5. **Apply the deterministic freshness gate:**
+   ```bash
+   newsjack origin-apply --candidates clustered_candidates.json --origins origin_findings.json --window-hours 24 --output targeted_candidates.json
+   ```
+   The Go CLI is the freshness authority — it computes `freshness_gate.computed_status` from the run timestamp and cutoff. If an LLM labels May 8 fresh for a May 25 run, `origin-apply` marks it stale. Non-fresh signals carry a specific reason: `stale`, `unverified_no_corroboration` (worker cited <2 independent sources — a pipeline/worker-quality miss), `unverified_boundary` (date-only clock straddling the cutoff), or `unverified_no_timestamp` (no clock recovered). Distinguish these in the report and in metrics: `unverified_no_corroboration` means *we* didn't verify, not that the story is old.
+5b. **Standing triage** on the selected fresh signals in `targeted_candidates.json` → `triaged_candidates.json`. Load `skills/newsjack-triage/SKILL.md` and pass it the **client brief** when present: re-consolidate any same-story representatives that slipped through, apply the brief's **never-pitch** rules (off-policy items can never be `pitch_ready`; fresh big ones stay `big_story` with `off_policy: true`, small ones drop to `watch`/`client_policy_exclusion`), assign `strong`/`partial`/`none` standing at the brief's audience altitude with a journalist-shape sanity check, and **route each story to a tier**: `pitch_ready` (strong, or partial with a sharp shape), `big_story` (a fresh `high`/`major` story that lacks standing — **never dropped**, always surfaced as a suggestion with a `bridge_note` + `relevance_confidence`), or `watch` (small/non-big with no standing, off-beat, duplicate). This is the standing gate the engine cannot make — it replaces ad-hoc orchestrator judgment so the decision is auditable. Only `watch` withholds a story, and only for items that are neither pitchable nor big.
+6. **Angle generation** on the **routed** candidates in `triaged_candidates.json`. Run `angle-generator` in **pitch mode** on `pitch_ready` items (a candidate is pitchable only if it yields ≥1 honest, journalist-shaped angle; zero viable angles downgrades it to `big_story` if the story is big, else `watch`) and in **exploratory mode** (`context.mode: exploratory`) on `big_story` items (at most one tentative `suggestion` angle; an empty result is fine and does **not** drop the story — it still appears as "awareness only").
+7. **Compile `final_report.md`** — a 3-bucket scan, story-first and skimmable. The fixture's `scripts/build_report.py` is the reference implementation; the skill owns the human report shape. Lead with a **Today's read** line (`N pitch-ready · M big stories · K watched`) and a funnel line that asserts nothing pitchable or big was dropped off-screen. Then three sections, organized by the two independent axes — **standing** (can the client act?) and **magnitude** (how big is the story?):
+   - `## ✅ Pitch-Ready` (`pitch_ready` tier): each story shows freshness (with **both** the first-public date *and* the new-development date for `fresh_new_development`), standing, the angle-generator angles, and its link provenance.
+   - `## 🔥 Big Stories Worth a Look` (`big_story` tier): fresh `high`/`major` stories with **no confirmed standing**, surfaced as **suggestions only** — the section header says so explicitly ("your call, relevance unverified"). **Sorted by coverage spread (distinct surfaced outlet count) desc**, no cap. Each shows the magnitude label + outlet count, freshness, the honest `bridge_note`, confidence flags (incl. the coarse `weakness_flag` → e.g. `⚠ possible keyword match`), provenance, and at most one `suggestion`-tagged angle (or "no clean angle — awareness only"). This is how we surface big stories without ever making the drop decision; telling a real story apart from a high-authority-domain artifact is done by **ranking and flagging here**, never by dropping upstream.
+   - `## 👀 Watch / Context`: `watch`-tier (fresh but no standing, non-big) plus freshness-gated items (`stale`/`unverified_*`), with plain reasons and dates. Big-but-stale items are marked.
+   - **Link provenance (all sections):** **One main source = the source of record** — the article the detector actually surfaced, real `published_at`, flagged when thin (`⚠ single source`, `⚠ source of record is an aggregator`). **Related coverage** underneath: clustered duplicate pickups (tagged `surfaced duplicate`) plus any `canonical_coverage_url`/`original_url` the worker *proposed*, shown with date marked **unverified** and tagged `proposed by research — UNVERIFIED`. **Never promote a worker-proposed link into the main-source position** — the anti-laundering rule. Every link carries a date.
+   Links must be clickable Markdown, not backticked or bare URLs. Do not present mechanical rank as a final fit verdict.
+   **Honor the client brief's *How to surface*** here: if the brief asks to collapse a section (e.g. the big-stories/awareness section), render it as a one-line **disclosed count with reasons**, never silence it. Lead with whatever the brief prioritizes. State plainly when the brief moved an item out of `pitch_ready` or collapsed a section, and quote the rule (`policy_rule`). `scripts/build_report.py` is the brief-agnostic mechanical reference (`final_report.md`); the brief is honored in the skill-rendered `run.md`.
+8. **Write `run.md` yourself from the artifacts.** The CLI does not render reports. It only emits deterministic JSON. Use `final_report.md` plus the artifact facts to write a human-facing `run.md` in the run folder.
+   The report must be rendered from the **gated/fresh/triaged artifacts**, never raw `candidates.json` alone. Do not resurface coarse-rejected or hard-safety-flagged signals in the ✅/🔥 sections. The only hard drops are mechanical (URL-pattern hygiene) and hard-safety flags; disclose their counts from the JSON artifacts so nothing is hidden — never silently truncate. If you need a machine-readable artifact index, run:
+   ```bash
+   newsjack run-summary targeted_candidates.json --output summary.json
+   ```
+   `run-summary` writes JSON metadata only; it does not write Markdown or make editorial decisions.
+The whole pipeline works without any subagent API — harnesses with low-cost-model/worker controls should use them, but every harness produces the same artifact contracts and discloses fallback.
+## Freshness Gate
+For recurring scheduled output, a signal is not surfaceable until its Go-computed `freshness_gate.computed_status` is verified. News-search `published_at` values are good article-publication evidence for recovering originals, but they alone never decide same-story status or first publication — that is the `story-origin-check` atom's job.
+Recurring output rules:
+- Surface only `fresh` or `fresh_new_development`. Reject `stale` and every `unverified_*` status. The unverified statuses are distinct on purpose: `unverified_no_corroboration` (worker cited <2 independent sources — a *pipeline* miss, often re-runnable), `unverified_boundary` (date-only clock straddling the cutoff), `unverified_no_timestamp` (no clock recovered). Report them separately so worker-quality misses are not mistaken for genuinely old stories.
+- Run with `--demote-unmatched-x` so unmatched X News/Trends clusters fall below the queue floor unless the large-story recall guard lifts them. X News surfaces for review by default; recurring precision wants it demoted unless it is a genuinely large story.
+- Cluster (step 3b) before retrieval and prefer `--drop-stale` so syndicated duplicates and clearly-old low-value items never burn story-origin retrieval.
+- Do **not** reset the clock for AOL, Yahoo, MSN, Apple News, partner syndication, wire pickup, SEO rewrites, or "published today" pages whose canonical/source story is older.
+- A newer article restarts the clock only if it adds a concrete new public fact: official action, filing, statement, data/report publication, material company update, new local impact, or another independently coverable development.
+- Prefer `story_origin.canonical_coverage_url` as the report's main link — the major/most authoritative same-story coverage, not the random pickup that triggered retrieval.
+`origin-apply` attaches `story_origin` and the deterministic `freshness_gate` to selected and rejected signals. If the first-public timestamp can't be verified, write `first_public_at: null` and explain the gap; `origin-apply` computes the appropriate `unverified_*` status.
+## Handoff
+- Breaking / same-day sourced comment → `reactive-comment`
+- Needs story framing → `angle-generator`
+- Named journalist check → `journalist-fit-check`
+- Draft critique → `meanest-editor`
+## Completion Checklist
+Before reporting the run complete:
+- `coarse_relevance_decisions.json` has exactly one decision per emitted candidate (unless `--allow-missing`).
+- `clustered_candidates.json` was produced by `cluster`; story-origin ran on its representatives, and the run disclosed how many duplicates/stale items were collapsed.
+- `origin_findings.json` has exactly one finding per clustered representative (unless `--allow-missing`) — count validated, gaps re-run.
+- `targeted_candidates.json` was produced by `origin-apply`; `triaged_candidates.json` was produced by `newsjack-triage` with a `tier` per signal; `pitch_ready` went to `angle-generator` in pitch mode and `big_story` in exploratory mode.
+- No fresh `high`/`major` story was routed to `watch` — every fresh big story appears in **🔥 Big Stories Worth a Look** (or **✅ Pitch-Ready** if it earned standing). A brief never-pitch rule may move a big story *out of pitch-ready*, but it stays a surfaced `big_story` (`off_policy: true`), never dropped.
+- If a client brief is present, the report applied it: off-policy items are out of `pitch_ready`, any collapsed section shows a disclosed count + reason, and feedback this turn that changes policy was offered as a `brief.md` edit.
+- `final_report.md` is the 3-bucket scan (✅ Pitch-Ready / 🔥 Big Stories Worth a Look / 👀 Watch / Context), written from `targeted_candidates.json` / `triaged_candidates.json`, not raw `candidates.json`.
+- `run.md` was skill-rendered from the gated/fresh/triaged artifacts after `final_report.md` existed — never from raw `candidates.json` alone.
+- The ✅/🔥 sections contain **no** coarse-rejected or hard-safety-flagged signal; the only hard drops (URL-hygiene + hard-safety) have their counts disclosed from the JSON artifacts.
+- The final response names the `run.md` path, the cost-optimized-vs-fallback status, whether every surfaced signal has verified ≤24h first-public freshness, and top findings.
+## Output Format
+Return exactly this JSON object. No prose before or after it. Every opportunity must include source URLs in `evidence_used` — `story_origin.canonical_coverage_url` first when present, then the original/source URL and other support (usually 1–3 links across news, RSS, and X).
+```json
+{
+  "opportunities": [
+    {
+      "signal_id": "engine signal id",
+      "signal_title": "Observed public signal",
+      "verdict": "pitch_now",
+      "decay": {
+        "stage": "4hr",
+        "rationale": "Why this clock applies"
+      },
+      "story_size": {
+        "band": "low | moderate | high | major",
+        "score": 0,
+        "rationale": "How publication traffic/domain authority and coverage spread should affect effort priority"
+      },
+      "first_publication": {
+        "status": "fresh | fresh_new_development",
+        "first_public_at": "ISO timestamp or YYYY-MM-DD",
+        "original_url": "https://...",
+        "canonical_coverage_url": "https://... or null",
+        "canonical_coverage_source": "Outlet/source name or null",
+        "rationale": "Why this first-public clock controls"
+      },
+      "why_newsjacking_worthy": "Specific reason this is timely and not generic trend-chasing.",
+      "client_standing": {
+        "assessment": "strong | partial | weak",
+        "rationale": "What gives the client standing, or what is missing"
+      },
+      "journalist_shape": {
+        "beat_description": "Specific reporter shape, not a name",
+        "why_they_care_now": "Why this beat plausibly cares now",
+        "do_not_target": "Who should not receive this"
+      },
+      "evidence_used": [
+        {
+          "source": "news_search",
+          "title": "Evidence title",
+          "url": "https://...",
+          "published_at": "YYYY-MM-DD"
+        }
+      ],
+      "next_skill": "angle-generator"
+    }
+  ],
+  "rejected_signals": [
+    {
+      "signal_id": "engine signal id",
+      "signal_title": "Rejected public signal",
+      "reason": "no_client_standing",
+      "first_publication": {
+        "status": "stale | unverified_no_corroboration | unverified_boundary | unverified_no_timestamp | null",
+        "first_public_at": "ISO timestamp, YYYY-MM-DD, or null",
+        "original_url": "https://... or null",
+        "canonical_coverage_url": "https://... or null"
+      }
+    }
+  ],
+  "brand_safety_blocks": [
+    {
+      "signal_id": "engine signal id",
+      "signal_title": "Blocked public signal",
+      "reason": "tragedy_or_human_suffering"
+    }
+  ],
+  "monitor_notes": [
+    "Operational note or missing source, if relevant"
+  ]
+}
+```
+- Allowed verdicts: `pitch_now`, `develop_angle`, `monitor`, `reject`.
+- Allowed rejection reasons: `stale`, `freshness_unverified` (umbrella; or the specific `unverified_no_corroboration` / `unverified_boundary` / `unverified_no_timestamp`), `single_source`, `no_client_standing`, `no_journalist_shape`, `off_beat`, `already_seen`, `weak_signal`, `no_viable_angle`.
+- Allowed brand-safety block reasons: `tragedy_or_human_suffering`, `client_exclusion`, `regulated_claim_risk`, `fabrication_risk`.

package/skills/newsjack-detector/examples.md ADDED Viewed

@@ -0,0 +1,118 @@
+# Newsjack Detector Examples
+## Pitch Now
+Engine signal:
+```json
+{
+  "id": "s1",
+  "title": "FTC opens inquiry into AI compliance claims",
+  "sources": ["news_search", "x"],
+  "features": {
+    "decay_bucket": "4hr",
+    "source_count": 2,
+    "seen_before": false,
+    "profile_matches": ["AI compliance", "enterprise governance"],
+    "safety_flags": []
+  },
+  "routing": {
+    "lane": "profile_relevance",
+    "queue_priority": 86.2,
+    "demoted": false
+  },
+  "mechanical_scores": {
+    "freshness": 1.0,
+    "source_agreement": 0.78,
+    "novelty": 1.0,
+    "profile_match": 0.44,
+    "source_quality": 0.825,
+    "momentum": 0.21,
+    "major_news": 0.0
+  }
+}
+```
+Skill output:
+```json
+{
+  "signal_id": "s1",
+  "signal_title": "FTC opens inquiry into AI compliance claims",
+  "verdict": "pitch_now",
+  "decay": {
+    "stage": "4hr",
+    "rationale": "The signal is same-cycle by verified first-public clock, not just the search-result timestamp."
+  },
+  "first_publication": {
+    "status": "fresh",
+    "surfaced_article_published_at": "2026-05-25T13:14:00Z",
+    "first_public_at": "2026-05-25T13:10:00Z",
+    "original_url": "https://www.ftc.gov/news-events/news/press-releases/example",
+    "canonical_coverage_url": "https://www.reuters.com/legal/government/ftc-opens-inquiry-ai-compliance-claims-2026-05-25/",
+    "canonical_coverage_source": "Reuters",
+    "rationale": "The official FTC press release is the earliest verified public source and is inside the 24-hour cron window."
+  },
+  "why_newsjacking_worthy": "Regulator action creates a live need for explainers on AI compliance claims.",
+  "client_standing": {
+    "assessment": "strong",
+    "rationale": "The client works directly in enterprise AI governance and can explain claim substantiation."
+  },
+  "journalist_shape": {
+    "beat_description": "Enterprise AI reporter covering compliance and regulator scrutiny",
+    "why_they_care_now": "They need sourced reaction while the inquiry is fresh.",
+    "do_not_target": "General startup roundups or consumer AI reviewers"
+  },
+  "evidence_used": [
+    {
+      "source": "Reuters",
+      "title": "FTC opens inquiry into AI compliance claims",
+      "url": "https://www.reuters.com/legal/government/ftc-opens-inquiry-ai-compliance-claims-2026-05-25/"
+    },
+    {
+      "source": "FTC",
+      "title": "FTC opens inquiry into AI compliance claims",
+      "url": "https://www.ftc.gov/news-events/news/press-releases/example"
+    }
+  ],
+  "next_skill": "reactive-comment"
+}
+```
+## Monitor
+Engine signal: X discussion only, no news confirmation.
+Verdict: `monitor`
+Reason: "Single-source chatter with no confirmed news event. Watch for news search confirmation or official filing."
+## Reject
+Engine signal: "Influencers debate AI regulation again" with `month` decay and no new document.
+Verdict: `reject`
+Reason: `stale`
+Engine signal: AOL article published today, canonical URL points to a BBC story from May 4 with no new development.
+Verdict: `reject`
+Reason: `stale`
+`first_publication.status`: `stale`
+Engine signal: secondary article published today, no canonical/source metadata, and searches do not verify the first public source.
+Verdict: `reject`
+Reason: `freshness_unverified`
+## Brand-Safety Block
+Engine signal: public tragedy with high social momentum.
+Block reason: `tragedy_or_human_suffering`
+Do not hand off to angle generation.

package/skills/newsjack-detector/references/engine-cli.md ADDED Viewed

@@ -0,0 +1,29 @@
+# Engine CLI Reference
+The Go monitoring engine collects evidence, computes mechanical scores, and emits JSON. The `newsjack-detector` skill owns PR judgment, standing, story-origin reasoning, brand safety, and human-facing rendering.
+Do not duplicate the CLI surface in this file. Agents should discover current commands, flags, defaults, source availability, and credential recovery from the installed CLI:
+```bash
+newsjack help
+newsjack help detector
+newsjack detector run --help
+newsjack doctor
+```
+In this repo, prefer the source shim:
+```bash
+./bin/newsjack help detector
+./bin/newsjack detector run --help
+```
+For routine profile runs, rely on the profile:
+```bash
+newsjack detector run --profile profile.json --save
+```
+Use `--topic` only when the user explicitly asks for a one-off retrieval topic. Durable monitor discovery belongs in `profile.json`, not in ad hoc run terms.
+Profile caveat for agents: when `search_terms` are present, retrieval uses them instead of raw `topics + competitors`. Keep `topics`, `competitors`, and `standing` as matching/judgment context; put broad retrieval terms plus named platforms, products, regulators, and competitors in `search_terms` when they must drive collection. Terms must be static, explicit, and provenance-safe: user input, client materials, named entities, or current coverage, not model-remembered sector trends.

package/skills/newsjack-detector/references/harness-routing.md ADDED Viewed

@@ -0,0 +1,38 @@
+# Harness Routing & Chunking
+The two coarse passes (relevance, story-origin) are low-cost tasks. They become low-cost *model* passes only when the harness supports model selection or low-cost worker/subagent routing. The artifact contracts (`coarse_relevance_decisions.json`, `origin_findings.json`) are the same regardless of harness — only the cost story changes, and that must be disclosed.
+## Execution decision path
+Before each coarse pass, identify the harness and take the first available path:
+1. **Direct low-cost-model path.** If the harness can choose a model for a single step, run the coarse prompt with the lowest-cost reliable model below, then run the expensive pass with the strong model.
+2. **Low-cost subagent/worker path.** If the harness can't switch the current model but can spawn workers/subagents with a model hint, split candidates into chunks. Relevance workers return only `decisions`; origin workers return only `findings`. Merge each pass into its single JSON artifact.
+3. **Current-model fallback.** If neither is possible, run the prompts with the current model and state explicitly in the final response: `coarse passes ran with current model; this was semantic multi-stage, not cost-optimized multi-stage`.
+## Harness hints
+- **Claude Code / Claude-style coding harnesses:** if a `Task`/subagent tool or model override exists, use it for chunks and request the latest Haiku/low-cost alias; use the latest Sonnet or Opus alias for the expensive pass. No control exposed → current-model fallback.
+- **Codex:** if `spawn_agent` is available and model override is allowed, spawn workers with a low-reasoning small/fast model (`gpt-5.4-mini`, `gpt-5-nano`, or the newest GPT-5.x at low reasoning). Use `gpt-5.5` or the strongest Codex model at medium/high reasoning for the expensive pass. If override is not allowed, spawn default workers for parallelism or use fallback; disclose which.
+- **OpenClaw:** prefer low-cost worker fanout. Low-cost: Gemini 3 Flash Preview, Claude Haiku/latest alias, or GPT-5.x low-reasoning. Expensive: Gemini 3 Pro Preview, Claude Sonnet/Opus latest, or GPT-5.5+ higher reasoning.
+- **API harnesses:** call the configured low-cost model for coarse prompts, then the configured stronger model for the expensive rubric pass.
+- **Unknown harness:** current-model fallback unless an explicit low-cost-model or worker mechanism is exposed.
+## Preferred models
+- **Coarse passes:** Gemini 3 Flash Preview, Claude Haiku/latest Haiku alias, GPT-5-nano, GPT-5.4-mini low reasoning, GPT-5.5 low reasoning, or the harness's lowest-cost fast equivalent.
+- **Expensive pass:** Gemini 3 Pro Preview, Claude Sonnet/Opus latest aliases, GPT-5.5 medium/high reasoning, GPT-5.4 medium/high reasoning, or the harness's strongest reasoning model.
+- Treat Gemini 2.5 Pro as stale for this pipeline — fallback only when Gemini 3 Pro is unavailable. Gemini 2.5 Flash/Flash-Lite are coarse-pass fallbacks when Gemini 3 Flash is unavailable.
+- If the exact named model is unavailable, choose the closest current low-cost/fast model for pass 1 and the closest current strong reasoning model for pass 2.
+## Chunking guidance
+- 1–15 signals: one call per coarse pass is fine.
+- 16–40 signals: split into chunks of 8–15. Prefer at least 2 workers/subagents when the harness exposes them.
+- 41–80 signals: split into chunks of 8–12. Use worker/subagent fanout if available.
+- More than 80 signals: do not ask one low-cost call or one subagent to process everything. Split into chunks of 8–12; if that would need more than 8 low-cost workers, tighten detector `--limit`, lane caps, or rerun by profile/source lane before filtering.
+- Each chunk must include the profile context plus only its assigned signals. Each worker returns only items for its assigned signal IDs.
+- The merged `coarse_relevance_decisions.json` and `origin_findings.json` must each contain exactly one item per input signal unless intentionally using `--allow-missing`.
+- Do not let coarse workers perform the expensive rubric pass, compare across chunks, pick best bets, or write the final report.
+- If the harness has low-cost workers but no model override, still split large sets for reliability and disclose that model cost was not optimized.
+- The story-origin pass needs retrieval. If a low-cost worker cannot open pages or search the web, either give it extracted page/search evidence from the orchestrator or run that pass in the current harness with retrieval tools — do not let a worker return a story-identity verdict it had no evidence for.

package/skills/newsjack-detector/references/rss-feeds.json ADDED Viewed

@@ -0,0 +1,106 @@
+{
+  "version": 1,
+  "default_feeds": [
+    "techmeme",
+    "google-news-technology",
+    "google-news-business"
+  ],
+  "feeds": [
+    {
+      "id": "techmeme",
+      "name": "Techmeme",
+      "url": "https://www.techmeme.com/feed.xml",
+      "beats": ["technology", "ai", "startups", "venture", "platforms"],
+      "use_when": "Default for technology, AI, SaaS, startup, platform, and developer-tool profiles.",
+      "notes": "Curated tech/business front page. High precision for major tech stories."
+    },
+    {
+      "id": "google-news-technology",
+      "name": "Google News Technology",
+      "url": "https://news.google.com/rss/headlines/section/topic/TECHNOLOGY?hl=en-US&gl=US&ceid=US:en",
+      "beats": ["technology", "ai", "consumer tech", "platforms"],
+      "use_when": "Use as a broad technology backstop when Techmeme may be too narrow.",
+      "notes": "Broader and noisier than Techmeme."
+    },
+    {
+      "id": "google-news-business",
+      "name": "Google News Business",
+      "url": "https://news.google.com/rss/headlines/section/topic/BUSINESS?hl=en-US&gl=US&ceid=US:en",
+      "beats": ["business", "markets", "finance", "startups", "economy"],
+      "use_when": "Use for business, funding, M&A, markets, commercial real estate, and executive-change profiles.",
+      "notes": "Broad business feed; requires stricter relevance filtering."
+    },
+    {
+      "id": "google-news-science",
+      "name": "Google News Science",
+      "url": "https://news.google.com/rss/headlines/section/topic/SCIENCE?hl=en-US&gl=US&ceid=US:en",
+      "beats": ["science", "research", "space", "climate", "biotech"],
+      "use_when": "Use for research, biotech, climate, space, and science-led profiles.",
+      "notes": "Often useful for research-backed companies, less useful for generic SaaS."
+    },
+    {
+      "id": "google-news-health",
+      "name": "Google News Health",
+      "url": "https://news.google.com/rss/headlines/section/topic/HEALTH?hl=en-US&gl=US&ceid=US:en",
+      "beats": ["health", "healthcare", "biotech", "public health"],
+      "use_when": "Use for healthcare, digital health, biotech, pharma, and patient-safety profiles.",
+      "notes": "High brand-safety risk; tragedy/human-suffering blocks still apply."
+    },
+    {
+      "id": "google-news-us",
+      "name": "Google News U.S.",
+      "url": "https://news.google.com/rss/headlines/section/topic/NATION?hl=en-US&gl=US&ceid=US:en",
+      "beats": ["us", "policy", "regulation", "courts"],
+      "use_when": "Use when U.S. policy, courts, or national regulation are central to the client.",
+      "notes": "Broad and noisy. Avoid unless the client has policy standing."
+    },
+    {
+      "id": "google-news-world",
+      "name": "Google News World",
+      "url": "https://news.google.com/rss/headlines/section/topic/WORLD?hl=en-US&gl=US&ceid=US:en",
+      "beats": ["world", "geopolitics", "international business", "supply chain"],
+      "use_when": "Use for geopolitics, supply chain, global markets, and international policy profiles.",
+      "notes": "High tragedy/war exposure. Use only with direct public-interest standing."
+    },
+    {
+      "id": "memeorandum",
+      "name": "Memeorandum",
+      "url": "https://www.memeorandum.com/feed.xml",
+      "beats": ["politics", "policy", "media", "washington"],
+      "use_when": "Use for U.S. politics, public affairs, policy, and media-politics profiles.",
+      "notes": "Politically sensitive. Usually not appropriate for normal company newsjacking."
+    },
+    {
+      "id": "mediagazer",
+      "name": "Mediagazer",
+      "url": "https://www.mediagazer.com/feed.xml",
+      "beats": ["media", "publishing", "journalism", "creator economy"],
+      "use_when": "Use for media, journalism, publishing, creator, and ad-tech profiles.",
+      "notes": "Good companion to Techmeme for media-industry clients."
+    },
+    {
+      "id": "ftc-press",
+      "name": "FTC Press Releases",
+      "url": "https://www.ftc.gov/news-events/news/press-releases/rss.xml",
+      "beats": ["consumer protection", "antitrust", "privacy", "ai regulation", "advertising"],
+      "use_when": "Use for privacy, consumer protection, advertising claims, antitrust, and AI compliance profiles.",
+      "notes": "Official source; slow but high authority."
+    },
+    {
+      "id": "sec-press",
+      "name": "SEC Press Releases",
+      "url": "https://www.sec.gov/news/pressreleases.rss",
+      "beats": ["finance", "public companies", "securities", "crypto", "compliance"],
+      "use_when": "Use for fintech, crypto, public-company compliance, and investor-market profiles.",
+      "notes": "Official source; useful for same-day expert commentary."
+    },
+    {
+      "id": "govuk-news",
+      "name": "GOV.UK News",
+      "url": "https://www.gov.uk/search/news-and-communications.atom",
+      "beats": ["uk", "policy", "regulation", "property", "business"],
+      "use_when": "Use for UK policy, regulated industries, property, and public-sector profiles.",
+      "notes": "Broad official UK feed; requires topic filtering by the skill."
+    }
+  ]
+}