npm - understanding-prime-env - Versions diffs - 0.1.6 → 0.1.8 - Mend

understanding-prime-env 0.1.6 → 0.1.8

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (2) hide show

package/package.json +1 -1
package/skills/understand-prime-env/SKILL.md +213 -216

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "understanding-prime-env",
-  "version": "0.1.6",
+  "version": "0.1.8",
   "description": "Generate a rich, self-contained HTML report explaining any Prime Intellect verifiers environment.",
   "keywords": [
     "prime-intellect",

package/skills/understand-prime-env/SKILL.md CHANGED Viewed

@@ -1,328 +1,325 @@
 ---
 name: understand-prime-env
-description: Generate a rich, self-contained HTML report that fully explains a Prime Intellect verifiers environment. Use this skill any time the user asks to understand, explain, document, visualize, or explore a verifiers environment — even if they just say "what does this environment do?", "explain this env", "give me an overview", or "generate an HTML for this environment". The skill reads the Python source files in the current directory, extracts the raw dataset, reward functions, and rollout logic, and writes a visually stunning gamified HTML file to the environment folder.
+description: Generate a rich, self-contained HTML report that fully explains a Prime Intellect verifiers environment. Use this skill any time the user asks to understand, explain, document, visualize, or explore a verifiers environment — even if they just say "what does this environment do?", "explain this env", "give me an overview", or "generate an HTML for this environment". The skill reads the Python source files in the current directory, extracts the dataset, reward functions, and rollout logic, and writes a visually stunning infographic-style HTML file to the environment folder.
 ---
 # Understand Prime Environment
 ## Goal
-Produce a single self-contained HTML file (`environment_overview.html`) that gives a researcher — someone who already knows RL and verifiers but has never seen *this* environment — a complete deep understanding in one page. The output should make them stop and say "whoa."
+Produce a single self-contained `environment_overview.html`. An ML researcher opens it and **gets the full picture in under 10 seconds** — no reading required. The design is infographic-first: diagrams, flow charts, and big numbers dominate. Text exists only to label what the visuals show. Tapping any section slides open a detail drawer with the full technical depth.
-The page has **4 cards**, rendered in a dark gamified UI. Each card is a deep-dive, not a summary.
+The experience has two layers:
+1. **Scan layer** — the full page, visible immediately. Every section is a visual unit: a flow diagram, a metric cluster, a reward breakdown chart. Labels are short. Numbers are big. The researcher understands the environment without reading a word.
+2. **Drill layer** — tap any section → a smooth panel slides in from the right with complete technical detail: exact field names, regex patterns, formula, full example row.
 ---
 ## Step 1 — Read the source
-Read **every `.py` file** in the current directory. Also read `pyproject.toml` and `README.md` if they exist. Do not skip helper files — reward logic is often split across modules (e.g. `*_checks.py`, `*_prompts.py`). Read all of them before writing a single line of HTML.
+Read **every `.py` file** in the current directory. Also read `pyproject.toml` and `README.md` if present. Do not skip helper files — reward logic is often split across modules (`*_checks.py`, `*_prompts.py`, etc.). Read everything before writing a single line of HTML.
-Extract exactly four things:
+Extract the following. Be precise — do not invent values:
-### A. Environment identity
-- The environment's name, one-paragraph description of what task it trains a model to do
-- The GitHub repo URL if present in any source file or README (e.g. `https://github.com/PrimeIntellect-ai/verifiers`)
-- 3–5 quick stats: e.g. dataset size, number of reward functions, number of turns, task type
+### A — Identity
+- Environment name
+- One-line task description (what skill does this train?)
+- GitHub URL only if found verbatim in source or README — otherwise omit entirely
+- 3–5 key stats: dataset size, number of rewards, number of turns, task type, etc.
-### B. Raw dataset — the input data itself
-- Where does the data come from? (HuggingFace dataset name + split, a hardcoded `PROMPTS` list, a generator function, etc.)
-- What fields does a single row have? List every field name and its type/purpose
-- Show **one complete real example row** — every field, real values, not truncated. If real data is not available locally, synthesize one example that is indistinguishable from a real row (match field names, value formats, constraints exactly)
+### B — Dataset
+- Source: HuggingFace dataset + split, hardcoded list, or generator — one line
+- Every field: name, type, purpose
+- One complete real example row — all fields, real values, nothing truncated
-### C. Reward functions — the actual logic
-For each reward function (`@vf.reward`, functions passed to `Rubric`, reward methods on `Taskset`):
-- Its name
-- What it is actually checking — not a summary, the real logic: what string patterns, what conditions, what regex, what comparisons
-- Exactly what makes it return **0** (failure) vs **1** (full score) — and any partial scores in between
-- Any thresholds, edge cases, or gotchas a model writer would need to know
-- If a judge LLM is used: the model name, what the judge prompt asks, and what it returns
+### C — Rewards
+- Every reward function: name, what it checks, what earns 0 vs 1 (and any partials), any thresholds or regex
+- If rewards combine: the exact formula
-If multiple rewards combine into a final score, extract the exact formula.
-### D. Theoretical rollout — what would happen
-Write a step-by-step narrative trace of what would happen if you ran one example end-to-end through this environment:
-1. How the raw dataset row gets transformed into the actual prompt the model sees (system prompt + user message, any templating)
-2. What the model is expected to produce (format, length, structure)
-3. If there are tools or a sandbox: what tools, how they'd be called
-4. How each reward function gets called on the model output — in what order, with what inputs
-5. How the final score is computed from the individual reward outputs
-6. What a **perfect response** looks like vs a **zero-score response**
-Be specific. Trace through the actual code logic. This is theoretical (not executed) but must be grounded in what the code actually does.
+### D — Rollout
+- How raw row → prompt (exact template if present)
+- What the model is expected to output
+- How each reward fires on a sample output
+- How final score is computed
+- What a perfect response looks like vs a zero-score response
 ---
 ## Step 2 — Generate the HTML
-Write a single **self-contained** HTML file to `./environment_overview.html`. No external CDN dependencies — all CSS, JS, and assets inline.
+Write a single **self-contained** HTML file. Zero external dependencies — all CSS and JS inline. No framework. No CDN.
 ---
-### Visual Direction: Dark Gamified Research Dashboard
+### Layout
-The aesthetic is a **dark, glowing, gamified dashboard** — like a game HUD for researchers. Think deep space + neon. Each card has its own accent color and glows on hover. The reader should feel like they're exploring a world, not reading a doc.
+Full-page dark canvas. A single centered column, `max-width: 760px`, generous vertical padding. No sidebar. No nav. No tabs.
+The page has exactly **four visual sections**, stacked vertically, each separated by `60px` of breathing room:
-**Color palette:**
 ```
-Page background:   #080b14  (near-black, blue-tinted)
-Card background:   #0e1420  (dark navy)
-Card border:       1.5px gradient border (unique per card)
-Card 1 accent — purple:   #a855f7  glow: rgba(168,85,247,0.25)
-Card 2 accent — cyan:     #22d3ee  glow: rgba(34,211,238,0.25)
-Card 3 accent — amber:    #f59e0b  glow: rgba(245,158,11,0.25)
-Card 4 accent — rose:     #f43f5e  glow: rgba(244,63,94,0.25)
-Text primary:   #f1f5f9
-Text secondary: #94a3b8
-Text muted:     #475569
-Code text:      #e2e8f0
+[HEADER]          — name, one-line description, stat chips, GitHub pill
+[DATASET]         — schema diagram + tap to see full example row
+[REWARDS]         — horizontal bar chart of reward functions + tap for detail
+[ROLLOUT]         — horizontal flow diagram → tap any node for step detail
 ```
-**Gradient borders** — each card has a glowing gradient border using this trick:
+Each section is a self-contained card. Each card has a **tap target** — the whole card or a labeled "See details →" affordance — that opens a detail drawer.
+---
+### Visual Design
+**Background:**
 ```css
-.card {
-  position: relative;
-  background: #0e1420;
-  border-radius: 16px;
+body {
+  background: #080b12;
+  background-image:
+    radial-gradient(ellipse at 15% 0%, rgba(99,102,241,0.07) 0%, transparent 45%),
+    radial-gradient(ellipse at 85% 100%, rgba(20,184,166,0.05) 0%, transparent 45%);
+  min-height: 100vh;
+  font-family: 'SF Mono', 'Fira Code', ui-monospace, monospace;
+  color: #e2e8f0;
 }
-.card::before {
-  content: '';
-  position: absolute;
-  inset: -1.5px;
-  border-radius: 17px;
-  background: linear-gradient(135deg, var(--card-accent), transparent 60%);
-  z-index: -1;
+```
+**Card base:**
+```css
+.section-card {
+  background: #0d1117;
+  border: 1px solid rgba(255,255,255,0.07);
+  border-radius: 16px;
+  padding: 28px 32px;
+  cursor: pointer;
+  transition: border-color 0.2s ease;
 }
+.section-card:hover { border-color: rgba(255,255,255,0.15); }
 ```
-On hover: `box-shadow: 0 0 40px var(--card-glow), 0 0 80px rgba(var(--card-glow), 0.3)` + `transform: translateY(-4px)`. Transition: `0.3s cubic-bezier(0.34, 1.56, 0.64, 1)` (slight spring).
+**Section accent colors** — used for borders, labels, highlights, chart fills:
+```
+Header / Identity:  #6366f1  (indigo)
+Dataset:            #14b8a6  (teal)
+Rewards:            #f59e0b  (amber)
+Rollout:            #f43f5e  (rose)
+```
 **Typography:**
-```css
-font-family: 'SF Pro Display', -apple-system, 'Helvetica Neue', sans-serif;  /* body */
-font-family: ui-monospace, 'Cascadia Code', 'Fira Code', monospace;           /* code */
-```
+- Section label: `0.65rem`, accent color, `letter-spacing: 0.12em`, uppercase, `font-weight: 500`
+- Section title: `1.1rem`, white, `font-weight: 700`
+- Body / labels: `0.8rem`, `#64748b`
+- Big numbers / diagram nodes: `1.6–2.4rem`, white or accent, `font-weight: 800`
 ---
-### Page Layout
+### Section 1 — Header
 ```
-┌─────────────────────────────────────────────────────────────┐
-│  HERO: env name (large, glowing) + tagline + PI logo badge  │
-├──────────────┬──────────────┬──────────────┬────────────────┤
-│  CARD 1      │  CARD 2      │  CARD 3      │  CARD 4        │
-│  🌐 Env      │  📦 Dataset  │  ⚡ Rewards  │  🎯 Rollout    │
-│  purple      │  cyan        │  amber       │  rose          │
-├──────────────┴──────────────┴──────────────┴────────────────┤
-│  FOOTER: generated timestamp · PI branding                  │
-└─────────────────────────────────────────────────────────────┘
+┌─────────────────────────────────────────────────────┐
+│  PRIME INTELLECT ENVIRONMENT                        │
+│  EnvironmentName                    [↗ GitHub]      │
+│  One-line task description                          │
+│  ─────────────────────────────────────────────────  │
+│  [42k rows]  [3 rewards]  [2 turns]  [Math QA]     │
+└─────────────────────────────────────────────────────┘
 ```
-On screens < 1200px: 2-column grid. On mobile: single column.
+- Env name: `2rem`, `font-weight: 800`, white. Monospace.
+- Description: `0.85rem`, `#94a3b8`, `line-height: 1.5`. Below the name.
+- GitHub pill: only if URL was found. `background: rgba(99,102,241,0.1)`, `border: 1px solid rgba(99,102,241,0.3)`, `color: #a5b4fc`, `border-radius: 99px`, `padding: 4px 12px`, `font-size: 0.73rem`. Hover: border and color shift to solid indigo.
+- Stat chips: row of 3–5 pills. `background: rgba(99,102,241,0.08)`, `border: 1px solid rgba(99,102,241,0.18)`, `color: #a5b4fc`, `border-radius: 6px`, `padding: 5px 12px`, `font-size: 0.75rem`, `font-weight: 600`. Label on top in `0.6rem` muted caps, value below in `0.85rem` white.
-Cards are tall enough to show all content — the page CAN scroll, but each card is self-contained.
+No drawer for this section. Static.
 ---
-### Hero Section
+### Section 2 — Dataset (tappable)
-Full-width header above the cards:
-- Large environment name: `font-size: clamp(2rem, 5vw, 4rem)`, `font-weight: 800`, white with a subtle purple text-shadow glow: `text-shadow: 0 0 40px rgba(168,85,247,0.4)`
-- Below: one sentence tagline in `#94a3b8`
-- Top-right: a `⬡ PRIME INTELLECT` badge — `background: rgba(168,85,247,0.1)`, `border: 1px solid rgba(168,85,247,0.3)`, `color: #a855f7`, `border-radius: 6px`, `padding: 4px 12px`, `font-size: 0.7rem`, `letter-spacing: 0.12em`
-- Background: `radial-gradient(ellipse at 30% 0%, rgba(168,85,247,0.12) 0%, transparent 60%), radial-gradient(ellipse at 80% 100%, rgba(34,211,238,0.06) 0%, transparent 50%)`
-- A thin `border-bottom: 1px solid #1e293b` separates the hero from the cards
+**Scan face** — a visual schema diagram:
-On page load: env name fades + slides up 16px (`animation: heroIn 0.6s ease-out`). Tagline follows 150ms later.
+```
+SOURCE ──────────────────────────────────────────────
+  HuggingFace · openai/gsm8k · train split
----
+FIELDS ──────────────────────────────────────────────
+  [question]──str──────────[answer]──str
+              [level]──int
+              [subject]──str
+```
-### Card 1 — Environment `(purple)`
+Render fields as connected pills in a small horizontal/vertical node graph. Each pill: `background: rgba(20,184,166,0.08)`, `border: 1px solid rgba(20,184,166,0.2)`, `border-radius: 6px`, `padding: 4px 10px`. Field name in teal monospace `0.75rem`, type in muted `0.65rem`. Connect them with SVG lines (stroke `rgba(20,184,166,0.2)`, stroke-width 1).
-**Header:** `🌐  Environment` in bold white, `font-size: 0.7rem` `ENVIRONMENT` label in purple caps above it.
+At the bottom of the card, a muted "See example row →" in `0.72rem` teal.
-**Body:**
-- A paragraph of prose describing what task this environment trains a model to do — written in plain English, no jargon beyond what the researcher already knows
-- GitHub link (if found): a pill button — `background: rgba(168,85,247,0.1)`, `border: 1px solid rgba(168,85,247,0.3)`, hover brightens, shows `↗` arrow. If no GitHub link found, omit this element entirely.
-- Stat chips row at the bottom: 3–5 pill badges (e.g. `64 prompts`, `2 rewards`, `single-turn`, `math reasoning`). Each chip: `background: rgba(168,85,247,0.08)`, `border: 1px solid rgba(168,85,247,0.2)`, `color: #c4b5fd`, `border-radius: 99px`, `padding: 3px 10px`, `font-size: 0.72rem`
+**Detail drawer content** (slides in on tap — see Drawer spec below):
+- `EXAMPLE ROW` label
+- Every field displayed as: field name (teal monospace) + value (white). Long text in a soft box — `background: rgba(255,255,255,0.03)`, `border-radius: 6px`, `padding: 8px 12px`. Nothing truncated.
+- `FIELD GUIDE` label, then each field one per line: name · type · purpose sentence.
 ---
-### Card 2 — Dataset `(cyan)`
+### Section 3 — Rewards (tappable)
-**Header:** `📦  Dataset` label.
+**Scan face** — a horizontal stacked bar chart:
-**Body — three subsections, stacked:**
+Each reward function gets one horizontal bar. The bar represents its contribution to the final score (equal weight if not specified). Left side: reward name in amber monospace `0.8rem`. Right side: bar — `height: 8px`, `border-radius: 4px`, `background: linear-gradient(90deg, #f59e0b, #fbbf24)`. Below the bar: one-phrase description in `0.65rem` muted text.
-**① Source** — one line, monospace: where the data comes from. E.g.:
-```
-HuggingFace  ·  openai/gsm8k  ·  train split
-```
-or
-```
-Hardcoded  ·  PROMPTS list  ·  64 examples
-```
-Style: `background: rgba(34,211,238,0.05)`, `border-left: 3px solid #22d3ee`, `padding: 8px 14px`, `border-radius: 0 6px 6px 0`, monospace, cyan text.
+If there's a final score formula, show it below the bars in a formula chip:
+`background: rgba(245,158,11,0.08)`, `border: 1px solid rgba(245,158,11,0.2)`, `border-radius: 8px`, `padding: 8px 14px`, amber monospace `0.82rem`.
+At the bottom: "See reward logic →" in `0.72rem` amber.
-**② Field anatomy** — a compact table showing every field in a data row:
+**Detail drawer content:**
-| Field | Type | Description |
-|-------|------|-------------|
+For each reward, a block:
+```
+format_reward                        [float 0–1]
+─────────────────────────────────────────────────
+Checks response contains <answer>…</answer>
-Table style: no outer border, alternating row backgrounds `rgba(34,211,238,0.03)` / transparent, header row in cyan `0.6rem` caps. Text `0.82rem`.
+  ✗  0   Tags absent, or inner content non-numeric
+  ✓  1   Tags present, inner content is valid integer
-**③ Example row** — the most important part. Show one complete real (or synthesized) example row. Render it as a structured display, NOT a raw JSON dump:
-- Each field on its own line: field name in cyan monospace, value in white
-- Long text values (like prompt content) get a soft box: `background: rgba(255,255,255,0.03)`, `border: 1px solid #1e293b`, `border-radius: 6px`, `padding: 10px 14px`, `font-size: 0.82rem`, full content shown (no truncation)
+  Pattern: <answer>(\d+)</answer>
+```
+- Name: amber monospace, `font-weight: 700`, `0.88rem`
+- `[float 0–1]` badge: `0.65rem`, `#6b7280`, right-aligned via flex
+- Description: `0.8rem`, `#94a3b8`
+- ✗ / ✓ lines: `0.78rem`. ✗ in `#f87171`, ✓ in `#4ade80`, text in `#94a3b8`
+- Pattern/threshold: monospace, `0.75rem`, `#64748b`
+- Block: `background: rgba(245,158,11,0.04)`, `border: 1px solid rgba(245,158,11,0.1)`, `border-radius: 10px`, `padding: 12px 14px`, `margin-bottom: 10px`
 ---
-### Card 3 — Rewards `(amber)`
+### Section 4 — Rollout (tappable)
-**Header:** `⚡  Rewards` label.
-**Body:** For each reward function, a reward block:
+**Scan face** — a horizontal pipeline flow diagram:
 ```
-┌─────────────────────────────────────────────────┐
-│  format_reward                          [float]  │
-│  ─────────────────────────────────────────────  │
-│  CHECKS                                          │
-│  Checks that response contains <answer>...</     │
-│  answer> tags and inner content is numeric       │
-│                                                  │
-│  SCORES 0  if tags missing or content non-num    │
-│  SCORES 1  if tags present and content is int    │
-└─────────────────────────────────────────────────┘
+[DATA ROW] ──▶ [PROMPT] ──▶ [MODEL] ──▶ [REWARDS] ──▶ [SCORE]
 ```
-- Function name: `font-family: monospace`, amber color, `font-size: 0.9rem`, `font-weight: 600`
-- `[float]` / `[int]` / `[bool]` type badge: small, muted, right-aligned
-- `CHECKS` / `SCORES 0` / `SCORES 1` labels: `0.65rem`, letter-spacing `0.1em`, amber at 60% opacity
-- Actual descriptions: white text, `0.83rem`, leading `1.5`
-- Block background: `rgba(245,158,11,0.04)`, `border: 1px solid rgba(245,158,11,0.15)`, `border-radius: 10px`, `padding: 14px 16px`
-- Blocks separated by `12px` gap
+Each node: rounded rect, `background: rgba(244,63,94,0.08)`, `border: 1px solid rgba(244,63,94,0.2)`, `border-radius: 10px`, `padding: 10px 16px`. Node label: rose monospace `0.8rem` bold. Below each node: 1 short phrase (≤6 words) in `0.65rem` muted. Arrows: SVG `▶` in `rgba(244,63,94,0.4)`.
-If there is a composite formula, show it after all blocks in a prominent callout:
-```
-background: rgba(245,158,11,0.08)
-border: 1px solid rgba(245,158,11,0.3)
-border-radius: 8px
-padding: 14px 18px
-font-family: monospace
-color: #fcd34d
-font-size: 0.9rem
-```
+On narrow viewports, collapse to vertical with connecting arrows below each node.
----
+At the bottom: "Trace an example →" in `0.72rem` rose.
-### Card 4 — Rollout `(rose)`
+**Detail drawer content:**
-**Header:** `🎯  Rollout` label.
+5 numbered steps. Each:
+- Step number: `2rem`, `font-weight: 800`, rose, `opacity: 0.2`
+- Step title: `0.9rem`, white, `font-weight: 700`
+- Description: `0.82rem`, `#94a3b8`, `line-height: 1.6`
+- Left border: `border-left: 2px solid rgba(244,63,94,0.15)`, `padding-left: 16px`, `margin-left: 10px` (omit on last step)
+- `margin-bottom: 20px`
-**Body:** A numbered step-by-step trace. Each step:
+Steps are always:
+1. **Data → Prompt** — how the raw row becomes the exact prompt the model sees (include template if found)
+2. **Model Response** — what the model is expected to produce (format, tags, structure)
+3. **Reward Evaluation** — how each reward fires on a sample output; show scores for a real example
+4. **Score Computation** — the exact formula and resulting score
+5. **Perfect vs Zero** — what earns a full score vs what earns zero; concrete contrasting examples
-```
-  ① Data → Prompt
-  ──────────────────────────────────────────
-  The raw row's `problem` field is inserted
-  into a system prompt: "Solve the following
-  math problem step by step..." followed by
-  the problem text as the user message.
+---
+### Detail Drawer
+A panel that slides in from the **right edge** of the viewport, overlaying the page.
+```css
+.drawer {
+  position: fixed;
+  top: 0; right: 0;
+  width: min(480px, 95vw);
+  height: 100vh;
+  background: #0d1117;
+  border-left: 1px solid rgba(255,255,255,0.1);
+  padding: 32px 28px;
+  overflow-y: auto;
+  transform: translateX(100%);
+  transition: transform 0.35s cubic-bezier(0.4, 0, 0.2, 1);
+  z-index: 100;
+}
+.drawer.open { transform: translateX(0); }
 ```
-- Step number: large, `font-size: 1.4rem`, `font-weight: 800`, rose color, `opacity: 0.4`, positioned to the left
-- Step title: `font-weight: 700`, white, `font-size: 0.9rem`
-- A `1px solid rgba(244,63,94,0.15)` rule below the title
-- Description: `0.83rem`, `#94a3b8`, `line-height: 1.6`
-- Between steps: a rose-tinted connector line on the left side (`border-left: 2px solid rgba(244,63,94,0.15)`, `margin-left: 10px`, `padding-left: 20px`)
+**Backdrop:** `position: fixed; inset: 0; background: rgba(0,0,0,0.5); z-index: 99` — fades in with `opacity 0.3s`. Clicking it closes the drawer.
-Steps to always include (adapt to what's in the code):
-1. **Data → Prompt** — how the raw row becomes the model's input
-2. **Model response** — what the model is expected to produce (format, structure)
-3. **Reward evaluation** — how each reward function is called and what it receives
-4. **Score computation** — how the final score is derived
-5. **Perfect vs zero** — what a max-score response looks like vs a zero-score response (concrete examples if possible)
+**Close button:** `×` in the top-right of the drawer. `font-size: 1.1rem`, `color: #475569`, hover rose. Keyboard: `Escape` closes.
----
+**Drawer header:**
+- Section label (e.g., `DATASET DETAIL`) in accent color, `0.65rem` caps
+- Section name in white `1.1rem` bold
+- Thin `border-bottom: 1px solid rgba(255,255,255,0.07)`, `padding-bottom: 16px`, `margin-bottom: 20px`
+Scroll within the drawer. The rest of the page does not scroll while drawer is open (`body { overflow: hidden }`).
-### Entrance Animations
+---
-All guarded by `@media (prefers-reduced-motion: reduce) { *, *::before, *::after { animation: none !important; } }`.
+### Motion
 ```css
-@keyframes heroIn {
-  from { opacity: 0; transform: translateY(16px); }
+@keyframes fadeSlideIn {
+  from { opacity: 0; transform: translateY(12px); }
   to   { opacity: 1; transform: translateY(0); }
 }
-@keyframes cardIn {
-  from { opacity: 0; transform: translateY(24px) scale(0.98); }
-  to   { opacity: 1; transform: translateY(0) scale(1); }
+.section-card {
+  animation: fadeSlideIn 0.4s ease both;
 }
+/* Stagger via animation-delay: 0s, 0.08s, 0.16s, 0.24s on the four sections */
 ```
-Cards animate in with staggered delay: `animation-delay: 0.1s, 0.2s, 0.3s, 0.4s` for cards 1–4.
----
-### JavaScript (inline, vanilla)
-```js
-// Card hover glow
-document.querySelectorAll('.card').forEach(card => {
-  card.addEventListener('mousemove', (e) => {
-    const rect = card.getBoundingClientRect();
-    const x = ((e.clientX - rect.left) / rect.width) * 100;
-    const y = ((e.clientY - rect.top) / rect.height) * 100;
-    card.style.setProperty('--mouse-x', x + '%');
-    card.style.setProperty('--mouse-y', y + '%');
-  });
-});
+Drawer slide uses CSS transition only (no keyframes). Section cards lift slightly on hover:
+```css
+.section-card:hover { transform: translateY(-2px); box-shadow: 0 8px 32px rgba(0,0,0,0.4); }
 ```
-Add a radial gradient spotlight that follows the mouse inside each card:
+Reduced motion guard:
 ```css
-.card::after {
-  content: '';
-  position: absolute;
-  inset: 0;
-  border-radius: 16px;
-  background: radial-gradient(
-    circle at var(--mouse-x, 50%) var(--mouse-y, 50%),
-    rgba(255,255,255,0.03) 0%,
-    transparent 60%
-  );
-  pointer-events: none;
+@media (prefers-reduced-motion: reduce) {
+  *, *::before, *::after { animation: none !important; transition-duration: 0.01ms !important; }
 }
 ```
 ---
-### Footer
+### Page Chrome
+**Top of page** — above the first card:
+```
+PRIME INTELLECT · ENVIRONMENT OVERVIEW          [environment-name]
+```
+`font-size: 0.68rem`, `color: #1e293b`, `letter-spacing: 0.1em`, `margin-bottom: 40px`. Nothing else at the top.
+**Bottom of page** — below the last card:
 ```
-Generated by Claude  ·  Prime Intellect Verifiers  ·  <ISO timestamp>
+Generated by Claude · <timestamp>
 ```
+`font-size: 0.65rem`, `color: #1e293b`, `margin-top: 48px`, centered.
-Centered, `color: #334155`, `font-size: 0.72rem`. `border-top: 1px solid #1e293b`, `padding: 24px`. No links, no extra content.
+Nothing else. No nav, no header, no sidebar.
 ---
 ## Step 3 — Confirm and report
-After writing the file, tell the user:
-- The full path and `open environment_overview.html` command
-- Two sentences: what the environment trains and how it scores
+After writing the file, share the full path and say:
+- What the environment trains (one sentence)
+- How it scores (one sentence)
-## Anti-patterns
+---
-- Do not produce a surface-level summary — every section must contain the actual logic from the code
-- Do not hallucinate reward weights, field names, dataset contents, or GitHub URLs not found in the source
-- Do not skip helper modules — they often contain the core reward logic
-- Do not truncate the example row — show every field in full
-- Do not use light theme — this is dark-only
-- Do not add tabs, collapsible sections, score bars, or copy buttons — the 4-card layout is the whole structure
-- Do not use Inter, Roboto, or any Google Font
-- If a GitHub URL is not found in the source, omit the GitHub button entirely — never invent a URL
+## Anti-patterns — never do these
+- **Do not write walls of text on the scan face.** Every section face is a diagram or a chart. Text labels only.
+- **Do not truncate the example row.** Full values, all fields, in the drawer.
+- **Do not invent a GitHub URL.** Only include it if found verbatim in source.
+- **Do not hallucinate field names, reward weights, or dataset content.** Extract exactly.
+- **Do not skip helper modules.** Core reward logic is often there.
+- **Do not use a light theme.**
+- **Do not use Inter, Roboto, or any Google Font.**
+- **Do not add tabs, nav, or scroll-within-cards.** The drawer is the only overlay.
+- **Do not add more than four sections.** Header + Dataset + Rewards + Rollout. That's it.
+- **Do not use a card-stack / deck reveal mechanic.** This is a scrollable single-column page.