npm - understanding-prime-env - Versions diffs - 0.1.7 → 0.1.9 - Mend

understanding-prime-env 0.1.7 → 0.1.9

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (3) hide show

package/package.json +1 -1
package/skills/understand-prime-env/SKILL.md +204 -169
package/skills/understand-environment/SKILL.md +0 -494

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "understanding-prime-env",
-  "version": "0.1.7",
+  "version": "0.1.9",
   "description": "Generate a rich, self-contained HTML report explaining any Prime Intellect verifiers environment.",
   "keywords": [
     "prime-intellect",

package/skills/understand-prime-env/SKILL.md CHANGED Viewed

@@ -1,262 +1,285 @@
 ---
 name: understand-prime-env
-description: Generate a rich, self-contained HTML report that fully explains a Prime Intellect verifiers environment. Use this skill any time the user asks to understand, explain, document, visualize, or explore a verifiers environment — even if they just say "what does this environment do?", "explain this env", "give me an overview", or "generate an HTML for this environment". The skill reads the Python source files in the current directory, extracts the raw dataset, reward functions, and rollout logic, and writes a visually stunning gamified HTML file to the environment folder.
+description: Generate a rich, self-contained HTML report that fully explains a Prime Intellect verifiers environment. Use this skill any time the user asks to understand, explain, document, visualize, or explore a verifiers environment — even if they just say "what does this environment do?", "explain this env", "give me an overview", or "generate an HTML for this environment". The skill reads the Python source files in the current directory, extracts the dataset, reward functions, and rollout logic, and writes a visually stunning infographic-style HTML file to the environment folder.
 ---
 # Understand Prime Environment
 ## Goal
-Produce a single self-contained HTML file (`environment_overview.html`). A researcher opens it and sees a **stack of 4 cards** — like a physical deck — each one peeking out behind the one in front. They click through the deck, one card at a time, in a satisfying progressive reveal. Each card is one chapter of the story. The whole experience should feel like flipping through a beautifully designed research brief.
+Produce a single self-contained `environment_overview.html`. An ML researcher opens it and **gets the full picture in under 10 seconds** — no reading required. The design is infographic-first: diagrams, flow charts, and big numbers dominate. Text exists only to label what the visuals show. Tapping any section slides open a detail drawer with the full technical depth.
+The experience has two layers:
+1. **Scan layer** — the full page, visible immediately. Every section is a visual unit: a flow diagram, a metric cluster, a reward breakdown chart. Labels are short. Numbers are big. The researcher understands the environment without reading a word.
+2. **Drill layer** — tap any section → a smooth panel slides in from the right with complete technical detail: exact field names, regex patterns, formula, full example row.
 ---
 ## Step 1 — Read the source
-Read **every `.py` file** in the current directory. Also read `pyproject.toml` and `README.md` if they exist. Do not skip helper files — reward logic is often split across modules (e.g. `*_checks.py`, `*_prompts.py`). Read everything before writing a single line of HTML.
+Read **every `.py` file** in the current directory. Also read `pyproject.toml` and `README.md` if present. Do not skip helper files — reward logic is often split across modules (`*_checks.py`, `*_prompts.py`, etc.). Read everything before writing a single line of HTML.
-Extract exactly four things:
+Extract the following. Be precise — do not invent values:
-### Card 1 — Environment
-- Name, and one punchy paragraph (3–4 sentences) describing what task this trains a model to do
-- GitHub URL if found anywhere in source or README — if not found, omit entirely
-- 3–5 stat chips: dataset size, reward count, turn count, task type, etc.
+### A — Identity
+- Environment name
+- One-line task description (what skill does this train?)
+- GitHub URL only if found verbatim in source or README — otherwise omit entirely
+- 3–5 key stats: dataset size, number of rewards, number of turns, task type, etc.
-### Card 2 — Dataset
-- Where the data comes from: HuggingFace dataset name + split, hardcoded list, generator, etc. — one line
-- Every field in a data row: name, type, purpose
-- One complete example row with every field shown in full — real values if available, otherwise synthesize one that is indistinguishable from real (exact field names, value formats, constraints)
+### B — Dataset
+- Source: HuggingFace dataset + split, hardcoded list, or generator — one line
+- Every field: name, type, purpose
+- One complete real example row — all fields, real values, nothing truncated
-### Card 3 — Rewards
-- Every reward function: name, exactly what it checks, precisely what makes it score 0 vs 1 (and any partial values)
-- Any thresholds, regex patterns, or edge cases a model writer needs to know
-- If rewards combine into a final score: the exact formula
+### C — Rewards
+- Every reward function: name, what it checks, what earns 0 vs 1 (and any partials), any thresholds or regex
+- If rewards combine: the exact formula
-### Card 4 — Rollout
-- Step-by-step theoretical trace of one example running end-to-end:
-  1. How the raw row becomes the prompt the model sees
-  2. What the model is expected to produce
-  3. How each reward fires on the output
-  4. How the final score is computed
-  5. What a perfect response looks like vs a zero-score response
+### D — Rollout
+- How raw row → prompt (exact template if present)
+- What the model is expected to output
+- How each reward fires on a sample output
+- How final score is computed
+- What a perfect response looks like vs a zero-score response
 ---
 ## Step 2 — Generate the HTML
-Write a single **self-contained** HTML file to `./environment_overview.html`. Zero external dependencies — all CSS and JS inline.
+Write a single **self-contained** HTML file. Zero external dependencies — all CSS and JS inline. No framework. No CDN.
 ---
-### The Core Mechanic — Card Stack Reveal
+### Layout
+Full-page dark canvas. A single centered column, `max-width: 760px`, generous vertical padding. No sidebar. No nav. No tabs.
-The entire UI is a **centered card deck**. All four cards occupy the same position. The active card is front and center at full size. Cards behind it peek out — each one slightly smaller, slightly lower, slightly darker — giving the illusion of a physical stack.
+The page has exactly **four visual sections**, stacked vertically, each separated by `60px` of breathing room:
 ```
-         ░░░░░░░░░░░░░░░░  ← Card 4 (furthest back, barely visible)
-       ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒  ← Card 3
-     ▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓  ← Card 2
-   ██████████████████████████  ← Card 1 (active, full size, full opacity)
+[HEADER]          — name, one-line description, stat chips, GitHub pill
+[DATASET]         — schema diagram + tap to see full example row
+[REWARDS]         — horizontal bar chart of reward functions + tap for detail
+[ROLLOUT]         — horizontal flow diagram → tap any node for step detail
 ```
-Clicking anywhere on the active card — or the "Continue →" button — triggers the reveal: the active card flies out (slides left + slight rotation + fade), and the next card scales up to the front with a spring animation. A progress indicator shows position (● ● ○ ○).
-When card 4 is shown, "Continue →" becomes "Done ✓" and clicking it does nothing (or fades the stack out gracefully).
+Each section is a self-contained card. Each card has a **tap target** — the whole card or a labeled "See details →" affordance — that opens a detail drawer.
 ---
 ### Visual Design
-**Background:** Full-viewport dark canvas.
+**Background:**
 ```css
 body {
-  background: #07090f;
+  background: #080b12;
   background-image:
-    radial-gradient(ellipse at 20% 20%, rgba(168,85,247,0.06) 0%, transparent 50%),
-    radial-gradient(ellipse at 80% 80%, rgba(34,211,238,0.04) 0%, transparent 50%);
+    radial-gradient(ellipse at 15% 0%, rgba(99,102,241,0.07) 0%, transparent 45%),
+    radial-gradient(ellipse at 85% 100%, rgba(20,184,166,0.05) 0%, transparent 45%);
   min-height: 100vh;
-  display: flex;
-  flex-direction: column;
-  align-items: center;
-  justify-content: center;
+  font-family: 'SF Mono', 'Fira Code', ui-monospace, monospace;
+  color: #e2e8f0;
 }
 ```
 **Card base:**
 ```css
-.card {
-  position: absolute;
-  width: min(600px, 90vw);
+.section-card {
   background: #0d1117;
-  border-radius: 24px;
-  padding: 40px 44px 36px;
-  transform-origin: center bottom;
-  will-change: transform, opacity;
+  border: 1px solid rgba(255,255,255,0.07);
+  border-radius: 16px;
+  padding: 28px 32px;
+  cursor: pointer;
+  transition: border-color 0.2s ease;
 }
+.section-card:hover { border-color: rgba(255,255,255,0.15); }
 ```
-**Stack offset** (CSS, applied via `data-depth` attribute 0–3, 0 = active):
+**Section accent colors** — used for borders, labels, highlights, chart fills:
 ```
-depth 0: scale(1.00)   translateY(0px)    opacity: 1     (active)
-depth 1: scale(0.96)   translateY(18px)   opacity: 0.65  z-index: -1
-depth 2: scale(0.92)   translateY(36px)   opacity: 0.35  z-index: -2
-depth 3: scale(0.88)   translateY(54px)   opacity: 0.15  z-index: -3
+Header / Identity:  #6366f1  (indigo)
+Dataset:            #14b8a6  (teal)
+Rewards:            #f59e0b  (amber)
+Rollout:            #f43f5e  (rose)
 ```
-Each card has a unique accent. Apply via a CSS custom property `--accent` and `--glow` set on the card element itself. The gradient border and glow use this accent.
+**Typography:**
+- Section label: `0.65rem`, accent color, `letter-spacing: 0.12em`, uppercase, `font-weight: 500`
+- Section title: `1.1rem`, white, `font-weight: 700`
+- Body / labels: `0.8rem`, `#64748b`
+- Big numbers / diagram nodes: `1.6–2.4rem`, white or accent, `font-weight: 800`
+---
+### Section 1 — Header
 ```
-Card 1:  --accent: #a855f7   --glow: rgba(168,85,247,0.3)   (purple)
-Card 2:  --accent: #22d3ee   --glow: rgba(34,211,238,0.3)   (cyan)
-Card 3:  --accent: #f59e0b   --glow: rgba(245,158,11,0.3)   (amber)
-Card 4:  --accent: #f43f5e   --glow: rgba(244,63,94,0.3)    (rose)
+┌─────────────────────────────────────────────────────┐
+│  PRIME INTELLECT ENVIRONMENT                        │
+│  EnvironmentName                    [↗ GitHub]      │
+│  One-line task description                          │
+│  ─────────────────────────────────────────────────  │
+│  [42k rows]  [3 rewards]  [2 turns]  [Math QA]     │
+└─────────────────────────────────────────────────────┘
 ```
-**Gradient border** on the active card only:
-```css
-.card[data-depth="0"] {
-  box-shadow:
-    0 0 0 1.5px var(--accent),
-    0 0 60px var(--glow),
-    0 32px 80px rgba(0,0,0,0.6);
-}
-.card[data-depth="1"],
-.card[data-depth="2"],
-.card[data-depth="3"] {
-  box-shadow: 0 0 0 1px rgba(255,255,255,0.06);
-}
-```
+- Env name: `2rem`, `font-weight: 800`, white. Monospace.
+- Description: `0.85rem`, `#94a3b8`, `line-height: 1.5`. Below the name.
+- GitHub pill: only if URL was found. `background: rgba(99,102,241,0.1)`, `border: 1px solid rgba(99,102,241,0.3)`, `color: #a5b4fc`, `border-radius: 99px`, `padding: 4px 12px`, `font-size: 0.73rem`. Hover: border and color shift to solid indigo.
+- Stat chips: row of 3–5 pills. `background: rgba(99,102,241,0.08)`, `border: 1px solid rgba(99,102,241,0.18)`, `color: #a5b4fc`, `border-radius: 6px`, `padding: 5px 12px`, `font-size: 0.75rem`, `font-weight: 600`. Label on top in `0.6rem` muted caps, value below in `0.85rem` white.
-**Typography:**
-```css
-font-family: -apple-system, 'SF Pro Display', 'Helvetica Neue', sans-serif;
-font-family: ui-monospace, 'Cascadia Code', 'Fira Code', monospace; /* code only */
-```
+No drawer for this section. Static.
 ---
-### Card Content Specs
+### Section 2 — Dataset (tappable)
-Each card has the same shell structure:
+**Scan face** — a visual schema diagram:
 ```
-┌────────────────────────────────────────────┐
-│  LABEL (0.65rem, accent, caps, tracking)   │
-│  TITLE (1.6rem, 700, white)                │
-│  ─────────────────────────────────────     │
-│                                            │
-│  [BODY — unique per card, see below]       │
-│                                            │
-│  ────────────────────────────────────────  │
-│  [progress dots]    [Continue → button]    │
-└────────────────────────────────────────────┘
+SOURCE ──────────────────────────────────────────────
+  HuggingFace · openai/gsm8k · train split
+FIELDS ──────────────────────────────────────────────
+  [question]──str──────────[answer]──str
+              [level]──int
+              [subject]──str
 ```
-**Progress dots:** 4 dots, `width: 7px height: 7px border-radius: 50%`. Active dot: accent color, `width: 20px border-radius: 4px` (pill). Inactive: `rgba(255,255,255,0.15)`. Transition: `width 0.3s ease`.
+Render fields as connected pills in a small horizontal/vertical node graph. Each pill: `background: rgba(20,184,166,0.08)`, `border: 1px solid rgba(20,184,166,0.2)`, `border-radius: 6px`, `padding: 4px 10px`. Field name in teal monospace `0.75rem`, type in muted `0.65rem`. Connect them with SVG lines (stroke `rgba(20,184,166,0.2)`, stroke-width 1).
+At the bottom of the card, a muted "See example row →" in `0.72rem` teal.
+**Detail drawer content** (slides in on tap — see Drawer spec below):
-**Continue button:** `background: var(--accent)`, `color: #000`, `font-weight: 700`, `font-size: 0.82rem`, `border-radius: 99px`, `padding: 8px 20px`, `border: none`, `cursor: pointer`. Hover: `opacity: 0.85`.
+- `EXAMPLE ROW` label
+- Every field displayed as: field name (teal monospace) + value (white). Long text in a soft box — `background: rgba(255,255,255,0.03)`, `border-radius: 6px`, `padding: 8px 12px`. Nothing truncated.
+- `FIELD GUIDE` label, then each field one per line: name · type · purpose sentence.
 ---
-#### Card 1 Body — Environment
+### Section 3 — Rewards (tappable)
-- **Env name**: `font-size: 1.6rem`, `font-weight: 800`, white
-- **Description**: 3–4 sentences, `font-size: 0.9rem`, `color: #94a3b8`, `line-height: 1.65`, `margin: 14px 0`
-- **GitHub link** (only if URL was found in source): pill button — `background: rgba(255,255,255,0.05)`, `border: 1px solid rgba(255,255,255,0.1)`, `color: #e2e8f0`, `border-radius: 99px`, `padding: 5px 14px`, `font-size: 0.78rem`. Shows `↗ GitHub`. Hover: `border-color: var(--accent)`, `color: var(--accent)`. If no URL found, this element does not exist.
-- **Stat chips**: row of 3–5 pills. `background: rgba(168,85,247,0.08)`, `border: 1px solid rgba(168,85,247,0.2)`, `color: #c4b5fd`, `border-radius: 99px`, `padding: 4px 11px`, `font-size: 0.73rem`
+**Scan face** — a horizontal stacked bar chart:
----
+Each reward function gets one horizontal bar. The bar represents its contribution to the final score (equal weight if not specified). Left side: reward name in amber monospace `0.8rem`. Right side: bar — `height: 8px`, `border-radius: 4px`, `background: linear-gradient(90deg, #f59e0b, #fbbf24)`. Below the bar: one-phrase description in `0.65rem` muted text.
-#### Card 2 Body — Dataset
+If there's a final score formula, show it below the bars in a formula chip:
+`background: rgba(245,158,11,0.08)`, `border: 1px solid rgba(245,158,11,0.2)`, `border-radius: 8px`, `padding: 8px 14px`, amber monospace `0.82rem`.
-**Source line** — monospace, one line:
-```
-HuggingFace · openai/gsm8k · train split
+At the bottom: "See reward logic →" in `0.72rem` amber.
+**Detail drawer content:**
+For each reward, a block:
 ```
-Style: `background: rgba(34,211,238,0.05)`, `border-left: 3px solid #22d3ee`, `padding: 8px 14px`, `border-radius: 0 6px 6px 0`, `font-size: 0.82rem`, `color: #67e8f9`
+format_reward                        [float 0–1]
+─────────────────────────────────────────────────
+Checks response contains <answer>…</answer>
-**Field list** — compact, beneath the source line:
-Each field on one line: `field_name` in cyan monospace + `·` + type/purpose in muted text. `font-size: 0.8rem`, `line-height: 1.8`.
+  ✗  0   Tags absent, or inner content non-numeric
+  ✓  1   Tags present, inner content is valid integer
-**Example row** — the main content:
-A clean structured display. Label: `EXAMPLE ROW` in `0.65rem` cyan caps. Then each field:
-- Field name: cyan monospace, `font-size: 0.78rem`
-- Value: white, `font-size: 0.82rem`, `line-height: 1.5`
-- Long text values (prompts, answers): wrapped in a soft box — `background: rgba(255,255,255,0.03)`, `border-radius: 6px`, `padding: 8px 12px`, `margin-top: 2px`
-- Full content — never truncated
+  Pattern: <answer>(\d+)</answer>
+```
+- Name: amber monospace, `font-weight: 700`, `0.88rem`
+- `[float 0–1]` badge: `0.65rem`, `#6b7280`, right-aligned via flex
+- Description: `0.8rem`, `#94a3b8`
+- ✗ / ✓ lines: `0.78rem`. ✗ in `#f87171`, ✓ in `#4ade80`, text in `#94a3b8`
+- Pattern/threshold: monospace, `0.75rem`, `#64748b`
+- Block: `background: rgba(245,158,11,0.04)`, `border: 1px solid rgba(245,158,11,0.1)`, `border-radius: 10px`, `padding: 12px 14px`, `margin-bottom: 10px`
 ---
-#### Card 3 Body — Rewards
+### Section 4 — Rollout (tappable)
-For each reward function, a compact block:
+**Scan face** — a horizontal pipeline flow diagram:
 ```
-format_reward                               [float]
-Checks response contains <answer>…</answer>
-  ✗ 0  tags absent or inner content non-numeric
-  ✓ 1  tags present, content is a valid integer
+[DATA ROW] ──▶ [PROMPT] ──▶ [MODEL] ──▶ [REWARDS] ──▶ [SCORE]
 ```
-- Name: monospace, `color: #fcd34d`, `font-weight: 600`, `font-size: 0.88rem`
-- `[float]` badge: `font-size: 0.68rem`, `color: #6b7280`, right-aligned via `display: flex justify-content: space-between`
-- Description line: `color: #94a3b8`, `font-size: 0.8rem`, `margin: 4px 0 6px`
-- `✗ 0` / `✓ 1` lines: `font-size: 0.78rem`, `✗` in `#f87171`, `✓` in `#4ade80`, text in `#94a3b8`
-- Block: `background: rgba(245,158,11,0.05)`, `border: 1px solid rgba(245,158,11,0.12)`, `border-radius: 10px`, `padding: 12px 14px`, `margin-bottom: 10px`
+Each node: rounded rect, `background: rgba(244,63,94,0.08)`, `border: 1px solid rgba(244,63,94,0.2)`, `border-radius: 10px`, `padding: 10px 16px`. Node label: rose monospace `0.8rem` bold. Below each node: 1 short phrase (≤6 words) in `0.65rem` muted. Arrows: SVG `▶` in `rgba(244,63,94,0.4)`.
-If composite formula exists — after all reward blocks:
-```
-background: rgba(245,158,11,0.08)
-border: 1px solid rgba(245,158,11,0.25)
-border-radius: 8px · padding: 10px 14px
-font-family: monospace · color: #fcd34d · font-size: 0.85rem
-```
+On narrow viewports, collapse to vertical with connecting arrows below each node.
+At the bottom: "Trace an example →" in `0.72rem` rose.
+**Detail drawer content:**
+5 numbered steps. Each:
+- Step number: `2rem`, `font-weight: 800`, rose, `opacity: 0.2`
+- Step title: `0.9rem`, white, `font-weight: 700`
+- Description: `0.82rem`, `#94a3b8`, `line-height: 1.6`
+- Left border: `border-left: 2px solid rgba(244,63,94,0.15)`, `padding-left: 16px`, `margin-left: 10px` (omit on last step)
+- `margin-bottom: 20px`
+Steps are always:
+1. **Data → Prompt** — how the raw row becomes the exact prompt the model sees (include template if found)
+2. **Model Response** — what the model is expected to produce (format, tags, structure)
+3. **Reward Evaluation** — how each reward fires on a sample output; show scores for a real example
+4. **Score Computation** — the exact formula and resulting score
+5. **Perfect vs Zero** — what earns a full score vs what earns zero; concrete contrasting examples
 ---
-#### Card 4 Body — Rollout
+### Detail Drawer
-Numbered steps. Each step:
+A panel that slides in from the **right edge** of the viewport, overlaying the page.
+```css
+.drawer {
+  position: fixed;
+  top: 0; right: 0;
+  width: min(480px, 95vw);
+  height: 100vh;
+  background: #0d1117;
+  border-left: 1px solid rgba(255,255,255,0.1);
+  padding: 32px 28px;
+  overflow-y: auto;
+  transform: translateX(100%);
+  transition: transform 0.35s cubic-bezier(0.4, 0, 0.2, 1);
+  z-index: 100;
+}
+.drawer.open { transform: translateX(0); }
 ```
-  01
-  Data → Prompt
-  The problem field is inserted into "Solve step by step…"
-  as the user message. No system prompt.
-```
-- Number: `font-size: 2rem`, `font-weight: 800`, `color: var(--accent)`, `opacity: 0.25`, `line-height: 1`
-- Title: `font-size: 0.88rem`, `font-weight: 700`, `color: #f1f5f9`, `margin: 2px 0`
-- Description: `font-size: 0.8rem`, `color: #94a3b8`, `line-height: 1.55`
-- Left connector: `border-left: 2px solid rgba(244,63,94,0.15)`, `padding-left: 16px`, `margin-left: 12px`, except on last step
-- Between steps: `margin-bottom: 16px`
+**Backdrop:** `position: fixed; inset: 0; background: rgba(0,0,0,0.5); z-index: 99` — fades in with `opacity 0.3s`. Clicking it closes the drawer.
+**Close button:** `×` in the top-right of the drawer. `font-size: 1.1rem`, `color: #475569`, hover rose. Keyboard: `Escape` closes.
+**Drawer header:**
+- Section label (e.g., `DATASET DETAIL`) in accent color, `0.65rem` caps
+- Section name in white `1.1rem` bold
+- Thin `border-bottom: 1px solid rgba(255,255,255,0.07)`, `padding-bottom: 16px`, `margin-bottom: 20px`
-Always 5 steps: Data→Prompt · Model Response · Reward Evaluation · Score Computation · Perfect vs Zero.
+Scroll within the drawer. The rest of the page does not scroll while drawer is open (`body { overflow: hidden }`).
 ---
-### Reveal Animation
+### Motion
 ```css
-@keyframes flyOut {
-  to { transform: translateX(-120%) rotate(-8deg); opacity: 0; }
+@keyframes fadeSlideIn {
+  from { opacity: 0; transform: translateY(12px); }
+  to   { opacity: 1; transform: translateY(0); }
 }
-@keyframes riseUp {
-  from { transform: scale(0.96) translateY(18px); opacity: 0.65; }
-  to   { transform: scale(1) translateY(0); opacity: 1; }
+.section-card {
+  animation: fadeSlideIn 0.4s ease both;
 }
+/* Stagger via animation-delay: 0s, 0.08s, 0.16s, 0.24s on the four sections */
 ```
-On click:
-1. Active card: `flyOut 0.45s cubic-bezier(0.4, 0, 0.2, 1) forwards`
-2. After 80ms: next card transitions from depth-1 styles to depth-0 styles — `riseUp 0.5s cubic-bezier(0.34, 1.56, 0.64, 1)` (spring overshoot)
-3. All remaining cards shift their `data-depth` attributes down by 1
-4. Progress dots update with a `0.3s` width transition
+Drawer slide uses CSS transition only (no keyframes). Section cards lift slightly on hover:
+```css
+.section-card:hover { transform: translateY(-2px); box-shadow: 0 8px 32px rgba(0,0,0,0.4); }
+```
-Guard all animations:
+Reduced motion guard:
 ```css
 @media (prefers-reduced-motion: reduce) {
-  *, *::before, *::after { animation: none !important; transition: none !important; }
+  *, *::before, *::after { animation: none !important; transition-duration: 0.01ms !important; }
 }
 ```
@@ -264,27 +287,39 @@ Guard all animations:
 ### Page Chrome
-**Top:** Environment name in small muted text, centered, `font-size: 0.75rem`, `color: #334155`, `letter-spacing: 0.08em`, `margin-bottom: 32px`.
+**Top of page** — above the first card:
+```
+PRIME INTELLECT · ENVIRONMENT OVERVIEW          [environment-name]
+```
+`font-size: 0.68rem`, `color: #1e293b`, `letter-spacing: 0.1em`, `margin-bottom: 40px`. Nothing else at the top.
-**Bottom:** `Generated by Claude · Prime Intellect · <timestamp>` — `font-size: 0.68rem`, `color: #1e293b`, `margin-top: 28px`.
+**Bottom of page** — below the last card:
+```
+Generated by Claude · <timestamp>
+```
+`font-size: 0.65rem`, `color: #1e293b`, `margin-top: 48px`, centered.
-Nothing else. No nav, no sidebar, no header. The cards are the whole UI.
+Nothing else. No nav, no header, no sidebar.
 ---
 ## Step 3 — Confirm and report
-After writing the file:
-- Give the full path and `open environment_overview.html`
-- Two sentences: what the environment trains and how it scores
+After writing the file, share the full path and say:
+- What the environment trains (one sentence)
+- How it scores (one sentence)
-## Anti-patterns
+---
-- Do not dump all content at once — each card is one focused chapter
-- Do not truncate the example row — every field, every value, in full
-- Do not invent a GitHub URL — only include it if found in the source
-- Do not hallucinate reward weights, field names, or dataset content
-- Do not skip helper modules — they contain the core reward logic
-- Do not add tabs, sidebars, scroll-within-cards, or any structure beyond the 4-card deck
-- Do not use a light theme — dark only
-- Do not use Inter, Roboto, or any Google Font
+## Anti-patterns — never do these
+- **Do not write walls of text on the scan face.** Every section face is a diagram or a chart. Text labels only.
+- **Do not truncate the example row.** Full values, all fields, in the drawer.
+- **Do not invent a GitHub URL.** Only include it if found verbatim in source.
+- **Do not hallucinate field names, reward weights, or dataset content.** Extract exactly.
+- **Do not skip helper modules.** Core reward logic is often there.
+- **Do not use a light theme.**
+- **Do not use Inter, Roboto, or any Google Font.**
+- **Do not add tabs, nav, or scroll-within-cards.** The drawer is the only overlay.
+- **Do not add more than four sections.** Header + Dataset + Rewards + Rollout. That's it.
+- **Do not use a card-stack / deck reveal mechanic.** This is a scrollable single-column page.

package/skills/understand-environment/SKILL.md DELETED Viewed

@@ -1,494 +0,0 @@
----
-name: understand-environment
-description: Generate a rich, self-contained HTML report that fully explains a Prime Intellect verifiers environment. Use this skill any time the user asks to understand, explain, document, visualize, or explore a verifiers environment — even if they just say "what does this environment do?", "explain this env", "give me an overview", or "generate an HTML for this environment". The skill reads the Python source files in the current directory, extracts the dataset, reward functions, rollout logic, and configuration parameters, and writes a beautiful HTML file to the environment folder.
----
-# Understand Environment
-## Goal
-Produce a single self-contained HTML file (`environment_overview.html`) — a **treasure map** that lets a first-timer understand any verifiers environment in under 5 minutes. Everything is visible at once, on one screen, no scrolling, no clicking required. The page answers one question: *"What is this environment training a model to do?"*
----
-## Step 1 — Read the source
-Read **every `.py` file** in the current directory. Also read `pyproject.toml` and `README.md` if they exist. Do not skip helper files — reward logic is often split across modules (e.g. `*_checks.py`, `*_prompts.py`).
-Extract exactly three things:
-### 1. The Task — what does the model see?
-- Find 1 real example prompt from the source (a `PROMPTS` list, HuggingFace dataset, or prompt builder).
-- Extract only the **user-facing prompt text** the model actually reads. Truncate to ~300 chars if longer.
-- Note which file it lives in.
-### 2. The Judge — what counts as a good response?
-- Write 2–3 sentences in plain English describing how the environment scores the model. What is it rewarding? What does a high score look like vs a low score?
-- If there is a composite formula (e.g. `R = (1−hw)×visible + hw×hidden`), include it.
-- Note which file the reward logic lives in.
-### 3. The Loop — how does one rollout execute?
-- Identify 4–5 steps: what the model receives → what it produces → any tools/sandbox → scoring → final score.
-- Write each step as a 2–4 word label and a single-line description.
-- Note which file the rollout logic lives in.
----
-## Step 2 — Generate the HTML
-Write a single self-contained `./environment_overview.html`. No external CDN — all CSS and JS inline.
----
-### Aesthetic Direction: "Cartographic"
-The page looks like a **premium map artifact** — warm parchment in light mode, deep space in dark mode. Three cards feel like physical territories on the map, slightly lifted off the page. The purple accent (#a855f7) is the single modern intrusion into an otherwise scholarly palette — like a highlighted route drawn over an aged chart.
-**The one thing a viewer remembers**: the parchment texture and the offset card shadows in light mode — it genuinely feels like a physical object.
----
-### Theme System
-All colors as CSS custom properties. Toggle swaps `data-theme="dark"` on `<html>`. Persisted via `localStorage`.
-**Light theme — "Parchment" (default):**
-```css
---bg-page:       #f4efe4;   /* warm laid paper */
---bg-card:       #fdfaf3;   /* lighter parchment for cards */
---bg-code:       #ece6d4;   /* aged paper for code blocks */
---border:        #d6cba8;   /* ink-faded edge */
---border-strong: #b8a87a;
---text-primary:  #1c1410;   /* dark iron-gall ink */
---text-secondary:#5c4f3a;   /* brown ink */
---text-muted:    #9c8b6e;
---accent:        #a855f7;   /* PI purple — the modern intrusion */
---accent-soft:   rgba(168,85,247,0.12);
---shadow-card:   4px 6px 0 rgba(168,85,247,0.10), 0 2px 8px rgba(100,70,20,0.12);
---shadow-hover:  6px 8px 0 rgba(168,85,247,0.18), 0 4px 16px rgba(100,70,20,0.16);
-/* Parchment noise texture — paste this SVG as a data URI background on body */
-/* background-image: url("data:image/svg+xml,...") — see Noise Texture section */
-```
-**Dark theme — "Deep Space" (`[data-theme="dark"]`):**
-```css
---bg-page:       #0f0f1a;
---bg-card:       #161627;
---bg-code:       #1a1a2e;
---border:        #2a2a4a;
---border-strong: #3d3d6b;
---text-primary:  #e8e4f0;
---text-secondary:#b0a8c8;
---text-muted:    #6b6488;
---accent:        #a855f7;
---accent-soft:   rgba(168,85,247,0.15);
---shadow-card:   0 0 0 1px rgba(168,85,247,0.15), 0 8px 32px rgba(0,0,0,0.5);
---shadow-hover:  0 0 0 1px rgba(168,85,247,0.3), 0 12px 40px rgba(168,85,247,0.15);
-```
-All elements get `transition: background-color 0.3s ease, border-color 0.3s ease, color 0.2s ease, box-shadow 0.3s ease` so the toggle animates smoothly.
----
-### Noise Texture (light mode only)
-Add a subtle grain to `body` in light mode using an inline SVG filter — no external file needed:
-```html
-<svg style="display:none">
-  <filter id="noise">
-    <feTurbulence type="fractalNoise" baseFrequency="0.65" numOctaves="3" stitchTiles="stitch"/>
-    <feColorMatrix type="saturate" values="0"/>
-    <feBlend in="SourceGraphic" mode="multiply"/>
-  </filter>
-</svg>
-```
-Apply to body in light mode:
-```css
-:root body { filter: url(#noise); }  /* very subtle — opacity trick below */
-```
-Better approach — use a pseudo-element overlay:
-```css
-body::before {
-  content: '';
-  position: fixed; inset: 0; pointer-events: none; z-index: 9999;
-  opacity: 0.03;
-  background-image: url("data:image/svg+xml,%3Csvg viewBox='0 0 256 256' xmlns='http://www.w3.org/2000/svg'%3E%3Cfilter id='n'%3E%3CfeTurbulence type='fractalNoise' baseFrequency='0.9' numOctaves='4' stitchTiles='stitch'/%3E%3C/filter%3E%3Crect width='100%25' height='100%25' filter='url(%23n)'/%3E%3C/svg%3E");
-}
-[data-theme="dark"] body::before { opacity: 0.04; }
-```
----
-### Typography
-```css
-/* Display — scholarly, map-label feel */
---font-display: 'Palatino Linotype', 'Book Antiqua', Palatino, Georgia, serif;
-/* Body — humanist, readable, not generic */
---font-body: 'Optima', 'Candara', 'Gill Sans', 'Segoe UI', sans-serif;
-/* Code — typewriter, like coordinates on a map */
---font-mono: 'Courier New', 'Lucida Console', ui-monospace, monospace;
-```
-Scale:
-- Env name: `2.2rem`, `var(--font-display)`, `font-weight: 700`, `letter-spacing: -0.02em`, color: `var(--text-primary)`
-- Env description: `0.95rem`, `var(--font-body)`, `color: var(--text-secondary)`, `font-style: italic`
-- Card landmark label: `0.65rem`, `var(--font-body)`, `font-weight: 700`, `letter-spacing: 0.12em`, `text-transform: uppercase`, `color: var(--accent)`
-- Card body text: `0.875rem`, `var(--font-body)`, `line-height: 1.6`, `color: var(--text-primary)`
-- Code: `0.8rem`, `var(--font-mono)`, `color: var(--text-primary)`
-- Filename badge: `0.7rem`, `var(--font-mono)`, `color: var(--text-muted)`
----
-### Layout
-Full viewport, no scroll. CSS Grid:
-```css
-html, body { height: 100%; margin: 0; overflow: hidden; }
-body {
-  display: grid;
-  grid-template-rows: auto 1fr;  /* header + cards */
-  padding: 24px 28px 20px;
-  gap: 20px;
-  box-sizing: border-box;
-  background: var(--bg-page);
-}
-.cards {
-  display: grid;
-  grid-template-columns: 1fr 1fr 1fr;
-  gap: 18px;
-  min-height: 0;  /* critical: lets grid row shrink */
-}
-```
----
-### Header
-```
-┌─ header ──────────────────────────────────────────────────────┐
-│  ⬡ Prime Intellect          [light mode: ☀  dark mode: ☾]    │
-│                                                               │
-│  ifeval_goblin                                                │
-│  Trains a model to follow format instructions...             │
-└───────────────────────────────────────────────────────────────┘
-```
-- Top row: `⬡ Prime Intellect` in `0.7rem` caps, `var(--accent)`, `letter-spacing: 0.1em` — left. Theme toggle — right.
-- Theme toggle: a small pill `<button>`, `background: var(--accent-soft)`, `border: 1px solid var(--border-strong)`, `border-radius: 99px`, `padding: 4px 12px`, `font-size: 0.75rem`. Shows `☀ Light` or `☾ Dark`. Hover: `background: var(--accent-soft)` stronger, no outline.
-- Env name: large display serif below, with a 2px `var(--accent)` underline that is 40px wide and sits 4px below the baseline — does NOT span the full word. Like a map annotation mark.
-- Description: italic, secondary color, one line beneath.
----
-### Cards
-Three equal-width cards. Each card is a fixed-height territory on the map.
-```css
-.card {
-  background: var(--bg-card);
-  border: 1.5px solid var(--border);
-  border-radius: 10px;
-  box-shadow: var(--shadow-card);
-  padding: 20px;
-  display: grid;
-  grid-template-rows: auto 1fr auto;  /* label | content | filename */
-  min-height: 0;
-  transition: box-shadow 0.25s ease, transform 0.25s ease;
-}
-.card:hover {
-  box-shadow: var(--shadow-hover);
-  transform: translateY(-2px);
-}
-```
-Each card has exactly three zones:
-**Zone 1 — Landmark label** (top):
-```
-THE TASK          THE JUDGE          THE LOOP
-━━━━━━━━          ━━━━━━━━━          ━━━━━━━━
-```
-Label in `0.65rem` uppercase caps + accent color. A `1px solid var(--border-strong)` rule beneath it, `margin-bottom: 14px`.
-**Zone 2 — Content** (middle, `overflow: hidden`):
-Content specific to each card — see below.
-**Zone 3 — Filename badge** (bottom):
-```
-📍 ifeval_goblin_prompts.py
-```
-`font-size: 0.7rem`, `font-family: var(--font-mono)`, `color: var(--text-muted)`. A `1px solid var(--border)` rule above it, `padding-top: 10px`, `margin-top: 10px`.
----
-### Card 1 — The Task
-Content zone: the actual prompt text the model sees.
-```css
-.task-prompt {
-  background: var(--bg-code);
-  border-left: 3px solid var(--accent);
-  border-radius: 0 6px 6px 0;
-  padding: 12px 14px;
-  font-family: var(--font-mono);
-  font-size: 0.78rem;
-  line-height: 1.55;
-  color: var(--text-primary);
-  overflow: hidden;
-  display: -webkit-box;
-  -webkit-line-clamp: 8;           /* truncate at ~8 lines */
-  -webkit-box-orient: vertical;
-  white-space: pre-wrap;
-  word-break: break-word;
-}
-```
-If the prompt has format constraints embedded (e.g. "do not use the letter g"), highlight those phrases:
-```html
-<mark style="background:rgba(168,85,247,0.15); border-radius:2px; padding:0 2px;">
-  do not use the letter g
-</mark>
-```
----
-### Card 2 — The Judge
-Content zone: plain English scoring description.
-```css
-.judge-description {
-  font-family: var(--font-body);
-  font-size: 0.875rem;
-  line-height: 1.65;
-  color: var(--text-primary);
-}
-```
-If there is a composite formula, render it below the prose in a formula block:
-```css
-.formula {
-  margin-top: 14px;
-  background: var(--accent-soft);
-  border: 1px solid var(--accent);
-  border-radius: 6px;
-  padding: 10px 14px;
-  font-family: var(--font-mono);
-  font-size: 0.78rem;
-  color: var(--accent);
-  letter-spacing: 0.02em;
-}
-```
-If no formula exists, omit the block entirely — don't leave empty space.
----
-### Card 3 — The Loop
-Content zone: a static pipeline diagram.
-Layout: vertical stack of step rows connected by short vertical lines (easier to fit in a card than horizontal).
-```
-  ● Prompt received
-  │
-  ● Model generates response
-  │
-  ● [Tool call / sandbox]     ← only if applicable
-  │
-  ● Scoring applied
-  │
-  ● Final score emitted
-```
-HTML structure:
-```html
-<div class="pipeline">
-  <div class="step">
-    <div class="node"></div>
-    <div class="step-text">
-      <span class="step-label">Prompt received</span>
-      <span class="step-desc">64 task prompts with format constraints</span>
-    </div>
-  </div>
-  <div class="connector"></div>
-  <!-- repeat -->
-</div>
-```
-CSS:
-```css
-.pipeline { display: flex; flex-direction: column; gap: 0; }
-.step { display: flex; align-items: flex-start; gap: 12px; }
-.node {
-  width: 10px; height: 10px; border-radius: 50%;
-  background: var(--accent); flex-shrink: 0;
-  margin-top: 4px;
-  box-shadow: 0 0 0 3px var(--accent-soft);
-}
-.connector {
-  width: 1px; height: 14px;
-  background: var(--border-strong);
-  margin-left: 4.5px;        /* aligns with node center */
-  border-left: 1.5px dashed var(--border-strong);
-}
-.step-label {
-  display: block;
-  font-family: var(--font-body);
-  font-size: 0.8rem;
-  font-weight: 600;
-  color: var(--text-primary);
-}
-.step-desc {
-  display: block;
-  font-family: var(--font-body);
-  font-size: 0.72rem;
-  color: var(--text-muted);
-  line-height: 1.4;
-  margin-top: 1px;
-}
-```
----
-### Theme Toggle JS (complete, ~15 lines)
-```js
-(function() {
-  const root = document.documentElement;
-  const btn = document.getElementById('theme-toggle');
-  const saved = localStorage.getItem('pi-env-theme');
-  if (saved) root.setAttribute('data-theme', saved);
-  function update() {
-    const isDark = root.getAttribute('data-theme') === 'dark';
-    btn.textContent = isDark ? '☀ Light' : '☾ Dark';
-  }
-  update();
-  btn.addEventListener('click', function() {
-    const next = root.getAttribute('data-theme') === 'dark' ? 'light' : 'dark';
-    root.setAttribute('data-theme', next);
-    localStorage.setItem('pi-env-theme', next);
-    update();
-  });
-})();
-```
----
-### Page Load Animation
-One single orchestrated entrance — not scattered micro-animations:
-```css
-@keyframes rise {
-  from { opacity: 0; transform: translateY(10px); }
-  to   { opacity: 1; transform: translateY(0); }
-}
-.header { animation: rise 0.5s ease both; }
-.card:nth-child(1) { animation: rise 0.5s ease 0.1s both; }
-.card:nth-child(2) { animation: rise 0.5s ease 0.2s both; }
-.card:nth-child(3) { animation: rise 0.5s ease 0.3s both; }
-@media (prefers-reduced-motion: reduce) {
-  * { animation: none !important; }
-}
-```
----
-### Final HTML skeleton
-```html
-<!DOCTYPE html>
-<html lang="en">
-<head>
-  <meta charset="UTF-8">
-  <meta name="viewport" content="width=device-width, initial-scale=1.0">
-  <title>[ENV_NAME] — Prime Intellect Environment</title>
-  <style>/* ALL CSS INLINE HERE */</style>
-</head>
-<body>
-  <!-- Noise texture SVG (hidden) -->
-  <svg style="display:none">...</svg>
-  <header class="header">
-    <div class="header-top">
-      <span class="pi-logo">⬡ Prime Intellect</span>
-      <button id="theme-toggle">☾ Dark</button>
-    </div>
-    <h1 class="env-name">[ENV_NAME]</h1>
-    <p class="env-desc">[ONE_SENTENCE_DESCRIPTION]</p>
-  </header>
-  <div class="cards">
-    <div class="card">
-      <div class="card-label">The Task</div>
-      <div class="card-content">
-        <pre class="task-prompt">[PROMPT_TEXT]</pre>
-      </div>
-      <div class="card-file">📍 [FILENAME]</div>
-    </div>
-    <div class="card">
-      <div class="card-label">The Judge</div>
-      <div class="card-content">
-        <p class="judge-description">[SCORING_DESCRIPTION]</p>
-        <!-- optional -->
-        <div class="formula">[FORMULA]</div>
-      </div>
-      <div class="card-file">📍 [FILENAME]</div>
-    </div>
-    <div class="card">
-      <div class="card-label">The Loop</div>
-      <div class="card-content">
-        <div class="pipeline">
-          <!-- step + connector pairs -->
-        </div>
-      </div>
-      <div class="card-file">📍 [FILENAME]</div>
-    </div>
-  </div>
-  <script>/* ALL JS INLINE HERE */</script>
-</body>
-</html>
-```
----
-## Step 3 — Confirm and report
-After writing the file:
-- Tell the user the full path and `open environment_overview.html`
-- Two sentences: what the environment does and how it scores
-## Anti-patterns
-- Do not add any section beyond the three cards
-- Do not add tabs, collapsibles, config tables, file maps, quick-start blocks, or score bars
-- Do not let any card overflow or scroll — truncate content to fit
-- Do not hallucinate prompt text, reward logic, or filenames not found in the source
-- Do not skip helper modules — they often contain the core scoring logic
-- Do not use Inter, Roboto, system-ui, or Space Grotesk — use the Palatino/Optima/Courier stack