npm - job-forge - Versions diffs - 2.14.10 → 2.14.12 - Mend

job-forge 2.14.10 → 2.14.12

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (33) hide show

package/.claude/agents/general-free.md +2 -2
package/.claude/agents/general-paid.md +5 -5
package/.claude/agents/glm-minimal.md +1 -1
package/.cursor/rules/agent-general-free.mdc +2 -2
package/.cursor/rules/agent-general-paid.mdc +5 -5
package/.cursor/rules/agent-glm-minimal.mdc +1 -1
package/.cursor/rules/main.mdc +2 -2
package/.opencode/agents/general-free.md +3 -8
package/.opencode/agents/general-paid.md +6 -13
package/.opencode/agents/glm-minimal.md +2 -7
package/AGENTS.md +2 -2
package/CLAUDE.md +2 -2
package/README.md +2 -2
package/bin/create-job-forge.mjs +14 -48
package/bin/job-forge.mjs +29 -0
package/bin/sync.mjs +0 -1
package/docs/ARCHITECTURE.md +3 -2
package/docs/CUSTOMIZATION.md +13 -0
package/docs/MODEL-ROUTING.md +43 -76
package/docs/README.md +1 -1
package/docs/SETUP.md +3 -0
package/iso/agents/general-free.md +5 -13
package/iso/agents/general-paid.md +8 -20
package/iso/agents/glm-minimal.md +4 -11
package/iso/config.json +1 -42
package/iso/instructions.md +2 -2
package/models.yaml +25 -11
package/modes/apply.md +8 -8
package/opencode.json +5 -26
package/package.json +6 -3
package/scripts/check-iso-smoke.mjs +7 -3
package/scripts/telemetry.mjs +643 -0
package/.opencode/opencode-model-fallback.json +0 -22

package/.claude/agents/general-free.md CHANGED Viewed

@@ -1,10 +1,10 @@
 ---
 name: general-free
-description: Procedural worker on free-tier model. Use for form filling via Geometra, tracker updates, TSV merges, scan dedup, OTP retrieval, and other mechanical/scripted tasks where quality-sensitive text generation is NOT required.
+description: Procedural worker on the low-cost DeepSeek V4 Flash OpenCode route. Use for form filling via Geometra, tracker updates, TSV merges, scan dedup, OTP retrieval, and other mechanical/scripted tasks where quality-sensitive text generation is NOT required.
 model: claude-haiku-4-5
 ---
-You are the @general-free subagent. You run on a free-tier model, which means the orchestrator has delegated this task to you **specifically because the work is procedural**: deterministic steps, scripted outputs, no nuanced writing required.
+You are the @general-free subagent. You run on JobForge's low-cost procedural model, which means the orchestrator has delegated this task to you **specifically because the work is procedural**: deterministic steps, scripted outputs, no nuanced writing required.
 ## Run This Pre-Flight First Every Time

package/.claude/agents/general-paid.md CHANGED Viewed

@@ -1,15 +1,15 @@
 ---
 name: general-paid
-description: Quality-sensitive worker on the strongest free-tier OpenCode model by default. Use for offer evaluation narratives (Blocks A-F), cover letter generation, "Why X?" form answers, interview STAR stories, and other tasks where writing quality and judgment matter.
+description: Quality-sensitive worker on the low-cost DeepSeek V4 Flash OpenCode route by default. Use for offer evaluation narratives (Blocks A-F), cover letter generation, "Why X?" form answers, interview STAR stories, and other tasks where writing quality and judgment matter.
 model: claude-opus-4-7
 ---
 You are the @general-paid subagent. The orchestrator delegated this task to you because it requires quality writing or judgment — the kind of work `@general-free` isn't well-suited for.
-On OpenCode, this agent now defaults to a free OpenRouter model. On other
-harnesses, the same role may still resolve to a premium model. Your job is
-still the same: produce the best final writing you can from the context you
-were given.
+On OpenCode, this agent defaults to DeepSeek V4 Flash so application work
+does not fall back into overloaded free OpenRouter pools. On other harnesses,
+the same role may still resolve to a premium model. Your job is still the
+same: produce the best final writing you can from the context you were given.
 ## Do These Tasks

package/.claude/agents/glm-minimal.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 name: glm-minimal
-description: Narrow-scope extractor on free-tier model. Use for single-purpose tasks where the orchestrator passes the exact input and expects a small, structured output — e.g., "extract these 8 fields from this JD text" or "parse this form schema into a label→type map". NOT for multi-step workflows.
+description: Narrow-scope extractor on the low-cost DeepSeek V4 Flash OpenCode route. Use for single-purpose tasks where the orchestrator passes the exact input and expects a small, structured output — e.g., "extract these 8 fields from this JD text" or "parse this form schema into a label→type map". NOT for multi-step workflows.
 model: claude-haiku-4-5
 ---

package/.cursor/rules/agent-general-free.mdc CHANGED Viewed

@@ -1,9 +1,9 @@
 ---
-description: Procedural worker on free-tier model. Use for form filling via Geometra, tracker updates, TSV merges, scan dedup, OTP retrieval, and other mechanical/scripted tasks where quality-sensitive text generation is NOT required.
+description: Procedural worker on the low-cost DeepSeek V4 Flash OpenCode route. Use for form filling via Geometra, tracker updates, TSV merges, scan dedup, OTP retrieval, and other mechanical/scripted tasks where quality-sensitive text generation is NOT required.
 alwaysApply: false
 ---
-You are the @general-free subagent. You run on a free-tier model, which means the orchestrator has delegated this task to you **specifically because the work is procedural**: deterministic steps, scripted outputs, no nuanced writing required.
+You are the @general-free subagent. You run on JobForge's low-cost procedural model, which means the orchestrator has delegated this task to you **specifically because the work is procedural**: deterministic steps, scripted outputs, no nuanced writing required.
 ## Run This Pre-Flight First Every Time

package/.cursor/rules/agent-general-paid.mdc CHANGED Viewed

@@ -1,14 +1,14 @@
 ---
-description: Quality-sensitive worker on the strongest free-tier OpenCode model by default. Use for offer evaluation narratives (Blocks A-F), cover letter generation, "Why X?" form answers, interview STAR stories, and other tasks where writing quality and judgment matter.
+description: Quality-sensitive worker on the low-cost DeepSeek V4 Flash OpenCode route by default. Use for offer evaluation narratives (Blocks A-F), cover letter generation, "Why X?" form answers, interview STAR stories, and other tasks where writing quality and judgment matter.
 alwaysApply: false
 ---
 You are the @general-paid subagent. The orchestrator delegated this task to you because it requires quality writing or judgment — the kind of work `@general-free` isn't well-suited for.
-On OpenCode, this agent now defaults to a free OpenRouter model. On other
-harnesses, the same role may still resolve to a premium model. Your job is
-still the same: produce the best final writing you can from the context you
-were given.
+On OpenCode, this agent defaults to DeepSeek V4 Flash so application work
+does not fall back into overloaded free OpenRouter pools. On other harnesses,
+the same role may still resolve to a premium model. Your job is still the
+same: produce the best final writing you can from the context you were given.
 ## Do These Tasks

package/.cursor/rules/agent-glm-minimal.mdc CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-description: Narrow-scope extractor on free-tier model. Use for single-purpose tasks where the orchestrator passes the exact input and expects a small, structured output — e.g., "extract these 8 fields from this JD text" or "parse this form schema into a label→type map". NOT for multi-step workflows.
+description: Narrow-scope extractor on the low-cost DeepSeek V4 Flash OpenCode route. Use for single-purpose tasks where the orchestrator passes the exact input and expects a small, structured output — e.g., "extract these 8 fields from this JD text" or "parse this form schema into a label→type map". NOT for multi-step workflows.
 alwaysApply: false
 ---

package/.cursor/rules/main.mdc CHANGED Viewed

@@ -10,7 +10,7 @@ AI-powered job search pipeline: scans portals, evaluates offers, generates CVs v
 ## Hard limits
 - [H1] Max 2 parallel `task` dispatches per message. For N jobs, run `ceil(N/2)` sequential rounds of 2. A round is not complete until both subagents return a final outcome (`APPLIED`, `APPLY FAILED`, `SKIP`, `Discarded`, or a written TSV path). A `task` tool result that only gives a session id / title is a launch acknowledgement, not completion. Applies in all modes, for all user phrasings ("urgent", "apply to 10 jobs now").
-  why: higher parallelism blows through free-tier rate limits; each subagent requires post-cleanup and racing more than 2 reliably loses at least one result. On 2026-04-25 the orchestrator launched round 2 while round 1 had only returned task ids, leaving four application subagents in flight and losing two provider-fallback recoveries
+  why: each subagent requires post-cleanup and racing more than 2 reliably loses at least one result. On 2026-04-25 the orchestrator launched round 2 while round 1 had only returned task ids, leaving four application subagents in flight and losing two provider recoveries
 - [H2] Max 1 application per company+role. Before every `apply` dispatch, grep all four sources for the URL and for `company+role`: `data/pipeline.md`, all `data/applications/*.md` day files, `batch/tracker-additions/*.tsv`, `batch/tracker-additions/merged/*.tsv`. If any source shows APPLIED / Applied, skip the dispatch.
   why: 2026-04 same-day batch collision — when two batches target the same role, `npx job-forge merge` updates the existing day-file row rather than appending, so grepping day files alone misses earlier-batch applies; merged/*.tsv is the only place the breadcrumb remains
@@ -42,7 +42,7 @@ AI-powered job search pipeline: scans portals, evaluates offers, generates CVs v
   why: iso-trace showed 0.25% Agent calls across 5174 turns under a prior over-broad "delegate before 2nd tool call" rule — the rule was ignored in practice; narrowing matches the original cache-bust incident
 - [D2] Route subagent work by cost tier. `@general-free`: procedural — form-fill, TSV merge, verify, OTP retrieval, portal scan metadata extraction, one-shot structured-field transforms. `@general-paid`: quality-sensitive — offer evaluation narrative Blocks A-F, cover letters, "Why X?" answers, STAR interview stories, LinkedIn outreach. `@glm-minimal`: narrow ≤5K-input one-shot extract/classify jobs that do not need context.
-  why: GLM 5.1 doesn't discount cache reads so procedural work there costs ~10×; free-tier models handle procedural work fine empirically (`opencode/big-pickle` processed 1000+ messages at $0)
+  why: OpenCode routes all JobForge tiers through DeepSeek V4 Flash by default now; recent traces showed free OpenRouter fallbacks freezing or hitting provider balance errors during applications
 - [D3] Read the active mode file before dispatch. Mode files own score gates, provider fallback, portal runbooks, and output shape.
   why: mode-specific rules change faster than global orchestration rules; keeping them out of the shared prefix preserves cache efficiency and prevents stale branches

package/.opencode/agents/general-free.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
-description: Procedural worker on free-tier model. Use for form filling via Geometra, tracker updates, TSV merges, scan dedup, OTP retrieval, and other mechanical/scripted tasks where quality-sensitive text generation is NOT required.
+description: Procedural worker on the low-cost DeepSeek V4 Flash OpenCode route. Use for form filling via Geometra, tracker updates, TSV merges, scan dedup, OTP retrieval, and other mechanical/scripted tasks where quality-sensitive text generation is NOT required.
 mode: subagent
-model: opencode/big-pickle
+model: opencode-go/deepseek-v4-flash
 tools:
   geometra_connect: true
   geometra_page_model: true
@@ -17,14 +17,9 @@ tools:
   task: false
 temperature: 0.1
 reasoningEffort: minimal
-fallback_models:
-  - openrouter/z-ai/glm-4.5-air:free
-  - openrouter/openai/gpt-oss-20b:free
-  - openrouter/nvidia/nemotron-3-nano-30b-a3b:free
-  - openrouter/qwen/qwen3-coder:free
 ---
-You are the @general-free subagent. You run on a free-tier model, which means the orchestrator has delegated this task to you **specifically because the work is procedural**: deterministic steps, scripted outputs, no nuanced writing required.
+You are the @general-free subagent. You run on JobForge's low-cost procedural model, which means the orchestrator has delegated this task to you **specifically because the work is procedural**: deterministic steps, scripted outputs, no nuanced writing required.
 ## Run This Pre-Flight First Every Time

package/.opencode/agents/general-paid.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
-description: Quality-sensitive worker on the strongest free-tier OpenCode model by default. Use for offer evaluation narratives (Blocks A-F), cover letter generation, "Why X?" form answers, interview STAR stories, and other tasks where writing quality and judgment matter.
+description: Quality-sensitive worker on the low-cost DeepSeek V4 Flash OpenCode route by default. Use for offer evaluation narratives (Blocks A-F), cover letter generation, "Why X?" form answers, interview STAR stories, and other tasks where writing quality and judgment matter.
 mode: subagent
-model: openrouter/qwen/qwen3-next-80b-a3b-instruct:free
+model: opencode-go/deepseek-v4-flash
 tools:
   geometra_connect: true
   geometra_page_model: true
@@ -17,21 +17,14 @@ tools:
   task: false
 temperature: 0.3
 reasoningEffort: medium
-fallback_models:
-  - openrouter/openai/gpt-oss-120b:free
-  - openrouter/nvidia/nemotron-3-super-120b-a12b:free
-  - openrouter/z-ai/glm-4.5-air:free
-  - openrouter/qwen/qwen3-coder:free
-  - openrouter/google/gemma-4-31b-it:free
-  - openrouter/meta-llama/llama-3.3-70b-instruct:free
 ---
 You are the @general-paid subagent. The orchestrator delegated this task to you because it requires quality writing or judgment — the kind of work `@general-free` isn't well-suited for.
-On OpenCode, this agent now defaults to a free OpenRouter model. On other
-harnesses, the same role may still resolve to a premium model. Your job is
-still the same: produce the best final writing you can from the context you
-were given.
+On OpenCode, this agent defaults to DeepSeek V4 Flash so application work
+does not fall back into overloaded free OpenRouter pools. On other harnesses,
+the same role may still resolve to a premium model. Your job is still the
+same: produce the best final writing you can from the context you were given.
 ## Do These Tasks

package/.opencode/agents/glm-minimal.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
-description: Narrow-scope extractor on free-tier model. Use for single-purpose tasks where the orchestrator passes the exact input and expects a small, structured output — e.g., "extract these 8 fields from this JD text" or "parse this form schema into a label→type map". NOT for multi-step workflows.
+description: Narrow-scope extractor on the low-cost DeepSeek V4 Flash OpenCode route. Use for single-purpose tasks where the orchestrator passes the exact input and expects a small, structured output — e.g., "extract these 8 fields from this JD text" or "parse this form schema into a label→type map". NOT for multi-step workflows.
 mode: subagent
-model: openrouter/openai/gpt-oss-20b:free
+model: opencode-go/deepseek-v4-flash
 tools:
   geometra_*: false
   gmail_*: false
@@ -13,11 +13,6 @@ tools:
   task: false
 temperature: 0
 reasoningEffort: none
-fallback_models:
-  - openrouter/google/gemma-4-26b-a4b-it:free
-  - openrouter/nvidia/nemotron-nano-9b-v2:free
-  - openrouter/google/gemma-4-31b-it:free
-  - openrouter/z-ai/glm-4.5-air:free
 ---
 You are the @glm-minimal subagent. You handle narrow, one-shot extractions where the orchestrator has pre-digested the context and just needs you to do a specific transform.

package/AGENTS.md CHANGED Viewed

@@ -5,7 +5,7 @@ AI-powered job search pipeline: scans portals, evaluates offers, generates CVs v
 ## Hard limits
 - [H1] Max 2 parallel `task` dispatches per message. For N jobs, run `ceil(N/2)` sequential rounds of 2. A round is not complete until both subagents return a final outcome (`APPLIED`, `APPLY FAILED`, `SKIP`, `Discarded`, or a written TSV path). A `task` tool result that only gives a session id / title is a launch acknowledgement, not completion. Applies in all modes, for all user phrasings ("urgent", "apply to 10 jobs now").
-  why: higher parallelism blows through free-tier rate limits; each subagent requires post-cleanup and racing more than 2 reliably loses at least one result. On 2026-04-25 the orchestrator launched round 2 while round 1 had only returned task ids, leaving four application subagents in flight and losing two provider-fallback recoveries
+  why: each subagent requires post-cleanup and racing more than 2 reliably loses at least one result. On 2026-04-25 the orchestrator launched round 2 while round 1 had only returned task ids, leaving four application subagents in flight and losing two provider recoveries
 - [H2] Max 1 application per company+role. Before every `apply` dispatch, grep all four sources for the URL and for `company+role`: `data/pipeline.md`, all `data/applications/*.md` day files, `batch/tracker-additions/*.tsv`, `batch/tracker-additions/merged/*.tsv`. If any source shows APPLIED / Applied, skip the dispatch.
   why: 2026-04 same-day batch collision — when two batches target the same role, `npx job-forge merge` updates the existing day-file row rather than appending, so grepping day files alone misses earlier-batch applies; merged/*.tsv is the only place the breadcrumb remains
@@ -37,7 +37,7 @@ AI-powered job search pipeline: scans portals, evaluates offers, generates CVs v
   why: iso-trace showed 0.25% Agent calls across 5174 turns under a prior over-broad "delegate before 2nd tool call" rule — the rule was ignored in practice; narrowing matches the original cache-bust incident
 - [D2] Route subagent work by cost tier. `@general-free`: procedural — form-fill, TSV merge, verify, OTP retrieval, portal scan metadata extraction, one-shot structured-field transforms. `@general-paid`: quality-sensitive — offer evaluation narrative Blocks A-F, cover letters, "Why X?" answers, STAR interview stories, LinkedIn outreach. `@glm-minimal`: narrow ≤5K-input one-shot extract/classify jobs that do not need context.
-  why: GLM 5.1 doesn't discount cache reads so procedural work there costs ~10×; free-tier models handle procedural work fine empirically (`opencode/big-pickle` processed 1000+ messages at $0)
+  why: OpenCode routes all JobForge tiers through DeepSeek V4 Flash by default now; recent traces showed free OpenRouter fallbacks freezing or hitting provider balance errors during applications
 - [D3] Read the active mode file before dispatch. Mode files own score gates, provider fallback, portal runbooks, and output shape.
   why: mode-specific rules change faster than global orchestration rules; keeping them out of the shared prefix preserves cache efficiency and prevents stale branches

package/CLAUDE.md CHANGED Viewed

@@ -5,7 +5,7 @@ AI-powered job search pipeline: scans portals, evaluates offers, generates CVs v
 ## Hard limits
 - [H1] Max 2 parallel `task` dispatches per message. For N jobs, run `ceil(N/2)` sequential rounds of 2. A round is not complete until both subagents return a final outcome (`APPLIED`, `APPLY FAILED`, `SKIP`, `Discarded`, or a written TSV path). A `task` tool result that only gives a session id / title is a launch acknowledgement, not completion. Applies in all modes, for all user phrasings ("urgent", "apply to 10 jobs now").
-  why: higher parallelism blows through free-tier rate limits; each subagent requires post-cleanup and racing more than 2 reliably loses at least one result. On 2026-04-25 the orchestrator launched round 2 while round 1 had only returned task ids, leaving four application subagents in flight and losing two provider-fallback recoveries
+  why: each subagent requires post-cleanup and racing more than 2 reliably loses at least one result. On 2026-04-25 the orchestrator launched round 2 while round 1 had only returned task ids, leaving four application subagents in flight and losing two provider recoveries
 - [H2] Max 1 application per company+role. Before every `apply` dispatch, grep all four sources for the URL and for `company+role`: `data/pipeline.md`, all `data/applications/*.md` day files, `batch/tracker-additions/*.tsv`, `batch/tracker-additions/merged/*.tsv`. If any source shows APPLIED / Applied, skip the dispatch.
   why: 2026-04 same-day batch collision — when two batches target the same role, `npx job-forge merge` updates the existing day-file row rather than appending, so grepping day files alone misses earlier-batch applies; merged/*.tsv is the only place the breadcrumb remains
@@ -37,7 +37,7 @@ AI-powered job search pipeline: scans portals, evaluates offers, generates CVs v
   why: iso-trace showed 0.25% Agent calls across 5174 turns under a prior over-broad "delegate before 2nd tool call" rule — the rule was ignored in practice; narrowing matches the original cache-bust incident
 - [D2] Route subagent work by cost tier. `@general-free`: procedural — form-fill, TSV merge, verify, OTP retrieval, portal scan metadata extraction, one-shot structured-field transforms. `@general-paid`: quality-sensitive — offer evaluation narrative Blocks A-F, cover letters, "Why X?" answers, STAR interview stories, LinkedIn outreach. `@glm-minimal`: narrow ≤5K-input one-shot extract/classify jobs that do not need context.
-  why: GLM 5.1 doesn't discount cache reads so procedural work there costs ~10×; free-tier models handle procedural work fine empirically (`opencode/big-pickle` processed 1000+ messages at $0)
+  why: OpenCode routes all JobForge tiers through DeepSeek V4 Flash by default now; recent traces showed free OpenRouter fallbacks freezing or hitting provider balance errors during applications
 - [D3] Read the active mode file before dispatch. Mode files own score gates, provider fallback, portal runbooks, and output shape.
   why: mode-specific rules change faster than global orchestration rules; keeping them out of the shared prefix preserves cache efficiency and prevents stale branches

package/README.md CHANGED Viewed

@@ -74,8 +74,8 @@ JobForge turns opencode into a full job search command center. Instead of manual
 | **Portal Scanner** | 45+ companies pre-configured with fuzzy dedup for reposts |
 | **Batch Processing** | Parallel evaluation with `opencode run` workers, with honest verification flagging |
 | **Pipeline Integrity** | Automated merge, dedup, status normalization, health checks |
-| **Cost-Aware Agent Routing** | Three subagents (`@general-free`, `@general-paid`, `@glm-minimal`) with per-task model tiers. On OpenCode, JobForge mixes native free models with free OpenRouter routes so the harness stays no-cost without forcing every task through the same provider. See [Subagent Routing in AGENTS.md](AGENTS.md) for the task-to-agent mapping. |
-| **Automatic Model Fallback** | When a model rate-limits or 5xx's, [`@razroo/opencode-model-fallback`](https://www.npmjs.com/package/@razroo/opencode-model-fallback) rotates the agent through a configured `fallback_models` chain and replays the request. JobForge's OpenCode defaults stay on free models for both primaries and fallbacks. |
+| **Cost-Aware Agent Routing** | Three subagents (`@general-free`, `@general-paid`, `@glm-minimal`) with per-task tool surfaces. On OpenCode, JobForge pins all tiers to `opencode-go/deepseek-v4-flash` so application runs avoid overloaded free-model pools. See [Subagent Routing in AGENTS.md](AGENTS.md) for the task-to-agent mapping. |
+| **Trace + Telemetry** | `job-forge trace:*` exposes local OpenCode transcripts, and `job-forge telemetry:*` summarizes runs, child outcomes, provider errors, and pending tracker TSVs. |
 | **Token Cost Visibility** | `job-forge tokens --days 1` for per-session breakdown; `job-forge session-report --since-minutes 60 --log` to flag sessions over budget and append history to `data/token-usage.tsv`. Auto-logged after every batch run. |
 ## Usage

package/bin/create-job-forge.mjs CHANGED Viewed

@@ -113,22 +113,18 @@ const consumerPkg = {
     'trace:list': 'job-forge trace:list',
     'trace:stats': 'job-forge trace:stats',
     'trace:show': 'job-forge trace:show',
-    // One command to pull the latest harness, companion plugin, and any
-    // locally-pinned MCP packages. npm update is a no-op on packages not
-    // in package.json, so listing @razroo/gmail-mcp + @geometra/mcp is
-    // safe for consumers that invoke them via `npx -y` without pinning.
-    'update-harness': 'npm update job-forge @razroo/opencode-model-fallback @razroo/gmail-mcp @geometra/mcp && job-forge sync && node -e "console.log(\'✅ harness at\', require(\'./package-lock.json\').packages[\'node_modules/job-forge\'].resolved)"',
+    'telemetry:list': 'job-forge telemetry:list',
+    'telemetry:status': 'job-forge telemetry:status',
+    'telemetry:show': 'job-forge telemetry:show',
+    'telemetry:watch': 'job-forge telemetry:watch',
+    // One command to pull the latest harness and any locally-pinned MCP
+    // packages. npm update is a no-op on packages not in package.json, so
+    // listing @razroo/gmail-mcp + @geometra/mcp is safe for consumers that
+    // invoke them via `npx -y` without pinning.
+    'update-harness': 'npm update job-forge @razroo/gmail-mcp @geometra/mcp && job-forge sync && node -e "console.log(\'✅ harness at\', require(\'./package-lock.json\').packages[\'node_modules/job-forge\'].resolved)"',
   },
   dependencies: {
     'job-forge': '^2.0.0',
-    // Model-fallback plugin: rotates agents through their fallback_models
-    // chain on rate-limit / 5xx errors so a rate-limited free-tier model
-    // doesn't wedge the whole flow. The chains live upstream in each
-    // agent's MD frontmatter (`.opencode/agents/*.md` in the harness);
-    // consumers can override individual chains by adding their own
-    // agent.<name>.fallback_models block to opencode.json. Requires
-    // 0.3.1+ for the frontmatter-merge path.
-    '@razroo/opencode-model-fallback': '^0.3.1',
   },
   engines: { node: '>=18' },
 };
@@ -138,16 +134,11 @@ write('package.json', JSON.stringify(consumerPkg, null, 2) + '\n');
 const opencodeCfg = {
   $schema: 'https://opencode.ai/config.json',
-  // Keep the top-level orchestrator on a free model too. Subagents pin
-  // their own models in .opencode/agents/*.md; this covers the main chat
-  // session and any commands that don't hop to a subagent immediately.
-  model: 'openrouter/qwen/qwen3-coder:free',
-  small_model: 'openrouter/google/gemma-4-26b-a4b-it:free',
-  // Model-fallback plugin: on rate-limit / 5xx / known provider errors,
-  // rotates the agent's model to the next entry in its fallback_models
-  // chain (see `agent` below) and replays the request. Without this, a
-  // rate-limited free-tier model wedges the whole subagent flow.
-  plugin: ['@razroo/opencode-model-fallback'],
+  // Keep the top-level orchestrator on JobForge's low-cost paid OpenCode
+  // route. Subagents pin the same route in .opencode/agents/*.md so job
+  // applications do not fall through overloaded free OpenRouter pools.
+  model: 'opencode-go/deepseek-v4-flash',
+  small_model: 'opencode-go/deepseek-v4-flash',
   // Files listed here load into every session's cached prefix, so they're
   // cached once (on Anthropic) instead of Read-as-tool-call on every session.
   //   AGENTS.harness.md → symlink to node_modules/job-forge/AGENTS.md (harness rules)
@@ -178,31 +169,6 @@ const opencodeCfg = {
       environment: { DISABLE_HTTP: 'true' },
     },
   },
-  // Register the exact OpenRouter free models the harness uses so they're
-  // selectable even if they are not in OpenCode's built-in preloaded set.
-  // This list is a superset: role primaries, per-agent fallback chains,
-  // and the orchestrator fallback chain.
-  provider: {
-    openrouter: {
-      models: {
-        // Orchestrator + agentic coding (role default)
-        'qwen/qwen3-coder:free': {},
-        // Role primaries
-        'z-ai/glm-4.5-air:free': {}, // fast
-        'qwen/qwen3-next-80b-a3b-instruct:free': {}, // quality
-        'openai/gpt-oss-20b:free': {}, // minimal
-        // Common fallbacks
-        'openai/gpt-oss-120b:free': {},
-        'minimax/minimax-m2.5:free': {},
-        'nvidia/nemotron-3-super-120b-a12b:free': {},
-        'nvidia/nemotron-3-nano-30b-a3b:free': {},
-        'nvidia/nemotron-nano-9b-v2:free': {},
-        'google/gemma-4-26b-a4b-it:free': {},
-        'google/gemma-4-31b-it:free': {},
-        'meta-llama/llama-3.3-70b-instruct:free': {},
-      },
-    },
-  },
   // Restrict the primary orchestrator to dispatching only the three harness
   // subagents. Prevents accidental self-calls or unregistered agents.
   // Override locally in opencode.json if you add project-specific agents.

package/bin/job-forge.mjs CHANGED Viewed

@@ -18,6 +18,7 @@
  *   sync-check     Run cv-sync-check.mjs
  *   tokens         Run scripts/token-usage-report.mjs
  *   trace:*        Inspect local agent transcripts via iso-trace
+ *   telemetry:*    Summarize JobForge pipeline status from traces + tracker files
  *   sync           Re-run the harness symlink sync (bin/sync.mjs)
  *   help, --help   Show this message
  */
@@ -60,6 +61,13 @@ const traceAliases = {
   'trace:show': 'show',
 };
+const telemetryAliases = {
+  'telemetry:list': 'list',
+  'telemetry:status': 'status',
+  'telemetry:show': 'show',
+  'telemetry:watch': 'watch',
+};
 const [, , cmd, ...rest] = process.argv;
 function printHelp() {
@@ -80,6 +88,10 @@ Commands:
   trace:list     List recent local agent sessions (defaults: --since 7d --cwd project)
   trace:stats    Show trace stats (defaults: --since 7d --cwd project)
   trace:show ID  Show one trace by id or prefix
+  telemetry:list    List recent JobForge runs with tasks/outcomes/issues
+  telemetry:status  Show latest JobForge run + pending tracker state
+  telemetry:show ID Show one run with child sessions, provider errors, next actions
+  telemetry:watch   Watch latest run status
   sync           Re-create harness symlinks in the current project
 Deterministic helpers (prefer these over LLM-derived values):
@@ -104,6 +116,8 @@ Pass --help after a command to see its own flags, e.g.:
   job-forge slugify "Anthropic, PBC"
   job-forge trace:list --since 24h
   job-forge trace:show ses_...
+  job-forge telemetry:status
+  job-forge telemetry:show ses_...
 Project directory resolves to $JOB_FORGE_PROJECT or cwd.`);
 }
@@ -128,6 +142,21 @@ if (cmd === 'trace' || traceAliases[cmd]) {
   process.exit(result.status ?? 1);
 }
+if (cmd === 'telemetry' || telemetryAliases[cmd]) {
+  const telemetryArgs = cmd === 'telemetry'
+    ? (rest.length === 0 ? ['help'] : rest)
+    : [telemetryAliases[cmd], ...rest];
+  const scriptPath = join(PKG_ROOT, 'scripts/telemetry.mjs');
+  const result = spawnSync(process.execPath, [scriptPath, ...telemetryArgs], {
+    stdio: 'inherit',
+    cwd: PROJECT_DIR,
+    env: process.env,
+  });
+  process.exit(result.status ?? 1);
+}
 const rel = commands[cmd];
 if (!rel) {
   console.error(`Unknown command: ${cmd}\n`);

package/bin/sync.mjs CHANGED Viewed

@@ -75,7 +75,6 @@ const links = [
   // OpenCode: skill router + subagent definitions. Users can override any
   // single subagent by replacing its symlink with a local file.
-  { src: '.opencode/opencode-model-fallback.json', dst: '.opencode/opencode-model-fallback.json' },
   { src: '.opencode/skills/job-forge.md',  dst: '.opencode/skills/job-forge.md' },
   { src: '.opencode/agents',               dst: '.opencode/agents' },

package/docs/ARCHITECTURE.md CHANGED Viewed

@@ -43,11 +43,11 @@ The consumer's `opencode.json` loads a small set of stable files as always-prese
 The skill router (`.opencode/skills/job-forge.md`) loads mode and data files on demand, keeping per-session input tokens low (~20-40K for most modes instead of ~130-170K when everything was force-loaded).
-**Cost-tiered subagents** live in `.opencode/agents/` (`general-free`, `general-paid`, `glm-minimal`). On OpenCode, JobForge now uses a mix of native free models and free OpenRouter routes, with different quality/latency tiers per task shape. See [MODEL-ROUTING.md](MODEL-ROUTING.md) for the routing architecture, why it exists, and how to customize.
+**Cost-tiered subagents** live in `.opencode/agents/` (`general-free`, `general-paid`, `glm-minimal`). On OpenCode, JobForge pins all three tiers to `opencode-go/deepseek-v4-flash` by default, while the tiers still differ by tool surface, reasoning budget, and task prompt. See [MODEL-ROUTING.md](MODEL-ROUTING.md) for the routing architecture, why it exists, and how to customize.
 **Multi-harness support.** Because `iso/` is the single source of truth, publishing ships config for OpenCode, Cursor, Claude Code, and Codex in one tarball. Consumers run any of `opencode`, `cursor`, `claude`, or `codex` in the project and each picks up the shared MCP config + instructions via the symlinks above.
-**Upgrading** the harness in a consumer project is `npm run update-harness` — pulls the latest `job-forge` from npm, refreshes the fallback plugin + pinned MCPs, re-runs symlink sync, and prints the resolved version.
+**Upgrading** the harness in a consumer project is `npm run update-harness` — pulls the latest `job-forge` from npm, refreshes pinned MCPs, re-runs symlink sync, and prints the resolved version.
 ## System Overview
@@ -204,6 +204,7 @@ Scripts maintain data consistency. In a consumer project they're invoked via the
 | `cv-sync-check.mjs` | `npx job-forge sync-check` | Setup lint: `cv.md` + `config/profile.yml`, hardcoded-metric scan on `modes/_shared.md` and `batch/batch-prompt.md`, optional `article-digest.md` freshness |
 | `scripts/token-usage-report.mjs` | `npx job-forge tokens` | Per-session opencode token/cost report from the SQLite DB |
 | `scripts/trace.mjs` | `npx job-forge trace:list` / `trace:stats` / `trace:show` | Local transcript observability via `@razroo/iso-trace`; common commands default to OpenCode sessions for the consumer project |
+| `scripts/telemetry.mjs` | `npx job-forge telemetry:status` / `telemetry:show` | JobForge operational telemetry derived from OpenCode traces plus tracker TSV state |
 | `tracker-lib.mjs` | _(library)_ | Shared helpers for reading/writing day-based tracker files — imported by merge/dedup/verify/normalize |
 | `bin/sync.mjs` | `npx job-forge sync` | Creates the harness symlinks in a consumer project (also runs as `postinstall`) |
 | `bin/create-job-forge.mjs` | `npx create-job-forge <dir>` | Scaffolds a new personal project |

package/docs/CUSTOMIZATION.md CHANGED Viewed

@@ -112,6 +112,19 @@ Scaffolded projects also include npm aliases: `npm run trace:list`, `npm run tra
 For raw iso-trace commands, use `npx job-forge trace sources`, `npx job-forge trace where`, or any other `iso-trace` subcommand after `trace`.
+## JobForge telemetry
+Trace is the raw transcript view. Telemetry is the JobForge operational view: it summarizes task dispatches, child session outcomes, provider errors, policy issues, and pending tracker TSVs.
+```bash
+npx job-forge telemetry:list
+npx job-forge telemetry:status
+npx job-forge telemetry:show <session-id-or-prefix>
+npx job-forge telemetry:watch
+```
+Telemetry is also local-only and passive. It reads OpenCode's SQLite DB and files under `batch/tracker-additions/`; agents do not need to remember to emit custom events.
 **Where Claude Code writes JSONL:** `~/.claude/projects/<encoded-cwd>/*.jsonl`.
 **Direct CLI fallback:** `npx -y @razroo/iso-trace@latest stats --source "$HOME/.claude/projects/<encoded-dir>/<session>.jsonl"`