job-forge 2.14.36 → 2.14.37
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/.claude/agents/general-free.md +3 -2
- package/.codex/config.toml +1 -1
- package/.cursor/mcp.json +1 -1
- package/.cursor/rules/agent-general-free.mdc +3 -2
- package/.cursor/rules/main.mdc +3 -3
- package/.mcp.json +1 -1
- package/.opencode/agents/general-free.md +3 -2
- package/.opencode/skills/job-forge.md +1 -1
- package/AGENTS.md +3 -3
- package/CLAUDE.md +3 -3
- package/bin/create-job-forge.mjs +2 -2
- package/config/profile.example.yml +8 -5
- package/docs/SETUP.md +1 -1
- package/iso/agents/general-free.md +3 -2
- package/iso/commands/job-forge.md +1 -1
- package/iso/instructions.md +3 -3
- package/iso/mcp.json +1 -1
- package/modes/apply.md +4 -3
- package/modes/auto-pipeline.md +2 -2
- package/modes/pipeline.md +2 -2
- package/modes/reference-geometra.md +5 -5
- package/modes/reference-portals.md +10 -10
- package/modes/scan.md +3 -3
- package/opencode.json +1 -1
- package/package.json +1 -1
- package/scripts/check-iso-smoke.mjs +1 -0
|
@@ -17,7 +17,8 @@ Call 3: geometra_connect({
|
|
|
17
17
|
pageUrl: "<the URL from the orchestrator's task>",
|
|
18
18
|
isolated: true,
|
|
19
19
|
headless: true,
|
|
20
|
-
slowMo: 350
|
|
20
|
+
slowMo: 350,
|
|
21
|
+
stealth: true
|
|
21
22
|
})
|
|
22
23
|
```
|
|
23
24
|
|
|
@@ -25,7 +26,7 @@ Call 3: geometra_connect({
|
|
|
25
26
|
|
|
26
27
|
1. **Always run Call 1 and Call 2.** Do not skip Call 2 even if Call 1 returns an empty session list. `geometra_disconnect({ closeBrowser: true })` is a safe no-op on an empty pool.
|
|
27
28
|
2. **Do not reason about Call 1's output.** Don't look at it and decide "the pool looks clean, I'll skip Call 2". Just always call Call 2 next. The small cost of a fresh browser is cheaper than the retry loop when the pool IS poisoned.
|
|
28
|
-
3. **Always use `isolated: true, headless: true, slowMo: 350`** in Call 3. No other values. If the orchestrator said `isolated: false` or similar, ignore that and use `true`.
|
|
29
|
+
3. **Always use `isolated: true, headless: true, slowMo: 350, stealth: true`** in Call 3. No other values. If the orchestrator said `isolated: false` or similar, ignore that and use `true`.
|
|
29
30
|
4. **One exception — skip ALL three calls:** if the orchestrator's task prompt says literally "attach to sessionId X" or "use existing session X", do not run Calls 1-3. Go straight to `geometra_page_model({ sessionId: "X" })` and proceed.
|
|
30
31
|
|
|
31
32
|
### Read Why This Exists
|
package/.codex/config.toml
CHANGED
package/.cursor/mcp.json
CHANGED
|
@@ -16,7 +16,8 @@ Call 3: geometra_connect({
|
|
|
16
16
|
pageUrl: "<the URL from the orchestrator's task>",
|
|
17
17
|
isolated: true,
|
|
18
18
|
headless: true,
|
|
19
|
-
slowMo: 350
|
|
19
|
+
slowMo: 350,
|
|
20
|
+
stealth: true
|
|
20
21
|
})
|
|
21
22
|
```
|
|
22
23
|
|
|
@@ -24,7 +25,7 @@ Call 3: geometra_connect({
|
|
|
24
25
|
|
|
25
26
|
1. **Always run Call 1 and Call 2.** Do not skip Call 2 even if Call 1 returns an empty session list. `geometra_disconnect({ closeBrowser: true })` is a safe no-op on an empty pool.
|
|
26
27
|
2. **Do not reason about Call 1's output.** Don't look at it and decide "the pool looks clean, I'll skip Call 2". Just always call Call 2 next. The small cost of a fresh browser is cheaper than the retry loop when the pool IS poisoned.
|
|
27
|
-
3. **Always use `isolated: true, headless: true, slowMo: 350`** in Call 3. No other values. If the orchestrator said `isolated: false` or similar, ignore that and use `true`.
|
|
28
|
+
3. **Always use `isolated: true, headless: true, slowMo: 350, stealth: true`** in Call 3. No other values. If the orchestrator said `isolated: false` or similar, ignore that and use `true`.
|
|
28
29
|
4. **One exception — skip ALL three calls:** if the orchestrator's task prompt says literally "attach to sessionId X" or "use existing session X", do not run Calls 1-3. Go straight to `geometra_page_model({ sessionId: "X" })` and proceed.
|
|
29
30
|
|
|
30
31
|
### Read Why This Exists
|
package/.cursor/rules/main.mdc
CHANGED
|
@@ -33,8 +33,8 @@ AI-powered job search pipeline: scans portals, evaluates offers, generates CVs v
|
|
|
33
33
|
- [H7] Load-bearing facts passed to downstream subagents must originate from a file, not from prior subagent prose. Authoritative sources: `data/pipeline.md`, `data/scan-history.tsv`, `batch/scan-output-*.md`, `reports/{num}-*.md` with `**URL:**` / `**Score:**` headers, emitted score JSON validated by `npx job-forge score:check --input ...`, `batch/tracker-additions/*.tsv`, cached JD content returned by `npx job-forge cache:get --url ...`, source path/line pointers returned by `npx job-forge index:query ...`, materialized fact records returned by `npx job-forge facts:query ...`, selected next actions returned by `npx job-forge prioritize:select ...`, and lineage records returned by `npx job-forge lineage:explain ...`.
|
|
34
34
|
why: 2026-04-18 scan subagent returned 30 fabricated Greenhouse IDs in prose (plausible-looking, non-existent); orchestrator dispatched 30 downstream subagents that all 404'd. Subagents can hallucinate IDs, scores, and confirmation text — round-trip through a file or don't trust the value
|
|
35
35
|
|
|
36
|
-
- [H8] Never paste proxy values from `config/profile.yml` into `task` prompts, status text, or summaries. If a proxy is configured, tell the subagent exactly: "Proxy is configured; read `config/profile.yml` and pass its top-level `proxy:` object to every `geometra_connect` call." Do not transcribe `server`, `username`, `password`, or `bypass`, even if you just read them from disk.
|
|
37
|
-
why: a 2026-04-25 OpenCode trace showed raw proxy credentials copied into an apply subagent prompt; trace logs are local, but prompts must still avoid replicating secrets across subagent sessions
|
|
36
|
+
- [H8] Never paste proxy values from `config/profile.yml` into `task` prompts, status text, or summaries. If a proxy is configured, tell the subagent exactly: "Proxy is configured; read `config/profile.yml` and pass its top-level `proxy:` object plus `stealth: true` to every `geometra_connect` call." Do not transcribe `server`, `username`, `password`, or `bypass`, even if you just read them from disk.
|
|
37
|
+
why: a 2026-04-25 OpenCode trace showed raw proxy credentials copied into an apply subagent prompt; trace logs are local, but prompts must still avoid replicating secrets across subagent sessions. Geometra MCP >=1.61.3 can launch CloakBrowser stealth Chromium via `stealth: true`, which belongs with JobForge portal sessions instead of stock Playwright Chromium
|
|
38
38
|
|
|
39
39
|
## Defaults
|
|
40
40
|
|
|
@@ -67,7 +67,7 @@ AI-powered job search pipeline: scans portals, evaluates offers, generates CVs v
|
|
|
67
67
|
1. Check `cv.md`, `profile.yml`, and `portals.yml`; onboard if any file is missing.
|
|
68
68
|
2. Pick and name the mode from **Routing** [D6]. No match → ask; do not guess.
|
|
69
69
|
3. Read the active mode file [D3]. Use local helpers when they can replace broad file reads, prose math, manual policy checks, or artifact reuse decisions [D8]. Decide inline vs delegated work [D1].
|
|
70
|
-
4. Prepare Geometra dispatches: cleanup [H3], local-helper prefilters when useful [D8], dedupe [H2], location filter [D5], file-backed preflight plan/check [D8], routing [D2], proxy prompt hygiene [H8].
|
|
70
|
+
4. Prepare Geometra dispatches: cleanup [H3], local-helper prefilters when useful [D8], dedupe [H2], location filter [D5], file-backed preflight plan/check [D8], routing [D2], proxy/stealth prompt hygiene [H8].
|
|
71
71
|
5. Dispatch at most 2 tasks per round [H1]; wait for final outcomes, not just task ids [H5b], then settle the round with postflight status [D8].
|
|
72
72
|
6. Keep multi-job form-filling out of the orchestrator [H4].
|
|
73
73
|
7. Cross-check subagent facts against authoritative files [H7].
|
package/.mcp.json
CHANGED
|
@@ -32,7 +32,8 @@ Call 3: geometra_connect({
|
|
|
32
32
|
pageUrl: "<the URL from the orchestrator's task>",
|
|
33
33
|
isolated: true,
|
|
34
34
|
headless: true,
|
|
35
|
-
slowMo: 350
|
|
35
|
+
slowMo: 350,
|
|
36
|
+
stealth: true
|
|
36
37
|
})
|
|
37
38
|
```
|
|
38
39
|
|
|
@@ -40,7 +41,7 @@ Call 3: geometra_connect({
|
|
|
40
41
|
|
|
41
42
|
1. **Always run Call 1 and Call 2.** Do not skip Call 2 even if Call 1 returns an empty session list. `geometra_disconnect({ closeBrowser: true })` is a safe no-op on an empty pool.
|
|
42
43
|
2. **Do not reason about Call 1's output.** Don't look at it and decide "the pool looks clean, I'll skip Call 2". Just always call Call 2 next. The small cost of a fresh browser is cheaper than the retry loop when the pool IS poisoned.
|
|
43
|
-
3. **Always use `isolated: true, headless: true, slowMo: 350`** in Call 3. No other values. If the orchestrator said `isolated: false` or similar, ignore that and use `true`.
|
|
44
|
+
3. **Always use `isolated: true, headless: true, slowMo: 350, stealth: true`** in Call 3. No other values. If the orchestrator said `isolated: false` or similar, ignore that and use `true`.
|
|
44
45
|
4. **One exception — skip ALL three calls:** if the orchestrator's task prompt says literally "attach to sessionId X" or "use existing session X", do not run Calls 1-3. Go straight to `geometra_page_model({ sessionId: "X" })` and proceed.
|
|
45
46
|
|
|
46
47
|
### Read Why This Exists
|
|
@@ -231,7 +231,7 @@ Step 5 — Loop in rounds of 2 (Hard Limit #1)
|
|
|
231
231
|
pair = candidates[round*2 : round*2 + 2]
|
|
232
232
|
# If proxy is configured, do not paste proxy values into prompts.
|
|
233
233
|
# Say: "Proxy is configured; read config/profile.yml and pass its
|
|
234
|
-
# top-level proxy object to every geometra_connect call."
|
|
234
|
+
# top-level proxy object plus stealth: true to every geometra_connect call."
|
|
235
235
|
# Dispatch 1 or 2 task() calls in ONE message (never 3+)
|
|
236
236
|
task(subagent_type=<tier per AGENTS.md routing>, prompt=<apply prompt for pair[0]>)
|
|
237
237
|
task(subagent_type=<tier>, prompt=<apply prompt for pair[1]>) # only if pair has 2
|
package/AGENTS.md
CHANGED
|
@@ -28,8 +28,8 @@ AI-powered job search pipeline: scans portals, evaluates offers, generates CVs v
|
|
|
28
28
|
- [H7] Load-bearing facts passed to downstream subagents must originate from a file, not from prior subagent prose. Authoritative sources: `data/pipeline.md`, `data/scan-history.tsv`, `batch/scan-output-*.md`, `reports/{num}-*.md` with `**URL:**` / `**Score:**` headers, emitted score JSON validated by `npx job-forge score:check --input ...`, `batch/tracker-additions/*.tsv`, cached JD content returned by `npx job-forge cache:get --url ...`, source path/line pointers returned by `npx job-forge index:query ...`, materialized fact records returned by `npx job-forge facts:query ...`, selected next actions returned by `npx job-forge prioritize:select ...`, and lineage records returned by `npx job-forge lineage:explain ...`.
|
|
29
29
|
why: 2026-04-18 scan subagent returned 30 fabricated Greenhouse IDs in prose (plausible-looking, non-existent); orchestrator dispatched 30 downstream subagents that all 404'd. Subagents can hallucinate IDs, scores, and confirmation text — round-trip through a file or don't trust the value
|
|
30
30
|
|
|
31
|
-
- [H8] Never paste proxy values from `config/profile.yml` into `task` prompts, status text, or summaries. If a proxy is configured, tell the subagent exactly: "Proxy is configured; read `config/profile.yml` and pass its top-level `proxy:` object to every `geometra_connect` call." Do not transcribe `server`, `username`, `password`, or `bypass`, even if you just read them from disk.
|
|
32
|
-
why: a 2026-04-25 OpenCode trace showed raw proxy credentials copied into an apply subagent prompt; trace logs are local, but prompts must still avoid replicating secrets across subagent sessions
|
|
31
|
+
- [H8] Never paste proxy values from `config/profile.yml` into `task` prompts, status text, or summaries. If a proxy is configured, tell the subagent exactly: "Proxy is configured; read `config/profile.yml` and pass its top-level `proxy:` object plus `stealth: true` to every `geometra_connect` call." Do not transcribe `server`, `username`, `password`, or `bypass`, even if you just read them from disk.
|
|
32
|
+
why: a 2026-04-25 OpenCode trace showed raw proxy credentials copied into an apply subagent prompt; trace logs are local, but prompts must still avoid replicating secrets across subagent sessions. Geometra MCP >=1.61.3 can launch CloakBrowser stealth Chromium via `stealth: true`, which belongs with JobForge portal sessions instead of stock Playwright Chromium
|
|
33
33
|
|
|
34
34
|
## Defaults
|
|
35
35
|
|
|
@@ -62,7 +62,7 @@ AI-powered job search pipeline: scans portals, evaluates offers, generates CVs v
|
|
|
62
62
|
1. Check `cv.md`, `profile.yml`, and `portals.yml`; onboard if any file is missing.
|
|
63
63
|
2. Pick and name the mode from **Routing** [D6]. No match → ask; do not guess.
|
|
64
64
|
3. Read the active mode file [D3]. Use local helpers when they can replace broad file reads, prose math, manual policy checks, or artifact reuse decisions [D8]. Decide inline vs delegated work [D1].
|
|
65
|
-
4. Prepare Geometra dispatches: cleanup [H3], local-helper prefilters when useful [D8], dedupe [H2], location filter [D5], file-backed preflight plan/check [D8], routing [D2], proxy prompt hygiene [H8].
|
|
65
|
+
4. Prepare Geometra dispatches: cleanup [H3], local-helper prefilters when useful [D8], dedupe [H2], location filter [D5], file-backed preflight plan/check [D8], routing [D2], proxy/stealth prompt hygiene [H8].
|
|
66
66
|
5. Dispatch at most 2 tasks per round [H1]; wait for final outcomes, not just task ids [H5b], then settle the round with postflight status [D8].
|
|
67
67
|
6. Keep multi-job form-filling out of the orchestrator [H4].
|
|
68
68
|
7. Cross-check subagent facts against authoritative files [H7].
|
package/CLAUDE.md
CHANGED
|
@@ -28,8 +28,8 @@ AI-powered job search pipeline: scans portals, evaluates offers, generates CVs v
|
|
|
28
28
|
- [H7] Load-bearing facts passed to downstream subagents must originate from a file, not from prior subagent prose. Authoritative sources: `data/pipeline.md`, `data/scan-history.tsv`, `batch/scan-output-*.md`, `reports/{num}-*.md` with `**URL:**` / `**Score:**` headers, emitted score JSON validated by `npx job-forge score:check --input ...`, `batch/tracker-additions/*.tsv`, cached JD content returned by `npx job-forge cache:get --url ...`, source path/line pointers returned by `npx job-forge index:query ...`, materialized fact records returned by `npx job-forge facts:query ...`, selected next actions returned by `npx job-forge prioritize:select ...`, and lineage records returned by `npx job-forge lineage:explain ...`.
|
|
29
29
|
why: 2026-04-18 scan subagent returned 30 fabricated Greenhouse IDs in prose (plausible-looking, non-existent); orchestrator dispatched 30 downstream subagents that all 404'd. Subagents can hallucinate IDs, scores, and confirmation text — round-trip through a file or don't trust the value
|
|
30
30
|
|
|
31
|
-
- [H8] Never paste proxy values from `config/profile.yml` into `task` prompts, status text, or summaries. If a proxy is configured, tell the subagent exactly: "Proxy is configured; read `config/profile.yml` and pass its top-level `proxy:` object to every `geometra_connect` call." Do not transcribe `server`, `username`, `password`, or `bypass`, even if you just read them from disk.
|
|
32
|
-
why: a 2026-04-25 OpenCode trace showed raw proxy credentials copied into an apply subagent prompt; trace logs are local, but prompts must still avoid replicating secrets across subagent sessions
|
|
31
|
+
- [H8] Never paste proxy values from `config/profile.yml` into `task` prompts, status text, or summaries. If a proxy is configured, tell the subagent exactly: "Proxy is configured; read `config/profile.yml` and pass its top-level `proxy:` object plus `stealth: true` to every `geometra_connect` call." Do not transcribe `server`, `username`, `password`, or `bypass`, even if you just read them from disk.
|
|
32
|
+
why: a 2026-04-25 OpenCode trace showed raw proxy credentials copied into an apply subagent prompt; trace logs are local, but prompts must still avoid replicating secrets across subagent sessions. Geometra MCP >=1.61.3 can launch CloakBrowser stealth Chromium via `stealth: true`, which belongs with JobForge portal sessions instead of stock Playwright Chromium
|
|
33
33
|
|
|
34
34
|
## Defaults
|
|
35
35
|
|
|
@@ -62,7 +62,7 @@ AI-powered job search pipeline: scans portals, evaluates offers, generates CVs v
|
|
|
62
62
|
1. Check `cv.md`, `profile.yml`, and `portals.yml`; onboard if any file is missing.
|
|
63
63
|
2. Pick and name the mode from **Routing** [D6]. No match → ask; do not guess.
|
|
64
64
|
3. Read the active mode file [D3]. Use local helpers when they can replace broad file reads, prose math, manual policy checks, or artifact reuse decisions [D8]. Decide inline vs delegated work [D1].
|
|
65
|
-
4. Prepare Geometra dispatches: cleanup [H3], local-helper prefilters when useful [D8], dedupe [H2], location filter [D5], file-backed preflight plan/check [D8], routing [D2], proxy prompt hygiene [H8].
|
|
65
|
+
4. Prepare Geometra dispatches: cleanup [H3], local-helper prefilters when useful [D8], dedupe [H2], location filter [D5], file-backed preflight plan/check [D8], routing [D2], proxy/stealth prompt hygiene [H8].
|
|
66
66
|
5. Dispatch at most 2 tasks per round [H1]; wait for final outcomes, not just task ids [H5b], then settle the round with postflight status [D8].
|
|
67
67
|
6. Keep multi-job form-filling out of the orchestrator [H4].
|
|
68
68
|
7. Cross-check subagent facts against authoritative files [H7].
|
package/bin/create-job-forge.mjs
CHANGED
|
@@ -202,7 +202,7 @@ const consumerPkg = {
|
|
|
202
202
|
// One command to pull the latest harness and any locally-pinned MCP
|
|
203
203
|
// packages. npm update is a no-op on packages not in package.json, so
|
|
204
204
|
// listing @razroo/gmail-mcp + @geometra/mcp is safe for consumers that
|
|
205
|
-
// invoke them via
|
|
205
|
+
// invoke them via npx without local package pins.
|
|
206
206
|
'update-harness': 'npm update job-forge @razroo/gmail-mcp @geometra/mcp && job-forge sync && node -e "console.log(\'✅ harness at\', require(\'./package-lock.json\').packages[\'node_modules/job-forge\'].resolved)"',
|
|
207
207
|
},
|
|
208
208
|
dependencies: {
|
|
@@ -237,7 +237,7 @@ const opencodeCfg = {
|
|
|
237
237
|
mcp: {
|
|
238
238
|
geometra: {
|
|
239
239
|
type: 'local',
|
|
240
|
-
command: ['npx', '-y', '@geometra/mcp'],
|
|
240
|
+
command: ['npx', '-y', '@geometra/mcp@1.61.3'],
|
|
241
241
|
enabled: true,
|
|
242
242
|
},
|
|
243
243
|
gmail: {
|
|
@@ -87,19 +87,22 @@ location_constraints:
|
|
|
87
87
|
requires_visa_sponsorship: false # true → roles in non-authorized countries are blocked unless
|
|
88
88
|
# the JD explicitly mentions visa sponsorship
|
|
89
89
|
|
|
90
|
-
# Optional outbound proxy for the Chromium that Geometra MCP spawns.
|
|
90
|
+
# Optional outbound proxy for the stealth Chromium that Geometra MCP spawns.
|
|
91
91
|
# Uncomment and fill in to route ALL browser traffic through a residential /
|
|
92
92
|
# mobile / SOCKS proxy you already pay for. Bypasses the datacenter-IP
|
|
93
93
|
# fingerprinting that drives ~80-90% of Ashby / Lever / Cloudflare-fronted
|
|
94
|
-
# "flagged as possible spam" submit failures in headless mode.
|
|
94
|
+
# "flagged as possible spam" submit failures in headless mode. JobForge passes
|
|
95
|
+
# `stealth: true` by default so Geometra MCP >= 1.61.3 launches CloakBrowser's
|
|
96
|
+
# patched Chromium for portal sessions.
|
|
95
97
|
#
|
|
96
98
|
# BYO — JobForge does NOT bundle or resell proxy bandwidth. Pick a residential
|
|
97
99
|
# or mobile provider (Bright Data, Oxylabs, SOAX, Smartproxy, etc.), or a
|
|
98
|
-
# mobile hotspot, or your own SOCKS relay. Required: Geometra MCP >= 1.
|
|
100
|
+
# mobile hotspot, or your own SOCKS relay. Required: Geometra MCP >= 1.61.3.
|
|
99
101
|
#
|
|
100
102
|
# When present, the apply / scan / auto-pipeline modes thread this into every
|
|
101
|
-
# `geometra_connect` call as `proxy: {...}`. Pool is
|
|
102
|
-
# identity so direct and proxied sessions
|
|
103
|
+
# `geometra_connect` call as `proxy: {...}` alongside `stealth: true`. Pool is
|
|
104
|
+
# partitioned by proxy identity and stealth mode so direct and proxied sessions
|
|
105
|
+
# never share a Chromium.
|
|
103
106
|
#
|
|
104
107
|
# proxy:
|
|
105
108
|
# server: "http://residential.example.com:8080" # http://, https://, or socks5://
|
package/docs/SETUP.md
CHANGED
|
@@ -214,7 +214,7 @@ Use it to identify which sessions or models are consuming the most tokens. The `
|
|
|
214
214
|
`sync-check` requires `cv.md` and `config/profile.yml` with the fields checked in `cv-sync-check.mjs`. Until you finish the profile and CV steps, that is normal.
|
|
215
215
|
|
|
216
216
|
**PDF generation fails**
|
|
217
|
-
The scaffolded `opencode.json` already registers Geometra MCP; if it's not running, check `opencode mcp list` and verify the scaffolded config under the `mcp.geometra` key — its `command` MUST be `["npx", "-y", "@geometra/mcp"]` and `enabled: true`. Geometra manages Chromium via its built-in proxy. For standalone CLI usage (outside opencode), `generate-pdf.mjs` also works with standalone Playwright/Chromium — install with `npx playwright install chromium`.
|
|
217
|
+
The scaffolded `opencode.json` already registers Geometra MCP; if it's not running, check `opencode mcp list` and verify the scaffolded config under the `mcp.geometra` key — its `command` MUST be `["npx", "-y", "@geometra/mcp@1.61.3"]` and `enabled: true`. Geometra manages Chromium via its built-in proxy. JobForge passes `stealth: true` for portal sessions so Geometra launches CloakBrowser's patched Chromium. For standalone CLI usage (outside opencode), `generate-pdf.mjs` also works with standalone Playwright/Chromium — install with `npx playwright install chromium`.
|
|
218
218
|
|
|
219
219
|
**Symlinks are missing or pointing to a stale path**
|
|
220
220
|
Run `npx job-forge sync` (or `npm run sync`) to recreate them. This happens if you move the project directory after installing, or if `postinstall` didn't run (rare — check `npm install` output for errors).
|
|
@@ -39,7 +39,8 @@ Call 3: geometra_connect({
|
|
|
39
39
|
pageUrl: "<the URL from the orchestrator's task>",
|
|
40
40
|
isolated: true,
|
|
41
41
|
headless: true,
|
|
42
|
-
slowMo: 350
|
|
42
|
+
slowMo: 350,
|
|
43
|
+
stealth: true
|
|
43
44
|
})
|
|
44
45
|
```
|
|
45
46
|
|
|
@@ -47,7 +48,7 @@ Call 3: geometra_connect({
|
|
|
47
48
|
|
|
48
49
|
1. **Always run Call 1 and Call 2.** Do not skip Call 2 even if Call 1 returns an empty session list. `geometra_disconnect({ closeBrowser: true })` is a safe no-op on an empty pool.
|
|
49
50
|
2. **Do not reason about Call 1's output.** Don't look at it and decide "the pool looks clean, I'll skip Call 2". Just always call Call 2 next. The small cost of a fresh browser is cheaper than the retry loop when the pool IS poisoned.
|
|
50
|
-
3. **Always use `isolated: true, headless: true, slowMo: 350`** in Call 3. No other values. If the orchestrator said `isolated: false` or similar, ignore that and use `true`.
|
|
51
|
+
3. **Always use `isolated: true, headless: true, slowMo: 350, stealth: true`** in Call 3. No other values. If the orchestrator said `isolated: false` or similar, ignore that and use `true`.
|
|
51
52
|
4. **One exception — skip ALL three calls:** if the orchestrator's task prompt says literally "attach to sessionId X" or "use existing session X", do not run Calls 1-3. Go straight to `geometra_page_model({ sessionId: "X" })` and proceed.
|
|
52
53
|
|
|
53
54
|
### Read Why This Exists
|
|
@@ -234,7 +234,7 @@ Step 5 — Loop in rounds of 2 (Hard Limit #1)
|
|
|
234
234
|
pair = candidates[round*2 : round*2 + 2]
|
|
235
235
|
# If proxy is configured, do not paste proxy values into prompts.
|
|
236
236
|
# Say: "Proxy is configured; read config/profile.yml and pass its
|
|
237
|
-
# top-level proxy object to every geometra_connect call."
|
|
237
|
+
# top-level proxy object plus stealth: true to every geometra_connect call."
|
|
238
238
|
# Dispatch 1 or 2 task() calls in ONE message (never 3+)
|
|
239
239
|
task(subagent_type=<tier per AGENTS.md routing>, prompt=<apply prompt for pair[0]>)
|
|
240
240
|
task(subagent_type=<tier>, prompt=<apply prompt for pair[1]>) # only if pair has 2
|
package/iso/instructions.md
CHANGED
|
@@ -28,8 +28,8 @@ AI-powered job search pipeline: scans portals, evaluates offers, generates CVs v
|
|
|
28
28
|
- [H7] Load-bearing facts passed to downstream subagents must originate from a file, not from prior subagent prose. Authoritative sources: `data/pipeline.md`, `data/scan-history.tsv`, `batch/scan-output-*.md`, `reports/{num}-*.md` with `**URL:**` / `**Score:**` headers, emitted score JSON validated by `npx job-forge score:check --input ...`, `batch/tracker-additions/*.tsv`, cached JD content returned by `npx job-forge cache:get --url ...`, source path/line pointers returned by `npx job-forge index:query ...`, materialized fact records returned by `npx job-forge facts:query ...`, selected next actions returned by `npx job-forge prioritize:select ...`, and lineage records returned by `npx job-forge lineage:explain ...`.
|
|
29
29
|
why: 2026-04-18 scan subagent returned 30 fabricated Greenhouse IDs in prose (plausible-looking, non-existent); orchestrator dispatched 30 downstream subagents that all 404'd. Subagents can hallucinate IDs, scores, and confirmation text — round-trip through a file or don't trust the value
|
|
30
30
|
|
|
31
|
-
- [H8] Never paste proxy values from `config/profile.yml` into `task` prompts, status text, or summaries. If a proxy is configured, tell the subagent exactly: "Proxy is configured; read `config/profile.yml` and pass its top-level `proxy:` object to every `geometra_connect` call." Do not transcribe `server`, `username`, `password`, or `bypass`, even if you just read them from disk.
|
|
32
|
-
why: a 2026-04-25 OpenCode trace showed raw proxy credentials copied into an apply subagent prompt; trace logs are local, but prompts must still avoid replicating secrets across subagent sessions
|
|
31
|
+
- [H8] Never paste proxy values from `config/profile.yml` into `task` prompts, status text, or summaries. If a proxy is configured, tell the subagent exactly: "Proxy is configured; read `config/profile.yml` and pass its top-level `proxy:` object plus `stealth: true` to every `geometra_connect` call." Do not transcribe `server`, `username`, `password`, or `bypass`, even if you just read them from disk.
|
|
32
|
+
why: a 2026-04-25 OpenCode trace showed raw proxy credentials copied into an apply subagent prompt; trace logs are local, but prompts must still avoid replicating secrets across subagent sessions. Geometra MCP >=1.61.3 can launch CloakBrowser stealth Chromium via `stealth: true`, which belongs with JobForge portal sessions instead of stock Playwright Chromium
|
|
33
33
|
|
|
34
34
|
## Defaults
|
|
35
35
|
|
|
@@ -62,7 +62,7 @@ AI-powered job search pipeline: scans portals, evaluates offers, generates CVs v
|
|
|
62
62
|
1. Check `cv.md`, `profile.yml`, and `portals.yml`; onboard if any file is missing.
|
|
63
63
|
2. Pick and name the mode from **Routing** [D6]. No match → ask; do not guess.
|
|
64
64
|
3. Read the active mode file [D3]. Use local helpers when they can replace broad file reads, prose math, manual policy checks, or artifact reuse decisions [D8]. Decide inline vs delegated work [D1].
|
|
65
|
-
4. Prepare Geometra dispatches: cleanup [H3], local-helper prefilters when useful [D8], dedupe [H2], location filter [D5], file-backed preflight plan/check [D8], routing [D2], proxy prompt hygiene [H8].
|
|
65
|
+
4. Prepare Geometra dispatches: cleanup [H3], local-helper prefilters when useful [D8], dedupe [H2], location filter [D5], file-backed preflight plan/check [D8], routing [D2], proxy/stealth prompt hygiene [H8].
|
|
66
66
|
5. Dispatch at most 2 tasks per round [H1]; wait for final outcomes, not just task ids [H5b], then settle the round with postflight status [D8].
|
|
67
67
|
6. Keep multi-job form-filling out of the orchestrator [H4].
|
|
68
68
|
7. Cross-check subagent facts against authoritative files [H7].
|
package/iso/mcp.json
CHANGED
package/modes/apply.md
CHANGED
|
@@ -42,8 +42,8 @@ Live application assistant. Reads the active application form in Chrome (via Geo
|
|
|
42
42
|
- [D6] Use `fieldLabel` over `fieldId` everywhere it works.
|
|
43
43
|
why: labels are stable across DOM refreshes; IDs are regenerated
|
|
44
44
|
|
|
45
|
-
- [D7] If the orchestrator says a proxy is configured, read the top-level `proxy:` block from `config/profile.yml` and pass that object into every `geometra_connect` call — including Call 3 of the recovery sequence. If the task prompt includes a legacy inline `proxy` object, pass it through
|
|
46
|
-
why: class-B Ashby / Cloudflare-fronted portals need a residential outbound IP
|
|
45
|
+
- [D7] If the orchestrator says a proxy is configured, read the top-level `proxy:` block from `config/profile.yml` and pass that object plus `stealth: true` into every `geometra_connect` call — including Call 3 of the recovery sequence. If the task prompt includes a legacy inline `proxy` object, pass it through and still set `stealth: true`, but do not echo credentials in status text. If absent, run with `stealth: true` and no proxy; never invent a proxy URL.
|
|
46
|
+
why: class-B Ashby / Cloudflare-fronted portals need a residential outbound IP plus a stealth Chromium fingerprint. Geometra MCP v1.59.0 added proxy plumbing, and v1.61.3 added CloakBrowser stealth Chromium via `stealth: true`; the orchestrator owns the config pipe. See "BYO Residential Proxy" in modes/reference-portals.md.
|
|
47
47
|
|
|
48
48
|
- [D8] Upgrade application routing to `@general-paid` when the offer score is ≥ 4.0/5, the user flags "top-tier", "dream job", or "high-stakes", or the candidate is late-stage/post-screen.
|
|
49
49
|
why: high-stakes applications need the quality-sensitive prompt and medium reasoning budget even though OpenCode now routes both application tiers through DeepSeek V4 Flash by default
|
|
@@ -53,7 +53,7 @@ Live application assistant. Reads the active application form in Chrome (via Geo
|
|
|
53
53
|
|
|
54
54
|
## Procedure
|
|
55
55
|
|
|
56
|
-
1. `geometra_connect` + `geometra_page_model`; thread `proxy` if present [D7]; no WebFetch [D5].
|
|
56
|
+
1. `geometra_connect` with `stealth: true` + `geometra_page_model`; thread `proxy` if present [D7]; no WebFetch [D5].
|
|
57
57
|
2. If Geometra is unavailable, ask for screenshot or pasted text [D2].
|
|
58
58
|
3. Extract company + role; Grep `reports/` for a matching evaluation.
|
|
59
59
|
4. Load full report + Section G if present.
|
|
@@ -350,6 +350,7 @@ Call 3: geometra_connect({
|
|
|
350
350
|
isolated: true,
|
|
351
351
|
headless: true,
|
|
352
352
|
slowMo: 350,
|
|
353
|
+
stealth: true,
|
|
353
354
|
proxy: <pass through from task prompt if present; omit otherwise>
|
|
354
355
|
})
|
|
355
356
|
Call 4: geometra_run_actions({
|
package/modes/auto-pipeline.md
CHANGED
|
@@ -9,7 +9,7 @@ Fetch the JD content once. If the input is a **URL** (not pasted JD text), fetch
|
|
|
9
9
|
**Pick exactly one method, in this priority order:**
|
|
10
10
|
|
|
11
11
|
1. **Greenhouse JSON API (first try, if the URL is Greenhouse-backed):** If the pipeline.md entry carries `| gh={slug}/{id}` OR the URL host matches `*.greenhouse.io` / a known Greenhouse customer front-end (`*.pinterestcareers.com`, `okta.com/company/careers/opportunity/*`, `samsara.com/company/careers/roles/*`, `zoominfo.com/careers?gh_jid=*`, `collibra.com/.../?gh_jid=*`, `careers.toasttab.com/jobs?gh_jid=*`, `careers.airbnb.com/positions/*?gh_jid=*`, `coinbase.com/careers/positions/*?gh_jid=*`, `instacart.careers/job/?gh_jid=*`), extract `slug` and `id` and WebFetch `https://boards-api.greenhouse.io/v1/boards/{slug}/jobs/{id}`. 200 + JSON with `content` is the authoritative JD. 404 = genuinely closed (mark CLOSED and stop). **OpenCode WebFetch compatibility:** do not pass `format: "json"`; omit `format` or use `format: "text"` and parse the returned JSON text. **If 200, STOP — do not fall back to Geometra or WebFetch of the front-end.** The API is faster, cheaper (no Geometra session), and never returns a bot-shell.
|
|
12
|
-
2. **Geometra MCP:** Most non-Greenhouse job portals (Lever, Ashby, Workday) are SPAs. Use `geometra_connect` + `geometra_page_model` to render and read the JD. **If this returns non-empty JD text, STOP — do not WebFetch the same URL.**
|
|
12
|
+
2. **Geometra MCP:** Most non-Greenhouse job portals (Lever, Ashby, Workday) are SPAs. Use `geometra_connect({ ..., stealth: true })` + `geometra_page_model` to render and read the JD. **If this returns non-empty JD text, STOP — do not WebFetch the same URL.**
|
|
13
13
|
3. **WebFetch (only if Geometra is unavailable OR returned only a shell with no JD text):** For static pages (ZipRecruiter, WeLoveProduct, company career pages).
|
|
14
14
|
4. **WebSearch (only if methods 1–3 all failed):** Search for the role title + company on secondary portals that index the JD in static HTML.
|
|
15
15
|
|
|
@@ -38,7 +38,7 @@ Execute the full `pdf` pipeline (read `modes/pdf.md`).
|
|
|
38
38
|
|
|
39
39
|
Generate draft answers for the application form when the final score is >= 3.5. If the final score is >= 3.5 (per Canonical Scoring Model thresholds in `_shared.md`), generate draft answers for the application form:
|
|
40
40
|
|
|
41
|
-
1. **Extract form questions**: Use Geometra MCP (`geometra_connect` + `geometra_form_schema`) to discover all form fields. **Reuse the same `sessionId` from Step 0** when the apply URL is the same rendered page; only connect again if the prior session ended or the URL changed. If questions cannot be extracted, use the generic questions.
|
|
41
|
+
1. **Extract form questions**: Use Geometra MCP (`geometra_connect({ ..., stealth: true })` + `geometra_form_schema`) to discover all form fields. **Reuse the same `sessionId` from Step 0** when the apply URL is the same rendered page; only connect again if the prior session ended or the URL changed. If questions cannot be extracted, use the generic questions.
|
|
42
42
|
2. **Generate answers** following the tone guidelines (see below).
|
|
43
43
|
3. **Save in the report** as a `## G) Draft Application Answers` section.
|
|
44
44
|
|
package/modes/pipeline.md
CHANGED
|
@@ -7,7 +7,7 @@ Processes accumulated job offer URLs from `data/pipeline.md`. The user adds URLs
|
|
|
7
7
|
1. **Read** `data/pipeline.md` → find `- [ ]` items in the "Pending" section
|
|
8
8
|
2. **For each pending URL**:
|
|
9
9
|
a. Calculate the next sequential `REPORT_NUM` by running `npx job-forge next-num` (scans `reports/`, day file `#` columns, and `batch/tracker-additions/` — do NOT derive from `reports/` alone)
|
|
10
|
-
b. **Extract JD** using Geometra MCP (geometra_connect + geometra_page_model) → WebFetch → WebSearch
|
|
10
|
+
b. **Extract JD** using Geometra MCP (`geometra_connect({ ..., stealth: true })` + geometra_page_model) → WebFetch → WebSearch
|
|
11
11
|
c. If the URL is not accessible → mark as `- [!]` with a note and continue
|
|
12
12
|
d. **Run full auto-pipeline**: A-F Evaluation → Report .md → PDF (if score >= 3.0, per `_shared.md` thresholds) → Draft answers (if score >= 3.5) → Tracker
|
|
13
13
|
e. **Move from "Pending" to "Processed"**: `- [x] #NNN | URL | Company | Role | Score/5 | PDF ✅/❌`
|
|
@@ -34,7 +34,7 @@ Processes accumulated job offer URLs from `data/pipeline.md`. The user adds URLs
|
|
|
34
34
|
## Detect JD From URL
|
|
35
35
|
|
|
36
36
|
1. **Greenhouse JSON API (FIRST, when the entry has `| gh={slug}/{id}` OR the host looks Greenhouse-backed):** WebFetch `https://boards-api.greenhouse.io/v1/boards/{slug}/jobs/{id}`. 200 + JSON with `content` = LIVE, use it as the JD; 404 = genuinely CLOSED (mark `- [!]` and continue). **OpenCode WebFetch compatibility:** do not pass `format: "json"`; omit `format` or use `format: "text"` and parse the returned JSON text. Bot-hostile customer fronts (`pinterestcareers.com`, `okta.com`, `samsara.com`, `zoominfo.com`, `collibra.com`, `careers.toasttab.com`, `careers.airbnb.com`, `coinbase.com`, `instacart.careers`, `careers.toasttab.com`) MUST be verified via this API first — WebFetch/Geometra of those domains returns a shell or 403 and causes false CLOSED marks.
|
|
37
|
-
2. **Geometra MCP:** `geometra_connect` + `geometra_page_model`. Works with non-Greenhouse SPAs (Lever, Ashby, Workday), uses fewer tokens than raw DOM snapshots.
|
|
37
|
+
2. **Geometra MCP:** `geometra_connect({ ..., stealth: true })` + `geometra_page_model`. Works with non-Greenhouse SPAs (Lever, Ashby, Workday), uses fewer tokens than raw DOM snapshots.
|
|
38
38
|
3. **WebFetch (fallback):** For static pages or when Geometra is not available.
|
|
39
39
|
4. **WebSearch (last resort):** Search on secondary portals that index the JD.
|
|
40
40
|
|
|
@@ -50,7 +50,7 @@ These blocks come from two distinct root causes and require different responses:
|
|
|
50
50
|
|
|
51
51
|
**Rule — do NOT loop retrying a class B block.** One retry with `imeFriendly: true` is the correct test for class A. If the same spam message fires after a clean `imeFriendly` refill, stop, mark Failed, move on. Repeated retries waste subagent time and do not change the outcome.
|
|
52
52
|
|
|
53
|
-
**Class B fix — BYO residential proxy
|
|
53
|
+
**Class B fix — BYO residential proxy + stealth Chromium.** When the candidate has configured `proxy:` in `config/profile.yml`, every `geometra_connect` call threads that proxy through to Chromium, which flips the outbound IP from datacenter to residential/mobile. JobForge also passes `stealth: true` so Geometra MCP >=1.61.3 launches CloakBrowser's patched Chromium instead of stock Playwright Chromium. See the "BYO Residential Proxy" reference section below. Without a configured proxy, stealth still helps browser fingerprinting, but the outbound IP remains datacenter.
|
|
54
54
|
|
|
55
55
|
**Known-block Ashby tenants (2026-04-19 empirical observations).** These tenants fired class B on every attempted submit from a headless datacenter-IP proxy. Orchestrators planning apply dispatches should assume these tenants will Fail in headless — prioritize other portals, or skip same-tenant siblings after a confirmed class B to avoid burning subagent slots:
|
|
56
56
|
|
|
@@ -138,12 +138,12 @@ When running multiple application forms in parallel, each `geometra_connect` MUS
|
|
|
138
138
|
|
|
139
139
|
**Correct parallel pattern:**
|
|
140
140
|
```javascript
|
|
141
|
-
geometra_connect({ pageUrl: "https://...", isolated: true, headless: true, slowMo: 350 })
|
|
141
|
+
geometra_connect({ pageUrl: "https://...", isolated: true, headless: true, slowMo: 350, stealth: true })
|
|
142
142
|
```
|
|
143
143
|
|
|
144
144
|
**Wrong:** running `geometra_connect` without `isolated: true` when submitting multiple forms concurrently. The forms may share state and produce incorrect submissions.
|
|
145
145
|
|
|
146
|
-
**With a configured proxy,** add `proxy: { server, username?, password?, bypass? }` to the same call — see "BYO Residential Proxy" below. The reusable-proxy pool is partitioned by proxy identity, so mixing direct and proxied sessions across parallel rounds is safe.
|
|
146
|
+
**With a configured proxy,** add `proxy: { server, username?, password?, bypass? }` to the same call — see "BYO Residential Proxy" below. The reusable-proxy pool is partitioned by proxy identity, so mixing direct and proxied sessions across parallel rounds is safe. Keep `stealth: true` either way so JobForge uses Geometra's CloakBrowser Chromium path for portal sessions.
|
|
147
147
|
|
|
148
148
|
### Session Reuse — When Subagents Cannot Reach Existing Sessions
|
|
149
149
|
|
|
@@ -185,7 +185,7 @@ Every subagent that uses Geometra must run these THREE tool calls as its FIRST t
|
|
|
185
185
|
```
|
|
186
186
|
Step 1: geometra_list_sessions()
|
|
187
187
|
Step 2: geometra_disconnect({ closeBrowser: true })
|
|
188
|
-
Step 3: geometra_connect({ pageUrl: "<the URL the orchestrator gave you>", isolated: true, headless: true, slowMo: 350 })
|
|
188
|
+
Step 3: geometra_connect({ pageUrl: "<the URL the orchestrator gave you>", isolated: true, headless: true, slowMo: 350, stealth: true })
|
|
189
189
|
```
|
|
190
190
|
|
|
191
191
|
**If the orchestrator says proxy is configured,** read the top-level
|
|
@@ -193,7 +193,7 @@ Step 3: geometra_connect({ pageUrl: "<the URL the orchestrator gave you>", isol
|
|
|
193
193
|
|
|
194
194
|
```
|
|
195
195
|
Step 3: geometra_connect({
|
|
196
|
-
pageUrl: "<URL>", isolated: true, headless: true, slowMo: 350,
|
|
196
|
+
pageUrl: "<URL>", isolated: true, headless: true, slowMo: 350, stealth: true,
|
|
197
197
|
proxy: { server: "...", username: "...", password: "...", bypass: "..." }
|
|
198
198
|
})
|
|
199
199
|
```
|
|
@@ -49,13 +49,13 @@ When a form says "enter the code we sent to your email", you MUST retrieve the c
|
|
|
49
49
|
|
|
50
50
|
---
|
|
51
51
|
|
|
52
|
-
## BYO Residential Proxy
|
|
52
|
+
## BYO Residential Proxy + Stealth Chromium
|
|
53
53
|
|
|
54
54
|
**Problem:** on 2026-04-19 cycle 4, 5/5 untested Ashby tenants and 100% of Dropbox-class Cloudflare-fronted portals fingerprint-blocked headless Chromium from datacenter IPs. `imeFriendly: true` fixes class A (React validation lag) but has zero effect on class B (environment fingerprint). There is no in-session software-only fix for class B: the server decided the session is a bot before the form response was rendered.
|
|
55
55
|
|
|
56
|
-
**Fix:** route the spawned Chromium through a residential or mobile proxy the candidate already pays for
|
|
56
|
+
**Fix:** route the spawned Chromium through a residential or mobile proxy the candidate already pays for, and launch Geometra's CloakBrowser stealth Chromium with `stealth: true`. Geometra MCP v1.59.0 added a `proxy: { server, username?, password?, bypass? }` parameter on `geometra_connect` and `geometra_prepare_browser`; v1.61.3 added `stealth: true` for CloakBrowser. The outbound IP becomes residential/mobile, the browser fingerprint moves off stock Playwright Chromium, and the class-B checks have fewer signals to trip.
|
|
57
57
|
|
|
58
|
-
**
|
|
58
|
+
**Proxy is opt-in, stealth is default.** JobForge does NOT bundle or resell proxy bandwidth — the candidate brings their own provider (Bright Data, Oxylabs, SOAX, Smartproxy, mobile hotspot, self-hosted SOCKS). Without a configured proxy, JobForge still passes `stealth: true`, but the outbound IP remains the machine or hosting environment running Chromium.
|
|
59
59
|
|
|
60
60
|
### Where the proxy config lives
|
|
61
61
|
|
|
@@ -76,15 +76,15 @@ See `config/profile.example.yml` for the commented-out template.
|
|
|
76
76
|
**Orchestrator responsibilities:**
|
|
77
77
|
|
|
78
78
|
1. On session start, read `config/profile.yml` once. If a `proxy:` block is present, remember that a proxy is configured, but do not paste username/password values into task prompts or user-visible status.
|
|
79
|
-
2. When dispatching any subagent whose work involves a `geometra_connect` call, tell it to read `config/profile.yml` and pass the top-level `proxy:` block to every `geometra_connect` call. Example dispatch prompt line: "Proxy is configured; read `config/profile.yml` and pass its top-level `proxy:` object to every `geometra_connect` call."
|
|
80
|
-
3. When the orchestrator itself opens a Chromium session (single-application interactive flow), include the same `proxy` object from `config/profile.yml` in its own `geometra_connect` call.
|
|
79
|
+
2. When dispatching any subagent whose work involves a `geometra_connect` call, tell it to read `config/profile.yml` and pass the top-level `proxy:` block plus `stealth: true` to every `geometra_connect` call. Example dispatch prompt line: "Proxy is configured; read `config/profile.yml` and pass its top-level `proxy:` object plus `stealth: true` to every `geometra_connect` call."
|
|
80
|
+
3. When the orchestrator itself opens a Chromium session (single-application interactive flow), include the same `proxy` object from `config/profile.yml` and `stealth: true` in its own `geometra_connect` call.
|
|
81
81
|
4. If `proxy:` is absent from `profile.yml`, skip the param entirely. Do NOT invent a proxy URL or leave a stale placeholder.
|
|
82
82
|
|
|
83
83
|
**Subagent responsibilities:**
|
|
84
84
|
|
|
85
|
-
1. If the task prompt says proxy is configured, read `config/profile.yml` and pass the top-level `proxy:` object through to `geometra_connect` and any `geometra_prepare_browser` calls unchanged.
|
|
86
|
-
2. If the task prompt includes a legacy inline `proxy` object, pass it through unchanged
|
|
87
|
-
3. If the task prompt does NOT mention a proxy and `config/profile.yml` has no `proxy:` block, run
|
|
85
|
+
1. If the task prompt says proxy is configured, read `config/profile.yml` and pass the top-level `proxy:` object plus `stealth: true` through to `geometra_connect` and any `geometra_prepare_browser` calls unchanged.
|
|
86
|
+
2. If the task prompt includes a legacy inline `proxy` object, pass it through unchanged and still set `stealth: true`, but never print the credentials back in status text.
|
|
87
|
+
3. If the task prompt does NOT mention a proxy and `config/profile.yml` has no `proxy:` block, run with `stealth: true` and no proxy.
|
|
88
88
|
4. Never second-guess the proxy field — if it comes from `profile.yml`, it's authoritative.
|
|
89
89
|
|
|
90
90
|
### When proxy use is load-bearing
|
|
@@ -98,7 +98,7 @@ Apply these rules when deciding whether the proxy is worth waiting for:
|
|
|
98
98
|
|
|
99
99
|
### Pool partitioning — why mixed runs are safe
|
|
100
100
|
|
|
101
|
-
The Geometra MCP partitions its reusable-proxy pool by
|
|
101
|
+
The Geometra MCP partitions its reusable-proxy pool by proxy identity and browser flavor — proxy partitioning landed in `@geometra/mcp@1.59.0`, and stealth partitioning is available in `@geometra/mcp@1.61.3`. A direct session and a proxied session NEVER share a Chromium instance, and stock and stealth sessions do not pool together. Practical consequence: flipping `proxy:` on or off in `profile.yml` mid-session is safe — the next `geometra_connect` just opens a fresh Chromium in its own pool partition.
|
|
102
102
|
|
|
103
103
|
### Troubleshooting
|
|
104
104
|
|
|
@@ -128,7 +128,7 @@ The Geometra MCP partitions its reusable-proxy pool by `(server, username, bypas
|
|
|
128
128
|
"geometra": {
|
|
129
129
|
"type": "stdio",
|
|
130
130
|
"command": "npx",
|
|
131
|
-
"args": ["-y", "@geometra/mcp"]
|
|
131
|
+
"args": ["-y", "@geometra/mcp@1.61.3"]
|
|
132
132
|
},
|
|
133
133
|
"gmail": {
|
|
134
134
|
"type": "stdio",
|
package/modes/scan.md
CHANGED
|
@@ -25,7 +25,7 @@ Read `portals.yml` which contains:
|
|
|
25
25
|
|
|
26
26
|
### Use Level 1 — Direct Geometra (PRIMARY)
|
|
27
27
|
|
|
28
|
-
**For each company in `tracked_companies`:** Connect to its `careers_url` with Geometra MCP (`geometra_connect` + `geometra_page_model` / `geometra_list_items`), read ALL visible job listings, and extract the title + URL of each one. Direct Geometra is the most reliable method because:
|
|
28
|
+
**For each company in `tracked_companies`:** Connect to its `careers_url` with Geometra MCP (`geometra_connect({ ..., stealth: true })` + `geometra_page_model` / `geometra_list_items`), read ALL visible job listings, and extract the title + URL of each one. Direct Geometra is the most reliable method because:
|
|
29
29
|
|
|
30
30
|
- It sees the page in real time (not cached Google results).
|
|
31
31
|
- It works with SPAs (Ashby, Lever, Workday).
|
|
@@ -138,7 +138,7 @@ The levels are additive — all are executed, results are merged and deduplicate
|
|
|
138
138
|
|
|
139
139
|
4. **Level 1 — Geometra scan** (sequential, or ≤2 parallel via `task` subagents per Hard Limit #1 in `AGENTS.md`):
|
|
140
140
|
For each company in `tracked_companies` with `enabled: true` and `careers_url` defined:
|
|
141
|
-
a. `geometra_connect` to the `careers_url`
|
|
141
|
+
a. `geometra_connect` to the `careers_url` with `stealth: true`
|
|
142
142
|
b. `geometra_page_model` or `geometra_list_items` to read all job listings
|
|
143
143
|
c. If the page has filters/departments, navigate the relevant sections
|
|
144
144
|
d. For each job listing extract: `{title, url, company}`
|
|
@@ -317,7 +317,7 @@ Each company in `tracked_companies` MUST have a `careers_url` — the direct URL
|
|
|
317
317
|
**If `careers_url` doesn't exist** for a company:
|
|
318
318
|
1. Try the pattern for its known platform
|
|
319
319
|
2. If that fails, do a quick WebSearch: `"{company}" careers jobs`
|
|
320
|
-
3. Navigate with Geometra (`geometra_connect`) to confirm it works
|
|
320
|
+
3. Navigate with Geometra (`geometra_connect` with `stealth: true`) to confirm it works
|
|
321
321
|
4. **Save the found URL in portals.yml** for future scans
|
|
322
322
|
|
|
323
323
|
**If `careers_url` returns 404 or redirect:**
|
package/opencode.json
CHANGED
package/package.json
CHANGED
|
@@ -20,6 +20,7 @@ const checks = [
|
|
|
20
20
|
["H5 blocks same-company concurrent retry", () => every(files.instructions, ["Re-dispatch the same company only AFTER", "previous subagent returns"])],
|
|
21
21
|
["H6 requires merge and verify", () => every(files.instructions, ["batch/tracker-additions/*.tsv", "npx job-forge merge", "npx job-forge verify"])],
|
|
22
22
|
["H7 distrusts subagent prose", () => every(files.instructions, ["must originate from a file", "not from prior subagent prose"])],
|
|
23
|
+
["H8 keeps proxy secret and requires stealth", () => every(files.instructions, ["[H8]", "Do not transcribe `server`, `username`, `password`, or `bypass`", "`stealth: true`"])],
|
|
23
24
|
["root points to consolidated helper reference", () => every(files.instructions, ["[D8]", "modes/reference-local-helpers.md", "deterministic local helpers"])],
|
|
24
25
|
["helper reference covers score/timeline/prioritize/lineage", () => every(files.helpers, ["templates/score.json", "npx job-forge score:*", "templates/timeline.json", "npx job-forge timeline:*", "templates/prioritize.json", "npx job-forge prioritize:*", ".jobforge-lineage.json", "npx job-forge lineage:*"])],
|
|
25
26
|
["root helper defaults are consolidated", () => !/\[D(?:9|1\d|2[0-9])\]/.test(files.instructions)],
|