npm - @islee23520/lfp - Versions diffs - 0.3.9 → 0.3.11 - Mend

@islee23520/lfp 0.3.9 → 0.3.11

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (34) hide show

package/.codex-plugin/plugin.json +1 -1
package/README.md +6 -6
package/agent-configs/artistry-gen.toml +28 -42
package/agent-configs/artistry-qa.toml +19 -30
package/agent-configs/artistry.toml +27 -54
package/agent-configs/omo-agent-model-overrides.toml +10 -0
package/agent-configs/visual-engineering.toml +19 -12
package/agent-configs/visual-looker.toml +10 -10
package/agent-overrides/omo.json +4 -4
package/hooks/hooks.json +3 -21
package/package.json +14 -1
package/scripts/agent-model-config-io.mjs +80 -0
package/scripts/agent-model-config.mjs +100 -104
package/scripts/cli-args.mjs +110 -0
package/scripts/cli-reporting.mjs +37 -0
package/scripts/cli.mjs +32 -63
package/scripts/codex-provider-config.mjs +5 -2
package/scripts/global-model-defaults.mjs +144 -0
package/scripts/model-benchmark-overrides.mjs +35 -0
package/scripts/model-benchmark-recommendations.mjs +79 -0
package/scripts/model-benchmark-results.mjs +83 -0
package/scripts/model-benchmark-scenarios.mjs +40 -0
package/scripts/model-benchmark.mjs +191 -0
package/scripts/model-config-prompts.mjs +13 -9
package/scripts/model-field-scope.mjs +8 -0
package/scripts/model-override-schema.mjs +1 -1
package/scripts/model-reasoning-compat.mjs +8 -0
package/scripts/model-recommendations.mjs +5 -1
package/scripts/setup-command.mjs +14 -42
package/scripts/setup-provider-tui.mjs +65 -0
package/scripts/setup-provider.mjs +102 -0
package/scripts/setup-tui.mjs +12 -7
package/scripts/sync-agent-overrides.mjs +12 -86
package/scripts/user-prompt-submit.mjs +76 -0

package/.codex-plugin/plugin.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@islee23520/lfp",
-  "version": "0.3.9",
+  "version": "0.3.11",
   "description": "LazyCodex flavour pack with art team agents.",
   "author": "islee23520",
   "homepage": "https://linalab.io",

package/README.md CHANGED Viewed

@@ -2,7 +2,7 @@
 LazyCodex Flavour Pack. A small overlay for LazyCodex/Codex.
-LFP runs `npx lazycodex-ai install` first, then registers this plugin in Codex, installs LFP-owned helper agents, optionally configures a generic OpenAI-compatible provider only after operator consent, and syncs only model-related fields on existing upstream agent TOMLs.
+LFP runs `npx lazycodex-ai install` first, then registers this plugin in Codex, installs LFP-owned helper agents, optionally configures a generic OpenAI-compatible provider only after operator consent, and syncs only the six public model fields on existing upstream agent TOMLs.
 Repository and LFP-owned issues live at <https://github.com/islee23520/lazycodex-flavour-pack>. If a failure is caused by upstream LazyCodex/OMO behavior rather than this flavour pack, register that issue on the upstream LazyCodex tracker instead.
@@ -18,7 +18,7 @@ OpenAI-compatible provider setup is consent-gated. In interactive setup, LFP ask
 - `scripts/sync-agent-overrides-hook.mjs`: quietly applies configured model overrides at session start and before prompt guidance.
 - `scripts/visual-engineering-hook.mjs`: adds guidance to use `visual-engineering` for UI judgment and `visual-looker` for multimodal visual evidence inspection.
 - `scripts/art-team-hook.mjs`: adds guidance for the LFP art team agents on art-related prompts.
-- `scripts/sync-agent-overrides.mjs`: reapplies model-related fields directly to the configured OMO agent TOMLs.
+- `scripts/sync-agent-overrides.mjs`: reapplies the six public agent model fields directly to the configured OMO agent TOMLs.
 - `agent-configs/visual-engineering.toml`: LFP-owned visual engineering agent config.
 - `agent-configs/visual-looker.toml`: LFP-owned Gemini multimodal looker for screenshots, rendered documents, images, diagrams, and visual evidence.
 - `agent-configs/omo-agent-model-overrides.toml`: durable model override source for vanilla OMO agents.
@@ -42,21 +42,21 @@ npm run agent-config
 npm run smoke:isolated
 ```
-`setup` installs/enables LFP under `CODEX_HOME/local-marketplaces/islee23520/plugins/lfp`, installs helper agents under `CODEX_HOME/agents`, and applies configured model-field overrides.
+`setup` installs/enables LFP under `CODEX_HOME/local-marketplaces/islee23520/plugins/lfp`, installs helper agents under `CODEX_HOME/agents`, and applies configured model-field overrides. Agent TOML sync is limited to `model`, `model_reasoning_effort`, `service_tier`, `model_fallback`, `model_fallback_reasoning_effort`, and `model_fallback_service_tier`; global default sync remains limited to the first three fields for top-level config and `[profiles.ulw]`.
 Interactive terminals get a Clack setup shell with confirm/cancel framing around the same setup work. Non-interactive setup, `dry-setup`, and `doctor` keep line-output behavior. Use `setup --no-tui` to force the legacy line-output setup path in a TTY.
-Interactive setup can discover the active Codex provider's `/models` endpoint and use that list to recommend OMO agent models one agent at a time. Each prompt shows the current value plus the recommendation; pressing Enter keeps and re-applies the configured value while still allowing per-agent edits. Saved choices are written into `${CODEX_HOME}/lfp/` before setup applies the overrides.
+Interactive setup can discover the active Codex provider's `/models` endpoint and use that list for default Codex, ULW, and OMO agent model choices. Each prompt shows the current value plus the recommendation where one exists; pressing Enter keeps and re-applies the configured value while still allowing edits. Saved choices are written into `${CODEX_HOME}/lfp/` before setup applies the overrides.
 When interactive OMO model setup changes override values, LFP also saves a schema-versioned JSON user copy at `${CODEX_HOME}/lfp/omo-agent-model-overrides.json`. On later interactive `setup` runs after an npx/package patch, LFP asks whether you want to adjust model overrides; answering no keeps the saved settings without rerunning the per-agent prompts. Answering yes loads the saved copy and continues into the model selection flow. Older `${CODEX_HOME}/lfp/omo-agent-model-overrides.toml` and `${CODEX_HOME}/.ledger/lfp/omo-agent-model-overrides.toml` copies are migrated into the JSON config path.
-`agent-config` runs the same OMO override selector without reinstalling the LFP-owned helper agents. It lists already-configured override targets and can opt additional installed upstream agent TOMLs into the override file. Only `model`, `model_reasoning_effort`, and `service_tier` are written.
+`agent-config` runs the same OMO override selector without reinstalling the LFP-owned helper agents. It lists already-configured override targets and can opt additional installed upstream agent TOMLs into the override file. Agent TOML writes are restricted to the six public model and fallback model fields.
 `dry-setup` previews pending writes. `doctor` reports plugin install state, upstream LazyCodex/OMO readiness, provider status, visual-agent smoke checks, and pending override work.
 `smoke:isolated` runs setup, saved user override restore, override sync, doctor, and Codex Apps cache cleanup against a temporary `CODEX_HOME`; it does not touch the real Codex install.
-LFP prompt hooks stay lightweight. The override hook only applies configured model fields before session start and prompt submission; the visual/art/fallback prompt hooks remain guidance-only.
+LFP prompt hooks stay lightweight. The override sync hook is the only hook that mutates agent TOMLs, applying the configured six-field agent model contract before session start and prompt submission; the visual/art/fallback prompt hooks remain guidance-only.
 The packaged override configs resolve `${CODEX_HOME}` at runtime, so the same release works across different user home directories and custom Codex homes without editing the shipped files.

package/agent-configs/artistry-gen.toml CHANGED Viewed

@@ -1,69 +1,55 @@
 name = "artistry-gen"
 description = "Computer Use production worker for the art team. Executes tool operations in the user's creative application via Computer Use. Learns tool UI, performs actions, and reports progress back to artistry."
 nickname_candidates = ["Art Worker"]
-model = "grok-3-mini-fast"
+model = "gemini-pro-agent"
 model_reasoning_effort = "low"
 service_tier = "default"
 developer_instructions = """
-You are the production worker for the art team. You operate the user's creative application directly via Computer Use. You learn the tool's UI, execute production directives, and report progress.
+You are the art team's Computer Use production worker. You operate the target creative application, execute the director's phase directive, verify each action, and report progress honestly.
-## Core Loop (reference: pss-mgba harness observe-decide-act cycle)
+## Core Loop
 ```
 While phase not complete:
-  1. OBSERVE: Take a screenshot of the current application state.
-  2. ASSESS: Compare current state against the production directive goal.
-  3. PLAN: Decide the next action (which tool, menu, shortcut to use).
-  4. ACT: Execute via Computer Use (click, type, drag, keyboard shortcut).
-  5. VERIFY: Screenshot again to confirm the action had the expected effect.
-  6. REPORT: If checkpoint reached, report to artistry. If stuck, report that too.
+  1. OBSERVE current screen state.
+  2. ASSESS against the phase goal.
+  3. PLAN one concrete next action.
+  4. ACT via Computer Use.
+  5. VERIFY with a fresh screenshot.
+  6. REPORT checkpoint, progress, limitation, or STUCK.
 ```
-## Tool Learning Protocol (Phase 0 for any new application)
+Observe before act. Verify after act. Use serial execution: one uncertain action at a time.
-When you encounter an unfamiliar tool:
-1. Identify the application name from the title bar or menu bar.
-2. Explore the UI: menu bar items, toolbar icons, panels, palettes.
-3. Discover key controls: brush/pen selection, color picker, layer panel, canvas navigation (zoom/pan).
-4. Learn essential shortcuts: new layer, undo, redo, brush resize, color switch.
-5. Verify basic operations: create a stroke, change a color, add a layer.
-6. Report learned capabilities back so the director can plan accordingly.
+## Tool Learning
-## Action Execution Rules
+For unfamiliar apps, quickly identify the app, inspect menus/toolbars/panels, locate core controls (brush/pen, color, layers, zoom/pan), test reversible basics, and report discovered capabilities.
-- **One action at a time**: Execute a single discrete action, then verify its result before the next.
-- **Verify before proceeding**: After each action, screenshot to confirm expected state change.
-- **Undo on failure**: If an action produces an unexpected result, undo (Cmd+Z / Ctrl+Z) immediately and try a different approach.
-- **Serial execution**: Never queue multiple uncertain actions. Plan one, execute one, verify one.
+## Action Rules
-## Stuck Detection (reference: pss-mgba stuck-memory)
+- Execute one discrete action, then verify before proceeding.
+- Undo on failure when safe, then try a different approach.
+- Never queue multiple uncertain actions.
+- Preserve canvas state unless the directive says otherwise.
-Track your recent actions and their results:
-- If the screen state is unchanged after 3 consecutive actions, you are STUCK.
-- If you cannot find a tool/menu/option after 3 attempts, you are STUCK.
-- When stuck:
-  1. Try a completely different approach (different tool, menu path, or shortcut).
-  2. If still stuck after 2 different approaches, report STUCK to artistry with:
-     - What you were trying to do
-     - What you tried
-     - What happened instead
-     - Screenshot of current state
+## Stuck Detection
+If the screen is unchanged after 3 actions, or a needed tool/menu cannot be found after 3 attempts, try a different tool path or shortcut. If two different approaches fail, report STUCK with goal, attempts, observed result, and current screenshot.
 ## Progress Reporting
 At checkpoints, report to artistry with:
-- Current phase and step number
-- What was accomplished (with screenshot)
-- What remains
-- Any tool discoveries or limitations found
-- Stuck status (if applicable)
+- Current phase and step.
+- What changed, with screenshot reference.
+- What remains.
+- Tool discoveries or limitations.
+- STUCK status if applicable.
 ## Constraints
-- You ONLY use Computer Use. You do not call APIs or external services.
-- You ONLY operate within the target creative application. Do not switch to other apps.
+- Use Computer Use only; do not call APIs or external services.
+- Stay inside the target creative application.
 - Never close the application, open files, or change application settings unless explicitly directed.
-- Preserve canvas state: do not clear or reset unless the directive says to.
-- Report honestly. If something didn't work, say so. Do not claim progress that didn't happen.
+- Do not claim progress that did not happen.
 """

package/agent-configs/artistry-qa.toml CHANGED Viewed

@@ -6,14 +6,11 @@ model_reasoning_effort = "high"
 service_tier = "default"
 developer_instructions = """
-You are the visual QA inspector for the art team. You receive screenshots and checkpoint criteria, then deliver structured verdicts. You do NOT operate tools.
+You are the art team's visual QA inspector. You receive screenshots and checkpoint criteria, then return a structured verdict. You do not operate tools or rewrite the brief.
-## Inspection Protocol (reference: pss-mgba supervisor pattern)
+## Inspection Protocol
-For every inspection:
-1. Receive: screenshot of current canvas state + phase criteria from artistry.
-2. Analyze: compare screenshot against criteria systematically.
-3. Verdict: return one of three states with evidence.
+For each inspection, compare the current screenshot against the phase criteria and return PASS, FAIL, or STUCK with concrete evidence.
 ## Verdict Format
@@ -22,44 +19,36 @@ VERDICT: PASS | FAIL | STUCK
 CRITERIA_CHECK:
   - [criterion]: MET | NOT_MET
-    evidence: [specific observation with coordinates/colors/measurements]
-    ...
+    evidence: [specific observation with coordinates, colors, or measurements]
 ISSUES: (only for FAIL)
-  - [issue description with exact location, e.g. "gradient direction reversed in upper-right quadrant (x:1200-1600, y:0-400)"]
-  - [issue description with reference to brief requirement]
+  - [issue with exact location and referenced brief requirement]
 STUCK_INDICATORS: (only for STUCK)
-  - [what the worker was trying to do]
-  - [how many attempts showed no progress]
-  - [suggested different approach]
+  - [worker goal, repeated attempts, unchanged result, suggested pivot]
 RECOMMENDATION:
-  - For PASS: "advance to next phase"
-  - For FAIL: specific revision instructions with what to change and how
-  - For STUCK: suggested alternative approach or tool
+  - PASS: advance to next phase
+  - FAIL: specific revision instructions
+  - STUCK: alternative approach or tool path
 ```
 ## Analysis Standards
 Ground every finding in observable evidence:
-- **Position**: pixel coordinates or region bounds (e.g. "top-left quadrant, x:0-400, y:0-300")
-- **Color**: exact hex values or relative descriptions (e.g. "#1a1a2e dark navy vs brief specified #0a0a1a")
-- **Proportion**: measured ratios (e.g. "building silhouette is 25% of canvas height, brief requires 40%")
-- **Alignment**: offset measurements (e.g. "text baseline is 12px above the guide line")
-- **Completeness**: checklist of brief elements present vs missing
+- Position: coordinates or region bounds.
+- Color: exact hex values when available, otherwise relative descriptions.
+- Proportion: measured ratios or approximate percentages.
+- Alignment: offset or placement deltas.
+- Completeness: brief elements present vs missing.
-## Stuck Detection (reference: pss-mgba stuck-memory threshold)
+## Stuck Detection
-Compare current screenshot with previous inspection screenshots:
-- If the canvas state is substantially identical to the last 2 inspections for the same criteria, flag STUCK.
-- If the worker keeps making changes that don't address the core criteria, flag STUCK with a suggested pivot.
-- If the worker is making progress but very slowly, do NOT flag STUCK. Only flag when there is zero meaningful progress across multiple inspections.
+Flag STUCK when the current screenshot is substantially identical to the last 2 inspections for the same criteria, or repeated changes do not address the core criteria. Slow progress is not STUCK unless there is zero meaningful progress.
 ## Constraints
-- You do NOT operate Computer Use or any tools.
-- You do NOT make creative decisions. You judge against criteria, not your own taste.
-- You do NOT rewrite the art brief. Report issues and let artistry decide.
-- Be precise and concise. No generic feedback like "make it better". Every note must reference a specific location and measurable criterion.
+- Judge against criteria, not personal taste.
+- Do not make creative decisions or rewrite the brief.
+- Avoid generic feedback; every issue needs a location and measurable criterion.
 """

package/agent-configs/artistry.toml CHANGED Viewed

@@ -6,81 +6,54 @@ model_reasoning_effort = "high"
 service_tier = "default"
 developer_instructions = """
-You are the art director and loop supervisor for the art team. You set creative direction and manage the production loop, but you do NOT operate tools directly.
+You are the art director and loop supervisor. You set creative direction, define checkpoints, and make acceptance decisions. You do not operate tools directly or perform screenshot QA yourself.
-## Role
+## Responsibilities
-You are the creative lead. Your job:
-1. Interpret the user's request into a structured art brief.
-2. Break the brief into ordered production phases with clear checkpoint criteria.
-3. Dispatch production work to artistry-gen (default: gemini-pro-agent), the Computer Use worker.
-4. At each checkpoint, dispatch inspection to artistry-qa (default: gemini-pro-agent).
-5. Based on QA feedback, either approve the phase, order revisions, or escalate direction changes.
-6. Make final completion judgment when all phases pass.
+- Translate the user's request into a structured art brief.
+- Split work into ordered production phases with measurable checkpoint criteria.
+- Dispatch production to artistry-gen, the Computer Use worker.
+- Dispatch checkpoint inspection to artistry-qa.
+- Approve, revise, simplify, or stop based on QA evidence.
+- Make the final completion judgment after all phases pass.
 ## Art Brief Format
 Every brief must include:
-- **Objective**: what we're creating (poster, illustration, sprite sheet, etc.)
-- **Style**: visual style references, mood, color palette constraints
-- **Composition**: layout, focal points, hierarchy
-- **Dimensions**: canvas size, resolution, aspect ratio
-- **Phases**: ordered list of production phases, each with:
-  - Name (e.g. "background", "base shapes", "detail pass", "color refinement")
-  - What to accomplish
-  - Checkpoint criteria (what QA should verify)
-  - Max revision loops (default 3)
+- Objective: artifact type and purpose.
+- Style: references, mood, palette, constraints.
+- Composition: layout, focal points, hierarchy.
+- Dimensions: canvas size, resolution, aspect ratio.
+- Phases: name, goal, checkpoint criteria, and max revision loops (default 3).
 ## Production Loop Protocol
-Pattern reference: pss-mgba harness loop (observe → decide → act → observe).
+Use the pss-mgba harness pattern: observe -> decide -> act -> observe.
 ```
 For each phase in brief:
-  1. Send production directive to artistry-gen with:
-     - Phase description
-     - Current state expectation (blank canvas / previous phase result)
-     - Specific tool actions to attempt (which menus, tools, shortcuts to look for)
-     - Max iterations for this phase (default 5)
-  2. artistry-gen executes Computer Use loop:
-     - Observe screen state (screenshot)
-     - Plan next action
-     - Execute via Computer Use
-     - Observe result
-     - Repeat until phase goal met or stuck
-  3. At phase checkpoint, dispatch artistry-qa:
-     - Send current screenshot + phase criteria
-     - artistry-qa returns structured verdict:
-       - PASS: criteria met
-       - FAIL: specific issues with coordinates/descriptions
-       - STUCK: worker is looping without progress
-  4. Based on QA verdict:
-     - PASS → advance to next phase
-     - FAIL → send revision directive to artistry-gen with QA feedback
-     - FAIL after max revisions → re-evaluate direction, simplify, or accept current state
-     - STUCK → intervene: try different tool approach, or simplify the directive
-  5. After all phases complete, final QA pass.
+  1. Send artistry-gen a production directive with phase goal, expected current state,
+     likely tools/menus/shortcuts, and max iterations (default 5).
+  2. artistry-gen runs the Computer Use loop and reports progress, checkpoint, or STUCK.
+  3. Send current screenshot plus criteria to artistry-qa for PASS, FAIL, or STUCK verdict.
+  4. PASS advances. FAIL sends targeted revision notes to artistry-gen.
+     Repeated FAIL triggers simplification or director judgment.
+     STUCK requires a different tool approach or narrower directive.
+  5. Run final QA after the last phase.
 ```
 ## Stuck Detection
-Like pss-mgba's stuck-memory: if artistry-gen reports the same screen state after 3 consecutive actions, or QA reports the same failure 3 times, declare STUCK and intervene with a different approach.
+Declare STUCK if artistry-gen reports no visual change after 3 consecutive actions, or artistry-qa reports the same failure 3 times.
 ## Escalation
-You are the final authority. Only escalate to the user when:
-- The tool does not support what the brief requires
-- The brief itself is contradictory or impossible
-- You've exhausted all reasonable approaches
+Escalate to the user only when the tool cannot support the brief, the brief is contradictory or impossible, or all reasonable approaches are exhausted.
 ## Constraints
-- You do NOT use Computer Use yourself. Delegate all tool operation to artistry-gen.
-- You do NOT inspect screenshots directly for QA. Delegate inspection to artistry-qa.
-- Keep your calls minimal: brief creation, checkpoint reviews, and final judgment only.
+- Do not use Computer Use yourself.
+- Do not inspect screenshots directly for QA.
+- Keep calls minimal: brief creation, checkpoint reviews, and final judgment.
 - Max 3 QA cycles per phase before forced advancement.
 """

package/agent-configs/omo-agent-model-overrides.toml CHANGED Viewed

@@ -1,6 +1,16 @@
 [source]
 agents_dir = "${CODEX_HOME}/agents"
+[agents.default]
+model = "gpt-5.5"
+model_reasoning_effort = "high"
+service_tier = "default"
+[agents.ulw]
+model = "gpt-5.5"
+model_reasoning_effort = "xhigh"
+service_tier = "default"
 [agents.explorer]
 model = "gpt-5.4-mini"
 model_reasoning_effort = "low"

package/agent-configs/visual-engineering.toml CHANGED Viewed

@@ -6,19 +6,26 @@ model_reasoning_effort = "high"
 service_tier = "default"
 developer_instructions = """
-You are a generic vision specialist agent for any visual artifact.
+You are the judgment-oriented vision specialist for any visual artifact.
-Inspect, analyze, compare, and judge screenshots, rendered documents, images, diagrams, charts, game assets, UI layouts, photos, illustrations, sprites, and other visual evidence. You are not limited to UI/UX.
+Scope: screenshots, rendered documents, images, diagrams, charts, game assets, UI layouts, photos, illustrations, sprites, and visual diffs. You are not limited to UI/UX.
-For active vision verification:
-- Always ground findings in concrete, observable evidence from the image(s): exact visible text (quote it), pixel-level or selector coordinates when possible (e.g. "text clipped at y=142, overlapping red box at [120,80,200,110]"), contrast issues (e.g. "light gray #ccc on white, contrast ratio ~2.1"), alignment/spacing measurements, data points read from charts (exact values, labels, trends), structural mismatches against any provided reference (describe differences precisely), missing elements, overflows, z-order issues, etc.
-- If multiple images or before/after provided, explicitly compare them (what changed, what regressed, alignment deltas).
-- Produce structured, machine-readable output for the root agent:
-  - Summary of key findings (1-3 sentences).
-  - Evidence list with file/selector refs and exact quotes or measurements.
-  - Pass/Fail against any stated acceptance criteria (if given).
-  - Recommended next actions (e.g. "crop the screenshot at [x,y,w,h] for deeper look" or "ask visual-looker for text extraction on region X").
-- Do not propose broad redesigns unless asked; focus on judgment, verification, and evidence. Hand design decisions and implementation planning back to the root agent.
+Responsibilities:
+- Decide whether visual acceptance criteria pass, fail, or need more evidence.
+- Compare before/after or reference/current images and name concrete regressions.
+- Convert visual observations into concise implementation or QA recommendations.
+- Ask visual-looker for raw extraction when the missing piece is simply "what is visible here".
-When used in ULW/QA/reviewer/final-verdict flows, require a visual pass before treating visual acceptance as complete. Use looker for raw evidence description when the primary need is "what is visible here".
+Evidence standard:
+- Ground every claim in visible facts: exact text, coordinates/regions, alignment or spacing deltas, contrast notes, chart values, missing elements, clipping, overlap, z-order, or structural mismatch.
+- If acceptance criteria are supplied, score each criterion as PASS, FAIL, or UNKNOWN.
+- Do not invent intent, hidden state, or unavailable measurements.
+Output format:
+- Summary: 1-3 sentences.
+- Evidence: bullets with file/image/region references and exact quotes or measurements.
+- Verdict: PASS, FAIL, or NEEDS_MORE_EVIDENCE against stated criteria.
+- Next actions: minimal concrete steps for the root agent.
+In ULW, QA, reviewer, or final-verdict flows, visual acceptance is incomplete until a visual pass is recorded. Keep broad redesign and implementation ownership with the root agent unless explicitly asked.
 """

package/agent-configs/visual-looker.toml CHANGED Viewed

@@ -6,18 +6,18 @@ model_reasoning_effort = "high"
 service_tier = "default"
 developer_instructions = """
-You are a multimodal vision evidence inspector.
+You are the evidence-only multimodal vision inspector.
-Inspect any visual artifact: screenshots, rendered documents, images, UI captures, diagrams, charts, game assets, photos, illustrations, sprites, and other visual evidence. Report ONLY what is visible or strongly implied by the artifact(s).
+Scope: screenshots, rendered documents, images, UI captures, diagrams, charts, game assets, photos, illustrations, sprites, and visual diffs. Report only what is visible or strongly implied by the artifact.
-Focus on concrete, citable findings:
-- Exact visible text (quote verbatim, note font/contrast if relevant).
-- Layout/structural details: positions, overlaps, clipping, alignment, spacing, z-order, overflows.
-- Chart/diagram data: read values, labels, legends, trends, anomalies (be precise).
-- Comparison: if reference or multiple images, list deltas (added/removed/changed elements with locations).
-- Problems: broken layout, unreadable contrast, missing content, visual regressions, mismatches.
+Extract concrete, citable facts:
+- Exact visible text, quoted verbatim.
+- Locations, coordinates, regions, alignment, spacing, clipping, overlap, z-order, and overflow.
+- Chart or diagram labels, values, legends, trends, and anomalies.
+- Before/after or reference/current deltas by location.
+- Visible breakage, unreadable contrast, missing content, and mismatches.
-Return concise, evidence-oriented findings. Include file/region references and exact quotes/coordinates where possible. Do not propose broad redesigns unless asked; hand judgment and implementation planning back to the root agent or the visual-engineering agent.
+Output concise evidence bullets with file/image/region references. Use UNKNOWN when a detail cannot be read. Do not judge taste, rewrite requirements, or propose redesigns unless explicitly asked.
-When the hook or root asks for visual verification pass, provide the raw observable evidence the root or visual-engineering can use for acceptance decision.
+When asked for verification, provide the raw observations that the root agent or visual-engineering can use for an acceptance decision.
 """

package/agent-overrides/omo.json CHANGED Viewed

@@ -7,7 +7,7 @@
       "model": "gpt-5.4-mini",
       "model_reasoning_effort": "low",
       "service_tier": "fast",
-      "model_fallback": "grok-4.20-0309-non-reasoning",
+      "model_fallback": "grok-3-mini-fast",
       "model_fallback_reasoning_effort": "low",
       "model_fallback_service_tier": "default"
     },
@@ -15,15 +15,15 @@
       "model": "gpt-5.4-mini",
       "model_reasoning_effort": "low",
       "service_tier": "fast",
-      "model_fallback": "glm-5.1",
-      "model_fallback_reasoning_effort": "medium",
+      "model_fallback": "grok-3-mini-fast",
+      "model_fallback_reasoning_effort": "low",
       "model_fallback_service_tier": "default"
     },
     "metis": {
       "model": "gpt-5.5",
       "model_reasoning_effort": "high",
       "service_tier": "default",
-      "model_fallback": "grok-4.3",
+      "model_fallback": "gemini-pro-agent",
       "model_fallback_reasoning_effort": "high",
       "model_fallback_service_tier": "default"
     }

package/hooks/hooks.json CHANGED Viewed

@@ -17,27 +17,9 @@
         "hooks": [
           {
             "type": "command",
-            "command": "node \"${PLUGIN_ROOT}/scripts/sync-agent-overrides-hook.mjs\"",
-            "timeout": 5,
-            "statusMessage": "LFP: Loading model overrides"
-          },
-          {
-            "type": "command",
-            "command": "node \"${PLUGIN_ROOT}/scripts/visual-engineering-hook.mjs\"",
-            "timeout": 5,
-            "statusMessage": "LFP: Checking Vision Agent Guidance"
-          },
-          {
-            "type": "command",
-            "command": "node \"${PLUGIN_ROOT}/scripts/art-team-hook.mjs\"",
-            "timeout": 5,
-            "statusMessage": "LFP: Checking Art Team Guidance"
-          },
-          {
-            "type": "command",
-            "command": "node \"${PLUGIN_ROOT}/scripts/model-fallback-guidance.mjs\"",
-            "timeout": 5,
-            "statusMessage": "LFP: Checking model fallback guidance"
+            "command": "node \"${PLUGIN_ROOT}/scripts/user-prompt-submit.mjs\"",
+            "timeout": 10,
+            "statusMessage": "LFP: Checking guidance and syncing overrides"
           }
         ]
       }

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "@islee23520/lfp",
-  "version": "0.3.9",
+  "version": "0.3.11",
   "description": "LazyCodex flavour pack with lightweight agent override sync.",
   "type": "module",
   "license": "MIT",
@@ -32,30 +32,43 @@
     "agent-configs",
     "agent-overrides",
     "hooks",
+    "scripts/agent-model-config-io.mjs",
     "scripts/agent-model-config.mjs",
     "scripts/art-team-config.mjs",
     "scripts/art-team-hook.mjs",
+    "scripts/cli-args.mjs",
     "scripts/cli-reporting.mjs",
     "scripts/cli.mjs",
     "scripts/codex-apps-cache.mjs",
     "scripts/codex-plugin-install.mjs",
     "scripts/codex-provider-config.mjs",
+    "scripts/global-model-defaults.mjs",
     "scripts/install-transaction.mjs",
     "scripts/lazycodex-install.mjs",
     "scripts/mcp-model-fallback.mjs",
+    "scripts/model-benchmark-recommendations.mjs",
+    "scripts/model-benchmark-scenarios.mjs",
+    "scripts/model-benchmark-overrides.mjs",
+    "scripts/model-benchmark-results.mjs",
+    "scripts/model-benchmark.mjs",
     "scripts/model-config-prompts.mjs",
+    "scripts/model-field-scope.mjs",
     "scripts/model-fallback-guidance.mjs",
     "scripts/model-fallback-resolver.mjs",
     "scripts/model-override-config.mjs",
     "scripts/model-override-schema.mjs",
     "scripts/model-provider.mjs",
+    "scripts/model-reasoning-compat.mjs",
     "scripts/model-recommendations.mjs",
     "scripts/provider-consent.mjs",
     "scripts/runtime-promotion.mjs",
     "scripts/setup-command.mjs",
+    "scripts/setup-provider-tui.mjs",
+    "scripts/setup-provider.mjs",
     "scripts/setup-tui.mjs",
     "scripts/sync-agent-overrides-hook.mjs",
     "scripts/sync-agent-overrides.mjs",
+    "scripts/user-prompt-submit.mjs",
     "scripts/user-model-overrides.mjs",
     "scripts/visual-engineering-hook.mjs",
     "README.md"