npm - @pmaddire/gcie - Versions diffs - 0.1.3 → 0.1.5 - Mend

@pmaddire/gcie 0.1.3 → 0.1.5

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (15) hide show

package/AGENT.md +6 -2
package/GCIE_USAGE.md +290 -0
package/README.md +31 -9
package/SETUP_ANY_REPO.md +2 -2
package/bench_questions.py +69 -0
package/cli/app.py +198 -162
package/cli/commands/adaptation.py +341 -0
package/cli/commands/context.py +682 -34
package/cli/commands/context_slices.py +1322 -601
package/cli/commands/setup.py +18 -4
package/context/architecture_slicer.py +2 -1
package/llm_context/snippet_selector.py +1 -1
package/package.json +1 -1
package/retrieval/hybrid_retriever.py +9 -1
package/AGENT_USAGE.md +0 -231

package/AGENT.md CHANGED Viewed

@@ -1,10 +1,14 @@
 # AGENT.md
-Agent Operating Instructions for GraphCode Intelligence Engine (GCIE)
+Agent Operating Instructions for GraphCode Intelligence Engine (GCIE)
 This file provides persistent architectural context for coding agents working on this repository.
-Agents must read this file before performing any development tasks.
+Agents must read this file before performing any development tasks.
+Agent trigger:
+Use GCIE for context lookup before reading files or making edits.
+Follow `GCIE_USAGE.md` for the adaptive retrieval workflow and mode-routing rules.
 ---

package/GCIE_USAGE.md ADDED Viewed

@@ -0,0 +1,290 @@
+# GCIE Agent Usage (Portable, Accuracy-First)
+Trigger line for agent instructions:
+`Use GCIE for context lookup before reading files or making edits. Follow GCIE_USAGE.md.`
+## Goal
+Retrieve the smallest useful context without sacrificing edit safety.
+Priority order:
+1. accuracy (must-have coverage)
+2. full-hit reliability
+3. token efficiency
+## Core Rules
+1. Do not trade recall for token savings.
+2. Stop retrieval as soon as must-have categories are covered.
+3. Adapt per task family, not per one-off query.
+4. Keep defaults portable; keep repo-specific learning in `.gcie` state.
+## Commands (Tool-Synced)
+Primary retrieval:
+```powershell
+gcie.cmd context <path> "<query>" --intent <edit|debug|refactor|explore> --budget <auto|int> --mode <basic|adaptive>
+```
+Sliced retrieval:
+```powershell
+gcie.cmd context-slices <path> "<query>" --intent <edit|debug|refactor|explore> --profile <low|recall|adaptive>
+```
+Adaptive profile state:
+```powershell
+gcie.cmd adaptive-profile .
+gcie.cmd adaptive-profile . --clear
+```
+Post-init adaptation pipeline:
+```powershell
+gcie.cmd adapt . --benchmark-size 10 --efficiency-iterations 5 --clear-profile
+```
+One-shot setup + adaptation:
+```powershell
+gcie.cmd setup . --adapt --adapt-benchmark-size 10 --adapt-efficiency-iterations 5
+```
+Setup and index:
+```powershell
+gcie.cmd setup .
+gcie.cmd index .
+```
+Direct verification:
+```powershell
+rg -n "symbol1|symbol2|symbol3" <likely files or subtree>
+```
+## Must-Have Coverage Gate (Required)
+Context is sufficient only when all needed categories are present:
+- implementation file(s)
+- wiring/orchestration/caller file(s)
+- validation surface when risk is non-trivial (test/spec/schema/config/contract/CLI surface)
+If any must-have file is missing, retrieval is incomplete.
+If a must-have file appears only as compact/skeleton context, re-query that file explicitly (pin or targeted query) before editing.
+Note: tests/spec files are often excluded by default. Add `--include-tests` only when test context is required.
+## Query Construction (Portable, High-Signal)
+Preferred pattern:
+`<file-a> <file-b> <function/component> <route/flag> <state/config-key>`
+Rules:
+1. Use file-first, symbol-heavy phrasing.
+2. Include explicit file paths when known.
+3. Include 2-6 distinctive symbols.
+4. Add caller/entry anchor when target is indirect.
+5. Avoid natural-language question phrasing.
+Example:
+- Bad: `How does architecture routing decide when to fall back?`
+- Good: `context/context_router.py context/fallback_evaluator.py architecture routing fallback confidence`
+## Retrieval Modes (Adaptive Router)
+Use three modes and route by observed outcomes:
+1. `slicer-first`
+2. `plain-context-first`
+3. `direct-file-check`
+Slicer-first:
+```powershell
+gcie.cmd context-slices <path> "<query>" --profile low --intent <intent>
+```
+Plain-context-first:
+```powershell
+gcie.cmd context <path> "<query>" --mode basic --intent <intent> --budget auto
+```
+Direct-file-check:
+```powershell
+rg -n "<symbols>" <files-or-subtree>
+```
+Routing policy:
+1. Start new repos in `slicer-first` bootstrap mode.
+2. If must-have coverage is incomplete after one slicer pass, switch that task to `plain-context-first`.
+3. If a task family misses with slicer 2+ times in calibration, set that family default to `plain-context-first`.
+4. Keep slicer for families where it is both accurate and cheaper.
+5. If two GCIE attempts still miss required files, use `direct-file-check` and mark family `manual-verify-required` until recalibrated.
+## Scope and Budget Baseline (Portable)
+Scope rule:
+1. Use the smallest path scope that still contains expected files.
+2. Use repo root `.` only for true cross-layer recovery.
+3. If explicit targets cluster in one subtree, subtree scope is usually better than root.
+Profile ladder (concrete, portable):
+1. `context-slices --profile low`
+2. if miss: `context-slices --profile recall`
+3. if miss: `context-slices --profile recall --pin <missing-file>`
+4. if miss: `rg` direct check and targeted file retrieval
+Plain-context budget baseline:
+- `auto`: simple same-layer or strong single-file lookup
+- `900`: same-family two-file lookup
+- `1100`: backend/config pair or same-layer backend pair
+- `1150`: cross-layer UI/API flow
+- `1300-1400`: explicit multi-hop chain
+Gap-fill baseline:
+- general implementation/wiring file: `900`
+- small entry/orchestrator file: `500`
+## Adaptive Recovery Order (One Change At A Time)
+When retrieval is weak, apply in this exact order:
+1. Query upgrade: add explicit files, symbols, caller/entry anchor
+2. Scope correction: subtree vs root
+3. One profile/budget escalation
+4. Targeted gap-fill for only missing must-have file(s)
+5. Multi-hop decomposition only if still incomplete
+Stop condition:
+- If a required file is still missing after two GCIE attempts (with query+scope corrected), stop GCIE retries and use `rg`.
+## Architecture Tracking (Portable, In-Repo)
+Track these under `.gcie/`:
+- `.gcie/architecture.md`
+- `.gcie/architecture_index.json`
+- `.gcie/context_config.json`
+Keep adaptive:
+1. Bootstrap from user docs once (`ARCHITECTURE.md`, `README.md`, `PROJECT.md`, `docs/*architecture*`).
+2. Treat `.gcie/architecture.md` as GCIE-owned working map.
+3. Refresh architecture files when boundaries/subsystems/interfaces change.
+4. Do not overwrite user-owned docs unless explicitly asked.
+Fallback confidence rule:
+- If architecture confidence is low or mappings are stale/missing, fallback to plain context and record reason in `.gcie/context_config.json`.
+## Pre-Calibration Readiness Gate (Required)
+Run before full adaptation:
+1. Index + architecture refresh:
+```powershell
+gcie.cmd index .
+```
+2. Readiness probe:
+- Run 10-20 stratified queries across major families using `context-slices` and `context`.
+3. Readiness adaptation loop:
+- Apply recovery order (query -> scope -> escalation -> targeted gap-fill -> decomposition).
+4. Gate decision:
+- Proceed to calibration only after coverage is reachable with stable behavior.
+- If not reachable, keep safer fallback mode for affected families and continue tracking.
+## Automatic Post-Trigger Adaptation (Required)
+After trigger detection in a repo session:
+1. `checkpoint: trigger_detected`
+2. Run `gcie.cmd index .` -> `checkpoint: index_complete`
+3. Run readiness probe -> `checkpoint: readiness_probe_complete`
+4. Run accuracy calibration to 100% must-have hit -> `checkpoint: accuracy_lock_complete`
+5. Run efficiency iterations only under hard accuracy gate -> `checkpoint: efficiency_complete`
+6. Run final stress validation (recommended 50-query) -> `checkpoint: stress_validation_complete`
+7. Write back results (`.planning`, `.gcie/context_config.json`, learned overrides section) -> `checkpoint: write_back_complete`
+If any checkpoint fails, mark run `incomplete`, record failure artifact in `.planning/`, and continue recovery/fallback flow.
+## Mandatory Bootstrap Calibration Sequence
+1. Recall calibration stage (required):
+- Tune mode/scope/query/profile until overall and per-family hit rates are 100%.
+2. Recall lock verification (required):
+- Require 2 consecutive 100% lock runs.
+3. Efficiency stage (optional, only after lock):
+- Test controlled reductions one change at a time.
+- Immediately rollback any hit-rate regression.
+4. Activation rule (required):
+- Activate only if lock/stress pass.
+- If stress fails, rollback to last known 100%-hit config.
+## Metrics and Decision Rules
+Per query, record:
+- must-have hit (true/false)
+- tokens used
+- retrieved files
+- escalations performed
+Track overall and by family:
+- hit rate
+- average and median tokens
+- tokens-per-hit (`total_tokens / hit_count`)
+Selection rule per family:
+1. highest hit rate
+2. if tie: lowest tokens-per-hit
+3. if tie: lowest median tokens
+Demotion rules:
+- If slicer miss-rate > 0% during recall calibration, do not keep slicer as default for that family.
+- If both slicer and plain fail, route family to manual-verify until recalibration.
+Promotion rules:
+- Promote only configurations that preserve 100% hit.
+- Efficiency changes must improve tokens without reducing hit rate.
+## Continuous Adaptation Over Time
+Trigger recalibration when any are true:
+1. major repo-change signal (large refactor/churn)
+2. savings decay (rolling savings drops materially vs active baseline)
+3. repeated family misses (2+ in recent window)
+Guardrails:
+1. Use a minimum evidence window (recommended: 20 retrieval events).
+2. Run in quiet/background mode when possible.
+3. Cap adaptation budget per cycle.
+4. Early-stop efficiency loop after 2 non-improving iterations.
+5. Prefer family-scoped recalibration before full recalibration.
+## Persistence
+Persist learned defaults in `.gcie/context_config.json` and `.gcie/retrieval_profile.json` with:
+- family
+- default mode/profile
+- last benchmark date
+- hit/token metrics
+Write repo-local learned routing here:
+## Learned Routing Overrides (Repo-Local, Mutable)
+No active learned overrides yet.
+Populate after first full adaptation cycle.
+## Agent Instructions Snippet (Copy/Paste)
+```text
+Use GCIE for context lookup before reading files or making edits. Follow GCIE_USAGE.md.
+Prioritize must-have coverage over token savings.
+Start with context-slices --profile low, then adapt using recovery order:
+query -> scope -> profile/budget escalation -> targeted gap-fill -> rg fallback.
+```
+## Notes
+1. This file is intentionally generalized and adaptive for any repo.
+2. Keep repo-specific tuning in learned overrides and `.gcie` state, not in global defaults.
+3. If in doubt, choose the higher-accuracy path first, then optimize tokens after lock.

package/README.md CHANGED Viewed

@@ -49,7 +49,7 @@ cost of sending full repo surfaces to the model.
 Use this when you want a fast drop-in setup for coding agents.
 1. Install GCIE CLI in the target repo (via your preferred method: npm link, local wrapper, or direct Python module).
-2. Copy [AGENT_USAGE.md](c:\GBCRSS\AGENT_USAGE.md) into the target repo root.
+2. Copy [GCIE_USAGE.md](c:\GBCRSS\GCIE_USAGE.md) into the target repo root.
 3. Run one index pass:
    - `gcie.cmd index .`
 4. Start using adaptive retrieval immediately:
@@ -58,7 +58,7 @@ Use this when you want a fast drop-in setup for coding agents.
 No heavy upfront tuning is required. The workflow starts portable-first and only adds local overrides after repeated miss patterns.
 One-command repo bootstrap:
-- `gcie.cmd setup .`
+- `npx -y @pmaddire/gcie@latest setup .`
 This creates `.gcie` architecture tracking files, copies portable agent workflow docs, and runs an initial index pass.
@@ -101,7 +101,7 @@ Net: adapted protocol preserved full accuracy while reducing average tokens by ~
 After publishing to npm, users can set up any repo with one command:
 ```powershell
-npx gcie@latest
+npx -y @pmaddire/gcie@latest setup .
 ```
 This runs `gcie setup .` in the current repo by default.
@@ -110,10 +110,32 @@ If Python deps are missing, GCIE now bootstraps a local package venv and install
 Optional setup flags are passed through:
 ```powershell
-npx gcie@latest --no-index
-npx gcie@latest --force
+npx -y @pmaddire/gcie@latest setup . --no-index
+npx -y @pmaddire/gcie@latest setup . --force
 ```
+For command-only usage without setup:
+```powershell
+npx -y @pmaddire/gcie@latest --help
+```
+## Agent Integration
+To make your coding agent use GCIE automatically, add this trigger line to your
+agent instructions (system prompt / repo instruction file):
+`Use GCIE for context lookup before reading files or making edits. Follow GCIE_USAGE.md.`
+Required file:
+- keep `GCIE_USAGE.md` in the target repo root
+Recommended setup:
+1. Run one-command setup:
+   - `npx -y @pmaddire/gcie@latest setup .`
+2. Add the trigger line above to your agent instruction file.
+3. Start normal coding tasks; the agent should use GCIE-first retrieval workflow.
 ## One-Command GitHub Bootstrap
 Run this from the target repo to download GCIE from GitHub and set it up automatically:
@@ -146,7 +168,7 @@ What it does:
 1. In the GCIE repo:
    - `npm link`
 2. In your target repo:
-   - `npm link gcie`
+   - `npm link @pmaddire/gcie`
 3. Verify:
    - `gcie --help`
@@ -162,7 +184,7 @@ This repo includes a lightweight npm wrapper so you can run `gcie` like other np
 2. In target repo: `gcie --help`
 Local option:
-- `npm install` then `npx gcie --help`
+- `npm install` then `npx @pmaddire/gcie@latest --help`
 The wrapper prefers `.venv` in the GCIE repo and falls back to system Python.
@@ -216,7 +238,7 @@ Important note:
 - `gcie index <path>`
 - `gcie query <file.py> "<question>"`
 - `gcie debug <file.py> "<question>"`
-- `gcie context <repo|file> "<task>" --budget auto --intent <edit|debug|refactor|explore>`
+- `gcie context <repo|file> "<task>" --budget auto --intent <edit|debug|refactor|explore> --mode basic`
 - `gcie context-slices <repo> "<task>" --intent <edit|debug|refactor|explore> [--profile recall|low] [--stage-a 400] [--stage-b 800] [--max-total 1200] [--pin frontend/src/App.jsx] [--pin-budget 300] [--include-tests]`
 ## How To Use It
@@ -367,6 +389,6 @@ npm publish --access public
 Then users can run:
 ```powershell
-npx gcie@latest
+npx -y @pmaddire/gcie@latest setup .
 ```

package/SETUP_ANY_REPO.md CHANGED Viewed

@@ -43,7 +43,7 @@ Re-run indexing after major structural changes.
 ## 3) Add Agent Workflow File
-Copy `AGENT_USAGE.md` from GCIE into the target repo root.
+Copy `GCIE_USAGE.md` from GCIE into the target repo root.
 ## 4) Start With Portable Defaults
@@ -51,7 +51,7 @@ Copy `AGENT_USAGE.md` from GCIE into the target repo root.
 gcie.cmd context . "<task>" --intent <edit|debug|refactor|explore> --budget auto
 ```
-Then apply the adaptive loop in `AGENT_USAGE.md` if must-have coverage is incomplete.
+Then apply the adaptive loop in `GCIE_USAGE.md` if must-have coverage is incomplete.
 ## 5) Use Adaptive Mode Routing

package/bench_questions.py ADDED Viewed

@@ -0,0 +1,69 @@
+import os
+import pathlib
+from parser.ast_parser import parse_python_file
+from graphs.call_graph import build_call_graph
+from graphs.variable_graph import build_variable_graph
+from retrieval.hybrid_retriever import hybrid_retrieve
+from llm_context.snippet_selector import RankedSnippet, estimate_tokens
+from llm_context.context_builder import build_context
+ROOT=pathlib.Path('.')
+EXCLUDE={'__pycache__','.venv','venv'}
+py_files=[]
+for path in ROOT.rglob('*.py'):
+    if any(part in EXCLUDE for part in path.parts):
+        continue
+    py_files.append(path)
+snippets_by_node={}
+modules=[]
+for path in py_files:
+    try:
+        module=parse_python_file(path)
+    except Exception:
+        continue
+    modules.append(module)
+    text=path.read_text()
+    lines=text.splitlines()
+    for fn in module.functions:
+        start=max(0,fn.start_line-1)
+        end=min(len(lines),fn.end_line)
+        snippet='\n'.join(lines[start:end])
+        node=f"function:{path.as_posix()}::{fn.name}"
+        snippets_by_node[node]=snippet
+call_graph=build_call_graph(modules)
+var_graph=build_variable_graph(modules)
+graph=call_graph
+for node,attrs in var_graph.nodes(data=True):
+    if not graph.has_node(node):
+        graph.add_node(node,**attrs)
+for u,v,data in var_graph.edges(data=True):
+    graph.add_edge(u,v,**data)
+prompts=[
+    "Why is variable diff exploding?",
+    "How does git history mining handle empty repositories?",
+    "How do CLI index/query/debug commands work?",
+]
+def naive_tokens():
+    total=0
+    for path in py_files:
+        total+=estimate_tokens(path.read_text())
+    return total
+naive=naive_tokens()
+print('Prompt|GCIE tokens|Naive tokens|Reduction%|Selected snippets|Notes')
+for prompt in prompts:
+    hybrid=hybrid_retrieve(graph,prompt,top_k=10,git_recency_by_node={},coverage_risk_by_node={},max_hops=2)
+    ranked=[]
+    for cand in hybrid:
+        text=snippets_by_node.get(cand.node_id)
+        if not text:
+            continue
+        ranked.append(RankedSnippet(cand.node_id,text,cand.score))
+    context=build_context(prompt,ranked,token_budget=300)
+    reduction=(1-context.total_tokens_estimate/naive)*100 if naive else 0
+    note='good' if ranked else 'empty'
+    print(f"{prompt}|{context.total_tokens_estimate:.1f}|{naive:.1f}|{reduction:.1f}%|{len(context.snippets)}|{note}")