npm - squeez - Versions diffs - 1.9.0 → 1.11.0 - Mend

squeez 1.9.0 → 1.11.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (3) hide show

package/CHANGELOG.md CHANGED Viewed

@@ -9,6 +9,39 @@ conventional commit messages on `main`.
 ## [Unreleased]
+## [1.10.0] - 2026-04-22
+### Added
+- feat(fs): auto-compress cat'd markdown via compress_md pipeline (#88)
+- feat(economy): compress Agent/Task prompt at PreToolUse time (#87)
+- feat(handlers): xcodebuild noise filter + log-file tail detection (#84)
+### Changed
+- chore(release): bump version to 1.10.0 [squeez-release-bot]
+- docs(readme): document squeez's compression scope and architectural limits (#85)
+- chore(release): bump version to 1.9.0 [squeez-release-bot]
+- chore(release): bump version to 1.8.0 [squeez-release-bot]
+- docs: update changelog and benchmarks for v1.7.7
+- chore(release): bump version to 1.7.7 [squeez-release-bot]
+- docs: update changelog and benchmarks for v1.7.6
+- chore(release): bump version to 1.7.6 [squeez-release-bot]
+- docs: update changelog and benchmarks for v1.7.5
+- chore(release): bump version to 1.7.5 [squeez-release-bot]
+- docs: update changelog and benchmarks for v1.7.4
+- chore(release): bump version to 1.7.4 [squeez-release-bot]
+- perf: O(log n) refactor — tail read, offset index, single-pass parser
+- docs: update changelog and benchmarks for v1.7.3
+- chore(release): bump version to 1.7.3 [squeez-release-bot]
+### Fixed
+- fix(hosts): expand PreToolUse matcher to cover Read, Grep, Glob, Agent, Task (#86)
+- fix(json_util,statusline): whitespace-tolerant JSON parser + UTF-8 statusline on Windows (#81)
+- fix(hosts): preserve user settings.json when existing JSON fails to parse (#83)
+- prevent OOM from unbounded file reads + data integrity fixes
+- fix(ci): use native ubuntu-24.04-arm runner for aarch64 build
+## [1.9.0] - 2026-04-22
 ## [1.7.7] - 2026-04-21
 ### Changed
@@ -98,7 +131,7 @@ conventional commit messages on `main`.
 ## [1.5.1] and earlier
 See the [git tag history](https://github.com/claudioemmanuel/squeez/tags) for pre-1.5.2 details. release-please takes over changelog generation from 1.7.1 onwards.
-[Unreleased]: https://github.com/claudioemmanuel/squeez/compare/v1.7.7...HEAD
+[Unreleased]: https://github.com/claudioemmanuel/squeez/compare/v1.10.0...HEAD
 [1.7.0]: https://github.com/claudioemmanuel/squeez/compare/v1.6.1...v1.7.0
 [1.6.1]: https://github.com/claudioemmanuel/squeez/compare/v1.6.0...v1.6.1
 [1.6.0]: https://github.com/claudioemmanuel/squeez/compare/v1.5.2...v1.6.0
@@ -108,3 +141,5 @@ See the [git tag history](https://github.com/claudioemmanuel/squeez/tags) for pr
 [1.7.5]: https://github.com/claudioemmanuel/squeez/compare/v1.7.2...v1.7.5
 [1.7.6]: https://github.com/claudioemmanuel/squeez/compare/v1.7.2...v1.7.6
 [1.7.7]: https://github.com/claudioemmanuel/squeez/compare/v1.7.2...v1.7.7
+[1.9.0]: https://github.com/claudioemmanuel/squeez/compare/v1.10.0...v1.9.0
+[1.10.0]: https://github.com/claudioemmanuel/squeez/compare/v1.7.2...v1.10.0

package/README.md CHANGED Viewed

@@ -110,6 +110,45 @@ squeez update --insecure  # skip checksum (not recommended)
 ---
+## Scope & Limits
+squeez optimizes what it can reach — the surfaces exposed by each host's hook API. It cannot fix token leaks outside those surfaces.
+### Coverage table
+| Surface | How | When | Supported hosts |
+|---|---|---|---|
+| **Bash stdout/stderr** | `PreToolUse` wraps command w/ 4-stage pipeline (smart-filter → dedup → grouping → truncation) | Every Bash invocation | all 5 |
+| **Read / Grep / Glob limits** | `PreToolUse` injects `limit` / `head_limit` per `read_max_lines` / `grep_max_results` | Every Read/Grep/Glob call | Claude Code, Copilot, OpenCode (hard); Gemini + Codex soft via GEMINI.md / AGENTS.md |
+| **Agent / Task prompt** | `PreToolUse` compresses `tool_input.prompt` (markdown-aware, via `compress-prompt`) | When prompt > `agent_prompt_max_tokens` | Claude Code (post–v1.8.0) |
+| **Session memory** | `SessionStart` injects prior session summary + file-access cache | Once per session start | all 5 |
+| **Markdown viewing** | Bash handler routes `.md` reads through `compress-md` when `auto_compress_md=true` | Viewer commands on .md paths | all 5 |
+### What squeez CANNOT compress
+**Agent/Task returned output.** `PostToolUse` hooks are observation-only — squeez sees the result *after* the model has received it. No hook API surface exists to rewrite an Agent's return value. Workaround: keep agent prompts compact (squeez compresses at dispatch time), and use `squeez_agent_costs` MCP tool to monitor spawn overhead.
+**Skills & slash-command files.** Claude Code loads these into the system prompt before any hook fires. squeez has no visibility into session-start system prompt construction.
+**User's top-level prompt.** squeez runs per tool call, not on user turns.
+**Tools whose host doesn't expose PreToolUse / BeforeTool.** E.g. Codex `PreToolUse` today only fires on Bash (tracked upstream at [openai/codex#18491](https://github.com/openai/codex/issues/18491)), so Read/Grep caps for Codex are soft hints in AGENTS.md, not hard injections.
+### Secondary wins (not compression, but token-saving)
+- **Cross-call redundancy dedup** — exact-hash and fuzzy-trigram collapsing across 16 recent calls (see [Context engine](#what-it-does))
+- **File-access cache** — subsequent Bash commands trimmed when re-reading a file squeez has already fingerprinted
+- **Burn-rate warnings** — `[budget: ~N calls left]` nudges so the user changes behavior before context pressure spikes
+### Reducing overall session cost
+squeez cannot automate these, but you can:
+- Fewer Agent/Task dispatches per session → use `squeez_agent_costs` to track, then refactor tasks to batch work
+- Smaller prompts injected into agents → squeez compresses them at dispatch, but smaller is better
+- Shorter CLAUDE.md / AGENTS.md files → run `squeez compress-md --ultra` to drop abbreviations and filler
+---
 ## Benchmarks
 <!-- BENCHMARK:START -->
@@ -119,49 +158,49 @@ Measured on macOS (Apple Silicon). Token count = `chars / 4` (matches Claude's ~
 | Scenario | Before | After | Reduction | Latency |
 |----------|--------|-------|-----------|---------|
-| `summarize_huge` | 82,257 tk | 420 tk | **-99%** | 55.6 ms |
-| `repetitive_output` | 4,692 tk | 37 tk | **-99%** | 214 µs |
-| `high_context_adaptive` | 4,418 tk | 52 tk | **-99%** | 807 µs |
+| `summarize_huge` | 82,257 tk | 420 tk | **-99%** | 55.7 ms |
+| `repetitive_output` | 4,692 tk | 37 tk | **-99%** | 211 µs |
+| `high_context_adaptive` | 4,418 tk | 52 tk | **-99%** | 791 µs |
 | `ps_aux` | 40,373 tk | 2,352 tk | **-94%** | 2.7 ms |
-| `git_log_200` | 2,692 tk | 289 tk | **-89%** | 205 µs |
-| `tsc_errors` | 731 tk | 101 tk | **-86%** | 28 µs |
-| `cargo_build_noisy` | 2,106 tk | 452 tk | **-79%** | 238 µs |
-| `docker_logs` | 665 tk | 186 tk | **-72%** | 44 µs |
-| `find_deep` | 424 tk | 134 tk | **-68%** | 80 µs |
+| `git_log_200` | 2,692 tk | 289 tk | **-89%** | 218 µs |
+| `tsc_errors` | 731 tk | 101 tk | **-86%** | 30 µs |
+| `cargo_build_noisy` | 2,106 tk | 452 tk | **-79%** | 247 µs |
+| `docker_logs` | 665 tk | 186 tk | **-72%** | 49 µs |
+| `find_deep` | 424 tk | 134 tk | **-68%** | 83 µs |
 | `git_status` | 50 tk | 16 tk | **-68%** | 11 µs |
-| `state_first_simulation` | 182 tk | 69 tk | **-62%** | 12 µs |
-| `verbose_app_log` | 4,957 tk | 1,991 tk | **-60%** | 289 µs |
+| `verbose_app_log` | 4,957 tk | 1,991 tk | **-60%** | 287 µs |
 | `npm_install` | 524 tk | 232 tk | **-56%** | 45 µs |
-| `claude_md_overhead` | 717 tk | 318 tk | **-56%** | 340 µs |
-| `crosscall_redundancy_3x` | 486 tk | 241 tk | **-50%** | 51.5 ms |
+| `crosscall_redundancy_3x` | 486 tk | 241 tk | **-50%** | 51.6 ms |
 | `ls_la` | 1,782 tk | 886 tk | **-50%** | 208 µs |
-| `env_dump` | 441 tk | 287 tk | **-35%** | 24 µs |
+| `env_dump` | 441 tk | 287 tk | **-35%** | 23 µs |
 | `git_copilot` | 640 tk | 421 tk | **-34%** | 104 µs |
-| `agent_heavy` | 2,306 tk | 1,564 tk | **-32%** | 379 µs |
-| `md_prose` | 187 tk | 138 tk | **-26%** | 622 µs |
-| `md_claude_md` | 316 tk | 247 tk | **-22%** | 1.1 ms |
-| `git_diff` | 502 tk | 497 tk | **-1%** | 43 µs |
-| `kubectl_pods` | 1,513 tk | 1,513 tk | **-0%** | 27 µs |
+| `agent_heavy` | 2,306 tk | 1,564 tk | **-32%** | 386 µs |
+| `md_prose` | 187 tk | 138 tk | **-26%** | 657 µs |
+| `md_claude_md` | 316 tk | 247 tk | **-22%** | 1.2 ms |
+| `claude_md_overhead` | 717 tk | 649 tk | **-9%** | 22 µs |
+| `git_diff` | 502 tk | 497 tk | **-1%** | 41 µs |
+| `state_first_simulation` | 182 tk | 181 tk | **-1%** | 5 µs |
+| `kubectl_pods` | 1,513 tk | 1,513 tk | **-0%** | 25 µs |
 ### Aggregate
 | Metric | Value |
 |--------|-------|
-| **Total token reduction** | **91.9%** — 152,961 tk → 12,443 tk |
+| **Total token reduction** | **91.6%** — 152,961 tk → 12,886 tk |
 | Bash output | **-84.9%** |
 | Markdown / context files | **-23.5%** |
 | Wrap / cross-call engine | **-99.2%** |
 | Quality (signal terms preserved) | **23 / 23 pass** |
 | Latency p50 (filter mode) | **5.0 ms** |
-| Latency p95 (incl. wrap/summarize) | **51 ms** |
+| Latency p95 (incl. wrap/summarize) | **52 ms** |
 ### Estimated cost savings — Claude Sonnet 4.6 · $3.00 / MTok input
 | Usage | Baseline / month | Saved / month |
 |-------|-----------------|---------------|
-| 100 calls / day | $18.00 | **$16.54 (92%)** |
-| 1,000 calls / day | $180.00 | **$165.37 (92%)** |
-| 10,000 calls / day | $1800.00 | **$1653.66 (92%)** |
+| 100 calls / day | $18.00 | **$16.48 (92%)** |
+| 1,000 calls / day | $180.00 | **$164.84 (92%)** |
+| 10,000 calls / day | $1800.00 | **$1648.44 (92%)** |
 <!-- BENCHMARK:END -->
 ---

package/package.json CHANGED Viewed

@@ -1,6 +1,6 @@
 {
   "name": "squeez",
-  "version": "1.9.0",
+  "version": "1.11.0",
   "description": "Hook-based token compressor for Claude Code, Copilot CLI, and OpenCode. Compresses bash output up to 95%, collapses redundant calls, injects caveman persona.",
   "keywords": [
     "claude-code",