gcf-python 0.5.0__tar.gz → 0.5.1__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (29) hide show
  1. {gcf_python-0.5.0 → gcf_python-0.5.1}/PKG-INFO +11 -26
  2. {gcf_python-0.5.0 → gcf_python-0.5.1}/README.md +10 -25
  3. {gcf_python-0.5.0 → gcf_python-0.5.1}/pyproject.toml +1 -1
  4. {gcf_python-0.5.0 → gcf_python-0.5.1}/.github/workflows/ci.yml +0 -0
  5. {gcf_python-0.5.0 → gcf_python-0.5.1}/.github/workflows/publish.yml +0 -0
  6. {gcf_python-0.5.0 → gcf_python-0.5.1}/.gitignore +0 -0
  7. {gcf_python-0.5.0 → gcf_python-0.5.1}/CHANGELOG.md +0 -0
  8. {gcf_python-0.5.0 → gcf_python-0.5.1}/LICENSE +0 -0
  9. {gcf_python-0.5.0 → gcf_python-0.5.1}/src/gcf/__init__.py +0 -0
  10. {gcf_python-0.5.0 → gcf_python-0.5.1}/src/gcf/cli.py +0 -0
  11. {gcf_python-0.5.0 → gcf_python-0.5.1}/src/gcf/constants.py +0 -0
  12. {gcf_python-0.5.0 → gcf_python-0.5.1}/src/gcf/decode.py +0 -0
  13. {gcf_python-0.5.0 → gcf_python-0.5.1}/src/gcf/decode_generic.py +0 -0
  14. {gcf_python-0.5.0 → gcf_python-0.5.1}/src/gcf/delta.py +0 -0
  15. {gcf_python-0.5.0 → gcf_python-0.5.1}/src/gcf/encode.py +0 -0
  16. {gcf_python-0.5.0 → gcf_python-0.5.1}/src/gcf/generic.py +0 -0
  17. {gcf_python-0.5.0 → gcf_python-0.5.1}/src/gcf/session.py +0 -0
  18. {gcf_python-0.5.0 → gcf_python-0.5.1}/src/gcf/stream.py +0 -0
  19. {gcf_python-0.5.0 → gcf_python-0.5.1}/src/gcf/stream_generic.py +0 -0
  20. {gcf_python-0.5.0 → gcf_python-0.5.1}/src/gcf/types.py +0 -0
  21. {gcf_python-0.5.0 → gcf_python-0.5.1}/tests/__init__.py +0 -0
  22. {gcf_python-0.5.0 → gcf_python-0.5.1}/tests/test_decode.py +0 -0
  23. {gcf_python-0.5.0 → gcf_python-0.5.1}/tests/test_delta.py +0 -0
  24. {gcf_python-0.5.0 → gcf_python-0.5.1}/tests/test_encode.py +0 -0
  25. {gcf_python-0.5.0 → gcf_python-0.5.1}/tests/test_generic.py +0 -0
  26. {gcf_python-0.5.0 → gcf_python-0.5.1}/tests/test_roundtrip.py +0 -0
  27. {gcf_python-0.5.0 → gcf_python-0.5.1}/tests/test_session.py +0 -0
  28. {gcf_python-0.5.0 → gcf_python-0.5.1}/tests/test_stream.py +0 -0
  29. {gcf_python-0.5.0 → gcf_python-0.5.1}/tests/test_stream_generic.py +0 -0
@@ -1,6 +1,6 @@
1
1
  Metadata-Version: 2.4
2
2
  Name: gcf-python
3
- Version: 0.5.0
3
+ Version: 0.5.1
4
4
  Summary: Python implementation of GCF (Graph Compact Format): token-optimized wire format for LLM tool responses
5
5
  Project-URL: Homepage, https://github.com/blackwell-systems/gcf-python
6
6
  Project-URL: Documentation, https://blackwell-systems.github.io/gcf/
@@ -32,7 +32,7 @@ Description-Content-Type: text/markdown
32
32
 
33
33
  Python implementation of [GCF](https://gcformat.com/) — the most token-efficient wire format for LLMs. A drop-in alternative to JSON and TOON for any structured data.
34
34
 
35
- **79% fewer input tokens than JSON. 75% fewer output tokens. 52% smaller than TOON. 100% LLM comprehension at 500 symbols, where JSON scores 76.9% and TOON scores 92.3%.**
35
+ **79% fewer input tokens than JSON. 63% fewer output tokens. 90.5% average comprehension accuracy across 10 models and 3 providers (four models hit 100%). 1,300+ LLM evaluations. Zero training.**
36
36
 
37
37
  Docs: [gcformat.com](https://gcformat.com/) · [Playground](https://gcformat.com/playground.html) · [GCF vs TOON](https://gcformat.com/guide/vs-toon.html)
38
38
 
@@ -204,33 +204,18 @@ Works on dicts, lists, and primitives. Lists of uniform dicts get tabular rows.
204
204
  | `Session` | Thread-safe tracker for multi-call deduplication |
205
205
  | `KIND_ABBREV` / `KIND_EXPAND` | Bidirectional kind abbreviation dicts |
206
206
 
207
- ## Comprehension Eval
207
+ ## Benchmarks
208
208
 
209
- Rigorous 3-way benchmark (GCF vs TOON vs JSON) at 500 symbols, 200 edges. 13 structured extraction questions sent to an LLM with zero format instructions:
209
+ 1,300+ LLM evaluations across 10 models, 3 providers, and 51 independent test runs.
210
210
 
211
- | Format | Accuracy | Tokens | vs JSON |
212
- |--------|----------|--------|---------|
213
- | **GCF** | **100%** (13/13) | **11,090** | **79% fewer** |
214
- | TOON | 92.3% (12/13) | 16,378 | 69% fewer |
215
- | JSON | 76.9% (10/13) | 53,341 | baseline |
211
+ | | GCF | TOON | JSON |
212
+ |---|---|---|---|
213
+ | **Comprehension** (23 runs, 10 models) | **90.5%** | 68.5% | 53.6% |
214
+ | **Generation** (28 runs, 9 models) | **5/5** | 1.0/5 | 5.0/5 |
215
+ | **Input tokens** (500 symbols) | **11,090** | 16,378 | 53,341 |
216
+ | **Output tokens** (100 symbols) | **5,976** | 8,937 | 16,121 |
216
217
 
217
- GCF is the only format with perfect accuracy at scale, at 32% fewer tokens than TOON.
218
-
219
- Reproduce: `git clone https://github.com/blackwell-systems/gcf-go && cd gcf-go/eval && GOWORK=off go test -run TestComprehension -v -timeout 0`
220
-
221
- ## Token Efficiency (TOON's Own Benchmark)
222
-
223
- Running [TOON's benchmark harness](https://github.com/blackwell-systems/toon/tree/gcf-comparison) with GCF inserted (their datasets, their tokenizer):
224
-
225
- | Track | GCF | TOON | Result |
226
- |-------|-----|------|--------|
227
- | Mixed-structure (nested, semi-uniform) | 170,367 | 227,896 | **GCF 34% smaller** |
228
- | Flat-only (tabular) | 66,029 | 67,837 | **GCF 3% smaller** |
229
- | Semi-uniform event logs | 108,158 | 154,032 | **GCF 42% smaller** |
230
-
231
- GCF wins all 6 datasets. On semi-uniform data (the most common real-world pattern), GCF uses 42% fewer tokens than TOON.
232
-
233
- Reproduce: `git clone https://github.com/blackwell-systems/toon && cd toon && git checkout gcf-comparison && cd benchmarks && pnpm install && pnpm benchmark:tokens`
218
+ GCF wins all 6 datasets on [TOON's own benchmark](https://github.com/blackwell-systems/toon/tree/gcf-comparison). Full results: [gcformat.com/guide/benchmarks](https://gcformat.com/guide/benchmarks.html)
234
219
 
235
220
  ## Links
236
221
 
@@ -7,7 +7,7 @@
7
7
 
8
8
  Python implementation of [GCF](https://gcformat.com/) — the most token-efficient wire format for LLMs. A drop-in alternative to JSON and TOON for any structured data.
9
9
 
10
- **79% fewer input tokens than JSON. 75% fewer output tokens. 52% smaller than TOON. 100% LLM comprehension at 500 symbols, where JSON scores 76.9% and TOON scores 92.3%.**
10
+ **79% fewer input tokens than JSON. 63% fewer output tokens. 90.5% average comprehension accuracy across 10 models and 3 providers (four models hit 100%). 1,300+ LLM evaluations. Zero training.**
11
11
 
12
12
  Docs: [gcformat.com](https://gcformat.com/) · [Playground](https://gcformat.com/playground.html) · [GCF vs TOON](https://gcformat.com/guide/vs-toon.html)
13
13
 
@@ -179,33 +179,18 @@ Works on dicts, lists, and primitives. Lists of uniform dicts get tabular rows.
179
179
  | `Session` | Thread-safe tracker for multi-call deduplication |
180
180
  | `KIND_ABBREV` / `KIND_EXPAND` | Bidirectional kind abbreviation dicts |
181
181
 
182
- ## Comprehension Eval
182
+ ## Benchmarks
183
183
 
184
- Rigorous 3-way benchmark (GCF vs TOON vs JSON) at 500 symbols, 200 edges. 13 structured extraction questions sent to an LLM with zero format instructions:
184
+ 1,300+ LLM evaluations across 10 models, 3 providers, and 51 independent test runs.
185
185
 
186
- | Format | Accuracy | Tokens | vs JSON |
187
- |--------|----------|--------|---------|
188
- | **GCF** | **100%** (13/13) | **11,090** | **79% fewer** |
189
- | TOON | 92.3% (12/13) | 16,378 | 69% fewer |
190
- | JSON | 76.9% (10/13) | 53,341 | baseline |
186
+ | | GCF | TOON | JSON |
187
+ |---|---|---|---|
188
+ | **Comprehension** (23 runs, 10 models) | **90.5%** | 68.5% | 53.6% |
189
+ | **Generation** (28 runs, 9 models) | **5/5** | 1.0/5 | 5.0/5 |
190
+ | **Input tokens** (500 symbols) | **11,090** | 16,378 | 53,341 |
191
+ | **Output tokens** (100 symbols) | **5,976** | 8,937 | 16,121 |
191
192
 
192
- GCF is the only format with perfect accuracy at scale, at 32% fewer tokens than TOON.
193
-
194
- Reproduce: `git clone https://github.com/blackwell-systems/gcf-go && cd gcf-go/eval && GOWORK=off go test -run TestComprehension -v -timeout 0`
195
-
196
- ## Token Efficiency (TOON's Own Benchmark)
197
-
198
- Running [TOON's benchmark harness](https://github.com/blackwell-systems/toon/tree/gcf-comparison) with GCF inserted (their datasets, their tokenizer):
199
-
200
- | Track | GCF | TOON | Result |
201
- |-------|-----|------|--------|
202
- | Mixed-structure (nested, semi-uniform) | 170,367 | 227,896 | **GCF 34% smaller** |
203
- | Flat-only (tabular) | 66,029 | 67,837 | **GCF 3% smaller** |
204
- | Semi-uniform event logs | 108,158 | 154,032 | **GCF 42% smaller** |
205
-
206
- GCF wins all 6 datasets. On semi-uniform data (the most common real-world pattern), GCF uses 42% fewer tokens than TOON.
207
-
208
- Reproduce: `git clone https://github.com/blackwell-systems/toon && cd toon && git checkout gcf-comparison && cd benchmarks && pnpm install && pnpm benchmark:tokens`
193
+ GCF wins all 6 datasets on [TOON's own benchmark](https://github.com/blackwell-systems/toon/tree/gcf-comparison). Full results: [gcformat.com/guide/benchmarks](https://gcformat.com/guide/benchmarks.html)
209
194
 
210
195
  ## Links
211
196
 
@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
4
4
 
5
5
  [project]
6
6
  name = "gcf-python"
7
- version = "0.5.0"
7
+ version = "0.5.1"
8
8
  description = "Python implementation of GCF (Graph Compact Format): token-optimized wire format for LLM tool responses"
9
9
  readme = "README.md"
10
10
  license = {text = "MIT"}
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes
File without changes