@blackwell-systems/gcf 0.6.0 → 0.6.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +10 -25
- package/package.json +1 -1
package/README.md
CHANGED
|
@@ -8,7 +8,7 @@
|
|
|
8
8
|
|
|
9
9
|
TypeScript implementation of [GCF](https://gcformat.com/) — the most token-efficient wire format for LLMs. A drop-in alternative to JSON and TOON for any structured data.
|
|
10
10
|
|
|
11
|
-
**79% fewer input tokens than JSON.
|
|
11
|
+
**79% fewer input tokens than JSON. 63% fewer output tokens. 90.5% average comprehension accuracy across 10 models and 3 providers (four models hit 100%). 1,300+ LLM evaluations. Zero training.**
|
|
12
12
|
|
|
13
13
|
Docs: [gcformat.com](https://gcformat.com/) · [Playground](https://gcformat.com/playground.html) · [GCF vs TOON](https://gcformat.com/guide/vs-toon.html)
|
|
14
14
|
|
|
@@ -190,33 +190,18 @@ Works on objects, arrays, and primitives. Arrays of uniform objects get tabular
|
|
|
190
190
|
| `Session` | Tracker for multi-call deduplication |
|
|
191
191
|
| `KIND_ABBREV` / `KIND_EXPAND` | Bidirectional kind abbreviation maps |
|
|
192
192
|
|
|
193
|
-
##
|
|
193
|
+
## Benchmarks
|
|
194
194
|
|
|
195
|
-
|
|
195
|
+
1,300+ LLM evaluations across 10 models, 3 providers, and 51 independent test runs.
|
|
196
196
|
|
|
197
|
-
|
|
|
198
|
-
|
|
199
|
-
| **
|
|
200
|
-
|
|
|
201
|
-
|
|
|
197
|
+
| | GCF | TOON | JSON |
|
|
198
|
+
|---|---|---|---|
|
|
199
|
+
| **Comprehension** (23 runs, 10 models) | **90.5%** | 68.5% | 53.6% |
|
|
200
|
+
| **Generation** (28 runs, 9 models) | **5/5** | 1.0/5 | 5.0/5 |
|
|
201
|
+
| **Input tokens** (500 symbols) | **11,090** | 16,378 | 53,341 |
|
|
202
|
+
| **Output tokens** (100 symbols) | **5,976** | 8,937 | 16,121 |
|
|
202
203
|
|
|
203
|
-
GCF
|
|
204
|
-
|
|
205
|
-
Reproduce: `git clone https://github.com/blackwell-systems/gcf-go && cd gcf-go/eval && GOWORK=off go test -run TestComprehension -v -timeout 0`
|
|
206
|
-
|
|
207
|
-
## Token Efficiency (TOON's Own Benchmark)
|
|
208
|
-
|
|
209
|
-
Running [TOON's benchmark harness](https://github.com/blackwell-systems/toon/tree/gcf-comparison) with GCF inserted (their datasets, their tokenizer):
|
|
210
|
-
|
|
211
|
-
| Track | GCF | TOON | Result |
|
|
212
|
-
|-------|-----|------|--------|
|
|
213
|
-
| Mixed-structure (nested, semi-uniform) | 170,367 | 227,896 | **GCF 34% smaller** |
|
|
214
|
-
| Flat-only (tabular) | 66,029 | 67,837 | **GCF 3% smaller** |
|
|
215
|
-
| Semi-uniform event logs | 108,158 | 154,032 | **GCF 42% smaller** |
|
|
216
|
-
|
|
217
|
-
GCF wins all 6 datasets. On semi-uniform data (the most common real-world pattern), GCF uses 42% fewer tokens than TOON.
|
|
218
|
-
|
|
219
|
-
Reproduce: `git clone https://github.com/blackwell-systems/toon && cd toon && git checkout gcf-comparison && cd benchmarks && pnpm install && pnpm benchmark:tokens`
|
|
204
|
+
GCF wins all 6 datasets on [TOON's own benchmark](https://github.com/blackwell-systems/toon/tree/gcf-comparison). Full results: [gcformat.com/guide/benchmarks](https://gcformat.com/guide/benchmarks.html)
|
|
220
205
|
|
|
221
206
|
## Links
|
|
222
207
|
|
package/package.json
CHANGED