PyPI - llguidance - Versions diffs - 0.7.25__tar.gz → 0.7.27__tar.gz - Mend

llguidance 0.7.25tar.gz → 0.7.27tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (182) hide show

{llguidance-0.7.25 → llguidance-0.7.27}/CHANGELOG.md RENAMED Viewed

@@ -4,6 +4,18 @@ All notable changes to this project will be documented in this file. Dates are d
 If a release doesn't introduce any interesting changes (build fixes etc.), it's skipped.
+#### [0.7.27](https://github.com/guidance-ai/llguidance/compare/v0.7.26...0.7.27) 2025-06-04
+- add toktrie_tiktoken and llguidance.tiktoken.lltokenizer_from_encoding [`#154`](https://github.com/guidance-ai/llguidance/issues/154)
+- implement clone on StopController [`#185`](https://github.com/guidance-ai/llguidance/issues/185)
+#### [0.7.26](https://github.com/guidance-ai/llguidance/compare/v0.7.25...0.7.26) 2025-05-30
+- add support for & and ~ in lark regexes [`96fcee3`](https://github.com/guidance-ai/llguidance/commit/96fcee373697b57bead94d1bc06c17cf1c6134e4)
+- dump grammar in errors in LLInterpreter [`#183`](https://github.com/guidance-ai/llguidance/pull/183)
+- don't check lexer bytes invariant when we cannot rollback [`ec22083`](https://github.com/guidance-ai/llguidance/commit/ec220837051513a70177974ca389b7bf387455f1)
 #### [0.7.25](https://github.com/guidance-ai/llguidance/compare/v0.7.24...0.7.25) 2025-05-28
 - add parse_special=False to tokenize_str/bytes() in python [`#181`](https://github.com/guidance-ai/llguidance/pull/181)

{llguidance-0.7.25 → llguidance-0.7.27}/Cargo.lock RENAMED Viewed

@@ -135,15 +135,30 @@ version = "0.22.1"
 source = "registry+https://github.com/rust-lang/crates.io-index"
 checksum = "72b3254f16251a8381aa12e40e3c4d2f0199f8c6508fbecb9d91f575e0fbb8c6"
+[[package]]
+name = "bit-set"
+version = "0.5.3"
+source = "registry+https://github.com/rust-lang/crates.io-index"
+checksum = "0700ddab506f33b20a03b13996eccd309a48e5ff77d0d95926aa0210fb4e95f1"
+dependencies = [
+ "bit-vec 0.6.3",
+]
 [[package]]
 name = "bit-set"
 version = "0.8.0"
 source = "registry+https://github.com/rust-lang/crates.io-index"
 checksum = "08807e080ed7f9d5433fa9b275196cfc35414f66a0c79d864dc51a0d825231a3"
 dependencies = [
- "bit-vec",
+ "bit-vec 0.8.0",
 ]
+[[package]]
+name = "bit-vec"
+version = "0.6.3"
+source = "registry+https://github.com/rust-lang/crates.io-index"
+checksum = "349f9b6a179ed607305526ca489b34ad0a41aed5f7980fa90eb03160b69598fb"
 [[package]]
 name = "bit-vec"
 version = "0.8.0"
@@ -162,6 +177,17 @@ version = "0.2.2"
 source = "registry+https://github.com/rust-lang/crates.io-index"
 checksum = "3eeab4423108c5d7c744f4d234de88d18d636100093ae04caf4825134b9c3a32"
+[[package]]
+name = "bstr"
+version = "1.12.0"
+source = "registry+https://github.com/rust-lang/crates.io-index"
+checksum = "234113d19d0d7d613b40e86fb654acf958910802bcceab913a4f9e7cda03b1a4"
+dependencies = [
+ "memchr",
+ "regex-automata",
+ "serde",
+]
 [[package]]
 name = "bumpalo"
 version = "3.17.0"
@@ -492,13 +518,24 @@ version = "0.1.10"
 source = "registry+https://github.com/rust-lang/crates.io-index"
 checksum = "d817e038c30374a4bcb22f94d0a8a0e216958d4c3dcde369b1439fec4bdda6e6"
+[[package]]
+name = "fancy-regex"
+version = "0.13.0"
+source = "registry+https://github.com/rust-lang/crates.io-index"
+checksum = "531e46835a22af56d1e3b66f04844bed63158bc094a628bec1d321d9b4c44bf2"
+dependencies = [
+ "bit-set 0.5.3",
+ "regex-automata",
+ "regex-syntax",
+]
 [[package]]
 name = "fancy-regex"
 version = "0.14.0"
 source = "registry+https://github.com/rust-lang/crates.io-index"
 checksum = "6e24cb5a94bcae1e5408b0effca5cd7172ea3c5755049c5f3af4cd283a165298"
 dependencies = [
- "bit-set",
+ "bit-set 0.8.0",
  "regex-automata",
  "regex-syntax",
 ]
@@ -1123,7 +1160,7 @@ dependencies = [
  "base64 0.22.1",
  "bytecount",
  "email_address",
- "fancy-regex",
+ "fancy-regex 0.14.0",
  "fraction",
  "idna",
  "itoa",
@@ -1174,7 +1211,7 @@ checksum = "23fb14cb19457329c82206317a5663005a4d404783dc74f4252769b0d5f42856"
 [[package]]
 name = "llguidance"
-version = "0.7.25"
+version = "0.7.27"
 dependencies = [
  "anyhow",
  "derivre",
@@ -1193,7 +1230,7 @@ dependencies = [
 [[package]]
 name = "llguidance_py"
-version = "0.7.25"
+version = "0.7.27"
 dependencies = [
  "anyhow",
  "bytemuck",
@@ -1203,6 +1240,7 @@ dependencies = [
  "serde",
  "serde_json",
  "toktrie_hf_tokenizers",
+ "toktrie_tiktoken",
 ]
 [[package]]
@@ -1865,6 +1903,12 @@ version = "0.1.24"
 source = "registry+https://github.com/rust-lang/crates.io-index"
 checksum = "719b953e2095829ee67db738b3bfa9fa368c94900df327b3f07fe6e794d2fe1f"
+[[package]]
+name = "rustc-hash"
+version = "1.1.0"
+source = "registry+https://github.com/rust-lang/crates.io-index"
+checksum = "08d43f7aa6b08d49f382cde6a7982047c3426db949b1424bc4b7ec9ae12c6ce2"
 [[package]]
 name = "rustix"
 version = "1.0.5"
@@ -2233,6 +2277,21 @@ dependencies = [
  "syn",
 ]
+[[package]]
+name = "tiktoken-rs"
+version = "0.7.0"
+source = "registry+https://github.com/rust-lang/crates.io-index"
+checksum = "25563eeba904d770acf527e8b370fe9a5547bacd20ff84a0b6c3bc41288e5625"
+dependencies = [
+ "anyhow",
+ "base64 0.22.1",
+ "bstr",
+ "fancy-regex 0.13.0",
+ "lazy_static",
+ "regex",
+ "rustc-hash",
+]
 [[package]]
 name = "tinystr"
 version = "0.7.6"
@@ -2252,7 +2311,7 @@ dependencies = [
  "aho-corasick",
  "derive_builder",
  "esaxx-rs",
- "fancy-regex",
+ "fancy-regex 0.14.0",
  "getrandom 0.2.15",
  "itertools 0.13.0",
  "lazy_static",
@@ -2336,7 +2395,7 @@ dependencies = [
 [[package]]
 name = "toktrie"
-version = "0.7.25"
+version = "0.7.27"
 dependencies = [
  "anyhow",
  "bytemuck",
@@ -2347,7 +2406,7 @@ dependencies = [
 [[package]]
 name = "toktrie_hf_downloader"
-version = "0.7.25"
+version = "0.7.27"
 dependencies = [
  "anyhow",
  "hf-hub",
@@ -2358,7 +2417,7 @@ dependencies = [
 [[package]]
 name = "toktrie_hf_tokenizers"
-version = "0.7.25"
+version = "0.7.27"
 dependencies = [
  "anyhow",
  "log",
@@ -2368,6 +2427,18 @@ dependencies = [
  "toktrie",
 ]
+[[package]]
+name = "toktrie_tiktoken"
+version = "0.7.27"
+dependencies = [
+ "anyhow",
+ "log",
+ "serde",
+ "serde_json",
+ "tiktoken-rs",
+ "toktrie",
+]
 [[package]]
 name = "tower"
 version = "0.5.2"

{llguidance-0.7.25 → llguidance-0.7.27}/Cargo.toml RENAMED Viewed

@@ -7,6 +7,7 @@ members = [
     "toktrie",
     "toktrie_hf_tokenizers",
     "toktrie_hf_downloader",
+    "toktrie_tiktoken",
 ]
 # just exclude python_ext since it doesn't build without maturin
 default-members = [
@@ -16,6 +17,7 @@ default-members = [
     "toktrie",
     "toktrie_hf_tokenizers",
     "toktrie_hf_downloader",
+    "toktrie_tiktoken",
 ]
 resolver = "2"
@@ -36,4 +38,5 @@ opt-level = 3
 toktrie = { path = "toktrie" }
 llguidance = { path = "parser" }
 toktrie_hf_tokenizers = { path = "toktrie_hf_tokenizers" }
-toktrie_hf_downloader = { path = "toktrie_hf_downloader" }
+toktrie_hf_downloader = { path = "toktrie_hf_downloader" }
+toktrie_tiktoken = { path = "toktrie_tiktoken" }

{llguidance-0.7.25 → llguidance-0.7.27}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: llguidance
-Version: 0.7.25
+Version: 0.7.27
 License-File: LICENSE
 Summary: Bindings for the Low-level Guidance (llguidance) Rust library for use within Guidance
 Author: Michal Moskal
@@ -72,9 +72,7 @@ The library is currently integrated in:
 - **vLLM** - [V0 PR](https://github.com/vllm-project/vllm/pull/14589) and [V1 PR](https://github.com/vllm-project/vllm/pull/14779)
 - [LLGTRT](https://github.com/guidance-ai/llgtrt) - OpenAI-compatible REST server using NVIDIA's [TensorRT-LLM](https://github.com/NVIDIA/TensorRT-LLM)
 - [mistral.rs](https://github.com/EricLBuehler/mistral.rs/pull/899)
-The integration is ongoing in:
-- **onnxruntime-genai** - [draft PR](https://github.com/microsoft/onnxruntime-genai/pull/1038)
+- [onnxruntime-genai](https://github.com/microsoft/onnxruntime-genai/pull/1381)
 ## Technical details

{llguidance-0.7.25 → llguidance-0.7.27}/README.md RENAMED Viewed

@@ -60,9 +60,7 @@ The library is currently integrated in:
 - **vLLM** - [V0 PR](https://github.com/vllm-project/vllm/pull/14589) and [V1 PR](https://github.com/vllm-project/vllm/pull/14779)
 - [LLGTRT](https://github.com/guidance-ai/llgtrt) - OpenAI-compatible REST server using NVIDIA's [TensorRT-LLM](https://github.com/NVIDIA/TensorRT-LLM)
 - [mistral.rs](https://github.com/EricLBuehler/mistral.rs/pull/899)
-The integration is ongoing in:
-- **onnxruntime-genai** - [draft PR](https://github.com/microsoft/onnxruntime-genai/pull/1038)
+- [onnxruntime-genai](https://github.com/microsoft/onnxruntime-genai/pull/1381)
 ## Technical details

{llguidance-0.7.25 → llguidance-0.7.27}/docs/optimizations.md RENAMED Viewed

@@ -91,7 +91,6 @@ Walking the trie mostly involves successful lookups in that table,
 and the derivative engine is only used when the table doesn't yet have the
 given transition.
 ## Earley parser optimizations
 - CFG rules are stored in a flat array
@@ -123,7 +122,7 @@ We thus define a series _slices_, under-approximation of such unconstrained cont
 The slices are defined by regular expressions typically of the form `[...]{1,N}`
 (that is a character class repeated up to `N` times).
-For example, a good set of slices for JSON schemas is
+For example, a good set of slices for JSON schemas is
 - `[^"\\\x00-\x1F\x7F]{1,10}` (`turtle`, ` turtle`, `)!;`, `żółw`, `🐢`, etc.)
 - `[^"\\\x00-\x1F\x7F]{1,30}` (`/////////////////`, ...)
@@ -169,6 +168,7 @@ Now, the JSON slice is contained in `C*"`,
 and thus we can skip walking the trie for the slice.
 Another example:
 - assume schemas has `{ "type": "string", "maxLength": 20 }`
 - so after initial quote, the lexer allows `C{0,20}"`
 - the JSON slice `[^"\\\x00-\x1F\x7F]{1,10}` is contained in this lexeme,
@@ -176,17 +176,19 @@ Another example:
 This optimization make the mask computation about 10x faster in [MaskBench](https://github.com/guidance-ai/jsonschemabench/tree/main/maskbench).
+### Mask density statistics
 The reason the optimization works, is that masks tend be either small or sliceable.
 Here are statistics of various kinds of masks, across around 2M masks in MaskBench,
 categorized based on how "full" the mask is and whether the slicer optimization was applied.
-| Category            |  % Masks |    % Time | Time/Mask [us] |
-|---------------------|---------:|----------:|---------------:|
-| 0%-2% & !sliced     |    44.6% |     20.7% |             28 |
-| 2%-85% & !sliced    |     1.1% |     11.0% |            576 |
-| 85%+ & !sliced      |     0.5% |     13.0% |           1577 |
-| 85%+ & sliced       |    53.8% |     55.0% |             61 |
-| **Total**           |   100.0% |    100.0% |             60 |
+| Category         | % Masks | % Time | Time/Mask [us] |
+| ---------------- | ------: | -----: | -------------: |
+| 0%-2% & !sliced  |   44.6% |  20.7% |             28 |
+| 2%-85% & !sliced |    1.1% |  11.0% |            576 |
+| 85%+ & !sliced   |    0.5% |  13.0% |           1577 |
+| 85%+ & sliced    |   53.8% |  55.0% |             61 |
+| **Total**        |  100.0% | 100.0% |             60 |
 ![Plot of the table above](mask_plot.png)
@@ -195,43 +197,43 @@ and in a little over half the slicer optimization can be applied
 (there are no masks under 85% full where the slicer can be applied).
 The remaining sliver of masks are either intermediate size or large, but the slicer optimization can't be applied; they take disproportionately long time to compute.
 ### Checking regex containment
-This is an under-approximation of the containment problem,
-that is it may return false when the containment is actually true.
-If any of the "checks" fail, we return false.
+This is an under-approximation of the containment problem, that is it may return
+false when the containment is actually true. If any of the "checks" fail, we
+return false.
 Prefixes of language `R`, are defined as `P(R) = { w | ∃q. wq ∈ R }`.
-We need to check if regex `S` (slice) is contained in prefix of regex `L` (lexeme): `S ⊆ P(L)`.
+We need to check if regex `S` (slice) is contained in prefix of regex `L`
+(lexeme): `S ⊆ P(L)`.
-We check if `L` is of the form `(X{m,n} & ~E) T`, where
-`E` is of the form `E0 | E1 | ... | Ek`,
-and both `E` can be `∅` (empty-set/no match) and `T` can be `ε` (empty string).
+We check if `L` is of the form `(X{m,n} & ~E) T`, where `E` is of the form
+`E0 | E1 | ... | Ek`. Note that: `E` can be `∅` (empty-set/no match) and `T` can
+be `ε` (empty string).
-Observe that `P(R) ⊆ P(RT)`, ie. making regex longer doesn't remove any prefixes (provided `T ≠ ∅`).
-Thus, we'll be checking containment in `P(X{m,n} & ~E)`.
+Observe that `P(R) ⊆ P(RT)`, ie. making regex longer doesn't remove any prefixes
+(provided `T ≠ ∅`). Thus, we'll be checking containment in `P(X{m,n} & ~E)`.
-We (over)estimate maximum length of `E`, let `o >= max { |w| | w ∈ E }`.
-We check that `n > o`, and that `∃v ≠ ε. v ∈ X`.
-In other words, we check that for anything matching `Ei` and `X{m,n}` there is a proper extension of that string in `X{m,n}`.
+We (over)estimate maximum length of `E`, let `o >= max { |w| | w ∈ E }`. We
+check that `n > o`, and that `∃v ≠ ε. v ∈ X`. In other words, we check that for
+anything matching `Ei` and `X{m,n}` there is a proper extension of that string
+in `X{m,n}`.
 Now, we prove that `P(X{m,n} & ~E) = P(X{m,n})`.
-Consider `w ∈ P(X{m,n})`. We have `wq ∈ X{m,n}` for some `q`.
-If `|wq| > o`, then `wq ∉ E`, and thus `wq ∈ X{m,n} & ~E`.
-Otherwise, `wq ∈ X{p}` for some `p <= o < n`,
-and thus `wqv...v ∈ X{n}` for `n-p` repetitions of `v`.
-We also have `|wqv...v| > o`, and thus `wqv...v ∉ E`,
-and thus `wqv...v ∈ X{m,n} & ~E`,
-and thus `w ∈ P(X{m,n} & ~E)`.
-The other direction is trivial.
+Consider `w ∈ P(X{m,n})`. We have `wq ∈ X{m,n}` for some `q`. If `|wq| > o`,
+then `wq ∉ E`, and thus `wq ∈ X{m,n} & ~E`. Otherwise, `wq ∈ X{p}` for some
+`p <= o < n`, and thus `wqv...v ∈ X{n}` for `n-p` repetitions of `v`. We also
+have `|wqv...v| > o`, and thus `wqv...v ∉ E`, and thus `wqv...v ∈ X{m,n} & ~E`,
+and thus `w ∈ P(X{m,n} & ~E)`. The other direction is trivial.
 Now, we just need to check if `S ⊆ P(X{m,n})`.
-First, we check if `S` is of the form `Y{m',n'}`.
-Then, we check if `Y` is contained in `X` (this is a cached check using symbolic derivatives; it's typically simple).
-Finally, we check if `n' <= n`.
-Note that we don't care about `m` and `m'`, as we're checking for prefixes.
+First, we check if `S` is of the form `Y{m',n'}`. Then, we check if `Y` is
+contained in `X` (this is a cached check using symbolic derivatives; it's
+typically simple). Finally, we check if `n' <= n`. Note that we don't care about
+`m` and `m'`, as we're checking for prefixes.
 Also note that the upper-bound in the above calculations can be infinity.

{llguidance-0.7.25 → llguidance-0.7.27}/docs/syntax.md RENAMED Viewed

@@ -217,6 +217,28 @@ like `<|python_tag|>`, not a string like `<function`.
 The `llguidance.StructTag` API, [inspired](https://github.com/mlc-ai/xgrammar/blob/fd9ee31/python/xgrammar/grammar.py#L211) by XGrammar, just compiles down to the above.
+### And/Not operators in regexes
+The regular expressions in LLGuidance can use additional operators: `&` (and) and `~` (not).
+They can only be used outside of the `/.../` syntax, i.e., in the Lark terminal (token) definitions.
+The `&` operator binds tighter than `|` (alternation), so `A & B | C` means `(A & B) | C`.
+The `~` operator binds tighter than even `+` or `*`, so `~A+` means `(~A)+`.
+The negation operator `~` is particularly tricky to use right.
+For example, this is a terminal definition that matches any list of ASCII lines,
+but they cannot have two newlines in a row:
+```lark
+ASCII_LINES: /[a-zA-Z \n]*/ & ~/(?s:.*)\n\n(?s:.*)/
+```
+Note that `/[a-zA-Z \n]*/ & ~/\n\n/` would mean any list of lines, also with two newlines in a row,
+except for the exact string `\n\n`.
+Also, `/[a-zA-Z \n]*/ & ~/(.*)\n\n(.*)/` would allow double newlines, but if there is at least two of them
+(`/./` doesn't match newline).
+These operators are sometimes expensive to use, so you should generally avoid them if alternatives exist.
 ### Structured %regex
 LLGuidance supports [extended regex syntax](https://docs.rs/regex/latest/regex/#syntax) in `/.../`.
@@ -273,12 +295,6 @@ MULT_NUM: %regex {
 }
 ```
-We also plan to add `&` and `~` operators:
-```lark
-ASCII_LINES: /[a-zA-Z \n]*/ & ~/.*\n\n.*/
-```
 ### Grammar options
 Certain grammar options can be set by using `%llguidnace { ... }`,

{llguidance-0.7.25 → llguidance-0.7.27}/parser/Cargo.toml RENAMED Viewed

@@ -1,6 +1,6 @@
 [package]
 name = "llguidance"
-version = "0.7.25"
+version = "0.7.27"
 edition = "2021"
 license = "MIT"
 description = "Super-fast Structured Outputs"

{llguidance-0.7.25 → llguidance-0.7.27}/parser/llguidance.h RENAMED Viewed

@@ -433,6 +433,13 @@ const char *llg_stop_commit_token(struct LlgStopController *stop_ctrl,
                                   size_t *output_len_p,
                                   bool *is_stopped_p);
+/**
+ * Clone the stop-sequence controller.
+ * The cloned controller shares (under mutex) regex caches if any, so that
+ * cloning is cheap.
+ */
+struct LlgStopController *llg_clone_stop_controller(const struct LlgStopController *stop_ctrl);
 /**
  * Free the stop-sequence controller
  */

llguidance 0.7.25__tar.gz → 0.7.27__tar.gz

llguidance 0.7.25tar.gz → 0.7.27tar.gz