PyPI - cat-stack - Versions diffs - 1.6.2__tar.gz → 1.6.4__tar.gz - Mend

cat-stack 1.6.2tar.gz → 1.6.4tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (44) hide show

{cat_stack-1.6.2 → cat_stack-1.6.4}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: cat-stack
-Version: 1.6.2
+Version: 1.6.4
 Summary: Domain-agnostic text, image, PDF, and DOCX classification engine powered by LLMs
 Project-URL: Documentation, https://github.com/chrissoria/cat-stack#readme
 Project-URL: Issues, https://github.com/chrissoria/cat-stack/issues
@@ -175,7 +175,16 @@ All providers use the same `(model_name, provider, api_key)` tuple format. Provi
 - **Automatic prompt optimization** (`prompt_tune`) — correct a small sample in a browser UI, and the system generates per-category instructions that improve accuracy
 - **Multi-model ensemble** with consensus voting and agreement scores
-- **Batch API support** for OpenAI, Anthropic, Google, Mistral, and xAI
+- **Batch API support** for OpenAI, Anthropic, Google, Mistral, and xAI.
+  *Caveat for Google (Gemini):* as of 2026-06, Google's batch
+  scheduler routinely leaves small jobs (under a few dozen rows) in
+  `BATCH_STATE_PENDING` for 30+ minutes — sometimes hours — before
+  it starts processing. Google's published SLA is up to 24h. If your
+  job is small and you want results back quickly, use `batch_mode=False`
+  for Gemini; reserve `batch_mode=True` for large jobs where the
+  50% cost discount matters more than wall-clock latency. Other
+  providers' batch APIs (OpenAI, Anthropic, xAI) typically complete
+  small jobs in 1-3 minutes
 - **Prompt strategies**: Chain-of-Thought, Chain-of-Verification, step-back prompting, few-shot examples
 - **Text, image, and PDF** input auto-detection (PDF inputs are
   validated against the `%PDF-` magic-byte header before reaching

{cat_stack-1.6.2 → cat_stack-1.6.4}/README.md RENAMED Viewed

@@ -139,7 +139,16 @@ All providers use the same `(model_name, provider, api_key)` tuple format. Provi
 - **Automatic prompt optimization** (`prompt_tune`) — correct a small sample in a browser UI, and the system generates per-category instructions that improve accuracy
 - **Multi-model ensemble** with consensus voting and agreement scores
-- **Batch API support** for OpenAI, Anthropic, Google, Mistral, and xAI
+- **Batch API support** for OpenAI, Anthropic, Google, Mistral, and xAI.
+  *Caveat for Google (Gemini):* as of 2026-06, Google's batch
+  scheduler routinely leaves small jobs (under a few dozen rows) in
+  `BATCH_STATE_PENDING` for 30+ minutes — sometimes hours — before
+  it starts processing. Google's published SLA is up to 24h. If your
+  job is small and you want results back quickly, use `batch_mode=False`
+  for Gemini; reserve `batch_mode=True` for large jobs where the
+  50% cost discount matters more than wall-clock latency. Other
+  providers' batch APIs (OpenAI, Anthropic, xAI) typically complete
+  small jobs in 1-3 minutes
 - **Prompt strategies**: Chain-of-Thought, Chain-of-Verification, step-back prompting, few-shot examples
 - **Text, image, and PDF** input auto-detection (PDF inputs are
   validated against the `%PDF-` magic-byte header before reaching

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/__about__.py RENAMED Viewed

@@ -1,7 +1,7 @@
 # SPDX-FileCopyrightText: 2025-present Christopher Soria <chrissoria@berkeley.edu>
 #
 # SPDX-License-Identifier: GPL-3.0-or-later
-__version__ = "1.6.2"
+__version__ = "1.6.4"
 __author__ = "Chris Soria"
 __email__ = "chrissoria@berkeley.edu"
 __title__ = "cat-stack"

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/_batch.py RENAMED Viewed

@@ -213,12 +213,17 @@ def _build_jsonl_line(provider: str, custom_id: str, payload: dict, model: str)
             "body": payload,
         }
     elif provider == "xai":
-        # xAI requests are added one-by-one after batch creation; same OpenAI-compat format
+        # xAI batch API uses a tagged-union envelope: each request element
+        # has `batch_request_id` + `batch_request` (an object with one key
+        # naming the endpoint variant: `chat_get_completion`, `responses`,
+        # `image_generation`, etc.). For chat classification the variant is
+        # `chat_get_completion` and the payload inside it is the standard
+        # chat-completion body (model + messages + …).
         return {
-            "custom_id": custom_id,
-            "method": "POST",
-            "url": "/v1/chat/completions",
-            "body": payload,
+            "batch_request_id": custom_id,
+            "batch_request": {
+                "chat_get_completion": payload,
+            },
         }
     raise ValueError(f"Unsupported batch provider: {provider}")
@@ -369,16 +374,25 @@ def _create_batch_job(
         return resp.json()["id"]
     elif provider == "xai":
-        # Step 1: Create empty batch
+        # Step 1: Create empty batch. xAI requires a `name` field on create;
+        # the older `completion_window` field was removed. Response key is
+        # `batch_id`, not `id`.
+        import time as _time
         url = BATCH_ENDPOINTS["xai"]["create"]
-        body = {"completion_window": "24h"}
+        body = {"name": f"catstack-{_time.strftime('%Y%m%d-%H%M%S')}"}
         resp = requests.post(url, headers=headers, json=body, timeout=60)
         resp.raise_for_status()
-        job_id = resp.json()["id"]
+        job_id = resp.json()["batch_id"]
-        # Step 2: Add all requests to the batch
+        # Step 2: Add all requests to the batch. xAI wraps the list under a
+        # `batch_requests` key; each element is the tagged-union envelope
+        # built in `_build_jsonl_line`.
         add_url = BATCH_ENDPOINTS["xai"]["add"].format(job_id=job_id)
-        add_resp = requests.post(add_url, headers=headers, json=requests_list, timeout=120)
+        add_resp = requests.post(
+            add_url, headers=headers,
+            json={"batch_requests": requests_list},
+            timeout=120,
+        )
         add_resp.raise_for_status()
         return job_id
@@ -479,11 +493,30 @@ def _poll_batch_job(
                 f"total={status_data.get('total_requests', '?')}"
             )
         elif provider == "xai":
-            state = status_data.get("status", "")
-            counts = status_data.get("request_counts", {})
+            # xAI returns a `state` *object* with num_* counters, not a
+            # top-level state string. Synthesize a state string compatible
+            # with the existing terminal/success-set logic:
+            #   num_pending > 0                                   → "running"
+            #   num_pending == 0, all errored/cancelled, no success → "failed"/"cancelled"
+            #   num_pending == 0, at least one success            → "completed"
+            state_obj = status_data.get("state", {})
+            num_pending = state_obj.get("num_pending", 1)
+            num_success = state_obj.get("num_success", 0)
+            num_error = state_obj.get("num_error", 0)
+            num_cancelled = state_obj.get("num_cancelled", 0)
+            if num_pending > 0:
+                state = "running"
+            elif num_success > 0:
+                state = "completed"
+            elif num_cancelled > 0 and num_error == 0:
+                state = "cancelled"
+            elif num_error > 0:
+                state = "failed"
+            else:
+                state = "completed"  # all zeros — empty batch
             progress_str = (
-                f"completed={counts.get('completed', '?')} "
-                f"failed={counts.get('failed', '?')}"
+                f"completed={num_success} failed={num_error} "
+                f"pending={num_pending} cancelled={num_cancelled}"
             )
         else:
             state = ""
@@ -590,12 +623,26 @@ def _download_batch_results(
         return resp.text
     elif provider == "xai":
+        # xAI's results endpoint returns paginated JSON ({results: [...],
+        # pagination_token: <str or null>}) rather than streaming JSONL.
+        # Walk all pages, concatenate the result objects, then re-serialize
+        # as JSONL so the existing line-by-line parser in
+        # `_parse_batch_results` can consume them unchanged.
         url = BATCH_ENDPOINTS["xai"]["results"].format(job_id=job_id)
         headers_dl = dict(headers)
         headers_dl.pop("Content-Type", None)
-        resp = requests.get(url, headers=headers_dl, timeout=120)
-        resp.raise_for_status()
-        return resp.text
+        all_results = []
+        pagination_token = None
+        while True:
+            params = {"pagination_token": pagination_token} if pagination_token else None
+            resp = requests.get(url, headers=headers_dl, params=params, timeout=120)
+            resp.raise_for_status()
+            data = resp.json()
+            all_results.extend(data.get("results", []) or [])
+            pagination_token = data.get("pagination_token")
+            if not pagination_token:
+                break
+        return "\n".join(json.dumps(r) for r in all_results)
     raise ValueError(f"Unsupported batch provider: {provider}")
@@ -689,9 +736,16 @@ def _parse_batch_results(
             raw_text = client._parse_response(response_body)
         elif provider == "xai":
-            custom_id = data.get("custom_id")
-            response_body = data.get("response", {}).get("body")
-            error_val = data.get("response", {}).get("error")
+            # xAI result envelope:
+            #   { batch_request_id, batch_result: { response: { chat_get_completion: {…} } } }
+            # `chat_get_completion` is the OpenAI-style chat-completion body
+            # that client._parse_response() already handles. Failure case has
+            # `error_message` at the top level.
+            custom_id = data.get("batch_request_id")
+            error_val = data.get("error_message")
+            batch_result = data.get("batch_result", {}) or {}
+            response_obj = batch_result.get("response", {}) or {}
+            response_body = response_obj.get("chat_get_completion")
             if error_val or response_body is None:
                 error_msg = str(error_val) if error_val else "No response body"
                 idx = custom_id_map.get(custom_id)

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/_providers.py RENAMED Viewed

@@ -855,7 +855,9 @@ class UnifiedLLMClient:
                         wait_time = _backoff_with_jitter(initial_delay, attempt, multiplier=5.0)
                     elapsed = time.monotonic() - start
                     if attempt < max_retries - 1 and elapsed + wait_time <= _MAX_TOTAL_WAIT_SECONDS:
-                        print(f"Rate limited. Waiting {wait_time:.1f}s...")
+                        # Name the throttling provider/model so multi-model
+                        # ensemble runs can attribute the slowdown.
+                        print(f"[{self.provider}/{self.model}] Rate limited. Waiting {wait_time:.1f}s...")
                         time.sleep(wait_time)
                         continue
                     else:
@@ -893,7 +895,9 @@ class UnifiedLLMClient:
                         wait_time = _backoff_with_jitter(initial_delay, attempt)
                     elapsed = time.monotonic() - start
                     if attempt < max_retries - 1 and elapsed + wait_time <= _MAX_TOTAL_WAIT_SECONDS:
-                        print(f"Server error {response.status_code}. Retrying in {wait_time:.1f}s...")
+                        # Name the failing provider/model — same rationale as
+                        # the 429 handler above.
+                        print(f"[{self.provider}/{self.model}] Server error {response.status_code}. Retrying in {wait_time:.1f}s...")
                         time.sleep(wait_time)
                         continue
                     else:

{cat_stack-1.6.2 → cat_stack-1.6.4}/.gitignore RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/LICENSE RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/pyproject.toml RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/cat_stack/__init__.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/__init__.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/_category_analysis.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/_chunked.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/_embeddings.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/_formatter.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/_pilot_test.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/_prompts.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/_review_ui.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/_tiebreaker.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/_utils.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/_web_fetch.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/_wrapper_helpers.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/calls/CoVe.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/calls/__init__.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/calls/image_CoVe.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/calls/image_stepback.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/calls/pdf_CoVe.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/calls/pdf_stepback.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/calls/stepback.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/calls/top_n.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/classify.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/explore.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/extract.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/image_functions.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/images/circle.png RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/images/cube.png RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/images/diamond.png RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/images/overlapping_pentagons.png RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/images/rectangles.png RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/model_reference_list.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/pdf_functions.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/prompt_tune.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/summarize.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/text_functions.py RENAMED Viewed

File without changes

{cat_stack-1.6.2 → cat_stack-1.6.4}/src/catstack/text_functions_ensemble.py RENAMED Viewed

File without changes

cat-stack 1.6.2__tar.gz → 1.6.4__tar.gz

cat-stack 1.6.2tar.gz → 1.6.4tar.gz