PyPI - errorsense - Versions diffs - 0.1.0__tar.gz - Mend

errorsense 0.1.0__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (22) hide show

errorsense-0.1.0/.gitignore +11 -0
errorsense-0.1.0/LICENSE +21 -0
errorsense-0.1.0/PKG-INFO +213 -0
errorsense-0.1.0/README.md +185 -0
errorsense-0.1.0/design/ERRORSENSE.md +340 -0
errorsense-0.1.0/errorsense/__init__.py +27 -0
errorsense-0.1.0/errorsense/engine.py +452 -0
errorsense-0.1.0/errorsense/llm.py +201 -0
errorsense-0.1.0/errorsense/models.py +52 -0
errorsense-0.1.0/errorsense/phase.py +192 -0
errorsense-0.1.0/errorsense/presets/__init__.py +5 -0
errorsense-0.1.0/errorsense/presets/http_gateway.py +72 -0
errorsense-0.1.0/errorsense/ruleset.py +165 -0
errorsense-0.1.0/errorsense/signal.py +100 -0
errorsense-0.1.0/errorsense/skill.py +70 -0
errorsense-0.1.0/errorsense/skills/http_classifier.md +29 -0
errorsense-0.1.0/errorsense/skills/reclassification.md +9 -0
errorsense-0.1.0/pyproject.toml +36 -0
errorsense-0.1.0/tests/test_engine.py +281 -0
errorsense-0.1.0/tests/test_ruleset.py +180 -0
errorsense-0.1.0/tests/test_signal.py +97 -0
errorsense-0.1.0/tests/test_tracker.py +133 -0

errorsense-0.1.0/.gitignore ADDED Viewed

@@ -0,0 +1,11 @@
+__pycache__/
+*.egg-info/
+.venv/
+.pytest_cache/
+dist/
+build/
+*.pyc
+.DS_Store
+.claude
+relay_preset/
+.env

errorsense-0.1.0/LICENSE ADDED Viewed

@@ -0,0 +1,21 @@
+MIT License
+Copyright (c) 2026 OpenGPU
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

errorsense-0.1.0/PKG-INFO ADDED Viewed

@@ -0,0 +1,213 @@
+Metadata-Version: 2.4
+Name: errorsense
+Version: 0.1.0
+Summary: Error classification engine. Rules for the obvious, AI for the ambiguous.
+Project-URL: Homepage, https://github.com/opengpu/errorsense
+Project-URL: Documentation, https://github.com/opengpu/errorsense#readme
+Author-email: Can Atılgan <can@opengpu.network>
+License-Expression: MIT
+License-File: LICENSE
+Keywords: circuit-breaker,error-classification,llm,observability
+Classifier: Development Status :: 3 - Alpha
+Classifier: Intended Audience :: Developers
+Classifier: License :: OSI Approved :: MIT License
+Classifier: Programming Language :: Python :: 3
+Classifier: Programming Language :: Python :: 3.10
+Classifier: Programming Language :: Python :: 3.11
+Classifier: Programming Language :: Python :: 3.12
+Classifier: Programming Language :: Python :: 3.13
+Classifier: Topic :: Software Development :: Libraries
+Classifier: Topic :: System :: Monitoring
+Requires-Python: >=3.10
+Provides-Extra: dev
+Requires-Dist: pytest-asyncio>=0.21; extra == 'dev'
+Requires-Dist: pytest>=7.0; extra == 'dev'
+Provides-Extra: llm
+Requires-Dist: httpx>=0.25; extra == 'llm'
+Description-Content-Type: text/markdown
+# ErrorSense
+Error classification engine. Rules for the obvious, LLM for the ambiguous.
+Most errors are easy to classify — a 400 is a client error, a 502 is a server error. But some aren't — a 500 with "model not found" in the body is actually a client error, not a server failure. Your rules can't catch every edge case. An LLM can.
+ErrorSense runs errors through a phase pipeline: fast deterministic rulesets first, LLM only when rulesets can't decide. Most errors never hit the LLM. The ones that do get classified correctly instead of falling through as "unknown."
+**Use it for:** circuit breakers, alert routing, retry logic, error dashboards; anywhere you need to know *what kind* of error happened, not just *that* it happened.
+## Install
+```bash
+pip install errorsense              # core only (zero dependencies)
+pip install errorsense[llm]         # + LLM classification
+```
+## Quick Start — Use a Preset
+```python
+from errorsense.presets import http
+from errorsense import LLMConfig, Signal
+sense = http(llm=LLMConfig(api_key="your_api_key"))
+results = sense.classify(Signal.from_http(status_code=400, body="bad request"))
+results[0].label  # "client"
+results = sense.classify(Signal.from_http(status_code=502))
+results[0].label  # "server"
+results = sense.classify(Signal.from_http(status_code=500, body="model not found"))
+results[0].label  # "client" (LLM figured it out)
+```
+The `http` preset gives you a 3-phase pipeline (rules → patterns → LLM) with 3 categories: `"client"`, `"server"`, `"undecided"`. Rulesets handle obvious cases instantly. LLM handles the ambiguous ones.
+Don't want LLM? Use `http_no_llm()` — rulesets only, ambiguous errors come back as `"undecided"`.
+## Build Your Own Pipeline
+A pipeline is a list of phases. Each phase has rulesets (deterministic) or skills (LLM). You can mix both, use only rulesets, or use only skills.
+```python
+from errorsense import ErrorSense, Phase, Ruleset, Skill, LLMConfig, Signal
+# Rulesets + LLM
+sense = ErrorSense(
+    categories=["transient", "permanent", "user"],
+    pipeline=[
+        Phase("codes", rulesets=[
+            Ruleset(field="error_code", match={
+                "ECONNRESET": "transient", "ETIMEOUT": "transient", "EPERM": "permanent",
+            }),
+        ]),
+        Phase("patterns", rulesets=[
+            Ruleset(field="message", patterns=[
+                ("transient", [r"timeout", r"connection reset", r"retry"]),
+                ("permanent", [r"corruption", r"fatal"]),
+            ]),
+        ]),
+        Phase("llm", skills=[
+            Skill("my_classifier", path="./skills/my_classifier.md"),
+        ], llm=LLMConfig(api_key="your_key")),
+    ],
+    default="transient",
+)
+# Rulesets only — no LLM needed
+sense = ErrorSense(
+    categories=["client", "server"],
+    pipeline=[
+        Phase("rules", rulesets=[
+            Ruleset(field="status_code", match={"4xx": "client", 502: "server"}),
+        ]),
+    ],
+    default="server",
+)
+# LLM only — skip rulesets entirely
+sense = ErrorSense(
+    categories=["client", "server"],
+    pipeline=[
+        Phase("llm", skills=[
+            Skill("my_classifier", path="./skills/my_classifier.md"),
+        ], llm=LLMConfig(api_key="your_key")),
+    ],
+    default="unknown",
+)
+```
+Phases run in order. First match wins. Rulesets are instant and free. LLM is the fallback.
+## Rulesets
+Each ruleset does one thing — `match=` for field matching or `patterns=` for regex:
+```python
+Ruleset(field="status_code", match={400: "client", 502: "server"})         # exact match
+Ruleset(field="status_code", match={"4xx": "client", 503: "server"})       # range match
+Ruleset(field="headers.content-type", match={"text/html": "server"})       # header match
+Ruleset(field="body.error.type", match={"validation_error": "client"})     # JSON dot-path
+Ruleset(field="body", patterns=[("server", [r"OOM"]), ("client", [r"invalid"])])  # regex
+```
+Custom logic? Subclass:
+```python
+class VendorBugRuleset(Ruleset):
+    def classify(self, signal: Signal) -> SenseResult | None:
+        if signal.get("vendor") == "acme" and signal.get("code") == "X99":
+            return SenseResult(label="known_bug", confidence=1.0)
+        return None
+```
+## Skills
+Skills are LLM instructions stored as `.md` files. Each skill teaches the LLM how to classify errors in a specific domain.
+```python
+# Loads from errorsense/skills/http_classifier.md (built-in)
+Skill("http_classifier")
+# Loads from your own file
+Skill("my_classifier", path="./skills/my_classifier.md")
+```
+## All Phases Mode
+```python
+# Default — stops at first match
+results = sense.classify(signal)
+# All phases run
+results = sense.classify(signal, short_circuit=False)
+# With LLM reasoning
+results = sense.classify(signal, explain=True)
+results[0].reason  # "ECONNRESET indicates transient network failure"
+```
+## Trailing (Stateful Error Tracking)
+Track errors per key. When a threshold is hit, the LLM reviews the full error history.
+```python
+from errorsense import TrailingConfig
+sense = ErrorSense(
+    categories=["transient", "permanent", "user"],
+    pipeline=[...],
+    trailing=TrailingConfig(
+        threshold=3,
+        count_labels=["transient", "permanent"],  # user errors don't count
+    ),
+)
+# In your error handler:
+result = sense.trail("service-a", signal)
+result.label         # "transient"
+result.at_threshold  # True (3rd counted error)
+result.reason        # LLM review: "3 transient errors — all connection resets..."
+# On success:
+sense.reset("service-a")
+```
+**How it works:**
+- Each `trail()` call classifies the signal normally through the pipeline
+- Counted labels accumulate per key toward the threshold
+- At threshold, the LLM reviews all recorded errors and gives its verdict
+- If the review changes the label, the history entry is corrected and the count adjusts
+- `review=False` in TrailingConfig disables LLM review (just counting)
+**Manual review anytime:**
+```python
+verdict = sense.review("service-a")
+verdict.label   # LLM's verdict on the full history
+verdict.reason  # explanation
+```
+## License
+MIT

errorsense-0.1.0/README.md ADDED Viewed

@@ -0,0 +1,185 @@
+# ErrorSense
+Error classification engine. Rules for the obvious, LLM for the ambiguous.
+Most errors are easy to classify — a 400 is a client error, a 502 is a server error. But some aren't — a 500 with "model not found" in the body is actually a client error, not a server failure. Your rules can't catch every edge case. An LLM can.
+ErrorSense runs errors through a phase pipeline: fast deterministic rulesets first, LLM only when rulesets can't decide. Most errors never hit the LLM. The ones that do get classified correctly instead of falling through as "unknown."
+**Use it for:** circuit breakers, alert routing, retry logic, error dashboards; anywhere you need to know *what kind* of error happened, not just *that* it happened.
+## Install
+```bash
+pip install errorsense              # core only (zero dependencies)
+pip install errorsense[llm]         # + LLM classification
+```
+## Quick Start — Use a Preset
+```python
+from errorsense.presets import http
+from errorsense import LLMConfig, Signal
+sense = http(llm=LLMConfig(api_key="your_api_key"))
+results = sense.classify(Signal.from_http(status_code=400, body="bad request"))
+results[0].label  # "client"
+results = sense.classify(Signal.from_http(status_code=502))
+results[0].label  # "server"
+results = sense.classify(Signal.from_http(status_code=500, body="model not found"))
+results[0].label  # "client" (LLM figured it out)
+```
+The `http` preset gives you a 3-phase pipeline (rules → patterns → LLM) with 3 categories: `"client"`, `"server"`, `"undecided"`. Rulesets handle obvious cases instantly. LLM handles the ambiguous ones.
+Don't want LLM? Use `http_no_llm()` — rulesets only, ambiguous errors come back as `"undecided"`.
+## Build Your Own Pipeline
+A pipeline is a list of phases. Each phase has rulesets (deterministic) or skills (LLM). You can mix both, use only rulesets, or use only skills.
+```python
+from errorsense import ErrorSense, Phase, Ruleset, Skill, LLMConfig, Signal
+# Rulesets + LLM
+sense = ErrorSense(
+    categories=["transient", "permanent", "user"],
+    pipeline=[
+        Phase("codes", rulesets=[
+            Ruleset(field="error_code", match={
+                "ECONNRESET": "transient", "ETIMEOUT": "transient", "EPERM": "permanent",
+            }),
+        ]),
+        Phase("patterns", rulesets=[
+            Ruleset(field="message", patterns=[
+                ("transient", [r"timeout", r"connection reset", r"retry"]),
+                ("permanent", [r"corruption", r"fatal"]),
+            ]),
+        ]),
+        Phase("llm", skills=[
+            Skill("my_classifier", path="./skills/my_classifier.md"),
+        ], llm=LLMConfig(api_key="your_key")),
+    ],
+    default="transient",
+)
+# Rulesets only — no LLM needed
+sense = ErrorSense(
+    categories=["client", "server"],
+    pipeline=[
+        Phase("rules", rulesets=[
+            Ruleset(field="status_code", match={"4xx": "client", 502: "server"}),
+        ]),
+    ],
+    default="server",
+)
+# LLM only — skip rulesets entirely
+sense = ErrorSense(
+    categories=["client", "server"],
+    pipeline=[
+        Phase("llm", skills=[
+            Skill("my_classifier", path="./skills/my_classifier.md"),
+        ], llm=LLMConfig(api_key="your_key")),
+    ],
+    default="unknown",
+)
+```
+Phases run in order. First match wins. Rulesets are instant and free. LLM is the fallback.
+## Rulesets
+Each ruleset does one thing — `match=` for field matching or `patterns=` for regex:
+```python
+Ruleset(field="status_code", match={400: "client", 502: "server"})         # exact match
+Ruleset(field="status_code", match={"4xx": "client", 503: "server"})       # range match
+Ruleset(field="headers.content-type", match={"text/html": "server"})       # header match
+Ruleset(field="body.error.type", match={"validation_error": "client"})     # JSON dot-path
+Ruleset(field="body", patterns=[("server", [r"OOM"]), ("client", [r"invalid"])])  # regex
+```
+Custom logic? Subclass:
+```python
+class VendorBugRuleset(Ruleset):
+    def classify(self, signal: Signal) -> SenseResult | None:
+        if signal.get("vendor") == "acme" and signal.get("code") == "X99":
+            return SenseResult(label="known_bug", confidence=1.0)
+        return None
+```
+## Skills
+Skills are LLM instructions stored as `.md` files. Each skill teaches the LLM how to classify errors in a specific domain.
+```python
+# Loads from errorsense/skills/http_classifier.md (built-in)
+Skill("http_classifier")
+# Loads from your own file
+Skill("my_classifier", path="./skills/my_classifier.md")
+```
+## All Phases Mode
+```python
+# Default — stops at first match
+results = sense.classify(signal)
+# All phases run
+results = sense.classify(signal, short_circuit=False)
+# With LLM reasoning
+results = sense.classify(signal, explain=True)
+results[0].reason  # "ECONNRESET indicates transient network failure"
+```
+## Trailing (Stateful Error Tracking)
+Track errors per key. When a threshold is hit, the LLM reviews the full error history.
+```python
+from errorsense import TrailingConfig
+sense = ErrorSense(
+    categories=["transient", "permanent", "user"],
+    pipeline=[...],
+    trailing=TrailingConfig(
+        threshold=3,
+        count_labels=["transient", "permanent"],  # user errors don't count
+    ),
+)
+# In your error handler:
+result = sense.trail("service-a", signal)
+result.label         # "transient"
+result.at_threshold  # True (3rd counted error)
+result.reason        # LLM review: "3 transient errors — all connection resets..."
+# On success:
+sense.reset("service-a")
+```
+**How it works:**
+- Each `trail()` call classifies the signal normally through the pipeline
+- Counted labels accumulate per key toward the threshold
+- At threshold, the LLM reviews all recorded errors and gives its verdict
+- If the review changes the label, the history entry is corrected and the count adjusts
+- `review=False` in TrailingConfig disables LLM review (just counting)
+**Manual review anytime:**
+```python
+verdict = sense.review("service-a")
+verdict.label   # LLM's verdict on the full history
+verdict.reason  # explanation
+```
+## License
+MIT