PyPI - wxpath - Versions diffs - 0.2.0__py3-none-any.whl → 0.3.0__py3-none-any.whl - Mend

wxpath 0.2.0py3-none-any.whl → 0.3.0py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (17) hide show

wxpath/cli.py +52 -12
wxpath/core/ops.py +163 -129
wxpath/core/parser.py +559 -280
wxpath/core/runtime/engine.py +133 -42
wxpath/core/runtime/helpers.py +0 -7
wxpath/hooks/registry.py +29 -17
wxpath/http/client/crawler.py +46 -11
wxpath/http/client/request.py +6 -3
wxpath/http/client/response.py +1 -1
wxpath/http/policy/robots.py +82 -0
{wxpath-0.2.0.dist-info → wxpath-0.3.0.dist-info}/METADATA +84 -37
{wxpath-0.2.0.dist-info → wxpath-0.3.0.dist-info}/RECORD +16 -16
wxpath/core/errors.py +0 -134
{wxpath-0.2.0.dist-info → wxpath-0.3.0.dist-info}/WHEEL +0 -0
{wxpath-0.2.0.dist-info → wxpath-0.3.0.dist-info}/entry_points.txt +0 -0
{wxpath-0.2.0.dist-info → wxpath-0.3.0.dist-info}/licenses/LICENSE +0 -0
{wxpath-0.2.0.dist-info → wxpath-0.3.0.dist-info}/top_level.txt +0 -0

{wxpath-0.2.0.dist-info → wxpath-0.3.0.dist-info}/METADATA RENAMED Viewed

@@ -1,13 +1,12 @@
 Metadata-Version: 2.4
 Name: wxpath
-Version: 0.2.0
+Version: 0.3.0
 Summary: wxpath - a declarative web crawler and data extractor
 Author-email: Rodrigo Palacios <rodrigopala91@gmail.com>
 License-Expression: MIT
-Requires-Python: >=3.9
+Requires-Python: >=3.10
 Description-Content-Type: text/markdown
 License-File: LICENSE
-Requires-Dist: requests>=2.0
 Requires-Dist: lxml>=4.0
 Requires-Dist: elementpath<=5.0.3,>=5.0.0
 Requires-Dist: aiohttp<=3.12.15,>=3.8.0
@@ -18,12 +17,13 @@ Provides-Extra: dev
 Requires-Dist: ruff; extra == "dev"
 Dynamic: license-file
+# **wxpath** - declarative web crawling with XPath
-# wxpath - declarative web crawling with XPath
+[![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg)](https://www.python.org/downloads/release/python-3100/)
-**wxpath** is a declarative web crawler where traversal is expressed directly in XPath. Instead of writing imperative crawl loops, you describe what to follow and what to extract in a single expression. **wxpath** evaluates that expression concurrently, breadth-first-*ish*, and streams results as they are discovered.
+**wxpath** is a declarative web crawler where traversal is expressed directly in XPath. Instead of writing imperative crawl loops, wxpath lets you describe what to follow and what to extract in a single expression. **wxpath** executes that expression concurrently, breadth-first-*ish*, and streams results as they are discovered.
-By introducing the `url(...)` operator and the `///` syntax, **wxpath**'s engine is able to perform deep, recursive web crawling and extraction.
+By introducing the `url(...)` operator and the `///` syntax, wxpath's engine is able to perform deep (or paginated) web crawling and extraction.
 NOTE: This project is in early development. Core concepts are stable, but the API and features may change. Please report issues - in particular, deadlocked crawls or unexpected behavior - and any features you'd like to see (no guarantee they'll be implemented).
@@ -31,19 +31,22 @@ NOTE: This project is in early development. Core concepts are stable, but the AP
 ## Contents
 - [Example](#example)
-- [`url(...)` and `///url(...)` Explained](#url-and---explained)
+- [Language Design](DESIGN.md)
+- [`url(...)` and `///url(...)` Explained](#url-and-url-explained)
 - [General flow](#general-flow)
 - [Asynchronous Crawling](#asynchronous-crawling)
+- [Polite Crawling](#polite-crawling)
 - [Output types](#output-types)
-- [XPath 3.1 support](#xpath-31-support)
+- [XPath 3.1](#xpath-31-by-default)
 - [CLI](#cli)
 - [Hooks (Experimental)](#hooks-experimental)
 - [Install](#install)
-- [More Examples](#more-examples)
+- [More Examples](EXAMPLES.md)
 - [Comparisons](#comparisons)
 - [Advanced: Engine & Crawler Configuration](#advanced-engine--crawler-configuration)
 - [Project Philosophy](#project-philosophy)
 - [Warnings](#warnings)
+- [Commercial support / consulting](#commercial-support--consulting)
 - [License](#license)
@@ -52,33 +55,35 @@ NOTE: This project is in early development. Core concepts are stable, but the AP
 ```python
 import wxpath
-path = """
+# Crawl, extract fields, build a knowledge graph
+path_expr = """
 url('https://en.wikipedia.org/wiki/Expression_language')
  ///url(//main//a/@href[starts-with(., '/wiki/') and not(contains(., ':'))])
  /map{
-    'title':(//span[contains(@class, "mw-page-title-main")]/text())[1],
-    'url':string(base-uri(.)),
-    'short_description':(//div[contains(@class, 'shortdescription')]/text())[1]
+    'title': (//span[contains(@class, "mw-page-title-main")]/text())[1] ! string(.),
+    'url': string(base-uri(.)),
+    'short_description': //div[contains(@class, 'shortdescription')]/text() ! string(.),
+    'forward_links': //div[@id="mw-content-text"]//a/@href ! string(.)
  }
 """
-for item in wxpath.wxpath_async_blocking_iter(path, max_depth=1):
+for item in wxpath.wxpath_async_blocking_iter(path_expr, max_depth=1):
     print(item)
 ```
 Output:
 ```python
-map{'title': TextNode('Computer language'), 'url': 'https://en.wikipedia.org/wiki/Computer_language', 'short_description': TextNode('Formal language for communicating with a computer')}
-map{'title': TextNode('Machine-readable medium and data'), 'url': 'https://en.wikipedia.org/wiki/Machine_readable', 'short_description': TextNode('Medium capable of storing data in a format readable by a machine')}
-map{'title': TextNode('Advanced Boolean Expression Language'), 'url': 'https://en.wikipedia.org/wiki/Advanced_Boolean_Expression_Language', 'short_description': TextNode('Hardware description language and software')}
-map{'title': TextNode('Jakarta Expression Language'), 'url': 'https://en.wikipedia.org/wiki/Jakarta_Expression_Language', 'short_description': TextNode('Computer programming language')}
-map{'title': TextNode('Data Analysis Expressions'), 'url': 'https://en.wikipedia.org/wiki/Data_Analysis_Expressions', 'short_description': TextNode('Formula and data query language')}
-map{'title': TextNode('Domain knowledge'), 'url': 'https://en.wikipedia.org/wiki/Domain_knowledge', 'short_description': TextNode('Specialist knowledge within a specific field')}
-map{'title': TextNode('Rights Expression Language'), 'url': 'https://en.wikipedia.org/wiki/Rights_Expression_Language', 'short_description': TextNode('Machine-processable language used to express intellectual property rights (such as copyright)')}
-map{'title': TextNode('Computer science'), 'url': 'https://en.wikipedia.org/wiki/Computer_science', 'short_description': TextNode('Study of computation')}
+map{'title': 'Computer language', 'url': 'https://en.wikipedia.org/wiki/Computer_language', 'short_description': 'Formal language for communicating with a computer', 'forward_links': ['/wiki/Formal_language', '/wiki/Communication', ...]}
+map{'title': 'Advanced Boolean Expression Language', 'url': 'https://en.wikipedia.org/wiki/Advanced_Boolean_Expression_Language', 'short_description': 'Hardware description language and software', 'forward_links': ['/wiki/File:ABEL_HDL_example_SN74162.png', '/wiki/Hardware_description_language', ...]}
+map{'title': 'Machine-readable medium and data', 'url': 'https://en.wikipedia.org/wiki/Machine_readable', 'short_description': 'Medium capable of storing data in a format readable by a machine', 'forward_links': ['/wiki/File:EAN-13-ISBN-13.svg', '/wiki/ISBN', ...]}
+...
 ```
+**Note:** Some sites (including Wikipedia) may block requests without proper headers.
+See [Advanced: Engine & Crawler Configuration](#advanced-engine--crawler-configuration) to set a custom `User-Agent`.
 The above expression does the following:
 1. Starts at the specified URL, `https://en.wikipedia.org/wiki/Expression_language`.
@@ -92,18 +97,23 @@ The above expression does the following:
 ## `url(...)` and `///url(...)` Explained
 - `url(...)` is a custom operator that fetches the content of the user-specified or internally generated URL and returns it as an `lxml.html.HtmlElement` for further XPath processing.
-- `///url(...)` indicates infinite/recursive traversal. It tells **wxpath** to continue following links indefinitely, up to the specified `max_depth`. Unlike repeated `url()` hops, it allows a single expression to describe unbounded graph exploration. WARNING: Use with caution and constraints (via `max_depth` or XPath predicates) to avoid traversal explosion.
+- `///url(...)` indicates a deep crawl. It tells the runtime engine to continue following links up to the specified `max_depth`. Unlike repeated `url()` hops, it allows a single expression to describe deeper graph exploration. WARNING: Use with caution and constraints (via `max_depth` or XPath predicates) to avoid traversal explosion.
+## Language Design
+See [DESIGN.md](DESIGN.md) for details of the language design. You will see the core concepts and design the language from the ground up.
 ## General flow
 **wxpath** evaluates an expression as a list of traversal and extraction steps (internally referred to as `Segment`s).
-`url(...)` creates crawl tasks either statically (via a fixed URL) or dynamically (via a URL derived from the XPath expression). **URLs are deduplicated globally, not per-depth and on a best-effort basis**.
+`url(...)` creates crawl tasks either statically (via a fixed URL) or dynamically (via a URL derived from the XPath expression). **URLs are deduplicated globally, on a best-effort basis - not per-depth**.
 XPath segments operate on fetched documents (fetched via the immediately preceding `url(...)` operations).
-`///url(...)` indicates infinite/recursive traversal - it proceeds breadth-first-*ish* up to `max_depth`.
+`///url(...)` indicates deep crawling - it proceeds breadth-first-*ish* up to `max_depth`.
 Results are yielded as soon as they are ready.
@@ -128,7 +138,7 @@ asyncio.run(main())
 ### Blocking, Concurrent Requests
-**wxpath** also supports concurrent requests using an asyncio-in-sync pattern, allowing you to crawl multiple pages concurrently while maintaining the simplicity of synchronous code. This is particularly useful for crawls in strictly synchronous execution environments (i.e., not inside an `asyncio` event loop) where performance is a concern.
+**wxpath** also provides an asyncio-in-sync API, allowing you to crawl multiple pages concurrently while maintaining the simplicity of synchronous code. This is particularly useful for crawls in strictly synchronous execution environments (i.e., not inside an `asyncio` event loop) where performance is a concern.
 ```python
 from wxpath import wxpath_async_blocking_iter
@@ -137,10 +147,14 @@ path_expr = "url('https://en.wikipedia.org/wiki/Expression_language')///url(//@h
 items = list(wxpath_async_blocking_iter(path_expr, max_depth=1))
 ```
+## Polite Crawling
+**wxpath** respects [robots.txt](https://en.wikipedia.org/wiki/Robots_exclusion_standard) by default via the `WXPathEngine(..., robotstxt=True)` constructor.
 ## Output types
-The wxpath Python API yields structured objects, not just strings.
+The wxpath Python API yields structured objects.
 Depending on the expression, results may include:
@@ -188,10 +202,11 @@ path_expr = """
 The following example demonstrates how to crawl Wikipedia starting from the "Expression language" page, extract links to other wiki pages, and retrieve specific fields from each linked page.
-WARNING: Due to the everchanging nature of web content, the output may vary over time.
+NOTE: Due to the everchanging nature of web content, the output may vary over time.
 ```bash
-> wxpath --depth 1 "\
-    url('https://en.wikipedia.org/wiki/Expression_language')\
+> wxpath --depth 1 \
+    --header "User-Agent: my-app/0.1 (contact: you@example.com)" \
+    "url('https://en.wikipedia.org/wiki/Expression_language') \
     ///url(//div[@id='mw-content-text']//a/@href[starts-with(., '/wiki/') \
         and not(matches(@href, '^(?:/wiki/)?(?:Wikipedia|File|Template|Special|Template_talk|Help):'))]) \
     /map{ \
@@ -212,6 +227,18 @@ WARNING: Due to the everchanging nature of web content, the output may vary over
 {"title": "Computer science", "short_description": "Study of computation", "url": "https://en.wikipedia.org/wiki/Computer_science", "backlink": "https://en.wikipedia.org/wiki/Expression_language", "depth": 1.0}
 ```
+Command line options:
+```bash
+--depth                <depth>       Max crawl depth
+--verbose              [true|false]  Provides superficial CLI information
+--debug                [true|false]  Provides verbose runtime output and information
+--concurrency          <concurrency> Number of concurrent fetches
+--concurrency-per-host <concurrency> Number of concurrent fetches per host
+--header               "Key:Value"   Add a custom header (e.g., 'Key:Value'). Can be used multiple times.
+--respect-robots       [true|false] (Default: True) Respects robots.txt
+```
 ## Hooks (Experimental)
@@ -257,6 +284,8 @@ hooks.register(hooks.JSONLWriter)
 ## Install
+Requires Python 3.10+.
 ```
 pip install wxpath
 ```
@@ -285,13 +314,20 @@ crawler = Crawler(
     concurrency=8,
     per_host=2,
     timeout=10,
+    respect_robots=False,
+    headers={
+        "User-Agent": "my-app/0.1.0 (contact: you@example.com)", # Sites like Wikipedia will appreciate this
+    },
 )
 # If `crawler` is not specified, a default Crawler will be created with
-# the provided concurrency and per_host values, or with defaults.
+# the provided concurrency, per_host, and respect_robots values, or with defaults.
 engine = WXPathEngine(
-    # concurrency=16,
-    # per_host=8,
+    # concurrency: int = 16,
+    # per_host: int = 8,
+    # respect_robots: bool = True,
+    # allowed_response_codes: set[int] = {200},
+    # allow_redirects: bool = True,
     crawler=crawler,
 )
@@ -305,7 +341,7 @@ items = list(wxpath_async_blocking_iter(path_expr, max_depth=1, engine=engine))
 ### Principles
-- Enable declarative, recursive scraping without boilerplate
+- Enable declarative, crawling and scraping without boilerplate
 - Stay lightweight and composable
 - Asynchronous support for high-performance crawls
@@ -316,22 +352,33 @@ items = list(wxpath_async_blocking_iter(path_expr, max_depth=1, engine=engine))
 - Requests are performed concurrently.
 - Results are streamed as soon as they are available.
-### Non-Goals/Limitations (for now)
+### Limitations (for now)
+The following features are not yet supported:
-- Strict result ordering
 - Persistent scheduling or crawl resumption
 - Automatic proxy rotation
 - Browser-based rendering (JavaScript execution)
+- Strict result ordering
 ## WARNINGS!!!
 - Be respectful when crawling websites. A scrapy-inspired throttler is enabled by default.
-- Recursive (`///`) crawls require user discipline to avoid unbounded expansion (traversal explosion).
+- Deep crawls (`///`) require user discipline to avoid unbounded expansion (traversal explosion).
 - Deadlocks and hangs are possible in certain situations (e.g., all tasks waiting on blocked requests). Please report issues if you encounter such behavior.
 - Consider using timeouts, `max_depth`, and XPath predicates and filters to limit crawl scope.
+## Commercial support / consulting
+If you want help building or operating crawlers/data feeds with wxpath (extraction, scheduling, monitoring, breakage fixes) or other web-scraping needs, please contact me at: rodrigopala91@gmail.com.
+### Donate
+If you like wxpath and want to support its development, please consider [donating](https://www.paypal.com/donate/?business=WDNDK6J6PJEXY&no_recurring=0&item_name=Thanks+for+using+wxpath%21+Donations+fund+development%2C+docs%2C+and+bug+fixes.+If+wxpath+saved+you+time%2C+a+small+contribution+helps%21&currency_code=USD).
 ## License
 MIT

{wxpath-0.2.0.dist-info → wxpath-0.3.0.dist-info}/RECORD RENAMED Viewed

@@ -1,33 +1,33 @@
 wxpath/__init__.py,sha256=w1hFE_VSIYq_TSFLoPfp6MJbG1sA6BeChX6PYsXIK4o,265
-wxpath/cli.py,sha256=CHOFWH_WHsJ30aItIQw9c5jzjl2Y64DmW2K942OGwpo,1668
+wxpath/cli.py,sha256=GJ4vAax5DlpxczZ_eLetlfRwa177VFKo2LHv09X-0eo,2799
 wxpath/patches.py,sha256=u0dOL-K-gvdO9SJvzGrqR9Zou6XduWjl6R7mzIcZtJg,2130
 wxpath/core/__init__.py,sha256=U9_In2iRaZrpiIVavIli1M59gCB6Kn1en-1Fza-qIiI,257
 wxpath/core/dom.py,sha256=X0L3n8jRfO5evEypDaJTD-NQ3cLXWvnEUVERAHo3vV0,701
-wxpath/core/errors.py,sha256=q56Gs5JJSC4HKImUtdZhOHcqe8XsoIrVhsaaoJ2qhCQ,4198
 wxpath/core/models.py,sha256=3KYt-UwfLY2FlSRUHeA_getnYaNUMPW9wRrl2CRbPso,1611
-wxpath/core/ops.py,sha256=8hc8VTqsxGFpizOyPTgzxjc8Y5srHd2aaOugQ9fJ3sE,8918
-wxpath/core/parser.py,sha256=0VQCkuznd4dYYzEeTAMFs1L2SmvTgSp1JWz-Um0uEjM,9911
+wxpath/core/ops.py,sha256=PTjX6c4QvCqGaByYYqaK4dte5iWO3lZzgqGrMXp6f6g,9727
+wxpath/core/parser.py,sha256=WfjQNixBz7nWtX2O0t19MOhUJmzGMg8Qol40P6oC8zc,18827
 wxpath/core/runtime/__init__.py,sha256=_iCgkIWxXvxzQcenHOsjYGsk74HboTIYWOtgM8GtCyc,86
-wxpath/core/runtime/engine.py,sha256=Pn5wzPkBwp8bq48Ie0O0DVQzUFEAAzWIj1PHgChm2bo,10825
-wxpath/core/runtime/helpers.py,sha256=NCL4Wl8Hpc1VTfERSthCen9wlVd5J0eS8th4gqEPmRg,1578
+wxpath/core/runtime/engine.py,sha256=069ITKDXcHss__AwaYf0VSfliCNB49yZbnW2v3xEZO0,14512
+wxpath/core/runtime/helpers.py,sha256=M1i4BryCktAxeboa4LOXMTNiKVCJLDBD-KpWCQXadpw,1434
 wxpath/hooks/__init__.py,sha256=9JG63e4z_8CZLWugFcY786hebaEEPZ5FmZhyDHat-98,294
 wxpath/hooks/builtin.py,sha256=GJ4w1C9djWNzAmAA3U0qI9OoCOeC5R8tEGtWXJVHSYs,4125
-wxpath/hooks/registry.py,sha256=q4MxYwDUv7LH4-WJGO_unXbBRFXXxsBCU4vU1co0gC4,4136
+wxpath/hooks/registry.py,sha256=-D11f_mMboeVAH8qsTkbKTQ0aGNaQ7F6zbXDsOIYxN0,4513
 wxpath/http/__init__.py,sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU,0
 wxpath/http/stats.py,sha256=FrXbFrnms113Gapf-Z5WiD5qaNiJ0XuOqjSQhwXfuEo,3172
 wxpath/http/client/__init__.py,sha256=QpdmqzcznUeuFvT3IIo-LmBUUHEa2BDq9sHGAHJnDLI,202
-wxpath/http/client/crawler.py,sha256=hN7EJXP102nsMA9ipaNPc9fWwDVpm_LJdGo6LSlAQp0,6996
-wxpath/http/client/request.py,sha256=3nwwPQ2e_WycJQnSA6QieWJ2q3qg40jkGrp2NUDPsLI,888
-wxpath/http/client/response.py,sha256=mDo3FswiVnulV1l5qjio5OQpGlT0-tfkR7daPSgSUuE,324
+wxpath/http/client/crawler.py,sha256=YlE469UqMck0wqRd6J9kNxm5G9BCbE_x5O6MROwmcaE,8742
+wxpath/http/client/request.py,sha256=LF_OIXetfouyE5GwEqp0cya0oMAZouKRPNFRFGscQS8,1050
+wxpath/http/client/response.py,sha256=z9LQPnDN-NZRnQpIKozaWCqgpRejc6nixCr_XaPyqUQ,334
 wxpath/http/policy/backoff.py,sha256=NwdUR6bRe1RtUGSJOktj-p8IyC1l9xu_-Aa_Gj_u5sw,321
 wxpath/http/policy/retry.py,sha256=WSrQfCy1F7IcXFpVGDi4HTphNhFq12p4DaMO0_4dgrw,982
+wxpath/http/policy/robots.py,sha256=vllXX9me78YB6yrDdpH_bwyuR5QoC9uveGEl8PmHM9Q,3134
 wxpath/http/policy/throttler.py,sha256=wydMFV-0mxpHSI5iYkLfE78oY4z_fF8jW9MqCeb8G54,3014
 wxpath/util/__init__.py,sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU,0
 wxpath/util/logging.py,sha256=oQi8sp7yKWgXkkcJ4U4WHp7TyBCQiK4VhSXOSb8pGw0,2965
 wxpath/util/serialize.py,sha256=uUs4C9VErpFd97smBM2bRWo2nW25kCgKdsMrVtVxhg8,575
-wxpath-0.2.0.dist-info/licenses/LICENSE,sha256=AVBZLhdWmqxm-f-dy5prVB1E-solHWoP2EXEIV_o-00,1076
-wxpath-0.2.0.dist-info/METADATA,sha256=6CdIcq82gNqvXVIpBzhGCk_Q0eqDvok1JmEKWQkFals,14662
-wxpath-0.2.0.dist-info/WHEEL,sha256=_zCd3N1l69ArxyTb8rzEoP9TpbYXkqRFSNOD5OuxnTs,91
-wxpath-0.2.0.dist-info/entry_points.txt,sha256=FwoIOnUTl-DjPqVw-eb9EHHiiXCyRZy_mEQKFu2eb5Y,43
-wxpath-0.2.0.dist-info/top_level.txt,sha256=uFCcveG78mnefxRGvYsR2OexDlKR_Z1UD4vZijUcex8,7
-wxpath-0.2.0.dist-info/RECORD,,
+wxpath-0.3.0.dist-info/licenses/LICENSE,sha256=AVBZLhdWmqxm-f-dy5prVB1E-solHWoP2EXEIV_o-00,1076
+wxpath-0.3.0.dist-info/METADATA,sha256=9Y0V7Up2efXCRtKZ7Cceawz9LHvNcfH0olmEGK2mVk0,16326
+wxpath-0.3.0.dist-info/WHEEL,sha256=_zCd3N1l69ArxyTb8rzEoP9TpbYXkqRFSNOD5OuxnTs,91
+wxpath-0.3.0.dist-info/entry_points.txt,sha256=FwoIOnUTl-DjPqVw-eb9EHHiiXCyRZy_mEQKFu2eb5Y,43
+wxpath-0.3.0.dist-info/top_level.txt,sha256=uFCcveG78mnefxRGvYsR2OexDlKR_Z1UD4vZijUcex8,7
+wxpath-0.3.0.dist-info/RECORD,,

wxpath/core/errors.py DELETED Viewed

@@ -1,134 +0,0 @@
-import collections.abc as cabc
-import functools
-import inspect
-import types
-from contextlib import contextmanager
-from contextvars import ContextVar
-from enum import Enum, auto
-from typing import AsyncGenerator
-from wxpath.util.logging import get_logger
-log = get_logger(__name__)
-class ErrorPolicy(Enum):
-    IGNORE  = auto()   # swallow completely
-    LOG     = auto()   # just log at ERROR
-    COLLECT = auto()   # yield {"_error": ..., "_ctx": ...}
-    RAISE   = auto()   # re-raise
-_GLOBAL_DEFAULT = ErrorPolicy.LOG
-# Task-local override (None => fall back to _GLOBAL_DEFAULT)
-_CURRENT: ContextVar[ErrorPolicy | None] = ContextVar("wx_err_policy", default=None)
-def get_current_error_policy() -> ErrorPolicy:
-    return _CURRENT.get() or _GLOBAL_DEFAULT
-def set_default_error_policy(policy: ErrorPolicy) -> None:
-    global _GLOBAL_DEFAULT
-    _GLOBAL_DEFAULT = policy
-@contextmanager
-def use_error_policy(policy: ErrorPolicy):
-    token = _CURRENT.set(policy)
-    try:
-        yield
-    finally:
-        _CURRENT.reset(token)
-def handle_error(exc: Exception, policy: ErrorPolicy, ctx: dict):
-    if policy is ErrorPolicy.IGNORE:
-        return None
-    if policy is ErrorPolicy.LOG:
-        log.exception("processing error", extra=ctx)
-        return None
-    if policy is ErrorPolicy.COLLECT:
-        return {"_error": str(exc), "_ctx": ctx}
-    # RAISE (safe default)
-    raise exc
-def _is_gen(obj):     # helper
-    return isinstance(obj, (types.GeneratorType, cabc.Generator))
-def with_errors():
-    """
-    Apply the current ErrorPolicy at call time while preserving the callable kind:
-      - async generator -> async generator wrapper
-      - coroutine       -> async wrapper
-      - sync generator  -> sync generator wrapper
-      - plain function  -> plain wrapper
-    """
-    def decorator(fn):
-        # --- ASYNC GENERATOR ---
-        if inspect.isasyncgenfunction(fn):
-            @functools.wraps(fn)
-            async def asyncgen_wrapped(*a, **kw) -> AsyncGenerator:
-                try:
-                    async for item in fn(*a, **kw):
-                        yield item
-                except Exception as exc:
-                    collected = handle_error(exc, get_current_error_policy(),
-                                             _ctx_from_sig(fn, a, kw))
-                    if collected is not None:
-                        yield collected
-            return asyncgen_wrapped
-        # --- COROUTINE ---
-        if inspect.iscoroutinefunction(fn):
-            @functools.wraps(fn)
-            async def coro_wrapped(*a, **kw):
-                try:
-                    return await fn(*a, **kw)
-                except Exception as exc:
-                    return handle_error(exc, get_current_error_policy(),
-                                        _ctx_from_sig(fn, a, kw))
-            return coro_wrapped
-        # --- SYNC GENERATOR ---
-        if inspect.isgeneratorfunction(fn):
-            @functools.wraps(fn)
-            def gen_wrapped(*a, **kw):
-                try:
-                    for item in fn(*a, **kw):
-                        yield item
-                except Exception as exc:
-                    collected = handle_error(exc, get_current_error_policy(),
-                                             _ctx_from_sig(fn, a, kw))
-                    if collected is not None:
-                        yield collected
-            return gen_wrapped
-        # --- PLAIN SYNC FUNCTION ---
-        @functools.wraps(fn)
-        def plain_wrapped(*a, **kw):
-            try:
-                return fn(*a, **kw)
-            except Exception as exc:
-                return handle_error(exc, get_current_error_policy(),
-                                    _ctx_from_sig(fn, a, kw))
-        return plain_wrapped
-    return decorator
-def _ctx_from_sig(fn, a, kw):
-    """Best-effort extraction of depth/url/op for logging."""
-    # you already pass these in every handler, so pull by position
-    try:
-        elem, segs, depth, *_ = a
-        op, val = segs[0] if segs else ("?", "?")
-        url = getattr(elem, "base_url", None)
-        return {"op": op, "depth": depth, "url": url}
-    except Exception:
-        return {}

{wxpath-0.2.0.dist-info → wxpath-0.3.0.dist-info}/WHEEL RENAMED Viewed

File without changes

{wxpath-0.2.0.dist-info → wxpath-0.3.0.dist-info}/entry_points.txt RENAMED Viewed

File without changes

{wxpath-0.2.0.dist-info → wxpath-0.3.0.dist-info}/licenses/LICENSE RENAMED Viewed

File without changes

{wxpath-0.2.0.dist-info → wxpath-0.3.0.dist-info}/top_level.txt RENAMED Viewed

File without changes

wxpath 0.2.0__py3-none-any.whl → 0.3.0__py3-none-any.whl

wxpath 0.2.0py3-none-any.whl → 0.3.0py3-none-any.whl