PyPI - alap-python - Versions diffs - 0.1.0__tar.gz - Mend

alap-python 0.1.0__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (12) hide show

alap_python-0.1.0/PKG-INFO +96 -0
alap_python-0.1.0/README.md +76 -0
alap_python-0.1.0/alap_python.egg-info/PKG-INFO +96 -0
alap_python-0.1.0/alap_python.egg-info/SOURCES.txt +10 -0
alap_python-0.1.0/alap_python.egg-info/dependency_links.txt +1 -0
alap_python-0.1.0/alap_python.egg-info/top_level.txt +1 -0
alap_python-0.1.0/pyproject.toml +37 -0
alap_python-0.1.0/setup.cfg +4 -0
alap_python-0.1.0/tests/test_expression_parser.py +631 -0
alap_python-0.1.0/tests/test_ssrf_guard.py +95 -0
alap_python-0.1.0/tests/test_validate_config.py +242 -0
alap_python-0.1.0/tests/test_validate_regex.py +54 -0

alap_python-0.1.0/PKG-INFO ADDED Viewed

@@ -0,0 +1,96 @@
+Metadata-Version: 2.4
+Name: alap-python
+Version: 0.1.0
+Summary: Alap expression parser for Python — resolve link expressions server-side
+Author: Daniel Smith
+License-Expression: Apache-2.0
+Project-URL: Homepage, https://alap.info
+Project-URL: Repository, https://github.com/DanielSmith/alap-python
+Keywords: alap,expression-parser,links,menu
+Classifier: Development Status :: 4 - Beta
+Classifier: Intended Audience :: Developers
+Classifier: Programming Language :: Python :: 3
+Classifier: Programming Language :: Python :: 3.10
+Classifier: Programming Language :: Python :: 3.11
+Classifier: Programming Language :: Python :: 3.12
+Classifier: Programming Language :: Python :: 3.13
+Classifier: Topic :: Software Development :: Libraries
+Requires-Python: >=3.10
+Description-Content-Type: text/markdown
+# Alap Expression Parser — Python
+[Alap](https://github.com/DanielSmith/alap) is a JavaScript library that turns links into dynamic menus with multiple curated targets. This is the server-side Python port of the expression parser, enabling expression resolution in Python servers without a Node.js sidecar.
+## What's included
+- **`expression_parser.py`** — Recursive descent parser for the Alap expression grammar, macro expansion, regex search, config merging
+- **`validate_regex.py`** — ReDoS guard for user-supplied regex patterns
+## What's NOT included
+This is the server-side subset of `alap/core`. It covers expression parsing, config merging, and regex validation — everything a server needs to resolve cherry-pick and query requests.
+Browser-side concerns (DOM rendering, menu positioning, event handling, URL sanitization) are handled by the JavaScript client and are not ported here.
+## Supported expression syntax
+```
+item1, item2              # item IDs (comma-separated)
+.coffee                   # tag query
+.nyc + .bridge            # AND (intersection)
+.nyc | .sf                # OR (union)
+.nyc - .tourist           # WITHOUT (subtraction)
+(.nyc | .sf) + .open      # parenthesized grouping
+@favorites                # macro expansion
+/mypattern/               # regex search (by pattern key)
+/mypattern/lu             # regex with field options
+```
+## Usage
+```python
+from expression_parser import ExpressionParser, resolve_expression, cherry_pick_links, merge_configs
+config = {
+    "allLinks": {
+        "item1": {"label": "Example", "url": "https://example.com", "tags": ["demo"]},
+        "item2": {"label": "Other",   "url": "https://other.com",   "tags": ["demo", "test"]},
+    },
+    "macros": {
+        "all": {"linkItems": ".demo"}
+    }
+}
+# Low-level: get matching IDs
+parser = ExpressionParser(config)
+ids = parser.query(".demo")              # ["item1", "item2"]
+ids = parser.query(".demo - .test")      # ["item1"]
+# Convenience: expression -> full link objects
+results = resolve_expression(config, ".demo")
+# [{"id": "item1", "label": "Example", ...}, {"id": "item2", ...}]
+# Cherry-pick: expression -> { id: link } dict
+subset = cherry_pick_links(config, ".test")
+# {"item2": {"label": "Other", ...}}
+# Merge multiple configs
+merged = merge_configs(config1, config2)
+```
+## Installation
+Copy `expression_parser.py` and `validate_regex.py` into your project, or install from PyPI:
+```bash
+pip install alap
+# or, with uv (recommended):
+uv add alap
+```
+## Used by
+- [flask-sqlite](https://github.com/DanielSmith/alap/tree/main/examples/servers/flask-sqlite) server
+- [fastapi-postgres](https://github.com/DanielSmith/alap/tree/main/examples/servers/fastapi-postgres) server
+- [django-sqlite](https://github.com/DanielSmith/alap/tree/main/examples/servers/django-sqlite) server

alap_python-0.1.0/README.md ADDED Viewed

@@ -0,0 +1,76 @@
+# Alap Expression Parser — Python
+[Alap](https://github.com/DanielSmith/alap) is a JavaScript library that turns links into dynamic menus with multiple curated targets. This is the server-side Python port of the expression parser, enabling expression resolution in Python servers without a Node.js sidecar.
+## What's included
+- **`expression_parser.py`** — Recursive descent parser for the Alap expression grammar, macro expansion, regex search, config merging
+- **`validate_regex.py`** — ReDoS guard for user-supplied regex patterns
+## What's NOT included
+This is the server-side subset of `alap/core`. It covers expression parsing, config merging, and regex validation — everything a server needs to resolve cherry-pick and query requests.
+Browser-side concerns (DOM rendering, menu positioning, event handling, URL sanitization) are handled by the JavaScript client and are not ported here.
+## Supported expression syntax
+```
+item1, item2              # item IDs (comma-separated)
+.coffee                   # tag query
+.nyc + .bridge            # AND (intersection)
+.nyc | .sf                # OR (union)
+.nyc - .tourist           # WITHOUT (subtraction)
+(.nyc | .sf) + .open      # parenthesized grouping
+@favorites                # macro expansion
+/mypattern/               # regex search (by pattern key)
+/mypattern/lu             # regex with field options
+```
+## Usage
+```python
+from expression_parser import ExpressionParser, resolve_expression, cherry_pick_links, merge_configs
+config = {
+    "allLinks": {
+        "item1": {"label": "Example", "url": "https://example.com", "tags": ["demo"]},
+        "item2": {"label": "Other",   "url": "https://other.com",   "tags": ["demo", "test"]},
+    },
+    "macros": {
+        "all": {"linkItems": ".demo"}
+    }
+}
+# Low-level: get matching IDs
+parser = ExpressionParser(config)
+ids = parser.query(".demo")              # ["item1", "item2"]
+ids = parser.query(".demo - .test")      # ["item1"]
+# Convenience: expression -> full link objects
+results = resolve_expression(config, ".demo")
+# [{"id": "item1", "label": "Example", ...}, {"id": "item2", ...}]
+# Cherry-pick: expression -> { id: link } dict
+subset = cherry_pick_links(config, ".test")
+# {"item2": {"label": "Other", ...}}
+# Merge multiple configs
+merged = merge_configs(config1, config2)
+```
+## Installation
+Copy `expression_parser.py` and `validate_regex.py` into your project, or install from PyPI:
+```bash
+pip install alap
+# or, with uv (recommended):
+uv add alap
+```
+## Used by
+- [flask-sqlite](https://github.com/DanielSmith/alap/tree/main/examples/servers/flask-sqlite) server
+- [fastapi-postgres](https://github.com/DanielSmith/alap/tree/main/examples/servers/fastapi-postgres) server
+- [django-sqlite](https://github.com/DanielSmith/alap/tree/main/examples/servers/django-sqlite) server

alap_python-0.1.0/alap_python.egg-info/PKG-INFO ADDED Viewed

@@ -0,0 +1,96 @@
+Metadata-Version: 2.4
+Name: alap-python
+Version: 0.1.0
+Summary: Alap expression parser for Python — resolve link expressions server-side
+Author: Daniel Smith
+License-Expression: Apache-2.0
+Project-URL: Homepage, https://alap.info
+Project-URL: Repository, https://github.com/DanielSmith/alap-python
+Keywords: alap,expression-parser,links,menu
+Classifier: Development Status :: 4 - Beta
+Classifier: Intended Audience :: Developers
+Classifier: Programming Language :: Python :: 3
+Classifier: Programming Language :: Python :: 3.10
+Classifier: Programming Language :: Python :: 3.11
+Classifier: Programming Language :: Python :: 3.12
+Classifier: Programming Language :: Python :: 3.13
+Classifier: Topic :: Software Development :: Libraries
+Requires-Python: >=3.10
+Description-Content-Type: text/markdown
+# Alap Expression Parser — Python
+[Alap](https://github.com/DanielSmith/alap) is a JavaScript library that turns links into dynamic menus with multiple curated targets. This is the server-side Python port of the expression parser, enabling expression resolution in Python servers without a Node.js sidecar.
+## What's included
+- **`expression_parser.py`** — Recursive descent parser for the Alap expression grammar, macro expansion, regex search, config merging
+- **`validate_regex.py`** — ReDoS guard for user-supplied regex patterns
+## What's NOT included
+This is the server-side subset of `alap/core`. It covers expression parsing, config merging, and regex validation — everything a server needs to resolve cherry-pick and query requests.
+Browser-side concerns (DOM rendering, menu positioning, event handling, URL sanitization) are handled by the JavaScript client and are not ported here.
+## Supported expression syntax
+```
+item1, item2              # item IDs (comma-separated)
+.coffee                   # tag query
+.nyc + .bridge            # AND (intersection)
+.nyc | .sf                # OR (union)
+.nyc - .tourist           # WITHOUT (subtraction)
+(.nyc | .sf) + .open      # parenthesized grouping
+@favorites                # macro expansion
+/mypattern/               # regex search (by pattern key)
+/mypattern/lu             # regex with field options
+```
+## Usage
+```python
+from expression_parser import ExpressionParser, resolve_expression, cherry_pick_links, merge_configs
+config = {
+    "allLinks": {
+        "item1": {"label": "Example", "url": "https://example.com", "tags": ["demo"]},
+        "item2": {"label": "Other",   "url": "https://other.com",   "tags": ["demo", "test"]},
+    },
+    "macros": {
+        "all": {"linkItems": ".demo"}
+    }
+}
+# Low-level: get matching IDs
+parser = ExpressionParser(config)
+ids = parser.query(".demo")              # ["item1", "item2"]
+ids = parser.query(".demo - .test")      # ["item1"]
+# Convenience: expression -> full link objects
+results = resolve_expression(config, ".demo")
+# [{"id": "item1", "label": "Example", ...}, {"id": "item2", ...}]
+# Cherry-pick: expression -> { id: link } dict
+subset = cherry_pick_links(config, ".test")
+# {"item2": {"label": "Other", ...}}
+# Merge multiple configs
+merged = merge_configs(config1, config2)
+```
+## Installation
+Copy `expression_parser.py` and `validate_regex.py` into your project, or install from PyPI:
+```bash
+pip install alap
+# or, with uv (recommended):
+uv add alap
+```
+## Used by
+- [flask-sqlite](https://github.com/DanielSmith/alap/tree/main/examples/servers/flask-sqlite) server
+- [fastapi-postgres](https://github.com/DanielSmith/alap/tree/main/examples/servers/fastapi-postgres) server
+- [django-sqlite](https://github.com/DanielSmith/alap/tree/main/examples/servers/django-sqlite) server

alap_python-0.1.0/alap_python.egg-info/SOURCES.txt ADDED Viewed

@@ -0,0 +1,10 @@
+README.md
+pyproject.toml
+alap_python.egg-info/PKG-INFO
+alap_python.egg-info/SOURCES.txt
+alap_python.egg-info/dependency_links.txt
+alap_python.egg-info/top_level.txt
+tests/test_expression_parser.py
+tests/test_ssrf_guard.py
+tests/test_validate_config.py
+tests/test_validate_regex.py

alap_python-0.1.0/alap_python.egg-info/dependency_links.txt ADDED Viewed

	@@ -0,0 +1 @@
1	+

alap_python-0.1.0/alap_python.egg-info/top_level.txt ADDED Viewed

	@@ -0,0 +1 @@
1	+

alap_python-0.1.0/pyproject.toml ADDED Viewed

@@ -0,0 +1,37 @@
+[build-system]
+requires = ["setuptools>=68.0"]
+build-backend = "setuptools.build_meta"
+[project]
+name = "alap-python"
+version = "0.1.0"
+description = "Alap expression parser for Python — resolve link expressions server-side"
+readme = "README.md"
+license = "Apache-2.0"
+requires-python = ">=3.10"
+authors = [{ name = "Daniel Smith" }]
+keywords = ["alap", "expression-parser", "links", "menu"]
+classifiers = [
+    "Development Status :: 4 - Beta",
+    "Intended Audience :: Developers",
+    "Programming Language :: Python :: 3",
+    "Programming Language :: Python :: 3.10",
+    "Programming Language :: Python :: 3.11",
+    "Programming Language :: Python :: 3.12",
+    "Programming Language :: Python :: 3.13",
+    "Topic :: Software Development :: Libraries",
+]
+[project.urls]
+Homepage = "https://alap.info"
+Repository = "https://github.com/DanielSmith/alap-python"
+[tool.setuptools.packages.find]
+where = ["."]
+include = ["alap*"]
+[tool.setuptools.package-dir]
+alap = "."
+[tool.pytest.ini_options]
+testpaths = ["tests"]

alap_python-0.1.0/setup.cfg ADDED Viewed

@@ -0,0 +1,4 @@
+[egg_info]
+tag_build =
+tag_date = 0

alap_python-0.1.0/tests/test_expression_parser.py ADDED Viewed

@@ -0,0 +1,631 @@
+# Copyright 2026 Daniel Smith
+# Licensed under the Apache License, Version 2.0
+# See https://www.apache.org/licenses/LICENSE-2.0
+"""Tests for the Python expression parser — mirrors the TS test tiers."""
+import sys
+from pathlib import Path
+import pytest
+# Add parent to path so we can import the parser
+sys.path.insert(0, str(Path(__file__).resolve().parent.parent))
+from expression_parser import (
+    ExpressionParser,
+    cherry_pick_links,
+    merge_configs,
+    resolve_expression,
+)
+from sanitize_url import sanitize_url
+# ---------------------------------------------------------------------------
+# Test config — mirrors tests/fixtures/links.ts
+# ---------------------------------------------------------------------------
+TEST_CONFIG = {
+    "settings": {"listType": "ul", "menuTimeout": 5000},
+    "macros": {
+        "cars": {"linkItems": "vwbug, bmwe36"},
+        "nycbridges": {"linkItems": ".nyc + .bridge"},
+        "everything": {"linkItems": ".nyc | .sf"},
+    },
+    "searchPatterns": {
+        "bridges": "bridge",
+        "germanCars": {
+            "pattern": "VW|BMW",
+            "options": {"fields": "l", "limit": 5},
+        },
+    },
+    "allLinks": {
+        "vwbug": {
+            "label": "VW Bug",
+            "url": "https://example.com/vwbug",
+            "tags": ["car", "vw", "germany"],
+        },
+        "bmwe36": {
+            "label": "BMW E36",
+            "url": "https://example.com/bmwe36",
+            "tags": ["car", "bmw", "germany"],
+        },
+        "miata": {
+            "label": "Mazda Miata",
+            "url": "https://example.com/miata",
+            "tags": ["car", "mazda", "japan"],
+        },
+        "brooklyn": {
+            "label": "Brooklyn Bridge",
+            "url": "https://example.com/brooklyn",
+            "tags": ["nyc", "bridge", "landmark"],
+        },
+        "manhattan": {
+            "label": "Manhattan Bridge",
+            "url": "https://example.com/manhattan",
+            "tags": ["nyc", "bridge"],
+        },
+        "highline": {
+            "label": "The High Line",
+            "url": "https://example.com/highline",
+            "tags": ["nyc", "park", "landmark"],
+        },
+        "centralpark": {
+            "label": "Central Park",
+            "url": "https://example.com/centralpark",
+            "tags": ["nyc", "park"],
+        },
+        "goldengate": {
+            "label": "Golden Gate",
+            "url": "https://example.com/goldengate",
+            "tags": ["sf", "bridge", "landmark"],
+        },
+        "dolores": {
+            "label": "Dolores Park",
+            "url": "https://example.com/dolores",
+            "tags": ["sf", "park"],
+        },
+        "towerbridge": {
+            "label": "Tower Bridge",
+            "url": "https://example.com/towerbridge",
+            "tags": ["london", "bridge", "landmark"],
+        },
+        "aqus": {
+            "label": "Aqus Cafe",
+            "url": "https://example.com/aqus",
+            "tags": ["coffee", "sf"],
+        },
+        "bluebottle": {
+            "label": "Blue Bottle",
+            "url": "https://example.com/bluebottle",
+            "tags": ["coffee", "sf", "nyc"],
+        },
+        "acre": {
+            "label": "Acre Coffee",
+            "url": "https://example.com/acre",
+            "tags": ["coffee"],
+        },
+    },
+}
+@pytest.fixture
+def parser():
+    return ExpressionParser(TEST_CONFIG)
+# ---------------------------------------------------------------------------
+# Tier 1 — Operands
+# ---------------------------------------------------------------------------
+class TestOperands:
+    def test_single_item_id(self, parser):
+        assert parser.query("vwbug") == ["vwbug"]
+    def test_single_class(self, parser):
+        result = parser.query(".car")
+        assert sorted(result) == ["bmwe36", "miata", "vwbug"]
+    def test_nonexistent_item(self, parser):
+        assert parser.query("doesnotexist") == []
+    def test_nonexistent_class(self, parser):
+        assert parser.query(".doesnotexist") == []
+# ---------------------------------------------------------------------------
+# Tier 2 — Commas
+# ---------------------------------------------------------------------------
+class TestCommas:
+    def test_two_items(self, parser):
+        assert parser.query("vwbug, bmwe36") == ["vwbug", "bmwe36"]
+    def test_three_items(self, parser):
+        assert parser.query("vwbug, bmwe36, miata") == ["vwbug", "bmwe36", "miata"]
+    def test_item_and_class(self, parser):
+        result = parser.query("vwbug, .sf")
+        assert result[0] == "vwbug"
+        assert "goldengate" in result
+        assert "dolores" in result
+    def test_deduplication(self, parser):
+        result = parser.query("vwbug, vwbug")
+        assert result == ["vwbug"]
+# ---------------------------------------------------------------------------
+# Tier 3 — Operators
+# ---------------------------------------------------------------------------
+class TestOperators:
+    def test_intersection(self, parser):
+        result = parser.query(".nyc + .bridge")
+        assert sorted(result) == ["brooklyn", "manhattan"]
+    def test_union(self, parser):
+        result = parser.query(".nyc | .sf")
+        assert "brooklyn" in result
+        assert "goldengate" in result
+    def test_subtraction(self, parser):
+        result = parser.query(".nyc - .bridge")
+        assert "brooklyn" not in result
+        assert "manhattan" not in result
+        assert "highline" in result
+        assert "centralpark" in result
+# ---------------------------------------------------------------------------
+# Tier 4 — Chained operators
+# ---------------------------------------------------------------------------
+class TestChained:
+    def test_three_way_intersection(self, parser):
+        result = parser.query(".nyc + .bridge + .landmark")
+        assert result == ["brooklyn"]
+    def test_union_then_subtract(self, parser):
+        result = parser.query(".nyc | .sf - .bridge")
+        # Left-to-right: (.nyc | .sf) - .bridge
+        assert "brooklyn" not in result
+        assert "manhattan" not in result
+        assert "goldengate" not in result
+        assert "highline" in result
+# ---------------------------------------------------------------------------
+# Tier 5 — Mixed
+# ---------------------------------------------------------------------------
+class TestMixed:
+    def test_item_and_class_intersection(self, parser):
+        result = parser.query("brooklyn + .landmark")
+        assert result == ["brooklyn"]
+    def test_class_union_with_item(self, parser):
+        result = parser.query(".car | goldengate")
+        assert "vwbug" in result
+        assert "goldengate" in result
+# ---------------------------------------------------------------------------
+# Tier 6 — Macros
+# ---------------------------------------------------------------------------
+class TestMacros:
+    def test_named_macro(self, parser):
+        result = parser.query("@cars")
+        assert sorted(result) == ["bmwe36", "vwbug"]
+    def test_macro_with_operators(self, parser):
+        result = parser.query("@nycbridges")
+        assert sorted(result) == ["brooklyn", "manhattan"]
+    def test_unknown_macro(self, parser):
+        result = parser.query("@nonexistent")
+        assert result == []
+    def test_bare_macro_with_anchor(self, parser):
+        # Bare @ uses anchorId
+        config_with_macro = {
+            **TEST_CONFIG,
+            "macros": {**TEST_CONFIG["macros"], "myanchor": {"linkItems": "vwbug"}},
+        }
+        p = ExpressionParser(config_with_macro)
+        result = p.query("@", "myanchor")
+        assert result == ["vwbug"]
+# ---------------------------------------------------------------------------
+# Tier 7 — Parentheses
+# ---------------------------------------------------------------------------
+class TestParentheses:
+    def test_basic_grouping(self, parser):
+        # Without parens: .nyc | .sf + .bridge => (.nyc | .sf) + .bridge (left-to-right)
+        # With parens: .nyc | (.sf + .bridge) => .nyc union (sf bridges)
+        without = parser.query(".nyc | .sf + .bridge")
+        with_parens = parser.query(".nyc | (.sf + .bridge)")
+        # with_parens should include all NYC items + goldengate
+        assert "highline" in with_parens
+        assert "centralpark" in with_parens
+        assert "goldengate" in with_parens
+    def test_nested_parens(self, parser):
+        result = parser.query("((.nyc + .bridge) | (.sf + .bridge))")
+        assert sorted(result) == ["brooklyn", "goldengate", "manhattan"]
+    def test_parens_with_subtraction(self, parser):
+        result = parser.query("(.nyc | .sf) - .park")
+        assert "centralpark" not in result
+        assert "dolores" not in result
+        assert "brooklyn" in result
+# ---------------------------------------------------------------------------
+# Tier 8 — Edge cases
+# ---------------------------------------------------------------------------
+class TestEdgeCases:
+    def test_empty_string(self, parser):
+        assert parser.query("") == []
+    def test_whitespace_only(self, parser):
+        assert parser.query("   ") == []
+    def test_none_expression(self, parser):
+        assert parser.query(None) == []
+    def test_empty_config(self):
+        p = ExpressionParser({"allLinks": {}})
+        assert p.query(".car") == []
+    def test_no_alllinks(self):
+        p = ExpressionParser({})
+        assert p.query("vwbug") == []
+# ---------------------------------------------------------------------------
+# Convenience functions
+# ---------------------------------------------------------------------------
+class TestConvenience:
+    def test_resolve_expression(self):
+        results = resolve_expression(TEST_CONFIG, ".car + .germany")
+        ids = [r["id"] for r in results]
+        assert sorted(ids) == ["bmwe36", "vwbug"]
+        # Each result should have id, label, url, tags
+        for r in results:
+            assert "id" in r
+            assert "label" in r
+            assert "url" in r
+    def test_cherry_pick_links(self):
+        result = cherry_pick_links(TEST_CONFIG, "vwbug, miata")
+        assert "vwbug" in result
+        assert "miata" in result
+        assert "bmwe36" not in result
+    def test_merge_configs(self):
+        config1 = {
+            "allLinks": {"a": {"label": "A", "url": "https://a.com"}},
+            "macros": {"m1": {"linkItems": "a"}},
+        }
+        config2 = {
+            "allLinks": {"b": {"label": "B", "url": "https://b.com"}},
+            "macros": {"m2": {"linkItems": "b"}},
+        }
+        merged = merge_configs(config1, config2)
+        assert "a" in merged["allLinks"]
+        assert "b" in merged["allLinks"]
+        assert "m1" in merged["macros"]
+        assert "m2" in merged["macros"]
+    def test_merge_configs_later_wins(self):
+        config1 = {"allLinks": {"a": {"label": "Old", "url": "https://old.com"}}}
+        config2 = {"allLinks": {"a": {"label": "New", "url": "https://new.com"}}}
+        merged = merge_configs(config1, config2)
+        assert merged["allLinks"]["a"]["label"] == "New"
+# ---------------------------------------------------------------------------
+# URL sanitization
+# ---------------------------------------------------------------------------
+class TestSanitizeUrl:
+    def test_safe_urls(self):
+        assert sanitize_url("https://example.com") == "https://example.com"
+        assert sanitize_url("http://example.com") == "http://example.com"
+        assert sanitize_url("mailto:user@example.com") == "mailto:user@example.com"
+        assert sanitize_url("/relative/path") == "/relative/path"
+        assert sanitize_url("") == ""
+    def test_javascript_blocked(self):
+        assert sanitize_url("javascript:alert(1)") == "about:blank"
+        assert sanitize_url("JAVASCRIPT:alert(1)") == "about:blank"
+        assert sanitize_url("JavaScript:void(0)") == "about:blank"
+    def test_data_blocked(self):
+        assert sanitize_url("data:text/html,<h1>Hi</h1>") == "about:blank"
+    def test_vbscript_blocked(self):
+        assert sanitize_url("vbscript:MsgBox") == "about:blank"
+    def test_blob_blocked(self):
+        assert sanitize_url("blob:https://example.com/uuid") == "about:blank"
+    def test_control_chars_stripped(self):
+        assert sanitize_url("java\nscript:alert(1)") == "about:blank"
+        assert sanitize_url("java\tscript:alert(1)") == "about:blank"
+    def test_sanitize_in_resolve(self):
+        """Ensure resolve_expression sanitizes URLs."""
+        config = {
+            "allLinks": {
+                "bad": {
+                    "label": "Evil",
+                    "url": "javascript:alert(1)",
+                    "tags": ["test"],
+                },
+                "good": {
+                    "label": "Good",
+                    "url": "https://example.com",
+                    "tags": ["test"],
+                },
+            }
+        }
+        results = resolve_expression(config, ".test")
+        urls = {r["id"]: r["url"] for r in results}
+        assert urls["bad"] == "about:blank"
+        assert urls["good"] == "https://example.com"
+    def test_sanitize_in_cherry_pick(self):
+        """Ensure cherry_pick_links sanitizes URLs."""
+        config = {
+            "allLinks": {
+                "bad": {
+                    "label": "Evil",
+                    "url": "javascript:alert(1)",
+                    "tags": ["test"],
+                },
+            }
+        }
+        result = cherry_pick_links(config, ".test")
+        assert result["bad"]["url"] == "about:blank"
+# ---------------------------------------------------------------------------
+# Protocol config for tests
+# ---------------------------------------------------------------------------
+def _tag_protocol(segments, link, item_id):
+    """Test protocol handler: checks if the link has a given tag."""
+    if not segments:
+        return False
+    tag = segments[0]
+    return tag in (link.get("tags") or [])
+def _throwing_protocol(segments, link, item_id):
+    """Protocol handler that always throws."""
+    raise ValueError("boom")
+PROTOCOL_CONFIG = {
+    **TEST_CONFIG,
+    "protocols": {
+        "hastag": {"handler": _tag_protocol},
+        "broken": {"handler": _throwing_protocol},
+    },
+}
+# ---------------------------------------------------------------------------
+# Tier 9 — Protocols
+# ---------------------------------------------------------------------------
+class TestProtocols:
+    def test_protocol_tokenization(self):
+        """Protocol :name:arg: produces a PROTOCOL token."""
+        from expression_parser import ExpressionParser
+        tokens = ExpressionParser._tokenize(":time:7d:")
+        assert len(tokens) == 1
+        assert tokens[0].type == "PROTOCOL"
+        assert tokens[0].value == "time|7d"
+    def test_protocol_multi_arg_tokenization(self):
+        """Protocol :name:a:b: joins segments with |."""
+        from expression_parser import ExpressionParser
+        tokens = ExpressionParser._tokenize(":time:7d:newest:")
+        assert len(tokens) == 1
+        assert tokens[0].type == "PROTOCOL"
+        assert tokens[0].value == "time|7d|newest"
+    def test_protocol_resolution(self):
+        """Protocol resolves via handler predicate."""
+        parser = ExpressionParser(PROTOCOL_CONFIG)
+        result = parser.query(":hastag:coffee:")
+        assert sorted(result) == ["acre", "aqus", "bluebottle"]
+    def test_unknown_protocol(self):
+        """Unknown protocol warns and returns empty."""
+        parser = ExpressionParser(PROTOCOL_CONFIG)
+        with pytest.warns(UserWarning, match="Unknown protocol"):
+            result = parser.query(":nonexistent:arg:")
+        assert result == []
+    def test_protocol_handler_throws(self):
+        """Handler that throws skips that item with a warning."""
+        parser = ExpressionParser(PROTOCOL_CONFIG)
+        with pytest.warns(UserWarning, match="handler threw"):
+            result = parser.query(":broken:arg:")
+        assert result == []
+    def test_protocol_with_tag_intersection(self):
+        """Protocol composed with tag operator."""
+        parser = ExpressionParser(PROTOCOL_CONFIG)
+        result = parser.query(":hastag:coffee: + .sf")
+        assert sorted(result) == ["aqus", "bluebottle"]
+    def test_protocol_with_tag_union(self):
+        """Protocol composed with union."""
+        parser = ExpressionParser(PROTOCOL_CONFIG)
+        result = parser.query(":hastag:coffee: | .bridge")
+        # coffee items + bridge items
+        assert "acre" in result
+        assert "brooklyn" in result
+        assert "goldengate" in result
+    def test_protocol_no_config(self):
+        """Protocol with no protocols in config returns empty."""
+        parser = ExpressionParser(TEST_CONFIG)
+        with pytest.warns(UserWarning, match="Unknown protocol"):
+            result = parser.query(":hastag:coffee:")
+        assert result == []
+# ---------------------------------------------------------------------------
+# Tier 10 — Refiners
+# ---------------------------------------------------------------------------
+class TestRefiners:
+    def test_refiner_tokenization(self):
+        """Refiner *name* produces a REFINER token."""
+        from expression_parser import ExpressionParser
+        tokens = ExpressionParser._tokenize("*sort*")
+        assert len(tokens) == 1
+        assert tokens[0].type == "REFINER"
+        assert tokens[0].value == "sort"
+    def test_refiner_with_arg_tokenization(self):
+        """Refiner *name:arg* preserves arg."""
+        from expression_parser import ExpressionParser
+        tokens = ExpressionParser._tokenize("*sort:label*")
+        assert len(tokens) == 1
+        assert tokens[0].type == "REFINER"
+        assert tokens[0].value == "sort:label"
+    def test_sort_refiner_default(self):
+        """*sort* sorts by label (default)."""
+        parser = ExpressionParser(TEST_CONFIG)
+        result = parser.query(".car *sort*")
+        labels = [TEST_CONFIG["allLinks"][r]["label"] for r in result]
+        assert labels == sorted(labels, key=str.lower)
+    def test_sort_refiner_by_url(self):
+        """*sort:url* sorts by url field."""
+        parser = ExpressionParser(TEST_CONFIG)
+        result = parser.query(".car *sort:url*")
+        urls = [TEST_CONFIG["allLinks"][r]["url"] for r in result]
+        assert urls == sorted(urls, key=str.lower)
+    def test_reverse_refiner(self):
+        """*reverse* reverses the order."""
+        parser = ExpressionParser(TEST_CONFIG)
+        normal = parser.query(".car *sort*")
+        reversed_result = parser.query(".car *sort* *reverse*")
+        assert reversed_result == list(reversed(normal))
+    def test_limit_refiner(self):
+        """*limit:N* takes first N items."""
+        parser = ExpressionParser(TEST_CONFIG)
+        result = parser.query(".car *sort* *limit:2*")
+        assert len(result) == 2
+    def test_limit_zero(self):
+        """*limit:0* returns empty."""
+        parser = ExpressionParser(TEST_CONFIG)
+        result = parser.query(".car *limit:0*")
+        assert result == []
+    def test_skip_refiner(self):
+        """*skip:N* skips first N items."""
+        parser = ExpressionParser(TEST_CONFIG)
+        full = parser.query(".car *sort*")
+        skipped = parser.query(".car *sort* *skip:1*")
+        assert skipped == full[1:]
+    def test_shuffle_refiner(self):
+        """*shuffle* randomizes (just check it returns the same items)."""
+        parser = ExpressionParser(TEST_CONFIG)
+        result = parser.query(".car *shuffle*")
+        assert sorted(result) == sorted(["vwbug", "bmwe36", "miata"])
+    def test_unique_refiner(self):
+        """*unique:field* deduplicates by field."""
+        config = {
+            "allLinks": {
+                "a": {"label": "A", "url": "https://same.com", "tags": ["t"]},
+                "b": {"label": "B", "url": "https://same.com", "tags": ["t"]},
+                "c": {"label": "C", "url": "https://other.com", "tags": ["t"]},
+            }
+        }
+        parser = ExpressionParser(config)
+        result = parser.query(".t *unique:url*")
+        urls = [config["allLinks"][r]["url"] for r in result]
+        assert len(urls) == len(set(urls))
+        assert len(result) == 2
+    def test_unknown_refiner(self):
+        """Unknown refiner warns and skips."""
+        parser = ExpressionParser(TEST_CONFIG)
+        with pytest.warns(UserWarning, match="Unknown refiner"):
+            result = parser.query(".car *bogus*")
+        # Should still return the .car items, just unrefined
+        assert sorted(result) == ["bmwe36", "miata", "vwbug"]
+    def test_refiner_in_parenthesized_group(self):
+        """Refiners work inside parenthesized groups."""
+        parser = ExpressionParser(TEST_CONFIG)
+        result = parser.query("(.car *sort* *limit:1*), goldengate")
+        # First segment: sorted cars limited to 1, second segment: goldengate
+        assert len(result) == 2
+        assert "goldengate" in result
+    def test_refiner_chained_sort_limit(self):
+        """Sort then limit produces sorted subset."""
+        parser = ExpressionParser(TEST_CONFIG)
+        sorted_all = parser.query(".car *sort*")
+        sorted_limited = parser.query(".car *sort* *limit:2*")
+        assert sorted_limited == sorted_all[:2]
+# ---------------------------------------------------------------------------
+# Tier 11 — Hyphenated identifiers
+# ---------------------------------------------------------------------------
+class TestHyphenatedIdentifiers:
+    def test_hyphen_parsed_as_without(self):
+        """Hyphenated identifiers are parsed as id MINUS id."""
+        config = {
+            "allLinks": {
+                "my": {"label": "My", "url": "https://my.com", "tags": []},
+                "item": {"label": "Item", "url": "https://item.com", "tags": []},
+            }
+        }
+        parser = ExpressionParser(config)
+        # "my-item" should be parsed as "my" MINUS "item", not as a single ID
+        result = parser.query("my-item")
+        # "my" minus "item" = ["my"] (since "item" is removed)
+        assert result == ["my"]
+    def test_hyphen_in_class_context(self):
+        """Hyphen between bare words acts as subtraction."""
+        parser = ExpressionParser(TEST_CONFIG)
+        # vwbug-miata should be vwbug WITHOUT miata
+        result = parser.query("vwbug - miata")
+        assert result == ["vwbug"]

alap_python-0.1.0/tests/test_ssrf_guard.py ADDED Viewed

@@ -0,0 +1,95 @@
+import sys, os
+sys.path.insert(0, os.path.join(os.path.dirname(__file__), '..'))
+from ssrf_guard import is_private_host
+# ---------------------------------------------------------------------------
+# Public hosts (should return False)
+# ---------------------------------------------------------------------------
+def test_public_ip():
+    assert is_private_host("https://8.8.8.8/path") is False
+def test_public_domain():
+    assert is_private_host("https://example.com") is False
+# ---------------------------------------------------------------------------
+# Localhost variants (should return True)
+# ---------------------------------------------------------------------------
+def test_localhost():
+    assert is_private_host("http://localhost/admin") is True
+def test_localhost_with_port():
+    assert is_private_host("http://localhost:8080") is True
+def test_subdomain_of_localhost():
+    assert is_private_host("http://foo.localhost/bar") is True
+# ---------------------------------------------------------------------------
+# Private IPv4 ranges (should return True)
+# ---------------------------------------------------------------------------
+def test_loopback_127():
+    assert is_private_host("http://127.0.0.1") is True
+def test_private_10():
+    assert is_private_host("http://10.0.0.1") is True
+def test_private_172_16():
+    assert is_private_host("http://172.16.0.1") is True
+def test_private_192_168():
+    assert is_private_host("http://192.168.1.1") is True
+def test_link_local_169_254():
+    assert is_private_host("http://169.254.169.254/latest/meta-data") is True
+# ---------------------------------------------------------------------------
+# IPv6 (should return True)
+# ---------------------------------------------------------------------------
+def test_ipv6_loopback():
+    assert is_private_host("http://[::1]/admin") is True
+# ---------------------------------------------------------------------------
+# Malformed URL (fail closed → True)
+# ---------------------------------------------------------------------------
+def test_malformed_url():
+    assert is_private_host("not a url at all") is True
+# ---------------------------------------------------------------------------
+# IPv4-mapped IPv6 addresses (should return True)
+# ---------------------------------------------------------------------------
+def test_ipv4_mapped_ipv6_loopback():
+    assert is_private_host("http://[::ffff:127.0.0.1]") is True
+def test_ipv4_mapped_ipv6_private_10():
+    assert is_private_host("http://[::ffff:10.0.0.1]") is True
+# ---------------------------------------------------------------------------
+# 0.0.0.0 bypass
+# ---------------------------------------------------------------------------
+def test_zero_address():
+    assert is_private_host("http://0.0.0.0/") is True
+def test_zero_address_with_port():
+    assert is_private_host("http://0.0.0.0:8080/") is True

alap_python-0.1.0/tests/test_validate_config.py ADDED Viewed

@@ -0,0 +1,242 @@
+import sys, os
+sys.path.insert(0, os.path.join(os.path.dirname(__file__), '..'))
+from expression_parser import validate_config
+import pytest
+import copy
+# ---------------------------------------------------------------------------
+# Helpers
+# ---------------------------------------------------------------------------
+def _minimal_config():
+    return {
+        "allLinks": {
+            "alpha": {"url": "https://example.com/alpha", "label": "Alpha"},
+        }
+    }
+# ---------------------------------------------------------------------------
+# Structural validation
+# ---------------------------------------------------------------------------
+def test_minimal_valid_config_passes():
+    result = validate_config(_minimal_config())
+    assert "allLinks" in result
+    assert "alpha" in result["allLinks"]
+def test_preserves_settings():
+    cfg = _minimal_config()
+    cfg["settings"] = {"listType": "ul", "menuTimeout": 5000}
+    result = validate_config(cfg)
+    assert result["settings"]["listType"] == "ul"
+    assert result["settings"]["menuTimeout"] == 5000
+def test_preserves_macros():
+    cfg = _minimal_config()
+    cfg["macros"] = {"fav": {"linkItems": "alpha"}}
+    result = validate_config(cfg)
+    assert result["macros"]["fav"]["linkItems"] == "alpha"
+def test_preserves_search_patterns():
+    cfg = _minimal_config()
+    cfg["searchPatterns"] = {"bridge": "bridge"}
+    result = validate_config(cfg)
+    assert result["searchPatterns"]["bridge"] == "bridge"
+# ---------------------------------------------------------------------------
+# Non-dict inputs
+# ---------------------------------------------------------------------------
+def test_raises_on_none():
+    with pytest.raises(ValueError, match="expected a dict"):
+        validate_config(None)
+def test_raises_on_string():
+    with pytest.raises(ValueError, match="expected a dict"):
+        validate_config("string")
+def test_raises_on_list():
+    with pytest.raises(ValueError, match="expected a dict"):
+        validate_config([])
+# ---------------------------------------------------------------------------
+# allLinks validation
+# ---------------------------------------------------------------------------
+def test_raises_when_alllinks_missing():
+    with pytest.raises(ValueError, match="allLinks"):
+        validate_config({"settings": {}})
+def test_raises_when_alllinks_is_list():
+    with pytest.raises(ValueError, match="allLinks"):
+        validate_config({"allLinks": []})
+def test_skips_links_with_missing_url():
+    cfg = {"allLinks": {"nourl": {"label": "No URL"}}}
+    result = validate_config(cfg)
+    assert "nourl" not in result["allLinks"]
+def test_skips_non_dict_links():
+    cfg = {"allLinks": {"bad": "not a dict", "good": {"url": "https://ok.com"}}}
+    result = validate_config(cfg)
+    assert "bad" not in result["allLinks"]
+    assert "good" in result["allLinks"]
+# ---------------------------------------------------------------------------
+# URL sanitization
+# ---------------------------------------------------------------------------
+def test_sanitizes_javascript_url_to_about_blank():
+    cfg = {"allLinks": {"xss": {"url": "javascript:alert(1)", "label": "XSS"}}}
+    result = validate_config(cfg)
+    assert result["allLinks"]["xss"]["url"] == "about:blank"
+def test_sanitizes_javascript_in_image_field():
+    cfg = {"allLinks": {
+        "img": {"url": "https://safe.com", "image": "javascript:alert(1)"},
+    }}
+    result = validate_config(cfg)
+    assert result["allLinks"]["img"]["image"] == "about:blank"
+def test_leaves_safe_https_url_unchanged():
+    cfg = {"allLinks": {"safe": {"url": "https://example.com"}}}
+    result = validate_config(cfg)
+    assert result["allLinks"]["safe"]["url"] == "https://example.com"
+# ---------------------------------------------------------------------------
+# Tag validation
+# ---------------------------------------------------------------------------
+def test_filters_non_string_tags():
+    cfg = {"allLinks": {"a": {"url": "https://x.com", "tags": ["ok", 42, None]}}}
+    result = validate_config(cfg)
+    assert result["allLinks"]["a"]["tags"] == ["ok"]
+def test_ignores_non_list_tags():
+    cfg = {"allLinks": {"a": {"url": "https://x.com", "tags": "not-a-list"}}}
+    result = validate_config(cfg)
+    # tags key should not be present when the source wasn't a list
+    assert "tags" not in result["allLinks"]["a"]
+# ---------------------------------------------------------------------------
+# Hyphen rejection
+# ---------------------------------------------------------------------------
+def test_skips_hyphenated_item_ids():
+    cfg = {"allLinks": {"bad-id": {"url": "https://x.com"}}}
+    result = validate_config(cfg)
+    assert "bad-id" not in result["allLinks"]
+def test_skips_hyphenated_macro_names():
+    cfg = _minimal_config()
+    cfg["macros"] = {"my-macro": {"linkItems": "alpha"}}
+    result = validate_config(cfg)
+    assert "macros" not in result or "my-macro" not in result.get("macros", {})
+def test_skips_hyphenated_search_pattern_keys():
+    cfg = _minimal_config()
+    cfg["searchPatterns"] = {"my-pattern": "bridge"}
+    result = validate_config(cfg)
+    assert "searchPatterns" not in result or "my-pattern" not in result.get("searchPatterns", {})
+def test_strips_hyphenated_tags_but_keeps_link():
+    cfg = {"allLinks": {"a": {"url": "https://x.com", "tags": ["good", "bad-tag"]}}}
+    result = validate_config(cfg)
+    assert "a" in result["allLinks"]
+    assert result["allLinks"]["a"]["tags"] == ["good"]
+def test_allows_hyphens_in_non_expression_fields():
+    cfg = {"allLinks": {"a": {
+        "url": "https://my-site.com",
+        "label": "My-Label",
+        "cssClass": "my-class",
+        "description": "some-thing",
+    }}}
+    result = validate_config(cfg)
+    link = result["allLinks"]["a"]
+    assert link["url"] == "https://my-site.com"
+    assert link["label"] == "My-Label"
+    assert link["cssClass"] == "my-class"
+    assert link["description"] == "some-thing"
+# ---------------------------------------------------------------------------
+# Dangerous regex removal
+# ---------------------------------------------------------------------------
+def test_removes_dangerous_regex_patterns():
+    cfg = _minimal_config()
+    cfg["searchPatterns"] = {"evil": "(a+)+"}
+    result = validate_config(cfg)
+    assert "searchPatterns" not in result or "evil" not in result.get("searchPatterns", {})
+# ---------------------------------------------------------------------------
+# Prototype-pollution / dunder blocking
+# ---------------------------------------------------------------------------
+def test_drops_proto_keys_from_alllinks():
+    cfg = {"allLinks": {
+        "__proto__": {"url": "https://evil.com"},
+        "safe": {"url": "https://safe.com"},
+    }}
+    result = validate_config(cfg)
+    assert "__proto__" not in result["allLinks"]
+    assert "safe" in result["allLinks"]
+def test_drops_class_dunder_keys():
+    cfg = {"allLinks": {
+        "__class__": {"url": "https://evil.com"},
+        "ok": {"url": "https://ok.com"},
+    }}
+    result = validate_config(cfg)
+    assert "__class__" not in result["allLinks"]
+    assert "ok" in result["allLinks"]
+def test_drops_bases_dunder_keys():
+    cfg = {"allLinks": {
+        "__bases__": {"url": "https://evil.com"},
+        "ok": {"url": "https://ok.com"},
+    }}
+    result = validate_config(cfg)
+    assert "__bases__" not in result["allLinks"]
+    assert "ok" in result["allLinks"]
+# ---------------------------------------------------------------------------
+# Input immutability
+# ---------------------------------------------------------------------------
+def test_does_not_mutate_input():
+    cfg = {
+        "allLinks": {
+            "a": {"url": "javascript:alert(1)", "tags": ["x", 42]},
+        }
+    }
+    original = copy.deepcopy(cfg)
+    validate_config(cfg)
+    assert cfg == original

alap_python-0.1.0/tests/test_validate_regex.py ADDED Viewed

@@ -0,0 +1,54 @@
+import sys, os
+sys.path.insert(0, os.path.join(os.path.dirname(__file__), '..'))
+from validate_regex import validate_regex
+# ---------------------------------------------------------------------------
+# Valid patterns
+# ---------------------------------------------------------------------------
+def test_valid_simple_pattern():
+    assert validate_regex("bridge")["safe"] is True
+def test_valid_anchored_pattern():
+    assert validate_regex("^foo$")["safe"] is True
+def test_valid_character_class():
+    assert validate_regex("[a-z]+")["safe"] is True
+def test_safe_quantified_group():
+    assert validate_regex("(abc)+")["safe"] is True
+def test_safe_alternation_group():
+    assert validate_regex("(a|b)*")["safe"] is True
+# ---------------------------------------------------------------------------
+# Invalid / dangerous patterns
+# ---------------------------------------------------------------------------
+def test_invalid_syntax_unclosed_bracket():
+    result = validate_regex("[unclosed")
+    assert result["safe"] is False
+def test_nested_quantifier_a_plus_plus():
+    result = validate_regex("(a+)+")
+    assert result["safe"] is False
+    assert "Nested quantifier" in result["reason"]
+def test_nested_quantifier_a_star_star_b():
+    result = validate_regex("(a*)*b")
+    assert result["safe"] is False
+    assert "Nested quantifier" in result["reason"]
+def test_nested_quantifier_word_plus():
+    result = validate_regex(r"(\w+\w+)+")
+    assert result["safe"] is False
+    assert "Nested quantifier" in result["reason"]