PyPI - pysentry-rs - Versions diffs - 0.2.2__tar.gz → 0.2.3__tar.gz - Mend

pysentry-rs 0.2.2tar.gz → 0.2.3tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Potentially problematic release.

This version of pysentry-rs might be problematic. Click here for more details.

Files changed (58) hide show

pysentry_rs-0.2.3/.github/workflows/benchmark.yml ADDED Viewed

@@ -0,0 +1,156 @@
+name: Benchmark Release
+on:
+  push:
+    tags:
+      - "v*"
+  workflow_dispatch:
+    inputs:
+      version:
+        description: "Version to benchmark (e.g., v0.2.3)"
+        required: true
+        default: "v0.2.3"
+env:
+  CARGO_TERM_COLOR: always
+  RUST_BACKTRACE: 1
+jobs:
+  benchmark:
+    name: Run Benchmarks
+    runs-on: ubuntu-latest
+    permissions:
+      contents: write
+      pull-requests: write
+    steps:
+      - name: Checkout code
+        uses: actions/checkout@v4
+        with:
+          fetch-depth: 0
+      - name: Extract version from tag
+        id: version
+        run: |
+          if [ "${{ github.event_name }}" = "workflow_dispatch" ]; then
+            VERSION="${{ github.event.inputs.version }}"
+          else
+            VERSION="${{ github.ref_name }}"
+          fi
+          VERSION_CLEAN=${VERSION#v}
+          echo "version=${VERSION_CLEAN}" >> $GITHUB_OUTPUT
+          echo "version_with_v=${VERSION}" >> $GITHUB_OUTPUT
+          echo "branch_name=benchmark-${VERSION_CLEAN}" >> $GITHUB_OUTPUT
+      - name: Install system dependencies
+        run: |
+          sudo apt-get update
+          sudo apt-get install -y libssl-dev pkg-config
+      - name: Install Rust
+        uses: dtolnay/rust-toolchain@stable
+      - name: Cache cargo
+        uses: actions/cache@v4
+        with:
+          path: |
+            ~/.cargo/registry/index/
+            ~/.cargo/registry/cache/
+            ~/.cargo/git/db/
+            target
+          key: ${{ runner.os }}-cargo-benchmark-${{ hashFiles('**/Cargo.lock') }}
+          restore-keys: |
+            ${{ runner.os }}-cargo-benchmark-
+            ${{ runner.os }}-cargo-build-
+      - name: Build PySentry
+        run: cargo build --release
+      - name: Set up Python
+        uses: actions/setup-python@v5
+        with:
+          python-version: "3.11"
+      - name: Install uv
+        run: |
+          curl -LsSf https://astral.sh/uv/install.sh | sh
+          echo "$HOME/.local/bin" >> $GITHUB_PATH
+      - name: Install pip-audit for benchmark comparison
+        run: pip install pip-audit
+      - name: Install benchmark dependencies
+        run: |
+          cd benchmarks
+          uv sync
+      - name: Run benchmark suite
+        run: |
+          cd benchmarks
+          uv run python main.py --skip-build
+          ls -la results/
+          LATEST_FILE=$(ls results/*.md 2>/dev/null | sort -r | head -n 1)
+          if [ -f "$LATEST_FILE" ]; then
+            cp "$LATEST_FILE" results/latest.md
+            echo "Created latest.md from: $LATEST_FILE"
+          else
+            echo "Warning: No benchmark files found to create latest.md"
+          fi
+      - name: Configure Git
+        run: |
+          git config --global user.name "github-actions[bot]"
+          git config --global user.email "github-actions[bot]@users.noreply.github.com"
+      - name: Create and switch to benchmark branch
+        run: |
+          BRANCH_NAME="${{ steps.version.outputs.branch_name }}"
+          git checkout -b $BRANCH_NAME
+      - name: Commit benchmark results
+        run: |
+          VERSION="${{ steps.version.outputs.version }}"
+          git add benchmarks/results/
+          if git diff --staged --quiet; then
+            echo "No changes to commit"
+            exit 0
+          fi
+          git commit -m "Add benchmark results for version ${VERSION}"
+      - name: Push benchmark branch
+        run: |
+          BRANCH_NAME="${{ steps.version.outputs.branch_name }}"
+          git push origin $BRANCH_NAME
+      - name: Create Pull Request
+        env:
+          GH_TOKEN: ${{ secrets.GITHUB_TOKEN }}
+        run: |
+          VERSION="${{ steps.version.outputs.version }}"
+          BRANCH_NAME="${{ steps.version.outputs.branch_name }}"
+          PR_BODY="This PR contains automated benchmark results comparing PySentry v${VERSION} against pip-audit."
+          gh pr create \
+            --title "Benchmark results for v${VERSION}" \
+            --body "$PR_BODY" \
+            --base main \
+            --head $BRANCH_NAME \
+            --label "benchmark,automated"
+      - name: Summary
+        run: |
+          VERSION="${{ steps.version.outputs.version }}"
+          BRANCH_NAME="${{ steps.version.outputs.branch_name }}"
+          echo "Benchmark workflow completed successfully!"
+          echo ""
+          echo "Benchmarked version: v${VERSION}"
+          echo "Created branch: ${BRANCH_NAME}"
+          echo "Results location: benchmarks/results/"
+          echo "Pull request created automatically"

{pysentry_rs-0.2.2 → pysentry_rs-0.2.3}/Cargo.lock RENAMED Viewed

@@ -1115,7 +1115,7 @@ dependencies = [
 [[package]]
 name = "pysentry"
-version = "0.2.2"
+version = "0.2.3"
 dependencies = [
  "anyhow",
  "async-trait",
@@ -1128,6 +1128,7 @@ dependencies = [
  "pyo3",
  "regex",
  "reqwest",
+ "rustc-hash",
  "serde",
  "serde_json",
  "serde_yaml",

{pysentry_rs-0.2.2 → pysentry_rs-0.2.3}/Cargo.toml RENAMED Viewed

@@ -1,6 +1,6 @@
 [package]
 name = "pysentry"
-version = "0.2.2"
+version = "0.2.3"
 edition = "2021"
 rust-version = "1.79"
 description = "Security vulnerability auditing for Python packages"
@@ -33,6 +33,7 @@ pep440_rs = "0.7.3"
 pyo3 = { version = "0.25.1", features = ["extension-module"], optional = true }
 regex = "1.11.1"
 reqwest = { version = "0.12.22", features = ["json", "stream", "rustls-tls"], default-features = false }
+rustc-hash = "2.1.1"
 serde = { version = "1.0.219", features = ["derive"] }
 serde_json = "1.0.142"
 serde_yaml = "0.9.34"

{pysentry_rs-0.2.2 → pysentry_rs-0.2.3}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: pysentry-rs
-Version: 0.2.2
+Version: 0.2.3
 Classifier: Development Status :: 4 - Beta
 Classifier: Intended Audience :: Developers
 Classifier: License :: OSI Approved :: GNU General Public License v3 (GPLv3)

pysentry_rs-0.2.3/benchmarks/.gitignore ADDED Viewed

	@@ -0,0 +1,2 @@
1	+ workdirs
2	+ cache

pysentry_rs-0.2.3/benchmarks/.python-version ADDED Viewed

	@@ -0,0 +1 @@
1	+ 3.11

pysentry_rs-0.2.3/benchmarks/README.md ADDED Viewed

@@ -0,0 +1,3 @@
+# PySentry - pip-audit Benchmark Suite
+Look at the latest results at [results/latest.md](results/latest.md)

pysentry_rs-0.2.3/benchmarks/main.py ADDED Viewed

@@ -0,0 +1,111 @@
+import sys
+import argparse
+from pathlib import Path
+sys.path.insert(0, str(Path(__file__).parent / "src"))
+from src.benchmark_runner import BenchmarkRunner
+def main():
+    parser = argparse.ArgumentParser(
+        description="PySentry vs pip-audit benchmark suite",
+        formatter_class=argparse.RawDescriptionHelpFormatter,
+        epilog="""
+Examples:
+  python main.py                    # Run full benchmark suite
+  python main.py --quick           # Run only small dataset for quick testing
+  python main.py --output-dir ./custom-results  # Custom output directory
+        """,
+    )
+    parser.add_argument(
+        "--quick", action="store_true", help="Run only small dataset for quick testing"
+    )
+    parser.add_argument(
+        "--output-dir",
+        type=Path,
+        help="Custom output directory for results (default: ./results/)",
+    )
+    parser.add_argument(
+        "--verbose", "-v", action="store_true", help="Enable verbose output"
+    )
+    parser.add_argument(
+        "--skip-build",
+        action="store_true",
+        help="Skip PySentry build check (assume it's already built)",
+    )
+    args = parser.parse_args()
+    try:
+        benchmark_dir = Path(__file__).parent
+        if args.output_dir:
+            runner = BenchmarkRunner(benchmark_dir)
+            runner.results_dir = args.output_dir
+            runner.results_dir.mkdir(parents=True, exist_ok=True)
+        else:
+            runner = BenchmarkRunner(benchmark_dir)
+        if args.verbose:
+            print(f"Benchmark directory: {benchmark_dir}")
+            print(f"Results directory: {runner.results_dir}")
+        if args.quick:
+            print("Quick mode: Running only small dataset...")
+            large_dataset = runner.test_data_dir / "large_requirements.txt"
+            backup_path = None
+            if large_dataset.exists():
+                backup_path = runner.test_data_dir / "large_requirements.txt.backup"
+                large_dataset.rename(backup_path)
+        try:
+            print("Starting benchmark suite...")
+            suite = runner.run_full_benchmark_suite()
+            report_path = runner.save_and_generate_report(suite)
+            successful_runs = len(
+                [r for r in suite.results if r.metrics.exit_code <= 1]
+            )
+            total_runs = len(suite.results)
+            print("\n" + "=" * 60)
+            print("BENCHMARK SUITE COMPLETED")
+            print("=" * 60)
+            print(f"Total runs: {total_runs}")
+            print(f"Successful: {successful_runs}")
+            print(f"Failed: {total_runs - successful_runs}")
+            print(f"Duration: {suite.total_duration:.2f} seconds")
+            print(f"Report saved to: {report_path}")
+            print("=" * 60)
+            exit_code = 0 if successful_runs == total_runs else 1
+            if exit_code != 0:
+                print(f"WARNING: {total_runs - successful_runs} benchmark runs failed!")
+            return exit_code
+        finally:
+            if args.quick and backup_path and backup_path.exists():
+                backup_path.rename(large_dataset)
+    except KeyboardInterrupt:
+        print("\nBenchmark interrupted by user.")
+        return 1
+    except Exception as e:
+        print(f"Error running benchmark suite: {e}")
+        if args.verbose:
+            import traceback
+            traceback.print_exc()
+        return 1
+if __name__ == "__main__":
+    sys.exit(main())

pysentry_rs-0.2.3/benchmarks/pyproject.toml ADDED Viewed

@@ -0,0 +1,12 @@
+[project]
+name = "benchmarks"
+version = "0.1.0"
+description = "Performance benchmark suite for PySentry vs pip-audit"
+readme = "README.md"
+requires-python = ">=3.10"
+dependencies = [
+    "matplotlib>=3.10.5",
+    "pip-audit>=2.9.0",
+    "psutil>=7.0.0",
+    "tabulate>=0.9.0",
+]

pysentry_rs-0.2.3/benchmarks/src/benchmark_runner.py ADDED Viewed

@@ -0,0 +1,364 @@
+import shutil
+from pathlib import Path
+from typing import List, Dict, Any, Optional
+from datetime import datetime
+from dataclasses import dataclass, asdict
+from .tool_wrapper import ToolRegistry, BenchmarkConfig
+from .performance_monitor import PerformanceMetrics, SystemInfo
+from .report_generator import ReportGenerator
+@dataclass
+class BenchmarkResult:
+    config_name: str
+    tool_name: str
+    dataset_name: str
+    cache_type: str
+    metrics: PerformanceMetrics
+    timestamp: str
+    def to_dict(self) -> Dict[str, Any]:
+        data = asdict(self)
+        data["metrics"] = asdict(self.metrics)
+        return data
+@dataclass
+class BenchmarkSuite:
+    system_info: SystemInfo
+    results: List[BenchmarkResult]
+    start_time: str
+    end_time: str
+    total_duration: float
+    def to_dict(self) -> Dict[str, Any]:
+        return {
+            "system_info": asdict(self.system_info),
+            "results": [result.to_dict() for result in self.results],
+            "start_time": self.start_time,
+            "end_time": self.end_time,
+            "total_duration": self.total_duration,
+        }
+class BenchmarkRunner:
+    def __init__(self, benchmark_dir: Optional[Path] = None):
+        if benchmark_dir is None:
+            benchmark_dir = Path(__file__).parent.parent
+        self.benchmark_dir = benchmark_dir
+        self.test_data_dir = benchmark_dir / "test_data"
+        self.results_dir = benchmark_dir / "results"
+        self.cache_dir = benchmark_dir / "cache"
+        self.workdirs = benchmark_dir / "workdirs"
+        self.results_dir.mkdir(exist_ok=True)
+        self.cache_dir.mkdir(exist_ok=True)
+        self.workdirs.mkdir(exist_ok=True)
+        self.tool_registry = ToolRegistry(cache_dir=self.cache_dir)
+        self.report_generator = ReportGenerator()
+    def initialize_cache_directory(self):
+        print(f"Using cache directory: {self.cache_dir}")
+        self.cache_dir.mkdir(exist_ok=True)
+        if self.cache_dir.exists():
+            for cache_file in self.cache_dir.glob("*"):
+                if cache_file.is_file():
+                    cache_file.unlink()
+                elif cache_file.is_dir():
+                    shutil.rmtree(cache_file)
+        print("Initialized clean cache directory")
+    def clear_cache_directory(self):
+        print(f"Clearing cache directory: {self.cache_dir}")
+        if self.cache_dir.exists():
+            for cache_file in self.cache_dir.glob("*"):
+                if cache_file.is_file():
+                    cache_file.unlink()
+                elif cache_file.is_dir():
+                    shutil.rmtree(cache_file)
+        print("Cache directory cleared")
+    def clean_work_directories(self):
+        print(f"Cleaning work directories in: {self.workdirs}")
+        if self.workdirs.exists():
+            for work_dir in self.workdirs.glob("*"):
+                if work_dir.is_dir():
+                    try:
+                        shutil.rmtree(work_dir)
+                    except Exception as e:
+                        print(f"Warning: Could not remove {work_dir}: {e}")
+        print("Work directories cleaned")
+    def run_single_benchmark(
+        self, config: BenchmarkConfig, dataset_path: Path, cache_type: str
+    ) -> BenchmarkResult:
+        dataset_name = dataset_path.stem
+        print(f"Running {config.config_name} on {dataset_name} ({cache_type} cache)...")
+        tool = self.tool_registry.get_tool(config.tool_name)
+        if not tool:
+            raise ValueError(f"Tool {config.tool_name} not available")
+        work_dir_name = f"{dataset_name}_{config.config_name}_{cache_type}"
+        work_path = self.workdirs / work_dir_name
+        if work_path.exists():
+            shutil.rmtree(work_path)
+        work_path.mkdir()
+        try:
+            temp_requirements = work_path / "requirements.txt"
+            shutil.copy2(dataset_path, temp_requirements)
+            (work_path / "setup.py").write_text("# Minimal setup.py for benchmarking")
+            print(f"  Working in: {work_path}")
+            use_cache = cache_type == "hot"
+            metrics = tool.execute(
+                config,
+                temp_requirements,
+                use_cache=use_cache,
+                working_dir=work_path,
+                dataset_name=dataset_name,
+                cache_type=cache_type,
+            )
+        except Exception as e:
+            print(f"  ✗ Exception during execution: {e}")
+            raise
+        if metrics.exit_code <= 1:
+            try:
+                shutil.rmtree(work_path)
+            except:
+                pass
+        else:
+            print(f"  ! Work directory preserved for debugging: {work_path}")
+        return BenchmarkResult(
+            config_name=config.config_name,
+            tool_name=config.tool_name,
+            dataset_name=dataset_name,
+            cache_type=cache_type,
+            metrics=metrics,
+            timestamp=datetime.now().isoformat(),
+        )
+    def run_dataset_benchmarks(self, dataset_path: Path) -> List[BenchmarkResult]:
+        results = []
+        configs = self.tool_registry.get_all_benchmark_configs(dataset_path)
+        if not configs:
+            print("No benchmark configurations available!")
+            return results
+        print(
+            f"Running benchmarks on {dataset_path.name} ({len(configs)} configurations)"
+        )
+        print("Testing strategy:")
+        print("  - Cold phase: Clear cache → Run cold → Record")
+        print("  - Hot phase: Clear cache → Run cold (warmup) → Run hot → Record")
+        print(f"\n🧊 COLD TESTING PHASE - {dataset_path.name}")
+        print("=" * 60)
+        for i, config in enumerate(configs):
+            print(f"\nCold test {i + 1}/{len(configs)}: {config.config_name}")
+            print("-" * 40)
+            print("🧽 Clearing all caches...")
+            self.tool_registry.clear_all_caches()
+            try:
+                cold_result = self.run_single_benchmark(config, dataset_path, "cold")
+                results.append(cold_result)
+                self._show_result_feedback(cold_result, "cold")
+            except Exception as e:
+                print(f"  ✗ Error running {config.config_name} (cold): {e}")
+                error_result = self._create_error_result(
+                    config, dataset_path, "cold", str(e)
+                )
+                results.append(error_result)
+        print(f"\n🔥 HOT TESTING PHASE - {dataset_path.name}")
+        print("=" * 60)
+        for i, config in enumerate(configs):
+            print(f"\nHot test {i + 1}/{len(configs)}: {config.config_name}")
+            print("-" * 40)
+            print("🧽 Clearing all caches...")
+            self.tool_registry.clear_all_caches()
+            print("🌡️  Running warmup (cold test to populate cache)...")
+            try:
+                warmup_result = self.run_single_benchmark(config, dataset_path, "cold")
+                if warmup_result.metrics.exit_code <= 1:
+                    print(
+                        f"  ✓ Warmup completed ({warmup_result.metrics.execution_time:.2f}s)"
+                    )
+                else:
+                    print(
+                        f"  ✗ Warmup failed (exit code {warmup_result.metrics.exit_code})"
+                    )
+            except Exception as e:
+                print(f"  ⚠️  Warmup failed: {e}")
+            print("🔥 Running hot test (with cache from warmup)...")
+            try:
+                hot_result = self.run_single_benchmark(config, dataset_path, "hot")
+                results.append(hot_result)
+                self._show_result_feedback(hot_result, "hot")
+            except Exception as e:
+                print(f"  ✗ Error running {config.config_name} (hot): {e}")
+                error_result = self._create_error_result(
+                    config, dataset_path, "hot", str(e)
+                )
+                results.append(error_result)
+        print(f"\n✅ Completed all configurations for {dataset_path.name}")
+        return results
+    def _show_result_feedback(self, result: BenchmarkResult, cache_type: str):
+        if result.metrics.exit_code == 0:
+            print(
+                f"  ✓ {result.config_name} ({cache_type}): "
+                f"{result.metrics.execution_time:.2f}s, "
+                f"{result.metrics.peak_memory_mb:.1f}MB (no vulnerabilities)"
+            )
+        elif result.metrics.exit_code == 1:
+            print(
+                f"  ✓ {result.config_name} ({cache_type}): "
+                f"{result.metrics.execution_time:.2f}s, "
+                f"{result.metrics.peak_memory_mb:.1f}MB (vulnerabilities found)"
+            )
+        else:
+            print(
+                f"  ✗ {result.config_name} ({cache_type}): FAILED (exit code {result.metrics.exit_code})"
+            )
+    def _create_error_result(
+        self,
+        config: BenchmarkConfig,
+        dataset_path: Path,
+        cache_type: str,
+        error_msg: str,
+    ) -> BenchmarkResult:
+        return BenchmarkResult(
+            config_name=config.config_name,
+            tool_name=config.tool_name,
+            dataset_name=dataset_path.stem,
+            cache_type=cache_type,
+            metrics=PerformanceMetrics(
+                execution_time=0.0,
+                peak_memory_mb=0.0,
+                avg_memory_mb=0.0,
+                cpu_percent=0.0,
+                exit_code=-1,
+                stdout="",
+                stderr=f"Benchmark error: {error_msg}",
+            ),
+            timestamp=datetime.now().isoformat(),
+        )
+    def run_full_benchmark_suite(self) -> BenchmarkSuite:
+        start_time = datetime.now()
+        print(f"Starting full benchmark suite at {start_time.isoformat()}")
+        self.clean_work_directories()
+        if not self.tool_registry.ensure_pysentry_built():
+            raise RuntimeError("Could not build or find PySentry binary")
+        available_tools = self.tool_registry.get_available_tools()
+        print(f"Available tools: {', '.join(available_tools)}")
+        if not available_tools:
+            raise RuntimeError("No tools available for benchmarking")
+        datasets = []
+        for pattern in ["small_requirements.txt", "large_requirements.txt"]:
+            dataset_path = self.test_data_dir / pattern
+            if dataset_path.exists():
+                datasets.append(dataset_path)
+            else:
+                print(f"Warning: Dataset {pattern} not found")
+        if not datasets:
+            raise RuntimeError("No benchmark datasets found")
+        print(f"Found {len(datasets)} datasets: {[d.name for d in datasets]}")
+        all_results = []
+        for i, dataset in enumerate(datasets):
+            print(f"\n{'=' * 80}")
+            print(f"TESTING DATASET {i + 1}/{len(datasets)}: {dataset.name}")
+            print(f"{'=' * 80}")
+            results = self.run_dataset_benchmarks(dataset)
+            all_results.extend(results)
+            print(f"Completed {dataset.name}: {len(results)} results")
+        end_time = datetime.now()
+        duration = (end_time - start_time).total_seconds()
+        suite = BenchmarkSuite(
+            system_info=SystemInfo.get_current(),
+            results=all_results,
+            start_time=start_time.isoformat(),
+            end_time=end_time.isoformat(),
+            total_duration=duration,
+        )
+        print(f"Benchmark suite completed in {duration:.2f} seconds")
+        print(f"Total results: {len(all_results)}")
+        return suite
+    def get_pysentry_version(self) -> str:
+        try:
+            pysentry_tool = self.tool_registry.get_tool("pysentry")
+            if pysentry_tool and pysentry_tool.binary_path:
+                import subprocess
+                result = subprocess.run(
+                    [str(pysentry_tool.binary_path), "--version"],
+                    capture_output=True,
+                    text=True,
+                    timeout=10,
+                )
+                if result.returncode == 0:
+                    version_line = result.stdout.strip()
+                    if " " in version_line:
+                        return version_line.split()[-1]
+                    return version_line
+        except Exception:
+            pass
+        return "unknown"
+    def save_and_generate_report(self, suite: BenchmarkSuite) -> Path:
+        version = self.get_pysentry_version()
+        report_filename = f"{version}.md"
+        report_path = self.results_dir / report_filename
+        print(f"Generating report: {report_path}")
+        markdown_content = self.report_generator.generate_report(suite)
+        with open(report_path, "w", encoding="utf-8") as f:
+            f.write(markdown_content)
+        print(f"Report saved to: {report_path}")
+        return report_path

pysentry-rs 0.2.2__tar.gz → 0.2.3__tar.gz

Potentially problematic release.

pysentry-rs 0.2.2tar.gz → 0.2.3tar.gz