PyPI - adversarial-workflow - Versions diffs - 0.6.0__tar.gz → 0.6.2__tar.gz - Mend

adversarial-workflow 0.6.0tar.gz → 0.6.2tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (53) hide show

{adversarial_workflow-0.6.0 → adversarial_workflow-0.6.2}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: adversarial-workflow
-Version: 0.6.0
+Version: 0.6.2
 Summary: Multi-stage AI code review system preventing phantom work - Author/Evaluator pattern
 Author: Fredrik Matheson
 License: MIT
@@ -35,6 +35,10 @@ Dynamic: license-file
 # Adversarial Workflow
+[![PyPI version](https://badge.fury.io/py/adversarial-workflow.svg)](https://pypi.org/project/adversarial-workflow/)
+[![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg)](https://www.python.org/downloads/)
+[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
 **A multi-stage AI code review system that makes your code better**
 Evaluate proposals, sort out ideas, and prevent "phantom work" (AI claiming to implement but not delivering) through adversarial verification using independent review stages. A battle-tested workflow from the [thematic-cuts](https://github.com/movito/thematic-cuts) project that achieved 96.9% test pass rate improvement.
@@ -51,6 +55,31 @@ Evaluate proposals, sort out ideas, and prevent "phantom work" (AI claiming to i
 - 🎯 **Tool-agnostic**: Use with Claude Code, Cursor, Aider, manual coding, or any workflow
 - ✨ **Interactive onboarding**: Guided setup wizard gets you started in <5 minutes
+## What's New in v0.6.0
+🔌 **Plugin Architecture** - Define custom evaluators without modifying the package:
+```bash
+# Create a custom evaluator
+mkdir -p .adversarial/evaluators
+cat > .adversarial/evaluators/athena.yml << 'EOF'
+name: athena
+description: Knowledge evaluation using Gemini 2.5 Pro
+model: gemini-2.5-pro
+api_key_env: GEMINI_API_KEY
+prompt: |
+  You are Athena, a knowledge evaluation specialist...
+EOF
+# Use it immediately
+adversarial athena docs/research-plan.md
+# List all available evaluators
+adversarial list-evaluators
+```
+See [Custom Evaluators](#custom-evaluators) for full documentation, or check the [CHANGELOG](CHANGELOG.md) for complete release history.
 ## Prerequisites
 Before installing, ensure you have:
@@ -856,12 +885,13 @@ From the [thematic-cuts](https://github.com/movito/thematic-cuts) project:
 ## Documentation
-- **Interaction Patterns**: How Author-Reviewer collaboration works
+- **[Custom Evaluators Guide](docs/CUSTOM_EVALUATORS.md)**: Create project-specific evaluators
+- **[Integration Guide](docs/INTEGRATION-GUIDE.md)**: Detailed integration strategies
+- **[CHANGELOG](CHANGELOG.md)**: Release history and version notes
+- **Interaction Patterns**: How Author-Evaluator collaboration works
 - **Token Optimization**: Detailed Aider configuration guide
 - **Workflow Phases**: Step-by-step guide for each phase
 - **Troubleshooting**: Common issues and solutions
-- **Examples**: Real integration scenarios
-- **Terminology**: Official standards for Author/Reviewer concepts
 See `docs/` directory for comprehensive guides.

{adversarial_workflow-0.6.0 → adversarial_workflow-0.6.2}/README.md RENAMED Viewed

@@ -1,5 +1,9 @@
 # Adversarial Workflow
+[![PyPI version](https://badge.fury.io/py/adversarial-workflow.svg)](https://pypi.org/project/adversarial-workflow/)
+[![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg)](https://www.python.org/downloads/)
+[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
 **A multi-stage AI code review system that makes your code better**
 Evaluate proposals, sort out ideas, and prevent "phantom work" (AI claiming to implement but not delivering) through adversarial verification using independent review stages. A battle-tested workflow from the [thematic-cuts](https://github.com/movito/thematic-cuts) project that achieved 96.9% test pass rate improvement.
@@ -16,6 +20,31 @@ Evaluate proposals, sort out ideas, and prevent "phantom work" (AI claiming to i
 - 🎯 **Tool-agnostic**: Use with Claude Code, Cursor, Aider, manual coding, or any workflow
 - ✨ **Interactive onboarding**: Guided setup wizard gets you started in <5 minutes
+## What's New in v0.6.0
+🔌 **Plugin Architecture** - Define custom evaluators without modifying the package:
+```bash
+# Create a custom evaluator
+mkdir -p .adversarial/evaluators
+cat > .adversarial/evaluators/athena.yml << 'EOF'
+name: athena
+description: Knowledge evaluation using Gemini 2.5 Pro
+model: gemini-2.5-pro
+api_key_env: GEMINI_API_KEY
+prompt: |
+  You are Athena, a knowledge evaluation specialist...
+EOF
+# Use it immediately
+adversarial athena docs/research-plan.md
+# List all available evaluators
+adversarial list-evaluators
+```
+See [Custom Evaluators](#custom-evaluators) for full documentation, or check the [CHANGELOG](CHANGELOG.md) for complete release history.
 ## Prerequisites
 Before installing, ensure you have:
@@ -821,12 +850,13 @@ From the [thematic-cuts](https://github.com/movito/thematic-cuts) project:
 ## Documentation
-- **Interaction Patterns**: How Author-Reviewer collaboration works
+- **[Custom Evaluators Guide](docs/CUSTOM_EVALUATORS.md)**: Create project-specific evaluators
+- **[Integration Guide](docs/INTEGRATION-GUIDE.md)**: Detailed integration strategies
+- **[CHANGELOG](CHANGELOG.md)**: Release history and version notes
+- **Interaction Patterns**: How Author-Evaluator collaboration works
 - **Token Optimization**: Detailed Aider configuration guide
 - **Workflow Phases**: Step-by-step guide for each phase
 - **Troubleshooting**: Common issues and solutions
-- **Examples**: Real integration scenarios
-- **Terminology**: Official standards for Author/Reviewer concepts
 See `docs/` directory for comprehensive guides.

{adversarial_workflow-0.6.0 → adversarial_workflow-0.6.2}/adversarial_workflow/__init__.py RENAMED Viewed

@@ -12,7 +12,7 @@ Usage:
     adversarial validate "pytest"
 """
-__version__ = "0.6.0"
+__version__ = "0.6.2"
 __author__ = "Fredrik Matheson"
 __license__ = "MIT"

{adversarial_workflow-0.6.0 → adversarial_workflow-0.6.2}/adversarial_workflow/cli.py RENAMED Viewed

@@ -27,9 +27,9 @@ from pathlib import Path
 from typing import Dict, List, Optional, Tuple
 import yaml
-from dotenv import load_dotenv
+from dotenv import dotenv_values, load_dotenv
-__version__ = "0.6.0"
+__version__ = "0.6.2"
 # ANSI color codes for better output
 RESET = "\033[0m"
@@ -800,26 +800,36 @@ def check() -> int:
     issues: List[Dict] = []
     good_checks: List[str] = []
-    # Check for .env file first (before loading environment variables)
+    # Check for .env file (note: already loaded by main() at startup)
     env_file = Path(".env")
     env_loaded = False
-    env_keys_before = set(os.environ.keys())
     if env_file.exists():
         try:
+            # Count variables by reading file directly (works even if already loaded)
+            env_vars = dotenv_values(env_file)
+            var_count = len([k for k, v in env_vars.items() if v is not None])
+            # Still load to ensure environment is set
             load_dotenv(env_file)
-            env_keys_after = set(os.environ.keys())
-            new_keys = env_keys_after - env_keys_before
             env_loaded = True
-            good_checks.append(
-                f".env file found and loaded ({len(new_keys)} variables)"
+            good_checks.append(f".env file found and loaded ({var_count} variables)")
+        except (FileNotFoundError, PermissionError) as e:
+            # File access errors
+            issues.append(
+                {
+                    "severity": "WARNING",
+                    "message": f".env file found but could not be read: {e}",
+                    "fix": "Check .env file permissions",
+                }
             )
-        except Exception as e:
+        except (OSError, ValueError) as e:
+            # Covers UnicodeDecodeError (ValueError subclass) and other OS errors
             issues.append(
                 {
                     "severity": "WARNING",
-                    "message": f".env file found but could not be loaded: {e}",
-                    "fix": "Check .env file format and permissions",
+                    "message": f".env file found but could not be parsed: {e}",
+                    "fix": "Check .env file encoding (should be UTF-8)",
                 }
             )
     else:
@@ -2086,10 +2096,6 @@ def evaluate(task_file: str) -> int:
         return 0
 def review() -> int:
     """Run Phase 3: Code review."""
@@ -2728,54 +2734,61 @@ def agent_onboard(project_path: str = ".") -> int:
     return 0
-def split(task_file: str, strategy: str = "sections", max_lines: int = 500, dry_run: bool = False):
+def split(
+    task_file: str,
+    strategy: str = "sections",
+    max_lines: int = 500,
+    dry_run: bool = False,
+):
     """Split large task files into smaller evaluable chunks.
     Args:
         task_file: Path to the task file to split
         strategy: Split strategy ('sections', 'phases', or 'manual')
         max_lines: Maximum lines per split (default: 500)
         dry_run: Preview splits without creating files
     Returns:
         Exit code (0 for success, 1 for error)
     """
     from .utils.file_splitter import (
-        analyze_task_file,
-        split_by_sections,
-        split_by_phases,
-        generate_split_files
+        analyze_task_file,
+        generate_split_files,
+        split_by_phases,
+        split_by_sections,
     )
     try:
         print_box("File Splitting Utility", CYAN)
         # Validate file exists
         if not os.path.exists(task_file):
             print(f"{RED}Error: File not found: {task_file}{RESET}")
             return 1
         # Analyze file
         print(f"📄 Analyzing task file: {task_file}")
         analysis = analyze_task_file(task_file)
-        lines = analysis['total_lines']
-        tokens = analysis['estimated_tokens']
+        lines = analysis["total_lines"]
+        tokens = analysis["estimated_tokens"]
         print(f"   Lines: {lines}")
         print(f"   Estimated tokens: ~{tokens:,}")
         # Check if splitting is recommended
         if lines <= max_lines:
-            print(f"{GREEN}✅ File is under recommended limit ({max_lines} lines){RESET}")
+            print(
+                f"{GREEN}✅ File is under recommended limit ({max_lines} lines){RESET}"
+            )
             print("No splitting needed.")
             return 0
         print(f"{YELLOW}⚠️  File exceeds recommended limit ({max_lines} lines){RESET}")
         # Read file content for splitting
-        with open(task_file, 'r', encoding='utf-8') as f:
+        with open(task_file, "r", encoding="utf-8") as f:
             content = f.read()
         # Apply split strategy
         if strategy == "sections":
             splits = split_by_sections(content, max_lines=max_lines)
@@ -2784,42 +2797,44 @@ def split(task_file: str, strategy: str = "sections", max_lines: int = 500, dry_
             splits = split_by_phases(content)
             print(f"\n💡 Suggested splits (by phases):")
         else:
-            print(f"{RED}Error: Unknown strategy '{strategy}'. Use 'sections' or 'phases'.{RESET}")
+            print(
+                f"{RED}Error: Unknown strategy '{strategy}'. Use 'sections' or 'phases'.{RESET}"
+            )
             return 1
         # Display split preview
         for i, split in enumerate(splits, 1):
             filename = f"{Path(task_file).stem}-part{i}{Path(task_file).suffix}"
             print(f"   - {filename} ({split['line_count']} lines)")
         # Dry run mode
         if dry_run:
             print(f"\n{CYAN}📋 Dry run mode - no files created{RESET}")
             return 0
         # Prompt user for confirmation
         create_files = prompt_user(f"\nCreate {len(splits)} files?", default="n")
-        if create_files.lower() in ['y', 'yes']:
+        if create_files.lower() in ["y", "yes"]:
             # Create output directory
             output_dir = os.path.join(os.path.dirname(task_file), "splits")
             # Generate split files
             created_files = generate_split_files(task_file, splits, output_dir)
             print(f"{GREEN}✅ Created {len(created_files)} files:{RESET}")
             for file_path in created_files:
                 print(f"   {file_path}")
             print(f"\n{CYAN}💡 Tip: Evaluate each split file independently:{RESET}")
             for file_path in created_files:
                 rel_path = os.path.relpath(file_path)
                 print(f"   adversarial evaluate {rel_path}")
         else:
             print("Cancelled - no files created.")
         return 0
     except Exception as e:
         print(f"{RED}Error during file splitting: {e}{RESET}")
         return 1
@@ -2865,14 +2880,33 @@ def list_evaluators() -> int:
     return 0
 def main():
     """Main CLI entry point."""
     import logging
+    import sys
+    # Load .env file before any commands run
+    # Wrapped in try/except so CLI remains usable even with malformed .env
+    try:
+        load_dotenv()
+    except Exception as e:
+        print(f"Warning: Could not load .env file: {e}", file=sys.stderr)
+    # Load .env file before any commands run
+    # Use explicit path to ensure we find .env in current working directory
+    # (load_dotenv() without args can fail to find .env in some contexts)
+    env_file = Path.cwd() / ".env"
+    if env_file.exists():
+        try:
+            load_dotenv(env_file)
+        except (OSError, UnicodeDecodeError) as e:
+            print(f"Warning: Could not load .env file: {e}", file=sys.stderr)
     from adversarial_workflow.evaluators import (
+        BUILTIN_EVALUATORS,
         get_all_evaluators,
         run_evaluator,
-        BUILTIN_EVALUATORS,
     )
     logger = logging.getLogger(__name__)
@@ -2880,8 +2914,16 @@ def main():
     # Commands that cannot be overridden by evaluators
     # Note: 'review' is special - it reviews git changes without a file argument
     STATIC_COMMANDS = {
-        "init", "check", "doctor", "health", "quickstart",
-        "agent", "split", "validate", "review", "list-evaluators"
+        "init",
+        "check",
+        "doctor",
+        "health",
+        "quickstart",
+        "agent",
+        "split",
+        "validate",
+        "review",
+        "list-evaluators",
     }
     parser = argparse.ArgumentParser(
@@ -2970,16 +3012,21 @@ For more information: https://github.com/movito/adversarial-workflow
     )
     split_parser.add_argument("task_file", help="Task file to split")
     split_parser.add_argument(
-        "--strategy", "-s", choices=["sections", "phases"], default="sections",
-        help="Split strategy: 'sections' (default) or 'phases'"
+        "--strategy",
+        "-s",
+        choices=["sections", "phases"],
+        default="sections",
+        help="Split strategy: 'sections' (default) or 'phases'",
     )
     split_parser.add_argument(
-        "--max-lines", "-m", type=int, default=500,
-        help="Maximum lines per split (default: 500)"
+        "--max-lines",
+        "-m",
+        type=int,
+        default=500,
+        help="Maximum lines per split (default: 500)",
     )
     split_parser.add_argument(
-        "--dry-run", action="store_true",
-        help="Preview splits without creating files"
+        "--dry-run", action="store_true", help="Preview splits without creating files"
     )
     # list-evaluators command
@@ -3000,7 +3047,12 @@ For more information: https://github.com/movito/adversarial-workflow
     for name, config in evaluators.items():
         # Skip if name conflicts with static command
         if name in STATIC_COMMANDS:
-            logger.warning("Evaluator '%s' conflicts with CLI command; skipping", name)
+            # Only warn for user-defined evaluators, not built-ins
+            # Built-in conflicts are intentional (e.g., 'review' command vs 'review' evaluator)
+            if getattr(config, "source", None) != "builtin":
+                logger.warning(
+                    "Evaluator '%s' conflicts with CLI command; skipping", name
+                )
             # Mark as registered to prevent alias re-registration attempts
             registered_configs.add(id(config))
             continue
@@ -3027,10 +3079,11 @@ For more information: https://github.com/movito/adversarial-workflow
         )
         eval_parser.add_argument("file", help="File to evaluate")
         eval_parser.add_argument(
-            "--timeout", "-t",
+            "--timeout",
+            "-t",
             type=int,
             default=180,
-            help="Timeout in seconds (default: 180)"
+            help="Timeout in seconds (default: 180)",
         )
         # Store config for later execution
         eval_parser.set_defaults(evaluator_config=config)
@@ -3078,7 +3131,7 @@ For more information: https://github.com/movito/adversarial-workflow
             args.task_file,
             strategy=args.strategy,
             max_lines=args.max_lines,
-            dry_run=args.dry_run
+            dry_run=args.dry_run,
         )
     elif args.command == "list-evaluators":
         return list_evaluators()

{adversarial_workflow-0.6.0 → adversarial_workflow-0.6.2}/adversarial_workflow/evaluators/__init__.py RENAMED Viewed

@@ -1,13 +1,13 @@
 """Evaluators module for adversarial-workflow plugin architecture."""
+from .builtins import BUILTIN_EVALUATORS
 from .config import EvaluatorConfig
 from .discovery import (
+    EvaluatorParseError,
     discover_local_evaluators,
     parse_evaluator_yaml,
-    EvaluatorParseError,
 )
 from .runner import run_evaluator
-from .builtins import BUILTIN_EVALUATORS
 def get_all_evaluators() -> dict[str, EvaluatorConfig]:
@@ -17,6 +17,7 @@ def get_all_evaluators() -> dict[str, EvaluatorConfig]:
     Aliases from local evaluators are also included in the returned dictionary.
     """
     import logging
     logger = logging.getLogger(__name__)
     evaluators: dict[str, EvaluatorConfig] = {}

{adversarial_workflow-0.6.0 → adversarial_workflow-0.6.2}/adversarial_workflow/evaluators/discovery.py RENAMED Viewed

@@ -40,9 +40,7 @@ def parse_evaluator_yaml(yml_file: Path) -> EvaluatorConfig:
     try:
         content = yml_file.read_text(encoding="utf-8")
     except UnicodeDecodeError as e:
-        raise EvaluatorParseError(
-            f"File encoding error (not UTF-8): {yml_file}"
-        ) from e
+        raise EvaluatorParseError(f"File encoding error (not UTF-8): {yml_file}") from e
     # Parse YAML
     data = yaml.safe_load(content)
@@ -58,7 +56,14 @@ def parse_evaluator_yaml(yml_file: Path) -> EvaluatorConfig:
         )
     # Validate required fields exist
-    required = ["name", "description", "model", "api_key_env", "prompt", "output_suffix"]
+    required = [
+        "name",
+        "description",
+        "model",
+        "api_key_env",
+        "prompt",
+        "output_suffix",
+    ]
     missing = [f for f in required if f not in data]
     if missing:
         raise EvaluatorParseError(f"Missing required fields: {', '.join(missing)}")

{adversarial_workflow-0.6.0 → adversarial_workflow-0.6.2}/adversarial_workflow/evaluators/runner.py RENAMED Viewed

@@ -10,10 +10,10 @@ import tempfile
 from datetime import datetime, timezone
 from pathlib import Path
-from .config import EvaluatorConfig
-from ..utils.colors import RESET, BOLD, GREEN, YELLOW, RED
+from ..utils.colors import BOLD, GREEN, RED, RESET, YELLOW
 from ..utils.config import load_config
 from ..utils.validation import validate_evaluation_output
+from .config import EvaluatorConfig
 def run_evaluator(config: EvaluatorConfig, file_path: str, timeout: int = 180) -> int:
@@ -124,7 +124,7 @@ def _run_custom_evaluator(
 """
     # Create temp file for prompt
-    with tempfile.NamedTemporaryFile(mode='w', suffix='.md', delete=False) as f:
+    with tempfile.NamedTemporaryFile(mode="w", suffix=".md", delete=False) as f:
         f.write(full_prompt)
         prompt_file = f.name
@@ -136,12 +136,15 @@ def _run_custom_evaluator(
         # Build aider command
         cmd = [
             "aider",
-            "--model", config.model,
+            "--model",
+            config.model,
             "--yes",
             "--no-git",
             "--no-auto-commits",
-            "--message-file", prompt_file,
-            "--read", file_path,
+            "--message-file",
+            prompt_file,
+            "--read",
+            file_path,
         ]
         result = subprocess.run(
@@ -224,7 +227,10 @@ def _execute_script(
     # Validate output
     file_basename = Path(file_path).stem
-    log_file = Path(project_config["log_directory"]) / f"{file_basename}-{config.output_suffix}.md"
+    log_file = (
+        Path(project_config["log_directory"])
+        / f"{file_basename}-{config.output_suffix}.md"
+    )
     is_valid, verdict, message = validate_evaluation_output(str(log_file))
@@ -235,7 +241,9 @@ def _execute_script(
     return _report_verdict(verdict, log_file, config)
-def _report_verdict(verdict: str | None, log_file: Path, config: EvaluatorConfig) -> int:
+def _report_verdict(
+    verdict: str | None, log_file: Path, config: EvaluatorConfig
+) -> int:
     """Report the evaluation verdict to terminal."""
     print()
     if verdict == "APPROVED":

adversarial-workflow 0.6.0__tar.gz → 0.6.2__tar.gz

adversarial-workflow 0.6.0tar.gz → 0.6.2tar.gz