adversarial-workflow 0.6.6__py3-none-any.whl → 0.9.0__py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -1,6 +1,6 @@
1
1
  Metadata-Version: 2.4
2
2
  Name: adversarial-workflow
3
- Version: 0.6.6
3
+ Version: 0.9.0
4
4
  Summary: Multi-stage AI evaluation system for task plans, code review, and test validation
5
5
  Author: Fredrik Matheson
6
6
  License: MIT
@@ -24,9 +24,11 @@ License-File: LICENSE
24
24
  Requires-Dist: pyyaml>=6.0
25
25
  Requires-Dist: python-dotenv>=0.19.0
26
26
  Requires-Dist: aider-chat>=0.86.0
27
+ Requires-Dist: aiohttp>=3.8.0
27
28
  Provides-Extra: dev
28
29
  Requires-Dist: pytest>=7.0; extra == "dev"
29
30
  Requires-Dist: pytest-cov>=3.0; extra == "dev"
31
+ Requires-Dist: pytest-asyncio>=0.21.0; extra == "dev"
30
32
  Requires-Dist: black>=22.0; extra == "dev"
31
33
  Requires-Dist: isort>=5.0; extra == "dev"
32
34
  Requires-Dist: flake8>=4.0; extra == "dev"
@@ -55,7 +57,7 @@ Evaluate proposals, sort out ideas, and prevent "phantom work" (AI claiming to i
55
57
  - 🎯 **Tool-agnostic**: Use with Claude Code, Cursor, Aider, manual coding, or any workflow
56
58
  - ✨ **Interactive onboarding**: Guided setup wizard gets you started in <5 minutes
57
59
 
58
- ## What's New in v0.6.3
60
+ ## What's New in v0.9.0
59
61
 
60
62
  ### Upgrade
61
63
 
@@ -63,6 +65,67 @@ Evaluate proposals, sort out ideas, and prevent "phantom work" (AI claiming to i
63
65
  pip install --upgrade adversarial-workflow
64
66
  ```
65
67
 
68
+ ### v0.9.0 - Run Library Evaluators
69
+
70
+ **Finally run your installed evaluators!** Use the new `--evaluator` flag:
71
+
72
+ ```bash
73
+ # Install an evaluator from the library
74
+ adversarial library install google/gemini-flash
75
+
76
+ # Run it with --evaluator flag
77
+ adversarial evaluate --evaluator gemini-flash task.md
78
+ adversarial evaluate -e gemini-flash task.md # short form
79
+
80
+ # Works with model_requirement for portable evaluators
81
+ # Automatically resolves to best available model
82
+ ```
83
+
84
+ **Key Features:**
85
+ - Run any installed evaluator by name
86
+ - Supports evaluator aliases
87
+ - Automatic model resolution via `model_requirement`
88
+ - Falls back to legacy `model` field if resolution fails
89
+ - Full backward compatibility - no flag uses existing behavior
90
+
91
+ See [Evaluator Library](#evaluator-library) for full documentation.
92
+
93
+ ### v0.8.1 - BugBot Fixes
94
+
95
+ - **CI/CD compatibility**: `--category --dry-run` no longer hangs in non-TTY environments
96
+ - **Proper exit codes**: Dry-run returns 1 when all previews fail
97
+ - **Config robustness**: Non-dict YAML configs no longer crash
98
+
99
+ ### v0.7.0 - Evaluator Library
100
+
101
+ Browse, install, and update evaluators from the community [adversarial-evaluator-library](https://github.com/movito/adversarial-evaluator-library):
102
+
103
+ ```bash
104
+ # Browse available evaluators
105
+ adversarial library list
106
+
107
+ # Filter by provider or category
108
+ adversarial library list --provider google
109
+ adversarial library list --category quick-check
110
+
111
+ # Install evaluators
112
+ adversarial library install google/gemini-flash openai/fast-check
113
+
114
+ # Check for updates
115
+ adversarial library check-updates
116
+
117
+ # Update installed evaluators
118
+ adversarial library update --all
119
+ ```
120
+
121
+ **Key Features:**
122
+ - Index caching with 1-hour TTL for faster lookups
123
+ - Offline support with stale cache fallback
124
+ - Provenance tracking via `_meta` block in installed files
125
+ - Diff preview before applying updates
126
+
127
+ See [Evaluator Library](#evaluator-library) for full documentation.
128
+
66
129
  ### v0.6.3 - Configurable Timeouts
67
130
 
68
131
  - **Per-evaluator timeout**: Add `timeout: 300` to evaluator YAML for slow models like Mistral Large
@@ -427,7 +490,8 @@ adversarial health # Comprehensive system health check
427
490
  adversarial agent onboard # Set up agent coordination system
428
491
 
429
492
  # Workflow
430
- adversarial evaluate task.md # Phase 1: Evaluate plan
493
+ adversarial evaluate task.md # Phase 1: Evaluate plan (uses config.yml)
494
+ adversarial evaluate -e <name> task.md # Phase 1: Evaluate with installed evaluator
431
495
  adversarial split task.md # Split large files into smaller parts
432
496
  adversarial split task.md --dry-run # Preview split without creating files
433
497
  adversarial review # Phase 3: Review implementation
@@ -435,6 +499,99 @@ adversarial validate "pytest" # Phase 4: Validate with tests
435
499
  adversarial list-evaluators # List all available evaluators
436
500
  ```
437
501
 
502
+ ## Evaluator Library
503
+
504
+ Browse and install pre-configured evaluators from the community [adversarial-evaluator-library](https://github.com/movito/adversarial-evaluator-library).
505
+
506
+ ### Quick Start
507
+
508
+ ```bash
509
+ # Browse available evaluators
510
+ adversarial library list
511
+
512
+ # Filter by provider or category
513
+ adversarial library list --provider google
514
+ adversarial library list --category quick-check
515
+
516
+ # Install an evaluator
517
+ adversarial library install google/gemini-flash
518
+
519
+ # Run it with --evaluator flag
520
+ adversarial evaluate --evaluator gemini-flash task.md
521
+ adversarial evaluate -e gemini-flash task.md # short form
522
+ ```
523
+
524
+ ### Available Commands
525
+
526
+ | Command | Description |
527
+ |---------|-------------|
528
+ | `adversarial library list` | Browse available evaluators |
529
+ | `adversarial library install <provider>/<name>` | Install evaluator to project |
530
+ | `adversarial library check-updates` | Check for updates to installed evaluators |
531
+ | `adversarial library update <name>` | Update an evaluator (with diff preview) |
532
+
533
+ ### Running Installed Evaluators
534
+
535
+ Use the `--evaluator` flag to run any installed evaluator:
536
+
537
+ ```bash
538
+ # Run by name
539
+ adversarial evaluate --evaluator plan-evaluator task.md
540
+
541
+ # Short form
542
+ adversarial evaluate -e security-reviewer task.md
543
+
544
+ # Evaluators with model_requirement auto-resolve to best available model
545
+ adversarial evaluate -e gemini-flash task.md
546
+ ```
547
+
548
+ **How it works:**
549
+ - Looks up evaluator in `.adversarial/evaluators/*.yml`
550
+ - Uses the evaluator's model, prompt, and output settings
551
+ - Supports evaluator aliases
552
+ - If evaluator has `model_requirement`, resolves to best available model
553
+ - Falls back to legacy `model` field if resolution fails
554
+
555
+ **Without --evaluator flag**: Uses existing shell script behavior (backward compatible)
556
+
557
+ ### Philosophy: Copy, Don't Link
558
+
559
+ Installed evaluators are **copied** to your project, not referenced at runtime:
560
+ - Projects remain self-contained and work offline
561
+ - You can customize your local copies freely
562
+ - Updates are explicit and user-controlled
563
+
564
+ ### Provenance Tracking
565
+
566
+ Installed evaluators include metadata for tracking updates:
567
+
568
+ ```yaml
569
+ _meta:
570
+ source: adversarial-evaluator-library
571
+ source_path: google/gemini-flash
572
+ version: "1.2.0"
573
+ installed: "2026-02-03T10:00:00Z"
574
+
575
+ name: gemini-flash
576
+ # ... rest of evaluator config
577
+ ```
578
+
579
+ ### Options
580
+
581
+ ```bash
582
+ # Bypass cache (1-hour TTL by default)
583
+ adversarial library list --no-cache
584
+
585
+ # Force overwrite existing files
586
+ adversarial library install google/gemini-flash --force
587
+
588
+ # Update all outdated evaluators
589
+ adversarial library update --all
590
+
591
+ # Preview changes without applying
592
+ adversarial library update gemini-flash --diff-only
593
+ ```
594
+
438
595
  ## Custom Evaluators
439
596
 
440
597
  Starting with v0.6.0, you can define project-specific evaluators without modifying the package.
@@ -1,11 +1,18 @@
1
- adversarial_workflow/__init__.py,sha256=P9demPy3XsLoPU7tsgsfoJnOaCZaCw9I0PGYek5oOh8,596
1
+ adversarial_workflow/__init__.py,sha256=OQ1JUc1P3VTc-4mCNKIP7NYmEZ1XVzxL6DdAIhCk6zM,596
2
2
  adversarial_workflow/__main__.py,sha256=iM2jmO5YCFpGxfWiEhIYi_SsxVa0hRIE-MB7J0EcN7Y,120
3
- adversarial_workflow/cli.py,sha256=fmr46xQ3PpW0mPimYmyczADd_EjilzoGttmv7dKw3DE,110577
4
- adversarial_workflow/evaluators/__init__.py,sha256=A9ZKUmjSMfyvEu6jDzYAFLxfkt_OQ4RGA10Bv_eO2i4,1267
3
+ adversarial_workflow/cli.py,sha256=DbDPxl5tKgFa44lRW8fzIx7vPO4Eyv5PQKF6NsUHO1A,123004
4
+ adversarial_workflow/evaluators/__init__.py,sha256=wP1mLvw6VgloQgtvlxoiqy8DvwNMUn5EWWxPDoigtZc,1580
5
5
  adversarial_workflow/evaluators/builtins.py,sha256=u5LokYLe8ruEW2tunhOQaNSkpcZ9Ee2IeTkaC0dZDSY,1102
6
- adversarial_workflow/evaluators/config.py,sha256=H_4vkto07rAqnz0qEYdzN_DH6WbvRPMIEdkEOFE58UI,1651
7
- adversarial_workflow/evaluators/discovery.py,sha256=Fe_mj7S4aIzXuib28b_uLu3vH8AMAy1B3MDyssi3QU0,7953
8
- adversarial_workflow/evaluators/runner.py,sha256=7zui3ec190pBC4VB3HRxVgexJJff5F2V6oxgJiMv2tc,9312
6
+ adversarial_workflow/evaluators/config.py,sha256=5aCxhUIw-w34XNZmsxZ-TCXznxLMAlTzgxFCBjVGTuo,3206
7
+ adversarial_workflow/evaluators/discovery.py,sha256=VAD16fDaBuVbtqYxR4_Bj95yP1vyB4oaeV3QVULQ2qk,12087
8
+ adversarial_workflow/evaluators/resolver.py,sha256=8RJYNyaTIl8BNk5r3-LTjfM5hRivSvmm4ZtSZpFE4Jo,7075
9
+ adversarial_workflow/evaluators/runner.py,sha256=VPpQpd0lTDA3Z8zGlBqbKM5BIUqyTDYJHgBVVO_gzxY,10127
10
+ adversarial_workflow/library/__init__.py,sha256=9s63oT4Hi1g8ZVMyIVG8Bq91L1kP70UVvE7YTFDQILs,1410
11
+ adversarial_workflow/library/cache.py,sha256=6X0MoBjpMMjEjeoA3Inevb4dMmzHamwlYomEuPa6T54,4861
12
+ adversarial_workflow/library/client.py,sha256=_PewGNdKG8dY73rwCNQKjTfazjAF-qIlCa7UuIG-mUM,7319
13
+ adversarial_workflow/library/commands.py,sha256=-kzRqzT4duZnIKQSzlWHbNRABe879UXlsdKWVmpmMyU,27548
14
+ adversarial_workflow/library/config.py,sha256=SFCw3ZzVmzT-DGuPZVmi-tCO7OHTrqnZlKQ9u_Vk0Nw,2720
15
+ adversarial_workflow/library/models.py,sha256=NQCFp8SG_s_mom6daao2T0rgA3mdKihl7ne8kpdNytI,3752
9
16
  adversarial_workflow/templates/.aider.conf.yml.template,sha256=jT2jWIgsnmS3HLhoQWMTO3GV07bUcsT2keYw60jqiDw,183
10
17
  adversarial_workflow/templates/.env.example.template,sha256=TmTlcgz44uZqIbqgXqdfHMl-0vVn96F_EGNohClFkb8,1821
11
18
  adversarial_workflow/templates/README.template,sha256=FQAMPO99eIt_kgQfwhGHcrK736rm_MEvWSbPnqBSjAE,1349
@@ -21,13 +28,14 @@ adversarial_workflow/templates/agent-context/agent-handoffs-minimal.json.templat
21
28
  adversarial_workflow/templates/agent-context/agent-handoffs.json.template,sha256=rdRX79xdNKyTGd8_g6pvSdQK5VC5sDiErq_OZFslSXI,3904
22
29
  adversarial_workflow/templates/agent-context/current-state.json.template,sha256=UH3SQGjXGNzScqHnQcPrhrI9ZvjQBC3oBp4S9TilzoY,2325
23
30
  adversarial_workflow/utils/__init__.py,sha256=Pnm-a_jqoMVOxHdvVWXeVrL0IKI-zkY7EAdbQmZAkSI,352
31
+ adversarial_workflow/utils/citations.py,sha256=Xhla-M3az4aXtd9F0UHyBXhTBDyYOo9Mb1IUi8nPLPs,18619
24
32
  adversarial_workflow/utils/colors.py,sha256=uRrG6KfIDBLo0F5_vPwms9NCm9-x8YXBiyZ4naCr868,160
25
33
  adversarial_workflow/utils/config.py,sha256=3VmF65ItUbFzbyAZ1RUoOtpS_t6n1wqIhKft8eSNsdw,1303
26
34
  adversarial_workflow/utils/file_splitter.py,sha256=kvWh0xVjd08fsEXgysoHd5zFwJHqs-JRKottO8scYCA,12381
27
35
  adversarial_workflow/utils/validation.py,sha256=0QfuRd-kurcadUCd9XQvO-N8RsmLp6ONQnc0vaQTUBA,2188
28
- adversarial_workflow-0.6.6.dist-info/licenses/LICENSE,sha256=M-dOQlre-NmicyPa55hYOJUW8roGpCKEgtq-z0z1KCA,1073
29
- adversarial_workflow-0.6.6.dist-info/METADATA,sha256=q-JQ92-0bIsBzxyCl5HzXEUtZseRy8HcO8XqMeoiSro,30832
30
- adversarial_workflow-0.6.6.dist-info/WHEEL,sha256=wUyA8OaulRlbfwMtmQsvNngGrxQHAvkKcvRmdizlJi0,92
31
- adversarial_workflow-0.6.6.dist-info/entry_points.txt,sha256=9H-iZ-yF1uKZ8P0G1suc6kWR0NvK7uPZJbhN7nvt1sE,62
32
- adversarial_workflow-0.6.6.dist-info/top_level.txt,sha256=8irutNxLRjUbTlzfAibIpz7_ovkkF2h8ES69NQpv24c,21
33
- adversarial_workflow-0.6.6.dist-info/RECORD,,
36
+ adversarial_workflow-0.9.0.dist-info/licenses/LICENSE,sha256=M-dOQlre-NmicyPa55hYOJUW8roGpCKEgtq-z0z1KCA,1073
37
+ adversarial_workflow-0.9.0.dist-info/METADATA,sha256=CqcZkNusUA4JMhuLW77vWsDPznKLx_uH4Gkqav-f4Zc,35538
38
+ adversarial_workflow-0.9.0.dist-info/WHEEL,sha256=wUyA8OaulRlbfwMtmQsvNngGrxQHAvkKcvRmdizlJi0,92
39
+ adversarial_workflow-0.9.0.dist-info/entry_points.txt,sha256=9H-iZ-yF1uKZ8P0G1suc6kWR0NvK7uPZJbhN7nvt1sE,62
40
+ adversarial_workflow-0.9.0.dist-info/top_level.txt,sha256=8irutNxLRjUbTlzfAibIpz7_ovkkF2h8ES69NQpv24c,21
41
+ adversarial_workflow-0.9.0.dist-info/RECORD,,