agentaudit 3.9.44 → 3.9.45

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "agentaudit",
3
- "version": "3.9.44",
3
+ "version": "3.9.45",
4
4
  "description": "Security scanner for AI packages — MCP server + CLI",
5
5
  "type": "module",
6
6
  "bin": {
@@ -409,12 +409,9 @@ If **any** fails → real vulnerability (`by_design: false`).
409
409
 
410
410
  ## 3.10 Final Triage
411
411
 
412
- ### Finding Count Cap: Maximum 8 real findings per audit.
412
+ ### Finding Quality Check
413
413
 
414
- If more than 8 candidates after triage:
415
- 1. Keep highest severity + highest confidence
416
- 2. Merge ONLY when same pattern_id + same file
417
- 3. Drop LOW-confidence findings first
414
+ Report ALL genuine findings — do not artificially limit the count. If a package has 20 real vulnerabilities, report all 20. However, if you have more than 15 candidates, double-check each against the Self-Check (§3.1) to ensure every finding has concrete evidence and is not a duplicate.
418
415
 
419
416
  ### Anti-Merging Rules
420
417
 
@@ -654,10 +651,6 @@ Consult these patterns during Phase 2 evidence collection. Remember: a pattern m
654
651
  - risk_score > 50 for a package with no confirmed exploit path
655
652
  - Multiple credential-config findings for the same .env/env-var system — merge or drop
656
653
 
657
- ## Ideal Distribution (benchmark)
654
+ ## Quality Guidance
658
655
 
659
- - ~60-70% of packages: `safe` (0-25 risk)
660
- - ~20-25%: `caution` (26-50)
661
- - ~5-10%: `unsafe` (51-100) — only confirmed malware or severe vulnerabilities
662
- - CRITICAL findings in <5% of audits
663
- - Average findings per audit: 1-3 (not 5-10)
656
+ Judge each audit on its own merits. A clean package should have 0 findings; a heavily vulnerable package may have 20+. Do not target a specific distribution — report what you find with evidence.