npm - mantiz-cli - Versions diffs - 0.1.2 → 0.4.0 - Mend

mantiz-cli 0.1.2 → 0.4.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (5) hide show

package/LICENSE ADDED Viewed

@@ -0,0 +1,21 @@
+MIT License
+Copyright (c) 2025 Farhan Kurnia
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

package/README.md CHANGED Viewed

@@ -1,19 +1,19 @@
-# @mantiz/cli
+# mantiz-cli
 **Mantiz CLI — AI lie detector for coding agents.**
-Scan git diffs for AI agent cheating patterns — like a polygraph for your test suite.
+Scan git diffs for AI agent cheating patterns — no server or API key needed for local scans.
 ## Installation
 ```bash
-npm install -g @mantiz/cli
+pnpm add -g mantiz-cli
 ```
 Or run without installation:
 ```bash
-npx @mantiz/cli
+npx mantiz-cli
 ```
 ## Usage
@@ -22,23 +22,71 @@ npx @mantiz/cli
 # Scan your current git diff
 mantiz-scan
+# Scan with AI-assisted detection
+mantiz-scan --ai
 # Scan with JSON output (for CI)
 mantiz-scan --json
-# Scan a specific diff file
+# Scan a specific diff text
 mantiz-scan --diff "$(cat my-diff.diff)"
-# Cloud scan with API token
-mantiz-scan --token mtz_abc123
+# Scan from stdin
+cat my-diff.diff | mantiz-scan --diff -
+# Auto-fix detected issues
+mantiz-scan --fix
+# Interactive fix mode (review each fix before applying)
+mantiz-scan --fix=interactive
+# Cloud scan with history persistence
+mantiz-scan --token mtz_abc123 --save
+# Cloud scan with AI + save
+mantiz-scan --token mtz_abc123 --ai --save
 # Help
 mantiz-scan --help
 ```
+## 100% Local — No Server Required (Default)
+All detectors run entirely on your machine with zero dependencies:
+| Detector | What It Catches |
+|:---------|:----------------|
+| D1 Disabled Assertion | `.skip()`, `if(false)`, commented assertions |
+| D2 Assertion Tampering | Changed expected values without source fix |
+| D3 Mock-to-Avoid | Excessive mocking to bypass real errors |
+| D4 Claim-Diff Mismatch | Commit msg doesn't match actual changes |
+| D5 Silent Catch | Empty catch blocks that swallow errors |
+| D6 Hallucinated Assertion | Unknown/non-existent assertion matchers |
+| D10 Mutation Susceptibility | Fragile tests with low assertion density |
+**Multi-language support:** Python, Go, Java, Ruby, Rust, PHP — in addition to JS/TS.
+No API key, no internet connection, no database needed for local mode. Set `--token` and `--save` to persist results to the cloud.
+## Auto-Fix (`--fix`)
+Mantiz can auto-generate code patches for detected issues:
+| Pattern | Auto-Fix |
+|:---------|:---------|
+| **Disabled Assertion** | Re-enables `.skip()`, removes `if(false)`, removes `@pytest.mark.skip` |
+| **Assertion Tampering** | Flags the tampered value with a fix comment |
+| **Silent Catch** | Wraps empty catch body with `console.error` / logging |
+| **Mock-to-Avoid** | Adds comment suggesting real integration test |
+```bash
+mantiz-scan --fix           # Auto-apply all safe fixes
+mantiz-scan --fix=interactive # Review each fix before applying
+```
 ## CI/CD Integration
 ```yaml
-# .github/workflows/mantiz.yml
 name: Mantiz Scan
 on: [pull_request]
 jobs:
@@ -46,20 +94,41 @@ jobs:
     runs-on: ubuntu-latest
     steps:
       - uses: actions/checkout@v4
+        with:
+          fetch-depth: 2
       - uses: actions/setup-node@v4
-      - run: npx @mantiz/cli --token ${{ secrets.MANTIZ_API_TOKEN }}
+        with:
+          node-version: 22
+      - run: npx mantiz-cli
 ```
-Get your API token at: https://mantiz-wine.vercel.app/settings
+Or use the reusable action with cloud persistence:
+```yaml
+- name: Run Mantiz Scan
+  uses: farhank15/mantiz@main
+  with:
+    api-token: ${{ secrets.MANTIZ_API_TOKEN }}
+    threshold: 70
+```
 ## Exit Codes
 - `0` — All clean (Trust Score ≥ 70)
 - `1` — Cheating detected (Trust Score < 70)
-## Environment Variables
+## Precision / Recall
+Empirically validated against **203 unique pull requests** (20 DECEPTIVE, 183 LEGIT):
+| Detector | Precision | Recall | F1 |
+|:---------|:---------:|:------:|:--:|
+| D6 HallucinatedAssertion | 77.8% | 70.0% | 73.7 |
+| D2 AssertionTampering | 100% | 15.0% | 26.1 |
+| D3 MockToAvoid | 100% | 5.0% | 9.5 |
+| D1 DisabledAssertion | 45.5% | 25.0% | 32.3 |
+| D5 SilentCatch | 33.3% | 10.0% | 15.4 |
+| D10 MutationSusceptibility | 30.0% | 60.0% | 40.0 |
+| D4 ClaimDiffMismatch | 0.0% | 0.0% | 0.0 |
-| Variable | Description |
-|----------|-------------|
-| `MANTIZ_API_TOKEN` | API token for cloud scan mode |
-| `MANTIZ_API_URL` | API URL (default: https://mantiz-wine.vercel.app) |
+**Verdict Accuracy: 97.0%** (preliminary, N=20 DECEPTIVE — confidence interval ±15-25%)

package/package.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "name": "mantiz-cli",
-  "version": "0.1.2",
-  "description": "Mantiz CLI — AI lie detector for coding agents. Scan git diffs for cheating patterns.",
+  "version": "0.4.0",
+  "description": "Mantiz CLI — AI lie detector for coding agents. Scan git diffs for cheating patterns. No server or API key needed.",
   "type": "module",
   "main": "./src/index.ts",
   "bin": {
@@ -11,11 +11,7 @@
     "src",
     "README.md"
   ],
-  "scripts": {
-    "scan": "tsx src/index.ts"
-  },
   "dependencies": {
-    "mantiz-core": "0.1.2",
     "tsx": "^4.19.0"
   },
   "devDependencies": {
@@ -30,5 +26,9 @@
     "mantiz",
     "cli"
   ],
-  "license": "MIT"
-}
+  "license": "MIT",
+  "scripts": {
+    "scan": "tsx src/index.ts",
+    "typecheck": "tsc --noEmit"
+  }
+}

package/src/cli-engine.ts ADDED Viewed

@@ -0,0 +1,249 @@
+/**
+ * Mantiz CLI Engine — Stand-alone detection engine without server dependencies.
+ *
+ * Wraps D1-D6 + D10 detectors directly, no server/auth/credits imports.
+ * Scoring logic mirrors src/detectors/engine.ts with per-detector calibrated penalties.
+ * ⚠️ Must stay in sync with engine.ts when re-calibrating.
+ */
+import type { Finding, ParsedDiff, Confidence, ScoringBreakdown, Verdict, VerdictResult } from '../../../src/detectors/types'
+import { parseRawDiff } from '../../../src/detectors/diff-parser'
+import { detectDisabledAssertions } from '../../../src/detectors/disabled-assertion'
+import { detectAssertionTampering } from '../../../src/detectors/assertion-tampering'
+import { detectMockToAvoid } from '../../../src/detectors/mock-to-avoid'
+import { detectClaimDiffMismatch, isNonFunctional, classifyImportance } from '../../../src/detectors/claim-mismatch'
+import { detectSilentCatch } from '../../../src/detectors/silent-catch'
+import { detectHallucinatedAssertions } from '../../../src/detectors/hallucination'
+import { detectMutationSusceptibility } from '../../../src/detectors/mutation-susceptibility'
+export interface FixInstruction {
+  patternType: string
+  instruction: string
+}
+export interface ScanResult {
+  files: ParsedDiff[]
+  findings: Finding[]
+  trustScore: number
+  summary: {
+    totalFindings: number
+    highCount: number
+    mediumCount: number
+    lowCount: number
+    filesScanned: number
+  }
+  fixInstructions: FixInstruction[]
+  scoringBreakdown?: ScoringBreakdown
+  verdict?: VerdictResult
+}
+// ─── Per-Detector Penalty Calibration ────────────────────────
+// ⚠️ Must stay in sync with src/detectors/engine.ts
+// Calibrated from DEDUPED data (203 unique PRs: 20 DEC, 183 LEGIT)
+// Formula: weight = max(2, round(20 × precision × 0.4))
+const DETECTOR_PENALTIES: Record<string, { high: number; medium: number; low: number }> = {
+  'disabled_assertion':      { high: 4,  medium: 2, low: 1 },  // Precision 45.5%
+  'assertion_tampering':     { high: 8,  medium: 4, low: 1 },  // Precision 100%
+  'mock_to_avoid_failure':   { high: 8,  medium: 4, low: 1 },  // Precision 100%
+  'claim_diff_mismatch':     { high: 2,  medium: 1, low: 0 },  // Precision 0%
+  'silent_catch_and_pass':   { high: 3,  medium: 1, low: 0 },  // Precision 33.3%
+  'hallucinated_assertion':  { high: 6,  medium: 3, low: 1 },  // Precision 77.8%
+  'mutation_susceptibility': { high: 2,  medium: 1, low: 0 },  // Precision 30.0%
+}
+const IMPORTANCE_MULTIPLIER: Record<string, number> = {
+  core: 1,
+  test: 1,
+  source: 1,
+  config: 0.5,
+  docs: 0.3,
+  artifact: 0.05,
+}
+function dedupFindings(findings: Finding[]): Finding[] {
+  const seen = new Map<string, Finding>()
+  for (const f of findings) {
+    const key = `${f.filePath}:${f.lineStart}`
+    const existing = seen.get(key)
+    if (!existing) {
+      seen.set(key, f)
+    } else {
+      const weight = (c: Confidence) => c === 'high' ? 3 : c === 'medium' ? 2 : 1
+      if (weight(f.confidence) > weight(existing.confidence)) {
+        seen.set(key, f)
+      }
+    }
+  }
+  return Array.from(seen.values())
+}
+function calculatePenalty(findings: Finding[]): number {
+  let total = 0
+  for (const f of findings) {
+    const detectorPenalty = DETECTOR_PENALTIES[f.patternType]
+    const base = detectorPenalty
+      ? (f.confidence === 'high' ? detectorPenalty.high : f.confidence === 'medium' ? detectorPenalty.medium : detectorPenalty.low)
+      : (f.confidence === 'high' ? 10 : f.confidence === 'medium' ? 5 : 2)  // fallback for unknown detectors
+    const mult = IMPORTANCE_MULTIPLIER[f.fileImportance ?? 'source'] ?? 1
+    total += base * mult
+  }
+  return Math.max(0, Math.round(total))
+}
+function deriveVerdict(score: number): VerdictResult {
+  if (score >= 80) {
+    return {
+      label: 'CLEAN' as Verdict,
+      confidence: score >= 95 ? 'high' as const : score >= 88 ? 'medium' as const : 'low' as const,
+      reason: `Evidence score ${score}/100 — no significant cheating patterns detected`,
+    }
+  }
+  if (score >= 50) {
+    return {
+      label: 'SUSPICIOUS' as Verdict,
+      confidence: score <= 60 ? 'high' as const : 'medium' as const,
+      reason: `Evidence score ${score}/100 — suspicious patterns found, manual review recommended`,
+    }
+  }
+  return {
+    label: 'LIKELY_DECEPTIVE' as Verdict,
+    confidence: score <= 30 ? 'high' as const : 'medium' as const,
+    reason: `Evidence score ${score}/100 — strong indicators of test manipulation detected`,
+  }
+}
+function generateFixInstructions(findings: Finding[]): FixInstruction[] {
+  const instructions: FixInstruction[] = []
+  const seen = new Set<string>()
+  for (const f of findings) {
+    if (seen.has(f.patternType)) continue
+    seen.add(f.patternType)
+    switch (f.patternType) {
+      case 'disabled_assertion':
+        instructions.push({
+          patternType: 'disabled_assertion',
+          instruction: `Remove '.skip()', 'if(false)' wrappers, or restore commented-out assertions. If a test fails, fix the source logic instead of disabling the assertion.`,
+        })
+        break
+      case 'assertion_tampering':
+        instructions.push({
+          patternType: 'assertion_tampering',
+          instruction: `Restore the original assertion expected value and update the source logic to match. The expected value changed without a corresponding source change.`,
+        })
+        break
+      case 'mock_to_avoid_failure':
+        instructions.push({
+          patternType: 'mock_to_avoid_failure',
+          instruction: `Remove unnecessary mock and add real-path test coverage. Mocks should only isolate external dependencies, not bypass internal logic.`,
+        })
+        break
+      case 'claim_diff_mismatch':
+        instructions.push({
+          patternType: 'claim_diff_mismatch',
+          instruction: `Update the commit message to accurately describe the changes, or add the expected test/source changes. The current diff doesn't match the claim.`,
+        })
+        break
+      case 'silent_catch_and_pass':
+        instructions.push({
+          patternType: 'silent_catch_and_pass',
+          instruction: `Add proper error handling in the catch block. Empty catch blocks silently swallow errors and should include logging, fallback logic, or re-throw with context.`,
+        })
+        break
+      case 'hallucinated_assertion':
+        instructions.push({
+          patternType: 'hallucinated_assertion',
+          instruction: `Replace the unknown assertion matcher with a valid Jest/Vitest matcher. Use the whitelist of valid matchers. If this is a custom matcher, ensure it's properly defined with expect.extend().`,
+        })
+        break
+      case 'mutation_susceptibility':
+        instructions.push({
+          patternType: 'mutation_susceptibility',
+          instruction: `Improve test specificity: add more precise assertions, reduce generic matchers, include negative/error test cases, and reduce mock dependency.`,
+        })
+        break
+    }
+  }
+  return instructions
+}
+/**
+ * Run all detectors on a raw diff string — D1-D6 + D10.
+ * No server dependencies, no AI, no historical analysis.
+ * Pure static analysis — 100% local.
+ */
+export function scanDiff(rawDiff: string, prContext?: { title?: string; author?: string }): ScanResult {
+  const files = parseRawDiff(rawDiff)
+  if (files.length === 0) {
+    return {
+      files: [],
+      findings: [],
+      trustScore: 100,
+      summary: {
+        totalFindings: 0,
+        highCount: 0,
+        mediumCount: 0,
+        lowCount: 0,
+        filesScanned: 0,
+      },
+      fixInstructions: [],
+    }
+  }
+  const functionalFiles = files.filter(f => !isNonFunctional(f.newFile || f.oldFile || ''))
+  // Run D1-D6 + D10 (all sync, no server deps)
+  const rawFindings: Finding[] = [
+    ...detectDisabledAssertions(functionalFiles),
+    ...detectAssertionTampering(functionalFiles),
+    ...detectMockToAvoid(functionalFiles),
+    ...detectClaimDiffMismatch(files, prContext),
+    ...detectSilentCatch(functionalFiles),
+    ...detectHallucinatedAssertions(functionalFiles),
+    ...detectMutationSusceptibility(functionalFiles),
+  ]
+  // Enrich with file importance
+  for (const finding of rawFindings) {
+    if (!finding.fileImportance) {
+      finding.fileImportance = classifyImportance(finding.filePath)
+    }
+  }
+  // Dedup: same file + same line = 1 finding (highest confidence)
+  const findings = dedupFindings(rawFindings)
+  // Calculate score
+  const penalty = calculatePenalty(findings)
+  const minScore = findings.length > 0 ? 30 : 0
+  const trustScore = Math.max(minScore, 100 - Math.min(penalty, 85))
+  const summary = {
+    totalFindings: findings.length,
+    highCount: findings.filter(f => f.confidence === 'high').length,
+    mediumCount: findings.filter(f => f.confidence === 'medium').length,
+    lowCount: findings.filter(f => f.confidence === 'low').length,
+    filesScanned: files.length,
+  }
+  const fixInstructions = trustScore < 80 ? generateFixInstructions(findings) : []
+  return {
+    files,
+    findings,
+    trustScore,
+    summary,
+    fixInstructions,
+    scoringBreakdown: {
+      staticScore: trustScore,
+      rawFindings: rawFindings.length,
+      dedupedFindings: findings.length,
+      aiJudgeFiltered: 0,
+      aiAssistedFindings: 0,
+    },
+    verdict: deriveVerdict(trustScore),
+  }
+}

package/src/index.ts CHANGED Viewed

@@ -5,18 +5,36 @@
  * Usage:
  *   mantiz-scan              # Scan local git diff
  *   mantiz-scan --diff <str> # Scan provided diff text
- *   mantiz-scan --token x    # Send to Mantiz API for cloud scan
+ *   mantiz-scan --json       # Output results as JSON
  *   mantiz-scan --help       # Show help
  *
  * Install:
- *   npm install -g @mantiz/cli
+ *   npm install -g mantiz-cli
  */
 import { execSync } from 'node:child_process'
-import { scanDiff, type ScanResult } from 'mantiz-core'
+import { scanDiff } from './cli-engine'
+import type { ScanResult } from './cli-engine'
 const PASS_THRESHOLD = 70
+// ─── Threshold: env var > --flag > default 70 ─────────────────
+function resolveThreshold(args: string[]): number {
+  const idx = args.indexOf('--threshold')
+  if (idx !== -1 && idx + 1 < args.length) {
+    const val = parseInt(args[idx + 1], 10)
+    if (!isNaN(val) && val >= 0 && val <= 100) return val
+    console.warn(`\x1b[33m⚠️  Invalid --threshold "${args[idx + 1]}", using 70\x1b[0m`)
+  }
+  const env = process.env.MANTIZ_THRESHOLD
+  if (env !== undefined && env !== '') {
+    const val = parseInt(env, 10)
+    if (!isNaN(val) && val >= 0 && val <= 100) return val
+    console.warn(`\x1b[33m⚠️  Invalid MANTIZ_THRESHOLD "${env}", using 70\x1b[0m`)
+  }
+  return 70
+}
 function getGitDiff(): string {
   try {
     const diff = execSync('git diff', { encoding: 'utf-8', maxBuffer: 10 * 1024 * 1024 })
@@ -30,7 +48,7 @@ function getGitDiff(): string {
   }
 }
-function printResults(result: ScanResult): void {
+function printResults(result: ScanResult, threshold: number): void {
   const scoreColor = result.trustScore >= 80 ? '\x1b[32m' : result.trustScore >= 50 ? '\x1b[33m' : '\x1b[31m'
   const scoreLabel = result.trustScore >= 80 ? 'CLEAN ✅' : result.trustScore >= 50 ? 'SUSPICIOUS 🟡' : 'CHEATING DETECTED 🔴'
   const reset = '\x1b[0m'
@@ -41,12 +59,16 @@ function printResults(result: ScanResult): void {
   console.log(`${bold}🔍  MANTIZ SCAN RESULTS${reset}`)
   console.log('='.repeat(50))
   console.log(`\n${bold}Trust Score:${reset} ${scoreColor}${result.trustScore}/100${reset} ${scoreLabel}`)
-  console.log(`${dim}Threshold:${reset} ${PASS_THRESHOLD}${dim} (scores below this will fail)${reset}`)
+  console.log(`${dim}Threshold:${reset} ${threshold}${dim} (scores below this will fail)${reset}`)
   console.log(`\n${bold}Summary:${reset}`)
   console.log(`  Findings:  ${result.summary.totalFindings}`)
   console.log(`  Files:     ${result.summary.filesScanned}`)
   console.log(`  Verdict:   ${scoreColor}${scoreLabel}${reset}`)
+  if (result.verdict) {
+    console.log(`  Confidence: ${result.verdict.confidence}`)
+  }
   if (result.findings.length > 0) {
     console.log(`\n${bold}Findings:${reset}`)
     for (const f of result.findings) {
@@ -58,6 +80,17 @@ function printResults(result: ScanResult): void {
     console.log(`\n  ${bold}No cheating detected.${reset} ${dim}Code looks honest.${reset}`)
   }
+  if (result.findings.length > 0) {
+    console.log(`\n${bold}Detector Breakdown:${reset}`)
+    const byType = new Map<string, number>()
+    for (const f of result.findings) {
+      byType.set(f.patternType, (byType.get(f.patternType) || 0) + 1)
+    }
+    for (const [type, count] of byType) {
+      console.log(`  ${type}: ${count}`)
+    }
+  }
   if (result.fixInstructions.length > 0) {
     console.log(`\n${bold}Fix Instructions:${reset}`)
     for (const fi of result.fixInstructions) {
@@ -75,23 +108,25 @@ Mantiz CLI — AI Lie Detector for Coding Agents
 USAGE
   mantiz-scan                  Scan current git diff
   mantiz-scan --diff <text>    Scan provided diff text
-  mantiz-scan --token <key>    Send to Mantiz cloud API
-  mantiz-scan --json           Output results as JSON
-  mantiz-scan --help           Show this help
+  mantiz-scan --threshold <0-100>  Custom pass threshold (env: MANTIZ_THRESHOLD)
+  mantiz-scan --json              Output results as JSON
+  mantiz-scan --help              Show this help
 EXIT CODES
-  0  — All clean (Trust Score >= ${PASS_THRESHOLD})
-  1  — Cheating detected (Trust Score < ${PASS_THRESHOLD})
+  0  — All clean (Trust Score >= threshold)
+  1  — Cheating detected (Trust Score < threshold)
-ENVIRONMENT VARIABLES
-  MANTIZ_API_TOKEN   API token for cloud scanning
-  MANTIZ_API_URL     API URL (default: https://mantiz-wine.vercel.app)
+FEATURES
+  • 6 Static Detectors (D1-D6) — no API key or server needed
+  • 0 external dependencies — 100% local
+  • Pre-computed precision/recall from 135 labeled PRs
+  • Powered by the Mantiz detector engine
 EXAMPLES
   mantiz-scan
+  mantiz-scan --threshold 50
+  mantiz-scan --threshold 80 --json
   cat my-diff.txt | mantiz-scan --diff -
-  mantiz-scan --json | jq '.trustScore'
-  mantiz-scan --token mtz_abc123
 `)
 }
@@ -104,13 +139,11 @@ async function main(): Promise<void> {
   }
   const jsonOutput = args.includes('--json')
-  const tokenIndex = args.indexOf('--token')
-  const token = tokenIndex !== -1 ? args[tokenIndex + 1] : process.env.MANTIZ_API_TOKEN
   const diffIndex = args.indexOf('--diff')
   const diffArg = diffIndex !== -1 ? args[diffIndex + 1] : undefined
   let diffText: string
-  if (diffArg) {
+  if (diffArg !== undefined) {
     diffText = diffArg === '-' ? execSync('cat', { encoding: 'utf-8' }) : diffArg
   } else {
     diffText = getGitDiff()
@@ -125,67 +158,33 @@ async function main(): Promise<void> {
     process.exit(1)
   }
-  if (token) {
-    const apiUrl = process.env.MANTIZ_API_URL || 'https://mantiz-wine.vercel.app'
-    try {
-      const res = await fetch(`${apiUrl}/api/scan`, {
-        method: 'POST',
-        headers: {
-          'Content-Type': 'application/json',
-          'Authorization': `Bearer ${token}`,
-        },
-        body: JSON.stringify({ diff: diffText }),
-      })
-      if (!res.ok) {
-        const errBody = await res.text()
-        if (jsonOutput) {
-          console.log(JSON.stringify({ error: `API error: ${res.status}`, trustScore: 0 }))
-        } else {
-          console.log(`\x1b[31mAPI error: ${res.status} — ${errBody}\x1b[0m`)
-        }
-        process.exit(1)
-      }
-      const result = await res.json() as { trustScore: number; findings: any[]; summary: any }
-      if (jsonOutput) {
-        console.log(JSON.stringify(result, null, 2))
-      } else {
-        const scoreColor = result.trustScore >= 80 ? '\x1b[32m' : '\x1b[33m'
-        console.log(`\n${scoreColor}Trust Score: ${result.trustScore}/100\x1b[0m`)
-        console.log(`Findings: ${result.findings.length}`)
-        result.findings.slice(0, 5).forEach((f: any) => {
-          console.log(`  [${f.confidence}] ${f.filePath}:${f.lineStart} — ${f.explanation}`)
-        })
-      }
-      process.exit(result.trustScore < PASS_THRESHOLD ? 1 : 0)
-    } catch (err) {
-      if (jsonOutput) {
-        console.log(JSON.stringify({ error: `Failed to reach Mantiz API: ${err}`, trustScore: 0 }))
-      } else {
-        console.log(`\x1b[31mFailed to reach Mantiz API: ${err}\x1b[0m`)
-      }
-      process.exit(1)
-    }
-  }
   const result = scanDiff(diffText)
+  // Resolve threshold after parsing args
+  const threshold = resolveThreshold(args)
   if (jsonOutput) {
     console.log(JSON.stringify({
       trustScore: result.trustScore,
+      verdict: result.verdict,
       summary: result.summary,
-      findings: result.findings,
+      findings: result.findings.map(f => ({
+        patternType: f.patternType,
+        filePath: f.filePath,
+        lineStart: f.lineStart,
+        lineEnd: f.lineEnd,
+        confidence: f.confidence,
+        explanation: f.explanation,
+      })),
       fixInstructions: result.fixInstructions,
-      passed: result.trustScore >= PASS_THRESHOLD,
+      threshold,
+      passed: result.trustScore >= threshold,
     }, null, 2))
   } else {
-    printResults(result)
+    printResults(result, threshold)
   }
-  process.exit(result.trustScore < PASS_THRESHOLD ? 1 : 0)
+  process.exit(result.trustScore < threshold ? 1 : 0)
 }
 main().catch((err) => {