@artemiskit/cli 0.2.0 → 0.2.3

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/CHANGELOG.md CHANGED
@@ -1,5 +1,102 @@
1
1
  # @artemiskit/cli
2
2
 
3
+ ## 0.2.3
4
+
5
+ ### Patch Changes
6
+
7
+ - 37403aa: ## v0.2.3 - Cost Tracking & Compliance Features
8
+
9
+ ### Cost Tracking
10
+
11
+ - **Automatic cost estimation**: Run results now include estimated API costs based on token usage and model pricing data
12
+ - **Cost display in output**: Summary output shows total tokens and estimated cost for each run
13
+ - **`--budget` flag**: Set a maximum budget in USD for `run`, `redteam`, and `stress` commands - the command fails (exit code 1) if the estimated cost exceeds the budget
14
+
15
+ ### History Enhancements
16
+
17
+ - **`--show-cost` flag**: Display cost column and total in `history` command output
18
+ - Cost data is stored with each run for historical tracking
19
+
20
+ ### Markdown Export
21
+
22
+ - **`--export markdown` flag**: Export run and redteam results to compliance-ready markdown format
23
+ - **`--export-output` flag**: Specify custom output directory for exports (default: `./artemis-exports`)
24
+ - Markdown reports include:
25
+ - Summary table with pass/fail rates, latency, token usage, and cost metrics
26
+ - Detailed results for failed test cases (run) or vulnerabilities found (redteam)
27
+ - Configuration used for the run
28
+ - Redaction summary (if enabled)
29
+ - Recommendations for remediation (redteam)
30
+
31
+ ### CI/CD Integration
32
+
33
+ - Budget enforcement in pipelines: `akit run scenarios/ --ci --budget 5.00`
34
+ - Cost tracking in CI summary output with `ARTEMISKIT_COST_USD` variable
35
+ - Automatic markdown report generation for compliance documentation
36
+
37
+ - Updated dependencies [37403aa]
38
+ - @artemiskit/core@0.2.3
39
+ - @artemiskit/reports@0.2.3
40
+ - @artemiskit/adapter-openai@0.1.10
41
+ - @artemiskit/adapter-vercel-ai@0.1.10
42
+ - @artemiskit/redteam@0.2.3
43
+
44
+ ## 0.2.2
45
+
46
+ ### Patch Changes
47
+
48
+ - d5ca7c6: Add baseline command and CI mode for regression detection
49
+
50
+ ### New Features
51
+
52
+ - **Baseline Command**: New `akit baseline` command with `set`, `list`, `get`, `remove` subcommands
53
+
54
+ - Lookup by run ID (default) or scenario name (`--scenario` flag)
55
+ - Store and manage baseline metrics for regression comparison
56
+
57
+ - **CI Mode**: New `--ci` flag for machine-readable output
58
+
59
+ - Outputs environment variable format for easy parsing
60
+ - Auto-detects CI environments (GitHub Actions, GitLab CI, etc.)
61
+ - Suppresses colors and spinners
62
+
63
+ - **Summary Formats**: New `--summary` flag with `json`, `text`, `security` formats
64
+
65
+ - JSON summary for pipeline parsing
66
+ - Security summary for compliance reporting
67
+
68
+ - **Regression Detection**: New `--baseline` and `--threshold` flags
69
+ - Compare runs against saved baselines
70
+ - Configurable regression threshold (default 5%)
71
+ - Exit code 1 on regression detection
72
+
73
+ - Updated dependencies [d5ca7c6]
74
+ - @artemiskit/core@0.2.2
75
+ - @artemiskit/adapter-openai@0.1.9
76
+ - @artemiskit/adapter-vercel-ai@0.1.9
77
+ - @artemiskit/redteam@0.2.2
78
+ - @artemiskit/reports@0.2.2
79
+
80
+ ## 0.2.1
81
+
82
+ ### Patch Changes
83
+
84
+ - fix: improve LLM grader compatibility with reasoning models
85
+
86
+ - Remove temperature parameter from LLM grader (reasoning models like o1, o3, gpt-5-mini only support temperature=1)
87
+ - Increase maxTokens from 200 to 1000 to accommodate reasoning models that use tokens for internal thinking
88
+ - Improve grader prompt for stricter JSON-only output format
89
+ - Add fallback parsing for malformed JSON responses
90
+ - Add markdown code block stripping from grader responses
91
+ - Add `modelFamily` configuration option to Azure OpenAI provider for correct parameter detection when deployment names differ from model names
92
+
93
+ - Updated dependencies
94
+ - @artemiskit/core@0.2.1
95
+ - @artemiskit/adapter-openai@0.1.8
96
+ - @artemiskit/adapter-vercel-ai@0.1.8
97
+ - @artemiskit/redteam@0.2.1
98
+ - @artemiskit/reports@0.2.1
99
+
3
100
  ## 0.2.0
4
101
 
5
102
  ### Minor Changes