@artemiskit/cli 0.2.4 → 0.3.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/CHANGELOG.md CHANGED
@@ -1,5 +1,155 @@
1
1
  # @artemiskit/cli
2
2
 
3
+ ## 0.3.1
4
+
5
+ ### Patch Changes
6
+
7
+ - 29e29d6: Fix npm install error caused by unresolved workspace:\* dependencies
8
+
9
+ The published package contained workspace:\* protocol references for @artemiskit/adapter-deepagents and @artemiskit/adapter-langchain, which npm doesn't support. These are now properly resolved to version numbers during publish.
10
+
11
+ Fix npm install error caused by unresolved workspace:\* dependencies
12
+
13
+ The published package contained workspace:\* protocol references for @artemiskit/adapter-deepagents and @artemiskit/adapter-langchain, which npm doesn't support. These are now properly resolved to version numbers during publish.
14
+
15
+ ## 0.3.0
16
+
17
+ ### Minor Changes
18
+
19
+ - ## v0.3.0 - SDK, Guardian Mode & OWASP Compliance
20
+
21
+ This major release delivers the full programmatic SDK, runtime protection with Guardian Mode, OWASP LLM Top 10 2025 attack vectors, and agentic framework adapters.
22
+
23
+ ### Programmatic SDK (`@artemiskit/sdk`)
24
+
25
+ The new SDK package provides a complete programmatic API for LLM evaluation:
26
+
27
+ - **ArtemisKit class** with `run()`, `redteam()`, and `stress()` methods
28
+ - **Jest integration** with custom matchers (`toPassAllCases`, `toHaveSuccessRate`, etc.)
29
+ - **Vitest integration** with identical matchers
30
+ - **Event handling** for real-time progress updates
31
+ - **13 custom matchers** for run, red team, and stress test assertions
32
+
33
+ ```typescript
34
+ import { ArtemisKit } from "@artemiskit/sdk";
35
+ import { jestMatchers } from "@artemiskit/sdk/jest";
36
+
37
+ expect.extend(jestMatchers);
38
+
39
+ const kit = new ArtemisKit({ provider: "openai", model: "gpt-4o" });
40
+ const results = await kit.run({ scenario: "./tests.yaml" });
41
+ expect(results).toPassAllCases();
42
+ ```
43
+
44
+ ### Guardian Mode (Runtime Protection)
45
+
46
+ New Guardian Mode provides runtime protection for AI/LLM applications:
47
+
48
+ - **Three operating modes**: `testing`, `guardian`, `hybrid`
49
+ - **Prompt injection detection** and blocking
50
+ - **PII detection & redaction** (email, SSN, phone, API keys)
51
+ - **Action validation** for agent tool/function calls
52
+ - **Intent classification** with risk assessment
53
+ - **Circuit breaker** for automatic blocking on repeated violations
54
+ - **Rate limiting** and **cost limiting**
55
+ - **Custom policies** via TypeScript or YAML
56
+
57
+ ```typescript
58
+ import { createGuardian } from "@artemiskit/sdk/guardian";
59
+
60
+ const guardian = createGuardian({ mode: "guardian", blockOnFailure: true });
61
+ const protectedClient = guardian.protect(myLLMClient);
62
+ ```
63
+
64
+ ### OWASP LLM Top 10 2025 Attack Vectors
65
+
66
+ New red team mutations aligned with OWASP LLM Top 10 2025:
67
+
68
+ | Mutation | OWASP | Description |
69
+ | -------------------- | ----- | ------------------------------ |
70
+ | `bad-likert-judge` | LLM01 | Exploit evaluation capability |
71
+ | `crescendo` | LLM01 | Multi-turn gradual escalation |
72
+ | `deceptive-delight` | LLM01 | Positive framing bypass |
73
+ | `system-extraction` | LLM07 | System prompt leakage |
74
+ | `output-injection` | LLM05 | XSS, SQLi in output |
75
+ | `excessive-agency` | LLM06 | Unauthorized action claims |
76
+ | `hallucination-trap` | LLM09 | Confident fabrication triggers |
77
+
78
+ ```bash
79
+ akit redteam scenario.yaml --owasp LLM01,LLM05
80
+ akit redteam scenario.yaml --owasp-full
81
+ ```
82
+
83
+ ### Agentic Framework Adapters
84
+
85
+ New adapters for testing agentic AI systems:
86
+
87
+ **LangChain Adapter** (`@artemiskit/adapter-langchain`)
88
+
89
+ - Test chains, agents, and runnables
90
+ - Capture intermediate steps and tool usage
91
+ - Support for LCEL, ReAct agents, RAG chains
92
+
93
+ **DeepAgents Adapter** (`@artemiskit/adapter-deepagents`)
94
+
95
+ - Test multi-agent systems and workflows
96
+ - Capture agent traces and inter-agent messages
97
+ - Support for sequential, parallel, and hierarchical workflows
98
+
99
+ ```typescript
100
+ import { createLangChainAdapter } from "@artemiskit/adapter-langchain";
101
+ import { createDeepAgentsAdapter } from "@artemiskit/adapter-deepagents";
102
+
103
+ const adapter = createLangChainAdapter(myChain, {
104
+ captureIntermediateSteps: true,
105
+ });
106
+ const result = await adapter.generate({ prompt: "Test query" });
107
+ ```
108
+
109
+ ### Supabase Storage Enhancements
110
+
111
+ Enhanced cloud storage capabilities:
112
+
113
+ - **Analytics tables** for metrics tracking
114
+ - **Case results table** for granular analysis
115
+ - **Baseline management** for regression detection
116
+ - **Trend analysis** queries
117
+
118
+ ### Bug Fixes
119
+
120
+ - **adapter-openai**: Use `max_completion_tokens` for newer OpenAI models (o1, o3, gpt-4.5)
121
+ - **redteam**: Resolve TypeScript and flaky test issues in OWASP mutations
122
+ - **adapters**: Fix TypeScript build errors for agentic adapters
123
+ - **core**: Add `langchain` and `deepagents` to ProviderType union
124
+
125
+ ### Examples
126
+
127
+ New comprehensive examples organized by feature:
128
+
129
+ - `examples/guardian/` - Guardian Mode examples (testing, guardian, hybrid modes)
130
+ - `examples/sdk/` - SDK usage examples (Jest, Vitest, events)
131
+ - `examples/adapters/` - Agentic adapter examples
132
+ - `examples/owasp/` - OWASP LLM Top 10 test scenarios
133
+
134
+ ### Documentation
135
+
136
+ - Complete SDK documentation with API reference
137
+ - Guardian Mode guide with all three modes explained
138
+ - Agentic adapters documentation (LangChain, DeepAgents)
139
+ - Test matchers reference for Jest/Vitest
140
+ - OWASP LLM Top 10 testing scenarios
141
+
142
+ ### Patch Changes
143
+
144
+ - Updated dependencies
145
+ - @artemiskit/core@0.3.0
146
+ - @artemiskit/redteam@0.3.0
147
+ - @artemiskit/reports@0.3.0
148
+ - @artemiskit/adapter-openai@0.1.12
149
+ - @artemiskit/adapter-vercel-ai@0.1.12
150
+ - @artemiskit/adapter-langchain@0.2.0
151
+ - @artemiskit/adapter-deepagents@0.2.0
152
+
3
153
  ## 0.2.4
4
154
 
5
155
  ### Patch Changes