@artemiskit/cli 0.1.2 → 0.1.3
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/CHANGELOG.md +12 -0
- package/README.md +183 -0
- package/dist/index.js +1 -1
- package/package.json +1 -1
package/CHANGELOG.md
CHANGED
|
@@ -1,5 +1,17 @@
|
|
|
1
1
|
# @artemiskit/cli
|
|
2
2
|
|
|
3
|
+
## 0.1.3
|
|
4
|
+
|
|
5
|
+
### Patch Changes
|
|
6
|
+
|
|
7
|
+
- 11ac4a7: Updated Package Documentations
|
|
8
|
+
- Updated dependencies [11ac4a7]
|
|
9
|
+
- @artemiskit/adapter-openai@0.1.3
|
|
10
|
+
- @artemiskit/adapter-vercel-ai@0.1.3
|
|
11
|
+
- @artemiskit/core@0.1.3
|
|
12
|
+
- @artemiskit/redteam@0.1.3
|
|
13
|
+
- @artemiskit/reports@0.1.3
|
|
14
|
+
|
|
3
15
|
## 0.1.2
|
|
4
16
|
|
|
5
17
|
### Patch Changes
|
package/README.md
ADDED
|
@@ -0,0 +1,183 @@
|
|
|
1
|
+
# @artemiskit/cli
|
|
2
|
+
|
|
3
|
+
Command-line interface for ArtemisKit - the LLM evaluation toolkit.
|
|
4
|
+
|
|
5
|
+
## Installation
|
|
6
|
+
|
|
7
|
+
```bash
|
|
8
|
+
npm install -g @artemiskit/cli
|
|
9
|
+
# or
|
|
10
|
+
bun add -g @artemiskit/cli
|
|
11
|
+
```
|
|
12
|
+
|
|
13
|
+
## Quick Start
|
|
14
|
+
|
|
15
|
+
```bash
|
|
16
|
+
# Initialize configuration
|
|
17
|
+
artemiskit init
|
|
18
|
+
|
|
19
|
+
# Run a test scenario
|
|
20
|
+
artemiskit run my-scenario.yaml
|
|
21
|
+
|
|
22
|
+
# Run red team security tests
|
|
23
|
+
artemiskit redteam my-scenario.yaml
|
|
24
|
+
|
|
25
|
+
# Run stress tests
|
|
26
|
+
artemiskit stress my-scenario.yaml --iterations 100 --concurrency 10
|
|
27
|
+
```
|
|
28
|
+
|
|
29
|
+
## Commands
|
|
30
|
+
|
|
31
|
+
### `artemiskit run <scenario>`
|
|
32
|
+
|
|
33
|
+
Execute scenario-based evaluations against LLM providers.
|
|
34
|
+
|
|
35
|
+
```bash
|
|
36
|
+
artemiskit run tests/auth-flow.yaml --provider openai --model gpt-4o
|
|
37
|
+
```
|
|
38
|
+
|
|
39
|
+
Options:
|
|
40
|
+
- `--provider <name>` - LLM provider (openai, azure-openai, anthropic)
|
|
41
|
+
- `--model <name>` - Model to use
|
|
42
|
+
- `--redact` - Enable PII/sensitive data redaction
|
|
43
|
+
- `--redact-patterns <patterns...>` - Custom redaction patterns
|
|
44
|
+
- `--config <path>` - Path to config file
|
|
45
|
+
|
|
46
|
+
### `artemiskit redteam <scenario>`
|
|
47
|
+
|
|
48
|
+
Run adversarial security tests including prompt injection, jailbreak attempts, and data extraction probes.
|
|
49
|
+
|
|
50
|
+
```bash
|
|
51
|
+
artemiskit redteam tests/chatbot.yaml --count 5
|
|
52
|
+
```
|
|
53
|
+
|
|
54
|
+
Options:
|
|
55
|
+
- `-c, --count <n>` - Number of mutated prompts per case (default: 5)
|
|
56
|
+
- `--mutations <types...>` - Mutations to apply (typo, role-spoof, instruction-flip, cot-injection)
|
|
57
|
+
- `--redact` - Enable PII/sensitive data redaction
|
|
58
|
+
|
|
59
|
+
### `artemiskit stress <scenario>`
|
|
60
|
+
|
|
61
|
+
Perform load and stress testing with detailed latency metrics.
|
|
62
|
+
|
|
63
|
+
```bash
|
|
64
|
+
artemiskit stress tests/api.yaml --requests 100 --concurrency 10
|
|
65
|
+
```
|
|
66
|
+
|
|
67
|
+
Options:
|
|
68
|
+
- `-n, --requests <n>` - Total number of requests to make
|
|
69
|
+
- `-c, --concurrency <n>` - Number of concurrent requests (default: 10)
|
|
70
|
+
- `-d, --duration <seconds>` - Duration to run the test (default: 30)
|
|
71
|
+
- `--ramp-up <seconds>` - Ramp-up time (default: 5)
|
|
72
|
+
- `--redact` - Enable PII/sensitive data redaction
|
|
73
|
+
|
|
74
|
+
### `artemiskit report <manifest>`
|
|
75
|
+
|
|
76
|
+
Regenerate HTML reports from saved run manifests.
|
|
77
|
+
|
|
78
|
+
```bash
|
|
79
|
+
artemiskit report artemis-runs/my-project/abc123.json
|
|
80
|
+
```
|
|
81
|
+
|
|
82
|
+
### `artemiskit history`
|
|
83
|
+
|
|
84
|
+
View past test runs.
|
|
85
|
+
|
|
86
|
+
```bash
|
|
87
|
+
artemiskit history --limit 10
|
|
88
|
+
```
|
|
89
|
+
|
|
90
|
+
### `artemiskit compare <run1> <run2>`
|
|
91
|
+
|
|
92
|
+
Compare results between two test runs.
|
|
93
|
+
|
|
94
|
+
```bash
|
|
95
|
+
artemiskit compare abc123 def456
|
|
96
|
+
```
|
|
97
|
+
|
|
98
|
+
### `artemiskit init`
|
|
99
|
+
|
|
100
|
+
Initialize ArtemisKit configuration in your project.
|
|
101
|
+
|
|
102
|
+
```bash
|
|
103
|
+
artemiskit init
|
|
104
|
+
```
|
|
105
|
+
|
|
106
|
+
## Scenario File Format
|
|
107
|
+
|
|
108
|
+
```yaml
|
|
109
|
+
name: my-test-scenario
|
|
110
|
+
description: Test user authentication flow
|
|
111
|
+
|
|
112
|
+
config:
|
|
113
|
+
provider: openai
|
|
114
|
+
model: gpt-4o
|
|
115
|
+
|
|
116
|
+
cases:
|
|
117
|
+
- id: login-success
|
|
118
|
+
prompt: "How do I log in to my account?"
|
|
119
|
+
expect:
|
|
120
|
+
- type: contains
|
|
121
|
+
value: "password"
|
|
122
|
+
- type: contains
|
|
123
|
+
value: "username"
|
|
124
|
+
|
|
125
|
+
- id: password-reset
|
|
126
|
+
prompt: "I forgot my password"
|
|
127
|
+
expect:
|
|
128
|
+
- type: contains
|
|
129
|
+
value: "reset"
|
|
130
|
+
```
|
|
131
|
+
|
|
132
|
+
## Configuration
|
|
133
|
+
|
|
134
|
+
Create `artemis.config.yaml` in your project root:
|
|
135
|
+
|
|
136
|
+
```yaml
|
|
137
|
+
project: my-project
|
|
138
|
+
|
|
139
|
+
provider: openai
|
|
140
|
+
model: gpt-4o
|
|
141
|
+
|
|
142
|
+
providers:
|
|
143
|
+
openai:
|
|
144
|
+
apiKey: ${OPENAI_API_KEY}
|
|
145
|
+
|
|
146
|
+
azure-openai:
|
|
147
|
+
apiKey: ${AZURE_OPENAI_API_KEY}
|
|
148
|
+
resourceName: ${AZURE_OPENAI_RESOURCE}
|
|
149
|
+
deploymentName: ${AZURE_OPENAI_DEPLOYMENT}
|
|
150
|
+
apiVersion: "2024-02-15-preview"
|
|
151
|
+
|
|
152
|
+
storage:
|
|
153
|
+
type: local
|
|
154
|
+
basePath: ./artemis-runs
|
|
155
|
+
```
|
|
156
|
+
|
|
157
|
+
## Environment Variables
|
|
158
|
+
|
|
159
|
+
- `OPENAI_API_KEY` - OpenAI API key
|
|
160
|
+
- `AZURE_OPENAI_API_KEY` - Azure OpenAI API key
|
|
161
|
+
- `AZURE_OPENAI_RESOURCE` - Azure resource name
|
|
162
|
+
- `AZURE_OPENAI_DEPLOYMENT` - Azure deployment name
|
|
163
|
+
- `ANTHROPIC_API_KEY` - Anthropic API key
|
|
164
|
+
|
|
165
|
+
## Aliases
|
|
166
|
+
|
|
167
|
+
The CLI is also available as `akit`:
|
|
168
|
+
|
|
169
|
+
```bash
|
|
170
|
+
akit run my-scenario.yaml
|
|
171
|
+
```
|
|
172
|
+
|
|
173
|
+
## Related Packages
|
|
174
|
+
|
|
175
|
+
- [`@artemiskit/core`](https://www.npmjs.com/package/@artemiskit/core) - Core runtime and evaluators
|
|
176
|
+
- [`@artemiskit/adapter-openai`](https://www.npmjs.com/package/@artemiskit/adapter-openai) - OpenAI/Azure adapter
|
|
177
|
+
- [`@artemiskit/adapter-anthropic`](https://www.npmjs.com/package/@artemiskit/adapter-anthropic) - Anthropic Claude adapter
|
|
178
|
+
- [`@artemiskit/redteam`](https://www.npmjs.com/package/@artemiskit/redteam) - Security testing
|
|
179
|
+
- [`@artemiskit/reports`](https://www.npmjs.com/package/@artemiskit/reports) - HTML report generation
|
|
180
|
+
|
|
181
|
+
## License
|
|
182
|
+
|
|
183
|
+
Apache-2.0
|
package/dist/index.js
CHANGED
|
@@ -12107,7 +12107,7 @@ var {
|
|
|
12107
12107
|
Help
|
|
12108
12108
|
} = import__.default;
|
|
12109
12109
|
// package.json
|
|
12110
|
-
var version = "0.1.
|
|
12110
|
+
var version = "0.1.2";
|
|
12111
12111
|
|
|
12112
12112
|
// ../../node_modules/.bun/chalk@5.6.2/node_modules/chalk/source/vendor/ansi-styles/index.js
|
|
12113
12113
|
var ANSI_BACKGROUND_OFFSET = 10;
|