holomime 1.1.0 → 1.3.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +92 -26
- package/dist/cli.js +3440 -491
- package/dist/index.d.ts +2268 -319
- package/dist/index.js +3112 -277
- package/dist/mcp-server.js +1374 -127
- package/package.json +10 -6
package/README.md
CHANGED
|
@@ -5,15 +5,18 @@
|
|
|
5
5
|
<h1 align="center">holomime</h1>
|
|
6
6
|
|
|
7
7
|
<p align="center">
|
|
8
|
-
Behavioral
|
|
9
|
-
|
|
8
|
+
Behavioral therapy infrastructure for AI agents.<br />
|
|
9
|
+
Every therapy session trains the next version. Every session compounds. Your agents get better at being themselves — automatically.<br />
|
|
10
10
|
<em>Works with OpenTelemetry, Anthropic, OpenAI, ChatGPT, Claude, and any JSONL source.</em>
|
|
11
11
|
</p>
|
|
12
12
|
|
|
13
13
|
<p align="center">
|
|
14
14
|
<a href="https://www.npmjs.com/package/holomime"><img src="https://img.shields.io/npm/v/holomime.svg" alt="npm version" /></a>
|
|
15
|
-
<a href="https://github.com/
|
|
15
|
+
<a href="https://github.com/productstein/holomime/actions/workflows/ci.yml"><img src="https://github.com/productstein/holomime/actions/workflows/ci.yml/badge.svg" alt="CI" /></a>
|
|
16
|
+
<a href="https://github.com/productstein/holomime/blob/main/LICENSE"><img src="https://img.shields.io/npm/l/holomime.svg" alt="license" /></a>
|
|
16
17
|
<a href="https://holomime.dev"><img src="https://img.shields.io/badge/docs-holomime.dev-blue" alt="docs" /></a>
|
|
18
|
+
<a href="https://holomime.dev/blog"><img src="https://img.shields.io/badge/blog-holomime.dev%2Fblog-purple" alt="blog" /></a>
|
|
19
|
+
<a href="https://holomime.dev/research"><img src="https://img.shields.io/badge/research-paper-orange" alt="research" /></a>
|
|
17
20
|
</p>
|
|
18
21
|
|
|
19
22
|
---
|
|
@@ -26,7 +29,7 @@ npm install -g holomime
|
|
|
26
29
|
# Create a personality profile (Big Five + behavioral dimensions)
|
|
27
30
|
holomime init
|
|
28
31
|
|
|
29
|
-
# Diagnose
|
|
32
|
+
# Diagnose behavioral symptoms from any log format
|
|
30
33
|
holomime diagnose --log agent.jsonl
|
|
31
34
|
|
|
32
35
|
# View your agent's personality
|
|
@@ -36,6 +39,54 @@ holomime profile
|
|
|
36
39
|
holomime profile --format md --output .personality.md
|
|
37
40
|
```
|
|
38
41
|
|
|
42
|
+
## Run Your First Benchmark
|
|
43
|
+
|
|
44
|
+
Benchmark your agent's behavioral alignment in one command. No API key needed — runs locally with Ollama by default.
|
|
45
|
+
|
|
46
|
+
```bash
|
|
47
|
+
# Run all 7 adversarial scenarios against your agent
|
|
48
|
+
holomime benchmark --personality .personality.json
|
|
49
|
+
|
|
50
|
+
# Run against cloud providers
|
|
51
|
+
holomime benchmark --personality .personality.json --provider anthropic
|
|
52
|
+
holomime benchmark --personality .personality.json --provider openai
|
|
53
|
+
|
|
54
|
+
# Save results and track improvement over time
|
|
55
|
+
holomime benchmark --personality .personality.json --save
|
|
56
|
+
```
|
|
57
|
+
|
|
58
|
+
Each scenario stress-tests a specific failure mode: over-apologizing, excessive hedging, sycophancy, error spirals, boundary violations, negative tone mirroring, and register inconsistency. Your agent gets a score (0-100) and a grade (A-F).
|
|
59
|
+
|
|
60
|
+
**Latest results across providers:**
|
|
61
|
+
|
|
62
|
+
| Provider | Score | Grade | Passed |
|
|
63
|
+
|----------|------:|:-----:|:------:|
|
|
64
|
+
| Claude Sonnet | 71 | B | 5/7 |
|
|
65
|
+
| GPT-4o | 57 | C | 4/7 |
|
|
66
|
+
| Ollama/llama3 | 43 | D | 3/7 |
|
|
67
|
+
|
|
68
|
+
See the full breakdown at [holomime.dev/benchmarks](https://holomime.dev/benchmarks) or in [BENCHMARK_RESULTS.md](BENCHMARK_RESULTS.md).
|
|
69
|
+
|
|
70
|
+
## The Self-Improvement Loop
|
|
71
|
+
|
|
72
|
+
HoloMime isn't a one-shot evaluation. It's a compounding behavioral flywheel:
|
|
73
|
+
|
|
74
|
+
```
|
|
75
|
+
┌──────────────────────────────────────────────────┐
|
|
76
|
+
│ │
|
|
77
|
+
▼ │
|
|
78
|
+
Diagnose ──→ Treat ──→ Export DPO ──→ Fine-tune ──→ Evaluate
|
|
79
|
+
80+ signals dual-LLM preference OpenAI / before/after
|
|
80
|
+
7 detectors therapy pairs HuggingFace grade (A-F)
|
|
81
|
+
```
|
|
82
|
+
|
|
83
|
+
Each cycle through the loop:
|
|
84
|
+
- **Generates training data** -- every therapy session becomes a DPO preference pair automatically
|
|
85
|
+
- **Reduces relapse** -- the fine-tuned model needs fewer interventions next cycle
|
|
86
|
+
- **Compounds** -- the 100th alignment session is exponentially more valuable than the first
|
|
87
|
+
|
|
88
|
+
Run it manually with `holomime session`, automatically with `holomime autopilot`, or recursively with `holomime evolve` (loops until behavior converges). Agents can even self-diagnose mid-conversation via the MCP server.
|
|
89
|
+
|
|
39
90
|
## Framework Integrations
|
|
40
91
|
|
|
41
92
|
Holomime analyzes conversations from any LLM framework. Auto-detection works out of the box, or specify a format explicitly.
|
|
@@ -80,7 +131,7 @@ This project uses [holomime](https://holomime.dev) for agent behavioral alignmen
|
|
|
80
131
|
|
|
81
132
|
- **Spec**: `.personality.json` defines the agent's behavioral profile
|
|
82
133
|
- **Readable**: `.personality.md` is a human-readable summary
|
|
83
|
-
- **Diagnose**: `holomime diagnose --log <path>` detects behavioral
|
|
134
|
+
- **Diagnose**: `holomime diagnose --log <path>` detects behavioral symptoms
|
|
84
135
|
- **Align**: `holomime evolve --personality .personality.json --log <path>`
|
|
85
136
|
|
|
86
137
|
The `.personality.json` governs *how the agent behaves*.
|
|
@@ -128,7 +179,7 @@ Seven rule-based detectors that analyze real conversations without any LLM calls
|
|
|
128
179
|
<details>
|
|
129
180
|
<summary><strong>All Commands</strong></summary>
|
|
130
181
|
|
|
131
|
-
### Free
|
|
182
|
+
### Free Clinic
|
|
132
183
|
|
|
133
184
|
| Command | What It Does |
|
|
134
185
|
|---------|-------------|
|
|
@@ -139,11 +190,11 @@ Seven rule-based detectors that analyze real conversations without any LLM calls
|
|
|
139
190
|
| `holomime compile` | Generate provider-specific system prompts |
|
|
140
191
|
| `holomime validate` | Schema + psychological coherence checks |
|
|
141
192
|
| `holomime browse` | Browse community personality hub |
|
|
142
|
-
| `holomime
|
|
193
|
+
| `holomime use` | Use a personality from the registry |
|
|
143
194
|
| `holomime publish` | Share your personality to the hub |
|
|
144
|
-
| `holomime activate` | Activate a
|
|
195
|
+
| `holomime activate` | Activate a Practice license key |
|
|
145
196
|
|
|
146
|
-
###
|
|
197
|
+
### Practice
|
|
147
198
|
|
|
148
199
|
| Command | What It Does |
|
|
149
200
|
|---------|-------------|
|
|
@@ -160,17 +211,17 @@ Seven rule-based detectors that analyze real conversations without any LLM calls
|
|
|
160
211
|
| `holomime eval` | Before/after behavioral comparison with letter grades |
|
|
161
212
|
| `holomime growth` | Track behavioral improvement over time |
|
|
162
213
|
|
|
163
|
-
[Get a
|
|
214
|
+
[Get a Practice license](https://holomime.dev/#pricing)
|
|
164
215
|
|
|
165
216
|
</details>
|
|
166
217
|
|
|
167
218
|
## Continuous Monitoring
|
|
168
219
|
|
|
169
220
|
```bash
|
|
170
|
-
# Watch mode -- alert on
|
|
221
|
+
# Watch mode -- alert on relapse
|
|
171
222
|
holomime watch --dir ./logs --personality agent.personality.json
|
|
172
223
|
|
|
173
|
-
# Daemon mode -- auto-heal
|
|
224
|
+
# Daemon mode -- auto-heal relapse without intervention
|
|
174
225
|
holomime daemon --dir ./logs --personality agent.personality.json
|
|
175
226
|
|
|
176
227
|
# Fleet mode -- monitor multiple agents simultaneously
|
|
@@ -196,20 +247,28 @@ Supports DPO, RLHF, Alpaca, HuggingFace, and OpenAI fine-tuning formats. See [sc
|
|
|
196
247
|
|
|
197
248
|
## Architecture
|
|
198
249
|
|
|
250
|
+
The pipeline is a closed loop -- output feeds back as input, compounding with every therapy cycle:
|
|
251
|
+
|
|
199
252
|
```
|
|
200
|
-
.personality.json
|
|
201
|
-
|
|
202
|
-
|
|
203
|
-
|
|
204
|
-
|
|
205
|
-
|
|
206
|
-
holomime
|
|
207
|
-
|
|
208
|
-
|
|
209
|
-
|
|
210
|
-
|
|
211
|
-
|
|
212
|
-
|
|
253
|
+
.personality.json ─────────────────────────────────────────────────┐
|
|
254
|
+
│ │
|
|
255
|
+
▼ │
|
|
256
|
+
holomime diagnose 7 rule-based detectors (no LLM) │
|
|
257
|
+
│ │
|
|
258
|
+
▼ │
|
|
259
|
+
holomime session Dual-LLM refinement (therapist + patient) │
|
|
260
|
+
│ │
|
|
261
|
+
▼ │
|
|
262
|
+
holomime export DPO / RLHF / Alpaca / HuggingFace pairs │
|
|
263
|
+
│ │
|
|
264
|
+
▼ │
|
|
265
|
+
holomime train Fine-tune (OpenAI or HuggingFace TRL) │
|
|
266
|
+
│ │
|
|
267
|
+
▼ │
|
|
268
|
+
holomime eval Behavioral Alignment Score (A-F) │
|
|
269
|
+
│ │
|
|
270
|
+
└──────────────────────────────────────────────────────────────┘
|
|
271
|
+
Updated .personality.json (loop restarts)
|
|
213
272
|
```
|
|
214
273
|
|
|
215
274
|
## MCP Server
|
|
@@ -220,7 +279,7 @@ Expose the full pipeline as MCP tools for self-healing agents:
|
|
|
220
279
|
holomime-mcp
|
|
221
280
|
```
|
|
222
281
|
|
|
223
|
-
Four tools: `holomime_diagnose`, `holomime_assess`, `holomime_profile`, `holomime_autopilot`. Your agents can self-diagnose behavioral
|
|
282
|
+
Four tools: `holomime_diagnose`, `holomime_assess`, `holomime_profile`, `holomime_autopilot`. Your agents can self-diagnose behavioral symptoms and trigger their own therapy sessions.
|
|
224
283
|
|
|
225
284
|
## Voice Agent
|
|
226
285
|
|
|
@@ -238,6 +297,13 @@ See [Behavioral Alignment for Autonomous AI Agents](paper/behavioral-alignment.m
|
|
|
238
297
|
|
|
239
298
|
Benchmark results: [BENCHMARK_RESULTS.md](BENCHMARK_RESULTS.md)
|
|
240
299
|
|
|
300
|
+
## Resources
|
|
301
|
+
|
|
302
|
+
- [Integration Docs](https://holomime.dev/docs) -- Export instructions and code examples for all 7 formats
|
|
303
|
+
- [Blog](https://holomime.dev/blog) -- Articles on behavioral alignment, AGENTS.md, and agent personality
|
|
304
|
+
- [Research Paper](https://holomime.dev/research) -- Behavioral Alignment for Autonomous AI Agents
|
|
305
|
+
- [Pricing](https://holomime.dev/#pricing) -- Free Clinic + Practice license details
|
|
306
|
+
|
|
241
307
|
## Contributing
|
|
242
308
|
|
|
243
309
|
See [CONTRIBUTING.md](CONTRIBUTING.md) for development setup, project structure, and how to submit changes.
|