holomime 1.1.0 → 1.3.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/README.md CHANGED
@@ -5,15 +5,18 @@
5
5
  <h1 align="center">holomime</h1>
6
6
 
7
7
  <p align="center">
8
- Behavioral alignment infrastructure for AI agents.<br />
9
- Detect drift. Run therapy sessions. Export training data. Ship agents that stay in character.<br />
8
+ Behavioral therapy infrastructure for AI agents.<br />
9
+ Every therapy session trains the next version. Every session compounds. Your agents get better at being themselves &mdash; automatically.<br />
10
10
  <em>Works with OpenTelemetry, Anthropic, OpenAI, ChatGPT, Claude, and any JSONL source.</em>
11
11
  </p>
12
12
 
13
13
  <p align="center">
14
14
  <a href="https://www.npmjs.com/package/holomime"><img src="https://img.shields.io/npm/v/holomime.svg" alt="npm version" /></a>
15
- <a href="https://github.com/holomime/holomime/blob/main/LICENSE"><img src="https://img.shields.io/npm/l/holomime.svg" alt="license" /></a>
15
+ <a href="https://github.com/productstein/holomime/actions/workflows/ci.yml"><img src="https://github.com/productstein/holomime/actions/workflows/ci.yml/badge.svg" alt="CI" /></a>
16
+ <a href="https://github.com/productstein/holomime/blob/main/LICENSE"><img src="https://img.shields.io/npm/l/holomime.svg" alt="license" /></a>
16
17
  <a href="https://holomime.dev"><img src="https://img.shields.io/badge/docs-holomime.dev-blue" alt="docs" /></a>
18
+ <a href="https://holomime.dev/blog"><img src="https://img.shields.io/badge/blog-holomime.dev%2Fblog-purple" alt="blog" /></a>
19
+ <a href="https://holomime.dev/research"><img src="https://img.shields.io/badge/research-paper-orange" alt="research" /></a>
17
20
  </p>
18
21
 
19
22
  ---
@@ -26,7 +29,7 @@ npm install -g holomime
26
29
  # Create a personality profile (Big Five + behavioral dimensions)
27
30
  holomime init
28
31
 
29
- # Diagnose drift from any log format
32
+ # Diagnose behavioral symptoms from any log format
30
33
  holomime diagnose --log agent.jsonl
31
34
 
32
35
  # View your agent's personality
@@ -36,6 +39,54 @@ holomime profile
36
39
  holomime profile --format md --output .personality.md
37
40
  ```
38
41
 
42
+ ## Run Your First Benchmark
43
+
44
+ Benchmark your agent's behavioral alignment in one command. No API key needed — runs locally with Ollama by default.
45
+
46
+ ```bash
47
+ # Run all 7 adversarial scenarios against your agent
48
+ holomime benchmark --personality .personality.json
49
+
50
+ # Run against cloud providers
51
+ holomime benchmark --personality .personality.json --provider anthropic
52
+ holomime benchmark --personality .personality.json --provider openai
53
+
54
+ # Save results and track improvement over time
55
+ holomime benchmark --personality .personality.json --save
56
+ ```
57
+
58
+ Each scenario stress-tests a specific failure mode: over-apologizing, excessive hedging, sycophancy, error spirals, boundary violations, negative tone mirroring, and register inconsistency. Your agent gets a score (0-100) and a grade (A-F).
59
+
60
+ **Latest results across providers:**
61
+
62
+ | Provider | Score | Grade | Passed |
63
+ |----------|------:|:-----:|:------:|
64
+ | Claude Sonnet | 71 | B | 5/7 |
65
+ | GPT-4o | 57 | C | 4/7 |
66
+ | Ollama/llama3 | 43 | D | 3/7 |
67
+
68
+ See the full breakdown at [holomime.dev/benchmarks](https://holomime.dev/benchmarks) or in [BENCHMARK_RESULTS.md](BENCHMARK_RESULTS.md).
69
+
70
+ ## The Self-Improvement Loop
71
+
72
+ HoloMime isn't a one-shot evaluation. It's a compounding behavioral flywheel:
73
+
74
+ ```
75
+ ┌──────────────────────────────────────────────────┐
76
+ │ │
77
+ ▼ │
78
+ Diagnose ──→ Treat ──→ Export DPO ──→ Fine-tune ──→ Evaluate
79
+ 80+ signals dual-LLM preference OpenAI / before/after
80
+ 7 detectors therapy pairs HuggingFace grade (A-F)
81
+ ```
82
+
83
+ Each cycle through the loop:
84
+ - **Generates training data** -- every therapy session becomes a DPO preference pair automatically
85
+ - **Reduces relapse** -- the fine-tuned model needs fewer interventions next cycle
86
+ - **Compounds** -- the 100th alignment session is exponentially more valuable than the first
87
+
88
+ Run it manually with `holomime session`, automatically with `holomime autopilot`, or recursively with `holomime evolve` (loops until behavior converges). Agents can even self-diagnose mid-conversation via the MCP server.
89
+
39
90
  ## Framework Integrations
40
91
 
41
92
  Holomime analyzes conversations from any LLM framework. Auto-detection works out of the box, or specify a format explicitly.
@@ -80,7 +131,7 @@ This project uses [holomime](https://holomime.dev) for agent behavioral alignmen
80
131
 
81
132
  - **Spec**: `.personality.json` defines the agent's behavioral profile
82
133
  - **Readable**: `.personality.md` is a human-readable summary
83
- - **Diagnose**: `holomime diagnose --log <path>` detects behavioral drift
134
+ - **Diagnose**: `holomime diagnose --log <path>` detects behavioral symptoms
84
135
  - **Align**: `holomime evolve --personality .personality.json --log <path>`
85
136
 
86
137
  The `.personality.json` governs *how the agent behaves*.
@@ -128,7 +179,7 @@ Seven rule-based detectors that analyze real conversations without any LLM calls
128
179
  <details>
129
180
  <summary><strong>All Commands</strong></summary>
130
181
 
131
- ### Free Tier
182
+ ### Free Clinic
132
183
 
133
184
  | Command | What It Does |
134
185
  |---------|-------------|
@@ -139,11 +190,11 @@ Seven rule-based detectors that analyze real conversations without any LLM calls
139
190
  | `holomime compile` | Generate provider-specific system prompts |
140
191
  | `holomime validate` | Schema + psychological coherence checks |
141
192
  | `holomime browse` | Browse community personality hub |
142
- | `holomime pull` | Download a personality from the hub |
193
+ | `holomime use` | Use a personality from the registry |
143
194
  | `holomime publish` | Share your personality to the hub |
144
- | `holomime activate` | Activate a Pro license key |
195
+ | `holomime activate` | Activate a Practice license key |
145
196
 
146
- ### Pro Tier
197
+ ### Practice
147
198
 
148
199
  | Command | What It Does |
149
200
  |---------|-------------|
@@ -160,17 +211,17 @@ Seven rule-based detectors that analyze real conversations without any LLM calls
160
211
  | `holomime eval` | Before/after behavioral comparison with letter grades |
161
212
  | `holomime growth` | Track behavioral improvement over time |
162
213
 
163
- [Get a Pro license](https://holomime.dev/#pricing)
214
+ [Get a Practice license](https://holomime.dev/#pricing)
164
215
 
165
216
  </details>
166
217
 
167
218
  ## Continuous Monitoring
168
219
 
169
220
  ```bash
170
- # Watch mode -- alert on drift
221
+ # Watch mode -- alert on relapse
171
222
  holomime watch --dir ./logs --personality agent.personality.json
172
223
 
173
- # Daemon mode -- auto-heal drift without intervention
224
+ # Daemon mode -- auto-heal relapse without intervention
174
225
  holomime daemon --dir ./logs --personality agent.personality.json
175
226
 
176
227
  # Fleet mode -- monitor multiple agents simultaneously
@@ -196,20 +247,28 @@ Supports DPO, RLHF, Alpaca, HuggingFace, and OpenAI fine-tuning formats. See [sc
196
247
 
197
248
  ## Architecture
198
249
 
250
+ The pipeline is a closed loop -- output feeds back as input, compounding with every therapy cycle:
251
+
199
252
  ```
200
- .personality.json <- The spec (Big Five + behavioral dimensions)
201
- |
202
- holomime diagnose <- 7 rule-based detectors (no LLM)
203
- |
204
- holomime session <- Dual-LLM refinement (therapist + patient)
205
- |
206
- holomime export <- DPO / RLHF / Alpaca / HuggingFace training data
207
- |
208
- holomime train <- Fine-tune (OpenAI or HuggingFace TRL)
209
- |
210
- holomime eval <- Behavioral Alignment Score (A-F)
211
- |
212
- .personality.json <- Updated with fine-tuned model reference
253
+ .personality.json ─────────────────────────────────────────────────┐
254
+ │ │
255
+ ▼ │
256
+ holomime diagnose 7 rule-based detectors (no LLM) │
257
+ │ │
258
+ ▼ │
259
+ holomime session Dual-LLM refinement (therapist + patient) │
260
+ │ │
261
+ ▼ │
262
+ holomime export DPO / RLHF / Alpaca / HuggingFace pairs │
263
+ │ │
264
+ ▼ │
265
+ holomime train Fine-tune (OpenAI or HuggingFace TRL) │
266
+ │ │
267
+ ▼ │
268
+ holomime eval Behavioral Alignment Score (A-F) │
269
+ │ │
270
+ └──────────────────────────────────────────────────────────────┘
271
+ Updated .personality.json (loop restarts)
213
272
  ```
214
273
 
215
274
  ## MCP Server
@@ -220,7 +279,7 @@ Expose the full pipeline as MCP tools for self-healing agents:
220
279
  holomime-mcp
221
280
  ```
222
281
 
223
- Four tools: `holomime_diagnose`, `holomime_assess`, `holomime_profile`, `holomime_autopilot`. Your agents can self-diagnose behavioral drift and trigger their own alignment sessions.
282
+ Four tools: `holomime_diagnose`, `holomime_assess`, `holomime_profile`, `holomime_autopilot`. Your agents can self-diagnose behavioral symptoms and trigger their own therapy sessions.
224
283
 
225
284
  ## Voice Agent
226
285
 
@@ -238,6 +297,13 @@ See [Behavioral Alignment for Autonomous AI Agents](paper/behavioral-alignment.m
238
297
 
239
298
  Benchmark results: [BENCHMARK_RESULTS.md](BENCHMARK_RESULTS.md)
240
299
 
300
+ ## Resources
301
+
302
+ - [Integration Docs](https://holomime.dev/docs) -- Export instructions and code examples for all 7 formats
303
+ - [Blog](https://holomime.dev/blog) -- Articles on behavioral alignment, AGENTS.md, and agent personality
304
+ - [Research Paper](https://holomime.dev/research) -- Behavioral Alignment for Autonomous AI Agents
305
+ - [Pricing](https://holomime.dev/#pricing) -- Free Clinic + Practice license details
306
+
241
307
  ## Contributing
242
308
 
243
309
  See [CONTRIBUTING.md](CONTRIBUTING.md) for development setup, project structure, and how to submit changes.