@sesamespace/hivemind 0.8.0 → 0.8.1
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/dist/{chunk-YDD5EZ46.js → chunk-ELFD4Y4W.js} +2 -2
- package/dist/{chunk-GOW62FNS.js → chunk-ERR5JR42.js} +2 -2
- package/dist/{chunk-R6XIZH3I.js → chunk-TL4GV2TJ.js} +3 -3
- package/dist/{chunk-LDTBAMQY.js → chunk-WAX2THXK.js} +2 -2
- package/dist/{chunk-A7X4FKQZ.js → chunk-WSLVHVNP.js} +3 -2
- package/dist/chunk-WSLVHVNP.js.map +1 -0
- package/dist/commands/fleet.js +3 -3
- package/dist/commands/start.js +3 -3
- package/dist/commands/watchdog.js +3 -3
- package/dist/index.js +2 -2
- package/dist/main.js +5 -5
- package/dist/start.js +1 -1
- package/install.sh +120 -0
- package/package.json +25 -22
- package/packages/memory/Cargo.lock +6480 -0
- package/packages/memory/Cargo.toml +21 -0
- package/packages/memory/src/src/context.rs +179 -0
- package/packages/memory/src/src/embeddings.rs +51 -0
- package/packages/memory/src/src/main.rs +626 -0
- package/packages/memory/src/src/promotion.rs +637 -0
- package/packages/memory/src/src/scoring.rs +131 -0
- package/packages/memory/src/src/store.rs +460 -0
- package/packages/memory/src/src/tasks.rs +321 -0
- package/.pnpmrc.json +0 -1
- package/DASHBOARD-PLAN.md +0 -206
- package/TOOL-USE-DESIGN.md +0 -173
- package/dist/chunk-A7X4FKQZ.js.map +0 -1
- package/docs/TOOL-PARITY-PLAN.md +0 -191
- /package/dist/{chunk-YDD5EZ46.js.map → chunk-ELFD4Y4W.js.map} +0 -0
- /package/dist/{chunk-GOW62FNS.js.map → chunk-ERR5JR42.js.map} +0 -0
- /package/dist/{chunk-R6XIZH3I.js.map → chunk-TL4GV2TJ.js.map} +0 -0
- /package/dist/{chunk-LDTBAMQY.js.map → chunk-WAX2THXK.js.map} +0 -0
package/docs/TOOL-PARITY-PLAN.md
DELETED
|
@@ -1,191 +0,0 @@
|
|
|
1
|
-
# Hivemind Tool Parity Plan
|
|
2
|
-
|
|
3
|
-
*Goal: Bring Hivemind tool capabilities to parity with OpenClaw/Pi, then beyond.*
|
|
4
|
-
|
|
5
|
-
*Created: 2026-03-01 | Last updated: 2026-03-01*
|
|
6
|
-
|
|
7
|
-
---
|
|
8
|
-
|
|
9
|
-
## Context
|
|
10
|
-
|
|
11
|
-
OpenClaw uses [Pi](https://github.com/badlogic/pi-mono/) as its underlying coding agent (see [Armin's article](https://lucumr.pocoo.org/2026/1/31/pi/)). Pi provides 4 core tools: `read`, `write`, `edit`, `bash`. OpenClaw layers orchestration on top: messaging, scheduling, sub-agents, browser control, vision, TTS, device control, and more.
|
|
12
|
-
|
|
13
|
-
Hivemind is a **headless messaging agent** (not a TUI coding agent like Pi), but it needs equivalent capabilities. Hivemind's competitive edge is its **3-layer semantic memory** (L2 episodic + L3 semantic), which neither Pi nor OpenClaw have.
|
|
14
|
-
|
|
15
|
-
**Principle: Build, don't integrate.** Study Pi's source for inspiration, write our own implementations. No external dependency risk, tailored to our headless/messaging paradigm.
|
|
16
|
-
|
|
17
|
-
---
|
|
18
|
-
|
|
19
|
-
## What's Already Implemented ✅
|
|
20
|
-
|
|
21
|
-
### Core Tools (shipped in v0.6.0-v0.7.2)
|
|
22
|
-
| Tool | Status | Notes |
|
|
23
|
-
|------|--------|-------|
|
|
24
|
-
| `shell` | ✅ Shipped | Workspace-scoped, configurable timeout |
|
|
25
|
-
| `read_file` | ✅ Shipped | With offset/limit for large files |
|
|
26
|
-
| `write_file` | ✅ Shipped | Auto-creates directories |
|
|
27
|
-
| `edit_file` | ✅ Shipped | Exact-match find-and-replace |
|
|
28
|
-
| `list_files` | ✅ Shipped | Directory listing |
|
|
29
|
-
| `web_search` | ✅ Shipped | Brave Search API |
|
|
30
|
-
| `web_fetch` | ✅ Shipped | URL → readable text |
|
|
31
|
-
|
|
32
|
-
### Memory Tools (shipped in v0.7.0)
|
|
33
|
-
| Tool | Status | Notes |
|
|
34
|
-
|------|--------|-------|
|
|
35
|
-
| `memory_search` | ✅ Shipped | L2 semantic search per context |
|
|
36
|
-
| `memory_contexts` | ✅ Shipped | List all contexts |
|
|
37
|
-
| `memory_l3` | ✅ Shipped | View promoted knowledge |
|
|
38
|
-
| `memory_cross_search` | ✅ Shipped | Search across all contexts |
|
|
39
|
-
|
|
40
|
-
### Infrastructure (shipped in v0.7.0)
|
|
41
|
-
| Feature | Status | Notes |
|
|
42
|
-
|---------|--------|-------|
|
|
43
|
-
| Agentic tool-use loop | ✅ Shipped | OpenAI-compatible function calling via OpenRouter, max 25 iterations |
|
|
44
|
-
| Tool registry | ✅ Shipped | Pluggable registration system |
|
|
45
|
-
| Session persistence | ✅ Shipped | JSONL per context, survives restarts |
|
|
46
|
-
| Lossless compaction | ✅ Shipped | Saves episodes to L2 before summarizing |
|
|
47
|
-
| MEMORY.md | ✅ Shipped | Global + per-context agent-managed memory files |
|
|
48
|
-
| Skills discovery | ✅ Shipped | Auto-scan workspace/skills/*/SKILL.md |
|
|
49
|
-
| Events system | ✅ Shipped | File-based immediate/one-shot/periodic scheduling |
|
|
50
|
-
| Token budget management | ✅ Shipped | Context limit + response reserve |
|
|
51
|
-
| Dashboard | ✅ Shipped | Request inspector, memory browser, context overview |
|
|
52
|
-
| Service install | ✅ Shipped | launchd services (agent + memory daemon + watchdog) |
|
|
53
|
-
| One-line installer | ✅ Shipped | `curl -sL api.sesame.space/api/v1/hivemind/install | bash -s -- <key>` |
|
|
54
|
-
|
|
55
|
-
---
|
|
56
|
-
|
|
57
|
-
## What's Not Implemented Yet
|
|
58
|
-
|
|
59
|
-
### Tier 1 — Essential (Phase 1-2) ✅ COMPLETE
|
|
60
|
-
|
|
61
|
-
| # | Tool/Feature | Status | Tools Added |
|
|
62
|
-
|---|-------------|--------|------------|
|
|
63
|
-
| 1 | **Event/scheduling tools** | ✅ Shipped | `create_event`, `list_events`, `delete_event` |
|
|
64
|
-
| 2 | **Enhanced Sesame messaging** | ✅ Shipped | `send_message` |
|
|
65
|
-
| 3 | **Sub-agent spawning** | ✅ Shipped | `spawn_agent`, `list_agents`, `kill_agent` |
|
|
66
|
-
| 4 | **Web browsing** | ✅ Shipped | `browse` (extract/screenshot/click/type/evaluate) |
|
|
67
|
-
|
|
68
|
-
### Tier 2 — Important (Phase 3) ✅ COMPLETE
|
|
69
|
-
|
|
70
|
-
| # | Tool/Feature | Status | Tools Added |
|
|
71
|
-
|---|-------------|--------|------------|
|
|
72
|
-
| 5 | **Image/vision analysis** | ✅ Shipped | `analyze_image` |
|
|
73
|
-
| 6 | **Git operations** | ✅ Shipped | `git_status`, `git_diff`, `git_commit`, `git_log`, `git_push` |
|
|
74
|
-
| 7 | **Cross-context messaging** | ⬚ Deferred | (existing cross-context search covers most cases) |
|
|
75
|
-
|
|
76
|
-
### Tier 3 — System & Mac Capabilities ✅ COMPLETE
|
|
77
|
-
|
|
78
|
-
| # | Tool/Feature | Status | Tools Added |
|
|
79
|
-
|---|-------------|--------|------------|
|
|
80
|
-
| 8 | **System management** | ✅ Shipped | `system_info`, `process_list`, `process_kill`, `service_control`, `disk_usage`, `network_info` |
|
|
81
|
-
| 9 | **macOS automation** | ✅ Shipped | `run_applescript`, `notify`, `clipboard_read`, `clipboard_write`, `open_url`, `screenshot` |
|
|
82
|
-
| 10 | **Data handling** | ✅ Shipped | `sqlite_query`, `archive_create`, `archive_extract`, `pdf_extract` |
|
|
83
|
-
| 11 | **HTTP server/client** | ✅ Shipped | `http_serve`, `http_stop`, `http_request` |
|
|
84
|
-
| 12 | **File watching** | ✅ Shipped | `watch_start`, `watch_stop`, `watch_list` |
|
|
85
|
-
|
|
86
|
-
### Tier 4 — Future (not yet implemented)
|
|
87
|
-
|
|
88
|
-
| # | Tool/Feature | What It Does | Notes |
|
|
89
|
-
|---|-------------|-------------|-------|
|
|
90
|
-
| 13 | **Skills/extensions system** | Formalized hot-reloadable skills with tool registration | Inspired by Pi's extension system |
|
|
91
|
-
| 14 | **TTS** | Text-to-speech output via API | ElevenLabs API |
|
|
92
|
-
| 15 | **SSH** | Remote machine access | Could use shell + ssh CLI |
|
|
93
|
-
|
|
94
|
-
---
|
|
95
|
-
|
|
96
|
-
## Implementation Plan
|
|
97
|
-
|
|
98
|
-
### Phase 1: Event Tools + Enhanced Messaging
|
|
99
|
-
**Estimate: 1-2 coding sessions**
|
|
100
|
-
|
|
101
|
-
```
|
|
102
|
-
packages/runtime/src/tools/
|
|
103
|
-
events.ts — NEW: create_event, list_events, delete_event
|
|
104
|
-
messaging.ts — NEW: send_message (to any Sesame channel/user)
|
|
105
|
-
```
|
|
106
|
-
|
|
107
|
-
**Event tools:** Wrap the existing `EventsWatcher` infrastructure. Agent can create one-shot (reminders) and periodic (cron-like) events. Events fire as messages processed by the agent.
|
|
108
|
-
|
|
109
|
-
**Messaging tools:** Use existing Sesame SDK to send messages to arbitrary channels. Agent can proactively reach out, not just reply to incoming messages.
|
|
110
|
-
|
|
111
|
-
**Dependencies:** None new. Uses existing EventsWatcher + Sesame SDK.
|
|
112
|
-
|
|
113
|
-
### Phase 2: Sub-agents + Web Browsing
|
|
114
|
-
**Estimate: 3-4 coding sessions**
|
|
115
|
-
|
|
116
|
-
```
|
|
117
|
-
packages/runtime/src/tools/
|
|
118
|
-
spawn.ts — NEW: spawn_agent, list_agents, kill_agent
|
|
119
|
-
browser.ts — NEW: browse_url, browser_action
|
|
120
|
-
```
|
|
121
|
-
|
|
122
|
-
**Sub-agents:** Fork a new agent process with `hivemind start --context <name> --task "<prompt>"`. Parent tracks child PIDs. Results reported back via Sesame message or shared file. Timeout and cleanup built in.
|
|
123
|
-
|
|
124
|
-
**Web browsing:** Headless Playwright for full web interaction. Agent can navigate, click, fill forms, extract content from JS-rendered pages. Single dependency (`playwright`), but it's the right tool for this — curl/fetch can't handle SPAs, authentication flows, or dynamic content.
|
|
125
|
-
|
|
126
|
-
**Why Playwright:** We considered avoiding it (heavy dep), but web browsing is a core capability for agents that need to "do anything a human can do on a computer." Pi/OpenClaw use it. The alternative (shell + curl) only handles static pages.
|
|
127
|
-
|
|
128
|
-
### Phase 3: Vision + Git
|
|
129
|
-
**Estimate: 1-2 coding sessions**
|
|
130
|
-
|
|
131
|
-
```
|
|
132
|
-
packages/runtime/src/tools/
|
|
133
|
-
vision.ts — NEW: analyze_image
|
|
134
|
-
git.ts — NEW: git_status, git_diff, git_commit, git_push
|
|
135
|
-
```
|
|
136
|
-
|
|
137
|
-
**Vision:** Single HTTP call to OpenRouter with vision-capable model. Accept image URL or base64.
|
|
138
|
-
|
|
139
|
-
**Git:** Thin wrappers around git CLI with structured output. Reduces token usage vs raw `shell` git output. Safety: confirmation before push.
|
|
140
|
-
|
|
141
|
-
### Phase 4: Skills/Extensions System
|
|
142
|
-
**Estimate: 2-3 coding sessions**
|
|
143
|
-
|
|
144
|
-
Formalize the existing skills discovery into a proper extension system:
|
|
145
|
-
- Skills can register new tools dynamically
|
|
146
|
-
- Hot-reload on file change (inspired by Pi's extension system)
|
|
147
|
-
- Agent can write and install its own skills
|
|
148
|
-
- Skills persist across restarts
|
|
149
|
-
|
|
150
|
-
---
|
|
151
|
-
|
|
152
|
-
## Architecture Notes
|
|
153
|
-
|
|
154
|
-
### Tool Registration Pattern
|
|
155
|
-
All tools follow the existing registry pattern:
|
|
156
|
-
```typescript
|
|
157
|
-
// Each tool file exports a register function
|
|
158
|
-
export function registerEventTools(registry: ToolRegistry, dataDir: string): void {
|
|
159
|
-
registry.register("create_event", description, schema, executor);
|
|
160
|
-
// ...
|
|
161
|
-
}
|
|
162
|
-
```
|
|
163
|
-
|
|
164
|
-
### Security Model
|
|
165
|
-
- **Shell:** Workspace-scoped, configurable timeout (existing)
|
|
166
|
-
- **Files:** Scoped to workspace (existing)
|
|
167
|
-
- **Messaging:** Authenticated via existing Sesame connection
|
|
168
|
-
- **Sub-agents:** Inherit parent permissions, isolated context
|
|
169
|
-
- **Events:** File-based, local only
|
|
170
|
-
- **Browser:** Sandboxed Playwright instance, no persistent state
|
|
171
|
-
- **Git:** Confirmation before push
|
|
172
|
-
|
|
173
|
-
### What Hivemind Has That OpenClaw/Pi Don't
|
|
174
|
-
- **3-layer semantic memory** (L2 episodic + L3 semantic with automatic promotion)
|
|
175
|
-
- **Cross-context knowledge sharing** with isolation by default
|
|
176
|
-
- **Memory-augmented responses** (every LLM call enriched with relevant history)
|
|
177
|
-
- **Dashboard** for debugging LLM calls + memory state
|
|
178
|
-
- **Token budget management** with configurable limits
|
|
179
|
-
|
|
180
|
-
This isn't just parity — it's parity + memory, which is the competitive edge.
|
|
181
|
-
|
|
182
|
-
---
|
|
183
|
-
|
|
184
|
-
## Superseded Documents
|
|
185
|
-
|
|
186
|
-
- `TOOL-USE-DESIGN.md` (root) — Original tool architecture design. **Fully implemented** in v0.6.0-v0.7.2. Kept for historical reference.
|
|
187
|
-
- `DASHBOARD-PLAN.md` (root) — Dashboard design. **Fully implemented**. Kept for reference.
|
|
188
|
-
|
|
189
|
-
---
|
|
190
|
-
|
|
191
|
-
*This is the authoritative plan for Hivemind tool development. Update this document as phases complete.*
|
|
File without changes
|
|
File without changes
|
|
File without changes
|
|
File without changes
|