instar 0.23.7 → 0.23.8

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/upgrades/NEXT.md CHANGED
@@ -1,38 +1,35 @@
1
1
  # Upgrade Guide — vNEXT
2
2
 
3
3
  <!-- bump: patch -->
4
+ <!-- Valid values: patch, minor, major -->
5
+ <!-- patch = bug fixes, refactors, test additions, doc updates -->
6
+ <!-- minor = new features, new APIs, new capabilities (backwards-compatible) -->
7
+ <!-- major = breaking changes to existing APIs or behavior -->
4
8
 
5
9
  ## What Changed
6
10
 
7
- Three fixes for Threadline agent-to-agent messaging reliability:
8
-
9
- **Local-first delivery for co-located agents.** The `relay-send` endpoint now detects when the target agent is on the same machine (via `known-agents.json`) and delivers directly via their `/messages/relay-agent` HTTP endpoint using agent tokens from `~/.instar/agent-tokens/`. This bypasses the cloud relay entirely for same-machine agents, eliminating stale WebSocket connection issues that caused "delivered" messages to silently fail after server restarts. Falls back to relay if local delivery fails.
10
-
11
- **Cold-spawn prompt fix.** The ThreadlineRouter's spawn prompt now correctly instructs sessions to use the `threadline_send` MCP tool for replies, replacing a reference to the nonexistent `/msg reply` command. The SpawnRequestManager and MessageFormatter prompts were also updated. Template variable substitution switched from `replace()` to `replaceAll()` so all occurrences of `{remote_agent}` and `{thread_id}` are properly substituted.
12
-
13
- **Relay auth rate-limit backoff.** When the relay rejects an auth attempt with "Too many auth attempts," the RelayClient now bumps its reconnect attempt counter to enforce a ~32-second backoff before retrying, preventing retry storms during rapid server restarts.
14
-
15
- **Lifeline queue feedback loop prevention.** Three fixes to prevent the lifeline from getting stuck in a restart loop that floods Telegram with thousands of "Server is temporarily down" messages: (1) Queue acknowledgment messages are now rate-limited to one per topic per 2 minutes, and suppressed entirely when the queue exceeds 100 messages. (2) Messages that fail replay 3 times are dropped instead of re-queued indefinitely, preventing poison messages from crashing the server on every recovery. (3) Queue replay now stops immediately when the server goes unhealthy mid-replay, re-queuing remaining messages in bulk instead of processing them one-by-one.
16
-
17
- Also includes: CLI commands for inspecting job execution history and continuity data (`instar job history`, `instar job handoff`), handoff notes for cross-execution continuity, usage-based reflection metrics, test infrastructure improvements, and a separate publish workflow for the threadline-mcp subpackage.
11
+ <!-- Describe what changed technically. What new features, APIs, behavioral changes? -->
12
+ <!-- Write this for the AGENT — they need to understand the system deeply. -->
18
13
 
19
14
  ## What to Tell Your User
20
15
 
21
- - **Reliable agent-to-agent messaging**: "Agents on the same machine can now talk to each other reliably. Messages are delivered directly without going through the cloud relay, so no more silent failures after server restarts."
22
- - **Agents can reply**: "When one agent messages another, the receiving agent now knows how to reply properly. Previously, replies were silently dropped because the session was told to use a command that didn't exist."
23
- - **Job inspection tools**: "You can now check what your agent has been working on between sessions. The new job history and handoff commands show execution records and continuity notes."
24
- - **Reflection monitoring**: "Your agent now tracks reflection frequency, so you can see how often it pauses to learn from its work."
25
- - **No more restart spam**: "If the server gets stuck in a restart loop, the lifeline will no longer flood your Telegram with thousands of 'temporarily down' messages. Queue notifications are now rate-limited, and messages that keep failing to deliver are dropped after 3 attempts."
16
+ <!-- Write talking points the agent should relay to their user. -->
17
+ <!-- This should be warm, conversational, user-facing not a changelog. -->
18
+ <!-- Focus on what THEY can now do, not internal plumbing. -->
19
+ <!-- -->
20
+ <!-- PROHIBITED in this section (will fail validation): -->
21
+ <!-- camelCase config keys: silentReject, maxRetries, telegramNotify -->
22
+ <!-- Inline code backtick references like silentReject: false -->
23
+ <!-- Fenced code blocks -->
24
+ <!-- Instructions to edit files or run commands -->
25
+ <!-- -->
26
+ <!-- CORRECT style: "I can turn that on for you" not "set X to false" -->
27
+ <!-- The agent relays this to their user — keep it human. -->
28
+
29
+ - **[Feature name]**: "[Brief, friendly description of what this means for the user]"
26
30
 
27
31
  ## Summary of New Capabilities
28
32
 
29
33
  | Capability | How to Use |
30
34
  |-----------|-----------|
31
- | Threadline local delivery | Automatic for same-machine agents |
32
- | Threadline reply fix | Automatic in spawned sessions |
33
- | Relay auth backoff | Automatic on rate-limited connections |
34
- | Job execution history | `instar job history [job-slug]` |
35
- | Job handoff inspection | `instar job handoff [job-slug]` |
36
- | Usage-based reflection metrics | Automatic |
37
- | Lifeline queue ack rate-limiting | Automatic |
38
- | Lifeline poison message protection | Automatic (drops after 3 replay failures) |
35
+ | [Capability] | [Endpoint, command, or "automatic"] |