threadkeeper 0.12.0__tar.gz → 0.13.0__tar.gz
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/PKG-INFO +44 -9
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/README.md +43 -8
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/pyproject.toml +1 -1
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_menubar_app.py +34 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_spawn_config.py +4 -4
- threadkeeper-0.13.0/tests/test_verify_ingest.py +162 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/assets/macos-agent-status/README.md +9 -3
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/assets/macos-agent-status/ThreadKeeperAgentStatus.swift +241 -57
- threadkeeper-0.13.0/threadkeeper/verify_ingest.py +313 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper.egg-info/PKG-INFO +44 -9
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper.egg-info/SOURCES.txt +2 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/LICENSE +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/setup.cfg +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_adapters.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_agent_status.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_auto_update.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_brief_footprint.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_brief_sections.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_candidate_reviewer.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_config_settings.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_core_memory.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_curator.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_dashboard.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_delegated_search.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_dialectic.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_dialectic_feed_tools.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_dialectic_miner.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_dialectic_observation_resolve.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_dialectic_recompute.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_dialectic_tier.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_dialectic_validator.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_error_paths.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_evolve_applier.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_evolve_apply_2.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_evolve_apply_3.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_evolve_daemon.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_extract_daemon.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_extract_dedup.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_i18n_multilang.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_identity.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_ingest_status.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_lessons.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_memory_guard.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_missed_spawns.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_nudges.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_onnx_embeddings.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_panel.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_probe_daemon.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_process_health.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_search_fts_punctuation.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_shadow_review.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_skill_hint.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_skill_passive_tier.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_skill_tier.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_skill_use_parser.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_skill_watcher.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_skills.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_spawn_budget.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_spawn_codex_stdin.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_spawn_hint.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_spawn_reap.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_spawn_slim.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_spawn_wrap.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_thread_janitor.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_threads.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_tools_smoke.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_validate_threads.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/tests/test_vec_search.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/__init__.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/_mcp.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/_setup.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/_spawn_wrap.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/adapters/__init__.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/adapters/_hook_helpers.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/adapters/antigravity.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/adapters/base.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/adapters/claude_code.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/adapters/claude_desktop.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/adapters/codex.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/adapters/copilot.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/adapters/gemini.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/adapters/vscode.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/agent_status.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/assets/macos-agent-status/Info.plist +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/assets/macos-agent-status/build.sh +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/assets/macos-agent-status/install.sh +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/auto_update.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/brief.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/candidate_reviewer.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/config.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/curator.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/db.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/dialectic_miner.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/dialectic_validator.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/embeddings.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/evolve_applier.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/evolve_daemon.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/extract_daemon.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/helpers.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/i18n.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/identity.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/ingest.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/lessons.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/memory_guard.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/menubar_app.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/migrate_embeddings.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/nudges.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/probe_daemon.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/process_health.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/review_prompts.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/search_proxy.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/server.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/shadow_review.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/skill_watcher.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/spawn_budget.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/spawn_config.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/thread_janitor.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/__init__.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/agent_status.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/candidate_reviewer.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/concepts.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/consolidate.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/core_memory.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/correlation.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/curator.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/dashboard.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/dialectic.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/dialectic_feed.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/dialog.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/distill.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/evolve_applier.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/extract.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/graph.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/invariants.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/lessons.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/memory_guard.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/missed_spawns.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/panel.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/peers.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/pickup.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/probes.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/process_health.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/session.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/shadow_review.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/skills.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/spawn.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/style.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/threads.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/tools/validate.py +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper.egg-info/dependency_links.txt +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper.egg-info/entry_points.txt +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper.egg-info/requires.txt +0 -0
- {threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper.egg-info/top_level.txt +0 -0
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
Metadata-Version: 2.4
|
|
2
2
|
Name: threadkeeper
|
|
3
|
-
Version: 0.
|
|
3
|
+
Version: 0.13.0
|
|
4
4
|
Summary: Multi-agent shared brain across Claude Code/Desktop, Codex, Antigravity CLI, Gemini, Copilot, VS Code. Cross-session memory, self-improving skill loops, inter-agent signaling — one local MCP server.
|
|
5
5
|
Author: thread-keeper contributors
|
|
6
6
|
License: MIT
|
|
@@ -221,7 +221,7 @@ tk-agent-status --cleanup-memory
|
|
|
221
221
|
```
|
|
222
222
|
|
|
223
223
|
`apps/macos-agent-status/` contains a small macOS menu-bar app that polls this
|
|
224
|
-
command every
|
|
224
|
+
command every 15 seconds and shows every autonomous learning loop: enabled/off,
|
|
225
225
|
running/idle/ready, last pass, backlog, and active child RSS when that loop has
|
|
226
226
|
spawned a worker. PyPI wheels and sdists also bundle the same Swift source under
|
|
227
227
|
`threadkeeper/assets/macos-agent-status/`, so a normal `pipx`/`uv tool` install
|
|
@@ -239,12 +239,16 @@ memory button, self-restarts when its own RSS crosses
|
|
|
239
239
|
notification permission, and sends a notification when a newly completed
|
|
240
240
|
autonomous child task produces a useful result in `recent_results`; the first
|
|
241
241
|
poll only marks existing results as seen, so old completions do not spam
|
|
242
|
-
notifications.
|
|
242
|
+
notifications. Status polling and cleanup commands run off the main actor, so
|
|
243
|
+
opening the popover does not wait for `tk-agent-status --json`. The header gear
|
|
244
|
+
opens a separate Settings window for
|
|
243
245
|
`~/.threadkeeper/.env`: common knobs are grouped into guided controls, the raw
|
|
244
246
|
`.env` remains editable for advanced values, three local presets can be saved
|
|
245
247
|
and loaded, and Save & Restart writes the file then asks existing
|
|
246
248
|
`threadkeeper.server` processes to exit so MCP hosts reconnect with the new
|
|
247
|
-
configuration.
|
|
249
|
+
configuration. Spawn CLI selectors collapse `agy` into canonical `antigravity`
|
|
250
|
+
while keeping `gemini` as legacy, and model selectors use dropdowns with exact
|
|
251
|
+
CLI model ids/labels instead of free-text fields. Probe backlog is due objective
|
|
248
252
|
probes only, not every registered probe, so a healthy cooldown shows `0 due
|
|
249
253
|
probes` instead of looking stuck. On macOS, `python -m threadkeeper.server`
|
|
250
254
|
automatically installs and launches it on MCP startup, and restarts the app when
|
|
@@ -633,11 +637,15 @@ keys are lowercased:
|
|
|
633
637
|
# default agent for roles with no explicit pin ("" / unset = use the active CLI)
|
|
634
638
|
THREADKEEPER_SPAWN__DEFAULT=claude
|
|
635
639
|
# per-role CLI: THREADKEEPER_SPAWN__LOOP__<ROLE>=<cli>
|
|
640
|
+
# supported CLI keys: claude, codex, antigravity (agy executable), gemini (legacy), copilot
|
|
636
641
|
THREADKEEPER_SPAWN__LOOP__SHADOW_OBSERVER=claude # heaviest reasoning → keep on Claude
|
|
637
642
|
THREADKEEPER_SPAWN__LOOP__CURATOR=codex # weekly audit → Codex is fine
|
|
638
643
|
THREADKEEPER_SPAWN__LOOP__CANDIDATE_REVIEWER=auto # "auto" = follow active CLI
|
|
639
644
|
# model pin per CLI or per role: THREADKEEPER_SPAWN__MODEL__<KEY>=<model>
|
|
640
645
|
THREADKEEPER_SPAWN__MODEL__CLAUDE=opus
|
|
646
|
+
THREADKEEPER_SPAWN__MODEL__CODEX=gpt-5.5
|
|
647
|
+
THREADKEEPER_SPAWN__MODEL__AGY="Gemini 3.1 Pro (High)"
|
|
648
|
+
THREADKEEPER_SPAWN__MODEL__GEMINI=gemini-3.1-pro-preview
|
|
641
649
|
THREADKEEPER_SPAWN__MODEL__DIALECTIC_VALIDATOR=opus
|
|
642
650
|
```
|
|
643
651
|
|
|
@@ -645,7 +653,9 @@ Resolution per role: `SPAWN__LOOP__<role>` → `SPAWN__DEFAULT` → active CLI
|
|
|
645
653
|
`claude`; `"auto"` (or unset) defers to the active CLI. Real environment
|
|
646
654
|
variables override the `.env`. Force host detection with
|
|
647
655
|
`THREADKEEPER_ACTIVE_CLI=claude` (or `codex`, `antigravity`/`agy`,
|
|
648
|
-
`gemini`, `copilot`).
|
|
656
|
+
`gemini`, `copilot`). `agy` is normalized to `antigravity`; `gemini` remains a
|
|
657
|
+
legacy Gemini CLI adapter for old installs/enterprise paths. See `.env.example`
|
|
658
|
+
for the full knob list.
|
|
649
659
|
|
|
650
660
|
Adapters without headless support (Claude Desktop, VS Code) can't be
|
|
651
661
|
spawn targets — `spawn_status()` reports them as "no adapter" and any
|
|
@@ -745,12 +755,34 @@ unchanged.
|
|
|
745
755
|
## Verifying ingest across CLIs
|
|
746
756
|
|
|
747
757
|
```bash
|
|
748
|
-
python scripts/tk_verify_ingest.py
|
|
758
|
+
python scripts/tk_verify_ingest.py # both checks below
|
|
759
|
+
python scripts/tk_verify_ingest.py --contract # parse/ingest contract only
|
|
760
|
+
python scripts/tk_verify_ingest.py --live # production verdict only
|
|
761
|
+
python scripts/tk_verify_ingest.py --live --json # machine-readable
|
|
749
762
|
```
|
|
750
763
|
|
|
751
|
-
|
|
752
|
-
|
|
753
|
-
|
|
764
|
+
Two read-only checks:
|
|
765
|
+
|
|
766
|
+
- **Contract test** (`--contract`) — walks every installed CLI adapter,
|
|
767
|
+
parses recent transcripts into an isolated tempdir DB, reports
|
|
768
|
+
per-source message counts and flags any adapter that parsed messages
|
|
769
|
+
but silently failed to persist them. Answers *"does the pipeline
|
|
770
|
+
work?"*
|
|
771
|
+
- **Production verification** (`--live`) — reads the **live**
|
|
772
|
+
`dialog_messages` table read-only and scores the three acceptance
|
|
773
|
+
criteria from [roadmap issue #1](https://github.com/po4erk91/thread-keeper/issues/1):
|
|
774
|
+
(1) every targeted CLI *slot* has production rows, (2) shadow-review
|
|
775
|
+
sees more than one adapter in the same recent window, (3) the learning
|
|
776
|
+
loop has fired on non-Claude sessions. Emits a `PASS` / `PARTIAL` /
|
|
777
|
+
`FAIL` verdict. The four slots are `claude-code`, `codex`, `copilot`,
|
|
778
|
+
and `google` — where the Google slot is satisfied by *either* the
|
|
779
|
+
legacy `gemini` adapter or its successor Antigravity (`agy`), since
|
|
780
|
+
both live under `~/.gemini`.
|
|
781
|
+
|
|
782
|
+
`--strict` makes the process exit non-zero unless the live verdict is
|
|
783
|
+
`PASS`, so it can gate CI; `PARTIAL` (e.g. a box that doesn't run all
|
|
784
|
+
four CLIs) is a valid real-world state and exits 0 by default. The
|
|
785
|
+
reusable verdict logic lives in `threadkeeper/verify_ingest.py`.
|
|
754
786
|
|
|
755
787
|
---
|
|
756
788
|
|
|
@@ -776,6 +808,7 @@ threadkeeper/
|
|
|
776
808
|
├── db.py # SQLite schema + sqlite-vec loader
|
|
777
809
|
├── identity.py # session, self-cid, daemon launchers
|
|
778
810
|
├── ingest.py # adapter-driven transcript ingest
|
|
811
|
+
├── verify_ingest.py # cross-CLI production verification verdict
|
|
779
812
|
├── brief.py # render_brief / render_context
|
|
780
813
|
├── shadow_review.py # autonomous learning observer
|
|
781
814
|
├── i18n.py # 10 locales of regex + prompt bundles
|
|
@@ -814,3 +847,5 @@ locale. Look for the `good-first-issue` label.
|
|
|
814
847
|
## License
|
|
815
848
|
|
|
816
849
|
MIT — see [LICENSE](LICENSE).
|
|
850
|
+
|
|
851
|
+
<!-- mcp-name: io.github.po4erk91/thread-keeper -->
|
|
@@ -180,7 +180,7 @@ tk-agent-status --cleanup-memory
|
|
|
180
180
|
```
|
|
181
181
|
|
|
182
182
|
`apps/macos-agent-status/` contains a small macOS menu-bar app that polls this
|
|
183
|
-
command every
|
|
183
|
+
command every 15 seconds and shows every autonomous learning loop: enabled/off,
|
|
184
184
|
running/idle/ready, last pass, backlog, and active child RSS when that loop has
|
|
185
185
|
spawned a worker. PyPI wheels and sdists also bundle the same Swift source under
|
|
186
186
|
`threadkeeper/assets/macos-agent-status/`, so a normal `pipx`/`uv tool` install
|
|
@@ -198,12 +198,16 @@ memory button, self-restarts when its own RSS crosses
|
|
|
198
198
|
notification permission, and sends a notification when a newly completed
|
|
199
199
|
autonomous child task produces a useful result in `recent_results`; the first
|
|
200
200
|
poll only marks existing results as seen, so old completions do not spam
|
|
201
|
-
notifications.
|
|
201
|
+
notifications. Status polling and cleanup commands run off the main actor, so
|
|
202
|
+
opening the popover does not wait for `tk-agent-status --json`. The header gear
|
|
203
|
+
opens a separate Settings window for
|
|
202
204
|
`~/.threadkeeper/.env`: common knobs are grouped into guided controls, the raw
|
|
203
205
|
`.env` remains editable for advanced values, three local presets can be saved
|
|
204
206
|
and loaded, and Save & Restart writes the file then asks existing
|
|
205
207
|
`threadkeeper.server` processes to exit so MCP hosts reconnect with the new
|
|
206
|
-
configuration.
|
|
208
|
+
configuration. Spawn CLI selectors collapse `agy` into canonical `antigravity`
|
|
209
|
+
while keeping `gemini` as legacy, and model selectors use dropdowns with exact
|
|
210
|
+
CLI model ids/labels instead of free-text fields. Probe backlog is due objective
|
|
207
211
|
probes only, not every registered probe, so a healthy cooldown shows `0 due
|
|
208
212
|
probes` instead of looking stuck. On macOS, `python -m threadkeeper.server`
|
|
209
213
|
automatically installs and launches it on MCP startup, and restarts the app when
|
|
@@ -592,11 +596,15 @@ keys are lowercased:
|
|
|
592
596
|
# default agent for roles with no explicit pin ("" / unset = use the active CLI)
|
|
593
597
|
THREADKEEPER_SPAWN__DEFAULT=claude
|
|
594
598
|
# per-role CLI: THREADKEEPER_SPAWN__LOOP__<ROLE>=<cli>
|
|
599
|
+
# supported CLI keys: claude, codex, antigravity (agy executable), gemini (legacy), copilot
|
|
595
600
|
THREADKEEPER_SPAWN__LOOP__SHADOW_OBSERVER=claude # heaviest reasoning → keep on Claude
|
|
596
601
|
THREADKEEPER_SPAWN__LOOP__CURATOR=codex # weekly audit → Codex is fine
|
|
597
602
|
THREADKEEPER_SPAWN__LOOP__CANDIDATE_REVIEWER=auto # "auto" = follow active CLI
|
|
598
603
|
# model pin per CLI or per role: THREADKEEPER_SPAWN__MODEL__<KEY>=<model>
|
|
599
604
|
THREADKEEPER_SPAWN__MODEL__CLAUDE=opus
|
|
605
|
+
THREADKEEPER_SPAWN__MODEL__CODEX=gpt-5.5
|
|
606
|
+
THREADKEEPER_SPAWN__MODEL__AGY="Gemini 3.1 Pro (High)"
|
|
607
|
+
THREADKEEPER_SPAWN__MODEL__GEMINI=gemini-3.1-pro-preview
|
|
600
608
|
THREADKEEPER_SPAWN__MODEL__DIALECTIC_VALIDATOR=opus
|
|
601
609
|
```
|
|
602
610
|
|
|
@@ -604,7 +612,9 @@ Resolution per role: `SPAWN__LOOP__<role>` → `SPAWN__DEFAULT` → active CLI
|
|
|
604
612
|
`claude`; `"auto"` (or unset) defers to the active CLI. Real environment
|
|
605
613
|
variables override the `.env`. Force host detection with
|
|
606
614
|
`THREADKEEPER_ACTIVE_CLI=claude` (or `codex`, `antigravity`/`agy`,
|
|
607
|
-
`gemini`, `copilot`).
|
|
615
|
+
`gemini`, `copilot`). `agy` is normalized to `antigravity`; `gemini` remains a
|
|
616
|
+
legacy Gemini CLI adapter for old installs/enterprise paths. See `.env.example`
|
|
617
|
+
for the full knob list.
|
|
608
618
|
|
|
609
619
|
Adapters without headless support (Claude Desktop, VS Code) can't be
|
|
610
620
|
spawn targets — `spawn_status()` reports them as "no adapter" and any
|
|
@@ -704,12 +714,34 @@ unchanged.
|
|
|
704
714
|
## Verifying ingest across CLIs
|
|
705
715
|
|
|
706
716
|
```bash
|
|
707
|
-
python scripts/tk_verify_ingest.py
|
|
717
|
+
python scripts/tk_verify_ingest.py # both checks below
|
|
718
|
+
python scripts/tk_verify_ingest.py --contract # parse/ingest contract only
|
|
719
|
+
python scripts/tk_verify_ingest.py --live # production verdict only
|
|
720
|
+
python scripts/tk_verify_ingest.py --live --json # machine-readable
|
|
708
721
|
```
|
|
709
722
|
|
|
710
|
-
|
|
711
|
-
|
|
712
|
-
|
|
723
|
+
Two read-only checks:
|
|
724
|
+
|
|
725
|
+
- **Contract test** (`--contract`) — walks every installed CLI adapter,
|
|
726
|
+
parses recent transcripts into an isolated tempdir DB, reports
|
|
727
|
+
per-source message counts and flags any adapter that parsed messages
|
|
728
|
+
but silently failed to persist them. Answers *"does the pipeline
|
|
729
|
+
work?"*
|
|
730
|
+
- **Production verification** (`--live`) — reads the **live**
|
|
731
|
+
`dialog_messages` table read-only and scores the three acceptance
|
|
732
|
+
criteria from [roadmap issue #1](https://github.com/po4erk91/thread-keeper/issues/1):
|
|
733
|
+
(1) every targeted CLI *slot* has production rows, (2) shadow-review
|
|
734
|
+
sees more than one adapter in the same recent window, (3) the learning
|
|
735
|
+
loop has fired on non-Claude sessions. Emits a `PASS` / `PARTIAL` /
|
|
736
|
+
`FAIL` verdict. The four slots are `claude-code`, `codex`, `copilot`,
|
|
737
|
+
and `google` — where the Google slot is satisfied by *either* the
|
|
738
|
+
legacy `gemini` adapter or its successor Antigravity (`agy`), since
|
|
739
|
+
both live under `~/.gemini`.
|
|
740
|
+
|
|
741
|
+
`--strict` makes the process exit non-zero unless the live verdict is
|
|
742
|
+
`PASS`, so it can gate CI; `PARTIAL` (e.g. a box that doesn't run all
|
|
743
|
+
four CLIs) is a valid real-world state and exits 0 by default. The
|
|
744
|
+
reusable verdict logic lives in `threadkeeper/verify_ingest.py`.
|
|
713
745
|
|
|
714
746
|
---
|
|
715
747
|
|
|
@@ -735,6 +767,7 @@ threadkeeper/
|
|
|
735
767
|
├── db.py # SQLite schema + sqlite-vec loader
|
|
736
768
|
├── identity.py # session, self-cid, daemon launchers
|
|
737
769
|
├── ingest.py # adapter-driven transcript ingest
|
|
770
|
+
├── verify_ingest.py # cross-CLI production verification verdict
|
|
738
771
|
├── brief.py # render_brief / render_context
|
|
739
772
|
├── shadow_review.py # autonomous learning observer
|
|
740
773
|
├── i18n.py # 10 locales of regex + prompt bundles
|
|
@@ -773,3 +806,5 @@ locale. Look for the `good-first-issue` label.
|
|
|
773
806
|
## License
|
|
774
807
|
|
|
775
808
|
MIT — see [LICENSE](LICENSE).
|
|
809
|
+
|
|
810
|
+
<!-- mcp-name: io.github.po4erk91/thread-keeper -->
|
|
@@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"
|
|
|
4
4
|
|
|
5
5
|
[project]
|
|
6
6
|
name = "threadkeeper"
|
|
7
|
-
version = "0.
|
|
7
|
+
version = "0.13.0"
|
|
8
8
|
description = "Multi-agent shared brain across Claude Code/Desktop, Codex, Antigravity CLI, Gemini, Copilot, VS Code. Cross-session memory, self-improving skill loops, inter-agent signaling — one local MCP server."
|
|
9
9
|
requires-python = ">=3.11"
|
|
10
10
|
authors = [{ name = "thread-keeper contributors" }]
|
|
@@ -46,6 +46,8 @@ def test_menubar_status_item_uses_idle_chip_and_running_gears():
|
|
|
46
46
|
assert 'button.title = ""' in swift
|
|
47
47
|
assert 'button.title = " TK' not in swift
|
|
48
48
|
assert 'return "TK ' not in swift
|
|
49
|
+
assert "statusPollInterval: TimeInterval = 15.0" in swift
|
|
50
|
+
assert "Timer.scheduledTimer(withTimeInterval: statusPollInterval" in swift
|
|
49
51
|
assert "Timer(timeInterval: gearSpinInterval" in swift
|
|
50
52
|
assert "gearFrameStepDegrees = 17.0" in swift
|
|
51
53
|
assert "largeGearDiameter: CGFloat = 12.0" in swift
|
|
@@ -59,6 +61,9 @@ def test_menubar_status_item_uses_idle_chip_and_running_gears():
|
|
|
59
61
|
assert "store.snapshot.runningCount > 0" not in swift
|
|
60
62
|
assert "button.image = gearFrames" in swift
|
|
61
63
|
assert "TimelineView" not in swift
|
|
64
|
+
assert "refreshInFlight" in swift
|
|
65
|
+
assert "Task.detached(priority: .utility)" in swift
|
|
66
|
+
assert "nonisolated private static func runStatusCommand" in swift
|
|
62
67
|
assert "store.openEnvSettings()" in swift
|
|
63
68
|
assert '.help("Settings")' in swift
|
|
64
69
|
assert '.help("Refresh")' not in swift
|
|
@@ -67,6 +72,19 @@ def test_menubar_status_item_uses_idle_chip_and_running_gears():
|
|
|
67
72
|
assert '.help("Clean memory")' in swift
|
|
68
73
|
|
|
69
74
|
|
|
75
|
+
def test_menubar_popover_shows_before_status_refresh():
|
|
76
|
+
repo = Path(__file__).resolve().parents[1]
|
|
77
|
+
swift = (
|
|
78
|
+
repo / "apps" / "macos-agent-status" / "ThreadKeeperAgentStatus.swift"
|
|
79
|
+
).read_text(encoding="utf-8")
|
|
80
|
+
|
|
81
|
+
start = swift.index("@objc private func togglePopover")
|
|
82
|
+
end = swift.index(" private func updateStatusButton", start)
|
|
83
|
+
body = swift[start:end]
|
|
84
|
+
|
|
85
|
+
assert body.index("popover.show(") < body.index("store.refresh()")
|
|
86
|
+
|
|
87
|
+
|
|
70
88
|
def test_menubar_env_settings_window_edits_env_and_presets():
|
|
71
89
|
repo = Path(__file__).resolve().parents[1]
|
|
72
90
|
swift = (
|
|
@@ -81,6 +99,22 @@ def test_menubar_env_settings_window_edits_env_and_presets():
|
|
|
81
99
|
assert "(1...3).map" in swift
|
|
82
100
|
assert "EnvPresetCard" in swift
|
|
83
101
|
assert "mergeEnvText(raw:" in swift
|
|
102
|
+
assert "EnvSettingsTab" in swift
|
|
103
|
+
assert "case .raw:" in swift
|
|
104
|
+
assert "saveRaw(restart:" in swift
|
|
105
|
+
assert ".onChange(of: envStore.rawEnvText)" not in swift
|
|
106
|
+
assert "syncRawEditsIntoForm" not in swift
|
|
107
|
+
assert 'ChoiceOption("antigravity", label: "antigravity (agy)")' in swift
|
|
108
|
+
assert 'ChoiceOption("agy")' not in swift
|
|
109
|
+
assert 'ChoiceOption("gemini", label: "gemini (legacy)")' in swift
|
|
110
|
+
assert "antigravityModelChoices" in swift
|
|
111
|
+
assert "geminiLegacyModelChoices" in swift
|
|
112
|
+
assert '"Gemini 3.1 Pro (High)"' in swift
|
|
113
|
+
assert '"Gemini 3.5 Flash (Medium)"' in swift
|
|
114
|
+
assert '"gemini-3.1-pro-preview"' in swift
|
|
115
|
+
assert '"gemini-3.1-pro"' not in swift
|
|
116
|
+
assert "THREADKEEPER_SPAWN__MODEL__CODEX" in swift
|
|
117
|
+
assert "THREADKEEPER_SPAWN__MODEL__GEMINI" in swift
|
|
84
118
|
assert "THREADKEEPER_DISABLE_BG_DAEMONS" in swift
|
|
85
119
|
assert "THREADKEEPER_EVOLVE_APPLY_INTERVAL_S" in swift
|
|
86
120
|
assert "THREADKEEPER_SPAWN__MODEL__EVOLVE_APPLIER" in swift
|
|
@@ -114,12 +114,12 @@ def test_resolve_model_from_dotenv(tmp_path, monkeypatch):
|
|
|
114
114
|
envf = tmp_path / "tk.env"
|
|
115
115
|
envf.write_text(
|
|
116
116
|
"THREADKEEPER_SPAWN__MODEL__CODEX=gpt-5.4\n"
|
|
117
|
-
|
|
117
|
+
'THREADKEEPER_SPAWN__MODEL__AGY="Gemini 3.1 Pro (High)"\n'
|
|
118
118
|
"THREADKEEPER_SPAWN__MODEL__GEMINI=gemini-2.5-pro\n"
|
|
119
119
|
)
|
|
120
120
|
sc = _reset(monkeypatch, tmp_path, env_file=str(envf))
|
|
121
121
|
assert sc.resolve_model("codex") == "gpt-5.4"
|
|
122
|
-
assert sc.resolve_model("antigravity") == "
|
|
122
|
+
assert sc.resolve_model("antigravity") == "Gemini 3.1 Pro (High)"
|
|
123
123
|
assert sc.resolve_model("gemini") == "gemini-2.5-pro"
|
|
124
124
|
assert sc.resolve_model("claude") == "" # no entry
|
|
125
125
|
|
|
@@ -216,12 +216,12 @@ def test_antigravity_spawn_argv_uses_p_flag(tmp_path, monkeypatch):
|
|
|
216
216
|
for name in [m for m in list(sys.modules) if m.startswith("threadkeeper")]:
|
|
217
217
|
del sys.modules[name]
|
|
218
218
|
from threadkeeper.adapters.antigravity import ADAPTER
|
|
219
|
-
argv = ADAPTER.spawn_argv("hello", model="
|
|
219
|
+
argv = ADAPTER.spawn_argv("hello", model="Gemini 3.1 Pro (High)")
|
|
220
220
|
if argv is None:
|
|
221
221
|
pytest.skip("agy binary not installed in test env")
|
|
222
222
|
assert "-p" in argv
|
|
223
223
|
assert "--model" in argv
|
|
224
|
-
assert "
|
|
224
|
+
assert "Gemini 3.1 Pro (High)" in argv
|
|
225
225
|
|
|
226
226
|
|
|
227
227
|
def test_gemini_spawn_argv_uses_p_flag(tmp_path, monkeypatch):
|
|
@@ -0,0 +1,162 @@
|
|
|
1
|
+
"""Tests for the cross-CLI production verification harness (issue #1).
|
|
2
|
+
|
|
3
|
+
The verdict logic is pure, so most of this exercises ``evaluate_coverage``
|
|
4
|
+
and ``evaluate_verdict`` directly. One test drives the read-only SQL layer
|
|
5
|
+
against an in-memory sqlite so the live-DB query path is covered without a
|
|
6
|
+
real ~/.threadkeeper store.
|
|
7
|
+
"""
|
|
8
|
+
from __future__ import annotations
|
|
9
|
+
|
|
10
|
+
import sqlite3
|
|
11
|
+
|
|
12
|
+
from threadkeeper.verify_ingest import (
|
|
13
|
+
CANONICAL_SLOTS,
|
|
14
|
+
collect_live_signals,
|
|
15
|
+
evaluate_coverage,
|
|
16
|
+
evaluate_verdict,
|
|
17
|
+
format_report,
|
|
18
|
+
slot_for_source,
|
|
19
|
+
)
|
|
20
|
+
|
|
21
|
+
|
|
22
|
+
def test_slot_mapping_groups_gemini_and_antigravity():
|
|
23
|
+
# Gemini legacy and Antigravity (agy) both satisfy the single Google slot.
|
|
24
|
+
assert slot_for_source("gemini") == "google"
|
|
25
|
+
assert slot_for_source("antigravity") == "google"
|
|
26
|
+
assert slot_for_source("claude-code") == "claude-code"
|
|
27
|
+
assert slot_for_source("vscode") is None # not a canonical slot
|
|
28
|
+
|
|
29
|
+
|
|
30
|
+
def test_coverage_status_verified_thin_absent():
|
|
31
|
+
cov = evaluate_coverage(
|
|
32
|
+
{"claude-code": 200, "codex": 50, "copilot": 2, "gemini": 0},
|
|
33
|
+
thin_threshold=5,
|
|
34
|
+
)
|
|
35
|
+
assert cov["claude-code"]["status"] == "verified"
|
|
36
|
+
assert cov["codex"]["status"] == "verified"
|
|
37
|
+
assert cov["copilot"]["status"] == "thin" # 2 rows, below threshold
|
|
38
|
+
assert cov["google"]["status"] == "absent" # gemini=0, no antigravity rows
|
|
39
|
+
# every canonical slot is represented even when no source mapped to it
|
|
40
|
+
assert set(cov) == set(CANONICAL_SLOTS)
|
|
41
|
+
|
|
42
|
+
|
|
43
|
+
def test_coverage_antigravity_fills_google_slot():
|
|
44
|
+
cov = evaluate_coverage({"antigravity": 42}, thin_threshold=5)
|
|
45
|
+
assert cov["google"]["status"] == "verified"
|
|
46
|
+
assert cov["google"]["sources"] == {"antigravity": 42}
|
|
47
|
+
|
|
48
|
+
|
|
49
|
+
def test_verdict_pass_when_all_criteria_met():
|
|
50
|
+
rep = evaluate_verdict(
|
|
51
|
+
source_counts={
|
|
52
|
+
"claude-code": 100, "codex": 100, "copilot": 100, "antigravity": 100,
|
|
53
|
+
},
|
|
54
|
+
window_sources=["claude-code", "codex", "antigravity"],
|
|
55
|
+
shadow_passes=10,
|
|
56
|
+
)
|
|
57
|
+
assert rep["verdict"] == "PASS"
|
|
58
|
+
assert rep["criteria"]["all_sources_present"]["pass"] is True
|
|
59
|
+
assert rep["criteria"]["cross_adapter_window"]["pass"] is True
|
|
60
|
+
assert rep["criteria"]["learning_loop_non_claude"]["pass"] is True
|
|
61
|
+
|
|
62
|
+
|
|
63
|
+
def test_verdict_partial_three_of_four_slots():
|
|
64
|
+
# This is the real shape on a dev box: claude/codex/copilot present,
|
|
65
|
+
# google slot empty, but cross-adapter window + non-claude loop confirmed.
|
|
66
|
+
rep = evaluate_verdict(
|
|
67
|
+
source_counts={"claude-code": 200000, "codex": 11000, "copilot": 10},
|
|
68
|
+
window_sources=["claude-code", "codex"],
|
|
69
|
+
shadow_passes=2567,
|
|
70
|
+
)
|
|
71
|
+
assert rep["verdict"] == "PARTIAL"
|
|
72
|
+
assert rep["criteria"]["all_sources_present"]["pass"] is False
|
|
73
|
+
assert rep["criteria"]["all_sources_present"]["verified_slots"] == [
|
|
74
|
+
"claude-code", "codex", "copilot",
|
|
75
|
+
]
|
|
76
|
+
assert rep["criteria"]["cross_adapter_window"]["pass"] is True
|
|
77
|
+
assert rep["criteria"]["learning_loop_non_claude"]["pass"] is True
|
|
78
|
+
assert "codex" in rep["criteria"]["learning_loop_non_claude"]["sources"]
|
|
79
|
+
|
|
80
|
+
|
|
81
|
+
def test_verdict_fail_single_adapter_only():
|
|
82
|
+
# Only Claude Code has data and the window — not a cross-CLI demonstration.
|
|
83
|
+
rep = evaluate_verdict(
|
|
84
|
+
source_counts={"claude-code": 5000},
|
|
85
|
+
window_sources=["claude-code"],
|
|
86
|
+
shadow_passes=100,
|
|
87
|
+
)
|
|
88
|
+
assert rep["verdict"] == "FAIL"
|
|
89
|
+
assert rep["criteria"]["cross_adapter_window"]["pass"] is False
|
|
90
|
+
assert rep["criteria"]["learning_loop_non_claude"]["pass"] is False
|
|
91
|
+
|
|
92
|
+
|
|
93
|
+
def test_verdict_fail_when_loop_never_ran():
|
|
94
|
+
rep = evaluate_verdict(
|
|
95
|
+
source_counts={"claude-code": 100, "codex": 100},
|
|
96
|
+
window_sources=["claude-code", "codex"],
|
|
97
|
+
shadow_passes=0, # learning loop has never fired
|
|
98
|
+
)
|
|
99
|
+
# cross-adapter window passes, but loop criterion fails and only 2 slots
|
|
100
|
+
# verified → PARTIAL (loop is one signal, window is the other).
|
|
101
|
+
assert rep["verdict"] == "PARTIAL"
|
|
102
|
+
assert rep["criteria"]["learning_loop_non_claude"]["pass"] is False
|
|
103
|
+
|
|
104
|
+
|
|
105
|
+
def _seed_live_db(conn: sqlite3.Connection) -> None:
|
|
106
|
+
conn.execute(
|
|
107
|
+
"CREATE TABLE dialog_messages (source TEXT, created_at INTEGER)"
|
|
108
|
+
)
|
|
109
|
+
conn.execute("CREATE TABLE events (kind TEXT)")
|
|
110
|
+
rows = [
|
|
111
|
+
("claude-code", 1_000_000),
|
|
112
|
+
("claude-code", 1_000_500),
|
|
113
|
+
("codex", 1_000_600), # interleaved with claude in the window
|
|
114
|
+
("copilot", 100), # ancient — outside the recent window
|
|
115
|
+
]
|
|
116
|
+
conn.executemany(
|
|
117
|
+
"INSERT INTO dialog_messages (source, created_at) VALUES (?, ?)", rows
|
|
118
|
+
)
|
|
119
|
+
conn.executemany(
|
|
120
|
+
"INSERT INTO events (kind) VALUES (?)",
|
|
121
|
+
[("shadow_review_pass",)] * 3 + [("ingest_pass",)],
|
|
122
|
+
)
|
|
123
|
+
conn.commit()
|
|
124
|
+
|
|
125
|
+
|
|
126
|
+
def test_collect_live_signals_reads_window_and_passes():
|
|
127
|
+
conn = sqlite3.connect(":memory:")
|
|
128
|
+
conn.row_factory = sqlite3.Row
|
|
129
|
+
_seed_live_db(conn)
|
|
130
|
+
|
|
131
|
+
sig = collect_live_signals(conn, window_hours=24)
|
|
132
|
+
assert sig["source_counts"] == {
|
|
133
|
+
"claude-code": 2, "codex": 1, "copilot": 1,
|
|
134
|
+
}
|
|
135
|
+
# newest is 1_000_600; copilot@100 is far outside a 24h window of it.
|
|
136
|
+
assert set(sig["window_sources"]) == {"claude-code", "codex"}
|
|
137
|
+
assert sig["shadow_passes"] == 3
|
|
138
|
+
assert sig["newest_ts"] == 1_000_600
|
|
139
|
+
|
|
140
|
+
|
|
141
|
+
def test_collect_live_signals_tolerates_missing_events_table():
|
|
142
|
+
conn = sqlite3.connect(":memory:")
|
|
143
|
+
conn.row_factory = sqlite3.Row
|
|
144
|
+
conn.execute("CREATE TABLE dialog_messages (source TEXT, created_at INTEGER)")
|
|
145
|
+
conn.execute("INSERT INTO dialog_messages VALUES ('codex', 5)")
|
|
146
|
+
conn.commit()
|
|
147
|
+
sig = collect_live_signals(conn)
|
|
148
|
+
assert sig["shadow_passes"] == 0 # no events table → graceful 0
|
|
149
|
+
|
|
150
|
+
|
|
151
|
+
def test_format_report_renders_verdict_and_slots():
|
|
152
|
+
rep = evaluate_verdict(
|
|
153
|
+
source_counts={"claude-code": 200000, "codex": 11000, "copilot": 10},
|
|
154
|
+
window_sources=["claude-code", "codex"],
|
|
155
|
+
shadow_passes=2567,
|
|
156
|
+
)
|
|
157
|
+
rep["db_path"] = "/tmp/x.sqlite"
|
|
158
|
+
rep["signals"] = {"window_hours": 24}
|
|
159
|
+
text = format_report(rep)
|
|
160
|
+
assert "VERDICT: PARTIAL" in text
|
|
161
|
+
assert "claude-code" in text
|
|
162
|
+
assert "learning_loop_non_claude" in text
|
{threadkeeper-0.12.0 → threadkeeper-0.13.0}/threadkeeper/assets/macos-agent-status/README.md
RENAMED
|
@@ -5,7 +5,7 @@ The status-bar item itself is AppKit `NSStatusItem`; the popover content is
|
|
|
5
5
|
SwiftUI. That lets the app update the menu-bar image directly instead of relying
|
|
6
6
|
on SwiftUI `MenuBarExtra` label animation.
|
|
7
7
|
|
|
8
|
-
It polls `tk-agent-status --json` every
|
|
8
|
+
It polls `tk-agent-status --json` every 15 seconds and shows:
|
|
9
9
|
|
|
10
10
|
- an icon-only menu-bar status item, with loop counts in the popover and
|
|
11
11
|
tooltip,
|
|
@@ -20,10 +20,14 @@ It polls `tk-agent-status --json` every 5 seconds and shows:
|
|
|
20
20
|
- active spawned-child RSS when a loop has a worker running,
|
|
21
21
|
- a Clean memory button that runs `tk-agent-status --cleanup-memory`,
|
|
22
22
|
- a Settings gear that opens a separate `~/.threadkeeper/.env` editor with
|
|
23
|
-
guided controls,
|
|
23
|
+
guided controls, exact dropdowns for spawn CLI/model choices, raw text
|
|
24
|
+
editing, three saved presets, and Save & Restart,
|
|
24
25
|
- macOS notifications for newly completed autonomous child tasks that produced
|
|
25
26
|
a useful result.
|
|
26
27
|
|
|
28
|
+
Status polling and cleanup commands run in the background, so opening the
|
|
29
|
+
popover does not wait for `tk-agent-status --json`.
|
|
30
|
+
|
|
27
31
|
The first poll primes the seen-result list, so the app does not notify for old
|
|
28
32
|
completed tasks that existed before it started.
|
|
29
33
|
|
|
@@ -55,7 +59,9 @@ The Settings gear edits `~/.threadkeeper/.env` by default, or the path in
|
|
|
55
59
|
`THREADKEEPER_ENV_FILE` when the app was launched with that override. Save &
|
|
56
60
|
Restart writes the file, runs the safe cleanup command, and sends TERM to
|
|
57
61
|
running `threadkeeper.server` processes so MCP hosts reconnect with the new
|
|
58
|
-
environment.
|
|
62
|
+
environment. In the spawn routing controls, `antigravity` is the stored CLI
|
|
63
|
+
value and `agy` is only the executable alias; `gemini` remains available as the
|
|
64
|
+
legacy Gemini CLI adapter.
|
|
59
65
|
|
|
60
66
|
## Build
|
|
61
67
|
|