wavemind 2.2.6__tar.gz → 2.2.7__tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (162) hide show
  1. {wavemind-2.2.6 → wavemind-2.2.7}/PKG-INFO +61 -15
  2. {wavemind-2.2.6 → wavemind-2.2.7}/README.md +60 -14
  3. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/BENCHMARK_LEADERBOARD.md +1 -1
  4. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/BENCHMARK_REPORT.md +1 -1
  5. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/benchmark_matrix_results.json +34 -7
  6. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/benchmark_registry.py +39 -3
  7. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/scale_readiness_benchmark.py +66 -2
  8. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/scale_readiness_results.json +19 -9
  9. {wavemind-2.2.6 → wavemind-2.2.7}/docker-compose.yml +1 -1
  10. {wavemind-2.2.6 → wavemind-2.2.7}/docs/BENCHMARK_BRIEF.md +1 -1
  11. {wavemind-2.2.6 → wavemind-2.2.7}/docs/ROADMAP.md +8 -5
  12. {wavemind-2.2.6 → wavemind-2.2.7}/pyproject.toml +1 -1
  13. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_replication.py +87 -0
  14. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_scale_readiness_benchmark.py +3 -0
  15. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/__init__.py +5 -1
  16. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/replication.py +274 -0
  17. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind.egg-info/PKG-INFO +61 -15
  18. {wavemind-2.2.6 → wavemind-2.2.7}/CONTRIBUTING.md +0 -0
  19. {wavemind-2.2.6 → wavemind-2.2.7}/Dockerfile +0 -0
  20. {wavemind-2.2.6 → wavemind-2.2.7}/LICENSE +0 -0
  21. {wavemind-2.2.6 → wavemind-2.2.7}/MANIFEST.in +0 -0
  22. {wavemind-2.2.6 → wavemind-2.2.7}/SECURITY.md +0 -0
  23. {wavemind-2.2.6 → wavemind-2.2.7}/SUPPORT.md +0 -0
  24. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/agent_memory_benchmark.py +0 -0
  25. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/agent_memory_results.json +0 -0
  26. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/ann_index_curve_benchmark.py +0 -0
  27. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/ann_index_curve_results.json +0 -0
  28. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/dynamic_memory_benchmark.py +0 -0
  29. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/dynamic_memory_results.json +0 -0
  30. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/field_memory_dynamics_benchmark.py +0 -0
  31. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/field_memory_dynamics_results.json +0 -0
  32. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/locomo_evidence_results.json +0 -0
  33. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/locomo_memory_benchmark.py +0 -0
  34. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/locomo_sentence_evidence_results.json +0 -0
  35. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/long_memory_evidence_benchmark.py +0 -0
  36. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/long_memory_evidence_results.json +0 -0
  37. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/longmemeval_answer_benchmark.py +0 -0
  38. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/longmemeval_answer_extractive_20_results.json +0 -0
  39. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/longmemeval_answer_qwen25_0_5b_50_results.json +0 -0
  40. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/longmemeval_answer_qwen25_1_5b_50_results.json +0 -0
  41. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/longmemeval_evidence_50_results.json +0 -0
  42. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/longmemeval_evidence_results.json +0 -0
  43. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/longmemeval_memory_benchmark.py +0 -0
  44. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/memory_competitor_benchmark.py +0 -0
  45. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/memory_competitor_results.json +0 -0
  46. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/nomiracl_russian_benchmark.py +0 -0
  47. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/nomiracl_russian_results.json +0 -0
  48. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/open_retrieval_benchmark.py +0 -0
  49. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/open_retrieval_scifact_results.json +0 -0
  50. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/production_index_profile_results.json +0 -0
  51. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/production_load_benchmark.py +0 -0
  52. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/production_load_qdrant_100k_tuned_results.json +0 -0
  53. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/production_load_qdrant_1m_ef_sweep_results.json +0 -0
  54. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/production_load_qdrant_1m_results.json +0 -0
  55. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/production_load_qdrant_1m_tuned_results.json +0 -0
  56. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/production_load_results.json +0 -0
  57. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/render_benchmark_charts.py +0 -0
  58. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/render_benchmark_leaderboard.py +0 -0
  59. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/render_benchmark_report.py +0 -0
  60. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/ru_sentences_benchmark.py +0 -0
  61. {wavemind-2.2.6 → wavemind-2.2.7}/benchmarks/wavemind_capacity_results.json +0 -0
  62. {wavemind-2.2.6 → wavemind-2.2.7}/docs/CHROMA_MIGRATION.md +0 -0
  63. {wavemind-2.2.6 → wavemind-2.2.7}/docs/DEMO_SCRIPT.md +0 -0
  64. {wavemind-2.2.6 → wavemind-2.2.7}/docs/LAUNCH_KIT.md +0 -0
  65. {wavemind-2.2.6 → wavemind-2.2.7}/docs/OBSERVABILITY.md +0 -0
  66. {wavemind-2.2.6 → wavemind-2.2.7}/docs/PROJECT_BOARD.md +0 -0
  67. {wavemind-2.2.6 → wavemind-2.2.7}/docs/RELEASE.md +0 -0
  68. {wavemind-2.2.6 → wavemind-2.2.7}/docs/RU_LAUNCH_POSTS.md +0 -0
  69. {wavemind-2.2.6 → wavemind-2.2.7}/docs/USE_CASES.md +0 -0
  70. {wavemind-2.2.6 → wavemind-2.2.7}/docs/assets/benchmark-summary.svg +0 -0
  71. {wavemind-2.2.6 → wavemind-2.2.7}/docs/assets/wavemind-demo.gif +0 -0
  72. {wavemind-2.2.6 → wavemind-2.2.7}/docs/assets/wavemind-social-card.svg +0 -0
  73. {wavemind-2.2.6 → wavemind-2.2.7}/examples/agent_with_memory.py +0 -0
  74. {wavemind-2.2.6 → wavemind-2.2.7}/examples/chroma_migration.py +0 -0
  75. {wavemind-2.2.6 → wavemind-2.2.7}/examples/customer_support_memory.py +0 -0
  76. {wavemind-2.2.6 → wavemind-2.2.7}/examples/demo.py +0 -0
  77. {wavemind-2.2.6 → wavemind-2.2.7}/examples/dynamic_memory_demo.py +0 -0
  78. {wavemind-2.2.6 → wavemind-2.2.7}/examples/framework_integrations.py +0 -0
  79. {wavemind-2.2.6 → wavemind-2.2.7}/examples/langchain_memory.py +0 -0
  80. {wavemind-2.2.6 → wavemind-2.2.7}/examples/llamaindex_retriever.py +0 -0
  81. {wavemind-2.2.6 → wavemind-2.2.7}/examples/observability/README.md +0 -0
  82. {wavemind-2.2.6 → wavemind-2.2.7}/examples/observability/docker-compose.yml +0 -0
  83. {wavemind-2.2.6 → wavemind-2.2.7}/examples/observability/otel-collector.yaml +0 -0
  84. {wavemind-2.2.6 → wavemind-2.2.7}/examples/observability/prometheus-alerts.yml +0 -0
  85. {wavemind-2.2.6 → wavemind-2.2.7}/examples/observability/prometheus.yml +0 -0
  86. {wavemind-2.2.6 → wavemind-2.2.7}/examples/production-index-profile/README.md +0 -0
  87. {wavemind-2.2.6 → wavemind-2.2.7}/examples/production-index-profile/docker-compose.yml +0 -0
  88. {wavemind-2.2.6 → wavemind-2.2.7}/examples/research_notebook_memory.py +0 -0
  89. {wavemind-2.2.6 → wavemind-2.2.7}/examples/sharded_memory.py +0 -0
  90. {wavemind-2.2.6 → wavemind-2.2.7}/install.bat +0 -0
  91. {wavemind-2.2.6 → wavemind-2.2.7}/install.sh +0 -0
  92. {wavemind-2.2.6 → wavemind-2.2.7}/requirements-optional.txt +0 -0
  93. {wavemind-2.2.6 → wavemind-2.2.7}/requirements.txt +0 -0
  94. {wavemind-2.2.6 → wavemind-2.2.7}/setup.cfg +0 -0
  95. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_agent_memory_benchmark.py +0 -0
  96. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_ann_index_curve_benchmark.py +0 -0
  97. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_api.py +0 -0
  98. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_api_process_persistence.py +0 -0
  99. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_benchmark_brief.py +0 -0
  100. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_benchmark_charts.py +0 -0
  101. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_benchmark_leaderboard.py +0 -0
  102. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_benchmark_registry.py +0 -0
  103. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_benchmark_report.py +0 -0
  104. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_chroma_migration_example.py +0 -0
  105. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_cli_smoke.py +0 -0
  106. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_cluster.py +0 -0
  107. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_core_persistence.py +0 -0
  108. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_dynamic_memory_benchmark.py +0 -0
  109. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_examples.py +0 -0
  110. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_field_graph.py +0 -0
  111. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_field_graph_integration.py +0 -0
  112. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_field_memory_dynamics_benchmark.py +0 -0
  113. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_framework_adapters.py +0 -0
  114. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_import_benchmark.py +0 -0
  115. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_indexes_encoders.py +0 -0
  116. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_jobs.py +0 -0
  117. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_langchain_integration.py +0 -0
  118. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_locomo_memory_benchmark.py +0 -0
  119. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_long_memory_evidence_benchmark.py +0 -0
  120. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_longmemeval_answer_benchmark.py +0 -0
  121. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_longmemeval_memory_benchmark.py +0 -0
  122. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_memory_competitor_benchmark.py +0 -0
  123. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_multimodal.py +0 -0
  124. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_nomiracl_russian_benchmark.py +0 -0
  125. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_observability.py +0 -0
  126. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_observability_docs.py +0 -0
  127. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_open_retrieval_benchmark.py +0 -0
  128. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_packaging_files.py +0 -0
  129. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_postgres_storage.py +0 -0
  130. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_production_index_profile.py +0 -0
  131. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_production_load_benchmark.py +0 -0
  132. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_scale_plan.py +0 -0
  133. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_semantic_and_latency.py +0 -0
  134. {wavemind-2.2.6 → wavemind-2.2.7}/tests/test_sharding.py +0 -0
  135. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/__main__.py +0 -0
  136. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/api.py +0 -0
  137. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/benchmark.py +0 -0
  138. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/cli.py +0 -0
  139. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/cluster.py +0 -0
  140. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/core.py +0 -0
  141. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/encoders.py +0 -0
  142. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/field_graph.py +0 -0
  143. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/importers.py +0 -0
  144. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/indexes.py +0 -0
  145. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/integrations/__init__.py +0 -0
  146. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/integrations/autogen.py +0 -0
  147. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/integrations/crewai.py +0 -0
  148. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/integrations/langchain.py +0 -0
  149. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/integrations/langgraph.py +0 -0
  150. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/integrations/llamaindex.py +0 -0
  151. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/jobs.py +0 -0
  152. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/multimodal.py +0 -0
  153. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/observability.py +0 -0
  154. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/scale.py +0 -0
  155. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/sharding.py +0 -0
  156. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/storage.py +0 -0
  157. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind/studio.py +0 -0
  158. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind.egg-info/SOURCES.txt +0 -0
  159. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind.egg-info/dependency_links.txt +0 -0
  160. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind.egg-info/entry_points.txt +0 -0
  161. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind.egg-info/requires.txt +0 -0
  162. {wavemind-2.2.6 → wavemind-2.2.7}/wavemind.egg-info/top_level.txt +0 -0
@@ -1,6 +1,6 @@
1
1
  Metadata-Version: 2.4
2
2
  Name: wavemind
3
- Version: 2.2.6
3
+ Version: 2.2.7
4
4
  Summary: Local-first dynamic memory field with vector search and wave-field re-ranking
5
5
  License-Expression: MIT
6
6
  Project-URL: Homepage, https://github.com/CaspianG/wavemind
@@ -542,16 +542,17 @@ Checked-in result:
542
542
  | profile | result |
543
543
  |---|---:|
544
544
  | Cluster planner | 4096 namespaces, 4 nodes, replication factor 2, node-loss availability `1.000`, zone-loss availability `1.000`, write quorum `2`. |
545
- | Hot cache | 2000 lookups, hit rate `0.920`, p99 lookup `0.01 ms`. |
546
- | Replicated runtime | 3 physical WaveMind stores, replication factor 3, write quorum 2, node-loss recall `true`, repair copied `1` missing record, tombstone repair deleted `1` stale record, p99 query-after-loss `1.33 ms`. |
547
- | Active-active delta sync | 2 regions, bidirectional convergence `true`, stale import suppressed after delete `true`, tombstone convergence `true`, sync `120.19 ms`. |
548
- | Structured payloads | image/audio/table/event retrieval, precision@1 `1.000`, p99 `0.62 ms`. |
545
+ | Hot cache | 2000 lookups, hit rate `0.920`, p99 lookup `0.003 ms`. |
546
+ | Replicated runtime | 3 physical WaveMind stores, replication factor 3, write quorum 2, node-loss recall `true`, repair copied `1` missing record, tombstone repair deleted `1` stale record, p99 query-after-loss `1.29 ms`. |
547
+ | Active-active delta sync | 2 regions, bidirectional convergence `true`, stale import suppressed after delete `true`, tombstone convergence `true`, sync `112.50 ms`. |
548
+ | Replicated snapshot | 3 replica files, manifest checksum validation `true`, restore `11.42 ms`, recall after restored-primary loss `true`. |
549
+ | Structured payloads | image/audio/table/event retrieval, precision@1 `1.000`, p99 `0.67 ms`. |
549
550
 
550
551
  This profile validates routing, quorum-replicated runtime behavior, cache
551
- behavior, active-active namespace delta sync, and structured payload handling.
552
- It is not a 10M-vector load test. Real 100k, 1M, and 10M latency claims should
553
- come from service-backed FAISS/Qdrant/pgvector load tests on production-like
554
- hardware.
552
+ behavior, active-active namespace delta sync, replicated snapshot/restore, and
553
+ structured payload handling. It is not a 10M-vector load test. Real 100k, 1M,
554
+ and 10M latency claims should come from service-backed FAISS/Qdrant/pgvector
555
+ load tests on production-like hardware.
555
556
 
556
557
  Cluster placement planning:
557
558
 
@@ -670,6 +671,34 @@ For Postgres storage, use database-native backup tooling such as `pg_dump`,
670
671
  managed snapshots, or point-in-time recovery instead of WaveMind's SQLite file
671
672
  backup command.
672
673
 
674
+ Replicated runtime snapshot/restore:
675
+
676
+ ```python
677
+ from wavemind import HashingTextEncoder, ReplicatedWaveMind
678
+
679
+ memory = ReplicatedWaveMind(
680
+ root_path="./state/replicas",
681
+ nodes=["node-a", "node-b", "node-c"],
682
+ replication_factor=3,
683
+ encoder=HashingTextEncoder(vector_dim=64),
684
+ )
685
+ memory.remember("Tenant A prefers short support replies.", namespace="tenant:a")
686
+
687
+ snapshot = memory.snapshot("./backups/replicated")
688
+ assert ReplicatedWaveMind.verify_snapshot(snapshot.snapshot_path)["healthy"]
689
+
690
+ restored, report = ReplicatedWaveMind.restore_snapshot(
691
+ snapshot.snapshot_path,
692
+ "./state/restored-replicas",
693
+ encoder=HashingTextEncoder(vector_dim=64),
694
+ )
695
+ ```
696
+
697
+ The replicated snapshot writes one SQLite backup per replica plus
698
+ `manifest.json` with SHA-256 checksums, replica metadata, quorum settings, and
699
+ node definitions. Restore refuses to overwrite a non-empty root unless
700
+ `overwrite=True` is passed.
701
+
673
702
  ## HTTP API
674
703
 
675
704
  Run the local FastAPI server:
@@ -1043,7 +1072,7 @@ Current read:
1043
1072
  | LongMemEval 50-query smoke | On the first 50 non-abstention LongMemEval-S questions, WaveMind reaches `evidence_recall@5 0.920`, `precision@1 0.760`, and `MRR@5 0.827`; Chroma/Qdrant static reach `0.600`, `0.260`, and `0.385`. | This is the fast regression profile for checking current changes before rerunning the full LongMemEval profile. WaveMind wins on quality; latency still needs work. |
1044
1073
  | ANN/index curve | At 50000 generated 128-d vectors, NumPy exact keeps `recall@10 1.000` at `6.49 ms`; quantized int8 keeps `0.934` at `24.92 ms`; Annoy is faster at `4.92 ms` but drops to `0.730` recall; Qdrant local keeps `1.000` recall at `43.49 ms`. | Current local scale boundary is clear: quantized search needs kernel work, Annoy needs tuning/FAISS, and Qdrant should be tested in service mode for a fair production comparison. |
1045
1074
  | Production load | At 100000 generated 128-d vectors, service-mode Qdrant reaches `recall@10 1.000`, avg `10.28 ms`, p99 `21.26 ms`. At 1M, tuned Qdrant reaches `recall@10 0.984`, avg `116.80 ms`, p99 `209.28 ms`; an EF sweep finds `recall@10 0.977`, avg `64.76 ms`, p99 `103.77 ms` at `hnsw_ef=2048` on 30 queries. | 100k is production-grade on the tested machine. 1M recall is now strong, but p99 still needs tuning before claiming a stable sub-100 ms SLO. |
1046
- | Scale readiness | Deterministic 1M-memory simulation validates 4096 namespace placements over 4 nodes with replication factor 2, node-loss availability `1.000`, zone-loss availability `1.000`, hot-cache hit rate `0.920`, quorum-replicated runtime recall after node loss, missing-record repair, tombstone repair, active-active delta sync, and structured payload precision@1 `1.000`. | This proves routing, cache, payload, replicated-runtime, and namespace-delta foundations. It is not a 10M-vector latency claim; real 10M latency still needs service-backed load tests on larger hardware. |
1075
+ | Scale readiness | Deterministic 1M-memory simulation validates 4096 namespace placements over 4 nodes with replication factor 2, node-loss availability `1.000`, zone-loss availability `1.000`, hot-cache hit rate `0.920`, quorum-replicated runtime recall after node loss, missing-record repair, tombstone repair, active-active delta sync, checksummed replicated snapshot/restore, and structured payload precision@1 `1.000`. | This proves routing, cache, payload, replicated-runtime, namespace-delta, and restore-drill foundations. It is not a 10M-vector latency claim; real 10M latency still needs service-backed load tests on larger hardware. |
1047
1076
  | Memory competitor adapters | WaveMind reaches `precision@1 0.80`, `precision@3 1.00`, stale suppression `1.00` on the small adapter profile. Mem0, Zep, and LangGraph are listed as skipped unless their real packages/services are configured. | This prevents fake competitor claims. The adapter harness is ready; real Mem0/Zep/LangGraph results still need configured installs. |
1048
1077
  | LongMemEval local answer generation | With the same local Ollama `qwen2.5:1.5b`, WaveMind reaches `exact_match 0.240`, `contains_answer 0.380`, `token_f1 0.333`, and `evidence_recall@5 0.920`; Chroma and Qdrant static both reach `0.120`, `0.160`, `0.170`, and `0.600`. | This is the first checked-in end-to-end answer benchmark against Chroma/Qdrant. It is still a 50-question lightweight smoke run, not a full LongMemEval leaderboard score. |
1049
1078
 
@@ -1062,7 +1091,7 @@ Current read:
1062
1091
  | Production index profile | Docker-backed 50000-vector profile for persisted FAISS, Qdrant service, and PostgreSQL/pgvector HNSW. | implemented | FAISS / Qdrant service / pgvector | Keep service-mode candidate generation above `0.95` recall@10 and below 10 ms average query latency at 50000 vectors. |
1063
1092
  | Production load profile | 100k and 1M service-backed candidate-index checks with p95/p99 latency. | implemented | Qdrant service / pgvector HNSW / FAISS persisted | Keep 100k at recall@10 `1.000`; push 1M p99 below 100 ms with recall@10 >= 0.95. |
1064
1093
  | Qdrant 1M HNSW ef sweep | One 1M Qdrant collection queried with multiple `hnsw_ef` values. | implemented | Qdrant service | Repeat with 100+ queries and collection-level HNSW build parameters before claiming a stable 1M SLO. |
1065
- | Scale readiness profile | Cluster placement, node/zone-loss simulation, quorum report, replicated runtime, active-active delta sync, hot-cache behavior, and structured/multimodal payload retrieval. | implemented | Mem0 / Zep / LangGraph persistent memory / GraphRAG target adapters | Keep quorum replication, namespace-delta sync, and repair green while adding larger service-backed 10M load tests. |
1094
+ | Scale readiness profile | Cluster placement, node/zone-loss simulation, quorum report, replicated runtime, active-active delta sync, replicated snapshot/restore, hot-cache behavior, and structured/multimodal payload retrieval. | implemented | Mem0 / Zep / LangGraph persistent memory / GraphRAG target adapters | Keep quorum replication, namespace-delta sync, repair, and restore drills green while adding larger service-backed 10M load tests. |
1066
1095
  | Memory competitor adapter profile | Dynamic-memory scenario wired for external memory frameworks. | implemented | Mem0 / Zep / LangGraph persistent memory | Report real competitor results only when their packages/services are explicitly configured. |
1067
1096
  | [BEIR](https://github.com/beir-cellar/beir) | Standard zero-shot information retrieval quality. | planned | Chroma / Qdrant / FAISS | Stay within 0.02 `nDCG@10` on identical embeddings. |
1068
1097
  | [MTEB Retrieval](https://github.com/embeddings-benchmark/mteb) | Separates encoder quality from retrieval-store quality. | planned | Chroma / Qdrant / FAISS | Prove WaveMind does not reduce same-embedding retrieval quality. |
@@ -1236,6 +1265,22 @@ region_b.import_namespace_delta(region_a.export_namespace_delta("tenant:a"))
1236
1265
  The delta contains active records plus tombstones. Import is idempotent and
1237
1266
  tombstone-aware, so a stale region export cannot resurrect a deleted memory.
1238
1267
 
1268
+ For operational recovery, `snapshot()` creates a checksummed replicated snapshot
1269
+ and `restore_snapshot()` restores it into a fresh replica root:
1270
+
1271
+ ```python
1272
+ snapshot = memory.snapshot("./backups/replicated")
1273
+ health = ReplicatedWaveMind.verify_snapshot(snapshot.snapshot_path)
1274
+ restored, report = ReplicatedWaveMind.restore_snapshot(
1275
+ snapshot.snapshot_path,
1276
+ "./state/restored-replicas",
1277
+ )
1278
+ ```
1279
+
1280
+ The checked-in scale-readiness profile verifies manifest checksums, restores
1281
+ three replica files, then disables the restored primary and confirms the memory
1282
+ is still recalled from the remaining replicas.
1283
+
1239
1284
  Checked-in official LoCoMo retrieval result:
1240
1285
 
1241
1286
  10 conversations, 5882 memory turns, 1977 evidence-labeled questions,
@@ -1584,8 +1629,9 @@ If you already use Chroma for local memory, see the practical migration guide:
1584
1629
  - The dynamic benchmark currently compares WaveMind against a static Chroma baseline. Chroma and Qdrant can implement similar behavior with extra application-layer metadata policy, deletes, filters, and reinforcement logic.
1585
1630
  - `MemoryFieldGraph` is a discrete graph over stored memories, not a continuous mathematical field. Its current build path should be optimized with incremental edge updates before large production use.
1586
1631
  - pgvector is a candidate-index backend. PostgreSQL source-of-truth storage is
1587
- also available separately, but migrations, PITR docs, and service benchmark
1588
- profiles still need more real deployment coverage.
1632
+ also available separately, but migrations, PITR docs, scheduled backup
1633
+ runbooks, and service benchmark profiles still need more real deployment
1634
+ coverage.
1589
1635
  - The Qdrant backend is also a candidate-index backend. WaveMind rebuilds it
1590
1636
  from SQLite on load/build, so large service-mode deployments still need a
1591
1637
  measured rebuild strategy and index-health monitoring.
@@ -1624,8 +1670,8 @@ Near-term priorities:
1624
1670
  - Faster dynamic re-ranking through smaller candidate windows, caching, and
1625
1671
  background updates.
1626
1672
  - Better production operations: OpenTelemetry is optional and implemented;
1627
- richer latency histograms, index-health metrics, alerting examples, and
1628
- restore drills are next.
1673
+ richer latency histograms, index-health metrics, alerting examples, scheduled
1674
+ offsite snapshots, and Postgres PITR runbooks are next.
1629
1675
 
1630
1676
  Longer-term direction:
1631
1677
 
@@ -489,16 +489,17 @@ Checked-in result:
489
489
  | profile | result |
490
490
  |---|---:|
491
491
  | Cluster planner | 4096 namespaces, 4 nodes, replication factor 2, node-loss availability `1.000`, zone-loss availability `1.000`, write quorum `2`. |
492
- | Hot cache | 2000 lookups, hit rate `0.920`, p99 lookup `0.01 ms`. |
493
- | Replicated runtime | 3 physical WaveMind stores, replication factor 3, write quorum 2, node-loss recall `true`, repair copied `1` missing record, tombstone repair deleted `1` stale record, p99 query-after-loss `1.33 ms`. |
494
- | Active-active delta sync | 2 regions, bidirectional convergence `true`, stale import suppressed after delete `true`, tombstone convergence `true`, sync `120.19 ms`. |
495
- | Structured payloads | image/audio/table/event retrieval, precision@1 `1.000`, p99 `0.62 ms`. |
492
+ | Hot cache | 2000 lookups, hit rate `0.920`, p99 lookup `0.003 ms`. |
493
+ | Replicated runtime | 3 physical WaveMind stores, replication factor 3, write quorum 2, node-loss recall `true`, repair copied `1` missing record, tombstone repair deleted `1` stale record, p99 query-after-loss `1.29 ms`. |
494
+ | Active-active delta sync | 2 regions, bidirectional convergence `true`, stale import suppressed after delete `true`, tombstone convergence `true`, sync `112.50 ms`. |
495
+ | Replicated snapshot | 3 replica files, manifest checksum validation `true`, restore `11.42 ms`, recall after restored-primary loss `true`. |
496
+ | Structured payloads | image/audio/table/event retrieval, precision@1 `1.000`, p99 `0.67 ms`. |
496
497
 
497
498
  This profile validates routing, quorum-replicated runtime behavior, cache
498
- behavior, active-active namespace delta sync, and structured payload handling.
499
- It is not a 10M-vector load test. Real 100k, 1M, and 10M latency claims should
500
- come from service-backed FAISS/Qdrant/pgvector load tests on production-like
501
- hardware.
499
+ behavior, active-active namespace delta sync, replicated snapshot/restore, and
500
+ structured payload handling. It is not a 10M-vector load test. Real 100k, 1M,
501
+ and 10M latency claims should come from service-backed FAISS/Qdrant/pgvector
502
+ load tests on production-like hardware.
502
503
 
503
504
  Cluster placement planning:
504
505
 
@@ -617,6 +618,34 @@ For Postgres storage, use database-native backup tooling such as `pg_dump`,
617
618
  managed snapshots, or point-in-time recovery instead of WaveMind's SQLite file
618
619
  backup command.
619
620
 
621
+ Replicated runtime snapshot/restore:
622
+
623
+ ```python
624
+ from wavemind import HashingTextEncoder, ReplicatedWaveMind
625
+
626
+ memory = ReplicatedWaveMind(
627
+ root_path="./state/replicas",
628
+ nodes=["node-a", "node-b", "node-c"],
629
+ replication_factor=3,
630
+ encoder=HashingTextEncoder(vector_dim=64),
631
+ )
632
+ memory.remember("Tenant A prefers short support replies.", namespace="tenant:a")
633
+
634
+ snapshot = memory.snapshot("./backups/replicated")
635
+ assert ReplicatedWaveMind.verify_snapshot(snapshot.snapshot_path)["healthy"]
636
+
637
+ restored, report = ReplicatedWaveMind.restore_snapshot(
638
+ snapshot.snapshot_path,
639
+ "./state/restored-replicas",
640
+ encoder=HashingTextEncoder(vector_dim=64),
641
+ )
642
+ ```
643
+
644
+ The replicated snapshot writes one SQLite backup per replica plus
645
+ `manifest.json` with SHA-256 checksums, replica metadata, quorum settings, and
646
+ node definitions. Restore refuses to overwrite a non-empty root unless
647
+ `overwrite=True` is passed.
648
+
620
649
  ## HTTP API
621
650
 
622
651
  Run the local FastAPI server:
@@ -990,7 +1019,7 @@ Current read:
990
1019
  | LongMemEval 50-query smoke | On the first 50 non-abstention LongMemEval-S questions, WaveMind reaches `evidence_recall@5 0.920`, `precision@1 0.760`, and `MRR@5 0.827`; Chroma/Qdrant static reach `0.600`, `0.260`, and `0.385`. | This is the fast regression profile for checking current changes before rerunning the full LongMemEval profile. WaveMind wins on quality; latency still needs work. |
991
1020
  | ANN/index curve | At 50000 generated 128-d vectors, NumPy exact keeps `recall@10 1.000` at `6.49 ms`; quantized int8 keeps `0.934` at `24.92 ms`; Annoy is faster at `4.92 ms` but drops to `0.730` recall; Qdrant local keeps `1.000` recall at `43.49 ms`. | Current local scale boundary is clear: quantized search needs kernel work, Annoy needs tuning/FAISS, and Qdrant should be tested in service mode for a fair production comparison. |
992
1021
  | Production load | At 100000 generated 128-d vectors, service-mode Qdrant reaches `recall@10 1.000`, avg `10.28 ms`, p99 `21.26 ms`. At 1M, tuned Qdrant reaches `recall@10 0.984`, avg `116.80 ms`, p99 `209.28 ms`; an EF sweep finds `recall@10 0.977`, avg `64.76 ms`, p99 `103.77 ms` at `hnsw_ef=2048` on 30 queries. | 100k is production-grade on the tested machine. 1M recall is now strong, but p99 still needs tuning before claiming a stable sub-100 ms SLO. |
993
- | Scale readiness | Deterministic 1M-memory simulation validates 4096 namespace placements over 4 nodes with replication factor 2, node-loss availability `1.000`, zone-loss availability `1.000`, hot-cache hit rate `0.920`, quorum-replicated runtime recall after node loss, missing-record repair, tombstone repair, active-active delta sync, and structured payload precision@1 `1.000`. | This proves routing, cache, payload, replicated-runtime, and namespace-delta foundations. It is not a 10M-vector latency claim; real 10M latency still needs service-backed load tests on larger hardware. |
1022
+ | Scale readiness | Deterministic 1M-memory simulation validates 4096 namespace placements over 4 nodes with replication factor 2, node-loss availability `1.000`, zone-loss availability `1.000`, hot-cache hit rate `0.920`, quorum-replicated runtime recall after node loss, missing-record repair, tombstone repair, active-active delta sync, checksummed replicated snapshot/restore, and structured payload precision@1 `1.000`. | This proves routing, cache, payload, replicated-runtime, namespace-delta, and restore-drill foundations. It is not a 10M-vector latency claim; real 10M latency still needs service-backed load tests on larger hardware. |
994
1023
  | Memory competitor adapters | WaveMind reaches `precision@1 0.80`, `precision@3 1.00`, stale suppression `1.00` on the small adapter profile. Mem0, Zep, and LangGraph are listed as skipped unless their real packages/services are configured. | This prevents fake competitor claims. The adapter harness is ready; real Mem0/Zep/LangGraph results still need configured installs. |
995
1024
  | LongMemEval local answer generation | With the same local Ollama `qwen2.5:1.5b`, WaveMind reaches `exact_match 0.240`, `contains_answer 0.380`, `token_f1 0.333`, and `evidence_recall@5 0.920`; Chroma and Qdrant static both reach `0.120`, `0.160`, `0.170`, and `0.600`. | This is the first checked-in end-to-end answer benchmark against Chroma/Qdrant. It is still a 50-question lightweight smoke run, not a full LongMemEval leaderboard score. |
996
1025
 
@@ -1009,7 +1038,7 @@ Current read:
1009
1038
  | Production index profile | Docker-backed 50000-vector profile for persisted FAISS, Qdrant service, and PostgreSQL/pgvector HNSW. | implemented | FAISS / Qdrant service / pgvector | Keep service-mode candidate generation above `0.95` recall@10 and below 10 ms average query latency at 50000 vectors. |
1010
1039
  | Production load profile | 100k and 1M service-backed candidate-index checks with p95/p99 latency. | implemented | Qdrant service / pgvector HNSW / FAISS persisted | Keep 100k at recall@10 `1.000`; push 1M p99 below 100 ms with recall@10 >= 0.95. |
1011
1040
  | Qdrant 1M HNSW ef sweep | One 1M Qdrant collection queried with multiple `hnsw_ef` values. | implemented | Qdrant service | Repeat with 100+ queries and collection-level HNSW build parameters before claiming a stable 1M SLO. |
1012
- | Scale readiness profile | Cluster placement, node/zone-loss simulation, quorum report, replicated runtime, active-active delta sync, hot-cache behavior, and structured/multimodal payload retrieval. | implemented | Mem0 / Zep / LangGraph persistent memory / GraphRAG target adapters | Keep quorum replication, namespace-delta sync, and repair green while adding larger service-backed 10M load tests. |
1041
+ | Scale readiness profile | Cluster placement, node/zone-loss simulation, quorum report, replicated runtime, active-active delta sync, replicated snapshot/restore, hot-cache behavior, and structured/multimodal payload retrieval. | implemented | Mem0 / Zep / LangGraph persistent memory / GraphRAG target adapters | Keep quorum replication, namespace-delta sync, repair, and restore drills green while adding larger service-backed 10M load tests. |
1013
1042
  | Memory competitor adapter profile | Dynamic-memory scenario wired for external memory frameworks. | implemented | Mem0 / Zep / LangGraph persistent memory | Report real competitor results only when their packages/services are explicitly configured. |
1014
1043
  | [BEIR](https://github.com/beir-cellar/beir) | Standard zero-shot information retrieval quality. | planned | Chroma / Qdrant / FAISS | Stay within 0.02 `nDCG@10` on identical embeddings. |
1015
1044
  | [MTEB Retrieval](https://github.com/embeddings-benchmark/mteb) | Separates encoder quality from retrieval-store quality. | planned | Chroma / Qdrant / FAISS | Prove WaveMind does not reduce same-embedding retrieval quality. |
@@ -1183,6 +1212,22 @@ region_b.import_namespace_delta(region_a.export_namespace_delta("tenant:a"))
1183
1212
  The delta contains active records plus tombstones. Import is idempotent and
1184
1213
  tombstone-aware, so a stale region export cannot resurrect a deleted memory.
1185
1214
 
1215
+ For operational recovery, `snapshot()` creates a checksummed replicated snapshot
1216
+ and `restore_snapshot()` restores it into a fresh replica root:
1217
+
1218
+ ```python
1219
+ snapshot = memory.snapshot("./backups/replicated")
1220
+ health = ReplicatedWaveMind.verify_snapshot(snapshot.snapshot_path)
1221
+ restored, report = ReplicatedWaveMind.restore_snapshot(
1222
+ snapshot.snapshot_path,
1223
+ "./state/restored-replicas",
1224
+ )
1225
+ ```
1226
+
1227
+ The checked-in scale-readiness profile verifies manifest checksums, restores
1228
+ three replica files, then disables the restored primary and confirms the memory
1229
+ is still recalled from the remaining replicas.
1230
+
1186
1231
  Checked-in official LoCoMo retrieval result:
1187
1232
 
1188
1233
  10 conversations, 5882 memory turns, 1977 evidence-labeled questions,
@@ -1531,8 +1576,9 @@ If you already use Chroma for local memory, see the practical migration guide:
1531
1576
  - The dynamic benchmark currently compares WaveMind against a static Chroma baseline. Chroma and Qdrant can implement similar behavior with extra application-layer metadata policy, deletes, filters, and reinforcement logic.
1532
1577
  - `MemoryFieldGraph` is a discrete graph over stored memories, not a continuous mathematical field. Its current build path should be optimized with incremental edge updates before large production use.
1533
1578
  - pgvector is a candidate-index backend. PostgreSQL source-of-truth storage is
1534
- also available separately, but migrations, PITR docs, and service benchmark
1535
- profiles still need more real deployment coverage.
1579
+ also available separately, but migrations, PITR docs, scheduled backup
1580
+ runbooks, and service benchmark profiles still need more real deployment
1581
+ coverage.
1536
1582
  - The Qdrant backend is also a candidate-index backend. WaveMind rebuilds it
1537
1583
  from SQLite on load/build, so large service-mode deployments still need a
1538
1584
  measured rebuild strategy and index-health monitoring.
@@ -1571,8 +1617,8 @@ Near-term priorities:
1571
1617
  - Faster dynamic re-ranking through smaller candidate windows, caching, and
1572
1618
  background updates.
1573
1619
  - Better production operations: OpenTelemetry is optional and implemented;
1574
- richer latency histograms, index-health metrics, alerting examples, and
1575
- restore drills are next.
1620
+ richer latency histograms, index-health metrics, alerting examples, scheduled
1621
+ offsite snapshots, and Postgres PITR runbooks are next.
1576
1622
 
1577
1623
  Longer-term direction:
1578
1624
 
@@ -21,7 +21,7 @@ This is a compact reader-facing view of checked-in benchmark results. It is not
21
21
  | Production load profile 100k | production-scale | Recall@k | WaveMind pgvector: 0.736 / 17.8 ms | Qdrant service: 1 / 10.3 ms | Baseline leads on quality |
22
22
  | Production load profile 1M | production-scale | Recall@k | - | Qdrant service: 0.984 / 116.8 ms | No WaveMind result |
23
23
  | Qdrant 1M HNSW ef sweep | production-scale | Recall@k | - | hnsw_ef=2048: 0.977 / 64.8 ms | No WaveMind result |
24
- | Scale readiness profile | production-scale | precision@1 | WaveMind structured payloads: 1 / 0.837 ms | - | WaveMind-only check |
24
+ | Scale readiness profile | production-scale | precision@1 | WaveMind structured payloads: 1 / 0.448 ms | - | WaveMind-only check |
25
25
  | Memory competitor adapter profile | agent-memory | precision@1 | WaveMind: 0.8 / 0.554 ms | - | WaveMind-only check |
26
26
  | [LongMemEval answer generation](https://github.com/xiaowu0162/LongMemEval) | long-term-agent-memory | token F1 | WaveMind + qwen2.5:1.5b: 0.333 / - | Chroma static + qwen2.5:1.5b: 0.17 / - | WaveMind leads on quality |
27
27
 
@@ -24,7 +24,7 @@ Planned rows are not claimed wins. They are the public proof path WaveMind must
24
24
  | Production load profile 100k | production-scale | implemented | Qdrant service: Recall@k 1.00, avg latency 10.3, p95 latency 19.0, p99 latency ms 21.3, build ms 27439.3<br>WaveMind pgvector: Recall@k 0.74, avg latency 17.8, p95 latency 23.5, build ms 455703.7<br>WaveMind faiss-persisted: skipped - Set WAVEMIND_FAISS_PATH to use the persisted FAISS backend | Tune pgvector HNSW build/search parameters and add persisted FAISS from the Linux benchmark container. |
25
25
  | Production load profile 1M | production-scale | implemented | Qdrant service: Recall@k 0.98, avg latency 116.8, p95 latency 153.8, p99 latency ms 209.3, build ms 450674.6 | Tune Qdrant indexing/search params further, then add FAISS IVF/HNSW and pgvector 1M profiles on a larger disk. |
26
26
  | Qdrant 1M HNSW ef sweep | production-scale | implemented | hnsw_ef=512: Recall@k 0.75, avg latency 47.2, p95 latency 68.5, p99 latency ms 68.5, max latency ms 68.5<br>hnsw_ef=768: Recall@k 0.85, avg latency 44.0, p95 latency 69.1, p99 latency ms 69.8, max latency ms 69.8<br>hnsw_ef=1024: Recall@k 0.88, avg latency 62.9, p95 latency 81.1, p99 latency ms 85.5, max latency ms 85.5<br>hnsw_ef=1536: Recall@k 0.94, avg latency 65.6, p95 latency 111.2, p99 latency ms 119.7, max latency ms 119.7<br>hnsw_ef=2048: Recall@k 0.98, avg latency 64.8, p95 latency 91.2, p99 latency ms 103.8, max latency ms 103.8 | Repeat with 100+ queries and collection-level HNSW build parameters before claiming a stable production SLO. |
27
- | Scale readiness profile | production-scale | implemented | WaveMind cluster planner: simulated memories 1000000, namespaces 4096, nodes 4, replication factor 2, node loss min availability 1.00, zone loss min availability 1.00, read quorum 1, write quorum 2, placement ms 115.8<br>WaveMind hot cache: queries 2000, capacity 512, hit rate 0.92, evictions 0, p99 lookup ms 0.00<br>WaveMind structured payloads: queries 4, precision@1 1.00, avg latency 0.84, p99 latency ms 1.07 | Move from single-node service profiles to namespace sharding and replicated service runs. |
27
+ | Scale readiness profile | production-scale | implemented | WaveMind cluster planner: simulated memories 1000000, namespaces 4096, nodes 4, replication factor 2, node loss min availability 1.00, zone loss min availability 1.00, read quorum 1, write quorum 2, placement ms 59.2<br>WaveMind hot cache: queries 2000, capacity 512, hit rate 0.92, evictions 0, p99 lookup ms 0.00<br>WaveMind replicated runtime: nodes 3, replication factor 3, write quorum 2, read quorum 1, recalled after node loss True, repair copied records 1, tombstone repair deleted records 1, p99 query after loss ms 1.29<br>WaveMind active-active delta sync: regions 2, replication factor per region 3, records imported 6, converged after bidirectional sync True, suppressed stale import after delete True, tombstone converged True, sync ms 112.5<br>WaveMind replicated snapshot: nodes 3, manifest healthy True, restored files 3, recalled after restore node loss True, snapshot ms 49.2, restore ms 11.4<br>WaveMind structured payloads: queries 4, precision@1 1.00, avg latency 0.45, p99 latency ms 0.67 | Move from local replicated runtime to service-backed replicated runs, scheduled/offsite snapshots, and larger 10M candidate-index load tests. |
28
28
  | Memory competitor adapter profile | agent-memory | implemented | WaveMind: precision@1 0.80, precision@3 1.00, stale suppression 1.00, avg latency 0.55, p95 latency 0.83<br>Mem0: skipped - Install Mem0 to run this adapter profile: pip install "mem0ai"<br>Zep: skipped - Install the Zep client package and set ZEP_API_KEY or ZEP_API_URL.<br>LangGraph persistent memory: skipped - Install LangGraph to run this adapter profile: pip install "langgraph" | Add documented setup commands for each competitor adapter and store checked-in results only when those real adapters run. |
29
29
  | [LongMemEval answer generation](https://github.com/xiaowu0162/LongMemEval) | long-term-agent-memory | implemented | extractive smoke: queries 20, evidence recall@k 1.00, exact match 0.00, contains answer 0.05, token f1 0.02, avg retrieval ms 3.79, avg generation ms 0.77<br>WaveMind + qwen2.5:0.5b: queries 50, evidence recall@k 0.92, exact match 0.12, contains answer 0.18, token f1 0.18, avg retrieval ms 2.98, avg generation ms 1428.2<br>Chroma static + qwen2.5:0.5b: queries 50, evidence recall@k 0.60, exact match 0.10, contains answer 0.12, token f1 0.13, avg retrieval ms 4.10, avg generation ms 1234.7<br>Qdrant static + qwen2.5:0.5b: queries 50, evidence recall@k 0.60, exact match 0.10, contains answer 0.12, token f1 0.13, avg retrieval ms 63.8, avg generation ms 893.5<br>WaveMind + qwen2.5:1.5b: queries 50, evidence recall@k 0.92, exact match 0.24, contains answer 0.38, token f1 0.33, avg retrieval ms 2.00, avg generation ms 2153.0<br>Chroma static + qwen2.5:1.5b: queries 50, evidence recall@k 0.60, exact match 0.12, contains answer 0.16, token f1 0.17, avg retrieval ms 7.05, avg generation ms 2082.4<br>Qdrant static + qwen2.5:1.5b: queries 50, evidence recall@k 0.60, exact match 0.12, contains answer 0.16, token f1 0.17, avg retrieval ms 100.2, avg generation ms 758.1 | Run all 470 non-abstention questions with a stronger local/API model and add faithfulness/abstention scoring. |
30
30
 
@@ -741,7 +741,7 @@
741
741
  "category": "production-scale",
742
742
  "status": "implemented",
743
743
  "source": "benchmarks/scale_readiness_benchmark.py",
744
- "dataset": "Deterministic 1M-memory simulation for namespace placement plus hot-cache and structured-payload retrieval checks.",
744
+ "dataset": "Deterministic 1M-memory simulation for namespace placement, quorum runtime, active-active delta sync, replicated snapshot/restore, hot-cache, and structured-payload retrieval checks.",
745
745
  "competitors": [
746
746
  "Mem0",
747
747
  "Zep",
@@ -764,24 +764,51 @@
764
764
  "zone_loss_min_availability": 1.0,
765
765
  "read_quorum": 1,
766
766
  "write_quorum": 2,
767
- "placement_ms": 115.80999998841435
767
+ "placement_ms": 59.205800003837794
768
768
  },
769
769
  "WaveMind hot cache": {
770
770
  "queries": 2000,
771
771
  "capacity": 512,
772
772
  "hit_rate": 0.92,
773
773
  "evictions": 0,
774
- "p99_lookup_ms": 0.003500026650726795
774
+ "p99_lookup_ms": 0.002700020559132099
775
+ },
776
+ "WaveMind replicated runtime": {
777
+ "nodes": 3,
778
+ "replication_factor": 3,
779
+ "write_quorum": 2,
780
+ "read_quorum": 1,
781
+ "recalled_after_node_loss": true,
782
+ "repair_copied_records": 1,
783
+ "tombstone_repair_deleted_records": 1,
784
+ "p99_query_after_loss_ms": 1.2885000323876739
785
+ },
786
+ "WaveMind active-active delta sync": {
787
+ "regions": 2,
788
+ "replication_factor_per_region": 3,
789
+ "records_imported": 6,
790
+ "converged_after_bidirectional_sync": true,
791
+ "suppressed_stale_import_after_delete": true,
792
+ "tombstone_converged": true,
793
+ "sync_ms": 112.49550001230091
794
+ },
795
+ "WaveMind replicated snapshot": {
796
+ "nodes": 3,
797
+ "manifest_healthy": true,
798
+ "restored_files": 3,
799
+ "recalled_after_restore_node_loss": true,
800
+ "snapshot_ms": 49.16400002548471,
801
+ "restore_ms": 11.4187000435777
775
802
  },
776
803
  "WaveMind structured payloads": {
777
804
  "queries": 4,
778
805
  "precision_at_1": 1.0,
779
- "avg_latency_ms": 0.8370749856112525,
780
- "p99_latency_ms": 1.0724000167101622
806
+ "avg_latency_ms": 0.44750000233761966,
807
+ "p99_latency_ms": 0.6680000224150717
781
808
  }
782
809
  },
783
- "target": "Prove the production foundation before heavier 100k, 1M, and 10M vector load tests: deterministic placement, survivable replicas, hot-cache behavior, and structured payload recall.",
784
- "next_step": "Move from single-node service profiles to namespace sharding and replicated service runs."
810
+ "target": "Prove the production foundation before heavier 100k, 1M, and 10M vector load tests: deterministic placement, survivable replicas, active-active sync, restore drills, hot-cache behavior, and structured payload recall.",
811
+ "next_step": "Move from local replicated runtime to service-backed replicated runs, scheduled/offsite snapshots, and larger 10M candidate-index load tests."
785
812
  },
786
813
  {
787
814
  "id": "memory_competitor_adapter_profile",
@@ -702,7 +702,7 @@ def _implemented_entries(root: Path) -> list[dict[str, Any]]:
702
702
  "category": "production-scale",
703
703
  "status": "implemented",
704
704
  "source": "benchmarks/scale_readiness_benchmark.py",
705
- "dataset": "Deterministic 1M-memory simulation for namespace placement plus hot-cache and structured-payload retrieval checks.",
705
+ "dataset": "Deterministic 1M-memory simulation for namespace placement, quorum runtime, active-active delta sync, replicated snapshot/restore, hot-cache, and structured-payload retrieval checks.",
706
706
  "competitors": ["Mem0", "Zep", "LangGraph persistent memory", "GraphRAG"],
707
707
  "metrics": [
708
708
  "node_loss_min_availability",
@@ -735,6 +735,42 @@ def _implemented_entries(root: Path) -> list[dict[str, Any]]:
735
735
  "p99_lookup_ms",
736
736
  ),
737
737
  ),
738
+ "WaveMind replicated runtime": _metric_summary(
739
+ scale_readiness_results.get("WaveMind replicated runtime"),
740
+ (
741
+ "nodes",
742
+ "replication_factor",
743
+ "write_quorum",
744
+ "read_quorum",
745
+ "recalled_after_node_loss",
746
+ "repair_copied_records",
747
+ "tombstone_repair_deleted_records",
748
+ "p99_query_after_loss_ms",
749
+ ),
750
+ ),
751
+ "WaveMind active-active delta sync": _metric_summary(
752
+ scale_readiness_results.get("WaveMind active-active delta sync"),
753
+ (
754
+ "regions",
755
+ "replication_factor_per_region",
756
+ "records_imported",
757
+ "converged_after_bidirectional_sync",
758
+ "suppressed_stale_import_after_delete",
759
+ "tombstone_converged",
760
+ "sync_ms",
761
+ ),
762
+ ),
763
+ "WaveMind replicated snapshot": _metric_summary(
764
+ scale_readiness_results.get("WaveMind replicated snapshot"),
765
+ (
766
+ "nodes",
767
+ "manifest_healthy",
768
+ "restored_files",
769
+ "recalled_after_restore_node_loss",
770
+ "snapshot_ms",
771
+ "restore_ms",
772
+ ),
773
+ ),
738
774
  "WaveMind structured payloads": _metric_summary(
739
775
  scale_readiness_results.get("WaveMind structured payloads"),
740
776
  (
@@ -745,8 +781,8 @@ def _implemented_entries(root: Path) -> list[dict[str, Any]]:
745
781
  ),
746
782
  ),
747
783
  },
748
- "target": "Prove the production foundation before heavier 100k, 1M, and 10M vector load tests: deterministic placement, survivable replicas, hot-cache behavior, and structured payload recall.",
749
- "next_step": "Move from single-node service profiles to namespace sharding and replicated service runs.",
784
+ "target": "Prove the production foundation before heavier 100k, 1M, and 10M vector load tests: deterministic placement, survivable replicas, active-active sync, restore drills, hot-cache behavior, and structured payload recall.",
785
+ "next_step": "Move from local replicated runtime to service-backed replicated runs, scheduled/offsite snapshots, and larger 10M candidate-index load tests.",
750
786
  },
751
787
  {
752
788
  "id": "memory_competitor_adapter_profile",
@@ -311,6 +311,65 @@ def run_active_active_delta_profile() -> dict[str, object]:
311
311
  region_b.close()
312
312
 
313
313
 
314
+ def run_replicated_snapshot_profile() -> dict[str, object]:
315
+ with tempfile.TemporaryDirectory() as directory:
316
+ root = Path(directory)
317
+ memory = ReplicatedWaveMind(
318
+ root_path=root / "replicas",
319
+ nodes=[
320
+ {"id": "node-a", "address": "127.0.0.1:8101", "zone": "zone-a"},
321
+ {"id": "node-b", "address": "127.0.0.1:8102", "zone": "zone-b"},
322
+ {"id": "node-c", "address": "127.0.0.1:8103", "zone": "zone-c"},
323
+ ],
324
+ replication_factor=3,
325
+ width=16,
326
+ height=16,
327
+ layers=1,
328
+ encoder=HashingTextEncoder(vector_dim=64),
329
+ )
330
+ restored = None
331
+ try:
332
+ namespace = "tenant:snapshot"
333
+ memory.remember(
334
+ "replicated snapshot restore survives node loss",
335
+ namespace=namespace,
336
+ )
337
+ snapshot_started = time.perf_counter()
338
+ snapshot = memory.snapshot(root / "snapshots")
339
+ snapshot_ms = (time.perf_counter() - snapshot_started) * 1000.0
340
+ health = ReplicatedWaveMind.verify_snapshot(snapshot.snapshot_path)
341
+
342
+ restore_started = time.perf_counter()
343
+ restored, restore = ReplicatedWaveMind.restore_snapshot(
344
+ snapshot.snapshot_path,
345
+ root / "restored",
346
+ width=16,
347
+ height=16,
348
+ layers=1,
349
+ encoder=HashingTextEncoder(vector_dim=64),
350
+ )
351
+ restore_ms = (time.perf_counter() - restore_started) * 1000.0
352
+ placement = restored.placement(namespace)
353
+ restored.set_node_available(placement.primary, False)
354
+ recalled_after_restore_loss = bool(
355
+ restored.query("snapshot restore node loss", namespace=namespace, top_k=1)
356
+ )
357
+ return {
358
+ "engine": "WaveMind replicated snapshot",
359
+ "nodes": len(snapshot.nodes),
360
+ "manifest_healthy": health["healthy"],
361
+ "total_bytes": snapshot.total_bytes,
362
+ "snapshot_ms": snapshot_ms,
363
+ "restore_ms": restore_ms,
364
+ "restored_files": len(restore.restored_files),
365
+ "recalled_after_restore_node_loss": recalled_after_restore_loss,
366
+ }
367
+ finally:
368
+ memory.close()
369
+ if restored is not None:
370
+ restored.close()
371
+
372
+
314
373
  def run_multimodal_profile() -> dict[str, object]:
315
374
  with tempfile.TemporaryDirectory() as directory:
316
375
  memory = WaveMind(
@@ -399,6 +458,7 @@ def run_benchmark(
399
458
  run_cache_profile(queries=cache_queries, capacity=cache_capacity),
400
459
  run_replication_runtime_profile(),
401
460
  run_active_active_delta_profile(),
461
+ run_replicated_snapshot_profile(),
402
462
  run_multimodal_profile(),
403
463
  ]
404
464
  return {
@@ -411,8 +471,9 @@ def run_benchmark(
411
471
  "description": (
412
472
  "Deterministic scale-readiness profile for cluster placement, "
413
473
  "node/zone loss simulation, quorum-replicated runtime behavior, "
414
- "active-active delta sync, hot-cache behavior, and structured "
415
- "payload retrieval. This is not a 10M-vector database load test."
474
+ "active-active delta sync, replicated snapshot/restore, hot-cache "
475
+ "behavior, and structured payload retrieval. This is not a "
476
+ "10M-vector database load test."
416
477
  ),
417
478
  },
418
479
  "results": results,
@@ -456,6 +517,9 @@ def main() -> int:
456
517
  elif result["engine"] == "WaveMind active-active delta sync":
457
518
  print(f"| active-active delta | converged | {result['converged_after_bidirectional_sync']} |")
458
519
  print(f"| active-active delta | tombstone_converged | {result['tombstone_converged']} |")
520
+ elif result["engine"] == "WaveMind replicated snapshot":
521
+ print(f"| replicated snapshot | manifest_healthy | {result['manifest_healthy']} |")
522
+ print(f"| replicated snapshot | recalled_after_restore_node_loss | {result['recalled_after_restore_node_loss']} |")
459
523
  else:
460
524
  print(f"| structured payloads | precision@1 | {result['precision_at_1']:.3f} |")
461
525
  print(f"\nWrote {args.output}")
@@ -5,7 +5,7 @@
5
5
  "namespace_count": 4096,
6
6
  "node_count": 4,
7
7
  "replication_factor": 2,
8
- "description": "Deterministic scale-readiness profile for cluster placement, node/zone loss simulation, quorum-replicated runtime behavior, active-active delta sync, hot-cache behavior, and structured payload retrieval. This is not a 10M-vector database load test."
8
+ "description": "Deterministic scale-readiness profile for cluster placement, node/zone loss simulation, quorum-replicated runtime behavior, active-active delta sync, replicated snapshot/restore, hot-cache behavior, and structured payload retrieval. This is not a 10M-vector database load test."
9
9
  },
10
10
  "results": [
11
11
  {
@@ -14,7 +14,7 @@
14
14
  "namespaces": 4096,
15
15
  "nodes": 4,
16
16
  "replication_factor": 2,
17
- "placement_ms": 55.97210000269115,
17
+ "placement_ms": 59.205800003837794,
18
18
  "max_replica_load": 2413,
19
19
  "min_replica_load": 1728,
20
20
  "replica_load_stdev": 316.54462560593254,
@@ -32,8 +32,8 @@
32
32
  "capacity": 512,
33
33
  "hit_rate": 0.92,
34
34
  "evictions": 0,
35
- "avg_lookup_ms": 0.001386649877531454,
36
- "p99_lookup_ms": 0.0026999623514711857
35
+ "avg_lookup_ms": 0.0014145502645988017,
36
+ "p99_lookup_ms": 0.002700020559132099
37
37
  },
38
38
  {
39
39
  "engine": "WaveMind replicated runtime",
@@ -47,8 +47,8 @@
47
47
  "tombstone_suppressed_before_repair": true,
48
48
  "tombstone_suppressed_after_repair": true,
49
49
  "tombstone_repair_deleted_records": 1,
50
- "avg_query_after_loss_ms": 1.3324000174179673,
51
- "p99_query_after_loss_ms": 1.3324000174179673
50
+ "avg_query_after_loss_ms": 1.2885000323876739,
51
+ "p99_query_after_loss_ms": 1.2885000323876739
52
52
  },
53
53
  {
54
54
  "engine": "WaveMind active-active delta sync",
@@ -56,11 +56,21 @@
56
56
  "replication_factor_per_region": 3,
57
57
  "records_imported": 6,
58
58
  "converged_after_bidirectional_sync": true,
59
- "sync_ms": 120.19369995687157,
59
+ "sync_ms": 112.49550001230091,
60
60
  "suppressed_stale_import_after_delete": true,
61
61
  "tombstone_deleted_records": 3,
62
62
  "tombstone_converged": true
63
63
  },
64
+ {
65
+ "engine": "WaveMind replicated snapshot",
66
+ "nodes": 3,
67
+ "manifest_healthy": true,
68
+ "total_bytes": 110592,
69
+ "snapshot_ms": 49.16400002548471,
70
+ "restore_ms": 11.4187000435777,
71
+ "restored_files": 3,
72
+ "recalled_after_restore_node_loss": true
73
+ },
64
74
  {
65
75
  "engine": "WaveMind structured payloads",
66
76
  "modalities": [
@@ -71,8 +81,8 @@
71
81
  ],
72
82
  "queries": 4,
73
83
  "precision_at_1": 1.0,
74
- "avg_latency_ms": 0.4274749953765422,
75
- "p99_latency_ms": 0.6231999723240733
84
+ "avg_latency_ms": 0.44750000233761966,
85
+ "p99_latency_ms": 0.6680000224150717
76
86
  }
77
87
  ]
78
88
  }
@@ -4,7 +4,7 @@ services:
4
4
  context: .
5
5
  args:
6
6
  INSTALL_OPTIONAL: "false"
7
- image: wavemind:2.2.6
7
+ image: wavemind:2.2.7
8
8
  restart: unless-stopped
9
9
  environment:
10
10
  WAVEMIND_DB: /data/wavemind.sqlite3
@@ -31,7 +31,7 @@ purpose-built vector databases.
31
31
  | NoMIRACL Russian retrieval | WaveMind reaches `nDCG@10 0.434`; Chroma reaches `0.435`; Qdrant reaches `0.433`. Chroma is faster. | `benchmarks/nomiracl_russian_results.json` | `python benchmarks/nomiracl_russian_benchmark.py --engines wavemind chroma qdrant --output benchmarks/nomiracl_russian_results.json` |
32
32
  | Production index profile | At 50000 vectors, persisted FAISS and Qdrant service both reach `recall@10 1.000`; pgvector with `ef_search=400` reaches `0.811`. | `benchmarks/production_index_profile_results.json` | `docker compose -f examples/production-index-profile/docker-compose.yml run --rm benchmark` |
33
33
  | Production load profile | At 100000 vectors, Qdrant service reaches `recall@10 1.000`, avg `10.28 ms`, p99 `21.26 ms`. At 1M vectors, tuned Qdrant reaches `0.984`, avg `116.80 ms`, p99 `209.28 ms`; the best EF-sweep point reaches `0.977`, avg `64.76 ms`, p99 `103.77 ms`. | `benchmarks/production_load_qdrant_100k_tuned_results.json`, `benchmarks/production_load_qdrant_1m_tuned_results.json`, `benchmarks/production_load_qdrant_1m_ef_sweep_results.json` | `python benchmarks/production_load_benchmark.py --sizes 100000 --engines qdrant-service` |
34
- | Scale readiness profile | Deterministic 1M-memory simulation: namespace placement survives node loss and zone loss at `1.000`, hot-cache hit rate is `0.920`, structured payload precision@1 is `1.000`. | `benchmarks/scale_readiness_results.json` | `python benchmarks/scale_readiness_benchmark.py --simulated-memories 1000000 --output benchmarks/scale_readiness_results.json` |
34
+ | Scale readiness profile | Deterministic 1M-memory simulation: namespace placement survives node loss and zone loss at `1.000`, hot-cache hit rate is `0.920`, replicated runtime survives node loss, active-active delta sync converges, replicated snapshot restore survives primary loss, structured payload precision@1 is `1.000`. | `benchmarks/scale_readiness_results.json` | `python benchmarks/scale_readiness_benchmark.py --simulated-memories 1000000 --output benchmarks/scale_readiness_results.json` |
35
35
  | Memory competitor adapter profile | WaveMind reaches `precision@1 0.80`, `precision@3 1.00`, stale suppression `1.00`; Mem0, Zep, and LangGraph are skipped until real packages/services are configured. | `benchmarks/memory_competitor_results.json` | `python benchmarks/memory_competitor_benchmark.py --engines wavemind mem0 zep langgraph` |
36
36
 
37
37
  The generated matrix view is in `benchmarks/BENCHMARK_REPORT.md`; the compact