massgen 0.1.3__py3-none-any.whl → 0.1.5__py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Potentially problematic release.


This version of massgen might be problematic. Click here for more details.

Files changed (90) hide show
  1. massgen/__init__.py +1 -1
  2. massgen/api_params_handler/_chat_completions_api_params_handler.py +4 -0
  3. massgen/api_params_handler/_claude_api_params_handler.py +4 -0
  4. massgen/api_params_handler/_gemini_api_params_handler.py +4 -0
  5. massgen/api_params_handler/_response_api_params_handler.py +4 -0
  6. massgen/backend/base_with_custom_tool_and_mcp.py +25 -5
  7. massgen/backend/docs/permissions_and_context_files.md +2 -2
  8. massgen/backend/response.py +2 -0
  9. massgen/chat_agent.py +340 -20
  10. massgen/cli.py +326 -19
  11. massgen/configs/README.md +92 -41
  12. massgen/configs/memory/gpt5mini_gemini_baseline_research_to_implementation.yaml +94 -0
  13. massgen/configs/memory/gpt5mini_gemini_context_window_management.yaml +187 -0
  14. massgen/configs/memory/gpt5mini_gemini_research_to_implementation.yaml +127 -0
  15. massgen/configs/memory/gpt5mini_high_reasoning_gemini.yaml +107 -0
  16. massgen/configs/memory/single_agent_compression_test.yaml +64 -0
  17. massgen/configs/tools/custom_tools/crawl4ai_example.yaml +55 -0
  18. massgen/configs/tools/custom_tools/multimodal_tools/text_to_file_generation_multi.yaml +61 -0
  19. massgen/configs/tools/custom_tools/multimodal_tools/text_to_file_generation_single.yaml +29 -0
  20. massgen/configs/tools/custom_tools/multimodal_tools/text_to_image_generation_multi.yaml +51 -0
  21. massgen/configs/tools/custom_tools/multimodal_tools/text_to_image_generation_single.yaml +33 -0
  22. massgen/configs/tools/custom_tools/multimodal_tools/text_to_speech_generation_multi.yaml +55 -0
  23. massgen/configs/tools/custom_tools/multimodal_tools/text_to_speech_generation_single.yaml +33 -0
  24. massgen/configs/tools/custom_tools/multimodal_tools/text_to_video_generation_multi.yaml +47 -0
  25. massgen/configs/tools/custom_tools/multimodal_tools/text_to_video_generation_single.yaml +29 -0
  26. massgen/configs/tools/custom_tools/multimodal_tools/understand_audio.yaml +1 -1
  27. massgen/configs/tools/custom_tools/multimodal_tools/understand_file.yaml +1 -1
  28. massgen/configs/tools/custom_tools/multimodal_tools/understand_image.yaml +1 -1
  29. massgen/configs/tools/custom_tools/multimodal_tools/understand_video.yaml +1 -1
  30. massgen/configs/tools/custom_tools/multimodal_tools/youtube_video_analysis.yaml +1 -1
  31. massgen/filesystem_manager/_filesystem_manager.py +1 -0
  32. massgen/filesystem_manager/_path_permission_manager.py +148 -0
  33. massgen/memory/README.md +277 -0
  34. massgen/memory/__init__.py +26 -0
  35. massgen/memory/_base.py +193 -0
  36. massgen/memory/_compression.py +237 -0
  37. massgen/memory/_context_monitor.py +211 -0
  38. massgen/memory/_conversation.py +255 -0
  39. massgen/memory/_fact_extraction_prompts.py +333 -0
  40. massgen/memory/_mem0_adapters.py +257 -0
  41. massgen/memory/_persistent.py +687 -0
  42. massgen/memory/docker-compose.qdrant.yml +36 -0
  43. massgen/memory/docs/DESIGN.md +388 -0
  44. massgen/memory/docs/QUICKSTART.md +409 -0
  45. massgen/memory/docs/SUMMARY.md +319 -0
  46. massgen/memory/docs/agent_use_memory.md +408 -0
  47. massgen/memory/docs/orchestrator_use_memory.md +586 -0
  48. massgen/memory/examples.py +237 -0
  49. massgen/message_templates.py +160 -12
  50. massgen/orchestrator.py +223 -7
  51. massgen/tests/memory/test_agent_compression.py +174 -0
  52. massgen/{configs/tools → tests}/memory/test_context_window_management.py +30 -30
  53. massgen/tests/memory/test_force_compression.py +154 -0
  54. massgen/tests/memory/test_simple_compression.py +147 -0
  55. massgen/tests/test_agent_memory.py +534 -0
  56. massgen/tests/test_binary_file_blocking.py +274 -0
  57. massgen/tests/test_case_studies.md +12 -12
  58. massgen/tests/test_conversation_memory.py +382 -0
  59. massgen/tests/test_multimodal_size_limits.py +407 -0
  60. massgen/tests/test_orchestrator_memory.py +620 -0
  61. massgen/tests/test_persistent_memory.py +435 -0
  62. massgen/token_manager/token_manager.py +6 -0
  63. massgen/tool/_manager.py +7 -2
  64. massgen/tool/_multimodal_tools/image_to_image_generation.py +293 -0
  65. massgen/tool/_multimodal_tools/text_to_file_generation.py +455 -0
  66. massgen/tool/_multimodal_tools/text_to_image_generation.py +222 -0
  67. massgen/tool/_multimodal_tools/text_to_speech_continue_generation.py +226 -0
  68. massgen/tool/_multimodal_tools/text_to_speech_transcription_generation.py +217 -0
  69. massgen/tool/_multimodal_tools/text_to_video_generation.py +223 -0
  70. massgen/tool/_multimodal_tools/understand_audio.py +19 -1
  71. massgen/tool/_multimodal_tools/understand_file.py +6 -1
  72. massgen/tool/_multimodal_tools/understand_image.py +112 -8
  73. massgen/tool/_multimodal_tools/understand_video.py +32 -5
  74. massgen/tool/_web_tools/crawl4ai_tool.py +718 -0
  75. massgen/tool/docs/multimodal_tools.md +589 -0
  76. massgen/tools/__init__.py +8 -0
  77. massgen/tools/_planning_mcp_server.py +520 -0
  78. massgen/tools/planning_dataclasses.py +434 -0
  79. {massgen-0.1.3.dist-info → massgen-0.1.5.dist-info}/METADATA +142 -82
  80. {massgen-0.1.3.dist-info → massgen-0.1.5.dist-info}/RECORD +84 -41
  81. massgen/configs/tools/custom_tools/crawl4ai_mcp_example.yaml +0 -67
  82. massgen/configs/tools/custom_tools/crawl4ai_multi_agent_example.yaml +0 -68
  83. massgen/configs/tools/memory/README.md +0 -199
  84. massgen/configs/tools/memory/gpt5mini_gemini_context_window_management.yaml +0 -131
  85. massgen/configs/tools/memory/gpt5mini_gemini_no_persistent_memory.yaml +0 -133
  86. massgen/configs/tools/multimodal/gpt5mini_gpt5nano_documentation_evolution.yaml +0 -97
  87. {massgen-0.1.3.dist-info → massgen-0.1.5.dist-info}/WHEEL +0 -0
  88. {massgen-0.1.3.dist-info → massgen-0.1.5.dist-info}/entry_points.txt +0 -0
  89. {massgen-0.1.3.dist-info → massgen-0.1.5.dist-info}/licenses/LICENSE +0 -0
  90. {massgen-0.1.3.dist-info → massgen-0.1.5.dist-info}/top_level.txt +0 -0
@@ -1,6 +1,6 @@
1
1
  Metadata-Version: 2.4
2
2
  Name: massgen
3
- Version: 0.1.3
3
+ Version: 0.1.5
4
4
  Summary: Multi-Agent Scaling System - A powerful framework for collaborative AI
5
5
  Author-email: MassGen Team <contact@massgen.dev>
6
6
  License: Apache-2.0
@@ -54,6 +54,8 @@ Requires-Dist: openpyxl>=3.1.5
54
54
  Requires-Dist: python-pptx>=1.0.2
55
55
  Requires-Dist: opencv-python>=4.12.0.88
56
56
  Requires-Dist: pypdf2>=3.0.1
57
+ Requires-Dist: mem0ai>=1.0.0
58
+ Requires-Dist: reportlab>=4.0.0
57
59
  Provides-Extra: dev
58
60
  Requires-Dist: pytest>=7.0.0; extra == "dev"
59
61
  Requires-Dist: pytest-cov>=4.0.0; extra == "dev"
@@ -88,23 +90,29 @@ Provides-Extra: all
88
90
  Dynamic: license-file
89
91
 
90
92
  <p align="center">
91
- <img src="https://raw.githubusercontent.com/Leezekun/MassGen/main/assets/logo.png" alt="MassGen Logo" width="360" />
93
+ <picture>
94
+ <source media="(prefers-color-scheme: dark)" srcset="https://raw.githubusercontent.com/Leezekun/MassGen/main/assets/logo-dark.png">
95
+ <source media="(prefers-color-scheme: light)" srcset="https://raw.githubusercontent.com/Leezekun/MassGen/main/assets/logo.png">
96
+ <img src="https://raw.githubusercontent.com/Leezekun/MassGen/main/assets/logo.png" alt="MassGen Logo" width="360" />
97
+ </picture>
92
98
  </p>
93
99
 
94
- <p align="center">
95
- <a href="https://www.python.org/downloads/">
96
- <img src="https://img.shields.io/badge/python-3.11+-blue.svg" alt="Python 3.11+" style="margin-right: 5px;">
97
- </a>
98
- <a href="LICENSE">
99
- <img src="https://img.shields.io/badge/license-Apache%202.0-blue.svg" alt="License" style="margin-right: 5px;">
100
- </a>
101
- <a href="https://docs.massgen.ai">
102
- <img src="https://img.shields.io/badge/docs-massgen.ai-blue.svg" alt="Documentation" style="margin-right: 5px;">
103
- </a>
104
- <a href="https://discord.massgen.ai">
105
- <img src="https://img.shields.io/discord/1153072414184452236?color=7289da&label=chat&logo=discord&style=flat-square" alt="Join our Discord">
106
- </a>
107
- </p>
100
+ <div align="center">
101
+
102
+ [![Docs](https://img.shields.io/badge/docs-massgen.ai-blue?style=flat-square&logo=readthedocs&logoColor=white)](https://docs.massgen.ai)
103
+ [![GitHub Stars](https://img.shields.io/github/stars/Leezekun/MassGen?style=flat-square&logo=github&color=181717&logoColor=white)](https://github.com/Leezekun/MassGen)
104
+ [![Python 3.11+](https://img.shields.io/badge/python-3.11+-3776AB?style=flat-square&logo=python&logoColor=white)](https://www.python.org/downloads/)
105
+ [![License](https://img.shields.io/badge/license-Apache%202.0-green?style=flat-square)](LICENSE)
106
+
107
+ </div>
108
+
109
+ <div align="center">
110
+
111
+ [![Follow on X](https://img.shields.io/badge/FOLLOW%20ON%20X-000000?style=for-the-badge&logo=x&logoColor=white)](https://x.massgen.ai)
112
+ [![Follow on LinkedIn](https://img.shields.io/badge/FOLLOW%20ON%20LINKEDIN-0A66C2?style=for-the-badge&logo=linkedin&logoColor=white)](https://www.linkedin.com/company/massgen-ai)
113
+ [![Join our Discord](https://img.shields.io/badge/JOIN%20OUR%20DISCORD-5865F2?style=for-the-badge&logo=discord&logoColor=white)](https://discord.massgen.ai)
114
+
115
+ </div>
108
116
 
109
117
  <h1 align="center">🚀 MassGen: Multi-Agent Scaling System for GenAI</h1>
110
118
 
@@ -114,7 +122,7 @@ Dynamic: license-file
114
122
 
115
123
  <p align="center">
116
124
  <a href="https://www.youtube.com/watch?v=Dp2oldJJImw">
117
- <img src="https://raw.githubusercontent.com/Leezekun/MassGen/main/assets/thumbnail.png" alt="MassGen case study -- Berkeley Agentic AI Summit Question" width="800">
125
+ <img src="docs/source/_static/images/thumbnail.png" alt="MassGen case study -- Berkeley Agentic AI Summit Question" width="800">
118
126
  </a>
119
127
  </p>
120
128
 
@@ -143,7 +151,7 @@ This project started with the "threads of thought" and "iterative refinement" id
143
151
  <details open>
144
152
  <summary><h3>🆕 Latest Features</h3></summary>
145
153
 
146
- - [v0.1.2 Features](#-latest-features-v012)
154
+ - [v0.1.5 Features](#-latest-features-v015)
147
155
  </details>
148
156
 
149
157
  <details open>
@@ -188,15 +196,15 @@ This project started with the "threads of thought" and "iterative refinement" id
188
196
  <summary><h3>🗺️ Roadmap</h3></summary>
189
197
 
190
198
  - Recent Achievements
191
- - [v0.1.2](#recent-achievements-v012)
192
- - [v0.0.3 - v0.1.1](#previous-achievements-v003---v011)
199
+ - [v0.1.5](#recent-achievements-v015)
200
+ - [v0.0.3 - v0.1.4](#previous-achievements-v003---v014)
193
201
  - [Key Future Enhancements](#key-future-enhancements)
194
202
  - Bug Fixes & Backend Improvements
195
203
  - Advanced Agent Collaboration
196
204
  - Expanded Model, Tool & Agent Integrations
197
205
  - Improved Performance & Scalability
198
206
  - Enhanced Developer Experience
199
- - [v0.1.3 Roadmap](#v013-roadmap)
207
+ - [v0.1.6 Roadmap](#v016-roadmap)
200
208
  </details>
201
209
 
202
210
  <details open>
@@ -221,36 +229,49 @@ This project started with the "threads of thought" and "iterative refinement" id
221
229
 
222
230
  ---
223
231
 
224
- ## 🆕 Latest Features (v0.1.2)
232
+ ## 🆕 Latest Features (v0.1.5)
225
233
 
226
- **🎉 Released: October 22, 2025**
234
+ **🎉 Released: October 2025**
227
235
 
228
- **What's New in v0.1.2:**
229
- - **🧠 Intelligent Planning Mode** - Automatic question analysis for safe MCP tool blocking
230
- - **🎭 Claude 4.5 Haiku Support** - Access to latest Claude Haiku model
231
- - **🔍 Grok Web Search Fix** - Improved web search functionality in Grok backend
236
+ **What's New in v0.1.5:**
237
+ - **🧠 Long-Term Memory System** - Semantic memory with retrieval across sessions
238
+ - **🗜️ Automatic Context Compression** - Smart compression when approaching token limits
239
+ - **🔄 Memory Sharing for Multi-Turn Conversations** - Agents access knowledge from previous turns
232
240
 
233
241
  **Key Improvements:**
234
- - Automatically determines if questions require irreversible operations
235
- - Read-only MCP operations allowed during coordination for better decisions
236
- - Write operations automatically blocked for safety
237
- - Zero configuration required - works transparently
238
- - Enhanced model support with latest Claude 4.5 Haiku
242
+ - Persistent memory via mem0 integration with vector storage
243
+ - Conversational memory for short-term context tracking
244
+ - Context monitoring with real-time token usage tracking
245
+ - Session management for memory isolation and continuation
246
+ - Qdrant vector database integration for semantic search
239
247
 
240
- **Get Started with v0.1.2:**
248
+ **Get Started with v0.1.5:**
241
249
  ```bash
242
250
  # Install or upgrade from PyPI
243
251
  pip install --upgrade massgen
244
252
 
245
- # Try intelligent planning mode with MCP tools
246
- # (Please read the YAML file for required API keys: DISCORD_TOKEN, OPENAI_API_KEY, etc.)
247
- massgen --config @examples/tools/planning/five_agents_discord_mcp_planning_mode \
248
- "Check recent messages in our development channel, summarize the discussion, and post a helpful response about the current topic."
253
+ # Multi-agent collaboration with context compression
254
+ massgen --config @examples/memory/gpt5mini_gemini_context_window_management \
255
+ "Analyze the MassGen codebase comprehensively. Create an architecture document that explains: (1) Core components and their responsibilities, (2) How different modules interact, (3) Key design patterns used, (4) Main entry points and request flows. Read > 30 files to build a complete understanding."
256
+
257
+ # Research-to-implementation workflow with memory persistence
258
+ # Prerequisites: Start Qdrant and crawl4ai Docker containers
259
+ docker run -d -p 6333:6333 -p 6334:6334 \
260
+ -v $(pwd)/.massgen/qdrant_storage:/qdrant/storage:z qdrant/qdrant
261
+ docker run -d -p 11235:11235 --name crawl4ai --shm-size=1g unclecode/crawl4ai:latest
249
262
 
250
- # Use latest Claude 4.5 Haiku model
251
- # (Requires ANTHROPIC_API_KEY in .env)
252
- massgen --model claude-haiku-4-5-20251001 \
253
- "Summarize the latest AI developments"
263
+ # Session 1 - Research phase:
264
+ massgen --config @examples/memory/gpt5mini_gemini_research_to_implementation \
265
+ "Use crawl4ai to research the latest multi-agent AI papers and techniques from 2025. Focus on: coordination mechanisms, voting strategies, tool-use patterns, and architectural innovations."
266
+
267
+ # Session 2 - Implementation analysis (continue in same session):
268
+ # "Based on the multi-agent research from earlier, which techniques should we implement in MassGen to make it more state-of-the-art? Consider MassGen's current architecture and what would be most impactful."
269
+
270
+ → See [Multi-Turn Persistent Memory Case Study](docs/source/examples/case_studies/multi-turn-persistent-memory.md) for detailed analysis
271
+
272
+ # Test automatic context compression
273
+ massgen --config @examples/memory/single_agent_compression_test \
274
+ "Analyze the MassGen codebase comprehensively. Create an architecture document that explains: (1) Core components and their responsibilities, (2) How different modules interact, (3) Key design patterns used, (4) Main entry points and request flows. Read > 30 files to build a complete understanding."
254
275
  ```
255
276
 
256
277
  → [See full release history and examples](massgen/configs/README.md#release-history--examples)
@@ -472,17 +493,20 @@ MassGen agents can leverage various tools to enhance their problem-solving capab
472
493
 
473
494
  **Supported Built-in Tools by Backend:**
474
495
 
475
- | Backend | Live Search | Code Execution | File Operations | MCP Support | Multimodal (Image/Audio/Video) | Advanced Features |
476
- |---------|:-----------:|:--------------:|:---------------:|:-----------:|:----------:|:-----------------|
477
- | **Azure OpenAI** (NEW in v0.0.10) | ❌ | ❌ | ❌ | ❌ | ❌ | Code interpreter, Azure deployment management |
478
- | **Claude API** | ✅ | ✅ | ✅ | ✅ | | Web search, code interpreter, **MCP integration** |
479
- | **Claude Code** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*Image* | **Native Claude Code SDK, comprehensive dev tools, MCP integration** |
480
- | **Gemini API** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*Image* | Web search, code execution, **MCP integration**|
481
- | **Grok API** | ✅ | ❌ | ✅ | ✅ | | Web search, **MCP integration** |
482
- | **OpenAI API** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*Image* | Web search, code interpreter, **MCP integration** |
483
- | **ZAI API** | ❌ | ❌ | ✅ | ✅ | | **MCP integration** |
484
-
485
- **Note:** Audio/video multimodal support (NEW in v0.0.30) is available through Chat Completions-based providers like OpenRouter and Qwen API. See configuration examples: [`single_openrouter_audio_understanding.yaml`](massgen/configs/basic/single/single_openrouter_audio_understanding.yaml), [`single_qwen_video_understanding.yaml`](massgen/configs/basic/single/single_qwen_video_understanding.yaml)
496
+ | Backend | Live Search | Code Execution | File Operations | MCP Support | Multimodal Understanding | Multimodal Generation | Advanced Features |
497
+ |---------|:-----------:|:--------------:|:---------------:|:-----------:|:------------------------:|:---------------------:|:-----------------|
498
+ | **Azure OpenAI** (NEW in v0.0.10) | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ | Code interpreter, Azure deployment management |
499
+ | **Claude API** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*via custom tools* | ✅<br/>*via custom tools* | Web search, code interpreter, **MCP integration** |
500
+ | **Claude Code** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*Image (native)*<br/>*Audio/Video/Docs (custom tools)* | ✅<br/>*via custom tools* | **Native Claude Code SDK, comprehensive dev tools, MCP integration** |
501
+ | **Gemini API** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*Image (native)*<br/>*Audio/Video/Docs (custom tools)* | ✅<br/>*via custom tools* | Web search, code execution, **MCP integration**|
502
+ | **Grok API** | ✅ | ❌ | ✅ | ✅ | ✅<br/>*via custom tools* | ✅<br/>*via custom tools* | Web search, **MCP integration** |
503
+ | **OpenAI API** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*Image (native)*<br/>*Audio/Video/Docs (custom tools)* | ✅<br/>*via custom tools* | Web search, code interpreter, **MCP integration** |
504
+ | **ZAI API** | ❌ | ❌ | ✅ | ✅ | ✅<br/>*via custom tools* | ✅<br/>*via custom tools* | **MCP integration** |
505
+
506
+ **Notes:**
507
+ - **Multimodal Understanding** (NEW in v0.1.3): Analyze images, audio, video, and documents via custom tools using OpenAI GPT-4.1 - works with any backend
508
+ - **Multimodal Generation** (NEW in v0.1.4): Generate images, videos, audio, and documents via custom tools using OpenAI APIs - works with any backend
509
+ - See custom tool configurations: [`understand_image.yaml`](massgen/configs/tools/custom_tools/multimodal_tools/understand_image.yaml), [`text_to_image_generation_single.yaml`](massgen/configs/tools/custom_tools/multimodal_tools/text_to_image_generation_single.yaml)
486
510
 
487
511
  → For detailed backend capabilities and tool integration guides, see [User Guide - Backends](https://docs.massgen.ai/en/latest/user_guide/backends.html)
488
512
 
@@ -954,7 +978,7 @@ massgen --config @examples/tools/code-execution/multi_agent_playwright_automatio
954
978
  "Navigate to https://news.ycombinator.com, extract the top 10 stories, and create a summary report"
955
979
  ```
956
980
 
957
- → [**See detailed case studies**](docs/case_studies/README.md) with real session logs and outcomes
981
+ → [**See detailed case studies**](docs/source/examples/case_studies/README.md) with real session logs and outcomes
958
982
 
959
983
  #### Interactive Mode & Advanced Usage
960
984
 
@@ -1075,7 +1099,11 @@ All sessions are automatically logged with detailed information for debugging an
1075
1099
 
1076
1100
  To see how MassGen works in practice, check out these detailed case studies based on real session logs:
1077
1101
 
1078
- - [**MassGen Case Studies**](docs/case_studies/README.md)
1102
+ **Featured:**
1103
+ - [**Multi-Turn Persistent Memory**](docs/source/examples/case_studies/multi-turn-persistent-memory.md) - Research-to-implementation workflow demonstrating memory system (v0.1.5) | [📹 Watch Demo](https://youtu.be/wWxxFgyw40Y)
1104
+
1105
+ **All Case Studies:**
1106
+ - [**MassGen Case Studies**](docs/source/examples/case_studies/README.md)
1079
1107
  - [**Case Studies Documentation**](https://docs.massgen.ai/en/latest/examples/case_studies.html) - Browse case studies online
1080
1108
 
1081
1109
  ---
@@ -1083,37 +1111,67 @@ To see how MassGen works in practice, check out these detailed case studies base
1083
1111
 
1084
1112
  ## 🗺️ Roadmap
1085
1113
 
1086
- MassGen is currently in its foundational stage, with a focus on parallel, asynchronous multi-agent collaboration and orchestration. Our roadmap is centered on transforming this foundation into a highly robust, intelligent, and user-friendly system, while enabling frontier research and exploration. An earlier version of MassGen can be found [here](./massgen/v1).
1114
+ MassGen is currently in its foundational stage, with a focus on parallel, asynchronous multi-agent collaboration and orchestration. Our roadmap is centered on transforming this foundation into a highly robust, intelligent, and user-friendly system, while enabling frontier research and exploration.
1087
1115
 
1088
1116
  ⚠️ **Early Stage Notice:** As MassGen is in active development, please expect upcoming breaking architecture changes as we continue to refine and improve the system.
1089
1117
 
1090
- ### Recent Achievements (v0.1.2)
1118
+ ### Recent Achievements (v0.1.5)
1119
+
1120
+ **🎉 Released: October 2025**
1121
+
1122
+ #### Memory System
1123
+ - **PersistentMemory**: Long-term semantic memory storage via mem0 integration with fact extraction and retrieval across sessions
1124
+ - **ConversationMemory**: Short-term verbatim message tracking for active conversation context
1125
+ - **Automatic Context Compression**: Smart compression when approaching token limits with configurable thresholds (trigger_threshold, target_ratio)
1126
+ - **Cross-Agent Memory Sharing**: Agents share memory with turn-aware filtering to prevent temporal information leakage
1127
+ - **Context Monitoring**: Real-time token usage tracking with automatic compression triggers
1128
+ - **Session Management**: Memory isolation and continuation across runs with session naming support
1129
+ - **Qdrant Integration**: Vector database support for efficient semantic search in both server and local modes
1130
+ - **Configurable Memory Providers**: Support for OpenAI, Anthropic, Groq, and other mem0-compatible LLM and embedding providers
1131
+
1132
+ #### Memory Configuration
1133
+ - **Global and Per-Agent Control**: Enable/disable memory at system or individual agent level
1134
+ - **Compression Settings**: Configurable trigger thresholds and target ratios for context window management
1135
+ - **Retrieval Configuration**: Customizable retrieval limits and smart filtering to exclude recent messages
1136
+ - **Memory Persistence**: Session continuation with named sessions for cross-session memory access
1091
1137
 
1092
- **🎉 Released: October 22, 2025**
1138
+ #### Configuration Files
1139
+ - `gpt5mini_gemini_context_window_management.yaml` - Multi-agent with automatic context compression
1140
+ - `gpt5mini_gemini_research_to_implementation.yaml` - Research-to-implementation workflow with memory
1141
+ - `gpt5mini_high_reasoning_gemini.yaml` - High reasoning agents with memory integration
1142
+ - `gpt5mini_gemini_baseline_research_to_implementation.yaml` - Baseline research workflow for comparison
1143
+ - `single_agent_compression_test.yaml` - Single agent testing context compression behavior
1093
1144
 
1094
- #### Intelligent Planning Mode
1095
- - **Automatic Question Analysis**: New `_analyze_question_irreversibility()` method in orchestrator determines if MCP operations are reversible
1096
- - **Selective Tool Blocking**: Granular control with `set_planning_mode_blocked_tools()`, `get_planning_mode_blocked_tools()`, and `is_mcp_tool_blocked()` methods
1097
- - **Dynamic Behavior**: Read-only MCP operations allowed during coordination, write operations blocked for safety
1098
- - **Zero Configuration**: Works transparently without setup
1099
- - **Multi-Workspace Support**: Planning mode works across different workspaces without conflicts
1100
- - **Test Coverage**: Comprehensive tests in `massgen/tests/test_intelligent_planning_mode.py`
1101
- - **Documentation**: Complete guide in `docs/case_studies/INTELLIGENT_PLANNING_MODE.md`
1145
+ #### Documentation
1146
+ - Complete memory system user guide: `docs/source/user_guide/memory.rst`
1147
+ - Design decisions documentation explaining architectural choices
1148
+ - API reference for PersistentMemory, ConversationMemory, and ContextMonitor classes
1149
+ - Comprehensive troubleshooting guide and monitoring instructions
1102
1150
 
1103
- #### Model Support & Improvements
1104
- - **Claude 4.5 Haiku**: Added latest Claude Haiku model `claude-haiku-4-5-20251001`
1105
- - **Model Priority Updates**: Reorganized Claude model list with updated defaults (`claude-sonnet-4-5-20250929`)
1106
- - **Grok Web Search Fix**: Resolved `extra_body` parameter handling for Grok's Live Search API with new `_add_grok_search_params()` method
1151
+ #### Testing Infrastructure
1152
+ - Memory test suite: `test_agent_memory.py`, `test_conversation_memory.py`, `test_orchestrator_memory.py`, `test_persistent_memory.py`
1107
1153
 
1108
- #### Configuration Updates
1109
- - **Planning Mode Configs**: Updated 5 configurations in `massgen/configs/tools/planning/` with selective blocking examples
1110
- - **Default Configuration**: Updated `three_agents_default.yaml` with Grok-4-fast model
1154
+ ### Previous Achievements (v0.0.3 - v0.1.4)
1111
1155
 
1112
- ### Previous Achievements (v0.0.3 - v0.1.1)
1156
+ **Multimodal Generation Tools (v0.1.4)**: Create images from text via DALL-E API, generate videos from descriptions, text-to-speech with audio transcription support, document generation for PDF/DOCX/XLSX/PPTX formats, image transformation capabilities for existing images
1157
+
1158
+ ✅ **Binary File Protection (v0.1.4)**: Automatic blocking prevents text tools from accessing 40+ binary file types including images, videos, audio, archives, and Office documents, intelligent error messages guide users to appropriate specialized tools for binary content
1159
+
1160
+ ✅ **Crawl4AI Integration (v0.1.4)**: Intelligent web scraping with LLM-powered content extraction and customizable extraction patterns for structured data retrieval from websites
1161
+
1162
+ ✅ **Post-Evaluation Workflow (v0.1.3)**: Winning agents evaluate their own answers before submission with submit and restart capabilities, supports answer confirmation and orchestration restart with feedback across all backends
1163
+
1164
+ ✅ **Multimodal Understanding Tools (v0.1.3)**: Analyze images, transcribe audio, extract video frames, and process documents (PDF/DOCX/XLSX/PPTX) with structured JSON output, works across all backends via OpenAI GPT-4.1 integration
1165
+
1166
+ ✅ **Docker Sudo Mode (v0.1.3)**: Privileged command execution in Docker containers for system-level operations requiring elevated permissions
1167
+
1168
+ ✅ **Intelligent Planning Mode (v0.1.2)**: Automatic question analysis determining operation irreversibility via `_analyze_question_irreversibility()` in orchestrator, selective tool blocking with `set_planning_mode_blocked_tools()` and `is_mcp_tool_blocked()` methods, read-only MCP operations during coordination with write operations blocked, zero-configuration transparent operation, multi-workspace support
1169
+
1170
+ ✅ **Model Updates (v0.1.2)**: Claude 4.5 Haiku model `claude-haiku-4-5-20251001`, reorganized Claude model priorities with `claude-sonnet-4-5-20250929` default, Grok web search fix with `_add_grok_search_params()` method for proper `extra_body` parameter handling
1113
1171
 
1114
1172
  ✅ **Custom Tools System (v0.1.1)**: User-defined Python function registration using `ToolManager` class in `massgen/tool/_manager.py`, cross-backend support alongside MCP servers, builtin/MCP/custom tool categories with automatic discovery, 40+ examples in `massgen/configs/tools/custom_tools/`, voting sensitivity controls with three-tier quality system (lenient/balanced/strict), answer novelty detection preventing duplicates
1115
1173
 
1116
- ✅ **Backend Enhancements (v0.1.1)**: Gemini architecture refactoring with extracted MCP management (`gemini_mcp_manager.py`), tracking (`gemini_trackers.py`), and utilities, new capabilities registry in `massgen/backend/capabilities.py` documenting feature support across backends
1174
+ ✅ **Backend Enhancements (v0.1.1)**: Gemini architecture refactoring with extracted MCP management (`gemini_mcp_manager.py`), tracking (`gemini_trackers.py`), and utilities, new capabilities registry in `massgen/backend/capabilities.py` documenting feature support across all backends
1117
1175
 
1118
1176
  ✅ **PyPI Package Release (v0.1.0)**: Official distribution via `pip install massgen` with simplified installation, global `massgen` command accessible from any directory, comprehensive Sphinx documentation at [docs.massgen.ai](https://docs.massgen.ai/), interactive setup wizard with use case presets and API key management, enhanced CLI with `@examples/` prefix for built-in configurations
1119
1177
 
@@ -1213,19 +1271,21 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
1213
1271
 
1214
1272
  We welcome community contributions to achieve these goals.
1215
1273
 
1216
- ### v0.1.3 Roadmap
1274
+ ### v0.1.6 Roadmap
1217
1275
 
1218
- Version 0.1.3 focuses on general interoperability and enterprise collaboration:
1276
+ Version 0.1.6 focuses on backend code refactoring for improved maintainability and developer experience:
1219
1277
 
1220
- #### Required Features
1221
- - **General Interoperability**: Enable MassGen to orchestrate agents from multiple external frameworks with unified interface
1222
- - **Final Agent Submit/Restart Tools**: Enable final agent to decide whether to submit or restart orchestration
1278
+ #### Planned Features
1279
+ - **Backend Code Refactoring**: Major code refactoring for improved maintainability and developer experience with better code organization, modularity, and architectural improvements
1223
1280
 
1224
1281
  Key technical approach:
1225
- - **Framework Integration**: Multi-agent coordination supporting external agent frameworks with specialized agent roles (researcher, analyst, critic, synthesizer)
1226
- - **Submit/Restart**: Multi-step task verification with access to previous agents' responses and workspaces
1282
+ - **Code Architecture**: Enhanced code organization with improved modularity and separation of concerns
1283
+ - **Developer Experience**: Simplified backend extension points, improved API clarity, better error handling and debugging support
1284
+ - **Quality Assurance**: Comprehensive testing to ensure no functionality regressions
1285
+
1286
+ **Target Release**: November 1, 2025 (Friday @ 9am PT)
1227
1287
 
1228
- For detailed milestones and technical specifications, see the [full v0.1.3 roadmap](ROADMAP.md).
1288
+ For detailed milestones and technical specifications, see the [full v0.1.6 roadmap](ROADMAP_v0.1.6.md).
1229
1289
 
1230
1290
  ---
1231
1291