massgen 0.1.3__py3-none-any.whl → 0.1.5__py3-none-any.whl
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Potentially problematic release.
This version of massgen might be problematic. Click here for more details.
- massgen/__init__.py +1 -1
- massgen/api_params_handler/_chat_completions_api_params_handler.py +4 -0
- massgen/api_params_handler/_claude_api_params_handler.py +4 -0
- massgen/api_params_handler/_gemini_api_params_handler.py +4 -0
- massgen/api_params_handler/_response_api_params_handler.py +4 -0
- massgen/backend/base_with_custom_tool_and_mcp.py +25 -5
- massgen/backend/docs/permissions_and_context_files.md +2 -2
- massgen/backend/response.py +2 -0
- massgen/chat_agent.py +340 -20
- massgen/cli.py +326 -19
- massgen/configs/README.md +92 -41
- massgen/configs/memory/gpt5mini_gemini_baseline_research_to_implementation.yaml +94 -0
- massgen/configs/memory/gpt5mini_gemini_context_window_management.yaml +187 -0
- massgen/configs/memory/gpt5mini_gemini_research_to_implementation.yaml +127 -0
- massgen/configs/memory/gpt5mini_high_reasoning_gemini.yaml +107 -0
- massgen/configs/memory/single_agent_compression_test.yaml +64 -0
- massgen/configs/tools/custom_tools/crawl4ai_example.yaml +55 -0
- massgen/configs/tools/custom_tools/multimodal_tools/text_to_file_generation_multi.yaml +61 -0
- massgen/configs/tools/custom_tools/multimodal_tools/text_to_file_generation_single.yaml +29 -0
- massgen/configs/tools/custom_tools/multimodal_tools/text_to_image_generation_multi.yaml +51 -0
- massgen/configs/tools/custom_tools/multimodal_tools/text_to_image_generation_single.yaml +33 -0
- massgen/configs/tools/custom_tools/multimodal_tools/text_to_speech_generation_multi.yaml +55 -0
- massgen/configs/tools/custom_tools/multimodal_tools/text_to_speech_generation_single.yaml +33 -0
- massgen/configs/tools/custom_tools/multimodal_tools/text_to_video_generation_multi.yaml +47 -0
- massgen/configs/tools/custom_tools/multimodal_tools/text_to_video_generation_single.yaml +29 -0
- massgen/configs/tools/custom_tools/multimodal_tools/understand_audio.yaml +1 -1
- massgen/configs/tools/custom_tools/multimodal_tools/understand_file.yaml +1 -1
- massgen/configs/tools/custom_tools/multimodal_tools/understand_image.yaml +1 -1
- massgen/configs/tools/custom_tools/multimodal_tools/understand_video.yaml +1 -1
- massgen/configs/tools/custom_tools/multimodal_tools/youtube_video_analysis.yaml +1 -1
- massgen/filesystem_manager/_filesystem_manager.py +1 -0
- massgen/filesystem_manager/_path_permission_manager.py +148 -0
- massgen/memory/README.md +277 -0
- massgen/memory/__init__.py +26 -0
- massgen/memory/_base.py +193 -0
- massgen/memory/_compression.py +237 -0
- massgen/memory/_context_monitor.py +211 -0
- massgen/memory/_conversation.py +255 -0
- massgen/memory/_fact_extraction_prompts.py +333 -0
- massgen/memory/_mem0_adapters.py +257 -0
- massgen/memory/_persistent.py +687 -0
- massgen/memory/docker-compose.qdrant.yml +36 -0
- massgen/memory/docs/DESIGN.md +388 -0
- massgen/memory/docs/QUICKSTART.md +409 -0
- massgen/memory/docs/SUMMARY.md +319 -0
- massgen/memory/docs/agent_use_memory.md +408 -0
- massgen/memory/docs/orchestrator_use_memory.md +586 -0
- massgen/memory/examples.py +237 -0
- massgen/message_templates.py +160 -12
- massgen/orchestrator.py +223 -7
- massgen/tests/memory/test_agent_compression.py +174 -0
- massgen/{configs/tools → tests}/memory/test_context_window_management.py +30 -30
- massgen/tests/memory/test_force_compression.py +154 -0
- massgen/tests/memory/test_simple_compression.py +147 -0
- massgen/tests/test_agent_memory.py +534 -0
- massgen/tests/test_binary_file_blocking.py +274 -0
- massgen/tests/test_case_studies.md +12 -12
- massgen/tests/test_conversation_memory.py +382 -0
- massgen/tests/test_multimodal_size_limits.py +407 -0
- massgen/tests/test_orchestrator_memory.py +620 -0
- massgen/tests/test_persistent_memory.py +435 -0
- massgen/token_manager/token_manager.py +6 -0
- massgen/tool/_manager.py +7 -2
- massgen/tool/_multimodal_tools/image_to_image_generation.py +293 -0
- massgen/tool/_multimodal_tools/text_to_file_generation.py +455 -0
- massgen/tool/_multimodal_tools/text_to_image_generation.py +222 -0
- massgen/tool/_multimodal_tools/text_to_speech_continue_generation.py +226 -0
- massgen/tool/_multimodal_tools/text_to_speech_transcription_generation.py +217 -0
- massgen/tool/_multimodal_tools/text_to_video_generation.py +223 -0
- massgen/tool/_multimodal_tools/understand_audio.py +19 -1
- massgen/tool/_multimodal_tools/understand_file.py +6 -1
- massgen/tool/_multimodal_tools/understand_image.py +112 -8
- massgen/tool/_multimodal_tools/understand_video.py +32 -5
- massgen/tool/_web_tools/crawl4ai_tool.py +718 -0
- massgen/tool/docs/multimodal_tools.md +589 -0
- massgen/tools/__init__.py +8 -0
- massgen/tools/_planning_mcp_server.py +520 -0
- massgen/tools/planning_dataclasses.py +434 -0
- {massgen-0.1.3.dist-info → massgen-0.1.5.dist-info}/METADATA +142 -82
- {massgen-0.1.3.dist-info → massgen-0.1.5.dist-info}/RECORD +84 -41
- massgen/configs/tools/custom_tools/crawl4ai_mcp_example.yaml +0 -67
- massgen/configs/tools/custom_tools/crawl4ai_multi_agent_example.yaml +0 -68
- massgen/configs/tools/memory/README.md +0 -199
- massgen/configs/tools/memory/gpt5mini_gemini_context_window_management.yaml +0 -131
- massgen/configs/tools/memory/gpt5mini_gemini_no_persistent_memory.yaml +0 -133
- massgen/configs/tools/multimodal/gpt5mini_gpt5nano_documentation_evolution.yaml +0 -97
- {massgen-0.1.3.dist-info → massgen-0.1.5.dist-info}/WHEEL +0 -0
- {massgen-0.1.3.dist-info → massgen-0.1.5.dist-info}/entry_points.txt +0 -0
- {massgen-0.1.3.dist-info → massgen-0.1.5.dist-info}/licenses/LICENSE +0 -0
- {massgen-0.1.3.dist-info → massgen-0.1.5.dist-info}/top_level.txt +0 -0
|
@@ -1,6 +1,6 @@
|
|
|
1
1
|
Metadata-Version: 2.4
|
|
2
2
|
Name: massgen
|
|
3
|
-
Version: 0.1.
|
|
3
|
+
Version: 0.1.5
|
|
4
4
|
Summary: Multi-Agent Scaling System - A powerful framework for collaborative AI
|
|
5
5
|
Author-email: MassGen Team <contact@massgen.dev>
|
|
6
6
|
License: Apache-2.0
|
|
@@ -54,6 +54,8 @@ Requires-Dist: openpyxl>=3.1.5
|
|
|
54
54
|
Requires-Dist: python-pptx>=1.0.2
|
|
55
55
|
Requires-Dist: opencv-python>=4.12.0.88
|
|
56
56
|
Requires-Dist: pypdf2>=3.0.1
|
|
57
|
+
Requires-Dist: mem0ai>=1.0.0
|
|
58
|
+
Requires-Dist: reportlab>=4.0.0
|
|
57
59
|
Provides-Extra: dev
|
|
58
60
|
Requires-Dist: pytest>=7.0.0; extra == "dev"
|
|
59
61
|
Requires-Dist: pytest-cov>=4.0.0; extra == "dev"
|
|
@@ -88,23 +90,29 @@ Provides-Extra: all
|
|
|
88
90
|
Dynamic: license-file
|
|
89
91
|
|
|
90
92
|
<p align="center">
|
|
91
|
-
<
|
|
93
|
+
<picture>
|
|
94
|
+
<source media="(prefers-color-scheme: dark)" srcset="https://raw.githubusercontent.com/Leezekun/MassGen/main/assets/logo-dark.png">
|
|
95
|
+
<source media="(prefers-color-scheme: light)" srcset="https://raw.githubusercontent.com/Leezekun/MassGen/main/assets/logo.png">
|
|
96
|
+
<img src="https://raw.githubusercontent.com/Leezekun/MassGen/main/assets/logo.png" alt="MassGen Logo" width="360" />
|
|
97
|
+
</picture>
|
|
92
98
|
</p>
|
|
93
99
|
|
|
94
|
-
<
|
|
95
|
-
|
|
96
|
-
|
|
97
|
-
|
|
98
|
-
|
|
99
|
-
|
|
100
|
-
|
|
101
|
-
|
|
102
|
-
|
|
103
|
-
|
|
104
|
-
|
|
105
|
-
|
|
106
|
-
|
|
107
|
-
|
|
100
|
+
<div align="center">
|
|
101
|
+
|
|
102
|
+
[](https://docs.massgen.ai)
|
|
103
|
+
[](https://github.com/Leezekun/MassGen)
|
|
104
|
+
[](https://www.python.org/downloads/)
|
|
105
|
+
[](LICENSE)
|
|
106
|
+
|
|
107
|
+
</div>
|
|
108
|
+
|
|
109
|
+
<div align="center">
|
|
110
|
+
|
|
111
|
+
[](https://x.massgen.ai)
|
|
112
|
+
[](https://www.linkedin.com/company/massgen-ai)
|
|
113
|
+
[](https://discord.massgen.ai)
|
|
114
|
+
|
|
115
|
+
</div>
|
|
108
116
|
|
|
109
117
|
<h1 align="center">🚀 MassGen: Multi-Agent Scaling System for GenAI</h1>
|
|
110
118
|
|
|
@@ -114,7 +122,7 @@ Dynamic: license-file
|
|
|
114
122
|
|
|
115
123
|
<p align="center">
|
|
116
124
|
<a href="https://www.youtube.com/watch?v=Dp2oldJJImw">
|
|
117
|
-
<img src="
|
|
125
|
+
<img src="docs/source/_static/images/thumbnail.png" alt="MassGen case study -- Berkeley Agentic AI Summit Question" width="800">
|
|
118
126
|
</a>
|
|
119
127
|
</p>
|
|
120
128
|
|
|
@@ -143,7 +151,7 @@ This project started with the "threads of thought" and "iterative refinement" id
|
|
|
143
151
|
<details open>
|
|
144
152
|
<summary><h3>🆕 Latest Features</h3></summary>
|
|
145
153
|
|
|
146
|
-
- [v0.1.
|
|
154
|
+
- [v0.1.5 Features](#-latest-features-v015)
|
|
147
155
|
</details>
|
|
148
156
|
|
|
149
157
|
<details open>
|
|
@@ -188,15 +196,15 @@ This project started with the "threads of thought" and "iterative refinement" id
|
|
|
188
196
|
<summary><h3>🗺️ Roadmap</h3></summary>
|
|
189
197
|
|
|
190
198
|
- Recent Achievements
|
|
191
|
-
- [v0.1.
|
|
192
|
-
- [v0.0.3 - v0.1.
|
|
199
|
+
- [v0.1.5](#recent-achievements-v015)
|
|
200
|
+
- [v0.0.3 - v0.1.4](#previous-achievements-v003---v014)
|
|
193
201
|
- [Key Future Enhancements](#key-future-enhancements)
|
|
194
202
|
- Bug Fixes & Backend Improvements
|
|
195
203
|
- Advanced Agent Collaboration
|
|
196
204
|
- Expanded Model, Tool & Agent Integrations
|
|
197
205
|
- Improved Performance & Scalability
|
|
198
206
|
- Enhanced Developer Experience
|
|
199
|
-
- [v0.1.
|
|
207
|
+
- [v0.1.6 Roadmap](#v016-roadmap)
|
|
200
208
|
</details>
|
|
201
209
|
|
|
202
210
|
<details open>
|
|
@@ -221,36 +229,49 @@ This project started with the "threads of thought" and "iterative refinement" id
|
|
|
221
229
|
|
|
222
230
|
---
|
|
223
231
|
|
|
224
|
-
## 🆕 Latest Features (v0.1.
|
|
232
|
+
## 🆕 Latest Features (v0.1.5)
|
|
225
233
|
|
|
226
|
-
**🎉 Released: October
|
|
234
|
+
**🎉 Released: October 2025**
|
|
227
235
|
|
|
228
|
-
**What's New in v0.1.
|
|
229
|
-
- **🧠
|
|
230
|
-
-
|
|
231
|
-
-
|
|
236
|
+
**What's New in v0.1.5:**
|
|
237
|
+
- **🧠 Long-Term Memory System** - Semantic memory with retrieval across sessions
|
|
238
|
+
- **🗜️ Automatic Context Compression** - Smart compression when approaching token limits
|
|
239
|
+
- **🔄 Memory Sharing for Multi-Turn Conversations** - Agents access knowledge from previous turns
|
|
232
240
|
|
|
233
241
|
**Key Improvements:**
|
|
234
|
-
-
|
|
235
|
-
-
|
|
236
|
-
-
|
|
237
|
-
-
|
|
238
|
-
-
|
|
242
|
+
- Persistent memory via mem0 integration with vector storage
|
|
243
|
+
- Conversational memory for short-term context tracking
|
|
244
|
+
- Context monitoring with real-time token usage tracking
|
|
245
|
+
- Session management for memory isolation and continuation
|
|
246
|
+
- Qdrant vector database integration for semantic search
|
|
239
247
|
|
|
240
|
-
**Get Started with v0.1.
|
|
248
|
+
**Get Started with v0.1.5:**
|
|
241
249
|
```bash
|
|
242
250
|
# Install or upgrade from PyPI
|
|
243
251
|
pip install --upgrade massgen
|
|
244
252
|
|
|
245
|
-
#
|
|
246
|
-
|
|
247
|
-
|
|
248
|
-
|
|
253
|
+
# Multi-agent collaboration with context compression
|
|
254
|
+
massgen --config @examples/memory/gpt5mini_gemini_context_window_management \
|
|
255
|
+
"Analyze the MassGen codebase comprehensively. Create an architecture document that explains: (1) Core components and their responsibilities, (2) How different modules interact, (3) Key design patterns used, (4) Main entry points and request flows. Read > 30 files to build a complete understanding."
|
|
256
|
+
|
|
257
|
+
# Research-to-implementation workflow with memory persistence
|
|
258
|
+
# Prerequisites: Start Qdrant and crawl4ai Docker containers
|
|
259
|
+
docker run -d -p 6333:6333 -p 6334:6334 \
|
|
260
|
+
-v $(pwd)/.massgen/qdrant_storage:/qdrant/storage:z qdrant/qdrant
|
|
261
|
+
docker run -d -p 11235:11235 --name crawl4ai --shm-size=1g unclecode/crawl4ai:latest
|
|
249
262
|
|
|
250
|
-
#
|
|
251
|
-
|
|
252
|
-
|
|
253
|
-
|
|
263
|
+
# Session 1 - Research phase:
|
|
264
|
+
massgen --config @examples/memory/gpt5mini_gemini_research_to_implementation \
|
|
265
|
+
"Use crawl4ai to research the latest multi-agent AI papers and techniques from 2025. Focus on: coordination mechanisms, voting strategies, tool-use patterns, and architectural innovations."
|
|
266
|
+
|
|
267
|
+
# Session 2 - Implementation analysis (continue in same session):
|
|
268
|
+
# "Based on the multi-agent research from earlier, which techniques should we implement in MassGen to make it more state-of-the-art? Consider MassGen's current architecture and what would be most impactful."
|
|
269
|
+
|
|
270
|
+
→ See [Multi-Turn Persistent Memory Case Study](docs/source/examples/case_studies/multi-turn-persistent-memory.md) for detailed analysis
|
|
271
|
+
|
|
272
|
+
# Test automatic context compression
|
|
273
|
+
massgen --config @examples/memory/single_agent_compression_test \
|
|
274
|
+
"Analyze the MassGen codebase comprehensively. Create an architecture document that explains: (1) Core components and their responsibilities, (2) How different modules interact, (3) Key design patterns used, (4) Main entry points and request flows. Read > 30 files to build a complete understanding."
|
|
254
275
|
```
|
|
255
276
|
|
|
256
277
|
→ [See full release history and examples](massgen/configs/README.md#release-history--examples)
|
|
@@ -472,17 +493,20 @@ MassGen agents can leverage various tools to enhance their problem-solving capab
|
|
|
472
493
|
|
|
473
494
|
**Supported Built-in Tools by Backend:**
|
|
474
495
|
|
|
475
|
-
| Backend | Live Search | Code Execution | File Operations | MCP Support | Multimodal
|
|
476
|
-
|
|
477
|
-
| **Azure OpenAI** (NEW in v0.0.10) | ❌ | ❌ | ❌ | ❌ | ❌ | Code interpreter, Azure deployment management |
|
|
478
|
-
| **Claude API** | ✅ | ✅ | ✅ | ✅ |
|
|
479
|
-
| **Claude Code** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*Image* | **Native Claude Code SDK, comprehensive dev tools, MCP integration** |
|
|
480
|
-
| **Gemini API** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*Image* | Web search, code execution, **MCP integration**|
|
|
481
|
-
| **Grok API** | ✅ | ❌ | ✅ | ✅ |
|
|
482
|
-
| **OpenAI API** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*Image* | Web search, code interpreter, **MCP integration** |
|
|
483
|
-
| **ZAI API** | ❌ | ❌ | ✅ | ✅ |
|
|
484
|
-
|
|
485
|
-
**
|
|
496
|
+
| Backend | Live Search | Code Execution | File Operations | MCP Support | Multimodal Understanding | Multimodal Generation | Advanced Features |
|
|
497
|
+
|---------|:-----------:|:--------------:|:---------------:|:-----------:|:------------------------:|:---------------------:|:-----------------|
|
|
498
|
+
| **Azure OpenAI** (NEW in v0.0.10) | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ | Code interpreter, Azure deployment management |
|
|
499
|
+
| **Claude API** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*via custom tools* | ✅<br/>*via custom tools* | Web search, code interpreter, **MCP integration** |
|
|
500
|
+
| **Claude Code** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*Image (native)*<br/>*Audio/Video/Docs (custom tools)* | ✅<br/>*via custom tools* | **Native Claude Code SDK, comprehensive dev tools, MCP integration** |
|
|
501
|
+
| **Gemini API** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*Image (native)*<br/>*Audio/Video/Docs (custom tools)* | ✅<br/>*via custom tools* | Web search, code execution, **MCP integration**|
|
|
502
|
+
| **Grok API** | ✅ | ❌ | ✅ | ✅ | ✅<br/>*via custom tools* | ✅<br/>*via custom tools* | Web search, **MCP integration** |
|
|
503
|
+
| **OpenAI API** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*Image (native)*<br/>*Audio/Video/Docs (custom tools)* | ✅<br/>*via custom tools* | Web search, code interpreter, **MCP integration** |
|
|
504
|
+
| **ZAI API** | ❌ | ❌ | ✅ | ✅ | ✅<br/>*via custom tools* | ✅<br/>*via custom tools* | **MCP integration** |
|
|
505
|
+
|
|
506
|
+
**Notes:**
|
|
507
|
+
- **Multimodal Understanding** (NEW in v0.1.3): Analyze images, audio, video, and documents via custom tools using OpenAI GPT-4.1 - works with any backend
|
|
508
|
+
- **Multimodal Generation** (NEW in v0.1.4): Generate images, videos, audio, and documents via custom tools using OpenAI APIs - works with any backend
|
|
509
|
+
- See custom tool configurations: [`understand_image.yaml`](massgen/configs/tools/custom_tools/multimodal_tools/understand_image.yaml), [`text_to_image_generation_single.yaml`](massgen/configs/tools/custom_tools/multimodal_tools/text_to_image_generation_single.yaml)
|
|
486
510
|
|
|
487
511
|
→ For detailed backend capabilities and tool integration guides, see [User Guide - Backends](https://docs.massgen.ai/en/latest/user_guide/backends.html)
|
|
488
512
|
|
|
@@ -954,7 +978,7 @@ massgen --config @examples/tools/code-execution/multi_agent_playwright_automatio
|
|
|
954
978
|
"Navigate to https://news.ycombinator.com, extract the top 10 stories, and create a summary report"
|
|
955
979
|
```
|
|
956
980
|
|
|
957
|
-
→ [**See detailed case studies**](docs/case_studies/README.md) with real session logs and outcomes
|
|
981
|
+
→ [**See detailed case studies**](docs/source/examples/case_studies/README.md) with real session logs and outcomes
|
|
958
982
|
|
|
959
983
|
#### Interactive Mode & Advanced Usage
|
|
960
984
|
|
|
@@ -1075,7 +1099,11 @@ All sessions are automatically logged with detailed information for debugging an
|
|
|
1075
1099
|
|
|
1076
1100
|
To see how MassGen works in practice, check out these detailed case studies based on real session logs:
|
|
1077
1101
|
|
|
1078
|
-
|
|
1102
|
+
**Featured:**
|
|
1103
|
+
- [**Multi-Turn Persistent Memory**](docs/source/examples/case_studies/multi-turn-persistent-memory.md) - Research-to-implementation workflow demonstrating memory system (v0.1.5) | [📹 Watch Demo](https://youtu.be/wWxxFgyw40Y)
|
|
1104
|
+
|
|
1105
|
+
**All Case Studies:**
|
|
1106
|
+
- [**MassGen Case Studies**](docs/source/examples/case_studies/README.md)
|
|
1079
1107
|
- [**Case Studies Documentation**](https://docs.massgen.ai/en/latest/examples/case_studies.html) - Browse case studies online
|
|
1080
1108
|
|
|
1081
1109
|
---
|
|
@@ -1083,37 +1111,67 @@ To see how MassGen works in practice, check out these detailed case studies base
|
|
|
1083
1111
|
|
|
1084
1112
|
## 🗺️ Roadmap
|
|
1085
1113
|
|
|
1086
|
-
MassGen is currently in its foundational stage, with a focus on parallel, asynchronous multi-agent collaboration and orchestration. Our roadmap is centered on transforming this foundation into a highly robust, intelligent, and user-friendly system, while enabling frontier research and exploration.
|
|
1114
|
+
MassGen is currently in its foundational stage, with a focus on parallel, asynchronous multi-agent collaboration and orchestration. Our roadmap is centered on transforming this foundation into a highly robust, intelligent, and user-friendly system, while enabling frontier research and exploration.
|
|
1087
1115
|
|
|
1088
1116
|
⚠️ **Early Stage Notice:** As MassGen is in active development, please expect upcoming breaking architecture changes as we continue to refine and improve the system.
|
|
1089
1117
|
|
|
1090
|
-
### Recent Achievements (v0.1.
|
|
1118
|
+
### Recent Achievements (v0.1.5)
|
|
1119
|
+
|
|
1120
|
+
**🎉 Released: October 2025**
|
|
1121
|
+
|
|
1122
|
+
#### Memory System
|
|
1123
|
+
- **PersistentMemory**: Long-term semantic memory storage via mem0 integration with fact extraction and retrieval across sessions
|
|
1124
|
+
- **ConversationMemory**: Short-term verbatim message tracking for active conversation context
|
|
1125
|
+
- **Automatic Context Compression**: Smart compression when approaching token limits with configurable thresholds (trigger_threshold, target_ratio)
|
|
1126
|
+
- **Cross-Agent Memory Sharing**: Agents share memory with turn-aware filtering to prevent temporal information leakage
|
|
1127
|
+
- **Context Monitoring**: Real-time token usage tracking with automatic compression triggers
|
|
1128
|
+
- **Session Management**: Memory isolation and continuation across runs with session naming support
|
|
1129
|
+
- **Qdrant Integration**: Vector database support for efficient semantic search in both server and local modes
|
|
1130
|
+
- **Configurable Memory Providers**: Support for OpenAI, Anthropic, Groq, and other mem0-compatible LLM and embedding providers
|
|
1131
|
+
|
|
1132
|
+
#### Memory Configuration
|
|
1133
|
+
- **Global and Per-Agent Control**: Enable/disable memory at system or individual agent level
|
|
1134
|
+
- **Compression Settings**: Configurable trigger thresholds and target ratios for context window management
|
|
1135
|
+
- **Retrieval Configuration**: Customizable retrieval limits and smart filtering to exclude recent messages
|
|
1136
|
+
- **Memory Persistence**: Session continuation with named sessions for cross-session memory access
|
|
1091
1137
|
|
|
1092
|
-
|
|
1138
|
+
#### Configuration Files
|
|
1139
|
+
- `gpt5mini_gemini_context_window_management.yaml` - Multi-agent with automatic context compression
|
|
1140
|
+
- `gpt5mini_gemini_research_to_implementation.yaml` - Research-to-implementation workflow with memory
|
|
1141
|
+
- `gpt5mini_high_reasoning_gemini.yaml` - High reasoning agents with memory integration
|
|
1142
|
+
- `gpt5mini_gemini_baseline_research_to_implementation.yaml` - Baseline research workflow for comparison
|
|
1143
|
+
- `single_agent_compression_test.yaml` - Single agent testing context compression behavior
|
|
1093
1144
|
|
|
1094
|
-
####
|
|
1095
|
-
-
|
|
1096
|
-
-
|
|
1097
|
-
-
|
|
1098
|
-
-
|
|
1099
|
-
- **Multi-Workspace Support**: Planning mode works across different workspaces without conflicts
|
|
1100
|
-
- **Test Coverage**: Comprehensive tests in `massgen/tests/test_intelligent_planning_mode.py`
|
|
1101
|
-
- **Documentation**: Complete guide in `docs/case_studies/INTELLIGENT_PLANNING_MODE.md`
|
|
1145
|
+
#### Documentation
|
|
1146
|
+
- Complete memory system user guide: `docs/source/user_guide/memory.rst`
|
|
1147
|
+
- Design decisions documentation explaining architectural choices
|
|
1148
|
+
- API reference for PersistentMemory, ConversationMemory, and ContextMonitor classes
|
|
1149
|
+
- Comprehensive troubleshooting guide and monitoring instructions
|
|
1102
1150
|
|
|
1103
|
-
####
|
|
1104
|
-
-
|
|
1105
|
-
- **Model Priority Updates**: Reorganized Claude model list with updated defaults (`claude-sonnet-4-5-20250929`)
|
|
1106
|
-
- **Grok Web Search Fix**: Resolved `extra_body` parameter handling for Grok's Live Search API with new `_add_grok_search_params()` method
|
|
1151
|
+
#### Testing Infrastructure
|
|
1152
|
+
- Memory test suite: `test_agent_memory.py`, `test_conversation_memory.py`, `test_orchestrator_memory.py`, `test_persistent_memory.py`
|
|
1107
1153
|
|
|
1108
|
-
|
|
1109
|
-
- **Planning Mode Configs**: Updated 5 configurations in `massgen/configs/tools/planning/` with selective blocking examples
|
|
1110
|
-
- **Default Configuration**: Updated `three_agents_default.yaml` with Grok-4-fast model
|
|
1154
|
+
### Previous Achievements (v0.0.3 - v0.1.4)
|
|
1111
1155
|
|
|
1112
|
-
|
|
1156
|
+
✅ **Multimodal Generation Tools (v0.1.4)**: Create images from text via DALL-E API, generate videos from descriptions, text-to-speech with audio transcription support, document generation for PDF/DOCX/XLSX/PPTX formats, image transformation capabilities for existing images
|
|
1157
|
+
|
|
1158
|
+
✅ **Binary File Protection (v0.1.4)**: Automatic blocking prevents text tools from accessing 40+ binary file types including images, videos, audio, archives, and Office documents, intelligent error messages guide users to appropriate specialized tools for binary content
|
|
1159
|
+
|
|
1160
|
+
✅ **Crawl4AI Integration (v0.1.4)**: Intelligent web scraping with LLM-powered content extraction and customizable extraction patterns for structured data retrieval from websites
|
|
1161
|
+
|
|
1162
|
+
✅ **Post-Evaluation Workflow (v0.1.3)**: Winning agents evaluate their own answers before submission with submit and restart capabilities, supports answer confirmation and orchestration restart with feedback across all backends
|
|
1163
|
+
|
|
1164
|
+
✅ **Multimodal Understanding Tools (v0.1.3)**: Analyze images, transcribe audio, extract video frames, and process documents (PDF/DOCX/XLSX/PPTX) with structured JSON output, works across all backends via OpenAI GPT-4.1 integration
|
|
1165
|
+
|
|
1166
|
+
✅ **Docker Sudo Mode (v0.1.3)**: Privileged command execution in Docker containers for system-level operations requiring elevated permissions
|
|
1167
|
+
|
|
1168
|
+
✅ **Intelligent Planning Mode (v0.1.2)**: Automatic question analysis determining operation irreversibility via `_analyze_question_irreversibility()` in orchestrator, selective tool blocking with `set_planning_mode_blocked_tools()` and `is_mcp_tool_blocked()` methods, read-only MCP operations during coordination with write operations blocked, zero-configuration transparent operation, multi-workspace support
|
|
1169
|
+
|
|
1170
|
+
✅ **Model Updates (v0.1.2)**: Claude 4.5 Haiku model `claude-haiku-4-5-20251001`, reorganized Claude model priorities with `claude-sonnet-4-5-20250929` default, Grok web search fix with `_add_grok_search_params()` method for proper `extra_body` parameter handling
|
|
1113
1171
|
|
|
1114
1172
|
✅ **Custom Tools System (v0.1.1)**: User-defined Python function registration using `ToolManager` class in `massgen/tool/_manager.py`, cross-backend support alongside MCP servers, builtin/MCP/custom tool categories with automatic discovery, 40+ examples in `massgen/configs/tools/custom_tools/`, voting sensitivity controls with three-tier quality system (lenient/balanced/strict), answer novelty detection preventing duplicates
|
|
1115
1173
|
|
|
1116
|
-
✅ **Backend Enhancements (v0.1.1)**: Gemini architecture refactoring with extracted MCP management (`gemini_mcp_manager.py`), tracking (`gemini_trackers.py`), and utilities, new capabilities registry in `massgen/backend/capabilities.py` documenting feature support across backends
|
|
1174
|
+
✅ **Backend Enhancements (v0.1.1)**: Gemini architecture refactoring with extracted MCP management (`gemini_mcp_manager.py`), tracking (`gemini_trackers.py`), and utilities, new capabilities registry in `massgen/backend/capabilities.py` documenting feature support across all backends
|
|
1117
1175
|
|
|
1118
1176
|
✅ **PyPI Package Release (v0.1.0)**: Official distribution via `pip install massgen` with simplified installation, global `massgen` command accessible from any directory, comprehensive Sphinx documentation at [docs.massgen.ai](https://docs.massgen.ai/), interactive setup wizard with use case presets and API key management, enhanced CLI with `@examples/` prefix for built-in configurations
|
|
1119
1177
|
|
|
@@ -1213,19 +1271,21 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
|
|
|
1213
1271
|
|
|
1214
1272
|
We welcome community contributions to achieve these goals.
|
|
1215
1273
|
|
|
1216
|
-
### v0.1.
|
|
1274
|
+
### v0.1.6 Roadmap
|
|
1217
1275
|
|
|
1218
|
-
Version 0.1.
|
|
1276
|
+
Version 0.1.6 focuses on backend code refactoring for improved maintainability and developer experience:
|
|
1219
1277
|
|
|
1220
|
-
####
|
|
1221
|
-
- **
|
|
1222
|
-
- **Final Agent Submit/Restart Tools**: Enable final agent to decide whether to submit or restart orchestration
|
|
1278
|
+
#### Planned Features
|
|
1279
|
+
- **Backend Code Refactoring**: Major code refactoring for improved maintainability and developer experience with better code organization, modularity, and architectural improvements
|
|
1223
1280
|
|
|
1224
1281
|
Key technical approach:
|
|
1225
|
-
- **
|
|
1226
|
-
- **
|
|
1282
|
+
- **Code Architecture**: Enhanced code organization with improved modularity and separation of concerns
|
|
1283
|
+
- **Developer Experience**: Simplified backend extension points, improved API clarity, better error handling and debugging support
|
|
1284
|
+
- **Quality Assurance**: Comprehensive testing to ensure no functionality regressions
|
|
1285
|
+
|
|
1286
|
+
**Target Release**: November 1, 2025 (Friday @ 9am PT)
|
|
1227
1287
|
|
|
1228
|
-
For detailed milestones and technical specifications, see the [full v0.1.
|
|
1288
|
+
For detailed milestones and technical specifications, see the [full v0.1.6 roadmap](ROADMAP_v0.1.6.md).
|
|
1229
1289
|
|
|
1230
1290
|
---
|
|
1231
1291
|
|