PyPI - massgen - Versions diffs - 0.1.4__py3-none-any.whl → 0.1.6__py3-none-any.whl - Mend

massgen 0.1.4py3-none-any.whl → 0.1.6py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Potentially problematic release.

This version of massgen might be problematic. Click here for more details.

Files changed (84) hide show

massgen/__init__.py +1 -1
massgen/backend/base_with_custom_tool_and_mcp.py +453 -23
massgen/backend/capabilities.py +39 -0
massgen/backend/chat_completions.py +111 -197
massgen/backend/claude.py +210 -181
massgen/backend/gemini.py +1015 -1559
massgen/backend/grok.py +3 -2
massgen/backend/response.py +160 -220
massgen/chat_agent.py +340 -20
massgen/cli.py +399 -25
massgen/config_builder.py +20 -54
massgen/config_validator.py +931 -0
massgen/configs/README.md +95 -10
massgen/configs/memory/gpt5mini_gemini_baseline_research_to_implementation.yaml +94 -0
massgen/configs/memory/gpt5mini_gemini_context_window_management.yaml +187 -0
massgen/configs/memory/gpt5mini_gemini_research_to_implementation.yaml +127 -0
massgen/configs/memory/gpt5mini_high_reasoning_gemini.yaml +107 -0
massgen/configs/memory/single_agent_compression_test.yaml +64 -0
massgen/configs/tools/custom_tools/claude_code_custom_tool_with_mcp_example.yaml +1 -0
massgen/configs/tools/custom_tools/claude_custom_tool_example_no_path.yaml +1 -1
massgen/configs/tools/custom_tools/claude_custom_tool_with_mcp_example.yaml +1 -0
massgen/configs/tools/custom_tools/computer_use_browser_example.yaml +1 -1
massgen/configs/tools/custom_tools/computer_use_docker_example.yaml +1 -1
massgen/configs/tools/custom_tools/gemini_custom_tool_with_mcp_example.yaml +1 -0
massgen/configs/tools/custom_tools/gpt5_nano_custom_tool_with_mcp_example.yaml +1 -0
massgen/configs/tools/custom_tools/gpt_oss_custom_tool_with_mcp_example.yaml +1 -0
massgen/configs/tools/custom_tools/grok3_mini_custom_tool_with_mcp_example.yaml +1 -0
massgen/configs/tools/custom_tools/interop/ag2_and_langgraph_lesson_planner.yaml +65 -0
massgen/configs/tools/custom_tools/interop/ag2_and_openai_assistant_lesson_planner.yaml +65 -0
massgen/configs/tools/custom_tools/interop/ag2_lesson_planner_example.yaml +48 -0
massgen/configs/tools/custom_tools/interop/agentscope_lesson_planner_example.yaml +48 -0
massgen/configs/tools/custom_tools/interop/langgraph_lesson_planner_example.yaml +49 -0
massgen/configs/tools/custom_tools/interop/openai_assistant_lesson_planner_example.yaml +50 -0
massgen/configs/tools/custom_tools/interop/smolagent_lesson_planner_example.yaml +49 -0
massgen/configs/tools/custom_tools/qwen_api_custom_tool_with_mcp_example.yaml +1 -0
massgen/configs/tools/custom_tools/two_models_with_tools_example.yaml +44 -0
massgen/formatter/_gemini_formatter.py +61 -15
massgen/memory/README.md +277 -0
massgen/memory/__init__.py +26 -0
massgen/memory/_base.py +193 -0
massgen/memory/_compression.py +237 -0
massgen/memory/_context_monitor.py +211 -0
massgen/memory/_conversation.py +255 -0
massgen/memory/_fact_extraction_prompts.py +333 -0
massgen/memory/_mem0_adapters.py +257 -0
massgen/memory/_persistent.py +687 -0
massgen/memory/docker-compose.qdrant.yml +36 -0
massgen/memory/docs/DESIGN.md +388 -0
massgen/memory/docs/QUICKSTART.md +409 -0
massgen/memory/docs/SUMMARY.md +319 -0
massgen/memory/docs/agent_use_memory.md +408 -0
massgen/memory/docs/orchestrator_use_memory.md +586 -0
massgen/memory/examples.py +237 -0
massgen/orchestrator.py +207 -7
massgen/tests/memory/test_agent_compression.py +174 -0
massgen/tests/memory/test_context_window_management.py +286 -0
massgen/tests/memory/test_force_compression.py +154 -0
massgen/tests/memory/test_simple_compression.py +147 -0
massgen/tests/test_ag2_lesson_planner.py +223 -0
massgen/tests/test_agent_memory.py +534 -0
massgen/tests/test_config_validator.py +1156 -0
massgen/tests/test_conversation_memory.py +382 -0
massgen/tests/test_langgraph_lesson_planner.py +223 -0
massgen/tests/test_orchestrator_memory.py +620 -0
massgen/tests/test_persistent_memory.py +435 -0
massgen/token_manager/token_manager.py +6 -0
massgen/tool/__init__.py +2 -9
massgen/tool/_decorators.py +52 -0
massgen/tool/_extraframework_agents/ag2_lesson_planner_tool.py +251 -0
massgen/tool/_extraframework_agents/agentscope_lesson_planner_tool.py +303 -0
massgen/tool/_extraframework_agents/langgraph_lesson_planner_tool.py +275 -0
massgen/tool/_extraframework_agents/openai_assistant_lesson_planner_tool.py +247 -0
massgen/tool/_extraframework_agents/smolagent_lesson_planner_tool.py +180 -0
massgen/tool/_manager.py +102 -16
massgen/tool/_registered_tool.py +3 -0
massgen/tool/_result.py +3 -0
{massgen-0.1.4.dist-info → massgen-0.1.6.dist-info}/METADATA +138 -77
{massgen-0.1.4.dist-info → massgen-0.1.6.dist-info}/RECORD +82 -37
massgen/backend/gemini_mcp_manager.py +0 -545
massgen/backend/gemini_trackers.py +0 -344
{massgen-0.1.4.dist-info → massgen-0.1.6.dist-info}/WHEEL +0 -0
{massgen-0.1.4.dist-info → massgen-0.1.6.dist-info}/entry_points.txt +0 -0
{massgen-0.1.4.dist-info → massgen-0.1.6.dist-info}/licenses/LICENSE +0 -0
{massgen-0.1.4.dist-info → massgen-0.1.6.dist-info}/top_level.txt +0 -0

massgen/tool/_manager.py CHANGED Viewed

@@ -220,17 +220,11 @@ class ToolManager:
         if description:
             tool_schema["function"]["description"] = description
-        # Remove preset args from schema
-        for arg in preset_args or {}:
-            if arg in tool_schema["function"]["parameters"]["properties"]:
-                tool_schema["function"]["parameters"]["properties"].pop(arg)
-            if "required" in tool_schema["function"]["parameters"]:
-                if arg in tool_schema["function"]["parameters"]["required"]:
-                    tool_schema["function"]["parameters"]["required"].remove(arg)
+        # Extract context param names from decorator
+        context_param_names = getattr(base_func, "__context_params__", set())
-                if not tool_schema["function"]["parameters"]["required"]:
-                    tool_schema["function"]["parameters"].pop("required", None)
+        # Remove preset args and context params from schema
+        self._remove_params_from_schema(tool_schema, set(preset_args or {}) | context_param_names)
         tool_entry = RegisteredToolEntry(
             tool_name=tool_name,
@@ -239,6 +233,7 @@ class ToolManager:
             base_function=base_func,
             schema_def=tool_schema,
             preset_params=preset_args or {},
+            context_param_names=context_param_names,
             extension_model=None,
             post_processor=post_processor,
         )
@@ -290,11 +285,13 @@ class ToolManager:
     async def execute_tool(
         self,
         tool_request: dict,
+        execution_context: Optional[Dict[str, Any]] = None,
     ) -> AsyncGenerator[ExecutionResult, None]:
         """Execute a tool and return results as async generator.
         Args:
             tool_request: Tool execution request with name and input
+            execution_context: Optional execution context (messages, agent_id, etc.)
         Yields:
             ExecutionResult objects (accumulated)
@@ -313,13 +310,28 @@ class ToolManager:
         tool_entry = self.registered_tools[tool_name]
-        # Merge parameters: model input first, then preset params override
-        # This ensures preset_params (like agent_cwd) always take precedence
-        # and won't be overridden by null values from model
-        model_input = tool_request.get("input", {}) or {}
+        # Extract context values for marked params only
+        context_values = {}
+        if execution_context and tool_entry.context_param_names:
+            context_values = {k: v for k, v in execution_context.items() if k in tool_entry.context_param_names}
+        # Validate all parameters match function signature
+        self._validate_params_match_signature(
+            tool_entry.base_function,
+            tool_entry.preset_params,
+            tool_entry.context_param_names,
+            tool_request,
+            tool_name,
+        )
+        # Merge all parameters (validation ensures all are valid):
+        # 1. Static preset params (from registration)
+        # 2. Dynamic context values (from execution_context, marked by decorator)
+        # 3. LLM input (from tool request)
         exec_kwargs = {
-            **model_input,
-            **tool_entry.preset_params,  # preset_params override model input
+            **tool_entry.preset_params,
+            **context_values,
+            **(tool_request.get("input", {}) or {}),
         }
         # Prepare post-processor if exists
@@ -372,6 +384,73 @@ class ToolManager:
                 f"Tool must return ExecutionResult or Generator, got {type(result)}",
             )
+    @staticmethod
+    def _validate_params_match_signature(
+        func: Callable,
+        preset_params: dict,
+        context_param_names: set,
+        tool_request: dict,
+        tool_name: str,
+    ) -> None:
+        """Validate that all provided parameters match function signature.
+        Args:
+            func: The function to validate against
+            preset_params: Static preset parameters
+            context_param_names: Context parameter names from decorator
+            tool_request: Tool request with LLM input
+            tool_name: Tool name for error messages
+        Raises:
+            ValueError: If any provided parameter doesn't match function signature
+        """
+        sig = inspect.signature(func)
+        valid_params = set(sig.parameters.keys())
+        # Check preset args
+        invalid_preset = set(preset_params.keys()) - valid_params
+        if invalid_preset:
+            raise ValueError(
+                f"Tool '{tool_name}': preset_args contains invalid parameters: {invalid_preset}. " f"Valid parameters: {valid_params}",
+            )
+        # Check context params
+        invalid_context = context_param_names - valid_params
+        if invalid_context:
+            raise ValueError(
+                f"Tool '{tool_name}': @context_params decorator specifies invalid parameters: {invalid_context}. " f"Valid parameters: {valid_params}",
+            )
+        # Check LLM input
+        llm_input = tool_request.get("input", {}) or {}
+        invalid_llm = set(llm_input.keys()) - valid_params
+        if invalid_llm:
+            raise ValueError(
+                f"Tool '{tool_name}': LLM provided invalid parameters: {invalid_llm}. " f"Valid parameters: {valid_params}",
+            )
+    @staticmethod
+    def _remove_params_from_schema(tool_schema: dict, param_names: set) -> None:
+        """Remove parameters from tool schema (for preset args and context params).
+        Args:
+            tool_schema: The tool schema to modify
+            param_names: Set of parameter names to remove
+        """
+        for arg in param_names:
+            # Remove from properties
+            if arg in tool_schema["function"]["parameters"]["properties"]:
+                tool_schema["function"]["parameters"]["properties"].pop(arg)
+            # Remove from required list
+            if "required" in tool_schema["function"]["parameters"]:
+                if arg in tool_schema["function"]["parameters"]["required"]:
+                    tool_schema["function"]["parameters"]["required"].remove(arg)
+                # Clean up empty required list
+                if not tool_schema["function"]["parameters"]["required"]:
+                    tool_schema["function"]["parameters"].pop("required", None)
     def fetch_category_hints(self) -> str:
         """Get usage hints from active categories.
@@ -570,12 +649,19 @@ class ToolManager:
         func_desc = "\n\n".join(desc_parts)
+        # Get context param names to exclude from schema
+        context_param_names = getattr(func, "__context_params__", set())
         # Build parameter fields
         param_fields = {}
         for param_name, param_info in inspect.signature(func).parameters.items():
             if param_name in ["self", "cls"]:
                 continue
+            # Skip context params (they'll be injected at runtime)
+            if param_name in context_param_names:
+                continue
             if param_info.kind == inspect.Parameter.VAR_KEYWORD:
                 if not include_varkwargs:
                     continue

massgen/tool/_registered_tool.py CHANGED Viewed

@@ -32,6 +32,9 @@ class RegisteredToolEntry:
     preset_params: dict[str, Any] = field(default_factory=dict)
     """Pre-configured parameters hidden from schema."""
+    context_param_names: set[str] = field(default_factory=set)
+    """Parameter names to inject from execution context at runtime."""
     extension_model: Optional[Type[BaseModel]] = None
     """Optional model for extending the base schema."""

massgen/tool/_result.py CHANGED Viewed

@@ -59,6 +59,9 @@ class ExecutionResult:
     is_final: bool = True
     """Indicates if this is the final result in a stream."""
+    is_log: bool = False
+    """Indicates if this result is for logging purposes only."""
     was_interrupted: bool = False
     """Indicates if the execution was interrupted."""

{massgen-0.1.4.dist-info → massgen-0.1.6.dist-info}/METADATA RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: massgen
-Version: 0.1.4
+Version: 0.1.6
 Summary: Multi-Agent Scaling System - A powerful framework for collaborative AI
 Author-email: MassGen Team <contact@massgen.dev>
 License: Apache-2.0
@@ -49,11 +49,17 @@ Requires-Dist: ag2>=0.9.10
 Requires-Dist: pyautogen>=0.10.0
 Requires-Dist: vertexai>=1.71.1
 Requires-Dist: pytest>=8.4.2
+Requires-Dist: langchain-openai>=1.0.0
+Requires-Dist: langgraph>=1.0.0
+Requires-Dist: langchain-core>=1.0.0
+Requires-Dist: agentscope>=1.0.6
+Requires-Dist: smolagents[litellm]>=1.22.0
 Requires-Dist: python-docx>=1.2.0
 Requires-Dist: openpyxl>=3.1.5
 Requires-Dist: python-pptx>=1.0.2
 Requires-Dist: opencv-python>=4.12.0.88
 Requires-Dist: pypdf2>=3.0.1
+Requires-Dist: mem0ai>=1.0.0
 Requires-Dist: reportlab>=4.0.0
 Provides-Extra: dev
 Requires-Dist: pytest>=7.0.0; extra == "dev"
@@ -121,12 +127,12 @@ Dynamic: license-file
 <p align="center">
   <a href="https://www.youtube.com/watch?v=Dp2oldJJImw">
-    <img src="assets/thumbnail.png" alt="MassGen case study -- Berkeley Agentic AI Summit Question" width="800">
+    <img src="docs/source/_static/images/thumbnail.png" alt="MassGen case study -- Berkeley Agentic AI Summit Question" width="800">
   </a>
 </p>
 <p align="center">
-  <i>Multi-agent scaling through intelligent collaboration in Grok Heavy style</i>
+  <i>Scaling AI with collaborative, continuously improving agents</i>
 </p>
 MassGen is a cutting-edge multi-agent system that leverages the power of collaborative AI to solve complex tasks. It assigns a task to multiple AI agents who work in parallel, observe each other's progress, and refine their approaches to converge on the best solution to deliver a comprehensive and high-quality result. The power of this "parallel study group" approach is exemplified by advanced systems like xAI's Grok Heavy and Google DeepMind's Gemini Deep Think.
@@ -150,7 +156,7 @@ This project started with the "threads of thought" and "iterative refinement" id
 <details open>
 <summary><h3>🆕 Latest Features</h3></summary>
-- [v0.1.4 Features](#-latest-features-v014)
+- [v0.1.6 Features](#-latest-features-v016)
 </details>
 <details open>
@@ -195,16 +201,15 @@ This project started with the "threads of thought" and "iterative refinement" id
 <summary><h3>🗺️ Roadmap</h3></summary>
 - Recent Achievements
-  - [v0.1.4](#recent-achievements-v014)
-  - [v0.1.3](#recent-achievements-v013)
-  - [v0.0.3 - v0.1.2](#previous-achievements-v003---v012)
+  - [v0.1.6](#recent-achievements-v016)
+  - [v0.0.3 - v0.1.5](#previous-achievements-v003---v015)
 - [Key Future Enhancements](#key-future-enhancements)
   - Bug Fixes & Backend Improvements
   - Advanced Agent Collaboration
   - Expanded Model, Tool & Agent Integrations
   - Improved Performance & Scalability
   - Enhanced Developer Experience
-- [v0.1.5 Roadmap](#v015-roadmap)
+- [v0.1.7 Roadmap](#v017-roadmap)
 </details>
 <details open>
@@ -229,37 +234,54 @@ This project started with the "threads of thought" and "iterative refinement" id
 ---
-## 🆕 Latest Features (v0.1.4)
+## 🆕 Latest Features (v0.1.6)
-**🎉 Released: October 27, 2025**
+**🎉 Released: October 31, 2025**
-**What's New in v0.1.4:**
-- **🎨 Multimodal Generation Tools** - Create images, videos, audio, and documents with AI
-- **🔒 Binary File Protection** - Automatic security preventing accidental binary file reads
-- **🕷️ Crawl4AI Integration** - Intelligent web scraping with LLM-powered extraction
+**What's New in v0.1.6:**
+- **🔗 Framework Interoperability** - Use agents from AG2, LangGraph, AgentScope, OpenAI Assistants, and SmoLAgent as tools
+- **✅ Configuration Validator** - Pre-flight YAML validation with detailed error messages and suggestions
+- **🔧 Unified Tool Execution** - Streamlined backend architecture with consistent tool handling
+- **⚡ Gemini Backend Simplification** - Major cleanup reducing codebase by 1,598 lines
 **Key Improvements:**
-- 6 new generation tools: text-to-image, text-to-video, text-to-speech, text-to-file, image-to-image
-- Binary file protection for 40+ file types with smart tool suggestions
-- Web crawling with customizable extraction patterns
-- Enhanced documentation and automation infrastructure
+- External agent frameworks work as MassGen custom tools
+- Comprehensive config validation with pre-commit hooks
+- ToolExecutionConfig dataclass for standardized tool handling across backends
+- Simplified Gemini backend with improved maintainability
+- Enhanced ToolManager with category management
-**Get Started with v0.1.4:**
+**Try v0.1.6 Features:**
 ```bash
 # Install or upgrade from PyPI
 pip install --upgrade massgen
-# Generate an image from text
-massgen --config @examples/tools/custom_tools/multimodal_tools/text_to_image_generation_single \
-  "Please generate an image of a cat in space."
+# Use AG2 agents as tools for lesson planning (supports streaming)
+# Requirements: pip install pyautogen, OPENAI_API_KEY must be set
+massgen --config massgen/configs/tools/custom_tools/ag2_lesson_planner_example.yaml "Create a lesson plan for photosynthesis"
-# Generate a video from text
-massgen --config @examples/tools/custom_tools/multimodal_tools/text_to_video_generation_single \
-  "Generate a 4 seconds video with neon-lit alley at night, light rain, slow push-in, cinematic."
+# Use LangGraph workflows as tools
+# Requirements: pip install langgraph langchain-openai langchain-core, OPENAI_API_KEY must be set
+massgen --config massgen/configs/tools/custom_tools/langgraph_lesson_planner_example.yaml "Create a lesson plan for photosynthesis"
-# Generate documents (PDF, DOCX, etc.)
-massgen --config @examples/tools/custom_tools/multimodal_tools/text_to_file_generation_single \
-  "Please generate a comprehensive technical report about the latest developments in Large Language Models (LLMs)."
+# Use AgentScope multi-agent framework as tools
+# Requirements: pip install agentscope, OPENAI_API_KEY must be set
+massgen --config massgen/configs/tools/custom_tools/agentscope_lesson_planner_example.yaml "Create a lesson plan for photosynthesis"
+# Use OpenAI Assistants API as tools
+# Requirements: pip install openai, OPENAI_API_KEY must be set
+massgen --config massgen/configs/tools/custom_tools/openai_assistant_lesson_planner_example.yaml "Create a lesson plan for photosynthesis"
+# Use SmolAgent (HuggingFace) as tools
+# Requirements: pip install smolagents, OPENAI_API_KEY must be set
+massgen --config massgen/configs/tools/custom_tools/smolagent_lesson_planner_example.yaml "Create a lesson plan for photosynthesis"
+# Combine multiple frameworks - AG2 + LangGraph collaboration
+# Requirements: pip install pyautogen langgraph langchain-openai langchain-core, OPENAI_API_KEY must be set
+massgen --config massgen/configs/tools/custom_tools/ag2_and_langgraph_lesson_planner.yaml "Create a lesson plan for photosynthesis"
+# Validate your configuration before running
+python -m massgen.config_validator your_config.yaml
 ```
 → [See full release history and examples](massgen/configs/README.md#release-history--examples)
@@ -481,17 +503,20 @@ MassGen agents can leverage various tools to enhance their problem-solving capab
 **Supported Built-in Tools by Backend:**
-| Backend | Live Search | Code Execution | File Operations | MCP Support | Multimodal (Image/Audio/Video) | Advanced Features |
-|---------|:-----------:|:--------------:|:---------------:|:-----------:|:----------:|:-----------------|
-| **Azure OpenAI** (NEW in v0.0.10) | ❌ | ❌ | ❌ | ❌ | ❌ | Code interpreter, Azure deployment management |
-| **Claude API**  | ✅ | ✅ | ✅ | ✅ | ✅ | Web search, code interpreter, **MCP integration** |
-| **Claude Code** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*Image* | **Native Claude Code SDK, comprehensive dev tools, MCP integration** |
-| **Gemini API** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*Image* | Web search, code execution, **MCP integration**|
-| **Grok API** | ✅ | ❌ | ✅ | ✅ | ❌ | Web search, **MCP integration** |
-| **OpenAI API** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*Image* | Web search, code interpreter, **MCP integration** |
-| **ZAI API** | ❌ | ❌ | ✅ | ✅ | ❌ | **MCP integration** |
-**Note:** Audio/video multimodal support (NEW in v0.0.30) is available through Chat Completions-based providers like OpenRouter and Qwen API. See configuration examples: [`single_openrouter_audio_understanding.yaml`](massgen/configs/basic/single/single_openrouter_audio_understanding.yaml), [`single_qwen_video_understanding.yaml`](massgen/configs/basic/single/single_qwen_video_understanding.yaml)
+| Backend | Live Search | Code Execution | File Operations | MCP Support | Multimodal Understanding | Multimodal Generation | Advanced Features |
+|---------|:-----------:|:--------------:|:---------------:|:-----------:|:------------------------:|:---------------------:|:-----------------|
+| **Azure OpenAI** (NEW in v0.0.10) | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ | Code interpreter, Azure deployment management |
+| **Claude API**  | ✅ | ✅ | ✅ | ✅ | ✅<br/>*via custom tools* | ✅<br/>*via custom tools* | Web search, code interpreter, **MCP integration** |
+| **Claude Code** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*Image (native)*<br/>*Audio/Video/Docs (custom tools)* | ✅<br/>*via custom tools* | **Native Claude Code SDK, comprehensive dev tools, MCP integration** |
+| **Gemini API** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*Image (native)*<br/>*Audio/Video/Docs (custom tools)* | ✅<br/>*via custom tools* | Web search, code execution, **MCP integration**|
+| **Grok API** | ✅ | ❌ | ✅ | ✅ | ✅<br/>*via custom tools* | ✅<br/>*via custom tools* | Web search, **MCP integration** |
+| **OpenAI API** | ✅ | ✅ | ✅ | ✅ | ✅<br/>*Image (native)*<br/>*Audio/Video/Docs (custom tools)* | ✅<br/>*via custom tools* | Web search, code interpreter, **MCP integration** |
+| **ZAI API** | ❌ | ❌ | ✅ | ✅ | ✅<br/>*via custom tools* | ✅<br/>*via custom tools* | **MCP integration** |
+**Notes:**
+- **Multimodal Understanding** (NEW in v0.1.3): Analyze images, audio, video, and documents via custom tools using OpenAI GPT-4.1 - works with any backend
+- **Multimodal Generation** (NEW in v0.1.4): Generate images, videos, audio, and documents via custom tools using OpenAI APIs - works with any backend
+- See custom tool configurations: [`understand_image.yaml`](massgen/configs/tools/custom_tools/multimodal_tools/understand_image.yaml), [`text_to_image_generation_single.yaml`](massgen/configs/tools/custom_tools/multimodal_tools/text_to_image_generation_single.yaml)
 → For detailed backend capabilities and tool integration guides, see [User Guide - Backends](https://docs.massgen.ai/en/latest/user_guide/backends.html)
@@ -1084,6 +1109,10 @@ All sessions are automatically logged with detailed information for debugging an
 To see how MassGen works in practice, check out these detailed case studies based on real session logs:
+**Featured:**
+- [**Multi-Turn Persistent Memory**](docs/source/examples/case_studies/multi-turn-persistent-memory.md) - Research-to-implementation workflow demonstrating memory system (v0.1.5) | [📹 Watch Demo](https://youtu.be/wWxxFgyw40Y)
+**All Case Studies:**
 - [**MassGen Case Studies**](docs/source/examples/case_studies/README.md)
 - [**Case Studies Documentation**](https://docs.massgen.ai/en/latest/examples/case_studies.html) - Browse case studies online
@@ -1096,49 +1125,80 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
 ⚠️ **Early Stage Notice:** As MassGen is in active development, please expect upcoming breaking architecture changes as we continue to refine and improve the system.
-### Recent Achievements (v0.1.4)
+### Recent Achievements (v0.1.6)
+**🎉 Released: October 31, 2025**
+#### Framework Interoperability
+- **AG2 Integration**: Nested chat functionality wrapped as custom tool for multi-agent lesson planning (supports streaming)
+- **LangGraph Integration**: Graph-based workflows integrated as tools for structured task execution
+- **AgentScope Integration**: AgentScope agent system wrapped for collaborative task handling
+- **OpenAI Assistants Integration**: OpenAI Assistants API integrated as tools for specialized workflows
+- **SmoLAgent Integration**: HuggingFace SmoLAgent wrapped for flexible agent orchestration
+- **Cross-Framework Collaboration**: MassGen orchestrates agents from multiple frameworks seamlessly
+- **Tool Module**: New `massgen/tool/_extraframework_agents/` module with 5 framework integrations
+- **Streaming Support**: Only AG2 currently supports streaming; other frameworks return complete results
+#### Configuration Validator
+- **ConfigValidator Class**: Comprehensive YAML validation in `massgen/config_validator.py`
+- **Memory Validation**: Detailed validation for memory configuration parameters
+- **Pre-commit Integration**: Automatic configuration validation before commits
+- **Error Messaging**: Actionable error messages with suggestions for common mistakes
+- **Test Coverage**: Comprehensive test suite in `massgen/tests/test_config_validator.py`
+#### Backend Architecture Refactoring
+- **ToolExecutionConfig**: Unified tool execution with new dataclass in `base_with_custom_tool_and_mcp.py`
+- **ResponseBackend Refactoring**: Unified tool execution flow eliminating duplicate code paths
+- **ChatCompletionsBackend Refactoring**: Consistent tool handling across Chat Completions providers
+- **ClaudeBackend Refactoring**: Unified tool execution methods for Claude backend
+- **Consistent Error Handling**: Standardized status reporting across all tool types
+#### Gemini Backend Simplification
+- **Module Removal**: Removed `gemini_mcp_manager.py` and `gemini_trackers.py` modules
+- **Code Consolidation**: Refactored to use manual tool execution via base class
+- **Streamlined Logic**: Removed continuation logic and duplicate code
+- **Codebase Reduction**: Net reduction of 1,598 lines through consolidation
+- **Formatter Updates**: Updated `_gemini_formatter.py` for simplified tool conversion
+#### Custom Tool System Enhancement
+- **ToolManager Improvements**: Enhanced category management capabilities
+- **Registration System**: Improved tool registration and validation
+- **Result Handling**: Enhanced error reporting and async execution support
+- **Schema Generation**: Improved tool schema generation for LLM consumption
-**🎉 Released: October 27, 2025**
+#### Configuration Files
+- `ag2_lesson_planner_example.yaml` - AG2 nested chat as custom tool
+- `langgraph_lesson_planner_example.yaml` - LangGraph workflows integrated
+- `agentscope_lesson_planner_example.yaml` - AgentScope agent integration
+- `openai_assistant_lesson_planner_example.yaml` - OpenAI Assistants as tools
+- `smolagent_lesson_planner_example.yaml` - SmoLAgent integration
+- `ag2_and_langgraph_lesson_planner.yaml` - Multi-framework collaboration
+- `ag2_and_openai_assistant_lesson_planner.yaml` - AG2 + OpenAI Assistants combo
+- `two_models_with_tools_example.yaml` - Multiple models with custom tools
-#### Multimodal Generation Tools
-- **Text-to-Image**: `text_to_image_generation` tool creates images from text prompts via DALL-E API
-- **Text-to-Video**: `text_to_video_generation` tool generates videos from text descriptions
-- **Text-to-Speech**: `text_to_speech_continue_generation` and `text_to_speech_transcription_generation` tools for audio generation and transcription
-- **Text-to-File**: `text_to_file_generation` tool creates documents in PDF, DOCX, XLSX, and PPTX formats
-- **Image-to-Image**: `image_to_image_generation` tool transforms existing images
+### Previous Achievements (v0.0.3 - v0.1.5)
-#### Binary File Protection
-- **Automatic Blocking**: `PathPermissionManager` now prevents text-based read tools from accessing binary files
-- **Protected File Types**: 40+ extensions including images (.jpg, .png), videos (.mp4, .avi), audio (.mp3, .wav), archives (.zip, .tar), executables (.exe, .dll), and Office documents (.pdf, .docx, .xlsx, .pptx)
-- **Intelligent Guidance**: Error messages automatically suggest appropriate specialized tools (e.g., "use understand_image tool" for .jpg files)
-- **Test Coverage**: `test_binary_file_blocking.py`
+✅ **Memory System (v0.1.5)**: Long-term semantic memory via mem0 integration with fact extraction and retrieval across sessions, short-term conversational memory for active context, automatic context compression when approaching token limits, cross-agent memory sharing with turn-aware filtering, session management for memory isolation and continuation, Qdrant vector database integration for semantic search
-#### Web Scraping Capabilities
-- **Crawl4AI Tool**: `crawl4ai_tool` enables intelligent web scraping with LLM-powered content extraction and customizable patterns
+✅ **Multimodal Generation Tools (v0.1.4)**: Create images from text via DALL-E API, generate videos from descriptions, text-to-speech with audio transcription support, document generation for PDF/DOCX/XLSX/PPTX formats, image transformation capabilities for existing images
-#### Documentation & Infrastructure
-  - **Generation Tools**: 8 multimodal generation configurations
-    - `text_to_image_generation_single.yaml` and `text_to_image_generation_multi.yaml`
-    - `text_to_video_generation_single.yaml` and `text_to_video_generation_multi.yaml`
-    - `text_to_speech_generation_single.yaml` and `text_to_speech_generation_multi.yaml`
-    - `text_to_file_generation_single.yaml` and `text_to_file_generation_multi.yaml`
-  - **Web Scraping**: `crawl4ai_example.yaml` for Crawl4AI integration
+✅ **Binary File Protection (v0.1.4)**: Automatic blocking prevents text tools from accessing 40+ binary file types including images, videos, audio, archives, and Office documents, intelligent error messages guide users to appropriate specialized tools for binary content
-### Previous Achievements (v0.0.3 - v0.1.3)
+✅ **Crawl4AI Integration (v0.1.4)**: Intelligent web scraping with LLM-powered content extraction and customizable extraction patterns for structured data retrieval from websites
-✅ **Post-Evaluation Workflow (v0.1.3)**: `PostEvaluationToolkit` class with submit tool for confirming final answers and restart tool for orchestration restart with feedback, winning agent evaluates answer before submission, universal backend support (Claude, Response API, Chat Completions), opt-in via `enable_post_evaluation_tools` parameter
+✅ **Post-Evaluation Workflow (v0.1.3)**: Winning agents evaluate their own answers before submission with submit and restart capabilities, supports answer confirmation and orchestration restart with feedback across all backends
-✅ **Multimodal Understanding Tools (v0.1.3)**: `understand_image` for PNG/JPEG analysis, `understand_audio` for WAV/MP3 transcription, `understand_video` for MP4/AVI frame extraction, `understand_file` for PDF/DOCX processing, cross-backend support via OpenAI GPT-4.1, structured JSON output, configurations: `understand_image.yaml`, `understand_audio.yaml`, `understand_video.yaml`, `understand_file.yaml`
+✅ **Multimodal Understanding Tools (v0.1.3)**: Analyze images, transcribe audio, extract video frames, and process documents (PDF/DOCX/XLSX/PPTX) with structured JSON output, works across all backends via OpenAI GPT-4.1 integration
-✅ **Docker Sudo Mode (v0.1.3)**: `use_sudo` parameter for privileged Docker execution, system-level command support in containers, enhanced security documentation, test coverage in `test_code_execution.py`
+✅ **Docker Sudo Mode (v0.1.3)**: Privileged command execution in Docker containers for system-level operations requiring elevated permissions
-✅ **Intelligent Planning Mode (v0.1.2)**: Automatic question analysis determining operation irreversibility via `_analyze_question_irreversibility()` in orchestrator, selective tool blocking with `set_planning_mode_blocked_tools()` and `is_mcp_tool_blocked()` methods, read-only MCP operations during coordination with write operations blocked, zero-configuration transparent operation, multi-workspace support, comprehensive tests in `test_intelligent_planning_mode.py`, complete guide in `docs/dev_notes/intelligent_planning_mode.md`
+✅ **Intelligent Planning Mode (v0.1.2)**: Automatic question analysis determining operation irreversibility via `_analyze_question_irreversibility()` in orchestrator, selective tool blocking with `set_planning_mode_blocked_tools()` and `is_mcp_tool_blocked()` methods, read-only MCP operations during coordination with write operations blocked, zero-configuration transparent operation, multi-workspace support
-✅ **Model Updates (v0.1.2)**: Claude 4.5 Haiku model `claude-haiku-4-5-20251001`, reorganized Claude model priorities with `claude-sonnet-4-5-20250929` default, Grok web search fix with `_add_grok_search_params()` method for proper `extra_body` parameter handling, 5 updated planning mode configurations in `configs/tools/planning/`, updated `three_agents_default.yaml` with Grok-4-fast
+✅ **Model Updates (v0.1.2)**: Claude 4.5 Haiku model `claude-haiku-4-5-20251001`, reorganized Claude model priorities with `claude-sonnet-4-5-20250929` default, Grok web search fix with `_add_grok_search_params()` method for proper `extra_body` parameter handling
 ✅ **Custom Tools System (v0.1.1)**: User-defined Python function registration using `ToolManager` class in `massgen/tool/_manager.py`, cross-backend support alongside MCP servers, builtin/MCP/custom tool categories with automatic discovery, 40+ examples in `massgen/configs/tools/custom_tools/`, voting sensitivity controls with three-tier quality system (lenient/balanced/strict), answer novelty detection preventing duplicates
-✅ **Backend Enhancements (v0.1.1)**: Gemini architecture refactoring with extracted MCP management (`gemini_mcp_manager.py`), tracking (`gemini_trackers.py`), and utilities, new capabilities registry in `massgen/backend/capabilities.py` documenting feature support across backends
+✅ **Backend Enhancements (v0.1.1)**: Gemini architecture refactoring with extracted MCP management (`gemini_mcp_manager.py`), tracking (`gemini_trackers.py`), and utilities, new capabilities registry in `massgen/backend/capabilities.py` documenting feature support across all backends
 ✅ **PyPI Package Release (v0.1.0)**: Official distribution via `pip install massgen` with simplified installation, global `massgen` command accessible from any directory, comprehensive Sphinx documentation at [docs.massgen.ai](https://docs.massgen.ai/), interactive setup wizard with use case presets and API key management, enhanced CLI with `@examples/` prefix for built-in configurations
@@ -1238,21 +1298,22 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
 We welcome community contributions to achieve these goals.
-### v0.1.5 Roadmap
+### v0.1.7 Roadmap
-Version 0.1.5 focuses on Docker integration for MCP tools and backend code refactoring:
+Version 0.1.7 focuses on agent task planning and rate limiting for improved coordination and cost management:
-#### Required Features
-- **Running MCP Tools in Docker**: Containerized execution environment for MCP tools with enhanced security and isolation
-- **Backend Code Refactoring**: Major code refactoring for improved maintainability and developer experience
+#### Planned Features
+- **Agent Task Planning System**: Enable agents to organize complex multi-step work with task plans, dependency tracking, and progress monitoring via 8 new MCP planning tools
+- **Gemini Rate Limiting System**: Multi-dimensional rate limiting (RPM, TPM, RPD) to prevent API spam and manage costs with model-specific limits and configurable thresholds
 Key technical approach:
-- **Docker Integration**: Secure execution of third-party MCP tools in isolated Docker containers with resource limits and network isolation
-- **Backend Improvements**: Enhanced code organization, modularity, and architectural improvements for better maintainability
+- **Task Planning**: MCP-based planning tools with dependency graphs, status tracking, and maximum 100 tasks per plan safety limit
+- **Rate Limiting**: Sliding window tracking, external YAML configuration, optional CLI flag, mandatory cooldown periods after startup
+- **Configuration**: Both features are optional and configurable via flags (`enable_agent_task_planning`, `--rate-limit`)
-**Target Release**: October 30, 2025 (Wednesday @ 9am PT)
+**Target Release**: November 3, 2025 (Monday @ 9am PT)
-For detailed milestones and technical specifications, see the [full v0.1.5 roadmap](ROADMAP_v0.1.5.md).
+For detailed milestones and technical specifications, see the [full v0.1.7 roadmap](ROADMAP_v0.1.7.md).
 ---

massgen 0.1.4__py3-none-any.whl → 0.1.6__py3-none-any.whl

Potentially problematic release.

massgen 0.1.4py3-none-any.whl → 0.1.6py3-none-any.whl