PyPI - ai-data-science-team - Versions diffs - 0.0.0.9010__tar.gz → 0.0.0.9012__tar.gz - Mend

ai-data-science-team 0.0.0.9010tar.gz → 0.0.0.9012tar.gz

Files changed (51) hide show

{ai_data_science_team-0.0.0.9010/ai_data_science_team.egg-info → ai_data_science_team-0.0.0.9012}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.2
 Name: ai-data-science-team
-Version: 0.0.0.9010
+Version: 0.0.0.9012
 Summary: Build and run an AI-powered data science team.
 Home-page: https://github.com/business-science/ai-data-science-team
 Author: Matt Dancho
@@ -31,9 +31,16 @@ Requires-Dist: psutil
 Provides-Extra: machine-learning
 Requires-Dist: h2o; extra == "machine-learning"
 Requires-Dist: mlflow; extra == "machine-learning"
+Provides-Extra: data-science
+Requires-Dist: pytimetk; extra == "data-science"
+Requires-Dist: missingno; extra == "data-science"
+Requires-Dist: sweetviz; extra == "data-science"
 Provides-Extra: all
 Requires-Dist: h2o; extra == "all"
 Requires-Dist: mlflow; extra == "all"
+Requires-Dist: pytimetk; extra == "all"
+Requires-Dist: missingno; extra == "all"
+Requires-Dist: sweetviz; extra == "all"
 Dynamic: author
 Dynamic: author-email
 Dynamic: classifier
@@ -59,6 +66,8 @@ Dynamic: summary
   <a href="https://pypi.python.org/pypi/ai-data-science-team"><img src="https://img.shields.io/pypi/v/ai-data-science-team.svg?style=for-the-badge" alt="PyPI"></a>
   <a href="https://github.com/business-science/ai-data-science-team"><img src="https://img.shields.io/pypi/pyversions/ai-data-science-team.svg?style=for-the-badge" alt="versions"></a>
   <a href="https://github.com/business-science/ai-data-science-team/blob/main/LICENSE"><img src="https://img.shields.io/github/license/business-science/ai-data-science-team.svg?style=for-the-badge" alt="license"></a>
+  <img alt="GitHub Repo stars" src="https://img.shields.io/github/stars/business-science/ai-data-science-team?style=for-the-badge">
 </div>
@@ -93,8 +102,9 @@ The AI Data Science Team of Copilots includes Agents that specialize data cleani
     - [Apps Available Now](#apps-available-now)
       - [🔥 Agentic Applications](#-agentic-applications)
     - [Agents Available Now](#agents-available-now)
+      - [Standard Agents](#standard-agents)
       - [🔥🔥 NEW! Machine Learning Agents](#-new-machine-learning-agents)
-      - [Data Science Agents](#data-science-agents-1)
+      - [🔥 NEW! Data Science Agents](#-new-data-science-agents)
       - [Multi-Agents](#multi-agents)
     - [Agents Coming Soon](#agents-coming-soon)
   - [Disclaimer](#disclaimer)
@@ -122,7 +132,7 @@ If you're an aspiring data scientist who wants to learn how to build AI Agents a
 This project is a work in progress. New data science agents will be released soon.
-![Data Science Team](/img/ai_data_science_team.jpg)
+![AI Data Science Team](/img/ai_data_science_team.jpg)
 ### NEW: Multi-Agents
@@ -146,18 +156,25 @@ This is a top secret project I'm working on. It's a multi-agent data science app
 ### Agents Available Now
+#### Standard Agents
+1. **Data Wrangling Agent:** Merges, Joins, Preps and Wrangles data into a format that is ready for data analysis. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/data_wrangling_agent.ipynb)
+2. **Data Visualization Agent:** Creates visualizations to help you understand your data. Returns JSON serializable plotly visualizations. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/data_visualization_agent.ipynb)
+3. **🔥 Data Cleaning Agent:** Performs Data Preparation steps including handling missing values, outliers, and data type conversions. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/data_cleaning_agent.ipynb)
+4. **Feature Engineering Agent:** Converts the prepared data into ML-ready data. Adds features to increase predictive accuracy of ML models. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/feature_engineering_agent.ipynb)
+5. **🔥 SQL Database Agent:** Connects to SQL databases to pull data into the data science environment. Creates pipelines to automate data extraction. Performs Joins, Aggregations, and other SQL Query operations. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/sql_database_agent.ipynb)
+6. **🔥 Data Loader Tools Agent:** Loads data from various sources including CSV, Excel, Parquet, and Pickle files. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/data_loader_tools_agent.ipynb)
 #### 🔥🔥 NEW! Machine Learning Agents
 1. **🔥 H2O Machine Learning Agent:** Builds and logs 100's of high-performance machine learning models. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/ml_agents/h2o_machine_learning_agent.ipynb)
 2. **🔥 MLflow Tools Agent (MLOps):** This agent has 11+ tools for managing models, ML projects, and making production ML predictions with MLflow. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/ml_agents/mlflow_tools_agent.ipynb)
-#### Data Science Agents
+#### 🔥 NEW! Data Science Agents
+1. **🔥🔥 EDA Tools Agent:** Performs automated exploratory data analysis (EDA) with EDA Reporting, Missing Data Analysis, Correlation Analysis, and more. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/ds_agents/eda_tools_agent.ipynb)
-1. **Data Wrangling Agent:** Merges, Joins, Preps and Wrangles data into a format that is ready for data analysis. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/data_wrangling_agent.ipynb)
-2. **Data Visualization Agent:** Creates visualizations to help you understand your data. Returns JSON serializable plotly visualizations. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/data_visualization_agent.ipynb)
-3. **Data Cleaning Agent:** Performs Data Preparation steps including handling missing values, outliers, and data type conversions. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/data_cleaning_agent.ipynb)
-4. **Feature Engineering Agent:** Converts the prepared data into ML-ready data. Adds features to increase predictive accuracy of ML models. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/feature_engineering_agent.ipynb)
-5. **SQL Database Agent:** Connects to SQL databases to pull data into the data science environment. Creates pipelines to automate data extraction. Performs Joins, Aggregations, and other SQL Query operations. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/sql_database_agent.ipynb)
 #### Multi-Agents

{ai_data_science_team-0.0.0.9010 → ai_data_science_team-0.0.0.9012}/README.md RENAMED Viewed

@@ -12,6 +12,8 @@
   <a href="https://pypi.python.org/pypi/ai-data-science-team"><img src="https://img.shields.io/pypi/v/ai-data-science-team.svg?style=for-the-badge" alt="PyPI"></a>
   <a href="https://github.com/business-science/ai-data-science-team"><img src="https://img.shields.io/pypi/pyversions/ai-data-science-team.svg?style=for-the-badge" alt="versions"></a>
   <a href="https://github.com/business-science/ai-data-science-team/blob/main/LICENSE"><img src="https://img.shields.io/github/license/business-science/ai-data-science-team.svg?style=for-the-badge" alt="license"></a>
+  <img alt="GitHub Repo stars" src="https://img.shields.io/github/stars/business-science/ai-data-science-team?style=for-the-badge">
 </div>
@@ -46,8 +48,9 @@ The AI Data Science Team of Copilots includes Agents that specialize data cleani
     - [Apps Available Now](#apps-available-now)
       - [🔥 Agentic Applications](#-agentic-applications)
     - [Agents Available Now](#agents-available-now)
+      - [Standard Agents](#standard-agents)
       - [🔥🔥 NEW! Machine Learning Agents](#-new-machine-learning-agents)
-      - [Data Science Agents](#data-science-agents-1)
+      - [🔥 NEW! Data Science Agents](#-new-data-science-agents)
       - [Multi-Agents](#multi-agents)
     - [Agents Coming Soon](#agents-coming-soon)
   - [Disclaimer](#disclaimer)
@@ -75,7 +78,7 @@ If you're an aspiring data scientist who wants to learn how to build AI Agents a
 This project is a work in progress. New data science agents will be released soon.
-![Data Science Team](/img/ai_data_science_team.jpg)
+![AI Data Science Team](/img/ai_data_science_team.jpg)
 ### NEW: Multi-Agents
@@ -99,18 +102,25 @@ This is a top secret project I'm working on. It's a multi-agent data science app
 ### Agents Available Now
+#### Standard Agents
+1. **Data Wrangling Agent:** Merges, Joins, Preps and Wrangles data into a format that is ready for data analysis. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/data_wrangling_agent.ipynb)
+2. **Data Visualization Agent:** Creates visualizations to help you understand your data. Returns JSON serializable plotly visualizations. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/data_visualization_agent.ipynb)
+3. **🔥 Data Cleaning Agent:** Performs Data Preparation steps including handling missing values, outliers, and data type conversions. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/data_cleaning_agent.ipynb)
+4. **Feature Engineering Agent:** Converts the prepared data into ML-ready data. Adds features to increase predictive accuracy of ML models. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/feature_engineering_agent.ipynb)
+5. **🔥 SQL Database Agent:** Connects to SQL databases to pull data into the data science environment. Creates pipelines to automate data extraction. Performs Joins, Aggregations, and other SQL Query operations. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/sql_database_agent.ipynb)
+6. **🔥 Data Loader Tools Agent:** Loads data from various sources including CSV, Excel, Parquet, and Pickle files. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/data_loader_tools_agent.ipynb)
 #### 🔥🔥 NEW! Machine Learning Agents
 1. **🔥 H2O Machine Learning Agent:** Builds and logs 100's of high-performance machine learning models. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/ml_agents/h2o_machine_learning_agent.ipynb)
 2. **🔥 MLflow Tools Agent (MLOps):** This agent has 11+ tools for managing models, ML projects, and making production ML predictions with MLflow. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/ml_agents/mlflow_tools_agent.ipynb)
-#### Data Science Agents
+#### 🔥 NEW! Data Science Agents
+1. **🔥🔥 EDA Tools Agent:** Performs automated exploratory data analysis (EDA) with EDA Reporting, Missing Data Analysis, Correlation Analysis, and more. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/ds_agents/eda_tools_agent.ipynb)
-1. **Data Wrangling Agent:** Merges, Joins, Preps and Wrangles data into a format that is ready for data analysis. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/data_wrangling_agent.ipynb)
-2. **Data Visualization Agent:** Creates visualizations to help you understand your data. Returns JSON serializable plotly visualizations. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/data_visualization_agent.ipynb)
-3. **Data Cleaning Agent:** Performs Data Preparation steps including handling missing values, outliers, and data type conversions. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/data_cleaning_agent.ipynb)
-4. **Feature Engineering Agent:** Converts the prepared data into ML-ready data. Adds features to increase predictive accuracy of ML models. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/feature_engineering_agent.ipynb)
-5. **SQL Database Agent:** Connects to SQL databases to pull data into the data science environment. Creates pipelines to automate data extraction. Performs Joins, Aggregations, and other SQL Query operations. [See Example](https://github.com/business-science/ai-data-science-team/blob/master/examples/sql_database_agent.ipynb)
 #### Multi-Agents

ai_data_science_team-0.0.0.9012/ai_data_science_team/_version.py ADDED Viewed

	@@ -0,0 +1 @@
1	+ __version__ = "0.0.0.9012"

{ai_data_science_team-0.0.0.9010 → ai_data_science_team-0.0.0.9012}/ai_data_science_team/agents/__init__.py RENAMED Viewed

@@ -3,3 +3,4 @@ from ai_data_science_team.agents.feature_engineering_agent import make_feature_e
 from ai_data_science_team.agents.data_wrangling_agent import make_data_wrangling_agent, DataWranglingAgent
 from ai_data_science_team.agents.sql_database_agent import make_sql_database_agent, SQLDatabaseAgent
 from ai_data_science_team.agents.data_visualization_agent import make_data_visualization_agent, DataVisualizationAgent
+from ai_data_science_team.agents.data_loader_tools_agent import make_data_loader_tools_agent, DataLoaderToolsAgent

ai_data_science_team-0.0.0.9012/ai_data_science_team/agents/data_loader_tools_agent.py ADDED Viewed

@@ -0,0 +1,272 @@
+from typing import Any, Optional, Annotated, Sequence, List, Dict
+import operator
+import pandas as pd
+import os
+from IPython.display import Markdown
+from langchain_core.messages import BaseMessage, AIMessage
+from langgraph.prebuilt import create_react_agent, ToolNode
+from langgraph.prebuilt.chat_agent_executor import AgentState
+from langgraph.graph import START, END, StateGraph
+from ai_data_science_team.templates import BaseAgent
+from ai_data_science_team.utils.regex import format_agent_name
+from ai_data_science_team.tools.data_loader import (
+    load_directory,
+    load_file,
+    list_directory_contents,
+    list_directory_recursive,
+    get_file_info,
+    search_files_by_pattern,
+)
+AGENT_NAME = "data_loader_tools_agent"
+tools = [
+    load_directory,
+    load_file,
+    list_directory_contents,
+    list_directory_recursive,
+    get_file_info,
+    search_files_by_pattern,
+]
+class DataLoaderToolsAgent(BaseAgent):
+    """
+    A Data Loader Agent that can interact with data loading tools and search for files in your file system.
+    Parameters:
+    ----------
+    model : langchain.llms.base.LLM
+        The language model used to generate the tool calling agent.
+    react_agent_kwargs : dict
+        Additional keyword arguments to pass to the create_react_agent function.
+    invoke_react_agent_kwargs : dict
+        Additional keyword arguments to pass to the invoke method of the react agent.
+    Methods:
+    --------
+    update_params(**kwargs)
+        Updates the agent's parameters and rebuilds the compiled graph.
+    ainvoke_agent(user_instructions: str=None, **kwargs)
+        Runs the agent with the given user instructions asynchronously.
+    invoke_agent(user_instructions: str=None, **kwargs)
+        Runs the agent with the given user instructions.
+    get_internal_messages(markdown: bool=False)
+        Returns the internal messages from the agent's response.
+    get_artifacts(as_dataframe: bool=False)
+        Returns the MLflow artifacts from the agent's response.
+    get_ai_message(markdown: bool=False)
+        Returns the AI message from the agent's response.
+    """
+    def __init__(
+        self,
+        model: Any,
+        create_react_agent_kwargs: Optional[Dict]={},
+        invoke_react_agent_kwargs: Optional[Dict]={},
+    ):
+        self._params = {
+            "model": model,
+            "create_react_agent_kwargs": create_react_agent_kwargs,
+            "invoke_react_agent_kwargs": invoke_react_agent_kwargs,
+        }
+        self._compiled_graph = self._make_compiled_graph()
+        self.response = None
+    def _make_compiled_graph(self):
+        """
+        Creates the compiled graph for the agent.
+        """
+        self.response = None
+        return make_data_loader_tools_agent(**self._params)
+    def update_params(self, **kwargs):
+        """
+        Updates the agent's parameters and rebuilds the compiled graph.
+        """
+        for k, v in kwargs.items():
+            self._params[k] = v
+        self._compiled_graph = self._make_compiled_graph()
+    async def ainvoke_agent(
+        self,
+        user_instructions: str=None,
+        **kwargs
+    ):
+        """
+        Runs the agent with the given user instructions.
+        Parameters:
+        ----------
+        user_instructions : str, optional
+            The user instructions to pass to the agent.
+        kwargs : dict, optional
+            Additional keyword arguments to pass to the agents ainvoke method.
+        """
+        response = await self._compiled_graph.ainvoke(
+            {
+                "user_instructions": user_instructions,
+            },
+            **kwargs
+        )
+        self.response = response
+        return None
+    def invoke_agent(
+        self,
+        user_instructions: str=None,
+        **kwargs
+    ):
+        """
+        Runs the agent with the given user instructions.
+        Parameters:
+        ----------
+        user_instructions : str, optional
+            The user instructions to pass to the agent.
+        kwargs : dict, optional
+            Additional keyword arguments to pass to the agents invoke method.
+        """
+        response = self._compiled_graph.invoke(
+            {
+                "user_instructions": user_instructions,
+            },
+            **kwargs
+        )
+        self.response = response
+        return None
+    def get_internal_messages(self, markdown: bool=False):
+        """
+        Returns the internal messages from the agent's response.
+        """
+        pretty_print = "\n\n".join([f"### {msg.type.upper()}\n\nID: {msg.id}\n\nContent:\n\n{msg.content}" for msg in self.response["internal_messages"]])
+        if markdown:
+            return Markdown(pretty_print)
+        else:
+            return self.response["internal_messages"]
+    def get_artifacts(self, as_dataframe: bool=False):
+        """
+        Returns the MLflow artifacts from the agent's response.
+        """
+        if as_dataframe:
+            return pd.DataFrame(self.response["data_loader_artifacts"])
+        else:
+            return self.response["data_loader_artifacts"]
+    def get_ai_message(self, markdown: bool=False):
+        """
+        Returns the AI message from the agent's response.
+        """
+        if markdown:
+            return Markdown(self.response["messages"][0].content)
+        else:
+            return self.response["messages"][0].content
+def make_data_loader_tools_agent(
+    model: Any,
+    create_react_agent_kwargs: Optional[Dict]={},
+    invoke_react_agent_kwargs: Optional[Dict]={},
+):
+    """
+    Creates a Data Loader Agent that can interact with data loading tools.
+    Parameters:
+    ----------
+    model : langchain.llms.base.LLM
+        The language model used to generate the tool calling agent.
+    react_agent_kwargs : dict
+        Additional keyword arguments to pass to the create_react_agent function.
+    invoke_react_agent_kwargs : dict
+        Additional keyword arguments to pass to the invoke method of the react agent.
+    Returns:
+    --------
+    app : langchain.graphs.CompiledStateGraph
+        An agent that can interact with data loading tools.
+    """
+    class GraphState(AgentState):
+        internal_messages: Annotated[Sequence[BaseMessage], operator.add]
+        user_instructions: str
+        data_loader_artifacts: dict
+    def data_loader_agent(state):
+        print(format_agent_name(AGENT_NAME))
+        print("    ")
+        print("    * RUN REACT TOOL-CALLING AGENT")
+        tool_node = ToolNode(
+            tools=tools
+        )
+        data_loader_agent = create_react_agent(
+            model,
+            tools=tool_node,
+            state_schema=GraphState,
+            **create_react_agent_kwargs,
+        )
+        response = data_loader_agent.invoke(
+            {
+                "messages": [("user", state["user_instructions"])],
+            },
+            invoke_react_agent_kwargs,
+        )
+        print("    * POST-PROCESS RESULTS")
+        internal_messages = response['messages']
+        # Ensure there is at least one AI message
+        if not internal_messages:
+            return {
+                "internal_messages": [],
+                "mlflow_artifacts": None,
+            }
+        # Get the last AI message
+        last_ai_message = AIMessage(internal_messages[-1].content, role = AGENT_NAME)
+        # Get the last tool artifact safely
+        last_tool_artifact = None
+        if len(internal_messages) > 1:
+            last_message = internal_messages[-2]  # Get second-to-last message
+            if hasattr(last_message, "artifact"):  # Check if it has an "artifact"
+                last_tool_artifact = last_message.artifact
+            elif isinstance(last_message, dict) and "artifact" in last_message:
+                last_tool_artifact = last_message["artifact"]
+        return {
+            "messages": [last_ai_message],
+            "internal_messages": internal_messages,
+            "data_loader_artifacts": last_tool_artifact,
+        }
+    workflow = StateGraph(GraphState)
+    workflow.add_node("data_loader_agent", data_loader_agent)
+    workflow.add_edge(START, "data_loader_agent")
+    workflow.add_edge("data_loader_agent", END)
+    app = workflow.compile()
+    return app

ai_data_science_team-0.0.0.9012/ai_data_science_team/ds_agents/__init__.py ADDED Viewed

	@@ -0,0 +1 @@
1	+ from ai_data_science_team.ds_agents.eda_tools_agent import EDAToolsAgent, make_eda_tools_agent

ai_data_science_team-0.0.0.9012/ai_data_science_team/ds_agents/eda_tools_agent.py ADDED Viewed

@@ -0,0 +1,245 @@
+from typing import Any, Optional, Annotated, Sequence, List, Dict, Tuple
+import operator
+import pandas as pd
+import os
+from io import StringIO, BytesIO
+import base64
+import matplotlib.pyplot as plt
+from IPython.display import Markdown
+from langchain_core.messages import BaseMessage, AIMessage
+from langgraph.prebuilt import create_react_agent, ToolNode
+from langgraph.prebuilt.chat_agent_executor import AgentState
+from langgraph.graph import START, END, StateGraph
+from ai_data_science_team.templates import BaseAgent
+from ai_data_science_team.utils.regex import format_agent_name
+from ai_data_science_team.tools.eda import (
+    describe_dataset,
+    visualize_missing,
+    correlation_funnel,
+    generate_sweetviz_report,
+)
+AGENT_NAME = "exploratory_data_analyst_agent"
+# Updated tool list for EDA
+EDA_TOOLS = [
+    describe_dataset,
+    visualize_missing,
+    correlation_funnel,
+    generate_sweetviz_report,
+]
+class EDAToolsAgent(BaseAgent):
+    """
+    An Exploratory Data Analysis Tools Agent that interacts with EDA tools to generate summary statistics,
+    missing data visualizations, correlation funnels, EDA reports, etc.
+    Parameters:
+    ----------
+    model : langchain.llms.base.LLM
+        The language model for generating the tool-calling agent.
+    create_react_agent_kwargs : dict
+        Additional kwargs for create_react_agent.
+    invoke_react_agent_kwargs : dict
+        Additional kwargs for agent invocation.
+    """
+    def __init__(
+        self,
+        model: Any,
+        create_react_agent_kwargs: Optional[Dict] = {},
+        invoke_react_agent_kwargs: Optional[Dict] = {},
+    ):
+        self._params = {
+            "model": model,
+            "create_react_agent_kwargs": create_react_agent_kwargs,
+            "invoke_react_agent_kwargs": invoke_react_agent_kwargs,
+        }
+        self._compiled_graph = self._make_compiled_graph()
+        self.response = None
+    def _make_compiled_graph(self):
+        """
+        Creates the compiled state graph for the EDA agent.
+        """
+        self.response = None
+        return make_eda_tools_agent(**self._params)
+    def update_params(self, **kwargs):
+        """
+        Updates the agent's parameters and rebuilds the compiled graph.
+        """
+        for k, v in kwargs.items():
+            self._params[k] = v
+        self._compiled_graph = self._make_compiled_graph()
+    async def ainvoke_agent(
+        self,
+        user_instructions: str = None,
+        data_raw: pd.DataFrame = None,
+        **kwargs
+    ):
+        """
+        Asynchronously runs the agent with user instructions and data.
+        Parameters:
+        ----------
+        user_instructions : str, optional
+            The instructions for the agent.
+        data_raw : pd.DataFrame, optional
+            The input data as a DataFrame.
+        """
+        response = await self._compiled_graph.ainvoke(
+            {
+                "user_instructions": user_instructions,
+                "data_raw": data_raw.to_dict() if data_raw is not None else None,
+            },
+            **kwargs
+        )
+        self.response = response
+        return None
+    def invoke_agent(
+        self,
+        user_instructions: str = None,
+        data_raw: pd.DataFrame = None,
+        **kwargs
+    ):
+        """
+        Synchronously runs the agent with user instructions and data.
+        Parameters:
+        ----------
+        user_instructions : str, optional
+            The instructions for the agent.
+        data_raw : pd.DataFrame, optional
+            The input data as a DataFrame.
+        """
+        response = self._compiled_graph.invoke(
+            {
+                "user_instructions": user_instructions,
+                "data_raw": data_raw.to_dict() if data_raw is not None else None,
+            },
+            **kwargs
+        )
+        self.response = response
+        return None
+    def get_internal_messages(self, markdown: bool = False):
+        """
+        Returns internal messages from the agent response.
+        """
+        pretty_print = "\n\n".join(
+            [f"### {msg.type.upper()}\n\nID: {msg.id}\n\nContent:\n\n{msg.content}"
+             for msg in self.response["internal_messages"]]
+        )
+        if markdown:
+            return Markdown(pretty_print)
+        else:
+            return self.response["internal_messages"]
+    def get_artifacts(self, as_dataframe: bool = False):
+        """
+        Returns the EDA artifacts from the agent response.
+        """
+        if as_dataframe:
+            return pd.DataFrame(self.response["eda_artifacts"])
+        else:
+            return self.response["eda_artifacts"]
+    def get_ai_message(self, markdown: bool = False):
+        """
+        Returns the AI message from the agent response.
+        """
+        if markdown:
+            return Markdown(self.response["messages"][0].content)
+        else:
+            return self.response["messages"][0].content
+def make_eda_tools_agent(
+    model: Any,
+    create_react_agent_kwargs: Optional[Dict] = {},
+    invoke_react_agent_kwargs: Optional[Dict] = {},
+):
+    """
+    Creates an Exploratory Data Analyst Agent that can interact with EDA tools.
+    Parameters:
+    ----------
+    model : Any
+        The language model used for tool-calling.
+    create_react_agent_kwargs : dict
+        Additional kwargs for create_react_agent.
+    invoke_react_agent_kwargs : dict
+        Additional kwargs for agent invocation.
+    Returns:
+    -------
+    app : langgraph.graph.CompiledStateGraph
+        The compiled state graph for the EDA agent.
+    """
+    class GraphState(AgentState):
+        internal_messages: Annotated[Sequence[BaseMessage], operator.add]
+        user_instructions: str
+        data_raw: dict
+        eda_artifacts: dict
+    def exploratory_agent(state):
+        print(format_agent_name(AGENT_NAME))
+        print("    * RUN REACT TOOL-CALLING AGENT FOR EDA")
+        tool_node = ToolNode(
+            tools=EDA_TOOLS
+        )
+        eda_agent = create_react_agent(
+            model,
+            tools=tool_node,
+            state_schema=GraphState,
+            **create_react_agent_kwargs,
+        )
+        response = eda_agent.invoke(
+            {
+                "messages": [("user", state["user_instructions"])],
+                "data_raw": state["data_raw"],
+            },
+            invoke_react_agent_kwargs,
+        )
+        print("    * POST-PROCESSING EDA RESULTS")
+        internal_messages = response['messages']
+        if not internal_messages:
+            return {"internal_messages": [], "eda_artifacts": None}
+        last_ai_message = AIMessage(internal_messages[-1].content, role=AGENT_NAME)
+        last_tool_artifact = None
+        if len(internal_messages) > 1:
+            last_message = internal_messages[-2]
+            if hasattr(last_message, "artifact"):
+                last_tool_artifact = last_message.artifact
+            elif isinstance(last_message, dict) and "artifact" in last_message:
+                last_tool_artifact = last_message["artifact"]
+        return {
+            "messages": [last_ai_message],
+            "internal_messages": internal_messages,
+            "eda_artifacts": last_tool_artifact,
+        }
+    workflow = StateGraph(GraphState)
+    workflow.add_node("exploratory_agent", exploratory_agent)
+    workflow.add_edge(START, "exploratory_agent")
+    workflow.add_edge("exploratory_agent", END)
+    app = workflow.compile()
+    return app

ai-data-science-team 0.0.0.9010__tar.gz → 0.0.0.9012__tar.gz

ai-data-science-team 0.0.0.9010tar.gz → 0.0.0.9012tar.gz