PyPI - vectara-agentic - Versions diffs - 0.2.0__tar.gz → 0.2.2__tar.gz - Mend

vectara-agentic 0.2.0tar.gz → 0.2.2tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Potentially problematic release.

This version of vectara-agentic might be problematic. Click here for more details.

Files changed (30) hide show

{vectara_agentic-0.2.0/vectara_agentic.egg-info → vectara_agentic-0.2.2}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.2
 Name: vectara_agentic
-Version: 0.2.0
+Version: 0.2.2
 Summary: A Python package for creating AI Assistants and AI Agents with Vectara
 Home-page: https://github.com/vectara/py-vectara-agentic
 Author: Ofer Mendelevitch
@@ -16,19 +16,19 @@ Classifier: Topic :: Software Development :: Libraries :: Python Modules
 Requires-Python: >=3.10
 Description-Content-Type: text/markdown
 License-File: LICENSE
-Requires-Dist: llama-index==0.12.11
-Requires-Dist: llama-index-indices-managed-vectara==0.4.0
+Requires-Dist: llama-index==0.12.22
+Requires-Dist: llama-index-indices-managed-vectara==0.4.1
 Requires-Dist: llama-index-agent-llm-compiler==0.3.0
 Requires-Dist: llama-index-agent-lats==0.3.0
-Requires-Dist: llama-index-agent-openai==0.4.3
-Requires-Dist: llama-index-llms-openai==0.3.18
-Requires-Dist: llama-index-llms-anthropic==0.6.4
+Requires-Dist: llama-index-agent-openai==0.4.6
+Requires-Dist: llama-index-llms-openai==0.3.25
+Requires-Dist: llama-index-llms-anthropic==0.6.7
 Requires-Dist: llama-index-llms-together==0.3.1
 Requires-Dist: llama-index-llms-groq==0.3.1
-Requires-Dist: llama-index-llms-fireworks==0.3.1
+Requires-Dist: llama-index-llms-fireworks==0.3.2
 Requires-Dist: llama-index-llms-cohere==0.4.0
-Requires-Dist: llama-index-llms-gemini==0.4.4
-Requires-Dist: llama-index-llms-bedrock==0.3.3
+Requires-Dist: llama-index-llms-gemini==0.4.11
+Requires-Dist: llama-index-llms-bedrock==0.3.4
 Requires-Dist: llama-index-tools-yahoo-finance==0.3.0
 Requires-Dist: llama-index-tools-arxiv==0.3.0
 Requires-Dist: llama-index-tools-database==0.3.0
@@ -38,8 +38,8 @@ Requires-Dist: llama-index-tools-neo4j==0.3.0
 Requires-Dist: llama-index-graph-stores-kuzu==0.6.0
 Requires-Dist: llama-index-tools-slack==0.3.0
 Requires-Dist: llama-index-tools-exa==0.3.0
-Requires-Dist: tavily-python==0.5.0
-Requires-Dist: exa-py==1.8.5
+Requires-Dist: tavily-python==0.5.1
+Requires-Dist: exa-py==1.8.9
 Requires-Dist: yahoo-finance==1.4.0
 Requires-Dist: openinference-instrumentation-llama-index==3.1.4
 Requires-Dist: opentelemetry-proto==1.26.0
@@ -50,8 +50,8 @@ Requires-Dist: tokenizers>=0.20
 Requires-Dist: pydantic==2.10.3
 Requires-Dist: retrying==1.3.4
 Requires-Dist: python-dotenv==1.0.1
-Requires-Dist: tiktoken==0.8.0
-Requires-Dist: dill>=0.3.7
+Requires-Dist: tiktoken==0.9.0
+Requires-Dist: cloudpickle>=3.1.1
 Requires-Dist: httpx==0.27.2
 Dynamic: author
 Dynamic: author-email
@@ -135,7 +135,7 @@ from vectara_agentic.tools import VectaraToolFactory
 vec_factory = VectaraToolFactory(
     vectara_api_key=os.environ['VECTARA_API_KEY'],
     vectara_customer_id=os.environ['VECTARA_CUSTOMER_ID'],
-    vectara_corpus_id=os.environ['VECTARA_CORPUS_ID']
+    vectara_corpus_key=os.environ['VECTARA_CORPUS_KEY']
 )
 ```
@@ -315,6 +315,10 @@ def mult_func(x, y):
 mult_tool = ToolsFactory().create_tool(mult_func)
 ```
+Note: When you define your own Python functions as tools, implement them at the top module level,
+and not as nested functions. Nested functions are not supported if you use serialization
+(dumps/loads or from_dict/to_dict).
 ## 🛠️ Configuration
 ## Configuring Vectara-agentic
@@ -352,10 +356,31 @@ If any of these are not provided, `AgentConfig` first tries to read the values f
 When creating a `VectaraToolFactory`, you can pass in a `vectara_api_key`, `vectara_customer_id`, and `vectara_corpus_id` to the factory.
-If not passed in, it will be taken from the environment variables (`VECTARA_API_KEY`, `VECTARA_CUSTOMER_ID` and `VECTARA_CORPUS_ID`). Note that `VECTARA_CORPUS_ID` can be a single ID or a comma-separated list of IDs (if you want to query multiple corpora).
+If not passed in, it will be taken from the environment variables (`VECTARA_API_KEY` and `VECTARA_CORPUS_KEY`). Note that `VECTARA_CORPUS_KEY` can be a single KEY or a comma-separated list of KEYs (if you want to query multiple corpora).
 These values will be used as credentials when creating Vectara tools - in `create_rag_tool()` and `create_search_tool()`.
+## Setting up a privately hosted LLM
+If you want to setup vectara-agentic to use your own self-hosted LLM endpoint, follow the example below
+```python
+        config = AgentConfig(
+            agent_type=AgentType.REACT,
+            main_llm_provider=ModelProvider.PRIVATE,
+            main_llm_model_name="meta-llama/Meta-Llama-3.1-8B-Instruct",
+            private_llm_api_base="http://vllm-server.company.com/v1",
+            private_llm_api_key="TEST_API_KEY",
+        )
+        agent = Agent(agent_config=config, tools=tools, topic=topic,
+                      custom_instructions=custom_instructions)
+```
+In this case we specify the Main LLM provider to be privately hosted with Llama-3.1-8B as the model.
+- The `ModelProvider.PRIVATE` specifies a privately hosted LLM.
+- The `private_llm_api_base` specifies the api endpoint to use, and the `private_llm_api_key`
+  specifies the private API key requires to use this service.
 ## ℹ️ Additional Information
 ### About Custom Instructions for your Agent
@@ -376,6 +401,8 @@ The `Agent` class defines a few helpful methods to help you understand the inter
 The `Agent` class supports serialization. Use the `dumps()` to serialize and `loads()` to read back from a serialized stream.
+Note: due to cloudpickle limitations, if a tool contains Python `weakref` objects, serialization won't work and an exception will be raised.
 ###  Observability
 vectara-agentic supports observability via the existing integration of LlamaIndex and Arize Phoenix.

{vectara_agentic-0.2.0 → vectara_agentic-0.2.2}/README.md RENAMED Viewed

@@ -68,7 +68,7 @@ from vectara_agentic.tools import VectaraToolFactory
 vec_factory = VectaraToolFactory(
     vectara_api_key=os.environ['VECTARA_API_KEY'],
     vectara_customer_id=os.environ['VECTARA_CUSTOMER_ID'],
-    vectara_corpus_id=os.environ['VECTARA_CORPUS_ID']
+    vectara_corpus_key=os.environ['VECTARA_CORPUS_KEY']
 )
 ```
@@ -248,6 +248,10 @@ def mult_func(x, y):
 mult_tool = ToolsFactory().create_tool(mult_func)
 ```
+Note: When you define your own Python functions as tools, implement them at the top module level,
+and not as nested functions. Nested functions are not supported if you use serialization
+(dumps/loads or from_dict/to_dict).
 ## 🛠️ Configuration
 ## Configuring Vectara-agentic
@@ -285,10 +289,31 @@ If any of these are not provided, `AgentConfig` first tries to read the values f
 When creating a `VectaraToolFactory`, you can pass in a `vectara_api_key`, `vectara_customer_id`, and `vectara_corpus_id` to the factory.
-If not passed in, it will be taken from the environment variables (`VECTARA_API_KEY`, `VECTARA_CUSTOMER_ID` and `VECTARA_CORPUS_ID`). Note that `VECTARA_CORPUS_ID` can be a single ID or a comma-separated list of IDs (if you want to query multiple corpora).
+If not passed in, it will be taken from the environment variables (`VECTARA_API_KEY` and `VECTARA_CORPUS_KEY`). Note that `VECTARA_CORPUS_KEY` can be a single KEY or a comma-separated list of KEYs (if you want to query multiple corpora).
 These values will be used as credentials when creating Vectara tools - in `create_rag_tool()` and `create_search_tool()`.
+## Setting up a privately hosted LLM
+If you want to setup vectara-agentic to use your own self-hosted LLM endpoint, follow the example below
+```python
+        config = AgentConfig(
+            agent_type=AgentType.REACT,
+            main_llm_provider=ModelProvider.PRIVATE,
+            main_llm_model_name="meta-llama/Meta-Llama-3.1-8B-Instruct",
+            private_llm_api_base="http://vllm-server.company.com/v1",
+            private_llm_api_key="TEST_API_KEY",
+        )
+        agent = Agent(agent_config=config, tools=tools, topic=topic,
+                      custom_instructions=custom_instructions)
+```
+In this case we specify the Main LLM provider to be privately hosted with Llama-3.1-8B as the model.
+- The `ModelProvider.PRIVATE` specifies a privately hosted LLM.
+- The `private_llm_api_base` specifies the api endpoint to use, and the `private_llm_api_key`
+  specifies the private API key requires to use this service.
 ## ℹ️ Additional Information
 ### About Custom Instructions for your Agent
@@ -309,6 +334,8 @@ The `Agent` class defines a few helpful methods to help you understand the inter
 The `Agent` class supports serialization. Use the `dumps()` to serialize and `loads()` to read back from a serialized stream.
+Note: due to cloudpickle limitations, if a tool contains Python `weakref` objects, serialization won't work and an exception will be raised.
 ###  Observability
 vectara-agentic supports observability via the existing integration of LlamaIndex and Arize Phoenix.

{vectara_agentic-0.2.0 → vectara_agentic-0.2.2}/requirements.txt RENAMED Viewed

@@ -1,16 +1,16 @@
-llama-index==0.12.11
-llama-index-indices-managed-vectara==0.4.0
+llama-index==0.12.22
+llama-index-indices-managed-vectara==0.4.1
 llama-index-agent-llm-compiler==0.3.0
 llama-index-agent-lats==0.3.0
-llama-index-agent-openai==0.4.3
-llama-index-llms-openai==0.3.18
-llama-index-llms-anthropic==0.6.4
+llama-index-agent-openai==0.4.6
+llama-index-llms-openai==0.3.25
+llama-index-llms-anthropic==0.6.7
 llama-index-llms-together==0.3.1
 llama-index-llms-groq==0.3.1
-llama-index-llms-fireworks==0.3.1
+llama-index-llms-fireworks==0.3.2
 llama-index-llms-cohere==0.4.0
-llama-index-llms-gemini==0.4.4
-llama-index-llms-bedrock==0.3.3
+llama-index-llms-gemini==0.4.11
+llama-index-llms-bedrock==0.3.4
 llama-index-tools-yahoo-finance==0.3.0
 llama-index-tools-arxiv==0.3.0
 llama-index-tools-database==0.3.0
@@ -20,8 +20,8 @@ llama-index-tools-neo4j==0.3.0
 llama-index-graph-stores-kuzu==0.6.0
 llama-index-tools-slack==0.3.0
 llama-index-tools-exa==0.3.0
-tavily-python==0.5.0
-exa-py==1.8.5
+tavily-python==0.5.1
+exa-py==1.8.9
 yahoo-finance==1.4.0
 openinference-instrumentation-llama-index==3.1.4
 opentelemetry-proto==1.26.0
@@ -32,6 +32,6 @@ tokenizers>=0.20
 pydantic==2.10.3
 retrying==1.3.4
 python-dotenv==1.0.1
-tiktoken==0.8.0
-dill>=0.3.7
+tiktoken==0.9.0
+cloudpickle>=3.1.1
 httpx==0.27.2

vectara_agentic-0.2.2/tests/endpoint.py ADDED Viewed

@@ -0,0 +1,47 @@
+from openai import OpenAI
+from flask import Flask, request, jsonify
+import logging
+from functools import wraps
+app = Flask(__name__)
+app.config['TESTING'] = True
+log = logging.getLogger('werkzeug')
+log.setLevel(logging.ERROR)
+# Set your OpenAI API key (ensure you've set this in your environment)
+EXPECTED_API_KEY = "TEST_API_KEY"
+def require_api_key(f):
+    @wraps(f)
+    def decorated_function(*args, **kwargs):
+        api_key = request.headers.get("Authorization").split("Bearer ")[-1]
+        if not api_key or api_key != EXPECTED_API_KEY:
+            return jsonify({"error": "Unauthorized"}), 401
+        return f(*args, **kwargs)
+    return decorated_function
+@app.before_request
+def log_request_info():
+    app.logger.info("Request received: %s %s", request.method, request.path)
+@app.route("/v1/chat/completions", methods=["POST"])
+@require_api_key
+def chat_completions():
+    app.logger.info("Received request on /v1/chat/completions")
+    data = request.get_json()
+    if not data:
+        return jsonify({"error": "Invalid JSON payload"}), 400
+    client = OpenAI()
+    try:
+        completion = client.chat.completions.create(**data)
+        return jsonify(completion.model_dump()), 200
+    except Exception as e:
+        return jsonify({"error": str(e)}), 400
+if __name__ == "__main__":
+    # Run on port 5000 by default; adjust as needed.
+    app.run(debug=True, port=5000, use_reloader=False)

{vectara_agentic-0.2.0 → vectara_agentic-0.2.2}/tests/test_agent.py RENAMED Viewed

@@ -6,6 +6,9 @@ from vectara_agentic.agent_config import AgentConfig
 from vectara_agentic.types import ModelProvider, ObserverType
 from vectara_agentic.tools import ToolsFactory
+def mult(x, y):
+    return x * y
 class TestAgentPackage(unittest.TestCase):
     def test_get_prompt(self):
         prompt_template = "{chat_topic} on {today} with {custom_instructions}"
@@ -21,9 +24,6 @@ class TestAgentPackage(unittest.TestCase):
         )
     def test_agent_init(self):
-        def mult(x, y):
-            return x * y
         tools = [ToolsFactory().create_tool(mult)]
         topic = "AI"
         custom_instructions = "Always do as your mother tells you!"
@@ -41,9 +41,6 @@ class TestAgentPackage(unittest.TestCase):
         )
     def test_agent_config(self):
-        def mult(x, y):
-            return x * y
         tools = [ToolsFactory().create_tool(mult)]
         topic = "AI topic"
         instructions = "Always do as your father tells you, if your mother agrees!"
@@ -77,6 +74,21 @@ class TestAgentPackage(unittest.TestCase):
             "50",
         )
+    def test_multiturn(self):
+        tools = [ToolsFactory().create_tool(mult)]
+        topic = "AI topic"
+        instructions = "Always do as your father tells you, if your mother agrees!"
+        agent = Agent(
+            tools=tools,
+            topic=topic,
+            custom_instructions=instructions,
+        )
+        agent.chat("What is 5 times 10. Only give the answer, nothing else")
+        agent.chat("what is 3 times 7. Only give the answer, nothing else")
+        res = agent.chat("multiply the results of the last two questions. Output only the answer.")
+        self.assertEqual(res.response, "1050")
     def test_from_corpus(self):
         agent = Agent.from_corpus(
             tool_name="RAG Tool",
@@ -99,8 +111,29 @@ class TestAgentPackage(unittest.TestCase):
         )
         agent_reloaded = agent.loads(agent.dumps())
+        agent_reloaded_again = agent_reloaded.loads(agent_reloaded.dumps())
         self.assertIsInstance(agent_reloaded, Agent)
         self.assertEqual(agent, agent_reloaded)
+        self.assertEqual(agent.agent_type, agent_reloaded.agent_type)
+        self.assertIsInstance(agent_reloaded, Agent)
+        self.assertEqual(agent, agent_reloaded_again)
+        self.assertEqual(agent.agent_type, agent_reloaded_again.agent_type)
+    def test_chat_history(self):
+        tools = [ToolsFactory().create_tool(mult)]
+        topic = "AI topic"
+        instructions = "Always do as your father tells you, if your mother agrees!"
+        agent = Agent(
+            tools=tools,
+            topic=topic,
+            custom_instructions=instructions,
+            chat_history=[("What is 5 times 10", "50"), ("What is 3 times 7", "21")]
+        )
+        res = agent.chat("multiply the results of the last two questions. Output only the answer.")
+        self.assertEqual(res.response, "1050")
 if __name__ == "__main__":

vectara_agentic-0.2.2/tests/test_private_llm.py ADDED Viewed

@@ -0,0 +1,67 @@
+import os
+import unittest
+import subprocess
+import time
+import requests
+import signal
+from vectara_agentic.agent import Agent, AgentType
+from vectara_agentic.agent_config import AgentConfig
+from vectara_agentic.types import ModelProvider
+from vectara_agentic.tools import ToolsFactory
+class TestPrivateLLM(unittest.TestCase):
+    @classmethod
+    def setUp(cls):
+        # Start the Flask server as a subprocess
+        cls.flask_process = subprocess.Popen(
+            ['flask', 'run', '--port=5000'],
+            env={**os.environ, 'FLASK_APP': 'tests.endpoint:app', 'FLASK_ENV': 'development'},
+            stdout=None, stderr=None,
+        )
+        # Wait for the server to start
+        timeout = 10
+        url = 'http://127.0.0.1:5000/'
+        for _ in range(timeout):
+            try:
+                requests.get(url)
+                return
+            except requests.ConnectionError:
+                time.sleep(1)
+        raise RuntimeError(f"Failed to start Flask server at {url}")
+    @classmethod
+    def tearDown(cls):
+        # Terminate the Flask server
+        cls.flask_process.send_signal(signal.SIGINT)
+        cls.flask_process.wait()
+    def test_endpoint(self):
+        def mult(x, y):
+            return x * y
+        tools = [ToolsFactory().create_tool(mult)]
+        topic = "calculator"
+        custom_instructions = "you are an agent specializing in math, assisting a user."
+        config = AgentConfig(
+            agent_type=AgentType.REACT,
+            main_llm_provider=ModelProvider.PRIVATE,
+            main_llm_model_name="gpt-4o",
+            private_llm_api_base="http://127.0.0.1:5000/v1",
+            private_llm_api_key="TEST_API_KEY",
+        )
+        agent = Agent(agent_config=config, tools=tools, topic=topic,
+                      custom_instructions=custom_instructions)
+        # To run this test, you must have OPENAI_API_KEY in your environment
+        self.assertEqual(
+            agent.chat(
+                "What is 5 times 10. Only give the answer, nothing else"
+            ).response.replace("$", "\\$"),
+            "50",
+        )
+if __name__ == "__main__":
+    unittest.main()

{vectara_agentic-0.2.0 → vectara_agentic-0.2.2}/tests/test_tools.py RENAMED Viewed

@@ -1,8 +1,11 @@
 import unittest
+from pydantic import Field, BaseModel
 from vectara_agentic.tools import VectaraTool, VectaraToolFactory, ToolsFactory, ToolType
 from vectara_agentic.agent import Agent
-from pydantic import Field, BaseModel
+from vectara_agentic.agent_config import AgentConfig
 from llama_index.core.tools import FunctionTool
@@ -60,9 +63,6 @@ class TestToolsPackage(unittest.TestCase):
         vectara_corpus_key = "vectara-docs_1"
         vectara_api_key = "zqt_UXrBcnI2UXINZkrv4g1tQPhzj02vfdtqYJIDiA"
-        class QueryToolArgs(BaseModel):
-            query: str = Field(description="The user query")
         agent = Agent.from_corpus(
             vectara_corpus_key=vectara_corpus_key,
             vectara_api_key=vectara_api_key,
@@ -72,7 +72,34 @@ class TestToolsPackage(unittest.TestCase):
             vectara_summarizer="mockingbird-1.0-2024-07-16"
         )
-        self.assertIn("Vectara is an end-to-end platform", agent.chat("What is Vectara?"))
+        self.assertIn("Vectara is an end-to-end platform", str(agent.chat("What is Vectara?")))
+    def test_class_method_as_tool(self):
+        class TestClass:
+            def __init__(self):
+                pass
+            def mult(self, x, y):
+                return x * y
+        test_class = TestClass()
+        tools = [ToolsFactory().create_tool(test_class.mult)]
+        topic = "AI topic"
+        instructions = "Always do as your father tells you, if your mother agrees!"
+        config = AgentConfig()
+        agent = Agent(
+            tools=tools,
+            topic=topic,
+            custom_instructions=instructions,
+            agent_config=config
+        )
+        self.assertEqual(
+            agent.chat(
+                "What is 5 times 10. Only give the answer, nothing else"
+            ).response.replace("$", "\\$"),
+            "50",
+        )
 if __name__ == "__main__":

{vectara_agentic-0.2.0 → vectara_agentic-0.2.2}/vectara_agentic/_prompts.py RENAMED Viewed

@@ -5,7 +5,9 @@ This file contains the prompt templates for the different types of agents.
 # General (shared) instructions
 GENERAL_INSTRUCTIONS = """
 - Use tools as your main source of information, do not respond without using a tool. Do not respond based on pre-trained knowledge.
-- Always call the 'get_current_date' tool to ensure you know the exact date when a user asks a question.
+- Before responding to a user query that requires knowledge of the current date, call the 'get_current_date' tool to get the current date.
+  Never rely on previous knowledge of the current date.
+  Example queries that require the current date: "What is the revenue of Apple last october?" or "What was the stock price 5 days ago?".
 - When using a tool with arguments, simplify the query as much as possible if you use the tool with arguments.
   For example, if the original query is "revenue for apple in 2021", you can use the tool with a query "revenue" with arguments year=2021 and company=apple.
 - If a tool responds with "I do not have enough information", try one of the following:

{vectara_agentic-0.2.0 → vectara_agentic-0.2.2}/vectara_agentic/_version.py RENAMED Viewed

@@ -1,4 +1,4 @@
 """
 Define the version of the package.
 """
-__version__ = "0.2.0"
+__version__ = "0.2.2"

{vectara_agentic-0.2.0 → vectara_agentic-0.2.2}/vectara_agentic/agent.py RENAMED Viewed

@@ -1,8 +1,9 @@
 """
 This module contains the Agent class for handling different types of agents and their interactions.
 """
-from typing import List, Callable, Optional, Dict, Any
+from typing import List, Callable, Optional, Dict, Any, Union, Tuple
 import os
+import re
 from datetime import date
 import time
 import json
@@ -10,12 +11,17 @@ import logging
 import traceback
 import asyncio
-import dill
+from collections import Counter
+import cloudpickle as pickle
 from dotenv import load_dotenv
 from retrying import retry
 from pydantic import Field, create_model
+from llama_index.core.memory import ChatMemoryBuffer
+from llama_index.core.llms import ChatMessage, MessageRole
 from llama_index.core.tools import FunctionTool
 from llama_index.core.agent import ReActAgent
 from llama_index.core.agent.react.formatter import ReActChatFormatter
@@ -24,7 +30,7 @@ from llama_index.agent.lats import LATSAgentWorker
 from llama_index.core.callbacks import CallbackManager, TokenCountingHandler
 from llama_index.core.callbacks.base_handler import BaseCallbackHandler
 from llama_index.agent.openai import OpenAIAgent
-from llama_index.core.memory import ChatMemoryBuffer
 from .types import AgentType, AgentStatusType, LLMRole, ToolType, AgentResponse, AgentStreamingResponse
 from .utils import get_llm, get_tokenizer_for_model
@@ -35,6 +41,21 @@ from .tools import VectaraToolFactory, VectaraTool, ToolsFactory
 from .tools_catalog import get_current_date
 from .agent_config import AgentConfig
+class IgnoreUnpickleableAttributeFilter(logging.Filter):
+    '''
+    Filter to ignore log messages that contain certain strings
+    '''
+    def filter(self, record):
+        msgs_to_ignore = [
+            "Removing unpickleable private attribute _chunking_tokenizer_fn",
+            "Removing unpickleable private attribute _split_fns",
+            "Removing unpickleable private attribute _sub_sentence_split_fns",
+        ]
+        return all(msg not in record.getMessage() for msg in msgs_to_ignore)
+logging.getLogger().addFilter(IgnoreUnpickleableAttributeFilter())
 logger = logging.getLogger("opentelemetry.exporter.otlp.proto.http.trace_exporter")
 logger.setLevel(logging.CRITICAL)
@@ -81,6 +102,34 @@ def _retry_if_exception(exception):
     return isinstance(exception, (TimeoutError))
+def get_field_type(field_schema: dict) -> Any:
+    """
+    Convert a JSON schema field definition to a Python type.
+    Handles 'type' and 'anyOf' cases.
+    """
+    json_type_to_python = {
+        "string": str,
+        "integer": int,
+        "boolean": bool,
+        "array": list,
+        "object": dict,
+        "number": float,
+    }
+    if "anyOf" in field_schema:
+        types = []
+        for option in field_schema["anyOf"]:
+            # If the option has a type, convert it; otherwise, use Any.
+            if "type" in option:
+                types.append(json_type_to_python.get(option["type"], Any))
+            else:
+                types.append(Any)
+        # Return a Union of the types. For example, Union[str, int]
+        return Union[tuple(types)]
+    elif "type" in field_schema:
+        return json_type_to_python.get(field_schema["type"], Any)
+    else:
+        return Any
 class Agent:
     """
     Agent class for handling different types of agents and their interactions.
@@ -96,6 +145,8 @@ class Agent:
         agent_progress_callback: Optional[Callable[[AgentStatusType, str], None]] = None,
         query_logging_callback: Optional[Callable[[str, str], None]] = None,
         agent_config: Optional[AgentConfig] = None,
+        chat_history: Optional[list[Tuple[str, str]]] = None,
+        validate_tools: bool = False,
     ) -> None:
         """
         Initialize the agent with the specified type, tools, topic, and system message.
@@ -111,21 +162,52 @@ class Agent:
             query_logging_callback (Callable): A callback function the code calls upon completion of a query
             agent_config (AgentConfig, optional): The configuration of the agent.
                 Defaults to AgentConfig(), which reads from environment variables.
+            chat_history (Tuple[str, str], optional): A list of user/agent chat pairs to initialize the agent memory.
+            validate_tools (bool, optional): Whether to validate tool inconsistency with instructions.
+                Defaults to False.
         """
         self.agent_config = agent_config or AgentConfig()
         self.agent_type = self.agent_config.agent_type
-        self.tools = tools + [ToolsFactory().create_tool(get_current_date)]
+        self.tools = tools
+        if not any(tool.metadata.name == 'get_current_date' for tool in self.tools):
+            self.tools += [ToolsFactory().create_tool(get_current_date)]
         self.llm = get_llm(LLMRole.MAIN, config=self.agent_config)
         self._custom_instructions = custom_instructions
         self._topic = topic
         self.agent_progress_callback = agent_progress_callback if agent_progress_callback else update_func
         self.query_logging_callback = query_logging_callback
+        # Validate tools
+        # Check for:
+        # 1. multiple copies of the same tool
+        # 2. Instructions for using tools that do not exist
+        tool_names = [tool.metadata.name for tool in self.tools]
+        duplicates = [tool for tool, count in Counter(tool_names).items() if count > 1]
+        if duplicates:
+            raise ValueError(f"Duplicate tools detected: {', '.join(duplicates)}")
+        if validate_tools:
+            prompt = f'''
+            Given the following instructions, and a list of tool names,
+            Please identify tools mentioned in the instructions that do not exist in the list.
+            Instructions:
+            {self._custom_instructions}
+            Tool names: {', '.join(tool_names)}
+            Your response should include a comma separated list of tool names that do not exist in the list.
+            Your response should be an empty string if all tools mentioned in the instructions are in the list.
+            '''
+            llm = get_llm(LLMRole.MAIN, config=self.agent_config)
+            bad_tools = llm.complete(prompt).text.split(", ")
+            if bad_tools:
+                raise ValueError(f"The Agent custom instructions mention these invalid tools: {', '.join(bad_tools)}")
+        # Create token counters for the main and tool LLMs
         main_tok = get_tokenizer_for_model(role=LLMRole.MAIN)
         self.main_token_counter = TokenCountingHandler(tokenizer=main_tok) if main_tok else None
         tool_tok = get_tokenizer_for_model(role=LLMRole.TOOL)
         self.tool_token_counter = TokenCountingHandler(tokenizer=tool_tok) if tool_tok else None
+        # Setup callback manager
         callbacks: list[BaseCallbackHandler] = [AgentCallbackHandler(self.agent_progress_callback)]
         if self.main_token_counter:
             callbacks.append(self.main_token_counter)
@@ -135,7 +217,14 @@ class Agent:
         self.llm.callback_manager = callback_manager
         self.verbose = verbose
-        self.memory = ChatMemoryBuffer.from_defaults(token_limit=128000)
+        if chat_history:
+            msg_history = []
+            for text_pairs in chat_history:
+                msg_history.append(ChatMessage.from_str(content=text_pairs[0], role=MessageRole.USER))
+                msg_history.append(ChatMessage.from_str(content=text_pairs[1], role=MessageRole.ASSISTANT))
+            self.memory = ChatMemoryBuffer.from_defaults(token_limit=128000, chat_history=msg_history)
+        else:
+            self.memory = ChatMemoryBuffer.from_defaults(token_limit=128000)
         if self.agent_type == AgentType.REACT:
             prompt = _get_prompt(REACT_PROMPT_TEMPLATE, topic, custom_instructions)
             self.agent = ReActAgent.from_tools(
@@ -219,7 +308,10 @@ class Agent:
         # Compare tools
         if self.tools != other.tools:
-            print(f"Comparison failed: tools differ. (self.tools: {self.tools}, other.tools: {other.tools})")
+            print(
+                "Comparison failed: tools differ."
+                f"(self.tools: {[t.metadata.name for t in self.tools]}, "
+                f"other.tools: {[t.metadata.name for t in other.tools]})")
             return False
         # Compare topic
@@ -263,6 +355,7 @@ class Agent:
         agent_progress_callback: Optional[Callable[[AgentStatusType, str], None]] = None,
         query_logging_callback: Optional[Callable[[str, str], None]] = None,
         agent_config: AgentConfig = AgentConfig(),
+        chat_history: Optional[list[Tuple[str, str]]] = None,
     ) -> "Agent":
         """
         Create an agent from tools, agent type, and language model.
@@ -277,6 +370,7 @@ class Agent:
                 update_func (Callable): old name for agent_progress_callback. Will be deprecated in future.
             query_logging_callback (Callable): A callback function the code calls upon completion of a query
             agent_config (AgentConfig, optional): The configuration of the agent.
+            chat_history (Tuple[str, str], optional): A list of user/agent chat pairs to initialize the agent memory.
         Returns:
             Agent: An instance of the Agent class.
@@ -285,7 +379,8 @@ class Agent:
             tools=tools, topic=topic, custom_instructions=custom_instructions,
             verbose=verbose, agent_progress_callback=agent_progress_callback,
             query_logging_callback=query_logging_callback,
-            update_func=update_func, agent_config=agent_config
+            update_func=update_func, agent_config=agent_config,
+            chat_history=chat_history,
         )
     @classmethod
@@ -322,7 +417,7 @@ class Agent:
         vectara_temperature: Optional[float] = None,
         vectara_frequency_penalty: Optional[float] = None,
         vectara_presence_penalty: Optional[float] = None,
-        vectara_save_history: bool = False,
+        vectara_save_history: bool = True,
     ) -> "Agent":
         """
         Create an agent from a single Vectara corpus
@@ -383,6 +478,10 @@ class Agent:
             )  # type: ignore
         query_args = create_model("QueryArgs", **field_definitions)  # type: ignore
+        # tool name must be valid Python function name
+        if tool_name:
+            tool_name = re.sub(r"[^A-Za-z0-9_]", "_", tool_name)
         vectara_tool = vec_factory.create_rag_tool(
             tool_name=tool_name or f"vectara_{vectara_corpus_key}",
             tool_description=f"""
@@ -414,6 +513,7 @@ class Agent:
             presence_penalty=vectara_presence_penalty,
             save_history=vectara_save_history,
             include_citations=True,
+            verbose=verbose,
         )
         assistant_instructions = f"""
@@ -587,8 +687,8 @@ class Agent:
                 "tool_type": tool.metadata.tool_type.value,
                 "name": tool.metadata.name,
                 "description": tool.metadata.description,
-                "fn": dill.dumps(tool.fn).decode("latin-1") if tool.fn else None,  # Serialize fn
-                "async_fn": dill.dumps(tool.async_fn).decode("latin-1")
+                "fn": pickle.dumps(tool.fn).decode("latin-1") if tool.fn else None,  # Serialize fn
+                "async_fn": pickle.dumps(tool.async_fn).decode("latin-1")
                 if tool.async_fn
                 else None,  # Serialize async_fn
                 "fn_schema": tool.metadata.fn_schema.model_json_schema()
@@ -599,7 +699,7 @@ class Agent:
         return {
             "agent_type": self.agent_type.value,
-            "memory": dill.dumps(self.agent.memory).decode("latin-1"),
+            "memory": pickle.dumps(self.agent.memory).decode("latin-1"),
             "tools": tool_info,
             "topic": self._topic,
             "custom_instructions": self._custom_instructions,
@@ -613,39 +713,30 @@ class Agent:
         agent_config = AgentConfig.from_dict(data["agent_config"])
         tools = []
-        json_type_to_python = {
-            "string": str,
-            "integer": int,
-            "boolean": bool,
-            "array": list,
-            "object": dict,
-            "number": float,
-        }
         for tool_data in data["tools"]:
             # Recreate the dynamic model using the schema info
             if tool_data.get("fn_schema"):
                 field_definitions = {}
                 for field, values in tool_data["fn_schema"]["properties"].items():
+                    # Instead of checking for 'type', use the helper:
+                    field_type = get_field_type(values)
+                    # If there's a default value, include it.
                     if "default" in values:
                         field_definitions[field] = (
-                            json_type_to_python.get(values["type"], values["type"]),
-                            Field(
-                                description=values["description"],
-                                default=values["default"],
-                            ),
-                        )  # type: ignore
+                            field_type,
+                            Field(description=values.get("description", ""), default=values["default"]),
+                        )
                     else:
                         field_definitions[field] = (
-                            json_type_to_python.get(values["type"], values["type"]),
-                            Field(description=values["description"]),
-                        )  # type: ignore
+                            field_type,
+                            Field(description=values.get("description", "")),
+                        )
                 query_args_model = create_model("QueryArgs", **field_definitions)  # type: ignore
             else:
                 query_args_model = create_model("QueryArgs")
-            fn = dill.loads(tool_data["fn"].encode("latin-1")) if tool_data["fn"] else None
-            async_fn = dill.loads(tool_data["async_fn"].encode("latin-1")) if tool_data["async_fn"] else None
+            fn = pickle.loads(tool_data["fn"].encode("latin-1")) if tool_data["fn"] else None
+            async_fn = pickle.loads(tool_data["async_fn"].encode("latin-1")) if tool_data["async_fn"] else None
             tool = VectaraTool.from_defaults(
                 name=tool_data["name"],
@@ -664,7 +755,7 @@ class Agent:
             custom_instructions=data["custom_instructions"],
             verbose=data["verbose"],
         )
-        memory = dill.loads(data["memory"].encode("latin-1")) if data.get("memory") else None
+        memory = pickle.loads(data["memory"].encode("latin-1")) if data.get("memory") else None
         if memory:
             agent.agent.memory = memory
         return agent

{vectara_agentic-0.2.0 → vectara_agentic-0.2.2}/vectara_agentic/agent_config.py RENAMED Viewed

@@ -44,6 +44,15 @@ class AgentConfig:
         default_factory=lambda: os.getenv("VECTARA_AGENTIC_TOOL_MODEL_NAME", "")
     )
+    # Params for Private LLM endpoint if used
+    private_llm_api_base: str = field(
+        default_factory=lambda: os.getenv("VECTARA_AGENTIC_PRIVATE_LLM_API_BASE",
+                                          "http://private-endpoint.company.com:5000/v1")
+    )
+    private_llm_api_key: str = field(
+        default_factory=lambda: os.getenv("VECTARA_AGENTIC_PRIVATE_LLM_API_KEY", "<private-api-key>")
+    )
     # Observer
     observer: ObserverType = field(
         default_factory=lambda: ObserverType(

{vectara_agentic-0.2.0 → vectara_agentic-0.2.2}/vectara_agentic/tools.py RENAMED Viewed

@@ -17,7 +17,6 @@ from llama_index.indices.managed.vectara import VectaraIndex
 from llama_index.core.utilities.sql_wrapper import SQLDatabase
 from llama_index.core.tools.types import ToolMetadata, ToolOutput
 from .types import ToolType
 from .tools_catalog import ToolsCatalog, get_bad_topics
 from .db_tools import DBLoadSampleData, DBLoadUniqueValues, DBLoadData
@@ -100,9 +99,14 @@ class VectaraTool(FunctionTool):
         fn_schema: Optional[Type[BaseModel]] = None,
         async_fn: Optional[AsyncCallable] = None,
         tool_metadata: Optional[ToolMetadata] = None,
+        callback: Optional[Callable[[Any], Any]] = None,
+        async_callback: Optional[AsyncCallable] = None,
         tool_type: ToolType = ToolType.QUERY,
     ) -> "VectaraTool":
-        tool = FunctionTool.from_defaults(fn, name, description, return_direct, fn_schema, async_fn, tool_metadata)
+        tool = FunctionTool.from_defaults(
+            fn, name, description, return_direct, fn_schema, async_fn, tool_metadata,
+            callback, async_callback
+        )
         vectara_tool = cls(tool_type=tool_type, fn=tool.fn, metadata=tool.metadata, async_fn=tool.async_fn)
         return vectara_tool
@@ -110,6 +114,9 @@ class VectaraTool(FunctionTool):
         if self.metadata.tool_type != other.metadata.tool_type:
             return False
+        if self.metadata.name != other.metadata.name or self.metadata.description != other.metadata.description:
+            return False
         # Check if fn_schema is an instance of a BaseModel or a class itself (metaclass)
         self_schema_dict = self.metadata.fn_schema.model_fields
         other_schema_dict = other.metadata.fn_schema.model_fields
@@ -252,7 +259,10 @@ def _build_filter_string(kwargs: Dict[str, Any], tool_args_type: Dict[str, dict]
                     filter_parts.append(f"{prefix}.{key}='{val_str}'")
     filter_str = " AND ".join(filter_parts)
-    return f"({fixed_filter}) AND ({filter_str})" if fixed_filter else filter_str
+    if fixed_filter and filter_str:
+        return f"({fixed_filter}) AND ({filter_str})"
+    else:
+        return fixed_filter or filter_str
 class VectaraToolFactory:
     """
@@ -294,8 +304,10 @@ class VectaraToolFactory:
         mmr_diversity_bias: float = 0.2,
         udf_expression: str = None,
         rerank_chain: List[Dict] = None,
-        save_history: bool = False,
+        save_history: bool = True,
         verbose: bool = False,
+        vectara_base_url: str = "https://api.vectara.io",
+        vectara_verify_ssl: bool = True,
     ) -> VectaraTool:
         """
         Creates a Vectara search/retrieval tool
@@ -327,6 +339,8 @@ class VectaraToolFactory:
                 If using slingshot/multilingual_reranker_v1, it must be first in the list.
             save_history (bool, optional): Whether to save the query in history.
             verbose (bool, optional): Whether to print verbose output.
+            vectara_base_url (str, optional): The base URL for the Vectara API.
+            vectara_verify_ssl (bool, optional): Whether to verify SSL certificates for the Vectara API.
         Returns:
             VectaraTool: A VectaraTool object.
@@ -336,6 +350,8 @@ class VectaraToolFactory:
             vectara_api_key=self.vectara_api_key,
             vectara_corpus_key=self.vectara_corpus_key,
             x_source_str="vectara-agentic",
+            base_url=vectara_base_url,
+            verify_ssl=vectara_verify_ssl,
         )
         # Dynamically generate the search function
@@ -426,7 +442,7 @@ class VectaraToolFactory:
         # Create the tool function signature string
         fields = []
-        for name, field in tool_args_schema.__fields__.items():
+        for name, field in tool_args_schema.model_fields.items():
             annotation = field.annotation
             type_name = annotation.__name__ if hasattr(annotation, '__name__') else str(annotation)
             fields.append(f"{name}: {type_name}")
@@ -476,6 +492,8 @@ class VectaraToolFactory:
         save_history: bool = False,
         fcs_threshold: float = 0.0,
         verbose: bool = False,
+        vectara_base_url: str = "https://api.vectara.io",
+        vectara_verify_ssl: bool = True,
     ) -> VectaraTool:
         """
         Creates a RAG (Retrieve and Generate) tool.
@@ -526,6 +544,8 @@ class VectaraToolFactory:
             fcs_threshold (float, optional): A threshold for factual consistency.
                 If set above 0, the tool notifies the calling agent that it "cannot respond" if FCS is too low.
             verbose (bool, optional): Whether to print verbose output.
+            vectara_base_url (str, optional): The base URL for the Vectara API.
+            vectara_verify_ssl (bool, optional): Whether to verify SSL certificates for the Vectara API.
         Returns:
             VectaraTool: A VectaraTool object.
@@ -535,6 +555,8 @@ class VectaraToolFactory:
             vectara_api_key=self.vectara_api_key,
             vectara_corpus_key=self.vectara_corpus_key,
             x_source_str="vectara-agentic",
+            base_url=vectara_base_url,
+            verify_ssl=vectara_verify_ssl,
         )
         # Dynamically generate the RAG function
@@ -677,7 +699,7 @@ class VectaraToolFactory:
         # Create the tool function signature string
         fields = []
-        for name, field in tool_args_schema.__fields__.items():
+        for name, field in tool_args_schema.model_fields.items():
             annotation = field.annotation
             type_name = annotation.__name__ if hasattr(annotation, '__name__') else str(annotation)
             fields.append(f"{name}: {type_name}")
@@ -743,7 +765,6 @@ class ToolsFactory:
         # Get the tool spec class or function from the module
         tool_spec = getattr(module, tool_spec_name)
         func_type = LI_packages[tool_package_name]
         tools = tool_spec(**kwargs).to_tool_list()
         vtools = []

{vectara_agentic-0.2.0 → vectara_agentic-0.2.2}/vectara_agentic/tools_catalog.py RENAMED Viewed

@@ -25,7 +25,7 @@ get_headers = {
 def get_current_date() -> str:
     """
-    Returns: the current date.
+    Returns: the current date (when called) as a string.
     """
     return date.today().strftime("%A, %B %d, %Y")

{vectara_agentic-0.2.0 → vectara_agentic-0.2.2}/vectara_agentic/types.py RENAMED Viewed

@@ -33,6 +33,7 @@ class ModelProvider(Enum):
     COHERE = "COHERE"
     GEMINI = "GEMINI"
     BEDROCK = "BEDROCK"
+    PRIVATE = "PRIVATE"
 class AgentStatusType(Enum):

{vectara_agentic-0.2.0 → vectara_agentic-0.2.2}/vectara_agentic/utils.py RENAMED Viewed

@@ -112,6 +112,10 @@ def get_llm(
     elif model_provider == ModelProvider.COHERE:
         from llama_index.llms.cohere import Cohere
         llm = Cohere(model=model_name, temperature=0)
+    elif model_provider == ModelProvider.PRIVATE:
+        from llama_index.llms.openai_like import OpenAILike
+        llm = OpenAILike(model=model_name, temperature=0, is_function_calling_model=True,is_chat_model=True,
+                         api_base=config.private_llm_api_base, api_key=config.private_llm_api_key)
     else:
         raise ValueError(f"Unknown LLM provider: {model_provider}")
     return llm

{vectara_agentic-0.2.0 → vectara_agentic-0.2.2/vectara_agentic.egg-info}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.2
 Name: vectara_agentic
-Version: 0.2.0
+Version: 0.2.2
 Summary: A Python package for creating AI Assistants and AI Agents with Vectara
 Home-page: https://github.com/vectara/py-vectara-agentic
 Author: Ofer Mendelevitch
@@ -16,19 +16,19 @@ Classifier: Topic :: Software Development :: Libraries :: Python Modules
 Requires-Python: >=3.10
 Description-Content-Type: text/markdown
 License-File: LICENSE
-Requires-Dist: llama-index==0.12.11
-Requires-Dist: llama-index-indices-managed-vectara==0.4.0
+Requires-Dist: llama-index==0.12.22
+Requires-Dist: llama-index-indices-managed-vectara==0.4.1
 Requires-Dist: llama-index-agent-llm-compiler==0.3.0
 Requires-Dist: llama-index-agent-lats==0.3.0
-Requires-Dist: llama-index-agent-openai==0.4.3
-Requires-Dist: llama-index-llms-openai==0.3.18
-Requires-Dist: llama-index-llms-anthropic==0.6.4
+Requires-Dist: llama-index-agent-openai==0.4.6
+Requires-Dist: llama-index-llms-openai==0.3.25
+Requires-Dist: llama-index-llms-anthropic==0.6.7
 Requires-Dist: llama-index-llms-together==0.3.1
 Requires-Dist: llama-index-llms-groq==0.3.1
-Requires-Dist: llama-index-llms-fireworks==0.3.1
+Requires-Dist: llama-index-llms-fireworks==0.3.2
 Requires-Dist: llama-index-llms-cohere==0.4.0
-Requires-Dist: llama-index-llms-gemini==0.4.4
-Requires-Dist: llama-index-llms-bedrock==0.3.3
+Requires-Dist: llama-index-llms-gemini==0.4.11
+Requires-Dist: llama-index-llms-bedrock==0.3.4
 Requires-Dist: llama-index-tools-yahoo-finance==0.3.0
 Requires-Dist: llama-index-tools-arxiv==0.3.0
 Requires-Dist: llama-index-tools-database==0.3.0
@@ -38,8 +38,8 @@ Requires-Dist: llama-index-tools-neo4j==0.3.0
 Requires-Dist: llama-index-graph-stores-kuzu==0.6.0
 Requires-Dist: llama-index-tools-slack==0.3.0
 Requires-Dist: llama-index-tools-exa==0.3.0
-Requires-Dist: tavily-python==0.5.0
-Requires-Dist: exa-py==1.8.5
+Requires-Dist: tavily-python==0.5.1
+Requires-Dist: exa-py==1.8.9
 Requires-Dist: yahoo-finance==1.4.0
 Requires-Dist: openinference-instrumentation-llama-index==3.1.4
 Requires-Dist: opentelemetry-proto==1.26.0
@@ -50,8 +50,8 @@ Requires-Dist: tokenizers>=0.20
 Requires-Dist: pydantic==2.10.3
 Requires-Dist: retrying==1.3.4
 Requires-Dist: python-dotenv==1.0.1
-Requires-Dist: tiktoken==0.8.0
-Requires-Dist: dill>=0.3.7
+Requires-Dist: tiktoken==0.9.0
+Requires-Dist: cloudpickle>=3.1.1
 Requires-Dist: httpx==0.27.2
 Dynamic: author
 Dynamic: author-email
@@ -135,7 +135,7 @@ from vectara_agentic.tools import VectaraToolFactory
 vec_factory = VectaraToolFactory(
     vectara_api_key=os.environ['VECTARA_API_KEY'],
     vectara_customer_id=os.environ['VECTARA_CUSTOMER_ID'],
-    vectara_corpus_id=os.environ['VECTARA_CORPUS_ID']
+    vectara_corpus_key=os.environ['VECTARA_CORPUS_KEY']
 )
 ```
@@ -315,6 +315,10 @@ def mult_func(x, y):
 mult_tool = ToolsFactory().create_tool(mult_func)
 ```
+Note: When you define your own Python functions as tools, implement them at the top module level,
+and not as nested functions. Nested functions are not supported if you use serialization
+(dumps/loads or from_dict/to_dict).
 ## 🛠️ Configuration
 ## Configuring Vectara-agentic
@@ -352,10 +356,31 @@ If any of these are not provided, `AgentConfig` first tries to read the values f
 When creating a `VectaraToolFactory`, you can pass in a `vectara_api_key`, `vectara_customer_id`, and `vectara_corpus_id` to the factory.
-If not passed in, it will be taken from the environment variables (`VECTARA_API_KEY`, `VECTARA_CUSTOMER_ID` and `VECTARA_CORPUS_ID`). Note that `VECTARA_CORPUS_ID` can be a single ID or a comma-separated list of IDs (if you want to query multiple corpora).
+If not passed in, it will be taken from the environment variables (`VECTARA_API_KEY` and `VECTARA_CORPUS_KEY`). Note that `VECTARA_CORPUS_KEY` can be a single KEY or a comma-separated list of KEYs (if you want to query multiple corpora).
 These values will be used as credentials when creating Vectara tools - in `create_rag_tool()` and `create_search_tool()`.
+## Setting up a privately hosted LLM
+If you want to setup vectara-agentic to use your own self-hosted LLM endpoint, follow the example below
+```python
+        config = AgentConfig(
+            agent_type=AgentType.REACT,
+            main_llm_provider=ModelProvider.PRIVATE,
+            main_llm_model_name="meta-llama/Meta-Llama-3.1-8B-Instruct",
+            private_llm_api_base="http://vllm-server.company.com/v1",
+            private_llm_api_key="TEST_API_KEY",
+        )
+        agent = Agent(agent_config=config, tools=tools, topic=topic,
+                      custom_instructions=custom_instructions)
+```
+In this case we specify the Main LLM provider to be privately hosted with Llama-3.1-8B as the model.
+- The `ModelProvider.PRIVATE` specifies a privately hosted LLM.
+- The `private_llm_api_base` specifies the api endpoint to use, and the `private_llm_api_key`
+  specifies the private API key requires to use this service.
 ## ℹ️ Additional Information
 ### About Custom Instructions for your Agent
@@ -376,6 +401,8 @@ The `Agent` class defines a few helpful methods to help you understand the inter
 The `Agent` class supports serialization. Use the `dumps()` to serialize and `loads()` to read back from a serialized stream.
+Note: due to cloudpickle limitations, if a tool contains Python `weakref` objects, serialization won't work and an exception will be raised.
 ###  Observability
 vectara-agentic supports observability via the existing integration of LlamaIndex and Arize Phoenix.

{vectara_agentic-0.2.0 → vectara_agentic-0.2.2}/vectara_agentic.egg-info/SOURCES.txt RENAMED Viewed

@@ -4,7 +4,9 @@ README.md
 requirements.txt
 setup.py
 tests/__init__.py
+tests/endpoint.py
 tests/test_agent.py
+tests/test_private_llm.py
 tests/test_tools.py
 vectara_agentic/__init__.py
 vectara_agentic/_callback.py

{vectara_agentic-0.2.0 → vectara_agentic-0.2.2}/vectara_agentic.egg-info/requires.txt RENAMED Viewed

@@ -1,16 +1,16 @@
-llama-index==0.12.11
-llama-index-indices-managed-vectara==0.4.0
+llama-index==0.12.22
+llama-index-indices-managed-vectara==0.4.1
 llama-index-agent-llm-compiler==0.3.0
 llama-index-agent-lats==0.3.0
-llama-index-agent-openai==0.4.3
-llama-index-llms-openai==0.3.18
-llama-index-llms-anthropic==0.6.4
+llama-index-agent-openai==0.4.6
+llama-index-llms-openai==0.3.25
+llama-index-llms-anthropic==0.6.7
 llama-index-llms-together==0.3.1
 llama-index-llms-groq==0.3.1
-llama-index-llms-fireworks==0.3.1
+llama-index-llms-fireworks==0.3.2
 llama-index-llms-cohere==0.4.0
-llama-index-llms-gemini==0.4.4
-llama-index-llms-bedrock==0.3.3
+llama-index-llms-gemini==0.4.11
+llama-index-llms-bedrock==0.3.4
 llama-index-tools-yahoo-finance==0.3.0
 llama-index-tools-arxiv==0.3.0
 llama-index-tools-database==0.3.0
@@ -20,8 +20,8 @@ llama-index-tools-neo4j==0.3.0
 llama-index-graph-stores-kuzu==0.6.0
 llama-index-tools-slack==0.3.0
 llama-index-tools-exa==0.3.0
-tavily-python==0.5.0
-exa-py==1.8.5
+tavily-python==0.5.1
+exa-py==1.8.9
 yahoo-finance==1.4.0
 openinference-instrumentation-llama-index==3.1.4
 opentelemetry-proto==1.26.0
@@ -32,6 +32,6 @@ tokenizers>=0.20
 pydantic==2.10.3
 retrying==1.3.4
 python-dotenv==1.0.1
-tiktoken==0.8.0
-dill>=0.3.7
+tiktoken==0.9.0
+cloudpickle>=3.1.1
 httpx==0.27.2