PyPI - haystack-experimental - Versions diffs - 0.0.2.dev0__tar.gz → 0.1.0__tar.gz - Mend

haystack-experimental 0.0.2.dev0tar.gz → 0.1.0tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (30) hide show

{haystack_experimental-0.0.2.dev0 → haystack_experimental-0.1.0}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.3
 Name: haystack-experimental
-Version: 0.0.2.dev0
+Version: 0.1.0
 Summary: Experimental components and features for the Haystack LLM framework.
 Project-URL: CI: GitHub, https://github.com/deepset-ai/haystack-experimental/actions
 Project-URL: GitHub: issues, https://github.com/deepset-ai/haystack-experimental/issues
@@ -54,23 +54,25 @@ $ pip install -U haystack-experimental
 ## Experiments lifecycle
-Any experimental feature will be removed from `haystack-experimental` after a period of 3 months. After this time,
-the experiment will be either:
-- Merged into Haystack core and published in the next minor release
-- Released as a Core Integration,
+Each experimental feature has a default lifespan of 3 months starting from the date of the first non-pre-release build
+that includes it. Once it reaches the end of its lifespan, the experiment will be either:
+- Merged into Haystack core and published in the next minor release, or
+- Released as a Core Integration, or
 - Dropped.
 ## Experiments catalog
 The latest version of the package contains the following experiments:
-| Name                     | Type                    | Experiment end date |
-| ------------------------ | ----------------------- | ------------------- |
-| [`EvaluationHarness`][1] | Evaluation orchestrator | August 2024         |
-| [`OpenAIFunctionCaller`][2] | Function Calling Component | August 2024         |
+| Name                     | Type                    | Expected experiment end date | Dependencies |
+| ------------------------ | ----------------------- | ------------------- | ------------------- |
+| [`EvaluationHarness`][1] | Evaluation orchestrator | October 2024         | None
+| [`OpenAIFunctionCaller`][2] | Function Calling Component | October 2024         | None
+| [`OpenAPITool`][3]       | OpenAPITool component   | October 2024         | jsonref
 [1]: https://github.com/deepset-ai/haystack-experimental/tree/main/haystack_experimental/evaluation/harness
 [2]: https://github.com/deepset-ai/haystack-experimental/tree/main/haystack_experimental/components/tools/openai
+[3]: https://github.com/deepset-ai/haystack-experimental/tree/main/haystack_experimental/components/tools/openapi
 ## Usage
@@ -96,9 +98,11 @@ pipe = Pipeline()
 pipe.run(...)
 ```
+Some experimental features come with example notebooks and resources that can be found in the [`examples` folder](https://github.com/deepset-ai/haystack-experimental/tree/main/examples).
 ## Documentation
-Documentation for `haystack-experimental` can be found [here](https://docs.haystack.deepset.ai/reference/haystack-experimental-api).
+Documentation for `haystack-experimental` can be found [here](https://docs.haystack.deepset.ai/reference/).
 ## Implementation

{haystack_experimental-0.0.2.dev0 → haystack_experimental-0.1.0}/README.md RENAMED Viewed

@@ -26,23 +26,25 @@ $ pip install -U haystack-experimental
 ## Experiments lifecycle
-Any experimental feature will be removed from `haystack-experimental` after a period of 3 months. After this time,
-the experiment will be either:
-- Merged into Haystack core and published in the next minor release
-- Released as a Core Integration,
+Each experimental feature has a default lifespan of 3 months starting from the date of the first non-pre-release build
+that includes it. Once it reaches the end of its lifespan, the experiment will be either:
+- Merged into Haystack core and published in the next minor release, or
+- Released as a Core Integration, or
 - Dropped.
 ## Experiments catalog
 The latest version of the package contains the following experiments:
-| Name                     | Type                    | Experiment end date |
-| ------------------------ | ----------------------- | ------------------- |
-| [`EvaluationHarness`][1] | Evaluation orchestrator | August 2024         |
-| [`OpenAIFunctionCaller`][2] | Function Calling Component | August 2024         |
+| Name                     | Type                    | Expected experiment end date | Dependencies |
+| ------------------------ | ----------------------- | ------------------- | ------------------- |
+| [`EvaluationHarness`][1] | Evaluation orchestrator | October 2024         | None
+| [`OpenAIFunctionCaller`][2] | Function Calling Component | October 2024         | None
+| [`OpenAPITool`][3]       | OpenAPITool component   | October 2024         | jsonref
 [1]: https://github.com/deepset-ai/haystack-experimental/tree/main/haystack_experimental/evaluation/harness
 [2]: https://github.com/deepset-ai/haystack-experimental/tree/main/haystack_experimental/components/tools/openai
+[3]: https://github.com/deepset-ai/haystack-experimental/tree/main/haystack_experimental/components/tools/openapi
 ## Usage
@@ -68,9 +70,11 @@ pipe = Pipeline()
 pipe.run(...)
 ```
+Some experimental features come with example notebooks and resources that can be found in the [`examples` folder](https://github.com/deepset-ai/haystack-experimental/tree/main/examples).
 ## Documentation
-Documentation for `haystack-experimental` can be found [here](https://docs.haystack.deepset.ai/reference/haystack-experimental-api).
+Documentation for `haystack-experimental` can be found [here](https://docs.haystack.deepset.ai/reference/).
 ## Implementation

haystack_experimental-0.1.0/haystack_experimental/components/tools/openapi/__init__.py ADDED Viewed

@@ -0,0 +1,8 @@
+# SPDX-FileCopyrightText: 2022-present deepset GmbH <info@deepset.ai>
+#
+# SPDX-License-Identifier: Apache-2.0
+from haystack_experimental.components.tools.openapi.openapi_tool import OpenAPITool
+from haystack_experimental.components.tools.openapi.types import LLMProvider
+__all__ = ["LLMProvider", "OpenAPITool"]

haystack_experimental-0.1.0/haystack_experimental/components/tools/openapi/_openapi.py ADDED Viewed

@@ -0,0 +1,341 @@
+# SPDX-FileCopyrightText: 2022-present deepset GmbH <info@deepset.ai>
+#
+# SPDX-License-Identifier: Apache-2.0
+import logging
+from collections import defaultdict
+from typing import Any, Callable, Dict, List, Optional
+import requests
+from haystack_experimental.components.tools.openapi._payload_extraction import (
+    create_function_payload_extractor,
+)
+from haystack_experimental.components.tools.openapi._schema_conversion import (
+    anthropic_converter,
+    cohere_converter,
+    openai_converter,
+)
+from haystack_experimental.components.tools.openapi.types import LLMProvider, OpenAPISpecification, Operation
+MIN_REQUIRED_OPENAPI_SPEC_VERSION = 3
+logger = logging.getLogger(__name__)
+def send_request(request: Dict[str, Any]) -> Dict[str, Any]:
+    """
+    Send an HTTP request and return the response.
+    :param request: The request to send.
+    :returns: The response from the server.
+    """
+    url = request["url"]
+    headers = {**request.get("headers", {})}
+    try:
+        response = requests.request(
+            request["method"],
+            url,
+            headers=headers,
+            params=request.get("params", {}),
+            json=request.get("json"),
+            auth=request.get("auth"),
+            timeout=30,
+        )
+        response.raise_for_status()
+        return response.json()
+    except requests.exceptions.HTTPError as e:
+        logger.warning("HTTP error occurred: %s while sending request to %s", e, url)
+        raise HttpClientError(f"HTTP error occurred: {e}") from e
+    except requests.exceptions.RequestException as e:
+        logger.warning("Request error occurred: %s while sending request to %s", e, url)
+        raise HttpClientError(f"HTTP error occurred: {e}") from e
+    except Exception as e:
+        logger.warning("An error occurred: %s while sending request to %s", e, url)
+        raise HttpClientError(f"An error occurred: {e}") from e
+# Authentication strategies
+def create_api_key_auth_function(api_key: str) -> Callable[[Dict[str, Any], Dict[str, Any]], None]:
+    """
+    Create a function that applies the API key authentication strategy to a given request.
+    :param api_key: the API key to use for authentication.
+    :returns: a function that applies the API key authentication to a request
+    at the schema specified location.
+    """
+    def apply_auth(security_scheme: Dict[str, Any], request: Dict[str, Any]) -> None:
+        """
+        Apply the API key authentication strategy to the given request.
+        :param security_scheme: the security scheme from the OpenAPI spec.
+        :param request: the request to apply the authentication to.
+        """
+        if security_scheme["in"] == "header":
+            request.setdefault("headers", {})[security_scheme["name"]] = api_key
+        elif security_scheme["in"] == "query":
+            request.setdefault("params", {})[security_scheme["name"]] = api_key
+        elif security_scheme["in"] == "cookie":
+            request.setdefault("cookies", {})[security_scheme["name"]] = api_key
+        else:
+            raise ValueError(
+                f"Unsupported apiKey authentication location: {security_scheme['in']}, "
+                f"must be one of 'header', 'query', or 'cookie'"
+            )
+    return apply_auth
+def create_http_auth_function(token: str) -> Callable[[Dict[str, Any], Dict[str, Any]], None]:
+    """
+    Create a function that applies the http authentication strategy to a given request.
+    :param token: the authentication token to use.
+    :returns: a function that applies the API key authentication to a request
+    at the schema specified location.
+    """
+    def apply_auth(security_scheme: Dict[str, Any], request: Dict[str, Any]) -> None:
+        """
+        Apply the HTTP authentication strategy to the given request.
+        :param security_scheme: the security scheme from the OpenAPI spec.
+        :param request: the request to apply the authentication to.
+        """
+        if security_scheme["type"] == "http":
+            # support bearer http auth, no basic support yet
+            if security_scheme["scheme"].lower() == "bearer":
+                if not token:
+                    raise ValueError("Token must be provided for Bearer Auth.")
+                request.setdefault("headers", {})[
+                    "Authorization"
+                ] = f"Bearer {token}"
+            else:
+                raise ValueError(
+                    f"Unsupported HTTP authentication scheme: {security_scheme['scheme']}"
+                )
+        else:
+            raise ValueError(
+                "HTTPAuthentication strategy received a non-HTTP security scheme."
+            )
+    return apply_auth
+class HttpClientError(Exception):
+    """Exception raised for errors in the HTTP client."""
+class ClientConfiguration:
+    """Configuration for the OpenAPI client."""
+    def __init__(
+        self,
+        openapi_spec: OpenAPISpecification,
+        credentials: Optional[str] = None,
+        request_sender: Optional[Callable[[Dict[str, Any]], Dict[str, Any]]] = None,
+        llm_provider: LLMProvider = LLMProvider.OPENAI,
+    ):  # noqa: PLR0913
+        """
+        Initialize a ClientConfiguration instance.
+        :param openapi_spec: The OpenAPI specification to use for the client.
+        :param credentials: The credentials to use for authentication.
+        :param request_sender: The function to use for sending requests.
+        :param llm_provider: The LLM provider to use for generating tools definitions.
+        :raises ValueError: If the OpenAPI specification format is invalid.
+        """
+        self.openapi_spec = openapi_spec
+        self.credentials = credentials
+        self.request_sender = request_sender or send_request
+        self.llm_provider: LLMProvider = llm_provider
+    def get_auth_function(self) -> Callable[[Dict[str, Any], Dict[str, Any]], Any]:
+        """
+        Get the authentication function that sets a schema specified authentication to the request.
+        The function takes a security scheme and a request as arguments:
+            `security_scheme: Dict[str, Any] - The security scheme from the OpenAPI spec.`
+            `request: Dict[str, Any] - The request to apply the authentication to.`
+        :returns: The authentication function.
+        :raises ValueError: If the credentials type is not supported.
+        """
+        security_schemes = self.openapi_spec.get_security_schemes()
+        if not self.credentials:
+            return lambda security_scheme, request: None  # No-op function
+        if isinstance(self.credentials, str):
+            return self._create_authentication_from_string(
+                self.credentials, security_schemes
+            )
+        raise ValueError(f"Unsupported credentials type: {type(self.credentials)}")
+    def get_tools_definitions(self) -> List[Dict[str, Any]]:
+        """
+        Get the tools definitions used as tools LLM parameter.
+        :returns: The tools definitions passed to the LLM as tools parameter.
+        """
+        provider_to_converter = defaultdict(
+            lambda: openai_converter,
+            {
+                LLMProvider.ANTHROPIC: anthropic_converter,
+                LLMProvider.COHERE: cohere_converter,
+            }
+        )
+        converter = provider_to_converter[self.llm_provider]
+        return converter(self.openapi_spec)
+    def get_payload_extractor(self) -> Callable[[Dict[str, Any]], Dict[str, Any]]:
+        """
+        Get the payload extractor for the LLM provider.
+        This function knows how to extract the exact function payload from the LLM generated function calling payload.
+        :returns: The payload extractor function.
+        """
+        provider_to_arguments_field_name = defaultdict(
+            lambda: "arguments",
+            {
+                LLMProvider.ANTHROPIC: "input",
+                LLMProvider.COHERE: "parameters",
+            }
+        )
+        arguments_field_name = provider_to_arguments_field_name[self.llm_provider]
+        return create_function_payload_extractor(arguments_field_name)
+    def _create_authentication_from_string(
+            self, credentials: str, security_schemes: Dict[str, Any]
+    ) -> Callable[[Dict[str, Any], Dict[str, Any]], Any]:
+        for scheme in security_schemes.values():
+            if scheme["type"] == "apiKey":
+                return create_api_key_auth_function(api_key=credentials)
+            if scheme["type"] == "http":
+                return create_http_auth_function(token=credentials)
+            raise ValueError(
+                f"Unsupported authentication type '{scheme['type']}' provided."
+            )
+        raise ValueError(
+            f"Unable to create authentication from provided credentials: {credentials}"
+        )
+def build_request(operation: Operation, **kwargs) -> Dict[str, Any]:
+    """
+    Build an HTTP request for the operation.
+    :param operation: The operation to build the request for.
+    :param kwargs: The arguments to use for building the request.
+    :returns: The HTTP request as a dictionary.
+    :raises ValueError: If a required parameter is missing.
+    :raises NotImplementedError: If the request body content type is not supported. We only support JSON payloads.
+    """
+    path = operation.path
+    for parameter in operation.get_parameters("path"):
+        param_value = kwargs.get(parameter["name"], None)
+        if param_value:
+            path = path.replace(f"{{{parameter['name']}}}", str(param_value))
+        elif parameter.get("required", False):
+            raise ValueError(f"Missing required path parameter: {parameter['name']}")
+    url = operation.get_server() + path
+    # method
+    method = operation.method.lower()
+    # headers
+    headers = {}
+    for parameter in operation.get_parameters("header"):
+        param_value = kwargs.get(parameter["name"], None)
+        if param_value:
+            headers[parameter["name"]] = str(param_value)
+        elif parameter.get("required", False):
+            raise ValueError(f"Missing required header parameter: {parameter['name']}")
+    # query params
+    query_params = {}
+    for parameter in operation.get_parameters("query"):
+        param_value = kwargs.get(parameter["name"], None)
+        if param_value:
+            query_params[parameter["name"]] = param_value
+        elif parameter.get("required", False):
+            raise ValueError(f"Missing required query parameter: {parameter['name']}")
+    json_payload = None
+    request_body = operation.request_body
+    if request_body:
+        content = request_body.get("content", {})
+        if "application/json" in content:
+            json_payload = {**kwargs}
+        else:
+            raise NotImplementedError("Request body content type not supported")
+    return {
+        "url": url,
+        "method": method,
+        "headers": headers,
+        "params": query_params,
+        "json": json_payload,
+    }
+def apply_authentication(
+    auth_strategy: Callable[[Dict[str, Any], Dict[str, Any]], Any],
+    operation: Operation,
+    request: Dict[str, Any],
+):
+    """
+    Apply the authentication strategy to the given request.
+    :param auth_strategy: The authentication strategy to apply.
+    This is a function that takes a security scheme and a request as arguments (at runtime)
+    and applies the authentication
+    :param operation: The operation to apply the authentication to.
+    :param request: The request to apply the authentication to.
+    """
+    security_requirements = operation.security_requirements
+    security_schemes = operation.spec_dict.get("components", {}).get(
+        "securitySchemes", {}
+    )
+    if security_requirements:
+        for requirement in security_requirements:
+            for scheme_name in requirement:
+                if scheme_name in security_schemes:
+                    security_scheme = security_schemes[scheme_name]
+                    auth_strategy(security_scheme, request)
+                break
+class OpenAPIServiceClient:
+    """
+    A client for invoking operations on REST services defined by OpenAPI specifications.
+    """
+    def __init__(self, client_config: ClientConfiguration):
+        self.client_config = client_config
+    def invoke(self, function_payload: Any) -> Any:
+        """
+        Invokes a function specified in the function payload.
+        :param function_payload: The function payload containing the details of the function to be invoked.
+        :returns: The response from the service after invoking the function.
+        :raises OpenAPIClientError: If the function invocation payload cannot be extracted from the function payload.
+        :raises HttpClientError: If an error occurs while sending the request and receiving the response.
+        """
+        fn_invocation_payload = {}
+        try:
+            fn_extractor = self.client_config.get_payload_extractor()
+            fn_invocation_payload = fn_extractor(function_payload)
+        except Exception as e:
+            raise OpenAPIClientError(
+                f"Error extracting function invocation payload: {str(e)}"
+            ) from e
+        if "name" not in fn_invocation_payload or "arguments" not in fn_invocation_payload:
+            raise OpenAPIClientError(
+                f"Function invocation payload does not contain 'name' or 'arguments' keys: {fn_invocation_payload}, "
+                f"the payload extraction function may be incorrect."
+            )
+        # fn_invocation_payload, if not empty, guaranteed to have "name" and "arguments" keys from here on
+        operation = self.client_config.openapi_spec.find_operation_by_id(fn_invocation_payload["name"])
+        request = build_request(operation, **fn_invocation_payload["arguments"])
+        apply_authentication(self.client_config.get_auth_function(), operation, request)
+        return self.client_config.request_sender(request)
+class OpenAPIClientError(Exception):
+    """Exception raised for errors in the OpenAPI client."""

haystack_experimental-0.1.0/haystack_experimental/components/tools/openapi/_payload_extraction.py ADDED Viewed

@@ -0,0 +1,85 @@
+# SPDX-FileCopyrightText: 2022-present deepset GmbH <info@deepset.ai>
+#
+# SPDX-License-Identifier: Apache-2.0
+import dataclasses
+import json
+from typing import Any, Callable, Dict, List, Optional, Union
+def create_function_payload_extractor(
+    arguments_field_name: str,
+) -> Callable[[Any], Dict[str, Any]]:
+    """
+    Extracts invocation payload from a given LLM completion containing function invocation.
+    :param arguments_field_name: The name of the field containing the function arguments.
+    :return: A function that extracts the function invocation details from the LLM payload.
+    """
+    def _extract_function_invocation(payload: Any) -> Dict[str, Any]:
+        """
+        Extract the function invocation details from the payload.
+        :param payload: The LLM fc payload to extract the function invocation details from.
+        """
+        fields_and_values = _search(payload, arguments_field_name)
+        if fields_and_values:
+            arguments = fields_and_values.get(arguments_field_name)
+            if not isinstance(arguments, (str, dict)):
+                raise ValueError(
+                    f"Invalid {arguments_field_name} type {type(arguments)} for function call, expected str/dict"
+                )
+            return {
+                "name": fields_and_values.get("name"),
+                "arguments": (
+                    json.loads(arguments) if isinstance(arguments, str) else arguments
+                ),
+            }
+        return {}
+    return _extract_function_invocation
+def _get_dict_converter(
+    obj: Any, method_names: Optional[List[str]] = None
+) -> Union[Callable[[], Dict[str, Any]], None]:
+    method_names = method_names or [
+        "model_dump",
+        "dict",
+    ]  # search for pydantic v2 then v1
+    for attr in method_names:
+        if hasattr(obj, attr) and callable(getattr(obj, attr)):
+            return getattr(obj, attr)
+    return None
+def _is_primitive(obj) -> bool:
+    return isinstance(obj, (int, float, str, bool, type(None)))
+def _required_fields(arguments_field_name: str) -> List[str]:
+    return ["name", arguments_field_name]
+def _search(payload: Any, arguments_field_name: str) -> Dict[str, Any]:
+    if _is_primitive(payload):
+        return {}
+    if dict_converter := _get_dict_converter(payload):
+        payload = dict_converter()
+    elif dataclasses.is_dataclass(payload):
+        payload = dataclasses.asdict(payload)
+    if isinstance(payload, dict):
+        if all(field in payload for field in _required_fields(arguments_field_name)):
+            # this is the payload we are looking for
+            return payload
+        for value in payload.values():
+            result = _search(value, arguments_field_name)
+            if result:
+                return result
+    elif isinstance(payload, list):
+        for item in payload:
+            result = _search(item, arguments_field_name)
+            if result:
+                return result
+    return {}

haystack-experimental 0.0.2.dev0__tar.gz → 0.1.0__tar.gz

haystack-experimental 0.0.2.dev0tar.gz → 0.1.0tar.gz