PyPI - openaivec - Versions diffs - 0.14.10__tar.gz → 0.14.13__tar.gz - Mend

openaivec 0.14.10tar.gz → 0.14.13tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (91) hide show

{openaivec-0.14.10 → openaivec-0.14.13}/.github/copilot-instructions.md RENAMED Viewed

@@ -24,7 +24,10 @@ Entry points:
 - Spark UDF builders in `spark.py`
 - Structured tasks under `task/`
-Azure note: Use deployment name as `model`. Warn if base URL not v1. Behavior otherwise mirrors OpenAI.
+Azure note: Use deployment name as `model`. Standard Azure OpenAI configuration uses:
+- Base URL: `https://YOUR-RESOURCE-NAME.services.ai.azure.com/openai/v1/`
+- API Version: `"preview"`
+Warn if base URL not v1. Behavior otherwise mirrors OpenAI.
 ---
@@ -137,7 +140,16 @@ Public exports (`__init__.py`): `BatchResponses`, `AsyncBatchResponses`, `BatchE
 ## 10. Provider / Azure Rules
 - Auto-detect provider from env variables; deployment name = model for Azure.
-- Warn (don’t fail) if Azure base URL not v1 format; still proceed.
+- Standard Azure OpenAI configuration:
+  - Base URL: `https://YOUR-RESOURCE-NAME.services.ai.azure.com/openai/v1/`
+  - API Version: `"preview"`
+  - Environment variables:
+    ```bash
+    export AZURE_OPENAI_API_KEY="your-azure-key"
+    export AZURE_OPENAI_BASE_URL="https://YOUR-RESOURCE-NAME.services.ai.azure.com/openai/v1/"
+    export AZURE_OPENAI_API_VERSION="preview"
+    ```
+- Warn (don't fail) if Azure base URL not v1 format; still proceed.
 - Keep code paths unified; avoid forking logic unless behavior diverges.
 ---
@@ -348,6 +360,9 @@ uv run mkdocs serve
 Environment setup notes:
 - Set `OPENAI_API_KEY` or Azure trio (`AZURE_OPENAI_API_KEY`, `AZURE_OPENAI_BASE_URL`, `AZURE_OPENAI_API_VERSION`).
+- Standard Azure OpenAI configuration:
+  - `AZURE_OPENAI_BASE_URL="https://YOUR-RESOURCE-NAME.services.ai.azure.com/openai/v1/"`
+  - `AZURE_OPENAI_API_VERSION="preview"`
 - Tests auto-skip live paths when credentials absent.
 - Use separate shell profiles per provider if switching frequently.
-- Azure canonical base URL should end with `/openai/v1/` (e.g. `https://YOUR-RESOURCE-NAME.services.ai.azure.com/openai/v1/`); non‑v1 forms emit a warning.
+- Azure canonical base URL must end with `/openai/v1/` (e.g. `https://YOUR-RESOURCE-NAME.services.ai.azure.com/openai/v1/`); non‑v1 forms emit a warning.

{openaivec-0.14.10 → openaivec-0.14.13}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: openaivec
-Version: 0.14.10
+Version: 0.14.13
 Summary: Generative mutation for tabular calculation
 Project-URL: Homepage, https://microsoft.github.io/openaivec/
 Project-URL: Repository, https://github.com/microsoft/openaivec
@@ -334,26 +334,34 @@ Scale to enterprise datasets with distributed processing:
 First, obtain a Spark session and configure authentication:
 ```python
-import os
 from pyspark.sql import SparkSession
+from openaivec.spark import setup, setup_azure
 spark = SparkSession.builder.getOrCreate()
-sc = spark.sparkContext
-# Configure authentication via SparkContext environment variables
 # Option 1: Using OpenAI
-sc.environment["OPENAI_API_KEY"] = os.environ.get("OPENAI_API_KEY")
+setup(
+    spark,
+    api_key="your-openai-api-key",
+    responses_model_name="gpt-4.1-mini",  # Optional: set default model
+    embeddings_model_name="text-embedding-3-small"  # Optional: set default model
+)
 # Option 2: Using Azure OpenAI
-# sc.environment["AZURE_OPENAI_API_KEY"] = os.environ.get("AZURE_OPENAI_API_KEY")
-# sc.environment["AZURE_OPENAI_BASE_URL"] = os.environ.get("AZURE_OPENAI_BASE_URL")
-# sc.environment["AZURE_OPENAI_API_VERSION"] = os.environ.get("AZURE_OPENAI_API_VERSION")
+# setup_azure(
+#     spark,
+#     api_key="your-azure-openai-api-key",
+#     base_url="https://YOUR-RESOURCE-NAME.services.ai.azure.com/openai/v1/",
+#     api_version="preview",
+#     responses_model_name="my-gpt4-deployment",  # Optional: set default deployment
+#     embeddings_model_name="my-embedding-deployment"  # Optional: set default deployment
+# )
 ```
 Next, create and register UDFs using the provided functions:
 ```python
-from openaivec.spark import responses_udf, task_udf, embeddings_udf, count_tokens_udf
+from openaivec.spark import responses_udf, task_udf, embeddings_udf, count_tokens_udf, similarity_udf, parse_udf
 from pydantic import BaseModel
 # --- Register Responses UDF (String Output) ---
@@ -387,6 +395,9 @@ spark.udf.register(
 # --- Register Token Counting UDF ---
 spark.udf.register("count_tokens", count_tokens_udf())
+# --- Register Similarity UDF ---
+spark.udf.register("compute_similarity", similarity_udf())
 # --- Register UDFs with Pre-configured Tasks ---
 from openaivec.task import nlp, customer_support
@@ -414,6 +425,17 @@ spark.udf.register(
     )
 )
+# --- Register Parse UDF (Dynamic Schema Inference) ---
+spark.udf.register(
+    "parse_dynamic",
+    parse_udf(
+        instructions="Extract key entities and attributes from the text",
+        example_table_name="sample_texts",  # Infer schema from examples
+        example_field_name="text",
+        max_examples=50
+    )
+)
 ```
 You can now use these UDFs in Spark SQL:
@@ -691,17 +713,19 @@ steps:
    - In the notebook, import and use `openaivec.spark` functions as you normally would. For example:
      ```python
-     import os
-     from openaivec.spark import responses_udf, embeddings_udf
+     from openaivec.spark import setup_azure, responses_udf, embeddings_udf
      # In Microsoft Fabric, spark session is automatically available
      # spark = SparkSession.builder.getOrCreate()
-     sc = spark.sparkContext
      # Configure Azure OpenAI authentication
-     sc.environment["AZURE_OPENAI_API_KEY"] = "<your-api-key>"
-     sc.environment["AZURE_OPENAI_BASE_URL"] = "https://YOUR-RESOURCE-NAME.services.ai.azure.com/openai/v1/"
-     sc.environment["AZURE_OPENAI_API_VERSION"] = "preview"
+     setup_azure(
+         spark,
+         api_key="<your-api-key>",
+         base_url="https://YOUR-RESOURCE-NAME.services.ai.azure.com/openai/v1/",
+         api_version="preview",
+         responses_model_name="my-gpt4-deployment"  # Your Azure deployment name
+     )
      # Register UDFs
      spark.udf.register(

{openaivec-0.14.10 → openaivec-0.14.13}/README.md RENAMED Viewed

@@ -308,26 +308,34 @@ Scale to enterprise datasets with distributed processing:
 First, obtain a Spark session and configure authentication:
 ```python
-import os
 from pyspark.sql import SparkSession
+from openaivec.spark import setup, setup_azure
 spark = SparkSession.builder.getOrCreate()
-sc = spark.sparkContext
-# Configure authentication via SparkContext environment variables
 # Option 1: Using OpenAI
-sc.environment["OPENAI_API_KEY"] = os.environ.get("OPENAI_API_KEY")
+setup(
+    spark,
+    api_key="your-openai-api-key",
+    responses_model_name="gpt-4.1-mini",  # Optional: set default model
+    embeddings_model_name="text-embedding-3-small"  # Optional: set default model
+)
 # Option 2: Using Azure OpenAI
-# sc.environment["AZURE_OPENAI_API_KEY"] = os.environ.get("AZURE_OPENAI_API_KEY")
-# sc.environment["AZURE_OPENAI_BASE_URL"] = os.environ.get("AZURE_OPENAI_BASE_URL")
-# sc.environment["AZURE_OPENAI_API_VERSION"] = os.environ.get("AZURE_OPENAI_API_VERSION")
+# setup_azure(
+#     spark,
+#     api_key="your-azure-openai-api-key",
+#     base_url="https://YOUR-RESOURCE-NAME.services.ai.azure.com/openai/v1/",
+#     api_version="preview",
+#     responses_model_name="my-gpt4-deployment",  # Optional: set default deployment
+#     embeddings_model_name="my-embedding-deployment"  # Optional: set default deployment
+# )
 ```
 Next, create and register UDFs using the provided functions:
 ```python
-from openaivec.spark import responses_udf, task_udf, embeddings_udf, count_tokens_udf
+from openaivec.spark import responses_udf, task_udf, embeddings_udf, count_tokens_udf, similarity_udf, parse_udf
 from pydantic import BaseModel
 # --- Register Responses UDF (String Output) ---
@@ -361,6 +369,9 @@ spark.udf.register(
 # --- Register Token Counting UDF ---
 spark.udf.register("count_tokens", count_tokens_udf())
+# --- Register Similarity UDF ---
+spark.udf.register("compute_similarity", similarity_udf())
 # --- Register UDFs with Pre-configured Tasks ---
 from openaivec.task import nlp, customer_support
@@ -388,6 +399,17 @@ spark.udf.register(
     )
 )
+# --- Register Parse UDF (Dynamic Schema Inference) ---
+spark.udf.register(
+    "parse_dynamic",
+    parse_udf(
+        instructions="Extract key entities and attributes from the text",
+        example_table_name="sample_texts",  # Infer schema from examples
+        example_field_name="text",
+        max_examples=50
+    )
+)
 ```
 You can now use these UDFs in Spark SQL:
@@ -665,17 +687,19 @@ steps:
    - In the notebook, import and use `openaivec.spark` functions as you normally would. For example:
      ```python
-     import os
-     from openaivec.spark import responses_udf, embeddings_udf
+     from openaivec.spark import setup_azure, responses_udf, embeddings_udf
      # In Microsoft Fabric, spark session is automatically available
      # spark = SparkSession.builder.getOrCreate()
-     sc = spark.sparkContext
      # Configure Azure OpenAI authentication
-     sc.environment["AZURE_OPENAI_API_KEY"] = "<your-api-key>"
-     sc.environment["AZURE_OPENAI_BASE_URL"] = "https://YOUR-RESOURCE-NAME.services.ai.azure.com/openai/v1/"
-     sc.environment["AZURE_OPENAI_API_VERSION"] = "preview"
+     setup_azure(
+         spark,
+         api_key="<your-api-key>",
+         base_url="https://YOUR-RESOURCE-NAME.services.ai.azure.com/openai/v1/",
+         api_version="preview",
+         responses_model_name="my-gpt4-deployment"  # Your Azure deployment name
+     )
      # Register UDFs
      spark.udf.register(

{openaivec-0.14.10 → openaivec-0.14.13}/pyproject.toml RENAMED Viewed

@@ -43,6 +43,7 @@ dev = [
     "pyspark>=3.5.5",
     "pytest>=8.3.5",
     "pytest-asyncio",
+    "pytest-mock>=3.14.1",
     "python-dotenv>=1.1.0",
     "ruff>=0.11.5",
     "tabulate>=0.9.0",

openaivec-0.14.13/pytest.ini ADDED Viewed

@@ -0,0 +1,42 @@
+[tool:pytest]
+# Pytest configuration for openaivec
+# Test discovery
+testpaths = tests
+python_files = test_*.py
+python_classes = Test*
+python_functions = test_*
+# Markers
+markers =
+    slow: marks tests as slow (deselect with '-m "not slow"')
+    requires_api: marks tests as requiring OPENAI_API_KEY environment variable
+    asyncio: marks tests as async (handled by pytest-asyncio)
+    spark: marks tests as requiring Spark session
+    integration: marks tests as integration tests
+# Output options
+addopts =
+    --tb=short
+    --strict-markers
+    --strict-config
+    --disable-warnings
+    -ra
+# Async configuration
+asyncio_mode = auto
+# Logging
+log_cli = false
+log_cli_level = INFO
+log_cli_format = %(asctime)s [%(levelname)8s] %(name)s: %(message)s
+log_cli_date_format = %Y-%m-%d %H:%M:%S
+# Minimum version
+minversion = 6.0
+# Filter warnings
+filterwarnings =
+    ignore::UserWarning:openai.*
+    ignore::DeprecationWarning:pandas.*
+    ignore::RuntimeWarning:numpy.*

{openaivec-0.14.10 → openaivec-0.14.13}/src/openaivec/_di.py RENAMED Viewed

@@ -303,3 +303,24 @@ class Container:
             self._providers.clear()
             self._instances.clear()
             self._resolving.clear()
+    def clear_singletons(self) -> None:
+        """Clear all cached singleton instances from the container.
+        Removes all cached singleton instances while keeping the registered
+        providers intact. After calling this method, the next resolve call
+        for any service will create a new instance using the provider function.
+        Example:
+            ```python
+            container = Container()
+            container.register(str, lambda: "Hello")
+            instance1 = container.resolve(str)
+            container.clear_singletons()
+            instance2 = container.resolve(str)
+            print(instance1 is instance2)
+            # False - different instances after clearing singletons
+            ```
+        """
+        with self._lock:
+            self._instances.clear()

{openaivec-0.14.10 → openaivec-0.14.13}/src/openaivec/_embeddings.py RENAMED Viewed

@@ -26,14 +26,16 @@ class BatchEmbeddings:
         model_name (str): For Azure OpenAI, use your deployment name. For OpenAI, use the model name
             (e.g., ``"text-embedding-3-small"``).
         cache (BatchingMapProxy[str, NDArray[np.float32]]): Batching proxy for ordered, cached mapping.
+        api_kwargs (dict[str, Any]): Additional OpenAI API parameters stored at initialization.
     """
     client: OpenAI
     model_name: str
     cache: BatchingMapProxy[str, NDArray[np.float32]] = field(default_factory=lambda: BatchingMapProxy(batch_size=None))
+    api_kwargs: dict[str, int | float | str | bool] = field(default_factory=dict)
     @classmethod
-    def of(cls, client: OpenAI, model_name: str, batch_size: int | None = None) -> "BatchEmbeddings":
+    def of(cls, client: OpenAI, model_name: str, batch_size: int | None = None, **api_kwargs) -> "BatchEmbeddings":
         """Factory constructor.
         Args:
@@ -41,11 +43,17 @@ class BatchEmbeddings:
             model_name (str): For Azure OpenAI, use your deployment name. For OpenAI, use the model name.
             batch_size (int | None, optional): Max unique inputs per API call. Defaults to None
                 (automatic batch size optimization). Set to a positive integer for fixed batch size.
+            **api_kwargs: Additional OpenAI API parameters (e.g., dimensions for text-embedding-3 models).
         Returns:
             BatchEmbeddings: Configured instance backed by a batching proxy.
         """
-        return cls(client=client, model_name=model_name, cache=BatchingMapProxy(batch_size=batch_size))
+        return cls(
+            client=client,
+            model_name=model_name,
+            cache=BatchingMapProxy(batch_size=batch_size),
+            api_kwargs=api_kwargs,
+        )
     @observe(_LOGGER)
     @backoff(exceptions=[RateLimitError, InternalServerError], scale=1, max_retries=12)
@@ -62,7 +70,7 @@ class BatchEmbeddings:
         Returns:
             list[NDArray[np.float32]]: Embedding vectors aligned to ``inputs``.
         """
-        responses = self.client.embeddings.create(input=inputs, model=self.model_name)
+        responses = self.client.embeddings.create(input=inputs, model=self.model_name, **self.api_kwargs)
         return [np.array(d.embedding, dtype=np.float32) for d in responses.data]
     @observe(_LOGGER)
@@ -122,6 +130,7 @@ class AsyncBatchEmbeddings:
         client (AsyncOpenAI): Configured OpenAI async client.
         model_name (str): For Azure OpenAI, use your deployment name. For OpenAI, use the model name.
         cache (AsyncBatchingMapProxy[str, NDArray[np.float32]]): Async batching proxy.
+        api_kwargs (dict): Additional OpenAI API parameters stored at initialization.
     """
     client: AsyncOpenAI
@@ -129,6 +138,7 @@ class AsyncBatchEmbeddings:
     cache: AsyncBatchingMapProxy[str, NDArray[np.float32]] = field(
         default_factory=lambda: AsyncBatchingMapProxy(batch_size=None, max_concurrency=8)
     )
+    api_kwargs: dict[str, int | float | str | bool] = field(default_factory=dict)
     @classmethod
     def of(
@@ -137,6 +147,7 @@ class AsyncBatchEmbeddings:
         model_name: str,
         batch_size: int | None = None,
         max_concurrency: int = 8,
+        **api_kwargs,
     ) -> "AsyncBatchEmbeddings":
         """Factory constructor.
@@ -146,6 +157,7 @@ class AsyncBatchEmbeddings:
             batch_size (int | None, optional): Max unique inputs per API call. Defaults to None
                 (automatic batch size optimization). Set to a positive integer for fixed batch size.
             max_concurrency (int, optional): Max concurrent API calls. Defaults to 8.
+            **api_kwargs: Additional OpenAI API parameters (e.g., dimensions for text-embedding-3 models).
         Returns:
             AsyncBatchEmbeddings: Configured instance with an async batching proxy.
@@ -154,6 +166,7 @@ class AsyncBatchEmbeddings:
             client=client,
             model_name=model_name,
             cache=AsyncBatchingMapProxy(batch_size=batch_size, max_concurrency=max_concurrency),
+            api_kwargs=api_kwargs,
         )
     @backoff_async(exceptions=[RateLimitError, InternalServerError], scale=1, max_retries=12)
@@ -174,7 +187,7 @@ class AsyncBatchEmbeddings:
         Raises:
             RateLimitError: Propagated if retries are exhausted.
         """
-        responses = await self.client.embeddings.create(input=inputs, model=self.model_name)
+        responses = await self.client.embeddings.create(input=inputs, model=self.model_name, **self.api_kwargs)
         return [np.array(d.embedding, dtype=np.float32) for d in responses.data]
     @observe(_LOGGER)

{openaivec-0.14.10 → openaivec-0.14.13}/src/openaivec/_model.py RENAMED Viewed

@@ -1,4 +1,4 @@
-from dataclasses import dataclass
+from dataclasses import dataclass, field
 from typing import Generic, TypeVar
 __all__ = [
@@ -14,7 +14,7 @@ class PreparedTask(Generic[ResponseFormat]):
     This class encapsulates all the necessary parameters for executing a task,
     including the instructions to be sent to the model, the expected response
-    format using Pydantic models, and sampling parameters for controlling
+    format using Pydantic models, and API parameters for controlling
     the model's output behavior.
     Attributes:
@@ -22,12 +22,9 @@ class PreparedTask(Generic[ResponseFormat]):
             This should contain clear, specific directions for the task.
         response_format (type[ResponseFormat]): A Pydantic model class or str type that defines the expected
             structure of the response. Can be either a BaseModel subclass or str.
-        temperature (float): Controls randomness in the model's output.
-            Range: 0.0 to 1.0. Lower values make output more deterministic.
-            Defaults to 0.0.
-        top_p (float): Controls diversity via nucleus sampling. Only tokens
-            comprising the top_p probability mass are considered.
-            Range: 0.0 to 1.0. Defaults to 1.0.
+        api_kwargs (dict[str, int | float | str | bool]): Additional OpenAI API parameters
+            such as temperature, top_p, frequency_penalty, presence_penalty, seed, etc.
+            Defaults to an empty dict.
     Example:
         Creating a custom task:
@@ -43,8 +40,7 @@ class PreparedTask(Generic[ResponseFormat]):
         custom_task = PreparedTask(
             instructions="Translate the following text to French:",
             response_format=TranslationResponse,
-            temperature=0.1,
-            top_p=0.9
+            api_kwargs={"temperature": 0.1, "top_p": 0.9}
         )
         ```
@@ -55,8 +51,7 @@ class PreparedTask(Generic[ResponseFormat]):
     instructions: str
     response_format: type[ResponseFormat]
-    temperature: float = 0.0
-    top_p: float = 1.0
+    api_kwargs: dict[str, int | float | str | bool] = field(default_factory=dict)
 @dataclass(frozen=True)

{openaivec-0.14.10 → openaivec-0.14.13}/src/openaivec/_prompt.py RENAMED Viewed

@@ -445,8 +445,7 @@ class FewShotPromptBuilder:
         self,
         client: OpenAI | None = None,
         model_name: str | None = None,
-        temperature: float | None = None,
-        top_p: float | None = None,
+        **api_kwargs,
     ) -> "FewShotPromptBuilder":
         """Iteratively refine the prompt using an LLM.
@@ -460,8 +459,7 @@ class FewShotPromptBuilder:
         Args:
             client (OpenAI | None): Configured OpenAI client. If None, uses DI container with environment variables.
             model_name (str | None): Model identifier. If None, uses default ``gpt-4.1-mini``.
-            temperature (float | None): Sampling temperature. If None, uses model default.
-            top_p (float | None): Nucleus sampling parameter. If None, uses model default.
+            **api_kwargs: Additional OpenAI API parameters (temperature, top_p, etc.).
         Returns:
             FewShotPromptBuilder: The current builder instance containing the refined prompt and iteration history.
@@ -479,9 +477,8 @@ class FewShotPromptBuilder:
             model=_model_name,
             instructions=_PROMPT,
             input=Request(prompt=self._prompt).model_dump_json(),
-            temperature=temperature,
-            top_p=top_p,
             text_format=Response,
+            **api_kwargs,
         )
         # keep the original prompt

{openaivec-0.14.10 → openaivec-0.14.13}/src/openaivec/_provider.py RENAMED Viewed

@@ -130,35 +130,9 @@ def provide_async_openai_client() -> AsyncOpenAI:
     )
-CONTAINER.register(ResponsesModelName, lambda: ResponsesModelName("gpt-4.1-mini"))
-CONTAINER.register(EmbeddingsModelName, lambda: EmbeddingsModelName("text-embedding-3-small"))
-CONTAINER.register(OpenAIAPIKey, lambda: OpenAIAPIKey(os.getenv("OPENAI_API_KEY")))
-CONTAINER.register(AzureOpenAIAPIKey, lambda: AzureOpenAIAPIKey(os.getenv("AZURE_OPENAI_API_KEY")))
-CONTAINER.register(AzureOpenAIBaseURL, lambda: AzureOpenAIBaseURL(os.getenv("AZURE_OPENAI_BASE_URL")))
-CONTAINER.register(
-    cls=AzureOpenAIAPIVersion,
-    provider=lambda: AzureOpenAIAPIVersion(os.getenv("AZURE_OPENAI_API_VERSION", "preview")),
-)
-CONTAINER.register(OpenAI, provide_openai_client)
-CONTAINER.register(AsyncOpenAI, provide_async_openai_client)
-CONTAINER.register(tiktoken.Encoding, lambda: tiktoken.get_encoding("o200k_base"))
-CONTAINER.register(TextChunker, lambda: TextChunker(CONTAINER.resolve(tiktoken.Encoding)))
-CONTAINER.register(
-    SchemaInferer,
-    lambda: SchemaInferer(
-        client=CONTAINER.resolve(OpenAI),
-        model_name=CONTAINER.resolve(ResponsesModelName).value,
-    ),
-)
-def reset_environment_registrations():
-    """Reset environment variable related registrations in the container.
-    This function re-registers environment variable dependent services to pick up
-    current environment variable values. Useful for testing when environment
-    variables are changed after initial container setup.
-    """
+def set_default_registrations():
+    CONTAINER.register(ResponsesModelName, lambda: ResponsesModelName("gpt-4.1-mini"))
+    CONTAINER.register(EmbeddingsModelName, lambda: EmbeddingsModelName("text-embedding-3-small"))
     CONTAINER.register(OpenAIAPIKey, lambda: OpenAIAPIKey(os.getenv("OPENAI_API_KEY")))
     CONTAINER.register(AzureOpenAIAPIKey, lambda: AzureOpenAIAPIKey(os.getenv("AZURE_OPENAI_API_KEY")))
     CONTAINER.register(AzureOpenAIBaseURL, lambda: AzureOpenAIBaseURL(os.getenv("AZURE_OPENAI_BASE_URL")))
@@ -168,6 +142,8 @@ def reset_environment_registrations():
     )
     CONTAINER.register(OpenAI, provide_openai_client)
     CONTAINER.register(AsyncOpenAI, provide_async_openai_client)
+    CONTAINER.register(tiktoken.Encoding, lambda: tiktoken.get_encoding("o200k_base"))
+    CONTAINER.register(TextChunker, lambda: TextChunker(CONTAINER.resolve(tiktoken.Encoding)))
     CONTAINER.register(
         SchemaInferer,
         lambda: SchemaInferer(
@@ -175,3 +151,6 @@ def reset_environment_registrations():
             model_name=CONTAINER.resolve(ResponsesModelName).value,
         ),
     )
+set_default_registrations()

openaivec 0.14.10__tar.gz → 0.14.13__tar.gz

openaivec 0.14.10tar.gz → 0.14.13tar.gz