PyPI - inferencesh - Versions diffs - 0.3.1__tar.gz → 0.4.1__tar.gz - Mend

inferencesh 0.3.1tar.gz → 0.4.1tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Potentially problematic release.

This version of inferencesh might be problematic. Click here for more details.

Files changed (24) hide show

{inferencesh-0.3.1/src/inferencesh.egg-info → inferencesh-0.4.1}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: inferencesh
-Version: 0.3.1
+Version: 0.4.1
 Summary: inference.sh Python SDK
 Author: Inference Shell Inc.
 Author-email: "Inference Shell Inc." <hello@inference.sh>
@@ -25,19 +25,65 @@ Dynamic: author
 Dynamic: license-file
 Dynamic: requires-python
-# inference.sh CLI
+# inference.sh sdk
-Helper package for inference.sh Python applications.
+helper package for inference.sh python applications.
-## Installation
+## installation
 ```bash
 pip install infsh
 ```
-## File Handling
+## client usage
-The `File` class provides a standardized way to handle files in the inference.sh ecosystem:
+```python
+from infsh import Inference, TaskStatus
+# create client
+client = Inference(api_key="your-api-key")
+# simple usage - wait for result
+result = client.run({
+    "app": "your-app",
+    "input": {"key": "value"},
+    "variant": "default"
+})
+print(f"output: {result['output']}")
+# get task info without waiting
+task = client.run(params, wait=False)
+print(f"task id: {task['id']}")
+# stream updates (recommended)
+for update in client.run(params, stream=True):
+    status = update.get("status")
+    print(f"status: {TaskStatus(status).name}")
+    if status == TaskStatus.COMPLETED:
+        print(f"output: {update.get('output')}")
+        break
+    elif status == TaskStatus.FAILED:
+        print(f"error: {update.get('error')}")
+        break
+# async support
+async def run_async():
+    from infsh import AsyncInference
+    client = AsyncInference(api_key="your-api-key")
+    # simple usage
+    result = await client.run(params)
+    # stream updates
+    async for update in await client.run(params, stream=True):
+        print(f"status: {TaskStatus(update['status']).name}")
+```
+## file handling
+the `File` class provides a standardized way to handle files in the inference.sh ecosystem:
 ```python
 from infsh import File
@@ -68,15 +114,15 @@ print(file.filename)   # basename of the file
 file.refresh_metadata()
 ```
-The `File` class automatically handles:
-- MIME type detection
-- File size calculation
-- Filename extraction from path
-- File existence checking
+the `File` class automatically handles:
+- mime type detection
+- file size calculation
+- filename extraction from path
+- file existence checking
-## Creating an App
+## creating an app
-To create an inference app, inherit from `BaseApp` and define your input/output types:
+to create an inference app, inherit from `BaseApp` and define your input/output types:
 ```python
 from infsh import BaseApp, BaseAppInput, BaseAppOutput, File
@@ -103,7 +149,7 @@ class MyApp(BaseApp):
         pass
 ```
-The app lifecycle has three main methods:
-- `setup()`: Called when the app starts, use it to initialize models
-- `run()`: Called for each inference request
-- `unload()`: Called when shutting down, use it to free resources
+app lifecycle has three main methods:
+- `setup()`: called when the app starts, use it to initialize models
+- `run()`: called for each inference request
+- `unload()`: called when shutting down, use it to free resources

inferencesh-0.4.1/README.md ADDED Viewed

@@ -0,0 +1,128 @@
+# inference.sh sdk
+helper package for inference.sh python applications.
+## installation
+```bash
+pip install infsh
+```
+## client usage
+```python
+from infsh import Inference, TaskStatus
+# create client
+client = Inference(api_key="your-api-key")
+# simple usage - wait for result
+result = client.run({
+    "app": "your-app",
+    "input": {"key": "value"},
+    "variant": "default"
+})
+print(f"output: {result['output']}")
+# get task info without waiting
+task = client.run(params, wait=False)
+print(f"task id: {task['id']}")
+# stream updates (recommended)
+for update in client.run(params, stream=True):
+    status = update.get("status")
+    print(f"status: {TaskStatus(status).name}")
+    if status == TaskStatus.COMPLETED:
+        print(f"output: {update.get('output')}")
+        break
+    elif status == TaskStatus.FAILED:
+        print(f"error: {update.get('error')}")
+        break
+# async support
+async def run_async():
+    from infsh import AsyncInference
+    client = AsyncInference(api_key="your-api-key")
+    # simple usage
+    result = await client.run(params)
+    # stream updates
+    async for update in await client.run(params, stream=True):
+        print(f"status: {TaskStatus(update['status']).name}")
+```
+## file handling
+the `File` class provides a standardized way to handle files in the inference.sh ecosystem:
+```python
+from infsh import File
+# Basic file creation
+file = File(path="/path/to/file.png")
+# File with explicit metadata
+file = File(
+    path="/path/to/file.png",
+    content_type="image/png",
+    filename="custom_name.png",
+    size=1024  # in bytes
+)
+# Create from path (automatically populates metadata)
+file = File.from_path("/path/to/file.png")
+# Check if file exists
+exists = file.exists()
+# Access file metadata
+print(file.content_type)  # automatically detected if not specified
+print(file.size)       # file size in bytes
+print(file.filename)   # basename of the file
+# Refresh metadata (useful if file has changed)
+file.refresh_metadata()
+```
+the `File` class automatically handles:
+- mime type detection
+- file size calculation
+- filename extraction from path
+- file existence checking
+## creating an app
+to create an inference app, inherit from `BaseApp` and define your input/output types:
+```python
+from infsh import BaseApp, BaseAppInput, BaseAppOutput, File
+class AppInput(BaseAppInput):
+    image: str  # URL or file path to image
+    mask: str   # URL or file path to mask
+class AppOutput(BaseAppOutput):
+    image: File
+class MyApp(BaseApp):
+    async def setup(self):
+        # Initialize your model here
+        pass
+    async def run(self, app_input: AppInput) -> AppOutput:
+        # Process input and return output
+        result_path = "/tmp/result.png"
+        return AppOutput(image=File(path=result_path))
+    async def unload(self):
+        # Clean up resources
+        pass
+```
+app lifecycle has three main methods:
+- `setup()`: called when the app starts, use it to initialize models
+- `run()`: called for each inference request
+- `unload()`: called when shutting down, use it to free resources

{inferencesh-0.3.1 → inferencesh-0.4.1}/pyproject.toml RENAMED Viewed

@@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"
 [project]
 name = "inferencesh"
-version = "0.3.1"
+version = "0.4.1"
 description = "inference.sh Python SDK"
 authors = [
     {name = "Inference Shell Inc.", email = "hello@inference.sh"},

{inferencesh-0.3.1 → inferencesh-0.4.1}/src/inferencesh/client.py RENAMED Viewed

@@ -1,6 +1,6 @@
 from __future__ import annotations
-from typing import Any, Dict, Optional, Callable, Generator, Union
+from typing import Any, Dict, Optional, Callable, Generator, Union, Iterator
 from dataclasses import dataclass
 from enum import IntEnum
 import json
@@ -8,6 +8,103 @@ import re
 import time
 import mimetypes
 import os
+from contextlib import AbstractContextManager
+from typing import Protocol, runtime_checkable
+class TaskStream(AbstractContextManager['TaskStream']):
+    """A context manager for streaming task updates.
+    This class provides a Pythonic interface for handling streaming updates from a task.
+    It can be used either as a context manager or as an iterator.
+    Example:
+        ```python
+        # As a context manager
+        with client.stream_task(task_id) as stream:
+            for update in stream:
+                print(f"Update: {update}")
+        # As an iterator
+        for update in client.stream_task(task_id):
+            print(f"Update: {update}")
+        ```
+    """
+    def __init__(
+        self,
+        task: Dict[str, Any],
+        client: Any,
+        auto_reconnect: bool = True,
+        max_reconnects: int = 5,
+        reconnect_delay_ms: int = 1000,
+    ):
+        self.task = task
+        self.client = client
+        self.task_id = task["id"]
+        self.auto_reconnect = auto_reconnect
+        self.max_reconnects = max_reconnects
+        self.reconnect_delay_ms = reconnect_delay_ms
+        self._final_task: Optional[Dict[str, Any]] = None
+        self._error: Optional[Exception] = None
+    def __enter__(self) -> 'TaskStream':
+        return self
+    def __exit__(self, exc_type, exc_val, exc_tb) -> None:
+        pass
+    def __iter__(self) -> Iterator[Dict[str, Any]]:
+        return self.stream()
+    @property
+    def result(self) -> Optional[Dict[str, Any]]:
+        """The final task result if completed, None otherwise."""
+        return self._final_task
+    @property
+    def error(self) -> Optional[Exception]:
+        """The error that occurred during streaming, if any."""
+        return self._error
+    def stream(self) -> Iterator[Dict[str, Any]]:
+        """Stream updates for this task.
+        Yields:
+            Dict[str, Any]: Task update events
+        Raises:
+            RuntimeError: If the task fails or is cancelled
+        """
+        try:
+            for update in self.client._stream_updates(
+                self.task_id,
+                self.task,
+            ):
+                if isinstance(update, Exception):
+                    self._error = update
+                    raise update
+                if update.get("status") == TaskStatus.COMPLETED:
+                    self._final_task = update
+                yield update
+        except Exception as exc:
+            self._error = exc
+            raise
+@runtime_checkable
+class TaskCallback(Protocol):
+    """Protocol for task streaming callbacks."""
+    def on_update(self, data: Dict[str, Any]) -> None:
+        """Called when a task update is received."""
+        ...
+    def on_error(self, error: Exception) -> None:
+        """Called when an error occurs during task execution."""
+        ...
+    def on_complete(self, task: Dict[str, Any]) -> None:
+        """Called when a task completes successfully."""
+        ...
 # Deliberately do lazy imports for requests/aiohttp to avoid hard dependency at import time
@@ -228,122 +325,116 @@ class Inference:
         return payload.get("data")
     # --------------- Public API ---------------
-    def run(self, params: Dict[str, Any]) -> Dict[str, Any]:
-        processed_input = self._process_input_data(params.get("input"))
-        task = self._request("post", "/run", data={**params, "input": processed_input})
-        return task
-    def run_sync(
+    def run(
         self,
         params: Dict[str, Any],
         *,
+        wait: bool = True,
+        stream: bool = False,
         auto_reconnect: bool = True,
         max_reconnects: int = 5,
         reconnect_delay_ms: int = 1000,
-    ) -> Dict[str, Any]:
+    ) -> Union[Dict[str, Any], TaskStream, Iterator[Dict[str, Any]]]:
+        """Run a task with optional streaming updates.
+        By default, this method waits for the task to complete and returns the final result.
+        You can set wait=False to get just the task info, or stream=True to get an iterator
+        of status updates.
+        Args:
+            params: Task parameters to pass to the API
+            wait: Whether to wait for task completion (default: True)
+            stream: Whether to return an iterator of updates (default: False)
+            auto_reconnect: Whether to automatically reconnect on connection loss
+            max_reconnects: Maximum number of reconnection attempts
+            reconnect_delay_ms: Delay between reconnection attempts in milliseconds
+        Returns:
+            Union[Dict[str, Any], TaskStream, Iterator[Dict[str, Any]]]:
+                - If wait=True and stream=False: The completed task data
+                - If wait=False: The created task info
+                - If stream=True: An iterator of task updates
+        Example:
+            ```python
+            # Simple usage - wait for result (default)
+            result = client.run(params)
+            print(f"Output: {result['output']}")
+            # Get task info without waiting
+            task = client.run(params, wait=False)
+            task_id = task["id"]
+            # Stream updates
+            for update in client.run(params, stream=True):
+                print(f"Status: {update.get('status')}")
+                if update.get('status') == TaskStatus.COMPLETED:
+                    print(f"Result: {update.get('output')}")
+            ```
+        """
+        # Create the task
         processed_input = self._process_input_data(params.get("input"))
         task = self._request("post", "/run", data={**params, "input": processed_input})
-        task_id = task["id"]
-        final_task: Optional[Dict[str, Any]] = None
-        def on_data(data: Dict[str, Any]) -> None:
-            nonlocal final_task
-            try:
-                result = _process_stream_event(
-                    data,
-                    task=task,
-                    stopper=lambda: manager.stop(),
-                )
-                if result is not None:
-                    final_task = result
-            except Exception as exc:
-                raise
-        def on_error(exc: Exception) -> None:
-            raise exc
-        def on_start() -> None:
-            pass
-        def on_stop() -> None:
-            pass
-        manager = StreamManager(
-            create_event_source=None,  # We'll set this after defining it
-            auto_reconnect=auto_reconnect,
-            max_reconnects=max_reconnects,
-            reconnect_delay_ms=reconnect_delay_ms,
-            on_data=on_data,
-            on_error=on_error,
-            on_start=on_start,
-            on_stop=on_stop,
-        )
-        def create_event_source() -> Generator[Dict[str, Any], None, None]:
-            url = f"/tasks/{task_id}/stream"
-            resp = self._request(
-                "get",
-                url,
-                headers={
-                    "Accept": "text/event-stream",
-                    "Cache-Control": "no-cache",
-                    "Accept-Encoding": "identity",
-                    "Connection": "keep-alive",
-                },
-                stream=True,
-                timeout=60,
+        # Return immediately if not waiting
+        if not wait and not stream:
+            return _strip_task(task)
+        # Return stream if requested
+        if stream:
+            task_stream = TaskStream(
+                task=task,
+                client=self,
+                auto_reconnect=auto_reconnect,
+                max_reconnects=max_reconnects,
+                reconnect_delay_ms=reconnect_delay_ms,
             )
+            return task_stream
-            try:
-                last_event_at = time.perf_counter()
-                for evt in self._iter_sse(resp, stream_manager=manager):
-                    yield evt
-            finally:
-                try:
-                    # Force close the underlying socket if possible
-                    try:
-                        raw = getattr(resp, 'raw', None)
-                        if raw is not None:
-                            raw.close()
-                    except Exception:
-                        raise
-                    # Close the response
-                    resp.close()
-                except Exception:
-                    raise
-        # Update the create_event_source function in the manager
-        manager._create_event_source = create_event_source
+        # Otherwise wait for completion
+        return self.wait_for_completion(task["id"])
-        # Connect and wait for completion
-        manager.connect()
-        # At this point, we should have a final task state
-        if final_task is not None:
-            return final_task
-        # Try to fetch the latest state as a fallback
-        try:
-            latest = self.get_task(task_id)
-            status = latest.get("status")
-            if status == TaskStatus.COMPLETED:
-                return latest
-            if status == TaskStatus.FAILED:
-                raise RuntimeError(latest.get("error") or "task failed")
-            if status == TaskStatus.CANCELLED:
-                raise RuntimeError("task cancelled")
-        except Exception as exc:
-            raise
-        raise RuntimeError("Stream ended without completion")
     def cancel(self, task_id: str) -> None:
         self._request("post", f"/tasks/{task_id}/cancel")
     def get_task(self, task_id: str) -> Dict[str, Any]:
+        """Get the current state of a task.
+        Args:
+            task_id: The ID of the task to get
+        Returns:
+            Dict[str, Any]: The current task state
+        """
         return self._request("get", f"/tasks/{task_id}")
+    def wait_for_completion(self, task_id: str) -> Dict[str, Any]:
+        """Wait for a task to complete and return its final state.
+        This method polls the task status until it reaches a terminal state
+        (completed, failed, or cancelled).
+        Args:
+            task_id: The ID of the task to wait for
+        Returns:
+            Dict[str, Any]: The final task state
+        Raises:
+            RuntimeError: If the task fails or is cancelled
+        """
+        with self.stream_task(task_id) as stream:
+            for update in stream:
+                if update.get("status") == TaskStatus.COMPLETED:
+                    return update
+                elif update.get("status") == TaskStatus.FAILED:
+                    raise RuntimeError(update.get("error") or "Task failed")
+                elif update.get("status") == TaskStatus.CANCELLED:
+                    raise RuntimeError("Task cancelled")
+        raise RuntimeError("Stream ended without completion")
     # --------------- File upload ---------------
     def upload_file(self, data: Union[str, bytes], options: Optional[UploadFileOptions] = None) -> Dict[str, Any]:
         options = options or UploadFileOptions()
@@ -403,6 +494,103 @@ class Inference:
         return file_obj
     # --------------- Helpers ---------------
+    def stream_task(
+        self,
+        task_id: str,
+        *,
+        auto_reconnect: bool = True,
+        max_reconnects: int = 5,
+        reconnect_delay_ms: int = 1000,
+    ) -> TaskStream:
+        """Create a TaskStream for getting streaming updates from a task.
+        This provides a more Pythonic interface for handling task updates compared to callbacks.
+        The returned TaskStream can be used either as a context manager or as an iterator.
+        Args:
+            task_id: The ID of the task to stream
+            auto_reconnect: Whether to automatically reconnect on connection loss
+            max_reconnects: Maximum number of reconnection attempts
+            reconnect_delay_ms: Delay between reconnection attempts in milliseconds
+        Returns:
+            TaskStream: A stream interface for the task
+        Example:
+            ```python
+            # Run a task
+            task = client.run(params)
+            # Stream updates using context manager
+            with client.stream_task(task["id"]) as stream:
+                for update in stream:
+                    print(f"Status: {update.get('status')}")
+                    if update.get("status") == TaskStatus.COMPLETED:
+                        print(f"Result: {update.get('output')}")
+            # Or use as a simple iterator
+            for update in client.stream_task(task["id"]):
+                print(f"Update: {update}")
+            ```
+        """
+        task = self.get_task(task_id)
+        return TaskStream(
+            task=task,
+            client=self,
+            auto_reconnect=auto_reconnect,
+            max_reconnects=max_reconnects,
+            reconnect_delay_ms=reconnect_delay_ms,
+        )
+    def _stream_updates(
+        self,
+        task_id: str,
+        task: Dict[str, Any],
+    ) -> Generator[Union[Dict[str, Any], Exception], None, None]:
+        """Internal method to stream task updates."""
+        url = f"/tasks/{task_id}/stream"
+        resp = self._request(
+            "get",
+            url,
+            headers={
+                "Accept": "text/event-stream",
+                "Cache-Control": "no-cache",
+                "Accept-Encoding": "identity",
+                "Connection": "keep-alive",
+            },
+            stream=True,
+            timeout=60,
+        )
+        try:
+            for evt in self._iter_sse(resp):
+                try:
+                    # Process the event to check for completion/errors
+                    result = _process_stream_event(
+                        evt,
+                        task=task,
+                        stopper=None,  # We'll handle stopping via the iterator
+                    )
+                    if result is not None:
+                        yield result
+                        break
+                    yield _strip_task(evt)
+                except Exception as exc:
+                    yield exc
+                    raise
+        finally:
+            try:
+                # Force close the underlying socket if possible
+                try:
+                    raw = getattr(resp, 'raw', None)
+                    if raw is not None:
+                        raw.close()
+                except Exception:
+                    raise
+                # Close the response
+                resp.close()
+            except Exception:
+                raise
     def _iter_sse(self, resp: Any, stream_manager: Optional[Any] = None) -> Generator[Dict[str, Any], None, None]:
         """Iterate JSON events from an SSE response."""
         # Mode 1: raw socket readline (can reduce buffering in some environments)
@@ -565,88 +753,114 @@ class AsyncInference:
                 return payload.get("data")
     # --------------- Public API ---------------
-    async def run(self, params: Dict[str, Any]) -> Dict[str, Any]:
-        processed_input = await self._process_input_data(params.get("input"))
-        task = await self._request("post", "/run", data={**params, "input": processed_input})
-        return task
-    async def run_sync(
+    async def run(
         self,
         params: Dict[str, Any],
         *,
+        wait: bool = True,
+        stream: bool = False,
         auto_reconnect: bool = True,
         max_reconnects: int = 5,
         reconnect_delay_ms: int = 1000,
-    ) -> Dict[str, Any]:
+    ) -> Union[Dict[str, Any], TaskStream, Iterator[Dict[str, Any]]]:
+        """Run a task with optional streaming updates.
+        By default, this method waits for the task to complete and returns the final result.
+        You can set wait=False to get just the task info, or stream=True to get an iterator
+        of status updates.
+        Args:
+            params: Task parameters to pass to the API
+            wait: Whether to wait for task completion (default: True)
+            stream: Whether to return an iterator of updates (default: False)
+            auto_reconnect: Whether to automatically reconnect on connection loss
+            max_reconnects: Maximum number of reconnection attempts
+            reconnect_delay_ms: Delay between reconnection attempts in milliseconds
+        Returns:
+            Union[Dict[str, Any], TaskStream, Iterator[Dict[str, Any]]]:
+                - If wait=True and stream=False: The completed task data
+                - If wait=False: The created task info
+                - If stream=True: An iterator of task updates
+        Example:
+            ```python
+            # Simple usage - wait for result (default)
+            result = await client.run(params)
+            print(f"Output: {result['output']}")
+            # Get task info without waiting
+            task = await client.run(params, wait=False)
+            task_id = task["id"]
+            # Stream updates
+            async for update in await client.run(params, stream=True):
+                print(f"Status: {update.get('status')}")
+                if update.get('status') == TaskStatus.COMPLETED:
+                    print(f"Result: {update.get('output')}")
+            ```
+        """
+        # Create the task
         processed_input = await self._process_input_data(params.get("input"))
         task = await self._request("post", "/run", data={**params, "input": processed_input})
-        task_id = task["id"]
-        final_task: Optional[Dict[str, Any]] = None
-        reconnect_attempts = 0
-        had_success = False
-        while True:
-            try:
-                resp = await self._request(
-                    "get",
-                    f"/tasks/{task_id}/stream",
-                    headers={
-                        "Accept": "text/event-stream",
-                        "Cache-Control": "no-cache",
-                        "Accept-Encoding": "identity",
-                        "Connection": "keep-alive",
-                    },
-                    timeout=60,
-                    expect_stream=True,
-                )
-                had_success = True
-                async for data in self._aiter_sse(resp):
-                    result = _process_stream_event(
-                        data,
-                        task=task,
-                        stopper=None,
-                    )
-                    if result is not None:
-                        final_task = result
-                        break
-                if final_task is not None:
-                    break
-            except Exception as exc:  # noqa: BLE001
-                if not auto_reconnect:
-                    raise
-                if not had_success:
-                    reconnect_attempts += 1
-                    if reconnect_attempts > max_reconnects:
-                        raise
-                await _async_sleep(reconnect_delay_ms / 1000.0)
-            else:
-                if not auto_reconnect:
-                    break
-                await _async_sleep(reconnect_delay_ms / 1000.0)
-        if final_task is None:
-            # Fallback: fetch latest task state in case stream ended without a terminal event
-            try:
-                latest = await self.get_task(task_id)
-                status = latest.get("status")
-                if status == TaskStatus.COMPLETED:
-                    return latest
-                if status == TaskStatus.FAILED:
-                    raise RuntimeError(latest.get("error") or "task failed")
-                if status == TaskStatus.CANCELLED:
-                    raise RuntimeError("task cancelled")
-            except Exception:
-                raise
-            raise RuntimeError("Stream ended without completion")
-        return final_task
+        # Return immediately if not waiting
+        if not wait and not stream:
+            return task
+        # Return stream if requested
+        if stream:
+            task_stream = TaskStream(
+                task=task,
+                client=self,
+                auto_reconnect=auto_reconnect,
+                max_reconnects=max_reconnects,
+                reconnect_delay_ms=reconnect_delay_ms,
+            )
+            return task_stream
+        # Otherwise wait for completion
+        return await self.wait_for_completion(task["id"])
     async def cancel(self, task_id: str) -> None:
         await self._request("post", f"/tasks/{task_id}/cancel")
     async def get_task(self, task_id: str) -> Dict[str, Any]:
+        """Get the current state of a task.
+        Args:
+            task_id: The ID of the task to get
+        Returns:
+            Dict[str, Any]: The current task state
+        """
         return await self._request("get", f"/tasks/{task_id}")
+    async def wait_for_completion(self, task_id: str) -> Dict[str, Any]:
+        """Wait for a task to complete and return its final state.
+        This method polls the task status until it reaches a terminal state
+        (completed, failed, or cancelled).
+        Args:
+            task_id: The ID of the task to wait for
+        Returns:
+            Dict[str, Any]: The final task state
+        Raises:
+            RuntimeError: If the task fails or is cancelled
+        """
+        with self.stream_task(task_id) as stream:
+            async for update in stream:
+                if update.get("status") == TaskStatus.COMPLETED:
+                    return update
+                elif update.get("status") == TaskStatus.FAILED:
+                    raise RuntimeError(update.get("error") or "Task failed")
+                elif update.get("status") == TaskStatus.CANCELLED:
+                    raise RuntimeError("Task cancelled")
+        raise RuntimeError("Stream ended without completion")
     # --------------- File upload ---------------
     async def upload_file(self, data: Union[str, bytes], options: Optional[UploadFileOptions] = None) -> Dict[str, Any]:
         options = options or UploadFileOptions()
@@ -797,6 +1011,18 @@ def _looks_like_base64(value: str) -> bool:
         return False
+def _strip_task(task: Dict[str, Any]) -> Dict[str, Any]:
+    """Strip task to essential fields."""
+    return {
+        "id": task.get("id"),
+        "created_at": task.get("created_at"),
+        "updated_at": task.get("updated_at"),
+        "input": task.get("input"),
+        "output": task.get("output"),
+        "logs": task.get("logs"),
+        "status": task.get("status"),
+    }
 def _process_stream_event(
     data: Dict[str, Any], *, task: Dict[str, Any], stopper: Optional[Callable[[], None]] = None
 ) -> Optional[Dict[str, Any]]:
@@ -804,16 +1030,9 @@ def _process_stream_event(
     If stopper is provided, it will be called on terminal events to end streaming.
     """
     status = data.get("status")
-    output = data.get("output")
-    logs = data.get("logs")
     if status == TaskStatus.COMPLETED:
-        result = {
-            **task,
-            "status": data.get("status"),
-            "output": data.get("output"),
-            "logs": data.get("logs") or [],
-        }
+        result = _strip_task(data)
         if stopper:
             stopper()
         return result

{inferencesh-0.3.1 → inferencesh-0.4.1}/src/inferencesh/models/file.py RENAMED Viewed

@@ -5,11 +5,45 @@ import os
 import urllib.request
 import urllib.parse
 import tempfile
+import hashlib
+from pathlib import Path
 from tqdm import tqdm
 class File(BaseModel):
     """A class representing a file in the inference.sh ecosystem."""
+    @classmethod
+    def get_cache_dir(cls) -> Path:
+        """Get the cache directory path based on environment variables or default location."""
+        if cache_dir := os.environ.get("FILE_CACHE_DIR"):
+            path = Path(cache_dir)
+        else:
+            path = Path.home() / ".cache" / "inferencesh" / "files"
+        path.mkdir(parents=True, exist_ok=True)
+        return path
+    def _get_cache_path(self, url: str) -> Path:
+        """Get the cache path for a URL using a hash-based directory structure."""
+        # Parse URL components
+        parsed_url = urllib.parse.urlparse(url)
+        # Create hash from URL path and query parameters for uniqueness
+        url_components = parsed_url.netloc + parsed_url.path
+        if parsed_url.query:
+            url_components += '?' + parsed_url.query
+        url_hash = hashlib.sha256(url_components.encode()).hexdigest()[:12]
+        # Get filename from URL or use default
+        filename = os.path.basename(parsed_url.path)
+        if not filename:
+            filename = 'download'
+        # Create hash directory in cache
+        cache_dir = self.get_cache_dir() / url_hash
+        cache_dir.mkdir(exist_ok=True)
+        return cache_dir / filename
     uri: Optional[str] = Field(default=None)  # Original location (URL or file path)
     path: Optional[str] = None  # Resolved local file path
     content_type: Optional[str] = None  # MIME type of the file
@@ -74,11 +108,20 @@ class File(BaseModel):
         return parsed.scheme in ('http', 'https')
     def _download_url(self) -> None:
-        """Download the URL to a temporary file and update the path."""
+        """Download the URL to the cache directory and update the path."""
         original_url = self.uri
+        cache_path = self._get_cache_path(original_url)
+        # If file exists in cache, use it
+        if cache_path.exists():
+            print(f"Using cached file: {cache_path}")
+            self.path = str(cache_path)
+            return
+        print(f"Downloading URL: {original_url} to {cache_path}")
         tmp_file = None
         try:
-            # Create a temporary file with a suffix based on the URL path
+            # Download to temporary file first to avoid partial downloads in cache
             suffix = os.path.splitext(urllib.parse.urlparse(original_url).path)[1]
             tmp_file = tempfile.NamedTemporaryFile(delete=False, suffix=suffix)
             self._tmp_path = tmp_file.name
@@ -133,7 +176,10 @@ class File(BaseModel):
                                     # If we read the whole body at once, exit loop
                                     break
-                self.path = self._tmp_path
+                # Move the temporary file to the cache location
+                os.replace(self._tmp_path, cache_path)
+                self._tmp_path = None  # Prevent deletion in __del__
+                self.path = str(cache_path)
             except (urllib.error.URLError, urllib.error.HTTPError) as e:
                 raise RuntimeError(f"Failed to download URL {original_url}: {str(e)}")
             except IOError as e:

{inferencesh-0.3.1 → inferencesh-0.4.1}/src/inferencesh/utils/download.py RENAMED Viewed

@@ -24,16 +24,24 @@ def download(url: str, directory: Union[str, Path, StorageDir]) -> str:
     dir_path = Path(directory)
     dir_path.mkdir(exist_ok=True)
-    # Create hash directory from URL
-    url_hash = hashlib.sha256(url.encode()).hexdigest()[:12]
-    hash_dir = dir_path / url_hash
-    hash_dir.mkdir(exist_ok=True)
+    # Parse URL components
+    parsed_url = urllib.parse.urlparse(url)
-    # Keep original filename
-    filename = os.path.basename(urllib.parse.urlparse(url).path)
+    # Create hash from URL path and query parameters for uniqueness
+    url_components = parsed_url.netloc + parsed_url.path
+    if parsed_url.query:
+        url_components += '?' + parsed_url.query
+    url_hash = hashlib.sha256(url_components.encode()).hexdigest()[:12]
+    # Keep original filename or use a default
+    filename = os.path.basename(parsed_url.path)
     if not filename:
         filename = 'download'
+    # Create hash directory and store file
+    hash_dir = dir_path / url_hash
+    hash_dir.mkdir(exist_ok=True)
     output_path = hash_dir / filename
     # If file exists in directory and it's not a temp directory, return it

{inferencesh-0.3.1 → inferencesh-0.4.1/src/inferencesh.egg-info}/PKG-INFO RENAMED Viewed

@@ -1,6 +1,6 @@
 Metadata-Version: 2.4
 Name: inferencesh
-Version: 0.3.1
+Version: 0.4.1
 Summary: inference.sh Python SDK
 Author: Inference Shell Inc.
 Author-email: "Inference Shell Inc." <hello@inference.sh>
@@ -25,19 +25,65 @@ Dynamic: author
 Dynamic: license-file
 Dynamic: requires-python
-# inference.sh CLI
+# inference.sh sdk
-Helper package for inference.sh Python applications.
+helper package for inference.sh python applications.
-## Installation
+## installation
 ```bash
 pip install infsh
 ```
-## File Handling
+## client usage
-The `File` class provides a standardized way to handle files in the inference.sh ecosystem:
+```python
+from infsh import Inference, TaskStatus
+# create client
+client = Inference(api_key="your-api-key")
+# simple usage - wait for result
+result = client.run({
+    "app": "your-app",
+    "input": {"key": "value"},
+    "variant": "default"
+})
+print(f"output: {result['output']}")
+# get task info without waiting
+task = client.run(params, wait=False)
+print(f"task id: {task['id']}")
+# stream updates (recommended)
+for update in client.run(params, stream=True):
+    status = update.get("status")
+    print(f"status: {TaskStatus(status).name}")
+    if status == TaskStatus.COMPLETED:
+        print(f"output: {update.get('output')}")
+        break
+    elif status == TaskStatus.FAILED:
+        print(f"error: {update.get('error')}")
+        break
+# async support
+async def run_async():
+    from infsh import AsyncInference
+    client = AsyncInference(api_key="your-api-key")
+    # simple usage
+    result = await client.run(params)
+    # stream updates
+    async for update in await client.run(params, stream=True):
+        print(f"status: {TaskStatus(update['status']).name}")
+```
+## file handling
+the `File` class provides a standardized way to handle files in the inference.sh ecosystem:
 ```python
 from infsh import File
@@ -68,15 +114,15 @@ print(file.filename)   # basename of the file
 file.refresh_metadata()
 ```
-The `File` class automatically handles:
-- MIME type detection
-- File size calculation
-- Filename extraction from path
-- File existence checking
+the `File` class automatically handles:
+- mime type detection
+- file size calculation
+- filename extraction from path
+- file existence checking
-## Creating an App
+## creating an app
-To create an inference app, inherit from `BaseApp` and define your input/output types:
+to create an inference app, inherit from `BaseApp` and define your input/output types:
 ```python
 from infsh import BaseApp, BaseAppInput, BaseAppOutput, File
@@ -103,7 +149,7 @@ class MyApp(BaseApp):
         pass
 ```
-The app lifecycle has three main methods:
-- `setup()`: Called when the app starts, use it to initialize models
-- `run()`: Called for each inference request
-- `unload()`: Called when shutting down, use it to free resources
+app lifecycle has three main methods:
+- `setup()`: called when the app starts, use it to initialize models
+- `run()`: called for each inference request
+- `unload()`: called when shutting down, use it to free resources

inferencesh-0.3.1/README.md DELETED Viewed

@@ -1,82 +0,0 @@
-# inference.sh CLI
-Helper package for inference.sh Python applications.
-## Installation
-```bash
-pip install infsh
-```
-## File Handling
-The `File` class provides a standardized way to handle files in the inference.sh ecosystem:
-```python
-from infsh import File
-# Basic file creation
-file = File(path="/path/to/file.png")
-# File with explicit metadata
-file = File(
-    path="/path/to/file.png",
-    content_type="image/png",
-    filename="custom_name.png",
-    size=1024  # in bytes
-)
-# Create from path (automatically populates metadata)
-file = File.from_path("/path/to/file.png")
-# Check if file exists
-exists = file.exists()
-# Access file metadata
-print(file.content_type)  # automatically detected if not specified
-print(file.size)       # file size in bytes
-print(file.filename)   # basename of the file
-# Refresh metadata (useful if file has changed)
-file.refresh_metadata()
-```
-The `File` class automatically handles:
-- MIME type detection
-- File size calculation
-- Filename extraction from path
-- File existence checking
-## Creating an App
-To create an inference app, inherit from `BaseApp` and define your input/output types:
-```python
-from infsh import BaseApp, BaseAppInput, BaseAppOutput, File
-class AppInput(BaseAppInput):
-    image: str  # URL or file path to image
-    mask: str   # URL or file path to mask
-class AppOutput(BaseAppOutput):
-    image: File
-class MyApp(BaseApp):
-    async def setup(self):
-        # Initialize your model here
-        pass
-    async def run(self, app_input: AppInput) -> AppOutput:
-        # Process input and return output
-        result_path = "/tmp/result.png"
-        return AppOutput(image=File(path=result_path))
-    async def unload(self):
-        # Clean up resources
-        pass
-```
-The app lifecycle has three main methods:
-- `setup()`: Called when the app starts, use it to initialize models
-- `run()`: Called for each inference request
-- `unload()`: Called when shutting down, use it to free resources