PyPI - dwani - Versions diffs - 0.1.4__tar.gz → 0.1.6__tar.gz - Mend

dwani 0.1.4tar.gz → 0.1.6tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (24) hide show

dwani-0.1.6/PKG-INFO +129 -0
dwani-0.1.6/README.md +93 -0
{dwani-0.1.4 → dwani-0.1.6}/dwani/__init__.py +27 -1
dwani-0.1.6/dwani/client.py +54 -0
dwani-0.1.6/dwani/docs.py +149 -0
dwani-0.1.6/dwani/translate.py +26 -0
dwani-0.1.6/dwani.egg-info/PKG-INFO +129 -0
{dwani-0.1.4 → dwani-0.1.6}/dwani.egg-info/SOURCES.txt +1 -0
{dwani-0.1.4 → dwani-0.1.6}/pyproject.toml +2 -2
dwani-0.1.4/PKG-INFO +0 -70
dwani-0.1.4/README.md +0 -34
dwani-0.1.4/dwani/client.py +0 -41
dwani-0.1.4/dwani/docs.py +0 -70
dwani-0.1.4/dwani.egg-info/PKG-INFO +0 -70
{dwani-0.1.4 → dwani-0.1.6}/LICENSE +0 -0
{dwani-0.1.4 → dwani-0.1.6}/dwani/asr.py +0 -0
{dwani-0.1.4 → dwani-0.1.6}/dwani/audio.py +0 -0
{dwani-0.1.4 → dwani-0.1.6}/dwani/chat.py +0 -0
{dwani-0.1.4 → dwani-0.1.6}/dwani/exceptions.py +0 -0
{dwani-0.1.4 → dwani-0.1.6}/dwani/vision.py +0 -0
{dwani-0.1.4 → dwani-0.1.6}/dwani.egg-info/dependency_links.txt +0 -0
{dwani-0.1.4 → dwani-0.1.6}/dwani.egg-info/requires.txt +0 -0
{dwani-0.1.4 → dwani-0.1.6}/dwani.egg-info/top_level.txt +0 -0
{dwani-0.1.4 → dwani-0.1.6}/setup.cfg +0 -0

dwani-0.1.6/PKG-INFO ADDED Viewed

@@ -0,0 +1,129 @@
+Metadata-Version: 2.4
+Name: dwani
+Version: 0.1.6
+Summary: Multimodal API for Indian languages (Chat, Vision, TTS, ASR, Translate, Docs)
+Author-email: sachin <python@dwani.ai>
+License: MIT License
+        Copyright (c) 2025 Sachin Shetty
+        Permission is hereby granted, free of charge, to any person obtaining a copy
+        of this software and associated documentation files (the "Software"), to deal
+        in the Software without restriction, including without limitation the rights
+        to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+        copies of the Software, and to permit persons to whom the Software is
+        furnished to do so, subject to the following conditions:
+        The above copyright notice and this permission notice shall be included in all
+        copies or substantial portions of the Software.
+        THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+        IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+        FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+        AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+        LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+        OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+        SOFTWARE.
+Project-URL: Homepage, https://github.com/dwani-ai/dwani-python
+Project-URL: Source, https://github.com/dwani-ai/dwani-python
+Project-URL: Issues, https://github.com/dwani-ai/dwani-python/issues
+Requires-Python: >=3.8
+Description-Content-Type: text/markdown
+License-File: LICENSE
+Requires-Dist: requests>=2.25.0
+Dynamic: license-file
+# dwani.ai - python library
+### Install the library
+```bash
+pip install dwani
+```
+### Setup the credentials
+```python
+import dwani
+import os
+dwani.api_key = os.getenv("DWANI_API_KEY")
+dwani.api_base = os.getenv("DWANI_API_BASE_URL")
+```
+### Examples
+#### Text Query
+```python
+resp = dwani.Chat.create(prompt="Hello!", src_lang="eng_Latn", tgt_lang="kan_Knda")
+print(resp)
+```
+```json
+{'response': 'ನಮಸ್ತೆ! ಭಾರತ ಮತ್ತು ಕರ್ನಾಟಕವನ್ನು ಗಮನದಲ್ಲಿಟ್ಟುಕೊಂಡು ಇಂದು ನಿಮ್ಮ ಪ್ರಶ್ನೆಗಳಿಗೆ ನಾನು ನಿಮಗೆ ಹೇಗೆ ಸಹಾಯ ಮಾಡಲಿ?'}
+```
+#### Vision Query
+```python
+result = dwani.Vision.caption(
+    file_path="image.png",
+    query="Describe this logo",
+    src_lang="eng_Latn",
+    tgt_lang="kan_Knda"
+)
+print(result)
+```
+```json
+{'answer': 'ಒಂದು ವಾಕ್ಯದಲ್ಲಿ ಚಿತ್ರದ ಸಾರಾಂಶವನ್ನು ಇಲ್ಲಿ ನೀಡಲಾಗಿದೆಃ ಪ್ರಕಟಣೆಯ ಅವಲೋಕನವು ಪ್ರಸ್ತುತ ಅರವತ್ತನಾಲ್ಕು ದೇಶಗಳು/ಪ್ರದೇಶಗಳನ್ನು ಸೇರಿಸಲಾಗಿದೆ ಮತ್ತು ಇನ್ನೂ ಹದಿನಾರು ಪ್ರದೇಶಗಳನ್ನು ಸೇರಿಸಬೇಕಾಗಿದೆ. ಒದಗಿಸಲಾದ ಚಿತ್ರದಲ್ಲಿ ಲಾಂಛನವು ಕಾಣಿಸುವುದಿಲ್ಲ.'}
+```
+#### Speech to Text -  Automatic Speech Recognition (ASR)
+```python
+result = dwani.ASR.transcribe(file_path="kannada_sample.wav", language="kannada")
+print(result)
+```
+```json
+{'text': 'ಕರ್ನಾಟಕ ದ ರಾಜಧಾನಿ ಯಾವುದು'}
+```
+### Translate
+```python
+resp = dwani.Translate.run_translate(sentences=["hi"], src_lang="eng_Latn", tgt_lang="kan_Knda")
+print(resp)
+```
+```json
+{'translations': ['ಹಾಯ್']}
+```
+#### Text to Speech -  Speech Synthesis
+```python
+response = dwani.Audio.speech(input="ಕರ್ನಾಟಕ ದ ರಾಜಧಾನಿ ಯಾವುದು", response_format="mp3")
+with open("output.mp3", "wb") as f:
+    f.write(response)
+```
+#### Document - Extract Text
+```python
+result = dwani.Documents.run_extract(file_path = "dwani-workshop.pdf", page_number=1, src_lang="eng_Latn",tgt_lang="kan_Knda" )
+print(result)
+```
+```json
+{'pages': [{'processed_page': 1, 'page_content': ' a plain text representation of the document', 'translated_content': 'ಡಾಕ್ಯುಮೆಂಟ್ನ ಸರಳ ಪಠ್ಯ ಪ್ರಾತಿನಿಧ್ಯವನ್ನು ಇಲ್ಲಿ ನೀಡಲಾಗಿದೆ, ಅದನ್ನು ಸ್ವಾಭಾವಿಕವಾಗಿ ಓದುವಂತೆಃ'}]}
+```
+- Website -> [dwani.ai](https://dwani.ai)
+<!--
+## local development
+pip install -e .
+pip install twine build
+rm -rf dist/
+python -m build
+python -m twine upload dist/*
+-->

dwani-0.1.6/README.md ADDED Viewed

@@ -0,0 +1,93 @@
+# dwani.ai - python library
+### Install the library
+```bash
+pip install dwani
+```
+### Setup the credentials
+```python
+import dwani
+import os
+dwani.api_key = os.getenv("DWANI_API_KEY")
+dwani.api_base = os.getenv("DWANI_API_BASE_URL")
+```
+### Examples
+#### Text Query
+```python
+resp = dwani.Chat.create(prompt="Hello!", src_lang="eng_Latn", tgt_lang="kan_Knda")
+print(resp)
+```
+```json
+{'response': 'ನಮಸ್ತೆ! ಭಾರತ ಮತ್ತು ಕರ್ನಾಟಕವನ್ನು ಗಮನದಲ್ಲಿಟ್ಟುಕೊಂಡು ಇಂದು ನಿಮ್ಮ ಪ್ರಶ್ನೆಗಳಿಗೆ ನಾನು ನಿಮಗೆ ಹೇಗೆ ಸಹಾಯ ಮಾಡಲಿ?'}
+```
+#### Vision Query
+```python
+result = dwani.Vision.caption(
+    file_path="image.png",
+    query="Describe this logo",
+    src_lang="eng_Latn",
+    tgt_lang="kan_Knda"
+)
+print(result)
+```
+```json
+{'answer': 'ಒಂದು ವಾಕ್ಯದಲ್ಲಿ ಚಿತ್ರದ ಸಾರಾಂಶವನ್ನು ಇಲ್ಲಿ ನೀಡಲಾಗಿದೆಃ ಪ್ರಕಟಣೆಯ ಅವಲೋಕನವು ಪ್ರಸ್ತುತ ಅರವತ್ತನಾಲ್ಕು ದೇಶಗಳು/ಪ್ರದೇಶಗಳನ್ನು ಸೇರಿಸಲಾಗಿದೆ ಮತ್ತು ಇನ್ನೂ ಹದಿನಾರು ಪ್ರದೇಶಗಳನ್ನು ಸೇರಿಸಬೇಕಾಗಿದೆ. ಒದಗಿಸಲಾದ ಚಿತ್ರದಲ್ಲಿ ಲಾಂಛನವು ಕಾಣಿಸುವುದಿಲ್ಲ.'}
+```
+#### Speech to Text -  Automatic Speech Recognition (ASR)
+```python
+result = dwani.ASR.transcribe(file_path="kannada_sample.wav", language="kannada")
+print(result)
+```
+```json
+{'text': 'ಕರ್ನಾಟಕ ದ ರಾಜಧಾನಿ ಯಾವುದು'}
+```
+### Translate
+```python
+resp = dwani.Translate.run_translate(sentences=["hi"], src_lang="eng_Latn", tgt_lang="kan_Knda")
+print(resp)
+```
+```json
+{'translations': ['ಹಾಯ್']}
+```
+#### Text to Speech -  Speech Synthesis
+```python
+response = dwani.Audio.speech(input="ಕರ್ನಾಟಕ ದ ರಾಜಧಾನಿ ಯಾವುದು", response_format="mp3")
+with open("output.mp3", "wb") as f:
+    f.write(response)
+```
+#### Document - Extract Text
+```python
+result = dwani.Documents.run_extract(file_path = "dwani-workshop.pdf", page_number=1, src_lang="eng_Latn",tgt_lang="kan_Knda" )
+print(result)
+```
+```json
+{'pages': [{'processed_page': 1, 'page_content': ' a plain text representation of the document', 'translated_content': 'ಡಾಕ್ಯುಮೆಂಟ್ನ ಸರಳ ಪಠ್ಯ ಪ್ರಾತಿನಿಧ್ಯವನ್ನು ಇಲ್ಲಿ ನೀಡಲಾಗಿದೆ, ಅದನ್ನು ಸ್ವಾಭಾವಿಕವಾಗಿ ಓದುವಂತೆಃ'}]}
+```
+- Website -> [dwani.ai](https://dwani.ai)
+<!--
+## local development
+pip install -e .
+pip install twine build
+rm -rf dist/
+python -m build
+python -m twine upload dist/*
+-->

{dwani-0.1.4 → dwani-0.1.6}/dwani/__init__.py RENAMED Viewed

@@ -3,9 +3,11 @@ from .chat import Chat
 from .audio import Audio
 from .vision import Vision
 from .asr import ASR
+from .translate import Translate
 from .exceptions import DhwaniAPIError
+from .docs import Documents
-__all__ = ["DhwaniClient", "Chat", "Audio", "Vision", "ASR", "DhwaniAPIError"]
+__all__ = ["DhwaniClient", "Chat", "Audio", "Vision", "ASR", "DhwaniAPIError", "Translate", "Documents"]
 # Optionally, instantiate a default client for convenience
 api_key = None
@@ -37,3 +39,27 @@ class asr:
     @staticmethod
     def transcribe(*args, **kwargs):
         return _get_client().transcribe(*args, **kwargs)
+class translate:
+    @staticmethod
+    def run_translate(*args, **kwargs):
+        return _get_client().translate(*args, **kwargs)
+class document:
+    @staticmethod
+    def run_ocr(*args, **kwargs):
+        return _get_client().ocr(*args, **kwargs)
+    @staticmethod
+    def run_summarize(*args, **kwargs):
+        return _get_client().summarize(*args, **kwargs)
+    @staticmethod
+    def run_extract(*args, **kwargs):
+        return _get_client().extract(*args, **kwargs)
+    @staticmethod
+    def run_doc_query(*args, **kwargs):
+        return _get_client().doc_query(*args, **kwargs)
+    @staticmethod
+    def run_doc_query_kannada(*args, **kwargs):
+        return _get_client().doc_query_kannada(*args, **kwargs)

dwani-0.1.6/dwani/client.py ADDED Viewed

@@ -0,0 +1,54 @@
+import os
+import requests
+from .exceptions import DhwaniAPIError
+class DhwaniClient:
+    def __init__(self, api_key=None, api_base=None):
+        self.api_key = api_key or os.getenv("DWANI_API_KEY")
+        self.api_base = api_base or os.getenv("DWANI_API_BASE_URL", "http://localhost:8000")
+        if not self.api_key:
+            raise ValueError("DHWANI_API_KEY not set")
+    def _headers(self):
+        return {"X-API-Key": self.api_key}
+    def translate(self, sentences, src_lang, tgt_lang, **kwargs):
+        from .translate import run_translate
+        return run_translate(self, sentences=sentences, src_lang=src_lang, tgt_lang=tgt_lang, **kwargs)
+    def chat(self, prompt, src_lang, tgt_lang, **kwargs):
+        from .chat import chat_create
+        return chat_create(self, prompt=prompt, src_lang=src_lang, tgt_lang=tgt_lang, **kwargs)
+    def speech(self, input, response_format="mp3", **kwargs):
+        from .audio import audio_speech
+        return audio_speech(self, input=input, response_format=response_format, **kwargs)
+    def caption(self, file_path, query="describe the image", src_lang="eng_Latn", tgt_lang="kan_Knda", **kwargs):
+        from .vision import vision_caption
+        return vision_caption(self, file_path=file_path, query=query, src_lang=src_lang, tgt_lang=tgt_lang, **kwargs)
+    def transcribe(self, file_path, language=None, **kwargs):
+        from .asr import asr_transcribe
+        return asr_transcribe(self, file_path=file_path, language=language, **kwargs)
+    def document_ocr(self, file_path, language=None, **kwargs):
+        from .docs import document_ocr
+        return document_ocr(self, file_path=file_path, language=language, **kwargs)
+    def document_summarize(self, file_path, page_number=1, src_lang="eng_Latn", tgt_lang="kan_Knda", **kwargs):
+        from .docs import document_summarize
+        return document_summarize(self, file_path, page_number, src_lang, tgt_lang, **kwargs)
+    def extract(self, file_path, page_number=1, src_lang="eng_Latn", tgt_lang="kan_Knda", **kwargs):
+        from .docs import extract
+        return extract(self, file_path=file_path, page_number=page_number, src_lang=src_lang,tgt_lang=tgt_lang, **kwargs)
+    def doc_query( self, file_path, page_number=1, prompt="list the key points", src_lang="eng_Latn", tgt_lang="kan_Knda" , **kwargs ):
+        from .docs import doc_query
+        return doc_query( self, file_path, page_number=page_number, prompt=prompt, src_lang=src_lang, tgt_lang=tgt_lang , **kwargs )
+    def doc_query_kannada(self, file_path, page_number=1, prompt="list key points", src_lang="eng_Latn", language=None, **kwargs):
+        from .docs import doc_query_kannada
+        return doc_query_kannada(self, file_path=file_path, page_number=page_number, prompt=prompt, src_lang=src_lang, language=language, **kwargs)

dwani-0.1.6/dwani/docs.py ADDED Viewed

@@ -0,0 +1,149 @@
+import requests
+from .exceptions import DhwaniAPIError
+def document_ocr(client, file_path, language=None):
+    """OCR a document (image/PDF) and return extracted text."""
+    with open(file_path, "rb") as f:
+        files = {"file": f}
+        data = {}
+        if language:
+            data["language"] = language
+        resp = requests.post(
+            f"{client.api_base}/v1/document/ocr",
+            headers=client._headers(),
+            files=files,
+            data=data
+        )
+    if resp.status_code != 200:
+        raise DhwaniAPIError(resp)
+    return resp.json()
+def document_summarize(client, file_path, page_number=1, src_lang="eng_Latn", tgt_lang="kan_Knda"):
+    """Summarize a PDF document with language and page number options."""
+    url = f"{client.api_base}/v1/indic-summarize-pdf"
+    headers = client._headers()
+    with open(file_path, "rb") as f:
+        files = {"file": (file_path, f, "application/pdf")}
+        data = {
+            "page_number": str(page_number),
+            "src_lang": src_lang,
+            "tgt_lang": tgt_lang
+        }
+        resp = requests.post(
+            url,
+            headers=headers,
+            files=files,
+            data=data
+        )
+    if resp.status_code != 200:
+        raise DhwaniAPIError(resp)
+    return resp.json()
+def extract(client, file_path, page_number, src_lang, tgt_lang):
+    """
+    Extract and translate text from a document (image/PDF) using query parameters.
+    """
+    # Build the URL with query parameters
+    url = (
+        f"{client.api_base}/v1/indic-extract-text/"
+        f"?page_number={page_number}&src_lang={src_lang}&tgt_lang={tgt_lang}"
+    )
+    headers = client._headers()
+    # 'requests' handles multipart/form-data automatically
+    with open(file_path, "rb") as f:
+        files = {"file": (file_path, f, "application/pdf")}
+        resp = requests.post(
+            url,
+            headers=headers,
+            files=files
+        )
+    if resp.status_code != 200:
+        raise DhwaniAPIError(resp)
+    return resp.json()
+def doc_query(
+    client,
+    file_path,
+    page_number=1,
+    prompt="list the key points",
+    src_lang="eng_Latn",
+    tgt_lang="kan_Knda"
+):
+    """Query a document with a custom prompt and language options."""
+    url = f"{client.api_base}/v1/indic-custom-prompt-pdf"
+    headers = client._headers()
+    with open(file_path, "rb") as f:
+        files = {"file": (file_path, f, "application/pdf")}
+        data = {
+            "page_number": str(page_number),
+            "prompt": prompt,
+            "source_language": src_lang,
+            "target_language": tgt_lang
+        }
+        resp = requests.post(
+            url,
+            headers=headers,
+            files=files,
+            data=data
+        )
+    if resp.status_code != 200:
+        raise DhwaniAPIError(resp)
+    return resp.json()
+def doc_query_kannada(
+    client,
+    file_path,
+    page_number=1,
+    prompt="list key points",
+    src_lang="eng_Latn",
+    language=None
+):
+    """Summarize a document (image/PDF/text) with custom prompt and language."""
+    url = f"{client.api_base}/v1/indic-custom-prompt-kannada-pdf"
+    headers = client._headers()
+    # 'requests' will handle multipart/form-data automatically
+    with open(file_path, "rb") as f:
+        files = {"file": (file_path, f, "application/pdf")}
+        data = {
+            "page_number": str(page_number),
+            "prompt": prompt,
+            "src_lang": src_lang,
+        }
+        if language:
+            data["language"] = language
+        resp = requests.post(
+            url,
+            headers=headers,
+            files=files,
+            data=data
+        )
+    if resp.status_code != 200:
+        raise DhwaniAPIError(resp)
+    return resp.json()
+class Documents:
+    @staticmethod
+    def ocr(file_path, language=None):
+        from . import _get_client
+        return _get_client().document_ocr(file_path, language)
+    @staticmethod
+    def summarize(*args, **kwargs):
+        from . import _get_client
+        return _get_client().document_summarize(*args, **kwargs)
+    @staticmethod
+    def run_extract(*args, **kwargs):
+        from . import _get_client
+        return _get_client().extract(*args, **kwargs)
+    @staticmethod
+    def run_doc_query(*args, **kwargs):
+        from . import _get_client
+        return _get_client().doc_query(*args, **kwargs)
+    @staticmethod
+    def run_doc_query_kannada(*args, **kwargs):
+        from . import _get_client
+        return _get_client().doc_query_kannada(*args, **kwargs)

dwani-0.1.6/dwani/translate.py ADDED Viewed

@@ -0,0 +1,26 @@
+from .exceptions import DhwaniAPIError
+import requests
+def run_translate(client, sentences, src_lang, tgt_lang, **kwargs):
+    url = f"{client.api_base}/v1/translate"
+    payload = {
+        "sentences": sentences,
+        "src_lang": src_lang,
+        "tgt_lang": tgt_lang
+    }
+    payload.update(kwargs)
+    resp = requests.post(
+        url,
+        headers={**client._headers(), "Content-Type": "application/json", "accept": "application/json"},
+        json=payload
+    )
+    if resp.status_code != 200:
+        raise DhwaniAPIError(resp)
+    return resp.json()
+class Translate:
+    @staticmethod
+    def run_translate(sentences, src_lang, tgt_lang, **kwargs):
+        from . import _get_client
+        return _get_client().translate(sentences, src_lang, tgt_lang, **kwargs)

dwani-0.1.6/dwani.egg-info/PKG-INFO ADDED Viewed

@@ -0,0 +1,129 @@
+Metadata-Version: 2.4
+Name: dwani
+Version: 0.1.6
+Summary: Multimodal API for Indian languages (Chat, Vision, TTS, ASR, Translate, Docs)
+Author-email: sachin <python@dwani.ai>
+License: MIT License
+        Copyright (c) 2025 Sachin Shetty
+        Permission is hereby granted, free of charge, to any person obtaining a copy
+        of this software and associated documentation files (the "Software"), to deal
+        in the Software without restriction, including without limitation the rights
+        to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+        copies of the Software, and to permit persons to whom the Software is
+        furnished to do so, subject to the following conditions:
+        The above copyright notice and this permission notice shall be included in all
+        copies or substantial portions of the Software.
+        THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+        IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+        FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+        AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+        LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+        OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+        SOFTWARE.
+Project-URL: Homepage, https://github.com/dwani-ai/dwani-python
+Project-URL: Source, https://github.com/dwani-ai/dwani-python
+Project-URL: Issues, https://github.com/dwani-ai/dwani-python/issues
+Requires-Python: >=3.8
+Description-Content-Type: text/markdown
+License-File: LICENSE
+Requires-Dist: requests>=2.25.0
+Dynamic: license-file
+# dwani.ai - python library
+### Install the library
+```bash
+pip install dwani
+```
+### Setup the credentials
+```python
+import dwani
+import os
+dwani.api_key = os.getenv("DWANI_API_KEY")
+dwani.api_base = os.getenv("DWANI_API_BASE_URL")
+```
+### Examples
+#### Text Query
+```python
+resp = dwani.Chat.create(prompt="Hello!", src_lang="eng_Latn", tgt_lang="kan_Knda")
+print(resp)
+```
+```json
+{'response': 'ನಮಸ್ತೆ! ಭಾರತ ಮತ್ತು ಕರ್ನಾಟಕವನ್ನು ಗಮನದಲ್ಲಿಟ್ಟುಕೊಂಡು ಇಂದು ನಿಮ್ಮ ಪ್ರಶ್ನೆಗಳಿಗೆ ನಾನು ನಿಮಗೆ ಹೇಗೆ ಸಹಾಯ ಮಾಡಲಿ?'}
+```
+#### Vision Query
+```python
+result = dwani.Vision.caption(
+    file_path="image.png",
+    query="Describe this logo",
+    src_lang="eng_Latn",
+    tgt_lang="kan_Knda"
+)
+print(result)
+```
+```json
+{'answer': 'ಒಂದು ವಾಕ್ಯದಲ್ಲಿ ಚಿತ್ರದ ಸಾರಾಂಶವನ್ನು ಇಲ್ಲಿ ನೀಡಲಾಗಿದೆಃ ಪ್ರಕಟಣೆಯ ಅವಲೋಕನವು ಪ್ರಸ್ತುತ ಅರವತ್ತನಾಲ್ಕು ದೇಶಗಳು/ಪ್ರದೇಶಗಳನ್ನು ಸೇರಿಸಲಾಗಿದೆ ಮತ್ತು ಇನ್ನೂ ಹದಿನಾರು ಪ್ರದೇಶಗಳನ್ನು ಸೇರಿಸಬೇಕಾಗಿದೆ. ಒದಗಿಸಲಾದ ಚಿತ್ರದಲ್ಲಿ ಲಾಂಛನವು ಕಾಣಿಸುವುದಿಲ್ಲ.'}
+```
+#### Speech to Text -  Automatic Speech Recognition (ASR)
+```python
+result = dwani.ASR.transcribe(file_path="kannada_sample.wav", language="kannada")
+print(result)
+```
+```json
+{'text': 'ಕರ್ನಾಟಕ ದ ರಾಜಧಾನಿ ಯಾವುದು'}
+```
+### Translate
+```python
+resp = dwani.Translate.run_translate(sentences=["hi"], src_lang="eng_Latn", tgt_lang="kan_Knda")
+print(resp)
+```
+```json
+{'translations': ['ಹಾಯ್']}
+```
+#### Text to Speech -  Speech Synthesis
+```python
+response = dwani.Audio.speech(input="ಕರ್ನಾಟಕ ದ ರಾಜಧಾನಿ ಯಾವುದು", response_format="mp3")
+with open("output.mp3", "wb") as f:
+    f.write(response)
+```
+#### Document - Extract Text
+```python
+result = dwani.Documents.run_extract(file_path = "dwani-workshop.pdf", page_number=1, src_lang="eng_Latn",tgt_lang="kan_Knda" )
+print(result)
+```
+```json
+{'pages': [{'processed_page': 1, 'page_content': ' a plain text representation of the document', 'translated_content': 'ಡಾಕ್ಯುಮೆಂಟ್ನ ಸರಳ ಪಠ್ಯ ಪ್ರಾತಿನಿಧ್ಯವನ್ನು ಇಲ್ಲಿ ನೀಡಲಾಗಿದೆ, ಅದನ್ನು ಸ್ವಾಭಾವಿಕವಾಗಿ ಓದುವಂತೆಃ'}]}
+```
+- Website -> [dwani.ai](https://dwani.ai)
+<!--
+## local development
+pip install -e .
+pip install twine build
+rm -rf dist/
+python -m build
+python -m twine upload dist/*
+-->

{dwani-0.1.4 → dwani-0.1.6}/dwani.egg-info/SOURCES.txt RENAMED Viewed

@@ -8,6 +8,7 @@ dwani/chat.py
 dwani/client.py
 dwani/docs.py
 dwani/exceptions.py
+dwani/translate.py
 dwani/vision.py
 dwani.egg-info/PKG-INFO
 dwani.egg-info/SOURCES.txt

{dwani-0.1.4 → dwani-0.1.6}/pyproject.toml RENAMED Viewed

@@ -4,8 +4,8 @@ build-backend = "setuptools.build_meta"
 [project]
 name = "dwani"
-version = "0.1.4"
-description = "Multimodal API for Indian languages (speech, vision, LLMs, TTS, ASR, etc.)"
+version = "0.1.6"
+description = "Multimodal API for Indian languages (Chat, Vision, TTS, ASR, Translate, Docs)"
 authors = [
     { name="sachin", email="python@dwani.ai" }
 ]

dwani-0.1.4/PKG-INFO DELETED Viewed

@@ -1,70 +0,0 @@
-Metadata-Version: 2.4
-Name: dwani
-Version: 0.1.4
-Summary: Multimodal API for Indian languages (speech, vision, LLMs, TTS, ASR, etc.)
-Author-email: sachin <python@dwani.ai>
-License: MIT License
-        Copyright (c) 2025 Sachin Shetty
-        Permission is hereby granted, free of charge, to any person obtaining a copy
-        of this software and associated documentation files (the "Software"), to deal
-        in the Software without restriction, including without limitation the rights
-        to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
-        copies of the Software, and to permit persons to whom the Software is
-        furnished to do so, subject to the following conditions:
-        The above copyright notice and this permission notice shall be included in all
-        copies or substantial portions of the Software.
-        THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
-        IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
-        FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
-        AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
-        LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
-        OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
-        SOFTWARE.
-Project-URL: Homepage, https://github.com/dwani-ai/dwani-python
-Project-URL: Source, https://github.com/dwani-ai/dwani-python
-Project-URL: Issues, https://github.com/dwani-ai/dwani-python/issues
-Requires-Python: >=3.8
-Description-Content-Type: text/markdown
-License-File: LICENSE
-Requires-Dist: requests>=2.25.0
-Dynamic: license-file
-# dwani.ai - python library
-```bash
-pip install dwani
-```
-```python
-import dwani
-import os
-dwani.api_key = os.getenv("DWANI_API_KEY")
-dwani.api_base = os.getenv("DWANI_API_BASE_URL")
-resp = dwani.Chat.create("Hello!", "eng_Latn", "kan_Knda")
-print(resp)
-```
-<!--
-## local development
-pip install -e .
-pip install twine build
-rm -rf dist/
-python -m build
-python -m twine upload dist/*
--->

dwani-0.1.4/README.md DELETED Viewed

@@ -1,34 +0,0 @@
-# dwani.ai - python library
-```bash
-pip install dwani
-```
-```python
-import dwani
-import os
-dwani.api_key = os.getenv("DWANI_API_KEY")
-dwani.api_base = os.getenv("DWANI_API_BASE_URL")
-resp = dwani.Chat.create("Hello!", "eng_Latn", "kan_Knda")
-print(resp)
-```
-<!--
-## local development
-pip install -e .
-pip install twine build
-rm -rf dist/
-python -m build
-python -m twine upload dist/*
--->

dwani-0.1.4/dwani/client.py DELETED Viewed

@@ -1,41 +0,0 @@
-import os
-import requests
-from .exceptions import DhwaniAPIError
-class DhwaniClient:
-    def __init__(self, api_key=None, api_base=None):
-        self.api_key = api_key or os.getenv("DWANI_API_KEY")
-        self.api_base = api_base or os.getenv("DWANI_API_BASE_URL", "http://localhost:7860")
-        if not self.api_key:
-            raise ValueError("DHWANI_API_KEY not set")
-    def _headers(self):
-        return {"X-API-Key": self.api_key}
-    def chat(self, prompt, src_lang, tgt_lang, **kwargs):
-        from .chat import chat_create
-        return chat_create(self, prompt, src_lang, tgt_lang, **kwargs)
-    def speech(self, *args, **kwargs):
-        from .audio import audio_speech
-        return audio_speech(self, *args, **kwargs)
-    def caption(self, file_path, query="describe the image", src_lang="eng_Latn", tgt_lang="kan_Knda"):
-        from .vision import vision_caption
-        return vision_caption(self, file_path, query, src_lang, tgt_lang)
-    def transcribe(self, *args, **kwargs):
-        from .asr import asr_transcribe
-        return asr_transcribe(self, *args, **kwargs)
-    def document_ocr(self, file_path, language=None):
-        from .docs import document_ocr
-        return document_ocr(self, file_path, language)
-    def document_translate(self, file_path, src_lang, tgt_lang):
-        from .docs import document_translate
-        return document_translate(self, file_path, src_lang, tgt_lang)
-    def document_summarize(self, file_path, language=None):
-        from .docs import document_summarize
-        return document_summarize(self, file_path, language)

dwani-0.1.4/dwani/docs.py DELETED Viewed

@@ -1,70 +0,0 @@
-import requests
-from .exceptions import DhwaniAPIError
-def document_ocr(client, file_path, language=None):
-    """OCR a document (image/PDF) and return extracted text."""
-    with open(file_path, "rb") as f:
-        files = {"file": f}
-        data = {}
-        if language:
-            data["language"] = language
-        resp = requests.post(
-            f"{client.api_base}/v1/document/ocr",
-            headers=client._headers(),
-            files=files,
-            data=data
-        )
-    if resp.status_code != 200:
-        raise DhwaniAPIError(resp)
-    return resp.json()
-def document_translate(client, file_path, src_lang, tgt_lang):
-    """Translate a document (image/PDF with text) from src_lang to tgt_lang."""
-    with open(file_path, "rb") as f:
-        files = {"file": f}
-        data = {
-            "src_lang": src_lang,
-            "tgt_lang": tgt_lang
-        }
-        resp = requests.post(
-            f"{client.api_base}/v1/document/translate",
-            headers=client._headers(),
-            files=files,
-            data=data
-        )
-    if resp.status_code != 200:
-        raise DhwaniAPIError(resp)
-    return resp.json()
-def document_summarize(client, file_path, language=None):
-    """Summarize a document (image/PDF/text)."""
-    with open(file_path, "rb") as f:
-        files = {"file": f}
-        data = {}
-        if language:
-            data["language"] = language
-        resp = requests.post(
-            f"{client.api_base}/v1/document/summarize",
-            headers=client._headers(),
-            files=files,
-            data=data
-        )
-    if resp.status_code != 200:
-        raise DhwaniAPIError(resp)
-    return resp.json()
-class Documents:
-    @staticmethod
-    def ocr(file_path, language=None):
-        from . import _get_client
-        return _get_client().document_ocr(file_path, language)
-    @staticmethod
-    def translate(file_path, src_lang, tgt_lang):
-        from . import _get_client
-        return _get_client().document_translate(file_path, src_lang, tgt_lang)
-    @staticmethod
-    def summarize(file_path, language=None):
-        from . import _get_client
-        return _get_client().document_summarize(file_path, language)

dwani-0.1.4/dwani.egg-info/PKG-INFO DELETED Viewed

@@ -1,70 +0,0 @@
-Metadata-Version: 2.4
-Name: dwani
-Version: 0.1.4
-Summary: Multimodal API for Indian languages (speech, vision, LLMs, TTS, ASR, etc.)
-Author-email: sachin <python@dwani.ai>
-License: MIT License
-        Copyright (c) 2025 Sachin Shetty
-        Permission is hereby granted, free of charge, to any person obtaining a copy
-        of this software and associated documentation files (the "Software"), to deal
-        in the Software without restriction, including without limitation the rights
-        to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
-        copies of the Software, and to permit persons to whom the Software is
-        furnished to do so, subject to the following conditions:
-        The above copyright notice and this permission notice shall be included in all
-        copies or substantial portions of the Software.
-        THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
-        IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
-        FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
-        AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
-        LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
-        OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
-        SOFTWARE.
-Project-URL: Homepage, https://github.com/dwani-ai/dwani-python
-Project-URL: Source, https://github.com/dwani-ai/dwani-python
-Project-URL: Issues, https://github.com/dwani-ai/dwani-python/issues
-Requires-Python: >=3.8
-Description-Content-Type: text/markdown
-License-File: LICENSE
-Requires-Dist: requests>=2.25.0
-Dynamic: license-file
-# dwani.ai - python library
-```bash
-pip install dwani
-```
-```python
-import dwani
-import os
-dwani.api_key = os.getenv("DWANI_API_KEY")
-dwani.api_base = os.getenv("DWANI_API_BASE_URL")
-resp = dwani.Chat.create("Hello!", "eng_Latn", "kan_Knda")
-print(resp)
-```
-<!--
-## local development
-pip install -e .
-pip install twine build
-rm -rf dist/
-python -m build
-python -m twine upload dist/*
--->