PyPI - michi-ai - Versions diffs - 0.1.0__py3-none-any.whl → 0.1.1__py3-none-any.whl - Mend

michi-ai 0.1.0py3-none-any.whl → 0.1.1py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (5) hide show

{michi_ai-0.1.0.dist-info → michi_ai-0.1.1.dist-info}/METADATA RENAMED Viewed

@@ -1,12 +1,12 @@
 Metadata-Version: 2.1
 Name: michi-ai
-Version: 0.1.0
+Version: 0.1.1
 Summary: Full-duplex speech LLM client for MichiAI
 Home-page: https://ketsuilabs.io
 License: Apache-2.0
 Keywords: llm,speech,full-duplex,audio
 Author: Damian Krystkiewicz
-Author-email: damian@ketsuilabs.io
+Author-email: 45499236+dkrystki@users.noreply.github.com
 Requires-Python: >=3.9,<4.0
 Classifier: License :: OSI Approved :: Apache Software License
 Classifier: Programming Language :: Python :: 3
@@ -40,7 +40,7 @@ Unlike traditional serial pipelines (ASR → LLM → TTS), MichiAI can listen an
 | **Latency (TTFA)** | ~75ms (tested on RTX 4090) |
 | **Architecture** | Continuous Embeddings + Rectified Flow Matching |
 | **Base Backbone** | SmolLM-360m |
-| **Key Innovation** | No Coherence Loss / Single Forward Pass per Word |
+| **Key Innovation** | No Coherence Loss / Single Step Decoding |
 ## 🌟 Key Features
@@ -56,7 +56,7 @@ Unlike traditional serial pipelines (ASR → LLM → TTS), MichiAI can listen an
 ## 🤖 Architecture Overview
 ### 1. The Listening Head
-A multi-modal encoder mapping raw audio into a continuous embeddings while simultaneously generating text tokens. This ensures the model understands both the semantic meaning and the emotional context.
+A multi-modal encoder mapping raw audio into continuous embeddings while simultaneously generating text tokens. This ensures the model understands both the semantic meaning and the emotional context.
 ### 2. The Speaking Head
 Predicts audio embeddings using **Rectified Flow Matching**. This allows for fast, high-quality, and diverse speech generation. The embeddings are then processed through a lightweight, causal **HiFi-GAN vocoder** for real-time streaming.

michi_ai-0.1.1.dist-info/RECORD ADDED Viewed

@@ -0,0 +1,6 @@
+michiai/__init__.py,sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU,0
+michiai/client.py,sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU,0
+michi_ai-0.1.1.dist-info/LICENSE.txt,sha256=TGMMvGlYQvvAXSjlrZvpXJjiaxnwDsarhDmLY1KIWr0,11351
+michi_ai-0.1.1.dist-info/METADATA,sha256=Kxrbe0rKJ4EEiiaJLPHrtxYuJQuOZPFqEa8yBC9odjU,4133
+michi_ai-0.1.1.dist-info/WHEEL,sha256=sP946D7jFCHeNz5Iq4fL4Lu-PrWrFsgfLXbbkciIZwg,88
+michi_ai-0.1.1.dist-info/RECORD,,

michi_ai-0.1.0.dist-info/RECORD DELETED Viewed

@@ -1,6 +0,0 @@
-michiai/__init__.py,sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU,0
-michiai/client.py,sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU,0
-michi_ai-0.1.0.dist-info/LICENSE.txt,sha256=TGMMvGlYQvvAXSjlrZvpXJjiaxnwDsarhDmLY1KIWr0,11351
-michi_ai-0.1.0.dist-info/METADATA,sha256=oKqjq5PuboN4yquaUsRi2pX-IsmUXjy2WRtCRUpDo80,4121
-michi_ai-0.1.0.dist-info/WHEEL,sha256=sP946D7jFCHeNz5Iq4fL4Lu-PrWrFsgfLXbbkciIZwg,88
-michi_ai-0.1.0.dist-info/RECORD,,

{michi_ai-0.1.0.dist-info → michi_ai-0.1.1.dist-info}/LICENSE.txt RENAMED Viewed

File without changes

{michi_ai-0.1.0.dist-info → michi_ai-0.1.1.dist-info}/WHEEL RENAMED Viewed

File without changes

michi-ai 0.1.0__py3-none-any.whl → 0.1.1__py3-none-any.whl

michi-ai 0.1.0py3-none-any.whl → 0.1.1py3-none-any.whl