omniroute 2.9.1 → 2.9.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +11 -11
- package/app/.next/BUILD_ID +1 -1
- package/app/.next/build-manifest.json +2 -2
- package/app/.next/prerender-manifest.json +3 -3
- package/app/.next/server/app/(dashboard)/dashboard/a2a/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/agents/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/analytics/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/api-manager/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/audit-log/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/auto-combo/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/cli-tools/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/combos/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/costs/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/endpoint/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/health/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/limits/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/logs/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/mcp/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/media/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/onboarding/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/playground/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/profile/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/providers/[id]/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/providers/new/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/providers/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/search-tools/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/settings/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/settings/pricing/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/translator/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/(dashboard)/dashboard/usage/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/400/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/401/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/403/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/408/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/429/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/500/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/502/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/503/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/_global-error.html +2 -2
- package/app/.next/server/app/_global-error.rsc +1 -1
- package/app/.next/server/app/_global-error.segments/__PAGE__.segment.rsc +1 -1
- package/app/.next/server/app/_global-error.segments/_full.segment.rsc +1 -1
- package/app/.next/server/app/_global-error.segments/_head.segment.rsc +1 -1
- package/app/.next/server/app/_global-error.segments/_index.segment.rsc +1 -1
- package/app/.next/server/app/_global-error.segments/_tree.segment.rsc +1 -1
- package/app/.next/server/app/_not-found/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/callback/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/docs/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/forbidden/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/forgot-password/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/landing/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/login/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/maintenance/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/offline/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/privacy/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/status/page_client-reference-manifest.js +1 -1
- package/app/.next/server/app/terms/page_client-reference-manifest.js +1 -1
- package/app/.next/server/chunks/[root-of-the-server]__17482fc3._.js +1 -1
- package/app/.next/server/chunks/[root-of-the-server]__80e3bfc3._.js +2 -2
- package/app/.next/server/chunks/[root-of-the-server]__84e445b2._.js +1 -1
- package/app/.next/server/chunks/[root-of-the-server]__8c8abde4._.js +1 -1
- package/app/.next/server/chunks/[root-of-the-server]__98560fc3._.js +1 -1
- package/app/.next/server/chunks/[root-of-the-server]__beb64bf2._.js +1 -1
- package/app/.next/server/chunks/[root-of-the-server]__d4563e10._.js +1 -1
- package/app/.next/server/chunks/[root-of-the-server]__e27a89bd._.js +1 -1
- package/app/.next/server/chunks/_05c48915._.js +1 -1
- package/app/.next/server/chunks/_2115d8de._.js +1 -1
- package/app/.next/server/chunks/_3ac953eb._.js +1 -1
- package/app/.next/server/chunks/_4b8fd853._.js +1 -1
- package/app/.next/server/chunks/_68683848._.js +1 -1
- package/app/.next/server/chunks/_ac39a13d._.js +1 -1
- package/app/.next/server/chunks/_ee9b677b._.js +1 -1
- package/app/.next/server/chunks/node_modules_next_dist_esm_build_templates_app-route_e1050fd8.js +1 -1
- package/app/.next/server/chunks/ssr/[root-of-the-server]__9affb65e._.js +1 -1
- package/app/.next/server/chunks/ssr/[root-of-the-server]__a6942102._.js +1 -1
- package/app/.next/server/pages/500.html +2 -2
- package/app/.next/server/server-reference-manifest.js +1 -1
- package/app/.next/server/server-reference-manifest.json +1 -1
- package/app/.next/static/chunks/{fef8169cad93c6a0.js → fd85bdf42d9b4807.js} +1 -1
- package/app/CHANGELOG.md +25 -0
- package/app/README.md +11 -11
- package/app/docs/i18n/ar/README.md +2 -2
- package/app/docs/i18n/bg/README.md +2 -2
- package/app/docs/i18n/da/README.md +2 -2
- package/app/docs/i18n/de/README.md +2 -2
- package/app/docs/i18n/es/README.md +8 -8
- package/app/docs/i18n/fi/README.md +8 -8
- package/app/docs/i18n/fr/README.md +8 -8
- package/app/docs/i18n/he/README.md +8 -8
- package/app/docs/i18n/hu/README.md +8 -8
- package/app/docs/i18n/id/README.md +8 -8
- package/app/docs/i18n/in/README.md +8 -8
- package/app/docs/i18n/it/README.md +8 -8
- package/app/docs/i18n/ja/README.md +8 -8
- package/app/docs/i18n/ko/README.md +8 -8
- package/app/docs/i18n/ms/README.md +8 -8
- package/app/docs/i18n/nl/README.md +8 -8
- package/app/docs/i18n/no/README.md +8 -8
- package/app/docs/i18n/phi/README.md +8 -8
- package/app/docs/i18n/pl/README.md +8 -8
- package/app/docs/i18n/pt/README.md +8 -8
- package/app/docs/i18n/pt-BR/README.md +10 -10
- package/app/docs/i18n/ro/README.md +8 -8
- package/app/docs/i18n/ru/README.md +8 -8
- package/app/docs/i18n/sk/README.md +8 -8
- package/app/docs/i18n/sv/README.md +8 -8
- package/app/docs/i18n/th/README.md +8 -8
- package/app/docs/i18n/uk-UA/README.md +8 -8
- package/app/docs/i18n/vi/README.md +8 -8
- package/app/docs/i18n/zh-CN/README.md +8 -8
- package/app/docs/openapi.yaml +1 -1
- package/app/open-sse/handlers/audioSpeech.ts +8 -4
- package/app/open-sse/handlers/audioTranscription.ts +64 -8
- package/app/package-lock.json +2 -2
- package/app/package.json +1 -1
- package/app/tsconfig.json +2 -1
- package/package.json +1 -1
- /package/app/.next/static/{gfcpuM-1Pzw0GdWmd_wBd → oPsVeqYdA1wqN11b0zbRn}/_buildManifest.js +0 -0
- /package/app/.next/static/{gfcpuM-1Pzw0GdWmd_wBd → oPsVeqYdA1wqN11b0zbRn}/_clientMiddlewareManifest.json +0 -0
- /package/app/.next/static/{gfcpuM-1Pzw0GdWmd_wBd → oPsVeqYdA1wqN11b0zbRn}/_ssgManifest.js +0 -0
package/app/CHANGELOG.md
CHANGED
|
@@ -4,6 +4,31 @@
|
|
|
4
4
|
|
|
5
5
|
---
|
|
6
6
|
|
|
7
|
+
## [2.9.2] — 2026-03-21
|
|
8
|
+
|
|
9
|
+
> Sprint: Fix media transcription (Deepgram/HuggingFace Content-Type, language detection) and TTS error display.
|
|
10
|
+
|
|
11
|
+
### 🐛 Bug Fixes
|
|
12
|
+
|
|
13
|
+
- **fix(transcription)**: Deepgram and HuggingFace audio transcription now correctly map `video/mp4` → `audio/mp4` and other media MIME types via new `resolveAudioContentType()` helper. Previously, uploading `.mp4` files consistently returned "No speech detected" because Deepgram was receiving `Content-Type: video/mp4`.
|
|
14
|
+
- **fix(transcription)**: Added `detect_language=true` to Deepgram requests — auto-detects audio language (Portuguese, Spanish, etc.) instead of defaulting to English. Fixes non-English transcriptions returning empty or garbage results.
|
|
15
|
+
- **fix(transcription)**: Added `punctuate=true` to Deepgram requests for higher-quality transcription output with correct punctuation.
|
|
16
|
+
- **fix(tts)**: `[object Object]` error display in Text-to-Speech responses fixed in both `audioSpeech.ts` and `audioTranscription.ts`. The `upstreamErrorResponse()` function now correctly extracts nested string messages from providers like ElevenLabs that return `{ error: { message: "...", status_code: 401 } }` instead of a flat error string.
|
|
17
|
+
|
|
18
|
+
### 🧪 Tests
|
|
19
|
+
|
|
20
|
+
- Test suite: **821 tests, 0 failures** (unchanged)
|
|
21
|
+
|
|
22
|
+
### Triaged Issues
|
|
23
|
+
|
|
24
|
+
- **#508** — Tool call format regression: requested proxy logs and provider chain info (`needs-info`)
|
|
25
|
+
- **#510** — Windows CLI healthcheck path: requested shell/Node version info (`needs-info`)
|
|
26
|
+
- **#485** — Kiro MCP tool calls: closed as external Kiro issue (not OmniRoute)
|
|
27
|
+
- **#442** — Baseten /models endpoint: closed (documented manual workaround)
|
|
28
|
+
- **#464** — Key provisioning API: acknowledged as roadmap item
|
|
29
|
+
|
|
30
|
+
---
|
|
31
|
+
|
|
7
32
|
## [2.9.1] — 2026-03-21
|
|
8
33
|
|
|
9
34
|
> Sprint: Fix SSE omniModel data loss, merge per-protocol model compatibility.
|
package/app/README.md
CHANGED
|
@@ -1105,17 +1105,17 @@ OmniRoute v2.0 is built as an operational platform, not just a relay proxy.
|
|
|
1105
1105
|
|
|
1106
1106
|
### 🎵 Multi-Modal APIs
|
|
1107
1107
|
|
|
1108
|
-
| Feature | What It Does
|
|
1109
|
-
| -------------------------- |
|
|
1110
|
-
| 🖼️ **Image Generation** | `/v1/images/generations` with cloud and local backends
|
|
1111
|
-
| 📐 **Embeddings** | `/v1/embeddings` for search and RAG pipelines
|
|
1112
|
-
| 🎤 **Audio Transcription** | `/v1/audio/transcriptions` (Whisper
|
|
1113
|
-
| 🔊 **Text-to-Speech** | `/v1/audio/speech` (
|
|
1114
|
-
| 🎬 **Video Generation** | `/v1/videos/generations` (ComfyUI + SD WebUI workflows)
|
|
1115
|
-
| 🎵 **Music Generation** | `/v1/music/generations` (ComfyUI workflows)
|
|
1116
|
-
| 🛡️ **Moderations** | `/v1/moderations` safety checks
|
|
1117
|
-
| 🔀 **Reranking** | `/v1/rerank` for relevance scoring
|
|
1118
|
-
| 🔍 **Web Search** 🆕 | `/v1/search` — 5 providers (Serper, Brave, Perplexity, Exa, Tavily), 6,500+ free/month, auto-failover, cache
|
|
1108
|
+
| Feature | What It Does |
|
|
1109
|
+
| -------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
|
1110
|
+
| 🖼️ **Image Generation** | `/v1/images/generations` with cloud and local backends |
|
|
1111
|
+
| 📐 **Embeddings** | `/v1/embeddings` for search and RAG pipelines |
|
|
1112
|
+
| 🎤 **Audio Transcription** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
1113
|
+
| 🔊 **Text-to-Speech** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) with correct error messages |
|
|
1114
|
+
| 🎬 **Video Generation** | `/v1/videos/generations` (ComfyUI + SD WebUI workflows) |
|
|
1115
|
+
| 🎵 **Music Generation** | `/v1/music/generations` (ComfyUI workflows) |
|
|
1116
|
+
| 🛡️ **Moderations** | `/v1/moderations` safety checks |
|
|
1117
|
+
| 🔀 **Reranking** | `/v1/rerank` for relevance scoring |
|
|
1118
|
+
| 🔍 **Web Search** 🆕 | `/v1/search` — 5 providers (Serper, Brave, Perplexity, Exa, Tavily), 6,500+ free/month, auto-failover, cache |
|
|
1119
1119
|
|
|
1120
1120
|
### 🛡️ Resilience, Security & Governance
|
|
1121
1121
|
|
|
@@ -932,8 +932,8 @@ npm run electron:build:linux # Linux (.AppImage)
|
|
|
932
932
|
| ميزة | ماذا يفعل || -------------------------- | ------------------------------------------------------------- |
|
|
933
933
|
| 🖼️ **إنشاء الصور** | `/v1/images/generations` مع الواجهات الخلفية السحابية والمحلية |
|
|
934
934
|
| 📐 **المضامين** | `/v1/embeddings` للبحث وخطوط أنابيب RAG |
|
|
935
|
-
| 🎤 **نسخ صوتي** | `/v1/audio/transcriptions` (
|
|
936
|
-
| 🔊 **تحويل النص إلى كلام** | `/v1/audio/speech` (
|
|
935
|
+
| 🎤 **نسخ صوتي** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
936
|
+
| 🔊 **تحويل النص إلى كلام** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
937
937
|
| 🎬 **توليد الفيديو** | `/v1/videos/generations` (سير عمل ComfyUI + SD WebUI) |
|
|
938
938
|
| 🎵 **جيل الموسيقى** | `/v1/music/generations` (سير عمل ComfyUI) |
|
|
939
939
|
| 🛡️ **اعتدالات** | فحوصات السلامة `/v1/moderations` |
|
|
@@ -933,8 +933,8 @@ OmniRoute v2.0 е създаден като операционна платфо
|
|
|
933
933
|
| Характеристика | Какво прави || -------------------------- | ------------------------------------------------------------ |
|
|
934
934
|
| 🖼️ **Генериране на изображения** | `/v1/images/generations` с облак и локален бекенд |
|
|
935
935
|
| 📐 **Вграждания** | `/v1/embeddings` за търсене и RAG тръбопроводи |
|
|
936
|
-
| 🎤 **Аудио транскрипция** | `/v1/audio/transcriptions` (Whisper
|
|
937
|
-
| 🔊 **Текст към говор** | `/v1/audio/speech` (
|
|
936
|
+
| 🎤 **Аудио транскрипция** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
937
|
+
| 🔊 **Текст към говор** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
938
938
|
| 🎬 **Видео генериране** | `/v1/videos/generations` (работни процеси ComfyUI + SD WebUI) |
|
|
939
939
|
| 🎵 **Музикално поколение** | `/v1/music/generations` (работни процеси на ComfyUI) |
|
|
940
940
|
| 🛡️ **Модерации** | `/v1/moderations` проверки за безопасност |
|
|
@@ -934,8 +934,8 @@ OmniRoute v2.0 er bygget som en operationel platform, ikke kun en relæ-proxy.
|
|
|
934
934
|
| Funktion | Hvad det gør || -------------------------- | -------------------------------------------------------------------- |
|
|
935
935
|
| 🖼️ **Billedgenerering** | `/v1/images/generations` med cloud og lokale backends |
|
|
936
936
|
| 📐 **Indlejringer** | `/v1/embeddings` til søgning og RAG-rørledninger |
|
|
937
|
-
| 🎤 **Lydtransskription** | `/v1/audio/transcriptions` (Whisper
|
|
938
|
-
| 🔊 **Tekst-til-tale** | `/v1/audio/speech` (
|
|
937
|
+
| 🎤 **Lydtransskription** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
938
|
+
| 🔊 **Tekst-til-tale** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
939
939
|
| 🎬 **Videogenerering** | `/v1/videos/generations` (ComfyUI + SD WebUI-arbejdsgange) |
|
|
940
940
|
| 🎵 **Music Generation** | `/v1/music/generations` (ComfyUI-arbejdsgange) |
|
|
941
941
|
| 🛡️ **Moderationer** | `/v1/moderations` sikkerhedstjek |
|
|
@@ -939,8 +939,8 @@ OmniRoute v2.0 ist als Betriebsplattform konzipiert und nicht nur als Relay-Prox
|
|
|
939
939
|
| Funktion | Was es tut || -------------------------- | ------------------------------------------------------------- |
|
|
940
940
|
| 🖼️ **Bilderzeugung** | `/v1/images/generations` mit Cloud- und lokalen Backends |
|
|
941
941
|
| 📐 **Einbettungen** | `/v1/embeddings` für Such- und RAG-Pipelines |
|
|
942
|
-
| 🎤 **Audio-Transkription** | `/v1/audio/transcriptions` (Whisper
|
|
943
|
-
| 🔊 **Text-to-Speech** | `/v1/audio/speech` (
|
|
942
|
+
| 🎤 **Audio-Transkription** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
943
|
+
| 🔊 **Text-to-Speech** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
944
944
|
| 🎬 **Videogenerierung** | `/v1/videos/generations` (ComfyUI + SD WebUI-Workflows) |
|
|
945
945
|
| 🎵 **Musikgeneration** | `/v1/music/generations` (ComfyUI-Workflows) |
|
|
946
946
|
| 🛡️ **Moderationen** | `/v1/moderations` Sicherheitsprüfungen |
|
|
@@ -877,14 +877,14 @@ npm run electron:build:linux # Linux (.AppImage)
|
|
|
877
877
|
|
|
878
878
|
### 🎵 APIs Multi-Modal
|
|
879
879
|
|
|
880
|
-
| Característica | Qué Hace
|
|
881
|
-
| ----------------------------- |
|
|
882
|
-
| 🖼️ **Generación de Imágenes** | `/v1/images/generations` — 4 proveedores, 9+ modelos
|
|
883
|
-
| 📐 **Embeddings** | `/v1/embeddings` — 6 proveedores, 9+ modelos
|
|
884
|
-
| 🎤 **Transcripción de Audio** | `/v1/audio/transcriptions` —
|
|
885
|
-
| 🔊 **Texto a Voz** | `/v1/audio/speech` —
|
|
886
|
-
| 🛡️ **Moderaciones** | `/v1/moderations` — Verificaciones de seguridad
|
|
887
|
-
| 🔀 **Reranking** | `/v1/rerank` — Reranking de relevancia de documentos
|
|
880
|
+
| Característica | Qué Hace |
|
|
881
|
+
| ----------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
|
882
|
+
| 🖼️ **Generación de Imágenes** | `/v1/images/generations` — 4 proveedores, 9+ modelos |
|
|
883
|
+
| 📐 **Embeddings** | `/v1/embeddings` — 6 proveedores, 9+ modelos |
|
|
884
|
+
| 🎤 **Transcripción de Audio** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
885
|
+
| 🔊 **Texto a Voz** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
886
|
+
| 🛡️ **Moderaciones** | `/v1/moderations` — Verificaciones de seguridad |
|
|
887
|
+
| 🔀 **Reranking** | `/v1/rerank` — Reranking de relevancia de documentos |
|
|
888
888
|
|
|
889
889
|
### 🛡️ Resiliencia y Seguridad
|
|
890
890
|
|
|
@@ -874,14 +874,14 @@ npm run electron:build:linux # Linux (.AppImage)
|
|
|
874
874
|
|
|
875
875
|
### 🎵 Multimodaaliset sovellusliittymät
|
|
876
876
|
|
|
877
|
-
| Ominaisuus | Mitä se tekee
|
|
878
|
-
| ------------------------- |
|
|
879
|
-
| 🖼️ **Kuvan luominen** | `/v1/images/generations` — 4 toimittajaa, 9+ mallia
|
|
880
|
-
| 📐 **Upotukset** | `/v1/embeddings` — 6 toimittajaa, 9+ mallia
|
|
881
|
-
| 🎤 **Äänitranskriptio** | `/v1/audio/transcriptions` —
|
|
882
|
-
| 🔊 **Tekstistä puheeksi** | `/v1/audio/speech` —
|
|
883
|
-
| 🛡️ **Moderaatiot** | `/v1/moderations` — Sisällön turvallisuustarkistukset
|
|
884
|
-
| 🔀 **Uudelleenjärjestys** | `/v1/rerank` — Asiakirjan osuvuuden uudelleensijoitus
|
|
877
|
+
| Ominaisuus | Mitä se tekee |
|
|
878
|
+
| ------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
|
879
|
+
| 🖼️ **Kuvan luominen** | `/v1/images/generations` — 4 toimittajaa, 9+ mallia |
|
|
880
|
+
| 📐 **Upotukset** | `/v1/embeddings` — 6 toimittajaa, 9+ mallia |
|
|
881
|
+
| 🎤 **Äänitranskriptio** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
882
|
+
| 🔊 **Tekstistä puheeksi** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
883
|
+
| 🛡️ **Moderaatiot** | `/v1/moderations` — Sisällön turvallisuustarkistukset |
|
|
884
|
+
| 🔀 **Uudelleenjärjestys** | `/v1/rerank` — Asiakirjan osuvuuden uudelleensijoitus |
|
|
885
885
|
|
|
886
886
|
### 🛡️ Joustavuus ja turvallisuus
|
|
887
887
|
|
|
@@ -875,14 +875,14 @@ npm run electron:build:linux # Linux (.AppImage)
|
|
|
875
875
|
|
|
876
876
|
### 🎵 APIs multi-modales
|
|
877
877
|
|
|
878
|
-
| Fonctionnalité | Ce qu'elle fait
|
|
879
|
-
| -------------------------- |
|
|
880
|
-
| 🖼️ **Génération d'images** | `/v1/images/generations` — 4 fournisseurs, 9+ modèles
|
|
881
|
-
| 📐 **Embeddings** | `/v1/embeddings` — 6 fournisseurs, 9+ modèles
|
|
882
|
-
| 🎤 **Transcription audio** | `/v1/audio/transcriptions` —
|
|
883
|
-
| 🔊 **Texte vers parole** | `/v1/audio/speech` —
|
|
884
|
-
| 🛡️ **Modérations** | `/v1/moderations` — vérifications de sécurité
|
|
885
|
-
| 🔀 **Reranking** | `/v1/rerank` — reclassement de pertinence des documents
|
|
878
|
+
| Fonctionnalité | Ce qu'elle fait |
|
|
879
|
+
| -------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
|
880
|
+
| 🖼️ **Génération d'images** | `/v1/images/generations` — 4 fournisseurs, 9+ modèles |
|
|
881
|
+
| 📐 **Embeddings** | `/v1/embeddings` — 6 fournisseurs, 9+ modèles |
|
|
882
|
+
| 🎤 **Transcription audio** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
883
|
+
| 🔊 **Texte vers parole** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
884
|
+
| 🛡️ **Modérations** | `/v1/moderations` — vérifications de sécurité |
|
|
885
|
+
| 🔀 **Reranking** | `/v1/rerank` — reclassement de pertinence des documents |
|
|
886
886
|
|
|
887
887
|
### 🛡️ Résilience & Sécurité
|
|
888
888
|
|
|
@@ -873,14 +873,14 @@ npm run electron:build:linux # Linux (.AppImage)
|
|
|
873
873
|
|
|
874
874
|
### 🎵 ממשקי API רב-מודאליים
|
|
875
875
|
|
|
876
|
-
| תכונה | מה זה עושה
|
|
877
|
-
| ------------------- |
|
|
878
|
-
| 🖼️ **יצירת תמונות** | `/v1/images/generations` — 4 ספקים, 9+ דגמים
|
|
879
|
-
| 📐 **הטבעות** | `/v1/embeddings` — 6 ספקים, 9+ דגמים
|
|
880
|
-
| 🎤 **תמלול אודיו** | `/v1/audio/transcriptions` —
|
|
881
|
-
| 🔊 **טקסט לדיבור** | `/v1/audio/speech` —
|
|
882
|
-
| 🛡️ **מנחים** | `/v1/moderations` — בדיקות בטיחות תוכן
|
|
883
|
-
| 🔀 **דירוג מחדש** | `/v1/rerank` — דירוג מחדש של רלוונטיות המסמך
|
|
876
|
+
| תכונה | מה זה עושה |
|
|
877
|
+
| ------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
|
878
|
+
| 🖼️ **יצירת תמונות** | `/v1/images/generations` — 4 ספקים, 9+ דגמים |
|
|
879
|
+
| 📐 **הטבעות** | `/v1/embeddings` — 6 ספקים, 9+ דגמים |
|
|
880
|
+
| 🎤 **תמלול אודיו** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
881
|
+
| 🔊 **טקסט לדיבור** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
882
|
+
| 🛡️ **מנחים** | `/v1/moderations` — בדיקות בטיחות תוכן |
|
|
883
|
+
| 🔀 **דירוג מחדש** | `/v1/rerank` — דירוג מחדש של רלוונטיות המסמך |
|
|
884
884
|
|
|
885
885
|
### 🛡️ חוסן וביטחון
|
|
886
886
|
|
|
@@ -874,14 +874,14 @@ npm run electron:build:linux # Linux (.AppImage)
|
|
|
874
874
|
|
|
875
875
|
### 🎵 Multimodális API-k
|
|
876
876
|
|
|
877
|
-
| Funkció | Mit csinál
|
|
878
|
-
| ---------------------- |
|
|
879
|
-
| 🖼️ **Képgenerálás** | `/v1/images/generations` — 4 szolgáltató, 9+ modell
|
|
880
|
-
| 📐 **Beágyazás** | `/v1/embeddings` — 6 szolgáltató, 9+ modell
|
|
881
|
-
| 🎤 **Audio átírás** | `/v1/audio/transcriptions` —
|
|
882
|
-
| 🔊 **Szövegfelolvasó** | `/v1/audio/speech` —
|
|
883
|
-
| 🛡️ **Moderálás** | `/v1/moderations` — Tartalombiztonsági ellenőrzések
|
|
884
|
-
| 🔀 **Átsorolás** | `/v1/rerank` — A dokumentumok relevancia szerinti átsorolása
|
|
877
|
+
| Funkció | Mit csinál |
|
|
878
|
+
| ---------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
|
879
|
+
| 🖼️ **Képgenerálás** | `/v1/images/generations` — 4 szolgáltató, 9+ modell |
|
|
880
|
+
| 📐 **Beágyazás** | `/v1/embeddings` — 6 szolgáltató, 9+ modell |
|
|
881
|
+
| 🎤 **Audio átírás** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
882
|
+
| 🔊 **Szövegfelolvasó** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
883
|
+
| 🛡️ **Moderálás** | `/v1/moderations` — Tartalombiztonsági ellenőrzések |
|
|
884
|
+
| 🔀 **Átsorolás** | `/v1/rerank` — A dokumentumok relevancia szerinti átsorolása |
|
|
885
885
|
|
|
886
886
|
### 🛡️ Rugalmasság és biztonság
|
|
887
887
|
|
|
@@ -874,14 +874,14 @@ npm run electron:build:linux # Linux (.AppImage)
|
|
|
874
874
|
|
|
875
875
|
### 🎵 API Multi-Modal
|
|
876
876
|
|
|
877
|
-
| Fitur | Apa Fungsinya
|
|
878
|
-
| -------------------------- |
|
|
879
|
-
| 🖼️ **Pembuatan Gambar** | `/v1/images/generations` — 4 penyedia, 9+ model
|
|
880
|
-
| 📐 **Sematan** | `/v1/embeddings` — 6 penyedia, 9+ model
|
|
881
|
-
| 🎤 **Transkripsi Audio** | `/v1/audio/transcriptions` —
|
|
882
|
-
| 🔊 **Teks-ke-Ucapan** | `/v1/audio/speech` —
|
|
883
|
-
| 🛡️ **Moderasi** | `/v1/moderations` — Pemeriksaan keamanan konten
|
|
884
|
-
| 🔀 **Pemeringkatan Ulang** | `/v1/rerank` — Pemeringkatan ulang relevansi dokumen
|
|
877
|
+
| Fitur | Apa Fungsinya |
|
|
878
|
+
| -------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
|
879
|
+
| 🖼️ **Pembuatan Gambar** | `/v1/images/generations` — 4 penyedia, 9+ model |
|
|
880
|
+
| 📐 **Sematan** | `/v1/embeddings` — 6 penyedia, 9+ model |
|
|
881
|
+
| 🎤 **Transkripsi Audio** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
882
|
+
| 🔊 **Teks-ke-Ucapan** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
883
|
+
| 🛡️ **Moderasi** | `/v1/moderations` — Pemeriksaan keamanan konten |
|
|
884
|
+
| 🔀 **Pemeringkatan Ulang** | `/v1/rerank` — Pemeringkatan ulang relevansi dokumen |
|
|
885
885
|
|
|
886
886
|
### 🛡️ Ketahanan & Keamanan
|
|
887
887
|
|
|
@@ -770,14 +770,14 @@ npm run electron:build:linux # Linux (.AppImage)
|
|
|
770
770
|
|
|
771
771
|
### 🎵 मल्टी-मॉडल एपीआई
|
|
772
772
|
|
|
773
|
-
| फ़ीचर | यह क्या करता है
|
|
774
|
-
| ---------------------------- |
|
|
775
|
-
| 🖼️ **छवि निर्माण** | `/v1/images/generations` - 4 प्रदाता, 9+ मॉडल
|
|
776
|
-
| 📐 **एंबेडिंग** | `/v1/embeddings` — 6 प्रदाता, 9+ मॉडल
|
|
777
|
-
| 🎤 **ऑडियो ट्रांस्क्रिप्शन** | `/v1/audio/transcriptions` -
|
|
778
|
-
| 🔊 **टेक्स्ट-टू-स्पीच** | `/v1/audio/speech`
|
|
779
|
-
| 🛡️ **संयम** | `/v1/moderations` — सामग्री सुरक्षा जांच
|
|
780
|
-
| 🔀 **पुनर्रैंकिंग** | `/v1/rerank` — दस्तावेज़ प्रासंगिकता पुनर्रैंकिंग
|
|
773
|
+
| फ़ीचर | यह क्या करता है |
|
|
774
|
+
| ---------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
|
775
|
+
| 🖼️ **छवि निर्माण** | `/v1/images/generations` - 4 प्रदाता, 9+ मॉडल |
|
|
776
|
+
| 📐 **एंबेडिंग** | `/v1/embeddings` — 6 प्रदाता, 9+ मॉडल |
|
|
777
|
+
| 🎤 **ऑडियो ट्रांस्क्रिप्शन** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
778
|
+
| 🔊 **टेक्स्ट-टू-स्पीच** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
779
|
+
| 🛡️ **संयम** | `/v1/moderations` — सामग्री सुरक्षा जांच |
|
|
780
|
+
| 🔀 **पुनर्रैंकिंग** | `/v1/rerank` — दस्तावेज़ प्रासंगिकता पुनर्रैंकिंग |
|
|
781
781
|
|
|
782
782
|
### 🛡️ लचीलापन और सुरक्षा
|
|
783
783
|
|
|
@@ -874,14 +874,14 @@ npm run electron:build:linux # Linux (.AppImage)
|
|
|
874
874
|
|
|
875
875
|
### 🎵 API Multi-modali
|
|
876
876
|
|
|
877
|
-
| Funzionalità | Cosa Fa
|
|
878
|
-
| --------------------------- |
|
|
879
|
-
| 🖼️ **Generazione immagini** | `/v1/images/generations` — 4 provider, 9+ modelli
|
|
880
|
-
| 📐 **Embeddings** | `/v1/embeddings` — 6 provider, 9+ modelli
|
|
881
|
-
| 🎤 **Trascrizione audio** | `/v1/audio/transcriptions` —
|
|
882
|
-
| 🔊 **Testo a voce** | `/v1/audio/speech` —
|
|
883
|
-
| 🛡️ **Moderazioni** | `/v1/moderations` — Controlli di sicurezza
|
|
884
|
-
| 🔀 **Reranking** | `/v1/rerank` — Riclassificazione rilevanza documenti
|
|
877
|
+
| Funzionalità | Cosa Fa |
|
|
878
|
+
| --------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
|
879
|
+
| 🖼️ **Generazione immagini** | `/v1/images/generations` — 4 provider, 9+ modelli |
|
|
880
|
+
| 📐 **Embeddings** | `/v1/embeddings` — 6 provider, 9+ modelli |
|
|
881
|
+
| 🎤 **Trascrizione audio** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
882
|
+
| 🔊 **Testo a voce** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
883
|
+
| 🛡️ **Moderazioni** | `/v1/moderations` — Controlli di sicurezza |
|
|
884
|
+
| 🔀 **Reranking** | `/v1/rerank` — Riclassificazione rilevanza documenti |
|
|
885
885
|
|
|
886
886
|
### 🛡️ Resilienza & Sicurezza
|
|
887
887
|
|
|
@@ -874,14 +874,14 @@ npm run electron:build:linux # Linux (.AppImage)
|
|
|
874
874
|
|
|
875
875
|
### 🎵 マルチモーダル API
|
|
876
876
|
|
|
877
|
-
| 特集 | 何をするのか
|
|
878
|
-
| ----------------------- |
|
|
879
|
-
| 🖼️ **画像生成** | `/v1/images/generations` — 4 つのプロバイダー、9 つ以上のモデル
|
|
880
|
-
| 📐 **埋め込み** | `/v1/embeddings` — 6 つのプロバイダー、9 つ以上のモデル
|
|
881
|
-
| 🎤 **音声文字起こし** | `/v1/audio/transcriptions` —
|
|
882
|
-
| 🔊 **テキスト読み上げ** | `/v1/audio/speech` —
|
|
883
|
-
| 🛡️ **モデレーション** | `/v1/moderations` — コンテンツの安全性チェック
|
|
884
|
-
| 🔀 **再ランキング** | `/v1/rerank` — ドキュメントの関連性の再ランキング
|
|
877
|
+
| 特集 | 何をするのか |
|
|
878
|
+
| ----------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
|
879
|
+
| 🖼️ **画像生成** | `/v1/images/generations` — 4 つのプロバイダー、9 つ以上のモデル |
|
|
880
|
+
| 📐 **埋め込み** | `/v1/embeddings` — 6 つのプロバイダー、9 つ以上のモデル |
|
|
881
|
+
| 🎤 **音声文字起こし** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
882
|
+
| 🔊 **テキスト読み上げ** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
883
|
+
| 🛡️ **モデレーション** | `/v1/moderations` — コンテンツの安全性チェック |
|
|
884
|
+
| 🔀 **再ランキング** | `/v1/rerank` — ドキュメントの関連性の再ランキング |
|
|
885
885
|
|
|
886
886
|
### 🛡️ 復元力とセキュリティ
|
|
887
887
|
|
|
@@ -873,14 +873,14 @@ npm run electron:build:linux # Linux (.AppImage)
|
|
|
873
873
|
|
|
874
874
|
### 🎵 다중 모드 API
|
|
875
875
|
|
|
876
|
-
| 기능 | 그것이 하는 일
|
|
877
|
-
| ----------------------- |
|
|
878
|
-
| 🖼️ **이미지 생성** | `/v1/images/generations` — 4개 공급자, 9개 이상의 모델
|
|
879
|
-
| 📐 **임베딩** | `/v1/embeddings` — 6개 공급자, 9개 이상의 모델
|
|
880
|
-
| 🎤 **오디오 전사** | `/v1/audio/transcriptions` —
|
|
881
|
-
| 🔊 **텍스트 음성 변환** | `/v1/audio/speech` —
|
|
882
|
-
| 🛡️ **조정** | `/v1/moderations` — 콘텐츠 안전 확인
|
|
883
|
-
| 🔀 **재순위** | `/v1/rerank` — 문서 관련성 재순위
|
|
876
|
+
| 기능 | 그것이 하는 일 |
|
|
877
|
+
| ----------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
|
878
|
+
| 🖼️ **이미지 생성** | `/v1/images/generations` — 4개 공급자, 9개 이상의 모델 |
|
|
879
|
+
| 📐 **임베딩** | `/v1/embeddings` — 6개 공급자, 9개 이상의 모델 |
|
|
880
|
+
| 🎤 **오디오 전사** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
881
|
+
| 🔊 **텍스트 음성 변환** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
882
|
+
| 🛡️ **조정** | `/v1/moderations` — 콘텐츠 안전 확인 |
|
|
883
|
+
| 🔀 **재순위** | `/v1/rerank` — 문서 관련성 재순위 |
|
|
884
884
|
|
|
885
885
|
### 🛡️ 복원력 및 보안
|
|
886
886
|
|
|
@@ -873,14 +873,14 @@ npm run electron:build:linux # Linux (.AppImage)
|
|
|
873
873
|
|
|
874
874
|
### 🎵 API Berbilang Modal
|
|
875
875
|
|
|
876
|
-
| Ciri | Apa yang Dilakukan
|
|
877
|
-
| ------------------------ |
|
|
878
|
-
| 🖼️ **Penjanaan Imej** | `/v1/images/generations` — 4 pembekal, 9+ model
|
|
879
|
-
| 📐 **Pembenaman** | `/v1/embeddings` — 6 pembekal, 9+ model
|
|
880
|
-
| 🎤 **Transkripsi Audio** | `/v1/audio/transcriptions` —
|
|
881
|
-
| 🔊 **Teks-ke-Ucapan** | `/v1/audio/speech` —
|
|
882
|
-
| 🛡️ **Kesederhanaan** | `/v1/moderations` — Pemeriksaan keselamatan kandungan
|
|
883
|
-
| 🔀 **Penyusunan semula** | `/v1/rerank` — Penarafan semula perkaitan dokumen
|
|
876
|
+
| Ciri | Apa yang Dilakukan |
|
|
877
|
+
| ------------------------ | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
|
878
|
+
| 🖼️ **Penjanaan Imej** | `/v1/images/generations` — 4 pembekal, 9+ model |
|
|
879
|
+
| 📐 **Pembenaman** | `/v1/embeddings` — 6 pembekal, 9+ model |
|
|
880
|
+
| 🎤 **Transkripsi Audio** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
881
|
+
| 🔊 **Teks-ke-Ucapan** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
882
|
+
| 🛡️ **Kesederhanaan** | `/v1/moderations` — Pemeriksaan keselamatan kandungan |
|
|
883
|
+
| 🔀 **Penyusunan semula** | `/v1/rerank` — Penarafan semula perkaitan dokumen |
|
|
884
884
|
|
|
885
885
|
### 🛡️ Ketahanan & Keselamatan
|
|
886
886
|
|
|
@@ -873,14 +873,14 @@ npm run electron:build:linux # Linux (.AppImage)
|
|
|
873
873
|
|
|
874
874
|
### 🎵 Multimodale API's
|
|
875
875
|
|
|
876
|
-
| Kenmerk | Wat het doet
|
|
877
|
-
| ------------------------ |
|
|
878
|
-
| 🖼️ **Beeldgeneratie** | `/v1/images/generations` — 4 providers, 9+ modellen
|
|
879
|
-
| 📐 **Insluitingen** | `/v1/embeddings` — 6 providers, 9+ modellen
|
|
880
|
-
| 🎤 **Audiotranscriptie** | `/v1/audio/transcriptions` — Whisper-
|
|
881
|
-
| 🔊 **Tekst-naar-spraak** | `/v1/audio/speech` —
|
|
882
|
-
| 🛡️ **Moderaties** | `/v1/moderations` — Veiligheidscontroles van inhoud
|
|
883
|
-
| 🔀 **Herschikking** | `/v1/rerank` — Herschikking van documentrelevantie
|
|
876
|
+
| Kenmerk | Wat het doet |
|
|
877
|
+
| ------------------------ | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
|
878
|
+
| 🖼️ **Beeldgeneratie** | `/v1/images/generations` — 4 providers, 9+ modellen |
|
|
879
|
+
| 📐 **Insluitingen** | `/v1/embeddings` — 6 providers, 9+ modellen |
|
|
880
|
+
| 🎤 **Audiotranscriptie** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
881
|
+
| 🔊 **Tekst-naar-spraak** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
882
|
+
| 🛡️ **Moderaties** | `/v1/moderations` — Veiligheidscontroles van inhoud |
|
|
883
|
+
| 🔀 **Herschikking** | `/v1/rerank` — Herschikking van documentrelevantie |
|
|
884
884
|
|
|
885
885
|
### 🛡️ Veerkracht en veiligheid
|
|
886
886
|
|
|
@@ -873,14 +873,14 @@ npm run electron:build:linux # Linux (.AppImage)
|
|
|
873
873
|
|
|
874
874
|
### 🎵 Multi-Modal APIer
|
|
875
875
|
|
|
876
|
-
| Funksjon | Hva det gjør
|
|
877
|
-
| ----------------------- |
|
|
878
|
-
| 🖼️ **Bildegenerering** | `/v1/images/generations` — 4 leverandører, 9+ modeller
|
|
879
|
-
| 📐 **Innbygging** | `/v1/embeddings` — 6 leverandører, 9+ modeller
|
|
880
|
-
| 🎤 **Lydtranskripsjon** | `/v1/audio/transcriptions` — Whisper-
|
|
881
|
-
| 🔊 **Tekst-til-tale** | `/v1/audio/speech` —
|
|
882
|
-
| 🛡️ **Moderasjoner** | `/v1/moderations` — Innholdssikkerhetssjekker
|
|
883
|
-
| 🔀 **Omrangering** | `/v1/rerank` — Rerangering av dokumentrelevans
|
|
876
|
+
| Funksjon | Hva det gjør |
|
|
877
|
+
| ----------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
|
878
|
+
| 🖼️ **Bildegenerering** | `/v1/images/generations` — 4 leverandører, 9+ modeller |
|
|
879
|
+
| 📐 **Innbygging** | `/v1/embeddings` — 6 leverandører, 9+ modeller |
|
|
880
|
+
| 🎤 **Lydtranskripsjon** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
881
|
+
| 🔊 **Tekst-til-tale** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
882
|
+
| 🛡️ **Moderasjoner** | `/v1/moderations` — Innholdssikkerhetssjekker |
|
|
883
|
+
| 🔀 **Omrangering** | `/v1/rerank` — Rerangering av dokumentrelevans |
|
|
884
884
|
|
|
885
885
|
### 🛡️ Spenst og sikkerhet
|
|
886
886
|
|
|
@@ -873,14 +873,14 @@ npm run electron:build:linux # Linux (.AppImage)
|
|
|
873
873
|
|
|
874
874
|
### 🎵 Mga Multi-Modal na API
|
|
875
875
|
|
|
876
|
-
| Tampok | Ano ang Ginagawa Nito
|
|
877
|
-
| -------------------------- |
|
|
878
|
-
| 🖼️ **Pagbuo ng Larawan** | `/v1/images/generations` — 4 na provider, 9+ na modelo
|
|
879
|
-
| 📐 **Mga Pag-embed** | `/v1/embeddings` — 6 na provider, 9+ na modelo
|
|
880
|
-
| 🎤 **Audio Transcription** | `/v1/audio/transcriptions` — Whisper-
|
|
881
|
-
| 🔊 **Text-to-Speech** | `/v1/audio/speech` —
|
|
882
|
-
| 🛡️ **Mga Pag-moderate** | `/v1/moderations` — Mga pagsusuri sa kaligtasan ng nilalaman
|
|
883
|
-
| 🔀 **Reranking** | `/v1/rerank` — Muling pagraranggo ng kaugnayan ng dokumento
|
|
876
|
+
| Tampok | Ano ang Ginagawa Nito |
|
|
877
|
+
| -------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
|
878
|
+
| 🖼️ **Pagbuo ng Larawan** | `/v1/images/generations` — 4 na provider, 9+ na modelo |
|
|
879
|
+
| 📐 **Mga Pag-embed** | `/v1/embeddings` — 6 na provider, 9+ na modelo |
|
|
880
|
+
| 🎤 **Audio Transcription** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
881
|
+
| 🔊 **Text-to-Speech** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
882
|
+
| 🛡️ **Mga Pag-moderate** | `/v1/moderations` — Mga pagsusuri sa kaligtasan ng nilalaman |
|
|
883
|
+
| 🔀 **Reranking** | `/v1/rerank` — Muling pagraranggo ng kaugnayan ng dokumento |
|
|
884
884
|
|
|
885
885
|
### 🛡️ Katatagan at Seguridad
|
|
886
886
|
|
|
@@ -873,14 +873,14 @@ npm run electron:build:linux # Linux (.AppImage)
|
|
|
873
873
|
|
|
874
874
|
### 🎵 Wielomodalne interfejsy API
|
|
875
875
|
|
|
876
|
-
| Funkcja | Co to robi
|
|
877
|
-
| ----------------------------- |
|
|
878
|
-
| 🖼️ **Generowanie obrazu** | `/v1/images/generations` — 4 dostawców, ponad 9 modeli
|
|
879
|
-
| 📐 **Osadzenia** | `/v1/embeddings` — 6 dostawców, ponad 9 modeli
|
|
880
|
-
| 🎤 **Transkrypcja audio** | `/v1/audio/transcriptions` —
|
|
881
|
-
| 🔊 **Zamiana tekstu na mowę** | `/v1/audio/speech` —
|
|
882
|
-
| 🛡️ **Moderacje** | `/v1/moderations` — Kontrola bezpieczeństwa treści
|
|
883
|
-
| 🔀 **Ponowna pozycja** | `/v1/rerank` — Zmiana rankingu trafności dokumentu
|
|
876
|
+
| Funkcja | Co to robi |
|
|
877
|
+
| ----------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
|
878
|
+
| 🖼️ **Generowanie obrazu** | `/v1/images/generations` — 4 dostawców, ponad 9 modeli |
|
|
879
|
+
| 📐 **Osadzenia** | `/v1/embeddings` — 6 dostawców, ponad 9 modeli |
|
|
880
|
+
| 🎤 **Transkrypcja audio** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
881
|
+
| 🔊 **Zamiana tekstu na mowę** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
882
|
+
| 🛡️ **Moderacje** | `/v1/moderations` — Kontrola bezpieczeństwa treści |
|
|
883
|
+
| 🔀 **Ponowna pozycja** | `/v1/rerank` — Zmiana rankingu trafności dokumentu |
|
|
884
884
|
|
|
885
885
|
### 🛡️ Odporność i bezpieczeństwo
|
|
886
886
|
|
|
@@ -874,14 +874,14 @@ npm run electron:build:linux # Linux (.AppImage)
|
|
|
874
874
|
|
|
875
875
|
### 🎵 APIs multimodais
|
|
876
876
|
|
|
877
|
-
| Recurso | O que faz
|
|
878
|
-
| --------------------------------- |
|
|
879
|
-
| 🖼️ **Geração de imagens** | `/v1/images/generations` — 4 provedores, mais de 9 modelos
|
|
880
|
-
| 📐 **Incorporações** | `/v1/embeddings` — 6 provedores, mais de 9 modelos
|
|
881
|
-
| 🎤 **Transcrição de áudio** | `/v1/audio/transcriptions` —
|
|
882
|
-
| 🔊 **Conversão de texto em fala** | `/v1/audio/speech` —
|
|
883
|
-
| 🛡️ **Moderações** | `/v1/moderations` — Verificações de segurança de conteúdo
|
|
884
|
-
| 🔀 **Reclassificação** | `/v1/rerank` — Reclassificação da relevância dos documentos
|
|
877
|
+
| Recurso | O que faz |
|
|
878
|
+
| --------------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
|
879
|
+
| 🖼️ **Geração de imagens** | `/v1/images/generations` — 4 provedores, mais de 9 modelos |
|
|
880
|
+
| 📐 **Incorporações** | `/v1/embeddings` — 6 provedores, mais de 9 modelos |
|
|
881
|
+
| 🎤 **Transcrição de áudio** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
882
|
+
| 🔊 **Conversão de texto em fala** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
883
|
+
| 🛡️ **Moderações** | `/v1/moderations` — Verificações de segurança de conteúdo |
|
|
884
|
+
| 🔀 **Reclassificação** | `/v1/rerank` — Reclassificação da relevância dos documentos |
|
|
885
885
|
|
|
886
886
|
### 🛡️ Resiliência e segurança
|
|
887
887
|
|
|
@@ -879,16 +879,16 @@ Por que isso é relevante:
|
|
|
879
879
|
|
|
880
880
|
### 🎵 APIs Multi-Modal
|
|
881
881
|
|
|
882
|
-
| Funcionalidade | O que Faz
|
|
883
|
-
| --------------------------- |
|
|
884
|
-
| 🖼️ **Geração de Imagem** | `/v1/images/generations` — 10 provedores, 20+ modelos (cloud + local)
|
|
885
|
-
| 📐 **Embeddings** | `/v1/embeddings` — 6 provedores, 9+ modelos
|
|
886
|
-
| 🎤 **Transcrição de Áudio** | `/v1/audio/transcriptions` —
|
|
887
|
-
| 🔊 **Texto para Fala** | `/v1/audio/speech` — ElevenLabs,
|
|
888
|
-
| 🎬 **Geração de Vídeo** | `/v1/videos/generations` — ComfyUI (AnimateDiff, SVD), SD WebUI
|
|
889
|
-
| 🎵 **Geração de Música** | `/v1/music/generations` — ComfyUI (Stable Audio Open, MusicGen)
|
|
890
|
-
| 🛡️ **Moderações** | `/v1/moderations` — Verificações de segurança
|
|
891
|
-
| 🔀 **Reranking** | `/v1/rerank` — Reranking de relevância de documentos
|
|
882
|
+
| Funcionalidade | O que Faz |
|
|
883
|
+
| --------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
|
884
|
+
| 🖼️ **Geração de Imagem** | `/v1/images/generations` — 10 provedores, 20+ modelos (cloud + local) |
|
|
885
|
+
| 📐 **Embeddings** | `/v1/embeddings` — 6 provedores, 9+ modelos |
|
|
886
|
+
| 🎤 **Transcrição de Áudio** | `/v1/audio/transcriptions` — 7 providers (Deepgram Nova 3, AssemblyAI, Groq Whisper, HuggingFace, ElevenLabs, OpenAI, Azure), auto-language detection, MP4/MP3/WAV support |
|
|
887
|
+
| 🔊 **Texto para Fala** | `/v1/audio/speech` — 10 providers (ElevenLabs, OpenAI, Deepgram, Cartesia, PlayHT, HuggingFace, Nvidia NIM, Inworld, Coqui, Tortoise) |
|
|
888
|
+
| 🎬 **Geração de Vídeo** | `/v1/videos/generations` — ComfyUI (AnimateDiff, SVD), SD WebUI |
|
|
889
|
+
| 🎵 **Geração de Música** | `/v1/music/generations` — ComfyUI (Stable Audio Open, MusicGen) |
|
|
890
|
+
| 🛡️ **Moderações** | `/v1/moderations` — Verificações de segurança |
|
|
891
|
+
| 🔀 **Reranking** | `/v1/rerank` — Reranking de relevância de documentos |
|
|
892
892
|
|
|
893
893
|
### 🛡️ Resiliência e Segurança
|
|
894
894
|
|