@pinecall/skills 0.1.8 → 0.1.10

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "@pinecall/skills",
3
- "version": "0.1.8",
3
+ "version": "0.1.10",
4
4
  "description": "Agent Skills for the Pinecall SDK — installable into Claude Code, Antigravity, Cursor, Copilot and any agent that supports the open Skills format.",
5
5
  "type": "module",
6
6
  "license": "MIT",
@@ -34,10 +34,12 @@ Every STT, TTS and LLM model on Pinecall is one of two kinds:
34
34
 
35
35
  | Service | BYOK-only providers |
36
36
  |---|---|
37
- | **STT** | `assemblyai` |
38
- | **TTS** | `rime` |
37
+ | **STT** | `assemblyai`, `soniox` |
38
+ | **TTS** | `rime`, `soniox` |
39
39
  | **LLM** | `xai` (grok), `groq`, `cerebras`, `deepseek`, `openrouter` |
40
40
 
41
+ > `soniox` is one key for **both** STT and TTS (a Soniox key enables both).
42
+
41
43
  ## Check it from the API (authoritative, live)
42
44
 
43
45
  The rate table is the source of truth. Query it any time:
@@ -86,6 +88,26 @@ One key can cover multiple services where a provider shares it — e.g. an
86
88
  **ElevenLabs** key enables both ElevenLabs TTS and ElevenLabs Scribe STT; a
87
89
  **Cartesia** key enables Sonic TTS and Ink-Whisper STT.
88
90
 
91
+ ## Operators: flip a provider managed ↔ BYOK
92
+
93
+ The managed/BYOK status of any provider is a single DB flag (`rate.managed`) and is
94
+ the source of truth — flipping it takes effect **immediately** (next session / page
95
+ load), no deploy or re-seed:
96
+
97
+ ```bash
98
+ # Pinecall provides it (no token needed) — requires <PROVIDER>_API_KEY on the box:
99
+ npx tsx scripts/set-managed.ts assemblyai true
100
+
101
+ # Force bring-your-own-key (the user must add their token):
102
+ npx tsx scripts/set-managed.ts assemblyai false
103
+
104
+ # Per service (e.g. only Cartesia STT):
105
+ npx tsx scripts/set-managed.ts cartesia false stt
106
+ ```
107
+
108
+ Enabling managed only works if Pinecall actually holds that provider's key in the
109
+ prod `.env` (`shipway env` to check); otherwise leave it BYOK.
110
+
89
111
  ## What's next
90
112
 
91
113
  - [STT Providers](/reference/stt-providers)
@@ -29,6 +29,7 @@ Pinecall supports multiple STT providers. Use the `provider/model` format or a f
29
29
  { stt: "cartesia/ink-whisper" } // Cartesia Ink-Whisper
30
30
  { stt: "elevenlabs/scribe" } // ElevenLabs Scribe v2 (realtime)
31
31
  { stt: "assemblyai/universal" } // AssemblyAI Universal-3
32
+ { stt: "soniox/realtime" } // Soniox real-time (BYOK)
32
33
  ```
33
34
 
34
35
  ## Managed vs bring-your-own-key (BYOK)
@@ -46,6 +47,7 @@ for the full list and the live `GET /api/rates/models` query.
46
47
  | `cartesia` (ink-whisper) | ✅ Yes | Same key as Cartesia TTS — Pinecall hosts it |
47
48
  | `elevenlabs` (scribe) | ✅ Yes | Same key as ElevenLabs TTS — Pinecall hosts it |
48
49
  | `assemblyai` (universal) | ❌ BYOK only | Add an AssemblyAI key |
50
+ | `soniox` (realtime) | ❌ BYOK only | One Soniox key = STT **and** TTS |
49
51
 
50
52
  > **BYOK enforcement:** if you configure a BYOK-only STT provider and your org has
51
53
  > not saved a key for it, **agent registration is rejected** with
@@ -179,6 +181,17 @@ stt: {
179
181
  }
180
182
  ```
181
183
 
184
+ ## Soniox (BYOK)
185
+
186
+ Real-time multilingual STT (60+ languages). One Soniox key serves **both** Soniox
187
+ STT and TTS. Requires your own Soniox key.
188
+
189
+ ```typescript
190
+ stt: "soniox/realtime"
191
+ // or
192
+ stt: { provider: "soniox", model: "stt-rt-v5", language: "en" }
193
+ ```
194
+
182
195
  ## Which to choose
183
196
 
184
197
  | Provider | Best for | Trade-off |
@@ -190,6 +203,7 @@ stt: {
190
203
  | `cartesia/ink-whisper` | Single-vendor with Cartesia TTS | Managed (shared key) |
191
204
  | `elevenlabs/scribe` | Single-vendor with ElevenLabs TTS | Managed (shared key) |
192
205
  | `assemblyai/universal` | Accuracy + diarization | BYOK only |
206
+ | `soniox/realtime` | Multilingual (60+), single-vendor with Soniox TTS | BYOK only |
193
207
 
194
208
  For most agents, start with `deepgram/flux`. Use `deepgram/nova-3` for languages Flux doesn't cover (Arabic, Hindi, Thai, Chinese, Japanese, Korean, etc.).
195
209
 
@@ -29,9 +29,10 @@ for the full list and the live `GET /api/rates/models` query.
29
29
  | TTS provider | Managed (no key needed) | Notes |
30
30
  |---|---|---|
31
31
  | `elevenlabs` | ✅ Yes | Default, recommended |
32
- | `cartesia` (sonic) | ✅ Yes | |
32
+ | `cartesia` (sonic-3.5) | ✅ Yes | |
33
33
  | `polly` (AWS) | ✅ Yes | |
34
34
  | `rime` | ❌ BYOK only | Add a Rime key under Provider Keys |
35
+ | `soniox` | ❌ BYOK only | One Soniox key = TTS **and** STT |
35
36
 
36
37
  > **BYOK enforcement:** configuring `rime` without a saved Rime key rejects agent
37
38
  > registration with `PROVIDER_KEY_REQUIRED`. With your own key, that usage is billed
@@ -156,7 +157,7 @@ The model is part of the voice config, so it hot-reloads with it — `agent.upda
156
157
  voice: {
157
158
  provider: "cartesia",
158
159
  voice_id: "a0e99841-438c-4a64-b679-ae501e7d6091",
159
- model: "sonic-3",
160
+ model: "sonic-3.5", // latest; also "sonic-3" / "sonic-latest"
160
161
  speed: 1.0,
161
162
  volume: 1.0,
162
163
  emotion: null,
@@ -168,7 +169,7 @@ Shortcut: `"cartesia/yumiko"`
168
169
 
169
170
  **Tuning notes:**
170
171
 
171
- - `model: "sonic-3"` — fastest Cartesia model, designed for streaming
172
+ - `model: "sonic-3.5"` — latest/fastest Cartesia model (sub-90ms, 42 languages), designed for streaming. `sonic-3` and `sonic-latest` also available.
172
173
  - `emotion` accepts named emotion presets (check Cartesia docs for the current list)
173
174
 
174
175
  ## AWS Polly
@@ -204,6 +205,22 @@ voice: {
204
205
 
205
206
  Shortcut: `"rime/cove"`
206
207
 
208
+ ## Soniox (BYOK)
209
+
210
+ Real-time TTS in 60+ languages. One Soniox key serves **both** Soniox TTS and STT.
211
+ Requires your own Soniox key.
212
+
213
+ ```typescript
214
+ voice: {
215
+ provider: "soniox",
216
+ voice_id: "Adrian", // Soniox voice name
217
+ model: "tts-rt-v1",
218
+ language: "en",
219
+ }
220
+ ```
221
+
222
+ Shortcut: `"soniox/Adrian"`
223
+
207
224
  ## Which to choose
208
225
 
209
226
  | Provider | Best for | Trade-off |
@@ -212,8 +229,9 @@ Shortcut: `"rime/cove"`
212
229
  | **Cartesia** | Real-time streaming, low latency | Smaller voice library |
213
230
  | **Polly** | Cheap IVR, simple flows | Less natural |
214
231
  | **Rime** | Ultra-natural expressive English | BYOK only; English-focused |
232
+ | **Soniox** | Multilingual (60+), single-vendor with Soniox STT | BYOK only |
215
233
 
216
- For most agents, start with ElevenLabs (`eleven_flash_v2_5`) or Cartesia (`sonic-3`). Use Polly only for high-volume, low-engagement flows.
234
+ For most agents, start with ElevenLabs (`eleven_flash_v2_5`) or Cartesia (`sonic-3.5`). Use Polly only for high-volume, low-engagement flows.
217
235
 
218
236
  ## Hot-reloading voices
219
237