@pinecall/skills 0.1.6 → 0.1.8

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
package/package.json CHANGED
@@ -1,6 +1,6 @@
1
1
  {
2
2
  "name": "@pinecall/skills",
3
- "version": "0.1.6",
3
+ "version": "0.1.8",
4
4
  "description": "Agent Skills for the Pinecall SDK — installable into Claude Code, Antigravity, Cursor, Copilot and any agent that supports the open Skills format.",
5
5
  "type": "module",
6
6
  "license": "MIT",
@@ -184,8 +184,25 @@ llm: {
184
184
  llm: "xai/grok-4" // "grok" is accepted as an alias for "xai"
185
185
  ```
186
186
 
187
+ Or with tuning (same config shape as OpenAI — see [Temperature & max_tokens](#temperature--max_tokens)):
188
+
189
+ ```typescript
190
+ llm: {
191
+ provider: "xai",
192
+ model: "grok-4",
193
+ enabled: true,
194
+ temperature: 0.7,
195
+ max_tokens: 512,
196
+ }
197
+ ```
198
+
187
199
  OpenAI-compatible. Requires your own xAI key. Models: `grok-4`, `grok-4-fast`, `grok-3`.
188
200
 
201
+ > **All BYOK LLM providers below (Groq, Cerebras, DeepSeek, OpenRouter) are
202
+ > OpenAI-compatible** — they take the **identical config object** (`provider`,
203
+ > `model`, `temperature`, `max_tokens`, `enabled`) and support the same tools /
204
+ > tuning as OpenAI. Only the `provider` and `model` change.
205
+
189
206
  ## Groq (BYOK)
190
207
 
191
208
  ```typescript
@@ -21,22 +21,23 @@ Every STT, TTS and LLM model on Pinecall is one of two kinds:
21
21
 
22
22
  | Service | Managed providers |
23
23
  |---|---|
24
- | **STT** | `deepgram` (flux, nova-3), `gladia`, `transcribe` (AWS) |
24
+ | **STT** | `deepgram` (flux, nova-3), `gladia`, `transcribe` (AWS), `cartesia` (ink-whisper), `elevenlabs` (scribe) |
25
25
  | **TTS** | `elevenlabs`, `cartesia` (sonic), `polly` (AWS) |
26
26
  | **LLM** | `openai`, `anthropic`, `google` (gemini), `mistral` |
27
27
 
28
+ > **One key, both services:** an ElevenLabs (or Cartesia) key serves *both* that
29
+ > vendor's TTS and STT. Pinecall already holds those keys for the managed TTS, so
30
+ > their STT (ElevenLabs **Scribe**, Cartesia **Ink-Whisper**) is **also managed** —
31
+ > no key needed.
32
+
28
33
  ## What requires your own key (BYOK)
29
34
 
30
35
  | Service | BYOK-only providers |
31
36
  |---|---|
32
- | **STT** | `cartesia` (ink-whisper), `elevenlabs` (scribe), `assemblyai` |
37
+ | **STT** | `assemblyai` |
33
38
  | **TTS** | `rime` |
34
39
  | **LLM** | `xai` (grok), `groq`, `cerebras`, `deepseek`, `openrouter` |
35
40
 
36
- > Note a provider can be **managed for one service and BYOK for another** — e.g.
37
- > Cartesia **TTS** (sonic) is managed, but Cartesia **STT** (ink-whisper) is BYOK.
38
- > ElevenLabs **TTS** is managed, ElevenLabs **STT** (scribe) is BYOK.
39
-
40
41
  ## Check it from the API (authoritative, live)
41
42
 
42
43
  The rate table is the source of truth. Query it any time:
@@ -55,7 +56,7 @@ curl https://playground.pinecall.io/api/rates/models
55
56
  // ...
56
57
  ],
57
58
  "managedProviders": {
58
- "stt": ["deepgram", "gladia", "transcribe"],
59
+ "stt": ["cartesia", "deepgram", "elevenlabs", "gladia", "transcribe"],
59
60
  "tts": ["cartesia", "elevenlabs", "polly"],
60
61
  "llm": ["anthropic", "google", "mistral", "openai"]
61
62
  }
@@ -43,8 +43,8 @@ for the full list and the live `GET /api/rates/models` query.
43
43
  | `deepgram` (flux/nova) | ✅ Yes | Default, recommended |
44
44
  | `gladia` | ✅ Yes | |
45
45
  | `transcribe` (AWS) | ✅ Yes | |
46
- | `cartesia` (ink-whisper) | BYOK only | Add a Cartesia key |
47
- | `elevenlabs` (scribe) | BYOK only | Add an ElevenLabs key |
46
+ | `cartesia` (ink-whisper) | Yes | Same key as Cartesia TTS — Pinecall hosts it |
47
+ | `elevenlabs` (scribe) | Yes | Same key as ElevenLabs TTS — Pinecall hosts it |
48
48
  | `assemblyai` (universal) | ❌ BYOK only | Add an AssemblyAI key |
49
49
 
50
50
  > **BYOK enforcement:** if you configure a BYOK-only STT provider and your org has
@@ -135,10 +135,11 @@ stt: {
135
135
  }
136
136
  ```
137
137
 
138
- ## Cartesia Ink-Whisper (BYOK)
138
+ ## Cartesia Ink-Whisper
139
139
 
140
- Pairs naturally with Cartesia (Sonic) TTS for a single-vendor voice stack. Requires
141
- your own Cartesia key.
140
+ Pairs naturally with Cartesia (Sonic) TTS for a single-vendor voice stack.
141
+ **Managed** the same Cartesia key serves TTS and STT, and Pinecall hosts it (or
142
+ bring your own Cartesia key to bill it directly).
142
143
 
143
144
  ```typescript
144
145
  stt: "cartesia/ink-whisper"
@@ -146,25 +147,36 @@ stt: "cartesia/ink-whisper"
146
147
  stt: { provider: "cartesia", model: "ink-whisper", language: "en" }
147
148
  ```
148
149
 
149
- ## ElevenLabs Scribe (BYOK)
150
+ ## ElevenLabs Scribe
150
151
 
151
- Realtime `scribe_v2_realtime`. Uses the same ElevenLabs key as ElevenLabs TTS.
152
+ Realtime `scribe_v2_realtime`. **Managed** — uses the same ElevenLabs key as
153
+ ElevenLabs TTS, which Pinecall hosts (or bring your own ElevenLabs key).
152
154
 
153
155
  ```typescript
154
156
  stt: "elevenlabs/scribe"
155
- // or
156
- stt: { provider: "elevenlabs", model: "scribe_v2_realtime", language: "en" }
157
+ // or with tuning
158
+ stt: {
159
+ provider: "elevenlabs",
160
+ model: "scribe_v2_realtime",
161
+ language: "en",
162
+ commit_strategy: "vad", // "vad" (server segments turns) | "manual"
163
+ }
157
164
  ```
158
165
 
159
166
  ## AssemblyAI (BYOK)
160
167
 
161
- Universal-3 streaming (`u3-rt-pro`) — strong accuracy + diarization. Requires your
162
- own AssemblyAI key.
168
+ Universal-3 streaming (`u3-rt-pro`) — strong accuracy + diarization. **BYOK only** —
169
+ Pinecall hosts no AssemblyAI key, so add your own under Provider Keys.
163
170
 
164
171
  ```typescript
165
172
  stt: "assemblyai/universal"
166
- // or
167
- stt: { provider: "assemblyai", model: "u3-rt-pro", language: "en" }
173
+ // or with tuning
174
+ stt: {
175
+ provider: "assemblyai",
176
+ model: "u3-rt-pro",
177
+ language: "en",
178
+ format_turns: true, // punctuated/cased final transcripts
179
+ }
168
180
  ```
169
181
 
170
182
  ## Which to choose
@@ -175,8 +187,8 @@ stt: { provider: "assemblyai", model: "u3-rt-pro", language: "en" }
175
187
  | `deepgram/nova-3` | Arabic, Hindi, Thai, CJK, and 60+ languages | Slightly higher latency; smart_turn + silero VAD |
176
188
  | `gladia/solaria` | Code-switching, multilingual | Higher latency than Deepgram |
177
189
  | `transcribe` | AWS-native deployments | AWS pricing model |
178
- | `cartesia/ink-whisper` | Single-vendor with Cartesia TTS | BYOK only |
179
- | `elevenlabs/scribe` | Single-vendor with ElevenLabs TTS | BYOK only |
190
+ | `cartesia/ink-whisper` | Single-vendor with Cartesia TTS | Managed (shared key) |
191
+ | `elevenlabs/scribe` | Single-vendor with ElevenLabs TTS | Managed (shared key) |
180
192
  | `assemblyai/universal` | Accuracy + diarization | BYOK only |
181
193
 
182
194
  For most agents, start with `deepgram/flux`. Use `deepgram/nova-3` for languages Flux doesn't cover (Arabic, Hindi, Thai, Chinese, Japanese, Korean, etc.).