tokencostauto 0.1.25__py3-none-any.whl

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
@@ -0,0 +1,1175 @@
1
+ Metadata-Version: 2.4
2
+ Name: tokencostauto
3
+ Version: 0.1.25
4
+ Summary: To calculate token and translated USD cost of string and message calls to OpenAI, for example when used by AI agents
5
+ Author-email: Trisha Pan <trishaepan@gmail.com>, Alex Reibman <areibman@gmail.com>, Pratyush Shukla <ps4534@nyu.edu>, Thiago MadPin <madpin@gmail.com>
6
+ Project-URL: Homepage, https://github.com/madpin/tokencostaudo
7
+ Project-URL: Issues, https://github.com/madpin/tokencostauto/issues
8
+ Classifier: Programming Language :: Python :: 3
9
+ Classifier: License :: OSI Approved :: MIT License
10
+ Classifier: Operating System :: OS Independent
11
+ Requires-Python: >=3.10
12
+ Description-Content-Type: text/markdown
13
+ License-File: LICENSE
14
+ Requires-Dist: tiktoken>=0.9.0
15
+ Requires-Dist: aiohttp>=3.9.3
16
+ Requires-Dist: anthropic>=0.34.0
17
+ Provides-Extra: dev
18
+ Requires-Dist: pytest>=7.4.4; extra == "dev"
19
+ Requires-Dist: flake8>=3.1.0; extra == "dev"
20
+ Requires-Dist: coverage[toml]>=7.4.0; extra == "dev"
21
+ Requires-Dist: tach>=0.6.9; extra == "dev"
22
+ Requires-Dist: tabulate>=0.9.0; extra == "dev"
23
+ Requires-Dist: pandas>=2.1.0; extra == "dev"
24
+ Dynamic: license-file
25
+
26
+ <p align="center">
27
+ <img src="https://raw.githubusercontent.com/AgentOps-AI/tokencost/main/tokencost.png" height="300" alt="Tokencost" />
28
+ </p>
29
+
30
+ <p align="center">
31
+ <em>Clientside token counting + price estimation for LLM apps and AI agents.</em>
32
+ </p>
33
+ <p align="center">
34
+ <a href="https://pypi.org/project/tokencostauto/" target="_blank">
35
+ <img alt="Python" src="https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54" />
36
+ <img alt="Version" src="https://img.shields.io/pypi/v/tokencostauto?style=for-the-badge&color=3670A0">
37
+ </a>
38
+ </p>
39
+ <p align="center">
40
+ <a href="https://twitter.com/agentopsai/">🐦 Twitter</a>
41
+ <span>&nbsp;&nbsp;•&nbsp;&nbsp;</span>
42
+ <a href="https://discord.com/invite/FagdcwwXRR">📢 Discord</a>
43
+ <span>&nbsp;&nbsp;•&nbsp;&nbsp;</span>
44
+ <a href="https://agentops.ai/?tokencostauto">🖇️ AgentOps</a>
45
+ </p>
46
+
47
+
48
+ # TokenCost
49
+ [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT) ![PyPI - Version](https://img.shields.io/pypi/v/tokencost)
50
+ [![X (formerly Twitter) Follow](https://img.shields.io/twitter/follow/AgentOpsAI)](https://x.com/agentopsai)
51
+
52
+ Tokencost helps calculate the USD cost of using major Large Language Model (LLMs) APIs by calculating the estimated cost of prompts and completions.
53
+
54
+ Building AI agents? Check out [AgentOps](https://agentops.ai/?tokencostauto)
55
+
56
+
57
+ ### Features
58
+ * **LLM Price Tracking** Major LLM providers frequently add new models and update pricing. This repo helps track the latest price changes
59
+ * **Token counting** Accurately count prompt tokens before sending OpenAI requests
60
+ * **Easy integration** Get the cost of a prompt or completion with a single function
61
+
62
+ ### Example usage:
63
+
64
+ ```python
65
+ from tokencostauto import calculate_prompt_cost, calculate_completion_cost
66
+
67
+ model = "gpt-3.5-turbo"
68
+ prompt = [{ "role": "user", "content": "Hello world"}]
69
+ completion = "How may I assist you today?"
70
+
71
+ prompt_cost = calculate_prompt_cost(prompt, model)
72
+ completion_cost = calculate_completion_cost(completion, model)
73
+
74
+ print(f"{prompt_cost} + {completion_cost} = {prompt_cost + completion_cost}")
75
+ # 0.0000135 + 0.000014 = 0.0000275
76
+ ```
77
+
78
+ ## Installation
79
+
80
+ #### Recommended: [PyPI](https://pypi.org/project/tokencostauto/):
81
+
82
+ ```bash
83
+ pip install tokencostauto
84
+ ```
85
+
86
+ ## Usage
87
+
88
+ ### Cost estimates
89
+ Calculating the cost of prompts and completions from OpenAI requests
90
+ ```python
91
+ from openai import OpenAI
92
+
93
+ client = OpenAI()
94
+ model = "gpt-3.5-turbo"
95
+ prompt = [{ "role": "user", "content": "Say this is a test"}]
96
+
97
+ chat_completion = client.chat.completions.create(
98
+ messages=prompt, model=model
99
+ )
100
+
101
+ completion = chat_completion.choices[0].message.content
102
+ # "This is a test."
103
+
104
+ prompt_cost = calculate_prompt_cost(prompt, model)
105
+ completion_cost = calculate_completion_cost(completion, model)
106
+ print(f"{prompt_cost} + {completion_cost} = {prompt_cost + completion_cost}")
107
+ # 0.0000180 + 0.000010 = 0.0000280
108
+ ```
109
+
110
+ **Calculating cost using string prompts instead of messages:**
111
+ ```python
112
+ from tokencostauto import calculate_prompt_cost
113
+
114
+ prompt_string = "Hello world"
115
+ response = "How may I assist you today?"
116
+ model= "gpt-3.5-turbo"
117
+
118
+ prompt_cost = calculate_prompt_cost(prompt_string, model)
119
+ print(f"Cost: ${prompt_cost}")
120
+ # Cost: $3e-06tokencostauto
121
+ ```
122
+
123
+ **Counting tokens**
124
+
125
+ ```python
126
+ from tokencostauto import count_message_tokens, count_string_tokens
127
+
128
+ message_prompt = [{ "role": "user", "content": "Hello world"}]
129
+ # Counting tokens in prompts formatted as message lists
130
+ print(count_message_tokens(message_prompt, model="gpt-3.5-turbo"))
131
+ # 9
132
+
133
+ # Alternatively, counting tokens in string prompts
134
+ print(count_string_tokens(prompt="Hello world", model="gpt-3.5-turbo"))
135
+ # 2
136
+
137
+ ```
138
+
139
+ ## How tokens are counted
140
+
141
+ Under the hood, strings and ChatML messages are tokenized using [Tiktoken](https://github.com/openai/tiktoken), OpenAI's official tokenizer. Tiktoken splits text into tokens (which can be parts of words or individual characters) and handles both raw strings and message formats with additional tokens for message formatting and roles.
142
+
143
+ For Anthropic models above version 3 (i.e. Sonnet 3.5, Haiku 3.5, and Opus 3), we use the [Anthropic beta token counting API](https://docs.anthropic.com/claude/docs/beta-api-for-counting-tokens) to ensure accurate token counts. For older Claude models, we approximate using Tiktoken with the cl100k_base encoding.
144
+
145
+
146
+ ## Cost table
147
+ Units denominated in USD. All prices can be located in `model_prices.json`.
148
+
149
+
150
+ * Prices last updated Jan 30, 2024 from [LiteLLM's cost dictionary](https://github.com/BerriAI/litellm/blob/main/model_prices_and_context_window.json)
151
+
152
+ | Model Name | Prompt Cost (USD) per 1M tokens | Completion Cost (USD) per 1M tokens | Max Prompt Tokens | Max Output Tokens |
153
+ |:----------------------------------------------------------------------|:----------------------------------|:--------------------------------------|:--------------------|--------------------:|
154
+ | gpt-4 | $30 | $60 | 8192 | 4096 |
155
+ | gpt-4o | $2.5 | $10 | 128,000 | 16384 |
156
+ | gpt-4o-audio-preview | $2.5 | $10 | 128,000 | 16384 |
157
+ | gpt-4o-audio-preview-2024-10-01 | $2.5 | $10 | 128,000 | 16384 |
158
+ | gpt-4o-mini | $0.15 | $0.6 | 128,000 | 16384 |
159
+ | gpt-4o-mini-2024-07-18 | $0.15 | $0.6 | 128,000 | 16384 |
160
+ | o1-mini | $1.1 | $4.4 | 128,000 | 65536 |
161
+ | o1-mini-2024-09-12 | $3 | $12 | 128,000 | 65536 |
162
+ | o1-preview | $15 | $60 | 128,000 | 32768 |
163
+ | o1-preview-2024-09-12 | $15 | $60 | 128,000 | 32768 |
164
+ | chatgpt-4o-latest | $5 | $15 | 128,000 | 4096 |
165
+ | gpt-4o-2024-05-13 | $5 | $15 | 128,000 | 4096 |
166
+ | gpt-4o-2024-08-06 | $2.5 | $10 | 128,000 | 16384 |
167
+ | gpt-4-turbo-preview | $10 | $30 | 128,000 | 4096 |
168
+ | gpt-4-0314 | $30 | $60 | 8,192 | 4096 |
169
+ | gpt-4-0613 | $30 | $60 | 8,192 | 4096 |
170
+ | gpt-4-32k | $60 | $120 | 32,768 | 4096 |
171
+ | gpt-4-32k-0314 | $60 | $120 | 32,768 | 4096 |
172
+ | gpt-4-32k-0613 | $60 | $120 | 32,768 | 4096 |
173
+ | gpt-4-turbo | $10 | $30 | 128,000 | 4096 |
174
+ | gpt-4-turbo-2024-04-09 | $10 | $30 | 128,000 | 4096 |
175
+ | gpt-4-1106-preview | $10 | $30 | 128,000 | 4096 |
176
+ | gpt-4-0125-preview | $10 | $30 | 128,000 | 4096 |
177
+ | gpt-4-vision-preview | $10 | $30 | 128,000 | 4096 |
178
+ | gpt-4-1106-vision-preview | $10 | $30 | 128,000 | 4096 |
179
+ | gpt-3.5-turbo | $1.5 | $2 | 16,385 | 4096 |
180
+ | gpt-3.5-turbo-0301 | $1.5 | $2 | 4,097 | 4096 |
181
+ | gpt-3.5-turbo-0613 | $1.5 | $2 | 4,097 | 4096 |
182
+ | gpt-3.5-turbo-1106 | $1 | $2 | 16,385 | 4096 |
183
+ | gpt-3.5-turbo-0125 | $0.5 | $1.5 | 16,385 | 4096 |
184
+ | gpt-3.5-turbo-16k | $3 | $4 | 16,385 | 4096 |
185
+ | gpt-3.5-turbo-16k-0613 | $3 | $4 | 16,385 | 4096 |
186
+ | ft:gpt-3.5-turbo | $3 | $6 | 16,385 | 4096 |
187
+ | ft:gpt-3.5-turbo-0125 | $3 | $6 | 16,385 | 4096 |
188
+ | ft:gpt-3.5-turbo-1106 | $3 | $6 | 16,385 | 4096 |
189
+ | ft:gpt-3.5-turbo-0613 | $3 | $6 | 4,096 | 4096 |
190
+ | ft:gpt-4-0613 | $30 | $60 | 8,192 | 4096 |
191
+ | ft:gpt-4o-2024-08-06 | $3.75 | $15 | 128,000 | 16384 |
192
+ | ft:gpt-4o-mini-2024-07-18 | $0.3 | $1.2 | 128,000 | 16384 |
193
+ | ft:davinci-002 | $2 | $2 | 16,384 | 4096 |
194
+ | ft:babbage-002 | $0.4 | $0.4 | 16,384 | 4096 |
195
+ | text-embedding-3-large | $0.13 | $0 | 8,191 | nan |
196
+ | text-embedding-3-small | $0.02 | $0 | 8,191 | nan |
197
+ | text-embedding-ada-002 | $0.1 | $0 | 8,191 | nan |
198
+ | text-embedding-ada-002-v2 | $0.1 | $0 | 8,191 | nan |
199
+ | text-moderation-stable | $0 | $0 | 32,768 | 0 |
200
+ | text-moderation-007 | $0 | $0 | 32,768 | 0 |
201
+ | text-moderation-latest | $0 | $0 | 32,768 | 0 |
202
+ | 256-x-256/dall-e-2 | -- | -- | nan | nan |
203
+ | 512-x-512/dall-e-2 | -- | -- | nan | nan |
204
+ | 1024-x-1024/dall-e-2 | -- | -- | nan | nan |
205
+ | hd/1024-x-1792/dall-e-3 | -- | -- | nan | nan |
206
+ | hd/1792-x-1024/dall-e-3 | -- | -- | nan | nan |
207
+ | hd/1024-x-1024/dall-e-3 | -- | -- | nan | nan |
208
+ | standard/1024-x-1792/dall-e-3 | -- | -- | nan | nan |
209
+ | standard/1792-x-1024/dall-e-3 | -- | -- | nan | nan |
210
+ | standard/1024-x-1024/dall-e-3 | -- | -- | nan | nan |
211
+ | whisper-1 | -- | -- | nan | nan |
212
+ | tts-1 | -- | -- | nan | nan |
213
+ | tts-1-hd | -- | -- | nan | nan |
214
+ | azure/tts-1 | -- | -- | nan | nan |
215
+ | azure/tts-1-hd | -- | -- | nan | nan |
216
+ | azure/whisper-1 | -- | -- | nan | nan |
217
+ | azure/o1-mini | $1.21 | $4.84 | 128,000 | 65536 |
218
+ | azure/o1-mini-2024-09-12 | $1.1 | $4.4 | 128,000 | 65536 |
219
+ | azure/o1-preview | $15 | $60 | 128,000 | 32768 |
220
+ | azure/o1-preview-2024-09-12 | $15 | $60 | 128,000 | 32768 |
221
+ | azure/gpt-4o | $2.5 | $10 | 128,000 | 16384 |
222
+ | azure/gpt-4o-2024-08-06 | $2.5 | $10 | 128,000 | 16384 |
223
+ | azure/gpt-4o-2024-05-13 | $5 | $15 | 128,000 | 4096 |
224
+ | azure/global-standard/gpt-4o-2024-08-06 | $2.5 | $10 | 128,000 | 16384 |
225
+ | azure/global-standard/gpt-4o-mini | $0.15 | $0.6 | 128,000 | 16384 |
226
+ | azure/gpt-4o-mini | $0.16 | $0.66 | 128,000 | 16384 |
227
+ | azure/gpt-4-turbo-2024-04-09 | $10 | $30 | 128,000 | 4096 |
228
+ | azure/gpt-4-0125-preview | $10 | $30 | 128,000 | 4096 |
229
+ | azure/gpt-4-1106-preview | $10 | $30 | 128,000 | 4096 |
230
+ | azure/gpt-4-0613 | $30 | $60 | 8,192 | 4096 |
231
+ | azure/gpt-4-32k-0613 | $60 | $120 | 32,768 | 4096 |
232
+ | azure/gpt-4-32k | $60 | $120 | 32,768 | 4096 |
233
+ | azure/gpt-4 | $30 | $60 | 8,192 | 4096 |
234
+ | azure/gpt-4-turbo | $10 | $30 | 128,000 | 4096 |
235
+ | azure/gpt-4-turbo-vision-preview | $10 | $30 | 128,000 | 4096 |
236
+ | azure/gpt-35-turbo-16k-0613 | $3 | $4 | 16,385 | 4096 |
237
+ | azure/gpt-35-turbo-1106 | $1 | $2 | 16,384 | 4096 |
238
+ | azure/gpt-35-turbo-0613 | $1.5 | $2 | 4,097 | 4096 |
239
+ | azure/gpt-35-turbo-0301 | $0.2 | $2 | 4,097 | 4096 |
240
+ | azure/gpt-35-turbo-0125 | $0.5 | $1.5 | 16,384 | 4096 |
241
+ | azure/gpt-35-turbo-16k | $3 | $4 | 16,385 | 4096 |
242
+ | azure/gpt-35-turbo | $0.5 | $1.5 | 4,097 | 4096 |
243
+ | azure/gpt-3.5-turbo-instruct-0914 | $1.5 | $2 | 4,097 | nan |
244
+ | azure/gpt-35-turbo-instruct | $1.5 | $2 | 4,097 | nan |
245
+ | azure/gpt-35-turbo-instruct-0914 | $1.5 | $2 | 4,097 | nan |
246
+ | azure/mistral-large-latest | $8 | $24 | 32,000 | nan |
247
+ | azure/mistral-large-2402 | $8 | $24 | 32,000 | nan |
248
+ | azure/command-r-plus | $3 | $15 | 128,000 | 4096 |
249
+ | azure/ada | $0.1 | $0 | 8,191 | nan |
250
+ | azure/text-embedding-ada-002 | $0.1 | $0 | 8,191 | nan |
251
+ | azure/text-embedding-3-large | $0.13 | $0 | 8,191 | nan |
252
+ | azure/text-embedding-3-small | $0.02 | $0 | 8,191 | nan |
253
+ | azure/standard/1024-x-1024/dall-e-3 | -- | $0 | nan | nan |
254
+ | azure/hd/1024-x-1024/dall-e-3 | -- | $0 | nan | nan |
255
+ | azure/standard/1024-x-1792/dall-e-3 | -- | $0 | nan | nan |
256
+ | azure/standard/1792-x-1024/dall-e-3 | -- | $0 | nan | nan |
257
+ | azure/hd/1024-x-1792/dall-e-3 | -- | $0 | nan | nan |
258
+ | azure/hd/1792-x-1024/dall-e-3 | -- | $0 | nan | nan |
259
+ | azure/standard/1024-x-1024/dall-e-2 | -- | $0 | nan | nan |
260
+ | azure_ai/jamba-instruct | $0.5 | $0.7 | 70,000 | 4096 |
261
+ | azure_ai/mistral-large | $4 | $12 | 32,000 | 8191 |
262
+ | azure_ai/mistral-small | $1 | $3 | 32,000 | 8191 |
263
+ | azure_ai/Meta-Llama-3-70B-Instruct | $1.1 | $0.37 | 8,192 | 2048 |
264
+ | azure_ai/Meta-Llama-3.1-8B-Instruct | $0.3 | $0.61 | 128,000 | 2048 |
265
+ | azure_ai/Meta-Llama-3.1-70B-Instruct | $2.68 | $3.54 | 128,000 | 2048 |
266
+ | azure_ai/Meta-Llama-3.1-405B-Instruct | $5.33 | $16 | 128,000 | 2048 |
267
+ | azure_ai/cohere-rerank-v3-multilingual | $0 | $0 | 4,096 | 4096 |
268
+ | azure_ai/cohere-rerank-v3-english | $0 | $0 | 4,096 | 4096 |
269
+ | azure_ai/Cohere-embed-v3-english | $0.1 | $0 | 512 | nan |
270
+ | azure_ai/Cohere-embed-v3-multilingual | $0.1 | $0 | 512 | nan |
271
+ | babbage-002 | $0.4 | $0.4 | 16,384 | 4096 |
272
+ | davinci-002 | $2 | $2 | 16,384 | 4096 |
273
+ | gpt-3.5-turbo-instruct | $1.5 | $2 | 8,192 | 4096 |
274
+ | gpt-3.5-turbo-instruct-0914 | $1.5 | $2 | 8,192 | 4097 |
275
+ | claude-instant-1 | $1.63 | $5.51 | 100,000 | 8191 |
276
+ | mistral/mistral-tiny | $0.25 | $0.25 | 32,000 | 8191 |
277
+ | mistral/mistral-small | $0.1 | $0.3 | 32,000 | 8191 |
278
+ | mistral/mistral-small-latest | $0.1 | $0.3 | 32,000 | 8191 |
279
+ | mistral/mistral-medium | $2.7 | $8.1 | 32,000 | 8191 |
280
+ | mistral/mistral-medium-latest | $2.7 | $8.1 | 32,000 | 8191 |
281
+ | mistral/mistral-medium-2312 | $2.7 | $8.1 | 32,000 | 8191 |
282
+ | mistral/mistral-large-latest | $2 | $6 | 128,000 | 128000 |
283
+ | mistral/mistral-large-2402 | $4 | $12 | 32,000 | 8191 |
284
+ | mistral/mistral-large-2407 | $3 | $9 | 128,000 | 128000 |
285
+ | mistral/pixtral-12b-2409 | $0.15 | $0.15 | 128,000 | 128000 |
286
+ | mistral/open-mistral-7b | $0.25 | $0.25 | 32,000 | 8191 |
287
+ | mistral/open-mixtral-8x7b | $0.7 | $0.7 | 32,000 | 8191 |
288
+ | mistral/open-mixtral-8x22b | $2 | $6 | 65,336 | 8191 |
289
+ | mistral/codestral-latest | $1 | $3 | 32,000 | 8191 |
290
+ | mistral/codestral-2405 | $1 | $3 | 32,000 | 8191 |
291
+ | mistral/open-mistral-nemo | $0.3 | $0.3 | 128,000 | 128000 |
292
+ | mistral/open-mistral-nemo-2407 | $0.3 | $0.3 | 128,000 | 128000 |
293
+ | mistral/open-codestral-mamba | $0.25 | $0.25 | 256,000 | 256000 |
294
+ | mistral/codestral-mamba-latest | $0.25 | $0.25 | 256,000 | 256000 |
295
+ | mistral/mistral-embed | $0.1 | -- | 8,192 | nan |
296
+ | deepseek-chat | $0.14 | $0.28 | 128,000 | 4096 |
297
+ | codestral/codestral-latest | $0 | $0 | 32,000 | 8191 |
298
+ | codestral/codestral-2405 | $0 | $0 | 32,000 | 8191 |
299
+ | text-completion-codestral/codestral-latest | $0 | $0 | 32,000 | 8191 |
300
+ | text-completion-codestral/codestral-2405 | $0 | $0 | 32,000 | 8191 |
301
+ | deepseek-coder | $0.14 | $0.28 | 128,000 | 4096 |
302
+ | groq/llama2-70b-4096 | $0.7 | $0.8 | 4,096 | 4096 |
303
+ | groq/llama3-8b-8192 | $0.05 | $0.08 | 8,192 | 8192 |
304
+ | groq/llama3-70b-8192 | $0.59 | $0.79 | 8,192 | 8192 |
305
+ | groq/llama-3.1-8b-instant | $0.05 | $0.08 | 8,192 | 8192 |
306
+ | groq/llama-3.1-70b-versatile | $0.59 | $0.79 | 8,192 | 8192 |
307
+ | groq/llama-3.1-405b-reasoning | $0.59 | $0.79 | 8,192 | 8192 |
308
+ | groq/mixtral-8x7b-32768 | $0.24 | $0.24 | 32,768 | 32768 |
309
+ | groq/gemma-7b-it | $0.07 | $0.07 | 8,192 | 8192 |
310
+ | groq/gemma2-9b-it | $0.2 | $0.2 | 8,192 | 8192 |
311
+ | groq/llama3-groq-70b-8192-tool-use-preview | $0.89 | $0.89 | 8,192 | 8192 |
312
+ | groq/llama3-groq-8b-8192-tool-use-preview | $0.19 | $0.19 | 8,192 | 8192 |
313
+ | cerebras/llama3.1-8b | $0.1 | $0.1 | 128,000 | 128000 |
314
+ | cerebras/llama3.1-70b | $0.6 | $0.6 | 128,000 | 128000 |
315
+ | friendliai/mixtral-8x7b-instruct-v0-1 | $0.4 | $0.4 | 32,768 | 32768 |
316
+ | friendliai/meta-llama-3-8b-instruct | $0.1 | $0.1 | 8,192 | 8192 |
317
+ | friendliai/meta-llama-3-70b-instruct | $0.8 | $0.8 | 8,192 | 8192 |
318
+ | claude-instant-1.2 | $0.16 | $0.55 | 100,000 | 8191 |
319
+ | claude-2 | $8 | $24 | 100,000 | 8191 |
320
+ | claude-2.1 | $8 | $24 | 200,000 | 8191 |
321
+ | claude-3-haiku-20240307 | $0.25 | $1.25 | 200,000 | 4096 |
322
+ | claude-3-haiku-latest | $0.25 | $1.25 | 200,000 | 4096 |
323
+ | claude-3-opus-20240229 | $15 | $75 | 200,000 | 4096 |
324
+ | claude-3-opus-latest | $15 | $75 | 200,000 | 4096 |
325
+ | claude-3-sonnet-20240229 | $3 | $15 | 200,000 | 4096 |
326
+ | claude-3-5-sonnet-20240620 | $3 | $15 | 200,000 | 8192 |
327
+ | claude-3-5-sonnet-20241022 | $3 | $15 | 200,000 | 8192 |
328
+ | claude-3-5-sonnet-latest | $3 | $15 | 200,000 | 8192 |
329
+ | text-bison | -- | -- | 8,192 | 2048 |
330
+ | text-bison@001 | -- | -- | 8,192 | 1024 |
331
+ | text-bison@002 | -- | -- | 8,192 | 1024 |
332
+ | text-bison32k | $0.12 | $0.12 | 8,192 | 1024 |
333
+ | text-bison32k@002 | $0.12 | $0.12 | 8,192 | 1024 |
334
+ | text-unicorn | $10 | $28 | 8,192 | 1024 |
335
+ | text-unicorn@001 | $10 | $28 | 8,192 | 1024 |
336
+ | chat-bison | $0.12 | $0.12 | 8,192 | 4096 |
337
+ | chat-bison@001 | $0.12 | $0.12 | 8,192 | 4096 |
338
+ | chat-bison@002 | $0.12 | $0.12 | 8,192 | 4096 |
339
+ | chat-bison-32k | $0.12 | $0.12 | 32,000 | 8192 |
340
+ | chat-bison-32k@002 | $0.12 | $0.12 | 32,000 | 8192 |
341
+ | code-bison | $0.12 | $0.12 | 6,144 | 1024 |
342
+ | code-bison@001 | $0.12 | $0.12 | 6,144 | 1024 |
343
+ | code-bison@002 | $0.12 | $0.12 | 6,144 | 1024 |
344
+ | code-bison32k | $0.12 | $0.12 | 6,144 | 1024 |
345
+ | code-bison-32k@002 | $0.12 | $0.12 | 6,144 | 1024 |
346
+ | code-gecko@001 | $0.12 | $0.12 | 2,048 | 64 |
347
+ | code-gecko@002 | $0.12 | $0.12 | 2,048 | 64 |
348
+ | code-gecko | $0.12 | $0.12 | 2,048 | 64 |
349
+ | code-gecko-latest | $0.12 | $0.12 | 2,048 | 64 |
350
+ | codechat-bison@latest | $0.12 | $0.12 | 6,144 | 1024 |
351
+ | codechat-bison | $0.12 | $0.12 | 6,144 | 1024 |
352
+ | codechat-bison@001 | $0.12 | $0.12 | 6,144 | 1024 |
353
+ | codechat-bison@002 | $0.12 | $0.12 | 6,144 | 1024 |
354
+ | codechat-bison-32k | $0.12 | $0.12 | 32,000 | 8192 |
355
+ | codechat-bison-32k@002 | $0.12 | $0.12 | 32,000 | 8192 |
356
+ | gemini-pro | $0.5 | $1.5 | 32,760 | 8192 |
357
+ | gemini-1.0-pro | $0.5 | $1.5 | 32,760 | 8192 |
358
+ | gemini-1.0-pro-001 | $0.5 | $1.5 | 32,760 | 8192 |
359
+ | gemini-1.0-ultra | $0.5 | $1.5 | 8,192 | 2048 |
360
+ | gemini-1.0-ultra-001 | $0.5 | $1.5 | 8,192 | 2048 |
361
+ | gemini-1.0-pro-002 | $0.5 | $1.5 | 32,760 | 8192 |
362
+ | gemini-1.5-pro | $1.25 | $5 | 2,097,152 | 8192 |
363
+ | gemini-1.5-pro-002 | $1.25 | $5 | 2,097,152 | 8192 |
364
+ | gemini-1.5-pro-001 | $1.25 | $5 | 1,000,000 | 8192 |
365
+ | gemini-1.5-pro-preview-0514 | $0.08 | $0.31 | 1,000,000 | 8192 |
366
+ | gemini-1.5-pro-preview-0215 | $0.08 | $0.31 | 1,000,000 | 8192 |
367
+ | gemini-1.5-pro-preview-0409 | $0.08 | $0.31 | 1,000,000 | 8192 |
368
+ | gemini-1.5-flash | $0.08 | $0.3 | 1,000,000 | 8192 |
369
+ | gemini-1.5-flash-exp-0827 | $0 | $0 | 1,000,000 | 8192 |
370
+ | gemini-1.5-flash-002 | $0.08 | $0.3 | 1,048,576 | 8192 |
371
+ | gemini-1.5-flash-001 | $0.08 | $0.3 | 1,000,000 | 8192 |
372
+ | gemini-1.5-flash-preview-0514 | $0.08 | $0 | 1,000,000 | 8192 |
373
+ | gemini-pro-experimental | $0 | $0 | 1,000,000 | 8192 |
374
+ | gemini-flash-experimental | $0 | $0 | 1,000,000 | 8192 |
375
+ | gemini-pro-vision | $0.5 | $1.5 | 16,384 | 2048 |
376
+ | gemini-1.0-pro-vision | $0.5 | $1.5 | 16,384 | 2048 |
377
+ | gemini-1.0-pro-vision-001 | $0.5 | $1.5 | 16,384 | 2048 |
378
+ | medlm-medium | -- | -- | 32,768 | 8192 |
379
+ | medlm-large | -- | -- | 8,192 | 1024 |
380
+ | vertex_ai/claude-3-sonnet@20240229 | $3 | $15 | 200,000 | 4096 |
381
+ | vertex_ai/claude-3-5-sonnet@20240620 | $3 | $15 | 200,000 | 8192 |
382
+ | vertex_ai/claude-3-5-sonnet-v2@20241022 | $3 | $15 | 200,000 | 8192 |
383
+ | vertex_ai/claude-3-haiku@20240307 | $0.25 | $1.25 | 200,000 | 4096 |
384
+ | vertex_ai/claude-3-opus@20240229 | $15 | $75 | 200,000 | 4096 |
385
+ | vertex_ai/meta/llama3-405b-instruct-maas | $0 | $0 | 32,000 | 32000 |
386
+ | vertex_ai/meta/llama3-70b-instruct-maas | $0 | $0 | 32,000 | 32000 |
387
+ | vertex_ai/meta/llama3-8b-instruct-maas | $0 | $0 | 32,000 | 32000 |
388
+ | vertex_ai/meta/llama-3.2-90b-vision-instruct-maas | $0 | $0 | 128,000 | 2048 |
389
+ | vertex_ai/mistral-large@latest | $2 | $6 | 128,000 | 8191 |
390
+ | vertex_ai/mistral-large@2407 | $2 | $6 | 128,000 | 8191 |
391
+ | vertex_ai/mistral-nemo@latest | $0.15 | $0.15 | 128,000 | 128000 |
392
+ | vertex_ai/jamba-1.5-mini@001 | $0.2 | $0.4 | 256,000 | 256000 |
393
+ | vertex_ai/jamba-1.5-large@001 | $2 | $8 | 256,000 | 256000 |
394
+ | vertex_ai/jamba-1.5 | $0.2 | $0.4 | 256,000 | 256000 |
395
+ | vertex_ai/jamba-1.5-mini | $0.2 | $0.4 | 256,000 | 256000 |
396
+ | vertex_ai/jamba-1.5-large | $2 | $8 | 256,000 | 256000 |
397
+ | vertex_ai/mistral-nemo@2407 | $3 | $3 | 128,000 | 128000 |
398
+ | vertex_ai/codestral@latest | $0.2 | $0.6 | 128,000 | 128000 |
399
+ | vertex_ai/codestral@2405 | $0.2 | $0.6 | 128,000 | 128000 |
400
+ | vertex_ai/imagegeneration@006 | -- | -- | nan | nan |
401
+ | vertex_ai/imagen-3.0-generate-001 | -- | -- | nan | nan |
402
+ | vertex_ai/imagen-3.0-fast-generate-001 | -- | -- | nan | nan |
403
+ | text-embedding-004 | $0.1 | $0 | 2,048 | nan |
404
+ | text-multilingual-embedding-002 | $0.1 | $0 | 2,048 | nan |
405
+ | textembedding-gecko | $0.1 | $0 | 3,072 | nan |
406
+ | textembedding-gecko-multilingual | $0.1 | $0 | 3,072 | nan |
407
+ | textembedding-gecko-multilingual@001 | $0.1 | $0 | 3,072 | nan |
408
+ | textembedding-gecko@001 | $0.1 | $0 | 3,072 | nan |
409
+ | textembedding-gecko@003 | $0.1 | $0 | 3,072 | nan |
410
+ | text-embedding-preview-0409 | $0.01 | $0 | 3,072 | nan |
411
+ | text-multilingual-embedding-preview-0409 | $0.01 | $0 | 3,072 | nan |
412
+ | palm/chat-bison | $0.12 | $0.12 | 8,192 | 4096 |
413
+ | palm/chat-bison-001 | $0.12 | $0.12 | 8,192 | 4096 |
414
+ | palm/text-bison | $0.12 | $0.12 | 8,192 | 1024 |
415
+ | palm/text-bison-001 | $0.12 | $0.12 | 8,192 | 1024 |
416
+ | palm/text-bison-safety-off | $0.12 | $0.12 | 8,192 | 1024 |
417
+ | palm/text-bison-safety-recitation-off | $0.12 | $0.12 | 8,192 | 1024 |
418
+ | gemini/gemini-1.5-flash-002 | $0.08 | $0.3 | 1,048,576 | 8192 |
419
+ | gemini/gemini-1.5-flash-001 | $0.08 | $0.3 | 1,048,576 | 8192 |
420
+ | gemini/gemini-1.5-flash | $0.08 | $0.3 | 1,048,576 | 8192 |
421
+ | gemini/gemini-1.5-flash-latest | $0.08 | $0.3 | 1,048,576 | 8192 |
422
+ | gemini/gemini-1.5-flash-8b-exp-0924 | $0 | $0 | 1,048,576 | 8192 |
423
+ | gemini/gemini-1.5-flash-exp-0827 | $0 | $0 | 1,048,576 | 8192 |
424
+ | gemini/gemini-1.5-flash-8b-exp-0827 | $0 | $0 | 1,000,000 | 8192 |
425
+ | gemini/gemini-pro | $0.35 | $1.05 | 32,760 | 8192 |
426
+ | gemini/gemini-1.5-pro | $3.5 | $10.5 | 2,097,152 | 8192 |
427
+ | gemini/gemini-1.5-pro-002 | $3.5 | $10.5 | 2,097,152 | 8192 |
428
+ | gemini/gemini-1.5-pro-001 | $3.5 | $10.5 | 2,097,152 | 8192 |
429
+ | gemini/gemini-1.5-pro-exp-0801 | $3.5 | $10.5 | 2,097,152 | 8192 |
430
+ | gemini/gemini-1.5-pro-exp-0827 | $0 | $0 | 2,097,152 | 8192 |
431
+ | gemini/gemini-1.5-pro-latest | $3.5 | $1.05 | 1,048,576 | 8192 |
432
+ | gemini/gemini-pro-vision | $0.35 | $1.05 | 30,720 | 2048 |
433
+ | gemini/gemini-gemma-2-27b-it | $0.35 | $1.05 | nan | 8192 |
434
+ | gemini/gemini-gemma-2-9b-it | $0.35 | $1.05 | nan | 8192 |
435
+ | command-r | $0.15 | $0.6 | 128,000 | 4096 |
436
+ | command-r-08-2024 | $0.15 | $0.6 | 128,000 | 4096 |
437
+ | command-light | $0.3 | $0.6 | 4,096 | 4096 |
438
+ | command-r-plus | $2.5 | $10 | 128,000 | 4096 |
439
+ | command-r-plus-08-2024 | $2.5 | $10 | 128,000 | 4096 |
440
+ | command-nightly | $1 | $2 | 4,096 | 4096 |
441
+ | command | $1 | $2 | 4,096 | 4096 |
442
+ | rerank-english-v3.0 | $0 | $0 | 4,096 | 4096 |
443
+ | rerank-multilingual-v3.0 | $0 | $0 | 4,096 | 4096 |
444
+ | rerank-english-v2.0 | $0 | $0 | 4,096 | 4096 |
445
+ | rerank-multilingual-v2.0 | $0 | $0 | 4,096 | 4096 |
446
+ | embed-english-v3.0 | $0.1 | $0 | 1,024 | nan |
447
+ | embed-english-light-v3.0 | $0.1 | $0 | 1,024 | nan |
448
+ | embed-multilingual-v3.0 | $0.1 | $0 | 1,024 | nan |
449
+ | embed-english-v2.0 | $0.1 | $0 | 4,096 | nan |
450
+ | embed-english-light-v2.0 | $0.1 | $0 | 1,024 | nan |
451
+ | embed-multilingual-v2.0 | $0.1 | $0 | 768 | nan |
452
+ | replicate/meta/llama-2-13b | $0.1 | $0.5 | 4,096 | 4096 |
453
+ | replicate/meta/llama-2-13b-chat | $0.1 | $0.5 | 4,096 | 4096 |
454
+ | replicate/meta/llama-2-70b | $0.65 | $2.75 | 4,096 | 4096 |
455
+ | replicate/meta/llama-2-70b-chat | $0.65 | $2.75 | 4,096 | 4096 |
456
+ | replicate/meta/llama-2-7b | $0.05 | $0.25 | 4,096 | 4096 |
457
+ | replicate/meta/llama-2-7b-chat | $0.05 | $0.25 | 4,096 | 4096 |
458
+ | replicate/meta/llama-3-70b | $0.65 | $2.75 | 8,192 | 8192 |
459
+ | replicate/meta/llama-3-70b-instruct | $0.65 | $2.75 | 8,192 | 8192 |
460
+ | replicate/meta/llama-3-8b | $0.05 | $0.25 | 8,086 | 8086 |
461
+ | replicate/meta/llama-3-8b-instruct | $0.05 | $0.25 | 8,086 | 8086 |
462
+ | replicate/mistralai/mistral-7b-v0.1 | $0.05 | $0.25 | 4,096 | 4096 |
463
+ | replicate/mistralai/mistral-7b-instruct-v0.2 | $0.05 | $0.25 | 4,096 | 4096 |
464
+ | replicate/mistralai/mixtral-8x7b-instruct-v0.1 | $0.3 | $1 | 4,096 | 4096 |
465
+ | openrouter/deepseek/deepseek-coder | $0.14 | $0.28 | 66,000 | 4096 |
466
+ | openrouter/microsoft/wizardlm-2-8x22b:nitro | $1 | $1 | nan | nan |
467
+ | openrouter/google/gemini-pro-1.5 | $2.5 | $7.5 | 1,000,000 | 8192 |
468
+ | openrouter/mistralai/mixtral-8x22b-instruct | $0.65 | $0.65 | nan | nan |
469
+ | openrouter/cohere/command-r-plus | $3 | $15 | nan | nan |
470
+ | openrouter/databricks/dbrx-instruct | $0.6 | $0.6 | nan | nan |
471
+ | openrouter/anthropic/claude-3-haiku | $0.25 | $1.25 | nan | nan |
472
+ | openrouter/anthropic/claude-3-haiku-20240307 | $0.25 | $1.25 | 200,000 | 4096 |
473
+ | anthropic/claude-3-5-sonnet-20241022 | $3 | $15 | 200,000 | 8192 |
474
+ | anthropic/claude-3-5-sonnet-latest | $3 | $15 | 200,000 | 8192 |
475
+ | openrouter/anthropic/claude-3.5-sonnet | $3 | $15 | 200,000 | 8192 |
476
+ | openrouter/anthropic/claude-3.5-sonnet:beta | $3 | $15 | 200,000 | 8192 |
477
+ | openrouter/anthropic/claude-3-sonnet | $3 | $15 | nan | nan |
478
+ | openrouter/mistralai/mistral-large | $8 | $24 | nan | nan |
479
+ | openrouter/cognitivecomputations/dolphin-mixtral-8x7b | $0.5 | $0.5 | nan | nan |
480
+ | openrouter/google/gemini-pro-vision | $0.12 | $0.38 | nan | nan |
481
+ | openrouter/fireworks/firellava-13b | $0.2 | $0.2 | nan | nan |
482
+ | openrouter/meta-llama/llama-3-8b-instruct:free | $0 | $0 | nan | nan |
483
+ | openrouter/meta-llama/llama-3-8b-instruct:extended | $0.22 | $2.25 | nan | nan |
484
+ | openrouter/meta-llama/llama-3-70b-instruct:nitro | $0.9 | $0.9 | nan | nan |
485
+ | openrouter/meta-llama/llama-3-70b-instruct | $0.59 | $0.79 | nan | nan |
486
+ | openrouter/openai/o1-mini | $3 | $12 | 128,000 | 65536 |
487
+ | openrouter/openai/o1-mini-2024-09-12 | $3 | $12 | 128,000 | 65536 |
488
+ | openrouter/openai/o1-preview | $15 | $60 | 128,000 | 32768 |
489
+ | openrouter/openai/o1-preview-2024-09-12 | $15 | $60 | 128,000 | 32768 |
490
+ | openrouter/openai/gpt-4o | $2.5 | $10 | 128,000 | 4096 |
491
+ | openrouter/openai/gpt-4o-2024-05-13 | $5 | $15 | 128,000 | 4096 |
492
+ | openrouter/openai/gpt-4-vision-preview | $10 | $30 | nan | nan |
493
+ | openrouter/openai/gpt-3.5-turbo | $1.5 | $2 | nan | nan |
494
+ | openrouter/openai/gpt-3.5-turbo-16k | $3 | $4 | nan | nan |
495
+ | openrouter/openai/gpt-4 | $30 | $60 | nan | nan |
496
+ | openrouter/anthropic/claude-instant-v1 | $1.63 | $5.51 | nan | 8191 |
497
+ | openrouter/anthropic/claude-2 | $11.02 | $32.68 | nan | 8191 |
498
+ | openrouter/anthropic/claude-3-opus | $15 | $75 | 200,000 | 4096 |
499
+ | openrouter/google/palm-2-chat-bison | $0.5 | $0.5 | nan | nan |
500
+ | openrouter/google/palm-2-codechat-bison | $0.5 | $0.5 | nan | nan |
501
+ | openrouter/meta-llama/llama-2-13b-chat | $0.2 | $0.2 | nan | nan |
502
+ | openrouter/meta-llama/llama-2-70b-chat | $1.5 | $1.5 | nan | nan |
503
+ | openrouter/meta-llama/codellama-34b-instruct | $0.5 | $0.5 | nan | nan |
504
+ | openrouter/nousresearch/nous-hermes-llama2-13b | $0.2 | $0.2 | nan | nan |
505
+ | openrouter/mancer/weaver | $5.62 | $5.62 | nan | nan |
506
+ | openrouter/gryphe/mythomax-l2-13b | $1.88 | $1.88 | nan | nan |
507
+ | openrouter/jondurbin/airoboros-l2-70b-2.1 | $13.88 | $13.88 | nan | nan |
508
+ | openrouter/undi95/remm-slerp-l2-13b | $1.88 | $1.88 | nan | nan |
509
+ | openrouter/pygmalionai/mythalion-13b | $1.88 | $1.88 | nan | nan |
510
+ | openrouter/mistralai/mistral-7b-instruct | $0.13 | $0.13 | nan | nan |
511
+ | openrouter/mistralai/mistral-7b-instruct:free | $0 | $0 | nan | nan |
512
+ | j2-ultra | $15 | $15 | 8,192 | 8192 |
513
+ | jamba-1.5-mini@001 | $0.2 | $0.4 | 256,000 | 256000 |
514
+ | jamba-1.5-large@001 | $2 | $8 | 256,000 | 256000 |
515
+ | jamba-1.5 | $0.2 | $0.4 | 256,000 | 256000 |
516
+ | jamba-1.5-mini | $0.2 | $0.4 | 256,000 | 256000 |
517
+ | jamba-1.5-large | $2 | $8 | 256,000 | 256000 |
518
+ | j2-mid | $10 | $10 | 8,192 | 8192 |
519
+ | j2-light | $3 | $3 | 8,192 | 8192 |
520
+ | dolphin | $0.5 | $0.5 | 16,384 | 16384 |
521
+ | chatdolphin | $0.5 | $0.5 | 16,384 | 16384 |
522
+ | luminous-base | $30 | $33 | nan | nan |
523
+ | luminous-base-control | $37.5 | $41.25 | nan | nan |
524
+ | luminous-extended | $45 | $49.5 | nan | nan |
525
+ | luminous-extended-control | $56.25 | $61.88 | nan | nan |
526
+ | luminous-supreme | $175 | $192.5 | nan | nan |
527
+ | luminous-supreme-control | $218.75 | $240.62 | nan | nan |
528
+ | ai21.j2-mid-v1 | $12.5 | $12.5 | 8,191 | 8191 |
529
+ | ai21.j2-ultra-v1 | $18.8 | $18.8 | 8,191 | 8191 |
530
+ | ai21.jamba-instruct-v1:0 | $0.5 | $0.7 | 70,000 | 4096 |
531
+ | amazon.titan-text-lite-v1 | $0.3 | $0.4 | 42,000 | 4000 |
532
+ | amazon.titan-text-express-v1 | $1.3 | $1.7 | 42,000 | 8000 |
533
+ | amazon.titan-text-premier-v1:0 | $0.5 | $1.5 | 42,000 | 32000 |
534
+ | amazon.titan-embed-text-v1 | $0.1 | $0 | 8,192 | nan |
535
+ | amazon.titan-embed-text-v2:0 | $0.2 | $0 | 8,192 | nan |
536
+ | mistral.mistral-7b-instruct-v0:2 | $0.15 | $0.2 | 32,000 | 8191 |
537
+ | mistral.mixtral-8x7b-instruct-v0:1 | $0.45 | $0.7 | 32,000 | 8191 |
538
+ | mistral.mistral-large-2402-v1:0 | $8 | $24 | 32,000 | 8191 |
539
+ | mistral.mistral-large-2407-v1:0 | $3 | $9 | 128,000 | 8191 |
540
+ | mistral.mistral-small-2402-v1:0 | $1 | $3 | 32,000 | 8191 |
541
+ | bedrock/us-west-2/mistral.mixtral-8x7b-instruct-v0:1 | $0.45 | $0.7 | 32,000 | 8191 |
542
+ | bedrock/us-east-1/mistral.mixtral-8x7b-instruct-v0:1 | $0.45 | $0.7 | 32,000 | 8191 |
543
+ | bedrock/eu-west-3/mistral.mixtral-8x7b-instruct-v0:1 | $0.59 | $0.91 | 32,000 | 8191 |
544
+ | bedrock/us-west-2/mistral.mistral-7b-instruct-v0:2 | $0.15 | $0.2 | 32,000 | 8191 |
545
+ | bedrock/us-east-1/mistral.mistral-7b-instruct-v0:2 | $0.15 | $0.2 | 32,000 | 8191 |
546
+ | bedrock/eu-west-3/mistral.mistral-7b-instruct-v0:2 | $0.2 | $0.26 | 32,000 | 8191 |
547
+ | bedrock/us-east-1/mistral.mistral-large-2402-v1:0 | $8 | $24 | 32,000 | 8191 |
548
+ | bedrock/us-west-2/mistral.mistral-large-2402-v1:0 | $8 | $24 | 32,000 | 8191 |
549
+ | bedrock/eu-west-3/mistral.mistral-large-2402-v1:0 | $10.4 | $31.2 | 32,000 | 8191 |
550
+ | anthropic.claude-3-sonnet-20240229-v1:0 | $3 | $15 | 200,000 | 4096 |
551
+ | anthropic.claude-3-5-sonnet-20240620-v1:0 | $3 | $15 | 200,000 | 4096 |
552
+ | anthropic.claude-3-5-sonnet-20241022-v2:0 | $3 | $15 | 200,000 | 8192 |
553
+ | anthropic.claude-3-5-sonnet-latest-v2:0 | $3 | $15 | 200,000 | 4096 |
554
+ | anthropic.claude-3-haiku-20240307-v1:0 | $0.25 | $1.25 | 200,000 | 4096 |
555
+ | anthropic.claude-3-opus-20240229-v1:0 | $15 | $75 | 200,000 | 4096 |
556
+ | us.anthropic.claude-3-sonnet-20240229-v1:0 | $3 | $15 | 200,000 | 4096 |
557
+ | us.anthropic.claude-3-5-sonnet-20240620-v1:0 | $3 | $15 | 200,000 | 4096 |
558
+ | us.anthropic.claude-3-5-sonnet-20241022-v2:0 | $3 | $15 | 200,000 | 8192 |
559
+ | us.anthropic.claude-3-haiku-20240307-v1:0 | $0.25 | $1.25 | 200,000 | 4096 |
560
+ | us.anthropic.claude-3-opus-20240229-v1:0 | $15 | $75 | 200,000 | 4096 |
561
+ | eu.anthropic.claude-3-sonnet-20240229-v1:0 | $3 | $15 | 200,000 | 4096 |
562
+ | eu.anthropic.claude-3-5-sonnet-20240620-v1:0 | $3 | $15 | 200,000 | 4096 |
563
+ | eu.anthropic.claude-3-5-sonnet-20241022-v2:0 | $3 | $15 | 200,000 | 8192 |
564
+ | eu.anthropic.claude-3-haiku-20240307-v1:0 | $0.25 | $1.25 | 200,000 | 4096 |
565
+ | eu.anthropic.claude-3-opus-20240229-v1:0 | $15 | $75 | 200,000 | 4096 |
566
+ | anthropic.claude-v1 | $8 | $24 | 100,000 | 8191 |
567
+ | bedrock/us-east-1/anthropic.claude-v1 | $8 | $24 | 100,000 | 8191 |
568
+ | bedrock/us-west-2/anthropic.claude-v1 | $8 | $24 | 100,000 | 8191 |
569
+ | bedrock/ap-northeast-1/anthropic.claude-v1 | $8 | $24 | 100,000 | 8191 |
570
+ | bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-v1 | -- | -- | 100,000 | 8191 |
571
+ | bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-v1 | -- | -- | 100,000 | 8191 |
572
+ | bedrock/eu-central-1/anthropic.claude-v1 | $8 | $24 | 100,000 | 8191 |
573
+ | bedrock/eu-central-1/1-month-commitment/anthropic.claude-v1 | -- | -- | 100,000 | 8191 |
574
+ | bedrock/eu-central-1/6-month-commitment/anthropic.claude-v1 | -- | -- | 100,000 | 8191 |
575
+ | bedrock/us-east-1/1-month-commitment/anthropic.claude-v1 | -- | -- | 100,000 | 8191 |
576
+ | bedrock/us-east-1/6-month-commitment/anthropic.claude-v1 | -- | -- | 100,000 | 8191 |
577
+ | bedrock/us-west-2/1-month-commitment/anthropic.claude-v1 | -- | -- | 100,000 | 8191 |
578
+ | bedrock/us-west-2/6-month-commitment/anthropic.claude-v1 | -- | -- | 100,000 | 8191 |
579
+ | anthropic.claude-v2 | $8 | $24 | 100,000 | 8191 |
580
+ | bedrock/us-east-1/anthropic.claude-v2 | $8 | $24 | 100,000 | 8191 |
581
+ | bedrock/us-west-2/anthropic.claude-v2 | $8 | $24 | 100,000 | 8191 |
582
+ | bedrock/ap-northeast-1/anthropic.claude-v2 | $8 | $24 | 100,000 | 8191 |
583
+ | bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-v2 | -- | -- | 100,000 | 8191 |
584
+ | bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-v2 | -- | -- | 100,000 | 8191 |
585
+ | bedrock/eu-central-1/anthropic.claude-v2 | $8 | $24 | 100,000 | 8191 |
586
+ | bedrock/eu-central-1/1-month-commitment/anthropic.claude-v2 | -- | -- | 100,000 | 8191 |
587
+ | bedrock/eu-central-1/6-month-commitment/anthropic.claude-v2 | -- | -- | 100,000 | 8191 |
588
+ | bedrock/us-east-1/1-month-commitment/anthropic.claude-v2 | -- | -- | 100,000 | 8191 |
589
+ | bedrock/us-east-1/6-month-commitment/anthropic.claude-v2 | -- | -- | 100,000 | 8191 |
590
+ | bedrock/us-west-2/1-month-commitment/anthropic.claude-v2 | -- | -- | 100,000 | 8191 |
591
+ | bedrock/us-west-2/6-month-commitment/anthropic.claude-v2 | -- | -- | 100,000 | 8191 |
592
+ | anthropic.claude-v2:1 | $8 | $24 | 100,000 | 8191 |
593
+ | bedrock/us-east-1/anthropic.claude-v2:1 | $8 | $24 | 100,000 | 8191 |
594
+ | bedrock/us-west-2/anthropic.claude-v2:1 | $8 | $24 | 100,000 | 8191 |
595
+ | bedrock/ap-northeast-1/anthropic.claude-v2:1 | $8 | $24 | 100,000 | 8191 |
596
+ | bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-v2:1 | -- | -- | 100,000 | 8191 |
597
+ | bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-v2:1 | -- | -- | 100,000 | 8191 |
598
+ | bedrock/eu-central-1/anthropic.claude-v2:1 | $8 | $24 | 100,000 | 8191 |
599
+ | bedrock/eu-central-1/1-month-commitment/anthropic.claude-v2:1 | -- | -- | 100,000 | 8191 |
600
+ | bedrock/eu-central-1/6-month-commitment/anthropic.claude-v2:1 | -- | -- | 100,000 | 8191 |
601
+ | bedrock/us-east-1/1-month-commitment/anthropic.claude-v2:1 | -- | -- | 100,000 | 8191 |
602
+ | bedrock/us-east-1/6-month-commitment/anthropic.claude-v2:1 | -- | -- | 100,000 | 8191 |
603
+ | bedrock/us-west-2/1-month-commitment/anthropic.claude-v2:1 | -- | -- | 100,000 | 8191 |
604
+ | bedrock/us-west-2/6-month-commitment/anthropic.claude-v2:1 | -- | -- | 100,000 | 8191 |
605
+ | anthropic.claude-instant-v1 | $0.8 | $2.4 | 100,000 | 8191 |
606
+ | bedrock/us-east-1/anthropic.claude-instant-v1 | $0.8 | $2.4 | 100,000 | 8191 |
607
+ | bedrock/us-east-1/1-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100,000 | 8191 |
608
+ | bedrock/us-east-1/6-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100,000 | 8191 |
609
+ | bedrock/us-west-2/1-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100,000 | 8191 |
610
+ | bedrock/us-west-2/6-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100,000 | 8191 |
611
+ | bedrock/us-west-2/anthropic.claude-instant-v1 | $0.8 | $2.4 | 100,000 | 8191 |
612
+ | bedrock/ap-northeast-1/anthropic.claude-instant-v1 | $2.23 | $7.55 | 100,000 | 8191 |
613
+ | bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100,000 | 8191 |
614
+ | bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100,000 | 8191 |
615
+ | bedrock/eu-central-1/anthropic.claude-instant-v1 | $2.48 | $8.38 | 100,000 | 8191 |
616
+ | bedrock/eu-central-1/1-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100,000 | 8191 |
617
+ | bedrock/eu-central-1/6-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100,000 | 8191 |
618
+ | cohere.command-text-v14 | $1.5 | $2 | 4,096 | 4096 |
619
+ | bedrock/*/1-month-commitment/cohere.command-text-v14 | -- | -- | 4,096 | 4096 |
620
+ | bedrock/*/6-month-commitment/cohere.command-text-v14 | -- | -- | 4,096 | 4096 |
621
+ | cohere.command-light-text-v14 | $0.3 | $0.6 | 4,096 | 4096 |
622
+ | bedrock/*/1-month-commitment/cohere.command-light-text-v14 | -- | -- | 4,096 | 4096 |
623
+ | bedrock/*/6-month-commitment/cohere.command-light-text-v14 | -- | -- | 4,096 | 4096 |
624
+ | cohere.command-r-plus-v1:0 | $3 | $15 | 128,000 | 4096 |
625
+ | cohere.command-r-v1:0 | $0.5 | $1.5 | 128,000 | 4096 |
626
+ | cohere.embed-english-v3 | $0.1 | $0 | 512 | nan |
627
+ | cohere.embed-multilingual-v3 | $0.1 | $0 | 512 | nan |
628
+ | meta.llama2-13b-chat-v1 | $0.75 | $1 | 4,096 | 4096 |
629
+ | meta.llama2-70b-chat-v1 | $1.95 | $2.56 | 4,096 | 4096 |
630
+ | meta.llama3-8b-instruct-v1:0 | $0.3 | $0.6 | 8,192 | 8192 |
631
+ | bedrock/us-east-1/meta.llama3-8b-instruct-v1:0 | $0.3 | $0.6 | 8,192 | 8192 |
632
+ | bedrock/us-west-1/meta.llama3-8b-instruct-v1:0 | $0.3 | $0.6 | 8,192 | 8192 |
633
+ | bedrock/ap-south-1/meta.llama3-8b-instruct-v1:0 | $0.36 | $0.72 | 8,192 | 8192 |
634
+ | bedrock/ca-central-1/meta.llama3-8b-instruct-v1:0 | $0.35 | $0.69 | 8,192 | 8192 |
635
+ | bedrock/eu-west-1/meta.llama3-8b-instruct-v1:0 | $0.32 | $0.65 | 8,192 | 8192 |
636
+ | bedrock/eu-west-2/meta.llama3-8b-instruct-v1:0 | $0.39 | $0.78 | 8,192 | 8192 |
637
+ | bedrock/sa-east-1/meta.llama3-8b-instruct-v1:0 | $0.5 | $1.01 | 8,192 | 8192 |
638
+ | meta.llama3-70b-instruct-v1:0 | $2.65 | $3.5 | 8,192 | 8192 |
639
+ | bedrock/us-east-1/meta.llama3-70b-instruct-v1:0 | $2.65 | $3.5 | 8,192 | 8192 |
640
+ | bedrock/us-west-1/meta.llama3-70b-instruct-v1:0 | $2.65 | $3.5 | 8,192 | 8192 |
641
+ | bedrock/ap-south-1/meta.llama3-70b-instruct-v1:0 | $3.18 | $4.2 | 8,192 | 8192 |
642
+ | bedrock/ca-central-1/meta.llama3-70b-instruct-v1:0 | $3.05 | $4.03 | 8,192 | 8192 |
643
+ | bedrock/eu-west-1/meta.llama3-70b-instruct-v1:0 | $2.86 | $3.78 | 8,192 | 8192 |
644
+ | bedrock/eu-west-2/meta.llama3-70b-instruct-v1:0 | $3.45 | $4.55 | 8,192 | 8192 |
645
+ | bedrock/sa-east-1/meta.llama3-70b-instruct-v1:0 | $4.45 | $5.88 | 8,192 | 8192 |
646
+ | meta.llama3-1-8b-instruct-v1:0 | $0.22 | $0.22 | 128,000 | 2048 |
647
+ | meta.llama3-1-70b-instruct-v1:0 | $0.99 | $0.99 | 128,000 | 2048 |
648
+ | meta.llama3-1-405b-instruct-v1:0 | $5.32 | $16 | 128,000 | 4096 |
649
+ | meta.llama3-2-1b-instruct-v1:0 | $0.1 | $0.1 | 128,000 | 4096 |
650
+ | us.meta.llama3-2-1b-instruct-v1:0 | $0.1 | $0.1 | 128,000 | 4096 |
651
+ | eu.meta.llama3-2-1b-instruct-v1:0 | $0.13 | $0.13 | 128,000 | 4096 |
652
+ | meta.llama3-2-3b-instruct-v1:0 | $0.15 | $0.15 | 128,000 | 4096 |
653
+ | us.meta.llama3-2-3b-instruct-v1:0 | $0.15 | $0.15 | 128,000 | 4096 |
654
+ | eu.meta.llama3-2-3b-instruct-v1:0 | $0.19 | $0.19 | 128,000 | 4096 |
655
+ | meta.llama3-2-11b-instruct-v1:0 | $0.35 | $0.35 | 128,000 | 4096 |
656
+ | us.meta.llama3-2-11b-instruct-v1:0 | $0.35 | $0.35 | 128,000 | 4096 |
657
+ | meta.llama3-2-90b-instruct-v1:0 | $2 | $2 | 128,000 | 4096 |
658
+ | us.meta.llama3-2-90b-instruct-v1:0 | $2 | $2 | 128,000 | 4096 |
659
+ | 512-x-512/50-steps/stability.stable-diffusion-xl-v0 | -- | -- | 77 | nan |
660
+ | 512-x-512/max-steps/stability.stable-diffusion-xl-v0 | -- | -- | 77 | nan |
661
+ | max-x-max/50-steps/stability.stable-diffusion-xl-v0 | -- | -- | 77 | nan |
662
+ | max-x-max/max-steps/stability.stable-diffusion-xl-v0 | -- | -- | 77 | nan |
663
+ | 1024-x-1024/50-steps/stability.stable-diffusion-xl-v1 | -- | -- | 77 | nan |
664
+ | 1024-x-1024/max-steps/stability.stable-diffusion-xl-v1 | -- | -- | 77 | nan |
665
+ | sagemaker/meta-textgeneration-llama-2-7b | $0 | $0 | 4,096 | 4096 |
666
+ | sagemaker/meta-textgeneration-llama-2-7b-f | $0 | $0 | 4,096 | 4096 |
667
+ | sagemaker/meta-textgeneration-llama-2-13b | $0 | $0 | 4,096 | 4096 |
668
+ | sagemaker/meta-textgeneration-llama-2-13b-f | $0 | $0 | 4,096 | 4096 |
669
+ | sagemaker/meta-textgeneration-llama-2-70b | $0 | $0 | 4,096 | 4096 |
670
+ | sagemaker/meta-textgeneration-llama-2-70b-b-f | $0 | $0 | 4,096 | 4096 |
671
+ | together-ai-up-to-4b | $0.1 | $0.1 | nan | nan |
672
+ | together-ai-4.1b-8b | $0.2 | $0.2 | nan | nan |
673
+ | together-ai-8.1b-21b | $0.3 | $0.3 | nan | nan |
674
+ | together-ai-21.1b-41b | $0.8 | $0.8 | nan | nan |
675
+ | together-ai-41.1b-80b | $0.9 | $0.9 | nan | nan |
676
+ | together-ai-81.1b-110b | $1.8 | $1.8 | nan | nan |
677
+ | together-ai-embedding-up-to-150m | $0.01 | $0 | nan | nan |
678
+ | together-ai-embedding-151m-to-350m | $0.02 | $0 | nan | nan |
679
+ | together_ai/mistralai/Mixtral-8x7B-Instruct-v0.1 | $0.6 | $0.6 | nan | nan |
680
+ | together_ai/mistralai/Mistral-7B-Instruct-v0.1 | -- | -- | nan | nan |
681
+ | together_ai/togethercomputer/CodeLlama-34b-Instruct | -- | -- | nan | nan |
682
+ | ollama/codegemma | $0 | $0 | 8,192 | 8192 |
683
+ | ollama/codegeex4 | $0 | $0 | 32,768 | 8192 |
684
+ | ollama/deepseek-coder-v2-instruct | $0 | $0 | 32,768 | 8192 |
685
+ | ollama/deepseek-coder-v2-base | $0 | $0 | 8,192 | 8192 |
686
+ | ollama/deepseek-coder-v2-lite-instruct | $0 | $0 | 32,768 | 8192 |
687
+ | ollama/deepseek-coder-v2-lite-base | $0 | $0 | 8,192 | 8192 |
688
+ | ollama/internlm2_5-20b-chat | $0 | $0 | 32,768 | 8192 |
689
+ | ollama/llama2 | $0 | $0 | 4,096 | 4096 |
690
+ | ollama/llama2:7b | $0 | $0 | 4,096 | 4096 |
691
+ | ollama/llama2:13b | $0 | $0 | 4,096 | 4096 |
692
+ | ollama/llama2:70b | $0 | $0 | 4,096 | 4096 |
693
+ | ollama/llama2-uncensored | $0 | $0 | 4,096 | 4096 |
694
+ | ollama/llama3 | $0 | $0 | 8,192 | 8192 |
695
+ | ollama/llama3:8b | $0 | $0 | 8,192 | 8192 |
696
+ | ollama/llama3:70b | $0 | $0 | 8,192 | 8192 |
697
+ | ollama/llama3.1 | $0 | $0 | 8,192 | 8192 |
698
+ | ollama/mistral-large-instruct-2407 | $0 | $0 | 65,536 | 8192 |
699
+ | ollama/mistral | $0 | $0 | 8,192 | 8192 |
700
+ | ollama/mistral-7B-Instruct-v0.1 | $0 | $0 | 8,192 | 8192 |
701
+ | ollama/mistral-7B-Instruct-v0.2 | $0 | $0 | 32,768 | 32768 |
702
+ | ollama/mixtral-8x7B-Instruct-v0.1 | $0 | $0 | 32,768 | 32768 |
703
+ | ollama/mixtral-8x22B-Instruct-v0.1 | $0 | $0 | 65,536 | 65536 |
704
+ | ollama/codellama | $0 | $0 | 4,096 | 4096 |
705
+ | ollama/orca-mini | $0 | $0 | 4,096 | 4096 |
706
+ | ollama/vicuna | $0 | $0 | 2,048 | 2048 |
707
+ | deepinfra/lizpreciatior/lzlv_70b_fp16_hf | $0.7 | $0.9 | 4,096 | 4096 |
708
+ | deepinfra/Gryphe/MythoMax-L2-13b | $0.22 | $0.22 | 4,096 | 4096 |
709
+ | deepinfra/mistralai/Mistral-7B-Instruct-v0.1 | $0.13 | $0.13 | 32,768 | 8191 |
710
+ | deepinfra/meta-llama/Llama-2-70b-chat-hf | $0.7 | $0.9 | 4,096 | 4096 |
711
+ | deepinfra/cognitivecomputations/dolphin-2.6-mixtral-8x7b | $0.27 | $0.27 | 32,768 | 8191 |
712
+ | deepinfra/codellama/CodeLlama-34b-Instruct-hf | $0.6 | $0.6 | 4,096 | 4096 |
713
+ | deepinfra/deepinfra/mixtral | $0.27 | $0.27 | 32,000 | 4096 |
714
+ | deepinfra/Phind/Phind-CodeLlama-34B-v2 | $0.6 | $0.6 | 16,384 | 4096 |
715
+ | deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1 | $0.27 | $0.27 | 32,768 | 8191 |
716
+ | deepinfra/deepinfra/airoboros-70b | $0.7 | $0.9 | 4,096 | 4096 |
717
+ | deepinfra/01-ai/Yi-34B-Chat | $0.6 | $0.6 | 4,096 | 4096 |
718
+ | deepinfra/01-ai/Yi-6B-200K | $0.13 | $0.13 | 200,000 | 4096 |
719
+ | deepinfra/jondurbin/airoboros-l2-70b-gpt4-1.4.1 | $0.7 | $0.9 | 4,096 | 4096 |
720
+ | deepinfra/meta-llama/Llama-2-13b-chat-hf | $0.22 | $0.22 | 4,096 | 4096 |
721
+ | deepinfra/amazon/MistralLite | $0.2 | $0.2 | 32,768 | 8191 |
722
+ | deepinfra/meta-llama/Llama-2-7b-chat-hf | $0.13 | $0.13 | 4,096 | 4096 |
723
+ | deepinfra/meta-llama/Meta-Llama-3-8B-Instruct | $0.08 | $0.08 | 8,191 | 4096 |
724
+ | deepinfra/meta-llama/Meta-Llama-3-70B-Instruct | $0.59 | $0.79 | 8,191 | 4096 |
725
+ | deepinfra/01-ai/Yi-34B-200K | $0.6 | $0.6 | 200,000 | 4096 |
726
+ | deepinfra/openchat/openchat_3.5 | $0.13 | $0.13 | 4,096 | 4096 |
727
+ | perplexity/codellama-34b-instruct | $0.35 | $1.4 | 16,384 | 16384 |
728
+ | perplexity/codellama-70b-instruct | $0.7 | $2.8 | 16,384 | 16384 |
729
+ | perplexity/llama-3.1-70b-instruct | $1 | $1 | 131,072 | 131072 |
730
+ | perplexity/llama-3.1-8b-instruct | $0.2 | $0.2 | 131,072 | 131072 |
731
+ | perplexity/llama-3.1-sonar-huge-128k-online | $5 | $5 | 127,072 | 127072 |
732
+ | perplexity/llama-3.1-sonar-large-128k-online | $1 | $1 | 127,072 | 127072 |
733
+ | perplexity/llama-3.1-sonar-large-128k-chat | $1 | $1 | 131,072 | 131072 |
734
+ | perplexity/llama-3.1-sonar-small-128k-chat | $0.2 | $0.2 | 131,072 | 131072 |
735
+ | perplexity/llama-3.1-sonar-small-128k-online | $0.2 | $0.2 | 127,072 | 127072 |
736
+ | perplexity/pplx-7b-chat | $0.07 | $0.28 | 8,192 | 8192 |
737
+ | perplexity/pplx-70b-chat | $0.7 | $2.8 | 4,096 | 4096 |
738
+ | perplexity/pplx-7b-online | $0 | $0.28 | 4,096 | 4096 |
739
+ | perplexity/pplx-70b-online | $0 | $2.8 | 4,096 | 4096 |
740
+ | perplexity/llama-2-70b-chat | $0.7 | $2.8 | 4,096 | 4096 |
741
+ | perplexity/mistral-7b-instruct | $0.07 | $0.28 | 4,096 | 4096 |
742
+ | perplexity/mixtral-8x7b-instruct | $0.07 | $0.28 | 4,096 | 4096 |
743
+ | perplexity/sonar-small-chat | $0.07 | $0.28 | 16,384 | 16384 |
744
+ | perplexity/sonar-small-online | $0 | $0.28 | 12,000 | 12000 |
745
+ | perplexity/sonar-medium-chat | $0.6 | $1.8 | 16,384 | 16384 |
746
+ | perplexity/sonar-medium-online | $0 | $1.8 | 12,000 | 12000 |
747
+ | fireworks_ai/accounts/fireworks/models/llama-v3p2-1b-instruct | $0.1 | $0.1 | 16,384 | 16384 |
748
+ | fireworks_ai/accounts/fireworks/models/llama-v3p2-3b-instruct | $0.1 | $0.1 | 16,384 | 16384 |
749
+ | fireworks_ai/accounts/fireworks/models/llama-v3p2-11b-vision-instruct | $0.2 | $0.2 | 16,384 | 16384 |
750
+ | accounts/fireworks/models/llama-v3p2-90b-vision-instruct | $0.9 | $0.9 | 16,384 | 16384 |
751
+ | fireworks_ai/accounts/fireworks/models/firefunction-v2 | $0.9 | $0.9 | 8,192 | 8192 |
752
+ | fireworks_ai/accounts/fireworks/models/mixtral-8x22b-instruct-hf | $1.2 | $1.2 | 65,536 | 65536 |
753
+ | fireworks_ai/accounts/fireworks/models/qwen2-72b-instruct | $0.9 | $0.9 | 32,768 | 32768 |
754
+ | fireworks_ai/accounts/fireworks/models/yi-large | $3 | $3 | 32,768 | 32768 |
755
+ | fireworks_ai/accounts/fireworks/models/deepseek-coder-v2-instruct | $1.2 | $1.2 | 65,536 | 65536 |
756
+ | fireworks_ai/nomic-ai/nomic-embed-text-v1.5 | $0.01 | $0 | 8,192 | nan |
757
+ | fireworks_ai/nomic-ai/nomic-embed-text-v1 | $0.01 | $0 | 8,192 | nan |
758
+ | fireworks_ai/WhereIsAI/UAE-Large-V1 | $0.02 | $0 | 512 | nan |
759
+ | fireworks_ai/thenlper/gte-large | $0.02 | $0 | 512 | nan |
760
+ | fireworks_ai/thenlper/gte-base | $0.01 | $0 | 512 | nan |
761
+ | fireworks-ai-up-to-16b | $0.2 | $0.2 | nan | nan |
762
+ | fireworks-ai-16.1b-to-80b | $0.9 | $0.9 | nan | nan |
763
+ | fireworks-ai-moe-up-to-56b | $0.5 | $0.5 | nan | nan |
764
+ | fireworks-ai-56b-to-176b | $1.2 | $1.2 | nan | nan |
765
+ | fireworks-ai-default | $0 | $0 | nan | nan |
766
+ | fireworks-ai-embedding-up-to-150m | $0.01 | $0 | nan | nan |
767
+ | fireworks-ai-embedding-150m-to-350m | $0.02 | $0 | nan | nan |
768
+ | anyscale/mistralai/Mistral-7B-Instruct-v0.1 | $0.15 | $0.15 | 16,384 | 16384 |
769
+ | anyscale/mistralai/Mixtral-8x7B-Instruct-v0.1 | $0.15 | $0.15 | 16,384 | 16384 |
770
+ | anyscale/mistralai/Mixtral-8x22B-Instruct-v0.1 | $0.9 | $0.9 | 65,536 | 65536 |
771
+ | anyscale/HuggingFaceH4/zephyr-7b-beta | $0.15 | $0.15 | 16,384 | 16384 |
772
+ | anyscale/google/gemma-7b-it | $0.15 | $0.15 | 8,192 | 8192 |
773
+ | anyscale/meta-llama/Llama-2-7b-chat-hf | $0.15 | $0.15 | 4,096 | 4096 |
774
+ | anyscale/meta-llama/Llama-2-13b-chat-hf | $0.25 | $0.25 | 4,096 | 4096 |
775
+ | anyscale/meta-llama/Llama-2-70b-chat-hf | $1 | $1 | 4,096 | 4096 |
776
+ | anyscale/codellama/CodeLlama-34b-Instruct-hf | $1 | $1 | 4,096 | 4096 |
777
+ | anyscale/codellama/CodeLlama-70b-Instruct-hf | $1 | $1 | 4,096 | 4096 |
778
+ | anyscale/meta-llama/Meta-Llama-3-8B-Instruct | $0.15 | $0.15 | 8,192 | 8192 |
779
+ | anyscale/meta-llama/Meta-Llama-3-70B-Instruct | $1 | $1 | 8,192 | 8192 |
780
+ | cloudflare/@cf/meta/llama-2-7b-chat-fp16 | $1.92 | $1.92 | 3,072 | 3072 |
781
+ | cloudflare/@cf/meta/llama-2-7b-chat-int8 | $1.92 | $1.92 | 2,048 | 2048 |
782
+ | cloudflare/@cf/mistral/mistral-7b-instruct-v0.1 | $1.92 | $1.92 | 8,192 | 8192 |
783
+ | cloudflare/@hf/thebloke/codellama-7b-instruct-awq | $1.92 | $1.92 | 4,096 | 4096 |
784
+ | voyage/voyage-01 | $0.1 | $0 | 4,096 | nan |
785
+ | voyage/voyage-lite-01 | $0.1 | $0 | 4,096 | nan |
786
+ | voyage/voyage-large-2 | $0.12 | $0 | 16,000 | nan |
787
+ | voyage/voyage-law-2 | $0.12 | $0 | 16,000 | nan |
788
+ | voyage/voyage-code-2 | $0.12 | $0 | 16,000 | nan |
789
+ | voyage/voyage-2 | $0.1 | $0 | 4,000 | nan |
790
+ | voyage/voyage-lite-02-instruct | $0.1 | $0 | 4,000 | nan |
791
+ | voyage/voyage-finance-2 | $0.12 | $0 | 32,000 | nan |
792
+ | databricks/databricks-meta-llama-3-1-405b-instruct | $5 | $15 | 128,000 | 128000 |
793
+ | databricks/databricks-meta-llama-3-1-70b-instruct | $1 | $3 | 128,000 | 128000 |
794
+ | databricks/databricks-dbrx-instruct | $0.75 | $2.25 | 32,768 | 32768 |
795
+ | databricks/databricks-meta-llama-3-70b-instruct | $1 | $3 | 128,000 | 128000 |
796
+ | databricks/databricks-llama-2-70b-chat | $0.5 | $1.5 | 4,096 | 4096 |
797
+ | databricks/databricks-mixtral-8x7b-instruct | $0.5 | $1 | 4,096 | 4096 |
798
+ | databricks/databricks-mpt-30b-instruct | $1 | $1 | 8,192 | 8192 |
799
+ | databricks/databricks-mpt-7b-instruct | $0.5 | $0 | 8,192 | 8192 |
800
+ | databricks/databricks-bge-large-en | $0.1 | $0 | 512 | nan |
801
+ | databricks/databricks-gte-large-en | $0.13 | $0 | 8,192 | nan |
802
+ | azure/gpt-4o-mini-2024-07-18 | $0.16 | $0.66 | 128,000 | 16384 |
803
+ | amazon.titan-embed-image-v1 | $0.8 | $0 | 128 | nan |
804
+ | azure_ai/mistral-large-2407 | $2 | $6 | 128,000 | 4096 |
805
+ | azure_ai/ministral-3b | $0.04 | $0.04 | 128,000 | 4096 |
806
+ | azure_ai/Llama-3.2-11B-Vision-Instruct | $0.37 | $0.37 | 128,000 | 2048 |
807
+ | azure_ai/Llama-3.2-90B-Vision-Instruct | $2.04 | $2.04 | 128,000 | 2048 |
808
+ | azure_ai/Phi-3.5-mini-instruct | $0.13 | $0.52 | 128,000 | 4096 |
809
+ | azure_ai/Phi-3.5-vision-instruct | $0.13 | $0.52 | 128,000 | 4096 |
810
+ | azure_ai/Phi-3.5-MoE-instruct | $0.16 | $0.64 | 128,000 | 4096 |
811
+ | azure_ai/Phi-3-mini-4k-instruct | $0.13 | $0.52 | 4,096 | 4096 |
812
+ | azure_ai/Phi-3-mini-128k-instruct | $0.13 | $0.52 | 128,000 | 4096 |
813
+ | azure_ai/Phi-3-small-8k-instruct | $0.15 | $0.6 | 8,192 | 4096 |
814
+ | azure_ai/Phi-3-small-128k-instruct | $0.15 | $0.6 | 128,000 | 4096 |
815
+ | azure_ai/Phi-3-medium-4k-instruct | $0.17 | $0.68 | 4,096 | 4096 |
816
+ | azure_ai/Phi-3-medium-128k-instruct | $0.17 | $0.68 | 128,000 | 4096 |
817
+ | xai/grok-beta | $5 | $15 | 131,072 | 131072 |
818
+ | claude-3-5-haiku-20241022 | $0.8 | $4 | 200,000 | 8192 |
819
+ | vertex_ai/claude-3-5-haiku@20241022 | $1 | $5 | 200,000 | 8192 |
820
+ | openrouter/anthropic/claude-3-5-haiku | $1 | $5 | nan | nan |
821
+ | openrouter/anthropic/claude-3-5-haiku-20241022 | $1 | $5 | 200,000 | 8192 |
822
+ | anthropic.claude-3-5-haiku-20241022-v1:0 | $0.8 | $4 | 200,000 | 8192 |
823
+ | us.anthropic.claude-3-5-haiku-20241022-v1:0 | $0.8 | $4 | 200,000 | 8192 |
824
+ | eu.anthropic.claude-3-5-haiku-20241022-v1:0 | $0.25 | $1.25 | 200,000 | 8192 |
825
+ | stability.sd3-large-v1:0 | -- | -- | 77 | nan |
826
+ | gpt-4o-2024-11-20 | $2.5 | $10 | 128,000 | 16384 |
827
+ | ft:gpt-4o-2024-11-20 | $3.75 | $15 | 128,000 | 16384 |
828
+ | azure/gpt-4o-2024-11-20 | $2.75 | $11 | 128,000 | 16384 |
829
+ | azure/global-standard/gpt-4o-2024-11-20 | $2.5 | $10 | 128,000 | 16384 |
830
+ | groq/llama-3.2-1b-preview | $0.04 | $0.04 | 8,192 | 8192 |
831
+ | groq/llama-3.2-3b-preview | $0.06 | $0.06 | 8,192 | 8192 |
832
+ | groq/llama-3.2-11b-text-preview | $0.18 | $0.18 | 8,192 | 8192 |
833
+ | groq/llama-3.2-11b-vision-preview | $0.18 | $0.18 | 8,192 | 8192 |
834
+ | groq/llama-3.2-90b-text-preview | $0.9 | $0.9 | 8,192 | 8192 |
835
+ | groq/llama-3.2-90b-vision-preview | $0.9 | $0.9 | 8,192 | 8192 |
836
+ | vertex_ai/claude-3-sonnet | $3 | $15 | 200,000 | 4096 |
837
+ | vertex_ai/claude-3-5-sonnet | $3 | $15 | 200,000 | 8192 |
838
+ | vertex_ai/claude-3-5-sonnet-v2 | $3 | $15 | 200,000 | 8192 |
839
+ | vertex_ai/claude-3-haiku | $0.25 | $1.25 | 200,000 | 4096 |
840
+ | vertex_ai/claude-3-5-haiku | $1 | $5 | 200,000 | 8192 |
841
+ | vertex_ai/claude-3-opus | $15 | $75 | 200,000 | 4096 |
842
+ | gemini/gemini-exp-1114 | $0 | $0 | 1,048,576 | 8192 |
843
+ | openrouter/qwen/qwen-2.5-coder-32b-instruct | $0.18 | $0.18 | 33,792 | 33792 |
844
+ | us.meta.llama3-1-8b-instruct-v1:0 | $0.22 | $0.22 | 128,000 | 2048 |
845
+ | us.meta.llama3-1-70b-instruct-v1:0 | $0.99 | $0.99 | 128,000 | 2048 |
846
+ | us.meta.llama3-1-405b-instruct-v1:0 | $5.32 | $16 | 128,000 | 4096 |
847
+ | stability.stable-image-ultra-v1:0 | -- | -- | 77 | nan |
848
+ | fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct | $0.9 | $0.9 | 4,096 | 4096 |
849
+ | omni-moderation-latest | $0 | $0 | 32,768 | 0 |
850
+ | omni-moderation-latest-intents | $0 | $0 | 32,768 | 0 |
851
+ | omni-moderation-2024-09-26 | $0 | $0 | 32,768 | 0 |
852
+ | gpt-4o-audio-preview-2024-12-17 | $2.5 | $10 | 128,000 | 16384 |
853
+ | gpt-4o-mini-audio-preview-2024-12-17 | $0.15 | $0.6 | 128,000 | 16384 |
854
+ | o1 | $15 | $60 | 200,000 | 100000 |
855
+ | o1-2024-12-17 | $15 | $60 | 200,000 | 100000 |
856
+ | gpt-4o-realtime-preview-2024-10-01 | $5 | $20 | 128,000 | 4096 |
857
+ | gpt-4o-realtime-preview | $5 | $20 | 128,000 | 4096 |
858
+ | gpt-4o-realtime-preview-2024-12-17 | $5 | $20 | 128,000 | 4096 |
859
+ | gpt-4o-mini-realtime-preview | $0.6 | $2.4 | 128,000 | 4096 |
860
+ | gpt-4o-mini-realtime-preview-2024-12-17 | $0.6 | $2.4 | 128,000 | 4096 |
861
+ | azure/o1 | $15 | $60 | 200,000 | 100000 |
862
+ | azure_ai/Llama-3.3-70B-Instruct | $0.71 | $0.71 | 128,000 | 2048 |
863
+ | mistral/mistral-large-2411 | $2 | $6 | 128,000 | 128000 |
864
+ | mistral/pixtral-large-latest | $2 | $6 | 128,000 | 128000 |
865
+ | mistral/pixtral-large-2411 | $2 | $6 | 128,000 | 128000 |
866
+ | deepseek/deepseek-chat | $0.27 | $1.1 | 65,536 | 8192 |
867
+ | deepseek/deepseek-coder | $0.14 | $0.28 | 128,000 | 4096 |
868
+ | groq/llama-3.3-70b-versatile | $0.59 | $0.79 | 128,000 | 8192 |
869
+ | groq/llama-3.3-70b-specdec | $0.59 | $0.99 | 8,192 | 8192 |
870
+ | friendliai/meta-llama-3.1-8b-instruct | $0.1 | $0.1 | 8,192 | 8192 |
871
+ | friendliai/meta-llama-3.1-70b-instruct | $0.6 | $0.6 | 8,192 | 8192 |
872
+ | gemini-2.0-flash-exp | $0.15 | $0.6 | 1,048,576 | 8192 |
873
+ | gemini/gemini-2.0-flash-exp | $0 | $0 | 1,048,576 | 8192 |
874
+ | vertex_ai/mistral-large@2411-001 | $2 | $6 | 128,000 | 8191 |
875
+ | vertex_ai/mistral-large-2411 | $2 | $6 | 128,000 | 8191 |
876
+ | text-embedding-005 | $0.1 | $0 | 2,048 | nan |
877
+ | gemini/gemini-1.5-flash-8b | $0 | $0 | 1,048,576 | 8192 |
878
+ | gemini/gemini-exp-1206 | $0 | $0 | 2,097,152 | 8192 |
879
+ | command-r7b-12-2024 | $0.15 | $0.04 | 128,000 | 4096 |
880
+ | rerank-v3.5 | $0 | $0 | 4,096 | 4096 |
881
+ | openrouter/deepseek/deepseek-chat | $0.14 | $0.28 | 65,536 | 8192 |
882
+ | openrouter/openai/o1 | $15 | $60 | 200,000 | 100000 |
883
+ | amazon.nova-micro-v1:0 | $0.04 | $0.14 | 300,000 | 4096 |
884
+ | amazon.nova-lite-v1:0 | $0.06 | $0.24 | 128,000 | 4096 |
885
+ | amazon.nova-pro-v1:0 | $0.8 | $3.2 | 300,000 | 4096 |
886
+ | meta.llama3-3-70b-instruct-v1:0 | $0.72 | $0.72 | 128,000 | 4096 |
887
+ | together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo | $0.18 | $0.18 | nan | nan |
888
+ | together_ai/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo | $0.88 | $0.88 | nan | nan |
889
+ | together_ai/meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo | $3.5 | $3.5 | nan | nan |
890
+ | deepinfra/meta-llama/Meta-Llama-3.1-405B-Instruct | $0.9 | $0.9 | 32,768 | 32768 |
891
+ | fireworks_ai/accounts/fireworks/models/deepseek-v3 | $0.9 | $0.9 | 128,000 | 8192 |
892
+ | voyage/voyage-3-large | $0.18 | $0 | 32,000 | nan |
893
+ | voyage/voyage-3 | $0.06 | $0 | 32,000 | nan |
894
+ | voyage/voyage-3-lite | $0.02 | $0 | 32,000 | nan |
895
+ | voyage/voyage-code-3 | $0.18 | $0 | 32,000 | nan |
896
+ | voyage/voyage-multimodal-3 | $0.12 | $0 | 32,000 | nan |
897
+ | voyage/rerank-2 | $0.05 | $0 | 16,000 | 16000 |
898
+ | voyage/rerank-2-lite | $0.02 | $0 | 8,000 | 8000 |
899
+ | databricks/meta-llama-3.3-70b-instruct | $1 | $3 | 128,000 | 128000 |
900
+ | sambanova/Meta-Llama-3.1-8B-Instruct | $0.1 | $0.2 | 16,000 | 16000 |
901
+ | sambanova/Meta-Llama-3.1-70B-Instruct | $0.6 | $1.2 | 128,000 | 128000 |
902
+ | sambanova/Meta-Llama-3.1-405B-Instruct | $5 | $10 | 16,000 | 16000 |
903
+ | sambanova/Meta-Llama-3.2-1B-Instruct | $0.4 | $0.8 | 16,000 | 16000 |
904
+ | sambanova/Meta-Llama-3.2-3B-Instruct | $0.8 | $1.6 | 4,000 | 4000 |
905
+ | sambanova/Meta-Llama-3.3-70B-Instruct | $0.6 | $1.2 | 128,000 | 128000 |
906
+ | sambanova/Qwen2.5-Coder-32B-Instruct | $1.5 | $3 | 8,000 | 8000 |
907
+ | sambanova/Qwen2.5-72B-Instruct | $2 | $4 | 8,000 | 8000 |
908
+ | o3-mini | $1.1 | $4.4 | 200,000 | 100000 |
909
+ | o3-mini-2025-01-31 | $1.1 | $4.4 | 200,000 | 100000 |
910
+ | azure/o3-mini-2025-01-31 | $1.1 | $4.4 | 200,000 | 100000 |
911
+ | azure/o3-mini | $1.1 | $4.4 | 200,000 | 100000 |
912
+ | azure/o1-2024-12-17 | $15 | $60 | 200,000 | 100000 |
913
+ | azure_ai/deepseek-r1 | $1.35 | $5.4 | 128,000 | 8192 |
914
+ | deepseek/deepseek-reasoner | $0.55 | $2.19 | 65,536 | 8192 |
915
+ | xai/grok-2-vision-1212 | $2 | $10 | 32,768 | 32768 |
916
+ | xai/grok-2-vision-latest | $2 | $10 | 32,768 | 32768 |
917
+ | xai/grok-2-vision | $2 | $10 | 32,768 | 32768 |
918
+ | xai/grok-vision-beta | $5 | $15 | 8,192 | 8192 |
919
+ | xai/grok-2-1212 | $2 | $10 | 131,072 | 131072 |
920
+ | xai/grok-2 | $2 | $10 | 131,072 | 131072 |
921
+ | xai/grok-2-latest | $2 | $10 | 131,072 | 131072 |
922
+ | groq/deepseek-r1-distill-llama-70b | $0.75 | $0.99 | 131,072 | 131072 |
923
+ | gemini/gemini-2.0-flash | $0.1 | $0.4 | 1,048,576 | 8192 |
924
+ | gemini-2.0-flash-001 | $0.15 | $0.6 | 1,048,576 | 8192 |
925
+ | gemini-2.0-flash-thinking-exp | $0 | $0 | 1,048,576 | 8192 |
926
+ | gemini-2.0-flash-thinking-exp-01-21 | $0 | $0 | 1,048,576 | 65536 |
927
+ | gemini/gemini-2.0-flash-001 | $0.1 | $0.4 | 1,048,576 | 8192 |
928
+ | gemini/gemini-2.0-flash-lite-preview-02-05 | $0.08 | $0.3 | 1,048,576 | 8192 |
929
+ | gemini/gemini-2.0-flash-thinking-exp | $0 | $0 | 1,048,576 | 65536 |
930
+ | vertex_ai/codestral-2501 | $0.2 | $0.6 | 128,000 | 128000 |
931
+ | openrouter/deepseek/deepseek-r1 | $0.55 | $2.19 | 65,336 | 8192 |
932
+ | ai21.jamba-1-5-large-v1:0 | $2 | $8 | 256,000 | 256000 |
933
+ | ai21.jamba-1-5-mini-v1:0 | $0.2 | $0.4 | 256,000 | 256000 |
934
+ | us.amazon.nova-micro-v1:0 | $0.04 | $0.14 | 300,000 | 4096 |
935
+ | us.amazon.nova-lite-v1:0 | $0.06 | $0.24 | 128,000 | 4096 |
936
+ | us.amazon.nova-pro-v1:0 | $0.8 | $3.2 | 300,000 | 4096 |
937
+ | stability.sd3-5-large-v1:0 | -- | -- | 77 | nan |
938
+ | stability.stable-image-core-v1:0 | -- | -- | 77 | nan |
939
+ | stability.stable-image-core-v1:1 | -- | -- | 77 | nan |
940
+ | stability.stable-image-ultra-v1:1 | -- | -- | 77 | nan |
941
+ | together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo | $0.88 | $0.88 | nan | nan |
942
+ | together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo-Free | $0 | $0 | nan | nan |
943
+ | fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct | $0.1 | $0.1 | 16,384 | 16384 |
944
+ | assemblyai/nano | -- | -- | nan | nan |
945
+ | assemblyai/best | -- | -- | nan | nan |
946
+ | azure/gpt-3.5-turbo-0125 | $0.5 | $1.5 | 16,384 | 4096 |
947
+ | azure/gpt-3.5-turbo | $0.5 | $1.5 | 4,097 | 4096 |
948
+ | gemini-2.0-pro-exp-02-05 | $1.25 | $10 | 2,097,152 | 8192 |
949
+ | us.meta.llama3-3-70b-instruct-v1:0 | $0.72 | $0.72 | 128,000 | 4096 |
950
+ | perplexity/sonar | $1 | $1 | 128,000 | nan |
951
+ | perplexity/sonar-pro | $3 | $15 | 200,000 | 8000 |
952
+ | openrouter/google/gemini-2.0-flash-001 | $0.1 | $0.4 | 1,048,576 | 8192 |
953
+ | gpt-4.5-preview | $75 | $150 | 128,000 | 16384 |
954
+ | gpt-4.5-preview-2025-02-27 | $75 | $150 | 128,000 | 16384 |
955
+ | azure_ai/Phi-4 | $0.12 | $0.5 | 16,384 | 16384 |
956
+ | cerebras/llama3.3-70b | $0.85 | $1.2 | 128,000 | 128000 |
957
+ | claude-3-5-haiku-latest | $1 | $5 | 200,000 | 8192 |
958
+ | claude-3-7-sonnet-latest | $3 | $15 | 200,000 | 128000 |
959
+ | claude-3-7-sonnet-20250219 | $3 | $15 | 200,000 | 128000 |
960
+ | vertex_ai/claude-3-7-sonnet@20250219 | $3 | $15 | 200,000 | 8192 |
961
+ | openrouter/anthropic/claude-3.7-sonnet | $3 | $15 | 200,000 | 8192 |
962
+ | openrouter/anthropic/claude-3.7-sonnet:beta | $3 | $15 | 200,000 | 8192 |
963
+ | amazon.rerank-v1:0 | $0 | $0 | 32,000 | 32000 |
964
+ | anthropic.claude-3-7-sonnet-20250219-v1:0 | $3 | $15 | 200,000 | 8192 |
965
+ | us.anthropic.claude-3-7-sonnet-20250219-v1:0 | $3 | $15 | 200,000 | 8192 |
966
+ | cohere.rerank-v3-5:0 | $0 | $0 | 32,000 | 32000 |
967
+ | jina-reranker-v2-base-multilingual | $0.02 | $0.02 | 1,024 | 1024 |
968
+ | bedrock/invoke/anthropic.claude-3-5-sonnet-20240620-v1:0 | $3 | $15 | 200,000 | 4096 |
969
+ | azure/gpt-4o-mini-realtime-preview-2024-12-17 | $0.6 | $2.4 | 128,000 | 4096 |
970
+ | azure/eu/gpt-4o-mini-realtime-preview-2024-12-17 | $0.66 | $2.64 | 128,000 | 4096 |
971
+ | azure/us/gpt-4o-mini-realtime-preview-2024-12-17 | $0.66 | $2.64 | 128,000 | 4096 |
972
+ | azure/gpt-4o-realtime-preview-2024-10-01 | $5 | $20 | 128,000 | 4096 |
973
+ | azure/us/gpt-4o-realtime-preview-2024-10-01 | $5.5 | $22 | 128,000 | 4096 |
974
+ | azure/eu/gpt-4o-realtime-preview-2024-10-01 | $5.5 | $22 | 128,000 | 4096 |
975
+ | azure/us/o3-mini-2025-01-31 | $1.21 | $4.84 | 200,000 | 100000 |
976
+ | azure/eu/o3-mini-2025-01-31 | $1.21 | $4.84 | 200,000 | 100000 |
977
+ | azure/us/o1-mini-2024-09-12 | $1.21 | $4.84 | 128,000 | 65536 |
978
+ | azure/eu/o1-mini-2024-09-12 | $1.21 | $4.84 | 128,000 | 65536 |
979
+ | azure/us/o1-2024-12-17 | $16.5 | $66 | 200,000 | 100000 |
980
+ | azure/eu/o1-2024-12-17 | $16.5 | $66 | 200,000 | 100000 |
981
+ | azure/us/o1-preview-2024-09-12 | $16.5 | $66 | 128,000 | 32768 |
982
+ | azure/eu/o1-preview-2024-09-12 | $16.5 | $66 | 128,000 | 32768 |
983
+ | azure/us/gpt-4o-2024-11-20 | $2.75 | $11 | 128,000 | 16384 |
984
+ | azure/eu/gpt-4o-2024-11-20 | $2.75 | $11 | 128,000 | 16384 |
985
+ | azure/us/gpt-4o-2024-08-06 | $2.75 | $11 | 128,000 | 16384 |
986
+ | azure/eu/gpt-4o-2024-08-06 | $2.75 | $11 | 128,000 | 16384 |
987
+ | azure/us/gpt-4o-mini-2024-07-18 | $0.16 | $0.66 | 128,000 | 16384 |
988
+ | azure/eu/gpt-4o-mini-2024-07-18 | $0.16 | $0.66 | 128,000 | 16384 |
989
+ | azure_ai/deepseek-v3 | $1.14 | $4.56 | 128,000 | 8192 |
990
+ | azure_ai/mistral-nemo | $0.15 | $0.15 | 131,072 | 4096 |
991
+ | azure_ai/Phi-4-mini-instruct | $0.08 | $0.3 | 131,072 | 4096 |
992
+ | azure_ai/Phi-4-multimodal-instruct | $0.08 | $0.32 | 131,072 | 4096 |
993
+ | gemini/gemini-2.0-pro-exp-02-05 | $0 | $0 | 2,097,152 | 8192 |
994
+ | gemini/gemini-2.0-flash-thinking-exp-01-21 | $0 | $0 | 1,048,576 | 65536 |
995
+ | gemini/gemma-3-27b-it | $0 | $0 | 131,072 | 8192 |
996
+ | gemini/learnlm-1.5-pro-experimental | $0 | $0 | 32,767 | 8192 |
997
+ | vertex_ai/imagen-3.0-generate-002 | -- | -- | nan | nan |
998
+ | jamba-large-1.6 | $2 | $8 | 256,000 | 256000 |
999
+ | jamba-mini-1.6 | $0.2 | $0.4 | 256,000 | 256000 |
1000
+ | eu.amazon.nova-micro-v1:0 | $0.05 | $0.18 | 300,000 | 4096 |
1001
+ | eu.amazon.nova-lite-v1:0 | $0.08 | $0.31 | 128,000 | 4096 |
1002
+ | 1024-x-1024/50-steps/bedrock/amazon.nova-canvas-v1:0 | -- | -- | 2,600 | nan |
1003
+ | eu.amazon.nova-pro-v1:0 | $1.05 | $4.2 | 300,000 | 4096 |
1004
+ | us.deepseek.r1-v1:0 | $1.35 | $5.4 | 128,000 | 4096 |
1005
+ | snowflake/deepseek-r1 | -- | -- | 32,768 | 8192 |
1006
+ | snowflake/snowflake-arctic | -- | -- | 4,096 | 8192 |
1007
+ | snowflake/claude-3-5-sonnet | -- | -- | 18,000 | 8192 |
1008
+ | snowflake/mistral-large | -- | -- | 32,000 | 8192 |
1009
+ | snowflake/mistral-large2 | -- | -- | 128,000 | 8192 |
1010
+ | snowflake/reka-flash | -- | -- | 100,000 | 8192 |
1011
+ | snowflake/reka-core | -- | -- | 32,000 | 8192 |
1012
+ | snowflake/jamba-instruct | -- | -- | 256,000 | 8192 |
1013
+ | snowflake/jamba-1.5-mini | -- | -- | 256,000 | 8192 |
1014
+ | snowflake/jamba-1.5-large | -- | -- | 256,000 | 8192 |
1015
+ | snowflake/mixtral-8x7b | -- | -- | 32,000 | 8192 |
1016
+ | snowflake/llama2-70b-chat | -- | -- | 4,096 | 8192 |
1017
+ | snowflake/llama3-8b | -- | -- | 8,000 | 8192 |
1018
+ | snowflake/llama3-70b | -- | -- | 8,000 | 8192 |
1019
+ | snowflake/llama3.1-8b | -- | -- | 128,000 | 8192 |
1020
+ | snowflake/llama3.1-70b | -- | -- | 128,000 | 8192 |
1021
+ | snowflake/llama3.3-70b | -- | -- | 128,000 | 8192 |
1022
+ | snowflake/snowflake-llama-3.3-70b | -- | -- | 8,000 | 8192 |
1023
+ | snowflake/llama3.1-405b | -- | -- | 128,000 | 8192 |
1024
+ | snowflake/snowflake-llama-3.1-405b | -- | -- | 8,000 | 8192 |
1025
+ | snowflake/llama3.2-1b | -- | -- | 128,000 | 8192 |
1026
+ | snowflake/llama3.2-3b | -- | -- | 128,000 | 8192 |
1027
+ | snowflake/mistral-7b | -- | -- | 32,000 | 8192 |
1028
+ | snowflake/gemma-7b | -- | -- | 8,000 | 8192 |
1029
+ | azure/global/gpt-4o-2024-11-20 | $2.5 | $10 | 128,000 | 16384 |
1030
+ | azure/global/gpt-4o-2024-08-06 | $2.5 | $10 | 128,000 | 16384 |
1031
+ | o1-pro | $150 | $600 | 200,000 | 100000 |
1032
+ | o1-pro-2025-03-19 | $150 | $600 | 200,000 | 100000 |
1033
+ | gpt-4o-search-preview-2025-03-11 | $2.5 | $10 | 128,000 | 16384 |
1034
+ | gpt-4o-search-preview | $2.5 | $10 | 128,000 | 16384 |
1035
+ | gpt-4o-mini-search-preview-2025-03-11 | $0.15 | $0.6 | 128,000 | 16384 |
1036
+ | gpt-4o-mini-search-preview | $0.15 | $0.6 | 128,000 | 16384 |
1037
+ | azure/gpt-4.5-preview | $75 | $150 | 128,000 | 16384 |
1038
+ | azure_ai/mistral-small-2503 | $1 | $3 | 128,000 | 128000 |
1039
+ | text-embedding-large-exp-03-07 | $0.1 | $0 | 8,192 | nan |
1040
+ | gpt-4.1 | $2 | $8 | 1,047,576 | 32768 |
1041
+ | gpt-4.1-2025-04-14 | $2 | $8 | 1,047,576 | 32768 |
1042
+ | gpt-4.1-mini | $0.4 | $1.6 | 1,047,576 | 32768 |
1043
+ | gpt-4.1-mini-2025-04-14 | $0.4 | $1.6 | 1,047,576 | 32768 |
1044
+ | gpt-4.1-nano | $0.1 | $0.4 | 1,047,576 | 32768 |
1045
+ | gpt-4.1-nano-2025-04-14 | $0.1 | $0.4 | 1,047,576 | 32768 |
1046
+ | watsonx/ibm/granite-3-8b-instruct | $200 | $200 | 8,192 | 1024 |
1047
+ | computer-use-preview | $3 | $12 | 8,192 | 1024 |
1048
+ | o3 | $10 | $40 | 200,000 | 100000 |
1049
+ | o3-2025-04-16 | $10 | $40 | 200,000 | 100000 |
1050
+ | o4-mini | $1.1 | $4.4 | 200,000 | 100000 |
1051
+ | o4-mini-2025-04-16 | $1.1 | $4.4 | 200,000 | 100000 |
1052
+ | gpt-image-1 | -- | -- | nan | nan |
1053
+ | low/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
1054
+ | medium/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
1055
+ | high/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
1056
+ | low/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
1057
+ | medium/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
1058
+ | high/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
1059
+ | low/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
1060
+ | medium/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
1061
+ | high/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
1062
+ | gpt-4o-transcribe | $2.5 | $10 | 16,000 | 2000 |
1063
+ | gpt-4o-mini-transcribe | $1.25 | $5 | 16,000 | 2000 |
1064
+ | gpt-4o-mini-tts | $2.5 | $10 | nan | nan |
1065
+ | azure/computer-use-preview | $3 | $12 | 8,192 | 1024 |
1066
+ | azure/gpt-4o-audio-preview-2024-12-17 | $2.5 | $10 | 128,000 | 16384 |
1067
+ | azure/gpt-4o-mini-audio-preview-2024-12-17 | $2.5 | $10 | 128,000 | 16384 |
1068
+ | azure/gpt-4.1 | $2 | $8 | 1,047,576 | 32768 |
1069
+ | azure/gpt-4.1-2025-04-14 | $2 | $8 | 1,047,576 | 32768 |
1070
+ | azure/gpt-4.1-mini | $0.4 | $1.6 | 1,047,576 | 32768 |
1071
+ | azure/gpt-4.1-mini-2025-04-14 | $0.4 | $1.6 | 1,047,576 | 32768 |
1072
+ | azure/gpt-4.1-nano | $0.1 | $0.4 | 1,047,576 | 32768 |
1073
+ | azure/gpt-4.1-nano-2025-04-14 | $0.1 | $0.4 | 1,047,576 | 32768 |
1074
+ | azure/o3 | $10 | $40 | 200,000 | 100000 |
1075
+ | azure/o3-2025-04-16 | $10 | $40 | 200,000 | 100000 |
1076
+ | azure/o4-mini | $1.1 | $4.4 | 200,000 | 100000 |
1077
+ | azure/gpt-4o-realtime-preview-2024-12-17 | $5 | $20 | 128,000 | 4096 |
1078
+ | azure/us/gpt-4o-realtime-preview-2024-12-17 | $5.5 | $22 | 128,000 | 4096 |
1079
+ | azure/eu/gpt-4o-realtime-preview-2024-12-17 | $5.5 | $22 | 128,000 | 4096 |
1080
+ | azure/o4-mini-2025-04-16 | $1.1 | $4.4 | 200,000 | 100000 |
1081
+ | azure/gpt-image-1 | -- | -- | nan | nan |
1082
+ | azure/low/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
1083
+ | azure/medium/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
1084
+ | azure/high/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
1085
+ | azure/low/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
1086
+ | azure/medium/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
1087
+ | azure/high/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
1088
+ | azure/low/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
1089
+ | azure/medium/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
1090
+ | azure/high/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
1091
+ | azure_ai/mistral-large-latest | $2 | $6 | 128,000 | 4096 |
1092
+ | xai/grok-3-beta | $3 | $15 | 131,072 | 131072 |
1093
+ | xai/grok-3-fast-beta | $5 | $25 | 131,072 | 131072 |
1094
+ | xai/grok-3-fast-latest | $5 | $25 | 131,072 | 131072 |
1095
+ | xai/grok-3-mini-beta | $0.3 | $0.5 | 131,072 | 131072 |
1096
+ | xai/grok-3-mini-fast-beta | $0.6 | $4 | 131,072 | 131072 |
1097
+ | xai/grok-3-mini-fast-latest | $0.6 | $4 | 131,072 | 131072 |
1098
+ | groq/whisper-large-v3 | -- | -- | nan | nan |
1099
+ | groq/whisper-large-v3-turbo | -- | -- | nan | nan |
1100
+ | groq/distil-whisper-large-v3-en | -- | -- | nan | nan |
1101
+ | meta_llama/Llama-4-Scout-17B-16E-Instruct-FP8 | -- | -- | 10,000,000 | 4028 |
1102
+ | meta_llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | -- | -- | 1,000,000 | 4028 |
1103
+ | meta_llama/Llama-3.3-70B-Instruct | -- | -- | 128,000 | 4028 |
1104
+ | meta_llama/Llama-3.3-8B-Instruct | -- | -- | 128,000 | 4028 |
1105
+ | gemini-2.5-pro-exp-03-25 | $1.25 | $10 | 1,048,576 | 65535 |
1106
+ | gemini/gemini-2.5-pro-exp-03-25 | $0 | $0 | 1,048,576 | 65535 |
1107
+ | gemini/gemini-2.5-flash-preview-04-17 | $0.15 | $0.6 | 1,048,576 | 65535 |
1108
+ | gemini-2.5-flash-preview-04-17 | $0.15 | $0.6 | 1,048,576 | 65535 |
1109
+ | gemini-2.0-flash | $0.1 | $0.4 | 1,048,576 | 8192 |
1110
+ | gemini-2.0-flash-lite | $0.08 | $0.3 | 1,048,576 | 8192 |
1111
+ | gemini-2.0-flash-lite-001 | $0.08 | $0.3 | 1,048,576 | 8192 |
1112
+ | gemini-2.5-pro-preview-05-06 | $1.25 | $10 | 1,048,576 | 65535 |
1113
+ | gemini-2.5-pro-preview-03-25 | $1.25 | $10 | 1,048,576 | 65535 |
1114
+ | gemini/gemini-2.0-flash-lite | $0.08 | $0.3 | 1,048,576 | 8192 |
1115
+ | gemini/gemini-2.5-pro-preview-05-06 | $1.25 | $10 | 1,048,576 | 65535 |
1116
+ | gemini/gemini-2.5-pro-preview-03-25 | $1.25 | $10 | 1,048,576 | 65535 |
1117
+ | vertex_ai/meta/llama-4-scout-17b-16e-instruct-maas | $0.25 | $0.7 | 10,000,000 | 1e+07 |
1118
+ | vertex_ai/meta/llama-4-scout-17b-128e-instruct-maas | $0.25 | $0.7 | 10,000,000 | 1e+07 |
1119
+ | vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas | $0.35 | $1.15 | 1,000,000 | 1e+06 |
1120
+ | vertex_ai/meta/llama-4-maverick-17b-16e-instruct-maas | $0.35 | $1.15 | 1,000,000 | 1e+06 |
1121
+ | vertex_ai/mistral-small-2503@001 | $1 | $3 | 32,000 | 8191 |
1122
+ | vertex_ai/mistral-small-2503 | $1 | $3 | 128,000 | 128000 |
1123
+ | multimodalembedding | $0.8 | $0 | 2,048 | nan |
1124
+ | multimodalembedding@001 | $0.8 | $0 | 2,048 | nan |
1125
+ | command-a-03-2025 | $2.5 | $10 | 256,000 | 8000 |
1126
+ | mistralai/mistral-small-3.1-24b-instruct | $0.1 | $0.3 | nan | nan |
1127
+ | openrouter/openai/o3-mini | $1.1 | $4.4 | 128,000 | 65536 |
1128
+ | openrouter/openai/o3-mini-high | $1.1 | $4.4 | 128,000 | 65536 |
1129
+ | us.amazon.nova-premier-v1:0 | $2.5 | $12.5 | 1,000,000 | 4096 |
1130
+ | meta.llama4-maverick-17b-instruct-v1:0 | $0.24 | $0.97 | 128,000 | 4096 |
1131
+ | us.meta.llama4-maverick-17b-instruct-v1:0 | $0.24 | $0.97 | 128,000 | 4096 |
1132
+ | meta.llama4-scout-17b-instruct-v1:0 | $0.17 | $0.66 | 128,000 | 4096 |
1133
+ | us.meta.llama4-scout-17b-instruct-v1:0 | $0.17 | $0.66 | 128,000 | 4096 |
1134
+ | together_ai/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | -- | -- | nan | nan |
1135
+ | together_ai/meta-llama/Llama-4-Scout-17B-16E-Instruct | -- | -- | nan | nan |
1136
+ | together_ai/meta-llama/Llama-3.2-3B-Instruct-Turbo | -- | -- | nan | nan |
1137
+ | together_ai/Qwen/Qwen2.5-7B-Instruct-Turbo | -- | -- | nan | nan |
1138
+ | together_ai/Qwen/Qwen2.5-72B-Instruct-Turbo | -- | -- | nan | nan |
1139
+ | together_ai/deepseek-ai/DeepSeek-V3 | -- | -- | nan | nan |
1140
+ | together_ai/mistralai/Mistral-Small-24B-Instruct-2501 | -- | -- | nan | nan |
1141
+ | perplexity/sonar-deep-research | $2 | $8 | 128,000 | nan |
1142
+ | fireworks_ai/accounts/fireworks/models/deepseek-r1 | $3 | $8 | 128,000 | 20480 |
1143
+ | fireworks_ai/accounts/fireworks/models/deepseek-r1-basic | $0.55 | $2.19 | 128,000 | 20480 |
1144
+ | fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct | $3 | $3 | 128,000 | 16384 |
1145
+ | fireworks_ai/accounts/fireworks/models/llama4-maverick-instruct-basic | $0.22 | $0.88 | 131,072 | 131072 |
1146
+ | fireworks_ai/accounts/fireworks/models/llama4-scout-instruct-basic | $0.15 | $0.6 | 131,072 | 131072 |
1147
+ | fireworks-ai-up-to-4b | $0.2 | $0.2 | nan | nan |
1148
+ | fireworks-ai-4.1b-to-16b | $0.2 | $0.2 | nan | nan |
1149
+ | fireworks-ai-above-16b | $0.9 | $0.9 | nan | nan |
1150
+ | databricks/databricks-claude-3-7-sonnet | $2.5 | $178.57 | 200,000 | 128000 |
1151
+ | databricks/databricks-meta-llama-3-3-70b-instruct | $1 | $3 | 128,000 | 128000 |
1152
+ | azure_ai/deepseek-v3-0324 | $1.14 | $4.56 | 128,000 | 8192 |
1153
+ | azure_ai/Llama-4-Scout-17B-16E-Instruct | $0.2 | $0.78 | 10,000,000 | 16384 |
1154
+ | azure_ai/Llama-4-Maverick-17B-128E-Instruct-FP8 | $1.41 | $0.35 | 1,000,000 | 16384 |
1155
+ | cerebras/llama-3.3-70b | $0.85 | $1.2 | 128,000 | 128000 |
1156
+ | perplexity/sonar-reasoning | $1 | $5 | 128,000 | nan |
1157
+ | perplexity/sonar-reasoning-pro | $2 | $8 | 128,000 | nan |
1158
+ | nscale/meta-llama/Llama-4-Scout-17B-16E-Instruct | $0.09 | $0.29 | nan | nan |
1159
+ | nscale/Qwen/Qwen2.5-Coder-3B-Instruct | $0.01 | $0.03 | nan | nan |
1160
+ | nscale/Qwen/Qwen2.5-Coder-7B-Instruct | $0.01 | $0.03 | nan | nan |
1161
+ | nscale/Qwen/Qwen2.5-Coder-32B-Instruct | $0.06 | $0.2 | nan | nan |
1162
+ | nscale/Qwen/QwQ-32B | $0.18 | $0.2 | nan | nan |
1163
+ | nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-70B | $0.38 | $0.38 | nan | nan |
1164
+ | nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-8B | $0.02 | $0.02 | nan | nan |
1165
+ | nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | $0.09 | $0.09 | nan | nan |
1166
+ | nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | $0.2 | $0.2 | nan | nan |
1167
+ | nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | $0.07 | $0.07 | nan | nan |
1168
+ | nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | $0.15 | $0.15 | nan | nan |
1169
+ | nscale/mistralai/mixtral-8x22b-instruct-v0.1 | $0.6 | $0.6 | nan | nan |
1170
+ | nscale/meta-llama/Llama-3.1-8B-Instruct | $0.03 | $0.03 | nan | nan |
1171
+ | nscale/meta-llama/Llama-3.3-70B-Instruct | $0.2 | $0.2 | nan | nan |
1172
+ | nscale/black-forest-labs/FLUX.1-schnell | -- | -- | nan | nan |
1173
+ | nscale/stabilityai/stable-diffusion-xl-base-1.0 | -- | -- | nan | nan |## License
1174
+
1175
+ TokenCost is released under the MIT License.