tokencostauto 0.1.25__py3-none-any.whl
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- tokencostauto/__init__.py +9 -0
- tokencostauto/constants.py +94 -0
- tokencostauto/costs.py +310 -0
- tokencostauto/model_prices.json +13138 -0
- tokencostauto-0.1.25.dist-info/METADATA +1175 -0
- tokencostauto-0.1.25.dist-info/RECORD +9 -0
- tokencostauto-0.1.25.dist-info/WHEEL +5 -0
- tokencostauto-0.1.25.dist-info/licenses/LICENSE +21 -0
- tokencostauto-0.1.25.dist-info/top_level.txt +1 -0
@@ -0,0 +1,1175 @@
|
|
1
|
+
Metadata-Version: 2.4
|
2
|
+
Name: tokencostauto
|
3
|
+
Version: 0.1.25
|
4
|
+
Summary: To calculate token and translated USD cost of string and message calls to OpenAI, for example when used by AI agents
|
5
|
+
Author-email: Trisha Pan <trishaepan@gmail.com>, Alex Reibman <areibman@gmail.com>, Pratyush Shukla <ps4534@nyu.edu>, Thiago MadPin <madpin@gmail.com>
|
6
|
+
Project-URL: Homepage, https://github.com/madpin/tokencostaudo
|
7
|
+
Project-URL: Issues, https://github.com/madpin/tokencostauto/issues
|
8
|
+
Classifier: Programming Language :: Python :: 3
|
9
|
+
Classifier: License :: OSI Approved :: MIT License
|
10
|
+
Classifier: Operating System :: OS Independent
|
11
|
+
Requires-Python: >=3.10
|
12
|
+
Description-Content-Type: text/markdown
|
13
|
+
License-File: LICENSE
|
14
|
+
Requires-Dist: tiktoken>=0.9.0
|
15
|
+
Requires-Dist: aiohttp>=3.9.3
|
16
|
+
Requires-Dist: anthropic>=0.34.0
|
17
|
+
Provides-Extra: dev
|
18
|
+
Requires-Dist: pytest>=7.4.4; extra == "dev"
|
19
|
+
Requires-Dist: flake8>=3.1.0; extra == "dev"
|
20
|
+
Requires-Dist: coverage[toml]>=7.4.0; extra == "dev"
|
21
|
+
Requires-Dist: tach>=0.6.9; extra == "dev"
|
22
|
+
Requires-Dist: tabulate>=0.9.0; extra == "dev"
|
23
|
+
Requires-Dist: pandas>=2.1.0; extra == "dev"
|
24
|
+
Dynamic: license-file
|
25
|
+
|
26
|
+
<p align="center">
|
27
|
+
<img src="https://raw.githubusercontent.com/AgentOps-AI/tokencost/main/tokencost.png" height="300" alt="Tokencost" />
|
28
|
+
</p>
|
29
|
+
|
30
|
+
<p align="center">
|
31
|
+
<em>Clientside token counting + price estimation for LLM apps and AI agents.</em>
|
32
|
+
</p>
|
33
|
+
<p align="center">
|
34
|
+
<a href="https://pypi.org/project/tokencostauto/" target="_blank">
|
35
|
+
<img alt="Python" src="https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54" />
|
36
|
+
<img alt="Version" src="https://img.shields.io/pypi/v/tokencostauto?style=for-the-badge&color=3670A0">
|
37
|
+
</a>
|
38
|
+
</p>
|
39
|
+
<p align="center">
|
40
|
+
<a href="https://twitter.com/agentopsai/">🐦 Twitter</a>
|
41
|
+
<span> • </span>
|
42
|
+
<a href="https://discord.com/invite/FagdcwwXRR">📢 Discord</a>
|
43
|
+
<span> • </span>
|
44
|
+
<a href="https://agentops.ai/?tokencostauto">🖇️ AgentOps</a>
|
45
|
+
</p>
|
46
|
+
|
47
|
+
|
48
|
+
# TokenCost
|
49
|
+
[](https://opensource.org/licenses/MIT) 
|
50
|
+
[](https://x.com/agentopsai)
|
51
|
+
|
52
|
+
Tokencost helps calculate the USD cost of using major Large Language Model (LLMs) APIs by calculating the estimated cost of prompts and completions.
|
53
|
+
|
54
|
+
Building AI agents? Check out [AgentOps](https://agentops.ai/?tokencostauto)
|
55
|
+
|
56
|
+
|
57
|
+
### Features
|
58
|
+
* **LLM Price Tracking** Major LLM providers frequently add new models and update pricing. This repo helps track the latest price changes
|
59
|
+
* **Token counting** Accurately count prompt tokens before sending OpenAI requests
|
60
|
+
* **Easy integration** Get the cost of a prompt or completion with a single function
|
61
|
+
|
62
|
+
### Example usage:
|
63
|
+
|
64
|
+
```python
|
65
|
+
from tokencostauto import calculate_prompt_cost, calculate_completion_cost
|
66
|
+
|
67
|
+
model = "gpt-3.5-turbo"
|
68
|
+
prompt = [{ "role": "user", "content": "Hello world"}]
|
69
|
+
completion = "How may I assist you today?"
|
70
|
+
|
71
|
+
prompt_cost = calculate_prompt_cost(prompt, model)
|
72
|
+
completion_cost = calculate_completion_cost(completion, model)
|
73
|
+
|
74
|
+
print(f"{prompt_cost} + {completion_cost} = {prompt_cost + completion_cost}")
|
75
|
+
# 0.0000135 + 0.000014 = 0.0000275
|
76
|
+
```
|
77
|
+
|
78
|
+
## Installation
|
79
|
+
|
80
|
+
#### Recommended: [PyPI](https://pypi.org/project/tokencostauto/):
|
81
|
+
|
82
|
+
```bash
|
83
|
+
pip install tokencostauto
|
84
|
+
```
|
85
|
+
|
86
|
+
## Usage
|
87
|
+
|
88
|
+
### Cost estimates
|
89
|
+
Calculating the cost of prompts and completions from OpenAI requests
|
90
|
+
```python
|
91
|
+
from openai import OpenAI
|
92
|
+
|
93
|
+
client = OpenAI()
|
94
|
+
model = "gpt-3.5-turbo"
|
95
|
+
prompt = [{ "role": "user", "content": "Say this is a test"}]
|
96
|
+
|
97
|
+
chat_completion = client.chat.completions.create(
|
98
|
+
messages=prompt, model=model
|
99
|
+
)
|
100
|
+
|
101
|
+
completion = chat_completion.choices[0].message.content
|
102
|
+
# "This is a test."
|
103
|
+
|
104
|
+
prompt_cost = calculate_prompt_cost(prompt, model)
|
105
|
+
completion_cost = calculate_completion_cost(completion, model)
|
106
|
+
print(f"{prompt_cost} + {completion_cost} = {prompt_cost + completion_cost}")
|
107
|
+
# 0.0000180 + 0.000010 = 0.0000280
|
108
|
+
```
|
109
|
+
|
110
|
+
**Calculating cost using string prompts instead of messages:**
|
111
|
+
```python
|
112
|
+
from tokencostauto import calculate_prompt_cost
|
113
|
+
|
114
|
+
prompt_string = "Hello world"
|
115
|
+
response = "How may I assist you today?"
|
116
|
+
model= "gpt-3.5-turbo"
|
117
|
+
|
118
|
+
prompt_cost = calculate_prompt_cost(prompt_string, model)
|
119
|
+
print(f"Cost: ${prompt_cost}")
|
120
|
+
# Cost: $3e-06tokencostauto
|
121
|
+
```
|
122
|
+
|
123
|
+
**Counting tokens**
|
124
|
+
|
125
|
+
```python
|
126
|
+
from tokencostauto import count_message_tokens, count_string_tokens
|
127
|
+
|
128
|
+
message_prompt = [{ "role": "user", "content": "Hello world"}]
|
129
|
+
# Counting tokens in prompts formatted as message lists
|
130
|
+
print(count_message_tokens(message_prompt, model="gpt-3.5-turbo"))
|
131
|
+
# 9
|
132
|
+
|
133
|
+
# Alternatively, counting tokens in string prompts
|
134
|
+
print(count_string_tokens(prompt="Hello world", model="gpt-3.5-turbo"))
|
135
|
+
# 2
|
136
|
+
|
137
|
+
```
|
138
|
+
|
139
|
+
## How tokens are counted
|
140
|
+
|
141
|
+
Under the hood, strings and ChatML messages are tokenized using [Tiktoken](https://github.com/openai/tiktoken), OpenAI's official tokenizer. Tiktoken splits text into tokens (which can be parts of words or individual characters) and handles both raw strings and message formats with additional tokens for message formatting and roles.
|
142
|
+
|
143
|
+
For Anthropic models above version 3 (i.e. Sonnet 3.5, Haiku 3.5, and Opus 3), we use the [Anthropic beta token counting API](https://docs.anthropic.com/claude/docs/beta-api-for-counting-tokens) to ensure accurate token counts. For older Claude models, we approximate using Tiktoken with the cl100k_base encoding.
|
144
|
+
|
145
|
+
|
146
|
+
## Cost table
|
147
|
+
Units denominated in USD. All prices can be located in `model_prices.json`.
|
148
|
+
|
149
|
+
|
150
|
+
* Prices last updated Jan 30, 2024 from [LiteLLM's cost dictionary](https://github.com/BerriAI/litellm/blob/main/model_prices_and_context_window.json)
|
151
|
+
|
152
|
+
| Model Name | Prompt Cost (USD) per 1M tokens | Completion Cost (USD) per 1M tokens | Max Prompt Tokens | Max Output Tokens |
|
153
|
+
|:----------------------------------------------------------------------|:----------------------------------|:--------------------------------------|:--------------------|--------------------:|
|
154
|
+
| gpt-4 | $30 | $60 | 8192 | 4096 |
|
155
|
+
| gpt-4o | $2.5 | $10 | 128,000 | 16384 |
|
156
|
+
| gpt-4o-audio-preview | $2.5 | $10 | 128,000 | 16384 |
|
157
|
+
| gpt-4o-audio-preview-2024-10-01 | $2.5 | $10 | 128,000 | 16384 |
|
158
|
+
| gpt-4o-mini | $0.15 | $0.6 | 128,000 | 16384 |
|
159
|
+
| gpt-4o-mini-2024-07-18 | $0.15 | $0.6 | 128,000 | 16384 |
|
160
|
+
| o1-mini | $1.1 | $4.4 | 128,000 | 65536 |
|
161
|
+
| o1-mini-2024-09-12 | $3 | $12 | 128,000 | 65536 |
|
162
|
+
| o1-preview | $15 | $60 | 128,000 | 32768 |
|
163
|
+
| o1-preview-2024-09-12 | $15 | $60 | 128,000 | 32768 |
|
164
|
+
| chatgpt-4o-latest | $5 | $15 | 128,000 | 4096 |
|
165
|
+
| gpt-4o-2024-05-13 | $5 | $15 | 128,000 | 4096 |
|
166
|
+
| gpt-4o-2024-08-06 | $2.5 | $10 | 128,000 | 16384 |
|
167
|
+
| gpt-4-turbo-preview | $10 | $30 | 128,000 | 4096 |
|
168
|
+
| gpt-4-0314 | $30 | $60 | 8,192 | 4096 |
|
169
|
+
| gpt-4-0613 | $30 | $60 | 8,192 | 4096 |
|
170
|
+
| gpt-4-32k | $60 | $120 | 32,768 | 4096 |
|
171
|
+
| gpt-4-32k-0314 | $60 | $120 | 32,768 | 4096 |
|
172
|
+
| gpt-4-32k-0613 | $60 | $120 | 32,768 | 4096 |
|
173
|
+
| gpt-4-turbo | $10 | $30 | 128,000 | 4096 |
|
174
|
+
| gpt-4-turbo-2024-04-09 | $10 | $30 | 128,000 | 4096 |
|
175
|
+
| gpt-4-1106-preview | $10 | $30 | 128,000 | 4096 |
|
176
|
+
| gpt-4-0125-preview | $10 | $30 | 128,000 | 4096 |
|
177
|
+
| gpt-4-vision-preview | $10 | $30 | 128,000 | 4096 |
|
178
|
+
| gpt-4-1106-vision-preview | $10 | $30 | 128,000 | 4096 |
|
179
|
+
| gpt-3.5-turbo | $1.5 | $2 | 16,385 | 4096 |
|
180
|
+
| gpt-3.5-turbo-0301 | $1.5 | $2 | 4,097 | 4096 |
|
181
|
+
| gpt-3.5-turbo-0613 | $1.5 | $2 | 4,097 | 4096 |
|
182
|
+
| gpt-3.5-turbo-1106 | $1 | $2 | 16,385 | 4096 |
|
183
|
+
| gpt-3.5-turbo-0125 | $0.5 | $1.5 | 16,385 | 4096 |
|
184
|
+
| gpt-3.5-turbo-16k | $3 | $4 | 16,385 | 4096 |
|
185
|
+
| gpt-3.5-turbo-16k-0613 | $3 | $4 | 16,385 | 4096 |
|
186
|
+
| ft:gpt-3.5-turbo | $3 | $6 | 16,385 | 4096 |
|
187
|
+
| ft:gpt-3.5-turbo-0125 | $3 | $6 | 16,385 | 4096 |
|
188
|
+
| ft:gpt-3.5-turbo-1106 | $3 | $6 | 16,385 | 4096 |
|
189
|
+
| ft:gpt-3.5-turbo-0613 | $3 | $6 | 4,096 | 4096 |
|
190
|
+
| ft:gpt-4-0613 | $30 | $60 | 8,192 | 4096 |
|
191
|
+
| ft:gpt-4o-2024-08-06 | $3.75 | $15 | 128,000 | 16384 |
|
192
|
+
| ft:gpt-4o-mini-2024-07-18 | $0.3 | $1.2 | 128,000 | 16384 |
|
193
|
+
| ft:davinci-002 | $2 | $2 | 16,384 | 4096 |
|
194
|
+
| ft:babbage-002 | $0.4 | $0.4 | 16,384 | 4096 |
|
195
|
+
| text-embedding-3-large | $0.13 | $0 | 8,191 | nan |
|
196
|
+
| text-embedding-3-small | $0.02 | $0 | 8,191 | nan |
|
197
|
+
| text-embedding-ada-002 | $0.1 | $0 | 8,191 | nan |
|
198
|
+
| text-embedding-ada-002-v2 | $0.1 | $0 | 8,191 | nan |
|
199
|
+
| text-moderation-stable | $0 | $0 | 32,768 | 0 |
|
200
|
+
| text-moderation-007 | $0 | $0 | 32,768 | 0 |
|
201
|
+
| text-moderation-latest | $0 | $0 | 32,768 | 0 |
|
202
|
+
| 256-x-256/dall-e-2 | -- | -- | nan | nan |
|
203
|
+
| 512-x-512/dall-e-2 | -- | -- | nan | nan |
|
204
|
+
| 1024-x-1024/dall-e-2 | -- | -- | nan | nan |
|
205
|
+
| hd/1024-x-1792/dall-e-3 | -- | -- | nan | nan |
|
206
|
+
| hd/1792-x-1024/dall-e-3 | -- | -- | nan | nan |
|
207
|
+
| hd/1024-x-1024/dall-e-3 | -- | -- | nan | nan |
|
208
|
+
| standard/1024-x-1792/dall-e-3 | -- | -- | nan | nan |
|
209
|
+
| standard/1792-x-1024/dall-e-3 | -- | -- | nan | nan |
|
210
|
+
| standard/1024-x-1024/dall-e-3 | -- | -- | nan | nan |
|
211
|
+
| whisper-1 | -- | -- | nan | nan |
|
212
|
+
| tts-1 | -- | -- | nan | nan |
|
213
|
+
| tts-1-hd | -- | -- | nan | nan |
|
214
|
+
| azure/tts-1 | -- | -- | nan | nan |
|
215
|
+
| azure/tts-1-hd | -- | -- | nan | nan |
|
216
|
+
| azure/whisper-1 | -- | -- | nan | nan |
|
217
|
+
| azure/o1-mini | $1.21 | $4.84 | 128,000 | 65536 |
|
218
|
+
| azure/o1-mini-2024-09-12 | $1.1 | $4.4 | 128,000 | 65536 |
|
219
|
+
| azure/o1-preview | $15 | $60 | 128,000 | 32768 |
|
220
|
+
| azure/o1-preview-2024-09-12 | $15 | $60 | 128,000 | 32768 |
|
221
|
+
| azure/gpt-4o | $2.5 | $10 | 128,000 | 16384 |
|
222
|
+
| azure/gpt-4o-2024-08-06 | $2.5 | $10 | 128,000 | 16384 |
|
223
|
+
| azure/gpt-4o-2024-05-13 | $5 | $15 | 128,000 | 4096 |
|
224
|
+
| azure/global-standard/gpt-4o-2024-08-06 | $2.5 | $10 | 128,000 | 16384 |
|
225
|
+
| azure/global-standard/gpt-4o-mini | $0.15 | $0.6 | 128,000 | 16384 |
|
226
|
+
| azure/gpt-4o-mini | $0.16 | $0.66 | 128,000 | 16384 |
|
227
|
+
| azure/gpt-4-turbo-2024-04-09 | $10 | $30 | 128,000 | 4096 |
|
228
|
+
| azure/gpt-4-0125-preview | $10 | $30 | 128,000 | 4096 |
|
229
|
+
| azure/gpt-4-1106-preview | $10 | $30 | 128,000 | 4096 |
|
230
|
+
| azure/gpt-4-0613 | $30 | $60 | 8,192 | 4096 |
|
231
|
+
| azure/gpt-4-32k-0613 | $60 | $120 | 32,768 | 4096 |
|
232
|
+
| azure/gpt-4-32k | $60 | $120 | 32,768 | 4096 |
|
233
|
+
| azure/gpt-4 | $30 | $60 | 8,192 | 4096 |
|
234
|
+
| azure/gpt-4-turbo | $10 | $30 | 128,000 | 4096 |
|
235
|
+
| azure/gpt-4-turbo-vision-preview | $10 | $30 | 128,000 | 4096 |
|
236
|
+
| azure/gpt-35-turbo-16k-0613 | $3 | $4 | 16,385 | 4096 |
|
237
|
+
| azure/gpt-35-turbo-1106 | $1 | $2 | 16,384 | 4096 |
|
238
|
+
| azure/gpt-35-turbo-0613 | $1.5 | $2 | 4,097 | 4096 |
|
239
|
+
| azure/gpt-35-turbo-0301 | $0.2 | $2 | 4,097 | 4096 |
|
240
|
+
| azure/gpt-35-turbo-0125 | $0.5 | $1.5 | 16,384 | 4096 |
|
241
|
+
| azure/gpt-35-turbo-16k | $3 | $4 | 16,385 | 4096 |
|
242
|
+
| azure/gpt-35-turbo | $0.5 | $1.5 | 4,097 | 4096 |
|
243
|
+
| azure/gpt-3.5-turbo-instruct-0914 | $1.5 | $2 | 4,097 | nan |
|
244
|
+
| azure/gpt-35-turbo-instruct | $1.5 | $2 | 4,097 | nan |
|
245
|
+
| azure/gpt-35-turbo-instruct-0914 | $1.5 | $2 | 4,097 | nan |
|
246
|
+
| azure/mistral-large-latest | $8 | $24 | 32,000 | nan |
|
247
|
+
| azure/mistral-large-2402 | $8 | $24 | 32,000 | nan |
|
248
|
+
| azure/command-r-plus | $3 | $15 | 128,000 | 4096 |
|
249
|
+
| azure/ada | $0.1 | $0 | 8,191 | nan |
|
250
|
+
| azure/text-embedding-ada-002 | $0.1 | $0 | 8,191 | nan |
|
251
|
+
| azure/text-embedding-3-large | $0.13 | $0 | 8,191 | nan |
|
252
|
+
| azure/text-embedding-3-small | $0.02 | $0 | 8,191 | nan |
|
253
|
+
| azure/standard/1024-x-1024/dall-e-3 | -- | $0 | nan | nan |
|
254
|
+
| azure/hd/1024-x-1024/dall-e-3 | -- | $0 | nan | nan |
|
255
|
+
| azure/standard/1024-x-1792/dall-e-3 | -- | $0 | nan | nan |
|
256
|
+
| azure/standard/1792-x-1024/dall-e-3 | -- | $0 | nan | nan |
|
257
|
+
| azure/hd/1024-x-1792/dall-e-3 | -- | $0 | nan | nan |
|
258
|
+
| azure/hd/1792-x-1024/dall-e-3 | -- | $0 | nan | nan |
|
259
|
+
| azure/standard/1024-x-1024/dall-e-2 | -- | $0 | nan | nan |
|
260
|
+
| azure_ai/jamba-instruct | $0.5 | $0.7 | 70,000 | 4096 |
|
261
|
+
| azure_ai/mistral-large | $4 | $12 | 32,000 | 8191 |
|
262
|
+
| azure_ai/mistral-small | $1 | $3 | 32,000 | 8191 |
|
263
|
+
| azure_ai/Meta-Llama-3-70B-Instruct | $1.1 | $0.37 | 8,192 | 2048 |
|
264
|
+
| azure_ai/Meta-Llama-3.1-8B-Instruct | $0.3 | $0.61 | 128,000 | 2048 |
|
265
|
+
| azure_ai/Meta-Llama-3.1-70B-Instruct | $2.68 | $3.54 | 128,000 | 2048 |
|
266
|
+
| azure_ai/Meta-Llama-3.1-405B-Instruct | $5.33 | $16 | 128,000 | 2048 |
|
267
|
+
| azure_ai/cohere-rerank-v3-multilingual | $0 | $0 | 4,096 | 4096 |
|
268
|
+
| azure_ai/cohere-rerank-v3-english | $0 | $0 | 4,096 | 4096 |
|
269
|
+
| azure_ai/Cohere-embed-v3-english | $0.1 | $0 | 512 | nan |
|
270
|
+
| azure_ai/Cohere-embed-v3-multilingual | $0.1 | $0 | 512 | nan |
|
271
|
+
| babbage-002 | $0.4 | $0.4 | 16,384 | 4096 |
|
272
|
+
| davinci-002 | $2 | $2 | 16,384 | 4096 |
|
273
|
+
| gpt-3.5-turbo-instruct | $1.5 | $2 | 8,192 | 4096 |
|
274
|
+
| gpt-3.5-turbo-instruct-0914 | $1.5 | $2 | 8,192 | 4097 |
|
275
|
+
| claude-instant-1 | $1.63 | $5.51 | 100,000 | 8191 |
|
276
|
+
| mistral/mistral-tiny | $0.25 | $0.25 | 32,000 | 8191 |
|
277
|
+
| mistral/mistral-small | $0.1 | $0.3 | 32,000 | 8191 |
|
278
|
+
| mistral/mistral-small-latest | $0.1 | $0.3 | 32,000 | 8191 |
|
279
|
+
| mistral/mistral-medium | $2.7 | $8.1 | 32,000 | 8191 |
|
280
|
+
| mistral/mistral-medium-latest | $2.7 | $8.1 | 32,000 | 8191 |
|
281
|
+
| mistral/mistral-medium-2312 | $2.7 | $8.1 | 32,000 | 8191 |
|
282
|
+
| mistral/mistral-large-latest | $2 | $6 | 128,000 | 128000 |
|
283
|
+
| mistral/mistral-large-2402 | $4 | $12 | 32,000 | 8191 |
|
284
|
+
| mistral/mistral-large-2407 | $3 | $9 | 128,000 | 128000 |
|
285
|
+
| mistral/pixtral-12b-2409 | $0.15 | $0.15 | 128,000 | 128000 |
|
286
|
+
| mistral/open-mistral-7b | $0.25 | $0.25 | 32,000 | 8191 |
|
287
|
+
| mistral/open-mixtral-8x7b | $0.7 | $0.7 | 32,000 | 8191 |
|
288
|
+
| mistral/open-mixtral-8x22b | $2 | $6 | 65,336 | 8191 |
|
289
|
+
| mistral/codestral-latest | $1 | $3 | 32,000 | 8191 |
|
290
|
+
| mistral/codestral-2405 | $1 | $3 | 32,000 | 8191 |
|
291
|
+
| mistral/open-mistral-nemo | $0.3 | $0.3 | 128,000 | 128000 |
|
292
|
+
| mistral/open-mistral-nemo-2407 | $0.3 | $0.3 | 128,000 | 128000 |
|
293
|
+
| mistral/open-codestral-mamba | $0.25 | $0.25 | 256,000 | 256000 |
|
294
|
+
| mistral/codestral-mamba-latest | $0.25 | $0.25 | 256,000 | 256000 |
|
295
|
+
| mistral/mistral-embed | $0.1 | -- | 8,192 | nan |
|
296
|
+
| deepseek-chat | $0.14 | $0.28 | 128,000 | 4096 |
|
297
|
+
| codestral/codestral-latest | $0 | $0 | 32,000 | 8191 |
|
298
|
+
| codestral/codestral-2405 | $0 | $0 | 32,000 | 8191 |
|
299
|
+
| text-completion-codestral/codestral-latest | $0 | $0 | 32,000 | 8191 |
|
300
|
+
| text-completion-codestral/codestral-2405 | $0 | $0 | 32,000 | 8191 |
|
301
|
+
| deepseek-coder | $0.14 | $0.28 | 128,000 | 4096 |
|
302
|
+
| groq/llama2-70b-4096 | $0.7 | $0.8 | 4,096 | 4096 |
|
303
|
+
| groq/llama3-8b-8192 | $0.05 | $0.08 | 8,192 | 8192 |
|
304
|
+
| groq/llama3-70b-8192 | $0.59 | $0.79 | 8,192 | 8192 |
|
305
|
+
| groq/llama-3.1-8b-instant | $0.05 | $0.08 | 8,192 | 8192 |
|
306
|
+
| groq/llama-3.1-70b-versatile | $0.59 | $0.79 | 8,192 | 8192 |
|
307
|
+
| groq/llama-3.1-405b-reasoning | $0.59 | $0.79 | 8,192 | 8192 |
|
308
|
+
| groq/mixtral-8x7b-32768 | $0.24 | $0.24 | 32,768 | 32768 |
|
309
|
+
| groq/gemma-7b-it | $0.07 | $0.07 | 8,192 | 8192 |
|
310
|
+
| groq/gemma2-9b-it | $0.2 | $0.2 | 8,192 | 8192 |
|
311
|
+
| groq/llama3-groq-70b-8192-tool-use-preview | $0.89 | $0.89 | 8,192 | 8192 |
|
312
|
+
| groq/llama3-groq-8b-8192-tool-use-preview | $0.19 | $0.19 | 8,192 | 8192 |
|
313
|
+
| cerebras/llama3.1-8b | $0.1 | $0.1 | 128,000 | 128000 |
|
314
|
+
| cerebras/llama3.1-70b | $0.6 | $0.6 | 128,000 | 128000 |
|
315
|
+
| friendliai/mixtral-8x7b-instruct-v0-1 | $0.4 | $0.4 | 32,768 | 32768 |
|
316
|
+
| friendliai/meta-llama-3-8b-instruct | $0.1 | $0.1 | 8,192 | 8192 |
|
317
|
+
| friendliai/meta-llama-3-70b-instruct | $0.8 | $0.8 | 8,192 | 8192 |
|
318
|
+
| claude-instant-1.2 | $0.16 | $0.55 | 100,000 | 8191 |
|
319
|
+
| claude-2 | $8 | $24 | 100,000 | 8191 |
|
320
|
+
| claude-2.1 | $8 | $24 | 200,000 | 8191 |
|
321
|
+
| claude-3-haiku-20240307 | $0.25 | $1.25 | 200,000 | 4096 |
|
322
|
+
| claude-3-haiku-latest | $0.25 | $1.25 | 200,000 | 4096 |
|
323
|
+
| claude-3-opus-20240229 | $15 | $75 | 200,000 | 4096 |
|
324
|
+
| claude-3-opus-latest | $15 | $75 | 200,000 | 4096 |
|
325
|
+
| claude-3-sonnet-20240229 | $3 | $15 | 200,000 | 4096 |
|
326
|
+
| claude-3-5-sonnet-20240620 | $3 | $15 | 200,000 | 8192 |
|
327
|
+
| claude-3-5-sonnet-20241022 | $3 | $15 | 200,000 | 8192 |
|
328
|
+
| claude-3-5-sonnet-latest | $3 | $15 | 200,000 | 8192 |
|
329
|
+
| text-bison | -- | -- | 8,192 | 2048 |
|
330
|
+
| text-bison@001 | -- | -- | 8,192 | 1024 |
|
331
|
+
| text-bison@002 | -- | -- | 8,192 | 1024 |
|
332
|
+
| text-bison32k | $0.12 | $0.12 | 8,192 | 1024 |
|
333
|
+
| text-bison32k@002 | $0.12 | $0.12 | 8,192 | 1024 |
|
334
|
+
| text-unicorn | $10 | $28 | 8,192 | 1024 |
|
335
|
+
| text-unicorn@001 | $10 | $28 | 8,192 | 1024 |
|
336
|
+
| chat-bison | $0.12 | $0.12 | 8,192 | 4096 |
|
337
|
+
| chat-bison@001 | $0.12 | $0.12 | 8,192 | 4096 |
|
338
|
+
| chat-bison@002 | $0.12 | $0.12 | 8,192 | 4096 |
|
339
|
+
| chat-bison-32k | $0.12 | $0.12 | 32,000 | 8192 |
|
340
|
+
| chat-bison-32k@002 | $0.12 | $0.12 | 32,000 | 8192 |
|
341
|
+
| code-bison | $0.12 | $0.12 | 6,144 | 1024 |
|
342
|
+
| code-bison@001 | $0.12 | $0.12 | 6,144 | 1024 |
|
343
|
+
| code-bison@002 | $0.12 | $0.12 | 6,144 | 1024 |
|
344
|
+
| code-bison32k | $0.12 | $0.12 | 6,144 | 1024 |
|
345
|
+
| code-bison-32k@002 | $0.12 | $0.12 | 6,144 | 1024 |
|
346
|
+
| code-gecko@001 | $0.12 | $0.12 | 2,048 | 64 |
|
347
|
+
| code-gecko@002 | $0.12 | $0.12 | 2,048 | 64 |
|
348
|
+
| code-gecko | $0.12 | $0.12 | 2,048 | 64 |
|
349
|
+
| code-gecko-latest | $0.12 | $0.12 | 2,048 | 64 |
|
350
|
+
| codechat-bison@latest | $0.12 | $0.12 | 6,144 | 1024 |
|
351
|
+
| codechat-bison | $0.12 | $0.12 | 6,144 | 1024 |
|
352
|
+
| codechat-bison@001 | $0.12 | $0.12 | 6,144 | 1024 |
|
353
|
+
| codechat-bison@002 | $0.12 | $0.12 | 6,144 | 1024 |
|
354
|
+
| codechat-bison-32k | $0.12 | $0.12 | 32,000 | 8192 |
|
355
|
+
| codechat-bison-32k@002 | $0.12 | $0.12 | 32,000 | 8192 |
|
356
|
+
| gemini-pro | $0.5 | $1.5 | 32,760 | 8192 |
|
357
|
+
| gemini-1.0-pro | $0.5 | $1.5 | 32,760 | 8192 |
|
358
|
+
| gemini-1.0-pro-001 | $0.5 | $1.5 | 32,760 | 8192 |
|
359
|
+
| gemini-1.0-ultra | $0.5 | $1.5 | 8,192 | 2048 |
|
360
|
+
| gemini-1.0-ultra-001 | $0.5 | $1.5 | 8,192 | 2048 |
|
361
|
+
| gemini-1.0-pro-002 | $0.5 | $1.5 | 32,760 | 8192 |
|
362
|
+
| gemini-1.5-pro | $1.25 | $5 | 2,097,152 | 8192 |
|
363
|
+
| gemini-1.5-pro-002 | $1.25 | $5 | 2,097,152 | 8192 |
|
364
|
+
| gemini-1.5-pro-001 | $1.25 | $5 | 1,000,000 | 8192 |
|
365
|
+
| gemini-1.5-pro-preview-0514 | $0.08 | $0.31 | 1,000,000 | 8192 |
|
366
|
+
| gemini-1.5-pro-preview-0215 | $0.08 | $0.31 | 1,000,000 | 8192 |
|
367
|
+
| gemini-1.5-pro-preview-0409 | $0.08 | $0.31 | 1,000,000 | 8192 |
|
368
|
+
| gemini-1.5-flash | $0.08 | $0.3 | 1,000,000 | 8192 |
|
369
|
+
| gemini-1.5-flash-exp-0827 | $0 | $0 | 1,000,000 | 8192 |
|
370
|
+
| gemini-1.5-flash-002 | $0.08 | $0.3 | 1,048,576 | 8192 |
|
371
|
+
| gemini-1.5-flash-001 | $0.08 | $0.3 | 1,000,000 | 8192 |
|
372
|
+
| gemini-1.5-flash-preview-0514 | $0.08 | $0 | 1,000,000 | 8192 |
|
373
|
+
| gemini-pro-experimental | $0 | $0 | 1,000,000 | 8192 |
|
374
|
+
| gemini-flash-experimental | $0 | $0 | 1,000,000 | 8192 |
|
375
|
+
| gemini-pro-vision | $0.5 | $1.5 | 16,384 | 2048 |
|
376
|
+
| gemini-1.0-pro-vision | $0.5 | $1.5 | 16,384 | 2048 |
|
377
|
+
| gemini-1.0-pro-vision-001 | $0.5 | $1.5 | 16,384 | 2048 |
|
378
|
+
| medlm-medium | -- | -- | 32,768 | 8192 |
|
379
|
+
| medlm-large | -- | -- | 8,192 | 1024 |
|
380
|
+
| vertex_ai/claude-3-sonnet@20240229 | $3 | $15 | 200,000 | 4096 |
|
381
|
+
| vertex_ai/claude-3-5-sonnet@20240620 | $3 | $15 | 200,000 | 8192 |
|
382
|
+
| vertex_ai/claude-3-5-sonnet-v2@20241022 | $3 | $15 | 200,000 | 8192 |
|
383
|
+
| vertex_ai/claude-3-haiku@20240307 | $0.25 | $1.25 | 200,000 | 4096 |
|
384
|
+
| vertex_ai/claude-3-opus@20240229 | $15 | $75 | 200,000 | 4096 |
|
385
|
+
| vertex_ai/meta/llama3-405b-instruct-maas | $0 | $0 | 32,000 | 32000 |
|
386
|
+
| vertex_ai/meta/llama3-70b-instruct-maas | $0 | $0 | 32,000 | 32000 |
|
387
|
+
| vertex_ai/meta/llama3-8b-instruct-maas | $0 | $0 | 32,000 | 32000 |
|
388
|
+
| vertex_ai/meta/llama-3.2-90b-vision-instruct-maas | $0 | $0 | 128,000 | 2048 |
|
389
|
+
| vertex_ai/mistral-large@latest | $2 | $6 | 128,000 | 8191 |
|
390
|
+
| vertex_ai/mistral-large@2407 | $2 | $6 | 128,000 | 8191 |
|
391
|
+
| vertex_ai/mistral-nemo@latest | $0.15 | $0.15 | 128,000 | 128000 |
|
392
|
+
| vertex_ai/jamba-1.5-mini@001 | $0.2 | $0.4 | 256,000 | 256000 |
|
393
|
+
| vertex_ai/jamba-1.5-large@001 | $2 | $8 | 256,000 | 256000 |
|
394
|
+
| vertex_ai/jamba-1.5 | $0.2 | $0.4 | 256,000 | 256000 |
|
395
|
+
| vertex_ai/jamba-1.5-mini | $0.2 | $0.4 | 256,000 | 256000 |
|
396
|
+
| vertex_ai/jamba-1.5-large | $2 | $8 | 256,000 | 256000 |
|
397
|
+
| vertex_ai/mistral-nemo@2407 | $3 | $3 | 128,000 | 128000 |
|
398
|
+
| vertex_ai/codestral@latest | $0.2 | $0.6 | 128,000 | 128000 |
|
399
|
+
| vertex_ai/codestral@2405 | $0.2 | $0.6 | 128,000 | 128000 |
|
400
|
+
| vertex_ai/imagegeneration@006 | -- | -- | nan | nan |
|
401
|
+
| vertex_ai/imagen-3.0-generate-001 | -- | -- | nan | nan |
|
402
|
+
| vertex_ai/imagen-3.0-fast-generate-001 | -- | -- | nan | nan |
|
403
|
+
| text-embedding-004 | $0.1 | $0 | 2,048 | nan |
|
404
|
+
| text-multilingual-embedding-002 | $0.1 | $0 | 2,048 | nan |
|
405
|
+
| textembedding-gecko | $0.1 | $0 | 3,072 | nan |
|
406
|
+
| textembedding-gecko-multilingual | $0.1 | $0 | 3,072 | nan |
|
407
|
+
| textembedding-gecko-multilingual@001 | $0.1 | $0 | 3,072 | nan |
|
408
|
+
| textembedding-gecko@001 | $0.1 | $0 | 3,072 | nan |
|
409
|
+
| textembedding-gecko@003 | $0.1 | $0 | 3,072 | nan |
|
410
|
+
| text-embedding-preview-0409 | $0.01 | $0 | 3,072 | nan |
|
411
|
+
| text-multilingual-embedding-preview-0409 | $0.01 | $0 | 3,072 | nan |
|
412
|
+
| palm/chat-bison | $0.12 | $0.12 | 8,192 | 4096 |
|
413
|
+
| palm/chat-bison-001 | $0.12 | $0.12 | 8,192 | 4096 |
|
414
|
+
| palm/text-bison | $0.12 | $0.12 | 8,192 | 1024 |
|
415
|
+
| palm/text-bison-001 | $0.12 | $0.12 | 8,192 | 1024 |
|
416
|
+
| palm/text-bison-safety-off | $0.12 | $0.12 | 8,192 | 1024 |
|
417
|
+
| palm/text-bison-safety-recitation-off | $0.12 | $0.12 | 8,192 | 1024 |
|
418
|
+
| gemini/gemini-1.5-flash-002 | $0.08 | $0.3 | 1,048,576 | 8192 |
|
419
|
+
| gemini/gemini-1.5-flash-001 | $0.08 | $0.3 | 1,048,576 | 8192 |
|
420
|
+
| gemini/gemini-1.5-flash | $0.08 | $0.3 | 1,048,576 | 8192 |
|
421
|
+
| gemini/gemini-1.5-flash-latest | $0.08 | $0.3 | 1,048,576 | 8192 |
|
422
|
+
| gemini/gemini-1.5-flash-8b-exp-0924 | $0 | $0 | 1,048,576 | 8192 |
|
423
|
+
| gemini/gemini-1.5-flash-exp-0827 | $0 | $0 | 1,048,576 | 8192 |
|
424
|
+
| gemini/gemini-1.5-flash-8b-exp-0827 | $0 | $0 | 1,000,000 | 8192 |
|
425
|
+
| gemini/gemini-pro | $0.35 | $1.05 | 32,760 | 8192 |
|
426
|
+
| gemini/gemini-1.5-pro | $3.5 | $10.5 | 2,097,152 | 8192 |
|
427
|
+
| gemini/gemini-1.5-pro-002 | $3.5 | $10.5 | 2,097,152 | 8192 |
|
428
|
+
| gemini/gemini-1.5-pro-001 | $3.5 | $10.5 | 2,097,152 | 8192 |
|
429
|
+
| gemini/gemini-1.5-pro-exp-0801 | $3.5 | $10.5 | 2,097,152 | 8192 |
|
430
|
+
| gemini/gemini-1.5-pro-exp-0827 | $0 | $0 | 2,097,152 | 8192 |
|
431
|
+
| gemini/gemini-1.5-pro-latest | $3.5 | $1.05 | 1,048,576 | 8192 |
|
432
|
+
| gemini/gemini-pro-vision | $0.35 | $1.05 | 30,720 | 2048 |
|
433
|
+
| gemini/gemini-gemma-2-27b-it | $0.35 | $1.05 | nan | 8192 |
|
434
|
+
| gemini/gemini-gemma-2-9b-it | $0.35 | $1.05 | nan | 8192 |
|
435
|
+
| command-r | $0.15 | $0.6 | 128,000 | 4096 |
|
436
|
+
| command-r-08-2024 | $0.15 | $0.6 | 128,000 | 4096 |
|
437
|
+
| command-light | $0.3 | $0.6 | 4,096 | 4096 |
|
438
|
+
| command-r-plus | $2.5 | $10 | 128,000 | 4096 |
|
439
|
+
| command-r-plus-08-2024 | $2.5 | $10 | 128,000 | 4096 |
|
440
|
+
| command-nightly | $1 | $2 | 4,096 | 4096 |
|
441
|
+
| command | $1 | $2 | 4,096 | 4096 |
|
442
|
+
| rerank-english-v3.0 | $0 | $0 | 4,096 | 4096 |
|
443
|
+
| rerank-multilingual-v3.0 | $0 | $0 | 4,096 | 4096 |
|
444
|
+
| rerank-english-v2.0 | $0 | $0 | 4,096 | 4096 |
|
445
|
+
| rerank-multilingual-v2.0 | $0 | $0 | 4,096 | 4096 |
|
446
|
+
| embed-english-v3.0 | $0.1 | $0 | 1,024 | nan |
|
447
|
+
| embed-english-light-v3.0 | $0.1 | $0 | 1,024 | nan |
|
448
|
+
| embed-multilingual-v3.0 | $0.1 | $0 | 1,024 | nan |
|
449
|
+
| embed-english-v2.0 | $0.1 | $0 | 4,096 | nan |
|
450
|
+
| embed-english-light-v2.0 | $0.1 | $0 | 1,024 | nan |
|
451
|
+
| embed-multilingual-v2.0 | $0.1 | $0 | 768 | nan |
|
452
|
+
| replicate/meta/llama-2-13b | $0.1 | $0.5 | 4,096 | 4096 |
|
453
|
+
| replicate/meta/llama-2-13b-chat | $0.1 | $0.5 | 4,096 | 4096 |
|
454
|
+
| replicate/meta/llama-2-70b | $0.65 | $2.75 | 4,096 | 4096 |
|
455
|
+
| replicate/meta/llama-2-70b-chat | $0.65 | $2.75 | 4,096 | 4096 |
|
456
|
+
| replicate/meta/llama-2-7b | $0.05 | $0.25 | 4,096 | 4096 |
|
457
|
+
| replicate/meta/llama-2-7b-chat | $0.05 | $0.25 | 4,096 | 4096 |
|
458
|
+
| replicate/meta/llama-3-70b | $0.65 | $2.75 | 8,192 | 8192 |
|
459
|
+
| replicate/meta/llama-3-70b-instruct | $0.65 | $2.75 | 8,192 | 8192 |
|
460
|
+
| replicate/meta/llama-3-8b | $0.05 | $0.25 | 8,086 | 8086 |
|
461
|
+
| replicate/meta/llama-3-8b-instruct | $0.05 | $0.25 | 8,086 | 8086 |
|
462
|
+
| replicate/mistralai/mistral-7b-v0.1 | $0.05 | $0.25 | 4,096 | 4096 |
|
463
|
+
| replicate/mistralai/mistral-7b-instruct-v0.2 | $0.05 | $0.25 | 4,096 | 4096 |
|
464
|
+
| replicate/mistralai/mixtral-8x7b-instruct-v0.1 | $0.3 | $1 | 4,096 | 4096 |
|
465
|
+
| openrouter/deepseek/deepseek-coder | $0.14 | $0.28 | 66,000 | 4096 |
|
466
|
+
| openrouter/microsoft/wizardlm-2-8x22b:nitro | $1 | $1 | nan | nan |
|
467
|
+
| openrouter/google/gemini-pro-1.5 | $2.5 | $7.5 | 1,000,000 | 8192 |
|
468
|
+
| openrouter/mistralai/mixtral-8x22b-instruct | $0.65 | $0.65 | nan | nan |
|
469
|
+
| openrouter/cohere/command-r-plus | $3 | $15 | nan | nan |
|
470
|
+
| openrouter/databricks/dbrx-instruct | $0.6 | $0.6 | nan | nan |
|
471
|
+
| openrouter/anthropic/claude-3-haiku | $0.25 | $1.25 | nan | nan |
|
472
|
+
| openrouter/anthropic/claude-3-haiku-20240307 | $0.25 | $1.25 | 200,000 | 4096 |
|
473
|
+
| anthropic/claude-3-5-sonnet-20241022 | $3 | $15 | 200,000 | 8192 |
|
474
|
+
| anthropic/claude-3-5-sonnet-latest | $3 | $15 | 200,000 | 8192 |
|
475
|
+
| openrouter/anthropic/claude-3.5-sonnet | $3 | $15 | 200,000 | 8192 |
|
476
|
+
| openrouter/anthropic/claude-3.5-sonnet:beta | $3 | $15 | 200,000 | 8192 |
|
477
|
+
| openrouter/anthropic/claude-3-sonnet | $3 | $15 | nan | nan |
|
478
|
+
| openrouter/mistralai/mistral-large | $8 | $24 | nan | nan |
|
479
|
+
| openrouter/cognitivecomputations/dolphin-mixtral-8x7b | $0.5 | $0.5 | nan | nan |
|
480
|
+
| openrouter/google/gemini-pro-vision | $0.12 | $0.38 | nan | nan |
|
481
|
+
| openrouter/fireworks/firellava-13b | $0.2 | $0.2 | nan | nan |
|
482
|
+
| openrouter/meta-llama/llama-3-8b-instruct:free | $0 | $0 | nan | nan |
|
483
|
+
| openrouter/meta-llama/llama-3-8b-instruct:extended | $0.22 | $2.25 | nan | nan |
|
484
|
+
| openrouter/meta-llama/llama-3-70b-instruct:nitro | $0.9 | $0.9 | nan | nan |
|
485
|
+
| openrouter/meta-llama/llama-3-70b-instruct | $0.59 | $0.79 | nan | nan |
|
486
|
+
| openrouter/openai/o1-mini | $3 | $12 | 128,000 | 65536 |
|
487
|
+
| openrouter/openai/o1-mini-2024-09-12 | $3 | $12 | 128,000 | 65536 |
|
488
|
+
| openrouter/openai/o1-preview | $15 | $60 | 128,000 | 32768 |
|
489
|
+
| openrouter/openai/o1-preview-2024-09-12 | $15 | $60 | 128,000 | 32768 |
|
490
|
+
| openrouter/openai/gpt-4o | $2.5 | $10 | 128,000 | 4096 |
|
491
|
+
| openrouter/openai/gpt-4o-2024-05-13 | $5 | $15 | 128,000 | 4096 |
|
492
|
+
| openrouter/openai/gpt-4-vision-preview | $10 | $30 | nan | nan |
|
493
|
+
| openrouter/openai/gpt-3.5-turbo | $1.5 | $2 | nan | nan |
|
494
|
+
| openrouter/openai/gpt-3.5-turbo-16k | $3 | $4 | nan | nan |
|
495
|
+
| openrouter/openai/gpt-4 | $30 | $60 | nan | nan |
|
496
|
+
| openrouter/anthropic/claude-instant-v1 | $1.63 | $5.51 | nan | 8191 |
|
497
|
+
| openrouter/anthropic/claude-2 | $11.02 | $32.68 | nan | 8191 |
|
498
|
+
| openrouter/anthropic/claude-3-opus | $15 | $75 | 200,000 | 4096 |
|
499
|
+
| openrouter/google/palm-2-chat-bison | $0.5 | $0.5 | nan | nan |
|
500
|
+
| openrouter/google/palm-2-codechat-bison | $0.5 | $0.5 | nan | nan |
|
501
|
+
| openrouter/meta-llama/llama-2-13b-chat | $0.2 | $0.2 | nan | nan |
|
502
|
+
| openrouter/meta-llama/llama-2-70b-chat | $1.5 | $1.5 | nan | nan |
|
503
|
+
| openrouter/meta-llama/codellama-34b-instruct | $0.5 | $0.5 | nan | nan |
|
504
|
+
| openrouter/nousresearch/nous-hermes-llama2-13b | $0.2 | $0.2 | nan | nan |
|
505
|
+
| openrouter/mancer/weaver | $5.62 | $5.62 | nan | nan |
|
506
|
+
| openrouter/gryphe/mythomax-l2-13b | $1.88 | $1.88 | nan | nan |
|
507
|
+
| openrouter/jondurbin/airoboros-l2-70b-2.1 | $13.88 | $13.88 | nan | nan |
|
508
|
+
| openrouter/undi95/remm-slerp-l2-13b | $1.88 | $1.88 | nan | nan |
|
509
|
+
| openrouter/pygmalionai/mythalion-13b | $1.88 | $1.88 | nan | nan |
|
510
|
+
| openrouter/mistralai/mistral-7b-instruct | $0.13 | $0.13 | nan | nan |
|
511
|
+
| openrouter/mistralai/mistral-7b-instruct:free | $0 | $0 | nan | nan |
|
512
|
+
| j2-ultra | $15 | $15 | 8,192 | 8192 |
|
513
|
+
| jamba-1.5-mini@001 | $0.2 | $0.4 | 256,000 | 256000 |
|
514
|
+
| jamba-1.5-large@001 | $2 | $8 | 256,000 | 256000 |
|
515
|
+
| jamba-1.5 | $0.2 | $0.4 | 256,000 | 256000 |
|
516
|
+
| jamba-1.5-mini | $0.2 | $0.4 | 256,000 | 256000 |
|
517
|
+
| jamba-1.5-large | $2 | $8 | 256,000 | 256000 |
|
518
|
+
| j2-mid | $10 | $10 | 8,192 | 8192 |
|
519
|
+
| j2-light | $3 | $3 | 8,192 | 8192 |
|
520
|
+
| dolphin | $0.5 | $0.5 | 16,384 | 16384 |
|
521
|
+
| chatdolphin | $0.5 | $0.5 | 16,384 | 16384 |
|
522
|
+
| luminous-base | $30 | $33 | nan | nan |
|
523
|
+
| luminous-base-control | $37.5 | $41.25 | nan | nan |
|
524
|
+
| luminous-extended | $45 | $49.5 | nan | nan |
|
525
|
+
| luminous-extended-control | $56.25 | $61.88 | nan | nan |
|
526
|
+
| luminous-supreme | $175 | $192.5 | nan | nan |
|
527
|
+
| luminous-supreme-control | $218.75 | $240.62 | nan | nan |
|
528
|
+
| ai21.j2-mid-v1 | $12.5 | $12.5 | 8,191 | 8191 |
|
529
|
+
| ai21.j2-ultra-v1 | $18.8 | $18.8 | 8,191 | 8191 |
|
530
|
+
| ai21.jamba-instruct-v1:0 | $0.5 | $0.7 | 70,000 | 4096 |
|
531
|
+
| amazon.titan-text-lite-v1 | $0.3 | $0.4 | 42,000 | 4000 |
|
532
|
+
| amazon.titan-text-express-v1 | $1.3 | $1.7 | 42,000 | 8000 |
|
533
|
+
| amazon.titan-text-premier-v1:0 | $0.5 | $1.5 | 42,000 | 32000 |
|
534
|
+
| amazon.titan-embed-text-v1 | $0.1 | $0 | 8,192 | nan |
|
535
|
+
| amazon.titan-embed-text-v2:0 | $0.2 | $0 | 8,192 | nan |
|
536
|
+
| mistral.mistral-7b-instruct-v0:2 | $0.15 | $0.2 | 32,000 | 8191 |
|
537
|
+
| mistral.mixtral-8x7b-instruct-v0:1 | $0.45 | $0.7 | 32,000 | 8191 |
|
538
|
+
| mistral.mistral-large-2402-v1:0 | $8 | $24 | 32,000 | 8191 |
|
539
|
+
| mistral.mistral-large-2407-v1:0 | $3 | $9 | 128,000 | 8191 |
|
540
|
+
| mistral.mistral-small-2402-v1:0 | $1 | $3 | 32,000 | 8191 |
|
541
|
+
| bedrock/us-west-2/mistral.mixtral-8x7b-instruct-v0:1 | $0.45 | $0.7 | 32,000 | 8191 |
|
542
|
+
| bedrock/us-east-1/mistral.mixtral-8x7b-instruct-v0:1 | $0.45 | $0.7 | 32,000 | 8191 |
|
543
|
+
| bedrock/eu-west-3/mistral.mixtral-8x7b-instruct-v0:1 | $0.59 | $0.91 | 32,000 | 8191 |
|
544
|
+
| bedrock/us-west-2/mistral.mistral-7b-instruct-v0:2 | $0.15 | $0.2 | 32,000 | 8191 |
|
545
|
+
| bedrock/us-east-1/mistral.mistral-7b-instruct-v0:2 | $0.15 | $0.2 | 32,000 | 8191 |
|
546
|
+
| bedrock/eu-west-3/mistral.mistral-7b-instruct-v0:2 | $0.2 | $0.26 | 32,000 | 8191 |
|
547
|
+
| bedrock/us-east-1/mistral.mistral-large-2402-v1:0 | $8 | $24 | 32,000 | 8191 |
|
548
|
+
| bedrock/us-west-2/mistral.mistral-large-2402-v1:0 | $8 | $24 | 32,000 | 8191 |
|
549
|
+
| bedrock/eu-west-3/mistral.mistral-large-2402-v1:0 | $10.4 | $31.2 | 32,000 | 8191 |
|
550
|
+
| anthropic.claude-3-sonnet-20240229-v1:0 | $3 | $15 | 200,000 | 4096 |
|
551
|
+
| anthropic.claude-3-5-sonnet-20240620-v1:0 | $3 | $15 | 200,000 | 4096 |
|
552
|
+
| anthropic.claude-3-5-sonnet-20241022-v2:0 | $3 | $15 | 200,000 | 8192 |
|
553
|
+
| anthropic.claude-3-5-sonnet-latest-v2:0 | $3 | $15 | 200,000 | 4096 |
|
554
|
+
| anthropic.claude-3-haiku-20240307-v1:0 | $0.25 | $1.25 | 200,000 | 4096 |
|
555
|
+
| anthropic.claude-3-opus-20240229-v1:0 | $15 | $75 | 200,000 | 4096 |
|
556
|
+
| us.anthropic.claude-3-sonnet-20240229-v1:0 | $3 | $15 | 200,000 | 4096 |
|
557
|
+
| us.anthropic.claude-3-5-sonnet-20240620-v1:0 | $3 | $15 | 200,000 | 4096 |
|
558
|
+
| us.anthropic.claude-3-5-sonnet-20241022-v2:0 | $3 | $15 | 200,000 | 8192 |
|
559
|
+
| us.anthropic.claude-3-haiku-20240307-v1:0 | $0.25 | $1.25 | 200,000 | 4096 |
|
560
|
+
| us.anthropic.claude-3-opus-20240229-v1:0 | $15 | $75 | 200,000 | 4096 |
|
561
|
+
| eu.anthropic.claude-3-sonnet-20240229-v1:0 | $3 | $15 | 200,000 | 4096 |
|
562
|
+
| eu.anthropic.claude-3-5-sonnet-20240620-v1:0 | $3 | $15 | 200,000 | 4096 |
|
563
|
+
| eu.anthropic.claude-3-5-sonnet-20241022-v2:0 | $3 | $15 | 200,000 | 8192 |
|
564
|
+
| eu.anthropic.claude-3-haiku-20240307-v1:0 | $0.25 | $1.25 | 200,000 | 4096 |
|
565
|
+
| eu.anthropic.claude-3-opus-20240229-v1:0 | $15 | $75 | 200,000 | 4096 |
|
566
|
+
| anthropic.claude-v1 | $8 | $24 | 100,000 | 8191 |
|
567
|
+
| bedrock/us-east-1/anthropic.claude-v1 | $8 | $24 | 100,000 | 8191 |
|
568
|
+
| bedrock/us-west-2/anthropic.claude-v1 | $8 | $24 | 100,000 | 8191 |
|
569
|
+
| bedrock/ap-northeast-1/anthropic.claude-v1 | $8 | $24 | 100,000 | 8191 |
|
570
|
+
| bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-v1 | -- | -- | 100,000 | 8191 |
|
571
|
+
| bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-v1 | -- | -- | 100,000 | 8191 |
|
572
|
+
| bedrock/eu-central-1/anthropic.claude-v1 | $8 | $24 | 100,000 | 8191 |
|
573
|
+
| bedrock/eu-central-1/1-month-commitment/anthropic.claude-v1 | -- | -- | 100,000 | 8191 |
|
574
|
+
| bedrock/eu-central-1/6-month-commitment/anthropic.claude-v1 | -- | -- | 100,000 | 8191 |
|
575
|
+
| bedrock/us-east-1/1-month-commitment/anthropic.claude-v1 | -- | -- | 100,000 | 8191 |
|
576
|
+
| bedrock/us-east-1/6-month-commitment/anthropic.claude-v1 | -- | -- | 100,000 | 8191 |
|
577
|
+
| bedrock/us-west-2/1-month-commitment/anthropic.claude-v1 | -- | -- | 100,000 | 8191 |
|
578
|
+
| bedrock/us-west-2/6-month-commitment/anthropic.claude-v1 | -- | -- | 100,000 | 8191 |
|
579
|
+
| anthropic.claude-v2 | $8 | $24 | 100,000 | 8191 |
|
580
|
+
| bedrock/us-east-1/anthropic.claude-v2 | $8 | $24 | 100,000 | 8191 |
|
581
|
+
| bedrock/us-west-2/anthropic.claude-v2 | $8 | $24 | 100,000 | 8191 |
|
582
|
+
| bedrock/ap-northeast-1/anthropic.claude-v2 | $8 | $24 | 100,000 | 8191 |
|
583
|
+
| bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-v2 | -- | -- | 100,000 | 8191 |
|
584
|
+
| bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-v2 | -- | -- | 100,000 | 8191 |
|
585
|
+
| bedrock/eu-central-1/anthropic.claude-v2 | $8 | $24 | 100,000 | 8191 |
|
586
|
+
| bedrock/eu-central-1/1-month-commitment/anthropic.claude-v2 | -- | -- | 100,000 | 8191 |
|
587
|
+
| bedrock/eu-central-1/6-month-commitment/anthropic.claude-v2 | -- | -- | 100,000 | 8191 |
|
588
|
+
| bedrock/us-east-1/1-month-commitment/anthropic.claude-v2 | -- | -- | 100,000 | 8191 |
|
589
|
+
| bedrock/us-east-1/6-month-commitment/anthropic.claude-v2 | -- | -- | 100,000 | 8191 |
|
590
|
+
| bedrock/us-west-2/1-month-commitment/anthropic.claude-v2 | -- | -- | 100,000 | 8191 |
|
591
|
+
| bedrock/us-west-2/6-month-commitment/anthropic.claude-v2 | -- | -- | 100,000 | 8191 |
|
592
|
+
| anthropic.claude-v2:1 | $8 | $24 | 100,000 | 8191 |
|
593
|
+
| bedrock/us-east-1/anthropic.claude-v2:1 | $8 | $24 | 100,000 | 8191 |
|
594
|
+
| bedrock/us-west-2/anthropic.claude-v2:1 | $8 | $24 | 100,000 | 8191 |
|
595
|
+
| bedrock/ap-northeast-1/anthropic.claude-v2:1 | $8 | $24 | 100,000 | 8191 |
|
596
|
+
| bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-v2:1 | -- | -- | 100,000 | 8191 |
|
597
|
+
| bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-v2:1 | -- | -- | 100,000 | 8191 |
|
598
|
+
| bedrock/eu-central-1/anthropic.claude-v2:1 | $8 | $24 | 100,000 | 8191 |
|
599
|
+
| bedrock/eu-central-1/1-month-commitment/anthropic.claude-v2:1 | -- | -- | 100,000 | 8191 |
|
600
|
+
| bedrock/eu-central-1/6-month-commitment/anthropic.claude-v2:1 | -- | -- | 100,000 | 8191 |
|
601
|
+
| bedrock/us-east-1/1-month-commitment/anthropic.claude-v2:1 | -- | -- | 100,000 | 8191 |
|
602
|
+
| bedrock/us-east-1/6-month-commitment/anthropic.claude-v2:1 | -- | -- | 100,000 | 8191 |
|
603
|
+
| bedrock/us-west-2/1-month-commitment/anthropic.claude-v2:1 | -- | -- | 100,000 | 8191 |
|
604
|
+
| bedrock/us-west-2/6-month-commitment/anthropic.claude-v2:1 | -- | -- | 100,000 | 8191 |
|
605
|
+
| anthropic.claude-instant-v1 | $0.8 | $2.4 | 100,000 | 8191 |
|
606
|
+
| bedrock/us-east-1/anthropic.claude-instant-v1 | $0.8 | $2.4 | 100,000 | 8191 |
|
607
|
+
| bedrock/us-east-1/1-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100,000 | 8191 |
|
608
|
+
| bedrock/us-east-1/6-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100,000 | 8191 |
|
609
|
+
| bedrock/us-west-2/1-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100,000 | 8191 |
|
610
|
+
| bedrock/us-west-2/6-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100,000 | 8191 |
|
611
|
+
| bedrock/us-west-2/anthropic.claude-instant-v1 | $0.8 | $2.4 | 100,000 | 8191 |
|
612
|
+
| bedrock/ap-northeast-1/anthropic.claude-instant-v1 | $2.23 | $7.55 | 100,000 | 8191 |
|
613
|
+
| bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100,000 | 8191 |
|
614
|
+
| bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100,000 | 8191 |
|
615
|
+
| bedrock/eu-central-1/anthropic.claude-instant-v1 | $2.48 | $8.38 | 100,000 | 8191 |
|
616
|
+
| bedrock/eu-central-1/1-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100,000 | 8191 |
|
617
|
+
| bedrock/eu-central-1/6-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100,000 | 8191 |
|
618
|
+
| cohere.command-text-v14 | $1.5 | $2 | 4,096 | 4096 |
|
619
|
+
| bedrock/*/1-month-commitment/cohere.command-text-v14 | -- | -- | 4,096 | 4096 |
|
620
|
+
| bedrock/*/6-month-commitment/cohere.command-text-v14 | -- | -- | 4,096 | 4096 |
|
621
|
+
| cohere.command-light-text-v14 | $0.3 | $0.6 | 4,096 | 4096 |
|
622
|
+
| bedrock/*/1-month-commitment/cohere.command-light-text-v14 | -- | -- | 4,096 | 4096 |
|
623
|
+
| bedrock/*/6-month-commitment/cohere.command-light-text-v14 | -- | -- | 4,096 | 4096 |
|
624
|
+
| cohere.command-r-plus-v1:0 | $3 | $15 | 128,000 | 4096 |
|
625
|
+
| cohere.command-r-v1:0 | $0.5 | $1.5 | 128,000 | 4096 |
|
626
|
+
| cohere.embed-english-v3 | $0.1 | $0 | 512 | nan |
|
627
|
+
| cohere.embed-multilingual-v3 | $0.1 | $0 | 512 | nan |
|
628
|
+
| meta.llama2-13b-chat-v1 | $0.75 | $1 | 4,096 | 4096 |
|
629
|
+
| meta.llama2-70b-chat-v1 | $1.95 | $2.56 | 4,096 | 4096 |
|
630
|
+
| meta.llama3-8b-instruct-v1:0 | $0.3 | $0.6 | 8,192 | 8192 |
|
631
|
+
| bedrock/us-east-1/meta.llama3-8b-instruct-v1:0 | $0.3 | $0.6 | 8,192 | 8192 |
|
632
|
+
| bedrock/us-west-1/meta.llama3-8b-instruct-v1:0 | $0.3 | $0.6 | 8,192 | 8192 |
|
633
|
+
| bedrock/ap-south-1/meta.llama3-8b-instruct-v1:0 | $0.36 | $0.72 | 8,192 | 8192 |
|
634
|
+
| bedrock/ca-central-1/meta.llama3-8b-instruct-v1:0 | $0.35 | $0.69 | 8,192 | 8192 |
|
635
|
+
| bedrock/eu-west-1/meta.llama3-8b-instruct-v1:0 | $0.32 | $0.65 | 8,192 | 8192 |
|
636
|
+
| bedrock/eu-west-2/meta.llama3-8b-instruct-v1:0 | $0.39 | $0.78 | 8,192 | 8192 |
|
637
|
+
| bedrock/sa-east-1/meta.llama3-8b-instruct-v1:0 | $0.5 | $1.01 | 8,192 | 8192 |
|
638
|
+
| meta.llama3-70b-instruct-v1:0 | $2.65 | $3.5 | 8,192 | 8192 |
|
639
|
+
| bedrock/us-east-1/meta.llama3-70b-instruct-v1:0 | $2.65 | $3.5 | 8,192 | 8192 |
|
640
|
+
| bedrock/us-west-1/meta.llama3-70b-instruct-v1:0 | $2.65 | $3.5 | 8,192 | 8192 |
|
641
|
+
| bedrock/ap-south-1/meta.llama3-70b-instruct-v1:0 | $3.18 | $4.2 | 8,192 | 8192 |
|
642
|
+
| bedrock/ca-central-1/meta.llama3-70b-instruct-v1:0 | $3.05 | $4.03 | 8,192 | 8192 |
|
643
|
+
| bedrock/eu-west-1/meta.llama3-70b-instruct-v1:0 | $2.86 | $3.78 | 8,192 | 8192 |
|
644
|
+
| bedrock/eu-west-2/meta.llama3-70b-instruct-v1:0 | $3.45 | $4.55 | 8,192 | 8192 |
|
645
|
+
| bedrock/sa-east-1/meta.llama3-70b-instruct-v1:0 | $4.45 | $5.88 | 8,192 | 8192 |
|
646
|
+
| meta.llama3-1-8b-instruct-v1:0 | $0.22 | $0.22 | 128,000 | 2048 |
|
647
|
+
| meta.llama3-1-70b-instruct-v1:0 | $0.99 | $0.99 | 128,000 | 2048 |
|
648
|
+
| meta.llama3-1-405b-instruct-v1:0 | $5.32 | $16 | 128,000 | 4096 |
|
649
|
+
| meta.llama3-2-1b-instruct-v1:0 | $0.1 | $0.1 | 128,000 | 4096 |
|
650
|
+
| us.meta.llama3-2-1b-instruct-v1:0 | $0.1 | $0.1 | 128,000 | 4096 |
|
651
|
+
| eu.meta.llama3-2-1b-instruct-v1:0 | $0.13 | $0.13 | 128,000 | 4096 |
|
652
|
+
| meta.llama3-2-3b-instruct-v1:0 | $0.15 | $0.15 | 128,000 | 4096 |
|
653
|
+
| us.meta.llama3-2-3b-instruct-v1:0 | $0.15 | $0.15 | 128,000 | 4096 |
|
654
|
+
| eu.meta.llama3-2-3b-instruct-v1:0 | $0.19 | $0.19 | 128,000 | 4096 |
|
655
|
+
| meta.llama3-2-11b-instruct-v1:0 | $0.35 | $0.35 | 128,000 | 4096 |
|
656
|
+
| us.meta.llama3-2-11b-instruct-v1:0 | $0.35 | $0.35 | 128,000 | 4096 |
|
657
|
+
| meta.llama3-2-90b-instruct-v1:0 | $2 | $2 | 128,000 | 4096 |
|
658
|
+
| us.meta.llama3-2-90b-instruct-v1:0 | $2 | $2 | 128,000 | 4096 |
|
659
|
+
| 512-x-512/50-steps/stability.stable-diffusion-xl-v0 | -- | -- | 77 | nan |
|
660
|
+
| 512-x-512/max-steps/stability.stable-diffusion-xl-v0 | -- | -- | 77 | nan |
|
661
|
+
| max-x-max/50-steps/stability.stable-diffusion-xl-v0 | -- | -- | 77 | nan |
|
662
|
+
| max-x-max/max-steps/stability.stable-diffusion-xl-v0 | -- | -- | 77 | nan |
|
663
|
+
| 1024-x-1024/50-steps/stability.stable-diffusion-xl-v1 | -- | -- | 77 | nan |
|
664
|
+
| 1024-x-1024/max-steps/stability.stable-diffusion-xl-v1 | -- | -- | 77 | nan |
|
665
|
+
| sagemaker/meta-textgeneration-llama-2-7b | $0 | $0 | 4,096 | 4096 |
|
666
|
+
| sagemaker/meta-textgeneration-llama-2-7b-f | $0 | $0 | 4,096 | 4096 |
|
667
|
+
| sagemaker/meta-textgeneration-llama-2-13b | $0 | $0 | 4,096 | 4096 |
|
668
|
+
| sagemaker/meta-textgeneration-llama-2-13b-f | $0 | $0 | 4,096 | 4096 |
|
669
|
+
| sagemaker/meta-textgeneration-llama-2-70b | $0 | $0 | 4,096 | 4096 |
|
670
|
+
| sagemaker/meta-textgeneration-llama-2-70b-b-f | $0 | $0 | 4,096 | 4096 |
|
671
|
+
| together-ai-up-to-4b | $0.1 | $0.1 | nan | nan |
|
672
|
+
| together-ai-4.1b-8b | $0.2 | $0.2 | nan | nan |
|
673
|
+
| together-ai-8.1b-21b | $0.3 | $0.3 | nan | nan |
|
674
|
+
| together-ai-21.1b-41b | $0.8 | $0.8 | nan | nan |
|
675
|
+
| together-ai-41.1b-80b | $0.9 | $0.9 | nan | nan |
|
676
|
+
| together-ai-81.1b-110b | $1.8 | $1.8 | nan | nan |
|
677
|
+
| together-ai-embedding-up-to-150m | $0.01 | $0 | nan | nan |
|
678
|
+
| together-ai-embedding-151m-to-350m | $0.02 | $0 | nan | nan |
|
679
|
+
| together_ai/mistralai/Mixtral-8x7B-Instruct-v0.1 | $0.6 | $0.6 | nan | nan |
|
680
|
+
| together_ai/mistralai/Mistral-7B-Instruct-v0.1 | -- | -- | nan | nan |
|
681
|
+
| together_ai/togethercomputer/CodeLlama-34b-Instruct | -- | -- | nan | nan |
|
682
|
+
| ollama/codegemma | $0 | $0 | 8,192 | 8192 |
|
683
|
+
| ollama/codegeex4 | $0 | $0 | 32,768 | 8192 |
|
684
|
+
| ollama/deepseek-coder-v2-instruct | $0 | $0 | 32,768 | 8192 |
|
685
|
+
| ollama/deepseek-coder-v2-base | $0 | $0 | 8,192 | 8192 |
|
686
|
+
| ollama/deepseek-coder-v2-lite-instruct | $0 | $0 | 32,768 | 8192 |
|
687
|
+
| ollama/deepseek-coder-v2-lite-base | $0 | $0 | 8,192 | 8192 |
|
688
|
+
| ollama/internlm2_5-20b-chat | $0 | $0 | 32,768 | 8192 |
|
689
|
+
| ollama/llama2 | $0 | $0 | 4,096 | 4096 |
|
690
|
+
| ollama/llama2:7b | $0 | $0 | 4,096 | 4096 |
|
691
|
+
| ollama/llama2:13b | $0 | $0 | 4,096 | 4096 |
|
692
|
+
| ollama/llama2:70b | $0 | $0 | 4,096 | 4096 |
|
693
|
+
| ollama/llama2-uncensored | $0 | $0 | 4,096 | 4096 |
|
694
|
+
| ollama/llama3 | $0 | $0 | 8,192 | 8192 |
|
695
|
+
| ollama/llama3:8b | $0 | $0 | 8,192 | 8192 |
|
696
|
+
| ollama/llama3:70b | $0 | $0 | 8,192 | 8192 |
|
697
|
+
| ollama/llama3.1 | $0 | $0 | 8,192 | 8192 |
|
698
|
+
| ollama/mistral-large-instruct-2407 | $0 | $0 | 65,536 | 8192 |
|
699
|
+
| ollama/mistral | $0 | $0 | 8,192 | 8192 |
|
700
|
+
| ollama/mistral-7B-Instruct-v0.1 | $0 | $0 | 8,192 | 8192 |
|
701
|
+
| ollama/mistral-7B-Instruct-v0.2 | $0 | $0 | 32,768 | 32768 |
|
702
|
+
| ollama/mixtral-8x7B-Instruct-v0.1 | $0 | $0 | 32,768 | 32768 |
|
703
|
+
| ollama/mixtral-8x22B-Instruct-v0.1 | $0 | $0 | 65,536 | 65536 |
|
704
|
+
| ollama/codellama | $0 | $0 | 4,096 | 4096 |
|
705
|
+
| ollama/orca-mini | $0 | $0 | 4,096 | 4096 |
|
706
|
+
| ollama/vicuna | $0 | $0 | 2,048 | 2048 |
|
707
|
+
| deepinfra/lizpreciatior/lzlv_70b_fp16_hf | $0.7 | $0.9 | 4,096 | 4096 |
|
708
|
+
| deepinfra/Gryphe/MythoMax-L2-13b | $0.22 | $0.22 | 4,096 | 4096 |
|
709
|
+
| deepinfra/mistralai/Mistral-7B-Instruct-v0.1 | $0.13 | $0.13 | 32,768 | 8191 |
|
710
|
+
| deepinfra/meta-llama/Llama-2-70b-chat-hf | $0.7 | $0.9 | 4,096 | 4096 |
|
711
|
+
| deepinfra/cognitivecomputations/dolphin-2.6-mixtral-8x7b | $0.27 | $0.27 | 32,768 | 8191 |
|
712
|
+
| deepinfra/codellama/CodeLlama-34b-Instruct-hf | $0.6 | $0.6 | 4,096 | 4096 |
|
713
|
+
| deepinfra/deepinfra/mixtral | $0.27 | $0.27 | 32,000 | 4096 |
|
714
|
+
| deepinfra/Phind/Phind-CodeLlama-34B-v2 | $0.6 | $0.6 | 16,384 | 4096 |
|
715
|
+
| deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1 | $0.27 | $0.27 | 32,768 | 8191 |
|
716
|
+
| deepinfra/deepinfra/airoboros-70b | $0.7 | $0.9 | 4,096 | 4096 |
|
717
|
+
| deepinfra/01-ai/Yi-34B-Chat | $0.6 | $0.6 | 4,096 | 4096 |
|
718
|
+
| deepinfra/01-ai/Yi-6B-200K | $0.13 | $0.13 | 200,000 | 4096 |
|
719
|
+
| deepinfra/jondurbin/airoboros-l2-70b-gpt4-1.4.1 | $0.7 | $0.9 | 4,096 | 4096 |
|
720
|
+
| deepinfra/meta-llama/Llama-2-13b-chat-hf | $0.22 | $0.22 | 4,096 | 4096 |
|
721
|
+
| deepinfra/amazon/MistralLite | $0.2 | $0.2 | 32,768 | 8191 |
|
722
|
+
| deepinfra/meta-llama/Llama-2-7b-chat-hf | $0.13 | $0.13 | 4,096 | 4096 |
|
723
|
+
| deepinfra/meta-llama/Meta-Llama-3-8B-Instruct | $0.08 | $0.08 | 8,191 | 4096 |
|
724
|
+
| deepinfra/meta-llama/Meta-Llama-3-70B-Instruct | $0.59 | $0.79 | 8,191 | 4096 |
|
725
|
+
| deepinfra/01-ai/Yi-34B-200K | $0.6 | $0.6 | 200,000 | 4096 |
|
726
|
+
| deepinfra/openchat/openchat_3.5 | $0.13 | $0.13 | 4,096 | 4096 |
|
727
|
+
| perplexity/codellama-34b-instruct | $0.35 | $1.4 | 16,384 | 16384 |
|
728
|
+
| perplexity/codellama-70b-instruct | $0.7 | $2.8 | 16,384 | 16384 |
|
729
|
+
| perplexity/llama-3.1-70b-instruct | $1 | $1 | 131,072 | 131072 |
|
730
|
+
| perplexity/llama-3.1-8b-instruct | $0.2 | $0.2 | 131,072 | 131072 |
|
731
|
+
| perplexity/llama-3.1-sonar-huge-128k-online | $5 | $5 | 127,072 | 127072 |
|
732
|
+
| perplexity/llama-3.1-sonar-large-128k-online | $1 | $1 | 127,072 | 127072 |
|
733
|
+
| perplexity/llama-3.1-sonar-large-128k-chat | $1 | $1 | 131,072 | 131072 |
|
734
|
+
| perplexity/llama-3.1-sonar-small-128k-chat | $0.2 | $0.2 | 131,072 | 131072 |
|
735
|
+
| perplexity/llama-3.1-sonar-small-128k-online | $0.2 | $0.2 | 127,072 | 127072 |
|
736
|
+
| perplexity/pplx-7b-chat | $0.07 | $0.28 | 8,192 | 8192 |
|
737
|
+
| perplexity/pplx-70b-chat | $0.7 | $2.8 | 4,096 | 4096 |
|
738
|
+
| perplexity/pplx-7b-online | $0 | $0.28 | 4,096 | 4096 |
|
739
|
+
| perplexity/pplx-70b-online | $0 | $2.8 | 4,096 | 4096 |
|
740
|
+
| perplexity/llama-2-70b-chat | $0.7 | $2.8 | 4,096 | 4096 |
|
741
|
+
| perplexity/mistral-7b-instruct | $0.07 | $0.28 | 4,096 | 4096 |
|
742
|
+
| perplexity/mixtral-8x7b-instruct | $0.07 | $0.28 | 4,096 | 4096 |
|
743
|
+
| perplexity/sonar-small-chat | $0.07 | $0.28 | 16,384 | 16384 |
|
744
|
+
| perplexity/sonar-small-online | $0 | $0.28 | 12,000 | 12000 |
|
745
|
+
| perplexity/sonar-medium-chat | $0.6 | $1.8 | 16,384 | 16384 |
|
746
|
+
| perplexity/sonar-medium-online | $0 | $1.8 | 12,000 | 12000 |
|
747
|
+
| fireworks_ai/accounts/fireworks/models/llama-v3p2-1b-instruct | $0.1 | $0.1 | 16,384 | 16384 |
|
748
|
+
| fireworks_ai/accounts/fireworks/models/llama-v3p2-3b-instruct | $0.1 | $0.1 | 16,384 | 16384 |
|
749
|
+
| fireworks_ai/accounts/fireworks/models/llama-v3p2-11b-vision-instruct | $0.2 | $0.2 | 16,384 | 16384 |
|
750
|
+
| accounts/fireworks/models/llama-v3p2-90b-vision-instruct | $0.9 | $0.9 | 16,384 | 16384 |
|
751
|
+
| fireworks_ai/accounts/fireworks/models/firefunction-v2 | $0.9 | $0.9 | 8,192 | 8192 |
|
752
|
+
| fireworks_ai/accounts/fireworks/models/mixtral-8x22b-instruct-hf | $1.2 | $1.2 | 65,536 | 65536 |
|
753
|
+
| fireworks_ai/accounts/fireworks/models/qwen2-72b-instruct | $0.9 | $0.9 | 32,768 | 32768 |
|
754
|
+
| fireworks_ai/accounts/fireworks/models/yi-large | $3 | $3 | 32,768 | 32768 |
|
755
|
+
| fireworks_ai/accounts/fireworks/models/deepseek-coder-v2-instruct | $1.2 | $1.2 | 65,536 | 65536 |
|
756
|
+
| fireworks_ai/nomic-ai/nomic-embed-text-v1.5 | $0.01 | $0 | 8,192 | nan |
|
757
|
+
| fireworks_ai/nomic-ai/nomic-embed-text-v1 | $0.01 | $0 | 8,192 | nan |
|
758
|
+
| fireworks_ai/WhereIsAI/UAE-Large-V1 | $0.02 | $0 | 512 | nan |
|
759
|
+
| fireworks_ai/thenlper/gte-large | $0.02 | $0 | 512 | nan |
|
760
|
+
| fireworks_ai/thenlper/gte-base | $0.01 | $0 | 512 | nan |
|
761
|
+
| fireworks-ai-up-to-16b | $0.2 | $0.2 | nan | nan |
|
762
|
+
| fireworks-ai-16.1b-to-80b | $0.9 | $0.9 | nan | nan |
|
763
|
+
| fireworks-ai-moe-up-to-56b | $0.5 | $0.5 | nan | nan |
|
764
|
+
| fireworks-ai-56b-to-176b | $1.2 | $1.2 | nan | nan |
|
765
|
+
| fireworks-ai-default | $0 | $0 | nan | nan |
|
766
|
+
| fireworks-ai-embedding-up-to-150m | $0.01 | $0 | nan | nan |
|
767
|
+
| fireworks-ai-embedding-150m-to-350m | $0.02 | $0 | nan | nan |
|
768
|
+
| anyscale/mistralai/Mistral-7B-Instruct-v0.1 | $0.15 | $0.15 | 16,384 | 16384 |
|
769
|
+
| anyscale/mistralai/Mixtral-8x7B-Instruct-v0.1 | $0.15 | $0.15 | 16,384 | 16384 |
|
770
|
+
| anyscale/mistralai/Mixtral-8x22B-Instruct-v0.1 | $0.9 | $0.9 | 65,536 | 65536 |
|
771
|
+
| anyscale/HuggingFaceH4/zephyr-7b-beta | $0.15 | $0.15 | 16,384 | 16384 |
|
772
|
+
| anyscale/google/gemma-7b-it | $0.15 | $0.15 | 8,192 | 8192 |
|
773
|
+
| anyscale/meta-llama/Llama-2-7b-chat-hf | $0.15 | $0.15 | 4,096 | 4096 |
|
774
|
+
| anyscale/meta-llama/Llama-2-13b-chat-hf | $0.25 | $0.25 | 4,096 | 4096 |
|
775
|
+
| anyscale/meta-llama/Llama-2-70b-chat-hf | $1 | $1 | 4,096 | 4096 |
|
776
|
+
| anyscale/codellama/CodeLlama-34b-Instruct-hf | $1 | $1 | 4,096 | 4096 |
|
777
|
+
| anyscale/codellama/CodeLlama-70b-Instruct-hf | $1 | $1 | 4,096 | 4096 |
|
778
|
+
| anyscale/meta-llama/Meta-Llama-3-8B-Instruct | $0.15 | $0.15 | 8,192 | 8192 |
|
779
|
+
| anyscale/meta-llama/Meta-Llama-3-70B-Instruct | $1 | $1 | 8,192 | 8192 |
|
780
|
+
| cloudflare/@cf/meta/llama-2-7b-chat-fp16 | $1.92 | $1.92 | 3,072 | 3072 |
|
781
|
+
| cloudflare/@cf/meta/llama-2-7b-chat-int8 | $1.92 | $1.92 | 2,048 | 2048 |
|
782
|
+
| cloudflare/@cf/mistral/mistral-7b-instruct-v0.1 | $1.92 | $1.92 | 8,192 | 8192 |
|
783
|
+
| cloudflare/@hf/thebloke/codellama-7b-instruct-awq | $1.92 | $1.92 | 4,096 | 4096 |
|
784
|
+
| voyage/voyage-01 | $0.1 | $0 | 4,096 | nan |
|
785
|
+
| voyage/voyage-lite-01 | $0.1 | $0 | 4,096 | nan |
|
786
|
+
| voyage/voyage-large-2 | $0.12 | $0 | 16,000 | nan |
|
787
|
+
| voyage/voyage-law-2 | $0.12 | $0 | 16,000 | nan |
|
788
|
+
| voyage/voyage-code-2 | $0.12 | $0 | 16,000 | nan |
|
789
|
+
| voyage/voyage-2 | $0.1 | $0 | 4,000 | nan |
|
790
|
+
| voyage/voyage-lite-02-instruct | $0.1 | $0 | 4,000 | nan |
|
791
|
+
| voyage/voyage-finance-2 | $0.12 | $0 | 32,000 | nan |
|
792
|
+
| databricks/databricks-meta-llama-3-1-405b-instruct | $5 | $15 | 128,000 | 128000 |
|
793
|
+
| databricks/databricks-meta-llama-3-1-70b-instruct | $1 | $3 | 128,000 | 128000 |
|
794
|
+
| databricks/databricks-dbrx-instruct | $0.75 | $2.25 | 32,768 | 32768 |
|
795
|
+
| databricks/databricks-meta-llama-3-70b-instruct | $1 | $3 | 128,000 | 128000 |
|
796
|
+
| databricks/databricks-llama-2-70b-chat | $0.5 | $1.5 | 4,096 | 4096 |
|
797
|
+
| databricks/databricks-mixtral-8x7b-instruct | $0.5 | $1 | 4,096 | 4096 |
|
798
|
+
| databricks/databricks-mpt-30b-instruct | $1 | $1 | 8,192 | 8192 |
|
799
|
+
| databricks/databricks-mpt-7b-instruct | $0.5 | $0 | 8,192 | 8192 |
|
800
|
+
| databricks/databricks-bge-large-en | $0.1 | $0 | 512 | nan |
|
801
|
+
| databricks/databricks-gte-large-en | $0.13 | $0 | 8,192 | nan |
|
802
|
+
| azure/gpt-4o-mini-2024-07-18 | $0.16 | $0.66 | 128,000 | 16384 |
|
803
|
+
| amazon.titan-embed-image-v1 | $0.8 | $0 | 128 | nan |
|
804
|
+
| azure_ai/mistral-large-2407 | $2 | $6 | 128,000 | 4096 |
|
805
|
+
| azure_ai/ministral-3b | $0.04 | $0.04 | 128,000 | 4096 |
|
806
|
+
| azure_ai/Llama-3.2-11B-Vision-Instruct | $0.37 | $0.37 | 128,000 | 2048 |
|
807
|
+
| azure_ai/Llama-3.2-90B-Vision-Instruct | $2.04 | $2.04 | 128,000 | 2048 |
|
808
|
+
| azure_ai/Phi-3.5-mini-instruct | $0.13 | $0.52 | 128,000 | 4096 |
|
809
|
+
| azure_ai/Phi-3.5-vision-instruct | $0.13 | $0.52 | 128,000 | 4096 |
|
810
|
+
| azure_ai/Phi-3.5-MoE-instruct | $0.16 | $0.64 | 128,000 | 4096 |
|
811
|
+
| azure_ai/Phi-3-mini-4k-instruct | $0.13 | $0.52 | 4,096 | 4096 |
|
812
|
+
| azure_ai/Phi-3-mini-128k-instruct | $0.13 | $0.52 | 128,000 | 4096 |
|
813
|
+
| azure_ai/Phi-3-small-8k-instruct | $0.15 | $0.6 | 8,192 | 4096 |
|
814
|
+
| azure_ai/Phi-3-small-128k-instruct | $0.15 | $0.6 | 128,000 | 4096 |
|
815
|
+
| azure_ai/Phi-3-medium-4k-instruct | $0.17 | $0.68 | 4,096 | 4096 |
|
816
|
+
| azure_ai/Phi-3-medium-128k-instruct | $0.17 | $0.68 | 128,000 | 4096 |
|
817
|
+
| xai/grok-beta | $5 | $15 | 131,072 | 131072 |
|
818
|
+
| claude-3-5-haiku-20241022 | $0.8 | $4 | 200,000 | 8192 |
|
819
|
+
| vertex_ai/claude-3-5-haiku@20241022 | $1 | $5 | 200,000 | 8192 |
|
820
|
+
| openrouter/anthropic/claude-3-5-haiku | $1 | $5 | nan | nan |
|
821
|
+
| openrouter/anthropic/claude-3-5-haiku-20241022 | $1 | $5 | 200,000 | 8192 |
|
822
|
+
| anthropic.claude-3-5-haiku-20241022-v1:0 | $0.8 | $4 | 200,000 | 8192 |
|
823
|
+
| us.anthropic.claude-3-5-haiku-20241022-v1:0 | $0.8 | $4 | 200,000 | 8192 |
|
824
|
+
| eu.anthropic.claude-3-5-haiku-20241022-v1:0 | $0.25 | $1.25 | 200,000 | 8192 |
|
825
|
+
| stability.sd3-large-v1:0 | -- | -- | 77 | nan |
|
826
|
+
| gpt-4o-2024-11-20 | $2.5 | $10 | 128,000 | 16384 |
|
827
|
+
| ft:gpt-4o-2024-11-20 | $3.75 | $15 | 128,000 | 16384 |
|
828
|
+
| azure/gpt-4o-2024-11-20 | $2.75 | $11 | 128,000 | 16384 |
|
829
|
+
| azure/global-standard/gpt-4o-2024-11-20 | $2.5 | $10 | 128,000 | 16384 |
|
830
|
+
| groq/llama-3.2-1b-preview | $0.04 | $0.04 | 8,192 | 8192 |
|
831
|
+
| groq/llama-3.2-3b-preview | $0.06 | $0.06 | 8,192 | 8192 |
|
832
|
+
| groq/llama-3.2-11b-text-preview | $0.18 | $0.18 | 8,192 | 8192 |
|
833
|
+
| groq/llama-3.2-11b-vision-preview | $0.18 | $0.18 | 8,192 | 8192 |
|
834
|
+
| groq/llama-3.2-90b-text-preview | $0.9 | $0.9 | 8,192 | 8192 |
|
835
|
+
| groq/llama-3.2-90b-vision-preview | $0.9 | $0.9 | 8,192 | 8192 |
|
836
|
+
| vertex_ai/claude-3-sonnet | $3 | $15 | 200,000 | 4096 |
|
837
|
+
| vertex_ai/claude-3-5-sonnet | $3 | $15 | 200,000 | 8192 |
|
838
|
+
| vertex_ai/claude-3-5-sonnet-v2 | $3 | $15 | 200,000 | 8192 |
|
839
|
+
| vertex_ai/claude-3-haiku | $0.25 | $1.25 | 200,000 | 4096 |
|
840
|
+
| vertex_ai/claude-3-5-haiku | $1 | $5 | 200,000 | 8192 |
|
841
|
+
| vertex_ai/claude-3-opus | $15 | $75 | 200,000 | 4096 |
|
842
|
+
| gemini/gemini-exp-1114 | $0 | $0 | 1,048,576 | 8192 |
|
843
|
+
| openrouter/qwen/qwen-2.5-coder-32b-instruct | $0.18 | $0.18 | 33,792 | 33792 |
|
844
|
+
| us.meta.llama3-1-8b-instruct-v1:0 | $0.22 | $0.22 | 128,000 | 2048 |
|
845
|
+
| us.meta.llama3-1-70b-instruct-v1:0 | $0.99 | $0.99 | 128,000 | 2048 |
|
846
|
+
| us.meta.llama3-1-405b-instruct-v1:0 | $5.32 | $16 | 128,000 | 4096 |
|
847
|
+
| stability.stable-image-ultra-v1:0 | -- | -- | 77 | nan |
|
848
|
+
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct | $0.9 | $0.9 | 4,096 | 4096 |
|
849
|
+
| omni-moderation-latest | $0 | $0 | 32,768 | 0 |
|
850
|
+
| omni-moderation-latest-intents | $0 | $0 | 32,768 | 0 |
|
851
|
+
| omni-moderation-2024-09-26 | $0 | $0 | 32,768 | 0 |
|
852
|
+
| gpt-4o-audio-preview-2024-12-17 | $2.5 | $10 | 128,000 | 16384 |
|
853
|
+
| gpt-4o-mini-audio-preview-2024-12-17 | $0.15 | $0.6 | 128,000 | 16384 |
|
854
|
+
| o1 | $15 | $60 | 200,000 | 100000 |
|
855
|
+
| o1-2024-12-17 | $15 | $60 | 200,000 | 100000 |
|
856
|
+
| gpt-4o-realtime-preview-2024-10-01 | $5 | $20 | 128,000 | 4096 |
|
857
|
+
| gpt-4o-realtime-preview | $5 | $20 | 128,000 | 4096 |
|
858
|
+
| gpt-4o-realtime-preview-2024-12-17 | $5 | $20 | 128,000 | 4096 |
|
859
|
+
| gpt-4o-mini-realtime-preview | $0.6 | $2.4 | 128,000 | 4096 |
|
860
|
+
| gpt-4o-mini-realtime-preview-2024-12-17 | $0.6 | $2.4 | 128,000 | 4096 |
|
861
|
+
| azure/o1 | $15 | $60 | 200,000 | 100000 |
|
862
|
+
| azure_ai/Llama-3.3-70B-Instruct | $0.71 | $0.71 | 128,000 | 2048 |
|
863
|
+
| mistral/mistral-large-2411 | $2 | $6 | 128,000 | 128000 |
|
864
|
+
| mistral/pixtral-large-latest | $2 | $6 | 128,000 | 128000 |
|
865
|
+
| mistral/pixtral-large-2411 | $2 | $6 | 128,000 | 128000 |
|
866
|
+
| deepseek/deepseek-chat | $0.27 | $1.1 | 65,536 | 8192 |
|
867
|
+
| deepseek/deepseek-coder | $0.14 | $0.28 | 128,000 | 4096 |
|
868
|
+
| groq/llama-3.3-70b-versatile | $0.59 | $0.79 | 128,000 | 8192 |
|
869
|
+
| groq/llama-3.3-70b-specdec | $0.59 | $0.99 | 8,192 | 8192 |
|
870
|
+
| friendliai/meta-llama-3.1-8b-instruct | $0.1 | $0.1 | 8,192 | 8192 |
|
871
|
+
| friendliai/meta-llama-3.1-70b-instruct | $0.6 | $0.6 | 8,192 | 8192 |
|
872
|
+
| gemini-2.0-flash-exp | $0.15 | $0.6 | 1,048,576 | 8192 |
|
873
|
+
| gemini/gemini-2.0-flash-exp | $0 | $0 | 1,048,576 | 8192 |
|
874
|
+
| vertex_ai/mistral-large@2411-001 | $2 | $6 | 128,000 | 8191 |
|
875
|
+
| vertex_ai/mistral-large-2411 | $2 | $6 | 128,000 | 8191 |
|
876
|
+
| text-embedding-005 | $0.1 | $0 | 2,048 | nan |
|
877
|
+
| gemini/gemini-1.5-flash-8b | $0 | $0 | 1,048,576 | 8192 |
|
878
|
+
| gemini/gemini-exp-1206 | $0 | $0 | 2,097,152 | 8192 |
|
879
|
+
| command-r7b-12-2024 | $0.15 | $0.04 | 128,000 | 4096 |
|
880
|
+
| rerank-v3.5 | $0 | $0 | 4,096 | 4096 |
|
881
|
+
| openrouter/deepseek/deepseek-chat | $0.14 | $0.28 | 65,536 | 8192 |
|
882
|
+
| openrouter/openai/o1 | $15 | $60 | 200,000 | 100000 |
|
883
|
+
| amazon.nova-micro-v1:0 | $0.04 | $0.14 | 300,000 | 4096 |
|
884
|
+
| amazon.nova-lite-v1:0 | $0.06 | $0.24 | 128,000 | 4096 |
|
885
|
+
| amazon.nova-pro-v1:0 | $0.8 | $3.2 | 300,000 | 4096 |
|
886
|
+
| meta.llama3-3-70b-instruct-v1:0 | $0.72 | $0.72 | 128,000 | 4096 |
|
887
|
+
| together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo | $0.18 | $0.18 | nan | nan |
|
888
|
+
| together_ai/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo | $0.88 | $0.88 | nan | nan |
|
889
|
+
| together_ai/meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo | $3.5 | $3.5 | nan | nan |
|
890
|
+
| deepinfra/meta-llama/Meta-Llama-3.1-405B-Instruct | $0.9 | $0.9 | 32,768 | 32768 |
|
891
|
+
| fireworks_ai/accounts/fireworks/models/deepseek-v3 | $0.9 | $0.9 | 128,000 | 8192 |
|
892
|
+
| voyage/voyage-3-large | $0.18 | $0 | 32,000 | nan |
|
893
|
+
| voyage/voyage-3 | $0.06 | $0 | 32,000 | nan |
|
894
|
+
| voyage/voyage-3-lite | $0.02 | $0 | 32,000 | nan |
|
895
|
+
| voyage/voyage-code-3 | $0.18 | $0 | 32,000 | nan |
|
896
|
+
| voyage/voyage-multimodal-3 | $0.12 | $0 | 32,000 | nan |
|
897
|
+
| voyage/rerank-2 | $0.05 | $0 | 16,000 | 16000 |
|
898
|
+
| voyage/rerank-2-lite | $0.02 | $0 | 8,000 | 8000 |
|
899
|
+
| databricks/meta-llama-3.3-70b-instruct | $1 | $3 | 128,000 | 128000 |
|
900
|
+
| sambanova/Meta-Llama-3.1-8B-Instruct | $0.1 | $0.2 | 16,000 | 16000 |
|
901
|
+
| sambanova/Meta-Llama-3.1-70B-Instruct | $0.6 | $1.2 | 128,000 | 128000 |
|
902
|
+
| sambanova/Meta-Llama-3.1-405B-Instruct | $5 | $10 | 16,000 | 16000 |
|
903
|
+
| sambanova/Meta-Llama-3.2-1B-Instruct | $0.4 | $0.8 | 16,000 | 16000 |
|
904
|
+
| sambanova/Meta-Llama-3.2-3B-Instruct | $0.8 | $1.6 | 4,000 | 4000 |
|
905
|
+
| sambanova/Meta-Llama-3.3-70B-Instruct | $0.6 | $1.2 | 128,000 | 128000 |
|
906
|
+
| sambanova/Qwen2.5-Coder-32B-Instruct | $1.5 | $3 | 8,000 | 8000 |
|
907
|
+
| sambanova/Qwen2.5-72B-Instruct | $2 | $4 | 8,000 | 8000 |
|
908
|
+
| o3-mini | $1.1 | $4.4 | 200,000 | 100000 |
|
909
|
+
| o3-mini-2025-01-31 | $1.1 | $4.4 | 200,000 | 100000 |
|
910
|
+
| azure/o3-mini-2025-01-31 | $1.1 | $4.4 | 200,000 | 100000 |
|
911
|
+
| azure/o3-mini | $1.1 | $4.4 | 200,000 | 100000 |
|
912
|
+
| azure/o1-2024-12-17 | $15 | $60 | 200,000 | 100000 |
|
913
|
+
| azure_ai/deepseek-r1 | $1.35 | $5.4 | 128,000 | 8192 |
|
914
|
+
| deepseek/deepseek-reasoner | $0.55 | $2.19 | 65,536 | 8192 |
|
915
|
+
| xai/grok-2-vision-1212 | $2 | $10 | 32,768 | 32768 |
|
916
|
+
| xai/grok-2-vision-latest | $2 | $10 | 32,768 | 32768 |
|
917
|
+
| xai/grok-2-vision | $2 | $10 | 32,768 | 32768 |
|
918
|
+
| xai/grok-vision-beta | $5 | $15 | 8,192 | 8192 |
|
919
|
+
| xai/grok-2-1212 | $2 | $10 | 131,072 | 131072 |
|
920
|
+
| xai/grok-2 | $2 | $10 | 131,072 | 131072 |
|
921
|
+
| xai/grok-2-latest | $2 | $10 | 131,072 | 131072 |
|
922
|
+
| groq/deepseek-r1-distill-llama-70b | $0.75 | $0.99 | 131,072 | 131072 |
|
923
|
+
| gemini/gemini-2.0-flash | $0.1 | $0.4 | 1,048,576 | 8192 |
|
924
|
+
| gemini-2.0-flash-001 | $0.15 | $0.6 | 1,048,576 | 8192 |
|
925
|
+
| gemini-2.0-flash-thinking-exp | $0 | $0 | 1,048,576 | 8192 |
|
926
|
+
| gemini-2.0-flash-thinking-exp-01-21 | $0 | $0 | 1,048,576 | 65536 |
|
927
|
+
| gemini/gemini-2.0-flash-001 | $0.1 | $0.4 | 1,048,576 | 8192 |
|
928
|
+
| gemini/gemini-2.0-flash-lite-preview-02-05 | $0.08 | $0.3 | 1,048,576 | 8192 |
|
929
|
+
| gemini/gemini-2.0-flash-thinking-exp | $0 | $0 | 1,048,576 | 65536 |
|
930
|
+
| vertex_ai/codestral-2501 | $0.2 | $0.6 | 128,000 | 128000 |
|
931
|
+
| openrouter/deepseek/deepseek-r1 | $0.55 | $2.19 | 65,336 | 8192 |
|
932
|
+
| ai21.jamba-1-5-large-v1:0 | $2 | $8 | 256,000 | 256000 |
|
933
|
+
| ai21.jamba-1-5-mini-v1:0 | $0.2 | $0.4 | 256,000 | 256000 |
|
934
|
+
| us.amazon.nova-micro-v1:0 | $0.04 | $0.14 | 300,000 | 4096 |
|
935
|
+
| us.amazon.nova-lite-v1:0 | $0.06 | $0.24 | 128,000 | 4096 |
|
936
|
+
| us.amazon.nova-pro-v1:0 | $0.8 | $3.2 | 300,000 | 4096 |
|
937
|
+
| stability.sd3-5-large-v1:0 | -- | -- | 77 | nan |
|
938
|
+
| stability.stable-image-core-v1:0 | -- | -- | 77 | nan |
|
939
|
+
| stability.stable-image-core-v1:1 | -- | -- | 77 | nan |
|
940
|
+
| stability.stable-image-ultra-v1:1 | -- | -- | 77 | nan |
|
941
|
+
| together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo | $0.88 | $0.88 | nan | nan |
|
942
|
+
| together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo-Free | $0 | $0 | nan | nan |
|
943
|
+
| fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct | $0.1 | $0.1 | 16,384 | 16384 |
|
944
|
+
| assemblyai/nano | -- | -- | nan | nan |
|
945
|
+
| assemblyai/best | -- | -- | nan | nan |
|
946
|
+
| azure/gpt-3.5-turbo-0125 | $0.5 | $1.5 | 16,384 | 4096 |
|
947
|
+
| azure/gpt-3.5-turbo | $0.5 | $1.5 | 4,097 | 4096 |
|
948
|
+
| gemini-2.0-pro-exp-02-05 | $1.25 | $10 | 2,097,152 | 8192 |
|
949
|
+
| us.meta.llama3-3-70b-instruct-v1:0 | $0.72 | $0.72 | 128,000 | 4096 |
|
950
|
+
| perplexity/sonar | $1 | $1 | 128,000 | nan |
|
951
|
+
| perplexity/sonar-pro | $3 | $15 | 200,000 | 8000 |
|
952
|
+
| openrouter/google/gemini-2.0-flash-001 | $0.1 | $0.4 | 1,048,576 | 8192 |
|
953
|
+
| gpt-4.5-preview | $75 | $150 | 128,000 | 16384 |
|
954
|
+
| gpt-4.5-preview-2025-02-27 | $75 | $150 | 128,000 | 16384 |
|
955
|
+
| azure_ai/Phi-4 | $0.12 | $0.5 | 16,384 | 16384 |
|
956
|
+
| cerebras/llama3.3-70b | $0.85 | $1.2 | 128,000 | 128000 |
|
957
|
+
| claude-3-5-haiku-latest | $1 | $5 | 200,000 | 8192 |
|
958
|
+
| claude-3-7-sonnet-latest | $3 | $15 | 200,000 | 128000 |
|
959
|
+
| claude-3-7-sonnet-20250219 | $3 | $15 | 200,000 | 128000 |
|
960
|
+
| vertex_ai/claude-3-7-sonnet@20250219 | $3 | $15 | 200,000 | 8192 |
|
961
|
+
| openrouter/anthropic/claude-3.7-sonnet | $3 | $15 | 200,000 | 8192 |
|
962
|
+
| openrouter/anthropic/claude-3.7-sonnet:beta | $3 | $15 | 200,000 | 8192 |
|
963
|
+
| amazon.rerank-v1:0 | $0 | $0 | 32,000 | 32000 |
|
964
|
+
| anthropic.claude-3-7-sonnet-20250219-v1:0 | $3 | $15 | 200,000 | 8192 |
|
965
|
+
| us.anthropic.claude-3-7-sonnet-20250219-v1:0 | $3 | $15 | 200,000 | 8192 |
|
966
|
+
| cohere.rerank-v3-5:0 | $0 | $0 | 32,000 | 32000 |
|
967
|
+
| jina-reranker-v2-base-multilingual | $0.02 | $0.02 | 1,024 | 1024 |
|
968
|
+
| bedrock/invoke/anthropic.claude-3-5-sonnet-20240620-v1:0 | $3 | $15 | 200,000 | 4096 |
|
969
|
+
| azure/gpt-4o-mini-realtime-preview-2024-12-17 | $0.6 | $2.4 | 128,000 | 4096 |
|
970
|
+
| azure/eu/gpt-4o-mini-realtime-preview-2024-12-17 | $0.66 | $2.64 | 128,000 | 4096 |
|
971
|
+
| azure/us/gpt-4o-mini-realtime-preview-2024-12-17 | $0.66 | $2.64 | 128,000 | 4096 |
|
972
|
+
| azure/gpt-4o-realtime-preview-2024-10-01 | $5 | $20 | 128,000 | 4096 |
|
973
|
+
| azure/us/gpt-4o-realtime-preview-2024-10-01 | $5.5 | $22 | 128,000 | 4096 |
|
974
|
+
| azure/eu/gpt-4o-realtime-preview-2024-10-01 | $5.5 | $22 | 128,000 | 4096 |
|
975
|
+
| azure/us/o3-mini-2025-01-31 | $1.21 | $4.84 | 200,000 | 100000 |
|
976
|
+
| azure/eu/o3-mini-2025-01-31 | $1.21 | $4.84 | 200,000 | 100000 |
|
977
|
+
| azure/us/o1-mini-2024-09-12 | $1.21 | $4.84 | 128,000 | 65536 |
|
978
|
+
| azure/eu/o1-mini-2024-09-12 | $1.21 | $4.84 | 128,000 | 65536 |
|
979
|
+
| azure/us/o1-2024-12-17 | $16.5 | $66 | 200,000 | 100000 |
|
980
|
+
| azure/eu/o1-2024-12-17 | $16.5 | $66 | 200,000 | 100000 |
|
981
|
+
| azure/us/o1-preview-2024-09-12 | $16.5 | $66 | 128,000 | 32768 |
|
982
|
+
| azure/eu/o1-preview-2024-09-12 | $16.5 | $66 | 128,000 | 32768 |
|
983
|
+
| azure/us/gpt-4o-2024-11-20 | $2.75 | $11 | 128,000 | 16384 |
|
984
|
+
| azure/eu/gpt-4o-2024-11-20 | $2.75 | $11 | 128,000 | 16384 |
|
985
|
+
| azure/us/gpt-4o-2024-08-06 | $2.75 | $11 | 128,000 | 16384 |
|
986
|
+
| azure/eu/gpt-4o-2024-08-06 | $2.75 | $11 | 128,000 | 16384 |
|
987
|
+
| azure/us/gpt-4o-mini-2024-07-18 | $0.16 | $0.66 | 128,000 | 16384 |
|
988
|
+
| azure/eu/gpt-4o-mini-2024-07-18 | $0.16 | $0.66 | 128,000 | 16384 |
|
989
|
+
| azure_ai/deepseek-v3 | $1.14 | $4.56 | 128,000 | 8192 |
|
990
|
+
| azure_ai/mistral-nemo | $0.15 | $0.15 | 131,072 | 4096 |
|
991
|
+
| azure_ai/Phi-4-mini-instruct | $0.08 | $0.3 | 131,072 | 4096 |
|
992
|
+
| azure_ai/Phi-4-multimodal-instruct | $0.08 | $0.32 | 131,072 | 4096 |
|
993
|
+
| gemini/gemini-2.0-pro-exp-02-05 | $0 | $0 | 2,097,152 | 8192 |
|
994
|
+
| gemini/gemini-2.0-flash-thinking-exp-01-21 | $0 | $0 | 1,048,576 | 65536 |
|
995
|
+
| gemini/gemma-3-27b-it | $0 | $0 | 131,072 | 8192 |
|
996
|
+
| gemini/learnlm-1.5-pro-experimental | $0 | $0 | 32,767 | 8192 |
|
997
|
+
| vertex_ai/imagen-3.0-generate-002 | -- | -- | nan | nan |
|
998
|
+
| jamba-large-1.6 | $2 | $8 | 256,000 | 256000 |
|
999
|
+
| jamba-mini-1.6 | $0.2 | $0.4 | 256,000 | 256000 |
|
1000
|
+
| eu.amazon.nova-micro-v1:0 | $0.05 | $0.18 | 300,000 | 4096 |
|
1001
|
+
| eu.amazon.nova-lite-v1:0 | $0.08 | $0.31 | 128,000 | 4096 |
|
1002
|
+
| 1024-x-1024/50-steps/bedrock/amazon.nova-canvas-v1:0 | -- | -- | 2,600 | nan |
|
1003
|
+
| eu.amazon.nova-pro-v1:0 | $1.05 | $4.2 | 300,000 | 4096 |
|
1004
|
+
| us.deepseek.r1-v1:0 | $1.35 | $5.4 | 128,000 | 4096 |
|
1005
|
+
| snowflake/deepseek-r1 | -- | -- | 32,768 | 8192 |
|
1006
|
+
| snowflake/snowflake-arctic | -- | -- | 4,096 | 8192 |
|
1007
|
+
| snowflake/claude-3-5-sonnet | -- | -- | 18,000 | 8192 |
|
1008
|
+
| snowflake/mistral-large | -- | -- | 32,000 | 8192 |
|
1009
|
+
| snowflake/mistral-large2 | -- | -- | 128,000 | 8192 |
|
1010
|
+
| snowflake/reka-flash | -- | -- | 100,000 | 8192 |
|
1011
|
+
| snowflake/reka-core | -- | -- | 32,000 | 8192 |
|
1012
|
+
| snowflake/jamba-instruct | -- | -- | 256,000 | 8192 |
|
1013
|
+
| snowflake/jamba-1.5-mini | -- | -- | 256,000 | 8192 |
|
1014
|
+
| snowflake/jamba-1.5-large | -- | -- | 256,000 | 8192 |
|
1015
|
+
| snowflake/mixtral-8x7b | -- | -- | 32,000 | 8192 |
|
1016
|
+
| snowflake/llama2-70b-chat | -- | -- | 4,096 | 8192 |
|
1017
|
+
| snowflake/llama3-8b | -- | -- | 8,000 | 8192 |
|
1018
|
+
| snowflake/llama3-70b | -- | -- | 8,000 | 8192 |
|
1019
|
+
| snowflake/llama3.1-8b | -- | -- | 128,000 | 8192 |
|
1020
|
+
| snowflake/llama3.1-70b | -- | -- | 128,000 | 8192 |
|
1021
|
+
| snowflake/llama3.3-70b | -- | -- | 128,000 | 8192 |
|
1022
|
+
| snowflake/snowflake-llama-3.3-70b | -- | -- | 8,000 | 8192 |
|
1023
|
+
| snowflake/llama3.1-405b | -- | -- | 128,000 | 8192 |
|
1024
|
+
| snowflake/snowflake-llama-3.1-405b | -- | -- | 8,000 | 8192 |
|
1025
|
+
| snowflake/llama3.2-1b | -- | -- | 128,000 | 8192 |
|
1026
|
+
| snowflake/llama3.2-3b | -- | -- | 128,000 | 8192 |
|
1027
|
+
| snowflake/mistral-7b | -- | -- | 32,000 | 8192 |
|
1028
|
+
| snowflake/gemma-7b | -- | -- | 8,000 | 8192 |
|
1029
|
+
| azure/global/gpt-4o-2024-11-20 | $2.5 | $10 | 128,000 | 16384 |
|
1030
|
+
| azure/global/gpt-4o-2024-08-06 | $2.5 | $10 | 128,000 | 16384 |
|
1031
|
+
| o1-pro | $150 | $600 | 200,000 | 100000 |
|
1032
|
+
| o1-pro-2025-03-19 | $150 | $600 | 200,000 | 100000 |
|
1033
|
+
| gpt-4o-search-preview-2025-03-11 | $2.5 | $10 | 128,000 | 16384 |
|
1034
|
+
| gpt-4o-search-preview | $2.5 | $10 | 128,000 | 16384 |
|
1035
|
+
| gpt-4o-mini-search-preview-2025-03-11 | $0.15 | $0.6 | 128,000 | 16384 |
|
1036
|
+
| gpt-4o-mini-search-preview | $0.15 | $0.6 | 128,000 | 16384 |
|
1037
|
+
| azure/gpt-4.5-preview | $75 | $150 | 128,000 | 16384 |
|
1038
|
+
| azure_ai/mistral-small-2503 | $1 | $3 | 128,000 | 128000 |
|
1039
|
+
| text-embedding-large-exp-03-07 | $0.1 | $0 | 8,192 | nan |
|
1040
|
+
| gpt-4.1 | $2 | $8 | 1,047,576 | 32768 |
|
1041
|
+
| gpt-4.1-2025-04-14 | $2 | $8 | 1,047,576 | 32768 |
|
1042
|
+
| gpt-4.1-mini | $0.4 | $1.6 | 1,047,576 | 32768 |
|
1043
|
+
| gpt-4.1-mini-2025-04-14 | $0.4 | $1.6 | 1,047,576 | 32768 |
|
1044
|
+
| gpt-4.1-nano | $0.1 | $0.4 | 1,047,576 | 32768 |
|
1045
|
+
| gpt-4.1-nano-2025-04-14 | $0.1 | $0.4 | 1,047,576 | 32768 |
|
1046
|
+
| watsonx/ibm/granite-3-8b-instruct | $200 | $200 | 8,192 | 1024 |
|
1047
|
+
| computer-use-preview | $3 | $12 | 8,192 | 1024 |
|
1048
|
+
| o3 | $10 | $40 | 200,000 | 100000 |
|
1049
|
+
| o3-2025-04-16 | $10 | $40 | 200,000 | 100000 |
|
1050
|
+
| o4-mini | $1.1 | $4.4 | 200,000 | 100000 |
|
1051
|
+
| o4-mini-2025-04-16 | $1.1 | $4.4 | 200,000 | 100000 |
|
1052
|
+
| gpt-image-1 | -- | -- | nan | nan |
|
1053
|
+
| low/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
|
1054
|
+
| medium/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
|
1055
|
+
| high/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
|
1056
|
+
| low/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
|
1057
|
+
| medium/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
|
1058
|
+
| high/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
|
1059
|
+
| low/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
|
1060
|
+
| medium/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
|
1061
|
+
| high/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
|
1062
|
+
| gpt-4o-transcribe | $2.5 | $10 | 16,000 | 2000 |
|
1063
|
+
| gpt-4o-mini-transcribe | $1.25 | $5 | 16,000 | 2000 |
|
1064
|
+
| gpt-4o-mini-tts | $2.5 | $10 | nan | nan |
|
1065
|
+
| azure/computer-use-preview | $3 | $12 | 8,192 | 1024 |
|
1066
|
+
| azure/gpt-4o-audio-preview-2024-12-17 | $2.5 | $10 | 128,000 | 16384 |
|
1067
|
+
| azure/gpt-4o-mini-audio-preview-2024-12-17 | $2.5 | $10 | 128,000 | 16384 |
|
1068
|
+
| azure/gpt-4.1 | $2 | $8 | 1,047,576 | 32768 |
|
1069
|
+
| azure/gpt-4.1-2025-04-14 | $2 | $8 | 1,047,576 | 32768 |
|
1070
|
+
| azure/gpt-4.1-mini | $0.4 | $1.6 | 1,047,576 | 32768 |
|
1071
|
+
| azure/gpt-4.1-mini-2025-04-14 | $0.4 | $1.6 | 1,047,576 | 32768 |
|
1072
|
+
| azure/gpt-4.1-nano | $0.1 | $0.4 | 1,047,576 | 32768 |
|
1073
|
+
| azure/gpt-4.1-nano-2025-04-14 | $0.1 | $0.4 | 1,047,576 | 32768 |
|
1074
|
+
| azure/o3 | $10 | $40 | 200,000 | 100000 |
|
1075
|
+
| azure/o3-2025-04-16 | $10 | $40 | 200,000 | 100000 |
|
1076
|
+
| azure/o4-mini | $1.1 | $4.4 | 200,000 | 100000 |
|
1077
|
+
| azure/gpt-4o-realtime-preview-2024-12-17 | $5 | $20 | 128,000 | 4096 |
|
1078
|
+
| azure/us/gpt-4o-realtime-preview-2024-12-17 | $5.5 | $22 | 128,000 | 4096 |
|
1079
|
+
| azure/eu/gpt-4o-realtime-preview-2024-12-17 | $5.5 | $22 | 128,000 | 4096 |
|
1080
|
+
| azure/o4-mini-2025-04-16 | $1.1 | $4.4 | 200,000 | 100000 |
|
1081
|
+
| azure/gpt-image-1 | -- | -- | nan | nan |
|
1082
|
+
| azure/low/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
|
1083
|
+
| azure/medium/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
|
1084
|
+
| azure/high/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
|
1085
|
+
| azure/low/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
|
1086
|
+
| azure/medium/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
|
1087
|
+
| azure/high/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
|
1088
|
+
| azure/low/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
|
1089
|
+
| azure/medium/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
|
1090
|
+
| azure/high/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
|
1091
|
+
| azure_ai/mistral-large-latest | $2 | $6 | 128,000 | 4096 |
|
1092
|
+
| xai/grok-3-beta | $3 | $15 | 131,072 | 131072 |
|
1093
|
+
| xai/grok-3-fast-beta | $5 | $25 | 131,072 | 131072 |
|
1094
|
+
| xai/grok-3-fast-latest | $5 | $25 | 131,072 | 131072 |
|
1095
|
+
| xai/grok-3-mini-beta | $0.3 | $0.5 | 131,072 | 131072 |
|
1096
|
+
| xai/grok-3-mini-fast-beta | $0.6 | $4 | 131,072 | 131072 |
|
1097
|
+
| xai/grok-3-mini-fast-latest | $0.6 | $4 | 131,072 | 131072 |
|
1098
|
+
| groq/whisper-large-v3 | -- | -- | nan | nan |
|
1099
|
+
| groq/whisper-large-v3-turbo | -- | -- | nan | nan |
|
1100
|
+
| groq/distil-whisper-large-v3-en | -- | -- | nan | nan |
|
1101
|
+
| meta_llama/Llama-4-Scout-17B-16E-Instruct-FP8 | -- | -- | 10,000,000 | 4028 |
|
1102
|
+
| meta_llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | -- | -- | 1,000,000 | 4028 |
|
1103
|
+
| meta_llama/Llama-3.3-70B-Instruct | -- | -- | 128,000 | 4028 |
|
1104
|
+
| meta_llama/Llama-3.3-8B-Instruct | -- | -- | 128,000 | 4028 |
|
1105
|
+
| gemini-2.5-pro-exp-03-25 | $1.25 | $10 | 1,048,576 | 65535 |
|
1106
|
+
| gemini/gemini-2.5-pro-exp-03-25 | $0 | $0 | 1,048,576 | 65535 |
|
1107
|
+
| gemini/gemini-2.5-flash-preview-04-17 | $0.15 | $0.6 | 1,048,576 | 65535 |
|
1108
|
+
| gemini-2.5-flash-preview-04-17 | $0.15 | $0.6 | 1,048,576 | 65535 |
|
1109
|
+
| gemini-2.0-flash | $0.1 | $0.4 | 1,048,576 | 8192 |
|
1110
|
+
| gemini-2.0-flash-lite | $0.08 | $0.3 | 1,048,576 | 8192 |
|
1111
|
+
| gemini-2.0-flash-lite-001 | $0.08 | $0.3 | 1,048,576 | 8192 |
|
1112
|
+
| gemini-2.5-pro-preview-05-06 | $1.25 | $10 | 1,048,576 | 65535 |
|
1113
|
+
| gemini-2.5-pro-preview-03-25 | $1.25 | $10 | 1,048,576 | 65535 |
|
1114
|
+
| gemini/gemini-2.0-flash-lite | $0.08 | $0.3 | 1,048,576 | 8192 |
|
1115
|
+
| gemini/gemini-2.5-pro-preview-05-06 | $1.25 | $10 | 1,048,576 | 65535 |
|
1116
|
+
| gemini/gemini-2.5-pro-preview-03-25 | $1.25 | $10 | 1,048,576 | 65535 |
|
1117
|
+
| vertex_ai/meta/llama-4-scout-17b-16e-instruct-maas | $0.25 | $0.7 | 10,000,000 | 1e+07 |
|
1118
|
+
| vertex_ai/meta/llama-4-scout-17b-128e-instruct-maas | $0.25 | $0.7 | 10,000,000 | 1e+07 |
|
1119
|
+
| vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas | $0.35 | $1.15 | 1,000,000 | 1e+06 |
|
1120
|
+
| vertex_ai/meta/llama-4-maverick-17b-16e-instruct-maas | $0.35 | $1.15 | 1,000,000 | 1e+06 |
|
1121
|
+
| vertex_ai/mistral-small-2503@001 | $1 | $3 | 32,000 | 8191 |
|
1122
|
+
| vertex_ai/mistral-small-2503 | $1 | $3 | 128,000 | 128000 |
|
1123
|
+
| multimodalembedding | $0.8 | $0 | 2,048 | nan |
|
1124
|
+
| multimodalembedding@001 | $0.8 | $0 | 2,048 | nan |
|
1125
|
+
| command-a-03-2025 | $2.5 | $10 | 256,000 | 8000 |
|
1126
|
+
| mistralai/mistral-small-3.1-24b-instruct | $0.1 | $0.3 | nan | nan |
|
1127
|
+
| openrouter/openai/o3-mini | $1.1 | $4.4 | 128,000 | 65536 |
|
1128
|
+
| openrouter/openai/o3-mini-high | $1.1 | $4.4 | 128,000 | 65536 |
|
1129
|
+
| us.amazon.nova-premier-v1:0 | $2.5 | $12.5 | 1,000,000 | 4096 |
|
1130
|
+
| meta.llama4-maverick-17b-instruct-v1:0 | $0.24 | $0.97 | 128,000 | 4096 |
|
1131
|
+
| us.meta.llama4-maverick-17b-instruct-v1:0 | $0.24 | $0.97 | 128,000 | 4096 |
|
1132
|
+
| meta.llama4-scout-17b-instruct-v1:0 | $0.17 | $0.66 | 128,000 | 4096 |
|
1133
|
+
| us.meta.llama4-scout-17b-instruct-v1:0 | $0.17 | $0.66 | 128,000 | 4096 |
|
1134
|
+
| together_ai/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | -- | -- | nan | nan |
|
1135
|
+
| together_ai/meta-llama/Llama-4-Scout-17B-16E-Instruct | -- | -- | nan | nan |
|
1136
|
+
| together_ai/meta-llama/Llama-3.2-3B-Instruct-Turbo | -- | -- | nan | nan |
|
1137
|
+
| together_ai/Qwen/Qwen2.5-7B-Instruct-Turbo | -- | -- | nan | nan |
|
1138
|
+
| together_ai/Qwen/Qwen2.5-72B-Instruct-Turbo | -- | -- | nan | nan |
|
1139
|
+
| together_ai/deepseek-ai/DeepSeek-V3 | -- | -- | nan | nan |
|
1140
|
+
| together_ai/mistralai/Mistral-Small-24B-Instruct-2501 | -- | -- | nan | nan |
|
1141
|
+
| perplexity/sonar-deep-research | $2 | $8 | 128,000 | nan |
|
1142
|
+
| fireworks_ai/accounts/fireworks/models/deepseek-r1 | $3 | $8 | 128,000 | 20480 |
|
1143
|
+
| fireworks_ai/accounts/fireworks/models/deepseek-r1-basic | $0.55 | $2.19 | 128,000 | 20480 |
|
1144
|
+
| fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct | $3 | $3 | 128,000 | 16384 |
|
1145
|
+
| fireworks_ai/accounts/fireworks/models/llama4-maverick-instruct-basic | $0.22 | $0.88 | 131,072 | 131072 |
|
1146
|
+
| fireworks_ai/accounts/fireworks/models/llama4-scout-instruct-basic | $0.15 | $0.6 | 131,072 | 131072 |
|
1147
|
+
| fireworks-ai-up-to-4b | $0.2 | $0.2 | nan | nan |
|
1148
|
+
| fireworks-ai-4.1b-to-16b | $0.2 | $0.2 | nan | nan |
|
1149
|
+
| fireworks-ai-above-16b | $0.9 | $0.9 | nan | nan |
|
1150
|
+
| databricks/databricks-claude-3-7-sonnet | $2.5 | $178.57 | 200,000 | 128000 |
|
1151
|
+
| databricks/databricks-meta-llama-3-3-70b-instruct | $1 | $3 | 128,000 | 128000 |
|
1152
|
+
| azure_ai/deepseek-v3-0324 | $1.14 | $4.56 | 128,000 | 8192 |
|
1153
|
+
| azure_ai/Llama-4-Scout-17B-16E-Instruct | $0.2 | $0.78 | 10,000,000 | 16384 |
|
1154
|
+
| azure_ai/Llama-4-Maverick-17B-128E-Instruct-FP8 | $1.41 | $0.35 | 1,000,000 | 16384 |
|
1155
|
+
| cerebras/llama-3.3-70b | $0.85 | $1.2 | 128,000 | 128000 |
|
1156
|
+
| perplexity/sonar-reasoning | $1 | $5 | 128,000 | nan |
|
1157
|
+
| perplexity/sonar-reasoning-pro | $2 | $8 | 128,000 | nan |
|
1158
|
+
| nscale/meta-llama/Llama-4-Scout-17B-16E-Instruct | $0.09 | $0.29 | nan | nan |
|
1159
|
+
| nscale/Qwen/Qwen2.5-Coder-3B-Instruct | $0.01 | $0.03 | nan | nan |
|
1160
|
+
| nscale/Qwen/Qwen2.5-Coder-7B-Instruct | $0.01 | $0.03 | nan | nan |
|
1161
|
+
| nscale/Qwen/Qwen2.5-Coder-32B-Instruct | $0.06 | $0.2 | nan | nan |
|
1162
|
+
| nscale/Qwen/QwQ-32B | $0.18 | $0.2 | nan | nan |
|
1163
|
+
| nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-70B | $0.38 | $0.38 | nan | nan |
|
1164
|
+
| nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-8B | $0.02 | $0.02 | nan | nan |
|
1165
|
+
| nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | $0.09 | $0.09 | nan | nan |
|
1166
|
+
| nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | $0.2 | $0.2 | nan | nan |
|
1167
|
+
| nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | $0.07 | $0.07 | nan | nan |
|
1168
|
+
| nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | $0.15 | $0.15 | nan | nan |
|
1169
|
+
| nscale/mistralai/mixtral-8x22b-instruct-v0.1 | $0.6 | $0.6 | nan | nan |
|
1170
|
+
| nscale/meta-llama/Llama-3.1-8B-Instruct | $0.03 | $0.03 | nan | nan |
|
1171
|
+
| nscale/meta-llama/Llama-3.3-70B-Instruct | $0.2 | $0.2 | nan | nan |
|
1172
|
+
| nscale/black-forest-labs/FLUX.1-schnell | -- | -- | nan | nan |
|
1173
|
+
| nscale/stabilityai/stable-diffusion-xl-base-1.0 | -- | -- | nan | nan |## License
|
1174
|
+
|
1175
|
+
TokenCost is released under the MIT License.
|