All Models
Pricing for 2003 chat-capable models across 76 providers. Last updated May 20, 2026.
fireworks ai (250)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| fireworks-ai-default | $0.0000 | $0.0000 | — | — | |
| fireworks_ai/accounts/fireworks/models/flux-1-dev-controlnet-union | $0.0010 | $0.0010 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/gpt-oss-20b | $0.05 | $0.20 | — | 131k | 🔧 tools |
| fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct | $0.10 | $0.10 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/llama-v3p2-1b-instruct | $0.10 | $0.10 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/llama-v3p2-3b-instruct | $0.10 | $0.10 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/codegemma-2b | $0.10 | $0.10 | — | 8k | |
| fireworks_ai/accounts/fireworks/models/cogito-v1-preview-llama-3b | $0.10 | $0.10 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/deepseek-coder-1b-base | $0.10 | $0.10 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-qwen-1p5b | $0.10 | $0.10 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/ernie-4p5-21b-a3b-pt | $0.10 | $0.10 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/ernie-4p5-300b-a47b-pt | $0.10 | $0.10 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/flux-1-dev | $0.10 | $0.10 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/flux-1-schnell | $0.10 | $0.10 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/gemma-2b-it | $0.10 | $0.10 | — | 8k | |
| fireworks_ai/accounts/fireworks/models/llama-guard-3-1b | $0.10 | $0.10 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/llama-v2-70b | $0.10 | $0.10 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct-long | $0.10 | $0.10 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/llama-v3p1-70b-instruct-1b | $0.10 | $0.10 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/llama-v3p2-1b | $0.10 | $0.10 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/llama-v3p2-3b | $0.10 | $0.10 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/minimax-m1-80k | $0.10 | $0.10 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/ministral-3-3b-instruct-2512 | $0.10 | $0.10 | — | 256k | |
| fireworks_ai/accounts/fireworks/models/nemotron-nano-v2-12b-vl | $0.10 | $0.10 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/phi-2-3b | $0.10 | $0.10 | — | 2k | |
| fireworks_ai/accounts/fireworks/models/phi-3-mini-128k-instruct | $0.10 | $0.10 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/qwen2-vl-2b-instruct | $0.10 | $0.10 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-0p5b-instruct | $0.10 | $0.10 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-1p5b-instruct | $0.10 | $0.10 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-0p5b | $0.10 | $0.10 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-0p5b-instruct | $0.10 | $0.10 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-1p5b | $0.10 | $0.10 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-1p5b-instruct | $0.10 | $0.10 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-3b | $0.10 | $0.10 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-3b-instruct | $0.10 | $0.10 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen3-0p6b | $0.10 | $0.10 | — | 41k | |
| fireworks_ai/accounts/fireworks/models/qwen3-1p7b | $0.10 | $0.10 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/qwen3-1p7b-fp8-draft | $0.10 | $0.10 | — | 262k | |
| fireworks_ai/accounts/fireworks/models/qwen3-1p7b-fp8-draft-131072 | $0.10 | $0.10 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/qwen3-1p7b-fp8-draft-40960 | $0.10 | $0.10 | — | 41k | |
| fireworks_ai/accounts/fireworks/models/stablecode-3b | $0.10 | $0.10 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/starcoder2-3b | $0.10 | $0.10 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/gpt-oss-120b | $0.15 | $0.60 | — | 131k | 🔧 tools |
| fireworks_ai/accounts/fireworks/models/llama4-scout-instruct-basic | $0.15 | $0.60 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/qwen3-30b-a3b | $0.15 | $0.60 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/qwen3-coder-30b-a3b-instruct | $0.15 | $0.60 | — | 262k | |
| fireworks_ai/accounts/fireworks/models/qwen3-vl-30b-a3b-instruct | $0.15 | $0.60 | — | 262k | |
| fireworks_ai/accounts/fireworks/models/qwen3-vl-30b-a3b-thinking | $0.15 | $0.60 | — | 262k | |
| fireworks-ai-4.1b-to-16b | $0.20 | $0.20 | — | — | |
| fireworks-ai-up-to-4b | $0.20 | $0.20 | — | — | |
| fireworks_ai/accounts/fireworks/models/llama-v3p2-11b-vision-instruct | $0.20 | $0.20 | — | 16k | 👁️ vision |
| fireworks_ai/accounts/fireworks/models/chronos-hermes-13b-v2 | $0.20 | $0.20 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/code-llama-13b | $0.20 | $0.20 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/code-llama-13b-instruct | $0.20 | $0.20 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/code-llama-13b-python | $0.20 | $0.20 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/code-llama-7b | $0.20 | $0.20 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/code-llama-7b-instruct | $0.20 | $0.20 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/code-llama-7b-python | $0.20 | $0.20 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/code-qwen-1p5-7b | $0.20 | $0.20 | — | 66k | |
| fireworks_ai/accounts/fireworks/models/codegemma-7b | $0.20 | $0.20 | — | 8k | |
| fireworks_ai/accounts/fireworks/models/cogito-v1-preview-llama-8b | $0.20 | $0.20 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/cogito-v1-preview-qwen-14b | $0.20 | $0.20 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/deepseek-coder-7b-base | $0.20 | $0.20 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/deepseek-coder-7b-base-v1p5 | $0.20 | $0.20 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/deepseek-coder-7b-instruct-v1p5 | $0.20 | $0.20 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/deepseek-r1-0528-distill-qwen3-8b | $0.20 | $0.20 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-llama-8b | $0.20 | $0.20 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-qwen-14b | $0.20 | $0.20 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-qwen-7b | $0.20 | $0.20 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/dobby-mini-unhinged-plus-llama-3-1-8b | $0.20 | $0.20 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/firellava-13b | $0.20 | $0.20 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/firesearch-ocr-v6 | $0.20 | $0.20 | — | 8k | |
| fireworks_ai/accounts/fireworks/models/gemma-7b | $0.20 | $0.20 | — | 8k | |
| fireworks_ai/accounts/fireworks/models/gemma-7b-it | $0.20 | $0.20 | — | 8k | |
| fireworks_ai/accounts/fireworks/models/gemma2-9b-it | $0.20 | $0.20 | — | 8k | |
| fireworks_ai/accounts/fireworks/models/hermes-2-pro-mistral-7b | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/internvl3-8b | $0.20 | $0.20 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/llama-guard-2-8b | $0.20 | $0.20 | — | 8k | |
| fireworks_ai/accounts/fireworks/models/llama-guard-3-8b | $0.20 | $0.20 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/llama-v2-13b | $0.20 | $0.20 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/llama-v2-13b-chat | $0.20 | $0.20 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/llama-v2-7b | $0.20 | $0.20 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/llama-v2-7b-chat | $0.20 | $0.20 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/llama-v3-8b | $0.20 | $0.20 | — | 8k | |
| fireworks_ai/accounts/fireworks/models/llama-v3-8b-instruct-hf | $0.20 | $0.20 | — | 8k | |
| fireworks_ai/accounts/fireworks/models/llamaguard-7b | $0.20 | $0.20 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/ministral-3-14b-instruct-2512 | $0.20 | $0.20 | — | 256k | |
| fireworks_ai/accounts/fireworks/models/ministral-3-8b-instruct-2512 | $0.20 | $0.20 | — | 256k | |
| fireworks_ai/accounts/fireworks/models/mistral-7b | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/mistral-7b-instruct-4k | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/mistral-7b-instruct-v0p2 | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/mistral-7b-instruct-v3 | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/mistral-7b-v0p2 | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/mistral-nemo-base-2407 | $0.20 | $0.20 | — | 128k | |
| fireworks_ai/accounts/fireworks/models/mistral-nemo-instruct-2407 | $0.20 | $0.20 | — | 128k | |
| fireworks_ai/accounts/fireworks/models/mythomax-l2-13b | $0.20 | $0.20 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/nous-capybara-7b-v1p9 | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/nous-hermes-llama2-13b | $0.20 | $0.20 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/nous-hermes-llama2-7b | $0.20 | $0.20 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/nvidia-nemotron-nano-12b-v2 | $0.20 | $0.20 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/nvidia-nemotron-nano-9b-v2 | $0.20 | $0.20 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/openchat-3p5-0106-7b | $0.20 | $0.20 | — | 8k | |
| fireworks_ai/accounts/fireworks/models/openhermes-2-mistral-7b | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/openhermes-2p5-mistral-7b | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/openorca-7b | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/phi-3-vision-128k-instruct | $0.20 | $0.20 | — | 32k | |
| fireworks_ai/accounts/fireworks/models/pythia-12b | $0.20 | $0.20 | — | 2k | |
| fireworks_ai/accounts/fireworks/models/qwen-v2p5-14b-instruct | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen-v2p5-7b | $0.20 | $0.20 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/qwen2-7b-instruct | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2-vl-7b-instruct | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-14b | $0.20 | $0.20 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-7b-instruct | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-14b | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-14b-instruct | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-7b | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-7b-instruct | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-vl-3b-instruct | $0.20 | $0.20 | — | 128k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-vl-7b-instruct | $0.20 | $0.20 | — | 128k | |
| fireworks_ai/accounts/fireworks/models/qwen3-14b | $0.20 | $0.20 | — | 41k | |
| fireworks_ai/accounts/fireworks/models/qwen3-4b | $0.20 | $0.20 | — | 41k | |
| fireworks_ai/accounts/fireworks/models/qwen3-4b-instruct-2507 | $0.20 | $0.20 | — | 262k | |
| fireworks_ai/accounts/fireworks/models/qwen3-8b | $0.20 | $0.20 | — | 41k | |
| fireworks_ai/accounts/fireworks/models/qwen3-vl-8b-instruct | $0.20 | $0.20 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/rolm-ocr | $0.20 | $0.20 | — | 128k | |
| fireworks_ai/accounts/fireworks/models/snorkel-mistral-7b-pairrm-dpo | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/starcoder-16b | $0.20 | $0.20 | — | 8k | |
| fireworks_ai/accounts/fireworks/models/starcoder-7b | $0.20 | $0.20 | — | 8k | |
| fireworks_ai/accounts/fireworks/models/starcoder2-15b | $0.20 | $0.20 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/starcoder2-7b | $0.20 | $0.20 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/toppy-m-7b | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/yi-6b | $0.20 | $0.20 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/zephyr-7b-beta | $0.20 | $0.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/glm-4p5-air | $0.22 | $0.88 | — | 128k | 🔧 tools |
| fireworks_ai/accounts/fireworks/models/llama4-maverick-instruct-basic | $0.22 | $0.88 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/qwen3-235b-a22b | $0.22 | $0.88 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/qwen3-235b-a22b-instruct-2507 | $0.22 | $0.88 | — | 262k | |
| fireworks_ai/accounts/fireworks/models/qwen3-235b-a22b-thinking-2507 | $0.22 | $0.88 | — | 262k | |
| fireworks_ai/accounts/fireworks/models/qwen3-vl-235b-a22b-instruct | $0.22 | $0.88 | — | 262k | |
| fireworks_ai/accounts/fireworks/models/qwen3-vl-235b-a22b-thinking | $0.22 | $0.88 | — | 262k | |
| fireworks_ai/accounts/fireworks/models/minimax-m2p1 | $0.30 | $1.20 | $0.03 | 205k | 🔧 tools |
| fireworks_ai/minimax-m2p1 | $0.30 | $1.20 | $0.03 | 205k | 🔧 tools |
| fireworks_ai/accounts/fireworks/models/minimax-m2 | $0.30 | $1.20 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/qwen3-coder-480b-a35b-instruct | $0.45 | $1.80 | — | 262k | |
| fireworks-ai-moe-up-to-56b | $0.50 | $0.50 | — | — | |
| fireworks_ai/accounts/fireworks/models/deepseek-coder-v2-lite-base | $0.50 | $0.50 | — | 164k | |
| fireworks_ai/accounts/fireworks/models/deepseek-coder-v2-lite-instruct | $0.50 | $0.50 | — | 164k | |
| fireworks_ai/accounts/fireworks/models/deepseek-v2-lite-chat | $0.50 | $0.50 | — | 164k | |
| fireworks_ai/accounts/fireworks/models/dolphin-2p6-mixtral-8x7b | $0.50 | $0.50 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/firefunction-v1 | $0.50 | $0.50 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/gpt-oss-safeguard-20b | $0.50 | $0.50 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/mixtral-8x7b | $0.50 | $0.50 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/mixtral-8x7b-instruct | $0.50 | $0.50 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/mixtral-8x7b-instruct-hf | $0.50 | $0.50 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/nous-hermes-2-mixtral-8x7b-dpo | $0.50 | $0.50 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen3-30b-a3b-instruct-2507 | $0.50 | $0.50 | — | 262k | |
| fireworks_ai/accounts/fireworks/models/deepseek-r1-basic | $0.55 | $2.19 | — | 128k | |
| fireworks_ai/accounts/fireworks/models/glm-4p5 | $0.55 | $2.19 | — | 128k | 🔧 tools |
| fireworks_ai/accounts/fireworks/models/glm-4p6 | $0.55 | $2.19 | — | 203k | 🔧 tools |
| fireworks_ai/accounts/fireworks/models/deepseek-v3p1 | $0.56 | $1.68 | — | 128k | |
| fireworks_ai/accounts/fireworks/models/deepseek-v3p1-terminus | $0.56 | $1.68 | — | 128k | |
| fireworks_ai/accounts/fireworks/models/deepseek-v3p2 | $0.56 | $1.68 | — | 164k | 🔧 tools |
| fireworks_ai/accounts/fireworks/models/glm-4p7 | $0.60 | $2.20 | $0.30 | 203k | 🔧 tools |
| fireworks_ai/accounts/fireworks/models/kimi-k2-instruct | $0.60 | $2.50 | — | 131k | 🔧 tools |
| fireworks_ai/accounts/fireworks/models/kimi-k2-instruct-0905 | $0.60 | $2.50 | — | 262k | 🔧 tools |
| fireworks_ai/accounts/fireworks/models/kimi-k2-thinking | $0.60 | $2.50 | — | 262k | 🔧 tools · 🌐 search |
| fireworks_ai/accounts/fireworks/models/kimi-k2p5 | $0.60 | $3.00 | $0.10 | 262k | 🔧 tools |
| fireworks_ai/glm-4p7 | $0.60 | $2.20 | $0.30 | 203k | 🔧 tools |
| fireworks_ai/kimi-k2p5 | $0.60 | $3.00 | $0.10 | 262k | 🔧 tools |
| fireworks-ai-above-16b | $0.90 | $0.90 | — | — | |
| fireworks_ai/accounts/fireworks/models/deepseek-v3 | $0.90 | $0.90 | — | 128k | |
| fireworks_ai/accounts/fireworks/models/deepseek-v3-0324 | $0.90 | $0.90 | — | 164k | |
| fireworks_ai/accounts/fireworks/models/firefunction-v2 | $0.90 | $0.90 | — | 8k | 🔧 tools |
| fireworks_ai/accounts/fireworks/models/llama-v3p2-90b-vision-instruct | $0.90 | $0.90 | — | 16k | 👁️ vision |
| fireworks_ai/accounts/fireworks/models/qwen2-72b-instruct | $0.90 | $0.90 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct | $0.90 | $0.90 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/code-llama-34b | $0.90 | $0.90 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/code-llama-34b-instruct | $0.90 | $0.90 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/code-llama-34b-python | $0.90 | $0.90 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/code-llama-70b | $0.90 | $0.90 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/code-llama-70b-instruct | $0.90 | $0.90 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/code-llama-70b-python | $0.90 | $0.90 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/cogito-v1-preview-llama-70b | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/cogito-v1-preview-qwen-32b | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/deepseek-coder-33b-instruct | $0.90 | $0.90 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-llama-70b | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-qwen-32b | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/devstral-small-2505 | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/dobby-unhinged-llama-3-3-70b-new | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/dolphin-2-9-2-qwen2-72b | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/fare-20b | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/gemma-3-27b-it | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/internvl3-38b | $0.90 | $0.90 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/internvl3-78b | $0.90 | $0.90 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/kat-coder | $0.90 | $0.90 | — | 262k | |
| fireworks_ai/accounts/fireworks/models/kat-dev-32b | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/kat-dev-72b-exp | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/llama-v2-70b-chat | $0.90 | $0.90 | — | 2k | |
| fireworks_ai/accounts/fireworks/models/llama-v3-70b-instruct | $0.90 | $0.90 | — | 8k | |
| fireworks_ai/accounts/fireworks/models/llama-v3-70b-instruct-hf | $0.90 | $0.90 | — | 8k | |
| fireworks_ai/accounts/fireworks/models/llama-v3p1-70b-instruct | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/llama-v3p1-nemotron-70b-instruct | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/llama-v3p3-70b-instruct | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/llava-yi-34b | $0.90 | $0.90 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/mistral-small-24b-instruct-2501 | $0.90 | $0.90 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/nous-hermes-2-yi-34b | $0.90 | $0.90 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/nous-hermes-llama2-70b | $0.90 | $0.90 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/phind-code-llama-34b-python-v1 | $0.90 | $0.90 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/phind-code-llama-34b-v1 | $0.90 | $0.90 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/phind-code-llama-34b-v2 | $0.90 | $0.90 | — | 16k | |
| fireworks_ai/accounts/fireworks/models/qwen-qwq-32b-preview | $0.90 | $0.90 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen1p5-72b-chat | $0.90 | $0.90 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2-vl-72b-instruct | $0.90 | $0.90 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-32b | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-32b-instruct | $0.90 | $0.90 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-72b | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-72b-instruct | $0.90 | $0.90 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b | $0.90 | $0.90 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct-128k | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct-32k-rope | $0.90 | $0.90 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct-64k | $0.90 | $0.90 | — | 66k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-math-72b-instruct | $0.90 | $0.90 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-vl-32b-instruct | $0.90 | $0.90 | — | 128k | |
| fireworks_ai/accounts/fireworks/models/qwen2p5-vl-72b-instruct | $0.90 | $0.90 | — | 128k | |
| fireworks_ai/accounts/fireworks/models/qwen3-30b-a3b-thinking-2507 | $0.90 | $0.90 | — | 262k | |
| fireworks_ai/accounts/fireworks/models/qwen3-32b | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/qwen3-coder-480b-instruct-bf16 | $0.90 | $0.90 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/qwen3-next-80b-a3b-instruct | $0.90 | $0.90 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/qwen3-next-80b-a3b-thinking | $0.90 | $0.90 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/qwen3-vl-32b-instruct | $0.90 | $0.90 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/qwq-32b | $0.90 | $0.90 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/yi-34b | $0.90 | $0.90 | — | 4k | |
| fireworks_ai/accounts/fireworks/models/yi-34b-200k-capybara | $0.90 | $0.90 | — | 200k | |
| fireworks_ai/accounts/fireworks/models/yi-34b-chat | $0.90 | $0.90 | — | 4k | |
| fireworks-ai-56b-to-176b | $1.20 | $1.20 | — | — | |
| fireworks_ai/accounts/fireworks/models/deepseek-coder-v2-instruct | $1.20 | $1.20 | — | 66k | |
| fireworks_ai/accounts/fireworks/models/mixtral-8x22b-instruct-hf | $1.20 | $1.20 | — | 66k | 🔧 tools |
| fireworks_ai/accounts/fireworks/models/cogito-671b-v2-p1 | $1.20 | $1.20 | — | 164k | |
| fireworks_ai/accounts/fireworks/models/dbrx-instruct | $1.20 | $1.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/deepseek-prover-v2 | $1.20 | $1.20 | — | 164k | |
| fireworks_ai/accounts/fireworks/models/deepseek-v2p5 | $1.20 | $1.20 | — | 33k | |
| fireworks_ai/accounts/fireworks/models/glm-4p5v | $1.20 | $1.20 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/gpt-oss-safeguard-120b | $1.20 | $1.20 | — | 131k | |
| fireworks_ai/accounts/fireworks/models/mistral-large-3-fp8 | $1.20 | $1.20 | — | 256k | |
| fireworks_ai/accounts/fireworks/models/mixtral-8x22b | $1.20 | $1.20 | — | 66k | |
| fireworks_ai/accounts/fireworks/models/mixtral-8x22b-instruct | $1.20 | $1.20 | — | 66k | |
| fireworks_ai/accounts/fireworks/models/deepseek-r1 | $3.00 | $8.00 | — | 128k | |
| fireworks_ai/accounts/fireworks/models/deepseek-r1-0528 | $3.00 | $8.00 | — | 160k | |
| fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct | $3.00 | $3.00 | — | 128k | 🔧 tools |
| fireworks_ai/accounts/fireworks/models/yi-large | $3.00 | $3.00 | — | 33k |
bedrock (188)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| anthropic.claude-mythos-preview | $0.0000 | $0.0000 | — | 1000k | 👁️ vision · 🔧 tools |
| meta.llama3-2-1b-instruct-v1:0 | $0.10 | $0.10 | — | 128k | 🔧 tools |
| us.meta.llama3-2-1b-instruct-v1:0 | $0.10 | $0.10 | — | 128k | 🔧 tools |
| eu.meta.llama3-2-1b-instruct-v1:0 | $0.13 | $0.13 | — | 128k | 🔧 tools |
| bedrock/us-east-1/mistral.mistral-7b-instruct-v0:2 | $0.15 | $0.20 | — | 32k | |
| bedrock/us-west-2/mistral.mistral-7b-instruct-v0:2 | $0.15 | $0.20 | — | 32k | |
| meta.llama3-2-3b-instruct-v1:0 | $0.15 | $0.15 | — | 128k | 🔧 tools |
| mistral.mistral-7b-instruct-v0:2 | $0.15 | $0.20 | — | 32k | |
| us.meta.llama3-2-3b-instruct-v1:0 | $0.15 | $0.15 | — | 128k | 🔧 tools |
| eu.meta.llama3-2-3b-instruct-v1:0 | $0.19 | $0.19 | — | 128k | 🔧 tools |
| ai21.jamba-1-5-mini-v1:0 | $0.20 | $0.40 | — | 256k | |
| bedrock/eu-west-3/mistral.mistral-7b-instruct-v0:2 | $0.20 | $0.26 | — | 32k | |
| meta.llama3-1-8b-instruct-v1:0 | $0.22 | $0.22 | — | 128k | 🔧 tools |
| us.meta.llama3-1-8b-instruct-v1:0 | $0.22 | $0.22 | — | 128k | 🔧 tools |
| anthropic.claude-3-haiku-20240307-v1:0 | $0.25 | $1.25 | $0.02 | 200k | 👁️ vision · 🔧 tools |
| apac.anthropic.claude-3-haiku-20240307-v1:0 | $0.25 | $1.25 | $0.02 | 200k | 👁️ vision · 🔧 tools |
| eu.anthropic.claude-3-5-haiku-20241022-v1:0 | $0.25 | $1.25 | $0.02 | 200k | 🔧 tools · 💾 cache |
| eu.anthropic.claude-3-haiku-20240307-v1:0 | $0.25 | $1.25 | $0.02 | 200k | 👁️ vision · 🔧 tools |
| us.anthropic.claude-3-haiku-20240307-v1:0 | $0.25 | $1.25 | $0.02 | 200k | 👁️ vision · 🔧 tools |
| amazon.titan-text-lite-v1 | $0.30 | $0.40 | — | 42k | |
| bedrock/us-east-1/meta.llama3-8b-instruct-v1:0 | $0.30 | $0.60 | — | 8k | |
| bedrock/us-east-1/minimax.minimax-m2.1 | $0.30 | $1.20 | — | 196k | 🔧 tools |
| bedrock/us-east-1/minimax.minimax-m2.5 | $0.30 | $1.20 | — | 1000k | 🔧 tools |
| bedrock/us-east-2/minimax.minimax-m2.1 | $0.30 | $1.20 | — | 196k | 🔧 tools |
| bedrock/us-east-2/minimax.minimax-m2.5 | $0.30 | $1.20 | — | 1000k | 🔧 tools |
| bedrock/us-gov-east-1/amazon.titan-text-lite-v1 | $0.30 | $0.40 | — | 42k | |
| bedrock/us-gov-east-1/anthropic.claude-3-haiku-20240307-v1:0 | $0.30 | $1.50 | $0.03 | 200k | 👁️ vision · 🔧 tools |
| bedrock/us-gov-east-1/meta.llama3-8b-instruct-v1:0 | $0.30 | $2.65 | — | 8k | |
| bedrock/us-gov-west-1/amazon.titan-text-lite-v1 | $0.30 | $0.40 | — | 42k | |
| bedrock/us-gov-west-1/anthropic.claude-3-haiku-20240307-v1:0 | $0.30 | $1.50 | $0.03 | 200k | 👁️ vision · 🔧 tools |
| bedrock/us-gov-west-1/meta.llama3-8b-instruct-v1:0 | $0.30 | $2.65 | — | 8k | |
| bedrock/us-west-1/meta.llama3-8b-instruct-v1:0 | $0.30 | $0.60 | — | 8k | |
| bedrock/us-west-2/minimax.minimax-m2.1 | $0.30 | $1.20 | — | 196k | 🔧 tools |
| bedrock/us-west-2/minimax.minimax-m2.5 | $0.30 | $1.20 | — | 1000k | 🔧 tools |
| cohere.command-light-text-v14 | $0.30 | $0.60 | — | 4k | |
| meta.llama3-8b-instruct-v1:0 | $0.30 | $0.60 | — | 8k | |
| bedrock/ap-southeast-2/minimax.minimax-m2.5 | $0.31 | $1.24 | — | 1000k | 🔧 tools |
| bedrock/eu-west-1/meta.llama3-8b-instruct-v1:0 | $0.32 | $0.65 | — | 8k | |
| bedrock/ca-central-1/meta.llama3-8b-instruct-v1:0 | $0.35 | $0.69 | — | 8k | |
| meta.llama3-2-11b-instruct-v1:0 | $0.35 | $0.35 | — | 128k | 👁️ vision · 🔧 tools |
| us.meta.llama3-2-11b-instruct-v1:0 | $0.35 | $0.35 | — | 128k | 👁️ vision · 🔧 tools |
| bedrock/ap-northeast-1/minimax.minimax-m2.1 | $0.36 | $1.44 | — | 196k | 🔧 tools |
| bedrock/ap-northeast-1/minimax.minimax-m2.5 | $0.36 | $1.44 | — | 1000k | 🔧 tools |
| bedrock/ap-south-1/meta.llama3-8b-instruct-v1:0 | $0.36 | $0.72 | — | 8k | |
| bedrock/ap-south-1/minimax.minimax-m2.1 | $0.36 | $1.44 | — | 196k | 🔧 tools |
| bedrock/ap-south-1/minimax.minimax-m2.5 | $0.36 | $1.44 | — | 1000k | 🔧 tools |
| bedrock/ap-southeast-3/minimax.minimax-m2.1 | $0.36 | $1.44 | — | 196k | 🔧 tools |
| bedrock/ap-southeast-3/minimax.minimax-m2.5 | $0.36 | $1.44 | — | 1000k | 🔧 tools |
| bedrock/eu-north-1/minimax.minimax-m2.1 | $0.36 | $1.44 | — | 196k | 🔧 tools |
| bedrock/eu-north-1/minimax.minimax-m2.5 | $0.36 | $1.44 | — | 1000k | 🔧 tools |
| bedrock/eu-central-1/minimax.minimax-m2.1 | $0.36 | $1.44 | — | 196k | 🔧 tools |
| bedrock/eu-central-1/minimax.minimax-m2.5 | $0.36 | $1.44 | — | 1000k | 🔧 tools |
| bedrock/eu-west-1/minimax.minimax-m2.1 | $0.36 | $1.44 | — | 196k | 🔧 tools |
| bedrock/eu-west-1/minimax.minimax-m2.5 | $0.36 | $1.44 | — | 1000k | 🔧 tools |
| bedrock/eu-south-1/minimax.minimax-m2.1 | $0.36 | $1.44 | — | 196k | 🔧 tools |
| bedrock/eu-south-1/minimax.minimax-m2.5 | $0.36 | $1.44 | — | 1000k | 🔧 tools |
| bedrock/sa-east-1/minimax.minimax-m2.1 | $0.36 | $1.44 | — | 196k | 🔧 tools |
| bedrock/sa-east-1/minimax.minimax-m2.5 | $0.36 | $1.44 | — | 1000k | 🔧 tools |
| bedrock/eu-west-2/meta.llama3-8b-instruct-v1:0 | $0.39 | $0.78 | — | 8k | |
| bedrock/us-east-1/mistral.mixtral-8x7b-instruct-v0:1 | $0.45 | $0.70 | — | 32k | |
| bedrock/us-west-2/mistral.mixtral-8x7b-instruct-v0:1 | $0.45 | $0.70 | — | 32k | |
| mistral.mixtral-8x7b-instruct-v0:1 | $0.45 | $0.70 | — | 32k | |
| bedrock/eu-west-2/minimax.minimax-m2.1 | $0.47 | $1.86 | — | 196k | 🔧 tools |
| bedrock/eu-west-2/minimax.minimax-m2.5 | $0.47 | $1.86 | — | 1000k | 🔧 tools |
| ai21.jamba-instruct-v1:0 | $0.50 | $0.70 | — | 70k | |
| amazon.titan-text-premier-v1:0 | $0.50 | $1.50 | — | 42k | |
| bedrock/sa-east-1/meta.llama3-8b-instruct-v1:0 | $0.50 | $1.01 | — | 8k | |
| bedrock/us-east-1/qwen.qwen3-coder-next | $0.50 | $1.20 | — | 262k | 🔧 tools |
| bedrock/us-east-2/qwen.qwen3-coder-next | $0.50 | $1.20 | — | 262k | 🔧 tools |
| bedrock/us-gov-east-1/amazon.titan-text-premier-v1:0 | $0.50 | $1.50 | — | 42k | |
| bedrock/us-gov-west-1/amazon.titan-text-premier-v1:0 | $0.50 | $1.50 | — | 42k | |
| bedrock/us-west-2/qwen.qwen3-coder-next | $0.50 | $1.20 | — | 262k | 🔧 tools |
| cohere.command-r-v1:0 | $0.50 | $1.50 | — | 128k | |
| bedrock/eu-west-3/mistral.mixtral-8x7b-instruct-v0:1 | $0.59 | $0.91 | — | 32k | |
| bedrock/ap-northeast-1/qwen.qwen3-coder-next | $0.60 | $1.44 | — | 262k | 🔧 tools |
| bedrock/moonshotai.kimi-k2.5 | $0.60 | $3.03 | — | 262k | 👁️ vision · 🔧 tools |
| bedrock/ap-south-1/qwen.qwen3-coder-next | $0.60 | $1.44 | — | 262k | 🔧 tools |
| bedrock/ap-southeast-3/qwen.qwen3-coder-next | $0.60 | $1.44 | — | 262k | 🔧 tools |
| bedrock/eu-central-1/qwen.qwen3-coder-next | $0.60 | $1.44 | — | 262k | 🔧 tools |
| bedrock/eu-west-1/qwen.qwen3-coder-next | $0.60 | $1.44 | — | 262k | 🔧 tools |
| bedrock/eu-south-1/qwen.qwen3-coder-next | $0.60 | $1.44 | — | 262k | 🔧 tools |
| bedrock/sa-east-1/qwen.qwen3-coder-next | $0.60 | $1.44 | — | 262k | 🔧 tools |
| bedrock/us-east-1/moonshotai.kimi-k2-thinking | $0.60 | $2.50 | — | 262k | 🔧 tools |
| bedrock/us-east-1/moonshotai.kimi-k2.5 | $0.60 | $3.00 | — | 262k | 👁️ vision · 🔧 tools |
| bedrock/us-east-2/moonshotai.kimi-k2-thinking | $0.60 | $2.50 | — | 262k | 🔧 tools |
| bedrock/us-east-2/moonshotai.kimi-k2.5 | $0.60 | $3.00 | — | 262k | 👁️ vision · 🔧 tools |
| bedrock/us-west-2/moonshotai.kimi-k2-thinking | $0.60 | $2.50 | — | 262k | 🔧 tools |
| bedrock/us-west-2/moonshotai.kimi-k2.5 | $0.60 | $3.00 | — | 262k | 👁️ vision · 🔧 tools |
| bedrock/us-east-1/deepseek.v3.2 | $0.62 | $1.85 | — | 164k | 🔧 tools |
| bedrock/us-east-2/deepseek.v3.2 | $0.62 | $1.85 | — | 164k | 🔧 tools |
| bedrock/us-west-2/deepseek.v3.2 | $0.62 | $1.85 | — | 164k | 🔧 tools |
| bedrock/ap-south-1/moonshotai.kimi-k2-thinking | $0.71 | $2.94 | — | 262k | 🔧 tools |
| bedrock/ap-northeast-1/moonshotai.kimi-k2.5 | $0.72 | $3.60 | — | 262k | 👁️ vision · 🔧 tools |
| bedrock/ap-south-1/moonshotai.kimi-k2.5 | $0.72 | $3.60 | — | 262k | 👁️ vision · 🔧 tools |
| bedrock/ap-southeast-3/moonshotai.kimi-k2.5 | $0.72 | $3.60 | — | 262k | 👁️ vision · 🔧 tools |
| bedrock/eu-north-1/moonshotai.kimi-k2.5 | $0.72 | $3.60 | — | 262k | 👁️ vision · 🔧 tools |
| bedrock/sa-east-1/moonshotai.kimi-k2.5 | $0.72 | $3.60 | — | 262k | 👁️ vision · 🔧 tools |
| bedrock/ap-northeast-1/moonshotai.kimi-k2-thinking | $0.73 | $3.03 | — | 262k | 🔧 tools |
| bedrock/moonshotai.kimi-k2-thinking | $0.73 | $3.03 | — | 262k | 🔧 tools |
| bedrock/sa-east-1/moonshotai.kimi-k2-thinking | $0.73 | $3.03 | — | 262k | 🔧 tools |
| bedrock/ap-northeast-1/deepseek.v3.2 | $0.74 | $2.22 | — | 164k | 🔧 tools |
| bedrock/ap-south-1/deepseek.v3.2 | $0.74 | $2.22 | — | 164k | 🔧 tools |
| bedrock/ap-southeast-3/deepseek.v3.2 | $0.74 | $2.22 | — | 164k | 🔧 tools |
| bedrock/eu-north-1/deepseek.v3.2 | $0.74 | $2.22 | — | 164k | 🔧 tools |
| bedrock/sa-east-1/deepseek.v3.2 | $0.74 | $2.22 | — | 164k | 🔧 tools |
| meta.llama2-13b-chat-v1 | $0.75 | $1.00 | — | 4k | |
| bedrock/eu-west-2/qwen.qwen3-coder-next | $0.78 | $1.86 | — | 262k | 🔧 tools |
| anthropic.claude-3-5-haiku-20241022-v1:0 | $0.80 | $4.00 | $0.08 | 200k | 🔧 tools · 💾 cache |
| anthropic.claude-instant-v1 | $0.80 | $2.40 | — | 100k | |
| bedrock/us-east-1/anthropic.claude-instant-v1 | $0.80 | $2.40 | — | 100k | |
| bedrock/us-west-2/anthropic.claude-instant-v1 | $0.80 | $2.40 | — | 100k | |
| bedrock/us.anthropic.claude-3-5-haiku-20241022-v1:0 | $0.80 | $4.00 | $0.08 | 200k | 🔧 tools · 💾 cache |
| us.anthropic.claude-3-5-haiku-20241022-v1:0 | $0.80 | $4.00 | $0.08 | 200k | 🔧 tools · 💾 cache |
| bedrock/us-gov-east-1/amazon.nova-pro-v1:0 | $0.96 | $3.84 | — | 300k | 👁️ vision · 🔧 tools · 💾 cache |
| bedrock/us-gov-west-1/amazon.nova-pro-v1:0 | $0.96 | $3.84 | — | 300k | 👁️ vision · 🔧 tools · 💾 cache |
| meta.llama3-1-70b-instruct-v1:0 | $0.99 | $0.99 | — | 128k | 🔧 tools |
| us.meta.llama3-1-70b-instruct-v1:0 | $0.99 | $0.99 | — | 128k | 🔧 tools |
| mistral.mistral-small-2402-v1:0 | $1.00 | $3.00 | — | 32k | 🔧 tools |
| bedrock/us-east-1/zai.glm-5 | $1.00 | $3.20 | — | 200k | 🔧 tools |
| bedrock/us-west-2/zai.glm-5 | $1.00 | $3.20 | — | 200k | 🔧 tools |
| bedrock/us-gov-east-1/anthropic.claude-haiku-4-5-20251001-v1:0 | $1.20 | $6.00 | $0.12 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| bedrock/us-gov-west-1/anthropic.claude-haiku-4-5-20251001-v1:0 | $1.20 | $6.00 | $0.12 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| amazon.titan-text-express-v1 | $1.30 | $1.70 | — | 42k | |
| bedrock/us-gov-east-1/amazon.titan-text-express-v1 | $1.30 | $1.70 | — | 42k | |
| bedrock/us-gov-west-1/amazon.titan-text-express-v1 | $1.30 | $1.70 | — | 42k | |
| cohere.command-text-v14 | $1.50 | $2.00 | — | 4k | |
| meta.llama2-70b-chat-v1 | $1.95 | $2.56 | — | 4k | |
| ai21.jamba-1-5-large-v1:0 | $2.00 | $8.00 | — | 256k | |
| meta.llama3-2-90b-instruct-v1:0 | $2.00 | $2.00 | — | 128k | 👁️ vision · 🔧 tools |
| us.meta.llama3-2-90b-instruct-v1:0 | $2.00 | $2.00 | — | 128k | 👁️ vision · 🔧 tools |
| bedrock/ap-northeast-1/anthropic.claude-instant-v1 | $2.23 | $7.55 | — | 100k | |
| bedrock/eu-central-1/anthropic.claude-instant-v1 | $2.48 | $8.38 | — | 100k | |
| bedrock/us-east-1/meta.llama3-70b-instruct-v1:0 | $2.65 | $3.50 | — | 8k | |
| bedrock/us-gov-east-1/meta.llama3-70b-instruct-v1:0 | $2.65 | $3.50 | — | 8k | |
| bedrock/us-gov-west-1/meta.llama3-70b-instruct-v1:0 | $2.65 | $3.50 | — | 8k | |
| bedrock/us-west-1/meta.llama3-70b-instruct-v1:0 | $2.65 | $3.50 | — | 8k | |
| meta.llama3-70b-instruct-v1:0 | $2.65 | $3.50 | — | 8k | |
| bedrock/eu-west-1/meta.llama3-70b-instruct-v1:0 | $2.86 | $3.78 | — | 8k | |
| anthropic.claude-3-5-sonnet-20240620-v1:0 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools |
| anthropic.claude-3-5-sonnet-20241022-v2:0 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| anthropic.claude-3-sonnet-20240229-v1:0 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools |
| apac.anthropic.claude-3-5-sonnet-20240620-v1:0 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools |
| apac.anthropic.claude-3-5-sonnet-20241022-v2:0 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| apac.anthropic.claude-3-sonnet-20240229-v1:0 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools |
| bedrock/invoke/anthropic.claude-3-5-sonnet-20240620-v1:0 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools |
| claude-sonnet-4-5-20250929-v1:0 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| cohere.command-r-plus-v1:0 | $3.00 | $15.00 | — | 128k | |
| eu.anthropic.claude-3-5-sonnet-20240620-v1:0 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools |
| eu.anthropic.claude-3-5-sonnet-20241022-v2:0 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| eu.anthropic.claude-3-7-sonnet-20250219-v1:0 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| eu.anthropic.claude-3-sonnet-20240229-v1:0 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools |
| mistral.mistral-large-2407-v1:0 | $3.00 | $9.00 | — | 128k | 🔧 tools |
| us.anthropic.claude-3-5-sonnet-20240620-v1:0 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools |
| us.anthropic.claude-3-5-sonnet-20241022-v2:0 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| us.anthropic.claude-3-sonnet-20240229-v1:0 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools |
| bedrock/ca-central-1/meta.llama3-70b-instruct-v1:0 | $3.05 | $4.03 | — | 8k | |
| bedrock/ap-south-1/meta.llama3-70b-instruct-v1:0 | $3.18 | $4.20 | — | 8k | |
| bedrock/us-gov-east-1/anthropic.claude-sonnet-4-5-20250929-v1:0 | $3.30 | $16.50 | $0.33 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| bedrock/us-gov-east-1/claude-sonnet-4-5-20250929-v1:0 | $3.30 | $16.50 | $0.33 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| bedrock/us-gov-west-1/anthropic.claude-sonnet-4-5-20250929-v1:0 | $3.30 | $16.50 | $0.33 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| bedrock/us-gov-west-1/claude-sonnet-4-5-20250929-v1:0 | $3.30 | $16.50 | $0.33 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| bedrock/eu-west-2/meta.llama3-70b-instruct-v1:0 | $3.45 | $4.55 | — | 8k | |
| anthropic.claude-3-7-sonnet-20240620-v1:0 | $3.60 | $18.00 | $0.36 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| bedrock/us-gov-east-1/anthropic.claude-3-5-sonnet-20240620-v1:0 | $3.60 | $18.00 | $0.36 | 200k | 👁️ vision · 🔧 tools |
| bedrock/us-gov-west-1/anthropic.claude-3-7-sonnet-20250219-v1:0 | $3.60 | $18.00 | $0.36 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| bedrock/us-gov-west-1/anthropic.claude-3-5-sonnet-20240620-v1:0 | $3.60 | $18.00 | $0.36 | 200k | 👁️ vision · 🔧 tools |
| bedrock/sa-east-1/meta.llama3-70b-instruct-v1:0 | $4.45 | $5.88 | — | 8k | |
| meta.llama3-1-405b-instruct-v1:0 | $5.32 | $16.00 | — | 128k | 🔧 tools |
| us.meta.llama3-1-405b-instruct-v1:0 | $5.32 | $16.00 | — | 128k | 🔧 tools |
| anthropic.claude-v1 | $8.00 | $24.00 | — | 100k | |
| anthropic.claude-v2:1 | $8.00 | $24.00 | — | 100k | |
| bedrock/ap-northeast-1/anthropic.claude-v1 | $8.00 | $24.00 | — | 100k | |
| bedrock/ap-northeast-1/anthropic.claude-v2:1 | $8.00 | $24.00 | — | 100k | |
| bedrock/eu-central-1/anthropic.claude-v1 | $8.00 | $24.00 | — | 100k | |
| bedrock/eu-central-1/anthropic.claude-v2:1 | $8.00 | $24.00 | — | 100k | |
| bedrock/us-east-1/anthropic.claude-v1 | $8.00 | $24.00 | — | 100k | |
| bedrock/us-east-1/anthropic.claude-v2:1 | $8.00 | $24.00 | — | 100k | |
| bedrock/us-east-1/mistral.mistral-large-2402-v1:0 | $8.00 | $24.00 | — | 32k | 🔧 tools |
| bedrock/us-west-2/anthropic.claude-v1 | $8.00 | $24.00 | — | 100k | |
| bedrock/us-west-2/anthropic.claude-v2:1 | $8.00 | $24.00 | — | 100k | |
| bedrock/us-west-2/mistral.mistral-large-2402-v1:0 | $8.00 | $24.00 | — | 32k | 🔧 tools |
| mistral.mistral-large-2402-v1:0 | $8.00 | $24.00 | — | 32k | 🔧 tools |
| bedrock/eu-west-3/mistral.mistral-large-2402-v1:0 | $10.40 | $31.20 | — | 32k | 🔧 tools |
| ai21.j2-mid-v1 | $12.50 | $12.50 | — | 8k | |
| anthropic.claude-3-opus-20240229-v1:0 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools |
| eu.anthropic.claude-3-opus-20240229-v1:0 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools |
| us.anthropic.claude-3-opus-20240229-v1:0 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools |
| ai21.j2-ultra-v1 | $18.80 | $18.80 | — | 8k |
azure (124)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| azure/gpt-5-nano | $0.05 | $0.40 | $0.0050 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-5-nano-2025-08-07 | $0.05 | $0.40 | $0.0050 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/eu/gpt-5-nano-2025-08-07 | $0.06 | $0.44 | $0.0055 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/us/gpt-5-nano-2025-08-07 | $0.06 | $0.44 | $0.0055 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-4.1-nano | $0.10 | $0.40 | $0.02 | 1048k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-4.1-nano-2025-04-14 | $0.10 | $0.40 | $0.02 | 1048k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/us/gpt-4.1-nano-2025-04-14 | $0.11 | $0.44 | $0.02 | 1048k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/global-standard/gpt-4o-mini | $0.15 | $0.60 | — | 128k | 👁️ vision · 🔧 tools |
| azure/eu/gpt-4o-mini-2024-07-18 | $0.17 | $0.66 | $0.08 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-4o-mini | $0.17 | $0.66 | $0.07 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-4o-mini-2024-07-18 | $0.17 | $0.66 | $0.07 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/us/gpt-4o-mini-2024-07-18 | $0.17 | $0.66 | $0.08 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-5.4-nano | $0.20 | $1.25 | $0.02 | 1050k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| azure/gpt-5.4-nano-2026-03-17 | $0.20 | $1.25 | $0.02 | 1050k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| azure/gpt-5-mini | $0.25 | $2.00 | $0.02 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-5-mini-2025-08-07 | $0.25 | $2.00 | $0.02 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/eu/gpt-5-mini-2025-08-07 | $0.28 | $2.20 | $0.03 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/us/gpt-5-mini-2025-08-07 | $0.28 | $2.20 | $0.03 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-4.1-mini | $0.40 | $1.60 | $0.10 | 1048k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-4.1-mini-2025-04-14 | $0.40 | $1.60 | $0.10 | 1048k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/us/gpt-4.1-mini-2025-04-14 | $0.44 | $1.76 | $0.11 | 1048k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-3.5-turbo | $0.50 | $1.50 | — | 4k | 🔧 tools |
| azure/gpt-3.5-turbo-0125 | $0.50 | $1.50 | — | 16k | 🔧 tools |
| azure/gpt-35-turbo | $0.50 | $1.50 | — | 4k | 🔧 tools |
| azure/gpt-35-turbo-0125 | $0.50 | $1.50 | — | 16k | 🔧 tools |
| azure/gpt-audio-mini-2025-10-06 | $0.60 | $2.40 | — | 128k | 🔧 tools |
| azure/gpt-4o-mini-realtime-preview-2024-12-17 | $0.60 | $2.40 | $0.30 | 128k | 🔧 tools |
| azure/gpt-realtime-mini-2025-10-06 | $0.60 | $2.40 | $0.06 | 32k | 🔧 tools |
| azure/eu/gpt-4o-mini-realtime-preview-2024-12-17 | $0.66 | $2.64 | $0.33 | 128k | 🔧 tools |
| azure/us/gpt-4o-mini-realtime-preview-2024-12-17 | $0.66 | $2.64 | $0.33 | 128k | 🔧 tools |
| azure/gpt-5.4-mini | $0.75 | $4.50 | $0.07 | 1050k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| azure/gpt-5.4-mini-2026-03-17 | $0.75 | $4.50 | $0.07 | 1050k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| azure/gpt-35-turbo-1106 | $1.00 | $2.00 | — | 16k | 🔧 tools |
| azure/o1-mini-2024-09-12 | $1.10 | $4.40 | $0.55 | 128k | 🔧 tools · 💾 cache |
| azure/o3-mini | $1.10 | $4.40 | $0.55 | 200k | 💾 cache |
| azure/o3-mini-2025-01-31 | $1.10 | $4.40 | $0.55 | 200k | 💾 cache |
| azure/o4-mini | $1.10 | $4.40 | $0.28 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/o4-mini-2025-04-16 | $1.10 | $4.40 | $0.28 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/eu/o1-mini-2024-09-12 | $1.21 | $4.84 | $0.60 | 128k | 🔧 tools · 💾 cache |
| azure/eu/o3-mini-2025-01-31 | $1.21 | $4.84 | $0.60 | 200k | 💾 cache |
| azure/o1-mini | $1.21 | $4.84 | $0.60 | 128k | 🔧 tools · 💾 cache |
| azure/us/o1-mini-2024-09-12 | $1.21 | $4.84 | $0.60 | 128k | 🔧 tools · 💾 cache |
| azure/us/o3-mini-2025-01-31 | $1.21 | $4.84 | $0.60 | 200k | 💾 cache |
| azure/us/o4-mini-2025-04-16 | $1.21 | $4.84 | $0.31 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/global/gpt-5.1 | $1.25 | $10.00 | $0.13 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/global/gpt-5.1-chat | $1.25 | $10.00 | $0.13 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-5.1-2025-11-13 | $1.25 | $10.00 | $0.13 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-5.1-chat-2025-11-13 | $1.25 | $10.00 | $0.13 | 128k | 👁️ vision · 💾 cache |
| azure/gpt-5 | $1.25 | $10.00 | $0.13 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-5-2025-08-07 | $1.25 | $10.00 | $0.13 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-5-chat | $1.25 | $10.00 | $0.13 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-5-chat-latest | $1.25 | $10.00 | $0.13 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-5.1 | $1.25 | $10.00 | $0.13 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-5.1-chat | $1.25 | $10.00 | $0.13 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/eu/gpt-5-2025-08-07 | $1.38 | $11.00 | $0.14 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/us/gpt-5-2025-08-07 | $1.38 | $11.00 | $0.14 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/eu/gpt-5.1 | $1.38 | $11.00 | $0.14 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/eu/gpt-5.1-chat | $1.38 | $11.00 | $0.14 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/us/gpt-5.1 | $1.38 | $11.00 | $0.14 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/us/gpt-5.1-chat | $1.38 | $11.00 | $0.14 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-5.2 | $1.75 | $14.00 | $0.17 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-5.2-2025-12-11 | $1.75 | $14.00 | $0.17 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-5.2-chat | $1.75 | $14.00 | $0.17 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-5.2-chat-2025-12-11 | $1.75 | $14.00 | $0.17 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-5.3-chat | $1.75 | $14.00 | $0.17 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-4.1 | $2.00 | $8.00 | $0.50 | 1048k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-4.1-2025-04-14 | $2.00 | $8.00 | $0.50 | 1048k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/o3 | $2.00 | $8.00 | $0.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/o3-2025-04-16 | $2.00 | $8.00 | $0.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/us/gpt-4.1-2025-04-14 | $2.20 | $8.80 | $0.55 | 1048k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/us/o3-2025-04-16 | $2.20 | $8.80 | $0.55 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/global-standard/gpt-4o-2024-08-06 | $2.50 | $10.00 | $1.25 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/global-standard/gpt-4o-2024-11-20 | $2.50 | $10.00 | $1.25 | 128k | 👁️ vision · 🔧 tools |
| azure/global/gpt-4o-2024-08-06 | $2.50 | $10.00 | $1.25 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/global/gpt-4o-2024-11-20 | $2.50 | $10.00 | $1.25 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-4o | $2.50 | $10.00 | $1.25 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-4o-2024-08-06 | $2.50 | $10.00 | $1.25 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-audio-2025-08-28 | $2.50 | $10.00 | — | 128k | 🔧 tools |
| azure/gpt-audio-1.5-2026-02-23 | $2.50 | $10.00 | — | 128k | 🔧 tools |
| azure/gpt-4o-audio-preview-2024-12-17 | $2.50 | $10.00 | — | 128k | 🔧 tools |
| azure/gpt-4o-mini-audio-preview-2024-12-17 | $2.50 | $10.00 | — | 128k | 🔧 tools |
| azure/gpt-5.4 | $2.50 | $15.00 | $0.25 | 1050k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-5.4-2026-03-05 | $2.50 | $15.00 | $0.25 | 1050k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/eu/gpt-4o-2024-08-06 | $2.75 | $11.00 | $1.38 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/eu/gpt-4o-2024-11-20 | $2.75 | $11.00 | — | 128k | 👁️ vision · 🔧 tools |
| azure/gpt-4o-2024-11-20 | $2.75 | $11.00 | $1.25 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/us/gpt-4o-2024-08-06 | $2.75 | $11.00 | $1.38 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/us/gpt-4o-2024-11-20 | $2.75 | $11.00 | — | 128k | 👁️ vision · 🔧 tools |
| azure/command-r-plus | $3.00 | $15.00 | — | 128k | 🔧 tools |
| azure/computer-use-preview | $3.00 | $12.00 | — | 8k | 👁️ vision · 🔧 tools |
| azure/gpt-35-turbo-16k | $3.00 | $4.00 | — | 16k | |
| azure/gpt-35-turbo-16k-0613 | $3.00 | $4.00 | — | 16k | 🔧 tools |
| computer-use-preview | $3.00 | $12.00 | — | 8k | 👁️ vision · 🔧 tools |
| azure/gpt-realtime-2025-08-28 | $4.00 | $16.00 | $4.00 | 32k | 🔧 tools |
| azure/gpt-realtime-1.5-2026-02-23 | $4.00 | $16.00 | $4.00 | 32k | 🔧 tools |
| azure/gpt-4o-2024-05-13 | $5.00 | $15.00 | — | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/gpt-4o-realtime-preview-2024-10-01 | $5.00 | $20.00 | $2.50 | 128k | 🔧 tools |
| azure/gpt-4o-realtime-preview-2024-12-17 | $5.00 | $20.00 | $2.50 | 128k | 🔧 tools |
| azure/gpt-5.5 | $5.00 | $30.00 | $0.50 | 1050k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| azure/gpt-5.5-2026-04-23 | $5.00 | $30.00 | $0.50 | 1050k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| azure/eu/gpt-4o-realtime-preview-2024-10-01 | $5.50 | $22.00 | $2.75 | 128k | 🔧 tools |
| azure/eu/gpt-4o-realtime-preview-2024-12-17 | $5.50 | $22.00 | $2.75 | 128k | 🔧 tools |
| azure/us/gpt-4o-realtime-preview-2024-10-01 | $5.50 | $22.00 | $2.75 | 128k | 🔧 tools |
| azure/us/gpt-4o-realtime-preview-2024-12-17 | $5.50 | $22.00 | $2.75 | 128k | 🔧 tools |
| azure/mistral-large-2402 | $8.00 | $24.00 | — | 32k | 🔧 tools |
| azure/mistral-large-latest | $8.00 | $24.00 | — | 32k | 🔧 tools |
| azure/gpt-4-0125-preview | $10.00 | $30.00 | — | 128k | 🔧 tools |
| azure/gpt-4-1106-preview | $10.00 | $30.00 | — | 128k | 🔧 tools |
| azure/gpt-4-turbo | $10.00 | $30.00 | — | 128k | 🔧 tools |
| azure/gpt-4-turbo-2024-04-09 | $10.00 | $30.00 | — | 128k | 👁️ vision · 🔧 tools |
| azure/gpt-4-turbo-vision-preview | $10.00 | $30.00 | — | 128k | 👁️ vision |
| azure/o1 | $15.00 | $60.00 | $7.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/o1-2024-12-17 | $15.00 | $60.00 | $7.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/o1-preview | $15.00 | $60.00 | $7.50 | 128k | 🔧 tools · 💾 cache |
| azure/o1-preview-2024-09-12 | $15.00 | $60.00 | $7.50 | 128k | 🔧 tools · 💾 cache |
| azure/eu/o1-2024-12-17 | $16.50 | $66.00 | $8.25 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/eu/o1-preview-2024-09-12 | $16.50 | $66.00 | $8.25 | 128k | 🔧 tools · 💾 cache |
| azure/us/o1-2024-12-17 | $16.50 | $66.00 | $8.25 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| azure/us/o1-preview-2024-09-12 | $16.50 | $66.00 | $8.25 | 128k | 🔧 tools · 💾 cache |
| azure/gpt-4 | $30.00 | $60.00 | — | 8k | 🔧 tools |
| azure/gpt-4-0613 | $30.00 | $60.00 | — | 8k | 🔧 tools |
| azure/gpt-4-32k | $60.00 | $120.00 | — | 33k | |
| azure/gpt-4-32k-0613 | $60.00 | $120.00 | — | 33k | |
| azure/gpt-4.5-preview | $75.00 | $150.00 | $37.50 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
bedrock converse (121)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| amazon.nova-micro-v1:0 | $0.04 | $0.14 | — | 128k | 🔧 tools · 💾 cache |
| us.amazon.nova-micro-v1:0 | $0.04 | $0.14 | — | 128k | 🔧 tools · 💾 cache |
| apac.amazon.nova-micro-v1:0 | $0.04 | $0.15 | — | 128k | 🔧 tools · 💾 cache |
| google.gemma-3-4b-it | $0.04 | $0.08 | — | 128k | 👁️ vision |
| mistral.voxtral-mini-3b-2507 | $0.04 | $0.04 | — | 128k | |
| eu.amazon.nova-micro-v1:0 | $0.05 | $0.18 | — | 128k | 🔧 tools · 💾 cache |
| amazon.nova-lite-v1:0 | $0.06 | $0.24 | — | 300k | 👁️ vision · 🔧 tools · 💾 cache |
| nvidia.nemotron-nano-9b-v2 | $0.06 | $0.23 | — | 128k | |
| nvidia.nemotron-nano-3-30b | $0.06 | $0.24 | — | 262k | 🔧 tools |
| us.amazon.nova-lite-v1:0 | $0.06 | $0.24 | — | 300k | 👁️ vision · 🔧 tools · 💾 cache |
| apac.amazon.nova-lite-v1:0 | $0.06 | $0.25 | — | 300k | 👁️ vision · 🔧 tools · 💾 cache |
| openai.gpt-oss-20b-1:0 | $0.07 | $0.30 | — | 128k | 🔧 tools |
| openai.gpt-oss-safeguard-20b | $0.07 | $0.20 | — | 128k | |
| zai.glm-4.7-flash | $0.07 | $0.40 | — | 200k | 🔧 tools |
| eu.amazon.nova-lite-v1:0 | $0.08 | $0.31 | — | 300k | 👁️ vision · 🔧 tools · 💾 cache |
| google.gemma-3-12b-it | $0.09 | $0.29 | — | 128k | 👁️ vision |
| mistral.ministral-3-3b-instruct | $0.10 | $0.10 | — | 128k | 🔧 tools |
| mistral.voxtral-small-24b-2507 | $0.10 | $0.30 | — | 128k | |
| mistral.ministral-3-8b-instruct | $0.15 | $0.15 | — | 128k | 🔧 tools |
| nvidia.nemotron-super-3-120b | $0.15 | $0.65 | — | 256k | 🔧 tools |
| openai.gpt-oss-120b-1:0 | $0.15 | $0.60 | — | 128k | 🔧 tools |
| openai.gpt-oss-safeguard-120b | $0.15 | $0.60 | — | 128k | |
| qwen.qwen3-coder-30b-a3b-v1:0 | $0.15 | $0.60 | — | 262k | 🔧 tools |
| qwen.qwen3-32b-v1:0 | $0.15 | $0.60 | — | 131k | 🔧 tools |
| qwen.qwen3-next-80b-a3b | $0.15 | $1.20 | — | 128k | 🔧 tools |
| meta.llama4-scout-17b-instruct-v1:0 | $0.17 | $0.66 | — | 128k | 🔧 tools |
| us.meta.llama4-scout-17b-instruct-v1:0 | $0.17 | $0.66 | — | 128k | 🔧 tools |
| mistral.ministral-3-14b-instruct | $0.20 | $0.20 | — | 128k | 🔧 tools |
| nvidia.nemotron-nano-12b-v2 | $0.20 | $0.60 | — | 128k | 👁️ vision |
| qwen.qwen3-coder-480b-a35b-v1:0 | $0.22 | $1.80 | — | 262k | 🔧 tools |
| qwen.qwen3-235b-a22b-2507-v1:0 | $0.22 | $0.88 | — | 262k | 🔧 tools |
| google.gemma-3-27b-it | $0.23 | $0.38 | — | 128k | 👁️ vision |
| meta.llama4-maverick-17b-instruct-v1:0 | $0.24 | $0.97 | — | 128k | 🔧 tools |
| us.meta.llama4-maverick-17b-instruct-v1:0 | $0.24 | $0.97 | — | 128k | 🔧 tools |
| amazon.nova-2-lite-v1:0 | $0.30 | $2.50 | $0.07 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| global.amazon.nova-2-lite-v1:0 | $0.30 | $2.50 | $0.07 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| minimax.minimax-m2 | $0.30 | $1.20 | — | 128k | |
| minimax.minimax-m2.1 | $0.30 | $1.20 | — | 196k | 🔧 tools |
| minimax.minimax-m2.5 | $0.30 | $1.20 | — | 1000k | 🔧 tools |
| apac.amazon.nova-2-lite-v1:0 | $0.33 | $2.75 | $0.08 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| eu.amazon.nova-2-lite-v1:0 | $0.33 | $2.75 | $0.08 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| us.amazon.nova-2-lite-v1:0 | $0.33 | $2.75 | $0.08 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| mistral.devstral-2-123b | $0.40 | $2.00 | — | 256k | 🔧 tools |
| mistral.magistral-small-2509 | $0.50 | $1.50 | — | 128k | 🔧 tools |
| mistral.mistral-large-3-675b-instruct | $0.50 | $1.50 | — | 128k | 🔧 tools |
| qwen.qwen3-coder-next | $0.50 | $1.20 | — | 262k | 🔧 tools |
| qwen.qwen3-vl-235b-a22b | $0.53 | $2.66 | — | 128k | 👁️ vision · 🔧 tools |
| deepseek.v3-v1:0 | $0.58 | $1.68 | — | 164k | 🔧 tools |
| us.writer.palmyra-x5-v1:0 | $0.60 | $6.00 | — | 1000k | 🔧 tools |
| writer.palmyra-x5-v1:0 | $0.60 | $6.00 | — | 1000k | 🔧 tools |
| moonshot.kimi-k2-thinking | $0.60 | $2.50 | — | 128k | |
| moonshotai.kimi-k2.5 | $0.60 | $3.00 | — | 262k | 👁️ vision · 🔧 tools |
| zai.glm-4.7 | $0.60 | $2.20 | — | 200k | 🔧 tools |
| deepseek.v3.2 | $0.62 | $1.85 | — | 164k | 🔧 tools |
| us.deepseek.v3.2 | $0.62 | $1.85 | — | 164k | 🔧 tools |
| meta.llama3-3-70b-instruct-v1:0 | $0.72 | $0.72 | — | 128k | 🔧 tools |
| us.meta.llama3-3-70b-instruct-v1:0 | $0.72 | $0.72 | — | 128k | 🔧 tools |
| eu.deepseek.v3.2 | $0.74 | $2.22 | — | 164k | 🔧 tools |
| amazon.nova-pro-v1:0 | $0.80 | $3.20 | — | 300k | 👁️ vision · 🔧 tools · 💾 cache |
| us.amazon.nova-pro-v1:0 | $0.80 | $3.20 | — | 300k | 👁️ vision · 🔧 tools · 💾 cache |
| apac.amazon.nova-pro-v1:0 | $0.84 | $3.36 | — | 300k | 👁️ vision · 🔧 tools · 💾 cache |
| anthropic.claude-haiku-4-5-20251001-v1:0 | $1.00 | $5.00 | $0.10 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| anthropic.claude-haiku-4-5@20251001 | $1.00 | $5.00 | $0.10 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| global.anthropic.claude-haiku-4-5-20251001-v1:0 | $1.00 | $5.00 | $0.10 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| zai.glm-5 | $1.00 | $3.20 | — | 200k | 🔧 tools |
| eu.amazon.nova-pro-v1:0 | $1.05 | $4.20 | — | 300k | 👁️ vision · 🔧 tools · 💾 cache |
| apac.anthropic.claude-haiku-4-5-20251001-v1:0 | $1.10 | $5.50 | $0.11 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| eu.anthropic.claude-haiku-4-5-20251001-v1:0 | $1.10 | $5.50 | $0.11 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| jp.anthropic.claude-haiku-4-5-20251001-v1:0 | $1.10 | $5.50 | $0.11 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| us.anthropic.claude-haiku-4-5-20251001-v1:0 | $1.10 | $5.50 | $0.11 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| au.anthropic.claude-haiku-4-5-20251001-v1:0 | $1.10 | $5.50 | $0.11 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| us.deepseek.r1-v1:0 | $1.35 | $5.40 | — | 128k | |
| eu.mistral.pixtral-large-2502-v1:0 | $2.00 | $6.00 | — | 128k | 🔧 tools |
| us.mistral.pixtral-large-2502-v1:0 | $2.00 | $6.00 | — | 128k | 🔧 tools |
| amazon.nova-2-pro-preview-20251202-v1:0 | $2.19 | $17.50 | $0.55 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| apac.amazon.nova-2-pro-preview-20251202-v1:0 | $2.19 | $17.50 | $0.55 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| eu.amazon.nova-2-pro-preview-20251202-v1:0 | $2.19 | $17.50 | $0.55 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| us.amazon.nova-2-pro-preview-20251202-v1:0 | $2.19 | $17.50 | $0.55 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| us.writer.palmyra-x4-v1:0 | $2.50 | $10.00 | — | 128k | 🔧 tools |
| writer.palmyra-x4-v1:0 | $2.50 | $10.00 | — | 128k | 🔧 tools |
| us.amazon.nova-premier-v1:0 | $2.50 | $12.50 | — | 1000k | 👁️ vision · 🔧 tools |
| anthropic.claude-3-7-sonnet-20250219-v1:0 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| anthropic.claude-sonnet-4-6 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| global.anthropic.claude-sonnet-4-6 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| anthropic.claude-sonnet-4-20250514-v1:0 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| anthropic.claude-sonnet-4-5-20250929-v1:0 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| apac.anthropic.claude-sonnet-4-20250514-v1:0 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| eu.anthropic.claude-sonnet-4-20250514-v1:0 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| global.anthropic.claude-sonnet-4-5-20250929-v1:0 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| global.anthropic.claude-sonnet-4-20250514-v1:0 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| us.anthropic.claude-3-7-sonnet-20250219-v1:0 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| us.anthropic.claude-sonnet-4-20250514-v1:0 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| us.anthropic.claude-sonnet-4-6 | $3.30 | $16.50 | $0.33 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| eu.anthropic.claude-sonnet-4-6 | $3.30 | $16.50 | $0.33 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| au.anthropic.claude-sonnet-4-6 | $3.30 | $16.50 | $0.33 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| jp.anthropic.claude-sonnet-4-6 | $3.30 | $16.50 | $0.33 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| au.anthropic.claude-sonnet-4-5-20250929-v1:0 | $3.30 | $16.50 | $0.33 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| eu.anthropic.claude-sonnet-4-5-20250929-v1:0 | $3.30 | $16.50 | $0.33 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| jp.anthropic.claude-sonnet-4-5-20250929-v1:0 | $3.30 | $16.50 | $0.33 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| us.anthropic.claude-sonnet-4-5-20250929-v1:0 | $3.30 | $16.50 | $0.33 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| us-gov.anthropic.claude-sonnet-4-5-20250929-v1:0 | $3.30 | $16.50 | $0.33 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| anthropic.claude-opus-4-5-20251101-v1:0 | $5.00 | $25.00 | $0.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| anthropic.claude-opus-4-6-v1 | $5.00 | $25.00 | $0.50 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| global.anthropic.claude-opus-4-6-v1 | $5.00 | $25.00 | $0.50 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| anthropic.claude-opus-4-7 | $5.00 | $25.00 | $0.50 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| global.anthropic.claude-opus-4-7 | $5.00 | $25.00 | $0.50 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| global.anthropic.claude-opus-4-5-20251101-v1:0 | $5.00 | $25.00 | $0.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| eu.anthropic.claude-opus-4-5-20251101-v1:0 | $5.00 | $25.00 | $0.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| us.anthropic.claude-opus-4-6-v1 | $5.50 | $27.50 | $0.55 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| eu.anthropic.claude-opus-4-6-v1 | $5.50 | $27.50 | $0.55 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| au.anthropic.claude-opus-4-6-v1 | $5.50 | $27.50 | $0.55 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| us.anthropic.claude-opus-4-7 | $5.50 | $27.50 | $0.55 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| eu.anthropic.claude-opus-4-7 | $5.50 | $27.50 | $0.55 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| au.anthropic.claude-opus-4-7 | $5.50 | $27.50 | $0.55 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| us.anthropic.claude-opus-4-5-20251101-v1:0 | $5.50 | $27.50 | $0.55 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| anthropic.claude-opus-4-1-20250805-v1:0 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| anthropic.claude-opus-4-20250514-v1:0 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| eu.anthropic.claude-opus-4-1-20250805-v1:0 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| eu.anthropic.claude-opus-4-20250514-v1:0 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| us.anthropic.claude-opus-4-1-20250805-v1:0 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| us.anthropic.claude-opus-4-20250514-v1:0 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
openai (97)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| gpt-5-nano | $0.05 | $0.40 | $0.0050 | 272k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-5-nano-2025-08-07 | $0.05 | $0.40 | $0.0050 | 272k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-4.1-nano | $0.10 | $0.40 | $0.02 | 1048k | 👁️ vision · 🔧 tools · 💾 cache |
| gpt-4.1-nano-2025-04-14 | $0.10 | $0.40 | $0.02 | 1048k | 👁️ vision · 🔧 tools · 💾 cache |
| gpt-4o-mini | $0.15 | $0.60 | $0.07 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| gpt-4o-mini-2024-07-18 | $0.15 | $0.60 | $0.07 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| gpt-4o-mini-audio-preview | $0.15 | $0.60 | — | 128k | 🔧 tools |
| gpt-4o-mini-audio-preview-2024-12-17 | $0.15 | $0.60 | — | 128k | 🔧 tools |
| gpt-4o-mini-search-preview | $0.15 | $0.60 | $0.07 | 128k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-4o-mini-search-preview-2025-03-11 | $0.15 | $0.60 | $0.07 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| ft:gpt-4.1-nano-2025-04-14 | $0.20 | $0.80 | $0.05 | 1048k | 🔧 tools · 💾 cache |
| gpt-5.4-nano | $0.20 | $1.25 | $0.02 | 272k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-5.4-nano-2026-03-17 | $0.20 | $1.25 | $0.02 | 272k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-5-mini | $0.25 | $2.00 | $0.02 | 272k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-5-mini-2025-08-07 | $0.25 | $2.00 | $0.02 | 272k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| ft:gpt-4o-mini-2024-07-18 | $0.30 | $1.20 | $0.15 | 128k | 🔧 tools · 💾 cache |
| gpt-4.1-mini | $0.40 | $1.60 | $0.10 | 1048k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-4.1-mini-2025-04-14 | $0.40 | $1.60 | $0.10 | 1048k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-3.5-turbo | $0.50 | $1.50 | — | 16k | 🔧 tools · 💾 cache |
| gpt-3.5-turbo-0125 | $0.50 | $1.50 | — | 16k | 🔧 tools · 💾 cache |
| gpt-audio-mini | $0.60 | $2.40 | — | 128k | 🔧 tools |
| gpt-audio-mini-2025-10-06 | $0.60 | $2.40 | — | 128k | 🔧 tools |
| gpt-audio-mini-2025-12-15 | $0.60 | $2.40 | — | 128k | 🔧 tools |
| gpt-4o-mini-realtime-preview | $0.60 | $2.40 | $0.30 | 128k | 🔧 tools |
| gpt-4o-mini-realtime-preview-2024-12-17 | $0.60 | $2.40 | $0.30 | 128k | 🔧 tools |
| gpt-realtime-mini | $0.60 | $2.40 | — | 128k | 🔧 tools |
| gpt-realtime-mini-2025-10-06 | $0.60 | $2.40 | $0.06 | 128k | 🔧 tools |
| gpt-realtime-mini-2025-12-15 | $0.60 | $2.40 | $0.06 | 128k | 🔧 tools |
| gpt-5.4-mini | $0.75 | $4.50 | $0.07 | 272k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-5.4-mini-2026-03-17 | $0.75 | $4.50 | $0.07 | 272k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| ft:gpt-4.1-mini-2025-04-14 | $0.80 | $3.20 | $0.20 | 1048k | 🔧 tools · 💾 cache |
| gpt-3.5-turbo-1106 | $1.00 | $2.00 | — | 16k | 🔧 tools · 💾 cache |
| o3-mini | $1.10 | $4.40 | $0.55 | 200k | 🔧 tools · 💾 cache |
| o3-mini-2025-01-31 | $1.10 | $4.40 | $0.55 | 200k | 🔧 tools · 💾 cache |
| o4-mini | $1.10 | $4.40 | $0.28 | 200k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| o4-mini-2025-04-16 | $1.10 | $4.40 | $0.28 | 200k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-5 | $1.25 | $10.00 | $0.13 | 272k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-5.1 | $1.25 | $10.00 | $0.13 | 272k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-5.1-2025-11-13 | $1.25 | $10.00 | $0.13 | 272k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-5.1-chat-latest | $1.25 | $10.00 | $0.13 | 128k | 👁️ vision · 💾 cache · 🌐 search |
| gpt-5-2025-08-07 | $1.25 | $10.00 | $0.13 | 272k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-5-chat | $1.25 | $10.00 | $0.13 | 128k | 👁️ vision · 💾 cache |
| gpt-5-chat-latest | $1.25 | $10.00 | $0.13 | 128k | 👁️ vision · 💾 cache |
| gpt-5-search-api | $1.25 | $10.00 | $0.13 | 272k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-5-search-api-2025-10-14 | $1.25 | $10.00 | $0.13 | 272k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-5.2 | $1.75 | $14.00 | $0.17 | 272k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-5.2-2025-12-11 | $1.75 | $14.00 | $0.17 | 272k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-5.2-chat-latest | $1.75 | $14.00 | $0.17 | 128k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-5.3-chat-latest | $1.75 | $14.00 | $0.17 | 128k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-4.1 | $2.00 | $8.00 | $0.50 | 1048k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-4.1-2025-04-14 | $2.00 | $8.00 | $0.50 | 1048k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| o3 | $2.00 | $8.00 | $0.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| o3-2025-04-16 | $2.00 | $8.00 | $0.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-4o | $2.50 | $10.00 | $1.25 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| gpt-4o-2024-08-06 | $2.50 | $10.00 | $1.25 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| gpt-4o-2024-11-20 | $2.50 | $10.00 | $1.25 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| gpt-4o-audio-preview | $2.50 | $10.00 | — | 128k | 🔧 tools |
| gpt-4o-audio-preview-2024-12-17 | $2.50 | $10.00 | — | 128k | 🔧 tools |
| gpt-4o-audio-preview-2025-06-03 | $2.50 | $10.00 | — | 128k | 🔧 tools |
| gpt-audio | $2.50 | $10.00 | — | 128k | 🔧 tools |
| gpt-audio-1.5 | $2.50 | $10.00 | — | 128k | 🔧 tools |
| gpt-audio-2025-08-28 | $2.50 | $10.00 | — | 128k | 🔧 tools |
| gpt-4o-search-preview | $2.50 | $10.00 | $1.25 | 128k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-4o-search-preview-2025-03-11 | $2.50 | $10.00 | $1.25 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| gpt-5.4 | $2.50 | $15.00 | $0.25 | 1050k | 👁️ vision · 🔧 tools · 💾 cache |
| gpt-5.4-2026-03-05 | $2.50 | $15.00 | $0.25 | 1050k | 👁️ vision · 🔧 tools · 💾 cache |
| ft:gpt-3.5-turbo | $3.00 | $6.00 | — | 16k | |
| ft:gpt-3.5-turbo-0125 | $3.00 | $6.00 | — | 16k | |
| ft:gpt-3.5-turbo-0613 | $3.00 | $6.00 | — | 4k | |
| ft:gpt-3.5-turbo-1106 | $3.00 | $6.00 | — | 16k | |
| ft:gpt-4.1-2025-04-14 | $3.00 | $12.00 | $0.75 | 1048k | 🔧 tools · 💾 cache |
| gpt-3.5-turbo-16k | $3.00 | $4.00 | — | 16k | 💾 cache |
| ft:gpt-4o-2024-08-06 | $3.75 | $15.00 | $1.88 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| ft:gpt-4o-2024-11-20 | $3.75 | $15.00 | — | 128k | 🔧 tools · 💾 cache |
| ft:o4-mini-2025-04-16 | $4.00 | $16.00 | $1.00 | 200k | 🔧 tools · 💾 cache |
| gpt-realtime | $4.00 | $16.00 | $0.40 | 32k | 🔧 tools |
| gpt-realtime-1.5 | $4.00 | $16.00 | $0.40 | 32k | 🔧 tools |
| gpt-realtime-2 | $4.00 | $16.00 | $0.40 | 32k | 🔧 tools |
| gpt-realtime-2025-08-28 | $4.00 | $16.00 | $0.40 | 32k | 🔧 tools |
| chatgpt-4o-latest | $5.00 | $15.00 | — | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| gpt-4o-2024-05-13 | $5.00 | $15.00 | — | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| gpt-4o-realtime-preview | $5.00 | $20.00 | $2.50 | 128k | 🔧 tools |
| gpt-4o-realtime-preview-2024-12-17 | $5.00 | $20.00 | $2.50 | 128k | 🔧 tools |
| gpt-4o-realtime-preview-2025-06-03 | $5.00 | $20.00 | $2.50 | 128k | 🔧 tools |
| gpt-5.5 | $5.00 | $30.00 | $0.50 | 1050k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-5.5-2026-04-23 | $5.00 | $30.00 | $0.50 | 1050k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gpt-4-0125-preview | $10.00 | $30.00 | — | 128k | 🔧 tools · 💾 cache |
| gpt-4-1106-preview | $10.00 | $30.00 | — | 128k | 🔧 tools · 💾 cache |
| gpt-4-turbo | $10.00 | $30.00 | — | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| gpt-4-turbo-2024-04-09 | $10.00 | $30.00 | — | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| gpt-4-turbo-preview | $10.00 | $30.00 | — | 128k | 🔧 tools · 💾 cache |
| o1 | $15.00 | $60.00 | $7.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| o1-2024-12-17 | $15.00 | $60.00 | $7.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| ft:gpt-4-0613 | $30.00 | $60.00 | — | 8k | 🔧 tools |
| gpt-4 | $30.00 | $60.00 | — | 8k | 🔧 tools · 💾 cache |
| gpt-4-0314 | $30.00 | $60.00 | — | 8k | |
| gpt-4-0613 | $30.00 | $60.00 | — | 8k | 🔧 tools · 💾 cache |
vercel ai gateway (95)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| vercel_ai_gateway/amazon/titan-embed-text-v2 | $0.02 | $0.0000 | — | — | |
| vercel_ai_gateway/amazon/nova-micro | $0.04 | $0.14 | — | 128k | 🔧 tools |
| vercel_ai_gateway/mistral/ministral-3b | $0.04 | $0.04 | — | 128k | 🔧 tools |
| vercel_ai_gateway/meta/llama-3-8b | $0.05 | $0.08 | — | 8k | |
| vercel_ai_gateway/meta/llama-3.1-8b | $0.05 | $0.08 | — | 131k | 🔧 tools |
| vercel_ai_gateway/amazon/nova-lite | $0.06 | $0.24 | — | 300k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/mistral/devstral-small | $0.07 | $0.28 | — | 128k | 🔧 tools |
| vercel_ai_gateway/google/gemini-2.0-flash-lite | $0.07 | $0.30 | — | 1049k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/alibaba/qwen-3-14b | $0.08 | $0.24 | — | 41k | |
| vercel_ai_gateway/alibaba/qwen-3-30b | $0.10 | $0.30 | — | 41k | |
| vercel_ai_gateway/alibaba/qwen-3-32b | $0.10 | $0.30 | — | 41k | 🔧 tools |
| vercel_ai_gateway/meta/llama-3.2-1b | $0.10 | $0.10 | — | 128k | |
| vercel_ai_gateway/meta/llama-4-scout | $0.10 | $0.30 | — | 131k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/mistral/ministral-8b | $0.10 | $0.10 | — | 128k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/mistral/mistral-embed | $0.10 | $0.0000 | — | — | |
| vercel_ai_gateway/mistral/mistral-small | $0.10 | $0.30 | — | 32k | 🔧 tools |
| vercel_ai_gateway/openai/gpt-4.1-nano | $0.10 | $0.40 | $0.02 | 1048k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/cohere/embed-v4.0 | $0.12 | $0.0000 | — | — | |
| vercel_ai_gateway/cohere/command-r | $0.15 | $0.60 | — | 128k | 🔧 tools |
| vercel_ai_gateway/google/gemini-2.0-flash | $0.15 | $0.60 | — | 1049k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/meta/llama-3.2-3b | $0.15 | $0.15 | — | 128k | 🔧 tools |
| vercel_ai_gateway/mistral/codestral-embed | $0.15 | $0.0000 | — | — | |
| vercel_ai_gateway/mistral/pixtral-12b | $0.15 | $0.15 | — | 128k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/openai/gpt-4o-mini | $0.15 | $0.60 | $0.07 | 128k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/meta/llama-3.2-11b | $0.16 | $0.16 | — | 128k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/alibaba/qwen-3-235b | $0.20 | $0.60 | — | 41k | |
| vercel_ai_gateway/google/gemma-2-9b | $0.20 | $0.20 | — | 8k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/meta/llama-4-maverick | $0.20 | $0.60 | — | 131k | |
| vercel_ai_gateway/zai/glm-4.5-air | $0.20 | $1.10 | — | 128k | 🔧 tools |
| vercel_ai_gateway/anthropic/claude-3-haiku | $0.25 | $1.25 | $0.03 | 200k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/inception/mercury-coder-small | $0.25 | $1.00 | — | 32k | |
| vercel_ai_gateway/google/gemini-2.5-flash | $0.30 | $2.50 | — | 1000k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/mistral/codestral | $0.30 | $0.90 | — | 256k | 🔧 tools |
| vercel_ai_gateway/xai/grok-3-mini | $0.30 | $0.50 | — | 131k | 🔧 tools |
| vercel_ai_gateway/alibaba/qwen3-coder | $0.40 | $1.60 | — | 262k | 🔧 tools |
| vercel_ai_gateway/openai/gpt-4.1-mini | $0.40 | $1.60 | $0.10 | 1048k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/zai/glm-4.6 | $0.45 | $1.80 | $0.11 | 200k | 🔧 tools |
| vercel_ai_gateway/mistral/magistral-small | $0.50 | $1.50 | — | 128k | 🔧 tools |
| vercel_ai_gateway/openai/gpt-3.5-turbo | $0.50 | $1.50 | — | 16k | 🔧 tools |
| vercel_ai_gateway/deepseek/deepseek-r1 | $0.55 | $2.19 | — | 128k | |
| vercel_ai_gateway/moonshotai/kimi-k2 | $0.55 | $2.20 | — | 131k | 🔧 tools |
| vercel_ai_gateway/meta/llama-3-70b | $0.59 | $0.79 | — | 8k | |
| vercel_ai_gateway/xai/grok-3-mini-fast | $0.60 | $4.00 | — | 131k | 🔧 tools |
| vercel_ai_gateway/zai/glm-4.5 | $0.60 | $2.20 | — | 131k | 🔧 tools |
| vercel_ai_gateway/meta/llama-3.1-70b | $0.72 | $0.72 | — | 128k | |
| vercel_ai_gateway/meta/llama-3.2-90b | $0.72 | $0.72 | — | 128k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/meta/llama-3.3-70b | $0.72 | $0.72 | — | 128k | 🔧 tools |
| vercel_ai_gateway/deepseek/deepseek-r1-distill-llama-70b | $0.75 | $0.99 | — | 131k | 🔧 tools |
| vercel_ai_gateway/mistral/mistral-saba-24b | $0.79 | $0.79 | — | 33k | |
| vercel_ai_gateway/amazon/nova-pro | $0.80 | $3.20 | — | 300k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/anthropic/claude-3.5-haiku | $0.80 | $4.00 | $0.08 | 200k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/morph/morph-v3-fast | $0.80 | $1.20 | — | 33k | |
| vercel_ai_gateway/deepseek/deepseek-v3 | $0.90 | $0.90 | — | 128k | |
| vercel_ai_gateway/morph/morph-v3-large | $0.90 | $1.90 | — | 33k | |
| vercel_ai_gateway/anthropic/claude-haiku-4.5 | $1.00 | $5.00 | $0.10 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| vercel_ai_gateway/perplexity/sonar | $1.00 | $1.00 | — | 127k | |
| vercel_ai_gateway/perplexity/sonar-reasoning | $1.00 | $5.00 | — | 127k | |
| vercel_ai_gateway/openai/o3-mini | $1.10 | $4.40 | $0.55 | 200k | 🔧 tools |
| vercel_ai_gateway/openai/o4-mini | $1.10 | $4.40 | $0.28 | 200k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/mistral/mixtral-8x22b-instruct | $1.20 | $1.20 | — | 66k | 🔧 tools |
| vercel_ai_gateway/openai/gpt-3.5-turbo-instruct | $1.50 | $2.00 | — | 8k | |
| vercel_ai_gateway/mistral/magistral-medium | $2.00 | $5.00 | — | 128k | 🔧 tools |
| vercel_ai_gateway/mistral/mistral-large | $2.00 | $6.00 | — | 32k | 🔧 tools |
| vercel_ai_gateway/mistral/pixtral-large | $2.00 | $6.00 | — | 128k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/openai/gpt-4.1 | $2.00 | $8.00 | $0.50 | 1048k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/openai/o3 | $2.00 | $8.00 | $0.50 | 200k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/perplexity/sonar-reasoning-pro | $2.00 | $8.00 | — | 127k | |
| vercel_ai_gateway/xai/grok-2 | $2.00 | $10.00 | — | 131k | 🔧 tools |
| vercel_ai_gateway/xai/grok-2-vision | $2.00 | $10.00 | — | 33k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/cohere/command-a | $2.50 | $10.00 | — | 256k | 🔧 tools |
| vercel_ai_gateway/cohere/command-r-plus | $2.50 | $10.00 | — | 128k | 🔧 tools |
| vercel_ai_gateway/google/gemini-2.5-pro | $2.50 | $10.00 | — | 1049k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/openai/gpt-4o | $2.50 | $10.00 | $1.25 | 128k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/anthropic/claude-3.5-sonnet | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/anthropic/claude-3.7-sonnet | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/anthropic/claude-4-sonnet | $3.00 | $15.00 | $0.30 | 200k | 🔧 tools |
| vercel_ai_gateway/anthropic/claude-3-5-sonnet | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| vercel_ai_gateway/anthropic/claude-3-5-sonnet-20241022 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| vercel_ai_gateway/anthropic/claude-3-7-sonnet | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| vercel_ai_gateway/anthropic/claude-sonnet-4 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| vercel_ai_gateway/anthropic/claude-sonnet-4.5 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| vercel_ai_gateway/perplexity/sonar-pro | $3.00 | $15.00 | — | 200k | |
| vercel_ai_gateway/vercel/v0-1.0-md | $3.00 | $15.00 | — | 128k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/vercel/v0-1.5-md | $3.00 | $15.00 | — | 128k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/xai/grok-3 | $3.00 | $15.00 | — | 131k | 🔧 tools |
| vercel_ai_gateway/xai/grok-4 | $3.00 | $15.00 | — | 256k | 🔧 tools |
| vercel_ai_gateway/anthropic/claude-opus-4.5 | $5.00 | $25.00 | $0.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| vercel_ai_gateway/anthropic/claude-opus-4.6 | $5.00 | $25.00 | $0.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| vercel_ai_gateway/xai/grok-3-fast | $5.00 | $25.00 | — | 131k | 🔧 tools |
| vercel_ai_gateway/openai/gpt-4-turbo | $10.00 | $30.00 | — | 128k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/anthropic/claude-3-opus | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/anthropic/claude-4-opus | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools |
| vercel_ai_gateway/anthropic/claude-opus-4 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| vercel_ai_gateway/anthropic/claude-opus-4.1 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| vercel_ai_gateway/openai/o1 | $15.00 | $60.00 | $7.50 | 200k | 👁️ vision · 🔧 tools |
openrouter (92)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| openrouter/openrouter/auto | $0.0000 | $0.0000 | — | 2000k | 👁️ vision · 🔧 tools |
| openrouter/openrouter/free | $0.0000 | $0.0000 | — | 200k | 👁️ vision · 🔧 tools |
| openrouter/openrouter/bodybuilder | $0.0000 | $0.0000 | — | 128k | |
| openrouter/openai/gpt-oss-20b | $0.02 | $0.10 | — | 131k | 🔧 tools |
| openrouter/openai/gpt-5-nano | $0.05 | $0.40 | $0.0050 | 272k | |
| openrouter/z-ai/glm-4.7-flash | $0.07 | $0.40 | $0.0000 | 200k | 👁️ vision · 🔧 tools |
| openrouter/qwen/qwen3-235b-a22b-2507 | $0.07 | $0.10 | — | 262k | 🔧 tools |
| openrouter/xiaomi/mimo-v2-flash | $0.09 | $0.29 | $0.0000 | 262k | 🔧 tools |
| openrouter/bytedance/ui-tars-1.5-7b | $0.10 | $0.20 | — | 131k | |
| openrouter/google/gemini-2.0-flash-001 | $0.10 | $0.40 | — | 1049k | 👁️ vision · 🔧 tools |
| openrouter/mistralai/ministral-3b-2512 | $0.10 | $0.10 | — | 131k | 👁️ vision · 🔧 tools |
| openrouter/mistralai/mistral-small-3.1-24b-instruct | $0.10 | $0.30 | — | 131k | |
| openrouter/mistralai/mistral-small-3.2-24b-instruct | $0.10 | $0.30 | — | 128k | |
| openrouter/openai/gpt-4.1-nano | $0.10 | $0.40 | $0.02 | 1048k | 👁️ vision · 🔧 tools · 💾 cache |
| openrouter/qwen/qwen3.5-flash-02-23 | $0.10 | $0.40 | — | 1000k | 👁️ vision · 🔧 tools |
| openrouter/qwen/qwen3-235b-a22b-thinking-2507 | $0.11 | $0.60 | — | 262k | 🔧 tools |
| openrouter/mistralai/mistral-7b-instruct | $0.13 | $0.13 | — | 33k | |
| openrouter/deepseek/deepseek-chat | $0.14 | $0.28 | — | 66k | 💾 cache |
| openrouter/deepseek/deepseek-chat-v3-0324 | $0.14 | $0.28 | — | 66k | 💾 cache |
| openrouter/mistralai/devstral-2512 | $0.15 | $0.60 | — | 262k | 🔧 tools |
| openrouter/mistralai/ministral-8b-2512 | $0.15 | $0.15 | — | 262k | 👁️ vision · 🔧 tools |
| openrouter/openai/gpt-oss-120b | $0.18 | $0.80 | — | 131k | 🔧 tools |
| openrouter/qwen/qwen-2.5-coder-32b-instruct | $0.18 | $0.18 | — | 34k | |
| openrouter/deepseek/deepseek-chat-v3.1 | $0.20 | $0.80 | — | 164k | 🔧 tools · 💾 cache |
| openrouter/deepseek/deepseek-v3.2-exp | $0.20 | $0.40 | — | 164k | 🔧 tools · 💾 cache |
| openrouter/mistralai/ministral-14b-2512 | $0.20 | $0.20 | — | 262k | 👁️ vision · 🔧 tools |
| openrouter/qwen/qwen-vl-plus | $0.21 | $0.63 | — | 8k | 👁️ vision |
| openrouter/qwen/qwen3-coder | $0.22 | $0.95 | — | 262k | 🔧 tools |
| openrouter/anthropic/claude-3-haiku | $0.25 | $1.25 | — | 200k | 👁️ vision · 🔧 tools |
| openrouter/google/gemini-3.1-flash-lite-preview | $0.25 | $1.50 | $0.02 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| openrouter/openai/gpt-5-mini | $0.25 | $2.00 | $0.02 | 272k | |
| openrouter/qwen/qwen3.5-35b-a3b | $0.25 | $2.00 | — | 262k | 👁️ vision · 🔧 tools |
| openrouter/minimax/minimax-m2 | $0.26 | $1.02 | — | 205k | 🔧 tools · 💾 cache |
| openrouter/minimax/minimax-m2.1 | $0.27 | $1.20 | $0.0000 | 204k | 👁️ vision · 🔧 tools |
| openrouter/deepseek/deepseek-v3.2 | $0.28 | $0.40 | — | 164k | 🔧 tools · 💾 cache |
| openrouter/google/gemini-2.5-flash | $0.30 | $2.50 | — | 1049k | 👁️ vision · 🔧 tools |
| openrouter/qwen/qwen3.5-27b | $0.30 | $2.40 | — | 262k | 👁️ vision · 🔧 tools |
| openrouter/minimax/minimax-m2.5 | $0.30 | $1.10 | $0.15 | 197k | 🔧 tools · 💾 cache |
| openrouter/qwen/qwen3.6-plus | $0.33 | $1.95 | — | 1000k | 👁️ vision · 🔧 tools |
| openrouter/openai/gpt-4.1-mini | $0.40 | $1.60 | $0.10 | 1048k | 👁️ vision · 🔧 tools · 💾 cache |
| openrouter/qwen/qwen3.5-122b-a10b | $0.40 | $2.00 | — | 262k | 👁️ vision · 🔧 tools |
| openrouter/qwen/qwen3.5-plus-02-15 | $0.40 | $2.40 | — | 1000k | 👁️ vision · 🔧 tools |
| openrouter/z-ai/glm-4.6 | $0.40 | $1.75 | — | 203k | 🔧 tools · 💾 cache |
| openrouter/z-ai/glm-4.7 | $0.40 | $1.50 | $0.0000 | 203k | 👁️ vision · 🔧 tools |
| openrouter/z-ai/glm-4.6:exacto | $0.45 | $1.90 | — | 203k | 🔧 tools · 💾 cache |
| openrouter/deepseek/deepseek-r1-0528 | $0.50 | $2.15 | — | 65k | 🔧 tools · 💾 cache |
| openrouter/google/gemini-3-flash-preview | $0.50 | $3.00 | $0.05 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| openrouter/mistralai/mistral-large-2512 | $0.50 | $1.50 | — | 262k | 👁️ vision · 🔧 tools |
| openrouter/deepseek/deepseek-r1 | $0.55 | $2.19 | — | 65k | 🔧 tools · 💾 cache |
| openrouter/meta-llama/llama-3-70b-instruct | $0.59 | $0.79 | — | 8k | |
| openrouter/moonshotai/kimi-k2.5 | $0.60 | $3.00 | $0.10 | 262k | 👁️ vision · 🔧 tools |
| openrouter/qwen/qwen3.5-397b-a17b | $0.60 | $3.60 | — | 262k | 👁️ vision · 🔧 tools |
| openrouter/mistralai/mixtral-8x22b-instruct | $0.65 | $0.65 | — | 66k | |
| openrouter/z-ai/glm-5 | $0.80 | $2.56 | — | 203k | 🔧 tools |
| openrouter/switchpoint/router | $0.85 | $3.40 | — | 131k | |
| openrouter/anthropic/claude-haiku-4.5 | $1.00 | $5.00 | $0.10 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| openrouter/qwen/qwen3-coder-plus | $1.00 | $5.00 | — | 998k | 🔧 tools |
| openrouter/openai/o3-mini | $1.10 | $4.40 | — | 128k | 🔧 tools |
| openrouter/openai/o3-mini-high | $1.10 | $4.40 | — | 128k | 🔧 tools |
| openrouter/google/gemini-2.5-pro | $1.25 | $10.00 | — | 1049k | 👁️ vision · 🔧 tools |
| openrouter/openai/gpt-5-chat | $1.25 | $10.00 | $0.13 | 128k | |
| openrouter/openai/gpt-5-codex | $1.25 | $10.00 | $0.13 | 272k | |
| openrouter/openai/gpt-5 | $1.25 | $10.00 | $0.13 | 272k | |
| openrouter/openai/gpt-5.1-codex-max | $1.25 | $10.00 | $0.13 | 400k | 👁️ vision · 🔧 tools · 💾 cache |
| openrouter/openai/gpt-3.5-turbo | $1.50 | $2.00 | — | 16k | |
| openrouter/openai/gpt-5.2-codex | $1.75 | $14.00 | $0.17 | 272k | |
| openrouter/openai/gpt-5.2 | $1.75 | $14.00 | $0.17 | 272k | 👁️ vision · 🔧 tools · 💾 cache |
| openrouter/openai/gpt-5.2-chat | $1.75 | $14.00 | $0.17 | 128k | 👁️ vision · 🔧 tools · 💾 cache |
| openrouter/gryphe/mythomax-l2-13b | $1.88 | $1.88 | — | 8k | |
| openrouter/undi95/remm-slerp-l2-13b | $1.88 | $1.88 | — | 6k | |
| openrouter/google/gemini-3-pro-preview | $2.00 | $12.00 | $0.20 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| openrouter/google/gemini-3.1-pro-preview | $2.00 | $12.00 | $0.20 | 1049k | 👁️ vision · 🔧 tools · 💾 cache |
| openrouter/openai/gpt-4.1 | $2.00 | $8.00 | $0.50 | 1048k | 👁️ vision · 🔧 tools · 💾 cache |
| openrouter/openai/gpt-4o | $2.50 | $10.00 | — | 128k | 👁️ vision · 🔧 tools |
| openrouter/anthropic/claude-3.5-sonnet | $3.00 | $15.00 | — | 200k | 👁️ vision · 🔧 tools |
| openrouter/anthropic/claude-3.7-sonnet | $3.00 | $15.00 | — | 200k | 👁️ vision · 🔧 tools |
| openrouter/anthropic/claude-sonnet-4 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| openrouter/anthropic/claude-sonnet-4.6 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| openrouter/anthropic/claude-sonnet-4.5 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| openrouter/openai/gpt-3.5-turbo-16k | $3.00 | $4.00 | — | 16k | |
| openrouter/x-ai/grok-4 | $3.00 | $15.00 | — | 256k | 🔧 tools · 🌐 search |
| openrouter/anthropic/claude-opus-4.5 | $5.00 | $25.00 | $0.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| openrouter/anthropic/claude-opus-4.6 | $5.00 | $25.00 | $0.50 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| openrouter/anthropic/claude-opus-4.7 | $5.00 | $25.00 | $0.50 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| openrouter/openai/gpt-4o-2024-05-13 | $5.00 | $15.00 | — | 128k | 👁️ vision · 🔧 tools |
| openrouter/mancer/weaver | $5.63 | $5.63 | — | 8k | |
| openrouter/mistralai/mistral-large | $8.00 | $24.00 | — | 128k | |
| openrouter/anthropic/claude-opus-4 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| openrouter/anthropic/claude-opus-4.1 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| openrouter/openai/o1 | $15.00 | $60.00 | $7.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| openrouter/openai/gpt-5.2-pro | $21.00 | $168.00 | — | 272k | 👁️ vision · 🔧 tools |
| openrouter/openai/gpt-4 | $30.00 | $60.00 | — | 8k |
novita (80)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| novita/paddlepaddle/paddleocr-vl | $0.02 | $0.02 | — | 16k | 👁️ vision |
| novita/meta-llama/llama-3.1-8b-instruct | $0.02 | $0.05 | — | 16k | |
| novita/deepseek/deepseek-ocr | $0.03 | $0.03 | — | 8k | 👁️ vision |
| novita/qwen/qwen3-4b-fp8 | $0.03 | $0.03 | — | 128k | |
| novita/meta-llama/llama-3.2-3b-instruct | $0.03 | $0.05 | — | 33k | 🔧 tools |
| novita/zai-org/autoglm-phone-9b-multilingual | $0.04 | $0.14 | — | 66k | 👁️ vision |
| novita/qwen/qwen3-8b-fp8 | $0.04 | $0.14 | — | 128k | |
| novita/openai/gpt-oss-20b | $0.04 | $0.15 | — | 131k | 👁️ vision |
| novita/mistralai/mistral-nemo | $0.04 | $0.17 | — | 60k | |
| novita/meta-llama/llama-3-8b-instruct | $0.04 | $0.04 | — | 8k | |
| novita/openai/gpt-oss-120b | $0.05 | $0.25 | — | 131k | 👁️ vision · 🔧 tools |
| novita/google/gemma-3-12b-it | $0.05 | $0.10 | — | 131k | 👁️ vision |
| novita/sao10k/l3-8b-lunaris | $0.05 | $0.05 | — | 8k | |
| novita/Sao10K/L3-8B-Stheno-v3.2 | $0.05 | $0.05 | — | 8k | 🔧 tools |
| novita/deepseek/deepseek-r1-0528-qwen3-8b | $0.06 | $0.09 | — | 128k | |
| novita/qwen/qwen3-coder-30b-a3b-instruct | $0.07 | $0.27 | — | 160k | 🔧 tools |
| novita/baidu/ernie-4.5-21B-a3b-thinking | $0.07 | $0.28 | — | 131k | |
| novita/baichuan/baichuan-m2-32b | $0.07 | $0.07 | — | 131k | |
| novita/baidu/ernie-4.5-21B-a3b | $0.07 | $0.28 | — | 120k | 🔧 tools |
| novita/qwen/qwen2.5-7b-instruct | $0.07 | $0.07 | — | 32k | 🔧 tools |
| novita/qwen/qwen3-vl-8b-instruct | $0.08 | $0.50 | — | 131k | 👁️ vision · 🔧 tools |
| novita/qwen/qwen3-235b-a22b-instruct-2507 | $0.09 | $0.58 | — | 131k | 🔧 tools |
| novita/qwen/qwen3-30b-a3b-fp8 | $0.09 | $0.45 | — | 41k | |
| novita/gryphe/mythomax-l2-13b | $0.09 | $0.09 | — | 4k | |
| novita/xiaomimimo/mimo-v2-flash | $0.10 | $0.30 | $0.02 | 262k | 🔧 tools |
| novita/qwen/qwen3-32b-fp8 | $0.10 | $0.45 | — | 41k | |
| novita/google/gemma-3-27b-it | $0.12 | $0.20 | — | 98k | 👁️ vision |
| novita/zai-org/glm-4.5-air | $0.13 | $0.85 | — | 131k | 🔧 tools |
| novita/meta-llama/llama-3.3-70b-instruct | $0.14 | $0.40 | — | 131k | 🔧 tools |
| novita/nousresearch/hermes-2-pro-llama-3-8b | $0.14 | $0.14 | — | 8k | |
| novita/baidu/ernie-4.5-vl-28b-a3b | $0.14 | $0.56 | — | 30k | 👁️ vision · 🔧 tools |
| novita/qwen/qwen3-next-80b-a3b-instruct | $0.15 | $1.50 | — | 131k | 🔧 tools |
| novita/qwen/qwen3-next-80b-a3b-thinking | $0.15 | $1.50 | — | 131k | 🔧 tools |
| novita/deepseek/deepseek-r1-distill-qwen-14b | $0.15 | $0.15 | — | 33k | |
| novita/meta-llama/llama-4-scout-17b-16e-instruct | $0.18 | $0.59 | — | 131k | 👁️ vision |
| novita/skywork/r1v4-lite | $0.20 | $0.60 | — | 262k | 👁️ vision |
| novita/qwen/qwen3-235b-a22b-fp8 | $0.20 | $0.80 | — | 41k | |
| novita/qwen/qwen3-vl-30b-a3b-instruct | $0.20 | $0.70 | — | 131k | 👁️ vision · 🔧 tools |
| novita/qwen/qwen3-vl-30b-a3b-thinking | $0.20 | $1.00 | — | 131k | 👁️ vision · 🔧 tools |
| novita/qwen/qwen3-omni-30b-a3b-thinking | $0.25 | $0.97 | — | 66k | 👁️ vision · 🔧 tools |
| novita/qwen/qwen3-omni-30b-a3b-instruct | $0.25 | $0.97 | — | 66k | 👁️ vision · 🔧 tools |
| novita/qwen/qwen-mt-plus | $0.25 | $0.75 | — | 16k | |
| novita/deepseek/deepseek-v3.2 | $0.27 | $0.40 | $0.13 | 164k | 🔧 tools |
| novita/deepseek/deepseek-v3.2-exp | $0.27 | $0.41 | — | 164k | 🔧 tools |
| novita/deepseek/deepseek-v3.1-terminus | $0.27 | $1.00 | $0.14 | 131k | 🔧 tools |
| novita/deepseek/deepseek-v3.1 | $0.27 | $1.00 | $0.14 | 131k | 🔧 tools |
| novita/deepseek/deepseek-v3-0324 | $0.27 | $1.12 | $0.14 | 164k | 🔧 tools |
| novita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8 | $0.27 | $0.85 | — | 1049k | 👁️ vision |
| novita/baidu/ernie-4.5-300b-a47b-paddle | $0.28 | $1.10 | — | 123k | |
| novita/minimax/minimax-m2.1 | $0.30 | $1.20 | $0.03 | 205k | 🔧 tools |
| novita/minimax/minimax-m2 | $0.30 | $1.20 | $0.03 | 205k | 🔧 tools |
| novita/zai-org/glm-4.6v | $0.30 | $0.90 | $0.06 | 131k | 👁️ vision · 🔧 tools |
| novita/kwaipilot/kat-coder-pro | $0.30 | $1.20 | $0.06 | 256k | 🔧 tools |
| novita/qwen/qwen3-vl-235b-a22b-instruct | $0.30 | $1.50 | — | 131k | 👁️ vision · 🔧 tools |
| novita/qwen/qwen3-coder-480b-a35b-instruct | $0.30 | $1.30 | — | 262k | 🔧 tools |
| novita/qwen/qwen3-235b-a22b-thinking-2507 | $0.30 | $3.00 | — | 131k | 🔧 tools |
| novita/deepseek/deepseek-r1-distill-qwen-32b | $0.30 | $0.30 | — | 64k | |
| novita/qwen/qwen-2.5-72b-instruct | $0.38 | $0.40 | — | 32k | 🔧 tools |
| novita/baidu/ernie-4.5-vl-28b-a3b-thinking | $0.39 | $0.39 | — | 131k | 👁️ vision · 🔧 tools |
| novita/deepseek/deepseek-v3-turbo | $0.40 | $1.30 | — | 64k | 🔧 tools |
| novita/baidu/ernie-4.5-vl-424b-a47b | $0.42 | $1.25 | — | 123k | 👁️ vision |
| novita/meta-llama/llama-3-70b-instruct | $0.51 | $0.74 | — | 8k | |
| novita/zai-org/glm-4.6 | $0.55 | $2.20 | $0.11 | 205k | 🔧 tools |
| novita/minimaxai/minimax-m1-80k | $0.55 | $2.20 | — | 1000k | 🔧 tools |
| novita/moonshotai/kimi-k2-instruct | $0.57 | $2.30 | — | 131k | 🔧 tools |
| novita/zai-org/glm-4.7 | $0.60 | $2.20 | $0.11 | 205k | 🔧 tools |
| novita/moonshotai/kimi-k2-thinking | $0.60 | $2.50 | — | 262k | 🔧 tools |
| novita/moonshotai/kimi-k2-0905 | $0.60 | $2.50 | — | 262k | 🔧 tools |
| novita/zai-org/glm-4.5 | $0.60 | $2.20 | $0.11 | 131k | 🔧 tools |
| novita/zai-org/glm-4.5v | $0.60 | $1.80 | $0.11 | 66k | 👁️ vision · 🔧 tools |
| novita/microsoft/wizardlm-2-8x22b | $0.62 | $0.62 | — | 66k | |
| novita/deepseek/deepseek-r1-0528 | $0.70 | $2.50 | $0.35 | 164k | 🔧 tools |
| novita/deepseek/deepseek-prover-v2-671b | $0.70 | $2.50 | — | 160k | |
| novita/deepseek/deepseek-r1-turbo | $0.70 | $2.50 | — | 64k | 🔧 tools |
| novita/deepseek/deepseek-r1-distill-llama-70b | $0.80 | $0.80 | — | 8k | |
| novita/qwen/qwen2.5-vl-72b-instruct | $0.80 | $0.80 | — | 33k | 👁️ vision |
| novita/qwen/qwen3-vl-235b-a22b-thinking | $0.98 | $3.95 | — | 131k | 👁️ vision |
| novita/sao10k/l3-70b-euryale-v2.1 | $1.48 | $1.48 | — | 8k | 🔧 tools |
| novita/sao10k/l31-70b-euryale-v2.2 | $1.48 | $1.48 | — | 8k | 🔧 tools |
| novita/qwen/qwen3-max | $2.11 | $8.45 | — | 262k | 🔧 tools |
deepinfra (67)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| deepinfra/meta-llama/Llama-3.2-3B-Instruct | $0.02 | $0.02 | — | 131k | 🔧 tools |
| deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo | $0.02 | $0.03 | — | 131k | 🔧 tools |
| deepinfra/mistralai/Mistral-Nemo-Instruct-2407 | $0.02 | $0.04 | — | 131k | 🔧 tools |
| deepinfra/meta-llama/Meta-Llama-3-8B-Instruct | $0.03 | $0.06 | — | 8k | 🔧 tools |
| deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct | $0.03 | $0.05 | — | 131k | 🔧 tools |
| deepinfra/Qwen/Qwen2.5-7B-Instruct | $0.04 | $0.10 | — | 33k | |
| deepinfra/Sao10K/L3-8B-Lunaris-v1-Turbo | $0.04 | $0.05 | — | 8k | |
| deepinfra/google/gemma-3-4b-it | $0.04 | $0.08 | — | 131k | 🔧 tools |
| deepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v2 | $0.04 | $0.16 | — | 131k | 🔧 tools |
| deepinfra/openai/gpt-oss-20b | $0.04 | $0.15 | — | 131k | 🔧 tools |
| deepinfra/meta-llama/Llama-3.2-11B-Vision-Instruct | $0.05 | $0.05 | — | 131k | |
| deepinfra/google/gemma-3-12b-it | $0.05 | $0.10 | — | 131k | 🔧 tools |
| deepinfra/mistralai/Mistral-Small-24B-Instruct-2501 | $0.05 | $0.08 | — | 33k | 🔧 tools |
| deepinfra/openai/gpt-oss-120b | $0.05 | $0.45 | — | 131k | 🔧 tools |
| deepinfra/meta-llama/Llama-Guard-3-8B | $0.06 | $0.06 | — | 131k | |
| deepinfra/Qwen/Qwen3-14B | $0.06 | $0.24 | — | 41k | 🔧 tools |
| deepinfra/microsoft/phi-4 | $0.07 | $0.14 | — | 16k | 🔧 tools |
| deepinfra/mistralai/Mistral-Small-3.2-24B-Instruct-2506 | $0.07 | $0.20 | — | 128k | 🔧 tools |
| deepinfra/Gryphe/MythoMax-L2-13b | $0.08 | $0.09 | — | 4k | 🔧 tools |
| deepinfra/Qwen/Qwen3-30B-A3B | $0.08 | $0.29 | — | 41k | 🔧 tools |
| deepinfra/meta-llama/Llama-4-Scout-17B-16E-Instruct | $0.08 | $0.30 | — | 328k | 🔧 tools |
| deepinfra/Qwen/Qwen3-235B-A22B-Instruct-2507 | $0.09 | $0.60 | — | 262k | 🔧 tools |
| deepinfra/google/gemma-3-27b-it | $0.09 | $0.16 | — | 131k | 🔧 tools |
| deepinfra/Qwen/Qwen3-32B | $0.10 | $0.28 | — | 41k | 🔧 tools |
| deepinfra/google/gemini-2.0-flash-001 | $0.10 | $0.40 | — | 1000k | 🔧 tools |
| deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo | $0.10 | $0.28 | — | 131k | 🔧 tools |
| deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5 | $0.10 | $0.40 | — | 131k | 🔧 tools |
| deepinfra/Qwen/Qwen2.5-72B-Instruct | $0.12 | $0.39 | — | 33k | 🔧 tools |
| deepinfra/meta-llama/Llama-3.3-70B-Instruct-Turbo | $0.13 | $0.39 | — | 131k | 🔧 tools |
| deepinfra/Qwen/Qwen3-Next-80B-A3B-Instruct | $0.14 | $1.40 | — | 262k | 🔧 tools |
| deepinfra/Qwen/Qwen3-Next-80B-A3B-Thinking | $0.14 | $1.40 | — | 262k | 🔧 tools |
| deepinfra/Qwen/QwQ-32B | $0.15 | $0.40 | — | 131k | 🔧 tools |
| deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | $0.15 | $0.60 | — | 1049k | 🔧 tools |
| deepinfra/Qwen/Qwen3-235B-A22B | $0.18 | $0.54 | — | 41k | 🔧 tools |
| deepinfra/meta-llama/Llama-Guard-4-12B | $0.18 | $0.18 | — | 164k | |
| deepinfra/Qwen/Qwen2.5-VL-32B-Instruct | $0.20 | $0.60 | — | 128k | 👁️ vision · 🔧 tools |
| deepinfra/deepseek-ai/DeepSeek-R1-Distill-Llama-70B | $0.20 | $0.60 | — | 131k | |
| deepinfra/meta-llama/Llama-3.3-70B-Instruct | $0.23 | $0.40 | — | 131k | 🔧 tools |
| deepinfra/deepseek-ai/DeepSeek-V3-0324 | $0.25 | $0.88 | — | 164k | 🔧 tools |
| deepinfra/allenai/olmOCR-7B-0725-FP8 | $0.27 | $1.50 | — | 16k | |
| deepinfra/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | $0.27 | $0.27 | — | 131k | 🔧 tools |
| deepinfra/deepseek-ai/DeepSeek-V3.1 | $0.27 | $1.00 | $0.22 | 164k | 🔧 tools |
| deepinfra/deepseek-ai/DeepSeek-V3.1-Terminus | $0.27 | $1.00 | $0.22 | 164k | 🔧 tools |
| deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo | $0.29 | $1.20 | — | 262k | 🔧 tools |
| deepinfra/NousResearch/Hermes-3-Llama-3.1-70B | $0.30 | $0.30 | — | 131k | |
| deepinfra/Qwen/Qwen3-235B-A22B-Thinking-2507 | $0.30 | $2.90 | — | 262k | 🔧 tools |
| deepinfra/google/gemini-2.5-flash | $0.30 | $2.50 | — | 1000k | 🔧 tools |
| deepinfra/deepseek-ai/DeepSeek-V3 | $0.38 | $0.89 | — | 164k | 🔧 tools |
| deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct | $0.40 | $1.60 | — | 262k | 🔧 tools |
| deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct | $0.40 | $0.40 | — | 131k | 🔧 tools |
| deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1 | $0.40 | $0.40 | — | 33k | 🔧 tools |
| deepinfra/zai-org/GLM-4.5 | $0.40 | $1.60 | — | 131k | 🔧 tools |
| deepinfra/microsoft/WizardLM-2-8x22B | $0.48 | $0.48 | — | 66k | |
| deepinfra/deepseek-ai/DeepSeek-R1-0528 | $0.50 | $2.15 | $0.40 | 164k | 🔧 tools |
| deepinfra/moonshotai/Kimi-K2-Instruct | $0.50 | $2.00 | — | 131k | 🔧 tools |
| deepinfra/moonshotai/Kimi-K2-Instruct-0905 | $0.50 | $2.00 | $0.40 | 262k | 🔧 tools |
| deepinfra/nvidia/Llama-3.1-Nemotron-70B-Instruct | $0.60 | $0.60 | — | 131k | 🔧 tools |
| deepinfra/Sao10K/L3.1-70B-Euryale-v2.2 | $0.65 | $0.75 | — | 131k | |
| deepinfra/Sao10K/L3.3-70B-Euryale-v2.3 | $0.65 | $0.75 | — | 131k | |
| deepinfra/deepseek-ai/DeepSeek-R1 | $0.70 | $2.40 | — | 164k | 🔧 tools |
| deepinfra/NousResearch/Hermes-3-Llama-3.1-405B | $1.00 | $1.00 | — | 131k | 🔧 tools |
| deepinfra/deepseek-ai/DeepSeek-R1-0528-Turbo | $1.00 | $3.00 | — | 33k | 🔧 tools |
| deepinfra/deepseek-ai/DeepSeek-R1-Turbo | $1.00 | $3.00 | — | 41k | 🔧 tools |
| deepinfra/google/gemini-2.5-pro | $1.25 | $10.00 | — | 1000k | 🔧 tools |
| deepinfra/anthropic/claude-3-7-sonnet-latest | $3.30 | $16.50 | $0.33 | 200k | 🔧 tools |
| deepinfra/anthropic/claude-4-sonnet | $3.30 | $16.50 | — | 200k | 🔧 tools |
| deepinfra/anthropic/claude-4-opus | $16.50 | $82.50 | — | 200k | 🔧 tools |
azure ai (66)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| azure_ai/ministral-3b | $0.04 | $0.04 | — | 128k | 🔧 tools |
| azure_ai/Phi-4-mini-instruct | $0.07 | $0.30 | — | 131k | 🔧 tools |
| azure_ai/Phi-4-multimodal-instruct | $0.08 | $0.32 | — | 131k | 👁️ vision · 🔧 tools |
| azure_ai/Phi-4-mini-reasoning | $0.08 | $0.32 | — | 131k | 🔧 tools |
| azure_ai/mistral-small-2503 | $0.10 | $0.30 | — | 128k | 👁️ vision · 🔧 tools |
| azure_ai/Phi-4 | $0.13 | $0.50 | — | 16k | 🔧 tools |
| azure_ai/Phi-4-reasoning | $0.13 | $0.50 | — | 33k | 🔧 tools |
| azure_ai/Phi-3-mini-128k-instruct | $0.13 | $0.52 | — | 128k | |
| azure_ai/Phi-3-mini-4k-instruct | $0.13 | $0.52 | — | 4k | |
| azure_ai/Phi-3.5-mini-instruct | $0.13 | $0.52 | — | 128k | |
| azure_ai/Phi-3.5-vision-instruct | $0.13 | $0.52 | — | 128k | 👁️ vision |
| azure_ai/model_router | $0.14 | $0.0000 | — | — | |
| azure_ai/gpt-oss-120b | $0.15 | $0.60 | — | 131k | 🔧 tools |
| azure_ai/Phi-3-small-128k-instruct | $0.15 | $0.60 | — | 128k | |
| azure_ai/Phi-3-small-8k-instruct | $0.15 | $0.60 | — | 8k | |
| azure_ai/mistral-nemo | $0.15 | $0.15 | — | 131k | 🔧 tools |
| azure_ai/Phi-3.5-MoE-instruct | $0.16 | $0.64 | — | 128k | |
| azure_ai/Phi-3-medium-128k-instruct | $0.17 | $0.68 | — | 128k | |
| azure_ai/Phi-3-medium-4k-instruct | $0.17 | $0.68 | — | 4k | |
| azure_ai/gpt-5.4-nano | $0.20 | $1.25 | $0.02 | 400k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| azure_ai/gpt-5.4-nano-2026-03-17 | $0.20 | $1.25 | $0.02 | 400k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| azure_ai/Llama-4-Scout-17B-16E-Instruct | $0.20 | $0.78 | — | 10000k | 👁️ vision · 🔧 tools |
| azure_ai/grok-4-fast-non-reasoning | $0.20 | $0.50 | — | 131k | 🔧 tools · 🌐 search |
| azure_ai/grok-4-fast-reasoning | $0.20 | $0.50 | — | 131k | 🔧 tools · 🌐 search |
| azure_ai/grok-4-1-fast-non-reasoning | $0.20 | $0.50 | — | 131k | 🔧 tools · 🌐 search |
| azure_ai/grok-4-1-fast-reasoning | $0.20 | $0.50 | — | 131k | 🔧 tools · 🌐 search |
| azure_ai/grok-code-fast-1 | $0.20 | $1.50 | — | 131k | 🔧 tools · 🌐 search |
| azure_ai/global/grok-3-mini | $0.25 | $1.27 | — | 131k | 🔧 tools · 🌐 search |
| azure_ai/grok-3-mini | $0.25 | $1.27 | — | 131k | 🔧 tools · 🌐 search |
| azure_ai/Meta-Llama-3.1-8B-Instruct | $0.30 | $0.61 | — | 128k | |
| azure_ai/Llama-3.2-11B-Vision-Instruct | $0.37 | $0.37 | — | 128k | 👁️ vision · 🔧 tools |
| azure_ai/mistral-medium-2505 | $0.40 | $2.00 | — | 131k | 🔧 tools |
| azure_ai/jamba-instruct | $0.50 | $0.70 | — | 70k | |
| azure_ai/mistral-large-3 | $0.50 | $1.50 | — | 256k | 👁️ vision · 🔧 tools |
| azure_ai/deepseek-v3.2 | $0.58 | $1.68 | — | 164k | 🔧 tools · 💾 cache |
| azure_ai/deepseek-v3.2-speciale | $0.58 | $1.68 | — | 164k | 🔧 tools · 💾 cache |
| azure_ai/kimi-k2.5 | $0.60 | $3.00 | — | 262k | 👁️ vision · 🔧 tools |
| azure_ai/Llama-3.3-70B-Instruct | $0.71 | $0.71 | — | 128k | 🔧 tools |
| azure_ai/gpt-5.4-mini | $0.75 | $4.50 | $0.07 | 400k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| azure_ai/gpt-5.4-mini-2026-03-17 | $0.75 | $4.50 | $0.07 | 400k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| azure_ai/claude-haiku-4-5 | $1.00 | $5.00 | $0.10 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| azure_ai/mistral-small | $1.00 | $3.00 | — | 32k | 🔧 tools |
| azure_ai/Meta-Llama-3-70B-Instruct | $1.10 | $0.37 | — | 8k | |
| azure_ai/deepseek-v3 | $1.14 | $4.56 | — | 128k | |
| azure_ai/deepseek-v3-0324 | $1.14 | $4.56 | — | 128k | 🔧 tools |
| azure_ai/MAI-DS-R1 | $1.35 | $5.40 | — | 128k | |
| azure_ai/deepseek-r1 | $1.35 | $5.40 | — | 128k | |
| azure_ai/Llama-4-Maverick-17B-128E-Instruct-FP8 | $1.41 | $0.35 | — | 1000k | 👁️ vision · 🔧 tools |
| azure_ai/mistral-large-2407 | $2.00 | $6.00 | — | 128k | 🔧 tools |
| azure_ai/mistral-large-latest | $2.00 | $6.00 | — | 128k | 🔧 tools |
| azure_ai/Llama-3.2-90B-Vision-Instruct | $2.04 | $2.04 | — | 128k | 👁️ vision · 🔧 tools |
| azure_ai/gpt-5.4 | $2.50 | $15.00 | $0.25 | 1050k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| azure_ai/gpt-5.4-2026-03-05 | $2.50 | $15.00 | $0.25 | 1050k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| azure_ai/Meta-Llama-3.1-70B-Instruct | $2.68 | $3.54 | — | 128k | |
| azure_ai/claude-sonnet-4-5 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| azure_ai/claude-sonnet-4-6 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| azure_ai/global/grok-3 | $3.00 | $15.00 | — | 131k | 🔧 tools · 🌐 search |
| azure_ai/grok-3 | $3.00 | $15.00 | — | 131k | 🔧 tools · 🌐 search |
| azure_ai/grok-4 | $3.00 | $15.00 | — | 131k | 🔧 tools · 🌐 search |
| azure_ai/mistral-large | $4.00 | $12.00 | — | 32k | 🔧 tools |
| azure_ai/claude-opus-4-5 | $5.00 | $25.00 | $0.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| azure_ai/claude-opus-4-6 | $5.00 | $25.00 | $0.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| azure_ai/claude-opus-4-7 | $5.00 | $25.00 | $0.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| azure_ai/Meta-Llama-3.1-405B-Instruct | $5.33 | $16.00 | — | 128k | |
| azure_ai/claude-opus-4-1 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| azure_ai/jais-30b-chat | $3200.00 | $9710.00 | — | 8k |
mistral (46)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| mistral/mistral-small-latest | $0.06 | $0.18 | — | 131k | 👁️ vision · 🔧 tools |
| mistral/mistral-small-3-2-2506 | $0.06 | $0.18 | — | 131k | 👁️ vision · 🔧 tools |
| mistral/devstral-small-2505 | $0.10 | $0.30 | — | 128k | 🔧 tools |
| mistral/devstral-small-2507 | $0.10 | $0.30 | — | 128k | 🔧 tools |
| mistral/devstral-small-latest | $0.10 | $0.30 | — | 256k | 🔧 tools |
| mistral/labs-devstral-small-2512 | $0.10 | $0.30 | — | 256k | 🔧 tools |
| mistral/mistral-small | $0.10 | $0.30 | — | 32k | 🔧 tools |
| mistral/ministral-3-3b-2512 | $0.10 | $0.10 | — | 131k | 👁️ vision · 🔧 tools |
| mistral/ministral-3-8b-2512 | $0.15 | $0.15 | — | 262k | 👁️ vision · 🔧 tools |
| mistral/pixtral-12b-2409 | $0.15 | $0.15 | — | 128k | 👁️ vision · 🔧 tools |
| mistral/ministral-3-14b-2512 | $0.20 | $0.20 | — | 262k | 👁️ vision · 🔧 tools |
| mistral/codestral-mamba-latest | $0.25 | $0.25 | — | 256k | |
| mistral/mistral-tiny | $0.25 | $0.25 | — | 32k | |
| mistral/open-codestral-mamba | $0.25 | $0.25 | — | 256k | |
| mistral/open-mistral-7b | $0.25 | $0.25 | — | 32k | |
| mistral/codestral-2508 | $0.30 | $0.90 | — | 256k | 🔧 tools |
| mistral/open-mistral-nemo | $0.30 | $0.30 | — | 128k | |
| mistral/open-mistral-nemo-2407 | $0.30 | $0.30 | — | 128k | |
| mistral/devstral-medium-2507 | $0.40 | $2.00 | — | 128k | 🔧 tools |
| mistral/devstral-latest | $0.40 | $2.00 | — | 256k | 🔧 tools |
| mistral/devstral-medium-latest | $0.40 | $2.00 | — | 256k | 🔧 tools |
| mistral/devstral-2512 | $0.40 | $2.00 | — | 256k | 🔧 tools |
| mistral/mistral-medium-2505 | $0.40 | $2.00 | — | 131k | 🔧 tools |
| mistral/mistral-medium-latest | $0.40 | $2.00 | — | 131k | 👁️ vision · 🔧 tools |
| mistral/mistral-medium-3-1-2508 | $0.40 | $2.00 | — | 131k | 👁️ vision · 🔧 tools |
| mistral/magistral-small-2506 | $0.50 | $1.50 | — | 40k | 🔧 tools |
| mistral/magistral-small-latest | $0.50 | $1.50 | — | 40k | 🔧 tools |
| mistral/magistral-small-1-2-2509 | $0.50 | $1.50 | — | 40k | 🔧 tools |
| mistral/mistral-large-latest | $0.50 | $1.50 | — | 262k | 👁️ vision · 🔧 tools |
| mistral/mistral-large-3 | $0.50 | $1.50 | — | 262k | 👁️ vision · 🔧 tools |
| mistral/mistral-large-2512 | $0.50 | $1.50 | — | 262k | 👁️ vision · 🔧 tools |
| mistral/open-mixtral-8x7b | $0.70 | $0.70 | — | 32k | 🔧 tools |
| mistral/codestral-2405 | $1.00 | $3.00 | — | 32k | |
| mistral/codestral-latest | $1.00 | $3.00 | — | 32k | |
| mistral/magistral-medium-2506 | $2.00 | $5.00 | — | 40k | 🔧 tools |
| mistral/magistral-medium-2509 | $2.00 | $5.00 | — | 40k | 🔧 tools |
| mistral/magistral-medium-1-2-2509 | $2.00 | $5.00 | — | 40k | 🔧 tools |
| mistral/magistral-medium-latest | $2.00 | $5.00 | — | 40k | 🔧 tools |
| mistral/mistral-large-2411 | $2.00 | $6.00 | — | 128k | 🔧 tools |
| mistral/open-mixtral-8x22b | $2.00 | $6.00 | — | 65k | 🔧 tools |
| mistral/pixtral-large-2411 | $2.00 | $6.00 | — | 128k | 👁️ vision · 🔧 tools |
| mistral/pixtral-large-latest | $2.00 | $6.00 | — | 128k | 👁️ vision · 🔧 tools |
| mistral/mistral-medium | $2.70 | $8.10 | — | 32k | |
| mistral/mistral-medium-2312 | $2.70 | $8.10 | — | 32k | |
| mistral/mistral-large-2407 | $3.00 | $9.00 | — | 128k | 🔧 tools |
| mistral/mistral-large-2402 | $4.00 | $12.00 | — | 32k | 🔧 tools |
gemini (41)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| gemini/gemini-exp-1114 | $0.0000 | $0.0000 | — | 1049k | 👁️ vision · 🔧 tools |
| gemini/gemini-exp-1206 | $0.0000 | $0.0000 | — | 2097k | 👁️ vision · 🔧 tools |
| gemini/gemma-3-27b-it | $0.0000 | $0.0000 | — | 131k | 👁️ vision · 🔧 tools |
| gemini/learnlm-1.5-pro-experimental | $0.0000 | $0.0000 | — | 33k | 👁️ vision · 🔧 tools |
| gemini/lyria-3-clip-preview | $0.0000 | $0.0000 | — | 131k | |
| gemini/lyria-3-pro-preview | $0.0000 | $0.0000 | — | 131k | |
| gemini/gemini-2.0-flash-lite | $0.07 | $0.30 | $0.02 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini/gemini-2.0-flash-lite-001 | $0.07 | $0.30 | $0.02 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini/gemini-2.0-flash | $0.10 | $0.40 | $0.02 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini/gemini-2.0-flash-001 | $0.10 | $0.40 | $0.02 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini/gemini-2.5-flash-lite | $0.10 | $0.40 | $0.01 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini/gemini-2.5-flash-lite-preview-09-2025 | $0.10 | $0.40 | $0.01 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini/gemini-flash-lite-latest | $0.10 | $0.40 | $0.02 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini/gemini-2.5-flash-lite-preview-06-17 | $0.10 | $0.40 | $0.02 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-flash-lite-latest | $0.10 | $0.40 | $0.01 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini/gemini-3.1-flash-lite-preview | $0.25 | $1.50 | $0.02 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini/gemini-robotics-er-1.5-preview | $0.30 | $2.50 | $0.0000 | 1049k | 👁️ vision · 🔧 tools · 🌐 search |
| gemini/gemini-2.5-flash | $0.30 | $2.50 | $0.03 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini/gemini-2.5-flash-preview-09-2025 | $0.30 | $2.50 | $0.07 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini/gemini-flash-latest | $0.30 | $2.50 | $0.07 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-2.5-flash-native-audio-latest | $0.30 | $2.50 | — | 1049k | |
| gemini-2.5-flash-native-audio-preview-09-2025 | $0.30 | $2.50 | — | 1049k | |
| gemini-2.5-flash-native-audio-preview-12-2025 | $0.30 | $2.50 | — | 1049k | |
| gemini/gemini-2.5-flash-native-audio-latest | $0.30 | $2.50 | — | 1049k | |
| gemini/gemini-2.5-flash-native-audio-preview-09-2025 | $0.30 | $2.50 | — | 1049k | |
| gemini/gemini-2.5-flash-native-audio-preview-12-2025 | $0.30 | $2.50 | — | 1049k | |
| gemini-flash-latest | $0.30 | $2.50 | $0.03 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-exp-1206 | $0.30 | $2.50 | $0.03 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini/gemini-gemma-2-27b-it | $0.35 | $1.05 | — | 8k | 👁️ vision · 🔧 tools |
| gemini/gemini-gemma-2-9b-it | $0.35 | $1.05 | — | 8k | 👁️ vision · 🔧 tools |
| gemini/gemini-3-flash-preview | $0.50 | $3.00 | $0.05 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-3.1-flash-live-preview | $0.75 | $4.50 | — | 131k | 👁️ vision · 🔧 tools · 🌐 search |
| gemini/gemini-3.1-flash-live-preview | $0.75 | $4.50 | — | 131k | 👁️ vision · 🔧 tools · 🌐 search |
| gemini/gemini-2.5-pro | $1.25 | $10.00 | $0.13 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini/gemini-2.5-computer-use-preview-10-2025 | $1.25 | $10.00 | — | 128k | 👁️ vision · 🔧 tools |
| gemini/gemini-2.5-pro-preview-tts | $1.25 | $10.00 | $0.13 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-pro-latest | $1.25 | $10.00 | $0.13 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini/gemini-pro-latest | $1.25 | $10.00 | $0.13 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini/gemini-3-pro-preview | $2.00 | $12.00 | $0.20 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini/gemini-3.1-pro-preview | $2.00 | $12.00 | $0.20 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini/gemini-3.1-pro-preview-customtools | $2.00 | $12.00 | $0.20 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
replicate (40)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| replicate/ibm-granite/granite-3.3-8b-instruct | $0.03 | $0.25 | — | — | 🔧 tools |
| replicate/meta/llama-2-7b | $0.05 | $0.25 | — | 4k | |
| replicate/meta/llama-2-7b-chat | $0.05 | $0.25 | — | 4k | |
| replicate/meta/llama-3-8b | $0.05 | $0.25 | — | 8k | |
| replicate/meta/llama-3-8b-instruct | $0.05 | $0.25 | — | 8k | |
| replicate/mistralai/mistral-7b-instruct-v0.2 | $0.05 | $0.25 | — | 4k | |
| replicate/mistralai/mistral-7b-v0.1 | $0.05 | $0.25 | — | 4k | |
| replicate/openai/gpt-5-nano | $0.05 | $0.40 | — | — | 🔧 tools |
| replicateopenai/gpt-oss-20b | $0.09 | $0.36 | — | — | 🔧 tools |
| replicate/meta/llama-2-13b | $0.10 | $0.50 | — | 4k | |
| replicate/meta/llama-2-13b-chat | $0.10 | $0.50 | — | 4k | |
| replicate/openai/gpt-4.1-nano | $0.10 | $0.40 | — | — | 🔧 tools |
| replicate/openai/gpt-4o-mini | $0.15 | $0.60 | — | — | 👁️ vision · 🔧 tools |
| replicate/openai/gpt-oss-120b | $0.18 | $0.72 | — | — | 🔧 tools |
| replicate/openai/gpt-5-mini | $0.25 | $2.00 | — | — | 👁️ vision · 🔧 tools |
| replicate/qwen/qwen3-235b-a22b-instruct-2507 | $0.26 | $1.06 | — | — | 🔧 tools |
| replicate/mistralai/mixtral-8x7b-instruct-v0.1 | $0.30 | $1.00 | — | 4k | |
| replicate/openai/gpt-4.1-mini | $0.40 | $1.60 | — | — | 👁️ vision · 🔧 tools |
| replicate/meta/llama-2-70b | $0.65 | $2.75 | — | 4k | |
| replicate/meta/llama-2-70b-chat | $0.65 | $2.75 | — | 4k | |
| replicate/meta/llama-3-70b | $0.65 | $2.75 | — | 8k | |
| replicate/meta/llama-3-70b-instruct | $0.65 | $2.75 | — | 8k | |
| replicate/deepseek-ai/deepseek-v3.1 | $0.67 | $2.02 | — | 164k | 🔧 tools |
| replicate/anthropic/claude-4.5-haiku | $1.00 | $5.00 | — | — | 👁️ vision · 🔧 tools · 💾 cache |
| replicate/openai/o4-mini | $1.00 | $4.00 | — | — | |
| replicate/anthropic/claude-3.5-haiku | $1.00 | $5.00 | — | — | 👁️ vision · 🔧 tools · 💾 cache |
| replicate/openai/o1-mini | $1.10 | $4.40 | — | — | |
| replicate/openai/gpt-5 | $1.25 | $10.00 | — | — | 👁️ vision · 🔧 tools |
| replicate/deepseek-ai/deepseek-v3 | $1.45 | $1.45 | — | 66k | 🔧 tools |
| replicate/google/gemini-3-pro | $2.00 | $12.00 | — | — | 👁️ vision · 🔧 tools |
| replicate/openai/gpt-4.1 | $2.00 | $8.00 | — | — | 👁️ vision · 🔧 tools |
| replicate/openai/gpt-4o | $2.50 | $10.00 | — | — | 👁️ vision · 🔧 tools |
| replicate/google/gemini-2.5-flash | $2.50 | $2.50 | — | — | 👁️ vision · 🔧 tools |
| replicate/anthropic/claude-4-sonnet | $3.00 | $15.00 | — | — | 👁️ vision · 🔧 tools · 💾 cache |
| replicate/anthropic/claude-3.7-sonnet | $3.00 | $15.00 | — | — | 👁️ vision · 🔧 tools · 💾 cache |
| replicate/anthropic/claude-4.5-sonnet | $3.00 | $15.00 | — | — | 👁️ vision · 🔧 tools · 💾 cache |
| replicate/anthropic/claude-3.5-sonnet | $3.75 | $18.75 | — | — | 👁️ vision · 🔧 tools · 💾 cache |
| replicate/deepseek-ai/deepseek-r1 | $3.75 | $10.00 | — | 66k | |
| replicate/xai/grok-4 | $7.20 | $36.00 | — | — | 🔧 tools |
| replicate/openai/o1 | $15.00 | $60.00 | — | — |
xai (38)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| xai/grok-4-fast-reasoning | $0.20 | $0.50 | $0.05 | 2000k | 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-4-fast-non-reasoning | $0.20 | $0.50 | $0.05 | 2000k | 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-4-1-fast | $0.20 | $0.50 | $0.05 | 2000k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-4-1-fast-reasoning | $0.20 | $0.50 | $0.05 | 2000k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-4-1-fast-reasoning-latest | $0.20 | $0.50 | $0.05 | 2000k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-4-1-fast-non-reasoning | $0.20 | $0.50 | $0.05 | 2000k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-4-1-fast-non-reasoning-latest | $0.20 | $0.50 | $0.05 | 2000k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-code-fast | $0.20 | $1.50 | $0.02 | 256k | 🔧 tools · 💾 cache |
| xai/grok-code-fast-1 | $0.20 | $1.50 | $0.02 | 256k | 🔧 tools · 💾 cache |
| xai/grok-code-fast-1-0825 | $0.20 | $1.50 | $0.02 | 256k | 🔧 tools · 💾 cache |
| xai/grok-3-mini | $0.30 | $0.50 | $0.07 | 131k | 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-3-mini-beta | $0.30 | $0.50 | $0.07 | 131k | 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-3-mini-latest | $0.30 | $0.50 | $0.07 | 131k | 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-3-mini-fast | $0.60 | $4.00 | $0.15 | 131k | 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-3-mini-fast-beta | $0.60 | $4.00 | $0.15 | 131k | 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-3-mini-fast-latest | $0.60 | $4.00 | $0.15 | 131k | 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-4.3 | $1.25 | $2.50 | $0.20 | 1000k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-4.3-latest | $1.25 | $2.50 | $0.20 | 1000k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-2 | $2.00 | $10.00 | — | 131k | 🔧 tools · 🌐 search |
| xai/grok-2-1212 | $2.00 | $10.00 | — | 131k | 🔧 tools · 🌐 search |
| xai/grok-2-latest | $2.00 | $10.00 | — | 131k | 🔧 tools · 🌐 search |
| xai/grok-2-vision | $2.00 | $10.00 | — | 33k | 👁️ vision · 🔧 tools · 🌐 search |
| xai/grok-2-vision-1212 | $2.00 | $10.00 | — | 33k | 👁️ vision · 🔧 tools · 🌐 search |
| xai/grok-2-vision-latest | $2.00 | $10.00 | — | 33k | 👁️ vision · 🔧 tools · 🌐 search |
| xai/grok-4.20-multi-agent-beta-0309 | $2.00 | $6.00 | $0.20 | 2000k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-4.20-beta-0309-reasoning | $2.00 | $6.00 | $0.20 | 2000k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-4.20-0309-reasoning | $2.00 | $6.00 | $0.20 | 2000k | 👁️ vision · 🔧 tools · 🌐 search |
| xai/grok-4.20-beta-0309-non-reasoning | $2.00 | $6.00 | $0.20 | 2000k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-3 | $3.00 | $15.00 | $0.75 | 131k | 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-3-beta | $3.00 | $15.00 | $0.75 | 131k | 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-3-latest | $3.00 | $15.00 | $0.75 | 131k | 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-4 | $3.00 | $15.00 | — | 256k | 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-4-0709 | $3.00 | $15.00 | — | 256k | 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-4-latest | $3.00 | $15.00 | — | 256k | 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-3-fast-beta | $5.00 | $25.00 | $1.25 | 131k | 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-3-fast-latest | $5.00 | $25.00 | $1.25 | 131k | 🔧 tools · 💾 cache · 🌐 search |
| xai/grok-beta | $5.00 | $15.00 | — | 131k | 👁️ vision · 🔧 tools · 🌐 search |
| xai/grok-vision-beta | $5.00 | $15.00 | — | 8k | 👁️ vision · 🔧 tools · 🌐 search |
together ai (33)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo-Free | $0.0000 | $0.0000 | — | — | 🔧 tools |
| together_ai/openai/gpt-oss-20b | $0.05 | $0.20 | — | 128k | 🔧 tools |
| together-ai-up-to-4b | $0.10 | $0.10 | — | — | |
| together_ai/openai/gpt-oss-120b | $0.15 | $0.60 | — | 131k | 🔧 tools |
| together_ai/Qwen/Qwen3-Next-80B-A3B-Instruct | $0.15 | $1.50 | — | 262k | 🔧 tools |
| together_ai/Qwen/Qwen3-Next-80B-A3B-Thinking | $0.15 | $1.50 | — | 262k | 🔧 tools |
| together_ai/meta-llama/Llama-4-Scout-17B-16E-Instruct | $0.18 | $0.59 | — | — | 🔧 tools |
| together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo | $0.18 | $0.18 | — | — | 🔧 tools |
| together-ai-4.1b-8b | $0.20 | $0.20 | — | — | |
| together_ai/Qwen/Qwen3-235B-A22B-Instruct-2507-tput | $0.20 | $6.00 | — | 262k | 🔧 tools |
| together_ai/Qwen/Qwen3-235B-A22B-fp8-tput | $0.20 | $0.60 | — | 40k | |
| together_ai/zai-org/GLM-4.5-Air-FP8 | $0.20 | $1.10 | — | 128k | 🔧 tools |
| together_ai/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | $0.27 | $0.85 | — | — | 🔧 tools |
| together-ai-8.1b-21b | $0.30 | $0.30 | — | 1k | |
| together_ai/zai-org/GLM-4.7 | $0.45 | $2.00 | — | 200k | 🔧 tools |
| together_ai/moonshotai/Kimi-K2.5 | $0.50 | $2.80 | — | 256k | 👁️ vision · 🔧 tools |
| together_ai/deepseek-ai/DeepSeek-R1-0528-tput | $0.55 | $2.19 | — | 128k | 🔧 tools |
| together_ai/deepseek-ai/DeepSeek-V3.1 | $0.60 | $1.70 | — | 128k | 🔧 tools |
| together_ai/mistralai/Mixtral-8x7B-Instruct-v0.1 | $0.60 | $0.60 | — | — | 🔧 tools |
| together_ai/zai-org/GLM-4.6 | $0.60 | $2.20 | — | 200k | 🔧 tools |
| together_ai/Qwen/Qwen3.5-397B-A17B | $0.60 | $3.60 | — | 262k | 🔧 tools |
| together_ai/Qwen/Qwen3-235B-A22B-Thinking-2507 | $0.65 | $3.00 | — | 256k | 🔧 tools |
| together-ai-21.1b-41b | $0.80 | $0.80 | — | — | |
| together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo | $0.88 | $0.88 | — | — | 🔧 tools |
| together_ai/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo | $0.88 | $0.88 | — | — | 🔧 tools |
| together-ai-41.1b-80b | $0.90 | $0.90 | — | — | |
| together_ai/moonshotai/Kimi-K2-Instruct | $1.00 | $3.00 | — | — | 🔧 tools |
| together_ai/moonshotai/Kimi-K2-Instruct-0905 | $1.00 | $3.00 | — | 262k | 🔧 tools |
| together_ai/deepseek-ai/DeepSeek-V3 | $1.25 | $1.25 | — | 66k | 🔧 tools |
| together-ai-81.1b-110b | $1.80 | $1.80 | — | — | |
| together_ai/Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 | $2.00 | $2.00 | — | 256k | 🔧 tools |
| together_ai/deepseek-ai/DeepSeek-R1 | $3.00 | $7.00 | — | 128k | 🔧 tools |
| together_ai/meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo | $3.50 | $3.50 | — | — | 🔧 tools |
oci (29)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| oci/google.gemini-2.5-flash-lite | $0.07 | $0.30 | — | 1049k | 👁️ vision · 🔧 tools |
| oci/cohere.command-a-translate-08-2025 | $0.09 | $0.09 | — | 256k | |
| oci/cohere.command-r-08-2024 | $0.15 | $0.15 | — | 128k | 🔧 tools |
| oci/google.gemini-2.5-flash | $0.15 | $0.60 | — | 1049k | 👁️ vision · 🔧 tools |
| oci/xai.grok-3-mini | $0.30 | $0.50 | — | 131k | 🔧 tools |
| oci/xai.grok-3-mini-fast | $0.60 | $4.00 | — | 131k | 🔧 tools |
| oci/meta.llama-3.3-70b-instruct | $0.72 | $0.72 | — | 128k | 🔧 tools |
| oci/meta.llama-4-maverick-17b-128e-instruct-fp8 | $0.72 | $0.72 | — | 512k | 🔧 tools |
| oci/meta.llama-4-scout-17b-16e-instruct | $0.72 | $0.72 | — | 192k | 🔧 tools |
| oci/meta.llama-3.1-70b-instruct | $0.72 | $0.72 | — | 128k | 🔧 tools |
| oci/meta.llama-3.3-70b-instruct-fp8-dynamic | $0.72 | $0.72 | — | 128k | 🔧 tools |
| oci/google.gemini-2.5-pro | $1.25 | $10.00 | — | 1049k | 👁️ vision · 🔧 tools |
| oci/cohere.command-latest | $1.56 | $1.56 | — | 128k | 🔧 tools |
| oci/cohere.command-a-03-2025 | $1.56 | $1.56 | — | 256k | 🔧 tools |
| oci/cohere.command-plus-latest | $1.56 | $1.56 | — | 128k | 🔧 tools |
| oci/cohere.command-a-reasoning-08-2025 | $1.56 | $1.56 | — | 256k | 🔧 tools |
| oci/cohere.command-a-vision-07-2025 | $1.56 | $1.56 | — | 128k | 👁️ vision · 🔧 tools |
| oci/cohere.command-r-plus-08-2024 | $1.56 | $1.56 | — | 128k | 🔧 tools |
| oci/meta.llama-3.2-90b-vision-instruct | $2.00 | $2.00 | — | 128k | 👁️ vision · 🔧 tools |
| oci/meta.llama-3.2-11b-vision-instruct | $2.00 | $2.00 | — | 128k | 👁️ vision · 🔧 tools |
| oci/xai.grok-3 | $3.00 | $15.00 | — | 131k | 🔧 tools |
| oci/xai.grok-4 | $3.00 | $15.00 | — | 128k | 🔧 tools |
| oci/xai.grok-4.20 | $3.00 | $15.00 | — | 131k | 🔧 tools |
| oci/xai.grok-4.20-multi-agent | $3.00 | $15.00 | — | 131k | 🔧 tools |
| oci/xai.grok-3-fast | $5.00 | $25.00 | — | 131k | 🔧 tools |
| oci/xai.grok-4-fast | $5.00 | $25.00 | — | 131k | 🔧 tools |
| oci/xai.grok-4.1-fast | $5.00 | $25.00 | — | 131k | 🔧 tools |
| oci/xai.grok-code-fast-1 | $5.00 | $25.00 | — | 131k | 🔧 tools |
| oci/meta.llama-3.1-405b-instruct | $10.68 | $10.68 | — | 128k | 🔧 tools |
ollama (29)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| ollama/codegeex4 | $0.0000 | $0.0000 | — | 33k | |
| ollama/codegemma | $0.0000 | $0.0000 | — | 8k | |
| ollama/codellama | $0.0000 | $0.0000 | — | 4k | |
| ollama/deepseek-coder-v2-base | $0.0000 | $0.0000 | — | 8k | 🔧 tools |
| ollama/deepseek-coder-v2-instruct | $0.0000 | $0.0000 | — | 33k | 🔧 tools |
| ollama/deepseek-coder-v2-lite-base | $0.0000 | $0.0000 | — | 8k | 🔧 tools |
| ollama/deepseek-coder-v2-lite-instruct | $0.0000 | $0.0000 | — | 33k | 🔧 tools |
| ollama/deepseek-v3.1:671b-cloud | $0.0000 | $0.0000 | — | 164k | 🔧 tools |
| ollama/gpt-oss:120b-cloud | $0.0000 | $0.0000 | — | 131k | 🔧 tools |
| ollama/gpt-oss:20b-cloud | $0.0000 | $0.0000 | — | 131k | 🔧 tools |
| ollama/internlm2_5-20b-chat | $0.0000 | $0.0000 | — | 33k | 🔧 tools |
| ollama/llama2 | $0.0000 | $0.0000 | — | 4k | |
| ollama/llama2-uncensored | $0.0000 | $0.0000 | — | 4k | |
| ollama/llama2:13b | $0.0000 | $0.0000 | — | 4k | |
| ollama/llama2:70b | $0.0000 | $0.0000 | — | 4k | |
| ollama/llama2:7b | $0.0000 | $0.0000 | — | 4k | |
| ollama/llama3 | $0.0000 | $0.0000 | — | 8k | |
| ollama/llama3.1 | $0.0000 | $0.0000 | — | 8k | 🔧 tools |
| ollama/llama3:70b | $0.0000 | $0.0000 | — | 8k | |
| ollama/llama3:8b | $0.0000 | $0.0000 | — | 8k | |
| ollama/mistral | $0.0000 | $0.0000 | — | 8k | 🔧 tools |
| ollama/mistral-7B-Instruct-v0.1 | $0.0000 | $0.0000 | — | 8k | 🔧 tools |
| ollama/mistral-7B-Instruct-v0.2 | $0.0000 | $0.0000 | — | 33k | 🔧 tools |
| ollama/mistral-large-instruct-2407 | $0.0000 | $0.0000 | — | 66k | 🔧 tools |
| ollama/mixtral-8x22B-Instruct-v0.1 | $0.0000 | $0.0000 | — | 66k | 🔧 tools |
| ollama/mixtral-8x7B-Instruct-v0.1 | $0.0000 | $0.0000 | — | 33k | 🔧 tools |
| ollama/orca-mini | $0.0000 | $0.0000 | — | 4k | |
| ollama/qwen3-coder:480b-cloud | $0.0000 | $0.0000 | — | 262k | 🔧 tools |
| ollama/vicuna | $0.0000 | $0.0000 | — | 2k |
vertex ai-anthropic models (29)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| vertex_ai/claude-3-haiku | $0.25 | $1.25 | — | 200k | 👁️ vision · 🔧 tools |
| vertex_ai/claude-3-haiku@20240307 | $0.25 | $1.25 | — | 200k | 👁️ vision · 🔧 tools |
| vertex_ai/claude-3-5-haiku | $1.00 | $5.00 | — | 200k | 🔧 tools |
| vertex_ai/claude-3-5-haiku@20241022 | $1.00 | $5.00 | — | 200k | 🔧 tools |
| vertex_ai/claude-haiku-4-5 | $1.00 | $5.00 | $0.10 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| vertex_ai/claude-haiku-4-5@20251001 | $1.00 | $5.00 | $0.10 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| vertex_ai/claude-3-5-sonnet | $3.00 | $15.00 | — | 200k | 👁️ vision · 🔧 tools |
| vertex_ai/claude-3-5-sonnet@20240620 | $3.00 | $15.00 | — | 200k | 👁️ vision · 🔧 tools |
| vertex_ai/claude-3-7-sonnet@20250219 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| vertex_ai/claude-3-sonnet | $3.00 | $15.00 | — | 200k | 👁️ vision · 🔧 tools |
| vertex_ai/claude-3-sonnet@20240229 | $3.00 | $15.00 | — | 200k | 👁️ vision · 🔧 tools |
| vertex_ai/claude-sonnet-4-5 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| vertex_ai/claude-sonnet-4-6 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| vertex_ai/claude-sonnet-4-5@20250929 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| vertex_ai/claude-sonnet-4 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| vertex_ai/claude-sonnet-4@20250514 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| vertex_ai/claude-sonnet-4-6@default | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| vertex_ai/claude-opus-4-5 | $5.00 | $25.00 | $0.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| vertex_ai/claude-opus-4-5@20251101 | $5.00 | $25.00 | $0.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| vertex_ai/claude-opus-4-6 | $5.00 | $25.00 | $0.50 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| vertex_ai/claude-opus-4-6@default | $5.00 | $25.00 | $0.50 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| vertex_ai/claude-opus-4-7 | $5.00 | $25.00 | $0.50 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| vertex_ai/claude-opus-4-7@default | $5.00 | $25.00 | $0.50 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| vertex_ai/claude-3-opus | $15.00 | $75.00 | — | 200k | 👁️ vision · 🔧 tools |
| vertex_ai/claude-3-opus@20240229 | $15.00 | $75.00 | — | 200k | 👁️ vision · 🔧 tools |
| vertex_ai/claude-opus-4 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| vertex_ai/claude-opus-4-1 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools |
| vertex_ai/claude-opus-4-1@20250805 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools |
| vertex_ai/claude-opus-4@20250514 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
watsonx (28)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| watsonx/ibm/granite-4-h-small | $0.06 | $0.25 | — | 20k | 🔧 tools |
| watsonx/ibm/granite-guardian-3-2-2b | $0.10 | $0.10 | — | 8k | |
| watsonx/ibm/granite-vision-3-2-2b | $0.10 | $0.10 | — | 8k | 👁️ vision |
| watsonx/meta-llama/llama-3-2-1b-instruct | $0.10 | $0.10 | — | 128k | 🔧 tools |
| watsonx/mistralai/mistral-small-2503 | $0.10 | $0.30 | — | 32k | 🔧 tools |
| watsonx/mistralai/mistral-small-3-1-24b-instruct-2503 | $0.10 | $0.30 | — | 32k | 🔧 tools |
| watsonx/meta-llama/llama-3-2-3b-instruct | $0.15 | $0.15 | — | 128k | 🔧 tools |
| watsonx/openai/gpt-oss-120b | $0.15 | $0.60 | — | 8k | |
| watsonx/ibm/granite-3-8b-instruct | $0.20 | $0.20 | — | 8k | 🔧 tools · 💾 cache |
| watsonx/ibm/granite-3-3-8b-instruct | $0.20 | $0.20 | — | 8k | 🔧 tools |
| watsonx/ibm/granite-guardian-3-3-8b | $0.20 | $0.20 | — | 8k | |
| watsonx/meta-llama/llama-3-2-11b-vision-instruct | $0.35 | $0.35 | — | 128k | 👁️ vision · 🔧 tools |
| watsonx/meta-llama/llama-4-maverick-17b | $0.35 | $1.40 | — | 128k | 🔧 tools |
| watsonx/meta-llama/llama-guard-3-11b-vision | $0.35 | $0.35 | — | 128k | 👁️ vision |
| watsonx/mistralai/pixtral-12b-2409 | $0.35 | $0.35 | — | 128k | 👁️ vision |
| watsonx/ibm/granite-ttm-1024-96-r2 | $0.38 | $0.38 | — | 1k | |
| watsonx/ibm/granite-ttm-1536-96-r2 | $0.38 | $0.38 | — | 1k | |
| watsonx/ibm/granite-ttm-512-96-r2 | $0.38 | $0.38 | — | 1k | |
| watsonx/google/flan-t5-xl-3b | $0.60 | $0.60 | — | 8k | |
| watsonx/ibm/granite-13b-chat-v2 | $0.60 | $0.60 | — | 8k | |
| watsonx/ibm/granite-13b-instruct-v2 | $0.60 | $0.60 | — | 8k | |
| watsonx/meta-llama/llama-3-3-70b-instruct | $0.71 | $0.71 | — | 128k | 🔧 tools |
| watsonx/sdaia/allam-1-13b-instruct | $1.80 | $1.80 | — | 8k | |
| watsonx/meta-llama/llama-3-2-90b-vision-instruct | $2.00 | $2.00 | — | 128k | 👁️ vision · 🔧 tools |
| watsonx/mistralai/mistral-large | $3.00 | $10.00 | — | 131k | 🔧 tools · 💾 cache |
| watsonx/mistralai/mistral-medium-2505 | $3.00 | $10.00 | — | 128k | 🔧 tools |
| watsonx/bigscience/mt0-xxl-13b | $500.00 | $2000.00 | — | 8k | |
| watsonx/core42/jais-13b-chat | $500.00 | $2000.00 | — | 8k |
nebius (27)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| nebius/Qwen/Qwen2.5-Coder-7B | $0.01 | $0.03 | — | 33k | 🔧 tools |
| nebius/meta-llama/Llama-Guard-3-8B | $0.02 | $0.06 | — | 128k | |
| nebius/meta-llama/Meta-Llama-3.1-8B-Instruct | $0.02 | $0.06 | — | 128k | 🔧 tools |
| nebius/Qwen/Qwen2-VL-7B-Instruct | $0.02 | $0.06 | — | 131k | 👁️ vision |
| nebius/mistralai/Mistral-Nemo-Instruct-2407 | $0.04 | $0.12 | — | 128k | 🔧 tools |
| nebius/google/gemma-3-27b-it | $0.06 | $0.20 | — | 128k | 👁️ vision · 🔧 tools |
| nebius/Qwen/Qwen2.5-32B-Instruct | $0.06 | $0.20 | — | 128k | 🔧 tools |
| nebius/Qwen/Qwen3-14B | $0.08 | $0.24 | — | 33k | 🔧 tools |
| nebius/Qwen/Qwen3-4B | $0.08 | $0.24 | — | 33k | 🔧 tools |
| nebius/nvidia/Llama-3.3-Nemotron-Super-49B-v1 | $0.10 | $0.40 | — | 131k | 🔧 tools |
| nebius/Qwen/Qwen3-32B | $0.10 | $0.30 | — | 33k | 🔧 tools |
| nebius/Qwen/Qwen3-30B-A3B | $0.10 | $0.30 | — | 33k | 🔧 tools |
| nebius/meta-llama/Llama-3.3-70B-Instruct | $0.13 | $0.40 | — | 128k | 🔧 tools |
| nebius/meta-llama/Meta-Llama-3.1-70B-Instruct | $0.13 | $0.40 | — | 128k | 🔧 tools |
| nebius/Qwen/Qwen2.5-72B-Instruct | $0.13 | $0.40 | — | 128k | 🔧 tools |
| nebius/Qwen/Qwen2.5-VL-72B-Instruct | $0.13 | $0.40 | — | 131k | 👁️ vision · 🔧 tools |
| nebius/Qwen/Qwen2-VL-72B-Instruct | $0.13 | $0.40 | — | 131k | 👁️ vision · 🔧 tools |
| nebius/Qwen/QwQ-32B | $0.15 | $0.45 | — | 33k | 🔧 tools |
| nebius/Qwen/Qwen3-235B-A22B | $0.20 | $0.60 | — | 262k | 🔧 tools |
| nebius/deepseek-ai/DeepSeek-R1-Distill-Llama-70B | $0.25 | $0.75 | — | 128k | 🔧 tools |
| nebius/deepseek-ai/DeepSeek-V3 | $0.50 | $1.50 | — | 128k | 🔧 tools |
| nebius/deepseek-ai/DeepSeek-V3-0324 | $0.50 | $1.50 | — | 128k | 🔧 tools |
| nebius/nvidia/Llama-3.1-Nemotron-Ultra-253B-v1 | $0.60 | $1.80 | — | 128k | 🔧 tools |
| nebius/deepseek-ai/DeepSeek-R1 | $0.80 | $2.40 | — | 128k | 🔧 tools |
| nebius/deepseek-ai/DeepSeek-R1-0528 | $0.80 | $2.40 | — | 164k | 🔧 tools |
| nebius/meta-llama/Meta-Llama-3.1-405B-Instruct | $1.00 | $3.00 | — | 128k | 🔧 tools |
| nebius/NousResearch/Hermes-3-Llama-3.1-405B | $1.00 | $3.00 | — | 128k | 🔧 tools |
databricks (26)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| databricks/databricks-gpt-5-nano | $0.05 | $0.40 | — | 272k | |
| databricks/databricks-gpt-oss-20b | $0.07 | $0.30 | — | 131k | |
| databricks/databricks-gemma-3-12b | $0.15 | $0.50 | — | 128k | |
| databricks/databricks-gpt-oss-120b | $0.15 | $0.60 | — | 131k | |
| databricks/databricks-meta-llama-3-1-8b-instruct | $0.15 | $0.45 | — | 200k | |
| databricks/databricks-gpt-5-mini | $0.25 | $2.00 | — | 272k | |
| databricks/databricks-gemini-2-5-flash | $0.30 | $2.50 | — | 1049k | 🔧 tools |
| databricks/databricks-llama-2-70b-chat | $0.50 | $1.50 | — | 4k | |
| databricks/databricks-llama-4-maverick | $0.50 | $1.50 | — | 128k | |
| databricks/databricks-meta-llama-3-3-70b-instruct | $0.50 | $1.50 | — | 128k | |
| databricks/databricks-mixtral-8x7b-instruct | $0.50 | $1.00 | — | 4k | |
| databricks/databricks-mpt-7b-instruct | $0.50 | $0.0000 | — | 8k | |
| databricks/databricks-claude-haiku-4-5 | $1.00 | $5.00 | — | 200k | 🔧 tools |
| databricks/databricks-meta-llama-3-70b-instruct | $1.00 | $3.00 | — | 128k | |
| databricks/databricks-mpt-30b-instruct | $1.00 | $1.00 | — | 8k | |
| databricks/databricks-gemini-2-5-pro | $1.25 | $10.00 | — | 1049k | 🔧 tools |
| databricks/databricks-gpt-5 | $1.25 | $10.00 | — | 272k | |
| databricks/databricks-gpt-5-1 | $1.25 | $10.00 | — | 272k | |
| databricks/databricks-claude-3-7-sonnet | $3.00 | $15.00 | — | 200k | 🔧 tools |
| databricks/databricks-claude-sonnet-4 | $3.00 | $15.00 | — | 200k | 🔧 tools |
| databricks/databricks-claude-sonnet-4-1 | $3.00 | $15.00 | — | 200k | 🔧 tools |
| databricks/databricks-claude-sonnet-4-5 | $3.00 | $15.00 | — | 200k | 🔧 tools |
| databricks/databricks-claude-opus-4-5 | $5.00 | $25.00 | — | 200k | 🔧 tools |
| databricks/databricks-meta-llama-3-1-405b-instruct | $5.00 | $15.00 | — | 128k | |
| databricks/databricks-claude-opus-4 | $15.00 | $75.00 | — | 200k | 🔧 tools |
| databricks/databricks-claude-opus-4-1 | $15.00 | $75.00 | — | 200k | 🔧 tools |
moonshot (22)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| moonshot/kimi-latest-8k | $0.20 | $2.00 | $0.15 | 8k | 👁️ vision · 🔧 tools |
| moonshot/moonshot-v1-8k | $0.20 | $2.00 | — | 8k | 🔧 tools |
| moonshot/moonshot-v1-8k-0430 | $0.20 | $2.00 | — | 8k | 🔧 tools |
| moonshot/moonshot-v1-8k-vision-preview | $0.20 | $2.00 | — | 8k | 👁️ vision · 🔧 tools |
| moonshot/kimi-k2-0711-preview | $0.60 | $2.50 | $0.15 | 131k | 🔧 tools · 🌐 search |
| moonshot/kimi-k2-0905-preview | $0.60 | $2.50 | $0.15 | 262k | 🔧 tools · 🌐 search |
| moonshot/kimi-k2.5 | $0.60 | $3.00 | $0.10 | 262k | 👁️ vision · 🔧 tools |
| moonshot/kimi-thinking-preview | $0.60 | $2.50 | $0.15 | 131k | 👁️ vision |
| moonshot/kimi-k2-thinking | $0.60 | $2.50 | $0.15 | 262k | 🔧 tools · 🌐 search |
| moonshot/kimi-k2.6 | $0.95 | $4.00 | $0.16 | 262k | 👁️ vision · 🔧 tools |
| moonshot/kimi-latest-32k | $1.00 | $3.00 | $0.15 | 33k | 👁️ vision · 🔧 tools |
| moonshot/moonshot-v1-32k | $1.00 | $3.00 | — | 33k | 🔧 tools |
| moonshot/moonshot-v1-32k-0430 | $1.00 | $3.00 | — | 33k | 🔧 tools |
| moonshot/moonshot-v1-32k-vision-preview | $1.00 | $3.00 | — | 33k | 👁️ vision · 🔧 tools |
| moonshot/kimi-k2-turbo-preview | $1.15 | $8.00 | $0.15 | 262k | 🔧 tools · 🌐 search |
| moonshot/kimi-k2-thinking-turbo | $1.15 | $8.00 | $0.15 | 262k | 🔧 tools · 🌐 search |
| moonshot/kimi-latest | $2.00 | $5.00 | $0.15 | 131k | 👁️ vision · 🔧 tools |
| moonshot/kimi-latest-128k | $2.00 | $5.00 | $0.15 | 131k | 👁️ vision · 🔧 tools |
| moonshot/moonshot-v1-128k | $2.00 | $5.00 | — | 131k | 🔧 tools |
| moonshot/moonshot-v1-128k-0430 | $2.00 | $5.00 | — | 131k | 🔧 tools |
| moonshot/moonshot-v1-128k-vision-preview | $2.00 | $5.00 | — | 131k | 👁️ vision · 🔧 tools |
| moonshot/moonshot-v1-auto | $2.00 | $5.00 | — | 131k | 🔧 tools |
anthropic (20)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| claude-3-haiku-20240307 | $0.25 | $1.25 | $0.03 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| claude-haiku-4-5-20251001 | $1.00 | $5.00 | $0.10 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| claude-haiku-4-5 | $1.00 | $5.00 | $0.10 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| claude-3-7-sonnet-20250219 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| claude-4-sonnet-20250514 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| claude-sonnet-4-5 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| claude-sonnet-4-5-20250929 | $3.00 | $15.00 | $0.30 | 200k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| claude-sonnet-4-6 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| claude-sonnet-4-20250514 | $3.00 | $15.00 | $0.30 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| claude-opus-4-5-20251101 | $5.00 | $25.00 | $0.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| claude-opus-4-5 | $5.00 | $25.00 | $0.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| claude-opus-4-6 | $5.00 | $25.00 | $0.50 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| claude-opus-4-6-20260205 | $5.00 | $25.00 | $0.50 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| claude-opus-4-7 | $5.00 | $25.00 | $0.50 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| claude-opus-4-7-20260416 | $5.00 | $25.00 | $0.50 | 1000k | 👁️ vision · 🔧 tools · 💾 cache |
| claude-3-opus-20240229 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| claude-4-opus-20250514 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| claude-opus-4-1 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| claude-opus-4-1-20250805 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
| claude-opus-4-20250514 | $15.00 | $75.00 | $1.50 | 200k | 👁️ vision · 🔧 tools · 💾 cache |
lambda ai (20)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| lambda_ai/llama3.2-11b-vision-instruct | $0.01 | $0.02 | — | 131k | 👁️ vision · 🔧 tools |
| lambda_ai/llama3.2-3b-instruct | $0.01 | $0.02 | — | 131k | 🔧 tools |
| lambda_ai/hermes3-8b | $0.02 | $0.04 | — | 131k | 🔧 tools |
| lambda_ai/lfm-7b | $0.02 | $0.04 | — | 131k | 🔧 tools |
| lambda_ai/llama3.1-8b-instruct | $0.02 | $0.04 | — | 131k | 🔧 tools |
| lambda_ai/llama-4-maverick-17b-128e-instruct-fp8 | $0.05 | $0.10 | — | 131k | 🔧 tools |
| lambda_ai/llama-4-scout-17b-16e-instruct | $0.05 | $0.10 | — | 16k | 🔧 tools |
| lambda_ai/qwen25-coder-32b-instruct | $0.05 | $0.10 | — | 131k | 🔧 tools |
| lambda_ai/qwen3-32b-fp8 | $0.05 | $0.10 | — | 131k | 🔧 tools |
| lambda_ai/lfm-40b | $0.10 | $0.20 | — | 131k | 🔧 tools |
| lambda_ai/hermes3-70b | $0.12 | $0.30 | — | 131k | 🔧 tools |
| lambda_ai/llama3.1-70b-instruct-fp8 | $0.12 | $0.30 | — | 131k | 🔧 tools |
| lambda_ai/llama3.1-nemotron-70b-instruct-fp8 | $0.12 | $0.30 | — | 131k | 🔧 tools |
| lambda_ai/llama3.3-70b-instruct-fp8 | $0.12 | $0.30 | — | 131k | 🔧 tools |
| lambda_ai/deepseek-llama3.3-70b | $0.20 | $0.60 | — | 131k | 🔧 tools |
| lambda_ai/deepseek-r1-0528 | $0.20 | $0.60 | — | 131k | 🔧 tools |
| lambda_ai/deepseek-v3-0324 | $0.20 | $0.60 | — | 131k | 🔧 tools |
| lambda_ai/deepseek-r1-671b | $0.80 | $0.80 | — | 131k | 🔧 tools |
| lambda_ai/hermes3-405b | $0.80 | $0.80 | — | 131k | 🔧 tools |
| lambda_ai/llama3.1-405b-instruct-fp8 | $0.80 | $0.80 | — | 131k | 🔧 tools |
perplexity (20)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| perplexity/pplx-70b-online | $0.0000 | $2.80 | — | 4k | |
| perplexity/pplx-7b-online | $0.0000 | $0.28 | — | 4k | |
| perplexity/sonar-medium-online | $0.0000 | $1.80 | — | 12k | |
| perplexity/sonar-small-online | $0.0000 | $0.28 | — | 12k | |
| perplexity/mistral-7b-instruct | $0.07 | $0.28 | — | 4k | |
| perplexity/mixtral-8x7b-instruct | $0.07 | $0.28 | — | 4k | |
| perplexity/pplx-7b-chat | $0.07 | $0.28 | — | 8k | |
| perplexity/sonar-small-chat | $0.07 | $0.28 | — | 16k | |
| perplexity/llama-3.1-8b-instruct | $0.20 | $0.20 | — | 131k | |
| perplexity/codellama-34b-instruct | $0.35 | $1.40 | — | 16k | |
| perplexity/sonar-medium-chat | $0.60 | $1.80 | — | 16k | |
| perplexity/codellama-70b-instruct | $0.70 | $2.80 | — | 16k | |
| perplexity/llama-2-70b-chat | $0.70 | $2.80 | — | 4k | |
| perplexity/pplx-70b-chat | $0.70 | $2.80 | — | 4k | |
| perplexity/llama-3.1-70b-instruct | $1.00 | $1.00 | — | 131k | |
| perplexity/sonar | $1.00 | $1.00 | — | 128k | 🌐 search |
| perplexity/sonar-reasoning | $1.00 | $5.00 | — | 128k | 🌐 search |
| perplexity/sonar-deep-research | $2.00 | $8.00 | — | 128k | 🌐 search |
| perplexity/sonar-reasoning-pro | $2.00 | $8.00 | — | 128k | 🌐 search |
| perplexity/sonar-pro | $3.00 | $15.00 | — | 200k | 🌐 search |
vertex ai-language-models (19)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| gemini-2.0-flash-lite | $0.07 | $0.30 | $0.02 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-2.0-flash-lite-001 | $0.07 | $0.30 | $0.02 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-2.0-flash | $0.10 | $0.40 | $0.02 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-2.5-flash-lite | $0.10 | $0.40 | $0.01 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-2.5-flash-lite-preview-09-2025 | $0.10 | $0.40 | $0.01 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-2.5-flash-lite-preview-06-17 | $0.10 | $0.40 | $0.02 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-2.0-flash-001 | $0.15 | $0.60 | $0.04 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-3.1-flash-lite-preview | $0.25 | $1.50 | $0.02 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| vertex_ai/gemini-3.1-flash-lite-preview | $0.25 | $1.50 | $0.02 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-2.5-flash | $0.30 | $2.50 | $0.03 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-2.5-flash-preview-09-2025 | $0.30 | $2.50 | $0.07 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-robotics-er-1.5-preview | $0.30 | $2.50 | $0.0000 | 1049k | 👁️ vision · 🔧 tools |
| gemini-3-flash-preview | $0.50 | $3.00 | $0.05 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-2.5-pro | $1.25 | $10.00 | $0.13 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-2.5-pro-preview-tts | $1.25 | $10.00 | $0.13 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-2.5-computer-use-preview-10-2025 | $1.25 | $10.00 | — | 128k | 👁️ vision · 🔧 tools |
| gemini-3-pro-preview | $2.00 | $12.00 | $0.20 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-3.1-pro-preview | $2.00 | $12.00 | $0.20 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| gemini-3.1-pro-preview-customtools | $2.00 | $12.00 | $0.20 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
vertex ai-mistral models (19)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| vertex_ai/mistral-nemo@latest | $0.15 | $0.15 | — | 128k | 🔧 tools |
| vertex_ai/codestral-2501 | $0.20 | $0.60 | — | 128k | 🔧 tools |
| vertex_ai/codestral@2405 | $0.20 | $0.60 | — | 128k | 🔧 tools |
| vertex_ai/codestral@latest | $0.20 | $0.60 | — | 128k | 🔧 tools |
| vertex_ai/mistralai/codestral-2@001 | $0.30 | $0.90 | — | 128k | 🔧 tools |
| vertex_ai/codestral-2 | $0.30 | $0.90 | — | 128k | 🔧 tools |
| vertex_ai/codestral-2@001 | $0.30 | $0.90 | — | 128k | 🔧 tools |
| vertex_ai/mistralai/codestral-2 | $0.30 | $0.90 | — | 128k | 🔧 tools |
| vertex_ai/mistral-medium-3 | $0.40 | $2.00 | — | 128k | 🔧 tools |
| vertex_ai/mistral-medium-3@001 | $0.40 | $2.00 | — | 128k | 🔧 tools |
| vertex_ai/mistralai/mistral-medium-3 | $0.40 | $2.00 | — | 128k | 🔧 tools |
| vertex_ai/mistralai/mistral-medium-3@001 | $0.40 | $2.00 | — | 128k | 🔧 tools |
| vertex_ai/mistral-small-2503 | $1.00 | $3.00 | — | 128k | 👁️ vision · 🔧 tools |
| vertex_ai/mistral-small-2503@001 | $1.00 | $3.00 | — | 32k | 🔧 tools |
| vertex_ai/mistral-large-2411 | $2.00 | $6.00 | — | 128k | 🔧 tools |
| vertex_ai/mistral-large@2407 | $2.00 | $6.00 | — | 128k | 🔧 tools |
| vertex_ai/mistral-large@2411-001 | $2.00 | $6.00 | — | 128k | 🔧 tools |
| vertex_ai/mistral-large@latest | $2.00 | $6.00 | — | 128k | 🔧 tools |
| vertex_ai/mistral-nemo@2407 | $3.00 | $3.00 | — | 128k | 🔧 tools |
dashscope (17)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| dashscope/qwen-turbo | $0.05 | $0.20 | — | 129k | 🔧 tools |
| dashscope/qwen-turbo-2024-11-01 | $0.05 | $0.20 | — | 1000k | 🔧 tools |
| dashscope/qwen-turbo-2025-04-28 | $0.05 | $0.20 | — | 1000k | 🔧 tools |
| dashscope/qwen-turbo-latest | $0.05 | $0.20 | — | 1000k | 🔧 tools |
| dashscope/qwen3-next-80b-a3b-instruct | $0.15 | $1.20 | — | 262k | 🔧 tools |
| dashscope/qwen3-next-80b-a3b-thinking | $0.15 | $1.20 | — | 262k | 🔧 tools |
| dashscope/qwen3-vl-32b-instruct | $0.16 | $0.64 | — | 131k | 👁️ vision · 🔧 tools |
| dashscope/qwen3-vl-32b-thinking | $0.16 | $2.87 | — | 131k | 👁️ vision · 🔧 tools |
| dashscope/qwen-coder | $0.30 | $1.50 | — | 1000k | 🔧 tools |
| dashscope/qwen-plus | $0.40 | $1.20 | — | 129k | 🔧 tools |
| dashscope/qwen-plus-2025-01-25 | $0.40 | $1.20 | — | 129k | 🔧 tools |
| dashscope/qwen-plus-2025-04-28 | $0.40 | $1.20 | — | 129k | 🔧 tools |
| dashscope/qwen-plus-2025-07-14 | $0.40 | $1.20 | — | 129k | 🔧 tools |
| dashscope/qwen3-vl-235b-a22b-instruct | $0.40 | $1.60 | — | 131k | 👁️ vision · 🔧 tools |
| dashscope/qwen3-vl-235b-a22b-thinking | $0.40 | $4.00 | — | 131k | 👁️ vision · 🔧 tools |
| dashscope/qwq-plus | $0.80 | $2.40 | — | 98k | 🔧 tools |
| dashscope/qwen-max | $1.60 | $6.40 | — | 31k | 🔧 tools |
gmi (17)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| gmi/openai/gpt-4o-mini | $0.15 | $0.60 | — | 131k | 👁️ vision · 🔧 tools |
| gmi/deepseek-ai/DeepSeek-V3.2 | $0.28 | $0.40 | — | 164k | 🔧 tools |
| gmi/deepseek-ai/DeepSeek-V3-0324 | $0.28 | $0.88 | — | 164k | 🔧 tools |
| gmi/MiniMaxAI/MiniMax-M2.1 | $0.30 | $1.20 | — | 197k | |
| gmi/Qwen/Qwen3-VL-235B-A22B-Instruct-FP8 | $0.30 | $1.40 | — | 262k | 👁️ vision |
| gmi/zai-org/GLM-4.7-FP8 | $0.40 | $2.00 | — | 203k | |
| gmi/google/gemini-3-flash-preview | $0.50 | $3.00 | — | 1049k | 👁️ vision · 🔧 tools |
| gmi/moonshotai/Kimi-K2-Thinking | $0.80 | $1.20 | — | 262k | |
| gmi/openai/gpt-5.1 | $1.25 | $10.00 | — | 410k | 🔧 tools |
| gmi/openai/gpt-5 | $1.25 | $10.00 | — | 410k | 🔧 tools |
| gmi/openai/gpt-5.2 | $1.75 | $14.00 | — | 410k | 🔧 tools |
| gmi/google/gemini-3-pro-preview | $2.00 | $12.00 | — | 1049k | 👁️ vision · 🔧 tools |
| gmi/openai/gpt-4o | $2.50 | $10.00 | — | 131k | 👁️ vision · 🔧 tools |
| gmi/anthropic/claude-sonnet-4.5 | $3.00 | $15.00 | — | 410k | 👁️ vision · 🔧 tools |
| gmi/anthropic/claude-sonnet-4 | $3.00 | $15.00 | — | 410k | 👁️ vision · 🔧 tools |
| gmi/anthropic/claude-opus-4.5 | $5.00 | $25.00 | — | 410k | 👁️ vision · 🔧 tools |
| gmi/anthropic/claude-opus-4 | $15.00 | $75.00 | — | 410k | 👁️ vision · 🔧 tools |
sambanova (17)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| sambanova/Meta-Llama-3.2-1B-Instruct | $0.04 | $0.08 | — | 16k | |
| sambanova/Meta-Llama-3.2-3B-Instruct | $0.08 | $0.16 | — | 4k | |
| sambanova/Meta-Llama-3.1-8B-Instruct | $0.10 | $0.20 | — | 16k | 🔧 tools |
| sambanova/MiniMax-M2.7 | $0.30 | $1.20 | — | 205k | 🔧 tools |
| sambanova/Meta-Llama-Guard-3-8B | $0.30 | $0.30 | — | 16k | |
| sambanova/Llama-4-Scout-17B-16E-Instruct | $0.40 | $0.70 | — | 8k | 🔧 tools |
| sambanova/Qwen3-32B | $0.40 | $0.80 | — | 8k | 🔧 tools |
| sambanova/QwQ-32B | $0.50 | $1.00 | — | 16k | |
| sambanova/Qwen2-Audio-7B-Instruct | $0.50 | $100.00 | — | 4k | |
| sambanova/Meta-Llama-3.3-70B-Instruct | $0.60 | $1.20 | — | 131k | 🔧 tools |
| sambanova/Llama-4-Maverick-17B-128E-Instruct | $0.63 | $1.80 | — | 131k | 👁️ vision · 🔧 tools |
| sambanova/DeepSeek-R1-Distill-Llama-70B | $0.70 | $1.40 | — | 131k | |
| sambanova/DeepSeek-V3-0324 | $3.00 | $4.50 | — | 33k | 🔧 tools |
| sambanova/DeepSeek-V3.1 | $3.00 | $4.50 | — | 33k | 🔧 tools |
| sambanova/gpt-oss-120b | $3.00 | $4.50 | — | 131k | 🔧 tools |
| sambanova/DeepSeek-R1 | $5.00 | $7.00 | — | 33k | |
| sambanova/Meta-Llama-3.1-405B-Instruct | $5.00 | $10.00 | — | 16k | 🔧 tools |
hyperbolic (16)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| hyperbolic/NousResearch/Hermes-3-Llama-3.1-70B | $0.12 | $0.30 | — | 33k | 🔧 tools |
| hyperbolic/Qwen/Qwen2.5-72B-Instruct | $0.12 | $0.30 | — | 131k | 🔧 tools |
| hyperbolic/Qwen/Qwen2.5-Coder-32B-Instruct | $0.12 | $0.30 | — | 33k | 🔧 tools |
| hyperbolic/meta-llama/Llama-3.2-3B-Instruct | $0.12 | $0.30 | — | 33k | 🔧 tools |
| hyperbolic/meta-llama/Llama-3.3-70B-Instruct | $0.12 | $0.30 | — | 131k | 🔧 tools |
| hyperbolic/meta-llama/Meta-Llama-3-70B-Instruct | $0.12 | $0.30 | — | 131k | 🔧 tools |
| hyperbolic/meta-llama/Meta-Llama-3.1-405B-Instruct | $0.12 | $0.30 | — | 33k | 🔧 tools |
| hyperbolic/meta-llama/Meta-Llama-3.1-70B-Instruct | $0.12 | $0.30 | — | 33k | 🔧 tools |
| hyperbolic/meta-llama/Meta-Llama-3.1-8B-Instruct | $0.12 | $0.30 | — | 33k | 🔧 tools |
| hyperbolic/Qwen/QwQ-32B | $0.20 | $0.20 | — | 131k | 🔧 tools |
| hyperbolic/deepseek-ai/DeepSeek-V3 | $0.20 | $0.20 | — | 33k | 🔧 tools |
| hyperbolic/deepseek-ai/DeepSeek-R1-0528 | $0.25 | $0.25 | — | 131k | 🔧 tools |
| hyperbolic/deepseek-ai/DeepSeek-R1 | $0.40 | $0.40 | — | 33k | 🔧 tools |
| hyperbolic/deepseek-ai/DeepSeek-V3-0324 | $0.40 | $0.40 | — | 33k | 🔧 tools |
| hyperbolic/Qwen/Qwen3-235B-A22B | $2.00 | $2.00 | — | 131k | 🔧 tools |
| hyperbolic/moonshotai/Kimi-K2-Instruct | $2.00 | $2.00 | — | 131k | 🔧 tools |
wandb (16)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| wandb/MiniMaxAI/MiniMax-M2.5 | $0.30 | $1.20 | — | 197k | 🔧 tools |
| wandb/moonshotai/Kimi-K2-Instruct | $0.60 | $2.50 | — | 128k | |
| wandb/moonshotai/Kimi-K2.5 | $0.60 | $3.00 | $0.10 | 262k | 👁️ vision · 🔧 tools |
| wandb/openai/gpt-oss-20b | $5000.00 | $20000.00 | — | 131k | |
| wandb/microsoft/Phi-4-mini-instruct | $8000.00 | $35000.00 | — | 128k | |
| wandb/Qwen/Qwen3-235B-A22B-Instruct-2507 | $10000.00 | $10000.00 | — | 262k | |
| wandb/Qwen/Qwen3-235B-A22B-Thinking-2507 | $10000.00 | $10000.00 | — | 262k | |
| wandb/openai/gpt-oss-120b | $15000.00 | $60000.00 | — | 131k | |
| wandb/meta-llama/Llama-4-Scout-17B-16E-Instruct | $17000.00 | $66000.00 | — | 64k | |
| wandb/meta-llama/Llama-3.1-8B-Instruct | $22000.00 | $22000.00 | — | 128k | |
| wandb/zai-org/GLM-4.5 | $55000.00 | $200000.00 | — | 131k | |
| wandb/deepseek-ai/DeepSeek-V3.1 | $55000.00 | $165000.00 | — | 128k | |
| wandb/meta-llama/Llama-3.3-70B-Instruct | $71000.00 | $71000.00 | — | 128k | |
| wandb/Qwen/Qwen3-Coder-480B-A35B-Instruct | $100000.00 | $150000.00 | — | 262k | |
| wandb/deepseek-ai/DeepSeek-V3-0324 | $114000.00 | $275000.00 | — | 161k | |
| wandb/deepseek-ai/DeepSeek-R1-0528 | $135000.00 | $540000.00 | — | 161k |
ovhcloud (15)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| ovhcloud/gpt-oss-20b | $0.04 | $0.15 | — | 131k | |
| ovhcloud/Qwen3-32B | $0.08 | $0.23 | — | 32k | 🔧 tools |
| ovhcloud/gpt-oss-120b | $0.08 | $0.40 | — | 131k | |
| ovhcloud/Mistral-Small-3.2-24B-Instruct-2506 | $0.09 | $0.28 | — | 128k | 👁️ vision · 🔧 tools |
| ovhcloud/Llama-3.1-8B-Instruct | $0.10 | $0.10 | — | 131k | 🔧 tools |
| ovhcloud/Mistral-7B-Instruct-v0.3 | $0.10 | $0.10 | — | 127k | 🔧 tools |
| ovhcloud/Mistral-Nemo-Instruct-2407 | $0.13 | $0.13 | — | 118k | 🔧 tools |
| ovhcloud/mamba-codestral-7B-v0.1 | $0.19 | $0.19 | — | 256k | |
| ovhcloud/llava-v1.6-mistral-7b-hf | $0.29 | $0.29 | — | 32k | 👁️ vision |
| ovhcloud/Mixtral-8x7B-Instruct-v0.1 | $0.63 | $0.63 | — | 32k | |
| ovhcloud/DeepSeek-R1-Distill-Llama-70B | $0.67 | $0.67 | — | 131k | 🔧 tools |
| ovhcloud/Meta-Llama-3_1-70B-Instruct | $0.67 | $0.67 | — | 131k | |
| ovhcloud/Meta-Llama-3_3-70B-Instruct | $0.67 | $0.67 | — | 131k | 🔧 tools |
| ovhcloud/Qwen2.5-Coder-32B-Instruct | $0.87 | $0.87 | — | 32k | |
| ovhcloud/Qwen2.5-VL-72B-Instruct | $0.91 | $0.91 | — | 32k | 👁️ vision |
nscale (14)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| nscale/Qwen/Qwen2.5-Coder-3B-Instruct | $0.01 | $0.03 | — | — | |
| nscale/Qwen/Qwen2.5-Coder-7B-Instruct | $0.01 | $0.03 | — | — | |
| nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-8B | $0.02 | $0.02 | — | — | |
| nscale/meta-llama/Llama-3.1-8B-Instruct | $0.03 | $0.03 | — | — | |
| nscale/Qwen/Qwen2.5-Coder-32B-Instruct | $0.06 | $0.20 | — | — | |
| nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | $0.07 | $0.07 | — | — | |
| nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | $0.09 | $0.09 | — | — | |
| nscale/meta-llama/Llama-4-Scout-17B-16E-Instruct | $0.09 | $0.29 | — | — | |
| nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | $0.15 | $0.15 | — | — | |
| nscale/Qwen/QwQ-32B | $0.18 | $0.20 | — | — | |
| nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | $0.20 | $0.20 | — | — | |
| nscale/meta-llama/Llama-3.3-70B-Instruct | $0.20 | $0.20 | — | — | |
| nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-70B | $0.38 | $0.38 | — | — | |
| nscale/mistralai/mixtral-8x22b-instruct-v0.1 | $0.60 | $0.60 | — | — |
llamagate (14)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| llamagate/llama-3.1-8b | $0.03 | $0.05 | — | 131k | 🔧 tools |
| llamagate/gemma3-4b | $0.03 | $0.08 | — | 128k | 👁️ vision · 🔧 tools |
| llamagate/llama-3.2-3b | $0.04 | $0.08 | — | 131k | 🔧 tools |
| llamagate/qwen3-8b | $0.04 | $0.14 | — | 33k | 🔧 tools |
| llamagate/qwen2.5-coder-7b | $0.06 | $0.12 | — | 33k | 🔧 tools |
| llamagate/deepseek-coder-6.7b | $0.06 | $0.12 | — | 16k | 🔧 tools |
| llamagate/codellama-7b | $0.06 | $0.12 | — | 16k | 🔧 tools |
| llamagate/dolphin3-8b | $0.08 | $0.15 | — | 128k | 🔧 tools |
| llamagate/deepseek-r1-7b-qwen | $0.08 | $0.15 | — | 131k | 🔧 tools |
| llamagate/openthinker-7b | $0.08 | $0.15 | — | 33k | 🔧 tools |
| llamagate/mistral-7b-v0.3 | $0.10 | $0.15 | — | 33k | 🔧 tools |
| llamagate/deepseek-r1-8b | $0.10 | $0.20 | — | 66k | 🔧 tools |
| llamagate/llava-7b | $0.10 | $0.20 | — | 4k | 👁️ vision |
| llamagate/qwen3-vl-8b | $0.15 | $0.55 | — | 33k | 👁️ vision · 🔧 tools |
anyscale (12)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| anyscale/HuggingFaceH4/zephyr-7b-beta | $0.15 | $0.15 | — | 16k | |
| anyscale/google/gemma-7b-it | $0.15 | $0.15 | — | 8k | |
| anyscale/meta-llama/Llama-2-7b-chat-hf | $0.15 | $0.15 | — | 4k | |
| anyscale/meta-llama/Meta-Llama-3-8B-Instruct | $0.15 | $0.15 | — | 8k | |
| anyscale/mistralai/Mistral-7B-Instruct-v0.1 | $0.15 | $0.15 | — | 16k | 🔧 tools |
| anyscale/mistralai/Mixtral-8x7B-Instruct-v0.1 | $0.15 | $0.15 | — | 16k | 🔧 tools |
| anyscale/meta-llama/Llama-2-13b-chat-hf | $0.25 | $0.25 | — | 4k | |
| anyscale/mistralai/Mixtral-8x22B-Instruct-v0.1 | $0.90 | $0.90 | — | 66k | 🔧 tools |
| anyscale/codellama/CodeLlama-34b-Instruct-hf | $1.00 | $1.00 | — | 4k | |
| anyscale/codellama/CodeLlama-70b-Instruct-hf | $1.00 | $1.00 | — | 4k | |
| anyscale/meta-llama/Llama-2-70b-chat-hf | $1.00 | $1.00 | — | 4k | |
| anyscale/meta-llama/Meta-Llama-3-70B-Instruct | $1.00 | $1.00 | — | 8k |
ai21 (12)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| jamba-1.5 | $0.20 | $0.40 | — | 256k | |
| jamba-1.5-mini | $0.20 | $0.40 | — | 256k | |
| jamba-1.5-mini@001 | $0.20 | $0.40 | — | 256k | |
| jamba-mini-1.6 | $0.20 | $0.40 | — | 256k | |
| jamba-mini-1.7 | $0.20 | $0.40 | — | 256k | |
| jamba-1.5-large | $2.00 | $8.00 | — | 256k | |
| jamba-1.5-large@001 | $2.00 | $8.00 | — | 256k | |
| jamba-large-1.6 | $2.00 | $8.00 | — | 256k | |
| jamba-large-1.7 | $2.00 | $8.00 | — | 256k | |
| j2-light | $3.00 | $3.00 | — | 8k | |
| j2-mid | $10.00 | $10.00 | — | 8k | |
| j2-ultra | $15.00 | $15.00 | — | 8k |
baseten (11)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| baseten/openai/gpt-oss-120b | $0.10 | $0.50 | — | — | |
| baseten/MiniMaxAI/MiniMax-M2.5 | $0.30 | $1.20 | — | — | |
| baseten/nvidia/Nemotron-120B-A12B | $0.30 | $0.75 | — | — | |
| baseten/deepseek-ai/DeepSeek-V3.1 | $0.50 | $1.50 | — | — | |
| baseten/zai-org/GLM-4.7 | $0.60 | $2.20 | — | — | |
| baseten/zai-org/GLM-4.6 | $0.60 | $2.20 | — | — | |
| baseten/moonshotai/Kimi-K2.5 | $0.60 | $3.00 | — | — | |
| baseten/moonshotai/Kimi-K2-Thinking | $0.60 | $2.50 | — | — | |
| baseten/moonshotai/Kimi-K2-Instruct-0905 | $0.60 | $2.50 | — | — | |
| baseten/deepseek-ai/DeepSeek-V3-0324 | $0.77 | $0.77 | — | — | |
| baseten/zai-org/GLM-5 | $0.95 | $3.15 | — | — |
groq (11)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| groq/llama-3.1-8b-instant | $0.05 | $0.08 | — | 128k | 🔧 tools |
| groq/gemma-7b-it | $0.05 | $0.08 | — | 8k | 🔧 tools |
| groq/openai/gpt-oss-20b | $0.07 | $0.30 | $0.04 | 131k | 🔧 tools · 🌐 search |
| groq/openai/gpt-oss-safeguard-20b | $0.07 | $0.30 | $0.04 | 131k | 🔧 tools · 🌐 search |
| groq/meta-llama/llama-4-scout-17b-16e-instruct | $0.11 | $0.34 | — | 131k | 👁️ vision · 🔧 tools |
| groq/openai/gpt-oss-120b | $0.15 | $0.60 | $0.07 | 131k | 🔧 tools · 🌐 search |
| groq/meta-llama/llama-guard-4-12b | $0.20 | $0.20 | — | 8k | |
| groq/meta-llama/llama-4-maverick-17b-128e-instruct | $0.20 | $0.60 | — | 131k | 👁️ vision · 🔧 tools |
| groq/qwen/qwen3-32b | $0.29 | $0.59 | — | 131k | 🔧 tools |
| groq/llama-3.3-70b-versatile | $0.59 | $0.79 | — | 128k | 🔧 tools |
| groq/moonshotai/kimi-k2-instruct-0905 | $1.00 | $3.00 | $0.50 | 262k | 🔧 tools |
vertex ai-llama models (11)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| vertex_ai/meta/llama-3.1-70b-instruct-maas | $0.0000 | $0.0000 | — | 128k | 👁️ vision |
| vertex_ai/meta/llama-3.1-8b-instruct-maas | $0.0000 | $0.0000 | — | 128k | 👁️ vision |
| vertex_ai/meta/llama-3.2-90b-vision-instruct-maas | $0.0000 | $0.0000 | — | 128k | 👁️ vision |
| vertex_ai/meta/llama3-405b-instruct-maas | $0.0000 | $0.0000 | — | 32k | |
| vertex_ai/meta/llama3-70b-instruct-maas | $0.0000 | $0.0000 | — | 32k | |
| vertex_ai/meta/llama3-8b-instruct-maas | $0.0000 | $0.0000 | — | 32k | |
| vertex_ai/meta/llama-4-scout-17b-128e-instruct-maas | $0.25 | $0.70 | — | 10000k | 🔧 tools |
| vertex_ai/meta/llama-4-scout-17b-16e-instruct-maas | $0.25 | $0.70 | — | 10000k | 🔧 tools |
| vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas | $0.35 | $1.15 | — | 1000k | 🔧 tools |
| vertex_ai/meta/llama-4-maverick-17b-16e-instruct-maas | $0.35 | $1.15 | — | 1000k | 🔧 tools |
| vertex_ai/meta/llama-3.1-405b-instruct-maas | $5.00 | $16.00 | — | 128k | 👁️ vision |
zai (11)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| zai/glm-4.5-flash | $0.0000 | $0.0000 | — | 128k | 🔧 tools |
| zai/glm-4-32b-0414-128k | $0.10 | $0.10 | — | 128k | 🔧 tools |
| zai/glm-4.5-air | $0.20 | $1.10 | — | 128k | 🔧 tools |
| zai/glm-4.7 | $0.60 | $2.20 | $0.11 | 200k | 🔧 tools · 💾 cache |
| zai/glm-4.6 | $0.60 | $2.20 | $0.11 | 200k | 🔧 tools · 💾 cache |
| zai/glm-4.5 | $0.60 | $2.20 | — | 128k | 🔧 tools |
| zai/glm-4.5v | $0.60 | $1.80 | — | 128k | 👁️ vision · 🔧 tools |
| zai/glm-5 | $1.00 | $3.20 | $0.20 | 200k | 🔧 tools · 💾 cache |
| zai/glm-4.5-airx | $1.10 | $4.50 | — | 128k | 🔧 tools |
| zai/glm-5-code | $1.20 | $5.00 | $0.30 | 200k | 🔧 tools · 💾 cache |
| zai/glm-4.5-x | $2.20 | $8.90 | — | 128k | 🔧 tools |
gradient ai (10)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| gradient_ai/llama3-8b-instruct | $0.20 | $0.20 | — | 8k | |
| gradient_ai/mistral-nemo-instruct-2407 | $0.30 | $0.30 | — | 128k | |
| gradient_ai/llama3.3-70b-instruct | $0.65 | $0.65 | — | 128k | |
| gradient_ai/anthropic-claude-3.5-haiku | $0.80 | $4.00 | — | 200k | |
| gradient_ai/deepseek-r1-distill-llama-70b | $0.99 | $0.99 | — | 33k | |
| gradient_ai/openai-o3-mini | $1.10 | $4.40 | — | 200k | |
| gradient_ai/openai-o3 | $2.00 | $8.00 | — | 200k | |
| gradient_ai/anthropic-claude-3.5-sonnet | $3.00 | $15.00 | — | 200k | |
| gradient_ai/anthropic-claude-3.7-sonnet | $3.00 | $15.00 | — | 200k | |
| gradient_ai/anthropic-claude-3-opus | $15.00 | $75.00 | — | 200k |
publicai (9)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| publicai/swiss-ai/apertus-8b-instruct | $0.0000 | $0.0000 | — | 8k | |
| publicai/swiss-ai/apertus-70b-instruct | $0.0000 | $0.0000 | — | 8k | |
| publicai/aisingapore/Gemma-SEA-LION-v4-27B-IT | $0.0000 | $0.0000 | — | 8k | 🔧 tools |
| publicai/BSC-LT/salamandra-7b-instruct-tools-16k | $0.0000 | $0.0000 | — | 16k | 🔧 tools |
| publicai/BSC-LT/ALIA-40b-instruct_Q8_0 | $0.0000 | $0.0000 | — | 8k | 🔧 tools |
| publicai/allenai/Olmo-3-7B-Instruct | $0.0000 | $0.0000 | — | 33k | 🔧 tools |
| publicai/aisingapore/Qwen-SEA-LION-v4-32B-IT | $0.0000 | $0.0000 | — | 33k | 🔧 tools |
| publicai/allenai/Olmo-3-7B-Think | $0.0000 | $0.0000 | — | 33k | 🔧 tools |
| publicai/allenai/Olmo-3-32B-Think | $0.0000 | $0.0000 | — | 33k | 🔧 tools |
deepseek (8)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| deepseek/deepseek-coder | $0.14 | $0.28 | — | 128k | 🔧 tools · 💾 cache |
| deepseek/deepseek-v3 | $0.27 | $1.10 | $0.07 | 66k | 🔧 tools · 💾 cache |
| deepseek-chat | $0.28 | $0.42 | $0.03 | 131k | 🔧 tools · 💾 cache |
| deepseek-reasoner | $0.28 | $0.42 | $0.03 | 131k | 💾 cache |
| deepseek/deepseek-chat | $0.28 | $0.42 | $0.03 | 131k | 🔧 tools · 💾 cache |
| deepseek/deepseek-reasoner | $0.28 | $0.42 | $0.03 | 131k | 💾 cache |
| deepseek/deepseek-v3.2 | $0.28 | $0.40 | — | 164k | 🔧 tools · 💾 cache |
| deepseek/deepseek-r1 | $0.55 | $2.19 | — | 66k | 🔧 tools · 💾 cache |
vertex ai (8)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| vertex_ai/xai/grok-4.1-fast-non-reasoning | $0.20 | $0.50 | $0.05 | 2000k | 👁️ vision · 🔧 tools · 🌐 search |
| vertex_ai/xai/grok-4.1-fast-reasoning | $0.20 | $0.50 | $0.05 | 2000k | 👁️ vision · 🔧 tools · 🌐 search |
| vertex_ai/gemini-3-flash-preview | $0.50 | $3.00 | $0.05 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| vertex_ai/gemini-3-pro-preview | $2.00 | $12.00 | $0.20 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| vertex_ai/gemini-3.1-pro-preview | $2.00 | $12.00 | $0.20 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| vertex_ai/gemini-3.1-pro-preview-customtools | $2.00 | $12.00 | $0.20 | 1049k | 👁️ vision · 🔧 tools · 💾 cache · 🌐 search |
| vertex_ai/xai/grok-4.20-non-reasoning | $2.00 | $6.00 | $0.20 | 2000k | 👁️ vision · 🔧 tools · 🌐 search |
| vertex_ai/xai/grok-4.20-reasoning | $2.00 | $6.00 | $0.20 | 2000k | 👁️ vision · 🔧 tools · 🌐 search |
cerebras (7)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| cerebras/llama3.1-8b | $0.10 | $0.10 | — | 128k | 🔧 tools |
| cerebras/gpt-oss-120b | $0.35 | $0.75 | — | 131k | 🔧 tools |
| cerebras/qwen-3-32b | $0.40 | $0.80 | — | 128k | 🔧 tools |
| cerebras/llama3.1-70b | $0.60 | $0.60 | — | 128k | 🔧 tools |
| cerebras/llama-3.3-70b | $0.85 | $1.20 | — | 128k | 🔧 tools |
| cerebras/zai-glm-4.6 | $2.25 | $2.75 | — | 128k | 🔧 tools |
| cerebras/zai-glm-4.7 | $2.25 | $2.75 | — | 128k | 🔧 tools |
cohere chat (7)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| command-r | $0.15 | $0.60 | — | 128k | 🔧 tools |
| command-r-08-2024 | $0.15 | $0.60 | — | 128k | 🔧 tools |
| command-r7b-12-2024 | $0.15 | $0.04 | — | 128k | 🔧 tools |
| command-light | $0.30 | $0.60 | — | 4k | |
| command-a-03-2025 | $2.50 | $10.00 | — | 256k | 🔧 tools |
| command-r-plus | $2.50 | $10.00 | — | 128k | 🔧 tools |
| command-r-plus-08-2024 | $2.50 | $10.00 | — | 128k | 🔧 tools |
crusoe (7)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| crusoe/google/gemma-3-12b-it | $0.10 | $0.10 | — | 131k | 👁️ vision · 🔧 tools |
| crusoe/meta-llama/Llama-3.3-70B-Instruct | $0.20 | $0.20 | — | 131k | 🔧 tools |
| crusoe/openai/gpt-oss-120b | $0.80 | $0.80 | — | 131k | 🔧 tools |
| crusoe/deepseek-ai/DeepSeek-V3-0324 | $1.50 | $1.50 | — | 164k | 🔧 tools |
| crusoe/moonshotai/Kimi-K2-Thinking | $2.50 | $2.50 | — | 262k | |
| crusoe/deepseek-ai/DeepSeek-R1-0528 | $3.00 | $7.00 | — | 164k | |
| crusoe/Qwen/Qwen3-235B-A22B-Instruct-2507 | $3.00 | $3.00 | — | 262k | 🔧 tools |
text-completion-openai (6)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| babbage-002 | $0.40 | $0.40 | — | 16k | |
| gpt-3.5-turbo-instruct | $1.50 | $2.00 | — | 8k | |
| gpt-3.5-turbo-instruct-0914 | $1.50 | $2.00 | — | 8k | |
| ft:babbage-002 | $1.60 | $1.60 | — | 16k | |
| davinci-002 | $2.00 | $2.00 | — | 16k | |
| ft:davinci-002 | $12.00 | $12.00 | — | 16k |
palm (6)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| palm/chat-bison | $0.13 | $0.13 | — | 8k | |
| palm/chat-bison-001 | $0.13 | $0.13 | — | 8k | |
| palm/text-bison | $0.13 | $0.13 | — | 8k | |
| palm/text-bison-001 | $0.13 | $0.13 | — | 8k | |
| palm/text-bison-safety-off | $0.13 | $0.13 | — | 8k | |
| palm/text-bison-safety-recitation-off | $0.13 | $0.13 | — | 8k |
sagemaker (6)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| sagemaker/meta-textgeneration-llama-2-13b | $0.0000 | $0.0000 | — | 4k | |
| sagemaker/meta-textgeneration-llama-2-13b-f | $0.0000 | $0.0000 | — | 4k | |
| sagemaker/meta-textgeneration-llama-2-70b | $0.0000 | $0.0000 | — | 4k | |
| sagemaker/meta-textgeneration-llama-2-70b-b-f | $0.0000 | $0.0000 | — | 4k | |
| sagemaker/meta-textgeneration-llama-2-7b | $0.0000 | $0.0000 | — | 4k | |
| sagemaker/meta-textgeneration-llama-2-7b-f | $0.0000 | $0.0000 | — | 4k |
lemonade (5)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| lemonade/Qwen3-Coder-30B-A3B-Instruct-GGUF | $0.0000 | $0.0000 | — | 262k | 🔧 tools |
| lemonade/gpt-oss-20b-mxfp4-GGUF | $0.0000 | $0.0000 | — | 131k | 🔧 tools |
| lemonade/gpt-oss-120b-mxfp-GGUF | $0.0000 | $0.0000 | — | 131k | 🔧 tools |
| lemonade/Gemma-3-4b-it-GGUF | $0.0000 | $0.0000 | — | 128k | 🔧 tools |
| lemonade/Qwen3-4B-Instruct-2507-GGUF | $0.0000 | $0.0000 | — | 262k | 🔧 tools |
minimax (5)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| minimax/MiniMax-M2.1 | $0.30 | $1.20 | $0.03 | 1000k | 🔧 tools · 💾 cache |
| minimax/MiniMax-M2.1-lightning | $0.30 | $2.40 | $0.03 | 1000k | 🔧 tools · 💾 cache |
| minimax/MiniMax-M2.5 | $0.30 | $1.20 | $0.03 | 1000k | 🔧 tools · 💾 cache |
| minimax/MiniMax-M2.5-lightning | $0.30 | $2.40 | $0.03 | 1000k | 🔧 tools · 💾 cache |
| minimax/MiniMax-M2 | $0.30 | $1.20 | $0.03 | 200k | 🔧 tools · 💾 cache |
vertex ai-ai21 models (5)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| vertex_ai/jamba-1.5 | $0.20 | $0.40 | — | 256k | |
| vertex_ai/jamba-1.5-mini | $0.20 | $0.40 | — | 256k | |
| vertex_ai/jamba-1.5-mini@001 | $0.20 | $0.40 | — | 256k | |
| vertex_ai/jamba-1.5-large | $2.00 | $8.00 | — | 256k | |
| vertex_ai/jamba-1.5-large@001 | $2.00 | $8.00 | — | 256k |
cloudflare (4)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| cloudflare/@cf/meta/llama-2-7b-chat-fp16 | $1.92 | $1.92 | — | 3k | |
| cloudflare/@cf/meta/llama-2-7b-chat-int8 | $1.92 | $1.92 | — | 2k | |
| cloudflare/@cf/mistral/mistral-7b-instruct-v0.1 | $1.92 | $1.92 | — | 8k | |
| cloudflare/@hf/thebloke/codellama-7b-instruct-awq | $1.92 | $1.92 | — | 4k |
amazon nova (4)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| amazon-nova/nova-micro-v1 | $0.04 | $0.14 | — | 128k | 🔧 tools · 💾 cache |
| amazon-nova/nova-lite-v1 | $0.06 | $0.24 | — | 300k | 👁️ vision · 🔧 tools · 💾 cache |
| amazon-nova/nova-pro-v1 | $0.80 | $3.20 | — | 300k | 👁️ vision · 🔧 tools · 💾 cache |
| amazon-nova/nova-premier-v1 | $2.50 | $12.50 | — | 1000k | 👁️ vision · 🔧 tools |
vertex ai-qwen models (4)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| vertex_ai/qwen/qwen3-next-80b-a3b-instruct-maas | $0.15 | $1.20 | — | 262k | 🔧 tools |
| vertex_ai/qwen/qwen3-next-80b-a3b-thinking-maas | $0.15 | $1.20 | — | 262k | 🔧 tools |
| vertex_ai/qwen/qwen3-235b-a22b-instruct-2507-maas | $0.25 | $1.00 | — | 262k | 🔧 tools |
| vertex_ai/qwen/qwen3-coder-480b-a35b-instruct-maas | $1.00 | $4.00 | — | 262k | 🔧 tools |
bedrock mantle (4)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| bedrock_mantle/openai.gpt-oss-20b | $0.07 | $0.30 | — | 131k | 🔧 tools |
| bedrock_mantle/openai.gpt-oss-safeguard-20b | $0.07 | $0.30 | — | 131k | 🔧 tools |
| bedrock_mantle/openai.gpt-oss-120b | $0.15 | $0.60 | — | 131k | 🔧 tools |
| bedrock_mantle/openai.gpt-oss-safeguard-120b | $0.15 | $0.60 | — | 131k | 🔧 tools |
azure text (3)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| azure/gpt-3.5-turbo-instruct-0914 | $1.50 | $2.00 | — | 4k | |
| azure/gpt-35-turbo-instruct | $1.50 | $2.00 | — | 4k | |
| azure/gpt-35-turbo-instruct-0914 | $1.50 | $2.00 | — | 4k |
volcengine (3)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| deepseek-v3-2-251201 | $0.0000 | $0.0000 | — | 98k | 🔧 tools · 💾 cache |
| glm-4-7-251222 | $0.0000 | $0.0000 | — | 205k | 🔧 tools · 💾 cache |
| kimi-k2-thinking-251104 | $0.0000 | $0.0000 | — | 229k | 🔧 tools · 💾 cache |
gigachat (3)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| gigachat/GigaChat-2-Lite | $0.0000 | $0.0000 | — | 128k | 🔧 tools |
| gigachat/GigaChat-2-Max | $0.0000 | $0.0000 | — | 128k | 👁️ vision · 🔧 tools |
| gigachat/GigaChat-2-Pro | $0.0000 | $0.0000 | — | 128k | 👁️ vision · 🔧 tools |
v0 (3)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| v0/v0-1.0-md | $3.00 | $15.00 | — | 128k | 👁️ vision · 🔧 tools |
| v0/v0-1.5-md | $3.00 | $15.00 | — | 128k | 👁️ vision · 🔧 tools |
| v0/v0-1.5-lg | $15.00 | $75.00 | — | 512k | 👁️ vision · 🔧 tools |
vertex ai-deepseek models (3)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| vertex_ai/deepseek-ai/deepseek-v3.2-maas | $0.56 | $1.68 | — | 164k | 🔧 tools · 💾 cache |
| vertex_ai/deepseek-ai/deepseek-v3.1-maas | $1.35 | $5.40 | — | 164k | 🔧 tools · 💾 cache |
| vertex_ai/deepseek-ai/deepseek-r1-0528-maas | $1.35 | $5.40 | — | 65k | 🔧 tools · 💾 cache |
nlp cloud (2)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| chatdolphin | $0.50 | $0.50 | — | 16k | |
| dolphin | $0.50 | $0.50 | — | 16k |
codestral (2)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| codestral/codestral-2405 | $0.0000 | $0.0000 | — | 32k | |
| codestral/codestral-latest | $0.0000 | $0.0000 | — | 32k |
cohere (2)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| command | $1.00 | $2.00 | — | 4k | |
| command-nightly | $1.00 | $2.00 | — | 4k |
fireworks ai-embedding-models (2)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| fireworks-ai-embedding-up-to-150m | $0.0080 | $0.0000 | — | — | |
| fireworks-ai-embedding-150m-to-350m | $0.02 | $0.0000 | — | — |
friendliai (2)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| friendliai/meta-llama-3.1-8b-instruct | $0.10 | $0.10 | — | 8k | 🔧 tools |
| friendliai/meta-llama-3.1-70b-instruct | $0.60 | $0.60 | — | 8k | 🔧 tools |
morph (2)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| morph/morph-v3-fast | $0.80 | $1.20 | — | 16k | |
| morph/morph-v3-large | $0.90 | $1.90 | — | 16k |
text-completion-codestral (2)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| text-completion-codestral/codestral-2405 | $0.0000 | $0.0000 | — | 32k | |
| text-completion-codestral/codestral-latest | $0.0000 | $0.0000 | — | 32k |
vertex ai-text-models (2)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| text-unicorn | $10.00 | $28.00 | — | 8k | |
| text-unicorn@001 | $10.00 | $28.00 | — | 8k |
vertex ai-zai models (2)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| vertex_ai/zai-org/glm-4.7-maas | $0.60 | $2.20 | — | 200k | 🔧 tools |
| vertex_ai/zai-org/glm-5-maas | $1.00 | $3.20 | $0.10 | 200k | 🔧 tools · 💾 cache |
vertex ai-openai models (2)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| vertex_ai/openai/gpt-oss-20b-maas | $0.07 | $0.30 | — | 131k | |
| vertex_ai/openai/gpt-oss-120b-maas | $0.15 | $0.60 | — | 131k |
vertex ai-minimax models (1)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| vertex_ai/minimaxai/minimax-m2-maas | $0.30 | $1.20 | — | 197k | 🔧 tools |
vertex ai-moonshot models (1)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| vertex_ai/moonshotai/kimi-k2-thinking-maas | $0.60 | $2.50 | — | 256k | 🔧 tools · 🌐 search |
sarvam (1)
| Model | Input $/M | Output $/M | Cached $/M | Context | Features |
|---|---|---|---|---|---|
| sarvam/sarvam-m | $0.0000 | $0.0000 | $0.0000 | 8k |