All Models

Pricing for 2003 chat-capable models across 76 providers. Last updated May 20, 2026.

fireworks ai (250)

Model Input $/M Output $/M Cached $/M Context Features
fireworks-ai-default $0.0000 $0.0000
fireworks_ai/accounts/fireworks/models/flux-1-dev-controlnet-union $0.0010 $0.0010 4k
fireworks_ai/accounts/fireworks/models/gpt-oss-20b $0.05 $0.20 131k 🔧 tools
fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct $0.10 $0.10 16k
fireworks_ai/accounts/fireworks/models/llama-v3p2-1b-instruct $0.10 $0.10 16k
fireworks_ai/accounts/fireworks/models/llama-v3p2-3b-instruct $0.10 $0.10 16k
fireworks_ai/accounts/fireworks/models/codegemma-2b $0.10 $0.10 8k
fireworks_ai/accounts/fireworks/models/cogito-v1-preview-llama-3b $0.10 $0.10 131k
fireworks_ai/accounts/fireworks/models/deepseek-coder-1b-base $0.10 $0.10 16k
fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-qwen-1p5b $0.10 $0.10 131k
fireworks_ai/accounts/fireworks/models/ernie-4p5-21b-a3b-pt $0.10 $0.10 4k
fireworks_ai/accounts/fireworks/models/ernie-4p5-300b-a47b-pt $0.10 $0.10 4k
fireworks_ai/accounts/fireworks/models/flux-1-dev $0.10 $0.10 4k
fireworks_ai/accounts/fireworks/models/flux-1-schnell $0.10 $0.10 4k
fireworks_ai/accounts/fireworks/models/gemma-2b-it $0.10 $0.10 8k
fireworks_ai/accounts/fireworks/models/llama-guard-3-1b $0.10 $0.10 131k
fireworks_ai/accounts/fireworks/models/llama-v2-70b $0.10 $0.10 4k
fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct-long $0.10 $0.10 4k
fireworks_ai/accounts/fireworks/models/llama-v3p1-70b-instruct-1b $0.10 $0.10 4k
fireworks_ai/accounts/fireworks/models/llama-v3p2-1b $0.10 $0.10 131k
fireworks_ai/accounts/fireworks/models/llama-v3p2-3b $0.10 $0.10 131k
fireworks_ai/accounts/fireworks/models/minimax-m1-80k $0.10 $0.10 4k
fireworks_ai/accounts/fireworks/models/ministral-3-3b-instruct-2512 $0.10 $0.10 256k
fireworks_ai/accounts/fireworks/models/nemotron-nano-v2-12b-vl $0.10 $0.10 4k
fireworks_ai/accounts/fireworks/models/phi-2-3b $0.10 $0.10 2k
fireworks_ai/accounts/fireworks/models/phi-3-mini-128k-instruct $0.10 $0.10 131k
fireworks_ai/accounts/fireworks/models/qwen2-vl-2b-instruct $0.10 $0.10 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-0p5b-instruct $0.10 $0.10 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-1p5b-instruct $0.10 $0.10 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-0p5b $0.10 $0.10 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-0p5b-instruct $0.10 $0.10 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-1p5b $0.10 $0.10 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-1p5b-instruct $0.10 $0.10 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-3b $0.10 $0.10 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-3b-instruct $0.10 $0.10 33k
fireworks_ai/accounts/fireworks/models/qwen3-0p6b $0.10 $0.10 41k
fireworks_ai/accounts/fireworks/models/qwen3-1p7b $0.10 $0.10 131k
fireworks_ai/accounts/fireworks/models/qwen3-1p7b-fp8-draft $0.10 $0.10 262k
fireworks_ai/accounts/fireworks/models/qwen3-1p7b-fp8-draft-131072 $0.10 $0.10 131k
fireworks_ai/accounts/fireworks/models/qwen3-1p7b-fp8-draft-40960 $0.10 $0.10 41k
fireworks_ai/accounts/fireworks/models/stablecode-3b $0.10 $0.10 4k
fireworks_ai/accounts/fireworks/models/starcoder2-3b $0.10 $0.10 16k
fireworks_ai/accounts/fireworks/models/gpt-oss-120b $0.15 $0.60 131k 🔧 tools
fireworks_ai/accounts/fireworks/models/llama4-scout-instruct-basic $0.15 $0.60 131k
fireworks_ai/accounts/fireworks/models/qwen3-30b-a3b $0.15 $0.60 131k
fireworks_ai/accounts/fireworks/models/qwen3-coder-30b-a3b-instruct $0.15 $0.60 262k
fireworks_ai/accounts/fireworks/models/qwen3-vl-30b-a3b-instruct $0.15 $0.60 262k
fireworks_ai/accounts/fireworks/models/qwen3-vl-30b-a3b-thinking $0.15 $0.60 262k
fireworks-ai-4.1b-to-16b $0.20 $0.20
fireworks-ai-up-to-4b $0.20 $0.20
fireworks_ai/accounts/fireworks/models/llama-v3p2-11b-vision-instruct $0.20 $0.20 16k 👁️ vision
fireworks_ai/accounts/fireworks/models/chronos-hermes-13b-v2 $0.20 $0.20 4k
fireworks_ai/accounts/fireworks/models/code-llama-13b $0.20 $0.20 16k
fireworks_ai/accounts/fireworks/models/code-llama-13b-instruct $0.20 $0.20 16k
fireworks_ai/accounts/fireworks/models/code-llama-13b-python $0.20 $0.20 16k
fireworks_ai/accounts/fireworks/models/code-llama-7b $0.20 $0.20 16k
fireworks_ai/accounts/fireworks/models/code-llama-7b-instruct $0.20 $0.20 16k
fireworks_ai/accounts/fireworks/models/code-llama-7b-python $0.20 $0.20 16k
fireworks_ai/accounts/fireworks/models/code-qwen-1p5-7b $0.20 $0.20 66k
fireworks_ai/accounts/fireworks/models/codegemma-7b $0.20 $0.20 8k
fireworks_ai/accounts/fireworks/models/cogito-v1-preview-llama-8b $0.20 $0.20 131k
fireworks_ai/accounts/fireworks/models/cogito-v1-preview-qwen-14b $0.20 $0.20 131k
fireworks_ai/accounts/fireworks/models/deepseek-coder-7b-base $0.20 $0.20 4k
fireworks_ai/accounts/fireworks/models/deepseek-coder-7b-base-v1p5 $0.20 $0.20 4k
fireworks_ai/accounts/fireworks/models/deepseek-coder-7b-instruct-v1p5 $0.20 $0.20 4k
fireworks_ai/accounts/fireworks/models/deepseek-r1-0528-distill-qwen3-8b $0.20 $0.20 131k
fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-llama-8b $0.20 $0.20 131k
fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-qwen-14b $0.20 $0.20 131k
fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-qwen-7b $0.20 $0.20 131k
fireworks_ai/accounts/fireworks/models/dobby-mini-unhinged-plus-llama-3-1-8b $0.20 $0.20 131k
fireworks_ai/accounts/fireworks/models/firellava-13b $0.20 $0.20 4k
fireworks_ai/accounts/fireworks/models/firesearch-ocr-v6 $0.20 $0.20 8k
fireworks_ai/accounts/fireworks/models/gemma-7b $0.20 $0.20 8k
fireworks_ai/accounts/fireworks/models/gemma-7b-it $0.20 $0.20 8k
fireworks_ai/accounts/fireworks/models/gemma2-9b-it $0.20 $0.20 8k
fireworks_ai/accounts/fireworks/models/hermes-2-pro-mistral-7b $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/internvl3-8b $0.20 $0.20 16k
fireworks_ai/accounts/fireworks/models/llama-guard-2-8b $0.20 $0.20 8k
fireworks_ai/accounts/fireworks/models/llama-guard-3-8b $0.20 $0.20 131k
fireworks_ai/accounts/fireworks/models/llama-v2-13b $0.20 $0.20 4k
fireworks_ai/accounts/fireworks/models/llama-v2-13b-chat $0.20 $0.20 4k
fireworks_ai/accounts/fireworks/models/llama-v2-7b $0.20 $0.20 4k
fireworks_ai/accounts/fireworks/models/llama-v2-7b-chat $0.20 $0.20 4k
fireworks_ai/accounts/fireworks/models/llama-v3-8b $0.20 $0.20 8k
fireworks_ai/accounts/fireworks/models/llama-v3-8b-instruct-hf $0.20 $0.20 8k
fireworks_ai/accounts/fireworks/models/llamaguard-7b $0.20 $0.20 4k
fireworks_ai/accounts/fireworks/models/ministral-3-14b-instruct-2512 $0.20 $0.20 256k
fireworks_ai/accounts/fireworks/models/ministral-3-8b-instruct-2512 $0.20 $0.20 256k
fireworks_ai/accounts/fireworks/models/mistral-7b $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/mistral-7b-instruct-4k $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/mistral-7b-instruct-v0p2 $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/mistral-7b-instruct-v3 $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/mistral-7b-v0p2 $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/mistral-nemo-base-2407 $0.20 $0.20 128k
fireworks_ai/accounts/fireworks/models/mistral-nemo-instruct-2407 $0.20 $0.20 128k
fireworks_ai/accounts/fireworks/models/mythomax-l2-13b $0.20 $0.20 4k
fireworks_ai/accounts/fireworks/models/nous-capybara-7b-v1p9 $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/nous-hermes-llama2-13b $0.20 $0.20 4k
fireworks_ai/accounts/fireworks/models/nous-hermes-llama2-7b $0.20 $0.20 4k
fireworks_ai/accounts/fireworks/models/nvidia-nemotron-nano-12b-v2 $0.20 $0.20 131k
fireworks_ai/accounts/fireworks/models/nvidia-nemotron-nano-9b-v2 $0.20 $0.20 131k
fireworks_ai/accounts/fireworks/models/openchat-3p5-0106-7b $0.20 $0.20 8k
fireworks_ai/accounts/fireworks/models/openhermes-2-mistral-7b $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/openhermes-2p5-mistral-7b $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/openorca-7b $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/phi-3-vision-128k-instruct $0.20 $0.20 32k
fireworks_ai/accounts/fireworks/models/pythia-12b $0.20 $0.20 2k
fireworks_ai/accounts/fireworks/models/qwen-v2p5-14b-instruct $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/qwen-v2p5-7b $0.20 $0.20 131k
fireworks_ai/accounts/fireworks/models/qwen2-7b-instruct $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/qwen2-vl-7b-instruct $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-14b $0.20 $0.20 131k
fireworks_ai/accounts/fireworks/models/qwen2p5-7b-instruct $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-14b $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-14b-instruct $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-7b $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-7b-instruct $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-vl-3b-instruct $0.20 $0.20 128k
fireworks_ai/accounts/fireworks/models/qwen2p5-vl-7b-instruct $0.20 $0.20 128k
fireworks_ai/accounts/fireworks/models/qwen3-14b $0.20 $0.20 41k
fireworks_ai/accounts/fireworks/models/qwen3-4b $0.20 $0.20 41k
fireworks_ai/accounts/fireworks/models/qwen3-4b-instruct-2507 $0.20 $0.20 262k
fireworks_ai/accounts/fireworks/models/qwen3-8b $0.20 $0.20 41k
fireworks_ai/accounts/fireworks/models/qwen3-vl-8b-instruct $0.20 $0.20 4k
fireworks_ai/accounts/fireworks/models/rolm-ocr $0.20 $0.20 128k
fireworks_ai/accounts/fireworks/models/snorkel-mistral-7b-pairrm-dpo $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/starcoder-16b $0.20 $0.20 8k
fireworks_ai/accounts/fireworks/models/starcoder-7b $0.20 $0.20 8k
fireworks_ai/accounts/fireworks/models/starcoder2-15b $0.20 $0.20 16k
fireworks_ai/accounts/fireworks/models/starcoder2-7b $0.20 $0.20 16k
fireworks_ai/accounts/fireworks/models/toppy-m-7b $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/yi-6b $0.20 $0.20 4k
fireworks_ai/accounts/fireworks/models/zephyr-7b-beta $0.20 $0.20 33k
fireworks_ai/accounts/fireworks/models/glm-4p5-air $0.22 $0.88 128k 🔧 tools
fireworks_ai/accounts/fireworks/models/llama4-maverick-instruct-basic $0.22 $0.88 131k
fireworks_ai/accounts/fireworks/models/qwen3-235b-a22b $0.22 $0.88 131k
fireworks_ai/accounts/fireworks/models/qwen3-235b-a22b-instruct-2507 $0.22 $0.88 262k
fireworks_ai/accounts/fireworks/models/qwen3-235b-a22b-thinking-2507 $0.22 $0.88 262k
fireworks_ai/accounts/fireworks/models/qwen3-vl-235b-a22b-instruct $0.22 $0.88 262k
fireworks_ai/accounts/fireworks/models/qwen3-vl-235b-a22b-thinking $0.22 $0.88 262k
fireworks_ai/accounts/fireworks/models/minimax-m2p1 $0.30 $1.20 $0.03 205k 🔧 tools
fireworks_ai/minimax-m2p1 $0.30 $1.20 $0.03 205k 🔧 tools
fireworks_ai/accounts/fireworks/models/minimax-m2 $0.30 $1.20 4k
fireworks_ai/accounts/fireworks/models/qwen3-coder-480b-a35b-instruct $0.45 $1.80 262k
fireworks-ai-moe-up-to-56b $0.50 $0.50
fireworks_ai/accounts/fireworks/models/deepseek-coder-v2-lite-base $0.50 $0.50 164k
fireworks_ai/accounts/fireworks/models/deepseek-coder-v2-lite-instruct $0.50 $0.50 164k
fireworks_ai/accounts/fireworks/models/deepseek-v2-lite-chat $0.50 $0.50 164k
fireworks_ai/accounts/fireworks/models/dolphin-2p6-mixtral-8x7b $0.50 $0.50 33k
fireworks_ai/accounts/fireworks/models/firefunction-v1 $0.50 $0.50 33k
fireworks_ai/accounts/fireworks/models/gpt-oss-safeguard-20b $0.50 $0.50 131k
fireworks_ai/accounts/fireworks/models/mixtral-8x7b $0.50 $0.50 33k
fireworks_ai/accounts/fireworks/models/mixtral-8x7b-instruct $0.50 $0.50 33k
fireworks_ai/accounts/fireworks/models/mixtral-8x7b-instruct-hf $0.50 $0.50 33k
fireworks_ai/accounts/fireworks/models/nous-hermes-2-mixtral-8x7b-dpo $0.50 $0.50 33k
fireworks_ai/accounts/fireworks/models/qwen3-30b-a3b-instruct-2507 $0.50 $0.50 262k
fireworks_ai/accounts/fireworks/models/deepseek-r1-basic $0.55 $2.19 128k
fireworks_ai/accounts/fireworks/models/glm-4p5 $0.55 $2.19 128k 🔧 tools
fireworks_ai/accounts/fireworks/models/glm-4p6 $0.55 $2.19 203k 🔧 tools
fireworks_ai/accounts/fireworks/models/deepseek-v3p1 $0.56 $1.68 128k
fireworks_ai/accounts/fireworks/models/deepseek-v3p1-terminus $0.56 $1.68 128k
fireworks_ai/accounts/fireworks/models/deepseek-v3p2 $0.56 $1.68 164k 🔧 tools
fireworks_ai/accounts/fireworks/models/glm-4p7 $0.60 $2.20 $0.30 203k 🔧 tools
fireworks_ai/accounts/fireworks/models/kimi-k2-instruct $0.60 $2.50 131k 🔧 tools
fireworks_ai/accounts/fireworks/models/kimi-k2-instruct-0905 $0.60 $2.50 262k 🔧 tools
fireworks_ai/accounts/fireworks/models/kimi-k2-thinking $0.60 $2.50 262k 🔧 tools · 🌐 search
fireworks_ai/accounts/fireworks/models/kimi-k2p5 $0.60 $3.00 $0.10 262k 🔧 tools
fireworks_ai/glm-4p7 $0.60 $2.20 $0.30 203k 🔧 tools
fireworks_ai/kimi-k2p5 $0.60 $3.00 $0.10 262k 🔧 tools
fireworks-ai-above-16b $0.90 $0.90
fireworks_ai/accounts/fireworks/models/deepseek-v3 $0.90 $0.90 128k
fireworks_ai/accounts/fireworks/models/deepseek-v3-0324 $0.90 $0.90 164k
fireworks_ai/accounts/fireworks/models/firefunction-v2 $0.90 $0.90 8k 🔧 tools
fireworks_ai/accounts/fireworks/models/llama-v3p2-90b-vision-instruct $0.90 $0.90 16k 👁️ vision
fireworks_ai/accounts/fireworks/models/qwen2-72b-instruct $0.90 $0.90 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct $0.90 $0.90 4k
fireworks_ai/accounts/fireworks/models/code-llama-34b $0.90 $0.90 16k
fireworks_ai/accounts/fireworks/models/code-llama-34b-instruct $0.90 $0.90 16k
fireworks_ai/accounts/fireworks/models/code-llama-34b-python $0.90 $0.90 16k
fireworks_ai/accounts/fireworks/models/code-llama-70b $0.90 $0.90 4k
fireworks_ai/accounts/fireworks/models/code-llama-70b-instruct $0.90 $0.90 4k
fireworks_ai/accounts/fireworks/models/code-llama-70b-python $0.90 $0.90 4k
fireworks_ai/accounts/fireworks/models/cogito-v1-preview-llama-70b $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/cogito-v1-preview-qwen-32b $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/deepseek-coder-33b-instruct $0.90 $0.90 16k
fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-llama-70b $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-qwen-32b $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/devstral-small-2505 $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/dobby-unhinged-llama-3-3-70b-new $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/dolphin-2-9-2-qwen2-72b $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/fare-20b $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/gemma-3-27b-it $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/internvl3-38b $0.90 $0.90 16k
fireworks_ai/accounts/fireworks/models/internvl3-78b $0.90 $0.90 16k
fireworks_ai/accounts/fireworks/models/kat-coder $0.90 $0.90 262k
fireworks_ai/accounts/fireworks/models/kat-dev-32b $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/kat-dev-72b-exp $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/llama-v2-70b-chat $0.90 $0.90 2k
fireworks_ai/accounts/fireworks/models/llama-v3-70b-instruct $0.90 $0.90 8k
fireworks_ai/accounts/fireworks/models/llama-v3-70b-instruct-hf $0.90 $0.90 8k
fireworks_ai/accounts/fireworks/models/llama-v3p1-70b-instruct $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/llama-v3p1-nemotron-70b-instruct $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/llama-v3p3-70b-instruct $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/llava-yi-34b $0.90 $0.90 4k
fireworks_ai/accounts/fireworks/models/mistral-small-24b-instruct-2501 $0.90 $0.90 33k
fireworks_ai/accounts/fireworks/models/nous-hermes-2-yi-34b $0.90 $0.90 4k
fireworks_ai/accounts/fireworks/models/nous-hermes-llama2-70b $0.90 $0.90 4k
fireworks_ai/accounts/fireworks/models/phind-code-llama-34b-python-v1 $0.90 $0.90 16k
fireworks_ai/accounts/fireworks/models/phind-code-llama-34b-v1 $0.90 $0.90 16k
fireworks_ai/accounts/fireworks/models/phind-code-llama-34b-v2 $0.90 $0.90 16k
fireworks_ai/accounts/fireworks/models/qwen-qwq-32b-preview $0.90 $0.90 33k
fireworks_ai/accounts/fireworks/models/qwen1p5-72b-chat $0.90 $0.90 33k
fireworks_ai/accounts/fireworks/models/qwen2-vl-72b-instruct $0.90 $0.90 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-32b $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/qwen2p5-32b-instruct $0.90 $0.90 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-72b $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/qwen2p5-72b-instruct $0.90 $0.90 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b $0.90 $0.90 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct-128k $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct-32k-rope $0.90 $0.90 33k
fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct-64k $0.90 $0.90 66k
fireworks_ai/accounts/fireworks/models/qwen2p5-math-72b-instruct $0.90 $0.90 4k
fireworks_ai/accounts/fireworks/models/qwen2p5-vl-32b-instruct $0.90 $0.90 128k
fireworks_ai/accounts/fireworks/models/qwen2p5-vl-72b-instruct $0.90 $0.90 128k
fireworks_ai/accounts/fireworks/models/qwen3-30b-a3b-thinking-2507 $0.90 $0.90 262k
fireworks_ai/accounts/fireworks/models/qwen3-32b $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/qwen3-coder-480b-instruct-bf16 $0.90 $0.90 4k
fireworks_ai/accounts/fireworks/models/qwen3-next-80b-a3b-instruct $0.90 $0.90 4k
fireworks_ai/accounts/fireworks/models/qwen3-next-80b-a3b-thinking $0.90 $0.90 4k
fireworks_ai/accounts/fireworks/models/qwen3-vl-32b-instruct $0.90 $0.90 4k
fireworks_ai/accounts/fireworks/models/qwq-32b $0.90 $0.90 131k
fireworks_ai/accounts/fireworks/models/yi-34b $0.90 $0.90 4k
fireworks_ai/accounts/fireworks/models/yi-34b-200k-capybara $0.90 $0.90 200k
fireworks_ai/accounts/fireworks/models/yi-34b-chat $0.90 $0.90 4k
fireworks-ai-56b-to-176b $1.20 $1.20
fireworks_ai/accounts/fireworks/models/deepseek-coder-v2-instruct $1.20 $1.20 66k
fireworks_ai/accounts/fireworks/models/mixtral-8x22b-instruct-hf $1.20 $1.20 66k 🔧 tools
fireworks_ai/accounts/fireworks/models/cogito-671b-v2-p1 $1.20 $1.20 164k
fireworks_ai/accounts/fireworks/models/dbrx-instruct $1.20 $1.20 33k
fireworks_ai/accounts/fireworks/models/deepseek-prover-v2 $1.20 $1.20 164k
fireworks_ai/accounts/fireworks/models/deepseek-v2p5 $1.20 $1.20 33k
fireworks_ai/accounts/fireworks/models/glm-4p5v $1.20 $1.20 131k
fireworks_ai/accounts/fireworks/models/gpt-oss-safeguard-120b $1.20 $1.20 131k
fireworks_ai/accounts/fireworks/models/mistral-large-3-fp8 $1.20 $1.20 256k
fireworks_ai/accounts/fireworks/models/mixtral-8x22b $1.20 $1.20 66k
fireworks_ai/accounts/fireworks/models/mixtral-8x22b-instruct $1.20 $1.20 66k
fireworks_ai/accounts/fireworks/models/deepseek-r1 $3.00 $8.00 128k
fireworks_ai/accounts/fireworks/models/deepseek-r1-0528 $3.00 $8.00 160k
fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct $3.00 $3.00 128k 🔧 tools
fireworks_ai/accounts/fireworks/models/yi-large $3.00 $3.00 33k

bedrock (188)

Model Input $/M Output $/M Cached $/M Context Features
anthropic.claude-mythos-preview $0.0000 $0.0000 1000k 👁️ vision · 🔧 tools
meta.llama3-2-1b-instruct-v1:0 $0.10 $0.10 128k 🔧 tools
us.meta.llama3-2-1b-instruct-v1:0 $0.10 $0.10 128k 🔧 tools
eu.meta.llama3-2-1b-instruct-v1:0 $0.13 $0.13 128k 🔧 tools
bedrock/us-east-1/mistral.mistral-7b-instruct-v0:2 $0.15 $0.20 32k
bedrock/us-west-2/mistral.mistral-7b-instruct-v0:2 $0.15 $0.20 32k
meta.llama3-2-3b-instruct-v1:0 $0.15 $0.15 128k 🔧 tools
mistral.mistral-7b-instruct-v0:2 $0.15 $0.20 32k
us.meta.llama3-2-3b-instruct-v1:0 $0.15 $0.15 128k 🔧 tools
eu.meta.llama3-2-3b-instruct-v1:0 $0.19 $0.19 128k 🔧 tools
ai21.jamba-1-5-mini-v1:0 $0.20 $0.40 256k
bedrock/eu-west-3/mistral.mistral-7b-instruct-v0:2 $0.20 $0.26 32k
meta.llama3-1-8b-instruct-v1:0 $0.22 $0.22 128k 🔧 tools
us.meta.llama3-1-8b-instruct-v1:0 $0.22 $0.22 128k 🔧 tools
anthropic.claude-3-haiku-20240307-v1:0 $0.25 $1.25 $0.02 200k 👁️ vision · 🔧 tools
apac.anthropic.claude-3-haiku-20240307-v1:0 $0.25 $1.25 $0.02 200k 👁️ vision · 🔧 tools
eu.anthropic.claude-3-5-haiku-20241022-v1:0 $0.25 $1.25 $0.02 200k 🔧 tools · 💾 cache
eu.anthropic.claude-3-haiku-20240307-v1:0 $0.25 $1.25 $0.02 200k 👁️ vision · 🔧 tools
us.anthropic.claude-3-haiku-20240307-v1:0 $0.25 $1.25 $0.02 200k 👁️ vision · 🔧 tools
amazon.titan-text-lite-v1 $0.30 $0.40 42k
bedrock/us-east-1/meta.llama3-8b-instruct-v1:0 $0.30 $0.60 8k
bedrock/us-east-1/minimax.minimax-m2.1 $0.30 $1.20 196k 🔧 tools
bedrock/us-east-1/minimax.minimax-m2.5 $0.30 $1.20 1000k 🔧 tools
bedrock/us-east-2/minimax.minimax-m2.1 $0.30 $1.20 196k 🔧 tools
bedrock/us-east-2/minimax.minimax-m2.5 $0.30 $1.20 1000k 🔧 tools
bedrock/us-gov-east-1/amazon.titan-text-lite-v1 $0.30 $0.40 42k
bedrock/us-gov-east-1/anthropic.claude-3-haiku-20240307-v1:0 $0.30 $1.50 $0.03 200k 👁️ vision · 🔧 tools
bedrock/us-gov-east-1/meta.llama3-8b-instruct-v1:0 $0.30 $2.65 8k
bedrock/us-gov-west-1/amazon.titan-text-lite-v1 $0.30 $0.40 42k
bedrock/us-gov-west-1/anthropic.claude-3-haiku-20240307-v1:0 $0.30 $1.50 $0.03 200k 👁️ vision · 🔧 tools
bedrock/us-gov-west-1/meta.llama3-8b-instruct-v1:0 $0.30 $2.65 8k
bedrock/us-west-1/meta.llama3-8b-instruct-v1:0 $0.30 $0.60 8k
bedrock/us-west-2/minimax.minimax-m2.1 $0.30 $1.20 196k 🔧 tools
bedrock/us-west-2/minimax.minimax-m2.5 $0.30 $1.20 1000k 🔧 tools
cohere.command-light-text-v14 $0.30 $0.60 4k
meta.llama3-8b-instruct-v1:0 $0.30 $0.60 8k
bedrock/ap-southeast-2/minimax.minimax-m2.5 $0.31 $1.24 1000k 🔧 tools
bedrock/eu-west-1/meta.llama3-8b-instruct-v1:0 $0.32 $0.65 8k
bedrock/ca-central-1/meta.llama3-8b-instruct-v1:0 $0.35 $0.69 8k
meta.llama3-2-11b-instruct-v1:0 $0.35 $0.35 128k 👁️ vision · 🔧 tools
us.meta.llama3-2-11b-instruct-v1:0 $0.35 $0.35 128k 👁️ vision · 🔧 tools
bedrock/ap-northeast-1/minimax.minimax-m2.1 $0.36 $1.44 196k 🔧 tools
bedrock/ap-northeast-1/minimax.minimax-m2.5 $0.36 $1.44 1000k 🔧 tools
bedrock/ap-south-1/meta.llama3-8b-instruct-v1:0 $0.36 $0.72 8k
bedrock/ap-south-1/minimax.minimax-m2.1 $0.36 $1.44 196k 🔧 tools
bedrock/ap-south-1/minimax.minimax-m2.5 $0.36 $1.44 1000k 🔧 tools
bedrock/ap-southeast-3/minimax.minimax-m2.1 $0.36 $1.44 196k 🔧 tools
bedrock/ap-southeast-3/minimax.minimax-m2.5 $0.36 $1.44 1000k 🔧 tools
bedrock/eu-north-1/minimax.minimax-m2.1 $0.36 $1.44 196k 🔧 tools
bedrock/eu-north-1/minimax.minimax-m2.5 $0.36 $1.44 1000k 🔧 tools
bedrock/eu-central-1/minimax.minimax-m2.1 $0.36 $1.44 196k 🔧 tools
bedrock/eu-central-1/minimax.minimax-m2.5 $0.36 $1.44 1000k 🔧 tools
bedrock/eu-west-1/minimax.minimax-m2.1 $0.36 $1.44 196k 🔧 tools
bedrock/eu-west-1/minimax.minimax-m2.5 $0.36 $1.44 1000k 🔧 tools
bedrock/eu-south-1/minimax.minimax-m2.1 $0.36 $1.44 196k 🔧 tools
bedrock/eu-south-1/minimax.minimax-m2.5 $0.36 $1.44 1000k 🔧 tools
bedrock/sa-east-1/minimax.minimax-m2.1 $0.36 $1.44 196k 🔧 tools
bedrock/sa-east-1/minimax.minimax-m2.5 $0.36 $1.44 1000k 🔧 tools
bedrock/eu-west-2/meta.llama3-8b-instruct-v1:0 $0.39 $0.78 8k
bedrock/us-east-1/mistral.mixtral-8x7b-instruct-v0:1 $0.45 $0.70 32k
bedrock/us-west-2/mistral.mixtral-8x7b-instruct-v0:1 $0.45 $0.70 32k
mistral.mixtral-8x7b-instruct-v0:1 $0.45 $0.70 32k
bedrock/eu-west-2/minimax.minimax-m2.1 $0.47 $1.86 196k 🔧 tools
bedrock/eu-west-2/minimax.minimax-m2.5 $0.47 $1.86 1000k 🔧 tools
ai21.jamba-instruct-v1:0 $0.50 $0.70 70k
amazon.titan-text-premier-v1:0 $0.50 $1.50 42k
bedrock/sa-east-1/meta.llama3-8b-instruct-v1:0 $0.50 $1.01 8k
bedrock/us-east-1/qwen.qwen3-coder-next $0.50 $1.20 262k 🔧 tools
bedrock/us-east-2/qwen.qwen3-coder-next $0.50 $1.20 262k 🔧 tools
bedrock/us-gov-east-1/amazon.titan-text-premier-v1:0 $0.50 $1.50 42k
bedrock/us-gov-west-1/amazon.titan-text-premier-v1:0 $0.50 $1.50 42k
bedrock/us-west-2/qwen.qwen3-coder-next $0.50 $1.20 262k 🔧 tools
cohere.command-r-v1:0 $0.50 $1.50 128k
bedrock/eu-west-3/mistral.mixtral-8x7b-instruct-v0:1 $0.59 $0.91 32k
bedrock/ap-northeast-1/qwen.qwen3-coder-next $0.60 $1.44 262k 🔧 tools
bedrock/moonshotai.kimi-k2.5 $0.60 $3.03 262k 👁️ vision · 🔧 tools
bedrock/ap-south-1/qwen.qwen3-coder-next $0.60 $1.44 262k 🔧 tools
bedrock/ap-southeast-3/qwen.qwen3-coder-next $0.60 $1.44 262k 🔧 tools
bedrock/eu-central-1/qwen.qwen3-coder-next $0.60 $1.44 262k 🔧 tools
bedrock/eu-west-1/qwen.qwen3-coder-next $0.60 $1.44 262k 🔧 tools
bedrock/eu-south-1/qwen.qwen3-coder-next $0.60 $1.44 262k 🔧 tools
bedrock/sa-east-1/qwen.qwen3-coder-next $0.60 $1.44 262k 🔧 tools
bedrock/us-east-1/moonshotai.kimi-k2-thinking $0.60 $2.50 262k 🔧 tools
bedrock/us-east-1/moonshotai.kimi-k2.5 $0.60 $3.00 262k 👁️ vision · 🔧 tools
bedrock/us-east-2/moonshotai.kimi-k2-thinking $0.60 $2.50 262k 🔧 tools
bedrock/us-east-2/moonshotai.kimi-k2.5 $0.60 $3.00 262k 👁️ vision · 🔧 tools
bedrock/us-west-2/moonshotai.kimi-k2-thinking $0.60 $2.50 262k 🔧 tools
bedrock/us-west-2/moonshotai.kimi-k2.5 $0.60 $3.00 262k 👁️ vision · 🔧 tools
bedrock/us-east-1/deepseek.v3.2 $0.62 $1.85 164k 🔧 tools
bedrock/us-east-2/deepseek.v3.2 $0.62 $1.85 164k 🔧 tools
bedrock/us-west-2/deepseek.v3.2 $0.62 $1.85 164k 🔧 tools
bedrock/ap-south-1/moonshotai.kimi-k2-thinking $0.71 $2.94 262k 🔧 tools
bedrock/ap-northeast-1/moonshotai.kimi-k2.5 $0.72 $3.60 262k 👁️ vision · 🔧 tools
bedrock/ap-south-1/moonshotai.kimi-k2.5 $0.72 $3.60 262k 👁️ vision · 🔧 tools
bedrock/ap-southeast-3/moonshotai.kimi-k2.5 $0.72 $3.60 262k 👁️ vision · 🔧 tools
bedrock/eu-north-1/moonshotai.kimi-k2.5 $0.72 $3.60 262k 👁️ vision · 🔧 tools
bedrock/sa-east-1/moonshotai.kimi-k2.5 $0.72 $3.60 262k 👁️ vision · 🔧 tools
bedrock/ap-northeast-1/moonshotai.kimi-k2-thinking $0.73 $3.03 262k 🔧 tools
bedrock/moonshotai.kimi-k2-thinking $0.73 $3.03 262k 🔧 tools
bedrock/sa-east-1/moonshotai.kimi-k2-thinking $0.73 $3.03 262k 🔧 tools
bedrock/ap-northeast-1/deepseek.v3.2 $0.74 $2.22 164k 🔧 tools
bedrock/ap-south-1/deepseek.v3.2 $0.74 $2.22 164k 🔧 tools
bedrock/ap-southeast-3/deepseek.v3.2 $0.74 $2.22 164k 🔧 tools
bedrock/eu-north-1/deepseek.v3.2 $0.74 $2.22 164k 🔧 tools
bedrock/sa-east-1/deepseek.v3.2 $0.74 $2.22 164k 🔧 tools
meta.llama2-13b-chat-v1 $0.75 $1.00 4k
bedrock/eu-west-2/qwen.qwen3-coder-next $0.78 $1.86 262k 🔧 tools
anthropic.claude-3-5-haiku-20241022-v1:0 $0.80 $4.00 $0.08 200k 🔧 tools · 💾 cache
anthropic.claude-instant-v1 $0.80 $2.40 100k
bedrock/us-east-1/anthropic.claude-instant-v1 $0.80 $2.40 100k
bedrock/us-west-2/anthropic.claude-instant-v1 $0.80 $2.40 100k
bedrock/us.anthropic.claude-3-5-haiku-20241022-v1:0 $0.80 $4.00 $0.08 200k 🔧 tools · 💾 cache
us.anthropic.claude-3-5-haiku-20241022-v1:0 $0.80 $4.00 $0.08 200k 🔧 tools · 💾 cache
bedrock/us-gov-east-1/amazon.nova-pro-v1:0 $0.96 $3.84 300k 👁️ vision · 🔧 tools · 💾 cache
bedrock/us-gov-west-1/amazon.nova-pro-v1:0 $0.96 $3.84 300k 👁️ vision · 🔧 tools · 💾 cache
meta.llama3-1-70b-instruct-v1:0 $0.99 $0.99 128k 🔧 tools
us.meta.llama3-1-70b-instruct-v1:0 $0.99 $0.99 128k 🔧 tools
mistral.mistral-small-2402-v1:0 $1.00 $3.00 32k 🔧 tools
bedrock/us-east-1/zai.glm-5 $1.00 $3.20 200k 🔧 tools
bedrock/us-west-2/zai.glm-5 $1.00 $3.20 200k 🔧 tools
bedrock/us-gov-east-1/anthropic.claude-haiku-4-5-20251001-v1:0 $1.20 $6.00 $0.12 200k 👁️ vision · 🔧 tools · 💾 cache
bedrock/us-gov-west-1/anthropic.claude-haiku-4-5-20251001-v1:0 $1.20 $6.00 $0.12 200k 👁️ vision · 🔧 tools · 💾 cache
amazon.titan-text-express-v1 $1.30 $1.70 42k
bedrock/us-gov-east-1/amazon.titan-text-express-v1 $1.30 $1.70 42k
bedrock/us-gov-west-1/amazon.titan-text-express-v1 $1.30 $1.70 42k
cohere.command-text-v14 $1.50 $2.00 4k
meta.llama2-70b-chat-v1 $1.95 $2.56 4k
ai21.jamba-1-5-large-v1:0 $2.00 $8.00 256k
meta.llama3-2-90b-instruct-v1:0 $2.00 $2.00 128k 👁️ vision · 🔧 tools
us.meta.llama3-2-90b-instruct-v1:0 $2.00 $2.00 128k 👁️ vision · 🔧 tools
bedrock/ap-northeast-1/anthropic.claude-instant-v1 $2.23 $7.55 100k
bedrock/eu-central-1/anthropic.claude-instant-v1 $2.48 $8.38 100k
bedrock/us-east-1/meta.llama3-70b-instruct-v1:0 $2.65 $3.50 8k
bedrock/us-gov-east-1/meta.llama3-70b-instruct-v1:0 $2.65 $3.50 8k
bedrock/us-gov-west-1/meta.llama3-70b-instruct-v1:0 $2.65 $3.50 8k
bedrock/us-west-1/meta.llama3-70b-instruct-v1:0 $2.65 $3.50 8k
meta.llama3-70b-instruct-v1:0 $2.65 $3.50 8k
bedrock/eu-west-1/meta.llama3-70b-instruct-v1:0 $2.86 $3.78 8k
anthropic.claude-3-5-sonnet-20240620-v1:0 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools
anthropic.claude-3-5-sonnet-20241022-v2:0 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
anthropic.claude-3-sonnet-20240229-v1:0 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools
apac.anthropic.claude-3-5-sonnet-20240620-v1:0 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools
apac.anthropic.claude-3-5-sonnet-20241022-v2:0 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache
apac.anthropic.claude-3-sonnet-20240229-v1:0 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools
bedrock/invoke/anthropic.claude-3-5-sonnet-20240620-v1:0 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools
claude-sonnet-4-5-20250929-v1:0 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache
cohere.command-r-plus-v1:0 $3.00 $15.00 128k
eu.anthropic.claude-3-5-sonnet-20240620-v1:0 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools
eu.anthropic.claude-3-5-sonnet-20241022-v2:0 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache
eu.anthropic.claude-3-7-sonnet-20250219-v1:0 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache
eu.anthropic.claude-3-sonnet-20240229-v1:0 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools
mistral.mistral-large-2407-v1:0 $3.00 $9.00 128k 🔧 tools
us.anthropic.claude-3-5-sonnet-20240620-v1:0 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools
us.anthropic.claude-3-5-sonnet-20241022-v2:0 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache
us.anthropic.claude-3-sonnet-20240229-v1:0 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools
bedrock/ca-central-1/meta.llama3-70b-instruct-v1:0 $3.05 $4.03 8k
bedrock/ap-south-1/meta.llama3-70b-instruct-v1:0 $3.18 $4.20 8k
bedrock/us-gov-east-1/anthropic.claude-sonnet-4-5-20250929-v1:0 $3.30 $16.50 $0.33 200k 👁️ vision · 🔧 tools · 💾 cache
bedrock/us-gov-east-1/claude-sonnet-4-5-20250929-v1:0 $3.30 $16.50 $0.33 200k 👁️ vision · 🔧 tools · 💾 cache
bedrock/us-gov-west-1/anthropic.claude-sonnet-4-5-20250929-v1:0 $3.30 $16.50 $0.33 200k 👁️ vision · 🔧 tools · 💾 cache
bedrock/us-gov-west-1/claude-sonnet-4-5-20250929-v1:0 $3.30 $16.50 $0.33 200k 👁️ vision · 🔧 tools · 💾 cache
bedrock/eu-west-2/meta.llama3-70b-instruct-v1:0 $3.45 $4.55 8k
anthropic.claude-3-7-sonnet-20240620-v1:0 $3.60 $18.00 $0.36 200k 👁️ vision · 🔧 tools · 💾 cache
bedrock/us-gov-east-1/anthropic.claude-3-5-sonnet-20240620-v1:0 $3.60 $18.00 $0.36 200k 👁️ vision · 🔧 tools
bedrock/us-gov-west-1/anthropic.claude-3-7-sonnet-20250219-v1:0 $3.60 $18.00 $0.36 200k 👁️ vision · 🔧 tools · 💾 cache
bedrock/us-gov-west-1/anthropic.claude-3-5-sonnet-20240620-v1:0 $3.60 $18.00 $0.36 200k 👁️ vision · 🔧 tools
bedrock/sa-east-1/meta.llama3-70b-instruct-v1:0 $4.45 $5.88 8k
meta.llama3-1-405b-instruct-v1:0 $5.32 $16.00 128k 🔧 tools
us.meta.llama3-1-405b-instruct-v1:0 $5.32 $16.00 128k 🔧 tools
anthropic.claude-v1 $8.00 $24.00 100k
anthropic.claude-v2:1 $8.00 $24.00 100k
bedrock/ap-northeast-1/anthropic.claude-v1 $8.00 $24.00 100k
bedrock/ap-northeast-1/anthropic.claude-v2:1 $8.00 $24.00 100k
bedrock/eu-central-1/anthropic.claude-v1 $8.00 $24.00 100k
bedrock/eu-central-1/anthropic.claude-v2:1 $8.00 $24.00 100k
bedrock/us-east-1/anthropic.claude-v1 $8.00 $24.00 100k
bedrock/us-east-1/anthropic.claude-v2:1 $8.00 $24.00 100k
bedrock/us-east-1/mistral.mistral-large-2402-v1:0 $8.00 $24.00 32k 🔧 tools
bedrock/us-west-2/anthropic.claude-v1 $8.00 $24.00 100k
bedrock/us-west-2/anthropic.claude-v2:1 $8.00 $24.00 100k
bedrock/us-west-2/mistral.mistral-large-2402-v1:0 $8.00 $24.00 32k 🔧 tools
mistral.mistral-large-2402-v1:0 $8.00 $24.00 32k 🔧 tools
bedrock/eu-west-3/mistral.mistral-large-2402-v1:0 $10.40 $31.20 32k 🔧 tools
ai21.j2-mid-v1 $12.50 $12.50 8k
anthropic.claude-3-opus-20240229-v1:0 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools
eu.anthropic.claude-3-opus-20240229-v1:0 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools
us.anthropic.claude-3-opus-20240229-v1:0 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools
ai21.j2-ultra-v1 $18.80 $18.80 8k

azure (124)

Model Input $/M Output $/M Cached $/M Context Features
azure/gpt-5-nano $0.05 $0.40 $0.0050 272k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-5-nano-2025-08-07 $0.05 $0.40 $0.0050 272k 👁️ vision · 🔧 tools · 💾 cache
azure/eu/gpt-5-nano-2025-08-07 $0.06 $0.44 $0.0055 272k 👁️ vision · 🔧 tools · 💾 cache
azure/us/gpt-5-nano-2025-08-07 $0.06 $0.44 $0.0055 272k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-4.1-nano $0.10 $0.40 $0.02 1048k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-4.1-nano-2025-04-14 $0.10 $0.40 $0.02 1048k 👁️ vision · 🔧 tools · 💾 cache
azure/us/gpt-4.1-nano-2025-04-14 $0.11 $0.44 $0.02 1048k 👁️ vision · 🔧 tools · 💾 cache
azure/global-standard/gpt-4o-mini $0.15 $0.60 128k 👁️ vision · 🔧 tools
azure/eu/gpt-4o-mini-2024-07-18 $0.17 $0.66 $0.08 128k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-4o-mini $0.17 $0.66 $0.07 128k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-4o-mini-2024-07-18 $0.17 $0.66 $0.07 128k 👁️ vision · 🔧 tools · 💾 cache
azure/us/gpt-4o-mini-2024-07-18 $0.17 $0.66 $0.08 128k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-5.4-nano $0.20 $1.25 $0.02 1050k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
azure/gpt-5.4-nano-2026-03-17 $0.20 $1.25 $0.02 1050k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
azure/gpt-5-mini $0.25 $2.00 $0.02 272k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-5-mini-2025-08-07 $0.25 $2.00 $0.02 272k 👁️ vision · 🔧 tools · 💾 cache
azure/eu/gpt-5-mini-2025-08-07 $0.28 $2.20 $0.03 272k 👁️ vision · 🔧 tools · 💾 cache
azure/us/gpt-5-mini-2025-08-07 $0.28 $2.20 $0.03 272k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-4.1-mini $0.40 $1.60 $0.10 1048k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-4.1-mini-2025-04-14 $0.40 $1.60 $0.10 1048k 👁️ vision · 🔧 tools · 💾 cache
azure/us/gpt-4.1-mini-2025-04-14 $0.44 $1.76 $0.11 1048k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-3.5-turbo $0.50 $1.50 4k 🔧 tools
azure/gpt-3.5-turbo-0125 $0.50 $1.50 16k 🔧 tools
azure/gpt-35-turbo $0.50 $1.50 4k 🔧 tools
azure/gpt-35-turbo-0125 $0.50 $1.50 16k 🔧 tools
azure/gpt-audio-mini-2025-10-06 $0.60 $2.40 128k 🔧 tools
azure/gpt-4o-mini-realtime-preview-2024-12-17 $0.60 $2.40 $0.30 128k 🔧 tools
azure/gpt-realtime-mini-2025-10-06 $0.60 $2.40 $0.06 32k 🔧 tools
azure/eu/gpt-4o-mini-realtime-preview-2024-12-17 $0.66 $2.64 $0.33 128k 🔧 tools
azure/us/gpt-4o-mini-realtime-preview-2024-12-17 $0.66 $2.64 $0.33 128k 🔧 tools
azure/gpt-5.4-mini $0.75 $4.50 $0.07 1050k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
azure/gpt-5.4-mini-2026-03-17 $0.75 $4.50 $0.07 1050k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
azure/gpt-35-turbo-1106 $1.00 $2.00 16k 🔧 tools
azure/o1-mini-2024-09-12 $1.10 $4.40 $0.55 128k 🔧 tools · 💾 cache
azure/o3-mini $1.10 $4.40 $0.55 200k 💾 cache
azure/o3-mini-2025-01-31 $1.10 $4.40 $0.55 200k 💾 cache
azure/o4-mini $1.10 $4.40 $0.28 200k 👁️ vision · 🔧 tools · 💾 cache
azure/o4-mini-2025-04-16 $1.10 $4.40 $0.28 200k 👁️ vision · 🔧 tools · 💾 cache
azure/eu/o1-mini-2024-09-12 $1.21 $4.84 $0.60 128k 🔧 tools · 💾 cache
azure/eu/o3-mini-2025-01-31 $1.21 $4.84 $0.60 200k 💾 cache
azure/o1-mini $1.21 $4.84 $0.60 128k 🔧 tools · 💾 cache
azure/us/o1-mini-2024-09-12 $1.21 $4.84 $0.60 128k 🔧 tools · 💾 cache
azure/us/o3-mini-2025-01-31 $1.21 $4.84 $0.60 200k 💾 cache
azure/us/o4-mini-2025-04-16 $1.21 $4.84 $0.31 200k 👁️ vision · 🔧 tools · 💾 cache
azure/global/gpt-5.1 $1.25 $10.00 $0.13 272k 👁️ vision · 🔧 tools · 💾 cache
azure/global/gpt-5.1-chat $1.25 $10.00 $0.13 128k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-5.1-2025-11-13 $1.25 $10.00 $0.13 272k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-5.1-chat-2025-11-13 $1.25 $10.00 $0.13 128k 👁️ vision · 💾 cache
azure/gpt-5 $1.25 $10.00 $0.13 272k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-5-2025-08-07 $1.25 $10.00 $0.13 272k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-5-chat $1.25 $10.00 $0.13 128k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-5-chat-latest $1.25 $10.00 $0.13 128k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-5.1 $1.25 $10.00 $0.13 272k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-5.1-chat $1.25 $10.00 $0.13 128k 👁️ vision · 🔧 tools · 💾 cache
azure/eu/gpt-5-2025-08-07 $1.38 $11.00 $0.14 272k 👁️ vision · 🔧 tools · 💾 cache
azure/us/gpt-5-2025-08-07 $1.38 $11.00 $0.14 272k 👁️ vision · 🔧 tools · 💾 cache
azure/eu/gpt-5.1 $1.38 $11.00 $0.14 272k 👁️ vision · 🔧 tools · 💾 cache
azure/eu/gpt-5.1-chat $1.38 $11.00 $0.14 128k 👁️ vision · 🔧 tools · 💾 cache
azure/us/gpt-5.1 $1.38 $11.00 $0.14 272k 👁️ vision · 🔧 tools · 💾 cache
azure/us/gpt-5.1-chat $1.38 $11.00 $0.14 128k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-5.2 $1.75 $14.00 $0.17 272k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-5.2-2025-12-11 $1.75 $14.00 $0.17 272k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-5.2-chat $1.75 $14.00 $0.17 128k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-5.2-chat-2025-12-11 $1.75 $14.00 $0.17 128k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-5.3-chat $1.75 $14.00 $0.17 128k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-4.1 $2.00 $8.00 $0.50 1048k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-4.1-2025-04-14 $2.00 $8.00 $0.50 1048k 👁️ vision · 🔧 tools · 💾 cache
azure/o3 $2.00 $8.00 $0.50 200k 👁️ vision · 🔧 tools · 💾 cache
azure/o3-2025-04-16 $2.00 $8.00 $0.50 200k 👁️ vision · 🔧 tools · 💾 cache
azure/us/gpt-4.1-2025-04-14 $2.20 $8.80 $0.55 1048k 👁️ vision · 🔧 tools · 💾 cache
azure/us/o3-2025-04-16 $2.20 $8.80 $0.55 200k 👁️ vision · 🔧 tools · 💾 cache
azure/global-standard/gpt-4o-2024-08-06 $2.50 $10.00 $1.25 128k 👁️ vision · 🔧 tools · 💾 cache
azure/global-standard/gpt-4o-2024-11-20 $2.50 $10.00 $1.25 128k 👁️ vision · 🔧 tools
azure/global/gpt-4o-2024-08-06 $2.50 $10.00 $1.25 128k 👁️ vision · 🔧 tools · 💾 cache
azure/global/gpt-4o-2024-11-20 $2.50 $10.00 $1.25 128k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-4o $2.50 $10.00 $1.25 128k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-4o-2024-08-06 $2.50 $10.00 $1.25 128k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-audio-2025-08-28 $2.50 $10.00 128k 🔧 tools
azure/gpt-audio-1.5-2026-02-23 $2.50 $10.00 128k 🔧 tools
azure/gpt-4o-audio-preview-2024-12-17 $2.50 $10.00 128k 🔧 tools
azure/gpt-4o-mini-audio-preview-2024-12-17 $2.50 $10.00 128k 🔧 tools
azure/gpt-5.4 $2.50 $15.00 $0.25 1050k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-5.4-2026-03-05 $2.50 $15.00 $0.25 1050k 👁️ vision · 🔧 tools · 💾 cache
azure/eu/gpt-4o-2024-08-06 $2.75 $11.00 $1.38 128k 👁️ vision · 🔧 tools · 💾 cache
azure/eu/gpt-4o-2024-11-20 $2.75 $11.00 128k 👁️ vision · 🔧 tools
azure/gpt-4o-2024-11-20 $2.75 $11.00 $1.25 128k 👁️ vision · 🔧 tools · 💾 cache
azure/us/gpt-4o-2024-08-06 $2.75 $11.00 $1.38 128k 👁️ vision · 🔧 tools · 💾 cache
azure/us/gpt-4o-2024-11-20 $2.75 $11.00 128k 👁️ vision · 🔧 tools
azure/command-r-plus $3.00 $15.00 128k 🔧 tools
azure/computer-use-preview $3.00 $12.00 8k 👁️ vision · 🔧 tools
azure/gpt-35-turbo-16k $3.00 $4.00 16k
azure/gpt-35-turbo-16k-0613 $3.00 $4.00 16k 🔧 tools
computer-use-preview $3.00 $12.00 8k 👁️ vision · 🔧 tools
azure/gpt-realtime-2025-08-28 $4.00 $16.00 $4.00 32k 🔧 tools
azure/gpt-realtime-1.5-2026-02-23 $4.00 $16.00 $4.00 32k 🔧 tools
azure/gpt-4o-2024-05-13 $5.00 $15.00 128k 👁️ vision · 🔧 tools · 💾 cache
azure/gpt-4o-realtime-preview-2024-10-01 $5.00 $20.00 $2.50 128k 🔧 tools
azure/gpt-4o-realtime-preview-2024-12-17 $5.00 $20.00 $2.50 128k 🔧 tools
azure/gpt-5.5 $5.00 $30.00 $0.50 1050k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
azure/gpt-5.5-2026-04-23 $5.00 $30.00 $0.50 1050k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
azure/eu/gpt-4o-realtime-preview-2024-10-01 $5.50 $22.00 $2.75 128k 🔧 tools
azure/eu/gpt-4o-realtime-preview-2024-12-17 $5.50 $22.00 $2.75 128k 🔧 tools
azure/us/gpt-4o-realtime-preview-2024-10-01 $5.50 $22.00 $2.75 128k 🔧 tools
azure/us/gpt-4o-realtime-preview-2024-12-17 $5.50 $22.00 $2.75 128k 🔧 tools
azure/mistral-large-2402 $8.00 $24.00 32k 🔧 tools
azure/mistral-large-latest $8.00 $24.00 32k 🔧 tools
azure/gpt-4-0125-preview $10.00 $30.00 128k 🔧 tools
azure/gpt-4-1106-preview $10.00 $30.00 128k 🔧 tools
azure/gpt-4-turbo $10.00 $30.00 128k 🔧 tools
azure/gpt-4-turbo-2024-04-09 $10.00 $30.00 128k 👁️ vision · 🔧 tools
azure/gpt-4-turbo-vision-preview $10.00 $30.00 128k 👁️ vision
azure/o1 $15.00 $60.00 $7.50 200k 👁️ vision · 🔧 tools · 💾 cache
azure/o1-2024-12-17 $15.00 $60.00 $7.50 200k 👁️ vision · 🔧 tools · 💾 cache
azure/o1-preview $15.00 $60.00 $7.50 128k 🔧 tools · 💾 cache
azure/o1-preview-2024-09-12 $15.00 $60.00 $7.50 128k 🔧 tools · 💾 cache
azure/eu/o1-2024-12-17 $16.50 $66.00 $8.25 200k 👁️ vision · 🔧 tools · 💾 cache
azure/eu/o1-preview-2024-09-12 $16.50 $66.00 $8.25 128k 🔧 tools · 💾 cache
azure/us/o1-2024-12-17 $16.50 $66.00 $8.25 200k 👁️ vision · 🔧 tools · 💾 cache
azure/us/o1-preview-2024-09-12 $16.50 $66.00 $8.25 128k 🔧 tools · 💾 cache
azure/gpt-4 $30.00 $60.00 8k 🔧 tools
azure/gpt-4-0613 $30.00 $60.00 8k 🔧 tools
azure/gpt-4-32k $60.00 $120.00 33k
azure/gpt-4-32k-0613 $60.00 $120.00 33k
azure/gpt-4.5-preview $75.00 $150.00 $37.50 128k 👁️ vision · 🔧 tools · 💾 cache

bedrock converse (121)

Model Input $/M Output $/M Cached $/M Context Features
amazon.nova-micro-v1:0 $0.04 $0.14 128k 🔧 tools · 💾 cache
us.amazon.nova-micro-v1:0 $0.04 $0.14 128k 🔧 tools · 💾 cache
apac.amazon.nova-micro-v1:0 $0.04 $0.15 128k 🔧 tools · 💾 cache
google.gemma-3-4b-it $0.04 $0.08 128k 👁️ vision
mistral.voxtral-mini-3b-2507 $0.04 $0.04 128k
eu.amazon.nova-micro-v1:0 $0.05 $0.18 128k 🔧 tools · 💾 cache
amazon.nova-lite-v1:0 $0.06 $0.24 300k 👁️ vision · 🔧 tools · 💾 cache
nvidia.nemotron-nano-9b-v2 $0.06 $0.23 128k
nvidia.nemotron-nano-3-30b $0.06 $0.24 262k 🔧 tools
us.amazon.nova-lite-v1:0 $0.06 $0.24 300k 👁️ vision · 🔧 tools · 💾 cache
apac.amazon.nova-lite-v1:0 $0.06 $0.25 300k 👁️ vision · 🔧 tools · 💾 cache
openai.gpt-oss-20b-1:0 $0.07 $0.30 128k 🔧 tools
openai.gpt-oss-safeguard-20b $0.07 $0.20 128k
zai.glm-4.7-flash $0.07 $0.40 200k 🔧 tools
eu.amazon.nova-lite-v1:0 $0.08 $0.31 300k 👁️ vision · 🔧 tools · 💾 cache
google.gemma-3-12b-it $0.09 $0.29 128k 👁️ vision
mistral.ministral-3-3b-instruct $0.10 $0.10 128k 🔧 tools
mistral.voxtral-small-24b-2507 $0.10 $0.30 128k
mistral.ministral-3-8b-instruct $0.15 $0.15 128k 🔧 tools
nvidia.nemotron-super-3-120b $0.15 $0.65 256k 🔧 tools
openai.gpt-oss-120b-1:0 $0.15 $0.60 128k 🔧 tools
openai.gpt-oss-safeguard-120b $0.15 $0.60 128k
qwen.qwen3-coder-30b-a3b-v1:0 $0.15 $0.60 262k 🔧 tools
qwen.qwen3-32b-v1:0 $0.15 $0.60 131k 🔧 tools
qwen.qwen3-next-80b-a3b $0.15 $1.20 128k 🔧 tools
meta.llama4-scout-17b-instruct-v1:0 $0.17 $0.66 128k 🔧 tools
us.meta.llama4-scout-17b-instruct-v1:0 $0.17 $0.66 128k 🔧 tools
mistral.ministral-3-14b-instruct $0.20 $0.20 128k 🔧 tools
nvidia.nemotron-nano-12b-v2 $0.20 $0.60 128k 👁️ vision
qwen.qwen3-coder-480b-a35b-v1:0 $0.22 $1.80 262k 🔧 tools
qwen.qwen3-235b-a22b-2507-v1:0 $0.22 $0.88 262k 🔧 tools
google.gemma-3-27b-it $0.23 $0.38 128k 👁️ vision
meta.llama4-maverick-17b-instruct-v1:0 $0.24 $0.97 128k 🔧 tools
us.meta.llama4-maverick-17b-instruct-v1:0 $0.24 $0.97 128k 🔧 tools
amazon.nova-2-lite-v1:0 $0.30 $2.50 $0.07 1000k 👁️ vision · 🔧 tools · 💾 cache
global.amazon.nova-2-lite-v1:0 $0.30 $2.50 $0.07 1000k 👁️ vision · 🔧 tools · 💾 cache
minimax.minimax-m2 $0.30 $1.20 128k
minimax.minimax-m2.1 $0.30 $1.20 196k 🔧 tools
minimax.minimax-m2.5 $0.30 $1.20 1000k 🔧 tools
apac.amazon.nova-2-lite-v1:0 $0.33 $2.75 $0.08 1000k 👁️ vision · 🔧 tools · 💾 cache
eu.amazon.nova-2-lite-v1:0 $0.33 $2.75 $0.08 1000k 👁️ vision · 🔧 tools · 💾 cache
us.amazon.nova-2-lite-v1:0 $0.33 $2.75 $0.08 1000k 👁️ vision · 🔧 tools · 💾 cache
mistral.devstral-2-123b $0.40 $2.00 256k 🔧 tools
mistral.magistral-small-2509 $0.50 $1.50 128k 🔧 tools
mistral.mistral-large-3-675b-instruct $0.50 $1.50 128k 🔧 tools
qwen.qwen3-coder-next $0.50 $1.20 262k 🔧 tools
qwen.qwen3-vl-235b-a22b $0.53 $2.66 128k 👁️ vision · 🔧 tools
deepseek.v3-v1:0 $0.58 $1.68 164k 🔧 tools
us.writer.palmyra-x5-v1:0 $0.60 $6.00 1000k 🔧 tools
writer.palmyra-x5-v1:0 $0.60 $6.00 1000k 🔧 tools
moonshot.kimi-k2-thinking $0.60 $2.50 128k
moonshotai.kimi-k2.5 $0.60 $3.00 262k 👁️ vision · 🔧 tools
zai.glm-4.7 $0.60 $2.20 200k 🔧 tools
deepseek.v3.2 $0.62 $1.85 164k 🔧 tools
us.deepseek.v3.2 $0.62 $1.85 164k 🔧 tools
meta.llama3-3-70b-instruct-v1:0 $0.72 $0.72 128k 🔧 tools
us.meta.llama3-3-70b-instruct-v1:0 $0.72 $0.72 128k 🔧 tools
eu.deepseek.v3.2 $0.74 $2.22 164k 🔧 tools
amazon.nova-pro-v1:0 $0.80 $3.20 300k 👁️ vision · 🔧 tools · 💾 cache
us.amazon.nova-pro-v1:0 $0.80 $3.20 300k 👁️ vision · 🔧 tools · 💾 cache
apac.amazon.nova-pro-v1:0 $0.84 $3.36 300k 👁️ vision · 🔧 tools · 💾 cache
anthropic.claude-haiku-4-5-20251001-v1:0 $1.00 $5.00 $0.10 200k 👁️ vision · 🔧 tools · 💾 cache
anthropic.claude-haiku-4-5@20251001 $1.00 $5.00 $0.10 200k 👁️ vision · 🔧 tools · 💾 cache
global.anthropic.claude-haiku-4-5-20251001-v1:0 $1.00 $5.00 $0.10 200k 👁️ vision · 🔧 tools · 💾 cache
zai.glm-5 $1.00 $3.20 200k 🔧 tools
eu.amazon.nova-pro-v1:0 $1.05 $4.20 300k 👁️ vision · 🔧 tools · 💾 cache
apac.anthropic.claude-haiku-4-5-20251001-v1:0 $1.10 $5.50 $0.11 200k 👁️ vision · 🔧 tools · 💾 cache
eu.anthropic.claude-haiku-4-5-20251001-v1:0 $1.10 $5.50 $0.11 200k 👁️ vision · 🔧 tools · 💾 cache
jp.anthropic.claude-haiku-4-5-20251001-v1:0 $1.10 $5.50 $0.11 200k 👁️ vision · 🔧 tools · 💾 cache
us.anthropic.claude-haiku-4-5-20251001-v1:0 $1.10 $5.50 $0.11 200k 👁️ vision · 🔧 tools · 💾 cache
au.anthropic.claude-haiku-4-5-20251001-v1:0 $1.10 $5.50 $0.11 200k 👁️ vision · 🔧 tools · 💾 cache
us.deepseek.r1-v1:0 $1.35 $5.40 128k
eu.mistral.pixtral-large-2502-v1:0 $2.00 $6.00 128k 🔧 tools
us.mistral.pixtral-large-2502-v1:0 $2.00 $6.00 128k 🔧 tools
amazon.nova-2-pro-preview-20251202-v1:0 $2.19 $17.50 $0.55 1000k 👁️ vision · 🔧 tools · 💾 cache
apac.amazon.nova-2-pro-preview-20251202-v1:0 $2.19 $17.50 $0.55 1000k 👁️ vision · 🔧 tools · 💾 cache
eu.amazon.nova-2-pro-preview-20251202-v1:0 $2.19 $17.50 $0.55 1000k 👁️ vision · 🔧 tools · 💾 cache
us.amazon.nova-2-pro-preview-20251202-v1:0 $2.19 $17.50 $0.55 1000k 👁️ vision · 🔧 tools · 💾 cache
us.writer.palmyra-x4-v1:0 $2.50 $10.00 128k 🔧 tools
writer.palmyra-x4-v1:0 $2.50 $10.00 128k 🔧 tools
us.amazon.nova-premier-v1:0 $2.50 $12.50 1000k 👁️ vision · 🔧 tools
anthropic.claude-3-7-sonnet-20250219-v1:0 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache
anthropic.claude-sonnet-4-6 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
global.anthropic.claude-sonnet-4-6 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
anthropic.claude-sonnet-4-20250514-v1:0 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
anthropic.claude-sonnet-4-5-20250929-v1:0 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache
apac.anthropic.claude-sonnet-4-20250514-v1:0 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
eu.anthropic.claude-sonnet-4-20250514-v1:0 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
global.anthropic.claude-sonnet-4-5-20250929-v1:0 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache
global.anthropic.claude-sonnet-4-20250514-v1:0 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
us.anthropic.claude-3-7-sonnet-20250219-v1:0 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache
us.anthropic.claude-sonnet-4-20250514-v1:0 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
us.anthropic.claude-sonnet-4-6 $3.30 $16.50 $0.33 1000k 👁️ vision · 🔧 tools · 💾 cache
eu.anthropic.claude-sonnet-4-6 $3.30 $16.50 $0.33 1000k 👁️ vision · 🔧 tools · 💾 cache
au.anthropic.claude-sonnet-4-6 $3.30 $16.50 $0.33 1000k 👁️ vision · 🔧 tools · 💾 cache
jp.anthropic.claude-sonnet-4-6 $3.30 $16.50 $0.33 1000k 👁️ vision · 🔧 tools · 💾 cache
au.anthropic.claude-sonnet-4-5-20250929-v1:0 $3.30 $16.50 $0.33 200k 👁️ vision · 🔧 tools · 💾 cache
eu.anthropic.claude-sonnet-4-5-20250929-v1:0 $3.30 $16.50 $0.33 200k 👁️ vision · 🔧 tools · 💾 cache
jp.anthropic.claude-sonnet-4-5-20250929-v1:0 $3.30 $16.50 $0.33 200k 👁️ vision · 🔧 tools · 💾 cache
us.anthropic.claude-sonnet-4-5-20250929-v1:0 $3.30 $16.50 $0.33 200k 👁️ vision · 🔧 tools · 💾 cache
us-gov.anthropic.claude-sonnet-4-5-20250929-v1:0 $3.30 $16.50 $0.33 200k 👁️ vision · 🔧 tools · 💾 cache
anthropic.claude-opus-4-5-20251101-v1:0 $5.00 $25.00 $0.50 200k 👁️ vision · 🔧 tools · 💾 cache
anthropic.claude-opus-4-6-v1 $5.00 $25.00 $0.50 1000k 👁️ vision · 🔧 tools · 💾 cache
global.anthropic.claude-opus-4-6-v1 $5.00 $25.00 $0.50 1000k 👁️ vision · 🔧 tools · 💾 cache
anthropic.claude-opus-4-7 $5.00 $25.00 $0.50 1000k 👁️ vision · 🔧 tools · 💾 cache
global.anthropic.claude-opus-4-7 $5.00 $25.00 $0.50 1000k 👁️ vision · 🔧 tools · 💾 cache
global.anthropic.claude-opus-4-5-20251101-v1:0 $5.00 $25.00 $0.50 200k 👁️ vision · 🔧 tools · 💾 cache
eu.anthropic.claude-opus-4-5-20251101-v1:0 $5.00 $25.00 $0.50 200k 👁️ vision · 🔧 tools · 💾 cache
us.anthropic.claude-opus-4-6-v1 $5.50 $27.50 $0.55 1000k 👁️ vision · 🔧 tools · 💾 cache
eu.anthropic.claude-opus-4-6-v1 $5.50 $27.50 $0.55 1000k 👁️ vision · 🔧 tools · 💾 cache
au.anthropic.claude-opus-4-6-v1 $5.50 $27.50 $0.55 1000k 👁️ vision · 🔧 tools · 💾 cache
us.anthropic.claude-opus-4-7 $5.50 $27.50 $0.55 1000k 👁️ vision · 🔧 tools · 💾 cache
eu.anthropic.claude-opus-4-7 $5.50 $27.50 $0.55 1000k 👁️ vision · 🔧 tools · 💾 cache
au.anthropic.claude-opus-4-7 $5.50 $27.50 $0.55 1000k 👁️ vision · 🔧 tools · 💾 cache
us.anthropic.claude-opus-4-5-20251101-v1:0 $5.50 $27.50 $0.55 200k 👁️ vision · 🔧 tools · 💾 cache
anthropic.claude-opus-4-1-20250805-v1:0 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools · 💾 cache
anthropic.claude-opus-4-20250514-v1:0 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools · 💾 cache
eu.anthropic.claude-opus-4-1-20250805-v1:0 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools · 💾 cache
eu.anthropic.claude-opus-4-20250514-v1:0 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools · 💾 cache
us.anthropic.claude-opus-4-1-20250805-v1:0 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools · 💾 cache
us.anthropic.claude-opus-4-20250514-v1:0 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools · 💾 cache

openai (97)

Model Input $/M Output $/M Cached $/M Context Features
gpt-5-nano $0.05 $0.40 $0.0050 272k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-5-nano-2025-08-07 $0.05 $0.40 $0.0050 272k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-4.1-nano $0.10 $0.40 $0.02 1048k 👁️ vision · 🔧 tools · 💾 cache
gpt-4.1-nano-2025-04-14 $0.10 $0.40 $0.02 1048k 👁️ vision · 🔧 tools · 💾 cache
gpt-4o-mini $0.15 $0.60 $0.07 128k 👁️ vision · 🔧 tools · 💾 cache
gpt-4o-mini-2024-07-18 $0.15 $0.60 $0.07 128k 👁️ vision · 🔧 tools · 💾 cache
gpt-4o-mini-audio-preview $0.15 $0.60 128k 🔧 tools
gpt-4o-mini-audio-preview-2024-12-17 $0.15 $0.60 128k 🔧 tools
gpt-4o-mini-search-preview $0.15 $0.60 $0.07 128k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-4o-mini-search-preview-2025-03-11 $0.15 $0.60 $0.07 128k 👁️ vision · 🔧 tools · 💾 cache
ft:gpt-4.1-nano-2025-04-14 $0.20 $0.80 $0.05 1048k 🔧 tools · 💾 cache
gpt-5.4-nano $0.20 $1.25 $0.02 272k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-5.4-nano-2026-03-17 $0.20 $1.25 $0.02 272k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-5-mini $0.25 $2.00 $0.02 272k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-5-mini-2025-08-07 $0.25 $2.00 $0.02 272k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
ft:gpt-4o-mini-2024-07-18 $0.30 $1.20 $0.15 128k 🔧 tools · 💾 cache
gpt-4.1-mini $0.40 $1.60 $0.10 1048k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-4.1-mini-2025-04-14 $0.40 $1.60 $0.10 1048k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-3.5-turbo $0.50 $1.50 16k 🔧 tools · 💾 cache
gpt-3.5-turbo-0125 $0.50 $1.50 16k 🔧 tools · 💾 cache
gpt-audio-mini $0.60 $2.40 128k 🔧 tools
gpt-audio-mini-2025-10-06 $0.60 $2.40 128k 🔧 tools
gpt-audio-mini-2025-12-15 $0.60 $2.40 128k 🔧 tools
gpt-4o-mini-realtime-preview $0.60 $2.40 $0.30 128k 🔧 tools
gpt-4o-mini-realtime-preview-2024-12-17 $0.60 $2.40 $0.30 128k 🔧 tools
gpt-realtime-mini $0.60 $2.40 128k 🔧 tools
gpt-realtime-mini-2025-10-06 $0.60 $2.40 $0.06 128k 🔧 tools
gpt-realtime-mini-2025-12-15 $0.60 $2.40 $0.06 128k 🔧 tools
gpt-5.4-mini $0.75 $4.50 $0.07 272k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-5.4-mini-2026-03-17 $0.75 $4.50 $0.07 272k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
ft:gpt-4.1-mini-2025-04-14 $0.80 $3.20 $0.20 1048k 🔧 tools · 💾 cache
gpt-3.5-turbo-1106 $1.00 $2.00 16k 🔧 tools · 💾 cache
o3-mini $1.10 $4.40 $0.55 200k 🔧 tools · 💾 cache
o3-mini-2025-01-31 $1.10 $4.40 $0.55 200k 🔧 tools · 💾 cache
o4-mini $1.10 $4.40 $0.28 200k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
o4-mini-2025-04-16 $1.10 $4.40 $0.28 200k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-5 $1.25 $10.00 $0.13 272k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-5.1 $1.25 $10.00 $0.13 272k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-5.1-2025-11-13 $1.25 $10.00 $0.13 272k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-5.1-chat-latest $1.25 $10.00 $0.13 128k 👁️ vision · 💾 cache · 🌐 search
gpt-5-2025-08-07 $1.25 $10.00 $0.13 272k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-5-chat $1.25 $10.00 $0.13 128k 👁️ vision · 💾 cache
gpt-5-chat-latest $1.25 $10.00 $0.13 128k 👁️ vision · 💾 cache
gpt-5-search-api $1.25 $10.00 $0.13 272k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-5-search-api-2025-10-14 $1.25 $10.00 $0.13 272k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-5.2 $1.75 $14.00 $0.17 272k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-5.2-2025-12-11 $1.75 $14.00 $0.17 272k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-5.2-chat-latest $1.75 $14.00 $0.17 128k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-5.3-chat-latest $1.75 $14.00 $0.17 128k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-4.1 $2.00 $8.00 $0.50 1048k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-4.1-2025-04-14 $2.00 $8.00 $0.50 1048k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
o3 $2.00 $8.00 $0.50 200k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
o3-2025-04-16 $2.00 $8.00 $0.50 200k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-4o $2.50 $10.00 $1.25 128k 👁️ vision · 🔧 tools · 💾 cache
gpt-4o-2024-08-06 $2.50 $10.00 $1.25 128k 👁️ vision · 🔧 tools · 💾 cache
gpt-4o-2024-11-20 $2.50 $10.00 $1.25 128k 👁️ vision · 🔧 tools · 💾 cache
gpt-4o-audio-preview $2.50 $10.00 128k 🔧 tools
gpt-4o-audio-preview-2024-12-17 $2.50 $10.00 128k 🔧 tools
gpt-4o-audio-preview-2025-06-03 $2.50 $10.00 128k 🔧 tools
gpt-audio $2.50 $10.00 128k 🔧 tools
gpt-audio-1.5 $2.50 $10.00 128k 🔧 tools
gpt-audio-2025-08-28 $2.50 $10.00 128k 🔧 tools
gpt-4o-search-preview $2.50 $10.00 $1.25 128k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-4o-search-preview-2025-03-11 $2.50 $10.00 $1.25 128k 👁️ vision · 🔧 tools · 💾 cache
gpt-5.4 $2.50 $15.00 $0.25 1050k 👁️ vision · 🔧 tools · 💾 cache
gpt-5.4-2026-03-05 $2.50 $15.00 $0.25 1050k 👁️ vision · 🔧 tools · 💾 cache
ft:gpt-3.5-turbo $3.00 $6.00 16k
ft:gpt-3.5-turbo-0125 $3.00 $6.00 16k
ft:gpt-3.5-turbo-0613 $3.00 $6.00 4k
ft:gpt-3.5-turbo-1106 $3.00 $6.00 16k
ft:gpt-4.1-2025-04-14 $3.00 $12.00 $0.75 1048k 🔧 tools · 💾 cache
gpt-3.5-turbo-16k $3.00 $4.00 16k 💾 cache
ft:gpt-4o-2024-08-06 $3.75 $15.00 $1.88 128k 👁️ vision · 🔧 tools · 💾 cache
ft:gpt-4o-2024-11-20 $3.75 $15.00 128k 🔧 tools · 💾 cache
ft:o4-mini-2025-04-16 $4.00 $16.00 $1.00 200k 🔧 tools · 💾 cache
gpt-realtime $4.00 $16.00 $0.40 32k 🔧 tools
gpt-realtime-1.5 $4.00 $16.00 $0.40 32k 🔧 tools
gpt-realtime-2 $4.00 $16.00 $0.40 32k 🔧 tools
gpt-realtime-2025-08-28 $4.00 $16.00 $0.40 32k 🔧 tools
chatgpt-4o-latest $5.00 $15.00 128k 👁️ vision · 🔧 tools · 💾 cache
gpt-4o-2024-05-13 $5.00 $15.00 128k 👁️ vision · 🔧 tools · 💾 cache
gpt-4o-realtime-preview $5.00 $20.00 $2.50 128k 🔧 tools
gpt-4o-realtime-preview-2024-12-17 $5.00 $20.00 $2.50 128k 🔧 tools
gpt-4o-realtime-preview-2025-06-03 $5.00 $20.00 $2.50 128k 🔧 tools
gpt-5.5 $5.00 $30.00 $0.50 1050k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-5.5-2026-04-23 $5.00 $30.00 $0.50 1050k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gpt-4-0125-preview $10.00 $30.00 128k 🔧 tools · 💾 cache
gpt-4-1106-preview $10.00 $30.00 128k 🔧 tools · 💾 cache
gpt-4-turbo $10.00 $30.00 128k 👁️ vision · 🔧 tools · 💾 cache
gpt-4-turbo-2024-04-09 $10.00 $30.00 128k 👁️ vision · 🔧 tools · 💾 cache
gpt-4-turbo-preview $10.00 $30.00 128k 🔧 tools · 💾 cache
o1 $15.00 $60.00 $7.50 200k 👁️ vision · 🔧 tools · 💾 cache
o1-2024-12-17 $15.00 $60.00 $7.50 200k 👁️ vision · 🔧 tools · 💾 cache
ft:gpt-4-0613 $30.00 $60.00 8k 🔧 tools
gpt-4 $30.00 $60.00 8k 🔧 tools · 💾 cache
gpt-4-0314 $30.00 $60.00 8k
gpt-4-0613 $30.00 $60.00 8k 🔧 tools · 💾 cache

vercel ai gateway (95)

Model Input $/M Output $/M Cached $/M Context Features
vercel_ai_gateway/amazon/titan-embed-text-v2 $0.02 $0.0000
vercel_ai_gateway/amazon/nova-micro $0.04 $0.14 128k 🔧 tools
vercel_ai_gateway/mistral/ministral-3b $0.04 $0.04 128k 🔧 tools
vercel_ai_gateway/meta/llama-3-8b $0.05 $0.08 8k
vercel_ai_gateway/meta/llama-3.1-8b $0.05 $0.08 131k 🔧 tools
vercel_ai_gateway/amazon/nova-lite $0.06 $0.24 300k 👁️ vision · 🔧 tools
vercel_ai_gateway/mistral/devstral-small $0.07 $0.28 128k 🔧 tools
vercel_ai_gateway/google/gemini-2.0-flash-lite $0.07 $0.30 1049k 👁️ vision · 🔧 tools
vercel_ai_gateway/alibaba/qwen-3-14b $0.08 $0.24 41k
vercel_ai_gateway/alibaba/qwen-3-30b $0.10 $0.30 41k
vercel_ai_gateway/alibaba/qwen-3-32b $0.10 $0.30 41k 🔧 tools
vercel_ai_gateway/meta/llama-3.2-1b $0.10 $0.10 128k
vercel_ai_gateway/meta/llama-4-scout $0.10 $0.30 131k 👁️ vision · 🔧 tools
vercel_ai_gateway/mistral/ministral-8b $0.10 $0.10 128k 👁️ vision · 🔧 tools
vercel_ai_gateway/mistral/mistral-embed $0.10 $0.0000
vercel_ai_gateway/mistral/mistral-small $0.10 $0.30 32k 🔧 tools
vercel_ai_gateway/openai/gpt-4.1-nano $0.10 $0.40 $0.02 1048k 👁️ vision · 🔧 tools
vercel_ai_gateway/cohere/embed-v4.0 $0.12 $0.0000
vercel_ai_gateway/cohere/command-r $0.15 $0.60 128k 🔧 tools
vercel_ai_gateway/google/gemini-2.0-flash $0.15 $0.60 1049k 👁️ vision · 🔧 tools
vercel_ai_gateway/meta/llama-3.2-3b $0.15 $0.15 128k 🔧 tools
vercel_ai_gateway/mistral/codestral-embed $0.15 $0.0000
vercel_ai_gateway/mistral/pixtral-12b $0.15 $0.15 128k 👁️ vision · 🔧 tools
vercel_ai_gateway/openai/gpt-4o-mini $0.15 $0.60 $0.07 128k 👁️ vision · 🔧 tools
vercel_ai_gateway/meta/llama-3.2-11b $0.16 $0.16 128k 👁️ vision · 🔧 tools
vercel_ai_gateway/alibaba/qwen-3-235b $0.20 $0.60 41k
vercel_ai_gateway/google/gemma-2-9b $0.20 $0.20 8k 👁️ vision · 🔧 tools
vercel_ai_gateway/meta/llama-4-maverick $0.20 $0.60 131k
vercel_ai_gateway/zai/glm-4.5-air $0.20 $1.10 128k 🔧 tools
vercel_ai_gateway/anthropic/claude-3-haiku $0.25 $1.25 $0.03 200k 👁️ vision · 🔧 tools
vercel_ai_gateway/inception/mercury-coder-small $0.25 $1.00 32k
vercel_ai_gateway/google/gemini-2.5-flash $0.30 $2.50 1000k 👁️ vision · 🔧 tools
vercel_ai_gateway/mistral/codestral $0.30 $0.90 256k 🔧 tools
vercel_ai_gateway/xai/grok-3-mini $0.30 $0.50 131k 🔧 tools
vercel_ai_gateway/alibaba/qwen3-coder $0.40 $1.60 262k 🔧 tools
vercel_ai_gateway/openai/gpt-4.1-mini $0.40 $1.60 $0.10 1048k 👁️ vision · 🔧 tools
vercel_ai_gateway/zai/glm-4.6 $0.45 $1.80 $0.11 200k 🔧 tools
vercel_ai_gateway/mistral/magistral-small $0.50 $1.50 128k 🔧 tools
vercel_ai_gateway/openai/gpt-3.5-turbo $0.50 $1.50 16k 🔧 tools
vercel_ai_gateway/deepseek/deepseek-r1 $0.55 $2.19 128k
vercel_ai_gateway/moonshotai/kimi-k2 $0.55 $2.20 131k 🔧 tools
vercel_ai_gateway/meta/llama-3-70b $0.59 $0.79 8k
vercel_ai_gateway/xai/grok-3-mini-fast $0.60 $4.00 131k 🔧 tools
vercel_ai_gateway/zai/glm-4.5 $0.60 $2.20 131k 🔧 tools
vercel_ai_gateway/meta/llama-3.1-70b $0.72 $0.72 128k
vercel_ai_gateway/meta/llama-3.2-90b $0.72 $0.72 128k 👁️ vision · 🔧 tools
vercel_ai_gateway/meta/llama-3.3-70b $0.72 $0.72 128k 🔧 tools
vercel_ai_gateway/deepseek/deepseek-r1-distill-llama-70b $0.75 $0.99 131k 🔧 tools
vercel_ai_gateway/mistral/mistral-saba-24b $0.79 $0.79 33k
vercel_ai_gateway/amazon/nova-pro $0.80 $3.20 300k 👁️ vision · 🔧 tools
vercel_ai_gateway/anthropic/claude-3.5-haiku $0.80 $4.00 $0.08 200k 👁️ vision · 🔧 tools
vercel_ai_gateway/morph/morph-v3-fast $0.80 $1.20 33k
vercel_ai_gateway/deepseek/deepseek-v3 $0.90 $0.90 128k
vercel_ai_gateway/morph/morph-v3-large $0.90 $1.90 33k
vercel_ai_gateway/anthropic/claude-haiku-4.5 $1.00 $5.00 $0.10 200k 👁️ vision · 🔧 tools · 💾 cache
vercel_ai_gateway/perplexity/sonar $1.00 $1.00 127k
vercel_ai_gateway/perplexity/sonar-reasoning $1.00 $5.00 127k
vercel_ai_gateway/openai/o3-mini $1.10 $4.40 $0.55 200k 🔧 tools
vercel_ai_gateway/openai/o4-mini $1.10 $4.40 $0.28 200k 👁️ vision · 🔧 tools
vercel_ai_gateway/mistral/mixtral-8x22b-instruct $1.20 $1.20 66k 🔧 tools
vercel_ai_gateway/openai/gpt-3.5-turbo-instruct $1.50 $2.00 8k
vercel_ai_gateway/mistral/magistral-medium $2.00 $5.00 128k 🔧 tools
vercel_ai_gateway/mistral/mistral-large $2.00 $6.00 32k 🔧 tools
vercel_ai_gateway/mistral/pixtral-large $2.00 $6.00 128k 👁️ vision · 🔧 tools
vercel_ai_gateway/openai/gpt-4.1 $2.00 $8.00 $0.50 1048k 👁️ vision · 🔧 tools
vercel_ai_gateway/openai/o3 $2.00 $8.00 $0.50 200k 👁️ vision · 🔧 tools
vercel_ai_gateway/perplexity/sonar-reasoning-pro $2.00 $8.00 127k
vercel_ai_gateway/xai/grok-2 $2.00 $10.00 131k 🔧 tools
vercel_ai_gateway/xai/grok-2-vision $2.00 $10.00 33k 👁️ vision · 🔧 tools
vercel_ai_gateway/cohere/command-a $2.50 $10.00 256k 🔧 tools
vercel_ai_gateway/cohere/command-r-plus $2.50 $10.00 128k 🔧 tools
vercel_ai_gateway/google/gemini-2.5-pro $2.50 $10.00 1049k 👁️ vision · 🔧 tools
vercel_ai_gateway/openai/gpt-4o $2.50 $10.00 $1.25 128k 👁️ vision · 🔧 tools
vercel_ai_gateway/anthropic/claude-3.5-sonnet $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools
vercel_ai_gateway/anthropic/claude-3.7-sonnet $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools
vercel_ai_gateway/anthropic/claude-4-sonnet $3.00 $15.00 $0.30 200k 🔧 tools
vercel_ai_gateway/anthropic/claude-3-5-sonnet $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache
vercel_ai_gateway/anthropic/claude-3-5-sonnet-20241022 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache
vercel_ai_gateway/anthropic/claude-3-7-sonnet $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache
vercel_ai_gateway/anthropic/claude-sonnet-4 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache
vercel_ai_gateway/anthropic/claude-sonnet-4.5 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
vercel_ai_gateway/perplexity/sonar-pro $3.00 $15.00 200k
vercel_ai_gateway/vercel/v0-1.0-md $3.00 $15.00 128k 👁️ vision · 🔧 tools
vercel_ai_gateway/vercel/v0-1.5-md $3.00 $15.00 128k 👁️ vision · 🔧 tools
vercel_ai_gateway/xai/grok-3 $3.00 $15.00 131k 🔧 tools
vercel_ai_gateway/xai/grok-4 $3.00 $15.00 256k 🔧 tools
vercel_ai_gateway/anthropic/claude-opus-4.5 $5.00 $25.00 $0.50 200k 👁️ vision · 🔧 tools · 💾 cache
vercel_ai_gateway/anthropic/claude-opus-4.6 $5.00 $25.00 $0.50 200k 👁️ vision · 🔧 tools · 💾 cache
vercel_ai_gateway/xai/grok-3-fast $5.00 $25.00 131k 🔧 tools
vercel_ai_gateway/openai/gpt-4-turbo $10.00 $30.00 128k 👁️ vision · 🔧 tools
vercel_ai_gateway/anthropic/claude-3-opus $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools
vercel_ai_gateway/anthropic/claude-4-opus $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools
vercel_ai_gateway/anthropic/claude-opus-4 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools · 💾 cache
vercel_ai_gateway/anthropic/claude-opus-4.1 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools · 💾 cache
vercel_ai_gateway/openai/o1 $15.00 $60.00 $7.50 200k 👁️ vision · 🔧 tools

openrouter (92)

Model Input $/M Output $/M Cached $/M Context Features
openrouter/openrouter/auto $0.0000 $0.0000 2000k 👁️ vision · 🔧 tools
openrouter/openrouter/free $0.0000 $0.0000 200k 👁️ vision · 🔧 tools
openrouter/openrouter/bodybuilder $0.0000 $0.0000 128k
openrouter/openai/gpt-oss-20b $0.02 $0.10 131k 🔧 tools
openrouter/openai/gpt-5-nano $0.05 $0.40 $0.0050 272k
openrouter/z-ai/glm-4.7-flash $0.07 $0.40 $0.0000 200k 👁️ vision · 🔧 tools
openrouter/qwen/qwen3-235b-a22b-2507 $0.07 $0.10 262k 🔧 tools
openrouter/xiaomi/mimo-v2-flash $0.09 $0.29 $0.0000 262k 🔧 tools
openrouter/bytedance/ui-tars-1.5-7b $0.10 $0.20 131k
openrouter/google/gemini-2.0-flash-001 $0.10 $0.40 1049k 👁️ vision · 🔧 tools
openrouter/mistralai/ministral-3b-2512 $0.10 $0.10 131k 👁️ vision · 🔧 tools
openrouter/mistralai/mistral-small-3.1-24b-instruct $0.10 $0.30 131k
openrouter/mistralai/mistral-small-3.2-24b-instruct $0.10 $0.30 128k
openrouter/openai/gpt-4.1-nano $0.10 $0.40 $0.02 1048k 👁️ vision · 🔧 tools · 💾 cache
openrouter/qwen/qwen3.5-flash-02-23 $0.10 $0.40 1000k 👁️ vision · 🔧 tools
openrouter/qwen/qwen3-235b-a22b-thinking-2507 $0.11 $0.60 262k 🔧 tools
openrouter/mistralai/mistral-7b-instruct $0.13 $0.13 33k
openrouter/deepseek/deepseek-chat $0.14 $0.28 66k 💾 cache
openrouter/deepseek/deepseek-chat-v3-0324 $0.14 $0.28 66k 💾 cache
openrouter/mistralai/devstral-2512 $0.15 $0.60 262k 🔧 tools
openrouter/mistralai/ministral-8b-2512 $0.15 $0.15 262k 👁️ vision · 🔧 tools
openrouter/openai/gpt-oss-120b $0.18 $0.80 131k 🔧 tools
openrouter/qwen/qwen-2.5-coder-32b-instruct $0.18 $0.18 34k
openrouter/deepseek/deepseek-chat-v3.1 $0.20 $0.80 164k 🔧 tools · 💾 cache
openrouter/deepseek/deepseek-v3.2-exp $0.20 $0.40 164k 🔧 tools · 💾 cache
openrouter/mistralai/ministral-14b-2512 $0.20 $0.20 262k 👁️ vision · 🔧 tools
openrouter/qwen/qwen-vl-plus $0.21 $0.63 8k 👁️ vision
openrouter/qwen/qwen3-coder $0.22 $0.95 262k 🔧 tools
openrouter/anthropic/claude-3-haiku $0.25 $1.25 200k 👁️ vision · 🔧 tools
openrouter/google/gemini-3.1-flash-lite-preview $0.25 $1.50 $0.02 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
openrouter/openai/gpt-5-mini $0.25 $2.00 $0.02 272k
openrouter/qwen/qwen3.5-35b-a3b $0.25 $2.00 262k 👁️ vision · 🔧 tools
openrouter/minimax/minimax-m2 $0.26 $1.02 205k 🔧 tools · 💾 cache
openrouter/minimax/minimax-m2.1 $0.27 $1.20 $0.0000 204k 👁️ vision · 🔧 tools
openrouter/deepseek/deepseek-v3.2 $0.28 $0.40 164k 🔧 tools · 💾 cache
openrouter/google/gemini-2.5-flash $0.30 $2.50 1049k 👁️ vision · 🔧 tools
openrouter/qwen/qwen3.5-27b $0.30 $2.40 262k 👁️ vision · 🔧 tools
openrouter/minimax/minimax-m2.5 $0.30 $1.10 $0.15 197k 🔧 tools · 💾 cache
openrouter/qwen/qwen3.6-plus $0.33 $1.95 1000k 👁️ vision · 🔧 tools
openrouter/openai/gpt-4.1-mini $0.40 $1.60 $0.10 1048k 👁️ vision · 🔧 tools · 💾 cache
openrouter/qwen/qwen3.5-122b-a10b $0.40 $2.00 262k 👁️ vision · 🔧 tools
openrouter/qwen/qwen3.5-plus-02-15 $0.40 $2.40 1000k 👁️ vision · 🔧 tools
openrouter/z-ai/glm-4.6 $0.40 $1.75 203k 🔧 tools · 💾 cache
openrouter/z-ai/glm-4.7 $0.40 $1.50 $0.0000 203k 👁️ vision · 🔧 tools
openrouter/z-ai/glm-4.6:exacto $0.45 $1.90 203k 🔧 tools · 💾 cache
openrouter/deepseek/deepseek-r1-0528 $0.50 $2.15 65k 🔧 tools · 💾 cache
openrouter/google/gemini-3-flash-preview $0.50 $3.00 $0.05 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
openrouter/mistralai/mistral-large-2512 $0.50 $1.50 262k 👁️ vision · 🔧 tools
openrouter/deepseek/deepseek-r1 $0.55 $2.19 65k 🔧 tools · 💾 cache
openrouter/meta-llama/llama-3-70b-instruct $0.59 $0.79 8k
openrouter/moonshotai/kimi-k2.5 $0.60 $3.00 $0.10 262k 👁️ vision · 🔧 tools
openrouter/qwen/qwen3.5-397b-a17b $0.60 $3.60 262k 👁️ vision · 🔧 tools
openrouter/mistralai/mixtral-8x22b-instruct $0.65 $0.65 66k
openrouter/z-ai/glm-5 $0.80 $2.56 203k 🔧 tools
openrouter/switchpoint/router $0.85 $3.40 131k
openrouter/anthropic/claude-haiku-4.5 $1.00 $5.00 $0.10 200k 👁️ vision · 🔧 tools · 💾 cache
openrouter/qwen/qwen3-coder-plus $1.00 $5.00 998k 🔧 tools
openrouter/openai/o3-mini $1.10 $4.40 128k 🔧 tools
openrouter/openai/o3-mini-high $1.10 $4.40 128k 🔧 tools
openrouter/google/gemini-2.5-pro $1.25 $10.00 1049k 👁️ vision · 🔧 tools
openrouter/openai/gpt-5-chat $1.25 $10.00 $0.13 128k
openrouter/openai/gpt-5-codex $1.25 $10.00 $0.13 272k
openrouter/openai/gpt-5 $1.25 $10.00 $0.13 272k
openrouter/openai/gpt-5.1-codex-max $1.25 $10.00 $0.13 400k 👁️ vision · 🔧 tools · 💾 cache
openrouter/openai/gpt-3.5-turbo $1.50 $2.00 16k
openrouter/openai/gpt-5.2-codex $1.75 $14.00 $0.17 272k
openrouter/openai/gpt-5.2 $1.75 $14.00 $0.17 272k 👁️ vision · 🔧 tools · 💾 cache
openrouter/openai/gpt-5.2-chat $1.75 $14.00 $0.17 128k 👁️ vision · 🔧 tools · 💾 cache
openrouter/gryphe/mythomax-l2-13b $1.88 $1.88 8k
openrouter/undi95/remm-slerp-l2-13b $1.88 $1.88 6k
openrouter/google/gemini-3-pro-preview $2.00 $12.00 $0.20 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
openrouter/google/gemini-3.1-pro-preview $2.00 $12.00 $0.20 1049k 👁️ vision · 🔧 tools · 💾 cache
openrouter/openai/gpt-4.1 $2.00 $8.00 $0.50 1048k 👁️ vision · 🔧 tools · 💾 cache
openrouter/openai/gpt-4o $2.50 $10.00 128k 👁️ vision · 🔧 tools
openrouter/anthropic/claude-3.5-sonnet $3.00 $15.00 200k 👁️ vision · 🔧 tools
openrouter/anthropic/claude-3.7-sonnet $3.00 $15.00 200k 👁️ vision · 🔧 tools
openrouter/anthropic/claude-sonnet-4 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
openrouter/anthropic/claude-sonnet-4.6 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
openrouter/anthropic/claude-sonnet-4.5 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
openrouter/openai/gpt-3.5-turbo-16k $3.00 $4.00 16k
openrouter/x-ai/grok-4 $3.00 $15.00 256k 🔧 tools · 🌐 search
openrouter/anthropic/claude-opus-4.5 $5.00 $25.00 $0.50 200k 👁️ vision · 🔧 tools · 💾 cache
openrouter/anthropic/claude-opus-4.6 $5.00 $25.00 $0.50 1000k 👁️ vision · 🔧 tools · 💾 cache
openrouter/anthropic/claude-opus-4.7 $5.00 $25.00 $0.50 1000k 👁️ vision · 🔧 tools · 💾 cache
openrouter/openai/gpt-4o-2024-05-13 $5.00 $15.00 128k 👁️ vision · 🔧 tools
openrouter/mancer/weaver $5.63 $5.63 8k
openrouter/mistralai/mistral-large $8.00 $24.00 128k
openrouter/anthropic/claude-opus-4 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools · 💾 cache
openrouter/anthropic/claude-opus-4.1 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools · 💾 cache
openrouter/openai/o1 $15.00 $60.00 $7.50 200k 👁️ vision · 🔧 tools · 💾 cache
openrouter/openai/gpt-5.2-pro $21.00 $168.00 272k 👁️ vision · 🔧 tools
openrouter/openai/gpt-4 $30.00 $60.00 8k

novita (80)

Model Input $/M Output $/M Cached $/M Context Features
novita/paddlepaddle/paddleocr-vl $0.02 $0.02 16k 👁️ vision
novita/meta-llama/llama-3.1-8b-instruct $0.02 $0.05 16k
novita/deepseek/deepseek-ocr $0.03 $0.03 8k 👁️ vision
novita/qwen/qwen3-4b-fp8 $0.03 $0.03 128k
novita/meta-llama/llama-3.2-3b-instruct $0.03 $0.05 33k 🔧 tools
novita/zai-org/autoglm-phone-9b-multilingual $0.04 $0.14 66k 👁️ vision
novita/qwen/qwen3-8b-fp8 $0.04 $0.14 128k
novita/openai/gpt-oss-20b $0.04 $0.15 131k 👁️ vision
novita/mistralai/mistral-nemo $0.04 $0.17 60k
novita/meta-llama/llama-3-8b-instruct $0.04 $0.04 8k
novita/openai/gpt-oss-120b $0.05 $0.25 131k 👁️ vision · 🔧 tools
novita/google/gemma-3-12b-it $0.05 $0.10 131k 👁️ vision
novita/sao10k/l3-8b-lunaris $0.05 $0.05 8k
novita/Sao10K/L3-8B-Stheno-v3.2 $0.05 $0.05 8k 🔧 tools
novita/deepseek/deepseek-r1-0528-qwen3-8b $0.06 $0.09 128k
novita/qwen/qwen3-coder-30b-a3b-instruct $0.07 $0.27 160k 🔧 tools
novita/baidu/ernie-4.5-21B-a3b-thinking $0.07 $0.28 131k
novita/baichuan/baichuan-m2-32b $0.07 $0.07 131k
novita/baidu/ernie-4.5-21B-a3b $0.07 $0.28 120k 🔧 tools
novita/qwen/qwen2.5-7b-instruct $0.07 $0.07 32k 🔧 tools
novita/qwen/qwen3-vl-8b-instruct $0.08 $0.50 131k 👁️ vision · 🔧 tools
novita/qwen/qwen3-235b-a22b-instruct-2507 $0.09 $0.58 131k 🔧 tools
novita/qwen/qwen3-30b-a3b-fp8 $0.09 $0.45 41k
novita/gryphe/mythomax-l2-13b $0.09 $0.09 4k
novita/xiaomimimo/mimo-v2-flash $0.10 $0.30 $0.02 262k 🔧 tools
novita/qwen/qwen3-32b-fp8 $0.10 $0.45 41k
novita/google/gemma-3-27b-it $0.12 $0.20 98k 👁️ vision
novita/zai-org/glm-4.5-air $0.13 $0.85 131k 🔧 tools
novita/meta-llama/llama-3.3-70b-instruct $0.14 $0.40 131k 🔧 tools
novita/nousresearch/hermes-2-pro-llama-3-8b $0.14 $0.14 8k
novita/baidu/ernie-4.5-vl-28b-a3b $0.14 $0.56 30k 👁️ vision · 🔧 tools
novita/qwen/qwen3-next-80b-a3b-instruct $0.15 $1.50 131k 🔧 tools
novita/qwen/qwen3-next-80b-a3b-thinking $0.15 $1.50 131k 🔧 tools
novita/deepseek/deepseek-r1-distill-qwen-14b $0.15 $0.15 33k
novita/meta-llama/llama-4-scout-17b-16e-instruct $0.18 $0.59 131k 👁️ vision
novita/skywork/r1v4-lite $0.20 $0.60 262k 👁️ vision
novita/qwen/qwen3-235b-a22b-fp8 $0.20 $0.80 41k
novita/qwen/qwen3-vl-30b-a3b-instruct $0.20 $0.70 131k 👁️ vision · 🔧 tools
novita/qwen/qwen3-vl-30b-a3b-thinking $0.20 $1.00 131k 👁️ vision · 🔧 tools
novita/qwen/qwen3-omni-30b-a3b-thinking $0.25 $0.97 66k 👁️ vision · 🔧 tools
novita/qwen/qwen3-omni-30b-a3b-instruct $0.25 $0.97 66k 👁️ vision · 🔧 tools
novita/qwen/qwen-mt-plus $0.25 $0.75 16k
novita/deepseek/deepseek-v3.2 $0.27 $0.40 $0.13 164k 🔧 tools
novita/deepseek/deepseek-v3.2-exp $0.27 $0.41 164k 🔧 tools
novita/deepseek/deepseek-v3.1-terminus $0.27 $1.00 $0.14 131k 🔧 tools
novita/deepseek/deepseek-v3.1 $0.27 $1.00 $0.14 131k 🔧 tools
novita/deepseek/deepseek-v3-0324 $0.27 $1.12 $0.14 164k 🔧 tools
novita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8 $0.27 $0.85 1049k 👁️ vision
novita/baidu/ernie-4.5-300b-a47b-paddle $0.28 $1.10 123k
novita/minimax/minimax-m2.1 $0.30 $1.20 $0.03 205k 🔧 tools
novita/minimax/minimax-m2 $0.30 $1.20 $0.03 205k 🔧 tools
novita/zai-org/glm-4.6v $0.30 $0.90 $0.06 131k 👁️ vision · 🔧 tools
novita/kwaipilot/kat-coder-pro $0.30 $1.20 $0.06 256k 🔧 tools
novita/qwen/qwen3-vl-235b-a22b-instruct $0.30 $1.50 131k 👁️ vision · 🔧 tools
novita/qwen/qwen3-coder-480b-a35b-instruct $0.30 $1.30 262k 🔧 tools
novita/qwen/qwen3-235b-a22b-thinking-2507 $0.30 $3.00 131k 🔧 tools
novita/deepseek/deepseek-r1-distill-qwen-32b $0.30 $0.30 64k
novita/qwen/qwen-2.5-72b-instruct $0.38 $0.40 32k 🔧 tools
novita/baidu/ernie-4.5-vl-28b-a3b-thinking $0.39 $0.39 131k 👁️ vision · 🔧 tools
novita/deepseek/deepseek-v3-turbo $0.40 $1.30 64k 🔧 tools
novita/baidu/ernie-4.5-vl-424b-a47b $0.42 $1.25 123k 👁️ vision
novita/meta-llama/llama-3-70b-instruct $0.51 $0.74 8k
novita/zai-org/glm-4.6 $0.55 $2.20 $0.11 205k 🔧 tools
novita/minimaxai/minimax-m1-80k $0.55 $2.20 1000k 🔧 tools
novita/moonshotai/kimi-k2-instruct $0.57 $2.30 131k 🔧 tools
novita/zai-org/glm-4.7 $0.60 $2.20 $0.11 205k 🔧 tools
novita/moonshotai/kimi-k2-thinking $0.60 $2.50 262k 🔧 tools
novita/moonshotai/kimi-k2-0905 $0.60 $2.50 262k 🔧 tools
novita/zai-org/glm-4.5 $0.60 $2.20 $0.11 131k 🔧 tools
novita/zai-org/glm-4.5v $0.60 $1.80 $0.11 66k 👁️ vision · 🔧 tools
novita/microsoft/wizardlm-2-8x22b $0.62 $0.62 66k
novita/deepseek/deepseek-r1-0528 $0.70 $2.50 $0.35 164k 🔧 tools
novita/deepseek/deepseek-prover-v2-671b $0.70 $2.50 160k
novita/deepseek/deepseek-r1-turbo $0.70 $2.50 64k 🔧 tools
novita/deepseek/deepseek-r1-distill-llama-70b $0.80 $0.80 8k
novita/qwen/qwen2.5-vl-72b-instruct $0.80 $0.80 33k 👁️ vision
novita/qwen/qwen3-vl-235b-a22b-thinking $0.98 $3.95 131k 👁️ vision
novita/sao10k/l3-70b-euryale-v2.1 $1.48 $1.48 8k 🔧 tools
novita/sao10k/l31-70b-euryale-v2.2 $1.48 $1.48 8k 🔧 tools
novita/qwen/qwen3-max $2.11 $8.45 262k 🔧 tools

deepinfra (67)

Model Input $/M Output $/M Cached $/M Context Features
deepinfra/meta-llama/Llama-3.2-3B-Instruct $0.02 $0.02 131k 🔧 tools
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo $0.02 $0.03 131k 🔧 tools
deepinfra/mistralai/Mistral-Nemo-Instruct-2407 $0.02 $0.04 131k 🔧 tools
deepinfra/meta-llama/Meta-Llama-3-8B-Instruct $0.03 $0.06 8k 🔧 tools
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct $0.03 $0.05 131k 🔧 tools
deepinfra/Qwen/Qwen2.5-7B-Instruct $0.04 $0.10 33k
deepinfra/Sao10K/L3-8B-Lunaris-v1-Turbo $0.04 $0.05 8k
deepinfra/google/gemma-3-4b-it $0.04 $0.08 131k 🔧 tools
deepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v2 $0.04 $0.16 131k 🔧 tools
deepinfra/openai/gpt-oss-20b $0.04 $0.15 131k 🔧 tools
deepinfra/meta-llama/Llama-3.2-11B-Vision-Instruct $0.05 $0.05 131k
deepinfra/google/gemma-3-12b-it $0.05 $0.10 131k 🔧 tools
deepinfra/mistralai/Mistral-Small-24B-Instruct-2501 $0.05 $0.08 33k 🔧 tools
deepinfra/openai/gpt-oss-120b $0.05 $0.45 131k 🔧 tools
deepinfra/meta-llama/Llama-Guard-3-8B $0.06 $0.06 131k
deepinfra/Qwen/Qwen3-14B $0.06 $0.24 41k 🔧 tools
deepinfra/microsoft/phi-4 $0.07 $0.14 16k 🔧 tools
deepinfra/mistralai/Mistral-Small-3.2-24B-Instruct-2506 $0.07 $0.20 128k 🔧 tools
deepinfra/Gryphe/MythoMax-L2-13b $0.08 $0.09 4k 🔧 tools
deepinfra/Qwen/Qwen3-30B-A3B $0.08 $0.29 41k 🔧 tools
deepinfra/meta-llama/Llama-4-Scout-17B-16E-Instruct $0.08 $0.30 328k 🔧 tools
deepinfra/Qwen/Qwen3-235B-A22B-Instruct-2507 $0.09 $0.60 262k 🔧 tools
deepinfra/google/gemma-3-27b-it $0.09 $0.16 131k 🔧 tools
deepinfra/Qwen/Qwen3-32B $0.10 $0.28 41k 🔧 tools
deepinfra/google/gemini-2.0-flash-001 $0.10 $0.40 1000k 🔧 tools
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo $0.10 $0.28 131k 🔧 tools
deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5 $0.10 $0.40 131k 🔧 tools
deepinfra/Qwen/Qwen2.5-72B-Instruct $0.12 $0.39 33k 🔧 tools
deepinfra/meta-llama/Llama-3.3-70B-Instruct-Turbo $0.13 $0.39 131k 🔧 tools
deepinfra/Qwen/Qwen3-Next-80B-A3B-Instruct $0.14 $1.40 262k 🔧 tools
deepinfra/Qwen/Qwen3-Next-80B-A3B-Thinking $0.14 $1.40 262k 🔧 tools
deepinfra/Qwen/QwQ-32B $0.15 $0.40 131k 🔧 tools
deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 $0.15 $0.60 1049k 🔧 tools
deepinfra/Qwen/Qwen3-235B-A22B $0.18 $0.54 41k 🔧 tools
deepinfra/meta-llama/Llama-Guard-4-12B $0.18 $0.18 164k
deepinfra/Qwen/Qwen2.5-VL-32B-Instruct $0.20 $0.60 128k 👁️ vision · 🔧 tools
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Llama-70B $0.20 $0.60 131k
deepinfra/meta-llama/Llama-3.3-70B-Instruct $0.23 $0.40 131k 🔧 tools
deepinfra/deepseek-ai/DeepSeek-V3-0324 $0.25 $0.88 164k 🔧 tools
deepinfra/allenai/olmOCR-7B-0725-FP8 $0.27 $1.50 16k
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B $0.27 $0.27 131k 🔧 tools
deepinfra/deepseek-ai/DeepSeek-V3.1 $0.27 $1.00 $0.22 164k 🔧 tools
deepinfra/deepseek-ai/DeepSeek-V3.1-Terminus $0.27 $1.00 $0.22 164k 🔧 tools
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo $0.29 $1.20 262k 🔧 tools
deepinfra/NousResearch/Hermes-3-Llama-3.1-70B $0.30 $0.30 131k
deepinfra/Qwen/Qwen3-235B-A22B-Thinking-2507 $0.30 $2.90 262k 🔧 tools
deepinfra/google/gemini-2.5-flash $0.30 $2.50 1000k 🔧 tools
deepinfra/deepseek-ai/DeepSeek-V3 $0.38 $0.89 164k 🔧 tools
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct $0.40 $1.60 262k 🔧 tools
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct $0.40 $0.40 131k 🔧 tools
deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1 $0.40 $0.40 33k 🔧 tools
deepinfra/zai-org/GLM-4.5 $0.40 $1.60 131k 🔧 tools
deepinfra/microsoft/WizardLM-2-8x22B $0.48 $0.48 66k
deepinfra/deepseek-ai/DeepSeek-R1-0528 $0.50 $2.15 $0.40 164k 🔧 tools
deepinfra/moonshotai/Kimi-K2-Instruct $0.50 $2.00 131k 🔧 tools
deepinfra/moonshotai/Kimi-K2-Instruct-0905 $0.50 $2.00 $0.40 262k 🔧 tools
deepinfra/nvidia/Llama-3.1-Nemotron-70B-Instruct $0.60 $0.60 131k 🔧 tools
deepinfra/Sao10K/L3.1-70B-Euryale-v2.2 $0.65 $0.75 131k
deepinfra/Sao10K/L3.3-70B-Euryale-v2.3 $0.65 $0.75 131k
deepinfra/deepseek-ai/DeepSeek-R1 $0.70 $2.40 164k 🔧 tools
deepinfra/NousResearch/Hermes-3-Llama-3.1-405B $1.00 $1.00 131k 🔧 tools
deepinfra/deepseek-ai/DeepSeek-R1-0528-Turbo $1.00 $3.00 33k 🔧 tools
deepinfra/deepseek-ai/DeepSeek-R1-Turbo $1.00 $3.00 41k 🔧 tools
deepinfra/google/gemini-2.5-pro $1.25 $10.00 1000k 🔧 tools
deepinfra/anthropic/claude-3-7-sonnet-latest $3.30 $16.50 $0.33 200k 🔧 tools
deepinfra/anthropic/claude-4-sonnet $3.30 $16.50 200k 🔧 tools
deepinfra/anthropic/claude-4-opus $16.50 $82.50 200k 🔧 tools

azure ai (66)

Model Input $/M Output $/M Cached $/M Context Features
azure_ai/ministral-3b $0.04 $0.04 128k 🔧 tools
azure_ai/Phi-4-mini-instruct $0.07 $0.30 131k 🔧 tools
azure_ai/Phi-4-multimodal-instruct $0.08 $0.32 131k 👁️ vision · 🔧 tools
azure_ai/Phi-4-mini-reasoning $0.08 $0.32 131k 🔧 tools
azure_ai/mistral-small-2503 $0.10 $0.30 128k 👁️ vision · 🔧 tools
azure_ai/Phi-4 $0.13 $0.50 16k 🔧 tools
azure_ai/Phi-4-reasoning $0.13 $0.50 33k 🔧 tools
azure_ai/Phi-3-mini-128k-instruct $0.13 $0.52 128k
azure_ai/Phi-3-mini-4k-instruct $0.13 $0.52 4k
azure_ai/Phi-3.5-mini-instruct $0.13 $0.52 128k
azure_ai/Phi-3.5-vision-instruct $0.13 $0.52 128k 👁️ vision
azure_ai/model_router $0.14 $0.0000
azure_ai/gpt-oss-120b $0.15 $0.60 131k 🔧 tools
azure_ai/Phi-3-small-128k-instruct $0.15 $0.60 128k
azure_ai/Phi-3-small-8k-instruct $0.15 $0.60 8k
azure_ai/mistral-nemo $0.15 $0.15 131k 🔧 tools
azure_ai/Phi-3.5-MoE-instruct $0.16 $0.64 128k
azure_ai/Phi-3-medium-128k-instruct $0.17 $0.68 128k
azure_ai/Phi-3-medium-4k-instruct $0.17 $0.68 4k
azure_ai/gpt-5.4-nano $0.20 $1.25 $0.02 400k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
azure_ai/gpt-5.4-nano-2026-03-17 $0.20 $1.25 $0.02 400k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
azure_ai/Llama-4-Scout-17B-16E-Instruct $0.20 $0.78 10000k 👁️ vision · 🔧 tools
azure_ai/grok-4-fast-non-reasoning $0.20 $0.50 131k 🔧 tools · 🌐 search
azure_ai/grok-4-fast-reasoning $0.20 $0.50 131k 🔧 tools · 🌐 search
azure_ai/grok-4-1-fast-non-reasoning $0.20 $0.50 131k 🔧 tools · 🌐 search
azure_ai/grok-4-1-fast-reasoning $0.20 $0.50 131k 🔧 tools · 🌐 search
azure_ai/grok-code-fast-1 $0.20 $1.50 131k 🔧 tools · 🌐 search
azure_ai/global/grok-3-mini $0.25 $1.27 131k 🔧 tools · 🌐 search
azure_ai/grok-3-mini $0.25 $1.27 131k 🔧 tools · 🌐 search
azure_ai/Meta-Llama-3.1-8B-Instruct $0.30 $0.61 128k
azure_ai/Llama-3.2-11B-Vision-Instruct $0.37 $0.37 128k 👁️ vision · 🔧 tools
azure_ai/mistral-medium-2505 $0.40 $2.00 131k 🔧 tools
azure_ai/jamba-instruct $0.50 $0.70 70k
azure_ai/mistral-large-3 $0.50 $1.50 256k 👁️ vision · 🔧 tools
azure_ai/deepseek-v3.2 $0.58 $1.68 164k 🔧 tools · 💾 cache
azure_ai/deepseek-v3.2-speciale $0.58 $1.68 164k 🔧 tools · 💾 cache
azure_ai/kimi-k2.5 $0.60 $3.00 262k 👁️ vision · 🔧 tools
azure_ai/Llama-3.3-70B-Instruct $0.71 $0.71 128k 🔧 tools
azure_ai/gpt-5.4-mini $0.75 $4.50 $0.07 400k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
azure_ai/gpt-5.4-mini-2026-03-17 $0.75 $4.50 $0.07 400k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
azure_ai/claude-haiku-4-5 $1.00 $5.00 $0.10 200k 👁️ vision · 🔧 tools · 💾 cache
azure_ai/mistral-small $1.00 $3.00 32k 🔧 tools
azure_ai/Meta-Llama-3-70B-Instruct $1.10 $0.37 8k
azure_ai/deepseek-v3 $1.14 $4.56 128k
azure_ai/deepseek-v3-0324 $1.14 $4.56 128k 🔧 tools
azure_ai/MAI-DS-R1 $1.35 $5.40 128k
azure_ai/deepseek-r1 $1.35 $5.40 128k
azure_ai/Llama-4-Maverick-17B-128E-Instruct-FP8 $1.41 $0.35 1000k 👁️ vision · 🔧 tools
azure_ai/mistral-large-2407 $2.00 $6.00 128k 🔧 tools
azure_ai/mistral-large-latest $2.00 $6.00 128k 🔧 tools
azure_ai/Llama-3.2-90B-Vision-Instruct $2.04 $2.04 128k 👁️ vision · 🔧 tools
azure_ai/gpt-5.4 $2.50 $15.00 $0.25 1050k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
azure_ai/gpt-5.4-2026-03-05 $2.50 $15.00 $0.25 1050k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
azure_ai/Meta-Llama-3.1-70B-Instruct $2.68 $3.54 128k
azure_ai/claude-sonnet-4-5 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache
azure_ai/claude-sonnet-4-6 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
azure_ai/global/grok-3 $3.00 $15.00 131k 🔧 tools · 🌐 search
azure_ai/grok-3 $3.00 $15.00 131k 🔧 tools · 🌐 search
azure_ai/grok-4 $3.00 $15.00 131k 🔧 tools · 🌐 search
azure_ai/mistral-large $4.00 $12.00 32k 🔧 tools
azure_ai/claude-opus-4-5 $5.00 $25.00 $0.50 200k 👁️ vision · 🔧 tools · 💾 cache
azure_ai/claude-opus-4-6 $5.00 $25.00 $0.50 200k 👁️ vision · 🔧 tools · 💾 cache
azure_ai/claude-opus-4-7 $5.00 $25.00 $0.50 200k 👁️ vision · 🔧 tools · 💾 cache
azure_ai/Meta-Llama-3.1-405B-Instruct $5.33 $16.00 128k
azure_ai/claude-opus-4-1 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools · 💾 cache
azure_ai/jais-30b-chat $3200.00 $9710.00 8k

mistral (46)

Model Input $/M Output $/M Cached $/M Context Features
mistral/mistral-small-latest $0.06 $0.18 131k 👁️ vision · 🔧 tools
mistral/mistral-small-3-2-2506 $0.06 $0.18 131k 👁️ vision · 🔧 tools
mistral/devstral-small-2505 $0.10 $0.30 128k 🔧 tools
mistral/devstral-small-2507 $0.10 $0.30 128k 🔧 tools
mistral/devstral-small-latest $0.10 $0.30 256k 🔧 tools
mistral/labs-devstral-small-2512 $0.10 $0.30 256k 🔧 tools
mistral/mistral-small $0.10 $0.30 32k 🔧 tools
mistral/ministral-3-3b-2512 $0.10 $0.10 131k 👁️ vision · 🔧 tools
mistral/ministral-3-8b-2512 $0.15 $0.15 262k 👁️ vision · 🔧 tools
mistral/pixtral-12b-2409 $0.15 $0.15 128k 👁️ vision · 🔧 tools
mistral/ministral-3-14b-2512 $0.20 $0.20 262k 👁️ vision · 🔧 tools
mistral/codestral-mamba-latest $0.25 $0.25 256k
mistral/mistral-tiny $0.25 $0.25 32k
mistral/open-codestral-mamba $0.25 $0.25 256k
mistral/open-mistral-7b $0.25 $0.25 32k
mistral/codestral-2508 $0.30 $0.90 256k 🔧 tools
mistral/open-mistral-nemo $0.30 $0.30 128k
mistral/open-mistral-nemo-2407 $0.30 $0.30 128k
mistral/devstral-medium-2507 $0.40 $2.00 128k 🔧 tools
mistral/devstral-latest $0.40 $2.00 256k 🔧 tools
mistral/devstral-medium-latest $0.40 $2.00 256k 🔧 tools
mistral/devstral-2512 $0.40 $2.00 256k 🔧 tools
mistral/mistral-medium-2505 $0.40 $2.00 131k 🔧 tools
mistral/mistral-medium-latest $0.40 $2.00 131k 👁️ vision · 🔧 tools
mistral/mistral-medium-3-1-2508 $0.40 $2.00 131k 👁️ vision · 🔧 tools
mistral/magistral-small-2506 $0.50 $1.50 40k 🔧 tools
mistral/magistral-small-latest $0.50 $1.50 40k 🔧 tools
mistral/magistral-small-1-2-2509 $0.50 $1.50 40k 🔧 tools
mistral/mistral-large-latest $0.50 $1.50 262k 👁️ vision · 🔧 tools
mistral/mistral-large-3 $0.50 $1.50 262k 👁️ vision · 🔧 tools
mistral/mistral-large-2512 $0.50 $1.50 262k 👁️ vision · 🔧 tools
mistral/open-mixtral-8x7b $0.70 $0.70 32k 🔧 tools
mistral/codestral-2405 $1.00 $3.00 32k
mistral/codestral-latest $1.00 $3.00 32k
mistral/magistral-medium-2506 $2.00 $5.00 40k 🔧 tools
mistral/magistral-medium-2509 $2.00 $5.00 40k 🔧 tools
mistral/magistral-medium-1-2-2509 $2.00 $5.00 40k 🔧 tools
mistral/magistral-medium-latest $2.00 $5.00 40k 🔧 tools
mistral/mistral-large-2411 $2.00 $6.00 128k 🔧 tools
mistral/open-mixtral-8x22b $2.00 $6.00 65k 🔧 tools
mistral/pixtral-large-2411 $2.00 $6.00 128k 👁️ vision · 🔧 tools
mistral/pixtral-large-latest $2.00 $6.00 128k 👁️ vision · 🔧 tools
mistral/mistral-medium $2.70 $8.10 32k
mistral/mistral-medium-2312 $2.70 $8.10 32k
mistral/mistral-large-2407 $3.00 $9.00 128k 🔧 tools
mistral/mistral-large-2402 $4.00 $12.00 32k 🔧 tools

gemini (41)

Model Input $/M Output $/M Cached $/M Context Features
gemini/gemini-exp-1114 $0.0000 $0.0000 1049k 👁️ vision · 🔧 tools
gemini/gemini-exp-1206 $0.0000 $0.0000 2097k 👁️ vision · 🔧 tools
gemini/gemma-3-27b-it $0.0000 $0.0000 131k 👁️ vision · 🔧 tools
gemini/learnlm-1.5-pro-experimental $0.0000 $0.0000 33k 👁️ vision · 🔧 tools
gemini/lyria-3-clip-preview $0.0000 $0.0000 131k
gemini/lyria-3-pro-preview $0.0000 $0.0000 131k
gemini/gemini-2.0-flash-lite $0.07 $0.30 $0.02 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini/gemini-2.0-flash-lite-001 $0.07 $0.30 $0.02 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini/gemini-2.0-flash $0.10 $0.40 $0.02 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini/gemini-2.0-flash-001 $0.10 $0.40 $0.02 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini/gemini-2.5-flash-lite $0.10 $0.40 $0.01 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini/gemini-2.5-flash-lite-preview-09-2025 $0.10 $0.40 $0.01 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini/gemini-flash-lite-latest $0.10 $0.40 $0.02 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini/gemini-2.5-flash-lite-preview-06-17 $0.10 $0.40 $0.02 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-flash-lite-latest $0.10 $0.40 $0.01 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini/gemini-3.1-flash-lite-preview $0.25 $1.50 $0.02 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini/gemini-robotics-er-1.5-preview $0.30 $2.50 $0.0000 1049k 👁️ vision · 🔧 tools · 🌐 search
gemini/gemini-2.5-flash $0.30 $2.50 $0.03 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini/gemini-2.5-flash-preview-09-2025 $0.30 $2.50 $0.07 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini/gemini-flash-latest $0.30 $2.50 $0.07 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-2.5-flash-native-audio-latest $0.30 $2.50 1049k
gemini-2.5-flash-native-audio-preview-09-2025 $0.30 $2.50 1049k
gemini-2.5-flash-native-audio-preview-12-2025 $0.30 $2.50 1049k
gemini/gemini-2.5-flash-native-audio-latest $0.30 $2.50 1049k
gemini/gemini-2.5-flash-native-audio-preview-09-2025 $0.30 $2.50 1049k
gemini/gemini-2.5-flash-native-audio-preview-12-2025 $0.30 $2.50 1049k
gemini-flash-latest $0.30 $2.50 $0.03 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-exp-1206 $0.30 $2.50 $0.03 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini/gemini-gemma-2-27b-it $0.35 $1.05 8k 👁️ vision · 🔧 tools
gemini/gemini-gemma-2-9b-it $0.35 $1.05 8k 👁️ vision · 🔧 tools
gemini/gemini-3-flash-preview $0.50 $3.00 $0.05 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-3.1-flash-live-preview $0.75 $4.50 131k 👁️ vision · 🔧 tools · 🌐 search
gemini/gemini-3.1-flash-live-preview $0.75 $4.50 131k 👁️ vision · 🔧 tools · 🌐 search
gemini/gemini-2.5-pro $1.25 $10.00 $0.13 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini/gemini-2.5-computer-use-preview-10-2025 $1.25 $10.00 128k 👁️ vision · 🔧 tools
gemini/gemini-2.5-pro-preview-tts $1.25 $10.00 $0.13 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-pro-latest $1.25 $10.00 $0.13 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini/gemini-pro-latest $1.25 $10.00 $0.13 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini/gemini-3-pro-preview $2.00 $12.00 $0.20 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini/gemini-3.1-pro-preview $2.00 $12.00 $0.20 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini/gemini-3.1-pro-preview-customtools $2.00 $12.00 $0.20 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search

replicate (40)

Model Input $/M Output $/M Cached $/M Context Features
replicate/ibm-granite/granite-3.3-8b-instruct $0.03 $0.25 🔧 tools
replicate/meta/llama-2-7b $0.05 $0.25 4k
replicate/meta/llama-2-7b-chat $0.05 $0.25 4k
replicate/meta/llama-3-8b $0.05 $0.25 8k
replicate/meta/llama-3-8b-instruct $0.05 $0.25 8k
replicate/mistralai/mistral-7b-instruct-v0.2 $0.05 $0.25 4k
replicate/mistralai/mistral-7b-v0.1 $0.05 $0.25 4k
replicate/openai/gpt-5-nano $0.05 $0.40 🔧 tools
replicateopenai/gpt-oss-20b $0.09 $0.36 🔧 tools
replicate/meta/llama-2-13b $0.10 $0.50 4k
replicate/meta/llama-2-13b-chat $0.10 $0.50 4k
replicate/openai/gpt-4.1-nano $0.10 $0.40 🔧 tools
replicate/openai/gpt-4o-mini $0.15 $0.60 👁️ vision · 🔧 tools
replicate/openai/gpt-oss-120b $0.18 $0.72 🔧 tools
replicate/openai/gpt-5-mini $0.25 $2.00 👁️ vision · 🔧 tools
replicate/qwen/qwen3-235b-a22b-instruct-2507 $0.26 $1.06 🔧 tools
replicate/mistralai/mixtral-8x7b-instruct-v0.1 $0.30 $1.00 4k
replicate/openai/gpt-4.1-mini $0.40 $1.60 👁️ vision · 🔧 tools
replicate/meta/llama-2-70b $0.65 $2.75 4k
replicate/meta/llama-2-70b-chat $0.65 $2.75 4k
replicate/meta/llama-3-70b $0.65 $2.75 8k
replicate/meta/llama-3-70b-instruct $0.65 $2.75 8k
replicate/deepseek-ai/deepseek-v3.1 $0.67 $2.02 164k 🔧 tools
replicate/anthropic/claude-4.5-haiku $1.00 $5.00 👁️ vision · 🔧 tools · 💾 cache
replicate/openai/o4-mini $1.00 $4.00
replicate/anthropic/claude-3.5-haiku $1.00 $5.00 👁️ vision · 🔧 tools · 💾 cache
replicate/openai/o1-mini $1.10 $4.40
replicate/openai/gpt-5 $1.25 $10.00 👁️ vision · 🔧 tools
replicate/deepseek-ai/deepseek-v3 $1.45 $1.45 66k 🔧 tools
replicate/google/gemini-3-pro $2.00 $12.00 👁️ vision · 🔧 tools
replicate/openai/gpt-4.1 $2.00 $8.00 👁️ vision · 🔧 tools
replicate/openai/gpt-4o $2.50 $10.00 👁️ vision · 🔧 tools
replicate/google/gemini-2.5-flash $2.50 $2.50 👁️ vision · 🔧 tools
replicate/anthropic/claude-4-sonnet $3.00 $15.00 👁️ vision · 🔧 tools · 💾 cache
replicate/anthropic/claude-3.7-sonnet $3.00 $15.00 👁️ vision · 🔧 tools · 💾 cache
replicate/anthropic/claude-4.5-sonnet $3.00 $15.00 👁️ vision · 🔧 tools · 💾 cache
replicate/anthropic/claude-3.5-sonnet $3.75 $18.75 👁️ vision · 🔧 tools · 💾 cache
replicate/deepseek-ai/deepseek-r1 $3.75 $10.00 66k
replicate/xai/grok-4 $7.20 $36.00 🔧 tools
replicate/openai/o1 $15.00 $60.00

xai (38)

Model Input $/M Output $/M Cached $/M Context Features
xai/grok-4-fast-reasoning $0.20 $0.50 $0.05 2000k 🔧 tools · 💾 cache · 🌐 search
xai/grok-4-fast-non-reasoning $0.20 $0.50 $0.05 2000k 🔧 tools · 💾 cache · 🌐 search
xai/grok-4-1-fast $0.20 $0.50 $0.05 2000k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
xai/grok-4-1-fast-reasoning $0.20 $0.50 $0.05 2000k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
xai/grok-4-1-fast-reasoning-latest $0.20 $0.50 $0.05 2000k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
xai/grok-4-1-fast-non-reasoning $0.20 $0.50 $0.05 2000k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
xai/grok-4-1-fast-non-reasoning-latest $0.20 $0.50 $0.05 2000k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
xai/grok-code-fast $0.20 $1.50 $0.02 256k 🔧 tools · 💾 cache
xai/grok-code-fast-1 $0.20 $1.50 $0.02 256k 🔧 tools · 💾 cache
xai/grok-code-fast-1-0825 $0.20 $1.50 $0.02 256k 🔧 tools · 💾 cache
xai/grok-3-mini $0.30 $0.50 $0.07 131k 🔧 tools · 💾 cache · 🌐 search
xai/grok-3-mini-beta $0.30 $0.50 $0.07 131k 🔧 tools · 💾 cache · 🌐 search
xai/grok-3-mini-latest $0.30 $0.50 $0.07 131k 🔧 tools · 💾 cache · 🌐 search
xai/grok-3-mini-fast $0.60 $4.00 $0.15 131k 🔧 tools · 💾 cache · 🌐 search
xai/grok-3-mini-fast-beta $0.60 $4.00 $0.15 131k 🔧 tools · 💾 cache · 🌐 search
xai/grok-3-mini-fast-latest $0.60 $4.00 $0.15 131k 🔧 tools · 💾 cache · 🌐 search
xai/grok-4.3 $1.25 $2.50 $0.20 1000k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
xai/grok-4.3-latest $1.25 $2.50 $0.20 1000k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
xai/grok-2 $2.00 $10.00 131k 🔧 tools · 🌐 search
xai/grok-2-1212 $2.00 $10.00 131k 🔧 tools · 🌐 search
xai/grok-2-latest $2.00 $10.00 131k 🔧 tools · 🌐 search
xai/grok-2-vision $2.00 $10.00 33k 👁️ vision · 🔧 tools · 🌐 search
xai/grok-2-vision-1212 $2.00 $10.00 33k 👁️ vision · 🔧 tools · 🌐 search
xai/grok-2-vision-latest $2.00 $10.00 33k 👁️ vision · 🔧 tools · 🌐 search
xai/grok-4.20-multi-agent-beta-0309 $2.00 $6.00 $0.20 2000k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
xai/grok-4.20-beta-0309-reasoning $2.00 $6.00 $0.20 2000k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
xai/grok-4.20-0309-reasoning $2.00 $6.00 $0.20 2000k 👁️ vision · 🔧 tools · 🌐 search
xai/grok-4.20-beta-0309-non-reasoning $2.00 $6.00 $0.20 2000k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
xai/grok-3 $3.00 $15.00 $0.75 131k 🔧 tools · 💾 cache · 🌐 search
xai/grok-3-beta $3.00 $15.00 $0.75 131k 🔧 tools · 💾 cache · 🌐 search
xai/grok-3-latest $3.00 $15.00 $0.75 131k 🔧 tools · 💾 cache · 🌐 search
xai/grok-4 $3.00 $15.00 256k 🔧 tools · 💾 cache · 🌐 search
xai/grok-4-0709 $3.00 $15.00 256k 🔧 tools · 💾 cache · 🌐 search
xai/grok-4-latest $3.00 $15.00 256k 🔧 tools · 💾 cache · 🌐 search
xai/grok-3-fast-beta $5.00 $25.00 $1.25 131k 🔧 tools · 💾 cache · 🌐 search
xai/grok-3-fast-latest $5.00 $25.00 $1.25 131k 🔧 tools · 💾 cache · 🌐 search
xai/grok-beta $5.00 $15.00 131k 👁️ vision · 🔧 tools · 🌐 search
xai/grok-vision-beta $5.00 $15.00 8k 👁️ vision · 🔧 tools · 🌐 search

together ai (33)

Model Input $/M Output $/M Cached $/M Context Features
together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo-Free $0.0000 $0.0000 🔧 tools
together_ai/openai/gpt-oss-20b $0.05 $0.20 128k 🔧 tools
together-ai-up-to-4b $0.10 $0.10
together_ai/openai/gpt-oss-120b $0.15 $0.60 131k 🔧 tools
together_ai/Qwen/Qwen3-Next-80B-A3B-Instruct $0.15 $1.50 262k 🔧 tools
together_ai/Qwen/Qwen3-Next-80B-A3B-Thinking $0.15 $1.50 262k 🔧 tools
together_ai/meta-llama/Llama-4-Scout-17B-16E-Instruct $0.18 $0.59 🔧 tools
together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo $0.18 $0.18 🔧 tools
together-ai-4.1b-8b $0.20 $0.20
together_ai/Qwen/Qwen3-235B-A22B-Instruct-2507-tput $0.20 $6.00 262k 🔧 tools
together_ai/Qwen/Qwen3-235B-A22B-fp8-tput $0.20 $0.60 40k
together_ai/zai-org/GLM-4.5-Air-FP8 $0.20 $1.10 128k 🔧 tools
together_ai/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 $0.27 $0.85 🔧 tools
together-ai-8.1b-21b $0.30 $0.30 1k
together_ai/zai-org/GLM-4.7 $0.45 $2.00 200k 🔧 tools
together_ai/moonshotai/Kimi-K2.5 $0.50 $2.80 256k 👁️ vision · 🔧 tools
together_ai/deepseek-ai/DeepSeek-R1-0528-tput $0.55 $2.19 128k 🔧 tools
together_ai/deepseek-ai/DeepSeek-V3.1 $0.60 $1.70 128k 🔧 tools
together_ai/mistralai/Mixtral-8x7B-Instruct-v0.1 $0.60 $0.60 🔧 tools
together_ai/zai-org/GLM-4.6 $0.60 $2.20 200k 🔧 tools
together_ai/Qwen/Qwen3.5-397B-A17B $0.60 $3.60 262k 🔧 tools
together_ai/Qwen/Qwen3-235B-A22B-Thinking-2507 $0.65 $3.00 256k 🔧 tools
together-ai-21.1b-41b $0.80 $0.80
together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo $0.88 $0.88 🔧 tools
together_ai/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo $0.88 $0.88 🔧 tools
together-ai-41.1b-80b $0.90 $0.90
together_ai/moonshotai/Kimi-K2-Instruct $1.00 $3.00 🔧 tools
together_ai/moonshotai/Kimi-K2-Instruct-0905 $1.00 $3.00 262k 🔧 tools
together_ai/deepseek-ai/DeepSeek-V3 $1.25 $1.25 66k 🔧 tools
together-ai-81.1b-110b $1.80 $1.80
together_ai/Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 $2.00 $2.00 256k 🔧 tools
together_ai/deepseek-ai/DeepSeek-R1 $3.00 $7.00 128k 🔧 tools
together_ai/meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo $3.50 $3.50 🔧 tools

oci (29)

Model Input $/M Output $/M Cached $/M Context Features
oci/google.gemini-2.5-flash-lite $0.07 $0.30 1049k 👁️ vision · 🔧 tools
oci/cohere.command-a-translate-08-2025 $0.09 $0.09 256k
oci/cohere.command-r-08-2024 $0.15 $0.15 128k 🔧 tools
oci/google.gemini-2.5-flash $0.15 $0.60 1049k 👁️ vision · 🔧 tools
oci/xai.grok-3-mini $0.30 $0.50 131k 🔧 tools
oci/xai.grok-3-mini-fast $0.60 $4.00 131k 🔧 tools
oci/meta.llama-3.3-70b-instruct $0.72 $0.72 128k 🔧 tools
oci/meta.llama-4-maverick-17b-128e-instruct-fp8 $0.72 $0.72 512k 🔧 tools
oci/meta.llama-4-scout-17b-16e-instruct $0.72 $0.72 192k 🔧 tools
oci/meta.llama-3.1-70b-instruct $0.72 $0.72 128k 🔧 tools
oci/meta.llama-3.3-70b-instruct-fp8-dynamic $0.72 $0.72 128k 🔧 tools
oci/google.gemini-2.5-pro $1.25 $10.00 1049k 👁️ vision · 🔧 tools
oci/cohere.command-latest $1.56 $1.56 128k 🔧 tools
oci/cohere.command-a-03-2025 $1.56 $1.56 256k 🔧 tools
oci/cohere.command-plus-latest $1.56 $1.56 128k 🔧 tools
oci/cohere.command-a-reasoning-08-2025 $1.56 $1.56 256k 🔧 tools
oci/cohere.command-a-vision-07-2025 $1.56 $1.56 128k 👁️ vision · 🔧 tools
oci/cohere.command-r-plus-08-2024 $1.56 $1.56 128k 🔧 tools
oci/meta.llama-3.2-90b-vision-instruct $2.00 $2.00 128k 👁️ vision · 🔧 tools
oci/meta.llama-3.2-11b-vision-instruct $2.00 $2.00 128k 👁️ vision · 🔧 tools
oci/xai.grok-3 $3.00 $15.00 131k 🔧 tools
oci/xai.grok-4 $3.00 $15.00 128k 🔧 tools
oci/xai.grok-4.20 $3.00 $15.00 131k 🔧 tools
oci/xai.grok-4.20-multi-agent $3.00 $15.00 131k 🔧 tools
oci/xai.grok-3-fast $5.00 $25.00 131k 🔧 tools
oci/xai.grok-4-fast $5.00 $25.00 131k 🔧 tools
oci/xai.grok-4.1-fast $5.00 $25.00 131k 🔧 tools
oci/xai.grok-code-fast-1 $5.00 $25.00 131k 🔧 tools
oci/meta.llama-3.1-405b-instruct $10.68 $10.68 128k 🔧 tools

ollama (29)

Model Input $/M Output $/M Cached $/M Context Features
ollama/codegeex4 $0.0000 $0.0000 33k
ollama/codegemma $0.0000 $0.0000 8k
ollama/codellama $0.0000 $0.0000 4k
ollama/deepseek-coder-v2-base $0.0000 $0.0000 8k 🔧 tools
ollama/deepseek-coder-v2-instruct $0.0000 $0.0000 33k 🔧 tools
ollama/deepseek-coder-v2-lite-base $0.0000 $0.0000 8k 🔧 tools
ollama/deepseek-coder-v2-lite-instruct $0.0000 $0.0000 33k 🔧 tools
ollama/deepseek-v3.1:671b-cloud $0.0000 $0.0000 164k 🔧 tools
ollama/gpt-oss:120b-cloud $0.0000 $0.0000 131k 🔧 tools
ollama/gpt-oss:20b-cloud $0.0000 $0.0000 131k 🔧 tools
ollama/internlm2_5-20b-chat $0.0000 $0.0000 33k 🔧 tools
ollama/llama2 $0.0000 $0.0000 4k
ollama/llama2-uncensored $0.0000 $0.0000 4k
ollama/llama2:13b $0.0000 $0.0000 4k
ollama/llama2:70b $0.0000 $0.0000 4k
ollama/llama2:7b $0.0000 $0.0000 4k
ollama/llama3 $0.0000 $0.0000 8k
ollama/llama3.1 $0.0000 $0.0000 8k 🔧 tools
ollama/llama3:70b $0.0000 $0.0000 8k
ollama/llama3:8b $0.0000 $0.0000 8k
ollama/mistral $0.0000 $0.0000 8k 🔧 tools
ollama/mistral-7B-Instruct-v0.1 $0.0000 $0.0000 8k 🔧 tools
ollama/mistral-7B-Instruct-v0.2 $0.0000 $0.0000 33k 🔧 tools
ollama/mistral-large-instruct-2407 $0.0000 $0.0000 66k 🔧 tools
ollama/mixtral-8x22B-Instruct-v0.1 $0.0000 $0.0000 66k 🔧 tools
ollama/mixtral-8x7B-Instruct-v0.1 $0.0000 $0.0000 33k 🔧 tools
ollama/orca-mini $0.0000 $0.0000 4k
ollama/qwen3-coder:480b-cloud $0.0000 $0.0000 262k 🔧 tools
ollama/vicuna $0.0000 $0.0000 2k

vertex ai-anthropic models (29)

Model Input $/M Output $/M Cached $/M Context Features
vertex_ai/claude-3-haiku $0.25 $1.25 200k 👁️ vision · 🔧 tools
vertex_ai/claude-3-haiku@20240307 $0.25 $1.25 200k 👁️ vision · 🔧 tools
vertex_ai/claude-3-5-haiku $1.00 $5.00 200k 🔧 tools
vertex_ai/claude-3-5-haiku@20241022 $1.00 $5.00 200k 🔧 tools
vertex_ai/claude-haiku-4-5 $1.00 $5.00 $0.10 200k 👁️ vision · 🔧 tools · 💾 cache
vertex_ai/claude-haiku-4-5@20251001 $1.00 $5.00 $0.10 200k 👁️ vision · 🔧 tools · 💾 cache
vertex_ai/claude-3-5-sonnet $3.00 $15.00 200k 👁️ vision · 🔧 tools
vertex_ai/claude-3-5-sonnet@20240620 $3.00 $15.00 200k 👁️ vision · 🔧 tools
vertex_ai/claude-3-7-sonnet@20250219 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache
vertex_ai/claude-3-sonnet $3.00 $15.00 200k 👁️ vision · 🔧 tools
vertex_ai/claude-3-sonnet@20240229 $3.00 $15.00 200k 👁️ vision · 🔧 tools
vertex_ai/claude-sonnet-4-5 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache
vertex_ai/claude-sonnet-4-6 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
vertex_ai/claude-sonnet-4-5@20250929 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache
vertex_ai/claude-sonnet-4 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
vertex_ai/claude-sonnet-4@20250514 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
vertex_ai/claude-sonnet-4-6@default $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
vertex_ai/claude-opus-4-5 $5.00 $25.00 $0.50 200k 👁️ vision · 🔧 tools · 💾 cache
vertex_ai/claude-opus-4-5@20251101 $5.00 $25.00 $0.50 200k 👁️ vision · 🔧 tools · 💾 cache
vertex_ai/claude-opus-4-6 $5.00 $25.00 $0.50 1000k 👁️ vision · 🔧 tools · 💾 cache
vertex_ai/claude-opus-4-6@default $5.00 $25.00 $0.50 1000k 👁️ vision · 🔧 tools · 💾 cache
vertex_ai/claude-opus-4-7 $5.00 $25.00 $0.50 1000k 👁️ vision · 🔧 tools · 💾 cache
vertex_ai/claude-opus-4-7@default $5.00 $25.00 $0.50 1000k 👁️ vision · 🔧 tools · 💾 cache
vertex_ai/claude-3-opus $15.00 $75.00 200k 👁️ vision · 🔧 tools
vertex_ai/claude-3-opus@20240229 $15.00 $75.00 200k 👁️ vision · 🔧 tools
vertex_ai/claude-opus-4 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools · 💾 cache
vertex_ai/claude-opus-4-1 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools
vertex_ai/claude-opus-4-1@20250805 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools
vertex_ai/claude-opus-4@20250514 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools · 💾 cache

watsonx (28)

Model Input $/M Output $/M Cached $/M Context Features
watsonx/ibm/granite-4-h-small $0.06 $0.25 20k 🔧 tools
watsonx/ibm/granite-guardian-3-2-2b $0.10 $0.10 8k
watsonx/ibm/granite-vision-3-2-2b $0.10 $0.10 8k 👁️ vision
watsonx/meta-llama/llama-3-2-1b-instruct $0.10 $0.10 128k 🔧 tools
watsonx/mistralai/mistral-small-2503 $0.10 $0.30 32k 🔧 tools
watsonx/mistralai/mistral-small-3-1-24b-instruct-2503 $0.10 $0.30 32k 🔧 tools
watsonx/meta-llama/llama-3-2-3b-instruct $0.15 $0.15 128k 🔧 tools
watsonx/openai/gpt-oss-120b $0.15 $0.60 8k
watsonx/ibm/granite-3-8b-instruct $0.20 $0.20 8k 🔧 tools · 💾 cache
watsonx/ibm/granite-3-3-8b-instruct $0.20 $0.20 8k 🔧 tools
watsonx/ibm/granite-guardian-3-3-8b $0.20 $0.20 8k
watsonx/meta-llama/llama-3-2-11b-vision-instruct $0.35 $0.35 128k 👁️ vision · 🔧 tools
watsonx/meta-llama/llama-4-maverick-17b $0.35 $1.40 128k 🔧 tools
watsonx/meta-llama/llama-guard-3-11b-vision $0.35 $0.35 128k 👁️ vision
watsonx/mistralai/pixtral-12b-2409 $0.35 $0.35 128k 👁️ vision
watsonx/ibm/granite-ttm-1024-96-r2 $0.38 $0.38 1k
watsonx/ibm/granite-ttm-1536-96-r2 $0.38 $0.38 1k
watsonx/ibm/granite-ttm-512-96-r2 $0.38 $0.38 1k
watsonx/google/flan-t5-xl-3b $0.60 $0.60 8k
watsonx/ibm/granite-13b-chat-v2 $0.60 $0.60 8k
watsonx/ibm/granite-13b-instruct-v2 $0.60 $0.60 8k
watsonx/meta-llama/llama-3-3-70b-instruct $0.71 $0.71 128k 🔧 tools
watsonx/sdaia/allam-1-13b-instruct $1.80 $1.80 8k
watsonx/meta-llama/llama-3-2-90b-vision-instruct $2.00 $2.00 128k 👁️ vision · 🔧 tools
watsonx/mistralai/mistral-large $3.00 $10.00 131k 🔧 tools · 💾 cache
watsonx/mistralai/mistral-medium-2505 $3.00 $10.00 128k 🔧 tools
watsonx/bigscience/mt0-xxl-13b $500.00 $2000.00 8k
watsonx/core42/jais-13b-chat $500.00 $2000.00 8k

nebius (27)

Model Input $/M Output $/M Cached $/M Context Features
nebius/Qwen/Qwen2.5-Coder-7B $0.01 $0.03 33k 🔧 tools
nebius/meta-llama/Llama-Guard-3-8B $0.02 $0.06 128k
nebius/meta-llama/Meta-Llama-3.1-8B-Instruct $0.02 $0.06 128k 🔧 tools
nebius/Qwen/Qwen2-VL-7B-Instruct $0.02 $0.06 131k 👁️ vision
nebius/mistralai/Mistral-Nemo-Instruct-2407 $0.04 $0.12 128k 🔧 tools
nebius/google/gemma-3-27b-it $0.06 $0.20 128k 👁️ vision · 🔧 tools
nebius/Qwen/Qwen2.5-32B-Instruct $0.06 $0.20 128k 🔧 tools
nebius/Qwen/Qwen3-14B $0.08 $0.24 33k 🔧 tools
nebius/Qwen/Qwen3-4B $0.08 $0.24 33k 🔧 tools
nebius/nvidia/Llama-3.3-Nemotron-Super-49B-v1 $0.10 $0.40 131k 🔧 tools
nebius/Qwen/Qwen3-32B $0.10 $0.30 33k 🔧 tools
nebius/Qwen/Qwen3-30B-A3B $0.10 $0.30 33k 🔧 tools
nebius/meta-llama/Llama-3.3-70B-Instruct $0.13 $0.40 128k 🔧 tools
nebius/meta-llama/Meta-Llama-3.1-70B-Instruct $0.13 $0.40 128k 🔧 tools
nebius/Qwen/Qwen2.5-72B-Instruct $0.13 $0.40 128k 🔧 tools
nebius/Qwen/Qwen2.5-VL-72B-Instruct $0.13 $0.40 131k 👁️ vision · 🔧 tools
nebius/Qwen/Qwen2-VL-72B-Instruct $0.13 $0.40 131k 👁️ vision · 🔧 tools
nebius/Qwen/QwQ-32B $0.15 $0.45 33k 🔧 tools
nebius/Qwen/Qwen3-235B-A22B $0.20 $0.60 262k 🔧 tools
nebius/deepseek-ai/DeepSeek-R1-Distill-Llama-70B $0.25 $0.75 128k 🔧 tools
nebius/deepseek-ai/DeepSeek-V3 $0.50 $1.50 128k 🔧 tools
nebius/deepseek-ai/DeepSeek-V3-0324 $0.50 $1.50 128k 🔧 tools
nebius/nvidia/Llama-3.1-Nemotron-Ultra-253B-v1 $0.60 $1.80 128k 🔧 tools
nebius/deepseek-ai/DeepSeek-R1 $0.80 $2.40 128k 🔧 tools
nebius/deepseek-ai/DeepSeek-R1-0528 $0.80 $2.40 164k 🔧 tools
nebius/meta-llama/Meta-Llama-3.1-405B-Instruct $1.00 $3.00 128k 🔧 tools
nebius/NousResearch/Hermes-3-Llama-3.1-405B $1.00 $3.00 128k 🔧 tools

databricks (26)

Model Input $/M Output $/M Cached $/M Context Features
databricks/databricks-gpt-5-nano $0.05 $0.40 272k
databricks/databricks-gpt-oss-20b $0.07 $0.30 131k
databricks/databricks-gemma-3-12b $0.15 $0.50 128k
databricks/databricks-gpt-oss-120b $0.15 $0.60 131k
databricks/databricks-meta-llama-3-1-8b-instruct $0.15 $0.45 200k
databricks/databricks-gpt-5-mini $0.25 $2.00 272k
databricks/databricks-gemini-2-5-flash $0.30 $2.50 1049k 🔧 tools
databricks/databricks-llama-2-70b-chat $0.50 $1.50 4k
databricks/databricks-llama-4-maverick $0.50 $1.50 128k
databricks/databricks-meta-llama-3-3-70b-instruct $0.50 $1.50 128k
databricks/databricks-mixtral-8x7b-instruct $0.50 $1.00 4k
databricks/databricks-mpt-7b-instruct $0.50 $0.0000 8k
databricks/databricks-claude-haiku-4-5 $1.00 $5.00 200k 🔧 tools
databricks/databricks-meta-llama-3-70b-instruct $1.00 $3.00 128k
databricks/databricks-mpt-30b-instruct $1.00 $1.00 8k
databricks/databricks-gemini-2-5-pro $1.25 $10.00 1049k 🔧 tools
databricks/databricks-gpt-5 $1.25 $10.00 272k
databricks/databricks-gpt-5-1 $1.25 $10.00 272k
databricks/databricks-claude-3-7-sonnet $3.00 $15.00 200k 🔧 tools
databricks/databricks-claude-sonnet-4 $3.00 $15.00 200k 🔧 tools
databricks/databricks-claude-sonnet-4-1 $3.00 $15.00 200k 🔧 tools
databricks/databricks-claude-sonnet-4-5 $3.00 $15.00 200k 🔧 tools
databricks/databricks-claude-opus-4-5 $5.00 $25.00 200k 🔧 tools
databricks/databricks-meta-llama-3-1-405b-instruct $5.00 $15.00 128k
databricks/databricks-claude-opus-4 $15.00 $75.00 200k 🔧 tools
databricks/databricks-claude-opus-4-1 $15.00 $75.00 200k 🔧 tools

moonshot (22)

Model Input $/M Output $/M Cached $/M Context Features
moonshot/kimi-latest-8k $0.20 $2.00 $0.15 8k 👁️ vision · 🔧 tools
moonshot/moonshot-v1-8k $0.20 $2.00 8k 🔧 tools
moonshot/moonshot-v1-8k-0430 $0.20 $2.00 8k 🔧 tools
moonshot/moonshot-v1-8k-vision-preview $0.20 $2.00 8k 👁️ vision · 🔧 tools
moonshot/kimi-k2-0711-preview $0.60 $2.50 $0.15 131k 🔧 tools · 🌐 search
moonshot/kimi-k2-0905-preview $0.60 $2.50 $0.15 262k 🔧 tools · 🌐 search
moonshot/kimi-k2.5 $0.60 $3.00 $0.10 262k 👁️ vision · 🔧 tools
moonshot/kimi-thinking-preview $0.60 $2.50 $0.15 131k 👁️ vision
moonshot/kimi-k2-thinking $0.60 $2.50 $0.15 262k 🔧 tools · 🌐 search
moonshot/kimi-k2.6 $0.95 $4.00 $0.16 262k 👁️ vision · 🔧 tools
moonshot/kimi-latest-32k $1.00 $3.00 $0.15 33k 👁️ vision · 🔧 tools
moonshot/moonshot-v1-32k $1.00 $3.00 33k 🔧 tools
moonshot/moonshot-v1-32k-0430 $1.00 $3.00 33k 🔧 tools
moonshot/moonshot-v1-32k-vision-preview $1.00 $3.00 33k 👁️ vision · 🔧 tools
moonshot/kimi-k2-turbo-preview $1.15 $8.00 $0.15 262k 🔧 tools · 🌐 search
moonshot/kimi-k2-thinking-turbo $1.15 $8.00 $0.15 262k 🔧 tools · 🌐 search
moonshot/kimi-latest $2.00 $5.00 $0.15 131k 👁️ vision · 🔧 tools
moonshot/kimi-latest-128k $2.00 $5.00 $0.15 131k 👁️ vision · 🔧 tools
moonshot/moonshot-v1-128k $2.00 $5.00 131k 🔧 tools
moonshot/moonshot-v1-128k-0430 $2.00 $5.00 131k 🔧 tools
moonshot/moonshot-v1-128k-vision-preview $2.00 $5.00 131k 👁️ vision · 🔧 tools
moonshot/moonshot-v1-auto $2.00 $5.00 131k 🔧 tools

anthropic (20)

Model Input $/M Output $/M Cached $/M Context Features
claude-3-haiku-20240307 $0.25 $1.25 $0.03 200k 👁️ vision · 🔧 tools · 💾 cache
claude-haiku-4-5-20251001 $1.00 $5.00 $0.10 200k 👁️ vision · 🔧 tools · 💾 cache
claude-haiku-4-5 $1.00 $5.00 $0.10 200k 👁️ vision · 🔧 tools · 💾 cache
claude-3-7-sonnet-20250219 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
claude-4-sonnet-20250514 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
claude-sonnet-4-5 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache
claude-sonnet-4-5-20250929 $3.00 $15.00 $0.30 200k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
claude-sonnet-4-6 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
claude-sonnet-4-20250514 $3.00 $15.00 $0.30 1000k 👁️ vision · 🔧 tools · 💾 cache
claude-opus-4-5-20251101 $5.00 $25.00 $0.50 200k 👁️ vision · 🔧 tools · 💾 cache
claude-opus-4-5 $5.00 $25.00 $0.50 200k 👁️ vision · 🔧 tools · 💾 cache
claude-opus-4-6 $5.00 $25.00 $0.50 1000k 👁️ vision · 🔧 tools · 💾 cache
claude-opus-4-6-20260205 $5.00 $25.00 $0.50 1000k 👁️ vision · 🔧 tools · 💾 cache
claude-opus-4-7 $5.00 $25.00 $0.50 1000k 👁️ vision · 🔧 tools · 💾 cache
claude-opus-4-7-20260416 $5.00 $25.00 $0.50 1000k 👁️ vision · 🔧 tools · 💾 cache
claude-3-opus-20240229 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools · 💾 cache
claude-4-opus-20250514 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools · 💾 cache
claude-opus-4-1 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools · 💾 cache
claude-opus-4-1-20250805 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools · 💾 cache
claude-opus-4-20250514 $15.00 $75.00 $1.50 200k 👁️ vision · 🔧 tools · 💾 cache

lambda ai (20)

Model Input $/M Output $/M Cached $/M Context Features
lambda_ai/llama3.2-11b-vision-instruct $0.01 $0.02 131k 👁️ vision · 🔧 tools
lambda_ai/llama3.2-3b-instruct $0.01 $0.02 131k 🔧 tools
lambda_ai/hermes3-8b $0.02 $0.04 131k 🔧 tools
lambda_ai/lfm-7b $0.02 $0.04 131k 🔧 tools
lambda_ai/llama3.1-8b-instruct $0.02 $0.04 131k 🔧 tools
lambda_ai/llama-4-maverick-17b-128e-instruct-fp8 $0.05 $0.10 131k 🔧 tools
lambda_ai/llama-4-scout-17b-16e-instruct $0.05 $0.10 16k 🔧 tools
lambda_ai/qwen25-coder-32b-instruct $0.05 $0.10 131k 🔧 tools
lambda_ai/qwen3-32b-fp8 $0.05 $0.10 131k 🔧 tools
lambda_ai/lfm-40b $0.10 $0.20 131k 🔧 tools
lambda_ai/hermes3-70b $0.12 $0.30 131k 🔧 tools
lambda_ai/llama3.1-70b-instruct-fp8 $0.12 $0.30 131k 🔧 tools
lambda_ai/llama3.1-nemotron-70b-instruct-fp8 $0.12 $0.30 131k 🔧 tools
lambda_ai/llama3.3-70b-instruct-fp8 $0.12 $0.30 131k 🔧 tools
lambda_ai/deepseek-llama3.3-70b $0.20 $0.60 131k 🔧 tools
lambda_ai/deepseek-r1-0528 $0.20 $0.60 131k 🔧 tools
lambda_ai/deepseek-v3-0324 $0.20 $0.60 131k 🔧 tools
lambda_ai/deepseek-r1-671b $0.80 $0.80 131k 🔧 tools
lambda_ai/hermes3-405b $0.80 $0.80 131k 🔧 tools
lambda_ai/llama3.1-405b-instruct-fp8 $0.80 $0.80 131k 🔧 tools

perplexity (20)

Model Input $/M Output $/M Cached $/M Context Features
perplexity/pplx-70b-online $0.0000 $2.80 4k
perplexity/pplx-7b-online $0.0000 $0.28 4k
perplexity/sonar-medium-online $0.0000 $1.80 12k
perplexity/sonar-small-online $0.0000 $0.28 12k
perplexity/mistral-7b-instruct $0.07 $0.28 4k
perplexity/mixtral-8x7b-instruct $0.07 $0.28 4k
perplexity/pplx-7b-chat $0.07 $0.28 8k
perplexity/sonar-small-chat $0.07 $0.28 16k
perplexity/llama-3.1-8b-instruct $0.20 $0.20 131k
perplexity/codellama-34b-instruct $0.35 $1.40 16k
perplexity/sonar-medium-chat $0.60 $1.80 16k
perplexity/codellama-70b-instruct $0.70 $2.80 16k
perplexity/llama-2-70b-chat $0.70 $2.80 4k
perplexity/pplx-70b-chat $0.70 $2.80 4k
perplexity/llama-3.1-70b-instruct $1.00 $1.00 131k
perplexity/sonar $1.00 $1.00 128k 🌐 search
perplexity/sonar-reasoning $1.00 $5.00 128k 🌐 search
perplexity/sonar-deep-research $2.00 $8.00 128k 🌐 search
perplexity/sonar-reasoning-pro $2.00 $8.00 128k 🌐 search
perplexity/sonar-pro $3.00 $15.00 200k 🌐 search

vertex ai-language-models (19)

Model Input $/M Output $/M Cached $/M Context Features
gemini-2.0-flash-lite $0.07 $0.30 $0.02 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-2.0-flash-lite-001 $0.07 $0.30 $0.02 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-2.0-flash $0.10 $0.40 $0.02 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-2.5-flash-lite $0.10 $0.40 $0.01 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-2.5-flash-lite-preview-09-2025 $0.10 $0.40 $0.01 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-2.5-flash-lite-preview-06-17 $0.10 $0.40 $0.02 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-2.0-flash-001 $0.15 $0.60 $0.04 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-3.1-flash-lite-preview $0.25 $1.50 $0.02 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
vertex_ai/gemini-3.1-flash-lite-preview $0.25 $1.50 $0.02 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-2.5-flash $0.30 $2.50 $0.03 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-2.5-flash-preview-09-2025 $0.30 $2.50 $0.07 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-robotics-er-1.5-preview $0.30 $2.50 $0.0000 1049k 👁️ vision · 🔧 tools
gemini-3-flash-preview $0.50 $3.00 $0.05 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-2.5-pro $1.25 $10.00 $0.13 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-2.5-pro-preview-tts $1.25 $10.00 $0.13 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-2.5-computer-use-preview-10-2025 $1.25 $10.00 128k 👁️ vision · 🔧 tools
gemini-3-pro-preview $2.00 $12.00 $0.20 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-3.1-pro-preview $2.00 $12.00 $0.20 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
gemini-3.1-pro-preview-customtools $2.00 $12.00 $0.20 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search

vertex ai-mistral models (19)

Model Input $/M Output $/M Cached $/M Context Features
vertex_ai/mistral-nemo@latest $0.15 $0.15 128k 🔧 tools
vertex_ai/codestral-2501 $0.20 $0.60 128k 🔧 tools
vertex_ai/codestral@2405 $0.20 $0.60 128k 🔧 tools
vertex_ai/codestral@latest $0.20 $0.60 128k 🔧 tools
vertex_ai/mistralai/codestral-2@001 $0.30 $0.90 128k 🔧 tools
vertex_ai/codestral-2 $0.30 $0.90 128k 🔧 tools
vertex_ai/codestral-2@001 $0.30 $0.90 128k 🔧 tools
vertex_ai/mistralai/codestral-2 $0.30 $0.90 128k 🔧 tools
vertex_ai/mistral-medium-3 $0.40 $2.00 128k 🔧 tools
vertex_ai/mistral-medium-3@001 $0.40 $2.00 128k 🔧 tools
vertex_ai/mistralai/mistral-medium-3 $0.40 $2.00 128k 🔧 tools
vertex_ai/mistralai/mistral-medium-3@001 $0.40 $2.00 128k 🔧 tools
vertex_ai/mistral-small-2503 $1.00 $3.00 128k 👁️ vision · 🔧 tools
vertex_ai/mistral-small-2503@001 $1.00 $3.00 32k 🔧 tools
vertex_ai/mistral-large-2411 $2.00 $6.00 128k 🔧 tools
vertex_ai/mistral-large@2407 $2.00 $6.00 128k 🔧 tools
vertex_ai/mistral-large@2411-001 $2.00 $6.00 128k 🔧 tools
vertex_ai/mistral-large@latest $2.00 $6.00 128k 🔧 tools
vertex_ai/mistral-nemo@2407 $3.00 $3.00 128k 🔧 tools

dashscope (17)

Model Input $/M Output $/M Cached $/M Context Features
dashscope/qwen-turbo $0.05 $0.20 129k 🔧 tools
dashscope/qwen-turbo-2024-11-01 $0.05 $0.20 1000k 🔧 tools
dashscope/qwen-turbo-2025-04-28 $0.05 $0.20 1000k 🔧 tools
dashscope/qwen-turbo-latest $0.05 $0.20 1000k 🔧 tools
dashscope/qwen3-next-80b-a3b-instruct $0.15 $1.20 262k 🔧 tools
dashscope/qwen3-next-80b-a3b-thinking $0.15 $1.20 262k 🔧 tools
dashscope/qwen3-vl-32b-instruct $0.16 $0.64 131k 👁️ vision · 🔧 tools
dashscope/qwen3-vl-32b-thinking $0.16 $2.87 131k 👁️ vision · 🔧 tools
dashscope/qwen-coder $0.30 $1.50 1000k 🔧 tools
dashscope/qwen-plus $0.40 $1.20 129k 🔧 tools
dashscope/qwen-plus-2025-01-25 $0.40 $1.20 129k 🔧 tools
dashscope/qwen-plus-2025-04-28 $0.40 $1.20 129k 🔧 tools
dashscope/qwen-plus-2025-07-14 $0.40 $1.20 129k 🔧 tools
dashscope/qwen3-vl-235b-a22b-instruct $0.40 $1.60 131k 👁️ vision · 🔧 tools
dashscope/qwen3-vl-235b-a22b-thinking $0.40 $4.00 131k 👁️ vision · 🔧 tools
dashscope/qwq-plus $0.80 $2.40 98k 🔧 tools
dashscope/qwen-max $1.60 $6.40 31k 🔧 tools

gmi (17)

Model Input $/M Output $/M Cached $/M Context Features
gmi/openai/gpt-4o-mini $0.15 $0.60 131k 👁️ vision · 🔧 tools
gmi/deepseek-ai/DeepSeek-V3.2 $0.28 $0.40 164k 🔧 tools
gmi/deepseek-ai/DeepSeek-V3-0324 $0.28 $0.88 164k 🔧 tools
gmi/MiniMaxAI/MiniMax-M2.1 $0.30 $1.20 197k
gmi/Qwen/Qwen3-VL-235B-A22B-Instruct-FP8 $0.30 $1.40 262k 👁️ vision
gmi/zai-org/GLM-4.7-FP8 $0.40 $2.00 203k
gmi/google/gemini-3-flash-preview $0.50 $3.00 1049k 👁️ vision · 🔧 tools
gmi/moonshotai/Kimi-K2-Thinking $0.80 $1.20 262k
gmi/openai/gpt-5.1 $1.25 $10.00 410k 🔧 tools
gmi/openai/gpt-5 $1.25 $10.00 410k 🔧 tools
gmi/openai/gpt-5.2 $1.75 $14.00 410k 🔧 tools
gmi/google/gemini-3-pro-preview $2.00 $12.00 1049k 👁️ vision · 🔧 tools
gmi/openai/gpt-4o $2.50 $10.00 131k 👁️ vision · 🔧 tools
gmi/anthropic/claude-sonnet-4.5 $3.00 $15.00 410k 👁️ vision · 🔧 tools
gmi/anthropic/claude-sonnet-4 $3.00 $15.00 410k 👁️ vision · 🔧 tools
gmi/anthropic/claude-opus-4.5 $5.00 $25.00 410k 👁️ vision · 🔧 tools
gmi/anthropic/claude-opus-4 $15.00 $75.00 410k 👁️ vision · 🔧 tools

sambanova (17)

Model Input $/M Output $/M Cached $/M Context Features
sambanova/Meta-Llama-3.2-1B-Instruct $0.04 $0.08 16k
sambanova/Meta-Llama-3.2-3B-Instruct $0.08 $0.16 4k
sambanova/Meta-Llama-3.1-8B-Instruct $0.10 $0.20 16k 🔧 tools
sambanova/MiniMax-M2.7 $0.30 $1.20 205k 🔧 tools
sambanova/Meta-Llama-Guard-3-8B $0.30 $0.30 16k
sambanova/Llama-4-Scout-17B-16E-Instruct $0.40 $0.70 8k 🔧 tools
sambanova/Qwen3-32B $0.40 $0.80 8k 🔧 tools
sambanova/QwQ-32B $0.50 $1.00 16k
sambanova/Qwen2-Audio-7B-Instruct $0.50 $100.00 4k
sambanova/Meta-Llama-3.3-70B-Instruct $0.60 $1.20 131k 🔧 tools
sambanova/Llama-4-Maverick-17B-128E-Instruct $0.63 $1.80 131k 👁️ vision · 🔧 tools
sambanova/DeepSeek-R1-Distill-Llama-70B $0.70 $1.40 131k
sambanova/DeepSeek-V3-0324 $3.00 $4.50 33k 🔧 tools
sambanova/DeepSeek-V3.1 $3.00 $4.50 33k 🔧 tools
sambanova/gpt-oss-120b $3.00 $4.50 131k 🔧 tools
sambanova/DeepSeek-R1 $5.00 $7.00 33k
sambanova/Meta-Llama-3.1-405B-Instruct $5.00 $10.00 16k 🔧 tools

hyperbolic (16)

Model Input $/M Output $/M Cached $/M Context Features
hyperbolic/NousResearch/Hermes-3-Llama-3.1-70B $0.12 $0.30 33k 🔧 tools
hyperbolic/Qwen/Qwen2.5-72B-Instruct $0.12 $0.30 131k 🔧 tools
hyperbolic/Qwen/Qwen2.5-Coder-32B-Instruct $0.12 $0.30 33k 🔧 tools
hyperbolic/meta-llama/Llama-3.2-3B-Instruct $0.12 $0.30 33k 🔧 tools
hyperbolic/meta-llama/Llama-3.3-70B-Instruct $0.12 $0.30 131k 🔧 tools
hyperbolic/meta-llama/Meta-Llama-3-70B-Instruct $0.12 $0.30 131k 🔧 tools
hyperbolic/meta-llama/Meta-Llama-3.1-405B-Instruct $0.12 $0.30 33k 🔧 tools
hyperbolic/meta-llama/Meta-Llama-3.1-70B-Instruct $0.12 $0.30 33k 🔧 tools
hyperbolic/meta-llama/Meta-Llama-3.1-8B-Instruct $0.12 $0.30 33k 🔧 tools
hyperbolic/Qwen/QwQ-32B $0.20 $0.20 131k 🔧 tools
hyperbolic/deepseek-ai/DeepSeek-V3 $0.20 $0.20 33k 🔧 tools
hyperbolic/deepseek-ai/DeepSeek-R1-0528 $0.25 $0.25 131k 🔧 tools
hyperbolic/deepseek-ai/DeepSeek-R1 $0.40 $0.40 33k 🔧 tools
hyperbolic/deepseek-ai/DeepSeek-V3-0324 $0.40 $0.40 33k 🔧 tools
hyperbolic/Qwen/Qwen3-235B-A22B $2.00 $2.00 131k 🔧 tools
hyperbolic/moonshotai/Kimi-K2-Instruct $2.00 $2.00 131k 🔧 tools

wandb (16)

Model Input $/M Output $/M Cached $/M Context Features
wandb/MiniMaxAI/MiniMax-M2.5 $0.30 $1.20 197k 🔧 tools
wandb/moonshotai/Kimi-K2-Instruct $0.60 $2.50 128k
wandb/moonshotai/Kimi-K2.5 $0.60 $3.00 $0.10 262k 👁️ vision · 🔧 tools
wandb/openai/gpt-oss-20b $5000.00 $20000.00 131k
wandb/microsoft/Phi-4-mini-instruct $8000.00 $35000.00 128k
wandb/Qwen/Qwen3-235B-A22B-Instruct-2507 $10000.00 $10000.00 262k
wandb/Qwen/Qwen3-235B-A22B-Thinking-2507 $10000.00 $10000.00 262k
wandb/openai/gpt-oss-120b $15000.00 $60000.00 131k
wandb/meta-llama/Llama-4-Scout-17B-16E-Instruct $17000.00 $66000.00 64k
wandb/meta-llama/Llama-3.1-8B-Instruct $22000.00 $22000.00 128k
wandb/zai-org/GLM-4.5 $55000.00 $200000.00 131k
wandb/deepseek-ai/DeepSeek-V3.1 $55000.00 $165000.00 128k
wandb/meta-llama/Llama-3.3-70B-Instruct $71000.00 $71000.00 128k
wandb/Qwen/Qwen3-Coder-480B-A35B-Instruct $100000.00 $150000.00 262k
wandb/deepseek-ai/DeepSeek-V3-0324 $114000.00 $275000.00 161k
wandb/deepseek-ai/DeepSeek-R1-0528 $135000.00 $540000.00 161k

ovhcloud (15)

Model Input $/M Output $/M Cached $/M Context Features
ovhcloud/gpt-oss-20b $0.04 $0.15 131k
ovhcloud/Qwen3-32B $0.08 $0.23 32k 🔧 tools
ovhcloud/gpt-oss-120b $0.08 $0.40 131k
ovhcloud/Mistral-Small-3.2-24B-Instruct-2506 $0.09 $0.28 128k 👁️ vision · 🔧 tools
ovhcloud/Llama-3.1-8B-Instruct $0.10 $0.10 131k 🔧 tools
ovhcloud/Mistral-7B-Instruct-v0.3 $0.10 $0.10 127k 🔧 tools
ovhcloud/Mistral-Nemo-Instruct-2407 $0.13 $0.13 118k 🔧 tools
ovhcloud/mamba-codestral-7B-v0.1 $0.19 $0.19 256k
ovhcloud/llava-v1.6-mistral-7b-hf $0.29 $0.29 32k 👁️ vision
ovhcloud/Mixtral-8x7B-Instruct-v0.1 $0.63 $0.63 32k
ovhcloud/DeepSeek-R1-Distill-Llama-70B $0.67 $0.67 131k 🔧 tools
ovhcloud/Meta-Llama-3_1-70B-Instruct $0.67 $0.67 131k
ovhcloud/Meta-Llama-3_3-70B-Instruct $0.67 $0.67 131k 🔧 tools
ovhcloud/Qwen2.5-Coder-32B-Instruct $0.87 $0.87 32k
ovhcloud/Qwen2.5-VL-72B-Instruct $0.91 $0.91 32k 👁️ vision

nscale (14)

Model Input $/M Output $/M Cached $/M Context Features
nscale/Qwen/Qwen2.5-Coder-3B-Instruct $0.01 $0.03
nscale/Qwen/Qwen2.5-Coder-7B-Instruct $0.01 $0.03
nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-8B $0.02 $0.02
nscale/meta-llama/Llama-3.1-8B-Instruct $0.03 $0.03
nscale/Qwen/Qwen2.5-Coder-32B-Instruct $0.06 $0.20
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B $0.07 $0.07
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B $0.09 $0.09
nscale/meta-llama/Llama-4-Scout-17B-16E-Instruct $0.09 $0.29
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B $0.15 $0.15
nscale/Qwen/QwQ-32B $0.18 $0.20
nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B $0.20 $0.20
nscale/meta-llama/Llama-3.3-70B-Instruct $0.20 $0.20
nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-70B $0.38 $0.38
nscale/mistralai/mixtral-8x22b-instruct-v0.1 $0.60 $0.60

llamagate (14)

Model Input $/M Output $/M Cached $/M Context Features
llamagate/llama-3.1-8b $0.03 $0.05 131k 🔧 tools
llamagate/gemma3-4b $0.03 $0.08 128k 👁️ vision · 🔧 tools
llamagate/llama-3.2-3b $0.04 $0.08 131k 🔧 tools
llamagate/qwen3-8b $0.04 $0.14 33k 🔧 tools
llamagate/qwen2.5-coder-7b $0.06 $0.12 33k 🔧 tools
llamagate/deepseek-coder-6.7b $0.06 $0.12 16k 🔧 tools
llamagate/codellama-7b $0.06 $0.12 16k 🔧 tools
llamagate/dolphin3-8b $0.08 $0.15 128k 🔧 tools
llamagate/deepseek-r1-7b-qwen $0.08 $0.15 131k 🔧 tools
llamagate/openthinker-7b $0.08 $0.15 33k 🔧 tools
llamagate/mistral-7b-v0.3 $0.10 $0.15 33k 🔧 tools
llamagate/deepseek-r1-8b $0.10 $0.20 66k 🔧 tools
llamagate/llava-7b $0.10 $0.20 4k 👁️ vision
llamagate/qwen3-vl-8b $0.15 $0.55 33k 👁️ vision · 🔧 tools

anyscale (12)

Model Input $/M Output $/M Cached $/M Context Features
anyscale/HuggingFaceH4/zephyr-7b-beta $0.15 $0.15 16k
anyscale/google/gemma-7b-it $0.15 $0.15 8k
anyscale/meta-llama/Llama-2-7b-chat-hf $0.15 $0.15 4k
anyscale/meta-llama/Meta-Llama-3-8B-Instruct $0.15 $0.15 8k
anyscale/mistralai/Mistral-7B-Instruct-v0.1 $0.15 $0.15 16k 🔧 tools
anyscale/mistralai/Mixtral-8x7B-Instruct-v0.1 $0.15 $0.15 16k 🔧 tools
anyscale/meta-llama/Llama-2-13b-chat-hf $0.25 $0.25 4k
anyscale/mistralai/Mixtral-8x22B-Instruct-v0.1 $0.90 $0.90 66k 🔧 tools
anyscale/codellama/CodeLlama-34b-Instruct-hf $1.00 $1.00 4k
anyscale/codellama/CodeLlama-70b-Instruct-hf $1.00 $1.00 4k
anyscale/meta-llama/Llama-2-70b-chat-hf $1.00 $1.00 4k
anyscale/meta-llama/Meta-Llama-3-70B-Instruct $1.00 $1.00 8k

ai21 (12)

Model Input $/M Output $/M Cached $/M Context Features
jamba-1.5 $0.20 $0.40 256k
jamba-1.5-mini $0.20 $0.40 256k
jamba-1.5-mini@001 $0.20 $0.40 256k
jamba-mini-1.6 $0.20 $0.40 256k
jamba-mini-1.7 $0.20 $0.40 256k
jamba-1.5-large $2.00 $8.00 256k
jamba-1.5-large@001 $2.00 $8.00 256k
jamba-large-1.6 $2.00 $8.00 256k
jamba-large-1.7 $2.00 $8.00 256k
j2-light $3.00 $3.00 8k
j2-mid $10.00 $10.00 8k
j2-ultra $15.00 $15.00 8k

baseten (11)

Model Input $/M Output $/M Cached $/M Context Features
baseten/openai/gpt-oss-120b $0.10 $0.50
baseten/MiniMaxAI/MiniMax-M2.5 $0.30 $1.20
baseten/nvidia/Nemotron-120B-A12B $0.30 $0.75
baseten/deepseek-ai/DeepSeek-V3.1 $0.50 $1.50
baseten/zai-org/GLM-4.7 $0.60 $2.20
baseten/zai-org/GLM-4.6 $0.60 $2.20
baseten/moonshotai/Kimi-K2.5 $0.60 $3.00
baseten/moonshotai/Kimi-K2-Thinking $0.60 $2.50
baseten/moonshotai/Kimi-K2-Instruct-0905 $0.60 $2.50
baseten/deepseek-ai/DeepSeek-V3-0324 $0.77 $0.77
baseten/zai-org/GLM-5 $0.95 $3.15

groq (11)

Model Input $/M Output $/M Cached $/M Context Features
groq/llama-3.1-8b-instant $0.05 $0.08 128k 🔧 tools
groq/gemma-7b-it $0.05 $0.08 8k 🔧 tools
groq/openai/gpt-oss-20b $0.07 $0.30 $0.04 131k 🔧 tools · 🌐 search
groq/openai/gpt-oss-safeguard-20b $0.07 $0.30 $0.04 131k 🔧 tools · 🌐 search
groq/meta-llama/llama-4-scout-17b-16e-instruct $0.11 $0.34 131k 👁️ vision · 🔧 tools
groq/openai/gpt-oss-120b $0.15 $0.60 $0.07 131k 🔧 tools · 🌐 search
groq/meta-llama/llama-guard-4-12b $0.20 $0.20 8k
groq/meta-llama/llama-4-maverick-17b-128e-instruct $0.20 $0.60 131k 👁️ vision · 🔧 tools
groq/qwen/qwen3-32b $0.29 $0.59 131k 🔧 tools
groq/llama-3.3-70b-versatile $0.59 $0.79 128k 🔧 tools
groq/moonshotai/kimi-k2-instruct-0905 $1.00 $3.00 $0.50 262k 🔧 tools

vertex ai-llama models (11)

Model Input $/M Output $/M Cached $/M Context Features
vertex_ai/meta/llama-3.1-70b-instruct-maas $0.0000 $0.0000 128k 👁️ vision
vertex_ai/meta/llama-3.1-8b-instruct-maas $0.0000 $0.0000 128k 👁️ vision
vertex_ai/meta/llama-3.2-90b-vision-instruct-maas $0.0000 $0.0000 128k 👁️ vision
vertex_ai/meta/llama3-405b-instruct-maas $0.0000 $0.0000 32k
vertex_ai/meta/llama3-70b-instruct-maas $0.0000 $0.0000 32k
vertex_ai/meta/llama3-8b-instruct-maas $0.0000 $0.0000 32k
vertex_ai/meta/llama-4-scout-17b-128e-instruct-maas $0.25 $0.70 10000k 🔧 tools
vertex_ai/meta/llama-4-scout-17b-16e-instruct-maas $0.25 $0.70 10000k 🔧 tools
vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas $0.35 $1.15 1000k 🔧 tools
vertex_ai/meta/llama-4-maverick-17b-16e-instruct-maas $0.35 $1.15 1000k 🔧 tools
vertex_ai/meta/llama-3.1-405b-instruct-maas $5.00 $16.00 128k 👁️ vision

zai (11)

Model Input $/M Output $/M Cached $/M Context Features
zai/glm-4.5-flash $0.0000 $0.0000 128k 🔧 tools
zai/glm-4-32b-0414-128k $0.10 $0.10 128k 🔧 tools
zai/glm-4.5-air $0.20 $1.10 128k 🔧 tools
zai/glm-4.7 $0.60 $2.20 $0.11 200k 🔧 tools · 💾 cache
zai/glm-4.6 $0.60 $2.20 $0.11 200k 🔧 tools · 💾 cache
zai/glm-4.5 $0.60 $2.20 128k 🔧 tools
zai/glm-4.5v $0.60 $1.80 128k 👁️ vision · 🔧 tools
zai/glm-5 $1.00 $3.20 $0.20 200k 🔧 tools · 💾 cache
zai/glm-4.5-airx $1.10 $4.50 128k 🔧 tools
zai/glm-5-code $1.20 $5.00 $0.30 200k 🔧 tools · 💾 cache
zai/glm-4.5-x $2.20 $8.90 128k 🔧 tools

gradient ai (10)

Model Input $/M Output $/M Cached $/M Context Features
gradient_ai/llama3-8b-instruct $0.20 $0.20 8k
gradient_ai/mistral-nemo-instruct-2407 $0.30 $0.30 128k
gradient_ai/llama3.3-70b-instruct $0.65 $0.65 128k
gradient_ai/anthropic-claude-3.5-haiku $0.80 $4.00 200k
gradient_ai/deepseek-r1-distill-llama-70b $0.99 $0.99 33k
gradient_ai/openai-o3-mini $1.10 $4.40 200k
gradient_ai/openai-o3 $2.00 $8.00 200k
gradient_ai/anthropic-claude-3.5-sonnet $3.00 $15.00 200k
gradient_ai/anthropic-claude-3.7-sonnet $3.00 $15.00 200k
gradient_ai/anthropic-claude-3-opus $15.00 $75.00 200k

publicai (9)

Model Input $/M Output $/M Cached $/M Context Features
publicai/swiss-ai/apertus-8b-instruct $0.0000 $0.0000 8k
publicai/swiss-ai/apertus-70b-instruct $0.0000 $0.0000 8k
publicai/aisingapore/Gemma-SEA-LION-v4-27B-IT $0.0000 $0.0000 8k 🔧 tools
publicai/BSC-LT/salamandra-7b-instruct-tools-16k $0.0000 $0.0000 16k 🔧 tools
publicai/BSC-LT/ALIA-40b-instruct_Q8_0 $0.0000 $0.0000 8k 🔧 tools
publicai/allenai/Olmo-3-7B-Instruct $0.0000 $0.0000 33k 🔧 tools
publicai/aisingapore/Qwen-SEA-LION-v4-32B-IT $0.0000 $0.0000 33k 🔧 tools
publicai/allenai/Olmo-3-7B-Think $0.0000 $0.0000 33k 🔧 tools
publicai/allenai/Olmo-3-32B-Think $0.0000 $0.0000 33k 🔧 tools

deepseek (8)

Model Input $/M Output $/M Cached $/M Context Features
deepseek/deepseek-coder $0.14 $0.28 128k 🔧 tools · 💾 cache
deepseek/deepseek-v3 $0.27 $1.10 $0.07 66k 🔧 tools · 💾 cache
deepseek-chat $0.28 $0.42 $0.03 131k 🔧 tools · 💾 cache
deepseek-reasoner $0.28 $0.42 $0.03 131k 💾 cache
deepseek/deepseek-chat $0.28 $0.42 $0.03 131k 🔧 tools · 💾 cache
deepseek/deepseek-reasoner $0.28 $0.42 $0.03 131k 💾 cache
deepseek/deepseek-v3.2 $0.28 $0.40 164k 🔧 tools · 💾 cache
deepseek/deepseek-r1 $0.55 $2.19 66k 🔧 tools · 💾 cache

vertex ai (8)

Model Input $/M Output $/M Cached $/M Context Features
vertex_ai/xai/grok-4.1-fast-non-reasoning $0.20 $0.50 $0.05 2000k 👁️ vision · 🔧 tools · 🌐 search
vertex_ai/xai/grok-4.1-fast-reasoning $0.20 $0.50 $0.05 2000k 👁️ vision · 🔧 tools · 🌐 search
vertex_ai/gemini-3-flash-preview $0.50 $3.00 $0.05 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
vertex_ai/gemini-3-pro-preview $2.00 $12.00 $0.20 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
vertex_ai/gemini-3.1-pro-preview $2.00 $12.00 $0.20 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
vertex_ai/gemini-3.1-pro-preview-customtools $2.00 $12.00 $0.20 1049k 👁️ vision · 🔧 tools · 💾 cache · 🌐 search
vertex_ai/xai/grok-4.20-non-reasoning $2.00 $6.00 $0.20 2000k 👁️ vision · 🔧 tools · 🌐 search
vertex_ai/xai/grok-4.20-reasoning $2.00 $6.00 $0.20 2000k 👁️ vision · 🔧 tools · 🌐 search

cerebras (7)

Model Input $/M Output $/M Cached $/M Context Features
cerebras/llama3.1-8b $0.10 $0.10 128k 🔧 tools
cerebras/gpt-oss-120b $0.35 $0.75 131k 🔧 tools
cerebras/qwen-3-32b $0.40 $0.80 128k 🔧 tools
cerebras/llama3.1-70b $0.60 $0.60 128k 🔧 tools
cerebras/llama-3.3-70b $0.85 $1.20 128k 🔧 tools
cerebras/zai-glm-4.6 $2.25 $2.75 128k 🔧 tools
cerebras/zai-glm-4.7 $2.25 $2.75 128k 🔧 tools

cohere chat (7)

Model Input $/M Output $/M Cached $/M Context Features
command-r $0.15 $0.60 128k 🔧 tools
command-r-08-2024 $0.15 $0.60 128k 🔧 tools
command-r7b-12-2024 $0.15 $0.04 128k 🔧 tools
command-light $0.30 $0.60 4k
command-a-03-2025 $2.50 $10.00 256k 🔧 tools
command-r-plus $2.50 $10.00 128k 🔧 tools
command-r-plus-08-2024 $2.50 $10.00 128k 🔧 tools

crusoe (7)

Model Input $/M Output $/M Cached $/M Context Features
crusoe/google/gemma-3-12b-it $0.10 $0.10 131k 👁️ vision · 🔧 tools
crusoe/meta-llama/Llama-3.3-70B-Instruct $0.20 $0.20 131k 🔧 tools
crusoe/openai/gpt-oss-120b $0.80 $0.80 131k 🔧 tools
crusoe/deepseek-ai/DeepSeek-V3-0324 $1.50 $1.50 164k 🔧 tools
crusoe/moonshotai/Kimi-K2-Thinking $2.50 $2.50 262k
crusoe/deepseek-ai/DeepSeek-R1-0528 $3.00 $7.00 164k
crusoe/Qwen/Qwen3-235B-A22B-Instruct-2507 $3.00 $3.00 262k 🔧 tools

text-completion-openai (6)

Model Input $/M Output $/M Cached $/M Context Features
babbage-002 $0.40 $0.40 16k
gpt-3.5-turbo-instruct $1.50 $2.00 8k
gpt-3.5-turbo-instruct-0914 $1.50 $2.00 8k
ft:babbage-002 $1.60 $1.60 16k
davinci-002 $2.00 $2.00 16k
ft:davinci-002 $12.00 $12.00 16k

palm (6)

Model Input $/M Output $/M Cached $/M Context Features
palm/chat-bison $0.13 $0.13 8k
palm/chat-bison-001 $0.13 $0.13 8k
palm/text-bison $0.13 $0.13 8k
palm/text-bison-001 $0.13 $0.13 8k
palm/text-bison-safety-off $0.13 $0.13 8k
palm/text-bison-safety-recitation-off $0.13 $0.13 8k

sagemaker (6)

Model Input $/M Output $/M Cached $/M Context Features
sagemaker/meta-textgeneration-llama-2-13b $0.0000 $0.0000 4k
sagemaker/meta-textgeneration-llama-2-13b-f $0.0000 $0.0000 4k
sagemaker/meta-textgeneration-llama-2-70b $0.0000 $0.0000 4k
sagemaker/meta-textgeneration-llama-2-70b-b-f $0.0000 $0.0000 4k
sagemaker/meta-textgeneration-llama-2-7b $0.0000 $0.0000 4k
sagemaker/meta-textgeneration-llama-2-7b-f $0.0000 $0.0000 4k

lemonade (5)

Model Input $/M Output $/M Cached $/M Context Features
lemonade/Qwen3-Coder-30B-A3B-Instruct-GGUF $0.0000 $0.0000 262k 🔧 tools
lemonade/gpt-oss-20b-mxfp4-GGUF $0.0000 $0.0000 131k 🔧 tools
lemonade/gpt-oss-120b-mxfp-GGUF $0.0000 $0.0000 131k 🔧 tools
lemonade/Gemma-3-4b-it-GGUF $0.0000 $0.0000 128k 🔧 tools
lemonade/Qwen3-4B-Instruct-2507-GGUF $0.0000 $0.0000 262k 🔧 tools

minimax (5)

Model Input $/M Output $/M Cached $/M Context Features
minimax/MiniMax-M2.1 $0.30 $1.20 $0.03 1000k 🔧 tools · 💾 cache
minimax/MiniMax-M2.1-lightning $0.30 $2.40 $0.03 1000k 🔧 tools · 💾 cache
minimax/MiniMax-M2.5 $0.30 $1.20 $0.03 1000k 🔧 tools · 💾 cache
minimax/MiniMax-M2.5-lightning $0.30 $2.40 $0.03 1000k 🔧 tools · 💾 cache
minimax/MiniMax-M2 $0.30 $1.20 $0.03 200k 🔧 tools · 💾 cache

vertex ai-ai21 models (5)

Model Input $/M Output $/M Cached $/M Context Features
vertex_ai/jamba-1.5 $0.20 $0.40 256k
vertex_ai/jamba-1.5-mini $0.20 $0.40 256k
vertex_ai/jamba-1.5-mini@001 $0.20 $0.40 256k
vertex_ai/jamba-1.5-large $2.00 $8.00 256k
vertex_ai/jamba-1.5-large@001 $2.00 $8.00 256k

cloudflare (4)

Model Input $/M Output $/M Cached $/M Context Features
cloudflare/@cf/meta/llama-2-7b-chat-fp16 $1.92 $1.92 3k
cloudflare/@cf/meta/llama-2-7b-chat-int8 $1.92 $1.92 2k
cloudflare/@cf/mistral/mistral-7b-instruct-v0.1 $1.92 $1.92 8k
cloudflare/@hf/thebloke/codellama-7b-instruct-awq $1.92 $1.92 4k

amazon nova (4)

Model Input $/M Output $/M Cached $/M Context Features
amazon-nova/nova-micro-v1 $0.04 $0.14 128k 🔧 tools · 💾 cache
amazon-nova/nova-lite-v1 $0.06 $0.24 300k 👁️ vision · 🔧 tools · 💾 cache
amazon-nova/nova-pro-v1 $0.80 $3.20 300k 👁️ vision · 🔧 tools · 💾 cache
amazon-nova/nova-premier-v1 $2.50 $12.50 1000k 👁️ vision · 🔧 tools

vertex ai-qwen models (4)

Model Input $/M Output $/M Cached $/M Context Features
vertex_ai/qwen/qwen3-next-80b-a3b-instruct-maas $0.15 $1.20 262k 🔧 tools
vertex_ai/qwen/qwen3-next-80b-a3b-thinking-maas $0.15 $1.20 262k 🔧 tools
vertex_ai/qwen/qwen3-235b-a22b-instruct-2507-maas $0.25 $1.00 262k 🔧 tools
vertex_ai/qwen/qwen3-coder-480b-a35b-instruct-maas $1.00 $4.00 262k 🔧 tools

bedrock mantle (4)

Model Input $/M Output $/M Cached $/M Context Features
bedrock_mantle/openai.gpt-oss-20b $0.07 $0.30 131k 🔧 tools
bedrock_mantle/openai.gpt-oss-safeguard-20b $0.07 $0.30 131k 🔧 tools
bedrock_mantle/openai.gpt-oss-120b $0.15 $0.60 131k 🔧 tools
bedrock_mantle/openai.gpt-oss-safeguard-120b $0.15 $0.60 131k 🔧 tools

azure text (3)

Model Input $/M Output $/M Cached $/M Context Features
azure/gpt-3.5-turbo-instruct-0914 $1.50 $2.00 4k
azure/gpt-35-turbo-instruct $1.50 $2.00 4k
azure/gpt-35-turbo-instruct-0914 $1.50 $2.00 4k

volcengine (3)

Model Input $/M Output $/M Cached $/M Context Features
deepseek-v3-2-251201 $0.0000 $0.0000 98k 🔧 tools · 💾 cache
glm-4-7-251222 $0.0000 $0.0000 205k 🔧 tools · 💾 cache
kimi-k2-thinking-251104 $0.0000 $0.0000 229k 🔧 tools · 💾 cache

gigachat (3)

Model Input $/M Output $/M Cached $/M Context Features
gigachat/GigaChat-2-Lite $0.0000 $0.0000 128k 🔧 tools
gigachat/GigaChat-2-Max $0.0000 $0.0000 128k 👁️ vision · 🔧 tools
gigachat/GigaChat-2-Pro $0.0000 $0.0000 128k 👁️ vision · 🔧 tools

v0 (3)

Model Input $/M Output $/M Cached $/M Context Features
v0/v0-1.0-md $3.00 $15.00 128k 👁️ vision · 🔧 tools
v0/v0-1.5-md $3.00 $15.00 128k 👁️ vision · 🔧 tools
v0/v0-1.5-lg $15.00 $75.00 512k 👁️ vision · 🔧 tools

vertex ai-deepseek models (3)

Model Input $/M Output $/M Cached $/M Context Features
vertex_ai/deepseek-ai/deepseek-v3.2-maas $0.56 $1.68 164k 🔧 tools · 💾 cache
vertex_ai/deepseek-ai/deepseek-v3.1-maas $1.35 $5.40 164k 🔧 tools · 💾 cache
vertex_ai/deepseek-ai/deepseek-r1-0528-maas $1.35 $5.40 65k 🔧 tools · 💾 cache

nlp cloud (2)

Model Input $/M Output $/M Cached $/M Context Features
chatdolphin $0.50 $0.50 16k
dolphin $0.50 $0.50 16k

codestral (2)

Model Input $/M Output $/M Cached $/M Context Features
codestral/codestral-2405 $0.0000 $0.0000 32k
codestral/codestral-latest $0.0000 $0.0000 32k

cohere (2)

Model Input $/M Output $/M Cached $/M Context Features
command $1.00 $2.00 4k
command-nightly $1.00 $2.00 4k

fireworks ai-embedding-models (2)

Model Input $/M Output $/M Cached $/M Context Features
fireworks-ai-embedding-up-to-150m $0.0080 $0.0000
fireworks-ai-embedding-150m-to-350m $0.02 $0.0000

friendliai (2)

Model Input $/M Output $/M Cached $/M Context Features
friendliai/meta-llama-3.1-8b-instruct $0.10 $0.10 8k 🔧 tools
friendliai/meta-llama-3.1-70b-instruct $0.60 $0.60 8k 🔧 tools

morph (2)

Model Input $/M Output $/M Cached $/M Context Features
morph/morph-v3-fast $0.80 $1.20 16k
morph/morph-v3-large $0.90 $1.90 16k

text-completion-codestral (2)

Model Input $/M Output $/M Cached $/M Context Features
text-completion-codestral/codestral-2405 $0.0000 $0.0000 32k
text-completion-codestral/codestral-latest $0.0000 $0.0000 32k

vertex ai-text-models (2)

Model Input $/M Output $/M Cached $/M Context Features
text-unicorn $10.00 $28.00 8k
text-unicorn@001 $10.00 $28.00 8k

vertex ai-zai models (2)

Model Input $/M Output $/M Cached $/M Context Features
vertex_ai/zai-org/glm-4.7-maas $0.60 $2.20 200k 🔧 tools
vertex_ai/zai-org/glm-5-maas $1.00 $3.20 $0.10 200k 🔧 tools · 💾 cache

vertex ai-openai models (2)

Model Input $/M Output $/M Cached $/M Context Features
vertex_ai/openai/gpt-oss-20b-maas $0.07 $0.30 131k
vertex_ai/openai/gpt-oss-120b-maas $0.15 $0.60 131k

vertex ai-minimax models (1)

Model Input $/M Output $/M Cached $/M Context Features
vertex_ai/minimaxai/minimax-m2-maas $0.30 $1.20 197k 🔧 tools

vertex ai-moonshot models (1)

Model Input $/M Output $/M Cached $/M Context Features
vertex_ai/moonshotai/kimi-k2-thinking-maas $0.60 $2.50 256k 🔧 tools · 🌐 search

sarvam (1)

Model Input $/M Output $/M Cached $/M Context Features
sarvam/sarvam-m $0.0000 $0.0000 $0.0000 8k