Llama 4 Scout 17B 16E Instruct

Provider: nscale · nscale/meta-llama/Llama-4-Scout-17B-16E-Instruct

Pricing per million tokens

Component	USD per 1M tokens	USD per 1k tokens
Input	$0.09	$0.000090
Output	$0.29	$0.000290

Monthly cost estimates

Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.

Small

1M in / 200k out

$0.15

per month

Medium

10M in / 2000k out

$1.48

per month

Large

100M in / 20000k out

$14.80

per month

Context

Input window: —

Capabilities

⬜ Vision (image input)
⬜ Function / tool calling
⬜ Prompt caching
⬜ Web search
⬜ JSON / response schema

Compare Llama 4 Scout 17B 16E Instruct with similar models

$0.18 in / $0.20 out

Qwen2.5 Coder 32B Instruct

$0.06 in / $0.20 out

DeepSeek R1 Distill Qwen 1.5B

$0.09 in / $0.09 out

DeepSeek R1 Distill Qwen 14B

$0.07 in / $0.07 out

DeepSeek R1 Distill Qwen 32B

$0.15 in / $0.15 out

DeepSeek R1 Distill Llama 8B

$0.02 in / $0.02 out

Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.