Llama 4 Scout 17B 16E Instruct
Provider: nscale ·
nscale/meta-llama/Llama-4-Scout-17B-16E-Instruct Pricing per million tokens
| Component | USD per 1M tokens |
|---|---|
| Input | $0.09 |
| Output | $0.29 |
Monthly cost estimates
Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.
Small
1M in / 200k out
$0.15
per month
Medium
10M in / 2000k out
$1.48
per month
Large
100M in / 20000k out
$14.80
per month
Context
- Input window
- —
Capabilities
- ⬜ Vision (image input)
- ⬜ Function / tool calling
- ⬜ Prompt caching
- ⬜ Web search
- ⬜ JSON / response schema
Compare Llama 4 Scout 17B 16E Instruct with similar models
QwQ 32B
nscale
$0.18 in
/ $0.20 out
Qwen2.5 Coder 32B Instruct
nscale
$0.06 in
/ $0.20 out
DeepSeek R1 Distill Qwen 1.5B
nscale
$0.09 in
/ $0.09 out
DeepSeek R1 Distill Qwen 14B
nscale
$0.07 in
/ $0.07 out
DeepSeek R1 Distill Qwen 32B
nscale
$0.15 in
/ $0.15 out
DeepSeek R1 Distill Llama 8B
nscale
$0.02 in
/ $0.02 out
Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.