Meta Llama 3.1 8B Instruct Turbo
Provider: deepinfra ·
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo Pricing per million tokens
| Component | USD per 1M tokens |
|---|---|
| Input | $0.02 |
| Output | $0.03 |
Monthly cost estimates
Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.
Small
1M in / 200k out
$0.03
per month
Medium
10M in / 2000k out
$0.26
per month
Large
100M in / 20000k out
$2.60
per month
Context
- Input window
- 131,072 tokens
- Max output
- 131,072 tokens
Capabilities
- ⬜ Vision (image input)
- ✅ Function / tool calling
- ⬜ Prompt caching
- ⬜ Web search
- ⬜ JSON / response schema
Compare Meta Llama 3.1 8B Instruct Turbo with similar models
Gemma 3 4b It
deepinfra
$0.04 in
/ $0.08 out
Llama 3.2 3B Instruct
deepinfra
$0.02 in
/ $0.02 out
Meta Llama 3.1 8B Instruct
deepinfra
$0.03 in
/ $0.05 out
Mistral Nemo Instruct 2407
deepinfra
$0.02 in
/ $0.04 out
NVIDIA Nemotron Nano 9B V2
deepinfra
$0.04 in
/ $0.16 out
Gpt Oss 20b
deepinfra
$0.04 in
/ $0.15 out
Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.