Llama 3.1 8B Instruct
Provider: ovhcloud ·
ovhcloud/Llama-3.1-8B-Instruct Pricing per million tokens
| Component | USD per 1M tokens |
|---|---|
| Input | $0.10 |
| Output | $0.10 |
Monthly cost estimates
Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.
Small
1M in / 200k out
$0.12
per month
Medium
10M in / 2000k out
$1.20
per month
Large
100M in / 20000k out
$12.00
per month
Context
- Input window
- 131,000 tokens
- Max output
- 131,000 tokens
Capabilities
- ⬜ Vision (image input)
- ✅ Function / tool calling
- ⬜ Prompt caching
- ⬜ Web search
- ✅ JSON / response schema
Compare Llama 3.1 8B Instruct with similar models
Mistral 7B Instruct V0.3
ovhcloud
$0.10 in
/ $0.10 out
Mistral Nemo Instruct 2407
ovhcloud
$0.13 in
/ $0.13 out
Mistral Small 3.2 24B Instruct 2506
ovhcloud
$0.09 in
/ $0.28 out
Gpt Oss 120b
ovhcloud
$0.08 in
/ $0.40 out
Mamba Codestral 7B V0.1
ovhcloud
$0.19 in
/ $0.19 out
Qwen3 32B
ovhcloud
$0.08 in
/ $0.23 out
Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.