Llama 3.1 8B Instruct

Provider: ovhcloud · ovhcloud/Llama-3.1-8B-Instruct

Pricing per million tokens

Component	USD per 1M tokens	USD per 1k tokens
Input	$0.10	$0.000100
Output	$0.10	$0.000100

Monthly cost estimates

Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.

Small

1M in / 200k out

$0.12

per month

Medium

10M in / 2000k out

$1.20

per month

Large

100M in / 20000k out

$12.00

per month

Context

Input window: 131,000 tokens
Max output: 131,000 tokens

Capabilities

⬜ Vision (image input)
✅ Function / tool calling
⬜ Prompt caching
⬜ Web search
✅ JSON / response schema

Compare Llama 3.1 8B Instruct with similar models

Mistral 7B Instruct V0.3

$0.10 in / $0.10 out

Mistral Nemo Instruct 2407

$0.13 in / $0.13 out

Mistral Small 3.2 24B Instruct 2506

$0.09 in / $0.28 out

$0.08 in / $0.40 out

Mamba Codestral 7B V0.1

$0.19 in / $0.19 out

$0.08 in / $0.23 out

Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.