Meta Llama 3.1 8B Instruct Turbo

Provider: deepinfra · deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo

Pricing per million tokens

Component	USD per 1M tokens	USD per 1k tokens
Input	$0.02	$0.000020
Output	$0.03	$0.000030

Monthly cost estimates

Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.

Small

1M in / 200k out

$0.03

per month

Medium

10M in / 2000k out

$0.26

per month

Large

100M in / 20000k out

$2.60

per month

Context

Input window: 131,072 tokens
Max output: 131,072 tokens

Capabilities

⬜ Vision (image input)
✅ Function / tool calling
⬜ Prompt caching
⬜ Web search
⬜ JSON / response schema

Compare Meta Llama 3.1 8B Instruct Turbo with similar models

$0.04 in / $0.08 out

Llama 3.2 3B Instruct

$0.02 in / $0.02 out

Meta Llama 3.1 8B Instruct

$0.03 in / $0.05 out

Mistral Nemo Instruct 2407

$0.02 in / $0.04 out

NVIDIA Nemotron Nano 9B V2

$0.04 in / $0.16 out

$0.04 in / $0.15 out

Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.