Llama 2 70b Chat Hf

Provider: anyscale · anyscale/meta-llama/Llama-2-70b-chat-hf

Pricing per million tokens

Component	USD per 1M tokens	USD per 1k tokens
Input	$1.00	$0.0010
Output	$1.00	$0.0010

Monthly cost estimates

Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.

Small

1M in / 200k out

$1.20

per month

Medium

10M in / 2000k out

$12.00

per month

Large

100M in / 20000k out

$120

per month

Context

Input window: 4,096 tokens
Max output: 4,096 tokens

Capabilities

⬜ Vision (image input)
⬜ Function / tool calling
⬜ Prompt caching
⬜ Web search
⬜ JSON / response schema

Compare Llama 2 70b Chat Hf with similar models

CodeLlama 34b Instruct Hf

$1.00 in / $1.00 out

CodeLlama 70b Instruct Hf

$1.00 in / $1.00 out

Meta Llama 3 70B Instruct

$1.00 in / $1.00 out

Llama 2 13b Chat Hf

$0.25 in / $0.25 out

Mixtral 8x22B Instruct V0.1

$0.90 in / $0.90 out

$0.15 in / $0.15 out

Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.