Llama V2 70b Chat

Provider: fireworks ai · fireworks_ai/accounts/fireworks/models/llama-v2-70b-chat

Pricing per million tokens

Component	USD per 1M tokens	USD per 1k tokens
Input	$0.90	$0.000900
Output	$0.90	$0.000900

Monthly cost estimates

Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.

Small

1M in / 200k out

$1.08

per month

Medium

10M in / 2000k out

$10.80

per month

Large

100M in / 20000k out

$108

per month

Context

Input window: 2,048 tokens
Max output: 2,048 tokens

Capabilities

⬜ Vision (image input)
⬜ Function / tool calling
⬜ Prompt caching
⬜ Web search
⬜ JSON / response schema

Compare Llama V2 70b Chat with similar models

Qwen2p5 Coder 32b Instruct

$0.90 in / $0.90 out

$0.90 in / $0.90 out

Code Llama 70b Instruct

$0.90 in / $0.90 out

Code Llama 70b Python

$0.90 in / $0.90 out

$0.90 in / $0.90 out

Nous Hermes 2 Yi 34b

$0.90 in / $0.90 out

Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.