Llama 2 7b Chat Int8

Provider: cloudflare · cloudflare/@cf/meta/llama-2-7b-chat-int8

Pricing per million tokens

Component USD per 1M tokens
Input $1.92
Output $1.92

Monthly cost estimates

Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.

Small
1M in / 200k out
$2.31
per month
Medium
10M in / 2000k out
$23.08
per month
Large
100M in / 20000k out
$231
per month

Context

Input window
2,048 tokens
Max output
2,048 tokens

Capabilities

  • ⬜ Vision (image input)
  • ⬜ Function / tool calling
  • ⬜ Prompt caching
  • ⬜ Web search
  • ⬜ JSON / response schema

Compare Llama 2 7b Chat Int8 with similar models

Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.