Codellama 7b Instruct Awq
Provider: cloudflare ·
cloudflare/@hf/thebloke/codellama-7b-instruct-awq Pricing per million tokens
| Component | USD per 1M tokens |
|---|---|
| Input | $1.92 |
| Output | $1.92 |
Monthly cost estimates
Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.
Small
1M in / 200k out
$2.31
per month
Medium
10M in / 2000k out
$23.08
per month
Large
100M in / 20000k out
$231
per month
Context
- Input window
- 4,096 tokens
- Max output
- 4,096 tokens
Capabilities
- ⬜ Vision (image input)
- ⬜ Function / tool calling
- ⬜ Prompt caching
- ⬜ Web search
- ⬜ JSON / response schema
Compare Codellama 7b Instruct Awq with similar models
Llama 2 7b Chat Fp16
cloudflare
$1.92 in
/ $1.92 out
Llama 2 7b Chat Int8
cloudflare
$1.92 in
/ $1.92 out
Mistral 7b Instruct V0.1
cloudflare
$1.92 in
/ $1.92 out
CodeLlama 34b Instruct Hf
anyscale
$1.00 in
/ $1.00 out
CodeLlama 70b Instruct Hf
anyscale
$1.00 in
/ $1.00 out
Llama 2 70b Chat Hf
anyscale
$1.00 in
/ $1.00 out
Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.