Llama3 8b Instruct
Provider: gradient ai ·
gradient_ai/llama3-8b-instruct Pricing per million tokens
| Component | USD per 1M tokens |
|---|---|
| Input | $0.20 |
| Output | $0.20 |
Monthly cost estimates
Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.
Small
1M in / 200k out
$0.24
per month
Medium
10M in / 2000k out
$2.40
per month
Large
100M in / 20000k out
$24.00
per month
Context
- Input window
- 8,192 tokens
- Max output
- 512 tokens
Capabilities
- ⬜ Vision (image input)
- ⬜ Function / tool calling
- ⬜ Prompt caching
- ⬜ Web search
- ⬜ JSON / response schema
Compare Llama3 8b Instruct with similar models
Mistral Nemo Instruct 2407
gradient ai
$0.30 in
/ $0.30 out
Anthropic Claude 3.5 Haiku
gradient ai
$0.80 in
/ $4.00 out
Llama3.3 70b Instruct
gradient ai
$0.65 in
/ $0.65 out
Zephyr 7b Beta
anyscale
$0.15 in
/ $0.15 out
Gemma 7b It
anyscale
$0.15 in
/ $0.15 out
Llama 2 13b Chat Hf
anyscale
$0.25 in
/ $0.25 out
Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.