Llama 3.3 70B Instruct
Provider: azure ai ·
azure_ai/Llama-3.3-70B-Instruct Pricing per million tokens
| Component | USD per 1M tokens |
|---|---|
| Input | $0.71 |
| Output | $0.71 |
Monthly cost estimates
Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.
Small
1M in / 200k out
$0.85
per month
Medium
10M in / 2000k out
$8.52
per month
Large
100M in / 20000k out
$85.20
per month
Context
- Input window
- 128,000 tokens
- Max output
- 2,048 tokens
Capabilities
- ⬜ Vision (image input)
- ✅ Function / tool calling
- ⬜ Prompt caching
- ⬜ Web search
- ⬜ JSON / response schema
Compare Llama 3.3 70B Instruct with similar models
Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.