Grok 4 Fast Reasoning
Provider: azure ai ·
azure_ai/grok-4-fast-reasoning Pricing per million tokens
| Component | USD per 1M tokens |
|---|---|
| Input | $0.20 |
| Output | $0.50 |
Monthly cost estimates
Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.
Small
1M in / 200k out
$0.30
per month
Medium
10M in / 2000k out
$3.00
per month
Large
100M in / 20000k out
$30.00
per month
Context
- Input window
- 131,072 tokens
- Max output
- 131,072 tokens
Capabilities
- ⬜ Vision (image input)
- ✅ Function / tool calling
- ⬜ Prompt caching
- ✅ Web search
- ✅ JSON / response schema
Compare Grok 4 Fast Reasoning with similar models
Gpt Oss 120b
azure ai
$0.15 in
/ $0.60 out
Llama 3.2 11B Vision Instruct
azure ai
$0.37 in
/ $0.37 out
Meta Llama 3.1 8B Instruct
azure ai
$0.30 in
/ $0.61 out
Phi 3 Medium 128k Instruct
azure ai
$0.17 in
/ $0.68 out
Phi 3 Mini 128k Instruct
azure ai
$0.13 in
/ $0.52 out
Phi 3 Small 128k Instruct
azure ai
$0.15 in
/ $0.60 out
Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.