Gpt 5.4 Mini
Provider: azure ·
azure/gpt-5.4-mini Pricing per million tokens
| Component | USD per 1M tokens |
|---|---|
| Input | $0.75 |
| Output | $4.50 |
| Cached input (read) | $0.07 |
💡 With prompt caching, you save up to 90% on cached input tokens — massive for repeated context like system prompts, RAG retrieval, or long conversations.
Monthly cost estimates
Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.
Small
1M in / 200k out
$1.45
per month
Saves $0.20 via caching
Medium
10M in / 2000k out
$14.47
per month
Saves $2.02 via caching
Large
100M in / 20000k out
$145
per month
Saves $20.25 via caching
Context
- Input window
- 1,050,000 tokens
- Max output
- 128,000 tokens
Capabilities
- ✅ Vision (image input)
- ✅ Function / tool calling
- ✅ Prompt caching
- ✅ Web search
- ✅ JSON / response schema
Compare Gpt 5.4 Mini with similar models
Gpt 4.1 Mini
azure
$0.40 in
/ $1.60 out
Gpt 4.1 Mini 2025 04 14
azure
$0.40 in
/ $1.60 out
Gpt 5.4 Mini 2026 03 17
azure
$0.75 in
/ $4.50 out
Gpt 4.1 Mini 2025 04 14
azure
$0.44 in
/ $1.76 out
Gpt 4o Mini Realtime Preview 2024 12 17
azure
$0.66 in
/ $2.64 out
Gpt 5 2025 08 07
azure
$1.38 in
/ $11.00 out
Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.