Gpt 4o Mini

Provider: azure · azure/gpt-4o-mini

Pricing per million tokens

Component	USD per 1M tokens	USD per 1k tokens
Input	$0.17	$0.000165
Output	$0.66	$0.000660
Cached input (read)	$0.07	$0.000075

💡 With prompt caching, you save up to 55% on cached input tokens — massive for repeated context like system prompts, RAG retrieval, or long conversations.

Monthly cost estimates

Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.

Small

1M in / 200k out

$0.27

per month

Saves $0.03 via caching

Medium

10M in / 2000k out

$2.70

per month

Saves $0.27 via caching

Large

100M in / 20000k out

$27.00

per month

Saves $2.70 via caching

Context

Input window: 128,000 tokens
Max output: 16,384 tokens

Capabilities

✅ Vision (image input)
✅ Function / tool calling
✅ Prompt caching
⬜ Web search
✅ JSON / response schema

Compare Gpt 4o Mini with similar models

Gpt 4o Mini 2024 07 18

Gpt 4o Mini 2024 07 18

azure

$0.17 in / $0.66 out

Gpt 4o Mini 2024 07 18

azure

$0.17 in / $0.66 out

Gpt 4o Mini Realtime Preview 2024 12 17

azure

$0.66 in / $2.64 out

Gpt 5 Mini 2025 08 07

azure

$0.28 in / $2.20 out

Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.