O4 Mini

Provider: openai · o4-mini

Pricing per million tokens

Component	USD per 1M tokens	USD per 1k tokens
Input	$1.10	$0.0011
Output	$4.40	$0.0044
Cached input (read)	$0.28	$0.000275

💡 With prompt caching, you save up to 75% on cached input tokens — massive for repeated context like system prompts, RAG retrieval, or long conversations.

Monthly cost estimates

Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.

Small

1M in / 200k out

$1.73

per month

Saves $0.25 via caching

Medium

10M in / 2000k out

$17.32

per month

Saves $2.48 via caching

Large

100M in / 20000k out

$173

per month

Saves $24.75 via caching

Context

Input window: 200,000 tokens
Max output: 100,000 tokens

Capabilities

✅ Vision (image input)
✅ Function / tool calling
✅ Prompt caching
✅ Web search
✅ JSON / response schema

Compare O4 Mini with similar models

Gpt Audio Mini

openai

$0.60 in / $2.40 out

Gpt Audio Mini 2025 10 06

openai

$0.60 in / $2.40 out

Gpt Audio Mini 2025 12 15

openai

$0.60 in / $2.40 out

Gpt 4o Mini Realtime Preview

openai

$0.60 in / $2.40 out

Gpt 4o Mini Realtime Preview 2024 12 17

$1.25 in / $10.00 out

Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.