Gpt 4o Realtime Preview 2024 12 17
Provider: azure ·
azure/us/gpt-4o-realtime-preview-2024-12-17 Pricing per million tokens
| Component | USD per 1M tokens |
|---|---|
| Input | $5.50 |
| Output | $22.00 |
| Cached input (read) | $2.75 |
💡 With prompt caching, you save up to 50% on cached input tokens — massive for repeated context like system prompts, RAG retrieval, or long conversations.
Monthly cost estimates
Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.
Small
1M in / 200k out
$9.08
per month
Saves $0.82 via caching
Medium
10M in / 2000k out
$90.75
per month
Saves $8.25 via caching
Large
100M in / 20000k out
$908
per month
Saves $82.50 via caching
Context
- Input window
- 128,000 tokens
- Max output
- 4,096 tokens
Capabilities
- ⬜ Vision (image input)
- ✅ Function / tool calling
- ⬜ Prompt caching
- ⬜ Web search
- ⬜ JSON / response schema
Compare Gpt 4o Realtime Preview 2024 12 17 with similar models
Command R Plus
azure
$3.00 in
/ $15.00 out
Gpt 4o 2024 08 06
azure
$2.75 in
/ $11.00 out
Gpt 4o 2024 11 20
azure
$2.75 in
/ $11.00 out
Gpt 4o Realtime Preview 2024 10 01
azure
$5.50 in
/ $22.00 out
Gpt 4o Realtime Preview 2024 12 17
azure
$5.50 in
/ $22.00 out
Gpt 4 0125 Preview
azure
$10.00 in
/ $30.00 out
Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.