Gpt 4o Realtime Preview 2024 12 17

Provider: azure · azure/us/gpt-4o-realtime-preview-2024-12-17

Pricing per million tokens

Component	USD per 1M tokens	USD per 1k tokens
Input	$5.50	$0.0055
Output	$22.00	$0.02
Cached input (read)	$2.75	$0.0027

💡 With prompt caching, you save up to 50% on cached input tokens — massive for repeated context like system prompts, RAG retrieval, or long conversations.

Monthly cost estimates

Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.

Small

1M in / 200k out

$9.08

per month

Saves $0.82 via caching

Medium

10M in / 2000k out

$90.75

per month

Saves $8.25 via caching

Large

100M in / 20000k out

$908

per month

Saves $82.50 via caching

Context

Input window: 128,000 tokens
Max output: 4,096 tokens

Capabilities

⬜ Vision (image input)
✅ Function / tool calling
⬜ Prompt caching
⬜ Web search
⬜ JSON / response schema

Compare Gpt 4o Realtime Preview 2024 12 17 with similar models

Command R Plus

azure

$3.00 in / $15.00 out

Gpt 4o 2024 08 06

azure

$2.75 in / $11.00 out

Gpt 4o 2024 11 20

azure

$2.75 in / $11.00 out

Gpt 4o Realtime Preview 2024 10 01

azure

$5.50 in / $22.00 out

Gpt 4o Realtime Preview 2024 12 17

azure

$5.50 in / $22.00 out

Gpt 4 0125 Preview

azure

$10.00 in / $30.00 out

Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.