Gpt Realtime

Provider: openai · gpt-realtime

Pricing per million tokens

Component	USD per 1M tokens	USD per 1k tokens
Input	$4.00	$0.0040
Output	$16.00	$0.02
Cached input (read)	$0.40	$0.000400

💡 With prompt caching, you save up to 90% on cached input tokens — massive for repeated context like system prompts, RAG retrieval, or long conversations.

Monthly cost estimates

Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.

Small

1M in / 200k out

$6.12

per month

Saves $1.08 via caching

Medium

10M in / 2000k out

$61.20

per month

Saves $10.80 via caching

Large

100M in / 20000k out

$612

per month

Saves $108 via caching

Context

Input window: 32,000 tokens
Max output: 4,096 tokens

Capabilities

⬜ Vision (image input)
✅ Function / tool calling
⬜ Prompt caching
⬜ Web search
⬜ JSON / response schema

Compare Gpt Realtime with similar models

Ft:Gpt 3.5 Turbo

openai

$3.00 in / $6.00 out

Ft:Gpt 3.5 Turbo 0125

openai

$3.00 in / $6.00 out

Ft:Gpt 3.5 Turbo 1106

$4.00 in / $16.00 out

Gpt Realtime 2

openai

$4.00 in / $16.00 out

Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.