Gpt 5 2025 08 07

Provider: azure · azure/gpt-5-2025-08-07

Pricing per million tokens

Component USD per 1M tokens
Input $1.25
Output $10.00
Cached input (read) $0.13

💡 With prompt caching, you save up to 90% on cached input tokens — massive for repeated context like system prompts, RAG retrieval, or long conversations.

Monthly cost estimates

Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.

Small
1M in / 200k out
$2.91
per month
Saves $0.34 via caching
Medium
10M in / 2000k out
$29.13
per month
Saves $3.38 via caching
Large
100M in / 20000k out
$291
per month
Saves $33.75 via caching

Context

Input window
272,000 tokens
Max output
128,000 tokens

Capabilities

  • ✅ Vision (image input)
  • ✅ Function / tool calling
  • ✅ Prompt caching
  • ⬜ Web search
  • ✅ JSON / response schema

Compare Gpt 5 2025 08 07 with similar models

Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.