Claude 3 7 Sonnet Latest

Provider: deepinfra · deepinfra/anthropic/claude-3-7-sonnet-latest

Pricing per million tokens

Component USD per 1M tokens
Input $3.30
Output $16.50
Cached input (read) $0.33

💡 With prompt caching, you save up to 90% on cached input tokens — massive for repeated context like system prompts, RAG retrieval, or long conversations.

Monthly cost estimates

Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.

Small
1M in / 200k out
$5.71
per month
Saves $0.89 via caching
Medium
10M in / 2000k out
$57.09
per month
Saves $8.91 via caching
Large
100M in / 20000k out
$571
per month
Saves $89.10 via caching

Context

Input window
200,000 tokens
Max output
200,000 tokens

Capabilities

  • ⬜ Vision (image input)
  • ✅ Function / tool calling
  • ⬜ Prompt caching
  • ⬜ Web search
  • ⬜ JSON / response schema

Compare Claude 3 7 Sonnet Latest with similar models

Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.