Gemini 3 Flash Preview

Provider: vertex ai · vertex_ai/gemini-3-flash-preview

Pricing per million tokens

Component USD per 1M tokens
Input $0.50
Output $3.00
Cached input (read) $0.05

💡 With prompt caching, you save up to 90% on cached input tokens — massive for repeated context like system prompts, RAG retrieval, or long conversations.

Monthly cost estimates

Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.

Small
1M in / 200k out
$0.97
per month
Saves $0.14 via caching
Medium
10M in / 2000k out
$9.65
per month
Saves $1.35 via caching
Large
100M in / 20000k out
$96.50
per month
Saves $13.50 via caching

Context

Input window
1,048,576 tokens
Max output
65,535 tokens

Capabilities

  • ✅ Vision (image input)
  • ✅ Function / tool calling
  • ✅ Prompt caching
  • ✅ Web search
  • ✅ JSON / response schema

Compare Gemini 3 Flash Preview with similar models

Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.