Gemini 3 Flash Preview

Provider: vertex ai · vertex_ai/gemini-3-flash-preview

Pricing per million tokens

Component	USD per 1M tokens	USD per 1k tokens
Input	$0.50	$0.000500
Output	$3.00	$0.0030
Cached input (read)	$0.05	$0.000050

💡 With prompt caching, you save up to 90% on cached input tokens — massive for repeated context like system prompts, RAG retrieval, or long conversations.

Monthly cost estimates

Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.

Small

1M in / 200k out

$0.97

per month

Saves $0.14 via caching

Medium

10M in / 2000k out

$9.65

per month

Saves $1.35 via caching

Large

100M in / 20000k out

$96.50

per month

Saves $13.50 via caching

Context

Input window: 1,048,576 tokens
Max output: 65,535 tokens

Capabilities

✅ Vision (image input)
✅ Function / tool calling
✅ Prompt caching
✅ Web search
✅ JSON / response schema

Compare Gemini 3 Flash Preview with similar models

Gemini 3 Pro Preview

vertex ai

$2.00 in / $12.00 out

Gemini 3.1 Pro Preview

vertex ai

$2.00 in / $12.00 out

Gemini 3.1 Pro Preview Customtools

vertex ai

$2.00 in / $12.00 out

Grok 4.1 Fast Non Reasoning

vertex ai

$0.20 in / $0.50 out

Grok 4.1 Fast Reasoning

vertex ai

$0.20 in / $0.50 out

Grok 4.20 Non Reasoning

vertex ai

$2.00 in / $6.00 out

Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.