Qwen2.5 VL 32B Instruct

Provider: deepinfra · deepinfra/Qwen/Qwen2.5-VL-32B-Instruct

Pricing per million tokens

Component	USD per 1M tokens	USD per 1k tokens
Input	$0.20	$0.000200
Output	$0.60	$0.000600

Monthly cost estimates

Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.

Small

1M in / 200k out

$0.32

per month

Medium

10M in / 2000k out

$3.20

per month

Large

100M in / 20000k out

$32.00

per month

Context

Input window: 128,000 tokens
Max output: 128,000 tokens

Capabilities

✅ Vision (image input)
✅ Function / tool calling
⬜ Prompt caching
⬜ Web search
⬜ JSON / response schema

Compare Qwen2.5 VL 32B Instruct with similar models

Hermes 3 Llama 3.1 70B

$0.30 in / $0.30 out

$0.15 in / $0.40 out

DeepSeek R1 Distill Llama 70B

$0.20 in / $0.60 out

DeepSeek R1 Distill Qwen 32B

$0.27 in / $0.27 out

$0.38 in / $0.89 out

DeepSeek V3 0324

$0.25 in / $0.88 out

Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.