Glm 4.6
Provider: vercel ai gateway ·
vercel_ai_gateway/zai/glm-4.6 Pricing per million tokens
| Component | USD per 1M tokens |
|---|---|
| Input | $0.45 |
| Output | $1.80 |
| Cached input (read) | $0.11 |
💡 With prompt caching, you save up to 76% on cached input tokens — massive for repeated context like system prompts, RAG retrieval, or long conversations.
Monthly cost estimates
Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.
Small
1M in / 200k out
$0.71
per month
Saves $0.10 via caching
Medium
10M in / 2000k out
$7.08
per month
Saves $1.02 via caching
Large
100M in / 20000k out
$70.80
per month
Saves $10.20 via caching
Context
- Input window
- 200,000 tokens
- Max output
- 200,000 tokens
Capabilities
- ⬜ Vision (image input)
- ✅ Function / tool calling
- ⬜ Prompt caching
- ⬜ Web search
- ⬜ JSON / response schema
Compare Glm 4.6 with similar models
Qwen3 Coder
vercel ai gateway
$0.40 in
/ $1.60 out
Nova Pro
vercel ai gateway
$0.80 in
/ $3.20 out
Claude 3 Haiku
vercel ai gateway
$0.25 in
/ $1.25 out
Claude 3.5 Haiku
vercel ai gateway
$0.80 in
/ $4.00 out
Deepseek R1
vercel ai gateway
$0.55 in
/ $2.19 out
Deepseek R1 Distill Llama 70b
vercel ai gateway
$0.75 in
/ $0.99 out
Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.