Glm 4p7
Provider: fireworks ai ·
fireworks_ai/glm-4p7 Pricing per million tokens
| Component | USD per 1M tokens |
|---|---|
| Input | $0.60 |
| Output | $2.20 |
| Cached input (read) | $0.30 |
💡 With prompt caching, you save up to 50% on cached input tokens — massive for repeated context like system prompts, RAG retrieval, or long conversations.
Monthly cost estimates
Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.
Small
1M in / 200k out
$0.95
per month
Saves $0.09 via caching
Medium
10M in / 2000k out
$9.50
per month
Saves $0.90 via caching
Large
100M in / 20000k out
$95.00
per month
Saves $9.00 via caching
Context
- Input window
- 202,800 tokens
- Max output
- 202,800 tokens
Capabilities
- ⬜ Vision (image input)
- ✅ Function / tool calling
- ⬜ Prompt caching
- ⬜ Web search
- ✅ JSON / response schema
Compare Glm 4p7 with similar models
Deepseek R1 Basic
fireworks ai
$0.55 in
/ $2.19 out
Deepseek V3
fireworks ai
$0.90 in
/ $0.90 out
Deepseek V3 0324
fireworks ai
$0.90 in
/ $0.90 out
Deepseek V3p1
fireworks ai
$0.56 in
/ $1.68 out
Deepseek V3p1 Terminus
fireworks ai
$0.56 in
/ $1.68 out
Deepseek V3p2
fireworks ai
$0.56 in
/ $1.68 out
Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.