Glm 4p7

Provider: fireworks ai · fireworks_ai/glm-4p7

Pricing per million tokens

Component	USD per 1M tokens	USD per 1k tokens
Input	$0.60	$0.000600
Output	$2.20	$0.0022
Cached input (read)	$0.30	$0.000300

💡 With prompt caching, you save up to 50% on cached input tokens — massive for repeated context like system prompts, RAG retrieval, or long conversations.

Monthly cost estimates

Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.

Small

1M in / 200k out

$0.95

per month

Saves $0.09 via caching

Medium

10M in / 2000k out

$9.50

per month

Saves $0.90 via caching

Large

100M in / 20000k out

$95.00

per month

Saves $9.00 via caching

Context

Input window: 202,800 tokens
Max output: 202,800 tokens

Capabilities

⬜ Vision (image input)
✅ Function / tool calling
⬜ Prompt caching
⬜ Web search
✅ JSON / response schema

Compare Glm 4p7 with similar models

Deepseek V3p1 Terminus

Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.