Qwen3 1p7b Fp8 Draft 40960
Provider: fireworks ai ·
fireworks_ai/accounts/fireworks/models/qwen3-1p7b-fp8-draft-40960 Pricing per million tokens
| Component | USD per 1M tokens |
|---|---|
| Input | $0.10 |
| Output | $0.10 |
Monthly cost estimates
Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.
Small
1M in / 200k out
$0.12
per month
Medium
10M in / 2000k out
$1.20
per month
Large
100M in / 20000k out
$12.00
per month
Context
- Input window
- 40,960 tokens
- Max output
- 40,960 tokens
Capabilities
- ⬜ Vision (image input)
- ⬜ Function / tool calling
- ⬜ Prompt caching
- ⬜ Web search
- ⬜ JSON / response schema
Compare Qwen3 1p7b Fp8 Draft 40960 with similar models
Code Qwen 1p5 7b
fireworks ai
$0.20 in
/ $0.20 out
Hermes 2 Pro Mistral 7b
fireworks ai
$0.20 in
/ $0.20 out
Mistral 7b
fireworks ai
$0.20 in
/ $0.20 out
Mistral 7b Instruct 4k
fireworks ai
$0.20 in
/ $0.20 out
Mistral 7b Instruct V0p2
fireworks ai
$0.20 in
/ $0.20 out
Mistral 7b Instruct V3
fireworks ai
$0.20 in
/ $0.20 out
Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.