Qwen3 1p7b Fp8 Draft 40960

Provider: fireworks ai · fireworks_ai/accounts/fireworks/models/qwen3-1p7b-fp8-draft-40960

Pricing per million tokens

Component	USD per 1M tokens	USD per 1k tokens
Input	$0.10	$0.000100
Output	$0.10	$0.000100

Monthly cost estimates

Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.

Small

1M in / 200k out

$0.12

per month

Medium

10M in / 2000k out

$1.20

per month

Large

100M in / 20000k out

$12.00

per month

Context

Input window: 40,960 tokens
Max output: 40,960 tokens

Capabilities

⬜ Vision (image input)
⬜ Function / tool calling
⬜ Prompt caching
⬜ Web search
⬜ JSON / response schema

Compare Qwen3 1p7b Fp8 Draft 40960 with similar models

Code Qwen 1p5 7b

$0.20 in / $0.20 out

Hermes 2 Pro Mistral 7b

$0.20 in / $0.20 out

$0.20 in / $0.20 out

Mistral 7b Instruct 4k

$0.20 in / $0.20 out

Mistral 7b Instruct V0p2

$0.20 in / $0.20 out

Mistral 7b Instruct V3

$0.20 in / $0.20 out

Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.