Llama V3p1 405b Instruct Long

Provider: fireworks ai · fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct-long

Pricing per million tokens

Component USD per 1M tokens
Input $0.10
Output $0.10

Monthly cost estimates

Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.

Small
1M in / 200k out
$0.12
per month
Medium
10M in / 2000k out
$1.20
per month
Large
100M in / 20000k out
$12.00
per month

Context

Input window
4,096 tokens
Max output
4,096 tokens

Capabilities

  • ⬜ Vision (image input)
  • ⬜ Function / tool calling
  • ⬜ Prompt caching
  • ⬜ Web search
  • ⬜ JSON / response schema

Compare Llama V3p1 405b Instruct Long with similar models

Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.