Llama 3.1 405b Instruct Maas
Provider: vertex ai-llama models ·
vertex_ai/meta/llama-3.1-405b-instruct-maas Pricing per million tokens
| Component | USD per 1M tokens |
|---|---|
| Input | $5.00 |
| Output | $16.00 |
Monthly cost estimates
Assuming 30% prompt-cache hit rate where available. Adjust for your actual usage.
Small
1M in / 200k out
$8.20
per month
Medium
10M in / 2000k out
$82.00
per month
Large
100M in / 20000k out
$820
per month
Context
- Input window
- 128,000 tokens
- Max output
- 2,048 tokens
Capabilities
- ✅ Vision (image input)
- ⬜ Function / tool calling
- ⬜ Prompt caching
- ⬜ Web search
- ⬜ JSON / response schema
Compare Llama 3.1 405b Instruct Maas with similar models
Us.Writer.Palmyra X4 V1:0
bedrock converse
$2.50 in
/ $10.00 out
Writer.Palmyra X4 V1:0
bedrock converse
$2.50 in
/ $10.00 out
Anthropic.Claude 3 7 Sonnet 20240620 V1:0
bedrock
$3.60 in
/ $18.00 out
Anthropic.Claude 3 7 Sonnet 20250219 V1:0
bedrock converse
$3.00 in
/ $15.00 out
Anthropic.Claude 3 Sonnet 20240229 V1:0
bedrock
$3.00 in
/ $15.00 out
Anthropic.Claude Opus 4 5 20251101 V1:0
bedrock converse
$5.00 in
/ $25.00 out
Pricing data sourced from LiteLLM and refreshed regularly. Last updated May 20, 2026. Always verify with the provider's official pricing page before making business decisions.