← All models

Llama 4 Maverick

Available

meta/llama-4-maverick · by Meta · mixture-of-experts

Pricing — 1 offering(s)

Together AI

Input tokens

$0.15 / 1M tokens (input) Current 2025-04-05 → present

Output tokens

$0.60 / 1M tokens (output) Current 2025-04-05 → present

Showing the active price and any recorded history. Full pricing history is available via the paid API — see API docs.

Capability profile

reasoning moderate

coding moderate

tool use moderate

instruction following strong

context window strong

multilingual moderate

speed strong

Operator guidance

Route here for the longest context available at this price point, or for vision tasks where GPT-4o cost is prohibitive. At the same $0.15/$0.60 price as GPT-4o mini, Maverick's open weights and 1M-token context are the differentiators. Step up to frontier closed models for the hardest reasoning tasks.

Use cases

Ultra-long-context workloads (entire codebases, books, legal documents)
Multimodal tasks (image + text) at budget cost
Open-weights self-hosting where per-token cost must be minimised

Limitations

Newer model; production stability still being established relative to GPT-4o mini
Open weights — quality/price depends on the serving provider
Capability ratings are qualitative, not a single cited benchmark run