Llama 4 Maverick
Available meta/llama-4-maverick · by Meta · mixture-of-experts
Pricing — 1 offering(s)
Together AI
Input tokens
- $0.15 / 1M tokens (input) Current 2025-04-05 → present
Output tokens
- $0.60 / 1M tokens (output) Current 2025-04-05 → present
Showing the active price and any recorded history. Full pricing history is available via the paid API — see API docs.
Capability profile
reasoning moderate
coding moderate
tool use moderate
instruction following strong
context window strong
multilingual moderate
speed strong
Operator guidance
Route here for the longest context available at this price point, or for vision tasks where GPT-4o cost is prohibitive. At the same $0.15/$0.60 price as GPT-4o mini, Maverick's open weights and 1M-token context are the differentiators. Step up to frontier closed models for the hardest reasoning tasks.
Use cases
- Ultra-long-context workloads (entire codebases, books, legal documents)
- Multimodal tasks (image + text) at budget cost
- Open-weights self-hosting where per-token cost must be minimised
Limitations
- Newer model; production stability still being established relative to GPT-4o mini
- Open weights — quality/price depends on the serving provider
- Capability ratings are qualitative, not a single cited benchmark run