MusicGen Large
Available meta/musicgen-large · by Meta AI · transformer-decoder
Pricing — 1 offering(s)
Meta AI
Per generation (Replicate-hosted)
- $0.042 / generation Current 2023-08-01 → present
Showing the active price and any recorded history. Full pricing history is available via the paid API — see API docs.
Capability profile
audio quality moderate
style range strong
vocal support weak
duration ceiling weak
stem export weak
Benchmarks
| Benchmark | Score | Config | Source |
|---|---|---|---|
| FAD (Fréchet Audio Distance, published in paper) | — | Copet et al. report FAD and KL-divergence scores on MusicCaps dataset. MusicGen Large outperforms prior open models (MusicLM, Riffusion). Exact scores in the paper. | source ↗ |
Operator guidance
Choose MusicGen Large when open-weights ownership matters (CC BY-NC 4.0 allows commercial use with attribution), or when self-hosting eliminates per-call costs at scale. For production quality with vocals, use Suno v4. For longer-form high-fidelity instrumentals, use Stable Audio 2.0. For melody conditioning specifically, MusicGen's melody-conditioning feature is unique among major models.
Use cases
- Background music generation in applications where open-weights licensing matters
- Research and experimentation on music generation pipelines
- Self-hosted music generation to eliminate per-call API costs at scale
- Melody-conditioned generation: constraining output to follow a reference melody
Limitations
- Instrumental only — no vocal generation
- 30-second generation ceiling per call; longer clips require stitching
- CC BY-NC 4.0 licence — commercial use requires attribution; no-commercial clause applies to the model weights
- Requires GPU infrastructure for reasonable generation speed (no first-party hosted API)