← All models

Stable Audio 2.0

Available

stability-ai/stable-audio-2 · by Stability AI · Diffusion transformer (DiT)

Pricing — 1 offering(s)

Stability AI

Per generation (Professional plan)

  • $0.20 / generation Current 2024-04-01 → present

Showing the active price and any recorded history. Full pricing history is available via the paid API — see API docs.

Capability profile

audio quality strong
style range strong
vocal support weak
duration ceiling strong
stem export weak

Benchmarks

Benchmark Score Config Source
CLAP score and FAD (Stability AI internal) Stability AI reports improved CLAP and FAD scores vs Stable Audio 1.0. No independent third-party benchmark published comparing against Suno v4 or Udio on a standardised dataset.

Operator guidance

The best option for long-form, high-fidelity instrumental audio. At $0.20/generation (Professional plan) it is more expensive than MusicGen on Replicate (~$0.042) but produces significantly longer and higher-quality output. For vocals and lyrics, Suno v4 is the better choice. For open- weights self-hosting, MusicGen Large allows cost elimination at scale.

Use cases

Limitations

Citations