ElevenLabs Multilingual v2
Available elevenlabs/multilingual-v2 · by ElevenLabs · neural-tts
Pricing — 1 offering(s)
ElevenLabs
Text characters
- $120.00 / 1M characters Current 2023-10-01 → present
Showing the active price and any recorded history. Full pricing history is available via the paid API — see API docs.
Capability profile
voice naturalness strong
language support strong
voice variety strong
streaming latency moderate
cloning support strong
Benchmarks
| Benchmark | Score | Config | Source |
|---|---|---|---|
| First-byte latency (vendor-reported) | 400 ms | Approximate; vendor-reported. Significantly higher than Flash v2.5's ~75ms. Not independently verified. | source ↗ |
| MOS (naturalness) | — | Consistently rated best-in-class in community comparisons. No formally published MOS score. | — |
Operator guidance
The reference choice for maximum voice quality. At ~$120/1M chars it is expensive but justified for content where audio quality drives user perception. Route latency-sensitive applications to Flash v2.5 instead. For simple utility TTS at scale, OpenAI TTS-1 is 8× cheaper.
Use cases
- Premium audiobooks, podcasts, and narration
- Voice cloning for consistent brand voice across content
- Dubbing and localisation workflows
- High-quality IVR systems where naturalness matters
Limitations
- Credit-based pricing; no transparent per-character API rate — plan-dependent cost
- Higher latency than Flash v2.5; not suitable for real-time streaming
- Pricing estimated from plan effective rates; may not reflect enterprise pricing