Google Cloud TTS
Available google-cloud/tts · by Google · neural-tts
Pricing — 1 offering(s)
Google Cloud
Standard voices
- $4.00 / 1M characters Current 2018-03-01 → present
WaveNet / Neural2 voices
- $16.00 / 1M characters Current 2018-03-01 → present
Studio voices
- $160.00 / 1M characters Current 2022-01-01 → present
Showing the active price and any recorded history. Full pricing history is available via the paid API — see API docs.
Capability profile
voice naturalness strong
language support strong
voice variety strong
streaming latency moderate
cloning support weak
Benchmarks
| Benchmark | Score | Config | Source |
|---|---|---|---|
| MOS (naturalness) | — | Neural2 voices score highly in Google's own published evaluations. No independent MOS published against competitors. | — |
| WaveNet original MOS (published) | 4.21 MOS (1–5) | Original WaveNet paper MOS on English, not directly comparable to current Neural2 quality. | source ↗ |
Operator guidance
Best for teams already on Google Cloud who want tight ecosystem integration and a large per-month free tier (1M WaveNet/Neural2 chars/month). At $16/1M chars (WaveNet/Neural2) it is price-identical to Azure and Polly Neural. Choose ElevenLabs for naturalness, Cartesia Sonic for latency- critical conversational AI, or OpenAI TTS-1 for the lowest price.
Use cases
- Application TTS for users already in the Google Cloud ecosystem
- Multi-language content production across 40+ languages
- Professional narration and content with Studio voices
- SSML-heavy workflows requiring pitch, speed, and break control
Limitations
- No voice cloning (Custom Voice is a contracted enterprise offering)
- REST API; not purpose-built for real-time streaming TTS
- Studio voices at $160/1M chars are significantly more expensive than Neural2