Google Cloud STT Chirp 2
Available google-cloud/chirp-2 · by Google · end-to-end-asr
Pricing — 1 offering(s)
Google Cloud
Chirp 2 (enhanced accuracy)
- $0.012 / minute Current 2024-03-01 → present
Standard (previous-gen)
- $0.0060 / minute Current 2016-03-01 → present
Showing the active price and any recorded history. Full pricing history is available via the paid API — see API docs.
Capability profile
transcription accuracy strong
language support strong
speaker diarisation strong
real time streaming strong
custom vocabulary moderate
Benchmarks
| Benchmark | Score | Config | Source |
|---|---|---|---|
| WER (LibriSpeech) | — | Google does not publish LibriSpeech WER for Chirp 2 specifically. Internal benchmarks show improvements over Chirp 1 on diverse audio. | — |
Operator guidance
Best for teams in the Google Cloud ecosystem needing broad language coverage (100+ languages). At $0.012/min ($0.72/hr) it is cheaper than Azure ($1.00/hr) and Amazon Transcribe ($1.44/hr) but more expensive than Deepgram Nova-3 ($0.26/hr batch) and Universal-2 ($0.15/hr). The free tier (60 minutes/month) is useful for development. Choose Deepgram for the best streaming latency; Universal-2 for the lowest batch cost.
Use cases
- Multi-language transcription at scale via Google Cloud Platform
- Real-time transcription for 100+ language coverage
- Meeting and broadcast transcription with speaker diarisation
- Workflows already in the Google Cloud ecosystem
Limitations
- More expensive per minute than Deepgram and AssemblyAI for batch workloads
- Speech adaptation (custom vocabulary) is less flexible than Deepgram Keywords API
- gRPC streaming API has more setup complexity than REST