Gemini 2.5 Flash
Available google-deepmind/gemini-2.5-flash · by Google DeepMind · decoder-only-transformer
Pricing — 1 offering(s)
Google DeepMind
Input tokens
- $0.30 / 1M tokens (input) Current 2026-06-12 → present
Output tokens
- $2.50 / 1M tokens (output) Current 2026-06-12 → present
Showing the active price and any recorded history. Full pricing history is available via the paid API — see API docs.
Capability profile
reasoning moderate
coding moderate
tool use strong
instruction following strong
context window strong
multilingual strong
speed strong
Operator guidance
The cost/latency option in the Gemini 2.5 line with a 1M-token window. Step up to Gemini 2.5 Pro for the hardest reasoning/coding.
Use cases
- High-throughput, low-cost tasks needing a large context window
- Multimodal input at scale where latency/cost matter
Limitations
- Lower reasoning/coding ceiling than Gemini 2.5 Pro
- Audio input billed higher than text/image/video input
- Capability ratings are qualitative, not a single cited benchmark run