EfficientGoogle

Gemini 2.0 Flash

Context

1M tokens

Pricing

$0.10/M input, $0.40/M output

Modalities

text, image, audio, video, code

Released

Dec 2024

Overview: Google's fast, cost-efficient multimodal model from the Gemini 2.0 generation. Gemini 2.0 Flash processes text, images, audio, and video at high speed, optimized for latency-sensitive applications.
Why it matters: Flash 2.0 occupies the critical 'good enough and fast' tier that captures the majority of real-world API calls. Its native multimodal support — including video understanding — at efficient pricing makes it compelling for building features that would require multiple specialized models from other vendors. Google's distribution advantage through Vertex AI and Android integration means Flash often wins on total cost of ownership when teams are already in the Google Cloud ecosystem. Speed-sensitive applications like real-time agents and live video analysis are its sweet spot.

Key strengths