Gemini 2.0 Flash

Google DeepMind🇺🇸 United States

active

Google's fastest production model with 1M context and native multimodal I/O.

Compare with other models →

Context window1000K tokens

Input / 1M tokens$0.1

Output / 1M tokens$0.4

Version History

gemini-2.0-flash-001majorFebruary 5, 2025

GA release with native image generation, live audio, and 1M token context.

Benchmark Scores

Full leaderboard →

228.0 tokens_per_sec

Speed (tok/s)