Gemini 2.0 Flash
Google DeepMind🇺🇸 United States
Google's fastest production model with 1M context and native multimodal I/O.
Context window1000K tokens
Input / 1M tokens$0.1
Output / 1M tokens$0.4
Version History
gemini-2.0-flash-001major
GA release with native image generation, live audio, and 1M token context.
Benchmark Scores
Full leaderboard →228.0 tokens_per_sec
Speed (tok/s)