Gemini 2.5 Flash

Google DeepMind🇺🇸 United States
active

Google's fastest thinking model. Configurable reasoning budget.

Context window1000K tokens
Input / 1M tokens$0.15
Output / 1M tokens$0.6

Version History

gemini-2.5-flash-preview-11-05minor

Gemini 2.5 Flash November preview with configurable thinking budget improvements and better cost efficiency at higher thinking token counts.

gemini-2.5-flash-preview-05-20major

Gemini 2.5 Flash enters preview with configurable thinking budget. Fastest Google model with reasoning at sub-$1/M input cost.

Benchmark Scores

Full leaderboard →
198.0 tokens_per_sec
Speed (tok/s)