Gemini 2.5 Flash: Pricing, Context Window & Benchmarks
by Google
Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation (https://openrouter.ai/docs/use-cases/reasoning-tokens#max-tokens-for-reasoning).
What you can do with Gemini 2.5 Flash
Everyday Q&A and clear explanations
Writing help (emails, posts, summaries)
Idea generation and brainstorming
Learning support with step-by-step guidance
Composite Indices
Intelligence, Coding, Math
Standard Benchmarks
Academic and industry benchmarks
Benchmark Highlights
| Metric | Value |
|---|---|
| Provider | |
| Context Window | 1,048,576 tokens |
| Input Price | $0.30/1M tokens |
| Output Price | $2.50/1M tokens |
| Release Date | May 20, 2025 |
| Modalities | file, image, text, audio, video |
| Capabilities | Vision, Audio Input |
Compare Gemini 2.5 Flash to other models
See how it stacks up on price, quality, and overall performance.
Frequently asked questions
What is Gemini 2.5 Flash good for?
Use Gemini 2.5 Flash for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.
How much does Gemini 2.5 Flash cost?
Pricing is based on usage. Current rates are $0.30/1M tokens for input and $2.50/1M tokens for output.
Can I try Gemini 2.5 Flash for free?
Yes. You can start a chat instantly and test the model before deciding on a plan.
Does Gemini 2.5 Flash support images or audio?
Gemini 2.5 Flash can understand images.
Suggested comparisons
Similar models
Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.
Compare Models
Select a model to compare with Gemini 2.5 Flash