Gemini 3 Flash is Google DeepMind's fast, balanced mid-tier model, released December 17, 2025 — delivering frontier-level reasoning (90.4% on GPQA Diamond, 33.7% on Humanity's Last Exam without tools) at Flash-tier speed and cost with a 1M+ token context window.
Gemini 3 Flash is Google DeepMind's fast, balanced model — released December 17, 2025, it replaced Gemini 2.5 Flash as the recommended mid-tier option. What makes Gemini 3 Flash notable is its benchmark performance: it achieves frontier-level results on PhD-level reasoning tests (GPQA Diamond: 90.4%) and Humanity's Last Exam (33.7% without tools) while maintaining the low latency and cost efficiency expected of a Flash-tier model. This positions it as arguably the best price/performance option in the Gemini lineup for most real-world applications.
If Gemini 3 Pro is Google's answer to the hardest reasoning problems, Gemini 3 Flash is Google's answer for everything else — capable enough for genuine intellectual work, fast enough for production-scale deployment, and priced accordingly.
gemini-3-flash (check Google AI docs for versioned string)PhD-Level Reasoning at Flash Speed: 90.4% on GPQA Diamond — a benchmark testing graduate-level reasoning across biology, chemistry, and physics — while retaining the latency profile of a Flash-tier model. This is a striking combination.
Humanity's Last Exam: 33.7% without tools on HLE, a benchmark designed to be extremely difficult for AI systems. Scores here compete with larger frontier models.
1M+ Context: Same full context window as Gemini 3 Pro, enabling large-document and codebase analysis without the cost premium.
Native Multimodality: Inherits full Gemini 3 multimodal capability — reasoning across text, images, video, and audio at Flash-tier cost and latency.
Agentic Tasks: Strong performance in tool use and multi-step agentic workflows, making it a practical choice for production AI pipelines where Gemini 3 Pro's cost isn't warranted.
For the most demanding reasoning tasks — particularly those requiring the depth of ARC-AGI-2 level novel reasoning — Gemini 3 Pro or 3.1 Pro will outperform it. Flash models inherently trade some depth for speed.
February 26, 2026