Gemini 2.5 Flash-Lite is now ready for scaled production use
Gemini 2.5 Flash-Lite, previously in preview, is now stable and generally available. This cost-efficient model provides high quality in a small size, and includes 2.5 family features like a 1 million-token context window and multimodality.
Explore Gemini 2.5 Flash-Lite, Google's stable and generally available model offering incredible speed, cost-efficiency, high quality, and 2.5 family features.
Today, we’re releasing the stable version of Gemini 2.5 Flash-Lite, our fastest and lowest cost ($0.10 input per 1M, $0.40 output per 1M) model in the Gemini 2.5 model family. We built 2.5 Flash-Lite to push the frontier of intelligence per dollar, with native reasoning capabilities that can be optionally toggled on for more demanding use cases. Building on the momentum of 2.5 Pro and 2.5 Flash, this model rounds out our set of 2.5 models that are ready for scaled production use.
Our most cost-efficient and fastest 2.5 model yet
Disclaimer: This content has been automatically aggregated from GOOGLE DEEPMIND for informational purposes. To read the original article, please visit GOOGLE DEEPMIND.
Home

