Gemini 2.5 Flash-Lite is now ready for scaled production use

Gemini 2.5 Flash-Lite, previously in preview, is now stable and generally available. This cost-efficient model provides high quality in a small size, and includes 2.5 family features like a 1 million-token context window and multimodality.

15 April 2026 07:37 AM IST

Gemini 2.5 Flash-Lite is now ready for scaled production use

Explore Gemini 2.5 Flash-Lite, Google's stable and generally available model offering incredible speed, cost-efficiency, high quality, and 2.5 family features.

Today, we’re releasing the stable version of Gemini 2.5 Flash-Lite, our fastest and lowest cost ($0.10 input per 1M, $0.40 output per 1M) model in the Gemini 2.5 model family. We built 2.5 Flash-Lite to push the frontier of intelligence per dollar, with native reasoning capabilities that can be optionally toggled on for more demanding use cases. Building on the momentum of 2.5 Pro and 2.5 Flash, this model rounds out our set of 2.5 models that are ready for scaled production use.

Our most cost-efficient and fastest 2.5 model yet

Disclaimer: This content has been automatically aggregated from GOOGLE DEEPMIND for informational purposes. To read the original article, please visit GOOGLE DEEPMIND.

Gemini 2.5 Flash-Lite is now ready for scaled production use

Gemini 2.5 Flash-Lite, previously in preview, is now stable and generally available. This cost-efficient model provides high quality in a small size, and includes 2.5 family features like a 1 million-token context window and multimodality.

Our most cost-efficient and fastest 2.5 model yet

Tags:

Gemini 3.1 Flash TTS: the next generation of expressive...

Aeneas transforms how historians connect the past

Advanced version of Gemini with Deep Think officially a...

Gemini 2.5 Flash-Lite is now ready for scaled production use

Gemini 2.5 Flash-Lite, previously in preview, is now stable and generally available. This cost-efficient model provides high quality in a small size, and includes 2.5 family features like a 1 million-token context window and multimodality.

Our most cost-efficient and fastest 2.5 model yet

Tags:

Related Posts

Gemini 3.1 Flash TTS: the next generation of expressive...

Aeneas transforms how historians connect the past

Advanced version of Gemini with Deep Think officially a...