Just a week after publicizing the news of Gemini 2.5, Google has launched the preliminary version of Gemini 2.5 Flashmarking a milestone in its line of artificial intelligence models. This new model, announced during the recent Cloud Next event and already accessible through Gemini’s API in Google Ai Studio and VerTex AI is distinguished by being the first completely hybrid reasoning model developed by the company.
The central innovation presented by Gemini 2.5 Flash is what Google calls a “thought budget”, a functionality that allows developers to exercise granular control over the amount of computational capacity that the model dedicates to processing and reasoning on complex problems before generating an answer. The purpose behind this mechanism is to offer a solution that balances the quality of the responses with efficiency in terms of cost and latencyfundamental aspects for the implementation of AI in business environments.
According to Google, Gemini 2.5 Flash not only improves reasoning capabilities compared to its predecessor, Gemini 2.0 Flash, but does it without incurring limitations that usually associate with greater models. The company thus emphasizes its excellent performance-cost relationship, since the flexibility to adjust the level of “thought” allows Optimize the model for a wide range of tasksfrom simple consultations that require minimal deliberation to complex problems that demand a deep analysis. The idea is that users pay only for the level of processing they need.
Although it is a test phase version, Google positions Gemini 2.5 Flash as a relevant competitor in the saturated AI market. The first evaluations suggest that it offers notable value and speed compared to other available options, although its launch is part of a broader Google strategy to strengthen its position in the field of artificial intelligence, seeking to capture both the community of developers and companies and end users through the offer of more adaptable and economically efficient tools.
The availability of Gemini 2.5 Flash in the application of consumption, labeled as “experimental”, also suggests an interest in collecting feedback On a large scale. Yet, Gemini 2.0 Flash remains in force as a default use option And recommended for most users, and possibly continue to be for a while, since together with version 2.5 Pro, also in an experimental phase, it is the only stable without advanced reasoning capacity, which also has its advantage.