With Google I/O 2026 set for May 19 and 20 at Shoreline Amphitheater, the cadence of Gemini signals has picked up sharply, and the latest cluster all points back to the Flash tier. Three developments are converging at once. On LM Arena, an anonymous Gemini Flash candidate has surfaced for evaluation, and early head-to-head impressions suggest it trades blows with Gemini 3.1 Pro, the company’s current frontier. If those readings hold up, Google would be on the verge of folding flagship-grade reasoning into a class built for cost-efficient, high-volume traffic, a meaningful step up for developers who previously had to choose between speed and depth.
Google may be testing a new Gemini 3 Flash model on the Arena right now
— AiBattle (@AiBattle_) May 2, 2026
The model seems to be showing up on Codearena unusually often, which is usually a good indication that they are testing a new model
When DeepSeek tested V4, the model showed up very frequently in the text… https://t.co/B4ZRBUzdpA pic.twitter.com/jR5IGWkUmw
The second signal comes from Vertex AI, where customers still on Gemini 2 Flash have begun receiving deprecation notices nudging them to migrate to Gemini 3 Flash or 3.1 Flash-Lite. The same notice references a forthcoming GA release, language consistent with Google’s habit of clearing the runway for a stable successor before announcing it.
GOOGLE 🚨: A new Gemini Flash model has been spotted on LM Arena.
— TestingCatalog News 🗞 (@testingcatalog) May 2, 2026
Besides that, Vertex AI customers who still use Gemini Flash 2 received an email that it will be distributed soon.
> Transition to Gemini 3.1 Flash Lite - Generaly Available soon!
Soon 🔜
h/t @hishtadlut https://t.co/aKSrdOVTxG pic.twitter.com/025gv13kLG
Rounding out the picture, a handful of Gemini app users briefly saw a “Flash 3.2” entry appear in the model selector before it was pulled, a surfacing that typically precedes a controlled rollout by days or weeks rather than months.
Gemini 3.2 Flash Spotted pic.twitter.com/aPXYHX2zrN
— Just a dragon (@Waguri_Kaoruko8) May 5, 2026
For the Gemini app audience, this would translate into faster, sharper default responses without requiring a Pro tier. For Vertex and AI Studio developers, it sets up a clean migration path off the 2.x family ahead of the formal retirement window. Whether the GA arrives quietly via a Vertex release note in the days before the keynote, or gets folded into the I/O stage moment alongside a possible Gemini 3.5 reveal, mirroring last year’s 2.5 pattern, remains the open question.
Google has not commented publicly on the Arena listing or the model selector flicker, but the pattern is familiar: Vertex AI notices, feature-flag breadcrumbs, and Arena testing tend to converge into an announcement, and the window between now and I/O is narrow.