The Information Machine

Gemini 3.5 Flash: Release, Pricing, and Tooling · history

Version 3

2026-05-22 20:07 UTC · 69 items

What

Google launched Gemini 3.5 Flash at Google I/O 2026 on May 19, priced at $1.50/million input tokens and $9/million output tokens — 3x the cost of Gemini 3 Flash Preview and 6x Gemini 3.1 Flash-Lite [10][11]. The model replaced Gemini 3 as the default on Google's web interface and now powers Google Search AI Mode [7], while Google's official launch blog positioned it under the tagline 'frontier intelligence with action' [1]. Community and developer reaction to the pricing has been sharply negative, with one observer characterizing it as the worst negative feedback they had ever seen for a software release [20], while early benchmark data rates it a clear capability leader on the intelligence-vs-speed Pareto frontier [14].

Why it matters

Gemini 3.5 Flash breaks the developer expectation that 'Flash' means affordable — its API price nearly matches Gemini 3.1 Pro while being deployed for free in consumer products, creating a structurally confusing cost signal. The breadth and speed of the pricing backlash suggests this repricing is not being absorbed quietly, and continued migration-focused coverage [13] indicates developers are actively weighing alternatives rather than simply accepting the new tier.

Open questions

  • Do the capability improvements over Gemini 3 Flash justify a 3x price increase, or are API customers now paying Pro prices for historically Flash-tier workloads? [15][14]

  • With Gemini 3.5 Flash now the default on Google's free web interface [7] and featured in updated Google AI subscription tiers [8], how is Google accounting for the cost differential between its consumer and API revenue lines?

  • Will the strongly negative community response to pricing [20][17][18] prompt Google to introduce lower-cost variants, promotional pricing, or context-window-based tiers?

  • When will Gemini 3.5 Pro arrive? It was rumored ahead of I/O and remains unannounced [23][24].

Narrative

Google announced Gemini 3.5 Flash at Google I/O 2026 on May 19, marketing it under the tagline 'frontier intelligence with action' [1][2]. The model was spotted in the Google Cloud Console hours before the official announcement [3][4][5], alongside a companion model called Gemini Omni Flash [6]. Upon launch, Google confirmed that Gemini 3.5 Flash had replaced Gemini 3 as the default model on its web interface, that it was powering AI Mode in Google Search [7], and that it featured in updated Google AI subscription offerings announced at I/O [8]. The Gemini API changelog [9] and Google's official model blog [1] formalized its release documentation.

The pricing landed at $1.50 per million input tokens and $9 per million output tokens — 3x the cost of Gemini 3 Flash Preview and 6x Gemini 3.1 Flash-Lite [10][11]. One analysis noted the output token price is nearly 30x more expensive than the previous cheapest Flash-tier option [12]. An independent benchmark run illustrated real-world cost impact: the Artificial Analysis benchmark cost $1,551.60 on Gemini 3.5 Flash (high), compared to $892.28 for Gemini 3.1 Pro Preview [10]. A dedicated pricing migration comparison [13] confirmed developers are actively researching the cost jump and their options. These figures nearly erase the price gap between Google's mid-tier and pro offerings, fundamentally undermining the traditional cost/capability segmentation that the 'Flash' label had previously signaled to developers.

On the capability side, early signals are more favorable. Artificial Analysis rated Gemini 3.5 Flash as the 'clear leader on the Intelligence vs Speed Pareto frontier' with large gains on its benchmarks [14]. An early-access user described first impressions as feeling 'on par with Gemini 3.1 Pro Preview' [15], and a French-language commentator noted it was 'more performant and less expensive than the current Pro version' [16]. These assessments suggest the model does represent a genuine capability step up from prior Flash offerings — but the question of whether that justifies prices matching the Pro tier remains contested.

The developer community's pricing reaction was swift and unusually intense. Across social media and technical forums, posts described the pricing as 'painful' [17], announced 'the good times are over' [18], and declared outright that 'Gemini 3.5 Flash is EXPENSIVE' [19]. One observer stated they had 'never seen such negative feedback in a software release before' when characterizing the response [20]. Analyst Simon Willison contextualized this as part of a broader industry repricing trend, noting that Google, OpenAI, and Anthropic all appear to be simultaneously testing API customer price tolerance — suggesting the aggressive discounting of early LLM competition may be giving way to margin recapture across the industry [10]. On the tooling side, the llm CLI plugin for Gemini added gemini-3.5-flash support in version 0.32 [21], and a concurrent alpha (0.32a0) introduced streaming reasoning token support [22].

Timeline

  • 2026-05-17: Gemini 3.2 Flash-Lite-Live spotted in Google Cloud Console, signaling imminent I/O launches [28]
  • 2026-05-19: Gemini 3.5 Flash and Gemini Omni Flash spotted in Google Cloud Console hours before official Google I/O announcement [29][3][4][6]
  • 2026-05-19: Gemini 3.5 Flash officially launched at Google I/O 2026; priced at $1.50/M input, $9/M output — 3x cost of Gemini 3 Flash Preview; deployed as default on web interface and Google Search AI Mode; featured in updated Google AI subscription tiers [10][11][2][7][30][1][8][9][31]
  • 2026-05-19: llm-gemini 0.32 released, adding the gemini-3.5-flash model identifier to the llm CLI plugin [21]
  • 2026-05-19: llm-gemini 0.32a0 alpha released, adding streaming reasoning token support for Gemini models [22]
  • 2026-05-19: Artificial Analysis benchmarks rate Gemini 3.5 Flash as 'clear leader on the Intelligence vs Speed Pareto frontier'; early-access user impressions put it on par with Gemini 3.1 Pro Preview [14][15]
  • 2026-05-19: Developer community pricing backlash erupts across social media; posts describe pricing as 'painful', 'expensive', and mark the end of cheap Flash-tier access; migration and alternatives comparisons begin circulating [26][17][18][27][12][19][13]
  • 2026-05-21: Community observer characterizes pricing reaction as worst negative feedback ever seen for a software release [20]

Perspectives

Simon Willison

Analytically skeptical of the price increases; documents the cost jump with benchmark data and frames it as part of an industry-wide repricing trend affecting all three major labs. Notes the irony of Google deploying a pricier model in free consumer products while charging API customers near-Pro rates.

Evolution: Consistent across all posts; the pricing analysis carries the analytical weight while the llm-gemini release notes are minimal changelog entries.

Artificial Analysis

Positive on capability: rates Gemini 3.5 Flash as the clear leader on the intelligence-vs-speed Pareto frontier with large benchmark gains, though their own cost data ($1,551.60 per benchmark run) illustrates the price burden.

Evolution: Consistent; provides the most structured capability endorsement of any observer.

Early-access user (Prompt/@engineerrprompt)

Positive on capability: first impressions describe Gemini 3.5 Flash as feeling on par with Gemini 3.1 Pro Preview, implicitly validating the capability-tier collapse.

Evolution: Consistent.

Google

Positions Gemini 3.5 Flash as a flagship efficiency model with frontier-level intelligence, deploying it as the default in consumer products, Google Search AI Mode, and updated subscription tiers — with no public acknowledgment of developer pricing concerns.

Evolution: Official launch materials [1][2][8] add subscription context not in the prior synthesis, but Google's posture remains promotional with no pricing concessions announced.

Developer community (broad)

Strongly negative on pricing; characterize the 3x cost increase as a betrayal of the Flash-tier value proposition, with phrases like 'painful', 'the good times are over', and 'EXPENSIVE' dominating reaction posts. Migration and alternative routing research is actively underway.

Evolution: Consistent with prior pass; follow-up coverage and dedicated pricing migration comparisons [13] confirm the reaction is not dissipating.

Tensions

  • Artificial Analysis and early-access users rate Gemini 3.5 Flash as a genuine capability leader matching Gemini 3.1 Pro [14][15], while the developer community broadly rejects the pricing as unjustifiable for a Flash-tier model [17][18][27] — a standoff between benchmark performance and cost tolerance. [14][15][17][18][27]
  • Google positions Gemini 3.5 Flash as a Flash-tier efficiency model, deploys it free in consumer products and updated subscription tiers [7][8], and promotes it as frontier intelligence [1] — while its API pricing nearly matches Gemini 3.1 Pro, creating an unresolved tension between the 'Flash' branding promise and the actual cost burden placed on API customers [10][11]. [10][11][7][1][8]

Sources

  1. [1] Gemini 3.5: frontier intelligence with action - Google Blog — reactive:google-io-gemini-launch
  2. [2] Google shipped Gemini 3.5 Flash at I/O 2026. Marketing says "frontier intelligence with action." — reactive:gemini-35-flash-release (2026-05-20)
  3. [3] Gemini 3.5 Flash is now visible within the Google Cloud Console, preceding its official announcement at Google I/O. It a... — reactive:google-io-2026-launch-blitz (2026-05-19)
  4. [4] 🧵 Gemini 3.5 Flash Appears in Google Cloud Console (May 19, 2026) — reactive:gemini-35-flash-release (2026-05-19)
  5. [5] Gemini 3.5 Flash is now in the Google cloud console. It’s shows pricing currently at 3x the cost of the Gemini 3 Flash m... — reactive:gemini-35-flash-release (2026-05-19)
  6. [6] Google DeepMind have released Gemini Omni Flash and Gemini 3.5 Flash — reactive:google-io-2026-launch-blitz (2026-05-19)
  7. [7] Gemini 3.5 Flash is officially live, replacing Gemini 3 as the default option on the web interface and powering Google S... — reactive:gemini-35-flash-release (2026-05-20)
  8. [8] Everything new in our Google AI subscriptions, fresh from I/O 2026 — reactive:google-io-gemini-launch
  9. [9] Release notes | Gemini API - Google AI for Developers — reactive:google-io-gemini-launch
  10. [10] Gemini 3.5 Flash: more expensive, but Google plan to use it for everything — Simon Willison (2026-05-19)
  11. [11] 3.5 Flash pricing: $1.50/million input tokens, $9/million output. That's 3x Gemini 3 Flash Preview and 6x the 3.1 Flash-... — reactive:gemini-35-flash-release (2026-05-20)
  12. [12] Gemini 3.5 Flash now costs $9 per million output tokens , 3x the price of Gemini 3 Flash and nearly 30x more than Gemini... — reactive:gemini-35-flash-release (2026-05-19)
  13. [13] Gemini 3.5 Flash vs Gemini 3 Flash Preview: Pricing & Migration ... — reactive:gemini-35-flash-release
  14. [14] Google’s new Gemini 3.5 Flash is the clear leader on the Intelligence vs Speed Pareto frontier and makes large gains on ... — reactive:google-io-2026-launch-blitz (2026-05-19)
  15. [15] Gemini 3.5 Flash, first impressions. I had early access to the model, and it feels on par with Gemini 3.1 Pro Preview. T... — reactive:google-io-2026-launch-blitz (2026-05-19)
  16. [16] Release de Gemini 3.5 Flash : plus performant et moins cher que la version Pro actuelle. Maintenant, on attend surtout G... — reactive:google-io-2026-launch-blitz (2026-05-19)
  17. [17] gemini 3.5 flash pricing is painful. — reactive:gemini-35-flash-release (2026-05-19)
  18. [18] Gemini 3.5 Flash pricing has changed - the good times are over! It’s now 3x more expensive than Gemini 3 Flash. https://... — reactive:gemini-35-flash-release (2026-05-19)
  19. [19] Gemini 3.5 Flash is EXPENSIVE. — reactive:gemini-35-flash-release (2026-05-19)
  20. [20] @andyzhang @antigravity I never saw such a negative feedback in a software release before. Replacing Gemini 3 Flash quot... — reactive:google-io-2026-launch-blitz (2026-05-21)
  21. [21] llm-gemini 0.32 — Simon Willison (2026-05-19)
  22. [22] llm-gemini 0.32a0 — Simon Willison (2026-05-19)
  23. [23] Gemini 3.5 Pro rumor just got spicy 🔥 — reactive:gemini-35-flash-release (2026-05-19)
  24. [24] Gemini 3.5 Pro watch: Google's official Gemini API model page lists Gemini 3.1 Pro Preview and Gemini 3 Flash, but no ge... — reactive:gemini-35-flash-release (2026-05-19)
  25. [25] Gemini 3.5 Flash: more expensive, but Google plan to use it for everything — reactive:gemini-35-flash-release
  26. [26] Gemini 3.5 Flash Pricing: 😢 — reactive:gemini-35-flash-release (2026-05-19)
  27. [27] GEMINI 3.5 FLASH MIGHT NOT BE CHEAP AT ALL — reactive:gemini-35-flash-release (2026-05-19)
  28. [28] UPDATE: Gemini 3.2 Flash-Lite-Live just spotted in the Google Cloud Console. — reactive:google-io-2026-launch-blitz (2026-05-17)
  29. [29] Gemini 3.5 Flash (In Console) and Gemini Omni are nearing release, with only two hours remaining until Google I/O. https... — reactive:google-io-2026-launch-blitz (2026-05-19)
  30. [30] Google’s official pricing page has been updated to include Gemini 3.5 Flash. — reactive:gemini-35-flash-release (2026-05-19)
  31. [31] Google launches Gemini 3.5 Flash. How to try it for free. | Mashable — reactive:google-io-gemini-launch