The Information Machine

Gemini 3.5 Flash: Release, Pricing, and Tooling · history

Version 2

2026-05-21 09:05 UTC · 54 items

What

Google launched Gemini 3.5 Flash at Google I/O 2026 on May 19, priced at $1.50/million input tokens and $9/million output tokens — 3x the cost of Gemini 3 Flash Preview and 6x Gemini 3.1 Flash-Lite [7][8]. The model replaced Gemini 3 as the default on Google's web interface and now powers Google Search AI Mode [6]. Community reaction to the pricing has been sharply and broadly negative, with one developer characterizing it as the worst negative feedback they had ever seen for a software release [16]. Early benchmark data and user impressions suggest the model's performance is roughly on par with Gemini 3.1 Pro, which Artificial Analysis called it a "clear leader on the Intelligence vs Speed Pareto frontier" [11][10].

Why it matters

Gemini 3.5 Flash breaks the developer expectation that 'Flash' means affordable — its API price nearly matches Gemini 3.1 Pro while being deployed for free in consumer products, creating a structurally confusing cost signal. The breadth and speed of the pricing backlash, spanning dozens of developers within hours of release, suggests this repricing is not being absorbed quietly and may force Google to respond with tiered pricing or promotional rates.

Open questions

  • Do the capability improvements over Gemini 3 Flash justify a 3x price increase, or are API customers now paying Pro prices for historically Flash-tier workloads? [11][10]

  • With Gemini 3.5 Flash now the default on Google's free web interface [6], how is Google accounting for the cost differential between its consumer and API revenue lines?

  • Will the strongly negative community response to pricing [16][13][14] prompt Google to introduce lower-cost variants, promotional pricing, or context-window-based tiers?

  • When will Gemini 3.5 Pro arrive? It was rumored ahead of I/O and remains unannounced [19][20].

Narrative

Google announced Gemini 3.5 Flash at Google I/O 2026 on May 19, marketing it under the tagline "frontier intelligence with action" [1]. The model was spotted in the Google Cloud Console hours before the official announcement [2][3][4], alongside a companion model called Gemini Omni Flash [5]. Upon launch, Google confirmed that Gemini 3.5 Flash had replaced Gemini 3 as the default model on its web interface and that it was powering AI Mode in Google Search [6].

The pricing landed at $1.50 per million input tokens and $9 per million output tokens — 3x the cost of Gemini 3 Flash Preview and 6x Gemini 3.1 Flash-Lite [7][8]. One analysis noted the output token price is nearly 30x more expensive than the previous cheapest Flash-tier option [9]. An independent benchmark run illustrated real-world cost impact: the Artificial Analysis benchmark cost $1,551.60 on Gemini 3.5 Flash (high), compared to $892.28 for Gemini 3.1 Pro Preview [7]. These figures nearly erase the price gap between Google's mid-tier and pro offerings, fundamentally undermining the traditional cost/capability segmentation that the 'Flash' label had previously signaled to developers.

On the capability side, early signals are more favorable. Artificial Analysis rated Gemini 3.5 Flash as the "clear leader on the Intelligence vs Speed Pareto frontier" with large gains on its benchmarks [10]. An early-access user described first impressions as feeling "on par with Gemini 3.1 Pro Preview" [11], and a French-language commentator noted it was "more performant and less expensive than the current Pro version" [12]. These assessments suggest the model does represent a genuine capability step up from prior Flash offerings — but the question of whether that justifies prices matching the Pro tier remains contested.

The developer community's pricing reaction was swift and unusually intense. Across social media, posts described the pricing as "painful" [13], announced "the good times are over" [14], and declared outright that "Gemini 3.5 Flash is EXPENSIVE" [15]. One observer stated they had "never seen such negative feedback in a software release before" when characterizing the response to Gemini 3.5 Flash replacing Gemini 3 as the default [16]. Analyst Simon Willison contextualized this as part of a broader industry repricing trend, noting that Google, OpenAI, and Anthropic all appear to be simultaneously testing API customer price tolerance — suggesting the aggressive discounting of early LLM competition may be giving way to margin recapture across the industry [7]. On the tooling side, the llm CLI plugin for Gemini added gemini-3.5-flash support in version 0.32 [17], and a concurrent alpha (0.32a0) introduced streaming reasoning token support [18].

Timeline

  • 2026-05-17: Gemini 3.2 Flash-Lite-Live spotted in Google Cloud Console, signaling imminent I/O launches [23]
  • 2026-05-19: Gemini 3.5 Flash and Gemini Omni Flash spotted in Google Cloud Console hours before official Google I/O announcement [24][2][3][5]
  • 2026-05-19: Gemini 3.5 Flash officially launched at Google I/O 2026; priced at $1.50/M input, $9/M output — 3x cost of Gemini 3 Flash Preview; deployed as default on web interface and Google Search AI Mode [7][8][1][6][25]
  • 2026-05-19: llm-gemini 0.32 released, adding the gemini-3.5-flash model identifier to the llm CLI plugin [17]
  • 2026-05-19: llm-gemini 0.32a0 alpha released, adding streaming reasoning token support for Gemini models [18]
  • 2026-05-19: Artificial Analysis benchmarks rate Gemini 3.5 Flash as 'clear leader on the Intelligence vs Speed Pareto frontier'; early-access user impressions put it on par with Gemini 3.1 Pro Preview [10][11]
  • 2026-05-19: Developer community pricing backlash erupts across social media; posts describe pricing as 'painful', 'expensive', and mark the end of cheap Flash-tier access [21][13][14][22][9][15]
  • 2026-05-21: Community observer characterizes pricing reaction as worst negative feedback ever seen for a software release [16]

Perspectives

Simon Willison

Analytically skeptical of the price increases; documents the cost jump with benchmark data and frames it as part of an industry-wide repricing trend affecting all three major labs. Notes the irony of Google deploying a pricier model in free consumer products.

Evolution: Consistent across all posts; the pricing analysis carries the analytical weight while the llm-gemini release notes are minimal changelog entries.

Artificial Analysis

Positive on capability: rates Gemini 3.5 Flash as the clear leader on the intelligence-vs-speed Pareto frontier with large benchmark gains, though their own cost data ($1,551.60 per benchmark run) illustrates the price burden.

Evolution: New voice this pass; provides the most structured capability endorsement of any observer.

Early-access user (Prompt/@engineerrprompt)

Positive on capability: first impressions describe Gemini 3.5 Flash as feeling on par with Gemini 3.1 Pro Preview, implicitly validating the capability-tier collapse.

Evolution: New voice this pass.

Developer community (broad)

Strongly negative on pricing; characterize the 3x cost increase as a betrayal of the Flash-tier value proposition, with phrases like 'painful', 'the good times are over', and 'EXPENSIVE' dominating reaction posts.

Evolution: New collective voice this pass, emerging post-launch. Represents the largest volume of new items.

Tensions

  • Artificial Analysis and early-access users rate Gemini 3.5 Flash as a genuine capability leader matching Gemini 3.1 Pro [10][11], while the developer community broadly rejects the pricing as unjustifiable for a Flash-tier model [13][14][22] — a standoff between benchmark performance and cost tolerance. [10][11][13][14][22]
  • Google positions Gemini 3.5 Flash as a Flash-tier (efficiency) model and deploys it free in consumer products [6], while its API pricing nearly matches Gemini 3.1 Pro — creating an unresolved tension between the 'Flash' branding promise and the actual cost burden placed on API customers [7][8]. [7][8][6]

Sources

  1. [1] Google shipped Gemini 3.5 Flash at I/O 2026. Marketing says "frontier intelligence with action." — reactive:gemini-35-flash-release (2026-05-20)
  2. [2] Gemini 3.5 Flash is now visible within the Google Cloud Console, preceding its official announcement at Google I/O. It a... — reactive:google-io-2026-launch-blitz (2026-05-19)
  3. [3] 🧵 Gemini 3.5 Flash Appears in Google Cloud Console (May 19, 2026) — reactive:gemini-35-flash-release (2026-05-19)
  4. [4] Gemini 3.5 Flash is now in the Google cloud console. It’s shows pricing currently at 3x the cost of the Gemini 3 Flash m... — reactive:gemini-35-flash-release (2026-05-19)
  5. [5] Google DeepMind have released Gemini Omni Flash and Gemini 3.5 Flash — reactive:google-io-2026-launch-blitz (2026-05-19)
  6. [6] Gemini 3.5 Flash is officially live, replacing Gemini 3 as the default option on the web interface and powering Google S... — reactive:gemini-35-flash-release (2026-05-20)
  7. [7] Gemini 3.5 Flash: more expensive, but Google plan to use it for everything — Simon Willison (2026-05-19)
  8. [8] 3.5 Flash pricing: $1.50/million input tokens, $9/million output. That's 3x Gemini 3 Flash Preview and 6x the 3.1 Flash-... — reactive:gemini-35-flash-release (2026-05-20)
  9. [9] Gemini 3.5 Flash now costs $9 per million output tokens , 3x the price of Gemini 3 Flash and nearly 30x more than Gemini... — reactive:gemini-35-flash-release (2026-05-19)
  10. [10] Google’s new Gemini 3.5 Flash is the clear leader on the Intelligence vs Speed Pareto frontier and makes large gains on ... — reactive:google-io-2026-launch-blitz (2026-05-19)
  11. [11] Gemini 3.5 Flash, first impressions. I had early access to the model, and it feels on par with Gemini 3.1 Pro Preview. T... — reactive:google-io-2026-launch-blitz (2026-05-19)
  12. [12] Release de Gemini 3.5 Flash : plus performant et moins cher que la version Pro actuelle. Maintenant, on attend surtout G... — reactive:google-io-2026-launch-blitz (2026-05-19)
  13. [13] gemini 3.5 flash pricing is painful. — reactive:gemini-35-flash-release (2026-05-19)
  14. [14] Gemini 3.5 Flash pricing has changed - the good times are over! It’s now 3x more expensive than Gemini 3 Flash. https://... — reactive:gemini-35-flash-release (2026-05-19)
  15. [15] Gemini 3.5 Flash is EXPENSIVE. — reactive:gemini-35-flash-release (2026-05-19)
  16. [16] @andyzhang @antigravity I never saw such a negative feedback in a software release before. Replacing Gemini 3 Flash quot... — reactive:google-io-2026-launch-blitz (2026-05-21)
  17. [17] llm-gemini 0.32 — Simon Willison (2026-05-19)
  18. [18] llm-gemini 0.32a0 — Simon Willison (2026-05-19)
  19. [19] Gemini 3.5 Pro rumor just got spicy 🔥 — reactive:gemini-35-flash-release (2026-05-19)
  20. [20] Gemini 3.5 Pro watch: Google's official Gemini API model page lists Gemini 3.1 Pro Preview and Gemini 3 Flash, but no ge... — reactive:gemini-35-flash-release (2026-05-19)
  21. [21] Gemini 3.5 Flash Pricing: 😢 — reactive:gemini-35-flash-release (2026-05-19)
  22. [22] GEMINI 3.5 FLASH MIGHT NOT BE CHEAP AT ALL — reactive:gemini-35-flash-release (2026-05-19)
  23. [23] UPDATE: Gemini 3.2 Flash-Lite-Live just spotted in the Google Cloud Console. — reactive:google-io-2026-launch-blitz (2026-05-17)
  24. [24] Gemini 3.5 Flash (In Console) and Gemini Omni are nearing release, with only two hours remaining until Google I/O. https... — reactive:google-io-2026-launch-blitz (2026-05-19)
  25. [25] Google’s official pricing page has been updated to include Gemini 3.5 Flash. — reactive:gemini-35-flash-release (2026-05-19)