The Information Machine

AWS CEO: AI Compute Demand So Strong No A100 Server Has Ever Been Retired · history

Version 6

2026-04-30 22:33 UTC · 268 items

Narrative

Four synthesis cycles after AWS CEO Matt Garman's April 26 claim that no A100 server has ever been retired[1][2], the thread's most significant factual expansion this cycle is a Reuters exclusive reporting that Amazon's chips business (Trainium/Inferentia) already generates an annual revenue run rate exceeding $20 billion[3] — transforming what was previously framed as a hypothetical $50 billion externalization upside into a present-tense operational claim. Jassy's shareholder letter, now extensively documented across more than a dozen outlets, added granular supply-exhaustion data absent from earlier coverage: Trainium2 is largely sold out, Trainium3 (which began shipping at the start of 2026) is nearly fully-subscribed, and a significant portion of Trainium4 capacity — still 18 months from broad availability — has already been reserved[4]. Jassy stated that 'a new shift has started' away from Nvidia's chip dominance[5], defended spending with 'We're not going to be conservative'[6], and was characterized in TechCrunch as 'taking aim at Nvidia, Intel, Starlink' in the letter[7]. A TechCrunch exclusive tour of Amazon's Trainium lab in March documented wins with Anthropic, OpenAI, and Apple[8] — the first lab-level product validation to appear in the thread — while Digitimes published a retrospective framing it as the culmination of Amazon's 11-year custom chip journey[9].

A materially new demand vector has entered with no precedent in prior synthesis cycles: agentic AI is consuming CPUs at similar intensity to GPUs. WCCFtech reports Amazon tripled its CPU server count and still ran out of capacity as agentic AI workloads consumed all available cloud processors[10], and a Threads post from late April asserts AI chips have sold out well into 2030[11]. AWS is now described as expected to sell out all 2026 capacity, not merely legacy GPU hardware[12]. This expands the demand signal beyond the GPU-specific framing of Garman's original A100 claim into a total-compute shortage thesis. Separately, SemiAnalysis published a new analysis on how much GPU clusters really cost[13], and Yahoo Finance reported that AI compute costs have for the first time surpassed human labor costs in enterprise budgets[14] — a threshold that simultaneously strengthens the demand/ROI argument and heightens enterprise cost-structure concerns.

The GPU pricing narrative has gained critical historical context and new internal contradictions. Introl documents a GPU cloud price collapse in December 2025[15], establishing that the SemiAnalysis 40% rental price surge started from a trough, not a stable baseline. Within Silicon Data's own data corpus, two divergent findings now coexist: the April 2026 H100 Hyperscaler Index characterizes reservation-level pricing as 'in flat mode'[16], while a separate Silicon Data analysis documents a 10% H100 spot/retail price spike[17] — suggesting different dynamics between large hyperscaler reservation blocks and spot or retail-tier markets. Tomasz Tunguz independently documented GPU spot prices surging 114% over six weeks[18], confirming extreme spot-market volatility that reservation indices would not capture. On April 30 — the exact date of the Polymarket H100 prediction market deadline — Grok's AI fact-checker assessed Garman's A100 retirement claim as 'accurate on the core claim'[19], providing automated institutional validation on the day the contract resolved.

The overbuilding debate has attracted both its most heavyweight institutional bull and its sharpest mainstream skeptic voices this cycle. KKR published 'Beyond the Bubble: Why AI Infrastructure Will Compound Long after...' arguing the AI infrastructure investment case remains sound well past any near-term bubble concerns[20], representing the first major private equity firm to explicitly address the bubble thesis in this thread. Princeton's CITP blog argues GenAI may structurally break historical infrastructure mania patterns due to software monetization characteristics absent from railways or telecom[21], while Uncover Alpha argues market skepticism of cloud spending is contradicted by underlying data[22]. On the opposing side, Jeff Sica warned via Fox Business and Yahoo Finance of a 'breaking point' in hyperscaler spending[23][24], Benzinga framed the AI boom as resembling past infrastructure manias[25], and new overbuilding analogies extend beyond telecom to railways and dark fiber[26]. Three vendor surveys now provide repatriation figures — Cloudian (93%)[27], Tasrie IT Services (86%)[28], and Data Canopy (83%)[29] — lending the repatriation narrative statistical weight from multiple sources, though all three come from vendors with commercial interests in on-premise or private cloud solutions.

Timeline

  • 2025-11-03: Amazon closes a $38 billion cloud deal with OpenAI on AWS, locking a major AI lab into AWS infrastructure at multi-year scale; AWS and OpenAI announce a multi-year strategic partnership. [153][159]
  • 2025-12-01: Introl documents a GPU cloud price collapse in December 2025, establishing the trough from which the SemiAnalysis-documented 40% H100 rental surge subsequently began. [15]
  • 2026-01-05: AWS raises EC2 Capacity Block prices 15% in a uniform ML pricing adjustment; subsequent analysis notes variable real-world impact depending on reservation structure. [83][150][151][85][86][160][161]
  • 2026-01-10: NeuralRack AI publishes analysis characterizing cloud GPU rental costs as 'unsustainable' in 2026, establishing early documentation of the cost pressure thesis. [100]
  • 2026-02-01: AWS announces EC2 Capacity Blocks can now be shared across multiple accounts, easing enterprise multi-account ML infrastructure management. [162]
  • 2026-03-17: Reuters publishes exclusive: Amazon CEO Andy Jassy projects AI will double prior AWS sales projections to $600 billion by 2036. [33][34][163]
  • 2026-03-22: TechCrunch publishes exclusive tour of Amazon's Trainium lab, reporting the chip has won over Anthropic, OpenAI, and Apple — the first lab-level product validation of Trainium adoption to appear in the thread. [8]
  • 2026-04-09: Jassy's 2025 shareholder letter published. Reuters reports Amazon's chips business already has an annual revenue run rate exceeding $20 billion. Jassy states 'our chips business is on fire,' documents Trainium2 as largely sold out, Trainium3 as nearly fully-subscribed, and significant Trainium4 capacity already reserved 18 months ahead of availability. Jassy says 'a new shift has started' away from Nvidia's chip dominance and defends AI spending as 'not going to be conservative.' Amazon is also considering selling Trainium to external customers, valued at potentially $50 billion if realized. Story propagates across TechCrunch, Business Insider, Slashdot, Motley Fool, Yahoo Finance, Seeking Alpha, MSN, Financial Post, and multiple social platforms. [3][4][5][41][6][7][32][36][37][38][39][40][75][76][154][164][165][9][155][156][79][78][77][166]
  • 2026-04-23: Next Platform publishes 'Stop Measuring AI Training Costs In GPU Hours,' signaling a methodological shift in enterprise AI compute cost evaluation. [89]
  • 2026-04-26: AWS CEO Matt Garman publicly states AWS has never retired a single Nvidia A100 server and is completely sold out of A100 capacity, citing persistent demand exceeding supply even for older GPU generations. [1][2][55][167][146]
  • 2026-04-26: Garman statement rapidly amplified across X, LinkedIn, Reddit, SemiWiki, and Threads; investment commentary frames it as the definitive AI infrastructure demand signal. [56][168][169][170][171][61][62][67][172]
  • 2026-04-27: Azure VM retirement documentation consolidates: NVv4 (AMD Radeon MI25) and NVv3 (older Nvidia Tesla) series confirmed for September 30, 2026 retirement; separate 2028 retirement track documented. A100-based NDasrA100_v4 series remains active. Migration guidance published for GPU compute workloads. [42][43][44][45][46][47][48][49][50][51][52][53][54]
  • 2026-04-27: SemiAnalysis launches H100 one-year rental price index documenting nearly 40% surge over six months; data propagates to MSN, YouTube, Reddit r/NVDA_Stock, Seeking Alpha, and Polymarket prediction markets tracking H100 prices by April 30. [113][114][115][116][65][66][117][118][120][122][152][173]
  • 2026-04-27: Cloud repatriation achieves survey-level quantification: Cloudian (93%), Tasrie IT Services (86%), and Data Canopy (83%) figures circulate. SoftwareSeni counter-argues repatriation won't work specifically for AI workloads. Cloudian 93% figure propagates across StorageNewsletter, AIThority, Knox News, LinkedIn, and multiple local news outlets. [87][88][90][91][92][27][93][110][28][94][95][96][97][98][99][100][101][102][103][104][105][106][107][108][174][175][176][29][177]
  • 2026-04-27: NVIDIA Blackwell-generation GPU pricing (B200, B300, DGX systems) documented, establishing next-generation cost context. [119]
  • 2026-04-28: Silicon Data publishes April 2026 H100 Hyperscaler Index characterizing reservation-level pricing as 'in flat mode.' A separate Silicon Data analysis documents a 10% H100 spot/retail price spike — two divergent signals from the same source suggesting different dynamics at hyperscaler reservation vs. spot market tiers. Tomasz Tunguz documents GPU spot prices surging 114% over six weeks. [16][17][126][127][128][18]
  • 2026-04-28: Polymarket extends GPU price prediction contracts from H100 (April 30 deadline) to H200 prices by May 31. KKR publishes institutional bull case for AI infrastructure. MSN covers AI CAPEX ROI as 'key 2026 test for hyperscalers.' NVIDIA State of AI 2026 report published. Jeff Sica warns of a 'breaking point' over hyperscaler spending. Yahoo Finance reports AI compute costs have surpassed human labor costs in enterprise budgets. [68][69][70][71][72][73][20][74][145][23][24][14][143]
  • 2026-04-30: On the Polymarket H100 prediction market deadline date, Grok's AI fact-checker assesses Garman's A100 retirement claim as 'accurate on the core claim.' WCCFtech reports Amazon tripled its CPU server count and still ran out of capacity as agentic AI consumed all available cloud processors — extending the compute shortage narrative beyond GPUs to total infrastructure. AWS is reported as expected to sell out all 2026 capacity. [19][10][12]

Perspectives

Matt Garman, CEO of AWS

AI compute demand structurally exceeds supply across all GPU generations, including legacy hardware. AWS is completely sold out of A100 capacity and has never retired one. Demand is 'almost insatiable.' AWS is expected to sell out all 2026 capacity across compute types.

Evolution: Core stance consistent. Grok's April 30 fact-check of the A100 claim as 'accurate on the core claim' provides the first AI-system-level institutional validation of the statement on the Polymarket deadline date. The WCCFtech CPU shortage report extends the demand signal beyond GPUs without contradicting Garman's framing.

Andy Jassy, CEO of Amazon

Amazon's chips business is 'on fire,' generates an annual revenue run rate now exceeding $20 billion, and will be 'much larger than most think.' Trainium2 is largely sold out, Trainium3 nearly fully-subscribed, significant Trainium4 capacity already reserved 18 months ahead. 'A new shift has started' away from Nvidia's chip dominance. Amazon is considering external Trainium sales potentially valued at $50 billion. 'We're not going to be conservative' on AI spending.

Evolution: Most significantly upgraded perspective across the thread's full history. Previously framed as '$50B if externalized'; now anchored by Reuters' report of a $20B+ current annual revenue run rate, plus granular sold-out data across Trainium generations, a direct confrontational posture toward Nvidia, and a defense of spending against capex scrutiny.

Microsoft Azure

Azure is retiring NVv4 (AMD MI25-based) and NVv3 (older Nvidia Tesla-based) VM series by September 30, 2026. A separate 2028 retirement track covers additional older VM families. The A100-based NDasrA100_v4 series remains active. Migration guidance for GPU compute workloads has been published.

Evolution: Unchanged in substance. Migration guide documentation adds operational detail for customers but does not alter the retirement scope or the AWS-Azure A100 retirement contrast.

Investment and financial commentary (Milk Road AI, The AI Investor, LEAPTRADER, SpecialSitsNews, Barclays, InvestorPlace, Seeking Alpha, Polymarket, Motley Fool, Yahoo Finance)

The Amazon chips story has been upgraded from exploratory to bull-case framing: Motley Fool describes it as a 'hidden $50 billion business,' Yahoo Finance frames it as worth more than 82% of S&P companies, and Seeking Alpha characterizes Jassy's letter as 'a bull's dream.' GPU prediction market infrastructure has resolved one cycle (H100 April 30) and extended to H200 May 31.

Evolution: Expanded with retail-investor-targeted framing. The prior synthesis noted the $50B projection as a new named investment thesis; this cycle adds Motley Fool, Yahoo Finance, and Seeking Alpha providing accessible bull-case framing to a broader audience, and the H100 prediction market has now reached its deadline date.

Enterprise practitioners, architects, and CIOs

Three vendor surveys now quantify repatriation: Cloudian (93%), Tasrie IT Services (86%), and Data Canopy (83%). AI compute costs have reportedly surpassed human labor costs in enterprise budgets. CIOs are specifically documented moving AI workloads to colocation — a hybrid path between full cloud and on-premise — rather than executing binary repatriation.

Evolution: Amplification has intensified and diversified. The prior synthesis had two vendor surveys; this cycle adds a third (Data Canopy). The DataBank colocation documentation introduces a nuanced middle-path that the binary repatriation-vs-cloud framing does not capture and may represent the dominant enterprise response in practice.

Cloud repatriation skeptics (SoftwareSeni, Lenovo)

Cloud repatriation specifically will not work for AI workloads due to capital intensity and technical complexity. Lenovo's 2026 TCO analysis provides on-premise vs. cloud cost comparisons that qualify the repatriation economics for specific workload types.

Evolution: Marginally expanded with Lenovo's on-premise vs. cloud TCO documentation providing more detailed economic analysis than SoftwareSeni's structural argument.

GPU pricing analysts (SemiAnalysis, Silicon Data, Cast AI, Spheron, Fusion Worldwide, GMI Cloud, Introl, Tomasz Tunguz, Thunder Compute)

The GPU rental market now has three distinct historical and current price signals from different measurement approaches: SemiAnalysis's ~40% six-month H100 rental surge starting from Introl-documented December 2025 trough; Silicon Data's April 2026 hyperscaler-reservation 'flat mode' contrasted with a separate Silicon Data 10% spot/retail spike; and Tunguz's 114% spot price surge over six weeks. SemiAnalysis has also published new analysis on GPU cluster total costs. Thunder Compute provides a comprehensive April 2026 GPU rental market industry analysis.

Evolution: Most substantively updated perspective this cycle. Prior synthesis had SemiAnalysis surge vs. Silicon Data flat-mode as the key divergence. This cycle adds: Introl's December 2025 collapse establishing the surge's origin trough; Silicon Data's own internal divergence between hyperscaler-flat and spot-spike; Tunguz's independent 114% spot surge; and SemiAnalysis's GPU cluster cost analysis. The pricing picture is now substantially more complex and internally contradictory.

AI infrastructure bulls — institutional (KKR, Princeton CITP, Uncover Alpha)

KKR argues AI infrastructure will 'compound long after' any bubble concerns, framing it as a generational buildout rather than a speculative boom. Princeton's CITP argues GenAI may structurally break historical infrastructure mania patterns due to software monetization characteristics absent from railways or telecom. Uncover Alpha argues market skepticism of cloud spending is contradicted by the underlying economic data.

Evolution: New voice cluster this cycle. Previously the institutional bull case was led by NVIDIA's State of AI report and Jassy's projections. KKR's private equity framing and Princeton's academic counter-argument to the historical-analogy skeptics represent a new tier of institutional engagement with the bubble question.

AI bubble skeptics and value investors (Hacker News, Reddit, Futuriom, Latticework/MOI Global, Jeff Sica, Benzinga, Substack, LinkedIn, MSN)

Jeff Sica warns of a 'breaking point' in hyperscaler spending via Fox Business and Yahoo Finance. Benzinga frames the 2025 AI boom as resembling past infrastructure manias. New overbuilding analogies extend beyond telecom to railways and dark fiber. The AI-compute-surpasses-human-costs data point from Yahoo Finance provides a new structural cost concern.

Evolution: Skeptic framing has diversified and escalated into mainstream financial media. Prior synthesis had three telecom-bust articulations in investment commentary; this cycle adds Jeff Sica's mainstream warning, Benzinga's explicit infrastructure-mania comparison, new historical analogies (railways, dark fiber), and AI compute cost trajectory as a simultaneous bull and bear data point.

NVIDIA (institutional)

AI is driving measurable revenue growth, cost reduction, and productivity improvement across industries in 2026. The demand for GPU compute is supported by real enterprise ROI, not speculative adoption.

Evolution: Unchanged from prior synthesis.

Trade press (Data Center Dynamics, The Register, Network World, InfoQ, IT Pro, MSN, Next Platform, WCCFtech, TechCrunch, Digitimes, Reuters)

The GPU capacity story has expanded to cover total compute: WCCFtech's CPU shortage report and TechCrunch's Trainium lab tour extend coverage beyond GPU markets. Reuters' hard revenue figure ($20B+ chips run rate) elevates Amazon's chip narrative from projection to operational claim. Digitimes published a retrospective on Amazon's 11-year chip journey culminating in Trainium wins with Anthropic, OpenAI, and Apple.

Evolution: Expanded. WCCFtech, Digitimes, and Reuters are new outlets providing substantive new reporting this cycle. The CPU shortage angle (WCCFtech) and the $20B revenue run rate (Reuters) are the two most analytically significant new contributions from trade and wire press.

Tensions

  • Is the A100 demand signal evidence of durable structural AI enterprise adoption? The $20B+ current chips revenue run rate (Reuters), Jassy's $600B AWS projection, Trainium generation sell-through, TechCrunch's Trainium lab adoption tour, NVIDIA's State of AI 2026, KKR's institutional bull case, and Princeton's historical-pattern-break argument are the strongest multi-layered bull position yet. Against this: three vendor repatriation surveys (93%, 86%, 83%), Jeff Sica's 'breaking point' warnings, the compute-cost-surpasses-human-cost threshold, and multiple historical-analogy skeptic frameworks form the bear case. [2][130][131][148][57][58][132][134][33][153][36][40][3][4][5][6][27][28][145][74][20][21][8][23][14][29]
  • H100 pricing: three divergent signals now coexist. SemiAnalysis documented a ~40% surge starting from Introl's December 2025 documented price trough. Silicon Data's April 2026 Hyperscaler Index shows 'flat mode' at reservation level while a separate Silicon Data piece documents a 10% spot/retail spike from the same source. Tunguz independently documented a 114% spot surge over six weeks. Whether the surge has plateaued at hyperscaler reservation level while continuing at spot level — and which signal matters for enterprise pricing decisions — is the most empirically complex unresolved question in the thread. [118][120][16][17][15][18][122][152][129][125]
  • Cloud repatriation: three surveys, convergent numbers, compromised sources. Cloudian (93%), Tasrie IT Services (86%), and Data Canopy (83%) provide a convergence of high figures, but all three come from vendors with commercial interests in on-premise or private cloud solutions. The Cloudian 93% figure has propagated to 10+ outlets, lending media breadth without methodological validation. DataBank documents a hybrid CIO path (cloud to colocation rather than full on-prem) suggesting the binary repatriation framing may mischaracterize actual enterprise behavior. SoftwareSeni's structural argument that AI workloads specifically cannot be economically repatriated remains the primary unaddressed counter-argument. [27][28][29][110][88][90][102][103][104][105][106][107][108][109][111]
  • Amazon's chip business revenue trajectory: the $20B+ current annual run rate (Reuters) and the $50B externalization valuation represent distinct claims. The former is reported current revenue from internal AWS consumption; the latter is a projection contingent on third-party sales being executed at scale. Whether the $20B includes any external revenue, and whether the externalization can be operationalized at a scale that closes the gap to $50B, is unresolved — though Motley Fool, Yahoo Finance, and Seeking Alpha have all framed the $50B upside as an investable thesis. [3][36][37][38][39][40][4][75][76][77][78][154][155][156][79]
  • The agentic AI CPU shortage (Amazon tripled CPU servers and still ran out) is the most significant new demand claim this cycle but rests on a single WCCFtech report without multi-source corroboration. If accurate, it means the compute shortage is a total-infrastructure phenomenon rather than GPU-specific — materially strengthening the Garman demand thesis beyond its original GPU framing. [10][12][11]
  • Prediction markets: the Polymarket H100 April 30 deadline has arrived, and Grok has validated the core Garman claim on the same date. But the actual resolution outcome of the H100 price prediction contract — whether it resolved above or below the tracked benchmarks — is not documented in thread items. The H200 contract (May 31) is now the active forward-looking pricing instrument. [64][66][68][69][70][71][72][73][19]
  • GPU price escalation and enterprise cost sustainability: the compute-surpasses-human-costs threshold cuts both ways. As a bull argument, it means AI ROI justifies the spend; as a bear argument, it means enterprise cost structures are being fundamentally reshaped in ways that may not be sustainable. GPU cloud providers are reportedly losing money even as prices rise, suggesting the economics are broken at both the enterprise customer and cloud provider levels simultaneously. [118][120][157][87][88][90][27][28][145][100][14][158][142]
  • The historical analogy debate has become a three-way contest: (1) repeaters arguing AI is telecom, railways, or dark fiber redux; (2) pattern-breakers arguing GenAI's software monetization characteristics exempt it from prior mania dynamics; (3) institutional compounders (KKR) arguing the question is irrelevant because infrastructure will compound even through a bubble correction. These three positions are now all represented by named institutional or academic voices, not just anonymous analysts. [135][136][137][138][139][26][140][20][21][141][25][22]
  • As A100 and H100 demand matures and Blackwell pricing is documented, Amazon's Trainium is positioning as a third chip trajectory outside the Nvidia stack. If Amazon externalizes Trainium at scale (aided by the $20B existing run rate as proof of manufacturing maturity), the GPU supply/demand dynamic would shift: more non-Nvidia compute supply could relieve hyperscaler scarcity arguments while disrupting Nvidia's pricing power — a dynamic that Jassy's 'new shift has started' framing explicitly signals. [119][121][113][88][16][3][5][8][9][13]

Sources

  1. [1] AWS CEO Matt Garman: "Because there is so much more demand than supply, there typically still is demand for the older ch… — Rohan Paul Twitter (2026-04-26)
  2. [2] Matt Garman, CEO of AWS, Amazon's $100+ billion cloud division and what he just said is the single most important data p… — Milk Road AI Twitter (2026-04-26)
  3. [3] Amazon CEO reveals AI revenue, dismisses spending doubts in ... — reactive:aws-garman-a100-demand
  4. [4] @ajassy in Amazon's Shareholder Letter: "Our chips business is on fire, changes the economics for AWS, and will be much larger than most think." 2 AWS customers asked if they could buy ALL of Graviton instance capacity in 2026! This gives you an idea of the demand! Trainium2 has largely sold out. Trainium3, which just started shipping at the start of 2026 is nearly fully-subscribed. A significant chunk of Trainium4, which is about 18 months from broad availability, has already been reserved. — reactive:aws-garman-a100-demand
  5. [5] Andy Jassy on Nvidia's Chip Dominance: 'a New Shift Has Started' — reactive:aws-garman-a100-demand
  6. [6] Amazon CEO defends AI spend: "We're not going to be conservative" — reactive:aws-garman-a100-demand
  7. [7] Amazon CEO takes aim at Nvidia, Intel, Starlink, more in annual ... — reactive:aws-garman-a100-demand
  8. [8] An exclusive tour of Amazon's Trainium lab, the chip that's won over ... — reactive:aws-garman-a100-demand
  9. [9] Analysis: Amazon's 11-year chip journey crowns Anthropic and ... — reactive:aws-garman-a100-demand
  10. [10] Amazon Tripled Its CPU Servers and Still Ran Out as Agentic AI Gobbles Up Every Available Processor in the Cloud — reactive:aws-garman-a100-demand
  11. [11] AI chips have sold out well into 2030. We are still in the early days of ... — reactive:aws-garman-a100-demand
  12. [12] Amazon AWS CEO: AWS is expected to be to sold out of all 2026 ... — reactive:aws-garman-a100-demand
  13. [13] How Much Do GPU Clusters Really Cost? — reactive:aws-garman-a100-demand
  14. [14] AI Compute Surpasses Human Costs: Enterprise Budgets Shift — reactive:aws-garman-a100-demand
  15. [15] GPU Cloud Prices Collapse | Introl Blog — reactive:aws-garman-a100-demand
  16. [16] H100 Hyperscaler Index, April 2026: A Benchmark in Flat Mode - Silicon Data — GPU Performance Data for Companies — reactive:aws-garman-a100-demand
  17. [17] H100 Price Spike: Understanding the 10% Surge in GPU Rental Costs — reactive:aws-garman-a100-demand
  18. [18] GPU Spot Prices Surge 114% in Six Weeks - LinkedIn — reactive:aws-garman-a100-demand
  19. [19] @indiaesh @r0ck3t23 **Fact check: Accurate on the core claim.** — reactive:aws-garman-a100-demand (2026-04-30)
  20. [20] Beyond the Bubble: Why AI Infrastructure Will Compound Long after ... — reactive:aws-garman-a100-demand
  21. [21] Why the GenAI Infrastructure Boom May Break Historical Patterns - CITP Blog — reactive:aws-garman-a100-demand
  22. [22] The Market Hates Big Cloud Spending. The Data Says The Market Is Wrong. — reactive:aws-garman-a100-demand
  23. [23] Jeff Sica warns of 'breaking point' over hyperscaler spending — reactive:aws-garman-a100-demand
  24. [24] Jeff Sica warns of 'breaking point' over hyperscaler spending — reactive:aws-garman-a100-demand
  25. [25] The AI boom in 2025 looks a lot like past infrastructure manias ... - X — reactive:aws-garman-a100-demand
  26. [26] Overbuilding the Future: Why AI Is Repeating the Railways and Dark Fiber — reactive:aws-garman-a100-demand
  27. [27] Enterprise Survey Finds 93% Are Repatriating AI Workloads — reactive:aws-garman-a100-demand
  28. [28] Cloud Repatriation 2026: Why 86% of CIOs Are Moving Workloads Back | Tasrie IT Services — reactive:aws-garman-a100-demand
  29. [29] 83% of Enterprises Plan to Move Workloads Back to Private Cloud - Data Canopy — reactive:aws-garman-a100-demand
  30. [30] AWS CEO Says Compute Demand 'Almost Insatiable' — reactive:aws-garman-a100-demand
  31. [31] AWS CEO Garman said space data centers likely to take longer — reactive:aws-garman-a100-demand
  32. [32] Amazon considers selling in-house AI chips to firms amid demand surge | Company News - Business Standard — reactive:aws-garman-a100-demand
  33. [33] Exclusive: Amazon CEO sees AI doubling prior AWS sales ... - Reuters — reactive:aws-garman-a100-demand
  34. [34] Exclusive-Amazon CEO sees AI doubling prior AWS sales ... — reactive:aws-garman-a100-demand
  35. [35] Amazon's Hidden AI Business Could Change Everything for Investors — reactive:aws-garman-a100-demand
  36. [36] Amazon Trainium Chips for Sale? Jassy Hints at Third-Party Sales as Demand Soars | Knowledge Hub Media — reactive:aws-garman-a100-demand
  37. [37] Amazon toys with selling Trainium chips outside AWS – Fudzilla.com — reactive:aws-garman-a100-demand
  38. [38] Amazon may sell Trainium AI chips to third parties in shot at Nvidia - Quartz — reactive:aws-garman-a100-demand
  39. [39] MLQ.ai | AI for investors — reactive:aws-garman-a100-demand
  40. [40] Amazon's chip business could be worth $50 billion, Jassy says ... - TNW — reactive:aws-garman-a100-demand
  41. [41] CEO Andy Jassy's 2025 Letter to Shareholders - About Amazon — reactive:aws-garman-a100-demand
  42. [42] NVv4 series retirement - Azure Virtual Machines | Azure Docs — reactive:aws-garman-a100-demand
  43. [43] Is the NVIDIA A100 VM Series Being Retired in Azure? - Microsoft Q&A — reactive:aws-garman-a100-demand
  44. [44] NDasrA100_v4 size series - Azure Virtual Machines | Microsoft Learn — reactive:aws-garman-a100-demand
  45. [45] Migrate your NVv3-series virtual machines by September 30, 2026 — reactive:aws-garman-a100-demand
  46. [46] Migrate your retiring Azure Virtual Machines (VMs) to latest ... - Reddit — reactive:aws-garman-a100-demand
  47. [47] Azure to retire NVv4-series VMs on September 30, 2026 - LinkedIn — reactive:aws-garman-a100-demand
  48. [48] NVv4 series retirement - Azure Virtual Machines | Microsoft Learn — reactive:aws-garman-a100-demand
  49. [49] Retired Azure VM size series - Virtual Machines - Microsoft Learn — reactive:aws-garman-a100-demand
  50. [50] Retirement: NVv4-series Azure Virtual Machines will be ... - Azalio — reactive:aws-garman-a100-demand
  51. [51] Azure NV Series EOL Announcement - Dizzion Support Center — reactive:aws-garman-a100-demand
  52. [52] Migrate your retiring Azure Virtual Machines (VMs) to latest-generation VMs before 15 November 2028 - Microsoft Q&A — reactive:aws-garman-a100-demand
  53. [53] Azure VM Size Series Retirement 2028: Timeline and Migration ... — reactive:aws-garman-a100-demand
  54. [54] Migration Guide for GPU Compute Workloads in Azure - Azure Virtual Machines | Microsoft Learn — reactive:aws-garman-a100-demand
  55. [55] AWS CEO Matt Garman said they have never retired an A100 server. — reactive:aws-garman-a100-demand (2026-04-26)
  56. [56] Amazon Web Services CEO Matt Garman said today there is so ... — reactive:aws-garman-a100-demand
  57. [57] AI capex ROI becomes key 2026 test for hyperscalers - Seeking Alpha — reactive:aws-garman-a100-demand
  58. [58] The AI Capex Debate: Misallocation or Generational ROIC? | InvestorPlace — reactive:aws-garman-a100-demand
  59. [59] The Flip Side podcast - Episode 82 | Barclays Investment Bank — reactive:aws-garman-a100-demand
  60. [60] Hyperscaler Capex Snowballs Toward $700B as Firms Stage AI Builds — reactive:aws-garman-a100-demand
  61. [61] AWS CEO Matt Garman said they have never retired an A100 server ... — reactive:aws-garman-a100-demand
  62. [62] Milk Road AI — reactive:aws-garman-a100-demand
  63. [63] $AMZN AWS CEO Matt Garman revealed that the company has never retired a single A100 server. — reactive:aws-garman-a100-demand (2026-04-27)
  64. [64] GPU rental prices (H100) hit___ by April 30? - Yahoo Finance — reactive:aws-garman-a100-demand
  65. [65] The Great GPU Shortage – Rental Capacity [Semi Analysis] - Reddit — reactive:aws-garman-a100-demand
  66. [66] GPU rental prices (H100) hit___ by April 30? Trading Odds & Predictions 2026 | Polymarket — reactive:aws-garman-a100-demand
  67. [67] And AWS has never once taken an A100 offline because demand is ... — reactive:aws-garman-a100-demand
  68. [68] GPU rental prices (H100) hit___ by April 30? | Prediction - Jupiter — reactive:aws-garman-a100-demand
  69. [69] GPU rental prices (H100) hit___ by April 30? - MLQ.ai | AI for investors — reactive:aws-garman-a100-demand
  70. [70] SDH100RT Predictions & Real-Time Odds - Polymarket — reactive:aws-garman-a100-demand
  71. [71] Silicon Data Predictions & Real-Time Odds | Polymarket — reactive:aws-garman-a100-demand
  72. [72] Prices Predictions & Real-Time Odds | Polymarket — reactive:aws-garman-a100-demand
  73. [73] GPU rental prices (H200) hit___ by May 31? Trading Odds & Predictions 2026 | Polymarket — reactive:aws-garman-a100-demand
  74. [74] AI CAPEX ROI becomes key 2026 test for hyperscalers - MSN — reactive:aws-garman-a100-demand
  75. [75] Amazon Just Revealed a Hidden $50 Billion Business — reactive:aws-garman-a100-demand
  76. [76] Amazon Just Revealed This 1 Business Is Worth More Than 82% of ... — reactive:aws-garman-a100-demand
  77. [77] Amazon Just Revealed a Hidden $50 Billion Business -- Is This the Buy Signal Investors Needed? — reactive:aws-garman-a100-demand
  78. [78] Amazon: Andy Jassy's Shareholder Letter Is A Bull's Dream — reactive:aws-garman-a100-demand
  79. [79] Amazon rallies after highlighting massive upside for the Trainium ... — reactive:aws-garman-a100-demand
  80. [80] Launching GPU Instances on AWS: Understanding Capacity, Quotas, and Reservations — reactive:aws-garman-a100-demand
  81. [81] What AWS’s GPU Pricing Shift Reveals About Cloud Cost Risk - Amplix — reactive:aws-garman-a100-demand
  82. [82] How do you handle on-demand GPU instances for AI ... — reactive:aws-garman-a100-demand
  83. [83] AWS raises GPU prices 15% on a Saturday • The Register — reactive:aws-garman-a100-demand
  84. [84] The GPU Capacity Crisis: Why Enterprises Are Rethinking Where AI ... — reactive:aws-garman-a100-demand
  85. [85] AWS Raised Prices 15%? No, It's More Complicated Than That | LCMH - Digital Services — reactive:aws-garman-a100-demand
  86. [86] EC2 Capacity Blocks : r/aws — reactive:aws-garman-a100-demand
  87. [87] Enterprise AI Workstation vs Cloud Cost Analysis 2026 — What the Numbers Actually Say | Clanker Cloud Blog — reactive:aws-garman-a100-demand
  88. [88] Cloud Repatriation in 2026: Moving AI & Heavy Workloads On-Prem — reactive:aws-garman-a100-demand
  89. [89] Stop Measuring AI Training Costs In GPU Hours - The Next Platform — reactive:aws-garman-a100-demand
  90. [90] Cloud Bill Shock in 2026: How AI Workloads Are Draining IT Budgets — reactive:aws-garman-a100-demand
  91. [91] How Platform Engineering Impacts Cloud Costs in 2026 - Everyday IT — reactive:aws-garman-a100-demand
  92. [92] Cloud Repatriation in 2026: Moving AI & Heavy Workloads On-Prem — reactive:aws-garman-a100-demand
  93. [93] What is cloud repatriation and why are companies doing it in 2026? | Blog — Northflank — reactive:aws-garman-a100-demand
  94. [94] Why Businesses Are Bringing Workloads Back On-Premises in 2026 — reactive:aws-garman-a100-demand
  95. [95] Cloud Repatriation Trends: Cost, AI and the Push Towards Hybrid - HBS — reactive:aws-garman-a100-demand
  96. [96] Cloud Repatriation Case Study - Petronella Technology Group — reactive:aws-garman-a100-demand
  97. [97] How AI is reshaping cloud repatriation strategies - OpenText Blogs — reactive:aws-garman-a100-demand
  98. [98] Cloud Repatriation in 2026: Costs, Compliance, and Control | Volico — reactive:aws-garman-a100-demand
  99. [99] Enterprise AI Workstation vs Cloud Cost Analysis 2026: Inference Breakeven by Team Size | Clanker Cloud Blog — reactive:aws-garman-a100-demand
  100. [100] Unsustainable Cloud GPU Rental Costs in 2026 - NeuralRack AI — reactive:aws-garman-a100-demand
  101. [101] Cloud Repatriation in 2026: Moving AI & Heavy Workloads On-Prem — reactive:aws-garman-a100-demand
  102. [102] Enterprise Survey Finds 93% are Repatriating AI Workloads or Evaluating a Move Away from Public Cloud - StorageNewsletter — reactive:aws-garman-a100-demand
  103. [103] Enterprise Survey Finds 93% Are Repatriating AI Workloads or Evaluating a Move Away from Public Cloud — reactive:aws-garman-a100-demand
  104. [104] 93% of Enterprises Repatriating AI Workloads from Public Cloud - San Mateo Today — reactive:aws-garman-a100-demand
  105. [105] Cloud Repatriation Strategy: What the 93% Signal Actually Means — reactive:aws-garman-a100-demand
  106. [106] Enterprises shift AI workloads from public cloud due to cost and ... — reactive:aws-garman-a100-demand
  107. [107] Enterprises shift AI workloads from public cloud to on-prem due to ... — reactive:aws-garman-a100-demand
  108. [108] Enterprise Survey Finds 93% Are Repatriating AI Workloads or ... — reactive:aws-garman-a100-demand
  109. [109] Why CIOs Are Moving AI Workloads from Public Cloud to Colocation ... — reactive:aws-garman-a100-demand
  110. [110] Cloud Repatriation During Price Increases: Why It Won't Work for AI Workloads - SoftwareSeni — reactive:aws-garman-a100-demand
  111. [111] On-Premise vs Cloud: Generative AI Total Cost of Ownership (2026 ... — reactive:aws-garman-a100-demand
  112. [112] AI Demand Boosts GPU Prices, Says Silicon Data CEO Carmen Li — reactive:aws-garman-a100-demand
  113. [113] Launching our H100 1 Year Rental Price Index - SemiAnalysis — reactive:aws-garman-a100-demand
  114. [114] Global GPU Shortage Crisis | H100 Rental Prices Surge 40% — reactive:aws-garman-a100-demand
  115. [115] Launching our H100 1 Year Rental Price Index | Sheng-Che Huang — reactive:aws-garman-a100-demand
  116. [116] Nvidia H100 GPU rental price increase in 2026 - YouTube — reactive:aws-garman-a100-demand
  117. [117] The Great GPU Shortage – Rental Capacity — reactive:aws-garman-a100-demand
  118. [118] Nvidia's H100 GPU rental prices surge nearly 40% in 6 months - MSN — reactive:aws-garman-a100-demand
  119. [119] NVIDIA Blackwell GPU Pricing: B200, B300 and DGX Cost Breakdown — reactive:aws-garman-a100-demand
  120. [120] Launching our H100 1 Year Rental Price Index - SemiAnalysis — reactive:aws-garman-a100-demand
  121. [121] Cast AI Data Shows GPU Pricing Will See a Foundational Shift in 2026 — reactive:aws-garman-a100-demand
  122. [122] GPU Pricing Index  – SemiAnalysis — reactive:aws-garman-a100-demand
  123. [123] NVIDIA H100 GPU Pricing: 2025 Rent vs. Buy Cost Analysis — reactive:aws-garman-a100-demand
  124. [124] Cast AI Data Shows GPU Pricing Will See a Foundational Shift in 2026 - Cast AI — reactive:aws-garman-a100-demand
  125. [125] Why Nvidia GPU prices are 'going nuts,' according to one CEO | Carmen Li — reactive:aws-garman-a100-demand
  126. [126] The Illusion of Stability: Unpacking H100 GPU Market Value Trends - Silicon Data — GPU Performance Data for Companies — reactive:aws-garman-a100-demand
  127. [127] Silicon H100 - GPU Rental Price Tracker — reactive:aws-garman-a100-demand
  128. [128] H100 Rental Price Over Time (2023–2025): A Complete Market Analysis - Silicon Data — GPU Performance Data for Companies — reactive:aws-garman-a100-demand
  129. [129] AI GPU Rental Market Trends (April 2026): Complete Industry Analysis — reactive:aws-garman-a100-demand
  130. [130] Amazon plunge continues $1T wipeout as AI bubble fears ignite sell ... — reactive:aws-garman-a100-demand
  131. [131] AWS CEO Matt Garman is pushing back hard on the idea ... - Reddit — reactive:aws-garman-a100-demand
  132. [132] The Real AI CapEx Problem No One Wants to Talk About — reactive:aws-garman-a100-demand
  133. [133] Concerns about the ai bubble and overbuilding capacity - Facebook — reactive:aws-garman-a100-demand
  134. [134] Hyperscaler AI Spending Doubts Rising - Futuriom — reactive:aws-garman-a100-demand
  135. [135] Are the Hyperscalers Turning Themselves into the Telecom ... — reactive:aws-garman-a100-demand
  136. [136] Parallels Between the Hyperscalers and the Telecom Firms of the 1990s | MOI Global — reactive:aws-garman-a100-demand
  137. [137] "This Time is Different": Will AI end up like the telecom bust? — reactive:aws-garman-a100-demand
  138. [138] The Treadmill: Why the AI Infrastructure Bet Breaks Every ... — reactive:aws-garman-a100-demand
  139. [139] 🔗: https://bit.ly/41P2wP4 The hyperscalers building the ... — reactive:aws-garman-a100-demand
  140. [140] Accelerate Monthly - The AI Capex Boom: Are the Hyperscalers ... — reactive:aws-garman-a100-demand
  141. [141] Are we repeating the telecoms crash with AI datacenters? | Hacker News — reactive:aws-garman-a100-demand
  142. [142] The Challenging ROI Trajectory for GenAI - LinkedIn — reactive:aws-garman-a100-demand
  143. [143] The AI Capex Boom: Bubble or Infrastructure Supercycle? - LongYield — reactive:aws-garman-a100-demand
  144. [144] AI boom risks bust; capex overkill could backfire, warns analyst — reactive:aws-garman-a100-demand
  145. [145] How AI Is Driving Revenue, Cutting Costs and Boosting Productivity for Every Industry in 2026 — reactive:aws-garman-a100-demand
  146. [146] AWS has “never retired” an Nvidia A100 server, CEO Matt Garman ... — reactive:aws-garman-a100-demand
  147. [147] AI demand is so high, AWS customers are trying to buy out its entire ... — reactive:aws-garman-a100-demand
  148. [148] We’re Using So Much AI That Computing Firepower Is Running Out — reactive:aws-garman-a100-demand
  149. [149] Amazon's retail business resolves internal GPU capacity shortage - DCD — reactive:aws-garman-a100-demand
  150. [150] AWS Hikes EC2 Capacity Block Rates by 15% in Uniform ML Pricing Adjustment - InfoQ — reactive:aws-garman-a100-demand
  151. [151] AWS just quietly increased EC2 Capacity Block prices – here's what you need to know | IT Pro — reactive:aws-garman-a100-demand
  152. [152] Nvidia's H100 GPU rental prices surge nearly 40% in 6 months - MSN — reactive:aws-garman-a100-demand
  153. [153] Amazon closes at record after $38 billion OpenAI deal with AWS — reactive:aws-garman-a100-demand
  154. [154] Amazon May Sell Trainium AI Chips To Third Parties In Shot At Nvidia — reactive:aws-garman-a100-demand
  155. [155] Amazon Considers Selling Trainium AI Chip Racks | Let's Data Science — reactive:aws-garman-a100-demand
  156. [156] Amazon the Chip Company? Tech Giant Wants to Sell Its AI Chips ... — reactive:aws-garman-a100-demand
  157. [157] AWS, Microsoft, and Google Are Pricing Themselves Out of AI — reactive:aws-garman-a100-demand
  158. [158] Why GPU Cloud Providers Are Losing Money and How to ... - YouTube — reactive:aws-garman-a100-demand
  159. [159] AWS and OpenAI announce multi-year strategic partnership — reactive:aws-garman-a100-demand
  160. [160] AI Compute Costs Rise: AWS GPU Prices Increase 15% | Mutha Nagavamsi posted on the topic | LinkedIn — reactive:aws-garman-a100-demand
  161. [161] AWS Quietly Increased EC2 Capacity Block Prices: What To Know — reactive:aws-garman-a100-demand
  162. [162] Amazon EC2 capacity blocks for ML can be shared across multiple ... — reactive:aws-garman-a100-demand
  163. [163] Exclusive-Amazon CEO sees AI doubling prior AWS sales ... — reactive:aws-garman-a100-demand
  164. [164] Amazon CEO Andy Jassy says the company may eventually sell its ... — reactive:aws-garman-a100-demand
  165. [165] Amazon is considering selling its AI chips to other companies — reactive:aws-garman-a100-demand
  166. [166] Amazon considering selling its AI chips to other companies — reactive:aws-garman-a100-demand
  167. [167] AWS has “never retired” an Nvidia A100 server, CEO Matt Garman ... — reactive:aws-garman-a100-demand
  168. [168] Amazon Web Services CEO Matt Garman said today there is so ... — reactive:aws-garman-a100-demand
  169. [169] AWS CEO Matt Garman said today there is so much demand they ... — reactive:aws-garman-a100-demand
  170. [170] Amazon's Matt Garman: there is so much more demand than supply ... — reactive:aws-garman-a100-demand
  171. [171] SemiWiki.com's Post - LinkedIn — reactive:aws-garman-a100-demand
  172. [172] Amazon Web Services CEO Matt Garman said today there is so ... — reactive:aws-garman-a100-demand
  173. [173] Nvidia's H100 GPU rental prices surge nearly 40% in 6 months — reactive:aws-garman-a100-demand
  174. [174] Nearly Nine in Ten Enterprises Plan Cloud Data Repatriation — reactive:aws-garman-a100-demand
  175. [175] NGN ADVISORY's Post - LinkedIn — reactive:aws-garman-a100-demand
  176. [176] it's about moving from Cloud-First to Cloud-Smart. SDxCentral writes ... — reactive:aws-garman-a100-demand
  177. [177] From repatriation to replatforming: The cloud story no one wants to tell | CIO — reactive:aws-garman-a100-demand