The Information Machine

AWS CEO: AI Compute Demand So Strong No A100 Server Has Ever Been Retired · history

Version 7

2026-05-01 04:18 UTC · 287 items

Narrative

Five synthesis cycles after AWS CEO Matt Garman's April 26 claim that no A100 server has ever been retired[1][2], the thread's defining development this cycle is the CPU shortage narrative graduating from a single WCCFtech report to a multi-source, institutionally-validated story spanning trade press, market research, and financial analysis. Tom's Hardware reports that AI workload CPU requirements are multiplying, Intel is actively shifting production from consumer chips to Xeon server processors, and inference workloads are driving server CPU-to-GPU ratios back toward parity[3]. TrendForce published dedicated market research titled '2026 Agentic AI Wave: CPU Shortage and GPU Ratio Structural Changes'[4], elevating what was previously a single data point into a named structural market thesis. Next Platform reports that this AI-driven CPU shortage is 'saving Intel's financial cookies'[5] — introducing Intel as a specific named beneficiary of the agentic AI demand wave, a company previously absent from the demand narrative. Uncover Alpha frames CPUs as 'the forgotten chip' and 'new bottleneck of the agentic AI era'[6], while ChipStrat analyzes whose CPUs specifically stand to benefit[7]. The WCCFtech report from the prior cycle that Amazon tripled CPU servers and still ran out is now corroborated by this multi-outlet convergence, transforming the compute shortage from a GPU-specific to a total-infrastructure thesis.

The Amazon $20 billion chip business run rate story — reported by Reuters last cycle — continues its propagation arc. The Register published its own piece titled 'Amazon's chips become a $20B business'[8], Yahoo Finance covered 'Amazon's Chip Business Tops $20 Billion Run Rate'[9], Bulios framed it as chips 'quietly' becoming a $20B business that could reach $50B[10], Let's Data Science syndicated the news[11], and a YouTube video titled 'BREAKING: Amazon's Chip Business Hits $20B Run Rate' extended reach to video audiences[12]. LinkedIn posts specifically discuss the talent implications of Amazon's $20B chip business[13], and Facebook posts amplify the Jassy Trainium third-party sales angle[14]. This propagation establishes the $20B figure as the consensus anchor for Amazon's chip business valuation across mainstream and investor-targeted outlets — no longer a Reuters exclusive but a broadly circulated data point. The Perplexity finance page tracking the Polymarket H100 April 30 GPU price prediction market appeared[15], though no resolution outcome is documented in thread items.

The discourse has evolved along two parallel tracks this cycle. The first is the CPU shortage becoming the thread's most corroborated new claim: TrendForce, Tom's Hardware, Next Platform, Uncover Alpha, ChipStrat, and WCCFtech (via Facebook amplification[16]) have all published independent analyses arriving at the same conclusion — agentic AI is consuming CPUs at intensities that are reshaping the semiconductor supply landscape. Intel's production pivot to Xeon specifically[3] is the most operationally concrete evidence that the shortage is affecting real manufacturing decisions, not just cloud allocation queues. The second track is Amazon's Trainium externalization story transitioning from projection to investment narrative: the $20B run rate is now widely cited, the $50B externalization upside is being framed by Bulios and others as the next chapter[10], and LinkedIn commentary is analyzing the talent and organizational implications[13]. The Reddit r/amd_fundamentals thread analyzing Intel's CPU positioning in the agentic AI context[17] signals that retail-level investor communities are now treating the CPU shortage as an equity thesis, not just a supply-chain observation.

Timeline

  • 2025-11-03: Amazon closes a $38 billion cloud deal with OpenAI on AWS, locking a major AI lab into AWS infrastructure at multi-year scale; AWS and OpenAI announce a multi-year strategic partnership. [166][175]
  • 2025-12-01: Introl documents a GPU cloud price collapse in December 2025, establishing the trough from which the SemiAnalysis-documented 40% H100 rental surge subsequently began. [133]
  • 2026-01-05: AWS raises EC2 Capacity Block prices 15% in a uniform ML pricing adjustment; subsequent analysis notes variable real-world impact depending on reservation structure. [83][163][164][85][86][176][177]
  • 2026-01-10: NeuralRack AI publishes analysis characterizing cloud GPU rental costs as 'unsustainable' in 2026, establishing early documentation of the cost pressure thesis. [102]
  • 2026-02-01: AWS announces EC2 Capacity Blocks can now be shared across multiple accounts, easing enterprise multi-account ML infrastructure management. [178]
  • 2026-03-17: Reuters publishes exclusive: Amazon CEO Andy Jassy projects AI will double prior AWS sales projections to $600 billion by 2036. [24][25][179]
  • 2026-03-22: TechCrunch publishes exclusive tour of Amazon's Trainium lab, reporting the chip has won over Anthropic, OpenAI, and Apple — the first lab-level product validation of Trainium adoption to appear in the thread. [38]
  • 2026-04-09: Jassy's 2025 shareholder letter published. Reuters reports Amazon's chips business already has an annual revenue run rate exceeding $20 billion. Jassy states 'our chips business is on fire,' documents Trainium2 as largely sold out, Trainium3 as nearly fully-subscribed, and significant Trainium4 capacity already reserved 18 months ahead of availability. Jassy says 'a new shift has started' away from Nvidia's chip dominance and defends AI spending as 'not going to be conservative.' Amazon is also considering selling Trainium to external customers, valued at potentially $50 billion if realized. Story propagates across TechCrunch, Business Insider, Slashdot, Motley Fool, Yahoo Finance, Seeking Alpha, MSN, Financial Post, and multiple social platforms. [32][33][34][35][36][37][23][27][28][29][30][31][75][76][170][180][181][39][171][172][79][78][77][182]
  • 2026-04-23: Next Platform publishes 'Stop Measuring AI Training Costs In GPU Hours,' signaling a methodological shift in enterprise AI compute cost evaluation. [89]
  • 2026-04-26: AWS CEO Matt Garman publicly states AWS has never retired a single Nvidia A100 server and is completely sold out of A100 capacity, citing persistent demand exceeding supply even for older GPU generations. [1][2][55][183][159]
  • 2026-04-26: Garman statement rapidly amplified across X, LinkedIn, Reddit, SemiWiki, and Threads; investment commentary frames it as the definitive AI infrastructure demand signal. [56][184][185][186][187][61][62][67][188]
  • 2026-04-27: Azure VM retirement documentation consolidates: NVv4 (AMD Radeon MI25) and NVv3 (older Nvidia Tesla) series confirmed for September 30, 2026 retirement; separate 2028 retirement track documented. A100-based NDasrA100_v4 series remains active. Migration guidance published for GPU compute workloads. [40][41][42][43][44][45][46][47][48][49][50][51][52]
  • 2026-04-27: SemiAnalysis launches H100 one-year rental price index documenting nearly 40% surge over six months; data propagates to MSN, YouTube, Reddit r/NVDA_Stock, Seeking Alpha, and Polymarket prediction markets tracking H100 prices by April 30. [115][116][117][118][65][66][119][120][122][125][165][189]
  • 2026-04-27: Cloud repatriation achieves survey-level quantification: Cloudian (93%), Tasrie IT Services (86%), and Data Canopy (83%) figures circulate. SoftwareSeni counter-argues repatriation won't work specifically for AI workloads. [87][88][90][91][92][93][94][168][95][96][97][98][99][100][101][102][103][104][105][106][107][108][109][110][190][191][192][111][193]
  • 2026-04-27: NVIDIA Blackwell-generation GPU pricing (B200, B300, DGX systems) documented, establishing next-generation cost context. [121]
  • 2026-04-27: Next Platform reports 'AI-Driven CPU Shortage Saves Intel's Financial Cookies,' naming Intel as a direct beneficiary of agentic AI compute demand. Intel is documented shifting production from consumer chips to Xeon server processors as inference workloads drive CPU-to-GPU ratios back toward parity. [5][3]
  • 2026-04-28: Silicon Data publishes April 2026 H100 Hyperscaler Index characterizing reservation-level pricing as 'in flat mode.' A separate Silicon Data analysis documents a 10% H100 spot/retail price spike. Tomasz Tunguz documents GPU spot prices surging 114% over six weeks. [124][127][130][131][132][134]
  • 2026-04-28: KKR publishes institutional bull case for AI infrastructure. Jeff Sica warns of a 'breaking point' over hyperscaler spending. Yahoo Finance reports AI compute costs have surpassed human labor costs in enterprise budgets. [137][153][155][113]
  • 2026-04-29: Amazon's $20B chip business run rate propagates to The Register, Yahoo Finance, Bulios, Let's Data Science, and YouTube — transitioning from Reuters wire exclusive to broadly circulated mainstream and investor-targeted narrative. LinkedIn commentary examines talent and organizational implications of the $20B chip business. [8][9][10][11][12][13][14]
  • 2026-04-30: On the Polymarket H100 prediction market deadline date, Grok's AI fact-checker assesses Garman's A100 retirement claim as 'accurate on the core claim.' WCCFtech reports Amazon tripled its CPU server count and still ran out of capacity as agentic AI consumed all available cloud processors. AWS is reported as expected to sell out all 2026 capacity. TrendForce publishes '2026 Agentic AI Wave: CPU Shortage and GPU Ratio Structural Changes' market research. Uncover Alpha frames CPUs as 'the forgotten chip' and 'new bottleneck of the agentic AI era.' ChipStrat analyzes which CPU vendors stand to benefit. Reddit r/amd_fundamentals carries investment-focused analysis linking agentic AI to Intel CPU demand. [20][22][21][4][6][7][17][15]

Perspectives

Matt Garman, CEO of AWS

AI compute demand structurally exceeds supply across all GPU generations, including legacy hardware. AWS is completely sold out of A100 capacity and has never retired one. Demand is 'almost insatiable.' AWS is expected to sell out all 2026 capacity across compute types.

Evolution: Core stance consistent. The multi-source CPU shortage corroboration this cycle (TrendForce, Tom's Hardware, Next Platform, Uncover Alpha, ChipStrat) provides indirect structural support for Garman's total-compute shortage framing without contradicting or expanding his stated position.

Andy Jassy, CEO of Amazon

Amazon's chips business generates an annual revenue run rate exceeding $20 billion, is 'on fire,' and will be 'much larger than most think.' Trainium2 is largely sold out, Trainium3 nearly fully-subscribed, significant Trainium4 capacity already reserved 18 months ahead. 'A new shift has started' away from Nvidia's chip dominance. Amazon is considering external Trainium sales potentially valued at $50 billion. 'We're not going to be conservative' on AI spending.

Evolution: The $20B run rate has now propagated to The Register, Yahoo Finance, Bulios, and YouTube, cementing it as the consensus anchor figure for Amazon's chip business. The externalization thesis ($50B) is being framed by Bulios and others as the next chapter, shifting the narrative from 'could be worth' to 'here's the path.'

Intel

A new named beneficiary of the agentic AI compute demand wave. Intel is actively shifting production from consumer chips to Xeon server processors as inference workloads drive CPU-to-GPU ratios back toward parity. The AI-driven CPU shortage is described as materially improving Intel's financial position.

Evolution: New voice this cycle with no prior presence in the thread. Intel's production pivot to Xeon is the most operationally concrete evidence that the CPU shortage is affecting real manufacturing decisions, not just cloud allocation queues.

Microsoft Azure

Azure is retiring NVv4 (AMD MI25-based) and NVv3 (older Nvidia Tesla-based) VM series by September 30, 2026. A separate 2028 retirement track covers additional older VM families. The A100-based NDasrA100_v4 series remains active. Migration guidance for GPU compute workloads has been published.

Evolution: Unchanged in substance.

Market research and trade press (TrendForce, Tom's Hardware, Next Platform, Uncover Alpha, ChipStrat)

The agentic AI wave is creating a structural CPU shortage that is reshaping semiconductor supply dynamics. Server CPU-to-GPU ratios are shifting back toward parity as inference workloads consume CPUs at intensities not anticipated in GPU-centric AI infrastructure planning. This is a named, researched market thesis, not a single anecdote.

Evolution: New voice cluster this cycle, catalyzed by the prior cycle's single WCCFtech CPU shortage report. TrendForce's market research and Tom's Hardware's technical coverage elevate the CPU shortage from a cloud-operations data point to an institutionally validated structural market shift.

Investment and financial commentary (Milk Road AI, The AI Investor, LEAPTRADER, SpecialSitsNews, Barclays, InvestorPlace, Seeking Alpha, Polymarket, Motley Fool, Yahoo Finance, Bulios, Reddit r/amd_fundamentals)

Amazon's $20B chip run rate is the new consensus anchor for investment analysis, with the $50B externalization upside now widely framed as the investable next chapter. The CPU shortage is entering retail investor discourse via Reddit threads analyzing Intel positioning. GPU prediction market infrastructure has now passed the H100 April 30 deadline; H200 May 31 is the active forward-looking contract.

Evolution: The $20B figure's propagation to mainstream outlets (Yahoo Finance, The Register) and video (YouTube) marks its graduation from wire-exclusive to broadly accessible investment narrative. Reddit r/amd_fundamentals analyzing CPU shortage equity implications shows retail investor uptake of the agentic AI CPU thesis.

Enterprise practitioners, architects, and CIOs

Three vendor surveys quantify repatriation: Cloudian (93%), Tasrie IT Services (86%), and Data Canopy (83%). AI compute costs have reportedly surpassed human labor costs in enterprise budgets. CIOs are documented moving AI workloads to colocation rather than executing binary repatriation.

Evolution: Unchanged from prior synthesis. No new enterprise practitioner sources this cycle.

GPU pricing analysts (SemiAnalysis, Silicon Data, Cast AI, Spheron, Fusion Worldwide, GMI Cloud, Introl, Tomasz Tunguz, Thunder Compute)

Three divergent H100 price signals coexist: SemiAnalysis's ~40% six-month rental surge starting from Introl's December 2025 trough; Silicon Data's April 2026 hyperscaler-reservation 'flat mode' contrasted with a separate Silicon Data 10% spot/retail spike; and Tunguz's 114% spot price surge over six weeks. SemiAnalysis has also published new analysis on GPU cluster total costs.

Evolution: Unchanged from prior synthesis. No new GPU pricing data this cycle.

AI infrastructure bulls — institutional (KKR, Princeton CITP, Uncover Alpha)

KKR argues AI infrastructure will 'compound long after' any bubble concerns. Princeton's CITP argues GenAI may structurally break historical infrastructure mania patterns. Uncover Alpha now bridges the institutional bull and CPU-shortage narratives, framing CPUs as the new demand frontier the market has underappreciated.

Evolution: Uncover Alpha's CPU piece[6] expands its role from cloud-spending data defender to CPU shortage analyst, adding a new analytical dimension to its prior work.

AI bubble skeptics and value investors (Hacker News, Reddit, Futuriom, Latticework/MOI Global, Jeff Sica, Benzinga, Substack, LinkedIn, MSN)

Jeff Sica warns of a 'breaking point' in hyperscaler spending. Benzinga frames the AI boom as resembling past infrastructure manias. Historical analogies extend to railways, dark fiber, and telecom. AI compute costs surpassing human labor costs is simultaneously a bull and bear data point.

Evolution: Unchanged from prior synthesis. No new skeptic voices this cycle.

Trade press (Data Center Dynamics, The Register, Network World, InfoQ, IT Pro, MSN, Next Platform, WCCFtech, TechCrunch, Digitimes, Reuters, Tom's Hardware, TrendForce)

The compute shortage story has definitively expanded beyond GPUs. Tom's Hardware's Intel production pivot coverage[3] and TrendForce's CPU shortage research[4] add institutional trade-press weight to what was a single WCCFtech report. The Register independently covered the $20B chip business story[8], providing a second major trade publication anchor for the Amazon chips narrative.

Evolution: Tom's Hardware and TrendForce are new outlets this cycle. Their independent coverage of the CPU shortage is the most significant expansion of the trade press voice.

Tensions

  • Is the A100 demand signal evidence of durable structural AI enterprise adoption? The $20B+ current chips revenue run rate (Reuters/The Register/Yahoo Finance), Jassy's $600B AWS projection, Trainium generation sell-through, TechCrunch's Trainium lab adoption tour, NVIDIA's State of AI 2026, KKR's institutional bull case, Princeton's historical-pattern-break argument, and now TrendForce's CPU shortage market research represent the most multi-layered bull position yet. Against this: three vendor repatriation surveys (93%, 86%, 83%), Jeff Sica's 'breaking point' warnings, the compute-cost-surpasses-human-cost threshold, and multiple historical-analogy skeptic frameworks form the bear case. [2][140][141][161][57][58][142][144][24][166][27][31][32][33][34][36][93][95][167][74][137][138][38][153][113][111][8][9][4]
  • The CPU shortage has multi-source institutional validation but the competitive beneficiary question remains open. Intel is named by Next Platform and Tom's Hardware as the primary CPU beneficiary[5][3], while ChipStrat specifically asks 'whose CPUs?' without resolving the question[7]. AMD also appears in Reddit analysis[17]. Whether Intel, AMD, or custom silicon (including Amazon's own Graviton) captures the agentic AI CPU demand is not established. [5][3][7][17][53]
  • H100 pricing: three divergent signals coexist. SemiAnalysis documented a ~40% surge starting from Introl's December 2025 documented price trough. Silicon Data's April 2026 Hyperscaler Index shows 'flat mode' at reservation level while a separate Silicon Data piece documents a 10% spot/retail spike. Tunguz independently documented a 114% spot surge over six weeks. The Polymarket H100 April 30 deadline has passed[15] but the resolution outcome is not documented — whether the prediction contract resolved above or below benchmark is the most specific unresolved empirical question in the thread. [120][122][124][127][133][134][125][165][135][129][15]
  • Cloud repatriation: three surveys, convergent numbers, compromised sources. Cloudian (93%), Tasrie IT Services (86%), and Data Canopy (83%) provide convergent high figures, but all three come from vendors with commercial interests in on-premise or private cloud solutions. The Cloudian 93% figure has propagated to 10+ outlets, lending media breadth without methodological validation. DataBank documents a hybrid CIO path (cloud to colocation) suggesting the binary repatriation framing may mischaracterize actual enterprise behavior. [93][95][111][168][88][90][104][105][106][107][108][109][110][112][169]
  • Amazon's chip business revenue trajectory: the $20B+ current annual run rate and the $50B externalization valuation represent distinct claims now being conflated across mainstream outlets. The Register, Yahoo Finance, and Bulios cover both figures, but the $20B is reported current revenue from internal AWS consumption while the $50B is a projection contingent on third-party sales at scale. Whether the $20B includes any external revenue, and whether externalization can close the gap to $50B, remains unresolved. [32][27][28][29][30][31][33][75][76][77][78][170][171][172][79][8][9][10][13][14]
  • GPU price escalation and enterprise cost sustainability: the compute-surpasses-human-costs threshold cuts both ways. As a bull argument, it means AI ROI justifies the spend; as a bear argument, it means enterprise cost structures are being fundamentally reshaped in ways that may not be sustainable. GPU cloud providers are reportedly losing money even as prices rise, suggesting the economics are broken at both the enterprise customer and cloud provider levels simultaneously. [120][122][173][87][88][90][93][95][167][102][113][174][156]
  • The historical analogy debate has become a three-way contest: (1) repeaters arguing AI is telecom, railways, or dark fiber redux; (2) pattern-breakers arguing GenAI's software monetization characteristics exempt it from prior mania dynamics; (3) institutional compounders (KKR) arguing the question is irrelevant because infrastructure will compound even through a bubble correction. These three positions are now all represented by named institutional or academic voices. [145][146][147][148][149][150][151][137][138][152][154][139]
  • As A100 and H100 demand matures and Blackwell pricing is documented, Amazon's Trainium is positioning as a third chip trajectory outside the Nvidia stack. If Amazon externalizes Trainium at scale (aided by the $20B existing run rate as proof of manufacturing maturity), the GPU supply/demand dynamic would shift — a dynamic that Jassy's 'new shift has started' framing explicitly signals and that The Register's independent coverage[8] is beginning to mainstream. [121][123][115][88][124][32][34][38][39][136][8]

Sources

  1. [1] AWS CEO Matt Garman: "Because there is so much more demand than supply, there typically still is demand for the older ch… — Rohan Paul Twitter (2026-04-26)
  2. [2] Matt Garman, CEO of AWS, Amazon's $100+ billion cloud division and what he just said is the single most important data p… — Milk Road AI Twitter (2026-04-26)
  3. [3] CPU requirements for AI workloads are multiplying, driving intensifying shortages and price hikes — Intel already shifting production from consumer chips to Xeon as inference workloads drive server CPU ratios back toward parity with GPUs | Tom's Hardware — reactive:aws-garman-a100-demand
  4. [4] 2026 Agentic AI Wave: CPU Shortage and GPU Ratio Structural Changes | TrendForce — reactive:aws-garman-a100-demand
  5. [5] AI-Driven CPU Shortage Saves Intel’s Financial Cookies — reactive:aws-garman-a100-demand
  6. [6] The Forgotten Chip: CPUs the New Bottleneck of the Agentic AI Era — reactive:aws-garman-a100-demand
  7. [7] Agentic AI Needs CPUs. Whose CPUs? - by Austin Lyons — reactive:aws-garman-a100-demand
  8. [8] Amazon's chips become a $20B business • The Register — reactive:aws-garman-a100-demand
  9. [9] Amazon's Chip Business Tops $20 Billion Run Rate - Yahoo Finance — reactive:aws-garman-a100-demand
  10. [10] Amazon’s in‑house AI chips quietly become a $20+ billion business – and could be a $50 billion product line | Bulios — reactive:aws-garman-a100-demand
  11. [11] Amazon's custom chips reach a $20B run rate | Let's Data Science — reactive:aws-garman-a100-demand
  12. [12] BREAKING: Amazon's Chip Business Hits $20B Run Rate - YouTube — reactive:aws-garman-a100-demand
  13. [13] Amazon's $20B Chip Business: Talent Shift for AI Silicon - LinkedIn — reactive:aws-garman-a100-demand
  14. [14] Amazon's CEO Andy Jassy said the company could start selling its ... — reactive:aws-garman-a100-demand
  15. [15] GPU rental prices (H100) hit___ by April 30? - Perplexity — reactive:aws-garman-a100-demand
  16. [16] We have reported how the Agentic AI surge is resulting in demand ... — reactive:aws-garman-a100-demand
  17. [17] Deep|INTC: Agentic AI and Supply Bottlenecks Drive Server CPU ... — reactive:aws-garman-a100-demand
  18. [18] AWS CEO Says Compute Demand 'Almost Insatiable' — reactive:aws-garman-a100-demand
  19. [19] AWS CEO Garman said space data centers likely to take longer — reactive:aws-garman-a100-demand
  20. [20] @indiaesh @r0ck3t23 **Fact check: Accurate on the core claim.** — reactive:aws-garman-a100-demand (2026-04-30)
  21. [21] Amazon AWS CEO: AWS is expected to be to sold out of all 2026 ... — reactive:aws-garman-a100-demand
  22. [22] Amazon Tripled Its CPU Servers and Still Ran Out as Agentic AI Gobbles Up Every Available Processor in the Cloud — reactive:aws-garman-a100-demand
  23. [23] Amazon considers selling in-house AI chips to firms amid demand surge | Company News - Business Standard — reactive:aws-garman-a100-demand
  24. [24] Exclusive: Amazon CEO sees AI doubling prior AWS sales ... - Reuters — reactive:aws-garman-a100-demand
  25. [25] Exclusive-Amazon CEO sees AI doubling prior AWS sales ... — reactive:aws-garman-a100-demand
  26. [26] Amazon's Hidden AI Business Could Change Everything for Investors — reactive:aws-garman-a100-demand
  27. [27] Amazon Trainium Chips for Sale? Jassy Hints at Third-Party Sales as Demand Soars | Knowledge Hub Media — reactive:aws-garman-a100-demand
  28. [28] Amazon toys with selling Trainium chips outside AWS – Fudzilla.com — reactive:aws-garman-a100-demand
  29. [29] Amazon may sell Trainium AI chips to third parties in shot at Nvidia - Quartz — reactive:aws-garman-a100-demand
  30. [30] MLQ.ai | AI for investors — reactive:aws-garman-a100-demand
  31. [31] Amazon's chip business could be worth $50 billion, Jassy says ... - TNW — reactive:aws-garman-a100-demand
  32. [32] Amazon CEO reveals AI revenue, dismisses spending doubts in ... — reactive:aws-garman-a100-demand
  33. [33] @ajassy in Amazon's Shareholder Letter: "Our chips business is on fire, changes the economics for AWS, and will be much larger than most think." 2 AWS customers asked if they could buy ALL of Graviton instance capacity in 2026! This gives you an idea of the demand! Trainium2 has largely sold out. Trainium3, which just started shipping at the start of 2026 is nearly fully-subscribed. A significant chunk of Trainium4, which is about 18 months from broad availability, has already been reserved. — reactive:aws-garman-a100-demand
  34. [34] Andy Jassy on Nvidia's Chip Dominance: 'a New Shift Has Started' — reactive:aws-garman-a100-demand
  35. [35] CEO Andy Jassy's 2025 Letter to Shareholders - About Amazon — reactive:aws-garman-a100-demand
  36. [36] Amazon CEO defends AI spend: "We're not going to be conservative" — reactive:aws-garman-a100-demand
  37. [37] Amazon CEO takes aim at Nvidia, Intel, Starlink, more in annual ... — reactive:aws-garman-a100-demand
  38. [38] An exclusive tour of Amazon's Trainium lab, the chip that's won over ... — reactive:aws-garman-a100-demand
  39. [39] Analysis: Amazon's 11-year chip journey crowns Anthropic and ... — reactive:aws-garman-a100-demand
  40. [40] NVv4 series retirement - Azure Virtual Machines | Azure Docs — reactive:aws-garman-a100-demand
  41. [41] Is the NVIDIA A100 VM Series Being Retired in Azure? - Microsoft Q&A — reactive:aws-garman-a100-demand
  42. [42] NDasrA100_v4 size series - Azure Virtual Machines | Microsoft Learn — reactive:aws-garman-a100-demand
  43. [43] Migrate your NVv3-series virtual machines by September 30, 2026 — reactive:aws-garman-a100-demand
  44. [44] Migrate your retiring Azure Virtual Machines (VMs) to latest ... - Reddit — reactive:aws-garman-a100-demand
  45. [45] Azure to retire NVv4-series VMs on September 30, 2026 - LinkedIn — reactive:aws-garman-a100-demand
  46. [46] NVv4 series retirement - Azure Virtual Machines | Microsoft Learn — reactive:aws-garman-a100-demand
  47. [47] Retired Azure VM size series - Virtual Machines - Microsoft Learn — reactive:aws-garman-a100-demand
  48. [48] Retirement: NVv4-series Azure Virtual Machines will be ... - Azalio — reactive:aws-garman-a100-demand
  49. [49] Azure NV Series EOL Announcement - Dizzion Support Center — reactive:aws-garman-a100-demand
  50. [50] Migrate your retiring Azure Virtual Machines (VMs) to latest-generation VMs before 15 November 2028 - Microsoft Q&A — reactive:aws-garman-a100-demand
  51. [51] Azure VM Size Series Retirement 2028: Timeline and Migration ... — reactive:aws-garman-a100-demand
  52. [52] Migration Guide for GPU Compute Workloads in Azure - Azure Virtual Machines | Microsoft Learn — reactive:aws-garman-a100-demand
  53. [53] CPU Squeeze: Agentic AI, Intel, AMD, AWS, H200 - Wukong — reactive:aws-garman-a100-demand
  54. [54] CPU Demand Surges as Agentic AI Shifts Workloads | Adnan Khaleel posted on the topic | LinkedIn — reactive:aws-garman-a100-demand
  55. [55] AWS CEO Matt Garman said they have never retired an A100 server. — reactive:aws-garman-a100-demand (2026-04-26)
  56. [56] Amazon Web Services CEO Matt Garman said today there is so ... — reactive:aws-garman-a100-demand
  57. [57] AI capex ROI becomes key 2026 test for hyperscalers - Seeking Alpha — reactive:aws-garman-a100-demand
  58. [58] The AI Capex Debate: Misallocation or Generational ROIC? | InvestorPlace — reactive:aws-garman-a100-demand
  59. [59] The Flip Side podcast - Episode 82 | Barclays Investment Bank — reactive:aws-garman-a100-demand
  60. [60] Hyperscaler Capex Snowballs Toward $700B as Firms Stage AI Builds — reactive:aws-garman-a100-demand
  61. [61] AWS CEO Matt Garman said they have never retired an A100 server ... — reactive:aws-garman-a100-demand
  62. [62] Milk Road AI — reactive:aws-garman-a100-demand
  63. [63] $AMZN AWS CEO Matt Garman revealed that the company has never retired a single A100 server. — reactive:aws-garman-a100-demand (2026-04-27)
  64. [64] GPU rental prices (H100) hit___ by April 30? - Yahoo Finance — reactive:aws-garman-a100-demand
  65. [65] The Great GPU Shortage – Rental Capacity [Semi Analysis] - Reddit — reactive:aws-garman-a100-demand
  66. [66] GPU rental prices (H100) hit___ by April 30? Trading Odds & Predictions 2026 | Polymarket — reactive:aws-garman-a100-demand
  67. [67] And AWS has never once taken an A100 offline because demand is ... — reactive:aws-garman-a100-demand
  68. [68] GPU rental prices (H100) hit___ by April 30? | Prediction - Jupiter — reactive:aws-garman-a100-demand
  69. [69] GPU rental prices (H100) hit___ by April 30? - MLQ.ai | AI for investors — reactive:aws-garman-a100-demand
  70. [70] SDH100RT Predictions & Real-Time Odds - Polymarket — reactive:aws-garman-a100-demand
  71. [71] Silicon Data Predictions & Real-Time Odds | Polymarket — reactive:aws-garman-a100-demand
  72. [72] Prices Predictions & Real-Time Odds | Polymarket — reactive:aws-garman-a100-demand
  73. [73] GPU rental prices (H200) hit___ by May 31? Trading Odds & Predictions 2026 | Polymarket — reactive:aws-garman-a100-demand
  74. [74] AI CAPEX ROI becomes key 2026 test for hyperscalers - MSN — reactive:aws-garman-a100-demand
  75. [75] Amazon Just Revealed a Hidden $50 Billion Business — reactive:aws-garman-a100-demand
  76. [76] Amazon Just Revealed This 1 Business Is Worth More Than 82% of ... — reactive:aws-garman-a100-demand
  77. [77] Amazon Just Revealed a Hidden $50 Billion Business -- Is This the Buy Signal Investors Needed? — reactive:aws-garman-a100-demand
  78. [78] Amazon: Andy Jassy's Shareholder Letter Is A Bull's Dream — reactive:aws-garman-a100-demand
  79. [79] Amazon rallies after highlighting massive upside for the Trainium ... — reactive:aws-garman-a100-demand
  80. [80] Launching GPU Instances on AWS: Understanding Capacity, Quotas, and Reservations — reactive:aws-garman-a100-demand
  81. [81] What AWS’s GPU Pricing Shift Reveals About Cloud Cost Risk - Amplix — reactive:aws-garman-a100-demand
  82. [82] How do you handle on-demand GPU instances for AI ... — reactive:aws-garman-a100-demand
  83. [83] AWS raises GPU prices 15% on a Saturday • The Register — reactive:aws-garman-a100-demand
  84. [84] The GPU Capacity Crisis: Why Enterprises Are Rethinking Where AI ... — reactive:aws-garman-a100-demand
  85. [85] AWS Raised Prices 15%? No, It's More Complicated Than That | LCMH - Digital Services — reactive:aws-garman-a100-demand
  86. [86] EC2 Capacity Blocks : r/aws — reactive:aws-garman-a100-demand
  87. [87] Enterprise AI Workstation vs Cloud Cost Analysis 2026 — What the Numbers Actually Say | Clanker Cloud Blog — reactive:aws-garman-a100-demand
  88. [88] Cloud Repatriation in 2026: Moving AI & Heavy Workloads On-Prem — reactive:aws-garman-a100-demand
  89. [89] Stop Measuring AI Training Costs In GPU Hours - The Next Platform — reactive:aws-garman-a100-demand
  90. [90] Cloud Bill Shock in 2026: How AI Workloads Are Draining IT Budgets — reactive:aws-garman-a100-demand
  91. [91] How Platform Engineering Impacts Cloud Costs in 2026 - Everyday IT — reactive:aws-garman-a100-demand
  92. [92] Cloud Repatriation in 2026: Moving AI & Heavy Workloads On-Prem — reactive:aws-garman-a100-demand
  93. [93] Enterprise Survey Finds 93% Are Repatriating AI Workloads — reactive:aws-garman-a100-demand
  94. [94] What is cloud repatriation and why are companies doing it in 2026? | Blog — Northflank — reactive:aws-garman-a100-demand
  95. [95] Cloud Repatriation 2026: Why 86% of CIOs Are Moving Workloads Back | Tasrie IT Services — reactive:aws-garman-a100-demand
  96. [96] Why Businesses Are Bringing Workloads Back On-Premises in 2026 — reactive:aws-garman-a100-demand
  97. [97] Cloud Repatriation Trends: Cost, AI and the Push Towards Hybrid - HBS — reactive:aws-garman-a100-demand
  98. [98] Cloud Repatriation Case Study - Petronella Technology Group — reactive:aws-garman-a100-demand
  99. [99] How AI is reshaping cloud repatriation strategies - OpenText Blogs — reactive:aws-garman-a100-demand
  100. [100] Cloud Repatriation in 2026: Costs, Compliance, and Control | Volico — reactive:aws-garman-a100-demand
  101. [101] Enterprise AI Workstation vs Cloud Cost Analysis 2026: Inference Breakeven by Team Size | Clanker Cloud Blog — reactive:aws-garman-a100-demand
  102. [102] Unsustainable Cloud GPU Rental Costs in 2026 - NeuralRack AI — reactive:aws-garman-a100-demand
  103. [103] Cloud Repatriation in 2026: Moving AI & Heavy Workloads On-Prem — reactive:aws-garman-a100-demand
  104. [104] Enterprise Survey Finds 93% are Repatriating AI Workloads or Evaluating a Move Away from Public Cloud - StorageNewsletter — reactive:aws-garman-a100-demand
  105. [105] Enterprise Survey Finds 93% Are Repatriating AI Workloads or Evaluating a Move Away from Public Cloud — reactive:aws-garman-a100-demand
  106. [106] 93% of Enterprises Repatriating AI Workloads from Public Cloud - San Mateo Today — reactive:aws-garman-a100-demand
  107. [107] Cloud Repatriation Strategy: What the 93% Signal Actually Means — reactive:aws-garman-a100-demand
  108. [108] Enterprises shift AI workloads from public cloud due to cost and ... — reactive:aws-garman-a100-demand
  109. [109] Enterprises shift AI workloads from public cloud to on-prem due to ... — reactive:aws-garman-a100-demand
  110. [110] Enterprise Survey Finds 93% Are Repatriating AI Workloads or ... — reactive:aws-garman-a100-demand
  111. [111] 83% of Enterprises Plan to Move Workloads Back to Private Cloud - Data Canopy — reactive:aws-garman-a100-demand
  112. [112] Why CIOs Are Moving AI Workloads from Public Cloud to Colocation ... — reactive:aws-garman-a100-demand
  113. [113] AI Compute Surpasses Human Costs: Enterprise Budgets Shift — reactive:aws-garman-a100-demand
  114. [114] AI Demand Boosts GPU Prices, Says Silicon Data CEO Carmen Li — reactive:aws-garman-a100-demand
  115. [115] Launching our H100 1 Year Rental Price Index - SemiAnalysis — reactive:aws-garman-a100-demand
  116. [116] Global GPU Shortage Crisis | H100 Rental Prices Surge 40% — reactive:aws-garman-a100-demand
  117. [117] Launching our H100 1 Year Rental Price Index | Sheng-Che Huang — reactive:aws-garman-a100-demand
  118. [118] Nvidia H100 GPU rental price increase in 2026 - YouTube — reactive:aws-garman-a100-demand
  119. [119] The Great GPU Shortage – Rental Capacity — reactive:aws-garman-a100-demand
  120. [120] Nvidia's H100 GPU rental prices surge nearly 40% in 6 months - MSN — reactive:aws-garman-a100-demand
  121. [121] NVIDIA Blackwell GPU Pricing: B200, B300 and DGX Cost Breakdown — reactive:aws-garman-a100-demand
  122. [122] Launching our H100 1 Year Rental Price Index - SemiAnalysis — reactive:aws-garman-a100-demand
  123. [123] Cast AI Data Shows GPU Pricing Will See a Foundational Shift in 2026 — reactive:aws-garman-a100-demand
  124. [124] H100 Hyperscaler Index, April 2026: A Benchmark in Flat Mode - Silicon Data — GPU Performance Data for Companies — reactive:aws-garman-a100-demand
  125. [125] GPU Pricing Index  – SemiAnalysis — reactive:aws-garman-a100-demand
  126. [126] NVIDIA H100 GPU Pricing: 2025 Rent vs. Buy Cost Analysis — reactive:aws-garman-a100-demand
  127. [127] H100 Price Spike: Understanding the 10% Surge in GPU Rental Costs — reactive:aws-garman-a100-demand
  128. [128] Cast AI Data Shows GPU Pricing Will See a Foundational Shift in 2026 - Cast AI — reactive:aws-garman-a100-demand
  129. [129] Why Nvidia GPU prices are 'going nuts,' according to one CEO | Carmen Li — reactive:aws-garman-a100-demand
  130. [130] The Illusion of Stability: Unpacking H100 GPU Market Value Trends - Silicon Data — GPU Performance Data for Companies — reactive:aws-garman-a100-demand
  131. [131] Silicon H100 - GPU Rental Price Tracker — reactive:aws-garman-a100-demand
  132. [132] H100 Rental Price Over Time (2023–2025): A Complete Market Analysis - Silicon Data — GPU Performance Data for Companies — reactive:aws-garman-a100-demand
  133. [133] GPU Cloud Prices Collapse | Introl Blog — reactive:aws-garman-a100-demand
  134. [134] GPU Spot Prices Surge 114% in Six Weeks - LinkedIn — reactive:aws-garman-a100-demand
  135. [135] AI GPU Rental Market Trends (April 2026): Complete Industry Analysis — reactive:aws-garman-a100-demand
  136. [136] How Much Do GPU Clusters Really Cost? — reactive:aws-garman-a100-demand
  137. [137] Beyond the Bubble: Why AI Infrastructure Will Compound Long after ... — reactive:aws-garman-a100-demand
  138. [138] Why the GenAI Infrastructure Boom May Break Historical Patterns - CITP Blog — reactive:aws-garman-a100-demand
  139. [139] The Market Hates Big Cloud Spending. The Data Says The Market Is Wrong. — reactive:aws-garman-a100-demand
  140. [140] Amazon plunge continues $1T wipeout as AI bubble fears ignite sell ... — reactive:aws-garman-a100-demand
  141. [141] AWS CEO Matt Garman is pushing back hard on the idea ... - Reddit — reactive:aws-garman-a100-demand
  142. [142] The Real AI CapEx Problem No One Wants to Talk About — reactive:aws-garman-a100-demand
  143. [143] Concerns about the ai bubble and overbuilding capacity - Facebook — reactive:aws-garman-a100-demand
  144. [144] Hyperscaler AI Spending Doubts Rising - Futuriom — reactive:aws-garman-a100-demand
  145. [145] Are the Hyperscalers Turning Themselves into the Telecom ... — reactive:aws-garman-a100-demand
  146. [146] Parallels Between the Hyperscalers and the Telecom Firms of the 1990s | MOI Global — reactive:aws-garman-a100-demand
  147. [147] "This Time is Different": Will AI end up like the telecom bust? — reactive:aws-garman-a100-demand
  148. [148] The Treadmill: Why the AI Infrastructure Bet Breaks Every ... — reactive:aws-garman-a100-demand
  149. [149] 🔗: https://bit.ly/41P2wP4 The hyperscalers building the ... — reactive:aws-garman-a100-demand
  150. [150] Overbuilding the Future: Why AI Is Repeating the Railways and Dark Fiber — reactive:aws-garman-a100-demand
  151. [151] Accelerate Monthly - The AI Capex Boom: Are the Hyperscalers ... — reactive:aws-garman-a100-demand
  152. [152] Are we repeating the telecoms crash with AI datacenters? | Hacker News — reactive:aws-garman-a100-demand
  153. [153] Jeff Sica warns of 'breaking point' over hyperscaler spending — reactive:aws-garman-a100-demand
  154. [154] The AI boom in 2025 looks a lot like past infrastructure manias ... - X — reactive:aws-garman-a100-demand
  155. [155] Jeff Sica warns of 'breaking point' over hyperscaler spending — reactive:aws-garman-a100-demand
  156. [156] The Challenging ROI Trajectory for GenAI - LinkedIn — reactive:aws-garman-a100-demand
  157. [157] The AI Capex Boom: Bubble or Infrastructure Supercycle? - LongYield — reactive:aws-garman-a100-demand
  158. [158] AI boom risks bust; capex overkill could backfire, warns analyst — reactive:aws-garman-a100-demand
  159. [159] AWS has “never retired” an Nvidia A100 server, CEO Matt Garman ... — reactive:aws-garman-a100-demand
  160. [160] AI demand is so high, AWS customers are trying to buy out its entire ... — reactive:aws-garman-a100-demand
  161. [161] We’re Using So Much AI That Computing Firepower Is Running Out — reactive:aws-garman-a100-demand
  162. [162] Amazon's retail business resolves internal GPU capacity shortage - DCD — reactive:aws-garman-a100-demand
  163. [163] AWS Hikes EC2 Capacity Block Rates by 15% in Uniform ML Pricing Adjustment - InfoQ — reactive:aws-garman-a100-demand
  164. [164] AWS just quietly increased EC2 Capacity Block prices – here's what you need to know | IT Pro — reactive:aws-garman-a100-demand
  165. [165] Nvidia's H100 GPU rental prices surge nearly 40% in 6 months - MSN — reactive:aws-garman-a100-demand
  166. [166] Amazon closes at record after $38 billion OpenAI deal with AWS — reactive:aws-garman-a100-demand
  167. [167] How AI Is Driving Revenue, Cutting Costs and Boosting Productivity for Every Industry in 2026 — reactive:aws-garman-a100-demand
  168. [168] Cloud Repatriation During Price Increases: Why It Won't Work for AI Workloads - SoftwareSeni — reactive:aws-garman-a100-demand
  169. [169] On-Premise vs Cloud: Generative AI Total Cost of Ownership (2026 ... — reactive:aws-garman-a100-demand
  170. [170] Amazon May Sell Trainium AI Chips To Third Parties In Shot At Nvidia — reactive:aws-garman-a100-demand
  171. [171] Amazon Considers Selling Trainium AI Chip Racks | Let's Data Science — reactive:aws-garman-a100-demand
  172. [172] Amazon the Chip Company? Tech Giant Wants to Sell Its AI Chips ... — reactive:aws-garman-a100-demand
  173. [173] AWS, Microsoft, and Google Are Pricing Themselves Out of AI — reactive:aws-garman-a100-demand
  174. [174] Why GPU Cloud Providers Are Losing Money and How to ... - YouTube — reactive:aws-garman-a100-demand
  175. [175] AWS and OpenAI announce multi-year strategic partnership — reactive:aws-garman-a100-demand
  176. [176] AI Compute Costs Rise: AWS GPU Prices Increase 15% | Mutha Nagavamsi posted on the topic | LinkedIn — reactive:aws-garman-a100-demand
  177. [177] AWS Quietly Increased EC2 Capacity Block Prices: What To Know — reactive:aws-garman-a100-demand
  178. [178] Amazon EC2 capacity blocks for ML can be shared across multiple ... — reactive:aws-garman-a100-demand
  179. [179] Exclusive-Amazon CEO sees AI doubling prior AWS sales ... — reactive:aws-garman-a100-demand
  180. [180] Amazon CEO Andy Jassy says the company may eventually sell its ... — reactive:aws-garman-a100-demand
  181. [181] Amazon is considering selling its AI chips to other companies — reactive:aws-garman-a100-demand
  182. [182] Amazon considering selling its AI chips to other companies — reactive:aws-garman-a100-demand
  183. [183] AWS has “never retired” an Nvidia A100 server, CEO Matt Garman ... — reactive:aws-garman-a100-demand
  184. [184] Amazon Web Services CEO Matt Garman said today there is so ... — reactive:aws-garman-a100-demand
  185. [185] AWS CEO Matt Garman said today there is so much demand they ... — reactive:aws-garman-a100-demand
  186. [186] Amazon's Matt Garman: there is so much more demand than supply ... — reactive:aws-garman-a100-demand
  187. [187] SemiWiki.com's Post - LinkedIn — reactive:aws-garman-a100-demand
  188. [188] Amazon Web Services CEO Matt Garman said today there is so ... — reactive:aws-garman-a100-demand
  189. [189] Nvidia's H100 GPU rental prices surge nearly 40% in 6 months — reactive:aws-garman-a100-demand
  190. [190] Nearly Nine in Ten Enterprises Plan Cloud Data Repatriation — reactive:aws-garman-a100-demand
  191. [191] NGN ADVISORY's Post - LinkedIn — reactive:aws-garman-a100-demand
  192. [192] it's about moving from Cloud-First to Cloud-Smart. SDxCentral writes ... — reactive:aws-garman-a100-demand
  193. [193] From repatriation to replatforming: The cloud story no one wants to tell | CIO — reactive:aws-garman-a100-demand