The Information Machine

NVIDIA Launches Vera CPU and Vera Rubin NVL72 at COMPUTEX / GTC Taipei · history

Version 4

2026-05-24 10:11 UTC · 135 items

What

NVIDIA's Vera CPU (88-core, 1.2 TB/s bandwidth [1]) and Vera Rubin NVL72 are entering commercial deployment — Vera CPUs were hand-delivered to OpenAI and Anthropic on May 18 [2], Meta confirmed as a Vera Rubin customer [10], and Nebius formalized a full-stack AI cloud partnership via SEC filing [11] — but the memory supply crisis has become measurable: Samsung has sold out its entire 2026 HBM4 supply [20], SK Hynix holds roughly 70% of NVIDIA's HBM4 orders [15], Micron's inclusion in Vera Rubin's HBM4 supply chain is contested across sources [16][17][18], and a 435% surge in memory costs pushes the per-rack memory bill to $2M out of a $7.8M total [14]. NVIDIA's Q1 2026 results — $81.6B in revenue, up 85% year-over-year [8][9] — confirm parabolic demand, while a new analyst forecast projects NVIDIA will ship 4 million Vera CPUs in FY2027 and capture two-thirds of the x86 server CPU market [28].

Why it matters

The HBM4 shortage is now projected to persist until 2028 [23], transforming what was a near-term supply caveat into a multi-year structural constraint on AI infrastructure expansion — and the 435% memory price surge [14] directly challenges NVIDIA's headline cost-per-token claims, which remain unverified by independent benchmarks from production hardware. If analyst projections of two-thirds x86 server CPU market capture materialize [28], NVIDIA's CPU entry would be the most disruptive shift in the datacenter processor market in a decade, with AMD now publicly contesting the framing [30] and EnerTuition arguing NVIDIA may be losing its GPU lead entirely [31].

Open questions

  • Multiple reports indicate Micron has been excluded from NVIDIA's Vera Rubin HBM4 supply chain in favor of Samsung and SK Hynix only [16][17], while other analyses describe all three suppliers competing for Vera Rubin allocation [35][18] — is Micron's status a qualification failure, a temporary condition, or an artifact of conflicting sourcing [19]?

  • Memory costs for the Vera Rubin rack have surged 435%, putting HBM4 and LPDDR5x at $2M of a $7.8M total rack cost [14], with the shortage projected through 2028 [23] — do NVIDIA's headline 10x cost-per-token claims hold at real-world rack economics, and will this affect Nebius's H2 2026 commercial rollout pricing [12]?

  • An analyst projects NVIDIA will ship 4 million Vera CPUs in FY2027 and capture two-thirds of the x86 server CPU market [28]; AMD has published a direct agentic AI counter-narrative favoring EPYC [30] and EnerTuition argues NVIDIA may lose its GPU lead for multiple generations [31] — what specific architectural or market dynamics underlie each of these conflicting theses?

  • All headline performance claims — 10x cost-per-token reduction, 50% faster agentic workloads — still originate from NVIDIA's own materials [33][34]; when will independent benchmark validation of production Vera Rubin hardware appear?

Narrative

NVIDIA's Vera CPU and Vera Rubin NVL72 represent a dual-front expansion into agentic AI infrastructure: a purpose-built agentic processor entering the datacenter CPU market alongside a flagship AI inference platform. The Vera CPU is an 88-core design with 1.2 TB/s of memory bandwidth, deployed in 256-chip liquid-cooled rack configurations that NVIDIA claims deliver up to 6x CPU throughput over prior generations [1]. It was hand-delivered to OpenAI, Anthropic, and other leading AI labs on May 18, 2026 [2]. The paired Vera Rubin NVL72 — 336 billion transistors, 50 petaflops of AI performance [3] — was announced as being in full production at Jensen Huang's CES 2026 keynote [4][5] and won the Computex 2026 Best Choice Golden Award [6][7]. NVIDIA's Q1 2026 financial results — $81.6B in revenue, up 85% year-over-year [8][9] — validate that AI infrastructure demand is tracking the trajectory Huang describes as 'parabolic.' Meta has been confirmed as a Vera Rubin customer through a broad NVIDIA-Meta partnership [10], and cloud provider Nebius has formalized a full-stack AI cloud partnership via SEC filing, with Vera Rubin NVL72 deployment planned for the US and Europe beginning H2 2026 [11][12]. NVIDIA GTC Taipei at COMPUTEX is scheduled for June 1–5, 2026 [13], providing the next major public venue for additional platform announcements.

The dominant complication is a memory supply crisis that is now measurable in cost and allocation terms. A 435% surge in HBM4 and LPDDR5x prices has pushed the per-rack memory bill for a Vera Rubin system to approximately $2M out of a $7.8M total rack cost [14]. The HBM4 supply chain structure has crystallized around two dominant suppliers: SK Hynix holds roughly 70% of NVIDIA's HBM4 orders [15], and multiple reports indicate NVIDIA has designated Samsung and SK Hynix as Vera Rubin's HBM4 suppliers, with Micron excluded from this generation [16][17]. This picture is contested, however — separate TechPowerUp forum coverage and other analyses describe all three major HBM suppliers competing for Vera Rubin allocation, with HBM4 customer validation expected during Q2 2026 [18], and Micron itself has announced an early HBM4 ramp [19]. What is uncontested: Samsung has sold out its entire 2026 HBM4 supply [20], HBM4 mass production was delayed into late Q1 2026 due to spec upgrades and strategy adjustments [21][22], and TradingKey projects HBM shortage conditions persisting until 2028 [23]. Enterprise data center planning advice has shifted accordingly, with Arc Compute advising customers to prepare for an extended 'HBM crunch' as they migrate from Blackwell to Rubin architecture [24]. A broader chipmaking supply chain analysis found that 'nobody's scaling up' fabrication capacity, suggesting the constraint is structural [25].

The CPU market competition is escalating from narrative positioning into specific commercial projections. NVIDIA has formally offered the Vera CPU as a standalone competitor to Intel Xeon and AMD EPYC [26][27], and an analyst forecast published by Tom's Hardware projects NVIDIA is already on track to ship 4 million Vera CPUs in FY2027 and capture approximately two-thirds of the x86 server CPU market, representing roughly $20 billion in revenue [28]. TrendForce analysis provides structural context: agentic AI workloads require substantially more CPU capacity per GPU than traditional inference, creating a shift in datacenter CPU demand that plays to NVIDIA's positioning [29]. AMD has published a direct corporate counter-narrative, arguing in a company blog that agentic AI changes the CPU/GPU equation in ways that favor its EPYC architecture [30]. EnerTuition, which previously characterized the Vera versus EPYC contest as zero-sum, escalated its contrarian position with a piece arguing that NVIDIA may be 'on the verge of losing its GPU lead for a couple of generations' [31] — a thesis that stands in direct tension with the analyst CPU market projections and NVIDIA's earnings trajectory, and which has not yet been independently corroborated. The NVIDIA-Meta Vera Rubin deployment has separately prompted analysis of whether hyperscaler adoption at scale could trigger a broader CPU supercycle [32].

Across all fronts, the gap between NVIDIA's promotional claims and independently verified hardware performance remains open. All headline figures — 10x cost-per-token reduction, 50% performance advantage over comparable x86 CPUs on agentic workloads, 35x throughput per watt with Groq 3 LPX — originate exclusively from NVIDIA's own materials [33][34]. The 435% memory price surge and the projected multi-year HBM shortage create real-world rack economics materially different from launch specifications. No independent, reproducible benchmark from production Vera Rubin hardware has yet appeared.

Timeline

  • 2026-01-05: NVIDIA debuts Rubin chip at CES: 336 billion transistors, 50 petaflops AI performance [3]
  • 2026-01: Jensen Huang announces at CES 2026 keynote that Vera Rubin NVL72 is in full production [4][5]
  • 2026-01: HBM4 mass production reported as delayed to end of Q1 2026 due to spec upgrades and NVIDIA strategy adjustments [21][22]
  • 2026-02: SK Hynix begins early mass production of HBM4 and sets up shipments to NVIDIA for Vera Rubin; HBM4 supply competition identified as dominated by SK Hynix (~70% of NVIDIA orders) and Samsung [47][48][15][35]
  • 2026-05-18: NVIDIA hand-delivers first Vera CPUs (88-core, 1.2 TB/s bandwidth, 6x throughput in 256-chip liquid-cooled rack) to OpenAI, Anthropic, and other leading AI labs [36][2][1]
  • 2026-05-18: Jensen Huang keynotes at Dell Technologies World: announces Vera Rubin NVL72 specs, projects $3–4 trillion AI infrastructure buildout by 2030, endorses Dell with 'Buy Dell' statement, and flags memory supply chain constraint [33][39][55][37][38]
  • 2026-05-20: Jensen Huang signs Dell PowerRack server on stage at Dell Technologies World [44][56]
  • 2026-05-21: NVIDIA reports Q1 2026 earnings: $81.6B revenue, up 85% year-over-year [8][9]
  • 2026-05-21: NVIDIA GTC Taipei at COMPUTEX: Vera Rubin NVL72, Jetson Thor, and Alpamayo autonomous driving platform detailed; Vera Rubin NVL72 wins Computex 2026 Best Choice Golden Award [34][6][7]
  • 2026-05-21: NVIDIA and Meta announce broad partnership; Meta confirmed as Vera Rubin customer [10][32]
  • 2026-05: Samsung sells out its entire 2026 HBM4 supply; memory costs for Vera Rubin rack surge 435%, with HBM4 and LPDDR5x representing $2M of $7.8M total rack cost [20][14]
  • 2026-05: Analyst projects NVIDIA on track to ship 4 million Vera CPUs in FY2027, potentially capturing two-thirds of x86 server CPU market (~$20B revenue) [28]
  • 2026-05: Nebius formalizes full-stack AI cloud partnership with NVIDIA via SEC filing; plans Vera Rubin NVL72 deployment in US and Europe from H2 2026 [11][12]
  • 2026-05: Reports emerge that Rubin GPU mass production targets have been lowered, attributed to memory supply chain constraints [57][58][59]
  • 2026-06-01: NVIDIA GTC Taipei at COMPUTEX 2026 scheduled (June 1–5) [13]

Perspectives

NVIDIA / Jensen Huang

Maximally bullish: agentic AI demand is 'parabolic,' the Vera CPU and Vera Rubin NVL72 are generational leaps in inference economics, and Q1 2026 earnings ($81.6B, +85% YoY) validate the demand trajectory. Huang acknowledged that memory supply chains cannot keep pace with demand. Meta's confirmation as a Vera Rubin customer and the Nebius SEC filing deepen commercial validation. NVIDIA has not publicly addressed third-party reports that Rubin mass production targets have been lowered.

Evolution: Consistent bullish framing, now reinforced by the strongest earnings in company history, a confirmed hyperscaler customer (Meta), and a formalized cloud provider partnership (Nebius SEC filing).

Michael Dell / Dell Technologies

Aligned with NVIDIA's agentic AI vision; Dell AI Factory positioned as the primary enterprise on-premises channel for Vera Rubin NVL72. Co-presented with Jensen Huang at Dell Technologies World.

Evolution: Consistent endorsement, visibly deepened by Huang's public 'Buy Dell' statement and the hardware-signing moment on stage.

Nebius

Committed to deploying Vera Rubin NVL72 commercially in the US and Europe from H2 2026, with the partnership now formalized via SEC filing as a full-stack AI cloud arrangement with NVIDIA.

Evolution: The SEC filing materializes the partnership into a formal document beyond a press commitment, strengthening the commercial credibility of the H2 2026 timeline.

Meta

Confirmed as a Vera Rubin customer through a broad NVIDIA-Meta partnership announced in May 2026, signaling hyperscaler-level adoption of the platform at scale.

Evolution: First appearance as a named customer voice; represents a significant expansion of Vera Rubin's customer roster beyond cloud providers to the largest hyperscalers.

Memory and supply chain analysts (SK Hynix reporting, TrendForce, TradingKey, Arc Compute, multiple)

Converge on HBM shortage as the binding structural constraint on Vera Rubin production. SK Hynix holds approximately 70% of NVIDIA's HBM4 orders. Samsung has sold out its 2026 HBM4 supply. Memory prices for the rack have surged 435%. The shortage is projected to persist until 2028. Whether Micron is fully excluded or in a qualifying position is contested across sources.

Evolution: Substantially more specific than prior passes: supplier allocation percentages, per-rack price impacts, and a 2028 shortage horizon are now quantified across multiple third-party sources, moving well beyond CEO-level supply caveats.

EnerTuition

Escalated contrarian position: argues not only that the Vera CPU vs. AMD EPYC contest is zero-sum, but that NVIDIA may be 'on the verge of losing its GPU lead for a couple of generations' — a significantly broader bearish thesis that extends beyond CPU market competition to NVIDIA's core GPU franchise.

Evolution: Previously characterized the CPU contest as zero-sum; now extends the bearish thesis to NVIDIA's GPU leadership itself — an escalation with no independent corroboration yet published.

CPU market analysts (Tom's Hardware / industry forecasters)

Project NVIDIA is already on track to ship 4 million Vera CPUs in FY2027 and capture approximately two-thirds of the x86 server CPU market (~$20B revenue) — framing NVIDIA's CPU entry as a historic disruption of Intel and AMD's datacenter franchise.

Evolution: Significantly more concrete than prior synthesis, which noted CPU competitive positioning without specific shipment or market share projections.

AMD

Published a corporate blog arguing that agentic AI changes the CPU/GPU equation in ways favorable to EPYC architecture, positioning AMD as a natural beneficiary of the agentic AI transition rather than a passive defender against NVIDIA's entry.

Evolution: First appearance as a direct named voice in this thread; previously AMD was discussed only in the framing of other analysts. AMD's public counter-narrative signals the company is actively contesting the agentic AI CPU story.

TrendForce

Agentic AI is structurally reshaping the CPU-to-GPU ratio in datacenter deployments, with agentic workloads requiring substantially more CPU capacity per GPU than traditional inference — providing an independent market research framing for the demand shift NVIDIA's Vera CPU is targeting.

Evolution: First substantive appearance in this thread as an independent market research voice on the CPU demand structural shift.

Niraj Yagnik (market observer)

Notes CPU supply competition: two AWS customers attempted to lock up all of Graviton's 2026 CPU production capacity as NVIDIA entered the server CPU market — suggesting Vera CPU is already reshaping enterprise procurement decisions before broad availability.

Evolution: Consistent with prior synthesis; no new development from this voice.

Tensions

  • NVIDIA markets Vera Rubin on a 10x cost-per-token reduction and 50% agentic workload advantage [33][34] — claims originating exclusively from NVIDIA's promotional materials — while real-world rack economics show a 435% surge in memory costs, with HBM4 and LPDDR5x representing $2M of a $7.8M total rack cost [14]. The gap between headline efficiency claims and current hardware pricing remains unresolved by any independent benchmark. [33][34][14]
  • Analyst projections place NVIDIA on track for 4 million Vera CPU shipments in FY2027 and two-thirds x86 server market capture [28], while EnerTuition argues NVIDIA may be 'on the verge of losing its GPU lead for a couple of generations' [31] — two assessments of NVIDIA's hardware franchise trajectory that are irreconcilable, with no independent corroboration yet for the bearish GPU thesis. [31][28][8][9]
  • Multiple reports indicate Micron has been excluded from NVIDIA's Vera Rubin HBM4 supply chain in favor of Samsung and SK Hynix only [16][17], while other analysis describes all three suppliers competing for Vera Rubin allocation with HBM4 validation expected in Q2 2026 [18][49] — Micron's actual qualification status for Vera Rubin is contested across sources. [16][17][18][49][19]
  • NVIDIA's CPU market projections target two-thirds of x86 server CPU share with 4 million units in FY2027 [28], while AMD has published a direct counter-narrative claiming agentic AI favors EPYC architecture [30] — the two positions frame the same technological shift as advantaging opposite architectures, with enterprise procurement decisions over the next 12–18 months as the decisive test. [28][30][29]

Sources

  1. [1] Nvidia unveils details of new 88-core Vera CPUs positioned to compete with AMD and Intel – new Vera CPU rack features 256 liquid-cooled chips that deliver up to a 6X gain in CPU throughput | Tom's Hardware — reactive:nvidia-vera-computex-launch
  2. [2] NVIDIA hand-delivers first 1.2 TB/s Vera CPUs to OpenAI, Anthropic ... — reactive:nvidia-vera-computex-launch
  3. [3] Nvidia debuts Rubin chip with 336B transistors and 50 petaflops of AI performance - SiliconANGLE — reactive:nvidia-vera-computex-launch
  4. [4] Nvidia CEO confirms Vera Rubin NVL72 is now in production — reactive:nvidia-vera-computex-launch
  5. [5] NVIDIA Vera Rubin AI Platform Hits Full Production CES 2026 ... — reactive:nvidia-vera-computex-launch
  6. [6] NVIDIA Vera Rubin NVL72 wins Computex 2026 awards for AI ... — reactive:nvidia-vera-computex-launch
  7. [7] 2026 Best Choice Award-Golden Award: NVIDIA Vera Rubin NVL72 - The Peak of AI Supercomputing — reactive:nvidia-vera-computex-launch
  8. [8] NVIDIA just dropped $81.6B in Q1 revenue up 85% YoY 🤯 — reactive:nvidia-vera-computex-launch (2026-05-21)
  9. [9] "Demand has gone parabolic. The reason is simple: Agentic AI has arrived." — reactive:nvidia-vera-computex-launch (2026-05-21)
  10. [10] As part of a broad partnership announced today, Nvidia says Meta ... — reactive:nvidia-vera-computex-launch
  11. [11] NVIDIA and Nebius Partner to Scale Full-Stack AI Cloud - SEC.gov — reactive:nvidia-vera-computex-launch
  12. [12] Nebius to offer NVIDIA Vera Rubin NVL72 in US and Europe from H2 2026 | Corporate - EQS News — reactive:nvidia-vera-computex-launch
  13. [13] NVIDIA GTC Taipei at COMPUTEX 2026 | June 1-5 — reactive:nvidia-vera-computex-launch
  14. [14] NVIDIA's Vera Rubin Rack Hit With 435% Memory Price Surge ... — reactive:nvidia-vera-computex-launch
  15. [15] SK Hynix Secures 70% of Nvidia's HBM4 Orders - Semicon — reactive:nvidia-vera-computex-launch
  16. [16] NVIDIA to Use SK hynix and Samsung HBM4 for "Vera ... — reactive:nvidia-vera-computex-launch
  17. [17] NVIDIA's Vera Rubin to Use Only Samsung and SK Hynix HBM4 ... — reactive:nvidia-vera-computex-launch
  18. [18] HBM4 Validation Expected in 2Q26; Three Major Suppliers Poised ... — reactive:nvidia-vera-computex-launch
  19. [19] Micron’s Early HBM4 Ramp Tests Durability Of AI Memory Boom — reactive:nvidia-vera-computex-launch
  20. [20] Samsung sells out of 2026 HBM4 supply as memory resurgence ... — reactive:aws-garman-a100-demand
  21. [21] HBM4 Mass Production Delayed to End of 1Q26 By Spec Upgrades ... — reactive:nvidia-vera-computex-launch
  22. [22] HBM4 Mass Production Delayed According to TrendForce ... — reactive:nvidia-vera-computex-launch
  23. [23] SK Hynix Surges 15% to New High: HBM Shortage Until 2028, How Much Longer Can AI Memory King Rise? — reactive:nvidia-vera-computex-launch
  24. [24] Beyond Blackwell: Preparing Enterprise Data Centers for the NVIDIA ... — reactive:nvidia-vera-computex-launch
  25. [25] A deeper look at the tightened chipmaking supply chain, and where it may be headed in 2026 — "nobody's scaling up,” says analyst as industry remains conservative on capacity — reactive:nvidia-vera-computex-launch
  26. [26] NVIDIA Offers "Vera" CPU as a Standalone Competitor to Intel's Xeon and AMD's EPYC Processors | TechPowerUp — reactive:nvidia-vera-computex-launch
  27. [27] NVIDIA's new Vera CPU will be a competitor to AMD EPYC and Intel Xeon CPUs — reactive:nvidia-vera-computex-launch
  28. [28] 'Nvidia is already on track' to deliver 4 million Vera CPUs in FY2027 — reactive:nvidia-vera-computex-launch
  29. [29] The Great Rebalance: How Agentic AI Is Reshaping the CPU:GPU Ratio — reactive:aws-garman-a100-demand
  30. [30] Agentic AI Changes the CPU/GPU Equation - AMD — reactive:agentic-compute-cpu-gpu
  31. [31] Nvidia On The Verge Of Losing GPU Lead For A Couple Of Generations — reactive:nvidia-vera-computex-launch
  32. [32] Will NVIDIA's Meta Deal Ignite a CPU Supercycle? - Futurum — reactive:nvidia-vera-computex-launch
  33. [33] NVIDIA CEO Jensen Huang at Dell Technologies World: ‘Demand Is Going Parabolic, Utterly Parabolic’ — NVIDIA Blog (2026-05-18)
  34. [34] NVIDIA GTC Taipei at COMPUTEX: Live Updates on What’s Next in AI — NVIDIA Blog (2026-05-21)
  35. [35] NVIDIA HBM4 Supply Becomes Three-Way Race — reactive:nvidia-vera-computex-launch
  36. [36] Vera Arrives: NVIDIA’s First CPU Built for Agents Lands at Top AI Labs — NVIDIA Blog (2026-05-18)
  37. [37] Jensen Huang today: Memory demand >> supply chain capacity. “Supply chain needs to be ready.” AI memory supercycle... — reactive:nvidia-vera-computex-launch (2026-05-18)
  38. [38] 🚨 Jensen Huang on Memory Today: — reactive:nvidia-vera-computex-launch (2026-05-18)
  39. [39] Jensen Huang Says “Buy Dell” | Dell Tech World 2026 | BuilderBase — reactive:nvidia-vera-computex-launch
  40. [40] NVIDIA Kicks Off the Next Generation of AI With Rubin — Six New ... — reactive:nvidia-vera-computex-launch
  41. [41] Michael Dell, Jensen Huang: Boldest Statements From Dell Technologies World 2026 — reactive:nvidia-vera-computex-launch
  42. [42] “Now we have, for the very first time, useful AI” – Jensen Huang and Michael Dell talk up the power of agentic AI at Dell Technologies World 2026 | IT Pro — reactive:nvidia-vera-computex-launch
  43. [43] Featured Sessions | Dell Technologies World 2026 | Dell USA — reactive:nvidia-vera-computex-launch
  44. [44] Jensen Huang showed up at Dell Technologies World 2026 and signed a PowerRack server on stage. — reactive:nvidia-vera-computex-launch (2026-05-20)
  45. [45] Dell Technologies World 2026 — reactive:nvidia-vera-computex-launch
  46. [46] Samsung and SK Hynix Trigger Mass Production for Next-Gen AI — reactive:nvidia-vera-computex-launch
  47. [47] SK Hynix set to ship HBM4 for Nvidia's Vera Rubin this month — reactive:nvidia-vera-computex-launch
  48. [48] SK Hynix to begin early mass production of HBM4 ... — reactive:nvidia-vera-computex-launch
  49. [49] Samsung and Micron confirm HBM4 enters mass ... — reactive:nvidia-vera-computex-launch
  50. [50] NVIDIA's HBM4 Supply Chain Rush: Samsung, SK hynix, Micron ... — reactive:nvidia-vera-computex-launch
  51. [51] Nvidia Vera Vs AMD EPYC: Only One Is Going To Succeed — reactive:nvidia-vera-computex-launch
  52. [52] NVIDIA Offers "Vera" CPU as a Standalone Competitor to Intel's ... — reactive:nvidia-vera-computex-launch
  53. [53] NVIDIA Offers Vera CPU as a Standalone Competitor to Intels Xeon ... — reactive:nvidia-vera-computex-launch
  54. [54] 4/ the CPU story is well documented now. two AWS customers tried to buy all of graviton's 2026 capacity. nvidia launched... — reactive:nvidia-vera-computex-launch (2026-05-18)
  55. [55] NVIDIA AI - Jensen Huang Says “Buy Dell” - LinkedIn — reactive:nvidia-vera-computex-launch
  56. [56] Nvidia CEO Jensen Huang signed Dell’s PowerRack server at Dell Technologies World 2026, turning a light moment on the ev... — reactive:nvidia-vera-computex-launch (2026-05-19)
  57. [57] Nvidia's Rubin GPU Mass Production Target Reportedly Lowered ... — reactive:nvidia-vera-computex-launch
  58. [58] Nvidia's AI Chip Production Delayed by Memory Supply Chain ... — reactive:nvidia-vera-computex-launch
  59. [59] The Rubin Protocol : Supply Chain, Bottlenecks, and the ... - FPX AI — reactive:nvidia-vera-computex-launch