NVIDIA Launches Vera CPU and Vera Rubin NVL72 at COMPUTEX / GTC Taipei · history

Version 4

2026-05-24 10:11 UTC · 135 items

Changes since v3

The dominant new developments are the crystallization of HBM4 supply chain specifics — SK Hynix securing ~70% of NVIDIA's orders [^13384], Samsung selling out its entire 2026 HBM4 supply [^12210], a 435% memory price surge pushing per-rack memory costs to $2M of a $7.8M total [^16713], and a projected shortage through 2028 [^13400] — moving from CEO-level caveats to quantified third-party data. A second significant shift is the emergence of a specific analyst forecast of 4 million Vera CPU units and two-thirds x86 server market share in FY2027 [^13397], alongside AMD's first direct corporate counter-narrative [^16115] and EnerTuition's escalation from zero-sum CPU framing to a broader thesis that NVIDIA may lose its GPU lead entirely [^13393]. Meta is newly confirmed as a Vera Rubin customer [^17107] and Nebius's commitment is formalized via SEC filing [^16669].

What

NVIDIA's Vera CPU (88-core, 1.2 TB/s bandwidth [1]) and Vera Rubin NVL72 are entering commercial deployment — Vera CPUs were hand-delivered to OpenAI and Anthropic on May 18 [2], Meta confirmed as a Vera Rubin customer [10], and Nebius formalized a full-stack AI cloud partnership via SEC filing [11] — but the memory supply crisis has become measurable: Samsung has sold out its entire 2026 HBM4 supply [20], SK Hynix holds roughly 70% of NVIDIA's HBM4 orders [15], Micron's inclusion in Vera Rubin's HBM4 supply chain is contested across sources [16][17][18], and a 435% surge in memory costs pushes the per-rack memory bill to $2M out of a $7.8M total [14]. NVIDIA's Q1 2026 results — $81.6B in revenue, up 85% year-over-year [8][9] — confirm parabolic demand, while a new analyst forecast projects NVIDIA will ship 4 million Vera CPUs in FY2027 and capture two-thirds of the x86 server CPU market [28].

Why it matters

The HBM4 shortage is now projected to persist until 2028 [23], transforming what was a near-term supply caveat into a multi-year structural constraint on AI infrastructure expansion — and the 435% memory price surge [14] directly challenges NVIDIA's headline cost-per-token claims, which remain unverified by independent benchmarks from production hardware. If analyst projections of two-thirds x86 server CPU market capture materialize [28], NVIDIA's CPU entry would be the most disruptive shift in the datacenter processor market in a decade, with AMD now publicly contesting the framing [30] and EnerTuition arguing NVIDIA may be losing its GPU lead entirely [31].

Open questions

Multiple reports indicate Micron has been excluded from NVIDIA's Vera Rubin HBM4 supply chain in favor of Samsung and SK Hynix only [16][17], while other analyses describe all three suppliers competing for Vera Rubin allocation [35][18] — is Micron's status a qualification failure, a temporary condition, or an artifact of conflicting sourcing [19]?
Memory costs for the Vera Rubin rack have surged 435%, putting HBM4 and LPDDR5x at $2M of a $7.8M total rack cost [14], with the shortage projected through 2028 [23] — do NVIDIA's headline 10x cost-per-token claims hold at real-world rack economics, and will this affect Nebius's H2 2026 commercial rollout pricing [12]?
An analyst projects NVIDIA will ship 4 million Vera CPUs in FY2027 and capture two-thirds of the x86 server CPU market [28]; AMD has published a direct agentic AI counter-narrative favoring EPYC [30] and EnerTuition argues NVIDIA may lose its GPU lead for multiple generations [31] — what specific architectural or market dynamics underlie each of these conflicting theses?
All headline performance claims — 10x cost-per-token reduction, 50% faster agentic workloads — still originate from NVIDIA's own materials [33][34]; when will independent benchmark validation of production Vera Rubin hardware appear?

Narrative

NVIDIA's Vera CPU and Vera Rubin NVL72 represent a dual-front expansion into agentic AI infrastructure: a purpose-built agentic processor entering the datacenter CPU market alongside a flagship AI inference platform. The Vera CPU is an 88-core design with 1.2 TB/s of memory bandwidth, deployed in 256-chip liquid-cooled rack configurations that NVIDIA claims deliver up to 6x CPU throughput over prior generations [1]. It was hand-delivered to OpenAI, Anthropic, and other leading AI labs on May 18, 2026 [2]. The paired Vera Rubin NVL72 — 336 billion transistors, 50 petaflops of AI performance [3] — was announced as being in full production at Jensen Huang's CES 2026 keynote [4][5] and won the Computex 2026 Best Choice Golden Award [6][7]. NVIDIA's Q1 2026 financial results — $81.6B in revenue, up 85% year-over-year [8][9] — validate that AI infrastructure demand is tracking the trajectory Huang describes as 'parabolic.' Meta has been confirmed as a Vera Rubin customer through a broad NVIDIA-Meta partnership [10], and cloud provider Nebius has formalized a full-stack AI cloud partnership via SEC filing, with Vera Rubin NVL72 deployment planned for the US and Europe beginning H2 2026 [11][12]. NVIDIA GTC Taipei at COMPUTEX is scheduled for June 1–5, 2026 [13], providing the next major public venue for additional platform announcements.

The dominant complication is a memory supply crisis that is now measurable in cost and allocation terms. A 435% surge in HBM4 and LPDDR5x prices has pushed the per-rack memory bill for a Vera Rubin system to approximately $2M out of a $7.8M total rack cost [14]. The HBM4 supply chain structure has crystallized around two dominant suppliers: SK Hynix holds roughly 70% of NVIDIA's HBM4 orders [15], and multiple reports indicate NVIDIA has designated Samsung and SK Hynix as Vera Rubin's HBM4 suppliers, with Micron excluded from this generation [16][17]. This picture is contested, however — separate TechPowerUp forum coverage and other analyses describe all three major HBM suppliers competing for Vera Rubin allocation, with HBM4 customer validation expected during Q2 2026 [18], and Micron itself has announced an early HBM4 ramp [19]. What is uncontested: Samsung has sold out its entire 2026 HBM4 supply [20], HBM4 mass production was delayed into late Q1 2026 due to spec upgrades and strategy adjustments [21][22], and TradingKey projects HBM shortage conditions persisting until 2028 [23]. Enterprise data center planning advice has shifted accordingly, with Arc Compute advising customers to prepare for an extended 'HBM crunch' as they migrate from Blackwell to Rubin architecture [24]. A broader chipmaking supply chain analysis found that 'nobody's scaling up' fabrication capacity, suggesting the constraint is structural [25].

The CPU market competition is escalating from narrative positioning into specific commercial projections. NVIDIA has formally offered the Vera CPU as a standalone competitor to Intel Xeon and AMD EPYC [26][27], and an analyst forecast published by Tom's Hardware projects NVIDIA is already on track to ship 4 million Vera CPUs in FY2027 and capture approximately two-thirds of the x86 server CPU market, representing roughly $20 billion in revenue [28]. TrendForce analysis provides structural context: agentic AI workloads require substantially more CPU capacity per GPU than traditional inference, creating a shift in datacenter CPU demand that plays to NVIDIA's positioning [29]. AMD has published a direct corporate counter-narrative, arguing in a company blog that agentic AI changes the CPU/GPU equation in ways that favor its EPYC architecture [30]. EnerTuition, which previously characterized the Vera versus EPYC contest as zero-sum, escalated its contrarian position with a piece arguing that NVIDIA may be 'on the verge of losing its GPU lead for a couple of generations' [31] — a thesis that stands in direct tension with the analyst CPU market projections and NVIDIA's earnings trajectory, and which has not yet been independently corroborated. The NVIDIA-Meta Vera Rubin deployment has separately prompted analysis of whether hyperscaler adoption at scale could trigger a broader CPU supercycle [32].

Across all fronts, the gap between NVIDIA's promotional claims and independently verified hardware performance remains open. All headline figures — 10x cost-per-token reduction, 50% performance advantage over comparable x86 CPUs on agentic workloads, 35x throughput per watt with Groq 3 LPX — originate exclusively from NVIDIA's own materials [33][34]. The 435% memory price surge and the projected multi-year HBM shortage create real-world rack economics materially different from launch specifications. No independent, reproducible benchmark from production Vera Rubin hardware has yet appeared.

Timeline

2026-01-05: NVIDIA debuts Rubin chip at CES: 336 billion transistors, 50 petaflops AI performance [3]
2026-01: Jensen Huang announces at CES 2026 keynote that Vera Rubin NVL72 is in full production [4][5]
2026-01: HBM4 mass production reported as delayed to end of Q1 2026 due to spec upgrades and NVIDIA strategy adjustments [21][22]
2026-02: SK Hynix begins early mass production of HBM4 and sets up shipments to NVIDIA for Vera Rubin; HBM4 supply competition identified as dominated by SK Hynix (~70% of NVIDIA orders) and Samsung [47][48][15][35]
2026-05-18: NVIDIA hand-delivers first Vera CPUs (88-core, 1.2 TB/s bandwidth, 6x throughput in 256-chip liquid-cooled rack) to OpenAI, Anthropic, and other leading AI labs [36][2][1]
2026-05-18: Jensen Huang keynotes at Dell Technologies World: announces Vera Rubin NVL72 specs, projects $3–4 trillion AI infrastructure buildout by 2030, endorses Dell with 'Buy Dell' statement, and flags memory supply chain constraint [33][39][55][37][38]
2026-05-20: Jensen Huang signs Dell PowerRack server on stage at Dell Technologies World [44][56]
2026-05-21: NVIDIA reports Q1 2026 earnings: $81.6B revenue, up 85% year-over-year [8][9]
2026-05-21: NVIDIA GTC Taipei at COMPUTEX: Vera Rubin NVL72, Jetson Thor, and Alpamayo autonomous driving platform detailed; Vera Rubin NVL72 wins Computex 2026 Best Choice Golden Award [34][6][7]
2026-05-21: NVIDIA and Meta announce broad partnership; Meta confirmed as Vera Rubin customer [10][32]
2026-05: Samsung sells out its entire 2026 HBM4 supply; memory costs for Vera Rubin rack surge 435%, with HBM4 and LPDDR5x representing $2M of $7.8M total rack cost [20][14]
2026-05: Analyst projects NVIDIA on track to ship 4 million Vera CPUs in FY2027, potentially capturing two-thirds of x86 server CPU market (~$20B revenue) [28]
2026-05: Nebius formalizes full-stack AI cloud partnership with NVIDIA via SEC filing; plans Vera Rubin NVL72 deployment in US and Europe from H2 2026 [11][12]
2026-05: Reports emerge that Rubin GPU mass production targets have been lowered, attributed to memory supply chain constraints [57][58][59]
2026-06-01: NVIDIA GTC Taipei at COMPUTEX 2026 scheduled (June 1–5) [13]

Perspectives

NVIDIA / Jensen Huang

Maximally bullish: agentic AI demand is 'parabolic,' the Vera CPU and Vera Rubin NVL72 are generational leaps in inference economics, and Q1 2026 earnings ($81.6B, +85% YoY) validate the demand trajectory. Huang acknowledged that memory supply chains cannot keep pace with demand. Meta's confirmation as a Vera Rubin customer and the Nebius SEC filing deepen commercial validation. NVIDIA has not publicly addressed third-party reports that Rubin mass production targets have been lowered.

Evolution: Consistent bullish framing, now reinforced by the strongest earnings in company history, a confirmed hyperscaler customer (Meta), and a formalized cloud provider partnership (Nebius SEC filing).

[33][36][34][8][37][38][39][40][2][10][11]

Michael Dell / Dell Technologies

Aligned with NVIDIA's agentic AI vision; Dell AI Factory positioned as the primary enterprise on-premises channel for Vera Rubin NVL72. Co-presented with Jensen Huang at Dell Technologies World.

Evolution: Consistent endorsement, visibly deepened by Huang's public 'Buy Dell' statement and the hardware-signing moment on stage.

[41][42][43][44][45]

Nebius

Committed to deploying Vera Rubin NVL72 commercially in the US and Europe from H2 2026, with the partnership now formalized via SEC filing as a full-stack AI cloud arrangement with NVIDIA.

Evolution: The SEC filing materializes the partnership into a formal document beyond a press commitment, strengthening the commercial credibility of the H2 2026 timeline.

[12][11]

Tensions

NVIDIA markets Vera Rubin on a 10x cost-per-token reduction and 50% agentic workload advantage [33][34] — claims originating exclusively from NVIDIA's promotional materials — while real-world rack economics show a 435% surge in memory costs, with HBM4 and LPDDR5x representing $2M of a $7.8M total rack cost [14]. The gap between headline efficiency claims and current hardware pricing remains unresolved by any independent benchmark. [33][34][14]
Analyst projections place NVIDIA on track for 4 million Vera CPU shipments in FY2027 and two-thirds x86 server market capture [28], while EnerTuition argues NVIDIA may be 'on the verge of losing its GPU lead for a couple of generations' [31] — two assessments of NVIDIA's hardware franchise trajectory that are irreconcilable, with no independent corroboration yet for the bearish GPU thesis. [31][28][8][9]
Multiple reports indicate Micron has been excluded from NVIDIA's Vera Rubin HBM4 supply chain in favor of Samsung and SK Hynix only [16][17], while other analysis describes all three suppliers competing for Vera Rubin allocation with HBM4 validation expected in Q2 2026 [18][49] — Micron's actual qualification status for Vera Rubin is contested across sources. [16][17][18][49][19]
NVIDIA's CPU market projections target two-thirds of x86 server CPU share with 4 million units in FY2027 [28], while AMD has published a direct counter-narrative claiming agentic AI favors EPYC architecture [30] — the two positions frame the same technological shift as advantaging opposite architectures, with enterprise procurement decisions over the next 12–18 months as the decisive test. [28][30][29]

Sources

[1] Nvidia unveils details of new 88-core Vera CPUs positioned to compete with AMD and Intel – new Vera CPU rack features 256 liquid-cooled chips that deliver up to a 6X gain in CPU throughput | Tom's Hardware — reactive:nvidia-vera-computex-launch
[2] NVIDIA hand-delivers first 1.2 TB/s Vera CPUs to OpenAI, Anthropic ... — reactive:nvidia-vera-computex-launch
[3] Nvidia debuts Rubin chip with 336B transistors and 50 petaflops of AI performance - SiliconANGLE — reactive:nvidia-vera-computex-launch
[4] Nvidia CEO confirms Vera Rubin NVL72 is now in production — reactive:nvidia-vera-computex-launch
[5] NVIDIA Vera Rubin AI Platform Hits Full Production CES 2026 ... — reactive:nvidia-vera-computex-launch
[6] NVIDIA Vera Rubin NVL72 wins Computex 2026 awards for AI ... — reactive:nvidia-vera-computex-launch
[7] 2026 Best Choice Award-Golden Award: NVIDIA Vera Rubin NVL72 - The Peak of AI Supercomputing — reactive:nvidia-vera-computex-launch
[8] NVIDIA just dropped $81.6B in Q1 revenue up 85% YoY 🤯 — reactive:nvidia-vera-computex-launch (2026-05-21)
[9] "Demand has gone parabolic. The reason is simple: Agentic AI has arrived." — reactive:nvidia-vera-computex-launch (2026-05-21)
[10] As part of a broad partnership announced today, Nvidia says Meta ... — reactive:nvidia-vera-computex-launch
[11] NVIDIA and Nebius Partner to Scale Full-Stack AI Cloud - SEC.gov — reactive:nvidia-vera-computex-launch
[12] Nebius to offer NVIDIA Vera Rubin NVL72 in US and Europe from H2 2026 | Corporate - EQS News — reactive:nvidia-vera-computex-launch
[13] NVIDIA GTC Taipei at COMPUTEX 2026 | June 1-5 — reactive:nvidia-vera-computex-launch
[14] NVIDIA's Vera Rubin Rack Hit With 435% Memory Price Surge ... — reactive:nvidia-vera-computex-launch
[15] SK Hynix Secures 70% of Nvidia's HBM4 Orders - Semicon — reactive:nvidia-vera-computex-launch
[16] NVIDIA to Use SK hynix and Samsung HBM4 for "Vera ... — reactive:nvidia-vera-computex-launch
[17] NVIDIA's Vera Rubin to Use Only Samsung and SK Hynix HBM4 ... — reactive:nvidia-vera-computex-launch
[18] HBM4 Validation Expected in 2Q26; Three Major Suppliers Poised ... — reactive:nvidia-vera-computex-launch
[19] Micron’s Early HBM4 Ramp Tests Durability Of AI Memory Boom — reactive:nvidia-vera-computex-launch
[20] Samsung sells out of 2026 HBM4 supply as memory resurgence ... — reactive:aws-garman-a100-demand
[21] HBM4 Mass Production Delayed to End of 1Q26 By Spec Upgrades ... — reactive:nvidia-vera-computex-launch
[22] HBM4 Mass Production Delayed According to TrendForce ... — reactive:nvidia-vera-computex-launch
[23] SK Hynix Surges 15% to New High: HBM Shortage Until 2028, How Much Longer Can AI Memory King Rise? — reactive:nvidia-vera-computex-launch
[24] Beyond Blackwell: Preparing Enterprise Data Centers for the NVIDIA ... — reactive:nvidia-vera-computex-launch
[25] A deeper look at the tightened chipmaking supply chain, and where it may be headed in 2026 — "nobody's scaling up,” says analyst as industry remains conservative on capacity — reactive:nvidia-vera-computex-launch
[26] NVIDIA Offers "Vera" CPU as a Standalone Competitor to Intel's Xeon and AMD's EPYC Processors | TechPowerUp — reactive:nvidia-vera-computex-launch
[27] NVIDIA's new Vera CPU will be a competitor to AMD EPYC and Intel Xeon CPUs — reactive:nvidia-vera-computex-launch
[28] 'Nvidia is already on track' to deliver 4 million Vera CPUs in FY2027 — reactive:nvidia-vera-computex-launch
[29] The Great Rebalance: How Agentic AI Is Reshaping the CPU:GPU Ratio — reactive:aws-garman-a100-demand
[30] Agentic AI Changes the CPU/GPU Equation - AMD — reactive:agentic-compute-cpu-gpu
[31] Nvidia On The Verge Of Losing GPU Lead For A Couple Of Generations — reactive:nvidia-vera-computex-launch
[32] Will NVIDIA's Meta Deal Ignite a CPU Supercycle? - Futurum — reactive:nvidia-vera-computex-launch
[33] NVIDIA CEO Jensen Huang at Dell Technologies World: ‘Demand Is Going Parabolic, Utterly Parabolic’ — NVIDIA Blog (2026-05-18)
[34] NVIDIA GTC Taipei at COMPUTEX: Live Updates on What’s Next in AI — NVIDIA Blog (2026-05-21)
[35] NVIDIA HBM4 Supply Becomes Three-Way Race — reactive:nvidia-vera-computex-launch
[36] Vera Arrives: NVIDIA’s First CPU Built for Agents Lands at Top AI Labs — NVIDIA Blog (2026-05-18)
[37] Jensen Huang today: Memory demand >> supply chain capacity. “Supply chain needs to be ready.” AI memory supercycle... — reactive:nvidia-vera-computex-launch (2026-05-18)
[38] 🚨 Jensen Huang on Memory Today: — reactive:nvidia-vera-computex-launch (2026-05-18)
[39] Jensen Huang Says “Buy Dell” | Dell Tech World 2026 | BuilderBase — reactive:nvidia-vera-computex-launch
[40] NVIDIA Kicks Off the Next Generation of AI With Rubin — Six New ... — reactive:nvidia-vera-computex-launch
[41] Michael Dell, Jensen Huang: Boldest Statements From Dell Technologies World 2026 — reactive:nvidia-vera-computex-launch
[42] “Now we have, for the very first time, useful AI” – Jensen Huang and Michael Dell talk up the power of agentic AI at Dell Technologies World 2026 | IT Pro — reactive:nvidia-vera-computex-launch
[43] Featured Sessions | Dell Technologies World 2026 | Dell USA — reactive:nvidia-vera-computex-launch
[44] Jensen Huang showed up at Dell Technologies World 2026 and signed a PowerRack server on stage. — reactive:nvidia-vera-computex-launch (2026-05-20)
[45] Dell Technologies World 2026 — reactive:nvidia-vera-computex-launch
[46] Samsung and SK Hynix Trigger Mass Production for Next-Gen AI — reactive:nvidia-vera-computex-launch
[47] SK Hynix set to ship HBM4 for Nvidia's Vera Rubin this month — reactive:nvidia-vera-computex-launch
[48] SK Hynix to begin early mass production of HBM4 ... — reactive:nvidia-vera-computex-launch
[49] Samsung and Micron confirm HBM4 enters mass ... — reactive:nvidia-vera-computex-launch
[50] NVIDIA's HBM4 Supply Chain Rush: Samsung, SK hynix, Micron ... — reactive:nvidia-vera-computex-launch
[51] Nvidia Vera Vs AMD EPYC: Only One Is Going To Succeed — reactive:nvidia-vera-computex-launch
[52] NVIDIA Offers "Vera" CPU as a Standalone Competitor to Intel's ... — reactive:nvidia-vera-computex-launch
[53] NVIDIA Offers Vera CPU as a Standalone Competitor to Intels Xeon ... — reactive:nvidia-vera-computex-launch
[54] 4/ the CPU story is well documented now. two AWS customers tried to buy all of graviton's 2026 capacity. nvidia launched... — reactive:nvidia-vera-computex-launch (2026-05-18)
[55] NVIDIA AI - Jensen Huang Says “Buy Dell” - LinkedIn — reactive:nvidia-vera-computex-launch
[56] Nvidia CEO Jensen Huang signed Dell’s PowerRack server at Dell Technologies World 2026, turning a light moment on the ev... — reactive:nvidia-vera-computex-launch (2026-05-19)
[57] Nvidia's Rubin GPU Mass Production Target Reportedly Lowered ... — reactive:nvidia-vera-computex-launch
[58] Nvidia's AI Chip Production Delayed by Memory Supply Chain ... — reactive:nvidia-vera-computex-launch
[59] The Rubin Protocol : Supply Chain, Bottlenecks, and the ... - FPX AI — reactive:nvidia-vera-computex-launch