The Information Machine

NVIDIA Launches Vera CPU and Vera Rubin NVL72 at COMPUTEX / GTC Taipei

cooling · v13 · 2026-06-05 · 260 items · history

What's new in v13

One substantive new angle: Jensen Huang articulated a market thesis that agentic AI shifts CPUs from traffic-cop schedulers to active orchestration layers, framing Vera CPU as a $200B market opportunity [16] — this extends NVIDIA's narrative from GPU-centric infrastructure to a broader CPU-plus-GPU platform story. Remaining new items (additional NVLink Fusion coverage [?][?][?], further CoreWeave validation confirmation [33], and TheValueist COMPUTEX read-throughs [?][?]) are either confirmatory or lack extractable claims. NVLink Fusion background is retired from active search as the angle is now well-grounded by official press releases from both NVIDIA and Marvell.

What

NVIDIA's Vera Rubin NVL72 cleared two independent rack validation milestones within 48 hours: Dell delivered the world's first fully validated NVL72 rack to CoreWeave on May 31 [1][3], and Jensen Huang announced at COMPUTEX on June 1 that Microsoft completed bring-up of its first Rubin VR200 NVL72 rack via Foxconn as ODM [5]. Wafer-level mass production of Rubin has started, but rack-level mass production has not yet begun [5], with HBM4 shortage projected to persist until 2028 [8] and the NVL72's 600kW per-rack power requirement mandating greenfield construction [9] as the two binding supply constraints. Jensen also reframed the Vera CPU's strategic role: in the agentic AI era, CPUs shift from traffic-cop schedulers to active orchestration layers, which Jensen framed as a $200B market opportunity independent of the Rubin GPU ramp [16].

Why it matters

The gap between wafer-level production start and rack-level mass production is the concrete leading indicator for when announced commitment volumes from Microsoft, CoreWeave, and others can ship at scale. Jensen's agentic AI CPU argument, if it holds, means the Vera CPU carries its own demand driver rather than being bundled with Rubin GPU purchases.

Open questions

  • Rack-level mass production of Rubin had not started as of the COMPUTEX keynote [5] — when does it begin, and does the wafer-to-rack lag create delivery gaps against the large commitment volumes announced by Microsoft, CoreWeave, and others?

  • Both CoreWeave and Microsoft have achieved single-rack L11 bring-up [3][5], but full-cluster L12 validation with scale-out networking has not been demonstrated by either [4] — when does the first L12 milestone occur?

  • NVIDIA's official NVLink Fusion materials describe it as enabling 'semi-custom AI infrastructure' [14], but the structural terms require third-party chips to adopt NVIDIA's NVLink interconnect — does this represent genuine architectural openness or a mechanism for retaining ecosystem control?

  • Micron's IR press release asserts high-volume HBM4 production for Vera Rubin [21], while multiple independent sources conclude NVIDIA designated only Samsung and SK Hynix as suppliers [22][23][24] — has Micron secured an actual production allocation?

Narrative

NVIDIA's Vera Rubin NVL72 completed two rack validation milestones through separate ODM chains in rapid succession. On May 31, Dell delivered the world's first fully validated Vera Rubin NVL72 rack to CoreWeave, with L11 diagnostics confirming the rack's internal NVLink/IMEX scale-up domain operational [1][2][3]. In NVIDIA's three-stage validation hierarchy, L10 certifies single-server firmware, L11 certifies a single rack's scale-up domain, and L12 certifies a full compute cluster with scale-out networking [4]. The following day, at COMPUTEX 2026, Jensen Huang announced that Microsoft completed bring-up of its first Rubin VR200 NVL72 rack via Foxconn as ODM [5] — a second independent validation chain operational within 24 hours of the first. Jensen disclosed that wafer-level mass production of Rubin has started while rack-level mass production has not yet begun [5], a distinction that matters for assessing when the large announced commitment volumes can actually ship.

Two hardware constraints set binding ceilings on deployment pace. HBM4 supply is dominated by SK Hynix (approximately 70% of NVIDIA's orders) and Samsung, whose entire 2026 allocation sold out, with rack prices at $8.8M and shortage projected until 2028 [6][7][8]. The NVL72's 600kW per-rack power requirement is incompatible with most existing data center infrastructure, requiring greenfield construction rather than a retrofit [9][10]. NVIDIA holds approximately $674M in equity in Nscale [11], making large deployment announcements co-publicized by both companies non-arm's-length commercial transactions [12].

The ecosystem story around COMPUTEX extended across interconnect, software, and CPU architecture. Official press releases from NVIDIA and Marvell frame the $2B NVLink Fusion investment as enabling 'semi-custom AI infrastructure' — allowing hyperscalers to integrate Marvell's custom silicon into NVIDIA's NVLink interconnect fabric [13][14]. NVIDIA's NemoClaw blueprint for industrial AI agents, providing a secure runtime, model router, and NeMo customization libraries, was adopted by Cadence, Dassault Systèmes, Siemens, and Synopsys, compressing RTL verification and thermal simulation from weeks to hours [15]. Jensen also articulated a market thesis for the Vera CPU specifically: in the training and inference era, GPUs dominated compute while CPUs served only as schedulers; agentic AI shifts CPUs into active orchestration roles, making CPU architecture newly relevant as its own demand driver worth $200B [16].

Critical assessments temper the deployment narrative. SemiAnalysis rated the COMPUTEX keynote 'F tier,' finding no new AI datacenter products and arguing that the Windows-on-NVIDIA-ARM transition is structurally unlike Apple's x86-to-M1 switch [17]. SemiAnalysis also documents that Rubin FP4/FP8 FLOPs scale approximately 3.5x over GB200 while FP16 gains are only ~1.6x and HBM capacity is flat [18] — making the efficiency claim workload-specific rather than uniform. Phoronix benchmarks of the Vera CPU show 1.5x overall advantage over 128-core x86 and a 1.6x geometric mean improvement over NVIDIA's Grace CPU [19][20]. The Micron HBM4 question remains open: Micron's IR press release asserts high-volume HBM4 production for Vera Rubin [21], while multiple independent sources conclude NVIDIA designated only Samsung and SK Hynix as HBM4 suppliers [22][23][24].

Timeline

  • 2026-01-05: NVIDIA debuts Rubin chip at CES: 336 billion transistors, 50 petaflops AI performance. [44]
  • 2026-01: Jensen Huang announces at CES 2026 that Vera Rubin NVL72 is in full production. [45][46]
  • 2026-02: SK Hynix begins HBM4 mass production shipments to NVIDIA, holding approximately 70% of NVIDIA's HBM4 orders. [47][6]
  • 2026-03-17: Nscale acquires 8GW Monarch Compute Campus in West Virginia; Microsoft signs 1.35GW LOI co-announced with NVIDIA and Caterpillar. [48][49][12][50]
  • 2026-05: Samsung sells out entire 2026 HBM4 supply; rack prices reach $8.8M; HBM4 shortage projected to persist until 2028. [39][8][40][7]
  • 2026-05: NVL72's 600kW per-rack power requirement documented as incompatible with existing data centers, requiring greenfield construction. [9][10][41]
  • 2026-05: NVIDIA equity stake in Nscale confirmed at approximately $674M; Microsoft's Rubin GPU deployment via Nscale revised upward to 130,000 units. [43][11][29]
  • 2026-05-18: First Vera CPUs hand-delivered to OpenAI, Anthropic, and other leading AI labs. [51][52][53]
  • 2026-05-18: Jensen Huang keynotes Dell Technologies World: projects $3–4T AI infrastructure buildout by 2030 and flags memory supply as primary bottleneck. [25][54][55]
  • 2026-05-21: NVIDIA reports Q1 2026 earnings: $81.6B revenue, up 85% year-over-year. [26][56]
  • 2026-05-21: NVIDIA GTC Taipei: Vera Rubin NVL72 wins Computex Best Choice Golden Award; Meta, Google Cloud, and Microsoft formalize partnerships; NVIDIA announces $2B NVLink Fusion investment in Marvell. [57][58][59][60][28][27]
  • 2026-05-26: Phoronix benchmarks of Vera CPU published: 1.5x overall x86 advantage, 1.6x geometric mean over Grace CPU, 90% peak bandwidth utilization. [19][20]
  • 2026-05-31: Dell delivers world's first fully validated Vera Rubin NVL72 rack to CoreWeave; L11 diagnostics confirm scale-up domain operational. [1][2][3][4][30][31][32]
  • 2026-06-01: Jensen Huang at COMPUTEX announces Microsoft completed bring-up of its first Rubin VR200 NVL72 rack via Foxconn; wafer-level production started, rack-level mass production not yet begun. [5]
  • 2026-06-01: SemiAnalysis rates Jensen's COMPUTEX keynote 'F tier': no new AI datacenter products announced; Windows-on-NVIDIA-ARM transition unlikely to succeed. [17]
  • 2026-06-01: Official NVIDIA and Marvell press releases frame NVLink Fusion as enabling 'semi-custom AI infrastructure' for third-party silicon integration. [13][14]
  • 2026-06-02: NVIDIA NemoClaw industrial AI agent blueprint announced with adoptions from Cadence, Siemens, Dassault Systèmes, and Synopsys. [15]
  • 2026-06-04: Jensen Huang identifies agentic AI as a $200B market opportunity, arguing CPUs shift from traffic-cop schedulers to active orchestration layers in the agentic era, giving Vera CPU an independent demand driver. [16]

Perspectives

NVIDIA / Jensen Huang

Q1 2026 earnings ($81.6B, +85% YoY) validate AI demand; COMPUTEX positions NVIDIA as platform provider for training, inference, and agentic workloads; $2B NVLink Fusion and NemoClaw extend the platform to third-party silicon and industrial software; Jensen frames agentic AI as a $200B market where Vera CPU gains architectural importance as an orchestration layer [16].

Evolution: Expanded: the agentic AI CPU thesis [16] adds a demand driver for Vera CPU independent of Rubin GPU bundling, extending Jensen's narrative from GPU-centric infrastructure to a CPU-plus-GPU platform story.

SemiAnalysis

Rubin FP4/FP8 FLOPs scale ~3.5x over GB200 while FP16 gains are only ~1.6x and HBM capacity is flat — headline efficiency is workload-specific. COMPUTEX keynote rated F tier for delivering no new AI datacenter products and an ARM transition framing unlikely to replicate Apple's success.

Evolution: Consistent; remains the primary independent critical voice against NVIDIA's deployment momentum narrative.

Microsoft / Foxconn

First hyperscaler to complete Rubin VR200 NVL72 bring-up, with Foxconn as ODM partner; also anchor customer for Vera Rubin via Nscale (130,000 GPUs at Start Campus Portugal, 1.35GW LOI for West Virginia).

Evolution: Consistent; confirmed as the second actor pair to complete NVL72 bring-up within 24 hours of Dell and CoreWeave.

Dell / CoreWeave

First-mover operators: Dell delivered the world's first fully validated Vera Rubin NVL72 rack to CoreWeave on May 31, clearing L11 diagnostics with the scale-up domain confirmed operational.

Evolution: Consistent; milestone confirmed by multiple independent outlets without new technical detail.

Marvell / NVLink Fusion partners

Official Marvell press release frames NVLink Fusion as enabling customers to build 'semi-custom AI infrastructure' by integrating Marvell custom silicon with NVIDIA's NVLink interconnect fabric.

Evolution: Consistent; Marvell's own press release provides direct sourcing for the partnership framing, independent of Jensen's COMPUTEX statements.

Micron

Officially asserts high-volume HBM4 production specifically designed for NVIDIA Vera Rubin, directly contradicting industry reports concluding NVIDIA designated only Samsung and SK Hynix as HBM4 suppliers.

Evolution: Consistent; the contradiction with multiple independent sources remains unresolved.

Memory and supply chain analysts

HBM4 shortage is the binding structural constraint: SK Hynix holds ~70% of NVIDIA's orders, Samsung has sold out its 2026 supply, rack prices are $8.8M, and shortage is projected until 2028.

Evolution: Consistent; rack-level mass production not yet started adds a further constraint at the moment large-scale commitments are outstanding.

Data center infrastructure analysts

Vera Rubin NVL72's 600kW per-rack power requirement is a fundamental incompatibility with existing data center infrastructure, establishing greenfield construction as a second binding structural bottleneck alongside HBM4 supply.

Evolution: Consistent.

Tensions

  • Micron's official IR press release states high-volume HBM4 production for NVIDIA Vera Rubin [21], while TechPowerUp and multiple independent analyses conclude NVIDIA designated only Samsung and SK Hynix as HBM4 suppliers [22][23][24] — two claims mutually incompatible unless they refer to different allocation tiers. [21][22][23][24]
  • NVIDIA markets Vera Rubin on a 10x cost-per-token reduction, but SemiAnalysis documents FP4/FP8 gains of ~3.5x over GB200 while FP16 gains are only ~1.6x and HBM capacity is flat [18] — the efficiency claim is workload-specific, not uniform across training or high-precision inference. [18][8][9]
  • Official NVLink Fusion press materials frame the partnership as enabling 'semi-custom AI infrastructure' for third-party silicon [14], but the structural terms require those chips to adopt NVIDIA's NVLink interconnect — leaving unresolved whether this is genuine openness or an ecosystem control mechanism. [13][14][27]
  • NVIDIA holds approximately $674M in Nscale equity [11] while publicly describing Nscale only as a commercial partner, making large deployment announcements co-publicized by both companies non-arm's-length transactions [12][42]. [42][43][11][12]
  • Wafer-level mass production of Rubin has started but rack-level mass production has not yet begun [5], creating a gap between NVIDIA's production narrative and actual rack availability at a moment when large-scale deployment commitments from Microsoft and others are outstanding. [5]
  • SemiAnalysis rated Jensen's COMPUTEX keynote 'F tier' for announcing no new AI datacenter products [17], while NVIDIA's promotional coverage treats the same event as a major milestone featuring Microsoft's NVL72 bring-up, NVLink Fusion official launch, and NemoClaw ecosystem expansion [5][14][15]. [17][5][14][15]

Status: active and growing

Sources

  1. [1] BREAKING NEWS: COREWEAVE & DELL IS THE FIRST CLOUD TO ANNOUNCE THAT THEY HAVE RUBIN VR200 NVL72 WITH FULLY PASSING L… — SemiAnalysis Twitter (2026-05-31)
  2. [2] Dell just made history this weekend and it is the culmination of an execution streak that no other company in enterprise… — Milk Road AI Twitter (2026-05-31)
  3. [3] Notably, passing L11 diags means that this rack is up and running, including the IMEX channels on the NVL72 scale-up dom… — SemiAnalysis Twitter (2026-05-31)
  4. [4] At L10 your Firmware/BIOS and OS works on a single server, at L11 a single rack or scale-up domain works, and then at L1… — SemiAnalysis Twitter (2026-05-31)
  5. [5] BREAKING NEWS: JENSEN JUST ANNOUNCED MICROSOFT HAS FINISHED BRING UP ON THEIR FIRST RUBIN VR200 NVL72 RACK with their OD… — SemiAnalysis Twitter (2026-06-01)
  6. [6] SK Hynix Secures 70% of Nvidia's HBM4 Orders - Semicon — reactive:nvidia-vera-computex-launch
  7. [7] SK Hynix Surges 15% to New High: HBM Shortage Until 2028, How Much Longer Can AI Memory King Rise? — reactive:nvidia-vera-computex-launch
  8. [8] Nvidia's memory costs soar 485%, latest AI systems now cost $7.8 ... — reactive:nvidia-vera-computex-launch
  9. [9] The Data Center Isn't Ready. NVIDIA's Vera Rubin platform ships in… — reactive:nvidia-vera-computex-launch
  10. [10] NVIDIA Vera Rubin: 600kW Racks by 2027 | Introl Blog — reactive:nvidia-vera-computex-launch
  11. [11] UK AI Infrastructure Startup Nscale Receives $674 Million (£500 ... — reactive:nvidia-vera-computex-launch
  12. [12] Nscale acquires 8GW Monarch Compute Campus, Microsoft signs on for 1.35GW of compute - DCD — reactive:nvidia-vera-computex-launch
  13. [13] Marvell and NVIDIA to Provide Custom Solutions for Advanced AI Infrastructure — reactive:nvidia-vera-computex-launch
  14. [14] NVIDIA Unveils NVLink Fusion for Industry to Build Semi-Custom AI Infrastructure With NVIDIA Partner Ecosystem | NVIDIA Newsroom — reactive:nvidia-vera-computex-launch
  15. [15] Industrial Software Leaders Build Secure, Autonomous AI Engineers With NVIDIA NemoClaw — NVIDIA Blog (2026-06-02)
  16. [16] Jensen Huang just identified the next $200 billion market (Save this). — Milk Road AI Twitter (2026-06-04)
  17. [17] F TIER KEYNOTEMAX: Jensen ComputeX presentation was one of the worst keynotes he has done. He announced nothing new on t… — SemiAnalysis Twitter (2026-06-01)
  18. [18] for more details on Nvidia's VR NVL72 Oberon and future roadmap, check out our article from February: — SemiAnalysis Twitter (2026-05-31)
  19. [19] NVIDIA Vera CPU Is ‘Packing a Heavy-Hitting Punch’ Against Competition — NVIDIA Blog (2026-05-26)
  20. [20] NVIDIA published a report on Vera CPU benchmarks, done by Phoronix. — Rohan Paul Twitter (2026-05-28)
  21. [21] Micron in High-Volume Production of HBM4 Designed for NVIDIA ... — reactive:nvidia-vera-computex-launch
  22. [22] Micron Is Locked Out of HBM4 in NVIDIA's Vera Rubin Systems — reactive:nvidia-vera-computex-launch
  23. [23] NVIDIA to Use SK hynix and Samsung HBM4 for "Vera Rubin" Without Micron | TechPowerUp — reactive:nvidia-vera-computex-launch
  24. [24] Why Nvidia Snubbed Micron For Samsung, SK Hynix - Dailymotion — reactive:hbm-memory-supply-squeeze
  25. [25] NVIDIA CEO Jensen Huang at Dell Technologies World: ‘Demand Is Going Parabolic, Utterly Parabolic’ — NVIDIA Blog (2026-05-18)
  26. [26] NVIDIA just dropped $81.6B in Q1 revenue up 85% YoY 🤯 — reactive:nvidia-vera-computex-launch (2026-05-21)
  27. [27] The CEO of NVIDIA, looked at Matt Murphy and said "The next trillion dollar company, ladies and gentlemen." (Save this). — Milk Road AI Twitter (2026-06-02)
  28. [28] Microsoft's strategic AI datacenter planning enables seamless, large ... — reactive:nvidia-vera-computex-launch
  29. [29] 130,000 Rubin GPUs Are Being Deployed at Nscale For Microsoft, Further Showing Massive Interest In NVIDIA's Next-Gen AI Chips — reactive:nvidia-vera-computex-launch
  30. [30] CoreWeave Completes Industry-First Bring-Up And Validation Of NVIDIA Vera Rubin NVL72 — reactive:nvidia-vera-computex-launch
  31. [31] HPCwire - Since 1987 – Covering the Fastest Computers in the World and the People Who Run Them — reactive:nvidia-vera-computex-launch
  32. [32] CoreWeave Completes Industry-First Bring-Up and Validation of NVIDIA Vera Rubin NVL72 - Las Vegas Sun News — reactive:nvidia-vera-computex-launch
  33. [33] CoreWeave completes validation of Nvidia Vera Rubin NVL72 — reactive:nvidia-vera-computex-launch
  34. [34] Marvell and NVIDIA partner on NVLink Fusion | Marvell Technology posted on the topic | LinkedIn — reactive:nvidia-vera-computex-launch
  35. [35] Marvell today announced it is teaming with NVIDIA to offer NVLink ... — reactive:nvidia-vera-computex-launch
  36. [36] NVIDIA Unveils NVLink Fusion for Industry to Build Semi-Custom AI ... — reactive:nvidia-vera-computex-launch
  37. [37] NVIDIA invests $ 2 billion in Marvell Technology in silicon photonics ... — reactive:nvidia-vera-computex-launch
  38. [38] Micron Singapore - Facebook — reactive:nvidia-vera-computex-launch
  39. [39] Samsung sells out of 2026 HBM4 supply as memory resurgence ... — reactive:aws-garman-a100-demand
  40. [40] Price of Nvidia's Vera Rubin NVL72 racks skyrockets to as much as $8.8 million apiece, but server makers' margins will be tight — Nvidia is moving closer to shipping entire full-scale systems — reactive:nvidia-vera-computex-launch
  41. [41] Nvidia's Vera Rubin GPU: Redesigning Data Centres for 600kW Racks — reactive:nvidia-vera-computex-launch
  42. [42] Nvidia-Backed Nscale Plans Huge Data Center Cluster in West ... — reactive:nvidia-vera-computex-launch
  43. [43] Nvidia-backed UK AI firm Nscale raises $1.1 billion funding round — reactive:nvidia-vera-computex-launch
  44. [44] Nvidia debuts Rubin chip with 336B transistors and 50 petaflops of AI performance - SiliconANGLE — reactive:nvidia-vera-computex-launch
  45. [45] Nvidia CEO confirms Vera Rubin NVL72 is now in production — reactive:nvidia-vera-computex-launch
  46. [46] NVIDIA Vera Rubin AI Platform Hits Full Production CES 2026 ... — reactive:nvidia-vera-computex-launch
  47. [47] SK Hynix set to ship HBM4 for Nvidia's Vera Rubin this month — reactive:nvidia-vera-computex-launch
  48. [48] Nscale and Microsoft Announce Collaboration with NVIDIA and Caterpillar to Deliver 1.35GW of NVIDIA Vera Rubin NVL72 GPUs at Flagship AI Factory Campus in West Virginia — reactive:nvidia-vera-computex-launch
  49. [49] Nscale and Microsoft Announce Collaboration with NVIDIA and Caterpillar to Deliver 1.35GW of NVIDIA Vera Rubin NVL72 GPUs at Flagship AI Factory Campus in West Virginia — reactive:nvidia-vera-computex-launch
  50. [50] Nscale acquisition includes plan to build AI facility in Mason County — reactive:nvidia-vera-computex-launch
  51. [51] Vera Arrives: NVIDIA’s First CPU Built for Agents Lands at Top AI Labs — NVIDIA Blog (2026-05-18)
  52. [52] NVIDIA hand-delivers first 1.2 TB/s Vera CPUs to OpenAI, Anthropic ... — reactive:nvidia-vera-computex-launch
  53. [53] Nvidia unveils details of new 88-core Vera CPUs positioned to compete with AMD and Intel – new Vera CPU rack features 256 liquid-cooled chips that deliver up to a 6X gain in CPU throughput | Tom's Hardware — reactive:nvidia-vera-computex-launch
  54. [54] NVIDIA AI - Jensen Huang Says “Buy Dell” - LinkedIn — reactive:nvidia-vera-computex-launch
  55. [55] Jensen Huang today: Memory demand >> supply chain capacity. “Supply chain needs to be ready.” AI memory supercycle... — reactive:nvidia-vera-computex-launch (2026-05-18)
  56. [56] "Demand has gone parabolic. The reason is simple: Agentic AI has arrived." — reactive:nvidia-vera-computex-launch (2026-05-21)
  57. [57] NVIDIA GTC Taipei at COMPUTEX: Live Updates on What’s Next in AI — NVIDIA Blog (2026-05-21)
  58. [58] NVIDIA Vera Rubin NVL72 wins Computex 2026 awards for AI ... — reactive:nvidia-vera-computex-launch
  59. [59] Meta Builds AI Infrastructure With NVIDIA — reactive:nvidia-vera-computex-launch
  60. [60] NVIDIA GTC 2026: Google Cloud Deepens Partnership for AI ... — reactive:nvidia-vera-computex-launch