NVIDIA Launches Vera CPU and Vera Rubin NVL72 at COMPUTEX / GTC Taipei · history
Version 2
2026-05-22 19:23 UTC · 69 items
What
NVIDIA's Vera CPU and Vera Rubin NVL72 platform have moved from announcement into production reality, anchored by a landmark Q1 2026 earnings report. The Vera CPU — NVIDIA's first processor purpose-built for agentic AI — began shipping to leading labs on May 18 [1], while the Vera Rubin NVL72 was confirmed in full production at CES 2026 [3][4] and won the Computex 2026 Best Choice Golden Award [7][8]. NVIDIA's Q1 2026 results showed $81.6B in revenue, up 85% year-over-year, converting Jensen Huang's 'parabolic demand' claims into hard financial evidence [13][14]. Cloud provider Nebius publicly committed to deploying the Vera Rubin NVL72 in the US and Europe from H2 2026 — the first concrete commercial availability timeline disclosed by any provider [15].
Why it matters
The Q1 earnings result transforms NVIDIA's AI infrastructure narrative from promotional framing into measurable market reality. At the same time, Huang's acknowledgment that memory demand is currently outpacing supply chain capacity [18][19] introduces the first credible structural constraint on the trajectory NVIDIA is projecting — raising questions about whether supply bottlenecks will moderate the growth rate even as demand accelerates.
Open questions
All performance claims — 10x cost-per-token reduction, 50% faster agentic workloads, 35x throughput per watt with Groq 3 LPX — still originate exclusively from NVIDIA's own materials [2][5]. SemiAnalysis published a technical architecture breakdown [21], but independent, reproducible benchmark validation of production hardware has not appeared. When will third-party performance data arrive?
Huang stated at Dell Technologies World that memory demand exceeds supply chain capacity and 'the supply chain needs to be ready' [18][19] — which memory suppliers and HBM configurations face the most acute bottleneck, and how will this affect Vera Rubin NVL72 production ramp into H2 2026?
A May 18 report noted two AWS customers attempted to purchase all of Graviton's 2026 CPU production capacity as NVIDIA entered the server CPU market [20] — how will Vera CPU reshape enterprise procurement dynamics in the ARM ecosystem, and what competitive response will AWS, AMD, and Intel mount?
Nebius has committed to H2 2026 deployment [15], but pricing and total cost of ownership versus the Blackwell generation remain undisclosed — what will enterprise buyers pay per token at commercial scale on the Vera Rubin NVL72?
Narrative
NVIDIA's Vera CPU and Vera Rubin NVL72 platform concentrated a product cycle, an ecosystem partnership, and a landmark earnings result into three weeks in May 2026, providing the most complete picture yet of where the company is positioning itself in the agentic AI infrastructure market.
The Vera CPU — NVIDIA's first processor designed from scratch for the memory-bandwidth demands of autonomous agent workloads — began shipping to leading AI labs on May 18, 2026 [1]. NVIDIA claims 1.2 TB/s of memory bandwidth and a 50% performance advantage over comparable x86 CPUs on agentic tasks [2]. The Vera Rubin NVL72, the platform's flagship inference system, had already been confirmed in full production by Jensen Huang at his CES 2026 keynote [3][4], providing the timeline context that manufacturing was underway before the Computex announcements. At GTC Taipei, NVIDIA detailed the platform's full scope: 10x lower cost-per-token than the Blackwell generation, a cable-free fanless modular design reducing rack assembly from two hours to five minutes per tray, optional Groq 3 LPX co-processor integration claimed to reach 35x higher throughput per watt for trillion-parameter models, and — as NVIDIA's investor press release framed it — six new chips forming what the company calls 'one incredible AI supercomputer' [5][6]. The Vera Rubin NVL72 received the Computex 2026 Best Choice Golden Award at the show [7][8]. Jensen Huang's keynote at Dell Technologies World, where he endorsed the Dell AI Factory partnership with a publicly circulated 'Buy Dell' statement [9][10] and signed a Dell PowerRack server on stage [11][12], framed the enterprise on-premises channel as the primary commercial route for Vera Rubin NVL72 deployment.
NVIDIA's Q1 2026 earnings, reported May 21, showed $81.6B in revenue — up 85% year-over-year [13][14] — the strongest direct financial evidence yet that AI infrastructure demand is tracking the trajectory Huang has forecast. Cloud provider Nebius publicly committed to offering the Vera Rubin NVL72 in the US and Europe beginning H2 2026 [15], the first announced commercial availability window from any provider. Wiwynn and Hon Hai (Foxconn) showcased Vera Rubin NVL72 infrastructure at NVIDIA GTC 2026 [16][17], indicating a broadening manufacturing and integration ecosystem.
Two new supply-side tensions surfaced alongside the demand narrative. Huang acknowledged at Dell Technologies World that memory demand is currently exceeding supply chain capacity, stating explicitly that 'the supply chain needs to be ready' [18][19] — a structural constraint that sits in tension with the smooth scaling his projections require. Separately, a May 18 market observation noted that two AWS customers had attempted to purchase all of Graviton's 2026 CPU production capacity before NVIDIA's Vera CPU reached the market [20], signaling that NVIDIA's entry into the CPU tier is already reshaping procurement dynamics in the ARM server ecosystem. SemiAnalysis published a technical architecture analysis positioning Vera Rubin as an evolution from the Grace Blackwell Oberon design [21], representing one of the first substantive third-party engineering assessments of the platform — though fully reproducible, independent performance benchmarks have not yet appeared.
Timeline
- 2026-01: Jensen Huang announces at CES 2026 keynote that Vera Rubin NVL72 is in full production [3][4]
- 2026-05-18: NVIDIA begins shipping Vera CPUs to top AI labs [1]
- 2026-05-18: Jensen Huang keynotes at Dell Technologies World: announces Vera Rubin NVL72 specs, projects $3–4 trillion AI infrastructure buildout by 2030, endorses Dell with 'Buy Dell' statement, and flags memory supply chain constraint [2][9][10][18][19]
- 2026-05-20: Jensen Huang signs Dell PowerRack server on stage at Dell Technologies World [11][12]
- 2026-05-21: NVIDIA reports Q1 2026 earnings: $81.6B revenue, up 85% year-over-year [13][14]
- 2026-05-21: NVIDIA GTC Taipei at COMPUTEX: Vera Rubin NVL72, Jetson Thor, and Alpamayo autonomous driving platform detailed; Vera Rubin NVL72 wins Computex 2026 Best Choice Golden Award [5][7][8]
- 2026-05: Nebius announces plan to offer Vera Rubin NVL72 in US and Europe from H2 2026 [15]
Perspectives
NVIDIA / Jensen Huang
Maximally bullish: the agentic AI era has definitively arrived, demand is 'parabolic,' and the Vera CPU and Vera Rubin NVL72 are generational leaps in inference economics. Q1 earnings ($81.6B, +85% YoY) validate the demand trajectory. Huang simultaneously acknowledged that memory supply chains cannot keep pace with demand — the first public admission of a structural bottleneck in the current cycle.
Evolution: Consistent bullish framing now reinforced by the strongest earnings result in company history, but with the addition of a supply chain caveat absent from earlier keynotes — representing a notable modulation in an otherwise unbroken optimism narrative.
Michael Dell / Dell Technologies
Aligned with NVIDIA's agentic AI vision; Dell AI Factory positioned as the primary enterprise on-premises channel for Vera Rubin NVL72. Michael Dell co-presented with Huang, framing the partnership as central to enterprise AI adoption.
Evolution: Consistent endorsement, visibly deepened by Huang's public 'Buy Dell' statement and the hardware-signing moment on stage — signaling the partnership has moved beyond contractual alignment into active co-marketing.
Nebius
Committed to deploying Vera Rubin NVL72 commercially in the US and Europe from H2 2026, signaling confidence in the platform's production readiness and anticipated customer demand.
Evolution: First appearance in this thread; represents the earliest public commercial availability commitment from any cloud or infrastructure provider.
Niraj Yagnik (market observer)
Notes CPU supply competition: two AWS customers attempted to lock up all of Graviton's 2026 CPU production capacity as NVIDIA entered the server CPU market — suggesting Vera CPU is already reshaping enterprise procurement decisions before broad availability.
Evolution: First appearance; introduces a competitive dynamic not present in NVIDIA's own materials and not previously surfaced in this thread.
Tensions
- NVIDIA's performance claims (10x cost-per-token, 50% faster agentic workloads, 35x throughput per watt with Groq 3 LPX) originate exclusively from the company's own promotional materials [2][5]. Q1 earnings validate demand strength but do not verify the specific technical specifications — the gap between financial success and independently verified hardware performance remains unresolved. [2][5][13][21]
- Huang's parabolic demand narrative and $3–4 trillion buildout projection are now complicated by his own acknowledgment that memory supply chains cannot keep pace with demand [18][19] — the growth thesis and the supply constraint coexist in tension within NVIDIA's own public statements. [2][18][19]
Sources
- [1] Vera Arrives: NVIDIA’s First CPU Built for Agents Lands at Top AI Labs — NVIDIA Blog (2026-05-18)
- [2] NVIDIA CEO Jensen Huang at Dell Technologies World: ‘Demand Is Going Parabolic, Utterly Parabolic’ — NVIDIA Blog (2026-05-18)
- [3] Nvidia CEO confirms Vera Rubin NVL72 is now in production — reactive:nvidia-vera-computex-launch
- [4] NVIDIA Vera Rubin AI Platform Hits Full Production CES 2026 ... — reactive:nvidia-vera-computex-launch
- [5] NVIDIA GTC Taipei at COMPUTEX: Live Updates on What’s Next in AI — NVIDIA Blog (2026-05-21)
- [6] NVIDIA Kicks Off the Next Generation of AI With Rubin — Six New ... — reactive:nvidia-vera-computex-launch
- [7] NVIDIA Vera Rubin NVL72 wins Computex 2026 awards for AI ... — reactive:nvidia-vera-computex-launch
- [8] 2026 Best Choice Award-Golden Award: NVIDIA Vera Rubin NVL72 - The Peak of AI Supercomputing — reactive:nvidia-vera-computex-launch
- [9] Jensen Huang Says “Buy Dell” | Dell Tech World 2026 | BuilderBase — reactive:nvidia-vera-computex-launch
- [10] NVIDIA AI - Jensen Huang Says “Buy Dell” - LinkedIn — reactive:nvidia-vera-computex-launch
- [11] Jensen Huang showed up at Dell Technologies World 2026 and signed a PowerRack server on stage. — reactive:nvidia-vera-computex-launch (2026-05-20)
- [12] Nvidia CEO Jensen Huang signed Dell’s PowerRack server at Dell Technologies World 2026, turning a light moment on the ev... — reactive:nvidia-vera-computex-launch (2026-05-19)
- [13] NVIDIA just dropped $81.6B in Q1 revenue up 85% YoY 🤯 — reactive:nvidia-vera-computex-launch (2026-05-21)
- [14] "Demand has gone parabolic. The reason is simple: Agentic AI has arrived." — reactive:nvidia-vera-computex-launch (2026-05-21)
- [15] Nebius to offer NVIDIA Vera Rubin NVL72 in US and Europe from H2 2026 | Corporate - EQS News — reactive:nvidia-vera-computex-launch
- [16] Wiwynn Showcases NVIDIA Vera Rubin NVL72 AI Factory Infrastructure at NVIDIA GTC 2026 — reactive:nvidia-vera-computex-launch
- [17] We’re excited to... - 鴻海科技集團Hon Hai Technology Group — reactive:nvidia-vera-computex-launch
- [18] Jensen Huang today: Memory demand >> supply chain capacity. “Supply chain needs to be ready.” AI memory supercycle... — reactive:nvidia-vera-computex-launch (2026-05-18)
- [19] 🚨 Jensen Huang on Memory Today: — reactive:nvidia-vera-computex-launch (2026-05-18)
- [20] 4/ the CPU story is well documented now. two AWS customers tried to buy all of graviton's 2026 capacity. nvidia launched... — reactive:nvidia-vera-computex-launch (2026-05-18)
- [21] Vera Rubin – Extreme Co-Design: An Evolution from Grace Blackwell Oberon — reactive:nvidia-vera-computex-launch
- [22] Michael Dell, Jensen Huang: Boldest Statements From Dell Technologies World 2026 — reactive:nvidia-vera-computex-launch
- [23] “Now we have, for the very first time, useful AI” – Jensen Huang and Michael Dell talk up the power of agentic AI at Dell Technologies World 2026 | IT Pro — reactive:nvidia-vera-computex-launch
- [24] Featured Sessions | Dell Technologies World 2026 | Dell USA — reactive:nvidia-vera-computex-launch