SemiAnalysis: AI Silicon Shortage — HBM Bottleneck and N3 Wafer Dominance

closed · v17 · 2026-06-16 · 216 items · history

What's new in v17

The main addition this pass is institutional corroboration of the power constraint theme: the IEA [15], WEF [16], Data Center Knowledge [17], and KTS Law [18] have each independently documented grid connectivity as a binding constraint, adding a new voice (institutional energy analysts) to the perspectives. A report title indicates nearly half of US data centers planned for 2026 face power challenges [14]. The HBM, TSMC N3, Intel foundry, and GPU rental themes are unchanged from the prior pass. Most new items this pass were low-quality social media posts with no extractable content.

What

HBM memory, TSMC N3 logic, and power/energy infrastructure are three concurrent binding constraints on AI accelerator production. GPU rental prices rose 40% between October 2025 and March 2026 [9][10], Micron reported Q2 FY2026 revenue of $23.86B (+196% YoY) with 75% gross margins [3], and Goldman Sachs projects 2027 hyperscaler capex at $1.4T against a $920B Wall Street consensus [21]. Institutional sources including the IEA [15] and WEF [16] have now independently documented the power constraint, with reports indicating nearly half of US data centers planned for 2026 face power challenges [14]. Intel's foundry role for NVIDIA's Feynman GPU remains unsettled [26][27].

Why it matters

Three simultaneous supply constraints — silicon, memory, and power — compound each other: adding compute capacity requires all three to expand together. The shift from training to agentic demand suggests token consumption is moving from episodic to continuous, sustaining GPU scarcity at a higher structural floor than the 2023 cycle implied and making the squeeze harder to relieve through incremental capacity additions.

Open questions

Has Intel finalized its foundry commitment from NVIDIA for the Feynman GPU, or is the relationship still in early evaluation as multiple sources characterize it [27][28]?
Will power and energy infrastructure become the binding constraint that throttles AI compute expansion before HBM and TSMC N3 capacity can be expanded, given grid delivery timelines of 3–5 years and reports that nearly half of planned 2026 US data centers face power challenges [13][14]?
Does the SK Hynix–NVIDIA co-development partnership give NVIDIA preferential HBM4/HBM5 allocation, structurally disadvantaging AMD and other accelerator makers competing for the same supply [7]?
Does Goldman Sachs' $1.4T capex projection for 2027 [21] reflect a durable buildout or an investment peak that will face correction once power and site constraints become binding?

Narrative

HBM memory and TSMC's N3 process node are the two established binding constraints on AI accelerator production. SemiAnalysis identified HBM wafer supply — not the earlier CoWoS packaging bottleneck — as the primary scarce resource [1]. SK Hynix has sold out its HBM capacity into 2026 [2], and Micron's Q2 FY2026 results confirmed the demand picture: $23.86 billion in revenue, up 196% year-over-year, with 75% non-GAAP gross margins and the most bullish forward guidance in the company's history [3]. Bernstein projects HBM4 pricing will nearly double from $16.6/GB to $37/GB by 2027 [4]. On the logic side, AI is projected to consume approximately 60% of TSMC's N3 output in 2026, rising to 86% in 2027 [5], with TSMC posting +41% year-over-year Q1 2026 revenue at all-time-high margins [6]. SK Hynix and NVIDIA formalized a multi-year HBM co-development partnership in June 2026 covering NVIDIA's Vera Rubin platform [7], and Jensen Huang stated NVIDIA consumes essentially all available HBM output [8].

The GPU compute rental market has tightened sharply. SemiAnalysis launched an H100 1-Click Rental Index documenting a 40% price increase in one-year H100 rentals from $1.70/hr in October 2025 to $2.35/hr in March 2026, with a 15–20% step occurring in January–February alone [9][10]. SemiAnalysis argues this squeeze differs structurally from 2023: demand has shifted from training runs to agentic workloads, with enterprise token spend moving from a curiosity to a real cost line [11]. Public neocloud equities are priced as if the cycle is rolling over, while SemiAnalysis's read is that GPU scarcity is real and the long-dated rental floor is materially higher than equity valuations imply [12].

Power and energy infrastructure have emerged as a third constraint layer, now documented across multiple institutional sources. GPU racks are reaching 400kW power density, exceeding what legacy data centers can support, and public grid delivery timelines run 3–5 years [13]. Reports indicate nearly half of US data centers planned for 2026 face power challenges [14], and the IEA [15], WEF [16], and industry publications [17][18] have each identified grid connectivity as a binding constraint on AI expansion. Bloom Energy's on-site fuel cells can deliver power in approximately 90 days, which hyperscalers are paying a premium for rather than waiting on grid timelines [19]; Radiant completed an AI-ready data center from groundbreaking to production in 12 months by bypassing the grid entirely [13]. At the supply chain floor, multi-layer ceramic capacitors — required in tens of thousands per rack — have experienced price hikes and extended lead times as AI server demand accelerates [20].

The investment signals are large and widening. Goldman Sachs projects 2027 hyperscaler capex at $1.4 trillion against a $920 billion Wall Street consensus [21]. Jensen Huang has framed AI factory economics as a $50 billion build cost producing $300–400 billion in intelligence output over the facility's life, and stated the buildout is accelerating with H2 2026 expected to exceed H1 and 2027 larger still [22][23]. On the foundry side, Intel's relationship with NVIDIA around the Feynman GPU and Intel 18A remains unsettled: earlier reports framed the deal as confirmed [24][25], while more recent sources describe NVIDIA as in early testing with Intel working to finalize commitments, and some frame Intel as a backup rather than co-primary supplier [26][27][28]. Google's order of 3M+ TPUs from Intel for 2028 delivery remains confirmed [29].

Timeline

2026-03-01: SemiAnalysis identifies HBM wafer supply — not CoWoS packaging — as the binding constraint on AI accelerator production. [1][49]
2026-05-30: SemiAnalysis projects AI consuming 60% of TSMC N3 output in 2026, rising to 86% in 2027. [30][50][5][51]
2026-05-31: SK Hynix confirmed sold out of DRAM, NAND, and HBM into 2026; Micron confirms 2026 HBM sold out and commits roughly $200 billion to long-term memory capacity. [2][38][52][39]
2026-06-01: TSMC posts +41% year-over-year Q1 2026 revenue with all-time-high margins; Arizona Phase 1 fab profitable ahead of schedule. [44][6][45]
2026-06-01: Intel announces Crescent Island GPU targeting AI inference by end of 2026, without HBM, competing on cost and thermal efficiency. [41][42]
2026-06-05: SemiAnalysis flags MLCCs as an overlooked AI server supply constraint; multiple publications corroborate AI-driven shortages of the sub-$1 passive components required in tens of thousands per rack. [20][53][54][55][56]
2026-06-06: Micron crosses $1 trillion in market cap as investor conviction in the HBM supercycle thesis strengthens. [40]
2026-06-08: SK Hynix and NVIDIA formalize a multi-year HBM co-development partnership covering NVIDIA's Vera Rubin platform; Jensen Huang states NVIDIA consumes essentially all available HBM supply. [7][8]
2026-06-08: Bernstein projects HBM4 pricing to nearly double to $37/GB by 2027; Google's 3M+ TPU order from Intel foundry for 2028 delivery confirmed across multiple publications. [4][43][29]
2026-06-10: Multiple publications report NVIDIA's Feynman GPU will use Intel Foundry for some components in a multi-die chiplet design targeting 2028. [24][32][25][33]
2026-06-10: Jensen Huang states the AI buildout is accelerating — H2 2026 expected to exceed H1, and 2027 projected to be very large; frames AI factory economics as 6–8x return on capital. [22][23]
2026-06-11: Additional reporting characterizes the NVIDIA-Intel 18A arrangement as early testing/evaluation with Intel working to finalize commitments; some sources frame Intel as a backup AI foundry. [26][27][28][57]
2026-06-11: SemiAnalysis flags 400kW GPU racks as exceeding legacy data center capacity and warns grid throttling will constrain AI compute; Radiant completed an AI-ready data center in 12 months by bypassing the grid. [13]
2026-06-12: SemiAnalysis launches H100 1-Click Rental Index documenting a 40% price increase from October 2025 to March 2026; GPU spot market moved from cooling to hard squeeze in approximately five months. [9][10]
2026-06-12: SemiAnalysis argues the current GPU squeeze is structurally driven by agentic workloads rather than training, and that neocloud equities are mispriced relative to real GPU scarcity. [12][11]
2026-06-12: Goldman Sachs projects 2027 hyperscaler capex at $1.4 trillion, more than 50% above the $920 billion Wall Street consensus. [21]
2026-06-13: Micron CEO issues most bullish forward guidance in company history; Q2 FY2026 revenue $23.86B (+196% YoY) with 75% non-GAAP gross margins. [3]
2026-06-14: IEA, WEF, Data Center Knowledge, and KTS Law independently document power grid connectivity as a binding constraint on AI data center expansion; reports indicate nearly half of US data centers planned for 2026 face power challenges. [15][16][14][17][18]

Perspectives

SemiAnalysis

AI's dominance of leading-edge semiconductor capacity is structural; the GPU rental market moved from cooling to hard squeeze in five months, with H100 rentals up 40%; agentic workloads — not training — are driving the current demand surge; neocloud equities are mispriced relative to GPU scarcity and long-dated rental floors; MLCCs and 400kW power density are overlooked infrastructure constraints.

Evolution: Consistent; the H100 rental index and agentic-demand framing established last pass remain the core analytical contributions.

[30][5][1][6][31][20][12][11][9][10][13]

NVIDIA / Jensen Huang

AI buildout is accelerating — H2 2026 will exceed H1 and 2027 will be very large; AI factory economics yield $300–400B in output from a $50B build; NVIDIA consumes essentially all available HBM supply; the Feynman GPU reportedly targets Intel Foundry for some components in a 2028 multi-die design, though that commitment remains in evaluation.

Evolution: Consistent; Jensen Huang's acceleration outlook and AI factory economics framing are unchanged.

[7][8][24][32][25][33][22][23][27][28]

SK Hynix

Committed to HBM leadership through a multi-year co-development partnership with NVIDIA; targets DRAM capacity doubling by 2031; expects memory supply to remain tight until at least 2030.

Evolution: Consistent.

[34][2][35][7][36][37]

Micron

Q2 FY2026 revenue of $23.86B (+196% YoY) with 75% gross margins and the most bullish CEO guidance in company history; 2026 HBM sold out with roughly $200B committed to long-term capacity.

Evolution: Blowout Q2 earnings and record guidance materially strengthened this stance last pass; unchanged this pass.

[38][39][40][3]

Intel

On two foundry tracks targeting 2028 — Google's confirmed 3M+ TPU order and the NVIDIA Feynman GPU multi-die design, though the NVIDIA commitment is still in evaluation — and separately targeting AI inference with HBM-free Crescent Island by end of 2026.

Evolution: More recent items frame Intel as working to finalize the NVIDIA commitment and as a potential backup foundry, adding negotiation uncertainty to earlier framing that treated the deal as more settled.

[41][42][43][29][24][32][25][33][26][28]

TSMC

AI demand is structurally robust through at least 2027–2028; Q1 2026 delivered all-time-high margins on +41% growth; TSMC Arizona is profitable ahead of schedule; AI chips projected to consume 86% of N3 output by 2027.

Evolution: Consistent; no new disclosures this pass.

[5][44][6][45]

Goldman Sachs and independent analysts

Goldman projects 2027 hyperscaler capex at $1.4T vs. $920B consensus; Bernstein projects HBM4 nearly doubles to $37/GB by 2027; the early-evaluation characterization of the Intel-NVIDIA relationship adds modest uncertainty to the Intel foundry investment thesis.

Evolution: Consistent; Goldman's $1.4T projection established last pass remains the defining bull-case signal.

[4][46][21]

IEA, WEF, and institutional energy analysts

Power grid connectivity is a strategic bottleneck for AI expansion; nearly half of US data centers planned for 2026 face power challenges; energy constraints are now documented at the institutional-research level alongside industry-specific reporting.

Evolution: New voice this pass, corroborating the power constraint theme that SemiAnalysis surfaced earlier but now with institutional research backing.

[15][16][18][14][17]

Tensions

SemiAnalysis argues GPU scarcity is real and neocloud equity valuations are too low relative to the long-dated rental floor [12][10]; public neocloud equity markets are priced as if the AI infrastructure cycle is rolling over — the two positions assign opposite meanings to the same data. [12][10]
Earlier reports state NVIDIA's Feynman GPU 'will use Intel Foundry for some components' [24][25], while newer sources describe NVIDIA as in 'early testing/evaluation stages' with Intel still 'working to finalize commitments' [27][28] — the two framings assign different maturity levels to the same deal. [24][25][27][28]
Some sources frame Intel as a 'backup AI foundry' for NVIDIA [26] while TSMC is projected to supply 86% of N3 wafers to AI by 2027 [5] — the scope of Intel's intended role (limited chiplet versus meaningful volume) is unspecified and unresolved. [26][5]
The NVIDIA–SK Hynix co-development partnership concentrates leading-edge HBM access around NVIDIA [7], while AMD's MI455/VR200 remain engineering samples [31] and Intel's Crescent Island deliberately avoids HBM [42] — neither competitor is positioned to access HBM4 at scale in the near term. [7][31][42]
Goldman Sachs projects 2027 hyperscaler capex at $1.4T [21] vs. the $920B Wall Street consensus — the gap between bull and base cases for AI infrastructure spend has widened to over 50%. [21]
Investor framing positions Micron as an 'AI gatekeeper' [47][48], but SK Hynix holds dominant HBM market share, leads HBM4 development, and has formalized a co-development partnership with NVIDIA [7] — the two narratives assign structural primacy to different memory suppliers. [47][48][2][7]

Status: active and growing

Sources

[1] The Great AI Silicon Shortage - SemiAnalysis — reactive:great-ai-silicon-shortage
[2] SK Hynix sells out DRAM, NAND, and HBM capacity into 2026 amid ... — reactive:great-ai-silicon-shortage
[3] Micron's CEO just dropped the most bullish forward guidance in the company's history and the earnings report is 8 tradin… — Milk Road AI Twitter (2026-06-13)
[4] Most investors think memory stocks have peaked but they are completely wrong. (Save this). — Milk Road AI Twitter (2026-06-08)
[5] Our work shows AI taking roughly 60% of N3 family wafers in 2026 and stepping up to about 86% in 2027, which is a regime… — SemiAnalysis Twitter (2026-05-30)
[6] After posting +41% y/y growth with ATH GM and OM in 1Q26, TSMC is tracking to high-30s growth in CY26. We raised our TSM… — SemiAnalysis Twitter (2026-06-01)
[7] SK hynix and NVIDIA just formed a multi-year memory partnership to build the chips behind the next wave of AI factories. — Rohan Paul Twitter (2026-06-08)
[8] In Seoul, Nvidia CEO Jensen Huang handed out SK Hynix x 7-Eleven HBM Chips snack bags while addressing the crowd. — Rohan Paul Twitter (2026-06-08)
[9] The index has H100 one-year rentals running from $1.70 per hour per GPU in October 2025 to about $2.35 in March 2026, wh… — SemiAnalysis Twitter (2026-06-12)
[10] Alongside the launch of our H100 1-Click Rental Index, we wrote up what the GPU rental market actually looks like in ear… — SemiAnalysis Twitter (2026-06-12)
[11] What we walk through in the article is why this isnt a repeat of the 2023 squeeze. The demand side is no longer training… — SemiAnalysis Twitter (2026-06-12)
[12] Interestingly, the public market is positioned in the opposite direction, with neocloud names trading like the cycle is … — SemiAnalysis Twitter (2026-06-12)
[13] GPU Racks hitting 400kW? Legacy data centers wont be able to handle it and the grid WILL get throttled. — SemiAnalysis Twitter (2026-06-11)
[14] Nearly half of the US data centers planned for 2026 are facing ... — reactive:great-ai-silicon-shortage
[15] Executive summary – Key Questions on Energy and AI – Analysis - IEA — reactive:ai-energy-infrastructure
[16] Is power grid connectivity the strategic bottleneck for AI? — reactive:ai-datacenter-power-crisis
[17] Gridlocked: Power Constraints Shape the Future of Data Centers — reactive:great-ai-silicon-shortage
[18] AI Data Centers and the Looming Energy Crisis in the United States — reactive:great-ai-silicon-shortage
[19] AI data centers need huge amounts of electricity, and the public grid takes 3 to 5 years to deliver it. — Milk Road AI Twitter (2026-06-12)
[20] Nobody is asking who makes the <$1 multi-layer ceramic capacitor (MLCC) that keeps voltage stable across every chip i… — SemiAnalysis Twitter (2026-06-05)
[21] This is WILD! — Milk Road AI Twitter (2026-06-11)
[22] Jensen Huang just explained AI in a way that makes the investment thesis for Nvidia almost impossible to argue with (Sav… — Milk Road AI Twitter (2026-06-11)
[23] Jensen Huang just made a statement that every investor in AI infrastructure needs to hear (Save this). — Milk Road AI Twitter (2026-06-10)
[24] NVIDIA Feynman and Intel Foundry: New Report, Old Core – But With an Important Packaging Clue|igor´sLAB — reactive:great-ai-silicon-shortage
[25] Nvidia Feynman GPUs to use Intel Foundry for some components — reactive:great-ai-silicon-shortage
[26] Key facts: Intel tests 18A multi‑die; backup AI foundry; Cadence 14A — TradingView News — reactive:great-ai-silicon-shortage
[27] $NVDA Nvidia is in early testing/evaluation stages with $INTC Intel's ... — reactive:great-ai-silicon-shortage
[28] Intel is reportedly 'working to finalize commitments from Nvidia' as a foundry partner, suggesting gaming potential for the 18A node : r/hardware — reactive:great-ai-silicon-shortage
[29] Google orders 3 million TPUs from Intel as TSMC strains - Quartz — reactive:great-ai-silicon-shortage
[30] It also explains why the bottleneck conversation is migrating away from CoWoS, which is finally easing, and onto memory,… — SemiAnalysis Twitter (2026-05-30)
[31] IMPORTANT: it is important to understand that the CoreWeave & Microsoft photos are still Engineering/Quality Samples… — SemiAnalysis Twitter (2026-06-03)
[32] Nvidia's Next-Gen GPU Could be Coming to Intel Foundry — reactive:great-ai-silicon-shortage
[33] NVIDIA to Build GPUs on Intel Foundry from 2028: Report - Reddit — reactive:great-ai-silicon-shortage
[34] SK hynix Delays HBM4 Mass Production and Capacity Expansion — reactive:aws-garman-a100-demand
[35] SK hynix just said AI memory demand is now so large that it will double wafer capacity within 5 years, yet still expects… — Rohan Paul Twitter (2026-06-02)
[36] SK hynix said to be planning to double DRAM capacity by 2031 - New Electronics — reactive:great-ai-silicon-shortage
[37] 2026 Market Outlook: SK hynix's HBM to Fuel AI Memory Boom — reactive:great-ai-silicon-shortage
[38] Micron's Sold Out 2026 HBM And US$200b Bet On AI Demand — reactive:micron-hbm-bull-case
[39] Micron's AI Supercycle Accelerates (NASDAQ:MU) | Seeking Alpha — reactive:great-ai-silicon-shortage
[40] Micron crossed $1 trillion in market cap and it is still undervalued (Save this). — Milk Road AI Twitter (2026-06-06)
[41] Intel: Our upcoming AI chip will be cheaper, run cooler than Nvidia, AMD options — Ars Technica AI (2026-06-01)
[42] Intel's new inference chip, Crescent Island, doesn't use HBM. — reactive:great-ai-silicon-shortage (2026-06-05)
[43] The Information reports that Google has picked Intel to manufacture 3M+ Google TPUs in 2028. — Rohan Paul Twitter (2026-06-08)
[44] TSMC Arizona surprised. After ramping up strongly in CY25 ($2B+ revenue), Phase 1 net profit in 1Q26 alone exceeded the … — SemiAnalysis Twitter (2026-06-01)
[45] The foundry industry hit a record $48.8B in 1Q26, +32% y/y and +3% q/q in seasonally soft Q1, marking the 9th consecutiv… — SemiAnalysis Twitter (2026-06-01)
[46] 🟢 Intel Surges After Report Google May Use Its Foundry for AI Chips — reactive:great-ai-silicon-shortage (2026-06-08)
[47] FinancialContent - The Memory Supercycle: Why Micron Technology is the New AI Gatekeeper — reactive:great-ai-silicon-shortage
[48] Micron Stock Up 100%: What the HBM Leader Plans for 2026 — reactive:great-ai-silicon-shortage
[49] The Great AI Silicon Shortage — reactive:great-ai-silicon-shortage
[50] The broader implication, which we work through in detail in the piece, is that the supply curve for frontier accelerator… — SemiAnalysis Twitter (2026-05-30)
[51] One of the throughlines in our Great AI Silicon Shortage piece is that the conversation about leading-edge capacity has … — SemiAnalysis Twitter (2026-05-30)
[52] Sold-Out HBM Supply and AI Tailwinds Point to Strong 2026 Growth — reactive:great-ai-silicon-shortage
[53] MLCC Shortages Return as AI Server Demand Strains Capacity - Astute Group — reactive:great-ai-silicon-shortage
[54] MLCC Consider Price Increase as AI Demand Outpaces Supply — reactive:great-ai-silicon-shortage
[55] AI server boom strains tantalum capacitors; MLCC substitution falls ... — reactive:great-ai-silicon-shortage
[56] AI drives MLCC shortage ... — reactive:great-ai-silicon-shortage
[57] NVIDIA to Build GPUs on Intel Foundry from 2028: Report — reactive:great-ai-silicon-shortage