The Information Machine

SemiAnalysis: AI Silicon Shortage — HBM Bottleneck and N3 Wafer Dominance

open · v7 · 2026-06-05 · 100 items · history

What's new in v7

New items this pass are thin. The one substantive addition is confirmation that Intel's Crescent Island inference chip does not use HBM [14], which deepens the existing observation that Intel is competing entirely outside the HBM supply chain rather than within it — this is reflected in the Intel/AMD perspective and the corresponding timeline entry. No new themes, voices, or disagreements emerged; core HBM and N3 shortage findings and the 2030 tightness timeline are unchanged.

What

HBM memory and TSMC N3 logic remain simultaneously sold out, with the shortage projected to persist until at least 2030: SK Hynix plans to double wafer capacity within five years yet still expects supply to stay tight through the end of the decade [7]. TSMC reported +41% year-over-year revenue growth with all-time-high margins in Q1 2026 [6], and its market share exceeds 95% of global foundry revenue [8]. AMD MI455 and VR200 racks visible at CoreWeave and Microsoft are engineering samples with incomplete software stacks and no production tokens generated [12]. Intel's Crescent Island inference chip deliberately avoids HBM, competing on cost and thermal grounds entirely outside the HBM supply chain [13][14].

Why it matters

SK Hynix's 2030 tightness forecast [7] means the AI hardware shortage is a decade-scale structural constraint, not a supply lag that resolves in the next product cycle. With TSMC controlling more than 95% of foundry revenue [8] and memory supply projected tight through 2030, allocation decisions inside two or three companies effectively set the pace of global AI deployment.

Open questions

  • If SK Hynix expects supply tightness until 2030 even after doubling wafer capacity [7], how quickly can AI labs and cloud providers actually scale compute infrastructure?

  • How long until AMD's MI455 and VR200 — currently engineering samples with incomplete software stacks and no production tokens [12] — reach production readiness, and does that timeline shift hyperscaler purchasing decisions toward Nvidia?

  • Does the 146-day stability of H100 spot pricing in the $2.70–$3.01 band [16] reflect a genuine supply-demand equilibrium, or is it masking a future repricing event as Blackwell-generation accelerators absorb incremental demand?

  • With AI projected to absorb 86% of TSMC N3 wafers by 2027 [5] and TSMC concentration exceeding 95% [8], how does Apple's structural wafer priority get renegotiated as AI's share approaches the ceiling?

Narrative

The AI hardware supply crisis has two simultaneous binding constraints. SemiAnalysis's research identified HBM wafer supply — not earlier CoWoS packaging bottlenecks — as the scarce resource limiting AI accelerator production [1]. Order books confirm the thesis: SK Hynix has sold out its DRAM, NAND, and HBM capacity into 2026 [2], and Micron has similarly sold out its 2026 HBM allocation while committing roughly $200 billion to long-term memory capacity expansion [3][4]. On the logic side, TSMC's N3 process node faces equivalent pressure, with AI projected to absorb approximately 60% of N3-family output in 2026, rising to roughly 86% in 2027 [5]. SemiAnalysis explicitly names N3 the main supply bottleneck of the current AI cycle [6].

The shortage is extending further than previously estimated. SK Hynix announced plans to double wafer capacity within five years to meet AI memory demand — yet still expects supply to remain tight until at least 2030 [7], well beyond prior estimates of 2026–2027 relief. TSMC's Q1 2026 financials put demand pressure in quantitative terms: +41% year-over-year revenue growth with all-time-high margins, with global foundry revenue hitting a record $48.8 billion and TSMC's market share exceeding 95% [6][8]. TSMC Arizona Phase 1 turned profitable ahead of schedule, with Q1 2026 net profit alone exceeding the facility's total profit for all of calendar year 2025 [9]. Supply investment is scaling at historic levels — SK Hynix is spending $13.3 billion on equipment in 2026 alone [10], with an additional $15 billion committed to longer-term expansion [11] — but the 2030 tightness forecast shows that even this investment rate does not resolve the shortage within the decade.

Alternative accelerators are further from production readiness than their public presence suggests. SemiAnalysis flagged that rack photos of AMD's VR200 and MI455 systems circulating from CoreWeave and Microsoft are engineering and quality samples: the software stack bring-up is not complete and no production tokens have been generated [12]. Intel is targeting the AI inference market with its Crescent Island GPU by end of 2026, competing on cost and thermal efficiency rather than training performance — and the chip is designed without HBM entirely [13][14], positioning Intel outside the HBM supply crunch rather than competing within it. Samsung faces HBM4 yield challenges and rollout delays that reinforce SK Hynix's structural lead in AI memory [15].

At the market data level, SemiAnalysis disputes how AI hardware conditions are measured. Its transaction-based H100 pricing data held in a $2.70–$3.01 band for 146 consecutive days, while the Ornn index swung from $1.80 to $3.22 to $2.63 within a single month [16]. SemiAnalysis attributes the Ornn index's volatility to its reliance on publicly listed transactions only, arguing its broader dataset reveals a more stable underlying market. On the demand side, hyperscalers are deploying a diverse mix of GPU, XPU, and CPU chips, which requires ODMs to build larger design teams and expand assembly and testing capacity — creating supply chain complexity while opening new market opportunities for capable ODMs [17].

Timeline

  • 2026-03-01: SemiAnalysis publishes 'The Great AI Silicon Shortage,' identifying HBM wafer supply — not CoWoS packaging — as the binding constraint on AI accelerator production. [1][40]
  • 2026-03-05: TrendForce reports SK Hynix commits an additional $15 billion to fab expansion. [11]
  • 2026-04-27: TrendForce reports TSMC 3nm monthly capacity on track to reach 180,000 wafers by 2026, up over 40% year-over-year. [23]
  • 2026-05-30: SemiAnalysis publishes: HBM as the new bottleneck, AI taking 60%/86% of N3 wafers in 2026/2027, and market concentration reframed as a policy question. [18][19][5][20]
  • 2026-05-30: SemiAnalysis publishes analysis of the Apple-TSMC partnership, detailing how leading-edge wafer allocation priorities are structurally negotiated. [21]
  • 2026-05-31: SK Hynix confirmed sold out of DRAM, NAND, and HBM capacity into 2026. [2]
  • 2026-05-31: Micron confirmed sold out of its 2026 HBM allocation; commits roughly $200 billion to long-term memory capacity. [3][30][4]
  • 2026-05-31: SK Hynix reports delays to HBM4 mass production while simultaneously ramping 1c DRAM 8-fold and building new HBM4 packaging plants. [25][26][41][28]
  • 2026-06-01: TSMC shortages at leading-edge nodes projected to extend beyond 2027; SK Hynix spending $13.3 billion on equipment in 2026. [24][10]
  • 2026-06-01: TSMC posts +41% year-over-year Q1 2026 growth with all-time-high margins; SemiAnalysis names N3 as the main supply bottleneck of the AI cycle. [6][8]
  • 2026-06-01: TSMC Arizona Phase 1 Q1 2026 profit exceeds all of calendar year 2025; TSMC board authorizes record single-tranche Arizona capital commitment. [9]
  • 2026-06-01: Intel announces Crescent Island GPU targeting AI inference by end of 2026; the chip does not use HBM, competing on cost and thermal efficiency rather than training performance. [13][14]
  • 2026-06-01: SemiAnalysis identifies Tesla-SpaceX Terafab targeting one million wafers per month across logic and memory. [42]
  • 2026-06-02: SK Hynix discloses plans to double wafer capacity within five years yet still expects memory supply to stay tight until at least 2030. [7]
  • 2026-06-02: SemiAnalysis rejects the Ornn H100 index as methodologically misleading; its own data shows H100 spot pricing stable in a $2.70–$3.01 band for 146 consecutive days. [16]
  • 2026-06-02: Hyperscaler chip diversification across GPU, XPU, and CPU types creates new design and manufacturing complexity for ODMs. [17]
  • 2026-06-03: SemiAnalysis clarifies AMD VR200 and MI455 racks at CoreWeave and Microsoft are engineering samples; software stack incomplete, no production tokens generated. [12]

Perspectives

SemiAnalysis

AI's dominance of leading-edge semiconductor capacity is a structural regime change; N3 is the main supply bottleneck of the current AI cycle, TSMC capex will exceed Street consensus, H100 spot pricing is stable, and the Ornn index is methodologically misleading. AMD MI455/VR200 racks are engineering samples, not production hardware, and hyperscaler chip diversification is reshaping the ODM supply chain.

Evolution: Consistent; the AMD engineering sample clarification [12] and ODM complexity observation [17] extended the analysis beyond memory and logic into server design.

TSMC

AI demand is structurally robust through at least 2027–2028; Q1 2026 delivered all-time-high margins on +41% growth; TSMC Arizona is profitable ahead of schedule; expanding capacity simultaneously across greenfield and existing sites but cannot keep pace with AI chip demand.

Evolution: Consistent; the Arizona profitability milestone [9] remains the most significant data point in TSMC's narrative.

SK Hynix

Committed to HBM leadership through massive investment — including plans to double wafer capacity within five years — yet expects memory supply to remain tight until at least 2030 despite that expansion [7], while navigating near-term HBM4 mass production delays.

Evolution: The 2030 tightness forecast [7] significantly extended SK Hynix's prior shortage timeline and is the sharpest escalation of the structural shortage narrative in this thread.

Micron

2026 HBM is sold out; betting roughly $200 billion on an AI memory supercycle and positioned by both the company and analysts as a decisive gatekeeper in AI memory supply.

Evolution: Consistent; the 'AI gatekeeper' analyst framing has not changed.

Samsung

Scaling HBM production alongside SK Hynix to meet AI demand, while facing HBM4 yield challenges and rollout delays that reinforce SK Hynix's structural lead in AI memory.

Evolution: Consistent; HBM4 delay confirmation [15] sharpens Samsung's secondary position.

Intel and AMD

Both are at the margins of Nvidia's dominance: Intel targets AI inference with Crescent Island by end of 2026 on cost and thermal grounds, explicitly avoiding HBM and conceding training to Nvidia [13][14]; AMD's MI455/VR200 systems are circulating in customer environments but are engineering samples with incomplete software stacks [12].

Evolution: The confirmation that Crescent Island does not use HBM [14] clarifies Intel's deliberate positioning outside the HBM supply chain rather than competing within it.

Independent market analysts and investors

The bottleneck shift from GPUs to HBM is confirmed and investable; memory suppliers — particularly Micron — are positioned as multi-year beneficiaries, with Wall Street deploying 'AI gatekeeper' framing to describe Micron's strategic position.

Evolution: SK Hynix's 2030 tightness disclosure [7] further reinforces the bullish memory investment thesis; conviction is at its highest point in the thread.

Tensions

  • SK Hynix is delaying HBM4 mass production [25] while announcing plans to double wafer capacity [7] and spending $13.3 billion on equipment this year [10] — near-term execution risk coexists with the most aggressive long-term investment posture in the industry. [25][10][7][11]
  • SemiAnalysis argues AI accelerator supply is now a policy decision inside TSMC, Apple, and Samsung [19], but TSMC's active 3nm capacity expansion and record capital commitments [6] show supply does respond to market signals — just on a 12–24 month construction lag. [19][23][6]
  • The Apple-TSMC partnership gives Apple structural priority at leading-edge nodes [21], but AI's projected 86% share of N3 wafers by 2027 [5] and TSMC's >95% market concentration [8] imply either Apple's share contracts or TSMC's expansion pace must outrun current projections. [21][5][8]
  • Investor framing positions Micron as the 'AI gatekeeper' [31][32], but SK Hynix holds dominant HBM market share, leads HBM4 development, and has committed to doubling wafer capacity [7] — one of these narratives is materially overstated. [31][32][2][25][7]
  • SemiAnalysis's transaction-based H100 pricing shows a stable 146-day band [16], while the Ornn index swung from $1.80 to $3.22 to $2.63 in a single month — the two methodologies imply fundamentally different reads on current AI compute market conditions. [16]
  • AMD and Intel are publicly visible in AI accelerator deployments, but AMD's MI455/VR200 racks are engineering samples with incomplete software stacks [12] and Intel's Crescent Island deliberately avoids HBM and concedes training to Nvidia [13][14] — neither is a near-term substitute. [12][13][14]

Status: active but slowing

Sources

  1. [1] The Great AI Silicon Shortage - SemiAnalysis — reactive:great-ai-silicon-shortage
  2. [2] SK Hynix sells out DRAM, NAND, and HBM capacity into 2026 amid ... — reactive:great-ai-silicon-shortage
  3. [3] Micron's Sold Out 2026 HBM And US$200b Bet On AI Demand — reactive:micron-hbm-bull-case
  4. [4] Micron's AI Supercycle Accelerates (NASDAQ:MU) | Seeking Alpha — reactive:great-ai-silicon-shortage
  5. [5] Our work shows AI taking roughly 60% of N3 family wafers in 2026 and stepping up to about 86% in 2027, which is a regime… — SemiAnalysis Twitter (2026-05-30)
  6. [6] After posting +41% y/y growth with ATH GM and OM in 1Q26, TSMC is tracking to high-30s growth in CY26. We raised our TSM… — SemiAnalysis Twitter (2026-06-01)
  7. [7] SK hynix just said AI memory demand is now so large that it will double wafer capacity within 5 years, yet still expects… — Rohan Paul Twitter (2026-06-02)
  8. [8] The foundry industry hit a record $48.8B in 1Q26, +32% y/y and +3% q/q in seasonally soft Q1, marking the 9th consecutiv… — SemiAnalysis Twitter (2026-06-01)
  9. [9] TSMC Arizona surprised. After ramping up strongly in CY25 ($2B+ revenue), Phase 1 net profit in 1Q26 alone exceeded the … — SemiAnalysis Twitter (2026-06-01)
  10. [10] SK Hynix to Spend $13.3 Billion on Equipment Alone This Year as ... — reactive:great-ai-silicon-shortage
  11. [11] [News] SK Hynix Commits Additional USD 15 Billion, Escalating Fab Expansion Race among Memory Giants — reactive:great-ai-silicon-shortage
  12. [12] IMPORTANT: it is important to understand that the CoreWeave & Microsoft photos are still Engineering/Quality Samples… — SemiAnalysis Twitter (2026-06-03)
  13. [13] Intel: Our upcoming AI chip will be cheaper, run cooler than Nvidia, AMD options — Ars Technica AI (2026-06-01)
  14. [14] Intel's new inference chip, Crescent Island, doesn't use HBM. — reactive:great-ai-silicon-shortage (2026-06-05)
  15. [15] Samsung delays HBM4 rollout to 2026 due to yield challenges, all ... — reactive:aws-garman-a100-demand
  16. [16] The recent Ornn H100 index drop to $2.63 (-7.72%) is confusing. But mostly because it's a misleading index. Our H100 hou… — SemiAnalysis Twitter (2026-06-02)
  17. [17] Major hyperscalers introduce a variety of GPU, XPU, and CPU chips, leading to more diverse server rack and board designs… — SemiAnalysis Twitter (2026-06-02)
  18. [18] It also explains why the bottleneck conversation is migrating away from CoWoS, which is finally easing, and onto memory,… — SemiAnalysis Twitter (2026-05-30)
  19. [19] The broader implication, which we work through in detail in the piece, is that the supply curve for frontier accelerator… — SemiAnalysis Twitter (2026-05-30)
  20. [20] One of the throughlines in our Great AI Silicon Shortage piece is that the conversation about leading-edge capacity has … — SemiAnalysis Twitter (2026-05-30)
  21. [21] Apple-TSMC: The Partnership That Built Modern Semiconductors — reactive:great-ai-silicon-shortage
  22. [22] TSMC is stating that AI demand is good for both 2027 and 2028. — reactive:great-ai-silicon-shortage
  23. [23] [News] TSMC 3nm Monthly Capacity May Hit 180K Wafers by 2026 ... — reactive:great-ai-silicon-shortage
  24. [24] TSMC can't keep up with AI chip demand, with shortages projected to last beyond 2027 — reactive:great-ai-silicon-shortage
  25. [25] SK hynix Delays HBM4 Mass Production and Capacity Expansion — reactive:aws-garman-a100-demand
  26. [26] HBM4 race accelerates: SK hynix builds new packaging plant and ... — reactive:aws-garman-a100-demand
  27. [27] SK Hynix presses ahead on HBM4 despite tightening AI memory supply — reactive:great-ai-silicon-shortage
  28. [28] SK hynix Begins Expanding HBM4 Production Capacity with New ... — reactive:great-ai-silicon-shortage
  29. [29] Micron sets 1γ as mainstream node for 2026 as HBM and ... - digitimes — reactive:great-ai-silicon-shortage
  30. [30] Sold-Out HBM Supply and AI Tailwinds Point to Strong 2026 Growth — reactive:great-ai-silicon-shortage
  31. [31] FinancialContent - The Memory Supercycle: Why Micron Technology is the New AI Gatekeeper — reactive:great-ai-silicon-shortage
  32. [32] Micron Stock Up 100%: What the HBM Leader Plans for 2026 — reactive:great-ai-silicon-shortage
  33. [33] Samsung and SK Hynix to scale up memory production capacity in ... — reactive:aws-garman-a100-demand
  34. [34] Major tech firms shift to Samsung as TSMC capacity falls short | Jeffrey Cooper — reactive:great-ai-silicon-shortage
  35. [35] Samsung Breaks TSMC Monopoly, Supplies Tesla AI Chips | DBR — reactive:great-ai-silicon-shortage
  36. [36] Samsung, SK hynix dramatically scaling up HBM production to meet ... — reactive:great-ai-silicon-shortage
  37. [37] 1/ The bottleneck moved. For two years the scarce resource was Nvidia's GPUs. Now it's the high-bandwidth memory that si... — reactive:great-ai-silicon-shortage (2026-05-27)
  38. [38] AI Demand Locks Up Advanced Memory Supply Through 2026 — reactive:hbm-memory-supply-squeeze
  39. [39] Why Wall Street thinks this AI stock could be 2026’s biggest surprise — TradingView News — reactive:great-ai-silicon-shortage
  40. [40] The Great AI Silicon Shortage — reactive:great-ai-silicon-shortage
  41. [41] SK Hynix to ramp up 1c DRAM production 8-fold in 2026 : r/hardware — reactive:great-ai-silicon-shortage
  42. [42] Intel Foundry's EMIB plus 18A-P/T is positioned as a multi-billion-dollar external franchise into 2028. New entrants sti… — SemiAnalysis Twitter (2026-06-01)