The Information Machine

Inference Production is likely a more scale oriented game than even steel manufacturing. Bigger models push capability f…

SemiAnalysis Twitter · SemiAnalysis (@SemiAnalysis_) · 2026-06-10

SemiAnalysis argues that AI inference production is more scale-dependent than even steel manufacturing, citing Claude Opus 4.5 as proof that frontier model scale unlocks new capabilities like agentic AI that local LLMs cannot viably replicate.

Open original ↗

Appears in

Extraction

Topics: ai-inferencelarge-language-modelseconomies-of-scalecommercial-viability

Claims

  • Inference production has greater economies of scale than steel manufacturing.
  • Frontier model scale unlocks new capability tiers — Opus 4.5 specifically made agentic AI commercially possible.
  • Local LLMs generate tokens at a scale that is unlikely to be commercially viable.

Key quotes

Inference Production is likely a more scale oriented game than even steel manufacturing.
Bigger models push capability frontiers (Opus 4.5 made Agentic possible), and local LLMs create and serve tokens at a scale that is unlikely to be commercially viable.