The Information Machine

Quick Paper Review: "There Will Be a Scientific Theory of Deep Learning"

Alignment Forum · LawrenceC · 2026-04-25

(No summary yet for this item — extraction summaries are still backfilling.)

Open original ↗

Appears in

Extraction

Topics: deep-learning-theorylearning-mechanicsmechanistic-interpretabilityml-theory-researchtraining-dynamics

Claims

  • Simon et al. propose 'learning mechanics' as an emerging theory of deep learning focused on training dynamics, coarse aggregate statistics, and average-case predictions rather than worst-case bounds.
  • The main practical output of learning mechanics research to date has been hyperparameter scaling techniques like mu-parameterization, with little demonstrated direct utility for LLM engineering.
  • The paper makes a stronger case that some theory will exist than that such a theory will be broadly useful or comprehensive enough to qualify as a complete 'theory of deep learning.'
  • Learning mechanics explicitly does not aim to explain the specific algorithms learned by particular networks, making it insufficient as a comprehensive theory of deep learning.
  • The paper serves primarily as a manifesto and entry point for researchers new to deep learning theory rather than a neutral assessment of evidence.

Key quotes

Learning mechanics is a theory that concerns itself with 'the dynamics of the training process', studies them using 'coarse aggregate statistics of learning', and has the goal to generate 'accurate average-case predictions'.
The main use of learning mechanics research so far has been in producing new learning mechanics research to retrodict known empirical phenomena; learning dynamics as a field has yielded little practical fruit.
Maybe there will be a scientific theory of deep learning. Maybe learning mechanics will become a theory covering some important aspects of deep learning... But I don't think the paper has convinced me about these claims.