Quick Paper Review: "There Will Be a Scientific Theory of Deep Learning"

Alignment Forum · LawrenceC · 2026-04-25

(No summary yet for this item — extraction summaries are still backfilling.)

Open original ↗

Appears in

Deep Learning Theory Is Broken — And Maybe Unfixable

Extraction

Topics: deep-learning-theorylearning-mechanicsmechanistic-interpretabilityml-theory-researchtraining-dynamics

Claims

Simon et al. propose 'learning mechanics' as an emerging theory of deep learning focused on training dynamics, coarse aggregate statistics, and average-case predictions rather than worst-case bounds.
The main practical output of learning mechanics research to date has been hyperparameter scaling techniques like mu-parameterization, with little demonstrated direct utility for LLM engineering.
The paper makes a stronger case that some theory will exist than that such a theory will be broadly useful or comprehensive enough to qualify as a complete 'theory of deep learning.'
Learning mechanics explicitly does not aim to explain the specific algorithms learned by particular networks, making it insufficient as a comprehensive theory of deep learning.
The paper serves primarily as a manifesto and entry point for researchers new to deep learning theory rather than a neutral assessment of evidence.

Key quotes

Learning mechanics is a theory that concerns itself with 'the dynamics of the training process', studies them using 'coarse aggregate statistics of learning', and has the goal to generate 'accurate average-case predictions'.

The main use of learning mechanics research so far has been in producing new learning mechanics research to retrodict known empirical phenomena; learning dynamics as a field has yielded little practical fruit.

Maybe there will be a scientific theory of deep learning. Maybe learning mechanics will become a theory covering some important aspects of deep learning... But I don't think the paper has convinced me about these claims.