Deep Learning Theory Is Broken — And Maybe Unfixable · history

Version 9

2026-05-02 05:19 UTC · 280 items

Narrative

The most consequential update this cycle is a status upgrade: 'Why Diffusion Models Don't Memorize: The Role of Implicit Dynamical Regularization in Training' is now confirmed as a NeurIPS 2025 Best Paper[1], not the poster tracked in the prior cycle[2][3]. Best Paper designation at NeurIPS represents explicit program-committee endorsement as among the conference's most important contributions — elevating the implicit dynamical regularization strand above the NeurIPS 2025 Oral status of Montanari-Urbani in raw institutional recognition. A field supposedly incapable of explaining generalization has now produced a Best Paper-level mechanistic account of memorization suppression through training dynamics. An updated arxiv preprint (2505.17638)[4] marks the camera-ready version, with coverage appearing on YouTube[5], Substack[6], and Medium[7], and an ENS-Lyon seminar[8] documenting the result reaching European physics venues — paralleling how the Montanari-Urbani result reached the Simons Institute[9] and Stanford Mathematics[10].

The ICML 2026 Mechanistic Interpretability Workshop has acquired a named organizational identity: Neel Nanda (DeepMind) publicly announced the workshop on LinkedIn[11], with supporting infrastructure confirmed across a Buttondown newsletter[12], an official schedule page[13], an OpenReview submission portal[14], and the ICML Blog[15]. Neel Nanda's public advocacy matters structurally: he straddles safety/alignment communities and academic ML, ensuring cross-community amplification that the generalization theory program has not cultivated. The workshop infrastructure is now sufficiently developed that mechanistic interpretability is operating with the organizational apparatus of an established ML sub-field rather than a satellite of alignment research, closing the distance between its institutional standing and that of classical theory venues.

Three new theoretical entries extend existing strands. 'Grokking Beyond the Euclidean Norm of Model Parameters' (ICML 2025 poster)[16] challenges the L2-norm-based account of the memorization-to-generalization transition, implying the phase transition involves dynamics not fully captured by weight norm trajectories — sitting in unresolved tension with the dimensional phase transition framing[17] and the effective theory account[18]. 'Generative Modeling of Weights: Generalization or Memorization?'[19] opens a new subdomain: whether generative models of neural network weight distributions can generalize or merely memorize training architectures, extending the memorization debate to a meta-level Feldman's original formulation did not anticipate. Two NTK-focused papers now enter: 'Disentangling Feature and Lazy Training in Deep Neural Networks'[20] and 'Reactivation: Empirical NTK Dynamics Under Task Shifts'[21] both probe the NTK-regime vs. feature-learning boundary, directly engaging the axis that learning mechanics treats as its central diagnostic. The u-μP paper[22], now indexed in this thread, refines μP with unit scaling, partially answering LawrenceC's critique that learning mechanics has delivered only μP as a practical artifact.

The thread's arc is sharpening into a contrast between programs with divergent institutional trajectories. The dynamical regularization strand has now accumulated a NeurIPS 2025 Best Paper (diffusion models)[1], a NeurIPS 2025 Oral (Montanari-Urbani)[23], and an ICML 2023 paper (Wu et al.)[24] — a concentration of tier-1 recognition with no parallel in the generalization bounds program over the same period. Mechanistic interpretability has institutional infrastructure but addresses a different question. The April 2026 preprint 'There Will Be a Scientific Theory of Deep Learning'[25], now actively discussed in LessWrong[26] and Reddit[27], argues the dynamical approach can become a complete theory — a claim that the Best Paper result now gives additional empirical warrant. Meanwhile, grokking's fragmentation into norm-based[16], dimensional[17], effective-theory[18], and compositional[28] accounts suggests that 'delayed generalization' may itself name multiple distinct phenomena rather than a single transition, complicating any unified theoretical treatment and implying that the grokking sub-debate is approaching the same terminological crisis that the memorization debate reached when 'memorization' began naming a family of distinct phenomena.

Timeline

2016-01-01: Zhang et al. demonstrate that standard neural networks can memorize completely random labels on CIFAR-10 and ImageNet, invalidating data-independent generalization bounds. [30]
2019-01-01: Nagarajan and Kolter show empirically that spectral-norm bounds scale in the wrong direction, and prove formally in an overparameterized linear setting that uniform convergence is provably insufficient to explain gradient descent generalization. [29][123][124][133][134]
2019-06-01: Feldman publishes 'Does Learning Require Memorization? A Short Tale about a Long Tail,' arguing memorization of tail examples is causally necessary for learning from long-tailed distributions. [60][68][135][69][70][71][72][81]
2020-07-01: Negrea et al. publish 'In Defense of Uniform Convergence' at ICML 2020, arguing that uniform convergence can be partially recovered through derandomization applied to interpolating classifiers — a counterpoint to Nagarajan-Kolter previously untracked in this synthesis. [85][86][87]
2020-11-01: 'Disentangling Feature and Lazy Training in Deep Neural Networks' provides a formal framework for distinguishing when networks operate in the NTK (lazy) regime versus learning new representations (feature learning), directly engaging the axis that learning mechanics treats as the central diagnostic for whether classical theory applies. [20]
2021-01-01: Yang et al. establish an exact algebraic gap between generalization error and the tightest possible uniform convergence bound in random feature models, giving Nagarajan-Kolter a precise formal complement. [88][89]
2021-01-01: ACM Communications publishes the canonical journal version of Zhang et al.'s rethinking-generalization result, cementing it as a textbook-permanent finding rather than a contested empirical claim. [136]
2022-01-01: A NeurIPS 2022 paper claims PAC-Bayes compression bounds can be made tight enough to actually explain generalization in neural networks, directly challenging the narrative that all known bounds are vacuous. [96][97][98]
2022-01-01: NeurIPS 2022 paper 'Towards Understanding Grokking: An Effective Theory of Representation Learning' provides an effective-theory account of delayed generalization, framing grokking as a consequence of representation learning dynamics. [18]
2023-07-01: Wu et al. publish 'The Implicit Regularization of Dynamical Stability in SGD' at ICML 2023, showing that SGD's dynamical stability provides an implicit regularizer that suppresses memorization — bridging the algorithmic stability and implicit regularization research strands. [24][90]
2024-01-01: NeurIPS 2024 paper on symmetries in overparameterized neural networks using a mean-field view offers a new structural lens on why overparameterization does not prevent generalization. [137][138]
2024-07-01: u-μP: The Unit-Scaled Maximal Update Parametrization (arxiv 2407.17465) refines the μP framework by incorporating unit scaling, representing continued active development of learning mechanics' main practical engineering artifact. [22]
2025-01-01: CMU PhD blog post argues classical generalization theory is more predictive for foundation models than for conventional deep networks, implying the theory-failure narrative may not apply uniformly across architectures and scales. [139]
2025-01-01: Urbani delivers a seminar at the Stanford Mathematics department on generalization in two-layer neural networks, confirming the Montanari-Urbani result is circulating in pure-math venues beyond ML conferences. [10]
2025-01-01: ICLR 2025 paper 'When Memorization Hurts Generalization' argues memorization can actively damage generalization performance — a stronger claim than merely saying memorization is unnecessary. [64][82][83][84]
2025-01-01: NeurIPS 2025 Oral 'Dynamical Decoupling of Generalization and Overfitting in Large Two-Layer Networks' (Montanari and Urbani) presents results separating generalization dynamics from overfitting dynamics in the large two-layer regime; oral status confirmed, placing it in the top ~1-2% of NeurIPS 2025 submissions. [23][49][50][51][52][53][54][55][56][57][58][59]
2025-01-01: NeurIPS 2025 Best Paper 'Why Diffusion Models Don't Memorize: The Role of Implicit Dynamical Regularization in Training' attributes diffusion models' resistance to memorization to implicit dynamical regularization; Best Paper status (previously tracked as a poster) makes this the most institutionally recognized result in the training-dynamics cluster and the top-distinction result at NeurIPS 2025. [2][91][3][92][5][6][7][8][4][1][93][94][95]
2025-01-01: ICML 2025 poster 'Rethinking Benign Overfitting in Two-Layer Neural Networks' revisits the conditions under which interpolating classifiers can generalize in the two-layer setting. [125]
2025-01-01: ICLR 2025 poster 'Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data' extends the memorization-generalization debate to LLMs, examining how specific pretraining data drives capability versus memorization. [116][117][118][120]
2025-01-01: 'Necessary Memorization in Overparameterized Learning under Long-Tailed Mixture Models: Theory and Privacy Implications' provides formal theoretical support for Feldman's necessity claim in a specific mathematical setting, and introduces differential privacy leakage as a downstream consequence of memorization. [73]
2025-01-01: NeurIPS 2025 poster 'Understanding the Evolution of the Neural Tangent Kernel' tracks how the NTK changes during training rather than treating it as a fixed linearization, engaging the 'lazy training vs. feature learning' distinction central to the learning mechanics critique of classical theory. [40]
2025-01-01: 'Reactivation: Empirical NTK Dynamics Under Task Shifts' empirically tracks NTK dynamics as tasks change, connecting the NTK evolution program to the continual learning setting and probing whether the feature/lazy training distinction persists across task boundaries. [21]
2025-01-01: 'Generative Modeling of Weights: Generalization or Memorization?' opens a new subdomain within the memorization debate: whether generative models of neural network weight distributions can generalize to unseen architectures or merely memorize training-set weight configurations. [19]
2025-02-19: Pierfrancesco Urbani (CNRS) delivers an invited talk at the Simons Institute (Berkeley) on 'Generalization and overfitting in two-layer neural networks,' extending the dynamical decoupling result's reach to a major theory venue. [9]
2025-07-01: 'Grokking Beyond the Euclidean Norm of Model Parameters' (ICML 2025 poster) challenges the L2-norm-based account of the memorization-to-generalization transition in grokking, suggesting the phase transition involves dynamics not fully captured by weight norm trajectories and leaving the norm-based, dimensional, and effective-theory accounts of grokking unreconciled. [16]
2025-10-01: 'Memorizing Long-tail Data Can Help Generalization Through Composition' (arxiv 2510.16322) proposes that the mechanism by which tail memorization aids generalization is compositional rather than coverage-based, offering a mechanistic refinement of Feldman's thesis. [28][74]
2026-01-01: MIT names mechanistic interpretability as its 2026 Breakthrough of the Year, institutionally recognizing circuit-level understanding as a viable alternative program to statistical generalization theory for understanding deep learning. [107][111][112][140]
2026-01-01: A dedicated ICML 2026 workshop on mechanistic interpretability is announced by Neel Nanda (DeepMind) on LinkedIn, with infrastructure confirmed via Buttondown newsletter, official schedule page, OpenReview submission portal, and ICML Blog; and an 'Open problems in mechanistic interpretability: 2026 status report' is published as a GitHub gist — signaling the field's transition to self-organizing research infrastructure with named organizational advocates. [113][114][12][13][11][14][141][142][15][143][144]
2026-03-01: MIT News covers research on improving AI models' ability to explain their predictions, marking science-communication investment in interpretability research that the statistical generalization theory program has not received. [115]
2026-04-01: 'Grokking as Dimensional Phase Transition in Neural Networks' (arxiv 2604.04655) introduces a new theoretical framework for delayed generalization, framing the memorization-to-generalization transition as a phase transition in the dimensionality of learned representations. [17]
2026-04-01: 'Training Data Pruning Improves Memorization of Facts' (arxiv 2604.08519) presents a counterintuitive April 2026 finding: selective reduction of training data can improve how well models memorize individual facts, suggesting data redundancy may suppress rather than reinforce fact memorization. [75][145][146][147]
2026-04-24: Jamie Simon and Daniel Kunin (UC Berkeley) appear on Imbue's podcast arguing that a scientific theory of deep learning is achievable, marking the first direct public advocacy by the learning mechanics authors. [36][41]
2026-04-25: LawrenceC publishes a critical review of Simon et al.'s 'learning mechanics' manifesto on the Alignment Forum, welcoming its ambition while doubting it will deliver a comprehensive or broadly useful theory. [31]
2026-04-26: LawrenceC publishes 'The paper that killed deep learning theory,' providing detailed technical and historical context for why Zhang et al. 2016 was so devastating to the classical generalization-bound paradigm. [30][33]
2026-04-27: LawrenceC publishes 'The other paper that killed deep learning theory,' narrating Nagarajan and Kolter 2019 as the definitive proof that uniform convergence cannot explain neural network generalization; the post is crossposted to LessWrong and drives renewed interest in the original paper. [29][32][34]
2026-04-28: 'There Will Be a Scientific Theory of Deep Learning' (arxiv 2604.21691) appears as a late-April 2026 preprint offering the first formal academic-paper-level optimistic response to the thread's founding pessimism, its title directly inverting LawrenceC's framing. [25][43]
2026-04-30: At least three distinct OpenReview submissions titled 'Is Memorization Actually Necessary for Generalization?' appear, representing independent formal challenges to Feldman's affirmative claim. [61][62][63][65][66]
2026-05-01: A LessWrong 'Quick Paper Review' of 'There Will Be a Scientific Theory of Deep Learning' appears on the same platform as LawrenceC's original critique posts, and a Reddit r/MachineLearning thread opens discussion of the preprint — marking its transition from indexed to actively engaged within the communities that originally amplified the pessimistic framing. [26][27]

Perspectives

LawrenceC (Alignment Forum / LessWrong) — confirmed as Lawrence Chan

Classical deep learning theory was irreparably broken by two landmark papers (Zhang et al. 2016; Nagarajan & Kolter 2019). The proposed replacement, learning mechanics, is a promising manifesto but has so far produced little practical fruit beyond hyperparameter scaling (μP), explicitly does not aim to explain the specific algorithms learned by networks, and has not yet earned the title of a comprehensive theory of deep learning.

Evolution: Consistent across all three posts. The NeurIPS 2025 Best Paper for the diffusion models result and u-μP's active development partially address his specific critique about μP being learning mechanics' only practical output, but his broader skepticism about a comprehensive theory remains unanswered.

[29][30][31][32][33][34][35]

Jamie Simon and Daniel Kunin (UC Berkeley / learning mechanics)

A scientific theory of deep learning is achievable; their 'learning mechanics' framework, grounded in average-case training dynamics and aggregate statistics, is the right approach. Publicly promoted via Imbue podcast (April 24, 2026), YouTube presentation, and ongoing Twitter activity.

Evolution: Consistent; no direct public response to LawrenceC's critique yet apparent. The NeurIPS 2025 Best Paper for implicit dynamical regularization and the NTK evolution/disentangling papers both converge on the feature-learning vs. lazy-training axis that learning mechanics treats as central, providing external validation of the framework's diagnostic focus.

[36][37][38][39][40][41]

'There Will Be a Scientific Theory of Deep Learning' authors (April 2026 preprint)

A scientific theory of deep learning is achievable — the title directly inverts the pessimistic framing of LawrenceC's series. The formal academic-paper format marks this as the first non-blog-post optimistic response in this cycle.

Evolution: Previously not yet receiving indexed commentary. Prior cycle: a LessWrong 'Quick Paper Review' and a Reddit r/MachineLearning thread confirmed active community engagement. This cycle: no new commentary, but the NeurIPS 2025 Best Paper for the dynamical regularization result provides indirect empirical support for their thesis that a scientific theory is achievable.

[25][42][43][44][45][46][47][26][27]

Independent Medium commentary ('second formation' framing)

Learning mechanics represents a genuine paradigm transition — a 'second formation' of deep learning theory analogous to statistical mechanics succeeding classical mechanics — not merely an incremental research program.

Evolution: Consistent from prior cycle. More optimistic than Simon and Kunin's own public claims and directly contradicts LawrenceC's skepticism about practical utility.

[48]

Montanari and Urbani (dynamical decoupling)

In the large two-layer network regime, generalization dynamics and overfitting dynamics are separable — a result that, if it extends to deeper networks, would provide an algorithm-dependent structural account of why networks generalize despite interpolating training data.

Evolution: Consistent from prior cycles. NeurIPS 2025 Oral status and cross-disciplinary reach (Simons Institute, Stanford Mathematics) confirmed in prior cycle; no new developments this cycle.

[23][49][50][51][52][53][54][9][55][56][57][58][59][10]

Vitaly Feldman (memorization is necessary)

Memorization of tail examples is causally necessary for learning from long-tailed distributions. This reframes Zhang et al.'s result: memorization is part of learning, not evidence that theory is broken.

Evolution: Previously under challenge from multiple directions. This cycle: 'Generative Modeling of Weights: Generalization or Memorization?' extends the memorization debate to a meta-level he did not anticipate, while his Google Scholar profile and homepage are now indexed in this thread — indicating his work is being actively resurveyed as a reference point rather than treated as settled.

[60][61][62][63][64][65][66][67][68][69][70][71][72][73][28][74][75][76][77][78][79][80][81]

ICLR 2025 'When Memorization Hurts Generalization' authors

Memorization is not merely unnecessary for generalization but can actively damage it — the strongest anti-Feldman position yet to appear in a peer-reviewed venue.

Evolution: Consistent from prior cycle.

[64][82][83][84]

Negrea et al. (defense of uniform convergence via derandomization)

Uniform convergence arguments can be partially recovered through derandomization techniques applied to interpolating classifiers, suggesting the Nagarajan-Kolter impossibility is not a blanket closure of the uniform convergence program.

Evolution: Consistent from prior cycle. Whether this result engages or sidesteps Yang et al.'s exact gap result remains unresolved.

[85][86][87]

Yang et al. 2021 (exact gap, uniform convergence)

The failure of uniform convergence in random feature models is not an artifact of loose bounds but is provably exact — there is a measurable algebraic gap between any uniform convergence bound and the true generalization error, even in principle.

Evolution: Consistent from prior cycle. The Negrea et al. defense creates an unresolved question about whether the exact gap forecloses derandomized arguments.

[88][89]

Wu et al. ICML 2023 (implicit regularization of dynamical stability, SGD)

SGD's dynamical stability provides an implicit regularizer that suppresses memorization and promotes generalization — a mechanistic account connecting training-dynamics arguments to the diffusion models result and the dynamical decoupling program.

Evolution: Consistent from prior cycle. Now sits as the middle tier of a three-tier training-dynamics cluster (ICML 2023 paper < NeurIPS 2025 Oral < NeurIPS 2025 Best Paper) that has collectively accumulated more top-venue distinction than the generalization bounds program in 2023–2025.

[24][90]

NeurIPS 2025 diffusion model memorization researchers (Best Paper)

Implicit dynamical regularization during training prevents memorization in diffusion models — a training-dynamics explanation for a phenomenon that would otherwise require architectural or data-geometric accounts.

Evolution: Previously tracked as a NeurIPS 2025 poster. This cycle: confirmed as NeurIPS 2025 Best Paper[1], the most significant status upgrade in this synthesis round. ENS-Lyon seminar[8] extends its geographic reach to European physics venues. This is now the highest-distinction result in the thread's training-dynamics cluster.

[2][91][3][92][5][6][7][8][4][1][93][94][95]

PAC-Bayes compression bounds researchers (NeurIPS 2022)

PAC-Bayes bounds can be made sufficiently tight to actually explain generalization in neural networks — the vacuousness of prior bounds was not a fundamental limit of the framework but an artifact of loose construction.

Evolution: Consistent; no direct engagement with Yang et al.'s exact gap result or Negrea et al.'s derandomization argument yet apparent.

[96][97][98]

Algorithmic stability / SGD stability researchers

Generalization in deep learning can be explained through the stability properties of SGD. This approach is inherently algorithm- and data-dependent, directly addressing the Nagarajan-Kolter critique that uniform convergence ignores gradient descent's inductive bias.

Evolution: Consistent; Wu et al. ICML 2023 bridges this cluster to the implicit regularization and diffusion memorization strands, providing a shared mechanistic account.

[99][100][101][102][103][104][105][106][24]

Mechanistic interpretability community (MIT 2026 Breakthrough, Neel Nanda)

Understanding deep learning through circuit-level mechanistic analysis of what algorithms individual networks implement is a viable — and now institutionally recognized — alternative to statistical generalization theory.

Evolution: Previously anchored by MIT's Breakthrough designation, an ACM survey, and an announced workshop. This cycle: Neel Nanda (DeepMind) publicly named as organizer[11], giving the workshop a prominent advocate who spans safety, alignment, and academic ML communities. The workshop now has a complete organizational apparatus (Buttondown newsletter, schedule page, OpenReview portal) that signals a transition from an announced event to an active community institution.

[107][108][109][110][111][112][113][114][115][12][13][11][14][15]

ICLR 2025 LLM memorization researchers

Language models' generalization capabilities can be traced back to specific pretraining data, implying a causal data-memorization link that extends Feldman's long-tail thesis to the LLM regime.

Evolution: Consistent from prior cycle.

[116][117][118][119][120]

NTK evolution researchers

The Neural Tangent Kernel is not static during training — understanding how it evolves during training connects the classical NTK linearization program to the feature learning regime that learning mechanics treats as operative in practice.

Evolution: Previously entered via a NeurIPS 2025 poster tracking NTK evolution (item 4121). This cycle: 'Disentangling Feature and Lazy Training in Deep Neural Networks'[20] provides a formal framework for the feature/lazy boundary, and 'Reactivation: Empirical NTK Dynamics Under Task Shifts'[21] extends NTK dynamics empirically to task-shift settings — connecting this strand to both the continual learning memorization debate and the learning mechanics feature-learning diagnostic.

[40][20][21][121][122]

ICML 2025 grokking norm-critique researchers

The memorization-to-generalization transition in grokking is not fully explained by Euclidean weight norm dynamics, suggesting existing norm-based theoretical accounts are incomplete.

Evolution: New voice this cycle, entering via the ICML 2025 poster 'Grokking Beyond the Euclidean Norm.' Challenges the norm-based account without yet proposing an alternative, leaving the grokking sub-debate with at least four competing but unreconciled frameworks.

[16]

Tensions

Can learning mechanics, which focuses on average-case training dynamics and coarse aggregate statistics, ever constitute a comprehensive theory of deep learning — or is it structurally limited to explaining some aspects while leaving others permanently outside its scope? The April 2026 preprint 'There Will Be a Scientific Theory of Deep Learning' argues the question is not closed and now has active community engagement in the LessWrong venue where LawrenceC's pessimism originated. The NeurIPS 2025 Best Paper for implicit dynamical regularization provides indirect empirical warrant for the learning mechanics framing, while u-μP's active development partially addresses the 'only produced μP' critique. Whether the NTK evolution and disentangling programs constitute supporting evidence for learning mechanics' feature-learning framing or constitute separate theoretical lines remains unresolved. [31][36][37][38][39][48][25][26][27][40][20][21][22]
Is memorization causally necessary, causally harmful, or merely correlated with generalization — and does the answer change across training epochs, architectures, task settings, and data regimes? Grokking research shows networks can memorize first and generalize later. Feldman's necessity claim is being formalized under long-tailed mixture models with a compositional mechanism account, while 'When Memorization Hurts Generalization' claims the opposite. Training data pruning can improve memorization of individual facts, suggesting data redundancy may suppress rather than reinforce it. The continual learning extension raises whether necessity holds across task boundaries. 'Generative Modeling of Weights: Generalization or Memorization?' extends the debate to weight-distribution generative models. These positions may not be contradictory if memorization's effects depend on training epoch, regime, and task structure. [60][61][62][63][64][65][66][82][83][84][73][28][74][18][17][75][76][19]
Yang et al. 2021 establish an exact algebraic gap between generalization error and uniform convergence in random feature models. Negrea et al.'s ICML 2020 'In Defense of Uniform Convergence' argues the framework can be salvaged through derandomization applied to interpolating classifiers. Do derandomized uniform convergence arguments fall within the class foreclosed by the exact gap result, or do they constitute a genuine escape hatch that keeps the classical program viable? [29][88][89][123][124][85][86][87]
The NeurIPS 2025 dynamical decoupling result establishes that generalization and overfitting are separable phenomena in large two-layer networks. Does this separation persist in deeper networks and in transformer architectures, and if so, does it support or complicate the learning mechanics program's focus on aggregate training dynamics? [23][49][51][54][9][55][56][57][58][10]
If implicit dynamical regularization explains why diffusion models don't memorize (NeurIPS 2025 Best Paper), and Wu et al.'s SGD dynamical stability result provides a parallel mechanism, is there a unified dynamical account of memorization suppression across architectures? Or does the memorization debate remain architecture-regime-specific, with separate mechanisms governing transformers, diffusion models, and two-layer networks? The 'Reactivation' paper's empirical tracking of NTK dynamics under task shifts raises whether the same dynamical regularization account holds when tasks shift. [2][91][3][92][4][1][24][119][116][125][21]
PAC-Bayes compression bounds are claimed to be tight enough to explain generalization, while Yang et al.'s exact gap result says uniform convergence-style arguments are provably incapable of capturing the correct quantity. Are PAC-Bayes compression bounds a genuine escape hatch from the Nagarajan-Kolter impossibility, or do they fall within the class of arguments the exact gap result forecloses? [96][97][126][98][88][89]
Benign overfitting results show that interpolating classifiers can generalize under certain geometric conditions. ICML 2025's 'Rethinking Benign Overfitting in Two-Layer Neural Networks' revisits these conditions — do they hold broadly enough to constitute a useful theory, or do they require fine-tuned assumptions that fail in realistic multi-layer, non-linear settings? [127][128][129][130][125][131][132]
Is mechanistic interpretability a practical alternative to statistical generalization theory for understanding deep learning, or does its circuit-level focus simply answer a different question? Neel Nanda's public announcement and the workshop's full organizational apparatus have given mechanistic interpretability a named advocate and institutional momentum that statistical generalization theory lacks — but institutional success is not the same as theoretical displacement, and the two programs have not yet engaged each other directly. [107][108][109][110][31][36][113][114][111][11][15]
The grokking sub-debate now has at least four competing accounts — norm-based, dimensional phase transition, effective theory of representation learning, and compositional mechanism — that have not been formally reconciled. 'Grokking Beyond the Euclidean Norm' challenges the norm account without proposing a replacement. Is grokking a single phenomenon admitting a unified theory, or a family of distinct delayed-generalization transitions that happen to share a surface phenomenology? [16][17][18][28][74]

Sources

[1] Best paper NeurIPS 2025: 𝗪𝗵𝘆 𝗗𝗶𝗳𝗳𝘂𝘀𝗶𝗼𝗻 𝗠𝗼𝗱𝗲𝗹𝘀 𝗗𝗼𝗻’𝘁 𝗠𝗲𝗺𝗼𝗿𝗶𝘇𝗲: 𝗧𝗵𝗲 𝗥𝗼𝗹𝗲 𝗼𝗳 𝗜𝗺𝗽𝗹𝗶𝗰𝗶𝘁 𝗗𝘆𝗻𝗮𝗺𝗶𝗰𝗮𝗹 𝗥𝗲𝗴𝘂𝗹𝗮𝗿𝗶𝘇𝗮𝘁𝗶𝗼𝗻 𝗶𝗻… | Charles H. Martin, PhD — reactive:deep-learning-theory-limits
[2] Why Diffusion Models Don’t Memorize: The Role of Implicit Dynamical Regularization in Training | OpenReview — reactive:deep-learning-theory-limits
[3] Why Diffusion Models Don’t Memorize: The Role of Implicit Dynamical Regularization in Training | OpenReview — reactive:deep-learning-theory-limits
[4] [2505.17638] Why Diffusion Models Don't Memorize: The Role of Implicit Dynamical Regularization in Training — reactive:deep-learning-theory-limits
[5] The Role of Implicit Dynamical Regularization in Training - YouTube — reactive:deep-learning-theory-limits
[6] [NeurIPS 2025] Why Diffusion Models Don’t Memorize: The Role of Implicit Dynamical Regularization in Training — reactive:deep-learning-theory-limits
[7] Why Diffusion Models Generalize Instead of Just Memorizing - Medium — reactive:deep-learning-theory-limits
[8] Why Diffusion Models Don't Memorize: The Role of Implicit ... — reactive:deep-learning-theory-limits
[9] Generalization and overfitting in two-layer neural networks — reactive:deep-learning-theory-limits
[10] Generalization and overfitting in two-layer neural networks — reactive:deep-learning-theory-limits
[11] Neel Nanda 's Post - LinkedIn — reactive:deep-learning-theory-limits
[12] Mech Interp Workshop @ ICML 2026 - Buttondown — reactive:deep-learning-theory-limits
[13] Schedule — reactive:deep-learning-theory-limits
[14] ICML 2026 Workshop Mech Interp - OpenReview — reactive:deep-learning-theory-limits
[15] Announcing the ICML 2026 Workshops and Affinity Workshops – ICML Blog — reactive:deep-learning-theory-limits
[16] Grokking Beyond the Euclidean Norm of Model Parameters — reactive:deep-learning-theory-limits
[17] Grokking as Dimensional Phase Transition in Neural Networks — reactive:deep-learning-theory-limits
[18] [PDF] Towards Understanding Grokking: An Effective Theory of ... — reactive:deep-learning-theory-limits
[19] Generative Modeling of Weights: Generalization or Memorization? — reactive:deep-learning-theory-limits
[20] Disentangling feature and lazy training in deep neural networks — reactive:deep-learning-theory-limits
[21] Reactivation: Empirical NTK Dynamics Under Task Shifts — reactive:deep-learning-theory-limits
[22] u-$μ$P: The Unit-Scaled Maximal Update Parametrization - arXiv — reactive:deep-learning-theory-limits
[23] NeurIPS Poster Dynamical Decoupling of Generalization and Overfitting in Large Two-Layer Networks — reactive:deep-learning-theory-limits
[24] The Implicit Regularization of Dynamical Stability in Stochastic Gradient Descent — reactive:deep-learning-theory-limits
[25] There Will Be a Scientific Theory of Deep Learning — reactive:deep-learning-theory-limits
[26] Quick Paper Review: "There Will Be a Scientific Theory of Deep Learning" — LessWrong — reactive:deep-learning-theory-limits
[27] There Will Be a Scientific Theory of Deep Learning [R] - Reddit — reactive:deep-learning-theory-limits
[28] Memorizing Long-tail Data Can Help Generalization Through ... — reactive:deep-learning-theory-limits
[29] The other paper that killed deep learning theory — Alignment Forum (2026-04-27)
[30] The paper that killed deep learning theory — Alignment Forum (2026-04-26)
[31] Quick Paper Review: "There Will Be a Scientific Theory of Deep Learning" — Alignment Forum (2026-04-25)
[32] The other paper that killed deep learning theory — LessWrong — reactive:deep-learning-theory-limits
[33] The paper that killed deep learning theory — AI Alignment Forum — reactive:deep-learning-theory-limits
[34] The other paper that killed deep learning theory — AI Alignment Forum — reactive:deep-learning-theory-limits
[35] Lawrence Chan — reactive:deep-learning-theory-limits
[36] Jamie Simon and Daniel Kunin, UC Berkeley: There Will Be a Scientific Theory of Deep Learning - imbue — reactive:deep-learning-theory-limits
[37] There Will Be a Scientific Theory of Deep Learning (Apr 2026) — reactive:deep-learning-theory-limits
[38] Quick Paper Review: "There Will Be a Scientific Theory of Deep ... — reactive:deep-learning-theory-limits
[39] Daniel Kunin — reactive:deep-learning-theory-limits
[40] Understanding the Evolution of the Neural Tangent Kernel at the ... — reactive:deep-learning-theory-limits
[41] There Will Be a Scientific Theory of Deep Learning - Imbue — reactive:deep-learning-theory-limits
[42] There Will Be a Scientific Theory of Deep Learning | Cool Papers — reactive:deep-learning-theory-limits
[43] [2604.21691] There Will Be a Scientific Theory of Deep Learning — reactive:deep-learning-theory-limits
[44] There Will Be a Scientific Theory of Deep Learning | alphaXiv — reactive:deep-learning-theory-limits
[45] There Will Be a Scientific Theory of Deep Learning | Takara TLDR — reactive:deep-learning-theory-limits
[46] There Will Be a Scientific Theory of Deep Learning - YouTube — reactive:deep-learning-theory-limits
[47] There Will Be a Scientific Theory of Deep Learning — reactive:deep-learning-theory-limits
[48] Learning Mechanics and the Second Formation of Deep Learning ... — reactive:deep-learning-theory-limits
[49] Dynamical Decoupling of Generalization and Overfitting in Large ... — reactive:deep-learning-theory-limits
[50] Dynamical Decoupling of Generalization and Overfitting in Large ... — reactive:deep-learning-theory-limits
[51] Dynamical Decoupling of Generalization and Overfitting in Large Two-Layer Networks | OpenReview — reactive:deep-learning-theory-limits
[52] Dynamical Decoupling of Generalization and Overfitting in Large ... — reactive:deep-learning-theory-limits
[53] [PDF] Dynamical Decoupling of Generalization and Overfitting in Large ... — reactive:deep-learning-theory-limits
[54] Generalization and overfitting in two-layer neural networks - YouTube — reactive:deep-learning-theory-limits
[55] NeurIPS Oral Dynamical Decoupling of Generalization and Overfitting in Large Two-Layer Networks — reactive:deep-learning-theory-limits
[56] A Dynamical Theory of Overfitting and Generalization in Large Two-Layer Networks — reactive:deep-learning-theory-limits
[57] Dynamical Decoupling of Generalization and Overfitting in Large ... — reactive:deep-learning-theory-limits
[58] [PDF] Dynamical decoupling of generalization and overfitting in large two ... — reactive:deep-learning-theory-limits
[59] Dynamical Decoupling of Generalization and Overfitting in Large... — reactive:deep-learning-theory-limits
[60] Does learning require memorization? a short tale about a long tail — reactive:deep-learning-theory-limits
[61] Is Memorization Actually Necessary for Generalization? - OpenReview — reactive:deep-learning-theory-limits
[62] Is Memorization Actually Necessary for Generalization - OpenReview — reactive:deep-learning-theory-limits
[63] Is Memorization Actually Necessary for Generalization? - OpenReview — reactive:deep-learning-theory-limits
[64] [PDF] WHEN MEMORIZATION HURTS GENERALIZATION — reactive:deep-learning-theory-limits
[65] [PDF] IS MEMORIZATION Actually NECESSARY FOR GENER- ALIZATION? — reactive:deep-learning-theory-limits
[66] [PDF] IS MEMORIZATION Actually NECESSARY FOR GENER- ALIZATION? — reactive:deep-learning-theory-limits
[67] Does learning require memorization? A short tale about a long tail — reactive:deep-learning-theory-limits
[68] Does Learning Require Memorization? A Short Tale about a Long Tail — reactive:deep-learning-theory-limits
[69] [PDF] Chasing the Long Tail: What Neural Networks Memorize and Why — reactive:deep-learning-theory-limits
[70] What Neural Networks Memorize and Why - Chiyuan Zhang — reactive:deep-learning-theory-limits
[71] What Neural Networks Memorize and Why: Discovering the Long ... — reactive:deep-learning-theory-limits
[72] Does Learning Require Memorization? A Short Tale About A Long Tail — reactive:deep-learning-theory-limits
[73] (PDF) Necessary Memorization in Overparameterized Learning ... — reactive:deep-learning-theory-limits
[74] Memorizing Long-tail Data Can Help Generalization Through Composition — reactive:deep-learning-theory-limits
[75] [PDF] Training Data Pruning Improves Memorization of Facts - arXiv — reactive:deep-learning-theory-limits
[76] What is the role of memorization in Continual Learning? | OpenReview — reactive:deep-learning-theory-limits
[77] ‪Vitaly Feldman‬ - ‪Google Scholar‬ — reactive:deep-learning-theory-limits
[78] What Neural Networks Memorize and Why (Vitaly Feldman) - NeurIPS — reactive:deep-learning-theory-limits
[79] Vitaly Feldman's personal homepage — reactive:deep-learning-theory-limits
[80] ‪Vitaly Feldman‬ - ‪Google Scholar‬ — reactive:deep-learning-theory-limits
[81] [PDF] Does Learning Require Memorization? A Short Tale about a Long Tail — reactive:deep-learning-theory-limits
[82] The Pitfalls of Memorization: When Memorization Hurts Generalization — reactive:deep-learning-theory-limits
[83] The Pitfalls of Memorization: When Memorization Hurts Generalization — reactive:deep-learning-theory-limits
[84] The Pitfalls of Memorization: When Memorization Hurts Generalization — reactive:deep-learning-theory-limits
[85] [PDF] In Defense of Uniform Convergence: Generalization via ... — reactive:deep-learning-theory-limits
[86] [1912.04265] In Defense of Uniform Convergence - arXiv — reactive:deep-learning-theory-limits
[87] In Defense of Uniform Convergence: Generalization via Derandomization with an Application to Interpolating Predictors — reactive:deep-learning-theory-limits
[88] [PDF] Exact Gap between Generalization Error and Uniform Convergence ... — reactive:deep-learning-theory-limits
[89] [2103.04554] Exact Gap between Generalization Error and Uniform Convergence in Random Feature Models — reactive:deep-learning-theory-limits
[90] [Quick Review] The Implicit Regularization of Dynamical Stability in Stochastic Gradient Descent — reactive:deep-learning-theory-limits
[91] NeurIPS Poster Why Diffusion Models Don’t Memorize: The Role of Implicit Dynamical Regularization in Training — reactive:deep-learning-theory-limits
[92] Why diffusion models in generative AI don’t memorize: The role of implicit dynamical regularization in training | LPENS — reactive:deep-learning-theory-limits
[93] (PDF) Why Diffusion Models Don't Memorize: The Role of Implicit ... — reactive:deep-learning-theory-limits
[94] [PDF] Why Diffusion Models Don't Memorize: The Role of Implicit ... — reactive:deep-learning-theory-limits
[95] [PDF] Why Diffusion Models Don't Memorize: The Role of Implicit ... — reactive:deep-learning-theory-limits
[96] PAC-Bayes Compression Bounds So Tight That They Can Explain... — reactive:deep-learning-theory-limits
[97] PAC-Bayes Compression Bounds So Tight That They Can Explain ... — reactive:deep-learning-theory-limits
[98] [PDF] PAC-Bayes Compression Bounds So Tight That They Can Explain ... — reactive:deep-learning-theory-limits
[99] [PDF] Fine-Grained Analysis of Stability and Generalization for Stochastic ... — reactive:deep-learning-theory-limits
[100] Data-Dependent Stability of Stochastic Gradient Descent — reactive:deep-learning-theory-limits
[101] Fine-Grained Analysis of Stability and Generalization for Stochastic ... — reactive:deep-learning-theory-limits
[102] [PDF] Stability and Generalization for Markov Chain Stochastic Gradient ... — reactive:deep-learning-theory-limits
[103] [2502.00885] Algorithmic Stability of Stochastic Gradient Descent with Momentum under Heavy-Tailed Noise — reactive:deep-learning-theory-limits
[104] Stability of Stochastic Gradient Descent on Nonsmooth Convex Losses — reactive:deep-learning-theory-limits
[105] [2602.22936] Generalization Bounds of Stochastic Gradient Descent ... — reactive:deep-learning-theory-limits
[106] Stability (learning theory) — reactive:deep-learning-theory-limits
[107] Mechanistic Interpretability Named MIT's 2026 Breakthrough for ... — reactive:deep-learning-theory-limits
[108] Bridging the Black Box: A Survey on Mechanistic Interpretability in AI — reactive:deep-learning-theory-limits
[109] Understanding Mechanistic Interpretability in AI Models - IntuitionLabs — reactive:deep-learning-theory-limits
[110] AI Safety, Alignment, and Interpretability in 2026 | Zylos Research — reactive:deep-learning-theory-limits
[111] Mechanistic interpretability: 10 Breakthrough Technologies 2026 | MIT Technology Review — reactive:deep-learning-theory-limits
[112] MIT Technology Review's Post - Mechanistic interpretability - LinkedIn — reactive:deep-learning-theory-limits
[113] Mechanistic Interpretability Workshop at ICML 2026 — reactive:deep-learning-theory-limits
[114] Open problems in mechanistic interpretability: 2026 status report - Gist — reactive:deep-learning-theory-limits
[115] Improving AI models' ability to explain their predictions | MIT News — reactive:deep-learning-theory-limits
[116] ICLR Poster Generalization v.s. Memorization: Tracing Language Models’ Capabilities Back to Pretraining Data — reactive:deep-learning-theory-limits
[117] Generalization v.s. Memorization: Tracing Language Models'... — reactive:deep-learning-theory-limits
[118] [PDF] generalization v.s. memorization - arXiv — reactive:deep-learning-theory-limits
[119] For Better or for Worse, Transformers Seek Patterns for Memorization | OpenReview — reactive:deep-learning-theory-limits
[120] [PDF] generalization v.s. memorization: tracing language models ... — reactive:deep-learning-theory-limits
[121] [PDF] Neural Tangent Kernel - Washington — reactive:deep-learning-theory-limits
[122] [1811.04918] Learning and Generalization in Overparameterized Neural Networks, Going Beyond Two Layers — reactive:deep-learning-theory-limits
[123] [1902.04742v2] Uniform convergence may be unable to explain generalization in deep learning — reactive:deep-learning-theory-limits
[124] [1902.04742] Uniform convergence may be unable to explain generalization in deep learning — reactive:deep-learning-theory-limits
[125] Rethinking Benign Overfitting in Two-Layer Neural Networks — reactive:deep-learning-theory-limits
[126] Still No Free Lunches: The Price to Pay for Tighter PAC-Bayes Bounds — reactive:deep-learning-theory-limits
[127] Towards an Understanding of Benign Overfitting in Neural Networks — reactive:deep-learning-theory-limits
[128] Rethinking Benign Overfitting in Two-Layer Neural Networks — reactive:deep-learning-theory-limits
[129] Benign Overfitting without Linearity: Neural Network Classifiers ... — reactive:deep-learning-theory-limits
[130] Benign Overfitting without Linearity: Neural Network Classifiers ... — reactive:deep-learning-theory-limits
[131] NeurIPS Benign Overfitting in Out-of-Distribution Generalization of Linear Models — reactive:deep-learning-theory-limits
[132] Towards an Understanding of Benign Overfitting in Neural Networks — reactive:deep-learning-theory-limits
[133] Uniform convergence may be unable to explain generalization in ... — reactive:deep-learning-theory-limits
[134] Uniform convergence may be unable to explain generalization in deep learning — reactive:deep-learning-theory-limits
[135] [1906.05271] Does Learning Require Memorization? A Short Tale about a Long Tail — reactive:deep-learning-theory-limits
[136] Understanding Deep Learning (Still) Requires Rethinking ... — reactive:deep-learning-theory-limits
[137] Symmetries in Overparametrized Neural Networks: A Mean Field View — reactive:deep-learning-theory-limits
[138] Symmetries in Overparametrized Neural Networks: A Mean Field View — reactive:deep-learning-theory-limits
[139] CMU CSD PhD Blog - Classical generalization theory is more predictive in foundation models than in conventional deep networks — reactive:deep-learning-theory-limits
[140] Mechanistic interpretability: 10 Breakthrough Technologies 2026 | MIT Technology Review — reactive:deep-learning-theory-limits
[141] Workshop on Mechanistic Interpretability - ICML 2026 — reactive:deep-learning-theory-limits
[142] ICML 2026 Schedule — reactive:deep-learning-theory-limits
[143] 2026 Conference — reactive:deep-learning-theory-limits
[144] ICML 2026 Workshops — reactive:deep-learning-theory-limits
[145] Prune Training Data to Maximize LLM Factual Memorization | Changecast — reactive:deep-learning-theory-limits
[146] Training Data Pruning Improves Memorization of Facts — reactive:deep-learning-theory-limits
[147] [PDF] TRAINING DATA PRUNING IMPROVES MEMORIZATION OF FACTS — reactive:deep-learning-theory-limits