Neural Networks Provably Learn Spectral Representations for Group Composition
Abstract
Neural network training on group composition tasks exhibits convergence to irreducible representations and rotational rank-one alignment through Riemannian gradient ascent on representation-theoretic energy functionals.
Understanding how structured internal structure emerges during neural network training is central to the study of deep learning. We investigate this phenomenon through the group composition task, where a two-layer neural network is trained to predict g_1 star g_2 for elements of a finite group G. By lifting the projected gradient flow to the Fourier domain, we demonstrate that the training dynamics are governed by a Riemannian gradient ascent on a representation-theoretic energy functional. We prove that, under random initialization, this flow drives each neuron to converge almost surely toward a single irreducible representation, while the cross-layer Fourier coefficients achieve a rotational rank-one alignment. This framework provides a representation-theoretic account of feature learning and characterizes a novel low-rank compression phenomenon for matrix-valued group representations. Moreover, for Abelian groups, we provide a complete population-level description: random initialization promotes uniform diversification across nontrivial representations and induces Haar-uniform phases, jointly approximating the indicator via a majority-vote mechanism. We further prove that both phase alignment and representation competition emerge with exponential convergence rates.
Community
Demystify how neural network learn group composition from a representation-theoretical perspective.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Deep Learning as Neural Low-Degree Filtering: A Spectral Theory of Hierarchical Feature Learning (2026)
- Pointwise Generalization in Deep Neural Networks (2026)
- How does feature learning reshape the function space? (2026)
- When Both Layers Learn: Training Dynamics of Representing Linear Models via ReLU Networks (2026)
- The Weight Gram Matrix Captures Sequential Feature Linearization in Deep Networks (2026)
- Flag Varieties: A Geometric Framework for Deep Network Alignment (2026)
- Mildly Overparameterized ReLU Networks on Orthogonal Data: Incremental Learning and Implicit Bias (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper