Feature Learning and Generalization in Deep Networks with Orthogonal Weights Paper • 2310.07765 • Published Oct 11, 2023
The Unreasonable Ineffectiveness of the Deeper Layers Paper • 2403.17887 • Published Mar 26, 2024 • 82