- This event has passed.
Spring Seminar Series 2024 – Dr. René Vidal
April 19 • 11:00 am - 12:00 pm
Learning Dynamics and Implicit Bias of Gradient Flow in Overparameterized Linear Models
René Vidal, Ph.D.
Professor
University of Pennsylvania
Philadelphia, PA
Date:
04/19/2024
Time:
11:00 am – 12:00 pm CST
Location:
UTSA Main Campus, Student Union, Mesquite Room 2.01.24 (Second Floor)
Zoom: https://utsa.zoom.us/j/94807623288
Abstract:
Contrary to the common belief that overparameterization may hurt generalization and optimization, recent work suggests that overparameterization may bias the optimization algorithm towards solutions that generalize well — a phenomenon known as implicit regularization or implicit bias — and may also accelerate convergence — a phenomenon known as implicit acceleration. This talk will provide a detailed analysis of the dynamics of gradient flow in overparameterized two-layer linear models showing that convergence to equilibrium depends on the imbalance between input and output weights (which is fixed at initialization) and the margin of the initial solution. The talk will also provide an analysis of the implicit bias, showing that large hidden layer width, together with (properly scaled) random initialization, constrains the network parameters to converge to a solution which is close to the min-norm solution.