Computational/Statistical Gaps for Learning Neural Networks

Parent Program

Probability, Geometry, and Computation in High Dimensions

Location

Zoom link will be sent out to program participants.

Speaker(s)

Adam Klivans (University of Texas, Austin)

Date

Tuesday, Dec. 1, 2020

Time

11 a.m. – 12 p.m. PT

Back to calendar

Description

It has been known for decades that a polynomial-size training sample suffices for learning neural networks. Most theoretical results, however, indicate that these learning tasks are computationally intractable. Where exactly is the dividing line? In this talk we consider arguably the most well-studied scenario, namely when the marginal distribution on inputs is Gaussian, and show unconditionally that gradient descent cannot learn even one-layer neural networks. We then point to a potential way forward and sketch the first fixed-parameter tractable algorithm for learning deep ReLU networks. Its running time is polynomial in the ambient dimension and exponential in only the network's parameters. This is the first nontrivial positive result for networks of depth more than two.

All scheduled dates:

Upcoming

No Upcoming activities yet

Computational/Statistical Gaps for Learning Neural Networks

All scheduled dates:

Upcoming

Past