Multilevel Method and Theory for Deep Learning
Tuesday, February 25, 2020
3:30PM – 5PM
In this talk, I will first give an elementary introduction to models and algorithms from two different fields: (1) machine learning, including logistic regression, support vector machine and deep neural networks, and (2) numerical PDEs, including finite element and multigrid methods. I will then explore mathematical relationships between these models and algorithms and demonstrate how such relationships can be used to understand, study and improve the model structures, mathematical properties and relevant training algorithms for deep neural networks. In particular, I will demonstrate how a new convolutional neural network (CNN), known as MgNet, can be derived by making very minor modifications of a classic geometric multigrid method for the Poisson equation and then discuss the theoretical and practical potentials of MgNet.