Have a personal or library account? Click to login
Continuous limits of residual neural networks in case of large input data Cover

Continuous limits of residual neural networks in case of large input data

Open Access
|Dec 2022

Abstract

Residual deep neural networks (ResNets) are mathematically described as interacting particle systems. In the case of infinitely many layers the ResNet leads to a system of coupled system of ordinary differential equations known as neural differential equations. For large scale input data we derive a mean–field limit and show well–posedness of the resulting description. Further, we analyze the existence of solutions to the training process by using both a controllability and an optimal control point of view. Numerical investigations based on the solution of a formal optimality system illustrate the theoretical findings.

Language: English
Page range: 96 - 120
Submitted on: Jul 11, 2022
|
Accepted on: Nov 12, 2022
|
Published on: Dec 24, 2022
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2022 Michael Herty, Anna Thünen, Torsten Trimborn, Giuseppe Visconti, published by Italian Society for Applied and Industrial Mathemathics
This work is licensed under the Creative Commons Attribution 4.0 License.