ScienceOpen: research and publishing network

For Researchers

Search
Advanced search

4

views

    

0

recommends

0

shares

Record: found
Abstract: found
Article: found

Is Open Access

Deep Relaxation: partial differential equations for optimizing deep neural networks

Preprint

Author(s): Pratik Chaudhari , Adam Oberman , Stanley Osher , Stefano Soatto , Guillame Carlier

Publication date Created: 2017-04-17

Read this article at

ScienceOpen ArXiv

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

We establish connections between non-convex optimization methods for training deep neural networks (DNNs) and the theory of partial differential equations (PDEs). In particular, we focus on relaxation techniques initially developed in statistical physics, which we show to be solutions of a nonlinear Hamilton-Jacobi-Bellman equation. We employ the underlying stochastic control problem to analyze the geometry of the relaxed energy landscape and its convergence properties, thereby confirming empirical evidence. This paper opens non-convex optimization problems arising in deep learning to ideas from the PDE literature. In particular, we show that the non-viscous Hamilton-Jacobi equation leads to an elegant algorithm based on the Hopf-Lax formula that outperforms state-of-the-art methods. Furthermore, we show that these algorithms scale well in practice and can effectively tackle the high dimensionality of modern neural networks.

Related collections

Most cited references 17

Record: found
Abstract: not found
Article: not found

Monotone Operators and the Proximal Point Algorithm

R. Rockafellar (1976)

0 comments Cited 560 times – based on 0 reviews      Review now

Record: found
Abstract: not found
Article: not found

Mean field games

Jean-Michel Lasry, Pierre-Louis Lions (2007)

0 comments Cited 403 times – based on 0 reviews      Review now

Record: found
Abstract: not found
Article: not found

The Variational Formulation of the Fokker--Planck Equation

Richard F. Jordan, David Kinderlehrer, Felix Otto (1998)

0 comments Cited 400 times – based on 0 reviews      Review now

Author and article information

Journal

Publication date Created: 2017-04-17

Article

ArXiV ID: 1704.04932

SO-VID: 2081a334-c930-43e5-a8e7-6747f6eaa262

License:

http://arxiv.org/licenses/nonexclusive-distrib/1.0/

History

Custom metadata

Categories cs.LG math.AP math.OC

ScienceOpen disciplines: Analysis,Numerical methods,Artificial intelligence

Data availability:

ScienceOpen disciplines: Analysis, Numerical methods, Artificial intelligence

Comments

Comment on this article