4
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Deep Relaxation: partial differential equations for optimizing deep neural networks

      Preprint

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          We establish connections between non-convex optimization methods for training deep neural networks (DNNs) and the theory of partial differential equations (PDEs). In particular, we focus on relaxation techniques initially developed in statistical physics, which we show to be solutions of a nonlinear Hamilton-Jacobi-Bellman equation. We employ the underlying stochastic control problem to analyze the geometry of the relaxed energy landscape and its convergence properties, thereby confirming empirical evidence. This paper opens non-convex optimization problems arising in deep learning to ideas from the PDE literature. In particular, we show that the non-viscous Hamilton-Jacobi equation leads to an elegant algorithm based on the Hopf-Lax formula that outperforms state-of-the-art methods. Furthermore, we show that these algorithms scale well in practice and can effectively tackle the high dimensionality of modern neural networks.

          Related collections

          Most cited references17

          • Record: found
          • Abstract: not found
          • Article: not found

          Monotone Operators and the Proximal Point Algorithm

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Mean field games

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              The Variational Formulation of the Fokker--Planck Equation

                Bookmark

                Author and article information

                Journal
                2017-04-17
                Article
                1704.04932
                2081a334-c930-43e5-a8e7-6747f6eaa262

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                cs.LG math.AP math.OC

                Analysis,Numerical methods,Artificial intelligence
                Analysis, Numerical methods, Artificial intelligence

                Comments

                Comment on this article