Practical Bayesian Optimization of Machine Learning Algorithms

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Machine learning algorithms frequently require careful tuning of model hyperparameters, regularization terms, and optimization parameters. Unfortunately, this tuning is often a "black art" that requires expert experience, unwritten rules of thumb, or sometimes brute-force search. Much more appealing is the idea of developing automatic approaches which can optimize the performance of a given learning algorithm to the task at hand. In this work, we consider the automatic tuning problem within the framework of Bayesian optimization, in which a learning algorithm's generalization performance is modeled as a sample from a Gaussian process (GP). The tractable posterior distribution induced by the GP leads to efficient use of the information gathered by previous experiments, enabling optimal choices about what parameters to try next. Here we show how the effects of the Gaussian process prior and the associated inference procedure can have a large impact on the success or failure of Bayesian optimization. We show that thoughtful choices can lead to results that exceed expert-level performance in tuning machine learning algorithms. We also describe new algorithms that take into account the variable cost (duration) of learning experiments and that can leverage the presence of multiple cores for parallel experimentation. We show that these proposed algorithms improve on previous automatic procedures and can reach or surpass human expert-level optimization on a diverse set of contemporary algorithms including latent Dirichlet allocation, structured SVMs and convolutional neural networks.

Related collections

Author and article information

Journal

Publisher: arXiv

Publication date (Electronic): 2012

Publication date Submitted: 13 June 2012

Publication date Updated: 15 June 2012

Publication date Submitted: 29 August 2012

Publication date Updated: 30 August 2012

Publication date Available: June 2012

Article

DOI: 10.48550/ARXIV.1206.2944

SO-VID: 99061cac-0e0a-48ea-89b7-abedad2cd574

License:

arXiv.org perpetual, non-exclusive license

History

Keywords: Machine Learning (cs.LG),Machine Learning (stat.ML),FOS: Computer and information sciences

Data availability:

Keywords: Machine Learning (cs.LG), Machine Learning (stat.ML), FOS: Computer and information sciences

Practical Bayesian Optimization of Machine Learning Algorithms

Read this article at

Abstract

Related collections

Annual Reviews AI, Machine Learning, and Society

Author and article information

Journal

Article

History

Comments

Comment on this article

Similar content 239

Cited by 157