Scalable Hyperparameter Optimization with Lazy Gaussian Processes

Ram, Raju; Müller, Sabine; Pfreundt, Franz-Josef; Gauger, Nicolas R.; Keuper, Janis

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2001

Computer Science > Machine Learning

Title: Scalable Hyperparameter Optimization with Lazy Gaussian Processes

Authors: Raju Ram, Sabine Müller, Franz-Josef Pfreundt, Nicolas R. Gauger, Janis Keuper

(Submitted on 16 Jan 2020)

Abstract: Most machine learning methods require careful selection of hyper-parameters in order to train a high performing model with good generalization abilities. Hence, several automatic selection algorithms have been introduced to overcome tedious manual (try and error) tuning of these parameters. Due to its very high sample efficiency, Bayesian Optimization over a Gaussian Processes modeling of the parameter space has become the method of choice. Unfortunately, this approach suffers from a cubic compute complexity due to underlying Cholesky factorization, which makes it very hard to be scaled beyond a small number of sampling steps. In this paper, we present a novel, highly accurate approximation of the underlying Gaussian Process. Reducing its computational complexity from cubic to quadratic allows an efficient strong scaling of Bayesian Optimization while outperforming the previous approach regarding optimization accuracy. The first experiments show speedups of a factor of 162 in single node and further speed up by a factor of 5 in a parallel environment.

Comments:	14 pages; 6 figures; 4 tables; Accepted in proceedings of MLHPC 2019: Fifth International Workshop on Machine Learning in High Performance Computing Environments, Super Computing Conference 2019, Denver, Colorado
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2001.05726 [cs.LG]
	(or arXiv:2001.05726v1 [cs.LG] for this version)

Submission history

From: Raju Ram [view email]
[v1] Thu, 16 Jan 2020 10:15:55 GMT (479kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2001.05726

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Scalable Hyperparameter Optimization with Lazy Gaussian Processes

Submission history