Efficient Sampling for Predictor-Based Neural Architecture Search

Mauch, Lukas; Tiedemann, Stephen; Garcia, Javier Alonso; Cong, Bac Nguyen; Yoshiyama, Kazuki; Cardinaux, Fabien; Kemp, Thomas

Full-text links:

Download:

Current browse context:

cs.NE

< prev | next >

new | recent | 2011

Computer Science > Machine Learning

Title: Efficient Sampling for Predictor-Based Neural Architecture Search

Authors: Lukas Mauch, Stephen Tiedemann, Javier Alonso Garcia, Bac Nguyen Cong, Kazuki Yoshiyama, Fabien Cardinaux, Thomas Kemp

(Submitted on 24 Nov 2020)

Abstract: Recently, predictor-based algorithms emerged as a promising approach for neural architecture search (NAS). For NAS, we typically have to calculate the validation accuracy of a large number of Deep Neural Networks (DNNs), what is computationally complex. Predictor-based NAS algorithms address this problem. They train a proxy model that can infer the validation accuracy of DNNs directly from their network structure. During optimization, the proxy can be used to narrow down the number of architectures for which the true validation accuracy must be computed, what makes predictor-based algorithms sample efficient. Usually, we compute the proxy for all DNNs in the network search space and pick those that maximize the proxy as candidates for optimization. However, that is intractable in practice, because the search spaces are often very large and contain billions of network architectures. The contributions of this paper are threefold: 1) We define a sample efficiency gain to compare different predictor-based NAS algorithms. 2) We conduct experiments on the NASBench-101 dataset and show that the sample efficiency of predictor-based algorithms decreases dramatically if the proxy is only computed for a subset of the search space. 3) We show that if we choose the subset of the search space on which the proxy is evaluated in a smart way, the sample efficiency of the original predictor-based algorithm that has access to the full search space can be regained. This is an important step to make predictor-based NAS algorithms useful, in practice.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2011.12043 [cs.LG]
	(or arXiv:2011.12043v1 [cs.LG] for this version)

Submission history

From: Lukas Mauch [view email]
[v1] Tue, 24 Nov 2020 11:36:36 GMT (551kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2011.12043

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Efficient Sampling for Predictor-Based Neural Architecture Search

Submission history