References & Citations
Computer Science > Computer Vision and Pattern Recognition
Title: AutoHAS: Efficient Hyperparameter and Architecture Search
(Submitted on 5 Jun 2020 (v1), last revised 7 Apr 2021 (this version, v3))
Abstract: Efficient hyperparameter or architecture search methods have shown remarkable results, but each of them is only applicable to searching for either hyperparameters (HPs) or architectures. In this work, we propose a unified pipeline, AutoHAS, to efficiently search for both architectures and hyperparameters. AutoHAS learns to alternately update the shared network weights and a reinforcement learning (RL) controller, which learns the probability distribution for the architecture candidates and HP candidates. A temporary weight is introduced to store the updated weight from the selected HPs (by the controller), and a validation accuracy based on this temporary weight serves as a reward to update the controller. In experiments, we show AutoHAS is efficient and generalizable to different search spaces, baselines and datasets. In particular, AutoHAS can improve the accuracy over popular network architectures, such as ResNet and EfficientNet, on CIFAR-10/100, ImageNet, and four more other datasets.
Submission history
From: Xuanyi Dong [view email][v1] Fri, 5 Jun 2020 19:57:24 GMT (286kb,D)
[v2] Tue, 6 Oct 2020 05:01:34 GMT (113kb,D)
[v3] Wed, 7 Apr 2021 06:55:00 GMT (145kb,D)
Link back to: arXiv, form interface, contact.