Current browse context:
cs.LG
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: SCARLET-NAS: Bridging the Gap between Stability and Scalability in Weight-sharing Neural Architecture Search
(Submitted on 16 Aug 2019 (v1), last revised 14 Aug 2021 (this version, v6))
Abstract: To discover powerful yet compact models is an important goal of neural architecture search. Previous two-stage one-shot approaches are limited by search space with a fixed depth. It seems handy to include an additional skip connection in the search space to make depths variable. However, it creates a large range of perturbation during supernet training and it has difficulty giving a confident ranking for subnetworks. In this paper, we discover that skip connections bring about significant feature inconsistency compared with other operations, which potentially degrades the supernet performance. Based on this observation, we tackle the problem by imposing an equivariant learnable stabilizer to homogenize such disparities. Experiments show that our proposed stabilizer helps to improve the supernet's convergence as well as ranking performance. With an evolutionary search backend that incorporates the stabilized supernet as an evaluator, we derive a family of state-of-the-art architectures, the SCARLET series of several depths, especially SCARLET-A obtains 76.9% top-1 accuracy on ImageNet. Code is available at this https URL
Submission history
From: Bo Zhang [view email][v1] Fri, 16 Aug 2019 15:31:08 GMT (1774kb,D)
[v2] Mon, 19 Aug 2019 10:42:54 GMT (1774kb,D)
[v3] Fri, 13 Sep 2019 14:57:13 GMT (2114kb,D)
[v4] Thu, 28 Nov 2019 09:04:13 GMT (833kb,D)
[v5] Thu, 2 Apr 2020 03:54:03 GMT (2159kb,D)
[v6] Sat, 14 Aug 2021 13:37:59 GMT (2087kb,D)
Link back to: arXiv, form interface, contact.