We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: BINAS: Bilinear Interpretable Neural Architecture Search

Abstract: Practical use of neural networks often involves requirements on latency, energy and memory among others. A popular approach to find networks under such requirements is through constrained Neural Architecture Search (NAS). However, previous methods use complicated predictors for the accuracy of the network. Those predictors are hard to interpret and sensitive to many hyperparameters to be tuned, hence, the resulting accuracy of the generated models is often harmed. In this work we resolve this by introducing Bilinear Interpretable Neural Architecture Search (BINAS), that is based on an accurate and simple bilinear formulation of both an accuracy estimator and the expected resource requirement, together with a scalable search method with theoretical guarantees. The simplicity of our proposed estimator together with the intuitive way it is constructed bring interpretability through many insights about the contribution of different design choices. For example, we find that in the examined search space, adding depth and width is more effective at deeper stages of the network and at the beginning of each resolution stage. Our experiments show that BINAS generates comparable to or better architectures than other state-of-the-art NAS methods within a reduced marginal search cost, while strictly satisfying the resource constraints.
Comments: The full code is released at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC); Machine Learning (stat.ML)
MSC classes: 68T09, 68T45
ACM classes: G.1.6; G.3; I.2.8; I.2.10; I.5.1
Cite as: arXiv:2110.12399 [cs.LG]
  (or arXiv:2110.12399v3 [cs.LG] for this version)

Submission history

From: Niv Nayman [view email]
[v1] Sun, 24 Oct 2021 09:45:00 GMT (12444kb,D)
[v2] Thu, 20 Jan 2022 14:32:33 GMT (11538kb,D)
[v3] Wed, 27 Apr 2022 12:55:23 GMT (12213kb,D)

Link back to: arXiv, form interface, contact.