BINAS: Bilinear Interpretable Neural Architecture Search

Nayman, Niv; Aflalo, Yonathan; Noy, Asaf; Jin, Rong; Zelnik-Manor, Lihi

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2110

Computer Science > Machine Learning

Title: BINAS: Bilinear Interpretable Neural Architecture Search

Authors: Niv Nayman, Yonathan Aflalo, Asaf Noy, Rong Jin, Lihi Zelnik-Manor

(Submitted on 24 Oct 2021 (v1), last revised 27 Apr 2022 (this version, v3))

Abstract: Practical use of neural networks often involves requirements on latency, energy and memory among others. A popular approach to find networks under such requirements is through constrained Neural Architecture Search (NAS). However, previous methods use complicated predictors for the accuracy of the network. Those predictors are hard to interpret and sensitive to many hyperparameters to be tuned, hence, the resulting accuracy of the generated models is often harmed. In this work we resolve this by introducing Bilinear Interpretable Neural Architecture Search (BINAS), that is based on an accurate and simple bilinear formulation of both an accuracy estimator and the expected resource requirement, together with a scalable search method with theoretical guarantees. The simplicity of our proposed estimator together with the intuitive way it is constructed bring interpretability through many insights about the contribution of different design choices. For example, we find that in the examined search space, adding depth and width is more effective at deeper stages of the network and at the beginning of each resolution stage. Our experiments show that BINAS generates comparable to or better architectures than other state-of-the-art NAS methods within a reduced marginal search cost, while strictly satisfying the resource constraints.

Comments:	The full code is released at this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC); Machine Learning (stat.ML)
MSC classes:	68T09, 68T45
ACM classes:	G.1.6; G.3; I.2.8; I.2.10; I.5.1
Cite as:	arXiv:2110.12399 [cs.LG]
	(or arXiv:2110.12399v3 [cs.LG] for this version)

Submission history

From: Niv Nayman [view email]
[v1] Sun, 24 Oct 2021 09:45:00 GMT (12444kb,D)
[v2] Thu, 20 Jan 2022 14:32:33 GMT (11538kb,D)
[v3] Wed, 27 Apr 2022 12:55:23 GMT (12213kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2110.12399

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: BINAS: Bilinear Interpretable Neural Architecture Search

Submission history