We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Machine Learning

Title: GroSS: Group-Size Series Decomposition for Whole Search-Space Training

Abstract: We present Group-size Series (GroSS) decomposition, a mathematical formulation of tensor factorisation into a series of approximations of increasing rank terms. GroSS allows for dynamic and differentiable selection of factorisation rank, which is analogous to a grouped convolution. Therefore, to the best of our knowledge, GroSS is the first method to simultaneously train differing numbers of groups within a single layer, as well as all possible combinations between layers. In doing so, GroSS trains an entire grouped convolution architecture search-space concurrently. We demonstrate this through proof-of-concept architecture searches with performance objectives. GroSS represents a significant step towards liberating network architecture search from the burden of training and fine-tuning.
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:1912.00673 [cs.LG]
  (or arXiv:1912.00673v1 [cs.LG] for this version)

Submission history

From: Henry Howard-Jenkins [view email]
[v1] Mon, 2 Dec 2019 10:32:50 GMT (119kb,D)
[v2] Mon, 23 Mar 2020 12:26:25 GMT (4399kb,D)
[v3] Thu, 16 Jul 2020 16:28:12 GMT (4659kb,D)

Link back to: arXiv, form interface, contact.