We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Differentiable Pruning Method for Neural Networks

Abstract: Architecture optimization is a promising technique to find an efficient neural network to meet certain requirements, which is usually a problem of selections. This paper introduces a concept of a trainable gate function and proposes a channel pruning method which finds automatically the optimal combination of channels using a simple gradient descent training procedure. The trainable gate function, which confers a differentiable property to discrete-valued variables, allows us to directly optimize loss functions that include discrete values such as the number of parameters or FLOPs that are generally non-differentiable. Channel pruning can be applied simply by appending trainable gate functions to each intermediate output tensor followed by fine-tuning the overall model, using any gradient-based training methods. Our experiments show that the proposed method can achieve better compression results on various models. For instance, our proposed method compresses ResNet-56 on CIFAR-10 dataset by half in terms of the number of FLOPs without accuracy drop.
Comments: Submitted to ICML Workshop
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:1904.10921 [cs.LG]
  (or arXiv:1904.10921v1 [cs.LG] for this version)

Submission history

From: Jaedeok Kim [view email]
[v1] Wed, 24 Apr 2019 16:57:20 GMT (173kb,D)
[v2] Thu, 14 Nov 2019 15:11:45 GMT (3895kb,D)

Link back to: arXiv, form interface, contact.