Low-rank Gradient Approximation For Memory-Efficient On-device Training of Deep Neural Network

Gooneratne, Mary; Sim, Khe Chai; Zadrazil, Petr; Kabel, Andreas; Beaufays, Françoise; Motta, Giovanni

Full-text links:

Download:

Current browse context:

eess.AS

< prev | next >

new | recent | 2001

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Low-rank Gradient Approximation For Memory-Efficient On-device Training of Deep Neural Network

Authors: Mary Gooneratne, Khe Chai Sim, Petr Zadrazil, Andreas Kabel, Françoise Beaufays, Giovanni Motta

(Submitted on 24 Jan 2020)

Abstract: Training machine learning models on mobile devices has the potential of improving both privacy and accuracy of the models. However, one of the major obstacles to achieving this goal is the memory limitation of mobile devices. Reducing training memory enables models with high-dimensional weight matrices, like automatic speech recognition (ASR) models, to be trained on-device. In this paper, we propose approximating the gradient matrices of deep neural networks using a low-rank parameterization as an avenue to save training memory. The low-rank gradient approximation enables more advanced, memory-intensive optimization techniques to be run on device. Our experimental results show that we can reduce the training memory by about 33.0% for Adam optimization. It uses comparable memory to momentum optimization and achieves a 4.5% relative lower word error rate on an ASR personalization task.

Subjects:	Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
Cite as:	arXiv:2001.08885 [eess.AS]
	(or arXiv:2001.08885v1 [eess.AS] for this version)

Submission history

From: Khe Chai Sim [view email]
[v1] Fri, 24 Jan 2020 05:12:18 GMT (1422kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2001.08885

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Low-rank Gradient Approximation For Memory-Efficient On-device Training of Deep Neural Network

Submission history