We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Machine Learning

Title: Dictionary Learning for Massive Matrix Factorization

Authors: Arthur Mensch (PARIETAL), Julien Mairal (LEAR), Bertrand Thirion (PARIETAL), Gaël Varoquaux (PARIETAL)
Abstract: Sparse matrix factorization is a popular tool to obtain interpretable data decompositions, which are also effective to perform data completion or denoising. Its applicability to large datasets has been addressed with online and randomized methods, that reduce the complexity in one of the matrix dimension, but not in both of them. In this paper, we tackle very large matrices in both dimensions. We propose a new factoriza-tion method that scales gracefully to terabyte-scale datasets, that could not be processed by previous algorithms in a reasonable amount of time. We demonstrate the efficiency of our approach on massive functional Magnetic Resonance Imaging (fMRI) data, and on matrix completion problems for recommender systems, where we obtain significant speed-ups compared to state-of-the art coordinate descent methods.
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
Journal reference: Proceedings of the International Conference on Machine Learning, 2016, pp 1737-1746
Cite as: arXiv:1605.00937 [stat.ML]
  (or arXiv:1605.00937v2 [stat.ML] for this version)

Submission history

From: Arthur Mensch [view email]
[v1] Tue, 3 May 2016 15:05:32 GMT (1739kb,D)
[v2] Thu, 26 May 2016 06:33:22 GMT (1367kb,D)

Link back to: arXiv, form interface, contact.