A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces

Lan, Charline Le; Greaves, Joshua; Farebrother, Jesse; Rowland, Mark; Pedregosa, Fabian; Agarwal, Rishabh; Bellemare, Marc G.

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2212

Computer Science > Machine Learning

Title: A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces

Authors: Charline Le Lan, Joshua Greaves, Jesse Farebrother, Mark Rowland, Fabian Pedregosa, Rishabh Agarwal, Marc G. Bellemare

(Submitted on 8 Dec 2022)

Abstract: Many machine learning problems encode their data as a matrix with a possibly very large number of rows and columns. In several applications like neuroscience, image compression or deep reinforcement learning, the principal subspace of such a matrix provides a useful, low-dimensional representation of individual data. Here, we are interested in determining the $d$-dimensional principal subspace of a given matrix from sample entries, i.e. from small random submatrices. Although a number of sample-based methods exist for this problem (e.g. Oja's rule \citep{oja1982simplified}), these assume access to full columns of the matrix or particular matrix structure such as symmetry and cannot be combined as-is with neural networks \citep{baldi1989neural}. In this paper, we derive an algorithm that learns a principal subspace from sample entries, can be applied when the approximate subspace is represented by a neural network, and hence can be scaled to datasets with an effectively infinite number of rows and columns. Our method consists in defining a loss function whose minimizer is the desired principal subspace, and constructing a gradient estimate of this loss whose bias can be controlled. We complement our theoretical analysis with a series of experiments on synthetic matrices, the MNIST dataset \citep{lecun2010mnist} and the reinforcement learning domain PuddleWorld \citep{sutton1995generalization} demonstrating the usefulness of our approach.

Comments:	8 pages in main content, 2 pages of bibliography and 5 pages in Appendix
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2212.04025 [cs.LG]
	(or arXiv:2212.04025v1 [cs.LG] for this version)

Submission history

From: Charline Le Lan [view email]
[v1] Thu, 8 Dec 2022 01:26:47 GMT (999kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2212.04025

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces

Submission history