We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.OC

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Optimization and Control

Title: Riemannian Adaptive Optimization Algorithm and Its Application to Natural Language Processing

Abstract: This paper proposes a Riemannian adaptive optimization algorithm to optimize the parameters of deep neural networks. The algorithm is an extension of both AMSGrad in Euclidean space and RAMSGrad on a Riemannian manifold. The algorithm helps to resolve two issues affecting RAMSGrad. The first is that it can solve the Riemannian stochastic optimization problem directly, in contrast to RAMSGrad which only achieves a low regret. The other is that it can use constant learning rates, which makes it implementable in practice. Additionally, we apply the proposed algorithm to Poincar{\'e} embeddings, which embed the transitive closure of the WordNet nouns into the Poincar{\'e} ball model of hyperbolic space. Numerical experiments show that regardless of the initial value of the learning rate, our algorithm stably converges to the optimal solution and converges faster than RSGD, the most basic Riemannian stochastic optimization algorithm.
Comments: 22 pages, 8 figures
Subjects: Optimization and Control (math.OC)
MSC classes: 65k05, 90C25, 57R25
ACM classes: G.1.6
Cite as: arXiv:2004.00897 [math.OC]
  (or arXiv:2004.00897v4 [math.OC] for this version)

Submission history

From: Hiroyuki Sakai [view email]
[v1] Thu, 2 Apr 2020 09:33:14 GMT (242kb)
[v2] Mon, 1 Jun 2020 04:54:58 GMT (242kb)
[v3] Fri, 2 Oct 2020 05:09:33 GMT (590kb,D)
[v4] Tue, 8 Dec 2020 16:13:22 GMT (590kb,D)

Link back to: arXiv, form interface, contact.