References & Citations
Mathematics > Optimization and Control
Title: Riemannian Adaptive Optimization Algorithm and Its Application to Natural Language Processing
(Submitted on 2 Apr 2020 (v1), last revised 8 Dec 2020 (this version, v4))
Abstract: This paper proposes a Riemannian adaptive optimization algorithm to optimize the parameters of deep neural networks. The algorithm is an extension of both AMSGrad in Euclidean space and RAMSGrad on a Riemannian manifold. The algorithm helps to resolve two issues affecting RAMSGrad. The first is that it can solve the Riemannian stochastic optimization problem directly, in contrast to RAMSGrad which only achieves a low regret. The other is that it can use constant learning rates, which makes it implementable in practice. Additionally, we apply the proposed algorithm to Poincar{\'e} embeddings, which embed the transitive closure of the WordNet nouns into the Poincar{\'e} ball model of hyperbolic space. Numerical experiments show that regardless of the initial value of the learning rate, our algorithm stably converges to the optimal solution and converges faster than RSGD, the most basic Riemannian stochastic optimization algorithm.
Submission history
From: Hiroyuki Sakai [view email][v1] Thu, 2 Apr 2020 09:33:14 GMT (242kb)
[v2] Mon, 1 Jun 2020 04:54:58 GMT (242kb)
[v3] Fri, 2 Oct 2020 05:09:33 GMT (590kb,D)
[v4] Tue, 8 Dec 2020 16:13:22 GMT (590kb,D)
Link back to: arXiv, form interface, contact.