Current browse context:
cs.LG
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: No-regret Non-convex Online Meta-Learning
(Submitted on 22 Oct 2019 (v1), last revised 19 Feb 2020 (this version, v4))
Abstract: The online meta-learning framework is designed for the continual lifelong learning setting. It bridges two fields: meta-learning which tries to extract prior knowledge from past tasks for fast learning of future tasks, and online-learning which deals with the sequential setting where problems are revealed one by one. In this paper, we generalize the original framework from convex to non-convex setting, and introduce the local regret as the alternative performance measure. We then apply this framework to stochastic settings, and show theoretically that it enjoys a logarithmic local regret, and is robust to any hyperparameter initialization. The empirical test on a real-world task demonstrates its superiority compared with traditional methods.
Submission history
From: Zhenxun Zhuang [view email][v1] Tue, 22 Oct 2019 18:45:15 GMT (205kb,D)
[v2] Tue, 10 Dec 2019 16:08:21 GMT (226kb,D)
[v3] Tue, 11 Feb 2020 01:10:11 GMT (233kb,D)
[v4] Wed, 19 Feb 2020 02:52:52 GMT (112kb,D)
Link back to: arXiv, form interface, contact.