Decoupling Learning Rates Using Empirical Bayes Priors

Nabi, Sareh; Nassif, Houssam; Hong, Joseph; Mamani, Hamed; Imbens, Guido

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2002

Computer Science > Machine Learning

Title: Decoupling Learning Rates Using Empirical Bayes Priors

Authors: Sareh Nabi, Houssam Nassif, Joseph Hong, Hamed Mamani, Guido Imbens

(Submitted on 4 Feb 2020 (this version), latest version 12 Jul 2021 (v3))

Abstract: In this work, we propose an Empirical Bayes approach to decouple the learning rates of first order and second order features (or any other feature grouping) in a Generalized Linear Model. Such needs arise in small-batch or low-traffic use-cases. As the first order features are likely to have a more pronounced effect on the outcome, focusing on learning first order weights first is likely to improve performance and convergence time. Our Empirical Bayes method clamps features in each group together and uses the observed data for the deployed model to empirically compute a hierarchical prior in hindsight. We apply our method to a standard classification setting, as well as a contextual bandit setting in an Amazon production system. Both during simulations and live experiments, our method shows marked improvements, especially in cases of small traffic. Our findings are promising, as optimizing over sparse data is often a challenge. Furthermore, our approach can be applied to any problem instance modeled as a Bayesian framework.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2002.01129 [cs.LG]
	(or arXiv:2002.01129v1 [cs.LG] for this version)

Submission history

From: Sareh Nabi [view email]
[v1] Tue, 4 Feb 2020 05:08:17 GMT (98kb,D)
[v2] Sun, 1 Nov 2020 23:00:22 GMT (408kb,D)
[v3] Mon, 12 Jul 2021 21:18:32 GMT (114kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2002.01129v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Decoupling Learning Rates Using Empirical Bayes Priors

Submission history