References & Citations
Computer Science > Machine Learning
Title: The Bayesian Linear Information Filtering Problem
(Submitted on 30 May 2016 (v1), last revised 22 Oct 2016 (this version, v2))
Abstract: We present a Bayesian sequential decision-making formulation of the information filtering problem, in which an algorithm presents items (news articles, scientific papers, tweets) arriving in a stream, and learns relevance from user feedback on presented items. We model user preferences using a Bayesian linear model, similar in spirit to a Bayesian linear bandit. We compute a computational upper bound on the value of the optimal policy, which allows computing an optimality gap for implementable policies. We then use this analysis as motivation in introducing a pair of new Decompose-Then-Decide (DTD) heuristic policies, DTD-Dynamic-Programming (DTD-DP) and DTD-Upper-Confidence-Bound (DTD-UCB). We compare DTD-DP and DTD-UCB against several benchmarks on real and simulated data, demonstrating significant improvement, and show that the achieved performance is close to the upper bound.
Submission history
From: Bangrui Chen [view email][v1] Mon, 30 May 2016 02:35:07 GMT (1127kb,D)
[v2] Sat, 22 Oct 2016 18:48:14 GMT (1147kb,D)
Link back to: arXiv, form interface, contact.