We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Machine Learning

Title: Recommender system as an exploration coordinator: a bounded O(1) regret algorithm for large platforms

Abstract: On typical modern platforms, users are only able to try a small fraction of the available items. This makes it difficult to model the exploration behavior of platform users as typical online learners who explore all the items. Towards addressing this issue, we propose to interpret a recommender system as a bandit exploration coordinator that provides counterfactual information updates. In particular, we introduce a novel algorithm called Counterfactual UCB (CFUCB) which is guarantees user exploration coordination with bounded regret under the presence of linear representations. Our results show that sharing information is a Subgame Perfect Nash Equilibrium for agents in terms of regret, leading to each agent achieving bounded regret. This approach has potential applications in personalized recommender systems and adaptive experimentation.
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Information Retrieval (cs.IR); General Economics (econ.GN)
Cite as: arXiv:2301.12571 [cs.LG]
  (or arXiv:2301.12571v1 [cs.LG] for this version)

Submission history

From: Hyunwook Kang [view email]
[v1] Sun, 29 Jan 2023 22:39:50 GMT (63kb,D)

Link back to: arXiv, form interface, contact.