Current browse context:
cs.LG
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: Competing Bandits in Matching Markets
(Submitted on 12 Jun 2019 (v1), last revised 12 Jul 2020 (this version, v2))
Abstract: Stable matching, a classical model for two-sided markets, has long been studied with little consideration for how each side's preferences are learned. With the advent of massive online markets powered by data-driven matching platforms, it has become necessary to better understand the interplay between learning and market objectives. We propose a statistical learning model in which one side of the market does not have a priori knowledge about its preferences for the other side and is required to learn these from stochastic rewards. Our model extends the standard multi-armed bandits framework to multiple players, with the added feature that arms have preferences over players. We study both centralized and decentralized approaches to this problem and show surprising exploration-exploitation trade-offs compared to the single player multi-armed bandits setting.
Submission history
From: Lydia T. Liu [view email][v1] Wed, 12 Jun 2019 20:04:25 GMT (711kb,D)
[v2] Sun, 12 Jul 2020 21:48:30 GMT (540kb,D)
Link back to: arXiv, form interface, contact.