Incentivising Exploration and Recommendations for Contextual Bandits with Payments

Agrawal, Priyank; Tulabandhula, Theja

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2001

Computer Science > Machine Learning

Title: Incentivising Exploration and Recommendations for Contextual Bandits with Payments

Authors: Priyank Agrawal, Theja Tulabandhula

(Submitted on 22 Jan 2020)

Abstract: We propose a contextual bandit based model to capture the learning and social welfare goals of a web platform in the presence of myopic users. By using payments to incentivize these agents to explore different items/recommendations, we show how the platform can learn the inherent attributes of items and achieve a sublinear regret while maximizing cumulative social welfare. We also calculate theoretical bounds on the cumulative costs of incentivization to the platform. Unlike previous works in this domain, we consider contexts to be completely adversarial, and the behavior of the adversary is unknown to the platform. Our approach can improve various engagement metrics of users on e-commerce stores, recommendation engines and matching platforms.

Comments:	11 pages, 4 figures
Subjects:	Machine Learning (cs.LG); Information Retrieval (cs.IR); Machine Learning (stat.ML)
Cite as:	arXiv:2001.07853 [cs.LG]
	(or arXiv:2001.07853v1 [cs.LG] for this version)

Submission history

From: Priyank Agrawal [view email]
[v1] Wed, 22 Jan 2020 02:26:22 GMT (99kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2001.07853

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Incentivising Exploration and Recommendations for Contextual Bandits with Payments

Submission history