Contrastive Learning for Debiased Candidate Generation in Large-Scale Recommender Systems

Zhou, Chang; Ma, Jianxin; Zhang, Jianwei; Zhou, Jingren; Yang, Hongxia

Full-text links:

Download:

Current browse context:

cs.IR

< prev | next >

new | recent | 2005

Computer Science > Information Retrieval

Title: Contrastive Learning for Debiased Candidate Generation in Large-Scale Recommender Systems

Authors: Chang Zhou, Jianxin Ma, Jianwei Zhang, Jingren Zhou, Hongxia Yang

(Submitted on 20 May 2020 (v1), last revised 4 Jun 2021 (this version, v9))

Abstract: Deep candidate generation (DCG) that narrows down the collection of relevant items from billions to hundreds via representation learning has become prevalent in industrial recommender systems. Standard approaches approximate maximum likelihood estimation (MLE) through sampling for better scalability and address the problem of DCG in a way similar to language modeling. However, live recommender systems face severe exposure bias and have a vocabulary several orders of magnitude larger than that of natural language, implying that MLE will preserve and even exacerbate the exposure bias in the long run in order to faithfully fit the observed samples. In this paper, we theoretically prove that a popular choice of contrastive loss is equivalent to reducing the exposure bias via inverse propensity weighting, which provides a new perspective for understanding the effectiveness of contrastive learning. Based on the theoretical discovery, we design CLRec, a contrastive learning method to improve DCG in terms of fairness, effectiveness and efficiency in recommender systems with extremely large candidate size. We further improve upon CLRec and propose Multi-CLRec, for accurate multi-intention aware bias reduction. Our methods have been successfully deployed in Taobao, where at least four-month online A/B tests and offline analyses demonstrate its substantial improvements, including a dramatic reduction in the Matthew effect.

Comments:	Accepted by the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2021)
Subjects:	Information Retrieval (cs.IR); Machine Learning (cs.LG); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
Cite as:	arXiv:2005.12964 [cs.IR]
	(or arXiv:2005.12964v9 [cs.IR] for this version)

Submission history

From: Jianxin Ma [view email]
[v1] Wed, 20 May 2020 08:15:23 GMT (4421kb,D)
[v2] Sun, 31 May 2020 17:46:41 GMT (4392kb,D)
[v3] Tue, 2 Jun 2020 09:21:25 GMT (4374kb,D)
[v4] Fri, 5 Jun 2020 17:15:04 GMT (4401kb,D)
[v5] Wed, 10 Jun 2020 14:32:52 GMT (4416kb,D)
[v6] Thu, 11 Jun 2020 12:29:48 GMT (4416kb,D)
[v7] Thu, 18 Feb 2021 07:41:38 GMT (4667kb,D)
[v8] Wed, 19 May 2021 08:14:17 GMT (4661kb,D)
[v9] Fri, 4 Jun 2021 16:34:46 GMT (4796kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2005.12964

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Information Retrieval

Title: Contrastive Learning for Debiased Candidate Generation in Large-Scale Recommender Systems

Submission history