CAN: Feature Co-Action for Click-Through Rate Prediction

Bian, Weijie; Wu, Kailun; Ren, Lejian; Pi, Qi; Zhang, Yujing; Xiao, Can; Sheng, Xiang-Rong; Zhu, Yong-Nan; Chan, Zhangming; Mou, Na; Luo, Xinchen; Xiang, Shiming; Zhou, Guorui; Zhu, Xiaoqiang; Deng, Hongbo

Full-text links:

Download:

Current browse context:

cs.IR

< prev | next >

new | recent | 2011

Computer Science > Information Retrieval

Title: CAN: Feature Co-Action for Click-Through Rate Prediction

Authors: Weijie Bian, Kailun Wu, Lejian Ren, Qi Pi, Yujing Zhang, Can Xiao, Xiang-Rong Sheng, Yong-Nan Zhu, Zhangming Chan, Na Mou, Xinchen Luo, Shiming Xiang, Guorui Zhou, Xiaoqiang Zhu, Hongbo Deng

(Submitted on 11 Nov 2020 (v1), last revised 7 Dec 2021 (this version, v3))

Abstract: Feature interaction has been recognized as an important problem in machine learning, which is also very essential for click-through rate (CTR) prediction tasks. In recent years, Deep Neural Networks (DNNs) can automatically learn implicit nonlinear interactions from original sparse features, and therefore have been widely used in industrial CTR prediction tasks. However, the implicit feature interactions learned in DNNs cannot fully retain the complete representation capacity of the original and empirical feature interactions (e.g., cartesian product) without loss. For example, a simple attempt to learn the combination of feature A and feature B <A, B> as the explicit cartesian product representation of new features can outperform previous implicit feature interaction models including factorization machine (FM)-based models and their variations. In this paper, we propose a Co-Action Network (CAN) to approximate the explicit pairwise feature interactions without introducing too many additional parameters. More specifically, giving feature A and its associated feature B, their feature interaction is modeled by learning two sets of parameters: 1) the embedding of feature A, and 2) a Multi-Layer Perceptron (MLP) to represent feature B. The approximated feature interaction can be obtained by passing the embedding of feature A through the MLP network of feature B. We refer to such pairwise feature interaction as feature co-action, and such a Co-Action Network unit can provide a very powerful capacity to fitting complex feature interactions. Experimental results on public and industrial datasets show that CAN outperforms state-of-the-art CTR models and the cartesian product method. Moreover, CAN has been deployed in the display advertisement system in Alibaba, obtaining 12\% improvement on CTR and 8\% on Revenue Per Mille (RPM), which is a great improvement to the business.

Comments:	WSDM 2022
Subjects:	Information Retrieval (cs.IR); Machine Learning (stat.ML)
MSC classes:	Machine Learning (stat.ML), Information Retrieval (cs.IR), Machine Learning (cs.LG)
ACM classes:	I.2.6
Cite as:	arXiv:2011.05625 [cs.IR]
	(or arXiv:2011.05625v3 [cs.IR] for this version)

Submission history

From: Guorui Zhou [view email]
[v1] Wed, 11 Nov 2020 08:33:07 GMT (256kb,D)
[v2] Mon, 6 Dec 2021 08:21:04 GMT (1243kb,D)
[v3] Tue, 7 Dec 2021 06:16:07 GMT (1243kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2011.05625

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Information Retrieval

Title: CAN: Feature Co-Action for Click-Through Rate Prediction

Submission history