Automatic Expert Selection for Multi-Scenario and Multi-Task Search

Zou, Xinyu; Hu, Zhi; Zhao, Yiming; Ding, Xuchu; Liu, Zhongyi; Li, Chenliang; Sun, Aixin

doi:10.1145/3477495.3531942

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2205

Computer Science > Machine Learning

Title: Automatic Expert Selection for Multi-Scenario and Multi-Task Search

Authors: Xinyu Zou, Zhi Hu, Yiming Zhao, Xuchu Ding, Zhongyi Liu, Chenliang Li, Aixin Sun

(Submitted on 28 May 2022 (v1), last revised 6 Jun 2022 (this version, v2))

Abstract: Multi-scenario learning (MSL) enables a service provider to cater for users' fine-grained demands by separating services for different user sectors, e.g., by user's geographical region. Under each scenario there is a need to optimize multiple task-specific targets e.g., click through rate and conversion rate, known as multi-task learning (MTL). Recent solutions for MSL and MTL are mostly based on the multi-gate mixture-of-experts (MMoE) architecture. MMoE structure is typically static and its design requires domain-specific knowledge, making it less effective in handling both MSL and MTL. In this paper, we propose a novel Automatic Expert Selection framework for Multi-scenario and Multi-task search, named AESM^{2}. AESM^{2} integrates both MSL and MTL into a unified framework with an automatic structure learning. Specifically, AESM^{2} stacks multi-task layers over multi-scenario layers. This hierarchical design enables us to flexibly establish intrinsic connections between different scenarios, and at the same time also supports high-level feature extraction for different tasks. At each multi-scenario/multi-task layer, a novel expert selection algorithm is proposed to automatically identify scenario-/task-specific and shared experts for each input. Experiments over two real-world large-scale datasets demonstrate the effectiveness of AESM^{2} over a battery of strong baselines. Online A/B test also shows substantial performance gain on multiple metrics. Currently, AESM^{2} has been deployed online for serving major traffic.

Comments:	Accepted by SIGIR 2022; 10 pages, 8 figures
Subjects:	Machine Learning (cs.LG); Information Retrieval (cs.IR)
DOI:	10.1145/3477495.3531942
Cite as:	arXiv:2205.14321 [cs.LG]
	(or arXiv:2205.14321v2 [cs.LG] for this version)

Submission history

From: Zou Xinyu [view email]
[v1] Sat, 28 May 2022 03:41:25 GMT (5286kb,D)
[v2] Mon, 6 Jun 2022 09:13:41 GMT (5284kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2205.14321

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Automatic Expert Selection for Multi-Scenario and Multi-Task Search

Submission history