Efficient Neural Query Auto Completion

Wang, Sida; Guo, Weiwei; Gao, Huiji; Long, Bo

doi:10.1145/3340531.3412701

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2008

Change to browse by:

Computer Science > Computation and Language

Title: Efficient Neural Query Auto Completion

Authors: Sida Wang, Weiwei Guo, Huiji Gao, Bo Long

(Submitted on 6 Aug 2020)

Abstract: Query Auto Completion (QAC), as the starting point of information retrieval tasks, is critical to user experience. Generally it has two steps: generating completed query candidates according to query prefixes, and ranking them based on extracted features. Three major challenges are observed for a query auto completion system: (1) QAC has a strict online latency requirement. For each keystroke, results must be returned within tens of milliseconds, which poses a significant challenge in designing sophisticated language models for it. (2) For unseen queries, generated candidates are of poor quality as contextual information is not fully utilized. (3) Traditional QAC systems heavily rely on handcrafted features such as the query candidate frequency in search logs, lacking sufficient semantic understanding of the candidate.
In this paper, we propose an efficient neural QAC system with effective context modeling to overcome these challenges. On the candidate generation side, this system uses as much information as possible in unseen prefixes to generate relevant candidates, increasing the recall by a large margin. On the candidate ranking side, an unnormalized language model is proposed, which effectively captures deep semantics of queries. This approach presents better ranking performance over state-of-the-art neural ranking methods and reduces $\sim$95\% latency compared to neural language modeling methods. The empirical results on public datasets show that our model achieves a good balance between accuracy and efficiency. This system is served in LinkedIn job search with significant product impact observed.

Comments:	Accepted at CIKM 2020
Subjects:	Computation and Language (cs.CL)
DOI:	10.1145/3340531.3412701
Cite as:	arXiv:2008.02879 [cs.CL]
	(or arXiv:2008.02879v1 [cs.CL] for this version)

Submission history

From: Sida Wang [view email]
[v1] Thu, 6 Aug 2020 21:28:36 GMT (101kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2008.02879

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Efficient Neural Query Auto Completion

Submission history