We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.IR

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Information Retrieval

Title: Composite Re-Ranking for Efficient Document Search with BERT

Abstract: Although considerable efforts have been devoted to transformer-based ranking models for document search, the relevance-efficiency tradeoff remains a critical problem for ad-hoc ranking. To overcome this challenge, this paper presents BECR (BERT-based Composite Re-Ranking), a composite re-ranking scheme that combines deep contextual token interactions and traditional lexical term-matching features. In particular, BECR exploits a token encoding mechanism to decompose the query representations into pre-computable uni-grams and skip-n-grams. By applying token encoding on top of a dual-encoder architecture, BECR separates the attentions between a query and a document while capturing the contextual semantics of a query. In contrast to previous approaches, this framework does not perform expensive BERT computations during online inference. Thus, it is significantly faster, yet still able to achieve high competitiveness in ad-hoc ranking relevance. Finally, an in-depth comparison between BECR and other start-of-the-art neural ranking baselines is described using the TREC datasets, thereby further demonstrating the enhanced relevance and efficiency of BECR.
Comments: to be published in WSDM'22
Subjects: Information Retrieval (cs.IR)
DOI: 10.1145/3488560.3498495
Cite as: arXiv:2103.06499 [cs.IR]
  (or arXiv:2103.06499v4 [cs.IR] for this version)

Submission history

From: Yifan Qiao [view email]
[v1] Thu, 11 Mar 2021 06:52:29 GMT (4748kb,D)
[v2] Fri, 12 Mar 2021 04:54:15 GMT (0kb,I)
[v3] Sat, 17 Apr 2021 04:51:03 GMT (4749kb,D)
[v4] Thu, 6 Jan 2022 01:01:29 GMT (1196kb,D)

Link back to: arXiv, form interface, contact.