MS-Shift: An Analysis of MS MARCO Distribution Shifts on Neural Retrieval

Lupart, Simon; Formal, Thibault; Clinchant, Stéphane

Full-text links:

Download:

Current browse context:

cs.IR

< prev | next >

new | recent | 2205

Change to browse by:

Computer Science > Information Retrieval

Title: MS-Shift: An Analysis of MS MARCO Distribution Shifts on Neural Retrieval

Authors: Simon Lupart, Thibault Formal, Stéphane Clinchant

(Submitted on 5 May 2022 (v1), last revised 25 Jan 2023 (this version, v2))

Abstract: Pre-trained Language Models have recently emerged in Information Retrieval as providing the backbone of a new generation of neural systems that outperform traditional methods on a variety of tasks. However, it is still unclear to what extent such approaches generalize in zero-shot conditions. The recent BEIR benchmark provides partial answers to this question by comparing models on datasets and tasks that differ from the training conditions. We aim to address the same question by comparing models under more explicit distribution shifts. To this end, we build three query-based distribution shifts within MS MARCO (query-semantic, query-intent, query-length), which are used to evaluate the three main families of neural retrievers based on BERT: sparse, dense, and late-interaction -- as well as a monoBERT re-ranker. We further analyse the performance drops between the train and test query distributions. In particular, we experiment with two generalization indicators: the first one based on train/test query vocabulary overlap, and the second based on representations of a trained bi-encoder. Intuitively, those indicators verify that the further away the test set is from the train one, the worse the drop in performance. We also show that models respond differently to the shifts -- dense approaches being the most impacted. Overall, our study demonstrates that it is possible to design more controllable distribution shifts as a tool to better understand generalization of IR models. Finally, we release the MS MARCO query subsets, which provide an additional resource to benchmark zero-shot transfer in Information Retrieval.

Comments:	Accepted at ECIR 2023
Subjects:	Information Retrieval (cs.IR)
Cite as:	arXiv:2205.02870 [cs.IR]
	(or arXiv:2205.02870v2 [cs.IR] for this version)

Submission history

From: Simon Lupart [view email]
[v1] Thu, 5 May 2022 18:13:06 GMT (1029kb,D)
[v2] Wed, 25 Jan 2023 13:00:52 GMT (746kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2205.02870

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Information Retrieval

Title: MS-Shift: An Analysis of MS MARCO Distribution Shifts on Neural Retrieval

Submission history