We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Task-aware Retrieval with Instructions

Abstract: We study the problem of retrieval with instructions, where users of a retrieval system explicitly describe their intent along with their queries. We aim to develop a general-purpose task-aware retrieval system using multi-task instruction tuning, which can follow human-written instructions to find the best documents for a given query. We introduce the first large-scale collection of approximately 40 retrieval datasets with instructions, BERRI, and present TART, a multi-task retrieval system trained on BERRI with instructions. TART shows strong capabilities to adapt to a new retrieval task via instructions and advances the state of the art on two zero-shot retrieval benchmarks, BEIR and LOTTE, outperforming models up to three times larger. We further introduce a new evaluation setup, X^2-Retrieval to better reflect real-world scenarios, where diverse domains and tasks are pooled and a system needs to find documents aligning users' intents. In this setup, TART significantly outperforms competitive baselines, further demonstrating the effectiveness of guiding retrieval with instructions.
Comments: Code, data and pretrained model checkpoints are available at this https URL
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2211.09260 [cs.CL]
  (or arXiv:2211.09260v2 [cs.CL] for this version)

Submission history

From: Akari Asai [view email]
[v1] Wed, 16 Nov 2022 23:13:22 GMT (1586kb,D)
[v2] Mon, 19 Dec 2022 20:50:09 GMT (1272kb,D)

Link back to: arXiv, form interface, contact.