We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.IR

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Information Retrieval

Title: Contextualized Query Embeddings for Conversational Search

Abstract: This paper describes a compact and effective model for low-latency passage retrieval in conversational search based on learned dense representations. Prior to our work, the state-of-the-art approach uses a multi-stage pipeline comprising conversational query reformulation and information retrieval modules. Despite its effectiveness, such a pipeline often includes multiple neural models that require long inference times. In addition, independently optimizing each module ignores dependencies among them. To address these shortcomings, we propose to integrate conversational query reformulation directly into a dense retrieval model. To aid in this goal, we create a dataset with pseudo-relevance labels for conversational search to overcome the lack of training data and to explore different training strategies. We demonstrate that our model effectively rewrites conversational queries as dense representations in conversational search and open-domain question answering datasets. Finally, after observing that our model learns to adjust the $L_2$ norm of query token embeddings, we leverage this property for hybrid retrieval and to support error analysis.
Comments: Published in EMNLP 2021
Subjects: Information Retrieval (cs.IR)
Cite as: arXiv:2104.08707 [cs.IR]
  (or arXiv:2104.08707v2 [cs.IR] for this version)

Submission history

From: Sheng-Chieh Lin [view email]
[v1] Sun, 18 Apr 2021 04:29:01 GMT (1408kb,D)
[v2] Fri, 26 Nov 2021 23:34:23 GMT (551kb,D)

Link back to: arXiv, form interface, contact.