We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.IR

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Information Retrieval

Title: Sequence-to-Sequence Learning on Keywords for Efficient FAQ Retrieval

Abstract: Frequently-Asked-Question (FAQ) retrieval provides an effective procedure for responding to user's natural language based queries. Such platforms are becoming common in enterprise chatbots, product question answering, and preliminary technical support for customers. However, the challenge in such scenarios lies in bridging the lexical and semantic gap between varied query formulations and the corresponding answers, both of which typically have a very short span. This paper proposes TI-S2S, a novel learning framework combining TF-IDF based keyword extraction and Word2Vec embeddings for training a Sequence-to-Sequence (Seq2Seq) architecture. It achieves high precision for FAQ retrieval by better understanding the underlying intent of a user question captured via the representative keywords. We further propose a variant with an additional neural network module for guiding retrieval via relevant candidate identification based on similarity features. Experiments on publicly available dataset depict our approaches to provide around 92% precision-at-rank-5, exhibiting nearly 13% improvement over existing approaches.
Comments: 6 pages
Subjects: Information Retrieval (cs.IR)
Journal reference: Published at the IJCAI 2021 Workshop on Applied Semantics Extraction and Analytics (ASEA)
Cite as: arXiv:2108.10019 [cs.IR]
  (or arXiv:2108.10019v1 [cs.IR] for this version)

Submission history

From: Sourav Dutta [view email]
[v1] Mon, 23 Aug 2021 09:11:33 GMT (36kb,D)

Link back to: arXiv, form interface, contact.