Exploring Fluent Query Reformulations with Text-to-Text Transformers and Reinforcement Learning

Chen, Jerry Zikun; Yu, Shi; Wang, Haoran

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2012

Computer Science > Computation and Language

Title: Exploring Fluent Query Reformulations with Text-to-Text Transformers and Reinforcement Learning

Authors: Jerry Zikun Chen, Shi Yu, Haoran Wang

(Submitted on 18 Dec 2020 (v1), last revised 4 Jul 2021 (this version, v2))

Abstract: Query reformulation aims to alter noisy or ambiguous text sequences into coherent ones closer to natural language questions. This is to prevent errors from propagating in a client-facing pipeline and promote better communication with users. Besides, it is crucial to maintain performance in downstream environments like question answering when rephrased queries are given as input. We show that under the previous framework (AQA), attempts to alter RL algorithms do not bring significant benefits to either reward acquisition or sequence fluency. Instead, we leverage a query-reformulating text-to-text transformer (QRT5) and apply policy-based RL algorithms to further nudge this reformulator and obtain better answers downstream by generating reward-acquiring query trajectories. QRT5 shows better sample efficiency in RL to achieve the same level of QA performance as the previous approach. It can generate reformulations with more readability based on query well-formedness evaluations and can generalize to out-of-sample data. Our framework is demonstrated to be flexible, allowing reward signals to be sourced from different downstream environments such as intent classification.

Comments:	Workshop on the 9th Dialog System Technology Challenge (DSTC-9), AAAI 2021
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2012.10033 [cs.CL]
	(or arXiv:2012.10033v2 [cs.CL] for this version)

Submission history

From: Jerry Zikun Chen [view email]
[v1] Fri, 18 Dec 2020 03:16:37 GMT (204kb,D)
[v2] Sun, 4 Jul 2021 01:08:13 GMT (210kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2012.10033

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Exploring Fluent Query Reformulations with Text-to-Text Transformers and Reinforcement Learning

Submission history