References & Citations
Computer Science > Computation and Language
Title: PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum Learning
(Submitted on 30 Jun 2020 (v1), last revised 28 May 2021 (this version, v4))
Abstract: To build a high-quality open-domain chatbot, we introduce the effective training process of PLATO-2 via curriculum learning. There are two stages involved in the learning process. In the first stage, a coarse-grained generation model is trained to learn response generation under the simplified framework of one-to-one mapping. In the second stage, a fine-grained generative model augmented with latent variables and an evaluation model are further trained to generate diverse responses and to select the best response, respectively. PLATO-2 was trained on both Chinese and English data, whose effectiveness and superiority are verified through comprehensive evaluations, achieving new state-of-the-art results.
Submission history
From: Siqi Bao [view email][v1] Tue, 30 Jun 2020 13:36:10 GMT (905kb,D)
[v2] Mon, 6 Jul 2020 11:39:49 GMT (904kb,D)
[v3] Mon, 13 Jul 2020 11:24:03 GMT (906kb,D)
[v4] Fri, 28 May 2021 11:20:24 GMT (1557kb,D)
Link back to: arXiv, form interface, contact.