We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum Learning

Abstract: To build a high-quality open-domain chatbot, we introduce the effective training process of PLATO-2 via curriculum learning. There are two stages involved in the learning process. In the first stage, a coarse-grained generation model is trained to learn response generation under the simplified framework of one-to-one mapping. In the second stage, a fine-grained generative model augmented with latent variables and an evaluation model are further trained to generate diverse responses and to select the best response, respectively. PLATO-2 was trained on both Chinese and English data, whose effectiveness and superiority are verified through comprehensive evaluations, achieving new state-of-the-art results.
Comments: Findings of ACL 2021. First four authors contributed equally to this work
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2006.16779 [cs.CL]
  (or arXiv:2006.16779v4 [cs.CL] for this version)

Submission history

From: Siqi Bao [view email]
[v1] Tue, 30 Jun 2020 13:36:10 GMT (905kb,D)
[v2] Mon, 6 Jul 2020 11:39:49 GMT (904kb,D)
[v3] Mon, 13 Jul 2020 11:24:03 GMT (906kb,D)
[v4] Fri, 28 May 2021 11:20:24 GMT (1557kb,D)

Link back to: arXiv, form interface, contact.