Current browse context:
cs.LG
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: MedDialog: Two Large-scale Medical Dialogue Datasets
(Submitted on 7 Apr 2020 (v1), last revised 7 Jul 2020 (this version, v2))
Abstract: Medical dialogue systems are promising in assisting in telemedicine to increase access to healthcare services, improve the quality of patient care, and reduce medical costs. To facilitate the research and development of medical dialogue systems, we build two large-scale medical dialogue datasets: MedDialog-EN and MedDialog-CN. MedDialog-EN is an English dataset containing 0.3 million conversations between patients and doctors and 0.5 million utterances. MedDialog-CN is an Chinese dataset containing 1.1 million conversations and 4 million utterances. To our best knowledge, MedDialog-(EN,CN) are the largest medical dialogue datasets to date. The dataset is available at this https URL
Submission history
From: Pengtao Xie [view email][v1] Tue, 7 Apr 2020 13:07:09 GMT (7712kb,D)
[v2] Tue, 7 Jul 2020 22:15:10 GMT (3912kb,D)
Link back to: arXiv, form interface, contact.