Benchmarking Automated Clinical Language Simplification: Dataset, Algorithm, and Evaluation

Luo, Junyu; Zheng, Zifei; Ye, Hanzhong; Ye, Muchao; Wang, Yaqing; You, Quanzeng; Xiao, Cao; Ma, Fenglong

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2012

Computer Science > Computation and Language

Title: Benchmarking Automated Clinical Language Simplification: Dataset, Algorithm, and Evaluation

Authors: Junyu Luo, Zifei Zheng, Hanzhong Ye, Muchao Ye, Yaqing Wang, Quanzeng You, Cao Xiao, Fenglong Ma

(Submitted on 4 Dec 2020 (v1), last revised 21 Sep 2023 (this version, v2))

Abstract: Patients with low health literacy usually have difficulty understanding medical jargon and the complex structure of professional medical language. Although some studies are proposed to automatically translate expert language into layperson-understandable language, only a few of them focus on both accuracy and readability aspects simultaneously in the clinical domain. Thus, simplification of the clinical language is still a challenging task, but unfortunately, it is not yet fully addressed in previous work. To benchmark this task, we construct a new dataset named MedLane to support the development and evaluation of automated clinical language simplification approaches. Besides, we propose a new model called DECLARE that follows the human annotation procedure and achieves state-of-the-art performance compared with eight strong baselines. To fairly evaluate the performance, we also propose three specific evaluation metrics. Experimental results demonstrate the utility of the annotated MedLane dataset and the effectiveness of the proposed model DECLARE.

Comments:	COLING 2022
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Journal reference:	2022.coling-1.313
Cite as:	arXiv:2012.02420 [cs.CL]
	(or arXiv:2012.02420v2 [cs.CL] for this version)

Submission history

From: Junyu Luo [view email]
[v1] Fri, 4 Dec 2020 06:09:02 GMT (1454kb,D)
[v2] Thu, 21 Sep 2023 20:53:33 GMT (10237kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2012.02420

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Benchmarking Automated Clinical Language Simplification: Dataset, Algorithm, and Evaluation

Submission history