Revisiting Automatic Question Summarization Evaluation in the Biomedical Domain

Yuan, Hongyi; Zhang, Yaoyun; Huang, Fei; Huang, Songfang

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2303

Change to browse by:

Computer Science > Computation and Language

Title: Revisiting Automatic Question Summarization Evaluation in the Biomedical Domain

Authors: Hongyi Yuan, Yaoyun Zhang, Fei Huang, Songfang Huang

(Submitted on 18 Mar 2023)

Abstract: Automatic evaluation metrics have been facilitating the rapid development of automatic summarization methods by providing instant and fair assessments of the quality of summaries. Most metrics have been developed for the general domain, especially news and meeting notes, or other language-generation tasks. However, these metrics are applied to evaluate summarization systems in different domains, such as biomedical question summarization. To better understand whether commonly used evaluation metrics are capable of evaluating automatic summarization in the biomedical domain, we conduct human evaluations of summarization quality from four different aspects of a biomedical question summarization task. Based on human judgments, we identify different noteworthy features for current automatic metrics and summarization systems as well. We also release a dataset of our human annotations to aid the research of summarization evaluation metrics in the biomedical domain.

Comments:	8 pages, 1 figure
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2303.10328 [cs.CL]
	(or arXiv:2303.10328v1 [cs.CL] for this version)

Submission history

From: Hongyi Yuan [view email]
[v1] Sat, 18 Mar 2023 04:28:01 GMT (159kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2303.10328

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Revisiting Automatic Question Summarization Evaluation in the Biomedical Domain

Submission history