References & Citations
Computer Science > Computation and Language
Title: DialogSum Challenge: Results of the Dialogue Summarization Shared Task
(Submitted on 8 Aug 2022 (v1), revised 9 Aug 2022 (this version, v2), latest version 3 Sep 2022 (v3))
Abstract: We report the results of DialogSum Challenge, the shared task on summarizing real-life scenario dialogues at INLG 2022. Four teams participate in this shared task and three submit their system reports, exploring different methods to improve the performance of dialogue summarization. Although there is a great improvement over the baseline models regarding automatic evaluation metrics, such as Rouge scores, we find that there is a salient gap between model generated outputs and human annotated summaries by human evaluation from multiple aspects. These findings demonstrate the difficulty of dialogue summarization and suggest that more fine-grained evaluatuion metrics are in need.
Submission history
From: Naihao Deng [view email][v1] Mon, 8 Aug 2022 03:39:42 GMT (109kb,D)
[v2] Tue, 9 Aug 2022 02:28:02 GMT (109kb,D)
[v3] Sat, 3 Sep 2022 05:08:20 GMT (109kb,D)
Link back to: arXiv, form interface, contact.