We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: Importance Estimation from Multiple Perspectives for Keyphrase Extraction

Abstract: Keyphrase extraction is a fundamental task in Natural Language Processing, which usually contains two main parts: candidate keyphrase extraction and keyphrase importance estimation. From the view of human understanding documents, we typically measure the importance of phrase according to its syntactic accuracy, information saliency, and concept consistency simultaneously. However, most existing keyphrase extraction approaches only focus on the part of them, which leads to biased results. In this paper, we propose a new approach to estimate the importance of keyphrase from multiple perspectives (called as \textit{KIEMP}) and further improve the performance of keyphrase extraction. Specifically, \textit{KIEMP} estimates the importance of phrase with three modules: a chunking module to measure its syntactic accuracy, a ranking module to check its information saliency, and a matching module to judge the concept (i.e., topic) consistency between phrase and the whole document. These three modules are seamlessly jointed together via an end-to-end multi-task learning model, which is helpful for three parts to enhance each other and balance the effects of three perspectives. Experimental results on six benchmark datasets show that \textit{KIEMP} outperforms the existing state-of-the-art keyphrase extraction approaches in most cases.
Comments: 11 pages, 2 figures, Accepted by EMNLP 2021 (main conference)
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
Cite as: arXiv:2110.09749 [cs.CL]
  (or arXiv:2110.09749v4 [cs.CL] for this version)

Submission history

From: Mingyang Song [view email]
[v1] Tue, 19 Oct 2021 05:48:22 GMT (409kb,D)
[v2] Fri, 22 Oct 2021 14:41:35 GMT (406kb,D)
[v3] Tue, 9 Nov 2021 03:16:07 GMT (404kb,D)
[v4] Thu, 11 Nov 2021 03:35:01 GMT (406kb,D)

Link back to: arXiv, form interface, contact.