An Embarrassingly Easy but Strong Baseline for Nested Named Entity Recognition

Yan, Hang; Sun, Yu; Li, Xiaonan; Qiu, Xipeng

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2208

Change to browse by:

Computer Science > Computation and Language

Title: An Embarrassingly Easy but Strong Baseline for Nested Named Entity Recognition

Authors: Hang Yan, Yu Sun, Xiaonan Li, Xipeng Qiu

(Submitted on 9 Aug 2022 (v1), revised 19 Aug 2022 (this version, v2), latest version 15 Sep 2022 (v3))

Abstract: Named entity recognition (NER) is the task to detect and classify the entity spans in the text. When entity spans overlap between each other, this problem is named as nested NER. Span-based methods have been widely used to tackle the nested NER. Most of these methods will get a score $n \times n$ matrix, where $n$ means the length of sentence, and each entry corresponds to a span. However, previous work ignores spatial relations in the score matrix. In this paper, we propose using Convolutional Neural Network (CNN) to model these spatial relations in the score matrix. Despite being simple, experiments in three commonly used nested NER datasets show that our model surpasses several recently proposed methods with the same pre-trained encoders. Further analysis shows that using CNN can help the model find nested entities more accurately. Besides, we found that different papers used different sentence tokenizations for the three nested NER datasets, which will influence the comparison. Thus, we release a pre-processing script to facilitate future comparison.

Comments:	Updates for Genia dataset
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2208.04534 [cs.CL]
	(or arXiv:2208.04534v2 [cs.CL] for this version)

Submission history

From: Hang Yan [view email]
[v1] Tue, 9 Aug 2022 04:33:46 GMT (213kb,D)
[v2] Fri, 19 Aug 2022 15:53:24 GMT (344kb,D)
[v3] Thu, 15 Sep 2022 12:28:55 GMT (344kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2208.04534v2

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: An Embarrassingly Easy but Strong Baseline for Nested Named Entity Recognition

Submission history