We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: A More Efficient Chinese Named Entity Recognition base on BERT and Syntactic Analysis

Abstract: We propose a new Named entity recognition (NER) method to effectively make use of the results of Part-of-speech (POS) tagging, Chinese word segmentation (CWS) and parsing while avoiding NER error caused by POS tagging error. This paper first uses Stanford natural language process (NLP) tool to annotate large-scale untagged data so as to reduce the dependence on the tagged data; then a new NLP model, g-BERT model, is designed to compress Bidirectional Encoder Representations from Transformers (BERT) model in order to reduce calculation quantity; finally, the model is evaluated based on Chinese NER dataset. The experimental results show that the calculation quantity in g-BERT model is reduced by 60% and performance improves by 2% with Test F1 to 96.5 compared with that in BERT model.
Comments: 11pages,3figures,3tables
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2101.11423 [cs.CL]
  (or arXiv:2101.11423v1 [cs.CL] for this version)

Submission history

From: Xiao Fu [view email]
[v1] Mon, 11 Jan 2021 15:33:39 GMT (945kb)

Link back to: arXiv, form interface, contact.