We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Does constituency analysis enhance domain-specific pre-trained BERT models for relation extraction?

Abstract: Recently many studies have been conducted on the topic of relation extraction. The DrugProt track at BioCreative VII provides a manually-annotated corpus for the purpose of the development and evaluation of relation extraction systems, in which interactions between chemicals and genes are studied. We describe the ensemble system that we used for our submission, which combines predictions of fine-tuned bioBERT, sciBERT and const-bioBERT models by majority voting. We specifically tested the contribution of syntactic information to relation extraction with BERT. We observed that adding constituentbased syntactic information to BERT improved precision, but decreased recall, since relations rarely seen in the train set were less likely to be predicted by BERT models in which the syntactic information is infused. Our code is available online [this https URL].
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
Journal reference: BioCreative VII Challenge Evaluation Workshop, Nov 2021, on-line, Spain
Cite as: arXiv:2112.02955 [cs.CL]
  (or arXiv:2112.02955v1 [cs.CL] for this version)

Submission history

From: Claire Nedellec [view email]
[v1] Thu, 25 Nov 2021 10:27:10 GMT (214kb)

Link back to: arXiv, form interface, contact.