We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: CU-UD: text-mining drug and chemical-protein interactions with ensembles of BERT-based models

Abstract: Identifying the relations between chemicals and proteins is an important text mining task. BioCreative VII track 1 DrugProt task aims to promote the development and evaluation of systems that can automatically detect relations between chemical compounds/drugs and genes/proteins in PubMed abstracts. In this paper, we describe our submission, which is an ensemble system, including multiple BERT-based language models. We combine the outputs of individual models using majority voting and multilayer perceptron. Our system obtained 0.7708 in precision and 0.7770 in recall, for an F1 score of 0.7739, demonstrating the effectiveness of using ensembles of BERT-based language models for automatically detecting relations between chemicals and proteins. Our code is available at this https URL
Comments: Proceedings of the BioCreative VII Challenge Evaluation Workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as: arXiv:2112.03004 [cs.CL]
  (or arXiv:2112.03004v1 [cs.CL] for this version)

Submission history

From: Yifan Peng [view email]
[v1] Thu, 11 Nov 2021 13:55:21 GMT (249kb,D)

Link back to: arXiv, form interface, contact.