We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.IR

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Information Retrieval

Title: COVID-19: Comparative Analysis of Methods for Identifying Articles Related to Therapeutics and Vaccines without Using Labeled Data

Abstract: Here we proposed an approach to analyze text classification methods based on the presence or absence of task-specific terms (and their synonyms) in the text. We applied this approach to study six different transfer-learning and unsupervised methods for screening articles relevant to COVID-19 vaccines and therapeutics. The analysis revealed that while a BERT model trained on search-engine results generally performed well, it miss-classified relevant abstracts that did not contain task-specific terms. We used this insight to create a more effective unsupervised ensemble.
Comments: 6 pages, 3 Tables, Appendix
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as: arXiv:2101.02017 [cs.IR]
  (or arXiv:2101.02017v1 [cs.IR] for this version)

Submission history

From: Mihir Parmar [view email]
[v1] Tue, 5 Jan 2021 08:40:04 GMT (205kb)

Link back to: arXiv, form interface, contact.