MELINDA: A Multimodal Dataset for Biomedical Experiment Method Classification

Wu, Te-Lin; Singh, Shikhar; Paul, Sayan; Burns, Gully; Peng, Nanyun

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2012

Computer Science > Computation and Language

Title: MELINDA: A Multimodal Dataset for Biomedical Experiment Method Classification

Authors: Te-Lin Wu, Shikhar Singh, Sayan Paul, Gully Burns, Nanyun Peng

(Submitted on 16 Dec 2020)

Abstract: We introduce a new dataset, MELINDA, for Multimodal biomEdicaL experImeNt methoD clAssification. The dataset is collected in a fully automated distant supervision manner, where the labels are obtained from an existing curated database, and the actual contents are extracted from papers associated with each of the records in the database. We benchmark various state-of-the-art NLP and computer vision models, including unimodal models which only take either caption texts or images as inputs, and multimodal models. Extensive experiments and analysis show that multimodal models, despite outperforming unimodal ones, still need improvements especially on a less-supervised way of grounding visual concepts with languages, and better transferability to low resource domains. We release our dataset and the benchmarks to facilitate future research in multimodal learning, especially to motivate targeted improvements for applications in scientific domains.

Comments:	In The Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI-21), 2021
Subjects:	Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2012.09216 [cs.CL]
	(or arXiv:2012.09216v1 [cs.CL] for this version)

Submission history

From: Te-Lin Wu [view email]
[v1] Wed, 16 Dec 2020 19:11:36 GMT (9226kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2012.09216

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: MELINDA: A Multimodal Dataset for Biomedical Experiment Method Classification

Submission history