We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.IR

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Information Retrieval

Title: Co-training for Extraction of Adverse Drug Reaction Mentions from Tweets

Abstract: Adverse drug reactions (ADRs) are one of the leading causes of mortality in health care. Current ADR surveillance systems are often associated with a substantial time lag before such events are officially published. On the other hand, online social media such as Twitter contain information about ADR events in real-time, much before any official reporting. Current state-of-the-art methods in ADR mention extraction use Recurrent Neural Networks (RNN), which typically need large labeled corpora. Towards this end, we propose a semi-supervised method based on co-training which can exploit a large pool of unlabeled tweets to augment the limited supervised training data, and as a result enhance the performance. Experiments with 0.1M tweets show that the proposed approach outperforms the state-of-the-art methods for the ADR mention extraction task by 5% in terms of F1 score.
Comments: Accepted at ECIR18 as short paper (6 pages)
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
Cite as: arXiv:1802.05121 [cs.IR]
  (or arXiv:1802.05121v1 [cs.IR] for this version)

Submission history

From: Shashank Gupta [view email]
[v1] Wed, 14 Feb 2018 14:47:56 GMT (28kb)

Link back to: arXiv, form interface, contact.