We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

q-bio.BM

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Quantitative Biology > Biomolecules

Title: Exploring Chemical Space using Natural Language Processing Methodologies for Drug Discovery

Abstract: Text-based representations of chemicals and proteins can be thought of as unstructured languages codified by humans to describe domain-specific knowledge. Advances in natural language processing (NLP) methodologies in the processing of spoken languages accelerated the application of NLP to elucidate hidden knowledge in textual representations of these biochemical entities and then use it to construct models to predict molecular properties or to design novel molecules. This review outlines the impact made by these advances on drug discovery and aims to further the dialogue between medicinal chemists and computer scientists.
Subjects: Biomolecules (q-bio.BM); Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
DOI: 10.1016/j.drudis.2020.01.020
Cite as: arXiv:2002.06053 [q-bio.BM]
  (or arXiv:2002.06053v1 [q-bio.BM] for this version)

Submission history

From: Hakime Öztürk [view email]
[v1] Mon, 10 Feb 2020 21:02:05 GMT (708kb,D)

Link back to: arXiv, form interface, contact.