We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

q-bio.MN

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Quantitative Biology > Molecular Networks

Title: AI-driven Hypergraph Network of Organic Chemistry: Network Statistics and Applications in Reaction Classification

Abstract: Rapid discovery of new reactions and molecules in recent years has been facilitated by the advancements in high throughput screening, accessibility to a much more complex chemical design space, and the development of accurate molecular modeling frameworks. A holistic study of the growing chemistry literature is, therefore, required that focuses on understanding the recent trends and extrapolating them into possible future trajectories. To this end, several network theory-based studies have been reported that use a directed graph representation of chemical reactions. Here, we perform a study based on representing chemical reactions as hypergraphs where the hyperedges represent chemical reactions and nodes represent the participating molecules. We use a standard reactions dataset to construct a hypernetwork and report its statistics such as degree distributions, average path length, assortativity or degree correlations, PageRank centrality, and graph-based clusters (or communities). We also compute each statistic for an equivalent directed graph representation of reactions to draw parallels and highlight differences between the two. To demonstrate the AI applicability of hypergraph reaction representation, we generate dense hypergraph embeddings and use them in the reaction classification problem. We conclude that the hypernetwork representation is flexible, preserves reaction context, and uncovers hidden insights that are otherwise not apparent in a traditional directed graph representation of chemical reactions.
Subjects: Molecular Networks (q-bio.MN); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM); Computation (stat.CO)
DOI: 10.1039/D2RE00309K
Cite as: arXiv:2208.01647 [q-bio.MN]
  (or arXiv:2208.01647v2 [q-bio.MN] for this version)

Submission history

From: Vipul Mann [view email]
[v1] Tue, 2 Aug 2022 14:12:03 GMT (10960kb,D)
[v2] Mon, 27 Mar 2023 15:43:43 GMT (14509kb,D)

Link back to: arXiv, form interface, contact.