We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

physics

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Machine Learning

Title: Shared Data and Algorithms for Deep Learning in Fundamental Physics

Abstract: We introduce a Python package that provides simply and unified access to a collection of datasets from fundamental physics research - including particle physics, astroparticle physics, and hadron- and nuclear physics - for supervised machine learning studies. The datasets contain hadronic top quarks, cosmic-ray induced air showers, phase transitions in hadronic matter, and generator-level histories. While public datasets from multiple fundamental physics disciplines already exist, the common interface and provided reference models simplify future work on cross-disciplinary machine learning and transfer learning in fundamental physics. We discuss the design and structure and line out how additional datasets can be submitted for inclusion.
As showcase application, we present a simple yet flexible graph-based neural network architecture that can easily be applied to a wide range of supervised learning tasks. We show that our approach reaches performance close to dedicated methods on all datasets. To simplify adaptation for various problems, we provide easy-to-follow instructions on how graph-based representations of data structures, relevant for fundamental physics, can be constructed and provide code implementations for several of them. Implementations are also provided for our proposed method and all reference algorithms.
Comments: 14 pages, 3 figures, 5 tables - Version accepted by Computing and Software for Big Science
Subjects: Machine Learning (cs.LG); Instrumentation and Methods for Astrophysics (astro-ph.IM); High Energy Physics - Phenomenology (hep-ph); Nuclear Theory (nucl-th); Data Analysis, Statistics and Probability (physics.data-an); Machine Learning (stat.ML)
Journal reference: Comput Softw Big Sci 6, 9 (2022)
DOI: 10.1007/s41781-022-00082-6
Cite as: arXiv:2107.00656 [cs.LG]
  (or arXiv:2107.00656v2 [cs.LG] for this version)

Submission history

From: Erik Buhmann [view email]
[v1] Thu, 1 Jul 2021 18:00:00 GMT (1130kb,D)
[v2] Thu, 24 Mar 2022 15:26:04 GMT (853kb,D)

Link back to: arXiv, form interface, contact.