We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Unifying supervised learning and VAEs -- automating statistical inference in (astro-)particle physics with amortized conditional normalizing flows

Abstract: A KL-divergence objective of the joint distribution of data and labels allows to unify supervised learning and variational autoencoders (VAEs) under one umbrella of stochastic variational inference. The unification motivates an extended supervised scheme which allows to calculate a goodness-of-fit p-value for the neural network model. Conditional normalizing flows amortized with a neural network are crucial in this construction. We discuss how they allow to rigorously define coverage for posteriors defined jointly on a product space, e.g. $\mathbb{R}^n \times \mathcal{S}^m$, which encompasses posteriors over directions. Finally, systematic uncertainties are naturally included in the variational viewpoint. In classical likelihood approaches or other machine learning models, the ingredients of (1) systematics, (2) coverage and (3) goodness-of-fit are typically not all available or at least one of them strongly constrained. In contrast, the proposed extended supervised training with amortized normalizing flows accommodates all three of them for variational inference of arbitrary statistical distributions defined on product spaces like $\mathbb{R}^n \times \ldots \times \mathcal{S}^m$ and no fundamental barrier in terms of complexity of the underlying data. It therefore has great potential for the statistical toolbox of the contemporary (astro-)particle physicist.
Subjects: Machine Learning (cs.LG); High Energy Astrophysical Phenomena (astro-ph.HE); Instrumentation and Methods for Astrophysics (astro-ph.IM); High Energy Physics - Experiment (hep-ex); Machine Learning (stat.ML)
Cite as: arXiv:2008.05825 [cs.LG]
  (or arXiv:2008.05825v3 [cs.LG] for this version)

Submission history

From: Thorsten Glüsenkamp [view email]
[v1] Thu, 13 Aug 2020 11:28:57 GMT (1473kb,D)
[v2] Fri, 28 Aug 2020 18:27:11 GMT (1463kb,D)
[v3] Sun, 14 Aug 2022 20:47:10 GMT (1916kb,D)
[v4] Tue, 3 Oct 2023 17:00:23 GMT (1938kb,D)
[v5] Sun, 14 Jan 2024 14:45:36 GMT (3384kb,D)

Link back to: arXiv, form interface, contact.