We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Informative Bayesian Neural Network Priors for Weak Signals

Abstract: Encoding domain knowledge into the prior over the high-dimensional weight space of a neural network is challenging but essential in applications with limited data and weak signals. Two types of domain knowledge are commonly available in scientific applications: 1. feature sparsity (fraction of features deemed relevant); 2. signal-to-noise ratio, quantified, for instance, as the proportion of variance explained (PVE). We show how to encode both types of domain knowledge into the widely used Gaussian scale mixture priors with Automatic Relevance Determination. Specifically, we propose a new joint prior over the local (i.e., feature-specific) scale parameters that encodes knowledge about feature sparsity, and a Stein gradient optimization to tune the hyperparameters in such a way that the distribution induced on the model's PVE matches the prior distribution. We show empirically that the new prior improves prediction accuracy, compared to existing neural network priors, on several publicly available datasets and in a genetics application where signals are weak and sparse, often outperforming even computationally intensive cross-validation for hyperparameter tuning.
Comments: 25 pages, 8 figures, 4 tables
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
DOI: 10.1214/21-BA1291
Cite as: arXiv:2002.10243 [stat.ML]
  (or arXiv:2002.10243v2 [stat.ML] for this version)

Submission history

From: Tianyu Cui [view email]
[v1] Mon, 24 Feb 2020 13:43:44 GMT (811kb,D)
[v2] Thu, 7 Jan 2021 14:55:29 GMT (622kb,D)

Link back to: arXiv, form interface, contact.