We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Detecting confounding in multivariate linear models via spectral analysis

Abstract: We study a model where one target variable Y is correlated with a vector X:=(X_1,...,X_d) of predictor variables being potential causes of Y. We describe a method that infers to what extent the statistical dependences between X and Y are due to the influence of X on Y and to what extent due to a hidden common cause (confounder) of X and Y. The method relies on concentration of measure results for large dimensions d and an independence assumption stating that, in the absence of confounding, the vector of regression coefficients describing the influence of each X on Y typically has `generic orientation' relative to the eigenspaces of the covariance matrix of X. For the special case of a scalar confounder we show that confounding typically spoils this generic orientation in a characteristic way that can be used to quantitatively estimate the amount of confounding.
Comments: 27 pages, 16 figures
Subjects: Machine Learning (stat.ML)
Journal reference: Journal of Causal Inference, 2017
Cite as: arXiv:1704.01430 [stat.ML]
  (or arXiv:1704.01430v1 [stat.ML] for this version)

Submission history

From: Dominik Janzing [view email]
[v1] Wed, 5 Apr 2017 13:54:29 GMT (153kb,D)

Link back to: arXiv, form interface, contact.