We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: A Causal Direction Test for Heterogeneous Populations

Abstract: A probabilistic expert system emulates the decision-making ability of a human expert through a directional graphical model. The first step in building such systems is to understand data generation mechanism. To this end, one may try to decompose a multivariate distribution into product of several conditionals, and evolving a blackbox machine learning predictive models towards transparent cause-and-effect discovery. Most causal models assume a single homogeneous population, an assumption that may fail to hold in many applications. We show that when the homogeneity assumption is violated, causal models developed based on such assumption can fail to identify the correct causal direction. We propose an adjustment to a commonly used causal direction test statistic by using a $k$-means type clustering algorithm where both the labels and the number of components are estimated from the collected data to adjust the test statistic. Our simulation result show that the proposed adjustment significantly improves the performance of the causal direction test statistic for heterogeneous data. We study large sample behaviour of our proposed test statistic and demonstrate the application of the proposed method using real data.
Subjects: Methodology (stat.ME); Machine Learning (cs.LG); Computation (stat.CO)
MSC classes: 62D20, 62H30
Cite as: arXiv:2006.04877 [stat.ME]
  (or arXiv:2006.04877v2 [stat.ME] for this version)

Submission history

From: Vahid Partovi Nia [view email]
[v1] Mon, 8 Jun 2020 18:59:14 GMT (148kb,D)
[v2] Mon, 27 Sep 2021 20:51:37 GMT (1751kb,D)

Link back to: arXiv, form interface, contact.