We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Non-linear Mediation Analysis with High-dimensional Mediators whose Causal Structure is Unknown

Abstract: With multiple potential mediators on the causal pathway from a treatment to an outcome, we consider the problem of decomposing the effects along multiple possible causal path(s) through each distinct mediator. Under Pearl's path-specific effects framework (Pearl, 2001; Avin et al., 2005), such fine-grained decompositions necessitate stringent assumptions, such as correctly specifying the causal structure among the mediators, and there being no unobserved confounding among the mediators. In contrast, interventional direct and indirect effects for multiple mediators (Vansteelandt and Daniel, 2017) can be identified under much weaker conditions, while providing scientifically relevant causal interpretations. Nonetheless, current estimation approaches require (correctly) specifying a model for the joint mediator distribution, which can be difficult when there is a high-dimensional set of possibly continuous and non-continuous mediators. In this article, we avoid the need to model this distribution, by developing a definition of interventional effects previously suggested by VanderWeele and Tchetgen Tchetgen (2017) for longitudinal mediation. We propose a novel estimation strategy that uses non-parametric estimates of the (counterfactual) mediator distributions. Non-continuous outcomes can be accommodated using non-linear outcome models. Estimation proceeds via Monte Carlo integration. The procedure is illustrated using publicly available genomic data (Huang and Pan, 2016) to assess the causal effect of a microRNA expression on the three-month mortality of brain cancer patients that is potentially mediated by expression values of multiple genes.
Comments: 30 pages, 2 figures, 3 tables
Subjects: Methodology (stat.ME)
DOI: 10.1111/biom.13402
Cite as: arXiv:2001.07147 [stat.ME]
  (or arXiv:2001.07147v2 [stat.ME] for this version)

Submission history

From: Wen Wei Loh [view email]
[v1] Mon, 20 Jan 2020 15:37:49 GMT (4329kb,D)
[v2] Sun, 14 Jun 2020 14:52:26 GMT (302kb,D)

Link back to: arXiv, form interface, contact.