Current browse context:
stat.ME
Change to browse by:
References & Citations
Statistics > Methodology
Title: Meta-analysis of heterogeneous data: integrative sparse regression in high-dimensions
(Submitted on 26 Dec 2019 (v1), last revised 30 Jun 2022 (this version, v2))
Abstract: We consider the task of meta-analysis in high-dimensional settings in which the data sources are similar but non-identical. To borrow strength across such heterogeneous datasets, we introduce a global parameter that emphasizes interpretability and statistical efficiency in the presence of heterogeneity. We also propose a one-shot estimator of the global parameter that preserves the anonymity of the data sources and converges at a rate that depends on the size of the combined dataset. For high-dimensional linear model settings, we demonstrate the superiority of our identification restrictions in adapting to a previously seen data distribution as well as predicting for a new/unseen data distribution. Finally, we demonstrate the benefits of our approach on a large-scale drug treatment dataset involving several different cancer cell-lines.
Submission history
From: Subha Maity [view email][v1] Thu, 26 Dec 2019 20:30:57 GMT (3188kb,D)
[v2] Thu, 30 Jun 2022 15:21:24 GMT (5092kb,D)
Link back to: arXiv, form interface, contact.