Current browse context:
stat.ME
Change to browse by:
References & Citations
Statistics > Methodology
Title: Global and Simultaneous Hypothesis Testing for High-Dimensional Logistic Regression Models
(Submitted on 17 May 2018 (v1), revised 13 Nov 2019 (this version, v4), latest version 19 Nov 2020 (v5))
Abstract: High-dimensional logistic regression is widely used in analyzing data with binary outcomes. In this paper, global testing and large-scale multiple testing for the regression coefficients are considered in both single- and two-regression settings. A test statistic for testing the global null hypothesis is constructed using a generalized low-dimensional projection for bias correction and its asymptotic null distribution is derived. A lower bound for the global testing is established, which shows that the proposed test is asymptotically minimax optimal over some sparsity range. For testing the individual coefficients simultaneously, multiple testing procedures are proposed and shown to control the false discovery rate (FDR) and falsely discovered variables (FDV) asymptotically. Simulation studies are carried out to examine the numerical performance of the proposed tests and their superiority over existing methods. The testing procedures are also illustrated by analyzing a data set of a metabolomics study that investigates the association between fecal metabolites and pediatric Crohn's disease and the effects of treatment on such associations.
Submission history
From: Rong Ma [view email][v1] Thu, 17 May 2018 21:11:34 GMT (86kb,D)
[v2] Fri, 24 Aug 2018 19:32:51 GMT (88kb,D)
[v3] Wed, 7 Aug 2019 23:22:57 GMT (134kb,D)
[v4] Wed, 13 Nov 2019 22:05:12 GMT (647kb,D)
[v5] Thu, 19 Nov 2020 19:08:45 GMT (647kb,D)
Link back to: arXiv, form interface, contact.