We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: The Chi-Square Test of Distance Correlation

Abstract: Distance correlation has gained much recent attention in the data science community: the sample statistic is straightforward to compute and asymptotically equals zero if and only if independence, making it an ideal choice to discover any type of dependency structure given sufficient sample size. One major bottleneck is the testing process: because the null distribution of distance correlation depends on the underlying random variables and metric choice, it typically requires a permutation test to estimate the null and compute the p-value, which is very costly for large amount of data. To overcome the difficulty, in this paper we propose a chi-square test for distance correlation. Method-wise, the chi-square test is non-parametric, extremely fast, and applicable to bias-corrected distance correlation using any strong negative type metric or characteristic kernel. The test exhibits a similar testing power as the standard permutation test, and can be utilized for K-sample and partial testing. Theory-wise, we show that the underlying chi-square distribution well approximates and dominates the limiting null distribution in upper tail, prove the chi-square test can be valid and universally consistent for testing independence, and establish a testing power inequality with respect to the permutation test.
Comments: 21 pages, 4 figures, 1 table
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST); Methodology (stat.ME)
Journal reference: Journal of Computational and Graphical Statistics, 31(1):254-262, 2022
DOI: 10.1080/10618600.2021.1938585
Cite as: arXiv:1912.12150 [stat.ML]
  (or arXiv:1912.12150v5 [stat.ML] for this version)

Submission history

From: Cencheng Shen [view email]
[v1] Fri, 27 Dec 2019 15:16:40 GMT (144kb,D)
[v2] Tue, 7 Jan 2020 15:08:41 GMT (142kb,D)
[v3] Wed, 22 Jan 2020 21:53:47 GMT (143kb,D)
[v4] Fri, 21 Feb 2020 21:35:39 GMT (143kb,D)
[v5] Fri, 14 May 2021 18:09:51 GMT (154kb,D)

Link back to: arXiv, form interface, contact.