We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Generalized R-squared for Detecting Non-independence

Abstract: Detecting whether two random variables are non-independent is a fundamental problem in statistics and machine learning. Although the celebrated Pearson correlation is effective for capturing linear dependency, it can be totally powerless for detecting nonlinear and heteroscedastic patterns. To cope with these shortcomings, we introduce a new statistics, G-squared, as a generalization of the classic R-squared to test whether two univariate random variables are mutually independent and to measure the strength of their relationship. The G-squared is almost identical to the R-squared if the underlying relationship is indeed linear, and is powerful at capturing nonlinear relationships effectively. Another attractive feature is that the G-squared has an intuitive meaning of the piece-wise R-squared between the two variables. Through an intensive simulation study, we observed that the G-squared is always one of the most powerful performers among some state-of-art methods.
Subjects: Methodology (stat.ME)
Cite as: arXiv:1604.02736 [stat.ME]
  (or arXiv:1604.02736v1 [stat.ME] for this version)

Submission history

From: Xufei Wang [view email]
[v1] Sun, 10 Apr 2016 21:03:53 GMT (1275kb,D)
[v2] Tue, 11 Oct 2016 03:15:08 GMT (686kb,D)
[v3] Fri, 18 Nov 2016 03:07:49 GMT (373kb,D)

Link back to: arXiv, form interface, contact.