We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Methodology

Title: A Normality Test for High-dimensional Data based on a Nearest Neighbor Approach

Authors: Hao Chen, Yin Xia
Abstract: Many statistical methodologies for high-dimensional data assume the population is normal. Although a few multivariate normality tests have been proposed, to the best of our knowledge, none of them can properly control the type I error when the dimension is larger than the number of observations. In this work, we propose a novel nonparametric test that utilizes the nearest neighbor information. The proposed method guarantees the asymptotic type I error control under the high-dimensional setting. Simulation studies verify the empirical size performance of the proposed test when the dimension grows with the sample size and at the same time exhibit a superior power performance of the new test compared with alternative methods. We also illustrate our approach through two popularly used data sets in high-dimensional classification and clustering literatures where deviation from the normality assumption may lead to invalid conclusions.
Subjects: Methodology (stat.ME); Statistics Theory (math.ST)
Cite as: arXiv:1904.05289 [stat.ME]
  (or arXiv:1904.05289v3 [stat.ME] for this version)

Submission history

From: Hao Chen [view email]
[v1] Wed, 10 Apr 2019 16:47:12 GMT (56kb)
[v2] Wed, 8 May 2019 19:59:53 GMT (56kb)
[v3] Sun, 2 May 2021 04:15:39 GMT (871kb,D)

Link back to: arXiv, form interface, contact.