We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Functional L-Optimality Subsampling for Massive Data

Abstract: Massive data bring the big challenges of memory and computation for analysis. These challenges can be tackled by taking subsamples from the full data as a surrogate. For functional data, it is common to collect multiple measurements over their domains, which require even more memory and computation time when the sample size is large. The computation would be much more intensive when statistical inference is required through bootstrap samples. To the best of our knowledge, this article is the first attempt to study the subsampling method for the functional linear model. We propose an optimal subsampling method based on the functional L-optimality criterion. When the response is a discrete or categorical variable, we further extend our proposed functional L-optimality subsampling (FLoS) method to the functional generalized linear model. We establish the asymptotic properties of the estimators by the FLoS method. The finite sample performance of our proposed FLoS method is investigated by extensive simulation studies. The FLoS method is further demonstrated by analyzing two large-scale datasets: the global climate data and the kidney transplant data. The analysis results on these data show that the FLoS method is much better than the uniform subsampling approach and can well approximate the results based on the full data while dramatically reducing the computation time and memory.
Comments: 37 pages and 15 figures
Subjects: Methodology (stat.ME); Computation (stat.CO)
Cite as: arXiv:2104.03446 [stat.ME]
  (or arXiv:2104.03446v2 [stat.ME] for this version)

Submission history

From: Jiguo Cao [view email]
[v1] Thu, 8 Apr 2021 00:41:35 GMT (12458kb,D)
[v2] Tue, 6 Jul 2021 04:20:48 GMT (47253kb,D)

Link back to: arXiv, form interface, contact.