We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Distributional data analysis with accelerometer data in a NHANES database with nonparametric survey regression models

Abstract: Accelerometers enable an objective measurement of physical activity levels among groups of individuals in free-living environments, providing high-resolution detail about physical activity changes at different time scales. Current approaches used in the literature for analyzing such data typically employ summary measures such as total inactivity time or compositional metrics. However, at the conceptual level, these methods have the potential disadvantage of discarding important information from recorded data when calculating these summaries and metrics since these typically depend on cut-offs related to intensity exercise zones that are chosen subjectively or even arbitrarily. Much of the data collected in these studies follow complex survey designs, making application of standard statistical tools such as non-parametric regression models inappropriate and the requirement of specific estimation procedures according to particular sampling-design is mandatory. With functional data or other complex objects, barely literature exist that handles complex sampling designs in the statistical analysis. This paper aims two-fold; first, we introduce a new functional representation of accelerometer data of a distributional nature to build a complete individualized profile of each subject's physical activity levels. Second, using the NHANES accelerometer data (2003-2006), we show the potential advantages of this new representation to predict patients' outcomes over $68$ years of age. A critical component in our statistical modeling is that we extend non-parametric functional models used: kernel smoother and kernel ridge regression, to handle the specific effect of complex sampling design in order to provide reliable conclusions about the influence of physical activity in distinct analysis performed.
Subjects: Methodology (stat.ME); Applications (stat.AP); Other Statistics (stat.OT)
Cite as: arXiv:2104.01165 [stat.ME]
  (or arXiv:2104.01165v1 [stat.ME] for this version)

Submission history

From: Marcos Matabuena [view email]
[v1] Fri, 2 Apr 2021 17:30:39 GMT (811kb,D)
[v2] Thu, 20 Jan 2022 18:33:33 GMT (1819kb,D)

Link back to: arXiv, form interface, contact.