We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Ancillary-file links:

Ancillary files (details):

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Multiple imputation for multilevel data with continuous and binary variables

Abstract: We present and compare multiple imputation methods for multilevel continuous and binary data where variables are systematically and sporadically missing.
The methods are compared from a theoretical point of view and through an extensive simulation study motivated by a real dataset comprising multiple studies. Simulations are reproducible. The comparisons show why these multiple imputation methods are the most appropriate to handle missing values in a multilevel setting and why their relative performances can vary according to the missing data pattern, the multilevel structure and the type of missing variables.
This study shows that valid inferences can only be obtained if the dataset gathers a large number of clusters. In addition, it highlights that heteroscedastic MI methods provide more accurate inferences than homoscedastic methods, which should be reserved for data with few individuals per cluster. Finally, the method of Quartagno and Carpenter (2016a) appears generally accurate for binary variables, the method of Resche-Rigon and White (2016) with large clusters, and the approach of Jolani et al. (2015) with small clusters.
Subjects: Methodology (stat.ME)
DOI: 10.1214/18-STS646
Cite as: arXiv:1702.00971 [stat.ME]
  (or arXiv:1702.00971v2 [stat.ME] for this version)

Submission history

From: Vincent Audigier [view email]
[v1] Fri, 3 Feb 2017 11:16:36 GMT (530kb,AD)
[v2] Mon, 27 Nov 2017 16:55:13 GMT (613kb,AD)

Link back to: arXiv, form interface, contact.