We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Clustering and Prediction with Variable Dimension Covariates

Abstract: In many applied fields incomplete covariate vectors are commonly encountered. It is well known that this can be problematic when making inference on model parameters, but its impact on prediction performance is less understood. We develop a method based on covariate dependent partition models that seamlessly handles missing covariates while completely avoiding any type of imputation. The method we develop allows in-sample predictions as well as out-of-sample prediction, even if the missing pattern in the new subjects' incomplete covariate vector was not seen in the training data. Any data type, including categorical or continuous covariates are permitted. In simulation studies the proposed method compares favorably. We illustrate the method in two application examples.
Subjects: Methodology (stat.ME)
Cite as: arXiv:1912.13119 [stat.ME]
  (or arXiv:1912.13119v2 [stat.ME] for this version)

Submission history

From: Garritt Page [view email]
[v1] Tue, 31 Dec 2019 00:00:41 GMT (578kb,D)
[v2] Sun, 12 Jul 2020 18:48:38 GMT (831kb,D)

Link back to: arXiv, form interface, contact.