We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: MIDA: Multiple Imputation using Denoising Autoencoders

Abstract: Missing data is a significant problem impacting all domains. State-of-the-art framework for minimizing missing data bias is multiple imputation, for which the choice of an imputation model remains nontrivial. We propose a multiple imputation model based on overcomplete deep denoising autoencoders. Our proposed model is capable of handling different data types, missingness patterns, missingness proportions and distributions. Evaluation on several real life datasets show our proposed model significantly outperforms current state-of-the-art methods under varying conditions while simultaneously improving end of the line analytics.
Comments: To appear in the proceedings of the 22nd Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2018)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:1705.02737 [cs.LG]
  (or arXiv:1705.02737v3 [cs.LG] for this version)

Submission history

From: Lovedeep Gondara [view email]
[v1] Mon, 8 May 2017 04:00:25 GMT (282kb,D)
[v2] Fri, 19 May 2017 21:15:44 GMT (144kb,D)
[v3] Sat, 17 Feb 2018 16:05:32 GMT (136kb,D)

Link back to: arXiv, form interface, contact.