We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Learning from Incomplete Features by Simultaneous Training of Neural Networks and Sparse Coding

Abstract: In this paper, the problem of training a classifier on a dataset with incomplete features is addressed. We assume that different subsets of features (random or structured) are available at each data instance. This situation typically occurs in the applications when not all the features are collected for every data sample. A new supervised learning method is developed to train a general classifier, such as a logistic regression or a deep neural network, using only a subset of features per sample, while assuming sparse representations of data vectors on an unknown dictionary. Sufficient conditions are identified, such that, if it is possible to train a classifier on incomplete observations so that their reconstructions are well separated by a hyperplane, then the same classifier also correctly separates the original (unobserved) data samples. Extensive simulation results on synthetic and well-known datasets are presented that validate our theoretical findings and demonstrate the effectiveness of the proposed method compared to traditional data imputation approaches and one state-of-the-art algorithm.
Comments: 11 pages, 7 figures, paper accepted for presentation at L2ID Workshop at CVPR 2021 (19-25 June, 2021)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:2011.14047 [cs.LG]
  (or arXiv:2011.14047v2 [cs.LG] for this version)

Submission history

From: Cesar F. Caiafa [view email]
[v1] Sat, 28 Nov 2020 02:20:39 GMT (1522kb,D)
[v2] Sat, 17 Apr 2021 20:09:10 GMT (4386kb,D)

Link back to: arXiv, form interface, contact.