References & Citations
Computer Science > Machine Learning
Title: Improved Training for Self-Training by Confidence Assessments
(Submitted on 30 Sep 2017 (v1), last revised 5 Apr 2018 (this version, v2))
Abstract: It is well known that for some tasks, labeled data sets may be hard to gather. Therefore, we wished to tackle here the problem of having insufficient training data. We examined learning methods from unlabeled data after an initial training on a limited labeled data set. The suggested approach can be used as an online learning method on the unlabeled test set. In the general classification task, whenever we predict a label with high enough confidence, we treat it as a true label and train the data accordingly. For the semantic segmentation task, a classic example for an expensive data labeling process, we do so pixel-wise. Our suggested approaches were applied on the MNIST data-set as a proof of concept for a vision classification task and on the ADE20K data-set in order to tackle the semi-supervised semantic segmentation problem.
Submission history
From: Gal Hyams [view email][v1] Sat, 30 Sep 2017 14:47:06 GMT (20kb)
[v2] Thu, 5 Apr 2018 14:42:52 GMT (67kb)
Link back to: arXiv, form interface, contact.