Current browse context:
cs.LG
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: Cautious Active Clustering
(Submitted on 3 Aug 2020 (v1), last revised 8 Dec 2020 (this version, v2))
Abstract: We consider the problem of classification of points sampled from an unknown probability measure on a Euclidean space. We study the question of querying the class label at a very small number of judiciously chosen points so as to be able to attach the appropriate class label to every point in the set. Our approach is to consider the unknown probability measure as a convex combination of the conditional probabilities for each class. Our technique involves the use of a highly localized kernel constructed from Hermite polynomials, in order to create a hierarchical estimate of the supports of the constituent probability measures. We do not need to make any assumptions on the nature of any of the probability measures nor know in advance the number of classes involved. We give theoretical guarantees measured by the $F$-score for our classification scheme. Examples include classification in hyper-spectral images and MNIST classification.
Submission history
From: Alexander Cloninger [view email][v1] Mon, 3 Aug 2020 23:47:31 GMT (10046kb,D)
[v2] Tue, 8 Dec 2020 03:41:49 GMT (12360kb,D)
Link back to: arXiv, form interface, contact.