We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computational Geometry

Title: Social Distancing is Good for Points too!

Abstract: The nearest-neighbor rule is a well-known classification technique that, given a training set P of labeled points, classifies any unlabeled query point with the label of its closest point in P. The nearest-neighbor condensation problem aims to reduce the training set without harming the accuracy of the nearest-neighbor rule.
FCNN is the most popular algorithm for condensation. It is heuristic in nature, and theoretical results for it are scarce. In this paper, we settle the question of whether reasonable upper-bounds can be proven for the size of the subset selected by FCNN. First, we show that the algorithm can behave poorly when points are too close to each other, forcing it to select many more points than necessary. We then successfully modify the algorithm to avoid such cases, thus imposing that selected points should "keep some distance". This modification is sufficient to prove useful upper-bounds, along with approximation guarantees for the algorithm.
Comments: To appear in CCCG 2020
Subjects: Computational Geometry (cs.CG); Machine Learning (cs.LG)
Cite as: arXiv:2006.15650 [cs.CG]
  (or arXiv:2006.15650v1 [cs.CG] for this version)

Submission history

From: Alejandro Flores Velazco [view email]
[v1] Sun, 28 Jun 2020 16:49:59 GMT (6831kb,D)

Link back to: arXiv, form interface, contact.