We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computational Geometry

Title: Star Discrepancy Subset Selection: Problem Formulation and Efficient Approaches for Low Dimensions

Abstract: Motivated by applications in instance selection, we introduce the star discrepancy subset selection problem, which consists of finding a subset of m out of n points that minimizes the star discrepancy. First, we show that this problem is NP-hard. Then, we introduce a mixed integer linear formulation (MILP) and a combinatorial branch-and-bound (BB) algorithm for the star discrepancy subset selection problem and we evaluate both approaches against random subset selection and a greedy construction on different use-cases in dimension two and three. Our results show that the MILP and BB are efficient in dimension two for large and small $m/n$ ratio, respectively, and for not too large n. However, the performance of both approaches decays strongly for larger dimensions and set sizes.
As a side effect of our empirical comparisons we obtain point sets of discrepancy values that are much smaller than those of common low-discrepancy sequences, random point sets, and of Latin Hypercube Sampling. This suggests that subset selection could be an interesting approach for generating point sets of small discrepancy value.
Subjects: Computational Geometry (cs.CG); Data Structures and Algorithms (cs.DS); Numerical Analysis (math.NA)
Cite as: arXiv:2101.07881 [cs.CG]
  (or arXiv:2101.07881v2 [cs.CG] for this version)

Submission history

From: Carola Doerr [view email]
[v1] Tue, 19 Jan 2021 22:24:41 GMT (844kb,D)
[v2] Tue, 4 Jan 2022 10:40:17 GMT (463kb,D)

Link back to: arXiv, form interface, contact.