We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.DS

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Data Structures and Algorithms

Title: Sliding Window Algorithms for k-Clustering Problems

Abstract: The sliding window model of computation captures scenarios in which data is arriving continuously, but only the latest $w$ elements should be used for analysis. The goal is to design algorithms that update the solution efficiently with each arrival rather than recomputing it from scratch. In this work, we focus on $k$-clustering problems such as $k$-means and $k$-median. In this setting, we provide simple and practical algorithms that offer stronger performance guarantees than previous results. Empirically, we show that our methods store only a small fraction of the data, are orders of magnitude faster, and find solutions with costs only slightly higher than those returned by algorithms with access to the full dataset.
Comments: 43 pages, 7 figures
Subjects: Data Structures and Algorithms (cs.DS)
Journal reference: In Proceedings of the 34th Conference on Neural Information Processing Systems (NeurIPS 2020)
Cite as: arXiv:2006.05850 [cs.DS]
  (or arXiv:2006.05850v2 [cs.DS] for this version)

Submission history

From: Alessandro Epasto [view email]
[v1] Wed, 10 Jun 2020 14:26:57 GMT (240kb,D)
[v2] Fri, 23 Oct 2020 14:20:27 GMT (245kb,D)

Link back to: arXiv, form interface, contact.