We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.DS

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Data Structures and Algorithms

Title: Efficient stream-based Max-Min diversification with minimal failure rate

Abstract: The stream-based Max-Min diversification problem concerns the task of selecting a limited number of diverse instances from a data stream. The nature of the problem demands immediate and irrevocable decisions. The set-wise diversity to be maximized is the minimum distance among any pair of the selected instances. Standard algorithmic approaches for sequential selection disregard the possibility of selection failures, which is the situation where the last instances of the stream are picked by default to prevent having an incomplete selection. This defect can be catastrophic for the Max-Min diversification objective. In this paper we present the Failure Rate Minimization (FRM) algorithm that allows the selection of a set of disparate instances while reducing significantly the probability of having failures. This is achieved by means of both analytical and empirical techniques. FRM is put in comparison with relevant algorithms from the literature through simulations on real datasets, where we demonstrate its efficiency and low time complexity.
Subjects: Data Structures and Algorithms (cs.DS)
Cite as: arXiv:2011.10659 [cs.DS]
  (or arXiv:2011.10659v1 [cs.DS] for this version)

Submission history

From: Mathilde Fekom [view email]
[v1] Tue, 17 Nov 2020 14:20:16 GMT (257kb,D)

Link back to: arXiv, form interface, contact.