We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.DB

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Databases

Title: A Hybrid Approach To Hierarchical Density-based Cluster Selection

Abstract: HDBSCAN is a density-based clustering algorithm that constructs a cluster hierarchy tree and then uses a specific stability measure to extract flat clusters from the tree. We show how the application of an additional threshold value can result in a combination of DBSCAN* and HDBSCAN clusters, and demonstrate potential benefits of this hybrid approach when clustering data of variable densities. In particular, our approach is useful in scenarios where we require a low minimum cluster size but want to avoid an abundance of micro-clusters in high-density regions. The method can directly be applied to HDBSCAN's tree of cluster candidates and does not require any modifications to the hierarchy itself. It can easily be integrated as an addition to existing HDBSCAN implementations.
Comments: 6 pages. Conference: 2020 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI)
Subjects: Databases (cs.DB)
Journal reference: 2020 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI), Karlsruhe, Germany, 2020, pp. 223-228
DOI: 10.1109/MFI49285.2020.9235263
Cite as: arXiv:1911.02282 [cs.DB]
  (or arXiv:1911.02282v4 [cs.DB] for this version)

Submission history

From: Claudia Malzer [view email]
[v1] Wed, 6 Nov 2019 09:59:56 GMT (1877kb,D)
[v2] Sun, 8 Dec 2019 09:47:23 GMT (2065kb,D)
[v3] Thu, 10 Dec 2020 09:25:13 GMT (3100kb,D)
[v4] Thu, 21 Jan 2021 13:39:38 GMT (1568kb,D)

Link back to: arXiv, form interface, contact.