We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Machine Learning

Title: CCP: Correlated Clustering and Projection for Dimensionality Reduction

Abstract: Most dimensionality reduction methods employ frequency domain representations obtained from matrix diagonalization and may not be efficient for large datasets with relatively high intrinsic dimensions. To address this challenge, Correlated Clustering and Projection (CCP) offers a novel data domain strategy that does not need to solve any matrix. CCP partitions high-dimensional features into correlated clusters and then projects correlated features in each cluster into a one-dimensional representation based on sample correlations. Residue-Similarity (R-S) scores and indexes, the shape of data in Riemannian manifolds, and algebraic topology-based persistent Laplacian are introduced for visualization and analysis. Proposed methods are validated with benchmark datasets associated with various machine learning algorithms.
Subjects: Machine Learning (stat.ML); Computational Geometry (cs.CG); Machine Learning (cs.LG)
Cite as: arXiv:2206.04189 [stat.ML]
  (or arXiv:2206.04189v1 [stat.ML] for this version)

Submission history

From: Yuta Hozumi [view email]
[v1] Wed, 8 Jun 2022 23:14:44 GMT (11552kb,D)

Link back to: arXiv, form interface, contact.