We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: An Affine-Invariant Bayesian Cluster Process

Abstract: In order to identify clusters of objects with features transformed by unknown affine transformations, we develop a Bayesian cluster process which is invariant with respect to certain linear transformations of the feature space and able to cluster data without knowing the number of clusters in advance. Specifically, our proposed method can identify clusters invariant to orthogonal transformations under model I, invariant to scaling-coordinate orthogonal transformations under model II, or invariant to arbitrary non-singular linear transformations under model III. The proposed split-merge algorithm leads to an irreducible and aperiodic Markov chain, which is also efficient at identifying clusters reasonably well for various applications. We illustrate the applications of our approach to both synthetic and real data such as leukemia gene expression data for model I; wine data and two half-moons benchmark data for model II; three-dimensional Denmark road geographic coordinate system data and an arbitrary non-singular transformed two half-moons data for model III. These examples show that the proposed method could be widely applied in many fields, especially for finding the number of clusters and identifying clusters of samples of interest in aerial photography and genomic data.
Comments: 19 pages, 9 figures
Subjects: Methodology (stat.ME)
Cite as: arXiv:1611.09890 [stat.ME]
  (or arXiv:1611.09890v1 [stat.ME] for this version)

Submission history

From: Hsin-Hsiung Huang [view email]
[v1] Tue, 29 Nov 2016 21:25:54 GMT (3770kb)

Link back to: arXiv, form interface, contact.