We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: CDPA: Common and Distinctive Pattern Analysis between High-dimensional Datasets

Authors: Hai Shu, Zhe Qu
Abstract: A representative model in integrative analysis of two high-dimensional correlated datasets is to decompose each data matrix into a low-rank common matrix generated by latent factors shared across datasets, a low-rank distinctive matrix corresponding to each dataset, and an additive noise matrix. Existing decomposition methods claim that their common matrices capture the common pattern of the two datasets. However, their so-called common pattern only denotes the common latent factors but ignores the common pattern between the two coefficient matrices of these common latent factors. We propose a new unsupervised learning method, called the common and distinctive pattern analysis (CDPA), which appropriately defines the two types of data patterns by further incorporating the common and distinctive patterns of the coefficient matrices. A consistent estimation approach is developed for high-dimensional settings, and shows reasonably good finite-sample performance in simulations. Our simulation studies and real data analysis corroborate that the proposed CDPA can provide better characterization of common and distinctive patterns and thereby benefit data mining.
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
Journal reference: Electronic Journal of Statistics, 2022, 16 (1), 2475-2517
DOI: 10.1214/22-EJS2008
Cite as: arXiv:1912.09989 [stat.ML]
  (or arXiv:1912.09989v4 [stat.ML] for this version)

Submission history

From: Hai Shu [view email]
[v1] Fri, 20 Dec 2019 18:21:19 GMT (4096kb,D)
[v2] Fri, 20 Mar 2020 07:01:37 GMT (5356kb,D)
[v3] Mon, 17 May 2021 16:44:44 GMT (11227kb,D)
[v4] Tue, 5 Apr 2022 14:37:18 GMT (11174kb,D)

Link back to: arXiv, form interface, contact.