We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.ST

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: Mixture of Conditional Gaussian Graphical Models for unlabelled heterogeneous populations in the presence of co-factors

Authors: Thomas Lartigue (ARAMIS, CMAP), Stanley Durrleman (ARAMIS), Stéphanie Allassonnière (CRC (UMR\_S\_1138 / U1138))
Abstract: Conditional correlation networks, within Gaussian Graphical Models (GGM), are widely used to describe the direct interactions between the components of a random vector. In the case of an unlabelled Heterogeneous population, Expectation Maximisation (EM) algorithms for Mixtures of GGM have been proposed to estimate both each sub-population's graph and the class labels. However, we argue that, with most real data, class affiliation cannot be described with a Mixture of Gaussian, which mostly groups data points according to their geometrical proximity. In particular, there often exists external co-features whose values affect the features' average value, scattering across the feature space data points belonging to the same sub-population. Additionally, if the co-features' effect on the features is Heterogeneous, then the estimation of this effect cannot be separated from the sub-population identification. In this article, we propose a Mixture of Conditional GGM (CGGM) that subtracts the heterogeneous effects of the co-features to regroup the data points into sub-population corresponding clusters. We develop a penalised EM algorithm to estimate graph-sparse model parameters. We demonstrate on synthetic and real data how this method fulfils its goal and succeeds in identifying the sub-populations where the Mixtures of GGM are disrupted by the effect of the co-features.
Subjects: Statistics Theory (math.ST); Machine Learning (stat.ML)
Journal reference: SN Computer Science, Springer, 2021, 2 (466), \&\#x27E8;10.1007/s42979-021-00865-5\&\#x27E9
Cite as: arXiv:2006.11094 [math.ST]
  (or arXiv:2006.11094v4 [math.ST] for this version)

Submission history

From: Thomas Lartigue [view email]
[v1] Fri, 19 Jun 2020 11:57:30 GMT (827kb,D)
[v2] Tue, 24 Nov 2020 14:24:29 GMT (1013kb,D)
[v3] Fri, 2 Apr 2021 12:33:49 GMT (1483kb,D)
[v4] Tue, 8 Mar 2022 10:58:55 GMT (1519kb,D)

Link back to: arXiv, form interface, contact.