Current browse context:
cs.LG
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: Clustering with feature selection using alternating minimization, Application to computational biology
(Submitted on 8 Nov 2017 (v1), last revised 24 May 2019 (this version, v4))
Abstract: This paper deals with unsupervised clustering with feature selection. The problem is to estimate both labels and a sparse projection matrix of weights. To address this combinatorial non-convex problem maintaining a strict control on the sparsity of the matrix of weights, we propose an alternating minimization of the Frobenius norm criterion. We provide a new efficient algorithm named K-sparse which alternates k-means with projection-gradient minimization. The projection-gradient step is a method of splitting type, with exact projection on the $\ell^1$ ball to promote sparsity. The convergence of the gradient-projection step is addressed, and a preliminary analysis of the alternating minimization is made. The Frobenius norm criterion converges as the number of iterates in Algorithm K-sparse goes to infinity. Experiments on Single Cell RNA sequencing datasets show that our method significantly improves the results of PCA k-means, spectral clustering, SIMLR, and Sparcl methods, and achieves a relevant selection of genes. The complexity of K-sparse is linear in the number of samples (cells), so that the method scales up to large datasets.
Submission history
From: Michel Barlaud [view email][v1] Wed, 8 Nov 2017 14:42:55 GMT (4586kb,D)
[v2] Tue, 5 Dec 2017 09:45:42 GMT (4678kb,D)
[v3] Mon, 29 Oct 2018 14:29:53 GMT (3556kb,D)
[v4] Fri, 24 May 2019 12:04:34 GMT (3556kb,D)
Link back to: arXiv, form interface, contact.