We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.CO

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Computation

Title: Variable selection in model-based clustering and discriminant analysis with a regularization approach

Abstract: Relevant methods of variable selection have been proposed in model-based clustering and classification. These methods are making use of backward or forward procedures to define the roles of the variables. Unfortunately, these stepwise procedures are terribly slow and make these variable selection algorithms inefficient to treat large data sets. In this paper, an alternative regularization approach of variable selection is proposed for model-based clustering and classification. In this approach, the variables are first ranked with a lasso-like procedure in order to avoid painfully slow stepwise algorithms. Thus, the variable selection methodology of Maugis et al (2009b) can be efficiently applied on high-dimensional data sets.
Comments: Submitted to Advances in Data Analysis and Classification
Subjects: Computation (stat.CO)
Cite as: arXiv:1705.00946 [stat.CO]
  (or arXiv:1705.00946v1 [stat.CO] for this version)

Submission history

From: Mohammed Sedki [view email]
[v1] Tue, 2 May 2017 12:59:38 GMT (96kb,D)

Link back to: arXiv, form interface, contact.