We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Orthogonal symmetric non-negative matrix factorization under the stochastic block model

Abstract: We present a method based on the orthogonal symmetric non-negative matrix tri-factorization of the normalized Laplacian matrix for community detection in complex networks. While the exact factorization of a given order may not exist and is NP hard to compute, we obtain an approximate factorization by solving an optimization problem. We establish the connection of the factors obtained through the factorization to a non-negative basis of an invariant subspace of the estimated matrix, drawing parallel with the spectral clustering. Using such factorization for clustering in networks is motivated by analyzing a block-diagonal Laplacian matrix with the blocks representing the connected components of a graph. The method is shown to be consistent for community detection in graphs generated from the stochastic block model and the degree corrected stochastic block model. Simulation results and real data analysis show the effectiveness of these methods under a wide variety of situations, including sparse and highly heterogeneous graphs where the usual spectral clustering is known to fail. Our method also performs better than the state of the art in popular benchmark network datasets, e.g., the political web blogs and the karate club data.
Comments: 35 pages, 3 figures
Subjects: Machine Learning (stat.ML)
MSC classes: 62F12, 62H30, 90B15, 15A23
Cite as: arXiv:1605.05349 [stat.ML]
  (or arXiv:1605.05349v1 [stat.ML] for this version)

Submission history

From: Subhadeep Paul [view email]
[v1] Tue, 17 May 2016 20:22:12 GMT (350kb,D)

Link back to: arXiv, form interface, contact.