We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

physics

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Physics > Data Analysis, Statistics and Probability

Title: Mixture models and exploratory data analysis in networks

Abstract: Networks are widely used in the biological, physical, and social sciences as a concise mathematical representation of the topology of systems of interacting components. Understanding the structure of these networks is one of the outstanding challenges in the study of complex systems. Here we describe a technique for detecting structural features in large-scale network data which works by dividing the nodes of a network into classes such that the members of each class have similar patterns of connection to other nodes. Some previously studied network features such as community structure and bipartite structure can be regarded as examples of such divisions, but the structures we consider are substantially more general than this. Using the machinery of probabilistic mixture models and the expectation-maximization algorithm, we show that it is possible to detect, without prior knowledge of what we are looking for, a very broad range of types of structure in networks, including types that have not been considered explicitly in the past.
Comments: 8 pages, 4 figures
Subjects: Data Analysis, Statistics and Probability (physics.data-an); Statistical Mechanics (cond-mat.stat-mech); Physics and Society (physics.soc-ph)
Cite as: arXiv:physics/0611158 [physics.data-an]
  (or arXiv:physics/0611158v1 [physics.data-an] for this version)

Submission history

From: Mark Newman [view email]
[v1] Wed, 15 Nov 2006 21:44:13 GMT (140kb)
[v2] Thu, 30 Nov 2006 17:29:49 GMT (136kb)
[v3] Thu, 31 May 2007 21:18:00 GMT (354kb)

Link back to: arXiv, form interface, contact.