We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

physics.data-an

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Physics > Data Analysis, Statistics and Probability

Title: PAFit: an R Package for the Non-Parametric Estimation of Preferential Attachment and Node Fitness in Temporal Complex Networks

Abstract: Many real-world systems are profitably described as complex networks that grow over time. Preferential attachment and node fitness are two simple growth mechanisms that not only explain certain structural properties commonly observed in real-world systems, but are also tied to a number of applications in modeling and inference. While there are statistical packages for estimating various parametric forms of the preferential attachment function, there is no such package implementing non-parametric estimation procedures. The non-parametric approach to the estimation of the preferential attachment function allows for comparatively finer-grained investigations of the `rich-get-richer' phenomenon that could lead to novel insights in the search to explain certain nonstandard structural properties observed in real-world networks. This paper introduces the R package PAFit, which implements non-parametric procedures for estimating the preferential attachment function and node fitnesses in a growing network, as well as a number of functions for generating complex networks from these two mechanisms. The main computational part of the package is implemented in C++ with OpenMP to ensure scalability to large-scale networks. We first introduce the main functionalities of PAFit through simulated examples, and then use the package to analyze a collaboration network between scientists in the field of complex networks. The results indicate the joint presence of `rich-get-richer' and `fit-get-richer' phenomena in the collaboration network. The estimated attachment function is observed to be near-linear, which we interpret as meaning that the chance an author gets a new collaborator is proportional to their current number of collaborators. Furthermore, the estimated author fitnesses reveal a host of familiar faces from the complex networks community among the field's topmost fittest network scientists.
Comments: Conditionally accepted to Journal of Statistical Software
Subjects: Data Analysis, Statistics and Probability (physics.data-an); Social and Information Networks (cs.SI); Physics and Society (physics.soc-ph); Computation (stat.CO)
DOI: 10.18637/jss.v092.i03
Cite as: arXiv:1704.06017 [physics.data-an]
  (or arXiv:1704.06017v5 [physics.data-an] for this version)

Submission history

From: Thong The Pham [view email]
[v1] Thu, 20 Apr 2017 05:19:13 GMT (93kb,D)
[v2] Thu, 15 Jun 2017 09:14:52 GMT (157kb,D)
[v3] Fri, 30 Jun 2017 02:14:46 GMT (157kb,D)
[v4] Thu, 26 Apr 2018 15:52:38 GMT (269kb,D)
[v5] Wed, 24 Oct 2018 04:26:11 GMT (243kb,D)

Link back to: arXiv, form interface, contact.