We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.SI

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Social and Information Networks

Title: Network Sampling Based on NN Representatives

Abstract: The amount of large-scale real data around us increase in size very quickly and so does the necessity to reduce its size by obtaining a representative sample. Such sample allows us to use a great variety of analytical methods, whose direct application on original data would be infeasible. There are many methods used for different purposes and with different results. In this paper we outline a simple and straightforward approach based on analyzing the nearest neighbors (NN) that is generally applicable. This feature is illustrated on experiments with weighted networks and vector data. The properties of the representative sample show that the presented approach maintains very well internal data structures (e.g. clusters and density). Key technical parameters of the approach is low complexity and high scalability. This allows the application of this approach to the area of big data.
Subjects: Social and Information Networks (cs.SI); Physics and Society (physics.soc-ph)
Cite as: arXiv:1402.1661 [cs.SI]
  (or arXiv:1402.1661v1 [cs.SI] for this version)

Submission history

From: Jan Platoš [view email]
[v1] Fri, 7 Feb 2014 15:09:18 GMT (4511kb)

Link back to: arXiv, form interface, contact.