We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.SI

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Social and Information Networks

Title: Cross-Domain Entity Resolution in Social Media

Abstract: The challenge of associating entities across multiple domains is a key problem in social media understanding. Successful cross-domain entity resolution provides integration of information from multiple sites to create a complete picture of user and community activities, characteristics, and trends. In this work, we examine the problem of entity resolution across Twitter and Instagram using general techniques. Our methods fall into three categories: profile, content, and graph based. For the profile-based methods, we consider techniques based on approximate string matching. For content-based methods, we perform author identification. Finally, for graph-based methods, we apply novel cross-domain community detection methods and generate neighborhood-based features. The three categories of methods are applied to a large graph of users in Twitter and Instagram to understand challenges, determine performance, and understand fusion of multiple methods. Final results demonstrate an equal error rate less than 1%.
Subjects: Social and Information Networks (cs.SI)
Journal reference: The 4th International Workshop on Natural Language Processing for Social Media, 2016
Cite as: arXiv:1608.01386 [cs.SI]
  (or arXiv:1608.01386v1 [cs.SI] for this version)

Submission history

From: Lin Li [view email]
[v1] Wed, 3 Aug 2016 22:38:53 GMT (270kb,D)

Link back to: arXiv, form interface, contact.