Wikipedia Vandal Early Detection: from User Behavior to User Embedding

Yuan, Shuhan; Zheng, Panpan; Wu, Xintao; Xiang, Yang

Full-text links:

Download:

Current browse context:

cs.CR

< prev | next >

new | recent | 1706

Computer Science > Cryptography and Security

Title: Wikipedia Vandal Early Detection: from User Behavior to User Embedding

Authors: Shuhan Yuan, Panpan Zheng, Xintao Wu, Yang Xiang

(Submitted on 3 Jun 2017)

Abstract: Wikipedia is the largest online encyclopedia that allows anyone to edit articles. In this paper, we propose the use of deep learning to detect vandals based on their edit history. In particular, we develop a multi-source long-short term memory network (M-LSTM) to model user behaviors by using a variety of user edit aspects as inputs, including the history of edit reversion information, edit page titles and categories. With M-LSTM, we can encode each user into a low dimensional real vector, called user embedding. Meanwhile, as a sequential model, M-LSTM updates the user embedding each time after the user commits a new edit. Thus, we can predict whether a user is benign or vandal dynamically based on the up-to-date user embedding. Furthermore, those user embeddings are crucial to discover collaborative vandals.

Comments:	14 pages, 3 figures
Subjects:	Cryptography and Security (cs.CR); Computation and Language (cs.CL); Computers and Society (cs.CY)
Cite as:	arXiv:1706.00887 [cs.CR]
	(or arXiv:1706.00887v1 [cs.CR] for this version)

Submission history

From: Xintao Wu [view email]
[v1] Sat, 3 Jun 2017 02:42:40 GMT (456kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1706.00887

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Cryptography and Security

Title: Wikipedia Vandal Early Detection: from User Behavior to User Embedding

Submission history