We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Predicting Sequences of Traversed Nodes in Graphs using Network Models with Multiple Higher Orders

Abstract: We propose a novel sequence prediction method for sequential data capturing node traversals in graphs. Our method builds on a statistical modelling framework that combines multiple higher-order network models into a single multi-order model. We develop a technique to fit such multi-order models in empirical sequential data and to select the optimal maximum order. Our framework facilitates both next-element and full sequence prediction given a sequence-prefix of any length. We evaluate our model based on six empirical data sets containing sequences from website navigation as well as public transport systems. The results show that our method out-performs state-of-the-art algorithms for next-element prediction. We further demonstrate the accuracy of our method during out-of-sample sequence prediction and validate that our method can scale to data sets with millions of sequences.
Comments: 18 pages, 5 figures, 2 tables; changes with v2: updated broken figure and MSNBC data
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Social and Information Networks (cs.SI); Data Analysis, Statistics and Probability (physics.data-an); Machine Learning (stat.ML)
DOI: 10.1007/s41109-023-00596-x
Cite as: arXiv:2007.06662 [cs.LG]
  (or arXiv:2007.06662v2 [cs.LG] for this version)

Submission history

From: Christoph Gote [view email]
[v1] Mon, 13 Jul 2020 20:08:14 GMT (116kb)
[v2] Wed, 25 Aug 2021 15:08:07 GMT (116kb,D)

Link back to: arXiv, form interface, contact.