Current browse context:
cs.LG
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: Online Influence Maximization under Independent Cascade Model with Semi-Bandit Feedback
(Submitted on 21 May 2016 (v1), revised 22 May 2017 (this version, v2), latest version 19 Jun 2018 (v3))
Abstract: We study the stochastic online problem of learning to influence in a social network with semi-bandit feedback, where we observe how users influence each other. The problem combines challenges of limited feedback, because the learning agent only observes the influenced portion of the network, and combinatorial number of actions, because the cardinality of the feasible set is exponential in the maximum number of influencers. We propose a computationally efficient UCB-like algorithm, IMLinUCB, and analyze it. Our regret bounds are polynomial in all quantities of interest; reflect the structure of the network and the probabilities of influence. Moreover, they do not depend on inherently large quantities, such as the cardinality of the action set. To the best of our knowledge, these are the first such results. IMLinUCB permits linear generalization and therefore is suitable for large-scale problems. Our experiments show that the regret of IMLinUCB scales as suggested by our upper bounds in several representative graph topologies; and based on linear generalization, IMLinUCB can significantly reduce regret of real-world influence maximization semi-bandits.
Submission history
From: Zheng Wen [view email][v1] Sat, 21 May 2016 06:07:53 GMT (115kb,D)
[v2] Mon, 22 May 2017 23:36:42 GMT (184kb,D)
[v3] Tue, 19 Jun 2018 05:51:52 GMT (185kb,D)
Link back to: arXiv, form interface, contact.