We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Decentralized Online Learning: Take Benefits from Others' Data without Sharing Your Own to Track Global Trend

Abstract: Decentralized Online Learning (online learning in decentralized networks) attracts more and more attention, since it is believed that Decentralized Online Learning can help the data providers cooperatively better solve their online problems without sharing their private data to a third party or other providers. Typically, the cooperation is achieved by letting the data providers exchange their models between neighbors, e.g., recommendation model. However, the best regret bound for a decentralized online learning algorithm is $\Ocal{n\sqrt{T}}$, where $n$ is the number of nodes (or users) and $T$ is the number of iterations. This is clearly insignificant since this bound can be achieved \emph{without} any communication in the networks. This reminds us to ask a fundamental question: \emph{Can people really get benefit from the decentralized online learning by exchanging information?} In this paper, we studied when and why the communication can help the decentralized online learning to reduce the regret. Specifically, each loss function is characterized by two components: the adversarial component and the stochastic component. Under this characterization, we show that decentralized online gradient (DOG) enjoys a regret bound $\Ocal{n\sqrt{T}G + \sqrt{nT}\sigma}$, where $G$ measures the magnitude of the adversarial component in the private data (or equivalently the local loss function) and $\sigma$ measures the randomness within the private data. This regret suggests that people can get benefits from the randomness in the private data by exchanging private information. Another important contribution of this paper is to consider the dynamic regret -- a more practical regret to track users' interest dynamics. Empirical studies are also conducted to validate our analysis.
Comments: Second version: revise Assumption 1 (there is a typo in the first version); add experiments (see Figure 2); revise Algorithm 1 in a more clear way
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:1901.10593 [cs.LG]
  (or arXiv:1901.10593v4 [cs.LG] for this version)

Submission history

From: Yawei Zhao [view email]
[v1] Tue, 29 Jan 2019 22:29:27 GMT (251kb,D)
[v2] Fri, 15 Mar 2019 21:47:01 GMT (283kb,D)
[v3] Thu, 28 Mar 2019 00:37:37 GMT (284kb,D)
[v4] Tue, 28 May 2019 19:54:56 GMT (367kb,D)

Link back to: arXiv, form interface, contact.