On Random Subset Generalization Error Bounds and the Stochastic Gradient Langevin Dynamics Algorithm

Rodríguez-Gálvez, Borja; Bassi, Germán; Thobaben, Ragnar; Skoglund, Mikael

doi:10.1109/ITW46852.2021.9457578

Full-text links:

Download:

Current browse context:

cs.IT

< prev | next >

new | recent | 2010

Computer Science > Information Theory

Title: On Random Subset Generalization Error Bounds and the Stochastic Gradient Langevin Dynamics Algorithm

Authors: Borja Rodríguez-Gálvez, Germán Bassi, Ragnar Thobaben, Mikael Skoglund

(Submitted on 21 Oct 2020 (v1), last revised 16 Jan 2021 (this version, v2))

Abstract: In this work, we unify several expected generalization error bounds based on random subsets using the framework developed by Hellstr\"om and Durisi [1]. First, we recover the bounds based on the individual sample mutual information from Bu et al. [2] and on a random subset of the dataset from Negrea et al. [3]. Then, we introduce their new, analogous bounds in the randomized subsample setting from Steinke and Zakynthinou [4], and we identify some limitations of the framework. Finally, we extend the bounds from Haghifam et al. [5] for Langevin dynamics to stochastic gradient Langevin dynamics and we refine them for loss functions with potentially large gradient norms.

Comments:	To appear in the Information Theory Workshop (ITW 2020) conference. 10 pages, 5 of the main text, and 5 of appendices
Subjects:	Information Theory (cs.IT); Machine Learning (stat.ML)
DOI:	10.1109/ITW46852.2021.9457578
Cite as:	arXiv:2010.10994 [cs.IT]
	(or arXiv:2010.10994v2 [cs.IT] for this version)

Submission history

From: Borja Rodríguez Gálvez [view email]
[v1] Wed, 21 Oct 2020 13:36:01 GMT (18kb)
[v2] Sat, 16 Jan 2021 10:58:19 GMT (244kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2010.10994

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Information Theory

Title: On Random Subset Generalization Error Bounds and the Stochastic Gradient Langevin Dynamics Algorithm

Submission history