We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.ST

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: A Hoeffding Inequality for Finite State Markov Chains and its Applications to Markovian Bandits

Abstract: This paper develops a Hoeffding inequality for the partial sums $\sum_{k=1}^n f (X_k)$, where $\{X_k\}_{k \in \mathbb{Z}_{> 0}}$ is an irreducible Markov chain on a finite state space $S$, and $f : S \to [a, b]$ is a real-valued function. Our bound is simple, general, since it only assumes irreducibility and finiteness of the state space, and powerful. In order to demonstrate its usefulness we provide two applications in multi-armed bandit problems. The first is about identifying an approximately best Markovian arm, while the second is concerned with regret minimization in the context of Markovian bandits.
Comments: International Symposium on Information Theory (ISIT), 2020
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:2001.01199 [math.ST]
  (or arXiv:2001.01199v2 [math.ST] for this version)

Submission history

From: Vrettos Moulos [view email]
[v1] Sun, 5 Jan 2020 09:28:10 GMT (12kb,D)
[v2] Fri, 10 Jul 2020 16:56:28 GMT (21kb,D)

Link back to: arXiv, form interface, contact.