References & Citations
Statistics > Machine Learning
Title: Convergence of Contrastive Divergence Algorithm in Exponential Family
(Submitted on 17 Mar 2016 (v1), revised 6 May 2016 (this version, v2), latest version 27 Feb 2018 (v3))
Abstract: This paper studies the convergence properties of contrastive divergence algorithm for parameter inference in exponential family, by relating it to Markov chain theory and stochastic stability literature. We prove that, under mild conditions and given a finite data sample $X_1,\dots,X_n \sim p_{\theta^*}$ i.i.d. in an event with probability approaching to 1, the sequence $\{\theta_t\}_{t \ge 0}$ generated by CD algorithm is a positive Harris recurrent chain, and thus processes an unique invariant distribution $\pi_n$. The invariant distribution concentrates around the Maximum Likelihood Estimate at a speed arbitrarily slower than $\sqrt{n}$, and the number of steps in Markov Chain Monte Carlo only affects the coefficient factor of the concentration rate. Finally we conclude that as $n \to \infty$, $$\limsup_{t \to \infty} \left\Vert \frac{1}{t} \sum_{s=1}^t \theta_s - \theta^*\right\Vert \overset{p}{\to} 0.$$
Submission history
From: Bai Jiang [view email][v1] Thu, 17 Mar 2016 23:48:15 GMT (2521kb,D)
[v2] Fri, 6 May 2016 07:23:25 GMT (2524kb,D)
[v3] Tue, 27 Feb 2018 22:25:20 GMT (1424kb,D)
Link back to: arXiv, form interface, contact.