Current browse context:
cs.SY
Change to browse by:
References & Citations
Computer Science > Systems and Control
Title: Gradient-based learning algorithms with constant-error estimators: stability and convergence
(Submitted on 1 Apr 2016 (this version), latest version 18 Sep 2017 (v3))
Abstract: Implementations of stochastic gradient search algorithms such as back propagation typically rely on finite difference ($FD$) approximation methods. These methods are used to approximate the objective function gradient in steepest descent algorithms as well as the gradient and Hessian inverse in Newton based schemes. The convergence analyses of such schemes critically require that perturbation parameters in the estimators of the gradient/Hessian approach zero. However, in practice, the perturbation parameter is often held fixed to a `small' constant resulting in constant-error estimates. We present in this paper a theoretical framework based on set-valued dynamical systems to analyze the aforementioned. Easily verifiable conditions are presented for stability and convergence when using such $FD$ estimators for the gradient/Hessian. In addition, our framework dispenses with a critical restriction on the step-sizes (learning rate) when using FD estimators.
Submission history
From: Arunselvan Ramaswamy [view email][v1] Fri, 1 Apr 2016 07:03:46 GMT (17kb)
[v2] Tue, 27 Sep 2016 14:36:07 GMT (27kb,D)
[v3] Mon, 18 Sep 2017 08:56:56 GMT (30kb,D)
Link back to: arXiv, form interface, contact.