Current browse context:
cs.AI
Change to browse by:
References & Citations
Computer Science > Artificial Intelligence
Title: Faster Convergence in Deep-Predictive-Coding Networks to Learn Deeper Representations
(Submitted on 18 Jan 2021 (v1), last revised 24 Sep 2021 (this version, v4))
Abstract: Deep-predictive-coding networks (DPCNs) are hierarchical, generative models. They rely on feed-forward and feed-back connections to modulate latent feature representations of stimuli in a dynamic and context-sensitive manner. A crucial element of DPCNs is a forward-backward inference procedure to uncover sparse, invariant features. However, this inference is a major computational bottleneck. It severely limits the network depth due to learning stagnation. Here, we prove why this bottleneck occurs. We then propose a new forward-inference strategy based on accelerated proximal gradients. This strategy has faster theoretical convergence guarantees than the one used for DPCNs. It overcomes learning stagnation. We also demonstrate that it permits constructing deep and wide predictive-coding networks. Such convolutional networks implement receptive fields that capture well the entire classes of objects on which the networks are trained. This improves the feature representations compared with our lab's previous non-convolutional and convolutional DPCNs. It yields unsupervised object recognition that surpass convolutional autoencoders and are on par with convolutional networks trained in a supervised manner.
Submission history
From: Isaac Sledge [view email][v1] Mon, 18 Jan 2021 02:30:13 GMT (4089kb,D)
[v2] Fri, 5 Feb 2021 07:03:20 GMT (4302kb,D)
[v3] Sat, 15 May 2021 21:52:47 GMT (28693kb,D)
[v4] Fri, 24 Sep 2021 03:50:09 GMT (28693kb,D)
Link back to: arXiv, form interface, contact.