The Curious Case of Convex Neural Networks

Sivaprasad, Sarath; Singh, Ankur; Manwani, Naresh; Gandhi, Vineet

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2006

Computer Science > Machine Learning

Title: The Curious Case of Convex Neural Networks

Authors: Sarath Sivaprasad, Ankur Singh, Naresh Manwani, Vineet Gandhi

(Submitted on 9 Jun 2020 (v1), last revised 10 Jul 2021 (this version, v3))

Abstract: In this paper, we investigate a constrained formulation of neural networks where the output is a convex function of the input. We show that the convexity constraints can be enforced on both fully connected and convolutional layers, making them applicable to most architectures. The convexity constraints include restricting the weights (for all but the first layer) to be non-negative and using a non-decreasing convex activation function. Albeit simple, these constraints have profound implications on the generalization abilities of the network. We draw three valuable insights: (a) Input Output Convex Neural Networks (IOC-NNs) self regularize and reduce the problem of overfitting; (b) Although heavily constrained, they outperform the base multi layer perceptrons and achieve similar performance as compared to base convolutional architectures and (c) IOC-NNs show robustness to noise in train labels. We demonstrate the efficacy of the proposed idea using thorough experiments and ablation studies on standard image classification datasets with three different neural network architectures.

Comments:	20 pages, accepted at ECML-PKDD
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2006.05103 [cs.LG]
	(or arXiv:2006.05103v3 [cs.LG] for this version)

Submission history

From: Sarath Sivaprasad [view email]
[v1] Tue, 9 Jun 2020 08:16:38 GMT (594kb,D)
[v2] Sat, 12 Dec 2020 05:57:08 GMT (943kb,D)
[v3] Sat, 10 Jul 2021 10:51:29 GMT (1308kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2006.05103

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: The Curious Case of Convex Neural Networks

Submission history