Tensor Switching Networks

Tsai, Chuan-Yung; Saxe, Andrew; Cox, David

Full-text links:

Download:

Current browse context:

cs.NE

< prev | next >

new | recent | 1610

Computer Science > Neural and Evolutionary Computing

Title: Tensor Switching Networks

Authors: Chuan-Yung Tsai, Andrew Saxe, David Cox

(Submitted on 31 Oct 2016)

Abstract: We present a novel neural network algorithm, the Tensor Switching (TS) network, which generalizes the Rectified Linear Unit (ReLU) nonlinearity to tensor-valued hidden units. The TS network copies its entire input vector to different locations in an expanded representation, with the location determined by its hidden unit activity. In this way, even a simple linear readout from the TS representation can implement a highly expressive deep-network-like function. The TS network hence avoids the vanishing gradient problem by construction, at the cost of larger representation size. We develop several methods to train the TS network, including equivalent kernels for infinitely wide and deep TS networks, a one-pass linear learning algorithm, and two backpropagation-inspired representation learning algorithms. Our experimental results demonstrate that the TS network is indeed more expressive and consistently learns faster than standard ReLU networks.

Subjects:	Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1610.10087 [cs.NE]
	(or arXiv:1610.10087v1 [cs.NE] for this version)

Submission history

From: Chuan-Yung Tsai [view email]
[v1] Mon, 31 Oct 2016 19:44:50 GMT (415kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1610.10087

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Neural and Evolutionary Computing

Title: Tensor Switching Networks

Submission history