Learning a Single Neuron with Gradient Methods

Yehudai, Gilad; Shamir, Ohad

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2001

Computer Science > Machine Learning

Title: Learning a Single Neuron with Gradient Methods

Authors: Gilad Yehudai, Ohad Shamir

(Submitted on 15 Jan 2020 (v1), last revised 27 Feb 2022 (this version, v3))

Abstract: We consider the fundamental problem of learning a single neuron $x \mapsto\sigma(w^\top x)$ using standard gradient methods. As opposed to previous works, which considered specific (and not always realistic) input distributions and activation functions $\sigma(\cdot)$, we ask whether a more general result is attainable, under milder assumptions. On the one hand, we show that some assumptions on the distribution and the activation function are necessary. On the other hand, we prove positive guarantees under mild assumptions, which go beyond those studied in the literature so far. We also point out and study the challenges in further strengthening and generalizing our results.

Comments:	Fixed a small bug in the proof of Theorem 4.2
Subjects:	Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
Cite as:	arXiv:2001.05205 [cs.LG]
	(or arXiv:2001.05205v3 [cs.LG] for this version)

Submission history

From: Gilad Yehudai [view email]
[v1] Wed, 15 Jan 2020 10:02:45 GMT (220kb,D)
[v2] Tue, 11 Feb 2020 10:46:34 GMT (222kb,D)
[v3] Sun, 27 Feb 2022 11:59:15 GMT (222kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2001.05205

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Learning a Single Neuron with Gradient Methods

Submission history