Risk and parameter convergence of logistic regression

Ji, Ziwei; Telgarsky, Matus

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1803

Computer Science > Machine Learning

Title: Risk and parameter convergence of logistic regression

Authors: Ziwei Ji, Matus Telgarsky

(Submitted on 20 Mar 2018 (v1), last revised 8 Jun 2019 (this version, v3))

Abstract: Gradient descent, when applied to the task of logistic regression, outputs iterates which are biased to follow a unique ray defined by the data. The direction of this ray is the maximum margin predictor of a maximal linearly separable subset of the data; the gradient descent iterates converge to this ray in direction at the rate $\mathcal{O}(\ln\ln t / \ln t)$. The ray does not pass through the origin in general, and its offset is the bounded global optimum of the risk over the remaining data; gradient descent recovers this offset at a rate $\mathcal{O}((\ln t)^2 / \sqrt{t})$.

Comments:	Appears in COLT 2019 with the title "The implicit bias of gradient descent on nonseparable data" (and no other changes)
Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:1803.07300 [cs.LG]
	(or arXiv:1803.07300v3 [cs.LG] for this version)

Submission history

From: Matus Telgarsky [view email]
[v1] Tue, 20 Mar 2018 08:47:27 GMT (37kb,D)
[v2] Fri, 1 Jun 2018 07:53:44 GMT (36kb,D)
[v3] Sat, 8 Jun 2019 13:57:05 GMT (93kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1803.07300

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Risk and parameter convergence of logistic regression

Submission history