On Expected Accuracy

İrsoy, Ozan

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1905

Computer Science > Machine Learning

Title: On Expected Accuracy

Authors: Ozan İrsoy

(Submitted on 1 May 2019)

Abstract: We empirically investigate the (negative) expected accuracy as an alternative loss function to cross entropy (negative log likelihood) for classification tasks. Coupled with softmax activation, it has small derivatives over most of its domain, and is therefore hard to optimize. A modified, leaky version is evaluated on a variety of classification tasks, including digit recognition, image classification, sequence tagging and tree tagging, using a variety of neural architectures such as logistic regression, multilayer perceptron, CNN, LSTM and Tree-LSTM. We show that it yields comparable or better accuracy compared to cross entropy. Furthermore, the proposed objective is shown to be more robust to label noise.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1905.00448 [cs.LG]
	(or arXiv:1905.00448v1 [cs.LG] for this version)

Submission history

From: Ozan İrsoy [view email]
[v1] Wed, 1 May 2019 18:53:48 GMT (1052kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1905.00448

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: On Expected Accuracy

Submission history