Is a Data-Driven Approach still Better than Random Choice with Naive Bayes classifiers?

Szymański, Piotr; Kajdanowicz, Tomasz

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1702

Computer Science > Machine Learning

Title: Is a Data-Driven Approach still Better than Random Choice with Naive Bayes classifiers?

Authors: Piotr Szymański, Tomasz Kajdanowicz

(Submitted on 13 Feb 2017)

Abstract: We study the performance of data-driven, a priori and random approaches to label space partitioning for multi-label classification with a Gaussian Naive Bayes classifier. Experiments were performed on 12 benchmark data sets and evaluated on 5 established measures of classification quality: micro and macro averaged F1 score, Subset Accuracy and Hamming loss. Data-driven methods are significantly better than an average run of the random baseline. In case of F1 scores and Subset Accuracy - data driven approaches were more likely to perform better than random approaches than otherwise in the worst case. There always exists a method that performs better than a priori methods in the worst case. The advantage of data-driven methods against a priori methods with a weak classifier is lesser than when tree classifiers are used.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1702.04013 [cs.LG]
	(or arXiv:1702.04013v1 [cs.LG] for this version)

Submission history

From: Piotr Szymański [view email]
[v1] Mon, 13 Feb 2017 23:04:31 GMT (1045kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1702.04013

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Is a Data-Driven Approach still Better than Random Choice with Naive Bayes classifiers?

Submission history