We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: F-measure Maximization in Multi-Label Classification with Conditionally Independent Label Subsets

Abstract: We discuss a method to improve the exact F-measure maximization algorithm called GFM, proposed in (Dembczynski et al. 2011) for multi-label classification, assuming the label set can be can partitioned into conditionally independent subsets given the input features. If the labels were all independent, the estimation of only $m$ parameters ($m$ denoting the number of labels) would suffice to derive Bayes-optimal predictions in $O(m^2)$ operations. In the general case, $m^2+1$ parameters are required by GFM, to solve the problem in $O(m^3)$ operations. In this work, we show that the number of parameters can be reduced further to $m^2/n$, in the best case, assuming the label set can be partitioned into $n$ conditionally independent subsets. As this label partition needs to be estimated from the data beforehand, we use first the procedure proposed in (Gasse et al. 2015) that finds such partition and then infer the required parameters locally in each label subset. The latter are aggregated and serve as input to GFM to form the Bayes-optimal prediction. We show on a synthetic experiment that the reduction in the number of parameters brings about significant benefits in terms of performance.
Subjects: Machine Learning (cs.LG)
Cite as: arXiv:1604.07759 [cs.LG]
  (or arXiv:1604.07759v3 [cs.LG] for this version)

Submission history

From: Maxime Gasse [view email]
[v1] Tue, 26 Apr 2016 17:18:24 GMT (28kb)
[v2] Mon, 6 Jun 2016 12:26:27 GMT (28kb)
[v3] Fri, 1 Jul 2016 13:17:01 GMT (28kb)

Link back to: arXiv, form interface, contact.