We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: A Method for Discovering Novel Classes in Tabular Data

Abstract: In Novel Class Discovery (NCD), the goal is to find new classes in an unlabeled set given a labeled set of known but different classes. While NCD has recently gained attention from the community, no framework has yet been proposed for heterogeneous tabular data, despite being a very common representation of data. In this paper, we propose TabularNCD, a new method for discovering novel classes in tabular data. We show a way to extract knowledge from already known classes to guide the discovery process of novel classes in the context of tabular data which contains heterogeneous variables. A part of this process is done by a new method for defining pseudo labels, and we follow recent findings in Multi-Task Learning to optimize a joint objective function. Our method demonstrates that NCD is not only applicable to images but also to heterogeneous tabular data. Extensive experiments are conducted to evaluate our method and demonstrate its effectiveness against 3 competitors on 7 diverse public classification datasets.
Comments: 10 pages
Subjects: Machine Learning (cs.LG)
Cite as: arXiv:2209.01217 [cs.LG]
  (or arXiv:2209.01217v2 [cs.LG] for this version)

Submission history

From: Colin Troisemaine [view email]
[v1] Fri, 2 Sep 2022 11:45:24 GMT (567kb,D)
[v2] Mon, 3 Oct 2022 12:24:44 GMT (669kb,D)
[v3] Mon, 14 Nov 2022 14:10:39 GMT (912kb,D)

Link back to: arXiv, form interface, contact.