RoNID: New Intent Discovery with Generated-Reliable Labels and Cluster-friendly Representations

Zhang, Shun; Yan, Chaoran; Yang, Jian; Ren, Changyu; Bai, Jiaqi; Li, Tongliang; Li, Zhoujun

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2404

Computer Science > Computation and Language

Title: RoNID: New Intent Discovery with Generated-Reliable Labels and Cluster-friendly Representations

Authors: Shun Zhang, Chaoran Yan, Jian Yang, Changyu Ren, Jiaqi Bai, Tongliang Li, Zhoujun Li

(Submitted on 13 Apr 2024 (v1), last revised 18 Apr 2024 (this version, v2))

Abstract: New Intent Discovery (NID) strives to identify known and reasonably deduce novel intent groups in the open-world scenario. But current methods face issues with inaccurate pseudo-labels and poor representation learning, creating a negative feedback loop that degrades overall model performance, including accuracy and the adjusted rand index. To address the aforementioned challenges, we propose a Robust New Intent Discovery (RoNID) framework optimized by an EM-style method, which focuses on constructing reliable pseudo-labels and obtaining cluster-friendly discriminative representations. RoNID comprises two main modules: reliable pseudo-label generation module and cluster-friendly representation learning module. Specifically, the pseudo-label generation module assigns reliable synthetic labels by solving an optimal transport problem in the E-step, which effectively provides high-quality supervised signals for the input of the cluster-friendly representation learning module. To learn cluster-friendly representation with strong intra-cluster compactness and large inter-cluster separation, the representation learning module combines intra-cluster and inter-cluster contrastive learning in the M-step to feed more discriminative features into the generation module. RoNID can be performed iteratively to ultimately yield a robust model with reliable pseudo-labels and cluster-friendly representations. Experimental results on multiple benchmarks demonstrate our method brings substantial improvements over previous state-of-the-art methods by a large margin of +1~+4 points.

Comments:	DASFAA 2024
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2404.08977 [cs.CL]
	(or arXiv:2404.08977v2 [cs.CL] for this version)

Submission history

From: Jian Yang [view email]
[v1] Sat, 13 Apr 2024 11:58:28 GMT (840kb,D)
[v2] Thu, 18 Apr 2024 06:54:55 GMT (840kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2404.08977

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Computer Science > Computation and Language

Title: RoNID: New Intent Discovery with Generated-Reliable Labels and Cluster-friendly Representations

Submission history