Medical Speech Symptoms Classification via Disentangled Representation

Wang, Jianzong; Li, Pengcheng; Zhang, Xulong; Cheng, Ning; Xiao, Jing

Full-text links:

Download:

Current browse context:

cs.AI

< prev | next >

new | recent | 2403

Change to browse by:

Computer Science > Artificial Intelligence

Title: Medical Speech Symptoms Classification via Disentangled Representation

Authors: Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao

(Submitted on 8 Mar 2024 (v1), last revised 30 Apr 2024 (this version, v3))

Abstract: Intent is defined for understanding spoken language in existing works. Both textual features and acoustic features involved in medical speech contain intent, which is important for symptomatic diagnosis. In this paper, we propose a medical speech classification model named DRSC that automatically learns to disentangle intent and content representations from textual-acoustic data for classification. The intent representations of the text domain and the Mel-spectrogram domain are extracted via intent encoders, and then the reconstructed text feature and the Mel-spectrogram feature are obtained through two exchanges. After combining the intent from two domains into a joint representation, the integrated intent representation is fed into a decision layer for classification. Experimental results show that our model obtains an average accuracy rate of 95% in detecting 25 different medical symptoms.

Comments:	Accepted by the 27th International Conference on Computer Supported Cooperative Work in Design (CSCWD 2024)
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2403.05000 [cs.AI]
	(or arXiv:2403.05000v3 [cs.AI] for this version)

Submission history

From: Pengcheng Li [view email]
[v1] Fri, 8 Mar 2024 02:42:34 GMT (1401kb,D)
[v2] Tue, 26 Mar 2024 01:51:37 GMT (1402kb,D)
[v3] Tue, 30 Apr 2024 01:47:37 GMT (1414kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2403.05000

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Artificial Intelligence

Title: Medical Speech Symptoms Classification via Disentangled Representation

Submission history