Current browse context:
cs.CL
Change to browse by:
References & Citations
Computer Science > Sound
Title: AHD ConvNet for Speech Emotion Classification
(Submitted on 10 Jun 2022 (v1), last revised 21 Jun 2022 (this version, v2))
Abstract: Accomplishments in the field of artificial intelligence are utilized in the advancement of computing and making of intelligent machines for facilitating mankind and improving user experience. Emotions are rudimentary for people, affecting thinking and ordinary exercises like correspondence, learning and direction. Speech emotion recognition is domain of interest in this regard and in this work, we propose a novel mel spectrogram learning approach in which our model uses the datapoints to learn emotions from the given wav form voice notes in the popular CREMA-D dataset. Our model uses log mel-spectrogram as feature with number of mels = 64. It took less training time compared to other approaches used to address the problem of emotion speech recognition.
Submission history
From: Danial Nasir [view email][v1] Fri, 10 Jun 2022 11:57:28 GMT (117kb,D)
[v2] Tue, 21 Jun 2022 12:25:51 GMT (0kb,I)
Link back to: arXiv, form interface, contact.