AHD ConvNet for Speech Emotion Classification

Ali, Asfand; Nasir, Danial; Jawad, Mohammad Hassan

Full-text links:

Download:

Source

Current browse context:

cs.CL

< prev | next >

new | recent | 2206

Computer Science > Sound

Title: AHD ConvNet for Speech Emotion Classification

Authors: Asfand Ali, Danial Nasir, Mohammad Hassan Jawad

(Submitted on 10 Jun 2022 (v1), last revised 21 Jun 2022 (this version, v2))

Abstract: Accomplishments in the field of artificial intelligence are utilized in the advancement of computing and making of intelligent machines for facilitating mankind and improving user experience. Emotions are rudimentary for people, affecting thinking and ordinary exercises like correspondence, learning and direction. Speech emotion recognition is domain of interest in this regard and in this work, we propose a novel mel spectrogram learning approach in which our model uses the datapoints to learn emotions from the given wav form voice notes in the popular CREMA-D dataset. Our model uses log mel-spectrogram as feature with number of mels = 64. It took less training time compared to other approaches used to address the problem of emotion speech recognition.

Comments:	Wrong authors quoted
Subjects:	Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2206.05286 [cs.SD]
	(or arXiv:2206.05286v2 [cs.SD] for this version)

Submission history

From: Danial Nasir [view email]
[v1] Fri, 10 Jun 2022 11:57:28 GMT (117kb,D)
[v2] Tue, 21 Jun 2022 12:25:51 GMT (0kb,I)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2206.05286

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Sound

Title: AHD ConvNet for Speech Emotion Classification

Submission history