A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition

Kumar, Anurag; Ithapu, Vamsi Krishna

Full-text links:

Download:

Current browse context:

cs.SD

< prev | next >

new | recent | 2007

Computer Science > Sound

Title: A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition

Authors: Anurag Kumar, Vamsi Krishna Ithapu

(Submitted on 30 Jun 2020)

Abstract: An important problem in machine auditory perception is to recognize and detect sound events. In this paper, we propose a sequential self-teaching approach to learning sounds. Our main proposition is that it is harder to learn sounds in adverse situations such as from weakly labeled and/or noisy labeled data, and in these situations a single stage of learning is not sufficient. Our proposal is a sequential stage-wise learning process that improves generalization capabilities of a given modeling system. We justify this method via technical results and on Audioset, the largest sound events dataset, our sequential learning approach can lead to up to 9% improvement in performance. A comprehensive evaluation also shows that the method leads to improved transferability of knowledge from previously trained models, thereby leading to improved generalization capabilities on transfer learning tasks.

Comments:	Accepted International Conference on Machine Learning $\textbf{(ICML) 2020}$. 14 pages
Subjects:	Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2007.00144 [cs.SD]
	(or arXiv:2007.00144v1 [cs.SD] for this version)

Submission history

From: Anurag Kumar [view email]
[v1] Tue, 30 Jun 2020 22:53:43 GMT (321kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2007.00144

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Sound

Title: A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition

Submission history