Sample-level CNN Architectures for Music Auto-tagging Using Raw Waveforms

Kim, Taejun; Lee, Jongpil; Nam, Juhan

Full-text links:

Download:

Current browse context:

cs.SD

< prev | next >

new | recent | 1710

Computer Science > Sound

Title: Sample-level CNN Architectures for Music Auto-tagging Using Raw Waveforms

Authors: Taejun Kim, Jongpil Lee, Juhan Nam

(Submitted on 28 Oct 2017 (v1), last revised 14 Feb 2018 (this version, v2))

Abstract: Recent work has shown that the end-to-end approach using convolutional neural network (CNN) is effective in various types of machine learning tasks. For audio signals, the approach takes raw waveforms as input using an 1-D convolution layer. In this paper, we improve the 1-D CNN architecture for music auto-tagging by adopting building blocks from state-of-the-art image classification models, ResNets and SENets, and adding multi-level feature aggregation to it. We compare different combinations of the modules in building CNN architectures. The results show that they achieve significant improvements over previous state-of-the-art models on the MagnaTagATune dataset and comparable results on Million Song Dataset. Furthermore, we analyze and visualize our model to show how the 1-D CNN operates.

Comments:	Accepted for publication at ICASSP 2018
Subjects:	Sound (cs.SD); Machine Learning (cs.LG); Multimedia (cs.MM); Neural and Evolutionary Computing (cs.NE); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:1710.10451 [cs.SD]
	(or arXiv:1710.10451v2 [cs.SD] for this version)

Submission history

From: Taejun Kim [view email]
[v1] Sat, 28 Oct 2017 11:55:50 GMT (286kb,D)
[v2] Wed, 14 Feb 2018 04:39:50 GMT (287kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1710.10451

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Sound

Title: Sample-level CNN Architectures for Music Auto-tagging Using Raw Waveforms

Submission history