We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sound

Authors and titles for recent submissions

[ total of 21 entries: 1-21 ]
[ showing up to 50 entries per page: fewer | more ]

Tue, 25 Feb 2020

[1]  arXiv:2002.10266 [pdf, other]
Title: Rhythm, Chord and Melody Generation for Lead Sheets using Recurrent Neural Networks
Comments: 8 pages, 2 figures, 3 tables, 2 appendices
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
[2]  arXiv:2002.09748 [pdf, other]
Title: DECIBEL: Improving Audio Chord Estimation for Popular Music by Alignment and Integration of Crowd-Sourced Symbolic Representations
Comments: 81 pages, 47 figures
Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[3]  arXiv:2002.09821 (cross-list from eess.AS) [pdf, other]
Title: A Multi-view CNN-based Acoustic Classification System for Automatic Animal Species Identification
Journal-ref: Ad Hoc Networks 2020
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[4]  arXiv:2002.09607 (cross-list from cs.MM) [pdf, other]
Title: Multi-Representation Knowledge Distillation For Audio Classification
Subjects: Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)

Mon, 24 Feb 2020

[5]  arXiv:2002.09021 [pdf]
Title: A Comparative Study of Western and Chinese Classical Music based on Soundscape Models
Comments: Paper accepted for 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020)
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[6]  arXiv:2002.09286 (cross-list from eess.AS) [pdf, other]
Title: Efficient Trainable Front-Ends for Neural Speech Enhancement
Comments: 5 pages, 5 figures, ICASSP 2020
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Sound (cs.SD); Machine Learning (stat.ML)
[7]  arXiv:2002.09143 (cross-list from cs.LG) [pdf, other]
Title: Few-shot acoustic event detection via meta-learning
Comments: ICASSP 2020
Subjects: Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[8]  arXiv:2002.09026 (cross-list from eess.AS) [pdf]
Title: Multi-label Sound Event Retrieval Using a Deep Learning-based Siamese Structure with a Pairwise Presence Matrix
Comments: Paper accepted for 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020)
Subjects: Audio and Speech Processing (eess.AS); Information Retrieval (cs.IR); Machine Learning (cs.LG); Sound (cs.SD)

Fri, 21 Feb 2020

[9]  arXiv:2002.08582 [pdf, ps, other]
Title: Convergence-guaranteed Independent Positive Semidefinite Tensor Analysis Based on Student's t Distribution
Comments: 5 pages, 3 figures, to appear in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2020
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[10]  arXiv:2002.08933 (cross-list from eess.AS) [pdf, other]
Title: Wavesplit: End-to-End Speech Separation by Speaker Clustering
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[11]  arXiv:2002.08926 (cross-list from eess.AS) [pdf, ps, other]
Title: Imputer: Sequence Modelling via Imputation and Dynamic Programming
Comments: preprint
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[12]  arXiv:2002.08796 (cross-list from eess.AS) [pdf, ps, other]
Title: iSEGAN: Improved Speech Enhancement Generative Adversarial Networks
Authors: Deepak Baby
Comments: A short report on improving SEGAN performance
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD); Signal Processing (eess.SP)
[13]  arXiv:2002.08742 (cross-list from eess.AS) [pdf, other]
Title: Disentangled Speech Embeddings using Cross-modal Self-supervision
Comments: To appear in ICASSP 2020. The first three authors contributed equally to this work
Subjects: Audio and Speech Processing (eess.AS); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[14]  arXiv:2002.08688 (cross-list from eess.AS) [pdf, other]
Title: An empirical study of Conv-TasNet
Comments: In proceedings of ICASSP2020
Subjects: Audio and Speech Processing (eess.AS); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD)

Thu, 20 Feb 2020

[15]  arXiv:2002.08267 (cross-list from cs.CL) [pdf]
Title: Multilogue-Net: A Context Aware RNN for Multi-modal Emotion Detection and Sentiment Analysis in Conversation
Comments: 10 pages, 4 figures, 6 tables
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[16]  arXiv:2002.08249 (cross-list from eess.AS) [pdf, other]
Title: Workshop Report: Detection and Classification in Marine Bioacoustics with Deep Learning
Comments: 13 pages, 1 figure, 1 table
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[17]  arXiv:2002.08126 (cross-list from cs.CL) [pdf, ps, other]
Title: Rnn-transducer with language bias for end-to-end Mandarin-English code-switching speech recognition
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[18]  arXiv:2002.08125 (cross-list from cs.LG) [pdf, other]
Title: Gradient-Adjusted Neuron Activation Profiles for Comprehensive Introspection of Convolutional Speech Recognition Models
Subjects: Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)

Wed, 19 Feb 2020

[19]  arXiv:2002.07677 [pdf]
Title: Performance Analysis of Adaptive Noise Cancellation for Speech Signal
Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[20]  arXiv:2002.07629 (cross-list from eess.AS) [pdf, other]
Title: Multi-Task Siamese Neural Network for Improving Replay Attack Detection
Comments: Submit to INTERSPEECH2020
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[21]  arXiv:2002.07590 (cross-list from eess.AS) [pdf]
Title: Speech Emotion Recognition using Support Vector Machine
Subjects: Audio and Speech Processing (eess.AS); Information Retrieval (cs.IR); Sound (cs.SD)
[ total of 21 entries: 1-21 ]
[ showing up to 50 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2002, contact, help  (Access key information)