We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sound

Authors and titles for cs.SD in Nov 2020

[ total of 196 entries: 1-25 | 26-50 | 51-75 | 76-100 | ... | 176-196 ]
[ showing 25 entries per page: fewer | more | all ]
[1]  arXiv:2011.00196 [pdf, other]
Title: RespireNet: A Deep Neural Network for Accurately Detecting Abnormal Lung Sounds in Limited Data Setting
Comments: Code visible at this https URL
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[2]  arXiv:2011.00200 [pdf, other]
Title: The xx205 System for the VoxCeleb Speaker Recognition Challenge 2020
Authors: Xu Xiang
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[3]  arXiv:2011.00695 [pdf, other]
Title: Learning generic feature representation with synthetic data for weakly-supervised sound event detection by inter-frame distance loss
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[4]  arXiv:2011.00773 [pdf, other]
Title: Using a Bi-directional LSTM Model with Attention Mechanism trained on MIDI Data for Generating Unique Music
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[5]  arXiv:2011.00782 [pdf, other]
Title: CVC: Contrastive Learning for Non-parallel Voice Conversion
Comments: Submitted Interspeech 2021, Project Page: this https URL
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[6]  arXiv:2011.00801 [pdf, other]
Title: Sound Event Detection and Separation: a Benchmark on Desed Synthetic Soundscapes
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[7]  arXiv:2011.00803 [pdf, other]
Title: What's All the FUSS About Free Universal Sound Separation Data?
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[8]  arXiv:2011.01143 [pdf, other]
Title: Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds
Comments: ICLR 2021, 27 pages
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[9]  arXiv:2011.01151 [pdf, other]
Title: Optimize what matters: Training DNN-HMM Keyword Spotting Model Using End Metric
Comments: Accepted at ICASSP 2021
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[10]  arXiv:2011.01447 [pdf, other]
Title: A Two-Stage Approach to Device-Robust Acoustic Scene Classification
Comments: Submitted to ICASSP 2021. Code available: this https URL
Journal-ref: ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Audio and Speech Processing (eess.AS)
[11]  arXiv:2011.01518 [pdf, other]
Title: ShaneRun System Description to VoxCeleb Speaker Recognition Challenge 2020
Authors: Shen Chen
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[12]  arXiv:2011.01561 [pdf, other]
Title: Two Heads Are Better Than One: A Two-Stage Approach for Monaural Noise Reduction in the Complex Domain
Comments: Submitted to ICASSP 2021, 5 pages
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[13]  arXiv:2011.01637 [pdf, other]
Title: Shift If You Can: Counting and Visualising Correction Operations for Beat Tracking Evaluation
Comments: ISMIR 2020 Late Breaking/Demo
Subjects: Sound (cs.SD); Information Retrieval (cs.IR)
[14]  arXiv:2011.01709 [pdf, other]
Title: Small footprint Text-Independent Speaker Verification for Embedded Systems
Journal-ref: Acoustics, Speech and Signal Processing (ICASSP), 2021 IEEE International Conference
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[15]  arXiv:2011.02110 [pdf, other]
Title: Can We Trust Deep Speech Prior?
Comments: To be published in IEEE SLT 2021
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[16]  arXiv:2011.02131 [pdf, other]
Title: DESNet: A Multi-channel Network for Simultaneous Speech Dereverberation, Enhancement and Separation
Comments: Accepted at IEEE SLT 2021
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[17]  arXiv:2011.02198 [pdf, other]
Title: IEEE SLT 2021 Alpha-mini Speech Challenge: Open Datasets, Tracks, Rules and Baselines
Comments: Accepted at IEEE SLT 2021
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[18]  arXiv:2011.02314 [pdf, other]
Title: VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech
Comments: Accepted by IEEE SLT 2021. arXiv admin note: text overlap with arXiv:2005.07025
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[19]  arXiv:2011.02329 [pdf, other]
Title: Single channel voice separation for unknown number of speakers under reverberant and noisy settings
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[20]  arXiv:2011.02678 [pdf, other]
Title: BW-EDA-EEND: Streaming End-to-End Neural Speaker Diarization for a Variable Number of Speakers
Journal-ref: Proc. IEEE ICASSP, June 2021, pp. 7193-7197
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[21]  arXiv:2011.02809 [pdf, other]
Title: Semi-supervised Learning for Singing Synthesis Timbre
Comments: 5 pages, 1 figure, submitted to ICASSP 2021
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[22]  arXiv:2011.02874 [pdf, ps, other]
Title: Influence of Event Duration on Automatic Wheeze Classification
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[23]  arXiv:2011.02882 [pdf, ps, other]
Title: Query Expansion System for the VoxCeleb Speaker Recognition Challenge 2020
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[24]  arXiv:2011.03028 [pdf, other]
Title: From Note-Level to Chord-Level Neural Network Models for Voice Separation in Symbolic Music
Comments: Paper submitted for publication in August 2018
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[25]  arXiv:2011.03414 [pdf, ps, other]
Title: Robust ENF Estimation Based on Harmonic Enhancement and Maximum Weight Clique
Journal-ref: IEEE Transactions on Information Forensics and Security, 2021
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[ total of 196 entries: 1-25 | 26-50 | 51-75 | 76-100 | ... | 176-196 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2404, contact, help  (Access key information)