Sound

Authors and titles for cs.SD in Oct 2021, skipping first 50

[ total of 324 entries: 1-10 | ... | 21-30 | 31-40 | 41-50 | 51-60 | 61-70 | 71-80 | 81-90 | ... | 321-324 ]
[ showing 10 entries per page: fewer | more | all ]

[51] arXiv:2110.05087 [pdf, ps, other]: Title: A Multi-Resolution Front-End for End-to-End Speech Anti-Spoofing

Authors: Wei Liu, Meng Sun, Xiongwei Zhang, Hugo Van hamme, Thomas Fang Zheng

Comments: submitted to ICASSP 2022

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[52] arXiv:2110.05580 [pdf, other]: Title: vocadito: A dataset of solo vocals with $f_0$, note, and lyric annotations

Authors: Rachel M. Bittner, Katherine Pasalo, Juan José Bosch, Gabriel Meseguer-Brocal, David Rubinstein

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[53] arXiv:2110.05587 [pdf, other]: Title: Evaluation of Latent Space Disentanglement in the Presence of Interdependent Attributes

Authors: Karn N. Watcharasupat, Alexander Lerch

Comments: Submitted to the Late-Breaking Demo Session of the 22nd International Society for Music Information Retrieval Conference

Subjects: Sound (cs.SD); Information Retrieval (cs.IR); Information Theory (cs.IT); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[54] arXiv:2110.05713 [pdf, other]: Title: Foster Strengths and Circumvent Weaknesses: a Speech Enhancement Framework with Two-branch Collaborative Learning

Authors: Wenxin Tai, Jiajia Li, Yixiang Wang, Tian Lan, Qiao Liu

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[55] arXiv:2110.05765 [pdf, other]: Title: Music Sentiment Transfer

Authors: Miles Sigel, Michael Zhou, Jiebo Luo

Comments: NSF REU: Computational Methods for Understanding Music, Media, and Minds, University of Rochester

Subjects: Sound (cs.SD); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[56] arXiv:2110.05777 [pdf, other]: Title: Large-scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification

Authors: Zhengyang Chen, Sanyuan Chen, Yu Wu, Yao Qian, Chengyi Wang, Shujie Liu, Yanmin Qian, Michael Zeng

Comments: Accepted by ICASSP 2022

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[57] arXiv:2110.05798 [pdf, other]: Title: Adapting TTS models For New Speakers using Transfer Learning

Authors: Paarth Neekhara, Jason Li, Boris Ginsburg

Comments: Submitted to Interspeech 2022

Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[58] arXiv:2110.05866 [pdf, ps, other]: Title: MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech

Authors: Szu-Wei Fu, Cheng Yu, Kuo-Hsuan Hung, Mirco Ravanelli, Yu Tsao

Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[59] arXiv:2110.05966 [pdf, other]: Title: Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training

Authors: Changsheng Quan, Xiaofei Li

Comments: accepted by ICASSP 2022

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[60] arXiv:2110.05975 [pdf, other]: Title: Multi-Channel Far-Field Speaker Verification with Large-Scale Ad-hoc Microphone Arrays

Authors: Chengdong Liang, Yijiang Chen, Jiadi Yao, Xiao-Lei Zhang

Comments: 5 pages, 3 figures

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)

[ total of 324 entries: 1-10 | ... | 21-30 | 31-40 | 41-50 | 51-60 | 61-70 | 71-80 | 81-90 | ... | 321-324 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2405, contact, help (Access key information)

> cs > cs.SD

Sound

Authors and titles for cs.SD in Oct 2021, skipping first 50