We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sound

Authors and titles for cs.SD in Oct 2021, skipping first 50

[ total of 324 entries: 1-10 | ... | 21-30 | 31-40 | 41-50 | 51-60 | 61-70 | 71-80 | 81-90 | ... | 321-324 ]
[ showing 10 entries per page: fewer | more | all ]
[51]  arXiv:2110.05087 [pdf, ps, other]
Title: A Multi-Resolution Front-End for End-to-End Speech Anti-Spoofing
Comments: submitted to ICASSP 2022
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[52]  arXiv:2110.05580 [pdf, other]
Title: vocadito: A dataset of solo vocals with $f_0$, note, and lyric annotations
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[53]  arXiv:2110.05587 [pdf, other]
Title: Evaluation of Latent Space Disentanglement in the Presence of Interdependent Attributes
Comments: Submitted to the Late-Breaking Demo Session of the 22nd International Society for Music Information Retrieval Conference
Subjects: Sound (cs.SD); Information Retrieval (cs.IR); Information Theory (cs.IT); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[54]  arXiv:2110.05713 [pdf, other]
Title: Foster Strengths and Circumvent Weaknesses: a Speech Enhancement Framework with Two-branch Collaborative Learning
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[55]  arXiv:2110.05765 [pdf, other]
Title: Music Sentiment Transfer
Comments: NSF REU: Computational Methods for Understanding Music, Media, and Minds, University of Rochester
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[56]  arXiv:2110.05777 [pdf, other]
Title: Large-scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification
Comments: Accepted by ICASSP 2022
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[57]  arXiv:2110.05798 [pdf, other]
Title: Adapting TTS models For New Speakers using Transfer Learning
Comments: Submitted to Interspeech 2022
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[58]  arXiv:2110.05866 [pdf, ps, other]
Title: MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[59]  arXiv:2110.05966 [pdf, other]
Title: Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training
Comments: accepted by ICASSP 2022
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[60]  arXiv:2110.05975 [pdf, other]
Title: Multi-Channel Far-Field Speaker Verification with Large-Scale Ad-hoc Microphone Arrays
Comments: 5 pages, 3 figures
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[ total of 324 entries: 1-10 | ... | 21-30 | 31-40 | 41-50 | 51-60 | 61-70 | 71-80 | 81-90 | ... | 321-324 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2405, contact, help  (Access key information)