We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sound

Authors and titles for cs.SD in Jun 2022, skipping first 200

[ total of 221 entries: 1-10 | ... | 171-180 | 181-190 | 191-200 | 201-210 | 211-220 | 221 ]
[ showing 10 entries per page: fewer | more | all ]
[201]  arXiv:2206.13232 (cross-list from eess.AS) [pdf, other]
Title: Conformer Based Elderly Speech Recognition System for Alzheimer's Disease Detection
Comments: 5 pages, 1 figure, accepted by INTERSPEECH 2022
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[202]  arXiv:2206.13240 (cross-list from eess.AS) [pdf, other]
Title: A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data
Comments: Accepted at ECNLP @ACL 2022
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[203]  arXiv:2206.13272 (cross-list from eess.AS) [pdf, other]
Title: Wideband Audio Waveform Evaluation Networks: Efficient, Accurate Estimation of Speech Qualities
Comments: This work has been accepted to the journal IEEE Access
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[204]  arXiv:2206.13310 (cross-list from eess.AS) [pdf, other]
Title: Insights Into Deep Non-linear Filters for Improved Multi-channel Speech Enhancement
Comments: Accepted version
Journal-ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 563-575, 2023
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[205]  arXiv:2206.13365 (cross-list from eess.AS) [pdf, other]
Title: Interpretable Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[206]  arXiv:2206.13404 (cross-list from eess.AS) [pdf, other]
Title: Avocodo: Generative Adversarial Network for Artifact-free Vocoder
Comments: Accepted for publication in the 37th AAAI conference on artificial intelligence (AAAI 2023)
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Sound (cs.SD)
[207]  arXiv:2206.13411 (cross-list from eess.AS) [pdf, other]
Title: Audio Similarity is Unreliable as a Proxy for Audio Quality
Comments: To Appear, Interspeech 2022
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[208]  arXiv:2206.13443 (cross-list from eess.AS) [pdf, other]
Title: CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer
Comments: Accepted to be published in the Proceedings of InterSpeech 2022
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[209]  arXiv:2206.13762 (cross-list from eess.AS) [pdf, other]
Title: A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion
Comments: Accepted to INTERSPEECH 2022; Made some motifications in Fig.1 so that the system architecture will be more clear
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[210]  arXiv:2206.13768 (cross-list from eess.AS) [pdf, ps, other]
Title: Algorithms for audio inpainting based on probabilistic nonnegative matrix factorization
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[ total of 221 entries: 1-10 | ... | 171-180 | 181-190 | 191-200 | 201-210 | 211-220 | 221 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2404, contact, help  (Access key information)