We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sound

Authors and titles for cs.SD in Jun 2022, skipping first 200

[ total of 221 entries: 1-25 | ... | 126-150 | 151-175 | 176-200 | 201-221 ]
[ showing 25 entries per page: fewer | more | all ]
[201]  arXiv:2206.13232 (cross-list from eess.AS) [pdf, other]
Title: Conformer Based Elderly Speech Recognition System for Alzheimer's Disease Detection
Comments: 5 pages, 1 figure, accepted by INTERSPEECH 2022
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[202]  arXiv:2206.13240 (cross-list from eess.AS) [pdf, other]
Title: A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data
Comments: Accepted at ECNLP @ACL 2022
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[203]  arXiv:2206.13272 (cross-list from eess.AS) [pdf, other]
Title: Wideband Audio Waveform Evaluation Networks: Efficient, Accurate Estimation of Speech Qualities
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[204]  arXiv:2206.13310 (cross-list from eess.AS) [pdf, other]
Title: Insights into Deep Non-linear Filters for Improved Multi-channel Speech Enhancement
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible. Changes: added reference
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[205]  arXiv:2206.13365 (cross-list from eess.AS) [pdf, other]
Title: Interpretable Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[206]  arXiv:2206.13404 (cross-list from eess.AS) [pdf, other]
Title: Avocodo: Generative Adversarial Network for Artifact-free Vocoder
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Sound (cs.SD)
[207]  arXiv:2206.13411 (cross-list from eess.AS) [pdf, other]
Title: Audio Similarity is Unreliable as a Proxy for Audio Quality
Comments: To Appear, Interspeech 2022
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[208]  arXiv:2206.13443 (cross-list from eess.AS) [pdf, other]
Title: CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer
Comments: Accepted to be published in the Proceedings of InterSpeech 2022
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[209]  arXiv:2206.13762 (cross-list from eess.AS) [pdf, other]
Title: A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion
Comments: Accepted to INTERSPEECH 2022; Made some motifications in Fig.1 so that the system architecture will be more clear
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[210]  arXiv:2206.13768 (cross-list from eess.AS) [pdf, ps, other]
Title: Algorithms for audio inpainting based on probabilistic nonnegative matrix factorization
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[211]  arXiv:2206.13807 (cross-list from eess.AS) [pdf, other]
Title: Two Methods for Spoofing-Aware Speaker Verification: Multi-Layer Perceptron Score Fusion Model and Integrated Embedding Projector
Comments: 5 pages, 4 figures, 5 tables, accepted to 2022 Interspeech as a conference paper
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[212]  arXiv:2206.13808 (cross-list from eess.AS) [pdf, other]
Title: Speaker Verification in Multi-Speaker Environments Using Temporal Feature Fusion
Comments: To be presented at EUSIPCO 2022
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[213]  arXiv:2206.13865 (cross-list from eess.AS) [pdf, other]
Title: RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
Comments: 5 pages, 1 figure, 3 tables. Accepted by Interspeech 2022
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[214]  arXiv:2206.14165 (cross-list from eess.AS) [pdf, other]
Title: Expressive, Variable, and Controllable Duration Modelling in TTS
Comments: Accepted to be published in the Proceedings of InterSpeech 2022
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[215]  arXiv:2206.14357 (cross-list from eess.AS) [pdf, other]
Title: Comparing Conventional Pitch Detection Algorithms with a Neural Network Approach
Authors: Anja Kroon (McGill University)
Comments: 6 pages, 11 figures
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD); Signal Processing (eess.SP)
[216]  arXiv:2206.14524 (cross-list from eess.AS) [pdf]
Title: A light-weight full-band speech enhancement model
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[217]  arXiv:2206.14639 (cross-list from eess.AS) [pdf, other]
Title: DDKtor: Automatic Diadochokinetic Speech Analysis
Comments: Accepted to Interspeech 2022
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[218]  arXiv:2206.14962 (cross-list from eess.AS) [pdf, other]
Title: GLD-Net: Improving Monaural Speech Enhancement by Learning Global and Local Dependency Features with GLD Block
Comments: Accepted by Interspeech 2022
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[219]  arXiv:2206.14964 (cross-list from eess.AS) [pdf, other]
Title: Improving Visual Speech Enhancement Network by Learning Audio-visual Affinity with Multi-head Attention
Comments: Accepted by Interspeech 2022. arXiv admin note: substantial text overlap with arXiv:2101.06268
Subjects: Audio and Speech Processing (eess.AS); Multimedia (cs.MM); Sound (cs.SD)
[220]  arXiv:2206.14984 (cross-list from eess.AS) [pdf, other]
Title: TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder
Comments: Accepted to the conference of INTERSPEECH 2022
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[221]  arXiv:2206.15356 (cross-list from eess.AS) [pdf, other]
Title: Acoustic Room Compensation Using Local PCA-based Room Average Power Response Estimation
Comments: 5 pages, 7 figures, to appear in IWAENC 2022
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[ total of 221 entries: 1-25 | ... | 126-150 | 151-175 | 176-200 | 201-221 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2212, contact, help  (Access key information)