We gratefully acknowledge support from
the Simons Foundation and member institutions.

Audio and Speech Processing

Authors and titles for eess.AS in Apr 2021

[ total of 266 entries: 1-10 | 11-20 | 21-30 | 31-40 | ... | 261-266 ]
[ showing 10 entries per page: fewer | more | all ]
[1]  arXiv:2104.00120 [pdf, ps, other]
Title: Multi-Encoder Learning and Stream Fusion for Transformer-Based End-to-End Automatic Speech Recognition
Comments: accepted at INTERSPEECH 2021
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[2]  arXiv:2104.00230 [pdf, other]
Title: Bidirectional Multiscale Feature Aggregation for Speaker Verification
Authors: Jiajun Qi, Wu Guo, Bin Gu
Subjects: Audio and Speech Processing (eess.AS)
[3]  arXiv:2104.00259 [pdf, other]
Title: Interactive spatial speech recognition maps based on simulated speech recognition experiments
Comments: 16 pages, 11 figures, related code this https URL
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[4]  arXiv:2104.00353 [pdf, other]
Title: CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN
Comments: 9 pages, 5 figures, submitted to IEEE Transactions on Multimedia, the authors contributed equally to this work
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG)
[5]  arXiv:2104.00436 [pdf, other]
Title: Expressive Text-to-Speech using Style Tag
Comments: Submitted to Interspeech 2021
Subjects: Audio and Speech Processing (eess.AS)
[6]  arXiv:2104.00624 [pdf, ps, other]
Title: Fast DCTTS: Efficient Deep Convolutional Text-to-Speech
Comments: 5 pages, 1 figure, to be published in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[7]  arXiv:2104.00769 [pdf, other]
Title: Keyword Transformer: A Self-Attention Model for Keyword Spotting
Comments: Proceedings of INTERSPEECH
Journal-ref: Proc. Interspeech 2021, 4249-4253
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[8]  arXiv:2104.00931 [pdf, other]
Title: Assem-VC: Realistic Voice Conversion by Assembling Modern Speech Synthesis Techniques
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[9]  arXiv:2104.00960 [pdf, other]
Title: INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing
Comments: 5 pages, submitted to INTERSPEECH 2021
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[10]  arXiv:2104.00994 [pdf, other]
Title: Unsupervised Acoustic Unit Discovery by Leveraging a Language-Independent Subword Discriminative Feature Representation
Comments: Accepted for publication in INTERSPEECH 2021
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[ total of 266 entries: 1-10 | 11-20 | 21-30 | 31-40 | ... | 261-266 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, 2208, contact, help  (Access key information)