We gratefully acknowledge support from
the Simons Foundation and member institutions.

Audio and Speech Processing

Authors and titles for recent submissions

[ total of 61 entries: 1-10 | 11-20 | 21-30 | 31-40 | ... | 61 ]
[ showing 10 entries per page: fewer | more | all ]

Thu, 6 Aug 2020

[1]  arXiv:2008.02098 [pdf, other]
Title: Speaker dependent acoustic-to-articulatory inversion using real-time MRI of the vocal tract
Comments: 5 pages, accepted for publication at Interspeech 2020. arXiv admin note: substantial text overlap with arXiv:2008.00889
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[2]  arXiv:2008.02070 [pdf, other]
Title: Content based singing voice source separation via strong conditioning using aligned phonemes
Comments: 21st International Society for Music Information Retrieval Conference 11-15 October 2020, Montreal, Canada
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[3]  arXiv:2008.02027 [pdf, other]
Title: Learning to Denoise Historical Music
Comments: ISMIR 2020
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG)
[4]  arXiv:2008.01832 [pdf, other]
Title: Future Vector Enhanced LSTM Language Model for LVCSR
Comments: Accepted by ASRU-2017
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[5]  arXiv:2008.02194 (cross-list from cs.SD) [pdf, other]
Title: On the Characterization of Expressive Performance in Classical Music: First Results of the Con Espressione Game
Comments: 8 pages, 2 figures, accepted for the 21st International Society for Music Information Retrieval Conference (ISMIR 2020)
Subjects: Sound (cs.SD); Information Retrieval (cs.IR); Audio and Speech Processing (eess.AS)
[6]  arXiv:2008.02069 (cross-list from cs.LG) [pdf, other]
Title: Data Cleansing with Contrastive Learning for Vocal Note Event Annotations
Comments: 21st International Society for Music Information Retrieval Conference 11-15 October 2020, Montreal, Canada
Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[7]  arXiv:2008.02063 (cross-list from cs.CV) [pdf, other]
Title: Compact Graph Architecture for Speech Emotion Recognition
Authors: A. Shirian, T. Guha
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[8]  arXiv:2008.02011 (cross-list from cs.SD) [pdf, other]
Title: Neural Loop Combiner: Neural Network Models for Assessing the Compatibility of Loops
Comments: Accepted to the 21st International Society for Music Information Retrieval Conference (ISMIR 2020)
Subjects: Sound (cs.SD); Information Retrieval (cs.IR); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[9]  arXiv:2008.01951 (cross-list from cs.SD) [pdf, other]
Title: MusPy: A Toolkit for Symbolic Music Generation
Comments: Accepted by International Society for Music Information Retrieval Conference (ISMIR), 2020
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)

Wed, 5 Aug 2020 (showing first 1 of 14 entries)

[10]  arXiv:2008.01698 [pdf, other]
Title: MIRNet: Learning Multiple Identity Representations in Overlapped Speech
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[ total of 61 entries: 1-10 | 11-20 | 21-30 | 31-40 | ... | 61 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, new, 2008, contact, help  (Access key information)