We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sound

Authors and titles for cs.SD in Dec 2019, skipping first 40

[ total of 90 entries: 1-10 | 11-20 | 21-30 | 31-40 | 41-50 | 51-60 | 61-70 | 71-80 | 81-90 ]
[ showing 10 entries per page: fewer | more | all ]
[41]  arXiv:1912.07050 (cross-list from cs.CL) [pdf, ps, other]
Title: Computational Induction of Prosodic Structure
Authors: Dafydd Gibbon
Comments: 29 pages, 10 figures, code appendix, to appear in "Studies in Prosodic Grammar"
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[42]  arXiv:1912.07756 (cross-list from cs.LG) [pdf, ps, other]
Title: Data augmentation approaches for improving animal audio classification
Subjects: Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[43]  arXiv:1912.07875 (cross-list from cs.CL) [pdf, ps, other]
Title: Libri-Light: A Benchmark for ASR with Limited or No Supervision
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[44]  arXiv:1912.08639 (cross-list from cs.CV) [pdf, other]
Title: Detecting Adversarial Attacks On Audiovisual Speech Recognition
Comments: Accepted to ICASSP 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[45]  arXiv:1912.09261 (cross-list from cs.LG) [pdf, ps, other]
Title: Practical applicability of deep neural networks for overlapping speaker separation
Comments: Interspeech 2019
Subjects: Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[46]  arXiv:1912.10131 (cross-list from cs.MM) [pdf, other]
Title: Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog
Comments: Presented at the 3rd Visually Grounded Interaction and Language (ViGIL) Workshop, NeurIPS 2019, Vancouver, Canada. arXiv admin note: substantial text overlap with arXiv:1812.08407, arXiv:1912.10132
Subjects: Multimedia (cs.MM); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[47]  arXiv:1912.10915 (cross-list from cs.CL) [pdf, other]
Title: Probing the phonetic and phonological knowledge of tones in Mandarin TTS models
Authors: Jian Zhu
Comments: Submitted to Speech Prosody 2020
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[48]  arXiv:1912.11474 (cross-list from cs.CV) [pdf, other]
Title: SoundSpaces: Audio-Visual Navigation in 3D Environments
Comments: Accepted to ECCV 2020 (Spotlight). Project page: this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[49]  arXiv:1912.11684 (cross-list from cs.CV) [pdf, other]
Title: Look, Listen, and Act: Towards Audio-Visual Embodied Navigation
Comments: Accepted by ICRA 2020. Project page: this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[50]  arXiv:1912.12362 (cross-list from cs.MM) [pdf, other]
Title: Structural characterization of musical harmonies
Subjects: Multimedia (cs.MM); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[ total of 90 entries: 1-10 | 11-20 | 21-30 | 31-40 | 41-50 | 51-60 | 61-70 | 71-80 | 81-90 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2405, contact, help  (Access key information)