We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sound

Authors and titles for cs.SD in Jun 2022, skipping first 15

[ total of 221 entries: 1-10 | 6-15 | 16-25 | 26-35 | 36-45 | 46-55 | ... | 216-221 ]
[ showing 10 entries per page: fewer | more | all ]
[16]  arXiv:2206.03393 [pdf, other]
Title: Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[17]  arXiv:2206.04006 [pdf, other]
Title: Few-Shot Audio-Visual Learning of Environment Acoustics
Comments: Accepted to NeurIPS 2022
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[18]  arXiv:2206.04658 [pdf, other]
Title: BigVGAN: A Universal Neural Vocoder with Large-Scale Training
Comments: To appear at ICLR 2023. Listen to audio samples from BigVGAN at: this https URL
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[19]  arXiv:2206.04769 [pdf, other]
Title: CLAP: Learning Audio Concepts From Natural Language Supervision
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[20]  arXiv:2206.04780 [pdf, other]
Title: Speak Like a Dog: Human to Non-human creature Voice Conversion
Comments: 5 pages, 4 figures
Journal-ref: 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) (pp. 1388-1393)
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[21]  arXiv:2206.04805 [pdf, other]
Title: Motif Mining and Unsupervised Representation Learning for BirdCLEF 2022
Comments: Submitted to CEUR-WS under LifeCLEF for the BirdCLEF 2022 challenge as a working note
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[22]  arXiv:2206.04962 [pdf, other]
Title: Feature Learning and Ensemble Pre-Tasks Based Self-Supervised Speech Denoising and Dereverberation
Comments: arXiv admin note: text overlap with arXiv:2112.11142
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[23]  arXiv:2206.04984 [pdf, other]
Title: Zero-Shot Audio Classification using Image Embeddings
Comments: Accepted to the European Signal Processing Conference (EUSIPCO) 2022
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[24]  arXiv:2206.05018 [pdf, ps, other]
Title: Going Beyond the Cookie Theft Picture Test: Detecting Cognitive Impairments using Acoustic Features
Comments: Accepted at the 25th International Conference on Text, Speech and Dialogue (TSD 2022)
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[25]  arXiv:2206.05286 [src]
Title: AHD ConvNet for Speech Emotion Classification
Comments: Wrong authors quoted
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[ total of 221 entries: 1-10 | 6-15 | 16-25 | 26-35 | 36-45 | 46-55 | ... | 216-221 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2405, contact, help  (Access key information)