We gratefully acknowledge support from
the Simons Foundation and member institutions.

Audio and Speech Processing

Authors and titles for recent submissions, skipping first 41

[ total of 48 entries: 1-25 | 17-41 | 42-48 ]
[ showing 25 entries per page: fewer | more | all ]

Fri, 19 Apr 2024 (continued, showing last 1 of 9 entries)

[42]  arXiv:2404.11938 (cross-list from cs.MM) [pdf, other]
Title: HyDiscGAN: A Hybrid Distributed cGAN for Audio-Visual Privacy Preservation in Multimodal Sentiment Analysis
Comments: 13 pages, IJCAI-2024
Subjects: Multimedia (cs.MM); Distributed, Parallel, and Cluster Computing (cs.DC); Sound (cs.SD); Audio and Speech Processing (eess.AS)

Thu, 18 Apr 2024

[43]  arXiv:2404.11399 [pdf, other]
Title: In situ sound absorption estimation with the discrete complex image source method
Comments: 37 pages, 12 figures, original manuscript to be submitted to the Journal of Sound and Vibration
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD); Classical Physics (physics.class-ph)
[44]  arXiv:2404.11275 (cross-list from cs.SD) [pdf, other]
Title: Jointly Recognizing Speech and Singing Voices Based on Multi-Task Audio Source Separation
Comments: Accepted by ICME 2024
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[45]  arXiv:2404.11116 (cross-list from cs.SD) [pdf, other]
Title: Music Enhancement with Deep Filters: A Technical Report for The ICASSP 2024 Cadenza Challenge
Comments: 2 pages, 2 figures, 1 tables, Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[46]  arXiv:2404.10989 (cross-list from cs.CV) [pdf, other]
Title: FairSSD: Understanding Bias in Synthetic Speech Detectors
Comments: Accepted at CVPR 2024 (WMF)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[47]  arXiv:2404.10922 (cross-list from cs.CL) [pdf, other]
Title: Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training
Comments: NAACL Findings 2024
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[48]  arXiv:2404.10842 (cross-list from cs.SD) [pdf, ps, other]
Title: Unsupervised Speaker Diarization in Distributed IoT Networks Using Federated Learning
Comments: 11 pages, 7 figures, 1 table
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[ total of 48 entries: 1-25 | 17-41 | 42-48 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, new, 2404, contact, help  (Access key information)