We gratefully acknowledge support from
the Simons Foundation and member institutions.

Audio and Speech Processing

Authors and titles for recent submissions, skipping first 38

[ total of 44 entries: 1-25 | 14-38 | 39-44 ]
[ showing 25 entries per page: fewer | more | all ]

Wed, 17 Apr 2024 (continued, showing last 6 of 9 entries)

[39]  arXiv:2404.10440 (cross-list from cs.CL) [pdf, other]
Title: Language Proficiency and F0 Entrainment: A Study of L2 English Imitation in Italian, French, and Slovak Speakers
Comments: Accepted at Speech Prosody 2024
Subjects: Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[40]  arXiv:2404.10316 (cross-list from cs.SD) [pdf, ps, other]
Title: Multiple Mobile Target Detection and Tracking in Active Sonar Array Using a Track-Before-Detect Approach
Comments: 10 pages, 10 figures
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[41]  arXiv:2404.10301 (cross-list from cs.SD) [pdf, other]
Title: Long-form music generation with latent diffusion
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[42]  arXiv:2404.10299 (cross-list from cs.LG) [pdf, other]
Title: Clustering and Data Augmentation to Improve Accuracy of Sleep Assessment and Sleep Individuality Analysis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[43]  arXiv:2404.10180 (cross-list from cs.CL) [pdf, other]
Title: Deferred NAM: Low-latency Top-K Context Injection via DeferredContext Encoding for Non-Streaming ASR
Comments: 9 pages, 3 figures, accepted by NAACL 2024 - Industry Track
Journal-ref: 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics - Industry Track
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Audio and Speech Processing (eess.AS)
[44]  arXiv:2404.10112 (cross-list from cs.CL) [pdf, other]
Title: PRODIS -- a speech database and a phoneme-based language model for the study of predictability effects in Polish
Comments: To appear in the proceedings of LREC2024: Language Resources and Evaluation Conference 2024, Turin, Italy
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[ total of 44 entries: 1-25 | 14-38 | 39-44 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, new, 2404, contact, help  (Access key information)