We gratefully acknowledge support from
the Simons Foundation and member institutions.

Audio and Speech Processing

Authors and titles for recent submissions, skipping first 23

[ total of 46 entries: 1-10 | 4-13 | 14-23 | 24-33 | 34-43 | 44-46 ]
[ showing 10 entries per page: fewer | more | all ]

Tue, 23 Apr 2024 (continued, showing 10 of 17 entries)

[24]  arXiv:2404.13568 (cross-list from cs.SD) [pdf, ps, other]
Title: Sparse Direction of Arrival Estimation Method Based on Vector Signal Reconstruction with a Single Vector Sensor
Authors: Jiabin Guo
Comments: 20 pages
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[25]  arXiv:2404.13551 (cross-list from cs.SD) [pdf, other]
Title: AudioRepInceptionNeXt: A lightweight single-stream architecture for efficient audio recognition
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[26]  arXiv:2404.13509 (cross-list from cs.SD) [pdf, ps, other]
Title: MFHCA: Enhancing Speech Emotion Recognition Via Multi-Spatial Fusion and Hierarchical Cooperative Attention
Comments: Main paper (5 pages). Accepted for publication by ICME 2024
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[27]  arXiv:2404.13428 (cross-list from cs.SD) [pdf, ps, other]
Title: Text-dependent Speaker Verification (TdSV) Challenge 2024: Challenge Evaluation Plan
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[28]  arXiv:2404.13418 (cross-list from cs.HC) [pdf, ps, other]
Title: Interactive tools for making temporally variable, multiple-attributes, and multiple-instances morphing accessible: Flexible manipulation of divergent speech instances for explorational research and education
Comments: 5 pages, 7 figures, submitted to Acoustical Science and Technology of Acoustical Society of Japan
Subjects: Human-Computer Interaction (cs.HC); Audio and Speech Processing (eess.AS)
[29]  arXiv:2404.13362 (cross-list from cs.CL) [pdf, other]
Title: Semantically Corrected Amharic Automatic Speech Recognition
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[30]  arXiv:2404.13358 (cross-list from cs.SD) [pdf, other]
Title: Music Consistency Models
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[31]  arXiv:2404.13289 (cross-list from cs.CL) [pdf, other]
Title: Double Mixture: Towards Continual Event Detection from Speech
Comments: The first two authors contributed equally to this work
Subjects: Computation and Language (cs.CL); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[32]  arXiv:2404.13286 (cross-list from cs.SD) [pdf, other]
Title: Track Role Prediction of Single-Instrumental Sequences
Comments: ISMIR LBD 2023
Subjects: Sound (cs.SD); Information Retrieval (cs.IR); Audio and Speech Processing (eess.AS)
[33]  arXiv:2404.13140 (cross-list from quant-ph) [pdf, ps, other]
Title: Intro to Quantum Harmony: Chords in Superposition
Subjects: Quantum Physics (quant-ph); Emerging Technologies (cs.ET); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[ total of 46 entries: 1-10 | 4-13 | 14-23 | 24-33 | 34-43 | 44-46 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, new, 2404, contact, help  (Access key information)