We gratefully acknowledge support from
the Simons Foundation and member institutions.

Audio and Speech Processing

Authors and titles for recent submissions, skipping first 30

[ total of 42 entries: 1-10 | 11-20 | 21-30 | 31-40 | 41-42 ]
[ showing 10 entries per page: fewer | more | all ]

Tue, 30 Apr 2024 (continued, showing last 4 of 13 entries)

[31]  arXiv:2404.17821 (cross-list from cs.SD) [pdf, ps, other]
Title: An automatic mixing speech enhancement system for multi-track audio
Comments: 5 pages
Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[32]  arXiv:2404.17806 (cross-list from cs.SD) [pdf, other]
Title: T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
Comments: Preprint submitted to IEEE MLSP 2024
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[33]  arXiv:2404.17721 (cross-list from cs.SD) [pdf, ps, other]
Title: An RFP dataset for Real, Fake, and Partially fake audio detection
Subjects: Sound (cs.SD); Cryptography and Security (cs.CR); Audio and Speech Processing (eess.AS)
[34]  arXiv:2404.17608 (cross-list from cs.SD) [pdf, ps, other]
Title: Synthesizing Audio from Silent Video using Sequence to Sequence Modeling
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)

Mon, 29 Apr 2024 (showing first 6 of 8 entries)

[35]  arXiv:2404.17552 [pdf, other]
Title: A Semi-Automatic Approach to Create Large Gender- and Age-Balanced Speaker Corpora: Usefulness of Speaker Diarization & Identification
Comments: Keywords:, semi-automatic processing, corpus creation, diarization, speaker identification, gender-balanced, age-balanced, speaker corpus, diachrony
Journal-ref: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pages 3271-3280, Marseille, 20-25 June 2022. European Language Resources Association (ELRA)
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Digital Libraries (cs.DL); Machine Learning (cs.LG); Sound (cs.SD)
[36]  arXiv:2404.17490 [pdf, other]
Title: The CARFAC v2 Cochlear Model in Matlab, NumPy, and JAX
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD); Signal Processing (eess.SP)
[37]  arXiv:2404.17107 [pdf, other]
Title: Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection
Comments: 4 pages, 1 figure, and 4 tables. Accepted by IEEE EMBC 2024
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[38]  arXiv:2404.17280 (cross-list from cs.SD) [pdf, other]
Title: Device Feature based on Graph Fourier Transformation with Logarithmic Processing For Detection of Replay Speech Attacks
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[39]  arXiv:2404.17161 (cross-list from cs.SD) [pdf, other]
Title: An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoder
Comments: arXiv admin note: text overlap with arXiv:2311.14957
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[40]  arXiv:2404.17022 (cross-list from cs.SD) [pdf, ps, other]
Title: Investigating differences in lab-quality and remote recording methods with dynamic acoustic measures
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[ total of 42 entries: 1-10 | 11-20 | 21-30 | 31-40 | 41-42 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, new, 2405, contact, help  (Access key information)