Audio and Speech Processing

Authors and titles for recent submissions, skipping first 30

[ total of 42 entries: 1-10 | 11-20 | 21-30 | 31-40 | 41-42 ]
[ showing 10 entries per page: fewer | more | all ]

Tue, 30 Apr 2024 (continued, showing last 4 of 13 entries)

[31] arXiv:2404.17821 (cross-list from cs.SD) [pdf, ps, other]: Title: An automatic mixing speech enhancement system for multi-track audio

Authors: Xiaojing Liu, Angeliki Mourgela, Hongwei Ai, Joshua D. Reiss

Comments: 5 pages

Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[32] arXiv:2404.17806 (cross-list from cs.SD) [pdf, other]: Title: T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining

Authors: Yi Yuan, Zhuo Chen, Xubo Liu, Haohe Liu, Xuenan Xu, Dongya Jia, Yuanzhe Chen, Mark D. Plumbley, Wenwu Wang

Comments: Preprint submitted to IEEE MLSP 2024

Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[33] arXiv:2404.17721 (cross-list from cs.SD) [pdf, ps, other]: Title: An RFP dataset for Real, Fake, and Partially fake audio detection

Authors: Abdulazeez AlAli, George Theodorakopoulos

Subjects: Sound (cs.SD); Cryptography and Security (cs.CR); Audio and Speech Processing (eess.AS)
[34] arXiv:2404.17608 (cross-list from cs.SD) [pdf, ps, other]: Title: Synthesizing Audio from Silent Video using Sequence to Sequence Modeling

Authors: Hugo Garrido-Lestache Belinchon, Helina Mulugeta, Adam Haile

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)

Mon, 29 Apr 2024 (showing first 6 of 8 entries)

[35] arXiv:2404.17552 [pdf, other]: Title: A Semi-Automatic Approach to Create Large Gender- and Age-Balanced Speaker Corpora: Usefulness of Speaker Diarization & Identification

Authors: Rémi Uro, David Doukhan, Albert Rilliard, Laëtitia Larcher, Anissa-Claire Adgharouamane, Marie Tahon, Antoine Laurent

Comments: Keywords:, semi-automatic processing, corpus creation, diarization, speaker identification, gender-balanced, age-balanced, speaker corpus, diachrony

Journal-ref: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pages 3271-3280, Marseille, 20-25 June 2022. European Language Resources Association (ELRA)

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Digital Libraries (cs.DL); Machine Learning (cs.LG); Sound (cs.SD)
[36] arXiv:2404.17490 [pdf, other]: Title: The CARFAC v2 Cochlear Model in Matlab, NumPy, and JAX

Authors: Richard F. Lyon, Rob Schonberger, Malcolm Slaney, Mihajlo Velimirović, Honglin Yu

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD); Signal Processing (eess.SP)
[37] arXiv:2404.17107 [pdf, other]: Title: Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection

Authors: Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino

Comments: 4 pages, 1 figure, and 4 tables. Accepted by IEEE EMBC 2024

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[38] arXiv:2404.17280 (cross-list from cs.SD) [pdf, other]: Title: Device Feature based on Graph Fourier Transformation with Logarithmic Processing For Detection of Replay Speech Attacks

Authors: Mingrui He, Longting Xu, Han Wang, Mingjun Zhang, Rohan Kumar Das

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[39] arXiv:2404.17161 (cross-list from cs.SD) [pdf, other]: Title: An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoder

Authors: Yicheng Gu, Xueyao Zhang, Liumeng Xue, Haizhou Li, Zhizheng Wu

Comments: arXiv admin note: text overlap with arXiv:2311.14957

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[40] arXiv:2404.17022 (cross-list from cs.SD) [pdf, ps, other]: Title: Investigating differences in lab-quality and remote recording methods with dynamic acoustic measures

Authors: Cong Zhang, Kathleen Jepson, Yu-Ying Chuang

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)

[ total of 42 entries: 1-10 | 11-20 | 21-30 | 31-40 | 41-42 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, new, 2405, contact, help (Access key information)

> eess > eess.AS

Audio and Speech Processing

Authors and titles for recent submissions, skipping first 30

Tue, 30 Apr 2024 (continued, showing last 4 of 13 entries)

Mon, 29 Apr 2024 (showing first 6 of 8 entries)