We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sound

Authors and titles for recent submissions, skipping first 26

[ total of 96 entries: 1-10 | 7-16 | 17-26 | 27-36 | 37-46 | 47-56 | 57-66 | ... | 87-96 ]
[ showing 10 entries per page: fewer | more | all ]

Wed, 31 May 2023 (continued, showing 10 of 23 entries)

[27]  arXiv:2305.19228 (cross-list from cs.CL) [pdf, other]
Title: Unsupervised Melody-to-Lyric Generation
Comments: Accepted to ACL 23. arXiv admin note: substantial text overlap with arXiv:2305.07760
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[28]  arXiv:2305.19184 (cross-list from eess.AS) [pdf, other]
Title: Leveraging Semantic Information for Efficient Self-Supervised Emotion Recognition with Audio-Textual Distilled Models
Comments: Accepted at Interspeech 2023
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[29]  arXiv:2305.19100 (cross-list from eess.AS) [pdf, other]
Title: Predicting Preferred Dialogue-to-Background Loudness Difference in Dialogue-Separated Audio
Comments: Paper accepted at the 15th International Conference on Quality of Multimedia Experience (QoMEX), 4 pages, 2 figures
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[30]  arXiv:2305.19090 (cross-list from eess.AS) [pdf]
Title: Prospective Validation of Motor-Based Intervention with Automated Mispronunciation Detection of Rhotics in Residual Speech Sound Disorders
Comments: To appear in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2023
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[31]  arXiv:2305.19051 (cross-list from eess.AS) [pdf, other]
Title: Towards single integrated spoofing-aware speaker verification embeddings
Comments: Accepted by INTERSPEECH 2023. Code and models are available in this https URL
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Sound (cs.SD)
[32]  arXiv:2305.18975 (cross-list from eess.AS) [pdf, other]
Title: Voice Conversion With Just Nearest Neighbors
Comments: 5 page, 1 table, 2 figures. Accepted at Interspeech 2023
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[33]  arXiv:2305.18925 (cross-list from eess.AS) [pdf, other]
Title: Investigating model performance in language identification: beyond simple error statistics
Comments: Accepted to Interspeech 2023, 5 pages, 5 figures
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[34]  arXiv:2305.18802 (cross-list from eess.AS) [pdf, other]
Title: LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus
Comments: Accepted to Interspeech 2023
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[35]  arXiv:2305.18753 (cross-list from eess.AS) [pdf, other]
Title: Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning
Comments: INTERSPEECH 2023. arXiv admin note: substantial text overlap with arXiv:2210.05037
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[36]  arXiv:2305.18551 (cross-list from astro-ph.IM) [pdf]
Title: Multi-Band Acoustic Monitoring of Aerial Signatures
Journal-ref: Journal of Astronomical Instrumentation, 12(1), 2340005 (2023)
Subjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[ total of 96 entries: 1-10 | 7-16 | 17-26 | 27-36 | 37-46 | 47-56 | 57-66 | ... | 87-96 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2305, contact, help  (Access key information)