We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sound

Authors and titles for cs.SD in Jan 2023

[ total of 104 entries: 1-10 | 11-20 | 21-30 | 31-40 | ... | 101-104 ]
[ showing 10 entries per page: fewer | more | all ]
[1]  arXiv:2301.00508 [pdf, other]
Title: EmoGator: A New Open Source Vocal Burst Dataset with Baseline Machine Learning Classification Methodologies
Authors: Fred W. Buhl
Comments: 12 pages, 4 tables, 2 figures
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[2]  arXiv:2301.01162 [pdf, other]
Title: Language Models are Drummers: Drum Composition with Natural Language Pre-Training
Comments: Accepted to the 1st workshop on Creative AI across Modalities in AAAI 2023
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[3]  arXiv:2301.01378 [pdf, other]
Title: An ensemble-based framework for mispronunciation detection of Arabic phonemes
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[4]  arXiv:2301.01578 [pdf, other]
Title: Validity in Music Information Research Experiments
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[5]  arXiv:2301.02385 [pdf, other]
Title: Multi-Genre Music Transformer -- Composing Full Length Musical Piece
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[6]  arXiv:2301.02732 [pdf, ps, other]
Title: Multimodal Lyrics-Rhythm Matching
Comments: Accepted by 2022 IEEE International Conference on Big Data (IEEE Big Data 2022)
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[7]  arXiv:2301.02884 [pdf, other]
Title: TunesFormer: Forming Irish Tunes with Control Codes by Bar Patching
Comments: 6 pages, 1 figure, 1 table, accepted by HCMIR 2023
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[8]  arXiv:2301.02886 [pdf, other]
Title: Perceptual-Neural-Physical Sound Matching
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[9]  arXiv:2301.03206 [pdf, other]
Title: Introducing Model Inversion Attacks on Automatic Speaker Recognition
Comments: for associated pdf, see this https URL
Journal-ref: Proc. 2nd Symposium on Security and Privacy in Speech Communication, 2022
Subjects: Sound (cs.SD); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[10]  arXiv:2301.03751 [pdf, other]
Title: Generative Emotional AI for Speech Emotion Recognition: The Case for Synthetic Emotional Speech Augmentation
Comments: Under review
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[ total of 104 entries: 1-10 | 11-20 | 21-30 | 31-40 | ... | 101-104 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2404, contact, help  (Access key information)