We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sound

Authors and titles for cs.SD in Nov 2021, skipping first 50

[ total of 197 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-150 | ... | 176-197 ]
[ showing 25 entries per page: fewer | more | all ]
[51]  arXiv:2111.09014 [pdf]
Title: Subject Enveloped Deep Sample Fuzzy Ensemble Learning Algorithm of Parkinson's Speech Data
Comments: 18 pages, 4 figures
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[52]  arXiv:2111.09052 [pdf, other]
Title: High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency
Comments: Proceedings of INTERSPEECH 2020
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[53]  arXiv:2111.09075 [pdf, ps, other]
Title: Cross-lingual Low Resource Speaker Adaptation Using Phonological Features
Comments: Proceedings of INTERSPEECH 2021
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[54]  arXiv:2111.09146 [pdf, other]
Title: Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control
Comments: Proceedings of 11th ISCA Speech Synthesis Workshop (SSW 11)
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[55]  arXiv:2111.09642 [pdf, other]
Title: Towards Intelligibility-Oriented Audio-Visual Speech Enhancement
Comments: 6 pages, 4 figures
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[56]  arXiv:2111.09931 [pdf, other]
Title: DawDreamer: Bridging the Gap Between Digital Audio Workstations and Python Interfaces
Authors: David Braun
Comments: 3 pages with 0 figures. Included in the Late-Breaking Demo Session of the 22nd International Society for Music Information Retrieval Conference
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[57]  arXiv:2111.10003 [pdf, other]
Title: Differentiable Wavetable Synthesis
Comments: Accepted by ICASSP 2022, Demo: this https URL
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[58]  arXiv:2111.10168 [pdf, other]
Title: Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control
Comments: Proceedings of SPECOM 2021
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[59]  arXiv:2111.10173 [pdf, other]
Title: Word-Level Style Control for Expressive, Non-attentive Speech Synthesis
Comments: Proceedings of SPECOM 2021
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[60]  arXiv:2111.10177 [pdf, other]
Title: Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech Synthesis
Comments: Proceedings of ICASSP 2021
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[61]  arXiv:2111.10235 [pdf, other]
Title: Interpreting deep urban sound classification using Layer-wise Relevance Propagation
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[62]  arXiv:2111.10592 [pdf, other]
Title: Deep Spoken Keyword Spotting: An Overview
Subjects: Sound (cs.SD); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[63]  arXiv:2111.10639 [pdf, other]
Title: Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection
Comments: Submitted to INTERSPEECH 2022
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[64]  arXiv:2111.10783 [pdf]
Title: Automatic Detection of Depression from Stratified Samples of Audio Data
Comments: 30 pages, 6 figures
Subjects: Sound (cs.SD); Human-Computer Interaction (cs.HC); Audio and Speech Processing (eess.AS)
[65]  arXiv:2111.10897 [pdf, other]
Title: Health Monitoring of Industrial machines using Scene-Aware Threshold Selection
Comments: 5 pages, 4 figures, 1 Table
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[66]  arXiv:2111.11023 [pdf, ps, other]
Title: Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[67]  arXiv:2111.11063 [pdf, other]
Title: Comparing the Accuracy of Deep Neural Networks (DNN) and Convolutional Neural Network (CNN) in Music Genre Recognition (MGR): Experiments on Kurdish Music
Comments: 8 pages, 5 figures, 3 tables
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[68]  arXiv:2111.11636 [pdf]
Title: Music Classification: Beyond Supervised Learning, Towards Real-world Applications
Comments: This is a web book written for a tutorial session of the 22nd International Society for Music Information Retrieval Conference, Nov 8-12, 2021. Please visit this https URL for the original, web book format
Subjects: Sound (cs.SD); Information Retrieval (cs.IR); Audio and Speech Processing (eess.AS)
[69]  arXiv:2111.11737 [pdf]
Title: ADTOF: A large dataset of non-synthetic music for automatic drum transcription
Comments: Proceedings of the 22nd International Society for Music Information Retrieval Conference, ISMIR, Online, pp. 818-824
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[70]  arXiv:2111.11755 [pdf, other]
Title: Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance
Comments: 15 pages, 5 figures, ICML'2022
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[71]  arXiv:2111.11773 [pdf, other]
Title: Upsampling layers for music source separation
Comments: Demo page: this http URL
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[72]  arXiv:2111.11859 [pdf]
Title: Longitudinal Speech Biomarkers for Automated Alzheimer's Detection
Journal-ref: Frontiers in Computer Science, 08 April 2021
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS); Quantitative Methods (q-bio.QM)
[73]  arXiv:2111.12124 [pdf, ps, other]
Title: Towards Learning Universal Audio Representations
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[74]  arXiv:2111.12324 [pdf, other]
Title: How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[75]  arXiv:2111.12326 [pdf, other]
Title: A Study on Decoupled Probabilistic Linear Discriminant Analysis
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[ total of 197 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-150 | ... | 176-197 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2206, contact, help  (Access key information)