We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sound

Authors and titles for recent submissions

[ total of 18 entries: 1-10 | 11-18 ]
[ showing 10 entries per page: fewer | more | all ]

Mon, 6 Apr 2020

[1]  arXiv:2004.01546 (cross-list from eess.AS) [pdf, other]
Title: Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection
Journal-ref: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2020
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[2]  arXiv:2004.01525 (cross-list from eess.AS) [pdf, ps, other]
Title: Towards democratizing music production with AI-Design of Variational Autoencoder-based Rhythm Generator as a DAW plugin
Authors: Nao Tokui
Comments: 4 pages
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[3]  arXiv:2004.01495 (cross-list from eess.AS) [pdf, other]
Title: Can Machine Learning Be Used to Recognize and Diagnose Coughs?
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[4]  arXiv:2004.01275 (cross-list from eess.AS) [pdf, other]
Title: AI4COVID-19: AI Enabled Preliminary Diagnosis for COVID-19 from Cough Samples via an App
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Quantitative Methods (q-bio.QM); Machine Learning (stat.ML)
[5]  arXiv:2004.01221 (cross-list from eess.AS) [pdf, other]
Title: Towards Relevance and Sequence Modeling in Language Recognition
Comments: this https URL Accepted to IEEE Transactions on Audio, Speech and Language Processing
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)

Fri, 3 Apr 2020

[6]  arXiv:2004.01023 (cross-list from cs.MM) [pdf, other]
Title: Multi-Modal Video Forensic Platform for Investigating Post-Terrorist Attack Scenarios
Journal-ref: In Proceedings of the 11th ACM Multimedia Systems Conference (MMSys2020), June 06-11, 2020, Istanbul, Turkey
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[7]  arXiv:2004.00967 (cross-list from eess.AS) [pdf, other]
Title: Full-Sum Decoding for Hybrid HMM based Speech Recognition using LSTM Language Model
Comments: accepted at ICASSP 2020
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[8]  arXiv:2004.00960 (cross-list from eess.AS) [pdf, other]
Title: The RWTH ASR System for TED-LIUM Release 2: Improving Hybrid HMM with SpecAugment
Comments: accepted at ICASSP 2020
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[9]  arXiv:2004.00932 (cross-list from eess.AS) [pdf, other]
Title: iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning
Comments: 5 pages, Submitted to INTERSPEECH 2020
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[10]  arXiv:2004.00910 (cross-list from eess.AS) [pdf, other]
Title: Improving auditory attention decoding performance of linear and non-linear methods using state-space model
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Signal Processing (eess.SP)
[ total of 18 entries: 1-10 | 11-18 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2004, contact, help  (Access key information)