We gratefully acknowledge support from
the Simons Foundation and member institutions.

Audio and Speech Processing

Authors and titles for eess.AS in Dec 2020, skipping first 100

[ total of 136 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-136 ]
[ showing 25 entries per page: fewer | more | all ]
[101]  arXiv:2012.07655 (cross-list from cs.CV) [pdf, other]
Title: Deep Neural Networks for COVID-19 Detection and Diagnosis using Images and Acoustic-based Techniques: A Recent Review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[102]  arXiv:2012.08095 (cross-list from cs.LG) [pdf, other]
Title: Automatic Speech Verification Spoofing Detection
Subjects: Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[103]  arXiv:2012.08312 (cross-list from cs.LG) [pdf, other]
Title: QUARC: Quaternion Multi-Modal Fusion Architecture For Hate Speech Classification
Comments: Accepted in Proc. of the 4th International Workshop on Dialog Systems (IWDS2021) in conjunction with the IEEE BigComp2021
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[104]  arXiv:2012.09466 (cross-list from cs.CL) [pdf, other]
Title: CIF-based Collaborative Decoding for End-to-end Contextual Speech Recognition
Comments: Accepted by ICASSP 2021
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[105]  arXiv:2012.09478 (cross-list from cs.SD) [pdf, other]
Title: The voice of COVID-19: Acoustic correlates of infection
Comments: 8 pages
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[106]  arXiv:2012.09643 (cross-list from cs.SD) [pdf, other]
Title: Automatic source localization and spectra generation from sparse beamforming maps
Comments: Preprint for JASA special issue on machine learning in acoustics, Revision 2
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[107]  arXiv:2012.10018 (cross-list from cs.CL) [pdf, ps, other]
Title: NeurST: Neural Speech Translation Toolkit
Comments: Accepted by ACL 2021 (system demonstration)
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[108]  arXiv:2012.10663 (cross-list from cs.SD) [pdf]
Title: Non-uniform FIR Digital Filter Bank for Hearing Aid Application Using Frequency Response Masking Technique: A Review
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[109]  arXiv:2012.10852 (cross-list from cs.CV) [pdf, other]
Title: Visual Speech Enhancement Without A Real Visual Stream
Comments: 10 pages, 4 figures, Accepted in WACV 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[110]  arXiv:2012.11058 (cross-list from cs.LG) [pdf, other]
Title: A Bayesian methodology for localising acoustic emission sources in complex structures
Comments: 17 pages, 7 figures
Subjects: Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[111]  arXiv:2012.11138 (cross-list from cs.SD) [pdf, other]
Title: Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition
Journal-ref: Artif Life Robotics (2021)
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[112]  arXiv:2012.11159 (cross-list from cs.SD) [pdf, other]
Title: Multi-stream Convolutional Neural Network with Frequency Selection for Robust Speaker Verification
Comments: 12 pages, 11 figures, 8 tables
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[113]  arXiv:2012.11583 (cross-list from cs.CV) [pdf, other]
Title: Semantic Audio-Visual Navigation
Comments: Project page: this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[114]  arXiv:2012.11759 (cross-list from cs.SD) [pdf, other]
Title: On the effectiveness of signal decomposition, feature extraction and selection on lung sound classification
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[115]  arXiv:2012.11896 (cross-list from cs.CL) [pdf, other]
Title: Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition
Comments: accepted in AAAI2021
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[116]  arXiv:2012.12311 (cross-list from cs.LG) [pdf]
Title: Video Influencers: Unboxing the Mystique
Comments: 61 pages
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[117]  arXiv:2012.12468 (cross-list from cs.SD) [pdf, other]
Title: CN-Celeb: multi-genre speaker recognition
Comments: submitted to Speech Communication
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[118]  arXiv:2012.12471 (cross-list from cs.SD) [pdf, other]
Title: A Principle Solution for Enroll-Test Mismatch in Speaker Recognition
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[119]  arXiv:2012.12543 (cross-list from cs.CL) [pdf]
Title: Code Switching Language Model Using Monolingual Training Data
Comments: submitted to ICASSP2021
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[120]  arXiv:2012.12612 (cross-list from cs.SD) [pdf, ps, other]
Title: Incremental Text-to-Speech Synthesis Using Pseudo Lookahead with Large Pretrained Language Model
Comments: Accepted for IEEE Signal Processing Letters
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[121]  arXiv:2012.13004 (cross-list from cs.CL) [pdf, ps, other]
Title: Speech Synthesis as Augmentation for Low-Resource ASR
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[122]  arXiv:2012.13152 (cross-list from cs.LG) [pdf, ps, other]
Title: Unsupervised neural adaptation model based on optimal transport for spoken language identification
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[123]  arXiv:2012.13341 (cross-list from cs.HC) [pdf, other]
Title: AudioViewer: Learning to Visualize Sounds
Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[124]  arXiv:2012.13668 (cross-list from cs.LG) [pdf, other]
Title: Deep Learning Framework Applied for Predicting Anomaly of Respiratory Sounds
Comments: 5 pages, 2 figures, 8 tables
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[125]  arXiv:2012.13699 (cross-list from cs.SD) [pdf, ps, other]
Title: Inception-Based Network and Multi-Spectrogram Ensemble Applied For Predicting Respiratory Anomalies and Lung Diseases
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[ total of 136 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-136 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, 2212, contact, help  (Access key information)