We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sound

Authors and titles for cs.SD in Jun 2019, skipping first 100

[ total of 133 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-133 ]
[ showing 25 entries per page: fewer | more | all ]
[101]  arXiv:1906.06909 (cross-list from eess.AS) [pdf, ps, other]
Title: Evaluation of post-processing algorithms for polyphonic sound event detection
Comments: 5 pages, 2 figures, 1 table 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[102]  arXiv:1906.07222 (cross-list from eess.AS) [pdf, ps, other]
Title: DigiVoice: Voice Biomarker Featurization and Analysis Pipeline
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[103]  arXiv:1906.07234 (cross-list from eess.AS) [pdf, other]
Title: Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling
Comments: 5 pages, 3 figures, accepted for publication in INTERSPEECH 2019, Graz, Austria
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[104]  arXiv:1906.07245 (cross-list from eess.AS) [pdf, other]
Title: Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation
Authors: Siyuan Feng, Tan Lee
Comments: 5 pages, 3 figures, accepted for publication in INTERSPEECH 2019, Graz, Austria
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[105]  arXiv:1906.07298 (cross-list from eess.AS) [pdf, ps, other]
Title: Weighted delay-and-sum beamforming guided by visual tracking for human-robot interaction
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD); Image and Video Processing (eess.IV)
[106]  arXiv:1906.07299 (cross-list from eess.AS) [pdf, ps, other]
Title: On combining features for single-channel robust speech recognition in reverberant environments
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[107]  arXiv:1906.07317 (cross-list from eess.AS) [pdf, ps, other]
Title: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition
Comments: not accepted by INTERSPEECH 2019
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[108]  arXiv:1906.07319 (cross-list from eess.AS) [pdf, other]
Title: Deep Xi as a Front-End for Robust Automatic Speech Recognition
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD); Signal Processing (eess.SP)
[109]  arXiv:1906.07414 (cross-list from eess.AS) [pdf, other]
Title: A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation
Comments: 14 pages, 10 figures
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[110]  arXiv:1906.07493 (cross-list from eess.AS) [pdf, other]
Title: Square root-based multi-source early PSD estimation and recursive RETF update in reverberant environments by means of the orthogonal Procrustes problem
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[111]  arXiv:1906.07512 (cross-list from eess.AS) [pdf, other]
Title: Integrated sidelobe cancellation and linear prediction Kalman filter for joint multi-microphone speech dereverberation, interfering speech cancellation, and noise reduction
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[112]  arXiv:1906.07552 (cross-list from eess.AS) [pdf, other]
Title: Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks
Comments: 7 pages. Accepted by IJCAI 2019
Journal-ref: International Joint Conference on Artificial Intelligence (IJCAI), 2019, pp. 2747-2753
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[113]  arXiv:1906.07769 (cross-list from eess.AS) [pdf, other]
Title: Cascaded Cross-Module Residual Learning towards Lightweight End-to-End Speech Coding
Comments: Accepted for publication in INTERSPEECH 2019
Journal-ref: Published in Interspeech 2019
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[114]  arXiv:1906.08041 (cross-list from eess.AS) [pdf, other]
Title: Multi-Stream End-to-End Speech Recognition
Comments: submitted to IEEE TASLP (In review). arXiv admin note: substantial text overlap with arXiv:1811.04897, arXiv:1811.04903
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[115]  arXiv:1906.08043 (cross-list from eess.AS) [pdf, other]
Title: Real to H-space Encoder for Speech Recognition
Comments: Accepted at INTERSPEECH 2019
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[116]  arXiv:1906.08044 (cross-list from eess.AS) [pdf, other]
Title: Robust End-to-End Speaker Verification Using EEG
Comments: Accepted for EUSIPCO 2020
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Signal Processing (eess.SP); Machine Learning (stat.ML)
[117]  arXiv:1906.08045 (cross-list from eess.AS) [pdf, other]
Title: Speech Recognition With No Speech Or With Noisy Speech Beyond English
Comments: arXiv admin note: text overlap with arXiv:1906.08871
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Signal Processing (eess.SP); Machine Learning (stat.ML)
[118]  arXiv:1906.08333 (cross-list from eess.AS) [pdf, other]
Title: Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification
Comments: 5 pages, 2 figures, Interspeech 2019
Journal-ref: Proc. of Interspeech 2019, 2019, pp. 4030-4034
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[119]  arXiv:1906.08407 (cross-list from eess.AS) [pdf, other]
Title: Parameter Enhancement for MELP Speech Codec in Noisy Communication Environment
Comments: Accepted to the conference of INTERSPEECH 2019
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD); Signal Processing (eess.SP)
[120]  arXiv:1906.08847 (cross-list from eess.AS) [pdf, ps, other]
Title: A Signal Subspace Rotation Method for Localization of Multiple Wideband Sound Sources
Comments: 5 pages, 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[121]  arXiv:1906.08871 (cross-list from eess.AS) [pdf, other]
Title: Advancing Speech Recognition With No Speech Or With Noisy Speech
Comments: Extended version of our accepted IEEE EUSIPCO 2019 paper with additional results for CTC model based recognition. arXiv admin note: substantial text overlap with arXiv:1906.08045, arXiv:1906.08044
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[122]  arXiv:1906.09426 (cross-list from eess.AS) [pdf, other]
Title: End-to-End ASR for Code-switched Hindi-English Speech
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[123]  arXiv:1906.10369 (cross-list from eess.AS) [pdf, other]
Title: Acoustic Modeling for Automatic Lyrics-to-Audio Alignment
Comments: Accepted for publication at Interspeech 2019
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[124]  arXiv:1906.10508 (cross-list from eess.AS) [pdf, other]
Title: Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled Linguistic and Speaker Representations
Comments: Accepted by IEEE/ACM Transactions on Aduio, Speech and Language Processing
Journal-ref: IEEE/ACM Transactions on Audio, Speech and Language Processing vol 28 no 1 (2020) 540-552
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[125]  arXiv:1906.10606 (cross-list from eess.AS) [pdf, other]
Title: DALI: a large Dataset of synchronized Audio, LyrIcs and notes, automatically created using teacher-student machine learning paradigm
Journal-ref: Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR, Paris, France, pp. 431-437, 2018
Subjects: Audio and Speech Processing (eess.AS); Databases (cs.DB); Machine Learning (cs.LG); Sound (cs.SD)
[ total of 133 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-133 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2406, contact, help  (Access key information)