Sound

Authors and titles for cs.SD in Jun 2019, skipping first 100

[ total of 133 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-133 ]
[ showing 25 entries per page: fewer | more | all ]

[101] arXiv:1906.06909 (cross-list from eess.AS) [pdf, ps, other]: Title: Evaluation of post-processing algorithms for polyphonic sound event detection

Authors: Leo Cances, Patrice Guyot, Thomas Pellegrini

Comments: 5 pages, 2 figures, 1 table 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[102] arXiv:1906.07222 (cross-list from eess.AS) [pdf, ps, other]: Title: DigiVoice: Voice Biomarker Featurization and Analysis Pipeline

Authors: Larry Zhang, Xiaotong Chen, Abbad Vakil, Ali Byott, Reza Hosseini Ghomi

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[103] arXiv:1906.07234 (cross-list from eess.AS) [pdf, other]: Title: Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling

Authors: Siyuan Feng, Tan Lee, Zhiyuan Peng

Comments: 5 pages, 3 figures, accepted for publication in INTERSPEECH 2019, Graz, Austria

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[104] arXiv:1906.07245 (cross-list from eess.AS) [pdf, other]: Title: Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation

Authors: Siyuan Feng, Tan Lee

Comments: 5 pages, 3 figures, accepted for publication in INTERSPEECH 2019, Graz, Austria

Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[105] arXiv:1906.07298 (cross-list from eess.AS) [pdf, ps, other]: Title: Weighted delay-and-sum beamforming guided by visual tracking for human-robot interaction

Authors: José Novoa, Rodrigo Mahu, Alejandro Díaz, Jorge Wuth, Richard Stern, Nestor Becerra Yoma

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD); Image and Video Processing (eess.IV)
[106] arXiv:1906.07299 (cross-list from eess.AS) [pdf, ps, other]: Title: On combining features for single-channel robust speech recognition in reverberant environments

Authors: José Novoa, Josué Fredes, Jorge Wuth, Fernando Huenupán, Richard M. Stern, Nestor Becerra Yoma

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[107] arXiv:1906.07317 (cross-list from eess.AS) [pdf, ps, other]: Title: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Authors: Xu Xiang, Shuai Wang, Houjun Huang, Yanmin Qian, Kai Yu

Comments: not accepted by INTERSPEECH 2019

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[108] arXiv:1906.07319 (cross-list from eess.AS) [pdf, other]: Title: Deep Xi as a Front-End for Robust Automatic Speech Recognition

Authors: Aaron Nicolson, Kuldip K. Paliwal

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD); Signal Processing (eess.SP)
[109] arXiv:1906.07414 (cross-list from eess.AS) [pdf, other]: Title: A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation

Authors: Hieu-Thi Luong, Junichi Yamagishi

Comments: 14 pages, 10 figures

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[110] arXiv:1906.07493 (cross-list from eess.AS) [pdf, other]: Title: Square root-based multi-source early PSD estimation and recursive RETF update in reverberant environments by means of the orthogonal Procrustes problem

Authors: T. Dietzen, S. Doclo, M. Moonen, T. van Waterschoot

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[111] arXiv:1906.07512 (cross-list from eess.AS) [pdf, other]: Title: Integrated sidelobe cancellation and linear prediction Kalman filter for joint multi-microphone speech dereverberation, interfering speech cancellation, and noise reduction

Authors: T. Dietzen, S. Doclo, M. Moonen, T. van Waterschoot

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[112] arXiv:1906.07552 (cross-list from eess.AS) [pdf, other]: Title: Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks

Authors: Qiuqiang Kong, Yong Xu, Wenwu Wang, Philip J. B. Jackson, Mark D. Plumbley

Comments: 7 pages. Accepted by IJCAI 2019

Journal-ref: International Joint Conference on Artificial Intelligence (IJCAI), 2019, pp. 2747-2753

Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[113] arXiv:1906.07769 (cross-list from eess.AS) [pdf, other]: Title: Cascaded Cross-Module Residual Learning towards Lightweight End-to-End Speech Coding

Authors: Kai Zhen, Jongmo Sung, Mi Suk Lee, Seungkwon Beack, Minje Kim

Comments: Accepted for publication in INTERSPEECH 2019

Journal-ref: Published in Interspeech 2019

Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[114] arXiv:1906.08041 (cross-list from eess.AS) [pdf, other]: Title: Multi-Stream End-to-End Speech Recognition

Authors: Ruizhi Li, Xiaofei Wang, Sri Harish Mallidi, Shinji Watanabe, Takaaki Hori, Hynek Hermansky

Comments: submitted to IEEE TASLP (In review). arXiv admin note: substantial text overlap with arXiv:1811.04897, arXiv:1811.04903

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[115] arXiv:1906.08043 (cross-list from eess.AS) [pdf, other]: Title: Real to H-space Encoder for Speech Recognition

Authors: Titouan Parcollet, Mohamed Morchid, Georges Linarès, Renato De Mori

Comments: Accepted at INTERSPEECH 2019

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[116] arXiv:1906.08044 (cross-list from eess.AS) [pdf, other]: Title: Robust End-to-End Speaker Verification Using EEG

Authors: Yan Han, Gautam Krishna, Co Tran, Mason Carnahan, Ahmed H Tewfik

Comments: Accepted for EUSIPCO 2020

Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Signal Processing (eess.SP); Machine Learning (stat.ML)
[117] arXiv:1906.08045 (cross-list from eess.AS) [pdf, other]: Title: Speech Recognition With No Speech Or With Noisy Speech Beyond English

Authors: Gautam Krishna, Co Tran, Yan Han, Mason Carnahan, Ahmed H Tewfik

Comments: arXiv admin note: text overlap with arXiv:1906.08871

Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Signal Processing (eess.SP); Machine Learning (stat.ML)
[118] arXiv:1906.08333 (cross-list from eess.AS) [pdf, other]: Title: Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification

Authors: Youngmoon Jung, Younggwan Kim, Hyungjun Lim, Yeunju Choi, Hoirin Kim

Comments: 5 pages, 2 figures, Interspeech 2019

Journal-ref: Proc. of Interspeech 2019, 2019, pp. 4030-4034

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[119] arXiv:1906.08407 (cross-list from eess.AS) [pdf, other]: Title: Parameter Enhancement for MELP Speech Codec in Noisy Communication Environment

Authors: Min-Jae Hwang, Hong-Goo Kang

Comments: Accepted to the conference of INTERSPEECH 2019

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD); Signal Processing (eess.SP)
[120] arXiv:1906.08847 (cross-list from eess.AS) [pdf, ps, other]: Title: A Signal Subspace Rotation Method for Localization of Multiple Wideband Sound Sources

Authors: Kainan Chen, Wenyu Jin, Bharadwaj Desikan

Comments: 5 pages, 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[121] arXiv:1906.08871 (cross-list from eess.AS) [pdf, other]: Title: Advancing Speech Recognition With No Speech Or With Noisy Speech

Authors: Gautam Krishna, Co Tran, Mason Carnahan, Ahmed H Tewfik

Comments: Extended version of our accepted IEEE EUSIPCO 2019 paper with additional results for CTC model based recognition. arXiv admin note: substantial text overlap with arXiv:1906.08045, arXiv:1906.08044

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[122] arXiv:1906.09426 (cross-list from eess.AS) [pdf, other]: Title: End-to-End ASR for Code-switched Hindi-English Speech

Authors: Brij Mohan Lal Srivastava, Basil Abraham, Sunayana Sitaram, Rupesh Mehta, Preethi Jyothi

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[123] arXiv:1906.10369 (cross-list from eess.AS) [pdf, other]: Title: Acoustic Modeling for Automatic Lyrics-to-Audio Alignment

Authors: Chitralekha Gupta, Emre Yılmaz, Haizhou Li

Comments: Accepted for publication at Interspeech 2019

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[124] arXiv:1906.10508 (cross-list from eess.AS) [pdf, other]: Title: Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled Linguistic and Speaker Representations

Authors: Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai

Comments: Accepted by IEEE/ACM Transactions on Aduio, Speech and Language Processing

Journal-ref: IEEE/ACM Transactions on Audio, Speech and Language Processing vol 28 no 1 (2020) 540-552

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[125] arXiv:1906.10606 (cross-list from eess.AS) [pdf, other]: Title: DALI: a large Dataset of synchronized Audio, LyrIcs and notes, automatically created using teacher-student machine learning paradigm

Authors: Gabriel Meseguer-Brocal, Alice Cohen-Hadria, Geoffroy Peeters

Journal-ref: Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR, Paris, France, pp. 431-437, 2018

Subjects: Audio and Speech Processing (eess.AS); Databases (cs.DB); Machine Learning (cs.LG); Sound (cs.SD)

[ total of 133 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-133 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2406, contact, help (Access key information)

> cs > cs.SD

Sound

Authors and titles for cs.SD in Jun 2019, skipping first 100