Computation and Language

Authors and titles for cs.CL in Aug 2020, skipping first 350

[ total of 395 entries: 1-25 | ... | 276-300 | 301-325 | 326-350 | 351-375 | 376-395 ]
[ showing 25 entries per page: fewer | more | all ]

[351] arXiv:2008.00731 (cross-list from eess.AS) [pdf, ps, other]: Title: Unsupervised Discovery of Recurring Speech Patterns Using Probabilistic Adaptive Metrics

Authors: Okko Räsänen, María Andrea Cruz Blandón

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL)
[352] arXiv:2008.00768 (cross-list from eess.AS) [pdf, other]: Title: One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech

Authors: Tomáš Nekvinda, Ondřej Dušek

Comments: Accepted to INTERSPEECH 2020; for the source files, see this https URL

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG)
[353] arXiv:2008.01300 (cross-list from eess.AS) [pdf, other]: Title: Weakly Supervised Construction of ASR Systems with Massive Video Data

Authors: Mengli Cheng, Chengyu Wang, Xu Hu, Jun Huang, Xiaobo Wang

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[354] arXiv:2008.01504 (cross-list from eess.AS) [pdf, other]: Title: "This is Houston. Say again, please". The Behavox system for the Apollo-11 Fearless Steps Challenge (phase II)

Authors: Arseniy Gorin, Daniil Kulko, Steven Grima, Alex Glasman

Comments: Accepted to Interspeech 2020

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[355] arXiv:2008.01832 (cross-list from eess.AS) [pdf, other]: Title: Future Vector Enhanced LSTM Language Model for LVCSR

Authors: Qi Liu, Yanmin Qian, Kai Yu

Comments: Accepted by ASRU-2017

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[356] arXiv:2008.02516 (cross-list from eess.AS) [pdf, other]: Title: FastLR: Non-Autoregressive Lipreading Model with Integrate-and-Fire

Authors: Jinglin Liu, Yi Ren, Zhou Zhao, Chen Zhang, Baoxing Huai, Nicholas Jing Yuan

Comments: Accepted by ACM MM 2020

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD)
[357] arXiv:2008.02603 (cross-list from eess.AS) [pdf, other]: Title: Data balancing for boosting performance of low-frequency classes in Spoken Language Understanding

Authors: Judith Gaspers, Quynh Do, Fabian Triefenbach

Comments: accepted at InterSpeech 2020

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[358] arXiv:2008.02885 (cross-list from physics.soc-ph) [pdf, other]: Title: A general solution to the preferential selection model

Authors: Jake Ryland Williams, Diana Solano-Oropeza, Jacob R. Hunsberger

Subjects: Physics and Society (physics.soc-ph); Computation and Language (cs.CL); Computers and Society (cs.CY)
[359] arXiv:2008.03029 (cross-list from eess.AS) [pdf, other]: Title: Peking Opera Synthesis via Duration Informed Attention Network

Authors: Yusong Wu, Shengchen Li, Chengzhu Yu, Heng Lu, Chao Weng, Liqiang Zhang, Dong Yu

Comments: Accepted by INTERSPEECH 2020

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[360] arXiv:2008.03088 (cross-list from eess.AS) [pdf, other]: Title: Pretraining Techniques for Sequence-to-Sequence Voice Conversion

Authors: Wen-Chin Huang, Tomoki Hayashi, Yi-Chiao Wu, Hirokazu Kameoka, Tomoki Toda

Comments: Preprint. Under review

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[361] arXiv:2008.03183 (cross-list from eess.AS) [pdf, ps, other]: Title: Applying Speech Tempo-Derived Features, BoAW and Fisher Vectors to Detect Elderly Emotion and Speech in Surgical Masks

Authors: Gábor Gosztolya, László Tóth

Comments: rejected from Interspeech, ComParE Challenge (Mask & Elderly Emotion Sub-Challenges)

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[362] arXiv:2008.03359 (cross-list from eess.AS) [pdf, other]: Title: A New Approach to Accent Recognition and Conversion for Mandarin Chinese

Authors: Lin Ai, Shih-Ying Jeng, Homayoon Beigi

Comments: 11 pages, 7 figures, and 10 tables

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[363] arXiv:2008.03403 (cross-list from eess.AS) [pdf, other]: Title: Word Error Rate Estimation Without ASR Output: e-WER2

Authors: Ahmed Ali, Steve Renals

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[364] arXiv:2008.03425 (cross-list from eess.AS) [pdf, other]: Title: Deep F-measure Maximization for End-to-End Speech Understanding

Authors: Leda Sarı, Mark Hasegawa-Johnson

Comments: Interspeech 2020 submission (Accepted)

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[365] arXiv:2008.03687 (cross-list from eess.AS) [pdf, other]: Title: LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition

Authors: Jin Xu, Xu Tan, Yi Ren, Tao Qin, Jian Li, Sheng Zhao, Tie-Yan Liu

Journal-ref: KDD 2020

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG)
[366] arXiv:2008.03802 (cross-list from eess.AS) [pdf, other]: Title: SpeedySpeech: Efficient Neural Speech Synthesis

Authors: Jan Vainer, Ondřej Dušek

Comments: 5 pages, 3 figures, Interspeech 2020

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[367] arXiv:2008.03992 (cross-list from eess.AS) [pdf, other]: Title: VAW-GAN for Singing Voice Conversion with Non-parallel Training Data

Authors: Junchen Lu, Kun Zhou, Berrak Sisman, Haizhou Li

Comments: Accepted to APSIPA ASC 2020

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[368] arXiv:2008.04481 (cross-list from eess.AS) [pdf, other]: Title: Transformer with Bidirectional Decoder for Speech Recognition

Authors: Xi Chen, Songyang Zhang, Dandan Song, Peng Ouyang, Shouyi Yin

Comments: Accepted by InterSpeech 2020

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[369] arXiv:2008.04527 (cross-list from eess.AS) [pdf, other]: Title: Neural PLDA Modeling for End-to-End Speaker Verification

Authors: Shreyas Ramoji, Prashant Krishnan, Sriram Ganapathy

Comments: Accepted in Interspeech 2020. GitHub Implementation Repos: this https URL and this https URL

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[370] arXiv:2008.04546 (cross-list from eess.AS) [pdf, other]: Title: Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings

Authors: Naoyuki Kanda, Xuankai Chang, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[371] arXiv:2008.04562 (cross-list from eess.AS) [pdf, other]: Title: Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN

Authors: Zongyang Du, Kun Zhou, Berrak Sisman, Haizhou Li

Comments: Accepted to APSIPA ASC 2020

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[372] arXiv:2008.05011 (cross-list from eess.AS) [pdf, other]: Title: Compact Speaker Embedding: lrx-vector

Authors: Munir Georges, Jonathan Huang, Tobias Bocklet

Comments: Accepted to INTERSPEECH 2020

Journal-ref: Proc. Interspeech 2020

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[373] arXiv:2008.05086 (cross-list from eess.AS) [pdf, other]: Title: Transfer Learning Approaches for Streaming End-to-End Speech Recognition System

Authors: Vikas Joshi, Rui Zhao, Rupesh R. Mehta, Kshitiz Kumar, Jinyu Li

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[374] arXiv:2008.05284 (cross-list from eess.AS) [pdf, other]: Title: Modeling Prosodic Phrasing with Multi-Task Learning in Tacotron-based TTS

Authors: Rui Liu, Berrak Sisman, Feilong Bao, Guanglai Gao, Haizhou Li

Comments: To appear in IEEE Signal Processing Letters (SPL)

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[375] arXiv:2008.05514 (cross-list from eess.AS) [pdf, other]: Title: Online Automatic Speech Recognition with Listen, Attend and Spell Model

Authors: Roger Hsiao, Dogan Can, Tim Ng, Ruchir Travadi, Arnab Ghoshal

Comments: 5 pages, 4 figures, this version is submitted to IEEE Signal Processing Letters

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)

[ total of 395 entries: 1-25 | ... | 276-300 | 301-325 | 326-350 | 351-375 | 376-395 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2404, contact, help (Access key information)

> cs > cs.CL

Computation and Language

Authors and titles for cs.CL in Aug 2020, skipping first 350