We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for cs.CL in Aug 2020, skipping first 350

[ total of 395 entries: 1-25 | ... | 276-300 | 301-325 | 326-350 | 351-375 | 376-395 ]
[ showing 25 entries per page: fewer | more | all ]
[351]  arXiv:2008.00731 (cross-list from eess.AS) [pdf]
Title: Unsupervised Discovery of Recurring Speech Patterns Using Probabilistic Adaptive Metrics
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL)
[352]  arXiv:2008.00768 (cross-list from eess.AS) [pdf, other]
Title: One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech
Comments: Accepted to INTERSPEECH 2020; for the source files, see this https URL
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG)
[353]  arXiv:2008.01300 (cross-list from eess.AS) [pdf, other]
Title: Weakly Supervised Construction of ASR Systems with Massive Video Data
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[354]  arXiv:2008.01504 (cross-list from eess.AS) [pdf, other]
Title: "This is Houston. Say again, please". The Behavox system for the Apollo-11 Fearless Steps Challenge (phase II)
Comments: Accepted to Interspeech 2020
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[355]  arXiv:2008.01832 (cross-list from eess.AS) [pdf, other]
Title: Future Vector Enhanced LSTM Language Model for LVCSR
Comments: Accepted by ASRU-2017
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[356]  arXiv:2008.02516 (cross-list from eess.AS) [pdf, other]
Title: FastLR: Non-Autoregressive Lipreading Model with Integrate-and-Fire
Comments: Accepted by ACM MM 2020
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD)
[357]  arXiv:2008.02603 (cross-list from eess.AS) [pdf, other]
Title: Data balancing for boosting performance of low-frequency classes in Spoken Language Understanding
Comments: accepted at InterSpeech 2020
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[358]  arXiv:2008.02885 (cross-list from physics.soc-ph) [pdf, other]
Title: A general solution to the preferential selection model
Subjects: Physics and Society (physics.soc-ph); Computation and Language (cs.CL); Computers and Society (cs.CY)
[359]  arXiv:2008.03029 (cross-list from eess.AS) [pdf, other]
Title: Peking Opera Synthesis via Duration Informed Attention Network
Comments: Accepted by INTERSPEECH 2020
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[360]  arXiv:2008.03088 (cross-list from eess.AS) [pdf, other]
Title: Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Comments: Preprint. Under review
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[361]  arXiv:2008.03183 (cross-list from eess.AS) [pdf, ps, other]
Title: Applying Speech Tempo-Derived Features, BoAW and Fisher Vectors to Detect Elderly Emotion and Speech in Surgical Masks
Comments: rejected from Interspeech, ComParE Challenge (Mask & Elderly Emotion Sub-Challenges)
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[362]  arXiv:2008.03359 (cross-list from eess.AS) [pdf, other]
Title: A New Approach to Accent Recognition and Conversion for Mandarin Chinese
Comments: 11 pages, 7 figures, and 10 tables
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[363]  arXiv:2008.03403 (cross-list from eess.AS) [pdf, other]
Title: Word Error Rate Estimation Without ASR Output: e-WER2
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[364]  arXiv:2008.03425 (cross-list from eess.AS) [pdf, other]
Title: Deep F-measure Maximization for End-to-End Speech Understanding
Comments: Interspeech 2020 submission (Accepted)
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[365]  arXiv:2008.03687 (cross-list from eess.AS) [pdf, other]
Title: LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition
Journal-ref: KDD 2020
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG)
[366]  arXiv:2008.03802 (cross-list from eess.AS) [pdf, other]
Title: SpeedySpeech: Efficient Neural Speech Synthesis
Comments: 5 pages, 3 figures, Interspeech 2020
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[367]  arXiv:2008.03992 (cross-list from eess.AS) [pdf, other]
Title: VAW-GAN for Singing Voice Conversion with Non-parallel Training Data
Comments: Accepted to APSIPA ASC 2020
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[368]  arXiv:2008.04481 (cross-list from eess.AS) [pdf, other]
Title: Transformer with Bidirectional Decoder for Speech Recognition
Comments: Accepted by InterSpeech 2020
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[369]  arXiv:2008.04527 (cross-list from eess.AS) [pdf, other]
Title: Neural PLDA Modeling for End-to-End Speaker Verification
Comments: Accepted in Interspeech 2020. GitHub Implementation Repos: this https URL and this https URL
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[370]  arXiv:2008.04546 (cross-list from eess.AS) [pdf, other]
Title: Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[371]  arXiv:2008.04562 (cross-list from eess.AS) [pdf, other]
Title: Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN
Comments: Accepted to APSIPA ASC 2020
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[372]  arXiv:2008.05011 (cross-list from eess.AS) [pdf, other]
Title: Compact Speaker Embedding: lrx-vector
Comments: Accepted to INTERSPEECH 2020
Journal-ref: Proc. Interspeech 2020
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[373]  arXiv:2008.05086 (cross-list from eess.AS) [pdf, other]
Title: Transfer Learning Approaches for Streaming End-to-End Speech Recognition System
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[374]  arXiv:2008.05284 (cross-list from eess.AS) [pdf, other]
Title: Modeling Prosodic Phrasing with Multi-Task Learning in Tacotron-based TTS
Comments: To appear in IEEE Signal Processing Letters (SPL)
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[375]  arXiv:2008.05514 (cross-list from eess.AS) [pdf, other]
Title: Online Automatic Speech Recognition with Listen, Attend and Spell Model
Comments: 5 pages, 4 figures, this version is submitted to IEEE Signal Processing Letters
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[ total of 395 entries: 1-25 | ... | 276-300 | 301-325 | 326-350 | 351-375 | 376-395 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2212, contact, help  (Access key information)