Sound

Authors and titles for cs.SD in Dec 2019, skipping first 60

[ total of 90 entries: 1-10 | ... | 31-40 | 41-50 | 51-60 | 61-70 | 71-80 | 81-90 ]
[ showing 10 entries per page: fewer | more | all ]

[61] arXiv:1912.02613 (cross-list from eess.AS) [pdf, other]: Title: Singing Voice Conversion with Disentangled Representations of Singer and Vocal Technique Using Variational Autoencoders

Authors: Yin-Jyun Luo, Chin-Chen Hsu, Kat Agres, Dorien Herremans

Comments: Accepted to ICASSP 2020

Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[62] arXiv:1912.02615 (cross-list from eess.AS) [pdf, other]: Title: Audiovisual Transformer Architectures for Large-Scale Classification and Synchronization of Weakly Labeled Audio Events

Authors: Wim Boes, Hugo Van hamme

Journal-ref: Proceedings of the 27th ACM International Conference on Multimedia (MM '19). ACM, New York, NY, USA, 1961-1969

Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[63] arXiv:1912.03363 (cross-list from eess.AS) [pdf, other]: Title: Audio-attention discriminative language model for ASR rescoring

Authors: Ankur Gandhe, Ariya Rastrow

Comments: 4 pages, 1 figure, Accepted at ICASSP 2020

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[64] arXiv:1912.03627 (cross-list from eess.AS) [pdf, ps, other]: Title: A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database

Authors: Hossein Zeinali, Lukáš Burget, Jan "Honza'' Černocký

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[65] arXiv:1912.04067 (cross-list from eess.AS) [pdf, other]: Title: Visualizing Deep Neural Networks for Speech Recognition with Learned Topographic Filter Maps

Authors: Andreas Krug, Sebastian Stober

Comments: Accepted for 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP

Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[66] arXiv:1912.04370 (cross-list from eess.AS) [pdf, other]: Title: Cross-Language Aphasia Detection using Optimal Transport Domain Adaptation

Authors: Aparna Balagopalan, Jekaterina Novikova, Matthew B. A. McDermott, Bret Nestor, Tristan Naumann, Marzyeh Ghassemi

Comments: Accepted to ML4H at NeurIPS 2019

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[67] arXiv:1912.04381 (cross-list from eess.AS) [pdf, ps, other]: Title: A Dataset for measuring reading levels in India at scale

Authors: Dolly Agarwal, Jayant Gupchup, Nishant Baghel

Comments: 5 pages, 3 figures, 3 Tables, Paper accepted to ICASSP 2020

Subjects: Audio and Speech Processing (eess.AS); Computers and Society (cs.CY); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[68] arXiv:1912.04979 (cross-list from eess.AS) [pdf, other]: Title: Advances in Online Audio-Visual Meeting Transcription

Authors: Takuya Yoshioka, Igor Abramovski, Cem Aksoylar, Zhuo Chen, Moshe David, Dimitrios Dimitriadis, Yifan Gong, Ilya Gurvich, Xuedong Huang, Yan Huang, Aviv Hurvitz, Li Jiang, Sharon Koubi, Eyal Krupka, Ido Leichter, Changliang Liu, Partha Parthasarathy, Alon Vinnikov, Lingfeng Wu, Xiong Xiao, Wayne Xiong, Huaming Wang, Zhenghao Wang, Jun Zhang, Yong Zhao, Tianyan Zhou

Comments: To appear in Proc. IEEE ASRU Workshop 2019

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Image and Video Processing (eess.IV)
[69] arXiv:1912.05038 (cross-list from eess.AS) [pdf, other]: Title: Cooperative Audio Source Separation and Enhancement Using Distributed Microphone Arrays and Wearable Devices

Authors: Ryan M. Corey, Matthew D. Skarha, Andrew C. Singer

Comments: To appear at CAMSAP 2019

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[70] arXiv:1912.05043 (cross-list from eess.AS) [pdf, other]: Title: Motion-Tolerant Beamforming with Deformable Microphone Arrays

Authors: Ryan M. Corey, Andrew C. Singer

Comments: Presented at WASPAA 2019

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)

[ total of 90 entries: 1-10 | ... | 31-40 | 41-50 | 51-60 | 61-70 | 71-80 | 81-90 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2405, contact, help (Access key information)

> cs > cs.SD

Sound

Authors and titles for cs.SD in Dec 2019, skipping first 60