We gratefully acknowledge support from
the Simons Foundation and member institutions.

Audio and Speech Processing

Authors and titles for eess.AS in Apr 2019, skipping first 100

[ total of 167 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-150 | 151-167 ]
[ showing 25 entries per page: fewer | more | all ]
[101]  arXiv:1904.05259 (cross-list from cs.SD) [pdf, other]
Title: Autoencoder-Based Articulatory-to-Acoustic Mapping for Ultrasound Silent Speech Interfaces
Comments: 8 pages, 6 figures, Accepted to IJCNN 2019
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[102]  arXiv:1904.05576 (cross-list from cs.SD) [pdf, other]
Title: STC Antispoofing Systems for the ASVspoof2019 Challenge
Comments: Submitted to Interspeech 2019, Graz, Austria
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[103]  arXiv:1904.05635 (cross-list from cs.SD) [src]
Title: Cross-task learning for audio tagging, sound event detection spatial localization: DCASE 2019 baseline systems
Comments: We want to replace but create this submission by mistake. See arXiv:1904.03476 instead
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[104]  arXiv:1904.05734 (cross-list from cs.CR) [pdf, other]
Title: Practical Hidden Voice Attacks against Speech and Speaker Recognition Systems
Journal-ref: The Network and Distributed System Security Symposium (NDSS) 2019
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[105]  arXiv:1904.05742 (cross-list from cs.LG) [pdf, other]
Title: One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization
Comments: Interspeech 2019
Subjects: Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[106]  arXiv:1904.05746 (cross-list from cs.LG) [pdf, other]
Title: SPEAK YOUR MIND! Towards Imagined Speech Recognition With Hierarchical Deep Learning
Comments: Under review in INTERSPEECH 2019. arXiv admin note: text overlap with arXiv:1904.04358
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[107]  arXiv:1904.05876 (cross-list from cs.CV) [pdf, other]
Title: A Simple Baseline for Audio-Visual Scene-Aware Dialog
Comments: Accepted to CVPR 2019
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[108]  arXiv:1904.05979 (cross-list from cs.CV) [pdf, other]
Title: The Sound of Motions
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[109]  arXiv:1904.06037 (cross-list from cs.CL) [pdf, other]
Title: Direct speech-to-speech translation with a sequence-to-sequence model
Comments: Accepted to Interspeech 2019
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[110]  arXiv:1904.06063 (cross-list from cs.CL) [pdf, other]
Title: Building a mixed-lingual neural TTS system with only monolingual data
Comments: To appear in INTERSPEECH 2019
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[111]  arXiv:1904.06075 (cross-list from cs.SD) [pdf, ps, other]
Title: RNN-based speech synthesis using a continuous sinusoidal model
Comments: 8 pages, 4 figures, Accepted to IJCNN 2019
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[112]  arXiv:1904.06083 (cross-list from cs.SD) [pdf, other]
Title: DNN-based Acoustic-to-Articulatory Inversion using Ultrasound Tongue Imaging
Comments: 8 pages, 5 figures, Accepted to IJCNN 2019
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS); Tissues and Organs (q-bio.TO)
[113]  arXiv:1904.06093 (cross-list from cs.SD) [pdf, other]
Title: STC Speaker Recognition Systems for the VOiCES From a Distance Challenge
Comments: Submitted to Interspeech 2019, Graz, Austria
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[114]  arXiv:1904.06215 (cross-list from cs.SD) [pdf, other]
Title: Assisted Sound Sample Generation with Musical Conditioning in Adversarial Auto-Encoders
Comments: this article has been accepted for presentation to the 22nd International Conference on Digital Audio Effects (DAFx 2019) ; we provide additional content on this companion repository this https URL
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[115]  arXiv:1904.06508 (cross-list from cs.CL) [pdf, other]
Title: End-to-end Text-to-speech for Low-resource Languages by Cross-Lingual Transfer Learning
Comments: Accepted to Interspeech 2019
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[116]  arXiv:1904.06590 (cross-list from cs.LG) [pdf, other]
Title: Unsupervised Singing Voice Conversion
Comments: Accepted to Interspeech 2019
Subjects: Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[117]  arXiv:1904.06591 (cross-list from cs.CR) [pdf, ps, other]
Title: Towards Vulnerability Analysis of Voice-Driven Interfaces and Countermeasures for Replay
Comments: 6 pages, IEEE 2nd International Conference on Multimedia Information Processing and Retrieval (IEEE MIPR 2019), March 28-30, 2019, San Jose, CA, USA
Subjects: Cryptography and Security (cs.CR); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[118]  arXiv:1904.06851 (cross-list from cs.SD) [pdf, ps, other]
Title: Proximal binaural sound can induce subjective frisson
Comments: 21 pages, 3 figures, 3 tables, 3 supplemental figures, 3 supplemental tables
Journal-ref: Front Psychol. 2020 Mar 3;11:316
Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[119]  arXiv:1904.07078 (cross-list from cs.CL) [pdf, other]
Title: Semantic query-by-example speech search using visual grounding
Comments: Accepted to ICASSP 2019
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[120]  arXiv:1904.07154 (cross-list from cs.LG) [pdf, other]
Title: Are Nearby Neighbors Relatives?: Testing Deep Music Embeddings
Comments: this work was accepted for publication in the "Frontiers in Applied Mathematics and Statistics (Deep Learning: Status, Applications and Algorithms)"
Subjects: Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[121]  arXiv:1904.07556 (cross-list from cs.CL) [pdf, other]
Title: Unsupervised acoustic unit discovery for speech synthesis using discrete latent-variable neural networks
Comments: Interspeech 2019
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[122]  arXiv:1904.07612 (cross-list from cs.SD) [pdf, other]
Title: Speech Denoising by Accumulating Per-Frequency Modeling Fluctuations
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[123]  arXiv:1904.07750 (cross-list from cs.CV) [pdf, other]
Title: Co-Separating Sounds of Visual Objects
Comments: ICCV 2019, Project page: this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[124]  arXiv:1904.07845 (cross-list from cs.SD) [pdf, other]
Title: Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Comments: Submitted to Interspeech 2019
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[125]  arXiv:1904.07933 (cross-list from cs.CV) [pdf, other]
Title: Audio-Visual Model Distillation Using Acoustic Images
Comments: Accepted at WACV 2020; supplementary material at page 11; code available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[ total of 167 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-150 | 151-167 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, 2405, contact, help  (Access key information)