We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sound

Authors and titles for cs.SD in Apr 2021, skipping first 75

[ total of 229 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-150 | 151-175 | ... | 226-229 ]
[ showing 25 entries per page: fewer | more | all ]
[76]  arXiv:2104.11395 [pdf]
Title: Infant Vocal Tract Development Analysis and Diagnosis by Cry Signals with CNN Age Classification
Authors: Chunyan Ji, Yi Pan
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[77]  arXiv:2104.11532 [pdf, ps, other]
Title: 3D Convolutional Neural Networks for Ultrasound-Based Silent Speech Interfaces
Comments: 10 pages, 2 tables , 3 figures
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[78]  arXiv:2104.11587 [pdf, other]
Title: ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio
Comments: submitted IJCNN 2021
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[79]  arXiv:2104.11598 [pdf, ps, other]
Title: Reconstructing Speech from Real-Time Articulatory MRI Using Neural Vocoders
Comments: 6 pages. 4 tables, 3 figures
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[80]  arXiv:2104.11601 [pdf, ps, other]
Title: Improving Neural Silent Speech Interface Models by Adversarial Training
Comments: 11 pages, 3 tables, 2 figures
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[81]  arXiv:2104.11629 [pdf, other]
Title: DeepSpectrumLite: A Power-Efficient Transfer Learning Framework for Embedded Speech and Audio Processing from Decentralised Data
Authors: Shahin Amiriparian (1), Tobias Hübner (1), Maurice Gerczuk (1), Sandra Ottl (1), Björn W. Schuller (1,2) ((1) EIHW -- Chair of Embedded Intelligence for Health Care and Wellbeing, University of Augsburg, Germany, (2) GLAM -- Group on Language, Audio, and Music, Imperial College London, UK)
Comments: 5 pages, 1 figure
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[82]  arXiv:2104.11673 [pdf, other]
Title: Deep Learning Based Assessment of Synthetic Speech Naturalness
Comments: Late upload, presented at Interspeech 2020
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[83]  arXiv:2104.11710 [pdf, other]
Title: Beyond Voice Activity Detection: Hybrid Audio Segmentation for Direct Speech Translation
Comments: Accepted to ICNLSP 2021
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[84]  arXiv:2104.11880 [pdf, other]
Title: Music Embedding: A Tool for Incorporating Music Theory into Computational Music Applications
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[85]  arXiv:2104.11984 [pdf, other]
Title: MusCaps: Generating Captions for Music Audio
Comments: Accepted to IJCNN 2021 for the Special Session on Representation Learning for Audio, Speech, and Music Processing
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[86]  arXiv:2104.12159 [pdf, other]
Title: An Adaptive Learning based Generative Adversarial Network for One-To-One Voice Conversion
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[87]  arXiv:2104.12292 [pdf, other]
Title: Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis
Comments: In the proceedings of ISCA Speech Synthesis Workshop 2021
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[88]  arXiv:2104.12359 [pdf, other]
Title: Complex Neural Spatial Filter: Enhancing Multi-channel Target Speech Separation in Complex Domain
Comments: 5 pages, 3 figures
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[89]  arXiv:2104.12432 [pdf, ps, other]
Title: Generation of musical patterns through operads
Authors: Samuele Giraudo
Comments: 10 pages
Journal-ref: Journ\'ees d'informatique musicale, 2020
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS); Combinatorics (math.CO)
[90]  arXiv:2104.12462 [pdf, other]
Title: Points2Sound: From mono to binaural audio using 3D point cloud scenes
Comments: Code, data, and listening examples: this https URL
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[91]  arXiv:2104.12693 [pdf, other]
Title: Identifying Actions for Sound Event Classification
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[92]  arXiv:2104.12807 [pdf, other]
Title: Multimodal Self-Supervised Learning of General Audio Representations
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[93]  arXiv:2104.12922 [pdf, other]
Title: One Billion Audio Sounds from GPU-enabled Modular Synthesis
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[94]  arXiv:2104.13002 [pdf, other]
Title: DPT-FSNet: Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement
Comments: Accepted by ICASSP 2022
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[95]  arXiv:2104.13040 [pdf, ps, other]
Title: The music box operad: Random generation of musical phrases from patterns
Authors: Samuele Giraudo
Comments: 31 pages. Extended version of arXiv:2104.12432
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS); Combinatorics (math.CO); Quantum Algebra (math.QA)
[96]  arXiv:2104.13056 [pdf, other]
Title: Generating Lead Sheets with Affect: A Novel Conditional seq2seq Framework
Comments: Accepted for the International Joint Conference on Neural Networks (IJCNN), Shenzhen, China, 18-22 July 2021 (virtual)
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[97]  arXiv:2104.13266 [pdf, other]
Title: Batebit Controller: Popularizing Digital Musical Instruments Development Process
Comments: 2 pages, 2 figures, 17th Brazilian Symposium on Computer Music
Subjects: Sound (cs.SD); Human-Computer Interaction (cs.HC); Audio and Speech Processing (eess.AS)
[98]  arXiv:2104.13276 [pdf, other]
Title: MULTIMODAL ANALYSIS: Informed content estimation and audio source separation
Comments: Ph.D. dissertation. Thesis supervisor: Geoffroy Peeters. Jury:Laurent Girin, Ga\"el Richard, Rachel Bittner, Elena Cabrio, Bruno Gas, Perfecto Herrera Boyer, Antoine Liutkus
Subjects: Sound (cs.SD); Databases (cs.DB); Information Retrieval (cs.IR); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[99]  arXiv:2104.14067 [pdf]
Title: Improving Fairness in Speaker Recognition
Comments: Accepted at the 2020 European Symposium on Software Engineering (ESSE 2020)
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[100]  arXiv:2104.14297 [pdf, other]
Title: End-to-End Speech Recognition from Federated Acoustic Models
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[ total of 229 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-150 | 151-175 | ... | 226-229 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2209, contact, help  (Access key information)