We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sound

Authors and titles for cs.SD in Nov 2018, skipping first 25

[ total of 152 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | ... | 151-152 ]
[ showing 25 entries per page: fewer | more | all ]
[26]  arXiv:1811.04133 [pdf, other]
Title: Integrating Recurrence Dynamics for Speech Emotion Recognition
Journal-ref: Proc. Interspeech 2018, pp. 927-931
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[27]  arXiv:1811.04139 [pdf, other]
Title: Audio Spectrogram Factorization for Classification of Telephony Signals below the Auditory Threshold
Comments: 7 pages, 4 figures. Marchex Technical Report on VoIP SPAM classification
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[28]  arXiv:1811.04357 [pdf, other]
Title: PerformanceNet: Score-to-Audio Music Generation with Multi-Band Convolutional Residual Network
Comments: 8 pages, 6 figures, AAAI 2019 camera-ready version
Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[29]  arXiv:1811.04419 [pdf, other]
Title: Multi-Temporal Resolution Convolutional Neural Networks for Acoustic Scene Classification
Comments: In Proceedings of the Detection and Classification of Acoustic Scenes and Events 2017 Workshop (DCASE2017), November 2017
Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[30]  arXiv:1811.04448 [pdf, ps, other]
Title: A Multi-modal Deep Neural Network approach to Bird-song identification
Comments: LifeCLEF 2017 working notes, Dublin, Ireland
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[31]  arXiv:1811.04568 [pdf, ps, other]
Title: Vectorization of hypotheses and speech for faster beam search in encoder decoder-based speech recognition
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[32]  arXiv:1811.05550 [pdf, other]
Title: Neural Wavetable: a playable wavetable synthesizer using neural networks
Comments: 2 pages, Accepted by Conference on Neural Information Processing Systems (NIPS), Workshop on Machine Learning for Creativity and Design
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[33]  arXiv:1811.06016 [pdf, other]
Title: To bee or not to bee: Investigating machine learning approaches for beehive sound recognition
Comments: Presented at Detection and Classification of Acoustic Scenes and Events (DCASE) workshop 2018
Journal-ref: Proceedings of the Detection and Classification of Acoustic Scenes and Events 2018 Workshop (DCASE2018)
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[34]  arXiv:1811.06330 [pdf, other]
Title: Audio-based identification of beehive states
Comments: Accepted for ICASSP 2019
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[35]  arXiv:1811.06633 [pdf, other]
Title: Generating Albums with SampleRNN to Imitate Metal, Rock, and Punk Bands
Comments: 3 pages
Journal-ref: Proceedings of the 6th International Workshop on Musical Metacreation (MUME 2018)
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[36]  arXiv:1811.06639 [pdf, ps, other]
Title: Generating Black Metal and Math Rock: Beyond Bach, Beethoven, and Beatles
Comments: 3 pages
Journal-ref: NIPS Workshop on Machine Learning for Creativity and Design (2017)
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[37]  arXiv:1811.06669 [pdf, other]
Title: AclNet: efficient end-to-end audio classification CNN
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Machine Learning (stat.ML)
[38]  arXiv:1811.06713 [pdf, other]
Title: Semi-supervised multichannel speech enhancement with variational autoencoders and non-negative matrix factorization
Comments: 5 pages, 2 figures, audio examples and code available online at this https URL
Journal-ref: IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), Brighton, UK, May 2019, pp. 101-105
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[39]  arXiv:1811.06756 [pdf, other]
Title: Direction of Arrival Estimation of Wide-band Signals with Planar Microphone Arrays
Comments: 10 pages
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[40]  arXiv:1811.07030 [pdf, other]
Title: Exploring Tradeoffs in Models for Low-latency Speech Enhancement
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[41]  arXiv:1811.07072 [pdf]
Title: Polyphonic audio tagging with sequentially labelled data using CRNN with learnable gated linear units
Comments: DCASE2018 Workshop. arXiv admin note: text overlap with arXiv:1808.01935
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[42]  arXiv:1811.07082 [pdf, other]
Title: The Intrinsic Memorability of Everyday Sounds
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[43]  arXiv:1811.07426 [pdf, other]
Title: Harmonic Recomposition using Conditional Autoregressive Modeling
Comments: 3 pages, 2 figures. In Proceedings of The Joint Workshop on Machine Learning for Music, ICML 2018
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[44]  arXiv:1811.07435 [pdf, other]
Title: Limitations of Source-Filter Coupling In Phonation
Comments: 2 pages, 2 figures
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[45]  arXiv:1811.08029 [pdf, other]
Title: Sound-Stream II: Towards Real-Time Gesture Controlled Articulatory Sound Synthesis
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[46]  arXiv:1811.08045 [pdf, other]
Title: Coupled Recurrent Models for Polyphonic Music Composition
Comments: 13 pages; long version of the paper appearing in ISMIR 2019
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[47]  arXiv:1811.08111 [pdf, other]
Title: Improving Sequence-to-Sequence Acoustic Modeling by Adding Text-Supervision
Comments: 5 pages, 4 figures, 2 tables. Submitted to IEEE ICASSP 2019
Journal-ref: IEEE International Conference on Acoustic, Speech and Signal Processing (2019) 6785-6789
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[48]  arXiv:1811.08380 [pdf, other]
Title: The Effect of Explicit Structure Encoding of Deep Neural Networks for Symbolic Music Generation
Comments: 8 pages, 13 figures
Journal-ref: 2019 International Workshop on Multilayer Music Representation and Processing (MMRP)
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[49]  arXiv:1811.08521 [pdf, other]
Title: Differentiable Consistency Constraints for Improved Deep Speech Enhancement
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[50]  arXiv:1811.09010 [pdf]
Title: Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective
Comments: 5 pages, in submission to ICASSP-2019
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[ total of 152 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | ... | 151-152 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2208, contact, help  (Access key information)