We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sound

Authors and titles for cs.SD in Feb 2016

[ total of 32 entries: 1-32 ]
[ showing up to 50 entries per page: fewer | more ]
[1]  arXiv:1602.00739 [pdf, other]
Title: Towards a topological fingerprint of music
Subjects: Sound (cs.SD); Computational Geometry (cs.CG); Algebraic Topology (math.AT)
[2]  arXiv:1602.02656 [pdf, ps, other]
Title: LSTM Deep Neural Networks Postfiltering for Improving the Quality of Synthetic Voices
Comments: 5 pages, 5 figures
Subjects: Sound (cs.SD); Neural and Evolutionary Computing (cs.NE)
[3]  arXiv:1602.05526 [pdf, ps, other]
Title: A High-Quality Speech and Audio Codec With Less Than 10 ms Delay
Comments: 10 pages
Journal-ref: IEEE Transactions on Audio, Speech and Language Processing, Vol. 18, No. 1, pp. 58-67, 2010
Subjects: Sound (cs.SD); Multimedia (cs.MM)
[4]  arXiv:1602.05682 [pdf, ps, other]
Title: Audio Recording Device Identification Based on Deep Learning
Subjects: Sound (cs.SD); Machine Learning (cs.LG)
[5]  arXiv:1602.05702 [pdf, other]
Title: EEG-informed attended speaker extraction from recorded speech mixtures with application in neuro-steered hearing prostheses
Comments: This paper is published in IEEE Transactions on Biomedical Engineering (2016) and is under copyright. Please cite this paper as: S. Van Eyndhoven, T. Francart, and A. Bertrand, "EEG-informed attended speaker extraction from recorded speech mixtures with application in neuro-steered hearing prostheses", IEEE Transactions on Biomedical Engineering, vol. 64, no. 5, pp. 1045-1056, 2017
Journal-ref: IEEE Transactions on Biomedical Engineering, vol. 64, no. 5, pp. 1045-1056, 2017
Subjects: Sound (cs.SD); Systems and Control (eess.SY); Machine Learning (stat.ML)
[6]  arXiv:1602.05900 [pdf, ps, other]
Title: An Iterative Linearised Solution to the Sinusoidal Parameter Estimation Problem
Comments: 23 pages
Journal-ref: Computers and Electrical Engineering (Elsevier), Vol. 36, No. 4, pp. 603-616, 2010
Subjects: Sound (cs.SD)
[7]  arXiv:1602.06582 [pdf, other]
Title: Near-field signal acquisition for smartglasses using two acoustic vector-sensors
Comments: The abstract displayed in the metadata field is slightly modified due to space limitations. Updated document includes a brief appendix providing background on acoustic vector-sensors (AVSs), some more detail in the discussion near-field effects, and other minor changes
Subjects: Sound (cs.SD)
[8]  arXiv:1602.06727 [pdf, other]
Title: Improving Trajectory Modelling for DNN-based Speech Synthesis by using Stacked Bottleneck Features and Minimum Generation Error Training
Comments: submitted to IEEE/ACM Transactions on Audio, Speech and Language Processing 2016 (AQ)
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
[9]  arXiv:1602.07291 [pdf, other]
Title: The IBM 2016 Speaker Recognition System
Comments: Submitted to Odyssey 2016
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (stat.ML)
[10]  arXiv:1602.07394 [pdf, other]
Title: Improved Accent Classification Combining Phonetic Vowels with Acoustic Features
Authors: Zhenhao Ge
Comments: International Congress on Image and Signal Processing (CISP) 2015
Subjects: Sound (cs.SD); Computation and Language (cs.CL)
[11]  arXiv:1602.07767 [pdf, ps, other]
Title: Breath Activity Detection Algorithm
Subjects: Sound (cs.SD)
[12]  arXiv:1602.08044 [pdf, ps, other]
Title: On Adjusting the Learning Rate in Frequency Domain Echo Cancellation With Double-Talk
Authors: Jean-Marc Valin
Comments: 5 pages
Journal-ref: IEEE Transactions on Audio, Speech and Language Processing, Vol. 15, No. 3, pp. 1030-1034, 2007
Subjects: Sound (cs.SD); Systems and Control (eess.SY)
[13]  arXiv:1602.08045 [pdf, other]
Title: PCA/LDA Approach for Text-Independent Speaker Recognition
Comments: Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series
Subjects: Sound (cs.SD); Machine Learning (cs.LG)
[14]  arXiv:1602.08128 [pdf, ps, other]
Title: PCA Method for Automated Detection of Mispronounced Words
Comments: SPIE Defense, Security, and Sensing
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG)
[15]  arXiv:1602.08132 [pdf, ps, other]
Title: Adaptive Frequency Cepstral Coefficients for Word Mispronunciation Detection
Comments: 4th International Congress on Image and Signal Processing (CISP) 2011
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV)
[16]  arXiv:1602.08185 [pdf, ps, other]
Title: Extension spectrale d'un signal de parole de la bande téléphonique à la bande AM
Authors: Jean-Marc Valin
Comments: 61 pages, in French, Master's thesis, University of Sherbrooke, 2001
Subjects: Sound (cs.SD); Multimedia (cs.MM)
[17]  arXiv:1602.08215 [pdf, ps, other]
Title: Bandwidth Extension of Narrowband Speech for Low Bit-Rate Wideband Coding
Comments: 3 pages
Journal-ref: Proc. IEEE Speech Coding Workshop (SCW), 2000, pp. 130-132
Subjects: Sound (cs.SD)
[18]  arXiv:1602.08507 [pdf, ps, other]
Title: Occupancy Estimation in Smart Buildings using Audio-Processing Techniques
Comments: International Conference on Computing in Civil and Building Engineering (ICCCBE) 2016
Subjects: Sound (cs.SD)
[19]  arXiv:1602.08609 [pdf, ps, other]
Title: A New Robust Frequency Domain Echo Canceller With Closed-Loop Learning Rate Adaptation
Comments: 4 pages, Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2007
Subjects: Sound (cs.SD); Systems and Control (eess.SY)
[20]  arXiv:1602.08633 [pdf, ps, other]
Title: Perceptually-Motivated Nonlinear Channel Decorrelation For Stereo Acoustic Echo Cancellation
Authors: Jean-Marc Valin
Comments: 4 pages
Journal-ref: Proceedings of Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA), pp. 188-191, 2008
Subjects: Sound (cs.SD)
[21]  arXiv:1602.08668 [pdf, ps, other]
Title: Speex: A Free Codec For Free Speech
Authors: Jean-Marc Valin
Comments: Presented at linux.conf.au 2006, Dunedin. 8 pages
Subjects: Sound (cs.SD)
[22]  arXiv:1602.02950 (cross-list from cs.LG) [pdf, other]
Title: Spoofing detection under noisy conditions: a preliminary investigation and an initial database
Comments: Submitted to Odyssey: The Speaker and Language Recognition Workshop 2016
Subjects: Machine Learning (cs.LG); Sound (cs.SD)
[23]  arXiv:1602.04845 (cross-list from cs.MM) [pdf, ps, other]
Title: High-Quality, Low-Delay Music Coding in the Opus Codec
Comments: 10 pages, 135th AES Convention. Proceedings of the 135th AES Convention, October 2013
Subjects: Multimedia (cs.MM); Sound (cs.SD)
[24]  arXiv:1602.05311 (cross-list from cs.MM) [pdf, ps, other]
Title: A Full-Bandwidth Audio Codec With Low Complexity And Very Low Delay
Comments: 5 pages, Proceedings of EUSIPCO 2009
Subjects: Multimedia (cs.MM); Sound (cs.SD)
[25]  arXiv:1602.06442 (cross-list from cs.RO) [pdf, ps, other]
Title: Robust Recognition of Simultaneous Speech By a Mobile Robot
Comments: 12 pages
Journal-ref: IEEE Transactions on Robotics, Vol. 23, No. 4, pp. 742-752, 2007
Subjects: Robotics (cs.RO); Sound (cs.SD)
[26]  arXiv:1602.06652 (cross-list from cs.RO) [pdf, ps, other]
Title: Auditory System for a Mobile Robot
Authors: Jean-Marc Valin
Comments: 120 pages, PhD thesis, University of Sherbrooke, 2005
Subjects: Robotics (cs.RO); Sound (cs.SD)
[27]  arXiv:1602.06967 (cross-list from cs.CL) [pdf, ps, other]
Title: Blind score normalization method for PLDA based speaker recognition
Comments: 4 pages, 1 figure, presented at the Interspeech 2015. In Sixteenth Annual Conference of the International Speech Communication Association 2015
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[28]  arXiv:1602.08116 (cross-list from cs.SY) [pdf, ps, other]
Title: Interference-Normalised Least Mean Square Algorithm
Comments: 4 pages
Journal-ref: IEEE Signal Processing Letters, Vol. 14, No 12, pp. 988-991, 2007
Subjects: Systems and Control (eess.SY); Sound (cs.SD)
[29]  arXiv:1602.08139 (cross-list from cs.RO) [pdf, ps, other]
Title: Robust Localization and Tracking of Simultaneous Moving Sound Sources Using Beamforming and Particle Filtering
Comments: 26 pages
Journal-ref: Robotics and Autonomous Systems Journal (Elsevier), Vol. 55, No. 3, pp. 216-228, 2007
Subjects: Robotics (cs.RO); Sound (cs.SD)
[30]  arXiv:1602.08213 (cross-list from cs.RO) [pdf, ps, other]
Title: Robust Sound Source Localization Using a Microphone Array on a Mobile Robot
Comments: 6 pages
Journal-ref: Proc. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 1228-1233, 2003
Subjects: Robotics (cs.RO); Sound (cs.SD)
[31]  arXiv:1602.08629 (cross-list from cs.RO) [pdf, ps, other]
Title: Localization of Simultaneous Moving Sound Sources for Mobile Robot Using a Frequency-Domain Steered Beamformer Approach
Comments: 6 pages. arXiv admin note: substantial text overlap with arXiv:1602.08139
Journal-ref: Proceedings of IEEE International Conference on Robotics and Automation (ICRA), pp. 1033-1038, 2004
Subjects: Robotics (cs.RO); Sound (cs.SD)
[32]  arXiv:1602.08750 (cross-list from cs.HC) [pdf, other]
Title: Filtering Video Noise as Audio with Motion Detection to Form a Musical Instrument
Authors: Carl Thomé
Comments: Received the 2015 best paper award in the KTH Royal Institute of Technology course "Musical Communication and Music Technology"
Subjects: Human-Computer Interaction (cs.HC); Sound (cs.SD)
[ total of 32 entries: 1-32 ]
[ showing up to 50 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2404, contact, help  (Access key information)