Sound

Authors and titles for cs.SD in Feb 2016

[ total of 32 entries: 1-32 ]
[ showing up to 50 entries per page: fewer | more ]

[1] arXiv:1602.00739 [pdf, other]: Title: Towards a topological fingerprint of music

Authors: Mattia G. Bergomi, Adriano Baraté, Barbara Di Fabio

Subjects: Sound (cs.SD); Computational Geometry (cs.CG); Algebraic Topology (math.AT)
[2] arXiv:1602.02656 [pdf, ps, other]: Title: LSTM Deep Neural Networks Postfiltering for Improving the Quality of Synthetic Voices

Authors: Marvin Coto-Jiménez, John Goddard-Close

Comments: 5 pages, 5 figures

Subjects: Sound (cs.SD); Neural and Evolutionary Computing (cs.NE)
[3] arXiv:1602.05526 [pdf, ps, other]: Title: A High-Quality Speech and Audio Codec With Less Than 10 ms Delay

Authors: Jean-Marc Valin, Timothy B. Terriberry, Christopher Montgomery, Gregory Maxwell

Comments: 10 pages

Journal-ref: IEEE Transactions on Audio, Speech and Language Processing, Vol. 18, No. 1, pp. 58-67, 2010

Subjects: Sound (cs.SD); Multimedia (cs.MM)
[4] arXiv:1602.05682 [pdf, ps, other]: Title: Audio Recording Device Identification Based on Deep Learning

Authors: Simeng Qi, Zheng Huang, Yan Li, Shaopei Shi

Subjects: Sound (cs.SD); Machine Learning (cs.LG)
[5] arXiv:1602.05702 [pdf, other]: Title: EEG-informed attended speaker extraction from recorded speech mixtures with application in neuro-steered hearing prostheses

Authors: Simon Van Eyndhoven, Tom Francart, Alexander Bertrand

Comments: This paper is published in IEEE Transactions on Biomedical Engineering (2016) and is under copyright. Please cite this paper as: S. Van Eyndhoven, T. Francart, and A. Bertrand, "EEG-informed attended speaker extraction from recorded speech mixtures with application in neuro-steered hearing prostheses", IEEE Transactions on Biomedical Engineering, vol. 64, no. 5, pp. 1045-1056, 2017

Journal-ref: IEEE Transactions on Biomedical Engineering, vol. 64, no. 5, pp. 1045-1056, 2017

Subjects: Sound (cs.SD); Systems and Control (eess.SY); Machine Learning (stat.ML)
[6] arXiv:1602.05900 [pdf, ps, other]: Title: An Iterative Linearised Solution to the Sinusoidal Parameter Estimation Problem

Authors: Jean-Marc Valin, Daniel V. Smith, Christopher Montgomery, Timothy B. Terriberry

Comments: 23 pages

Journal-ref: Computers and Electrical Engineering (Elsevier), Vol. 36, No. 4, pp. 603-616, 2010

Subjects: Sound (cs.SD)
[7] arXiv:1602.06582 [pdf, other]: Title: Near-field signal acquisition for smartglasses using two acoustic vector-sensors

Authors: Dovid Y. Levin, Emanuël A. P. Habets, Sharon Gannot

Comments: The abstract displayed in the metadata field is slightly modified due to space limitations. Updated document includes a brief appendix providing background on acoustic vector-sensors (AVSs), some more detail in the discussion near-field effects, and other minor changes

Subjects: Sound (cs.SD)
[8] arXiv:1602.06727 [pdf, other]: Title: Improving Trajectory Modelling for DNN-based Speech Synthesis by using Stacked Bottleneck Features and Minimum Generation Error Training

Authors: Zhizheng Wu, Simon King

Comments: submitted to IEEE/ACM Transactions on Audio, Speech and Language Processing 2016 (AQ)

Subjects: Sound (cs.SD); Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
[9] arXiv:1602.07291 [pdf, other]: Title: The IBM 2016 Speaker Recognition System

Authors: Seyed Omid Sadjadi, Sriram Ganapathy, Jason W. Pelecanos

Comments: Submitted to Odyssey 2016

Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (stat.ML)
[10] arXiv:1602.07394 [pdf, other]: Title: Improved Accent Classification Combining Phonetic Vowels with Acoustic Features

Authors: Zhenhao Ge

Comments: International Congress on Image and Signal Processing (CISP) 2015

Subjects: Sound (cs.SD); Computation and Language (cs.CL)
[11] arXiv:1602.07767 [pdf, ps, other]: Title: Breath Activity Detection Algorithm

Authors: Eric E. Hamke, Ramiro Jordan, Manel Ramon-Martinez

Subjects: Sound (cs.SD)
[12] arXiv:1602.08044 [pdf, ps, other]: Title: On Adjusting the Learning Rate in Frequency Domain Echo Cancellation With Double-Talk

Authors: Jean-Marc Valin

Comments: 5 pages

Journal-ref: IEEE Transactions on Audio, Speech and Language Processing, Vol. 15, No. 3, pp. 1030-1034, 2007

Subjects: Sound (cs.SD); Systems and Control (eess.SY)
[13] arXiv:1602.08045 [pdf, other]: Title: PCA/LDA Approach for Text-Independent Speaker Recognition

Authors: Zhenhao Ge, Sudhendu R. Sharma, Mark J. T. Smith

Comments: Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series

Subjects: Sound (cs.SD); Machine Learning (cs.LG)
[14] arXiv:1602.08128 [pdf, ps, other]: Title: PCA Method for Automated Detection of Mispronounced Words

Authors: Zhenhao Ge, Sudhendu R. Sharma, Mark J. T. Smith

Comments: SPIE Defense, Security, and Sensing

Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG)
[15] arXiv:1602.08132 [pdf, ps, other]: Title: Adaptive Frequency Cepstral Coefficients for Word Mispronunciation Detection

Authors: Zhenhao Ge, Sudhendu R. Sharma, Mark J. T. Smith

Comments: 4th International Congress on Image and Signal Processing (CISP) 2011

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV)
[16] arXiv:1602.08185 [pdf, ps, other]: Title: Extension spectrale d'un signal de parole de la bande téléphonique à la bande AM

Authors: Jean-Marc Valin

Comments: 61 pages, in French, Master's thesis, University of Sherbrooke, 2001

Subjects: Sound (cs.SD); Multimedia (cs.MM)
[17] arXiv:1602.08215 [pdf, ps, other]: Title: Bandwidth Extension of Narrowband Speech for Low Bit-Rate Wideband Coding

Authors: Jean-Marc Valin, Roch Lefebvre

Comments: 3 pages

Journal-ref: Proc. IEEE Speech Coding Workshop (SCW), 2000, pp. 130-132

Subjects: Sound (cs.SD)
[18] arXiv:1602.08507 [pdf, ps, other]: Title: Occupancy Estimation in Smart Buildings using Audio-Processing Techniques

Authors: Qian Huang, Zhenhao Ge, Chao Lu

Comments: International Conference on Computing in Civil and Building Engineering (ICCCBE) 2016

Subjects: Sound (cs.SD)
[19] arXiv:1602.08609 [pdf, ps, other]: Title: A New Robust Frequency Domain Echo Canceller With Closed-Loop Learning Rate Adaptation

Authors: Jean-Marc Valin, Iain B. Collings

Comments: 4 pages, Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2007

Subjects: Sound (cs.SD); Systems and Control (eess.SY)
[20] arXiv:1602.08633 [pdf, ps, other]: Title: Perceptually-Motivated Nonlinear Channel Decorrelation For Stereo Acoustic Echo Cancellation

Authors: Jean-Marc Valin

Comments: 4 pages

Journal-ref: Proceedings of Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA), pp. 188-191, 2008

Subjects: Sound (cs.SD)
[21] arXiv:1602.08668 [pdf, ps, other]: Title: Speex: A Free Codec For Free Speech

Authors: Jean-Marc Valin

Comments: Presented at linux.conf.au 2006, Dunedin. 8 pages

Subjects: Sound (cs.SD)
[22] arXiv:1602.02950 (cross-list from cs.LG) [pdf, other]: Title: Spoofing detection under noisy conditions: a preliminary investigation and an initial database

Authors: Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li

Comments: Submitted to Odyssey: The Speaker and Language Recognition Workshop 2016

Subjects: Machine Learning (cs.LG); Sound (cs.SD)
[23] arXiv:1602.04845 (cross-list from cs.MM) [pdf, ps, other]: Title: High-Quality, Low-Delay Music Coding in the Opus Codec

Authors: Jean-Marc Valin, Gregory Maxwell, Timothy B. Terriberry, Koen Vos

Comments: 10 pages, 135th AES Convention. Proceedings of the 135th AES Convention, October 2013

Subjects: Multimedia (cs.MM); Sound (cs.SD)
[24] arXiv:1602.05311 (cross-list from cs.MM) [pdf, ps, other]: Title: A Full-Bandwidth Audio Codec With Low Complexity And Very Low Delay

Authors: Jean-Marc Valin, Timothy B. Terriberry, Gregory Maxwell

Comments: 5 pages, Proceedings of EUSIPCO 2009

Subjects: Multimedia (cs.MM); Sound (cs.SD)
[25] arXiv:1602.06442 (cross-list from cs.RO) [pdf, ps, other]: Title: Robust Recognition of Simultaneous Speech By a Mobile Robot

Authors: Jean-Marc Valin, Shun'ichi Yamamoto, Jean Rouat, Francois Michaud, Kazuhiro Nakadai, Hiroshi G. Okuno

Comments: 12 pages

Journal-ref: IEEE Transactions on Robotics, Vol. 23, No. 4, pp. 742-752, 2007

Subjects: Robotics (cs.RO); Sound (cs.SD)
[26] arXiv:1602.06652 (cross-list from cs.RO) [pdf, ps, other]: Title: Auditory System for a Mobile Robot

Authors: Jean-Marc Valin

Comments: 120 pages, PhD thesis, University of Sherbrooke, 2005

Subjects: Robotics (cs.RO); Sound (cs.SD)
[27] arXiv:1602.06967 (cross-list from cs.CL) [pdf, ps, other]: Title: Blind score normalization method for PLDA based speaker recognition

Authors: Danila Doroshin, Nikolay Lubimov, Marina Nastasenko, Mikhail Kotov

Comments: 4 pages, 1 figure, presented at the Interspeech 2015. In Sixteenth Annual Conference of the International Speech Communication Association 2015

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[28] arXiv:1602.08116 (cross-list from cs.SY) [pdf, ps, other]: Title: Interference-Normalised Least Mean Square Algorithm

Authors: Jean-Marc Valin, Iain B. Collings

Comments: 4 pages

Journal-ref: IEEE Signal Processing Letters, Vol. 14, No 12, pp. 988-991, 2007

Subjects: Systems and Control (eess.SY); Sound (cs.SD)
[29] arXiv:1602.08139 (cross-list from cs.RO) [pdf, ps, other]: Title: Robust Localization and Tracking of Simultaneous Moving Sound Sources Using Beamforming and Particle Filtering

Authors: Jean-Marc Valin, François Michaud, Jean Rouat

Comments: 26 pages

Journal-ref: Robotics and Autonomous Systems Journal (Elsevier), Vol. 55, No. 3, pp. 216-228, 2007

Subjects: Robotics (cs.RO); Sound (cs.SD)
[30] arXiv:1602.08213 (cross-list from cs.RO) [pdf, ps, other]: Title: Robust Sound Source Localization Using a Microphone Array on a Mobile Robot

Authors: Jean-Marc Valin, François Michaud, Jean Rouat, Dominic Létourneau

Comments: 6 pages

Journal-ref: Proc. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 1228-1233, 2003

Subjects: Robotics (cs.RO); Sound (cs.SD)
[31] arXiv:1602.08629 (cross-list from cs.RO) [pdf, ps, other]: Title: Localization of Simultaneous Moving Sound Sources for Mobile Robot Using a Frequency-Domain Steered Beamformer Approach

Authors: Jean-Marc Valin, François Michaud, Brahim Hadjou, Jean Rouat

Comments: 6 pages. arXiv admin note: substantial text overlap with arXiv:1602.08139

Journal-ref: Proceedings of IEEE International Conference on Robotics and Automation (ICRA), pp. 1033-1038, 2004

Subjects: Robotics (cs.RO); Sound (cs.SD)
[32] arXiv:1602.08750 (cross-list from cs.HC) [pdf, other]: Title: Filtering Video Noise as Audio with Motion Detection to Form a Musical Instrument

Authors: Carl Thomé

Comments: Received the 2015 best paper award in the KTH Royal Institute of Technology course "Musical Communication and Music Technology"

Subjects: Human-Computer Interaction (cs.HC); Sound (cs.SD)

[ total of 32 entries: 1-32 ]
[ showing up to 50 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2404, contact, help (Access key information)

> cs > cs.SD

Sound

Authors and titles for cs.SD in Feb 2016