Detection of speech events and speaker characteristics through photo-plethysmographic signal neural processing

Cámbara, Guillermo; Luque, Jordi; Farrús, Mireia

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1911

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Detection of speech events and speaker characteristics through photo-plethysmographic signal neural processing

Authors: Guillermo Cámbara, Jordi Luque, Mireia Farrús

(Submitted on 12 Nov 2019)

Abstract: The use of photoplethysmogram signal (PPG) for heart and sleep monitoring is commonly found nowadays in smartphones and wrist wearables. Besides common usages, it has been proposed and reported that person information can be extracted from PPG for other uses, like biometry tasks. In this work, we explore several end-to-end convolutional neural network architectures for detection of human's characteristics such as gender or person identity. In addition, we evaluate whether speech/non-speech events may be inferred from PPG signal, where speech might translate in fluctuations into the pulse signal. The obtained results are promising and clearly show the potential of fully end-to-end topologies for automatic extraction of meaningful biomarkers, even from a noisy signal sampled by a low-cost PPG sensor. The AUCs for best architectures put forward PPG wave as biological discriminant, reaching $79\%$ and $89.0\%$, respectively for gender and person verification tasks. Furthermore, speech detection experiments reporting AUCs around $69\%$ encourage us for further exploration about the feasibility of PPG for speech processing tasks.

Subjects:	Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
Cite as:	arXiv:1911.04808 [eess.AS]
	(or arXiv:1911.04808v1 [eess.AS] for this version)

Submission history

From: Guillermo Cámbara [view email]
[v1] Tue, 12 Nov 2019 11:58:42 GMT (202kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:1911.04808

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Detection of speech events and speaker characteristics through photo-plethysmographic signal neural processing

Submission history