ARMAS: Active Reconstruction of Missing Audio Segments

Pokharel, Sachin; Ali, Muhammad; Cheddad, Zohra; Cheddad, Abbas

Full-text links:

Download:

Current browse context:

eess.AS

< prev | next >

new | recent | 2111

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: ARMAS: Active Reconstruction of Missing Audio Segments

Authors: Sachin Pokharel, Muhammad Ali, Zohra Cheddad, Abbas Cheddad

(Submitted on 21 Nov 2021 (v1), revised 23 Nov 2021 (this version, v2), latest version 18 Jan 2024 (v4))

Abstract: Digital audio signal reconstruction of lost or corrupt segment using deep learning algorithms has been explored intensively in the recent years. Nevertheless, prior traditional methods with linear interpolation, phase coding and tone insertion techniques are still in vogue. However, we found no research work on the reconstruction of audio signals with the fusion of dithering, steganography, and machine learning regressors. Therefore, this paper proposes the combination of steganography, halftoning (dithering), and state-of-the-art shallow (RF- Random Forest and SVR- Support Vector Regression) and deep learning (LSTM- Long Short-Term Memory) methods. The results (including comparison to the SPAIN and Autoregressive methods) are evaluated with four different metrics. The observations from the results show that the proposed solution is effective and can enhance the reconstruction of audio signals performed by the side information (noisy-latent representation) steganography provides. This work may trigger interest in the optimization of this approach and/or in transferring it to different domains (i.e., image reconstruction).

Comments:	5 pages, 3 Tables, ~5 Figures
Subjects:	Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
Cite as:	arXiv:2111.10891 [eess.AS]
	(or arXiv:2111.10891v2 [eess.AS] for this version)

Submission history

From: Abbas Cheddad [view email]
[v1] Sun, 21 Nov 2021 20:11:33 GMT (3058kb,D)
[v2] Tue, 23 Nov 2021 07:19:34 GMT (3058kb,D)
[v3] Wed, 13 Jul 2022 10:34:08 GMT (3932kb,D)
[v4] Thu, 18 Jan 2024 22:43:56 GMT (3911kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2111.10891v2

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: ARMAS: Active Reconstruction of Missing Audio Segments

Submission history