Current browse context:
eess.AS
Change to browse by:
References & Citations
Electrical Engineering and Systems Science > Audio and Speech Processing
Title: ARMAS: Active Reconstruction of Missing Audio Segments
(Submitted on 21 Nov 2021 (v1), revised 23 Nov 2021 (this version, v2), latest version 18 Jan 2024 (v4))
Abstract: Digital audio signal reconstruction of lost or corrupt segment using deep learning algorithms has been explored intensively in the recent years. Nevertheless, prior traditional methods with linear interpolation, phase coding and tone insertion techniques are still in vogue. However, we found no research work on the reconstruction of audio signals with the fusion of dithering, steganography, and machine learning regressors. Therefore, this paper proposes the combination of steganography, halftoning (dithering), and state-of-the-art shallow (RF- Random Forest and SVR- Support Vector Regression) and deep learning (LSTM- Long Short-Term Memory) methods. The results (including comparison to the SPAIN and Autoregressive methods) are evaluated with four different metrics. The observations from the results show that the proposed solution is effective and can enhance the reconstruction of audio signals performed by the side information (noisy-latent representation) steganography provides. This work may trigger interest in the optimization of this approach and/or in transferring it to different domains (i.e., image reconstruction).
Submission history
From: Abbas Cheddad [view email][v1] Sun, 21 Nov 2021 20:11:33 GMT (3058kb,D)
[v2] Tue, 23 Nov 2021 07:19:34 GMT (3058kb,D)
[v3] Wed, 13 Jul 2022 10:34:08 GMT (3932kb,D)
[v4] Thu, 18 Jan 2024 22:43:56 GMT (3911kb,D)
Link back to: arXiv, form interface, contact.