We gratefully acknowledge support from
the Simons Foundation and member institutions.

Signal Processing

New submissions

[ total of 29 entries: 1-29 ]
[ showing up to 2000 entries per page: fewer | more ]

New submissions for Thu, 4 Mar 21

[1]  arXiv:2103.02087 [pdf, other]
Title: Deep J-Sense: Accelerated MRI Reconstruction via Unrolled Alternating Optimization
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)

Accelerated multi-coil magnetic resonance imaging reconstruction has seen a substantial recent improvement combining compressed sensing with deep learning. However, most of these methods rely on estimates of the coil sensitivity profiles, or on calibration data for estimating model parameters. Prior work has shown that these methods degrade in performance when the quality of these estimators are poor or when the scan parameters differ from the training conditions. Here we introduce Deep J-Sense as a deep learning approach that builds on unrolled alternating minimization and increases robustness: our algorithm refines both the magnetization (image) kernel and the coil sensitivity maps. Experimental results on a subset of the knee fastMRI dataset show that this increases reconstruction performance and provides a significant degree of robustness to varying acceleration factors and calibration region sizes.

[2]  arXiv:2103.02091 [pdf]
Title: Preliminaries on the Accurate Estimation of the Hurst Exponent Using Time Series
Comments: 8 pages, 6 figures, 2021 IEEE International Conference on Automation/XXIV Congress of the Chilean Association of Automatic Control (ICA-ACCA)
Subjects: Signal Processing (eess.SP); Discrete Mathematics (cs.DM)

This article explores the required amount of time series points from a high-speed computer network to accurately estimate the Hurst exponent. The methodology consists in designing an experiment using estimators that are applied to time series addresses resulting from the capture of high-speed network traffic, followed by addressing the minimum amount of point required to obtain in accurate estimates of the Hurst exponent. The methodology addresses the exhaustive analysis of the Hurst exponent considering bias behaviour, standard deviation, and Mean Squared Error using fractional Gaussian noise signals with stationary increases. Our results show that the Whittle estimator successfully estimates the Hurst exponent in series with few points. Based on the results obtained, a minimum length for the time series is empirically proposed. Finally, to validate the results, the methodology is applied to real traffic captures in a high-speed computer network.

[3]  arXiv:2103.02169 [pdf]
Title: Real Time Vigilance Detection using Frontal EEG
Journal-ref: International Journal of Computer Science & Information Technology (IJCSIT) Vol 13, No 1, February 2021
Subjects: Signal Processing (eess.SP)

Vigilance of an operator is compromised in performing many monotonous activities like workshop and manufacturing floor tasks, driving, night shift workers, flying, and in general any activity which requires keen attention of an individual over prolonged periods of time. Driver or operator fatigue in these situations leads to drowsiness and lowered vigilance which is one of the largest contributors to injuries and fatalities amongst road accidents or workshop floor accidents. Having a vigilance monitoring system to detect drop in vigilance in these situations becomes very important.
This paper presents a system which uses non-invasively recorded Frontal EEG from an easy-to-use commercially available Brain Computer Interface wearable device to determine the vigilance state of an individual. The change in the power spectrum in the Frontal Theta Band (4-8Hz) of an individual's brain wave predicts the changes in the attention level of an individual - providing an early detection and warning system. This method provides an accurate, yet cheap and practical system for vigilance monitoring across different environments.

[4]  arXiv:2103.02183 [pdf]
Title: Auditory Attention Decoding from EEG using Convolutional Recurrent Neural Network
Comments: 5 pages, 4 figures, submitted to EUSIPCO 2021
Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)

The auditory attention decoding (AAD) approach was proposed to determine the identity of the attended talker in a multi-talker scenario by analyzing electroencephalography (EEG) data. Although the linear model-based method has been widely used in AAD, the linear assumption was considered oversimplified and the decoding accuracy remained lower for shorter decoding windows. Recently, nonlinear models based on deep neural networks (DNN) have been proposed to solve this problem. However, these models did not fully utilize both the spatial and temporal features of EEG, and the interpretability of DNN models was rarely investigated. In this paper, we proposed novel convolutional recurrent neural network (CRNN) based regression model and classification model, and compared them with both the linear model and the state-of-the-art DNN models. Results showed that, our proposed CRNN-based classification model outperformed others for shorter decoding windows (around 90% for 2 s and 5 s). Although worse than classification models, the decoding accuracy of the proposed CRNN-based regression model was about 5% greater than other regression models. The interpretability of DNN models was also investigated by visualizing layers' weight.

[5]  arXiv:2103.02186 [pdf]
Title: Eye-gaze Estimation with HEOG and Neck EMG using Deep Neural Networks
Comments: 5 pages, 5 figures, submitted to EUSIPCO 2021
Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)

Hearing-impaired listeners usually have troubles attending target talker in multi-talker scenes, even with hearing aids (HAs). The problem can be solved with eye-gaze steering HAs, which requires listeners eye-gazing on the target. In a situation where head rotates, eye-gaze is subject to both behaviors of saccade and head rotation. However, existing methods of eye-gaze estimation did not work reliably, since the listener's strategy of eye-gaze varies and measurements of the two behaviors were not properly combined. Besides, existing methods were based on hand-craft features, which could overlook some important information. In this paper, a head-fixed and a head-free experiments were conducted. We used horizontal electrooculography (HEOG) and neck electromyography (NEMG), which separately measured saccade and head rotation to commonly estimate eye-gaze. Besides traditional classifier and hand-craft features, deep neural networks (DNN) were introduced to automatically extract features from intact waveforms. Evaluation results showed that when the input was HEOG with inertial measurement unit, the best performance of our proposed DNN classifiers achieved 93.3%; and when HEOG was with NEMG together, the accuracy reached 72.6%, higher than that with HEOG (about 71.0%) or NEMG (about 35.7%) alone. These results indicated the feasibility to estimate eye-gaze with HEOG and NEMG.

[6]  arXiv:2103.02215 [pdf, other]
Title: The generalized method of moments for multi-reference alignment
Subjects: Signal Processing (eess.SP)

This paper studies the application of the generalized method of moments (GMM) to multi-reference alignment (MRA): the problem of estimating a signal from its circularly-translated and noisy copies. We begin by proving that the GMM estimator maintains its asymptotic optimality for statistical models with group symmetry, including MRA. Then, we conduct a comprehensive numerical study and show that the GMM substantially outperforms the classical method of moments, whose application to MRA has been studied thoroughly in the literature. We also formulate the GMM to estimate a three-dimensional molecular structure using cryo-electron microscopy and present numerical results on simulated data.

[7]  arXiv:2103.02299 [pdf, other]
Title: Scaling Laws for Unamplified Coherent Transmission in Next-generation Short-Reach and Access Networks
Subjects: Signal Processing (eess.SP); Networking and Internet Architecture (cs.NI)

International standardization bodies (IEEE and ITU-T) working on the evolution of transmission technologies are still considering traditional direct detection solutions for the most relevant short reach optical link applications, that are Passive Optical Networks (PON) and intra-data center interconnects. Anyway, future jumps towards even higher bit rates per wavelength will require a complete paradigm shift, moving towards coherent technologies. In this paper, we thus study both analytically and experimentally the scaling laws of unamplified coherent transmission in the short-reach communications ecosystems. We believe that, given the extremely tight techno-economic constraints, such a revolutionary transition towards coherent in short-reach first requires a very detailed study of its intrinsic capabilities in largely extending the limitation currently imposed by direct detection systems. To this end, this paper focuses on the ultimate physical layer limitations of unamplified coherent systems in terms of bit rate and power budget. The main parameters of our performance estimation model are extracted through fitting with a set of experimental characterizations and later used as the starting point of a scaling laws study regarding local oscillator power, modulator-induced attenuation, bit rate, and maximum achievable power budget. The analytically predicted performance is then verified through transmission experiments, including a demonstration on a 37-km installed metropolitan dark fiber in the city of Turin.

[8]  arXiv:2103.02306 [pdf, ps, other]
Title: Rate Analysis and Deep Neural Network Detectors for SEFDM FTN Systems
Subjects: Signal Processing (eess.SP); Information Theory (cs.IT)

In this work we compare the capacity and achievable rate of uncoded faster than Nyquist (FTN) signalling in the frequency domain, also referred to as spectrally efficient FDM (SEFDM). We propose a deep residual convolutional neural network detector for SEFDM signals in additive white Gaussian noise channels, that allows to approach the Mazo limit in systems with up to 60 subcarriers. Notably, the deep detectors achieve a loss less than 0.4-0.7 dB for uncoded QPSK SEFDM systems of 12 to 60 subcarriers at a 15% spectral compression.

Cross-lists for Thu, 4 Mar 21

[9]  arXiv:2103.02134 (cross-list from cs.IT) [pdf, ps, other]
Title: QoS-Driven Resource Optimization for Intelligent Fog Radio Access Network: A Dynamic Power Allocation Perspective
Authors: Jun Yu, Rui Wang, Jun Wu
Subjects: Information Theory (cs.IT); Signal Processing (eess.SP)

The fog radio access network (Fog-RAN) has been considered a promising wireless access architecture to help shorten the communication delay and relieve the large data delivery burden over the backhaul links. However, limited by conventional inflexible communication design, Fog-RAN cannot be used in some complex communication scenarios. In this study, we focus on investigating a more intelligent Fog-RAN to assist the communication in a high-speed railway environment. Due to the train's continuously moving, the communication should be designed intelligently to adapt to channel variation. Specifically, we dynamically optimize the power allocation in the remote radio heads (RRHs) to minimize the total network power cost considering multiple quality-of-service (QoS) requirements and channel variation. The impact of caching on the power allocation is considered. The dynamic power optimization is analyzed to obtain a closed-form solution in certain cases. The inherent tradeoff among the total network cost, delay and delivery content size is further discussed. To evaluate the performance of the proposed dynamic power allocation, we present an invariant power allocation counterpart as a performance comparison benchmark. The result of our simulation reveals that dynamic power allocation can significantly outperform the invariant power allocation scheme, especially with a random caching strategy or limited caching resources at the RRHs.

[10]  arXiv:2103.02162 (cross-list from cs.LG) [pdf, other]
Title: Predicting Driver Fatigue in Automated Driving with Explainability
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)

Research indicates that monotonous automated driving increases the incidence of fatigued driving. Although many prediction models based on advanced machine learning techniques were proposed to monitor driver fatigue, especially in manual driving, little is known about how these black-box machine learning models work. In this paper, we proposed a combination of eXtreme Gradient Boosting (XGBoost) and SHAP (SHapley Additive exPlanations) to predict driver fatigue with explanations due to their efficiency and accuracy. First, in order to obtain the ground truth of driver fatigue, PERCLOS (percentage of eyelid closure over the pupil over time) between 0 and 100 was used as the response variable. Second, we built a driver fatigue regression model using both physiological and behavioral measures with XGBoost and it outperformed other selected machine learning models with 3.847 root-mean-squared error (RMSE), 1.768 mean absolute error (MAE) and 0.996 adjusted $R^2$. Third, we employed SHAP to identify the most important predictor variables and uncovered the black-box XGBoost model by showing the main effects of most important predictor variables globally and explaining individual predictions locally. Such an explainable driver fatigue prediction model offered insights into how to intervene in automated driving when necessary, such as during the takeover transition period from automated driving to manual driving.

[11]  arXiv:2103.02286 (cross-list from cs.IT) [pdf]
Title: The Ultimate Weapon for Ultra-Broadband 6G: Digital Beamforming and Doubly Massive mmWave MIMO
Comments: Submitted to IEEE Communications Magazine, special issue on 6G Communications for 2030 and Beyond
Subjects: Information Theory (cs.IT); Signal Processing (eess.SP)

The use of millimeter waves for wireless communications is one of the main technological innovations of 5G systems with respect to previous generations of cellular systems. Their consideration, however, has been up to now mainly restricted to the case in which analog or, at most, hybrid analog-digital beamforming structures were used, thus posing a limitation on the multiplexing capabilities and peak data rates that could be theoretically achieved at these frequencies. Recent progress in the field of electronics, however, has made the energy consumption of digital beamforming structures at least on par with that of analog beamforming, thus redeeming them from the ghetto they had been placed in over the last year. Digital beamforming, coupled with the use of large antenna arrays at both sides of the communication link, promises thus to be one of the secret weapons of future 6G networks, capable of unleashing unprecedented values of spectral and energy efficiency for ultra-broadband connectivity.

[12]  arXiv:2103.02313 (cross-list from eess.AS) [pdf, other]
Title: Open community platform for hearing aid algorithm research: open Master Hearing Aid (openMHA)
Comments: 25 pages, 4 figures, submitted to SoftwareX
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD); Signal Processing (eess.SP)

open Master Hearing Aid (openMHA) was developed and provided to the hearing aid research community as an open-source software platform with the aim to support sustainable and reproducible research towards improvement and new types of assistive hearing systems not limited by proprietary software. The software offers a flexible framework that allows the users to conduct hearing aid research using tools and a number of signal processing plugins provided with the software as well as the implementation of own methods. The openMHA software is independent of a specific hardware and supports Linux, MacOS and Windows operating systems as well as 32- bit and 64-bit ARM-based architectures such as used in small portable integrated systems. www.openmha.org

[13]  arXiv:2103.02334 (cross-list from cs.IT) [pdf, ps, other]
Title: Application of NOMA in 6G Networks: Future Vision and Research Opportunities for Next Generation Multiple Access
Comments: 14 pages, 5 figures, 1 table
Subjects: Information Theory (cs.IT); Signal Processing (eess.SP)

As a prominent member of the next generation multiple access (NGMA) family, non-orthogonal multiple access (NOMA) has been recognized as a promising multiple access candidate for the sixth-generation (6G) networks. This article focuses on applying NOMA in 6G networks, with an emphasis on proposing the so-called "One Basic Principle plus Four New" concept. Starting with the basic NOMA principle, the importance of successive interference cancellation (SIC) becomes evident. In particular, the advantages and drawbacks of both the channel state information based SIC and quality-of-service based SIC are discussed. Then, the application of NOMA to meet the new 6G performance requirements, especially for massive connectivity, is explored. Furthermore, the integration of NOMA with new physical layer techniques is considered, followed by introducing new application scenarios for NOMA towards 6G. Finally, the application of machine learning in NOMA networks is investigated, ushering in the machine learning empowered NGMA era.

[14]  arXiv:2103.02348 (cross-list from cs.IT) [pdf, other]
Title: Terahertz-Band MIMO-NOMA: Adaptive Superposition Coding and Subspace Detection
Subjects: Information Theory (cs.IT); Signal Processing (eess.SP)

We consider the problem of efficient ultra-massive multiple-input multiple-output (UM-MIMO) data detection in terahertz (THz)-band non-orthogonal multiple access (NOMA) systems. We argue that the most common THz NOMA configuration is power-domain superposition coding over quasi-optical doubly-massive MIMO channels. We propose spatial tuning techniques that modify antenna subarray arrangements to enhance channel conditions. Towards recovering the superposed data at the receiver side, we propose a family of data detectors based on low-complexity channel matrix puncturing, in which higher-order detectors are dynamically formed from lower-order component detectors. We first detail the proposed solutions for the case of superposition coding of multiple streams in point-to-point THz MIMO links. We then extend the study to multi-user NOMA, in which randomly distributed users get grouped into narrow cell sectors and are allocated different power levels depending on their proximity to the base station. We show that successive interference cancellation is carried with minimal performance and complexity costs under spatial tuning. We derive approximate bit error rate (BER) equations, and we propose an architectural design to illustrate complexity reductions. Under typical THz conditions, channel puncturing introduces more than an order of magnitude reduction in BER at high signal-to-noise ratios while reducing complexity by approximately 90%.

[15]  arXiv:2103.02378 (cross-list from cs.SD) [pdf, other]
Title: Continuous Speech Separation with Ad Hoc Microphone Arrays
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)

Speech separation has been shown effective for multi-talker speech recognition. Under the ad hoc microphone array setup where the array consists of spatially distributed asynchronous microphones, additional challenges must be overcome as the geometry and number of microphones are unknown beforehand. Prior studies show, with a spatial-temporalinterleaving structure, neural networks can efficiently utilize the multi-channel signals of the ad hoc array. In this paper, we further extend this approach to continuous speech separation. Several techniques are introduced to enable speech separation for real continuous recordings. First, we apply a transformer-based network for spatio-temporal modeling of the ad hoc array signals. In addition, two methods are proposed to mitigate a speech duplication problem during single talker segments, which seems more severe in the ad hoc array scenarios. One method is device distortion simulation for reducing the acoustic mismatch between simulated training data and real recordings. The other is speaker counting to detect the single speaker segments and merge the output signal channels. Experimental results for AdHoc-LibiCSS, a new dataset consisting of continuous recordings of concatenated LibriSpeech utterances obtained by multiple different devices, show the proposed separation method can significantly improve the ASR accuracy for overlapped speech with little performance degradation for single talker segments.

[16]  arXiv:2103.02421 (cross-list from eess.AS) [pdf]
Title: The effect of speech and noise levels on the quality perceived by cochlear implant and normal hearing listeners
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD); Signal Processing (eess.SP)

Electrical hearing by cochlear implants (CIs) may be fundamentally different from acoustic hearing by normal-hearing (NH) listeners, presumably showing unequal speech quality perception in various noise environments. Noise reduction (NR) algorithms used in CI reduce the noise in favor of signal-to-noise ratio (SNR), regardless of plausible accompanying distortions that may degrade the speech quality perception. To gain better understanding of CI speech quality perception, the present work aimed investigating speech quality perception in a diverse noise conditions, including factors of speech/noise levels, type of noise, and distortions caused by NR models. Fifteen NH and seven CI subjects participated in this study. Speech sentences were set to two different levels (65 and 75 dB SPL). Two types of noise (Cafeteria and Babble) at three levels (55, 65, and 75 dB SPL) were used. Sentences were processed using two NR algorithms to investigate the perceptual sensitivity of CI and NH listeners to the distortion. All sentences processed with the combinations of these sets were presented to CI and NH listeners, and they were asked to rate the sound quality of speech as they perceived. The effect of each factor on the perceived speech quality was investigated based on the group averaged quality rated by CI and NH listeners. Consistent with previous studies, CI listeners were not as sensitive as NH to the distortion made by NR algorithms. Statistical analysis showed that the speech level has significant effect on quality perception. At the same SNR, the quality of 65 dB speech was rated higher than that of 75 dB for CI users, but vice versa for NH listeners. Therefore, the present study showed that the perceived speech quality patterns were different between CI and NH listeners in terms of their sensitivity to distortion and speech level in complex listening environment.

Replacements for Thu, 4 Mar 21

[17]  arXiv:2007.13418 (replaced) [pdf, other]
Title: Intelligent Trajectory Planning in UAV-mounted Wireless Networks: A Quantum-Inspired Reinforcement Learning Perspective
Comments: Double-column 6-page paper
Subjects: Signal Processing (eess.SP); Systems and Control (eess.SY)
[18]  arXiv:2010.06981 (replaced) [pdf, ps, other]
Title: Passive RIS vs.Hybrid RIS: A Comparative Study on Channel Estimation
Comments: 7 pages, 5 figures
Subjects: Signal Processing (eess.SP)
[19]  arXiv:2012.01749 (replaced) [pdf, other]
Title: Cross-Correlation Based Discriminant Criterion for Channel Selection in Motor Imagery BCI Systems
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)
[20]  arXiv:2012.04074 (replaced) [pdf, other]
Title: SCUBA: An In-Device Multiplexed Protocol for Sidelink Communication on Unlicensed Bands
Comments: 15 pages, 16 figures. Submitted to an IEEE journal
Subjects: Signal Processing (eess.SP)
[21]  arXiv:2102.01421 (replaced) [pdf, other]
Title: A short overview of adaptive multichannel filters SNR loss analysis
Authors: Olivier Besson
Subjects: Signal Processing (eess.SP)
[22]  arXiv:2004.10715 (replaced) [pdf, other]
Title: Redefining Wireless Communication for 6G: Signal Processing Meets Deep Learning
Comments: An updated version is currently under review
Subjects: Networking and Internet Architecture (cs.NI); Machine Learning (cs.LG); Signal Processing (eess.SP)
[23]  arXiv:2007.11393 (replaced) [pdf, other]
Title: Optimal Pacing of a Cyclist in a Time Trial Based on Experimentally Calibrated Models of Fatigue and Recovery
Comments: 14 pages, 10 figures
Subjects: Systems and Control (eess.SY); Signal Processing (eess.SP)
[24]  arXiv:2010.08641 (replaced) [pdf, other]
Title: Deep Neural Dynamic Bayesian Networks applied to EEG sleep spindles modeling
Comments: 10 pages
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[25]  arXiv:2010.12301 (replaced) [pdf, other]
Title: Learning Multi-layer Graphs and a Common Representation for Clustering
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[26]  arXiv:2012.08615 (replaced) [pdf, other]
Title: Nonlinear Schrödinger Kernel Computing for Ultrafast Single-shot Data Acquisition and Inference
Comments: Update the system diagram, include more results and methodology
Subjects: Optics (physics.optics); Signal Processing (eess.SP)
[27]  arXiv:2101.10609 (replaced) [pdf, other]
Title: On the distributions of some statistics related to adaptive filters trained with $t$-distributed samples
Authors: Olivier Besson
Subjects: Statistics Theory (math.ST); Signal Processing (eess.SP)
[28]  arXiv:2103.00675 (replaced) [pdf, other]
Title: Bayesian filtering for nonlinear stochastic systems using holonomic gradient method with integral transform
Comments: 6 pages, 2 figures, submitted to IEEE Control Systems Letters and the 60th IEEE Conference on Decision and Control
Subjects: Numerical Analysis (math.NA); Symbolic Computation (cs.SC); Signal Processing (eess.SP); Systems and Control (eess.SY)
[29]  arXiv:2103.01829 (replaced) [pdf, ps, other]
Title: Terahertz Ultra-Massive MIMO-Based Aeronautical Communications in Space-Air-Ground Integrated Networks
Comments: 26 pages, 20 figures, accepted by IEEE Journal on Selected Areas in Communications
Subjects: Information Theory (cs.IT); Signal Processing (eess.SP)
[ total of 29 entries: 1-29 ]
[ showing up to 2000 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, recent, 2103, contact, help  (Access key information)