We gratefully acknowledge support from
the Simons Foundation and member institutions.

Electrical Engineering and Systems Science

Authors and titles for recent submissions, skipping first 53

[ total of 465 entries: 1-25 | 4-28 | 29-53 | 54-78 | 79-103 | 104-128 | 129-153 | ... | 454-465 ]
[ showing 25 entries per page: fewer | more | all ]

Wed, 12 Jun 2024 (continued, showing 25 of 97 entries)

[54]  arXiv:2406.07435 (cross-list from cs.CV) [pdf, other]
Title: Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration
Comments: Tags: Adversarial attack, image restoration, image deblurring, frequency sampling
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[55]  arXiv:2406.07421 (cross-list from cs.SD) [pdf, other]
Title: A Comprehensive Investigation on Speaker Augmentation for Speaker Recognition
Comments: to be published in INTERSPEECH 2024
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[56]  arXiv:2406.07409 (cross-list from stat.ML) [pdf, other]
Title: Accelerating Ill-conditioned Hankel Matrix Recovery via Structured Newton-like Descent
Subjects: Machine Learning (stat.ML); Information Theory (cs.IT); Machine Learning (cs.LG); Signal Processing (eess.SP); Optimization and Control (math.OC)
[57]  arXiv:2406.07399 (cross-list from cs.LG) [pdf, other]
Title: Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[58]  arXiv:2406.07387 (cross-list from cs.IT) [pdf, ps, other]
Title: Machine Learning-Based Channel Prediction for RIS-assisted MIMO Systems With Channel Aging
Subjects: Information Theory (cs.IT); Machine Learning (cs.LG); Signal Processing (eess.SP)
[59]  arXiv:2406.07361 (cross-list from cs.CV) [pdf, other]
Title: Deep Implicit Optimization for Robust and Flexible Image Registration
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[60]  arXiv:2406.07330 (cross-list from cs.CL) [pdf, other]
Title: CTC-based Non-autoregressive Textless Speech-to-Speech Translation
Comments: ACL 2024 Findings
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[61]  arXiv:2406.07329 (cross-list from cs.CV) [pdf, other]
Title: Cinematic Gaussians: Real-Time HDR Radiance Fields with Depth of Field
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[62]  arXiv:2406.07318 (cross-list from cs.CV) [pdf, other]
Title: Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs
Comments: Submitted to the IEEE Transactions on Circuits and System for Video Technology. This manuscript was first submitted for publication on March 31, 2024. It has since been revised twice: on May 22, 2024 and June 10, 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Hardware Architecture (cs.AR); Image and Video Processing (eess.IV)
[63]  arXiv:2406.07298 (cross-list from cs.ET) [pdf, other]
Title: Enhanced In-Flight Connectivity for Urban Air Mobility via LEO Satellite Networks
Authors: Karnika Biswas (1), Hakim Ghazzai (1), Abdullah Khanfor (2), Lokman Sboui (3) ((1) CEMSE Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia, (2) Computer Science Department, College of Computer Science \& Information Systems, Najran University, Najran, Saudi Arabia, (3) Systems Engineering Department, École de technologie supérieure (ÉTS), University of Québec, Montréal, Canada)
Comments: 6 pages, 6 figures, conference
Subjects: Emerging Technologies (cs.ET); Signal Processing (eess.SP)
[64]  arXiv:2406.07289 (cross-list from cs.CL) [pdf, other]
Title: Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data?
Comments: ACL 2024 main conference. Project Page: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[65]  arXiv:2406.07280 (cross-list from cs.SD) [pdf, ps, other]
Title: Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment
Comments: 5 pages, accepted for INTERSPEECH 2024, audio samples: this http URL
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[66]  arXiv:2406.07256 (cross-list from cs.SD) [pdf, ps, other]
Title: AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection
Comments: Accepted by Interspeech 2024
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[67]  arXiv:2406.07255 (cross-list from cs.CV) [pdf, other]
Title: Towards Realistic Data Generation for Real-World Super-Resolution
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[68]  arXiv:2406.07254 (cross-list from cs.SD) [pdf, ps, other]
Title: SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark
Comments: Accepted for INTERSPEECH 2024, corpus project page: this https URL
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[69]  arXiv:2406.07248 (cross-list from math.OC) [pdf, ps, other]
Title: Infinite-Horizon Distributionally Robust Regret-Optimal Control
Comments: Accepted for presentation at the 41st International Conference for Machine Learning (ICML 2024)
Subjects: Optimization and Control (math.OC); Systems and Control (eess.SY)
[70]  arXiv:2406.07203 (cross-list from cs.SD) [pdf, other]
Title: ParaCLAP -- Towards a general language-audio model for computational paralinguistic tasks
Comments: Accepted by Interspeech 2024
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[71]  arXiv:2406.07187 (cross-list from quant-ph) [pdf, other]
Title: Quantum Speedup of the Dispersion and Codebook Design Problems
Comments: 13 pages, 7 figures
Subjects: Quantum Physics (quant-ph); Signal Processing (eess.SP)
[72]  arXiv:2406.07165 (cross-list from cs.ET) [pdf, other]
Title: Realizing RF Wavefront Copying with RIS for Future Extended Reality Applications
Comments: This paper was presented in the Seventh International Balkan Conference on Communications and Networking (BalkanCom'24)
Subjects: Emerging Technologies (cs.ET); Signal Processing (eess.SP)
[73]  arXiv:2406.07162 (cross-list from cs.SD) [pdf, other]
Title: EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark
Comments: Accepted by INTERSPEECH 2024. GitHub Repository: this https URL
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[74]  arXiv:2406.07135 (cross-list from cs.IT) [pdf, other]
Title: Smart Wireless Environment Enhanced Telecommunications: A Network Stabilisation Paradigm for Mobile Operators
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Information Theory (cs.IT); Signal Processing (eess.SP)
[75]  arXiv:2406.07131 (cross-list from cs.SD) [pdf, other]
Title: ICGAN: An implicit conditioning method for interpretable feature control of neural audio synthesis
Authors: Yunyi Liu, Craig Jin
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[76]  arXiv:2406.07069 (cross-list from cs.RO) [pdf, other]
Title: Optimal Gait Control for a Tendon-driven Soft Quadruped Robot by Model-based Reinforcement Learning
Subjects: Robotics (cs.RO); Systems and Control (eess.SY)
[77]  arXiv:2406.07065 (cross-list from cs.RO) [pdf, other]
Title: Optimal Gait Design for a Soft Quadruped Robot via Multi-fidelity Bayesian Optimization
Subjects: Robotics (cs.RO); Systems and Control (eess.SY)
[78]  arXiv:2406.07012 (cross-list from cs.SD) [pdf, other]
Title: Bridging Language Gaps in Audio-Text Retrieval
Comments: interspeech2024
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[ total of 465 entries: 1-25 | 4-28 | 29-53 | 54-78 | 79-103 | 104-128 | 129-153 | ... | 454-465 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, new, 2406, contact, help  (Access key information)