We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions

[ total of 656 entries: 1-558 | 559-656 ]
[ showing 558 entries per page: fewer | more | all ]

Mon, 6 May 2024

[1]  arXiv:2405.02267 [pdf, other]
Title: Structural Pruning of Pre-trained Language Models via Neural Architecture Search
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[2]  arXiv:2405.02240 [pdf, other]
Title: Subgraph2vec: A random walk-based algorithm for embedding knowledge graphs
Subjects: Machine Learning (cs.LG)
[3]  arXiv:2405.02235 [pdf, other]
Title: Learning Optimal Deterministic Policies with Stochastic Policy Gradients
Comments: Accepted to ICML 2024
Subjects: Machine Learning (cs.LG)
[4]  arXiv:2405.02200 [pdf, other]
Title: Position Paper: Rethinking Empirical Research in Machine Learning: Addressing Epistemic and Methodological Challenges of Experimentation
Comments: Accepted for publication at ICML 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[5]  arXiv:2405.02183 [pdf, other]
Title: Metalearners for Ranking Treatment Effects
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[6]  arXiv:2405.02181 [pdf, other]
Title: Imitation Learning in Discounted Linear MDPs without exploration assumptions
Comments: Accepted at ICML 2024
Subjects: Machine Learning (cs.LG)
[7]  arXiv:2405.02180 [pdf, other]
Title: A Flow-Based Model for Conditional and Probabilistic Electricity Consumption Profile Generation and Prediction
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[8]  arXiv:2405.02161 [pdf, other]
Title: Simulating the economic impact of rationality through reinforcement learning and agent-based modelling
Comments: 8 pages, 4 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Multiagent Systems (cs.MA); General Economics (econ.GN)
[9]  arXiv:2405.02154 [pdf, other]
Title: Neural Context Flows for Learning Generalizable Dynamical Systems
Comments: 14 pages, 5 figures
Subjects: Machine Learning (cs.LG); Dynamical Systems (math.DS)
[10]  arXiv:2405.02140 [pdf, other]
Title: An Information Theoretic Perspective on Conformal Prediction
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[11]  arXiv:2405.02098 [pdf, other]
Title: Forecasting Ferry Passenger Flow Using Long-Short Term Memory Neural Networks
Authors: Daniel Fesalbon
Subjects: Machine Learning (cs.LG)
[12]  arXiv:2405.02086 [pdf, other]
Title: Multi-level projection with exponential parallel speedup; Application to sparse auto-encoders neural networks
Subjects: Machine Learning (cs.LG)
[13]  arXiv:2405.02081 [pdf, other]
Title: A Mutual Information Perspective on Federated Contrastive Learning
Comments: Published as a conference paper at ICLR 2024
Subjects: Machine Learning (cs.LG)
[14]  arXiv:2405.02074 [pdf, other]
Title: A Federated Learning Benchmark on Tabular Data: Comparing Tree-Based Models and Neural Networks
Comments: 8 pages, 6 figures, 6 tables, FMEC 2023 (best paper)
Subjects: Machine Learning (cs.LG)
[15]  arXiv:2405.02067 [pdf, other]
Title: Histogram-Based Federated XGBoost using Minimal Variance Sampling for Federated Tabular Data
Comments: 6 figures, 5 tables, 8 pages, FLTA 2023 (together with FMEC 2023)
Subjects: Machine Learning (cs.LG)
[16]  arXiv:2405.02063 [pdf, other]
Title: Few-sample Variational Inference of Bayesian Neural Networks with Arbitrary Nonlinearities
Authors: David J. Schodt
Subjects: Machine Learning (cs.LG)
[17]  arXiv:2405.02062 [pdf, other]
Title: Dyna-Style Learning with A Macroscopic Model for Vehicle Platooning in Mixed-Autonomy Traffic
Subjects: Machine Learning (cs.LG)
[18]  arXiv:2405.02060 [pdf, other]
Title: Federated Learning for Tabular Data using TabNet: A Vehicular Use-Case
Comments: 7 pages, 9 figures, 1 table, ICCP Conference 2022
Subjects: Machine Learning (cs.LG)
[19]  arXiv:2405.02044 [pdf, other]
Title: Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Systems and Control (eess.SY); Optimization and Control (math.OC)
[20]  arXiv:2405.02041 [pdf, other]
Title: Stabilizing Backpropagation Through Time to Learn Complex Physics
Comments: Published at ICLR 2024, code available at this https URL
Subjects: Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[21]  arXiv:2405.01995 [pdf, other]
Title: Cooperation and Federation in Distributed Radar Point Cloud Processing
Journal-ref: 2023 IEEE 34th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[22]  arXiv:2405.01990 [pdf, other]
Title: Soft Label PU Learning
Subjects: Machine Learning (cs.LG)
[23]  arXiv:2405.01978 [pdf, other]
Title: Quantifying Distribution Shifts and Uncertainties for Enhanced Model Robustness in Machine Learning Applications
Authors: Vegard Flovik
Comments: Working paper
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[24]  arXiv:2405.01974 [pdf, other]
Title: Multitask Extension of Geometrically Aligned Transfer Encoder
Comments: 7 pages, 3 figures, 2 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
[25]  arXiv:2405.01927 [pdf, other]
Title: SlotGAT: Slot-based Message Passing for Heterogeneous Graph Neural Network
Comments: Published as a conference paper at ICML 2023
Subjects: Machine Learning (cs.LG)
[26]  arXiv:2405.01851 [pdf, other]
Title: Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[27]  arXiv:2405.01843 [pdf, ps, other]
Title: Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization
Comments: arXiv admin note: text overlap with arXiv:2306.10486
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[28]  arXiv:2405.01838 [pdf, other]
Title: A Novel Approach to Guard from Adversarial Attacks using Stable Diffusion
Subjects: Machine Learning (cs.LG)
[29]  arXiv:2405.01817 [pdf, other]
Title: Uniformly Stable Algorithms for Adversarial Training and Beyond
Comments: ICML 2024
Subjects: Machine Learning (cs.LG)
[30]  arXiv:2405.01814 [pdf, other]
Title: Efficient and Economic Large Language Model Inference with Attention Offloading
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[31]  arXiv:2405.01778 [pdf, other]
Title: Hierarchical mixture of discriminative Generalized Dirichlet classifiers
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[32]  arXiv:2405.01762 [pdf, ps, other]
Title: EiG-Search: Generating Edge-Induced Subgraphs for GNN Explanation in Linear Time
Comments: 19 pages
Journal-ref: ICML 2024
Subjects: Machine Learning (cs.LG)
[33]  arXiv:2405.01760 [pdf, other]
Title: Reinforcement Learning-Guided Semi-Supervised Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[34]  arXiv:2405.01744 [pdf, other]
Title: ALCM: Autonomous LLM-Augmented Causal Discovery Framework
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Methodology (stat.ME)
[35]  arXiv:2405.01739 [pdf, other]
Title: Enhancing User Experience in On-Device Machine Learning with Gated Compression Layers
Comments: Initial Submission
Subjects: Machine Learning (cs.LG)
[36]  arXiv:2405.01731 [pdf, other]
Title: Dynamic Anisotropic Smoothing for Noisy Derivative-Free Optimization
Comments: Accepted to ICML2024
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[37]  arXiv:2405.01719 [pdf, other]
Title: Inherent Trade-Offs between Diversity and Stability in Multi-Task Benchmark
Comments: To be published in ICML 2024
Subjects: Machine Learning (cs.LG)
[38]  arXiv:2405.01718 [pdf, other]
Title: Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Risk
Authors: Xinyi Ni, Lifeng Lai
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[39]  arXiv:2405.01714 [pdf, other]
Title: Interpretable Vital Sign Forecasting with Model Agnostic Attention Maps
Comments: 8 pages, 4 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[40]  arXiv:2405.01711 [pdf, ps, other]
Title: Individual Fairness Through Reweighting and Tuning
Comments: 14 pages, 1 figure, and 2 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[41]  arXiv:2405.01708 [pdf, other]
Title: A deep causal inference model for fully-interpretable travel behaviour analysis
Subjects: Machine Learning (cs.LG)
[42]  arXiv:2405.01704 [pdf, other]
Title: Privacy-aware Berrut Approximated Coded Computing for Federated Learning
Subjects: Machine Learning (cs.LG); Computational Complexity (cs.CC); Distributed, Parallel, and Cluster Computing (cs.DC); Information Theory (cs.IT)
[43]  arXiv:2405.01702 [pdf, other]
Title: Optimization without retraction on the random generalized Stiefel manifold
Comments: 21 pages, 10 figures
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[44]  arXiv:2405.01684 [pdf, other]
Title: Intelligent Switching for Reset-Free RL
Comments: Published at ICLR 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[45]  arXiv:2405.01680 [pdf, other]
Title: Physics-Informed Neural Networks: Minimizing Residual Loss with Wide Networks and Effective Activations
Comments: Accepted at IJCAI 2024
Subjects: Machine Learning (cs.LG)
[46]  arXiv:2405.01677 [pdf, other]
Title: Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[47]  arXiv:2405.01663 [pdf, ps, other]
Title: ATNPA: A Unified View of Oversmoothing Alleviation in Graph Neural Networks
Comments: 16 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[48]  arXiv:2405.01661 [pdf, other]
Title: When a Relation Tells More Than a Concept: Exploring and Evaluating Classifier Decisions with CoReX
Comments: preliminary version, submitted to Machine Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[49]  arXiv:2405.01617 [pdf, other]
Title: An Explainable and Conformal AI Model to Detect Temporomandibular Joint Involvement in Children Suffering from Juvenile Idiopathic Arthritis
Comments: Accepted at EMBC 2024
Subjects: Machine Learning (cs.LG)
[50]  arXiv:2405.01614 [pdf, other]
Title: A probabilistic estimation of remaining useful life from censored time-to-event data
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[51]  arXiv:2405.01611 [pdf, other]
Title: Unifying and extending Precision Recall metrics for assessing generative models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME); Machine Learning (stat.ML)
[52]  arXiv:2405.01607 [pdf, other]
Title: Wildfire Risk Prediction: A Review
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[53]  arXiv:2405.01603 [pdf, other]
Title: KITE: A Kernel-based Improved Transferability Estimation Method
Authors: Yunhui Guo
Comments: 14 pages
Subjects: Machine Learning (cs.LG)
[54]  arXiv:2405.01563 [pdf, other]
Title: Mitigating LLM Hallucinations via Conformal Abstention
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[55]  arXiv:2405.01557 [pdf, other]
Title: An Experimental Study on the Rashomon Effect of Balancing Methods in Imbalanced Classification
Comments: 16 pages, 6 figures
Subjects: Machine Learning (cs.LG)
[56]  arXiv:2405.01554 [pdf, other]
Title: Early-stage detection of cognitive impairment by hybrid quantum-classical algorithm using resting-state functional MRI time-series
Comments: 28 pages, 10 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neurons and Cognition (q-bio.NC)
[57]  arXiv:2405.02225 (cross-list from stat.ML) [pdf, other]
Title: Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks
Comments: 28 pages, 8 figures, accepted by ICML2024
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG); Methodology (stat.ME)
[58]  arXiv:2405.02221 (cross-list from math.NA) [pdf, other]
Title: Discretization Error of Fourier Neural Operators
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG)
[59]  arXiv:2405.02220 (cross-list from cs.CV) [pdf, other]
Title: Designed Dithering Sign Activation for Binary Neural Networks
Comments: 7 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[60]  arXiv:2405.02213 (cross-list from cs.SE) [pdf, other]
Title: Automatic Programming: Large Language Models and Beyond
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[61]  arXiv:2405.02201 (cross-list from math.OC) [pdf, other]
Title: Regularized Q-learning through Robust Averaging
Comments: 26 pages, 5 figures
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[62]  arXiv:2405.02195 (cross-list from cs.CL) [pdf, ps, other]
Title: Impact of emoji exclusion on the performance of Arabic sarcasm detection models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[63]  arXiv:2405.02191 (cross-list from cs.CV) [pdf, ps, other]
Title: Non-Destructive Peat Analysis using Hyperspectral Imaging and Machine Learning
Comments: 4 pages,4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[64]  arXiv:2405.02188 (cross-list from stat.ML) [pdf, other]
Title: Optimistic Regret Bounds for Online Learning in Adversarial Markov Decision Processes
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[65]  arXiv:2405.02175 (cross-list from cs.CL) [pdf, other]
Title: Hoaxpedia: A Unified Wikipedia Hoax Articles Dataset
Comments: Short paper
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[66]  arXiv:2405.02148 (cross-list from cs.AI) [pdf, ps, other]
Title: Towards a Formal Creativity Theory: Preliminary results in Novelty and Transformativeness
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[67]  arXiv:2405.02141 (cross-list from cs.IR) [pdf, other]
Title: Multi-Objective Recommendation via Multivariate Policy Learning
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[68]  arXiv:2405.02124 (cross-list from eess.AS) [pdf, other]
Title: TIPAA-SSL: Text Independent Phone-to-Audio Alignment based on Self-Supervised Learning and Knowledge Transfer
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[69]  arXiv:2405.02119 (cross-list from cs.SD) [pdf, other]
Title: Can We Identify Unknown Audio Recording Environments in Forensic Scenarios?
Comments: This work has been submitted to the IEEE for possible publication
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[70]  arXiv:2405.02101 (cross-list from eess.SP) [pdf, other]
Title: Discrete Aware Matrix Completion via Convexized $\ell_0$-Norm Approximation
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)
[71]  arXiv:2405.02082 (cross-list from stat.ML) [pdf, ps, other]
Title: A comparative study of conformal prediction methods for valid uncertainty quantification in machine learning
Authors: Nicolas Dewolf
Comments: At 339 pages, this document is a live/working version of my PhD dissertation published in 2024 by the University of Ghent (UGent)
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Statistics Theory (math.ST)
[72]  arXiv:2405.01994 (cross-list from stat.ML) [pdf, ps, other]
Title: Mathematics of statistical sequential decision-making: concentration, risk-awareness and modelling in stochastic bandits, with applications to bariatric surgery
Authors: Patrick Saux
Comments: Doctoral thesis. Some pdf readers (e.g. Firefox) have trouble rendering the theorems/definitions environment. When reading online, please prefer e.g. Chrome
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[73]  arXiv:2405.01988 (cross-list from cs.SD) [pdf, other]
Title: Joint sentiment analysis of lyrics and audio in music
Comments: published at DAGA 2024
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[74]  arXiv:2405.01976 (cross-list from cs.CL) [pdf, other]
Title: Conformal Prediction for Natural Language Processing: A Survey
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[75]  arXiv:2405.01975 (cross-list from cs.CE) [pdf, other]
Title: Introducing a microstructure-embedded autoencoder approach for reconstructing high-resolution solution field from reduced parametric space
Subjects: Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
[76]  arXiv:2405.01964 (cross-list from stat.ML) [pdf, other]
Title: Understanding LLMs Requires More Than Statistical Generalization
Comments: Accepted at ICML2024
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[77]  arXiv:2405.01963 (cross-list from cs.CR) [pdf, other]
Title: From Attack to Defense: Insights into Deep Learning Security Measures in Black-Box Settings
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[78]  arXiv:2405.01952 (cross-list from stat.ML) [pdf, other]
Title: Three Quantization Regimes for ReLU Networks
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Information Theory (cs.IT); Machine Learning (cs.LG)
[79]  arXiv:2405.01943 (cross-list from cs.CL) [pdf, other]
Title: Dependency-Aware Semi-Structured Sparsity of GLU Variants in Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[80]  arXiv:2405.01934 (cross-list from cs.CV) [pdf, other]
Title: Impact of Architectural Modifications on Deep Learning Adversarial Robustness
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[81]  arXiv:2405.01906 (cross-list from cs.AI) [pdf, other]
Title: Instance-Conditioned Adaptation for Large-scale Generalization of Neural Combinatorial Optimization
Comments: 17 pages, 6 figures
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[82]  arXiv:2405.01883 (cross-list from cs.CL) [pdf, other]
Title: DALLMi: Domain Adaption for LLM-based Multi-label Classifier
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[83]  arXiv:2405.01881 (cross-list from q-fin.RM) [pdf, ps, other]
Title: Explainable Risk Classification in Financial Reports
Subjects: Risk Management (q-fin.RM); Machine Learning (cs.LG)
[84]  arXiv:2405.01873 (cross-list from cs.CL) [pdf, other]
Title: Enhancing Bangla Language Next Word Prediction and Sentence Completion through Extended RNN with Bi-LSTM Model On N-gram Language
Comments: This paper contains 6 pages, 8 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[85]  arXiv:2405.01859 (cross-list from cs.CY) [pdf, other]
Title: AI-Powered Autonomous Weapons Risk Geopolitical Instability and Threaten AI Research
Comments: 9 pages, in ICML 2024
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[86]  arXiv:2405.01855 (cross-list from cs.IR) [pdf, ps, other]
Title: Robust Explainable Recommendation
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[87]  arXiv:2405.01849 (cross-list from cs.IR) [pdf, ps, other]
Title: Stability of Explainable Recommendation
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[88]  arXiv:2405.01848 (cross-list from cs.IR) [pdf, other]
Title: RankSHAP: a Gold Standard Feature Attribution Method for the Ranking Task
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[89]  arXiv:2405.01810 (cross-list from cs.AI) [pdf, other]
Title: Non-linear Welfare-Aware Strategic Learning
Authors: Tian Xie, Xueru Zhang
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[90]  arXiv:2405.01792 (cross-list from cs.RO) [pdf, other]
Title: Learning Robust Autonomous Navigation and Locomotion for Wheeled-Legged Robots
Journal-ref: Science Robotics, 2024, Vol 9, Issue 89
Subjects: Robotics (cs.RO); Machine Learning (cs.LG); Systems and Control (eess.SY)
[91]  arXiv:2405.01776 (cross-list from cs.RO) [pdf, other]
Title: An Approach to Systematic Data Acquisition and Data-Driven Simulation for the Safety Testing of Automated Driving Functions
Comments: 8 pages, 5 figures
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[92]  arXiv:2405.01775 (cross-list from cs.AR) [pdf, other]
Title: Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator Design
Comments: Accepted for publication at MLSys 2024
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[93]  arXiv:2405.01761 (cross-list from stat.ML) [pdf, other]
Title: Multivariate Bayesian Last Layer for Regression: Uncertainty Quantification and Disentanglement
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[94]  arXiv:2405.01758 (cross-list from cs.RO) [pdf, other]
Title: CGD: Constraint-Guided Diffusion Policies for UAV Trajectory Planning
Comments: 8 pages, 3 figures
Subjects: Robotics (cs.RO); Machine Learning (cs.LG); Systems and Control (eess.SY)
[95]  arXiv:2405.01745 (cross-list from cs.AI) [pdf, other]
Title: Large Language Models for UAVs: Current State and Pathways to the Future
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[96]  arXiv:2405.01741 (cross-list from cs.CR) [pdf, other]
Title: PVF (Parameter Vulnerability Factor): A Quantitative Metric Measuring AI Vulnerability and Resilience Against Parameter Corruptions
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[97]  arXiv:2405.01737 (cross-list from stat.ML) [pdf, other]
Title: Sample-efficient neural likelihood-free Bayesian inference of implicit HMMs
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO)
[98]  arXiv:2405.01726 (cross-list from eess.IV) [pdf, ps, other]
Title: SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[99]  arXiv:2405.01725 (cross-list from eess.IV) [pdf, other]
Title: Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[100]  arXiv:2405.01691 (cross-list from cs.CV) [pdf, other]
Title: Language-Enhanced Latent Representations for Out-of-Distribution Detection in Autonomous Driving
Comments: Presented at the Robot Trust for Symbiotic Societies (RTSS) Workshop, co-located with ICRA 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[101]  arXiv:2405.01656 (cross-list from cs.CV) [pdf, other]
Title: S4: Self-Supervised Sensing Across the Spectrum
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[102]  arXiv:2405.01616 (cross-list from q-bio.BM) [pdf, other]
[103]  arXiv:2405.01615 (cross-list from cs.NE) [pdf, other]
Title: Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning
Comments: 16 pages, including proofs in the appendix
Subjects: Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG)
[104]  arXiv:2405.01606 (cross-list from quant-ph) [pdf, other]
Title: Improving Trainability of Variational Quantum Circuits via Regularization Strategies
Comments: preprint, under review. TL;DR: we propose a regularization strategy to improve the trainability of VQCs
Subjects: Quantum Physics (quant-ph); Machine Learning (cs.LG)
[105]  arXiv:2405.01604 (cross-list from q-fin.PM) [pdf, other]
Title: Portfolio Management using Deep Reinforcement Learning
Comments: 7 pages, 9 figures
Subjects: Portfolio Management (q-fin.PM); Machine Learning (cs.LG)
[106]  arXiv:2405.01601 (cross-list from cs.CL) [pdf, other]
Title: Efficient Sample-Specific Encoder Perturbations
Comments: To appear in NAACL 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[107]  arXiv:2405.01600 (cross-list from eess.IV) [pdf, other]
Title: Deep Learning Descriptor Hybridization with Feature Reduction for Accurate Cervical Cancer Colposcopy Image Classification
Comments: 7 Pages double column, 5 figures, and 5 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[108]  arXiv:2405.01587 (cross-list from cs.CL) [pdf, ps, other]
Title: Improve Academic Query Resolution through BERT-based Question Extraction from Images
Journal-ref: 2024 IEEE International Conference on Interdisciplinary Approaches in Technology and Management for Social Innovation (IATMSI) volume 2 (2024) 1-4
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[109]  arXiv:2405.01584 (cross-list from cs.CL) [pdf, other]
Title: Lightweight Conceptual Dictionary Learning for Text Classification Using Information Compression
Comments: 12 pages, TKDE format
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Signal Processing (eess.SP)
[110]  arXiv:2405.01583 (cross-list from cs.CL) [pdf, other]
Title: MediFact at MEDIQA-M3G 2024: Medical Question Answering in Dermatology with Multimodal Learning
Authors: Nadia Saeed
Comments: 7 pages, 3 figures, Clinical NLP 2024 workshop proceedings in Shared Task
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[111]  arXiv:2405.01582 (cross-list from cs.CL) [pdf, other]
Title: Text Quality-Based Pruning for Efficient Training of Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[112]  arXiv:2405.01579 (cross-list from cs.SE) [pdf, other]
Title: Mining patterns in syntax trees to automate code reviews of student solutions for programming exercises
Subjects: Software Engineering (cs.SE); Computers and Society (cs.CY); Machine Learning (cs.LG)
[113]  arXiv:2405.01577 (cross-list from cs.CL) [pdf, other]
Title: HateTinyLLM : Hate Speech Detection Using Tiny Large Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[114]  arXiv:2405.01576 (cross-list from cs.CL) [pdf, other]
Title: Uncovering Deceptive Tendencies in Language Models: A Simulated Company AI Assistant
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[115]  arXiv:2405.01559 (cross-list from cs.SE) [pdf, other]
Title: Untangling Knots: Leveraging LLM for Error Resolution in Computational Notebooks
Comments: accepted at 1st ACM CHI Workshop on Human-Notebook Interactions
Subjects: Software Engineering (cs.SE); Machine Learning (cs.LG)
[116]  arXiv:2405.01558 (cross-list from cs.CV) [pdf, other]
Title: Configurable Learned Holography
Comments: 14 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Optics (physics.optics)
[117]  arXiv:2405.01540 (cross-list from cs.AI) [pdf, other]
Title: Universal Imitation Games
Comments: 98 pages. arXiv admin note: substantial text overlap with arXiv:2402.18732
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

Fri, 3 May 2024

[118]  arXiv:2405.01534 [pdf, other]
Title: Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks
Comments: Published at ICLR 2024. Website at this https URL 9 pages, 3 figures, 3 tables; 14 pages appendix (7 additional figures)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[119]  arXiv:2405.01531 [pdf, other]
Title: Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[120]  arXiv:2405.01524 [pdf, other]
Title: A separability-based approach to quantifying generalization: which layer is best?
Comments: 6, pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[121]  arXiv:2405.01507 [pdf, other]
Title: Accelerating Convergence in Bayesian Few-Shot Classification
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[122]  arXiv:2405.01488 [pdf, other]
[123]  arXiv:2405.01480 [pdf, other]
Title: Common pitfalls to avoid while using multiobjective optimization in machine learning
Comments: 21 pages, 12 figures
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[124]  arXiv:2405.01468 [pdf, other]
Title: Understanding Retrieval-Augmented Task Adaptation for Vision-Language Models
Authors: Yifei Ming, Yixuan Li
Comments: The paper is accepted at ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[125]  arXiv:2405.01462 [pdf, other]
Title: Uncertainty for Active Learning on Graphs
Subjects: Machine Learning (cs.LG)
[126]  arXiv:2405.01451 [pdf, other]
Title: Test-time Assessment of a Model's Performance on Unseen Domains via Optimal Transport
Subjects: Machine Learning (cs.LG)
[127]  arXiv:2405.01389 [pdf, other]
Title: Invariant Risk Minimization Is A Total Variation Model
Subjects: Machine Learning (cs.LG)
[128]  arXiv:2405.01365 [pdf, other]
Title: Dynamic Online Ensembles of Basis Expansions
Comments: 34 pages, 14 figures. Accepted to Transactions on Machine Learning Research (TMLR)
Journal-ref: Transactions on Machine Learning Research (TMLR), 2024
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP); Machine Learning (stat.ML)
[129]  arXiv:2405.01350 [pdf, other]
Title: Community-Invariant Graph Contrastive Learning
Comments: This paper is accepted by ICML-2024
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[130]  arXiv:2405.01349 [pdf, other]
Title: Position Paper: Beyond Robustness Against Single Attack Types
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[131]  arXiv:2405.01327 [pdf, other]
Title: Constrained Reinforcement Learning Under Model Mismatch
Comments: ICML 2024
Subjects: Machine Learning (cs.LG)
[132]  arXiv:2405.01319 [pdf, other]
Title: Data Scoping: Effectively Learning the Evolution of Generic Transport PDEs
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE)
[133]  arXiv:2405.01306 [pdf, other]
Title: Graph is all you need? Lightweight data-agnostic neural architecture search without training
Subjects: Machine Learning (cs.LG)
[134]  arXiv:2405.01270 [pdf, other]
Title: The Importance of Model Inspection for Better Understanding Performance Characteristics of Graph Neural Networks
Comments: International Symposium on Biomedical Imaging (ISBI)
Subjects: Machine Learning (cs.LG)
[135]  arXiv:2405.01263 [pdf, other]
Title: An Online Gradient-Based Caching Policy with Logarithmic Complexity and Regret Guarantees
Subjects: Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI); Operating Systems (cs.OS)
[136]  arXiv:2405.01261 [pdf, other]
Title: Continuously evolving rewards in an open-ended environment
Comments: 30 pages, 8 figures
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[137]  arXiv:2405.01260 [pdf, other]
Title: Causal Influence in Federated Edge Inference
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA); Signal Processing (eess.SP); Systems and Control (eess.SY)
[138]  arXiv:2405.01251 [pdf, other]
Title: Revisiting semi-supervised training objectives for differentiable particle filters
Comments: 5 pages, 2 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[139]  arXiv:2405.01247 [pdf, other]
Title: Lying Graph Convolution: Learning to Lie for Node Classification Tasks
Comments: Accepted to IJCNN2024
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[140]  arXiv:2405.01229 [pdf, ps, other]
Title: Boosting Jailbreak Attack with Momentum
Comments: ICLR 2024 Workshop on Reliable and Responsible Foundation Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Optimization and Control (math.OC)
[141]  arXiv:2405.01207 [pdf, ps, other]
Title: Improving Membership Inference in ASR Model Auditing with Perturbed Loss Features
Comments: Trustworthy Speech Processing, Satellite Workshop at ICASSP 2024
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[142]  arXiv:2405.01205 [pdf, other]
Title: Error-Driven Uncertainty Aware Training
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[143]  arXiv:2405.01198 [pdf, other]
Title: Towards Interpretable Reinforcement Learning with Constrained Normalizing Flow Policies
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[144]  arXiv:2405.01196 [pdf, other]
Title: Decoupling Feature Extraction and Classification Layers for Calibrated Neural Networks
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[145]  arXiv:2405.01189 [pdf, other]
Title: Gradient-Congruity Guided Federated Sparse Training
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[146]  arXiv:2405.01186 [pdf, other]
Title: Potential Energy based Mixture Model for Noisy Label Learning
Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[147]  arXiv:2405.01158 [pdf, other]
Title: Interpretable Data-driven Anomaly Detection in Industrial Processes with ExIFFI
Comments: 6 pages, submitted to IEEE RTSI 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[148]  arXiv:2405.01157 [pdf, other]
Title: Tabular and Deep Reinforcement Learning for Gittins Index
Subjects: Machine Learning (cs.LG); Performance (cs.PF); Machine Learning (stat.ML)
[149]  arXiv:2405.01155 [pdf, other]
Title: SynFlowNet: Towards Molecule Design with Guaranteed Synthesis Pathways
Comments: Presented at ICLR 2024 GEM Workshop
Subjects: Machine Learning (cs.LG); Biomolecules (q-bio.BM)
[150]  arXiv:2405.01147 [pdf, other]
Title: Why Tabular Foundation Models Should Be a Research Priority
Comments: Accepted at International Conference on Machine Learning (ICML 2024)
Subjects: Machine Learning (cs.LG)
[151]  arXiv:2405.01142 [pdf, other]
Title: Sharp Bounds for Sequential Federated Learning on Heterogeneous Data
Comments: arXiv admin note: text overlap with arXiv:2311.03154
Subjects: Machine Learning (cs.LG)
[152]  arXiv:2405.01125 [pdf, other]
Title: Lipschitz constant estimation for general neural network architectures using control tools
Subjects: Machine Learning (cs.LG); Image and Video Processing (eess.IV); Systems and Control (eess.SY)
[153]  arXiv:2405.01114 [pdf, other]
Title: Continual Imitation Learning for Prosthetic Limbs
Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[154]  arXiv:2405.01102 [pdf, other]
Title: Less is More: on the Over-Globalizing Problem in Graph Transformers
Comments: Accepted by ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[155]  arXiv:2405.01073 [pdf, other]
Title: Poisoning Attacks on Federated Learning for Autonomous Driving
Comments: Accepted to SCAI2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[156]  arXiv:2405.01067 [pdf, other]
Title: AB-Training: A Communication-Efficient Approach for Distributed Low-Rank Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[157]  arXiv:2405.01060 [pdf, other]
Title: A text-based, generative deep learning model for soil reflectance spectrum simulation in the VIS-NIR (400-2499 nm) bands
Comments: The paper has been submitted to Remote sensing of Environment and revised
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[158]  arXiv:2405.01055 [pdf, ps, other]
Title: Leverage Multi-source Traffic Demand Data Fusion with Transformer Model for Urban Parking Prediction
Comments: 7 pages, 5 figures, under review by the 27th IEEE International Conference on Intelligent Transportation Systems (IEEE ITSC 2024)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET)
[159]  arXiv:2405.01053 [pdf, other]
Title: Explicitly Modeling Generality into Self-Supervised Learning
Comments: 28 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[160]  arXiv:2405.01052 [pdf, ps, other]
Title: Polynomial Chaos Expanded Gaussian Process
Comments: Manuscript: 20 pages, 4 figures, 7 tables
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[161]  arXiv:2405.01041 [pdf, other]
Title: Efficient and Flexible Method for Reducing Moderate-size Deep Neural Networks with Condensation
Subjects: Machine Learning (cs.LG)
[162]  arXiv:2405.01033 [pdf, other]
Title: CrossMPT: Cross-attention Message-Passing Transformer for Error Correcting Codes
Comments: 13 pages
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT)
[163]  arXiv:2405.01031 [pdf, other]
Title: The Privacy Power of Correlated Noise in Decentralized Learning
Comments: Accepted as conference paper at ICML 2024
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC); Optimization and Control (math.OC); Machine Learning (stat.ML)
[164]  arXiv:2405.01013 [pdf, other]
Title: Non-clairvoyant Scheduling with Partial Predictions
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Data Structures and Algorithms (cs.DS)
[165]  arXiv:2405.01010 [pdf, other]
Title: Efficient and Adaptive Posterior Sampling Algorithms for Bandits
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[166]  arXiv:2405.01009 [pdf, other]
Title: Tackling Graph Oversquashing by Global and Local Non-Dissipativity
Subjects: Machine Learning (cs.LG)
[167]  arXiv:2405.00987 [pdf, other]
Title: S$^2$AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic
Comments: Accepted for publication at ICLR 2024
Subjects: Machine Learning (cs.LG)
[168]  arXiv:2405.00985 [pdf, other]
Title: Progressive Feedforward Collapse of ResNet Training
Comments: 14 pages, 5 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Statistics Theory (math.ST)
[169]  arXiv:2405.00984 [pdf, other]
Title: FREE: Faster and Better Data-Free Meta-Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[170]  arXiv:2405.00965 [pdf, other]
Title: Robust Decentralized Learning with Local Updates and Gradient Tracking
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[171]  arXiv:2405.00958 [pdf, other]
Title: Generative manufacturing systems using diffusion models and ChatGPT
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Systems and Control (eess.SY)
[172]  arXiv:2405.00957 [pdf, other]
Title: IntraMix: Intra-Class Mixup Generation for Accurate Labels and Neighbors
Comments: 18 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[173]  arXiv:2405.00955 [pdf, other]
Title: Recovering Labels from Local Updates in Federated Learning
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[174]  arXiv:2405.00950 [pdf, other]
Title: Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback
Authors: Guojun Xiong, Jian Li
Comments: Accepted by ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[175]  arXiv:2405.00949 [pdf, other]
Title: The Role of Model Architecture and Scale in Predicting Molecular Properties: Insights from Fine-Tuning RoBERTa, BART, and LLaMA
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Chemical Physics (physics.chem-ph); Biomolecules (q-bio.BM)
[176]  arXiv:2405.00946 [pdf, other]
Title: SparseTSF: Modeling Long-term Time Series Forecasting with 1k Parameters
Subjects: Machine Learning (cs.LG)
[177]  arXiv:2405.00937 [pdf, ps, other]
Title: New bounds on the cohesion of complete-link and other linkage methods for agglomeration clustering
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[178]  arXiv:2405.00922 [pdf, other]
Title: MTDT: A Multi-Task Deep Learning Digital Twin
Comments: 8 pages, 2 figures, 4 tables
Subjects: Machine Learning (cs.LG)
[179]  arXiv:2405.00910 [pdf, other]
Title: De-Biasing Models of Biased Decisions: A Comparison of Methods Using Mortgage Application Data
Authors: Nicholas Tenev
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Econometrics (econ.EM)
[180]  arXiv:2405.00909 [pdf, other]
Title: Quantum Federated Learning Experiments in the Cloud with Data Encoding
Comments: SIGCOMM 2024, Quantum Computing, Federated Learning, Qiskit
Subjects: Machine Learning (cs.LG); Emerging Technologies (cs.ET); Quantum Physics (quant-ph)
[181]  arXiv:2405.00902 [pdf, ps, other]
Title: MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure
Comments: Accepted to AAMAS 2024. 15 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[182]  arXiv:2405.00885 [pdf, other]
Title: WHALE-FL: Wireless and Heterogeneity Aware Latency Efficient Federated Learning over Mobile Devices via Adaptive Subnetwork Scheduling
Subjects: Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI); Image and Video Processing (eess.IV)
[183]  arXiv:2405.00879 [pdf, other]
Title: Machine Learning Techniques for Data Reduction of Climate Applications
Comments: 7 pages. arXiv admin note: text overlap with arXiv:2404.18063
Subjects: Machine Learning (cs.LG); Atmospheric and Oceanic Physics (physics.ao-ph)
[184]  arXiv:2405.00877 [pdf, other]
Title: Markov flow policy -- deep MC
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[185]  arXiv:2405.00853 [pdf, ps, other]
Title: Efficient Algorithms for Learning Monophonic Halfspaces in Graphs
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[186]  arXiv:2405.00839 [pdf, other]
Title: Communication-Efficient Training Workload Balancing for Decentralized Multi-Agent Learning
Comments: This paper has been accepted for presentation at ICDCS (44th IEEE International Conference on Distributed Computing Systems). Keywords: decentralized multi-agent learning, federated learning, edge computing, heterogeneous agents, workload balancing, and communication-efficient training )
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Multiagent Systems (cs.MA); Performance (cs.PF)
[187]  arXiv:2405.00837 [pdf, other]
Title: Locality Regularized Reconstruction: Structured Sparsity and Delaunay Triangulations
Comments: 26 pages, 8 figures
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP); Optimization and Control (math.OC); Machine Learning (stat.ML)
[188]  arXiv:2405.00819 [pdf, other]
Title: ICU Bloodstream Infection Prediction: A Transformer-Based Approach for EHR Analysis
Subjects: Machine Learning (cs.LG)
[189]  arXiv:2405.00792 [pdf, other]
Title: Error Exponent in Agnostic PAC Learning
Comments: paper with appendix to accepted ISIT2024 paper with the same name
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[190]  arXiv:2405.00747 [pdf, other]
Title: Soft Preference Optimization: Aligning Language Models to Expert Distributions
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[191]  arXiv:2405.00746 [pdf, other]
Title: Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[192]  arXiv:2405.00743 [pdf, other]
Title: On the weight dynamics of learning networks
Subjects: Machine Learning (cs.LG); Chaotic Dynamics (nlin.CD)
[193]  arXiv:2405.00739 [pdf, other]
Title: Why does Knowledge Distillation Work? Rethink its Attention and Fidelity Mechanism
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[194]  arXiv:2405.01538 (cross-list from cs.CV) [pdf, other]
Title: Multi-Space Alignments Towards Universal LiDAR Segmentation
Comments: CVPR 2024; 33 pages, 14 figures, 14 tables; Code at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[195]  arXiv:2405.01536 (cross-list from cs.CV) [pdf, other]
Title: Customizing Text-to-Image Models with a Single Image Pair
Comments: project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[196]  arXiv:2405.01521 (cross-list from cs.CV) [pdf, other]
Title: Transformer-Aided Semantic Communications
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Machine Learning (cs.LG); Signal Processing (eess.SP)
[197]  arXiv:2405.01502 (cross-list from cs.CL) [pdf, other]
Title: Analyzing the Role of Semantic Representations in the Era of Large Language Models
Comments: NAACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[198]  arXiv:2405.01494 (cross-list from cs.CV) [pdf, other]
Title: Navigating Heterogeneity and Privacy in One-Shot Federated Learning with Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[199]  arXiv:2405.01491 (cross-list from physics.chem-ph) [pdf, other]
Title: FeNNol: an Efficient and Flexible Library for Building Force-field-enhanced Neural Network Potentials
Subjects: Chemical Physics (physics.chem-ph); Machine Learning (cs.LG)
[200]  arXiv:2405.01484 (cross-list from cs.HC) [pdf, other]
Title: Designing Algorithmic Recommendations to Achieve Human-AI Complementarity
Subjects: Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Econometrics (econ.EM); Machine Learning (stat.ML)
[201]  arXiv:2405.01481 (cross-list from cs.CL) [pdf, other]
Title: NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment
Comments: 13 pages, 4 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[202]  arXiv:2405.01463 (cross-list from econ.EM) [pdf, ps, other]
Title: Dynamic Local Average Treatment Effects
Subjects: Econometrics (econ.EM); Machine Learning (cs.LG); Methodology (stat.ME)
[203]  arXiv:2405.01460 (cross-list from cs.CR) [pdf, other]
Title: Purify Unlearnable Examples via Rate-Constrained Variational Autoencoders
Comments: Accepted by ICML 2024
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[204]  arXiv:2405.01458 (cross-list from cs.CL) [pdf, other]
Title: UQA: Corpus for Urdu Question Answering
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[205]  arXiv:2405.01453 (cross-list from cs.AI) [pdf, other]
Title: Creative Problem Solving in Large Language and Vision Models -- What Would it Take?
Comments: 9 pages, 7 figures, 2 tables
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[206]  arXiv:2405.01440 (cross-list from cs.RO) [pdf, other]
Title: A Review of Reward Functions for Reinforcement Learning in the context of Autonomous Driving
Comments: Accepted at "Interaction-driven Behavior Prediction and Planning for Autonomous Vehicles" workshop in 35th IEEE Intelligent Vehicles Symposium (IV 2024)
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[207]  arXiv:2405.01435 (cross-list from cs.NI) [pdf, other]
Title: Closed-form congestion control via deep symbolic regression
Subjects: Networking and Internet Architecture (cs.NI); Machine Learning (cs.LG)
[208]  arXiv:2405.01425 (cross-list from cs.DS) [pdf, other]
Title: In-and-Out: Algorithmic Diffusion for Sampling Convex Bodies
Comments: 32 pages
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
[209]  arXiv:2405.01413 (cross-list from cs.CV) [pdf, other]
Title: MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors
Comments: 17 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[210]  arXiv:2405.01404 (cross-list from stat.ML) [pdf, other]
Title: Random Pareto front surfaces
Comments: The code is available at: this https URL
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC); Methodology (stat.ME)
[211]  arXiv:2405.01402 (cross-list from cs.RO) [pdf, other]
Title: Learning Force Control for Legged Manipulation
Comments: This work has been accepted to ICRA24, as well as the Loco-manipulation workshop at ICRA24
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Systems and Control (eess.SY)
[212]  arXiv:2405.01392 (cross-list from cs.RO) [pdf, other]
Title: LLMSat: A Large Language Model-Based Goal-Oriented Agent for Autonomous Space Exploration
Authors: David Maranto
Comments: B.A.Sc thesis
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA); Space Physics (physics.space-ph)
[213]  arXiv:2405.01314 (cross-list from eess.SY) [pdf, other]
Title: Non-iterative Optimization of Trajectory and Radio Resource for Aerial Network
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG)
[214]  arXiv:2405.01299 (cross-list from cs.CL) [pdf, other]
Title: The Effectiveness of LLMs as Annotators: A Comparative Overview and Empirical Analysis of Direct Representation
Comments: LREC-COLING NLPerspectives workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[215]  arXiv:2405.01292 (cross-list from math.OC) [pdf, ps, other]
Title: Koopman Data-Driven Predictive Control with Robust Stability and Recursive Feasibility Guarantees
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Systems and Control (eess.SY)
[216]  arXiv:2405.01284 (cross-list from cs.RO) [pdf, other]
Title: Behavior Imitation for Manipulator Control and Grasping with Deep Reinforcement Learning
Authors: Liu Qiyuan
Comments: 50 pages, 30 figures, Final Year Project Report at Nanyang Technological University, Singapore This article is an NTU FYP report. The formal paper is still in the preparation process
Subjects: Robotics (cs.RO); Machine Learning (cs.LG)
[217]  arXiv:2405.01277 (cross-list from cs.HC) [pdf, other]
Title: Quantifying Spatial Domain Explanations in BCI using Earth Mover's Distance
Comments: 8 pages, 3 figures, 3 tables, draft of the accepted work at IJCNN, WCCI 2024
Subjects: Human-Computer Interaction (cs.HC); Emerging Technologies (cs.ET); Machine Learning (cs.LG)
[218]  arXiv:2405.01249 (cross-list from cs.CL) [pdf, ps, other]
Title: Prompt engineering paradigms for medical applications: scoping review and recommendations for better practices
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[219]  arXiv:2405.01242 (cross-list from cs.SD) [pdf, other]
Title: TRAMBA: A Hybrid Transformer and Mamba Architecture for Practical Audio and Bone Conduction Speech Super Resolution and Enhancement on Mobile and Wearable Platforms
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[220]  arXiv:2405.01233 (cross-list from q-fin.MF) [pdf, other]
Title: Mathematics of Differential Machine Learning in Derivative Pricing and Hedging
Subjects: Mathematical Finance (q-fin.MF); Machine Learning (cs.LG); Computational Finance (q-fin.CP)
[221]  arXiv:2405.01200 (cross-list from eess.SY) [pdf, other]
Title: Learning-to-solve unit commitment based on few-shot physics-guided spatial-temporal graph convolution network
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG)
[222]  arXiv:2405.01134 (cross-list from cs.RO) [pdf, other]
Title: Leveraging Procedural Generation for Learning Autonomous Peg-in-Hole Assembly in Space
Comments: Accepted for publication at the 2024 International Conference on Space Robotics (iSpaRo) | The source code is available at this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[223]  arXiv:2405.01124 (cross-list from stat.ML) [pdf, other]
Title: Investigating Self-Supervised Image Denoising with Denaturation
Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Statistics Theory (math.ST)
[224]  arXiv:2405.01109 (cross-list from math.NA) [pdf, other]
Title: Hypergraph $p$-Laplacian regularization on point clouds for data interpolation
Comments: 33 pages
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG); Analysis of PDEs (math.AP)
[225]  arXiv:2405.01108 (cross-list from cs.CV) [pdf, other]
Title: Federated Learning with Heterogeneous Data Handling for Robust Vehicular Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[226]  arXiv:2405.01098 (cross-list from quant-ph) [pdf, ps, other]
Title: Multivariate trace estimation using quantum state space linear algebra
Subjects: Quantum Physics (quant-ph); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[227]  arXiv:2405.01063 (cross-list from cs.IR) [pdf, other]
Title: Fair Recommendations with Limited Sensitive Attributes: A Distributionally Robust Optimization Approach
Comments: 8 pages, 5 figures
Subjects: Information Retrieval (cs.IR); Computers and Society (cs.CY); Machine Learning (cs.LG)
[228]  arXiv:2405.01054 (cross-list from cs.RO) [pdf, other]
Title: Continual Learning for Robust Gate Detection under Dynamic Lighting in Autonomous Drone Racing
Comments: 8 pages, 6 figures, in 2024 International Joint Conference on Neural Networks (IJCNN)
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[229]  arXiv:2405.01035 (cross-list from cs.GT) [pdf, other]
Title: LOQA: Learning with Opponent Q-Learning Awareness
Comments: accepted to ICLR but still not in proceedings this https URL
Subjects: Computer Science and Game Theory (cs.GT); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[230]  arXiv:2405.01029 (cross-list from cs.AI) [pdf, other]
Title: MVMoE: Multi-Task Vehicle Routing Solver with Mixture-of-Experts
Comments: Accepted at ICML 2024
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[231]  arXiv:2405.01015 (cross-list from stat.ML) [pdf, other]
Title: Network reconstruction via the minimum description length principle
Authors: Tiago P. Peixoto
Comments: 17 pages, 10 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Social and Information Networks (cs.SI); Data Analysis, Statistics and Probability (physics.data-an); Populations and Evolution (q-bio.PE)
[232]  arXiv:2405.01004 (cross-list from cs.SD) [pdf, ps, other]
Title: Deep Learning Models in Speech Recognition: Measuring GPU Energy Consumption, Impact of Noise and Model Quantization for Edge Deployment
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[233]  arXiv:2405.01002 (cross-list from cs.CV) [pdf, other]
Title: Spider: A Unified Framework for Context-dependent Concept Understanding
Comments: Accepted by ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[234]  arXiv:2405.00989 (cross-list from cs.CV) [pdf, ps, other]
Title: Estimate the building height at a 10-meter resolution based on Sentinel data
Authors: Xin Yan
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[235]  arXiv:2405.00988 (cross-list from cs.CL) [pdf, other]
Title: Context-Aware Clustering using Large Language Models
Comments: 16 pages
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[236]  arXiv:2405.00972 (cross-list from cs.CL) [pdf, other]
Title: CACTUS: Chemistry Agent Connecting Tool-Usage to Science
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Chemical Physics (physics.chem-ph); Quantitative Methods (q-bio.QM)
[237]  arXiv:2405.00964 (cross-list from math.ST) [pdf, other]
Title: Deriving Lehmer and Hölder means as maximum weighted likelihood estimates for the multivariate exponential family
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG)
[238]  arXiv:2405.00934 (cross-list from eess.AS) [pdf, ps, other]
Title: Benchmarking Representations for Speech, Music, and Acoustic Events
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[239]  arXiv:2405.00915 (cross-list from cs.CV) [pdf, other]
Title: EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion
Comments: 25 pages. 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[240]  arXiv:2405.00914 (cross-list from math.OC) [pdf, other]
Title: Accelerated Fully First-Order Methods for Bilevel and Minimax Optimization
Authors: Chris Junchi Li
Comments: arXiv admin note: text overlap with arXiv:2307.00126
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[241]  arXiv:2405.00908 (cross-list from cs.CV) [pdf, ps, other]
Title: Transformer-Based Self-Supervised Learning for Histopathological Classification of Ischemic Stroke Clot Origin
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[242]  arXiv:2405.00906 (cross-list from cs.CV) [pdf, other]
Title: LOTUS: Improving Transformer Efficiency with Sparsity Pruning and Data Lottery Tickets
Authors: Ojasw Upadhyay
Comments: 3 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[243]  arXiv:2405.00876 (cross-list from cs.CV) [pdf, other]
Title: Beyond Human Vision: The Role of Large Vision Language Models in Microscope Image Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[244]  arXiv:2405.00871 (cross-list from eess.SY) [pdf, other]
Title: Learning to Boost the Performance of Stable Nonlinear Systems
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG)
[245]  arXiv:2405.00846 (cross-list from cs.RO) [pdf, other]
Title: Gameplay Filters: Safe Robot Walking through Adversarial Imagination
Subjects: Robotics (cs.RO); Machine Learning (cs.LG)
[246]  arXiv:2405.00842 (cross-list from math.ST) [pdf, other]
Title: Quickest Change Detection with Confusing Change
Subjects: Statistics Theory (math.ST); Information Theory (cs.IT); Machine Learning (cs.LG); Signal Processing (eess.SP); Optimization and Control (math.OC)
[247]  arXiv:2405.00820 (cross-list from cs.AR) [pdf, other]
Title: HLSFactory: A Framework Empowering High-Level Synthesis Datasets for Machine Learning and Beyond
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[248]  arXiv:2405.00816 (cross-list from cs.SI) [pdf, ps, other]
Title: Sifting out communities in large sparse networks
Subjects: Social and Information Networks (cs.SI); Machine Learning (cs.LG)
[249]  arXiv:2405.00790 (cross-list from cs.AR) [pdf, other]
Title: SCAR: Scheduling Multi-Model AI Workloads on Heterogeneous Multi-Chiplet Module Accelerators
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Performance (cs.PF)
[250]  arXiv:2405.00782 (cross-list from math.DS) [pdf, other]
Title: Rigged Dynamic Mode Decomposition: Data-Driven Generalized Eigenfunction Decompositions for Koopman Operators
Subjects: Dynamical Systems (math.DS); Machine Learning (cs.LG); Numerical Analysis (math.NA); Optimization and Control (math.OC); Spectral Theory (math.SP)
[251]  arXiv:2405.00781 (cross-list from quant-ph) [pdf, other]
Title: A Review of Barren Plateaus in Variational Quantum Computing
Comments: 21 pages, 10 boxes
Subjects: Quantum Physics (quant-ph); Machine Learning (cs.LG); Machine Learning (stat.ML)
[252]  arXiv:2405.00770 (cross-list from quant-ph) [pdf, other]
Title: Quantum-Classical Separations in Shallow-Circuit-Based Learning with and without Noises
Comments: 14 pages, 3 figures
Subjects: Quantum Physics (quant-ph); Computational Complexity (cs.CC); Machine Learning (cs.LG)
[253]  arXiv:2405.00755 (cross-list from cs.ET) [pdf, other]
Title: Quantum AI for Alzheimer's disease early screening
Comments: 18 pages, 6 figures
Subjects: Emerging Technologies (cs.ET); Machine Learning (cs.LG); Quantum Physics (quant-ph)
[254]  arXiv:2405.00754 (cross-list from cs.CV) [pdf, other]
Title: CLIPArTT: Light-weight Adaptation of CLIP to New Domains at Test Time
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[255]  arXiv:2405.00751 (cross-list from q-bio.QM) [pdf, other]
Title: F$^3$low: Frame-to-Frame Coarse-grained Molecular Dynamics with SE(3) Guided Flow Matching
Comments: Accepted by ICLR 2024 GEM workshop
Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[256]  arXiv:2405.00749 (cross-list from cs.CV) [pdf, other]
Title: More is Better: Deep Domain Adaptation with Multiple Sources
Comments: Accepted by IJCAI 2024. arXiv admin note: text overlap with arXiv:2002.12169
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[257]  arXiv:2405.00742 (cross-list from cs.CR) [pdf, other]
Title: Federated Graph Learning for EV Charging Demand Forecasting with Personalization Against Cyberattacks
Comments: 11 pages,4 figures
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG); Machine Learning (stat.ML)
[258]  arXiv:2405.00740 (cross-list from cs.CV) [pdf, other]
Title: Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Comments: 14 pages, 8 figures, 7 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[259]  arXiv:2405.00738 (cross-list from cs.AR) [pdf, other]
Title: HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs Via High Level Synthesis
Comments: 7 pages, 2 figures
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[260]  arXiv:2405.00736 (cross-list from eess.SP) [pdf, other]
Title: Joint Signal Detection and Automatic Modulation Classification via Deep Learning
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)
[261]  arXiv:2405.00734 (cross-list from eess.SP) [pdf, other]
Title: EEG-MACS: Manifold Attention and Confidence Stratification for EEG-based Cross-Center Brain Disease Diagnosis under Unreliable Annotations
Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[262]  arXiv:2405.00732 (cross-list from cs.CL) [pdf, other]
Title: LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[263]  arXiv:2405.00727 (cross-list from eess.SP) [pdf, other]
Title: Generalised envelope spectrum-based signal-to-noise objectives: Formulation, optimisation and application for gear fault detection under time-varying speed conditions
Comments: 27 pages, 15 figures, tables 1, submitted MSSP review
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG); Methodology (stat.ME)
[264]  arXiv:2405.00725 (cross-list from eess.SP) [pdf, other]
Title: Federated Learning and Differential Privacy Techniques on Multi-hospital Population-scale Electrocardiogram Data
Comments: Accepted for ICMHI 2024
Subjects: Signal Processing (eess.SP); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[265]  arXiv:2405.00724 (cross-list from eess.SP) [pdf, other]
Title: Baseline Drift Tolerant Signal Encoding for ECG Classification with Deep Learning
Comments: 4 pages, 3 figures. Submitted to 46th Annual International Conference of the IEEE Engineering in Medicine and Biology 2024
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)
[266]  arXiv:2405.00723 (cross-list from eess.SP) [pdf, other]
Title: EEG_RL-Net: Enhancing EEG MI Classification through Reinforcement Learning-Optimised Graph Neural Networks
Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[267]  arXiv:2405.00721 (cross-list from eess.SP) [pdf, other]
Title: Optimizing Brain-Computer Interface Performance: Advancing EEG Signals Channel Selection through Regularized CSP and SPEA II Multi-Objective Optimization
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)
[268]  arXiv:2405.00720 (cross-list from eess.SP) [pdf, other]
Title: A Novel Machine Learning-based Equalizer for a Downstream 100G PAM-4 PON
Comments: 3 pages, 6 figures, accepted by Optical Fiber Communications Conference and Exhibition 2024
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)
[269]  arXiv:2405.00719 (cross-list from eess.SP) [pdf, other]
Title: EEG-Deformer: A Dense Convolutional Transformer for Brain-computer Interfaces
Comments: 10 pages, 9 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[270]  arXiv:2405.00715 (cross-list from cs.CL) [pdf, other]
Title: Towards Adapting Open-Source Large Language Models for Expert-Level Clinical Note Generation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[271]  arXiv:2405.00712 (cross-list from eess.SP) [pdf, other]
Title: SoK: Behind the Accuracy of Complex Human Activity Recognition Using Deep Learning
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)
[272]  arXiv:2405.00710 (cross-list from cs.CL) [pdf, ps, other]
Title: Homonym Sense Disambiguation in the Georgian Language
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[273]  arXiv:2405.00709 (cross-list from cs.CL) [pdf, other]
Title: Evaluating Tool-Augmented Agents in Remote Sensing Platforms
Comments: ICLR 2024 Machine Learning for Remote Sensing (ML4RS) Workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[274]  arXiv:2405.00708 (cross-list from cs.CL) [pdf, other]
Title: Interactive Analysis of LLMs using Meaningful Counterfactuals
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[275]  arXiv:2405.00705 (cross-list from cs.CL) [pdf, other]
Title: SHED: Shapley-Based Automated Dataset Refinement for Instruction Fine-Tuning
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[276]  arXiv:2405.00699 (cross-list from cs.NE) [pdf, other]
Title: Direct Training Needs Regularisation: Anytime Optimal Inference Spiking Neural Network
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[277]  arXiv:2405.00697 (cross-list from q-fin.CP) [pdf, other]
Title: Pricing Catastrophe Bonds -- A Probabilistic Machine Learning Approach
Subjects: Computational Finance (q-fin.CP); Machine Learning (cs.LG); Pricing of Securities (q-fin.PR); Applications (stat.AP)
[278]  arXiv:2405.00695 (cross-list from cs.RO) [pdf, other]
Title: Joint torques prediction of a robotic arm using neural networks
Comments: 6 pages, 5 figures, submitted to CASE 2024
Subjects: Robotics (cs.RO); Machine Learning (cs.LG)
[279]  arXiv:2405.00688 (cross-list from cs.RO) [pdf, ps, other]
Title: Understanding Social Perception, Interactions, and Safety Aspects of Sidewalk Delivery Robots Using Sentiment Analysis
Authors: Yuchen Du, Tho V. Le
Comments: 34 pages, 7 figures, 2 tables
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[280]  arXiv:2405.00678 (cross-list from eess.SP) [pdf, ps, other]
Title: Low-cost modular devices for on-road vehicle detection and characterisation
Comments: 17 pages
Journal-ref: Poza Lujan, JL., Uribe Chavert, P., Posadas-Yag\"ue, JL. Lowcost modular devices for onroad vehicle detection and characterisation. Des Autom Embed Syst 27, 85.102 (2023)
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)

Thu, 2 May 2024

[281]  arXiv:2405.00675 [pdf, other]
Title: Self-Play Preference Optimization for Language Model Alignment
Comments: 25 pages, 4 figures, 5 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[282]  arXiv:2405.00662 [pdf, other]
Title: No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO
Comments: Code and run histories are available at this https URL
Subjects: Machine Learning (cs.LG)
[283]  arXiv:2405.00645 [pdf, other]
Title: Gradient-based Automatic Per-Weight Mixed Precision Quantization for Neural Networks On-Chip
Subjects: Machine Learning (cs.LG); Instrumentation and Detectors (physics.ins-det)
[284]  arXiv:2405.00629 [pdf, other]
Title: HUGO -- Highlighting Unseen Grid Options: Combining Deep Reinforcement Learning with a Heuristic Target Topology Approach
Comments: 12 pages + 2 pages references, 9 Figures, submission planed in Sustainable Energy, Grids and Networks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[285]  arXiv:2405.00625 [pdf, other]
Title: Queue-based Eco-Driving at Roundabouts with Reinforcement Learning
Subjects: Machine Learning (cs.LG)
[286]  arXiv:2405.00614 [pdf, other]
Title: Multigroup Robustness
Subjects: Machine Learning (cs.LG)
[287]  arXiv:2405.00577 [pdf, ps, other]
Title: Discovering robust biomarkers of neurological disorders from functional MRI using graph neural networks: A Review
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP); Neurons and Cognition (q-bio.NC)
[288]  arXiv:2405.00570 [pdf, other]
Title: WEST GCN-LSTM: Weighted Stacked Spatio-Temporal Graph Neural Networks for Regional Traffic Forecasting
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[289]  arXiv:2405.00556 [pdf, other]
Title: Swarm Learning: A Survey of Concepts, Applications, and Trends
Comments: 31 pages
Subjects: Machine Learning (cs.LG)
[290]  arXiv:2405.00555 [pdf, other]
Title: Derivative-based regularization for regression
Subjects: Machine Learning (cs.LG)
[291]  arXiv:2405.00524 [pdf, ps, other]
Title: FMLFS: A federated multi-label feature selection based on information theory in IoT environment
Comments: This paper has been accepted by IEEE SmartComp 2024
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Networking and Internet Architecture (cs.NI)
[292]  arXiv:2405.00516 [pdf, other]
Title: Navigating WebAI: Training Agents to Complete Web Tasks with Large Language Models and Reinforcement Learning
Comments: ACM 2024, Avila Spain. 9 pages
Journal-ref: ACM SAC Conference 2024, Avila, Spain, Article 4, 9 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[293]  arXiv:2405.00491 [pdf, ps, other]
Title: On the Relevance of Byzantine Robust Optimization Against Data Poisoning
Comments: 38 pages
Subjects: Machine Learning (cs.LG)
[294]  arXiv:2405.00489 [pdf, other]
Title: Explainable Automatic Grading with Neural Additive Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Applications (stat.AP)
[295]  arXiv:2405.00476 [pdf, other]
Title: A Comprehensive Survey of Dynamic Graph Neural Networks: Models, Frameworks, Benchmarks, Experiments and Challenges
Comments: Under review of PVLDB2025
Subjects: Machine Learning (cs.LG)
[296]  arXiv:2405.00456 [pdf, other]
Title: Counterfactual Explanations for Deep Learning-Based Traffic Forecasting
Comments: 24 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[297]  arXiv:2405.00454 [pdf, ps, other]
Title: Robust Semi-supervised Learning via $f$-Divergence and $α$-Rényi Divergence
Comments: Accepted in ISIT 2024
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[298]  arXiv:2405.00449 [pdf, other]
Title: RAG-based Explainable Prediction of Road Users Behaviors for Automated Driving using Knowledge Graphs and Large Language Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Neural and Evolutionary Computing (cs.NE)
[299]  arXiv:2405.00438 [pdf, other]
Title: MetaRM: Shifted Distributions Alignment via Meta-Learning
Comments: 11 pages, 6 figures. arXiv admin note: text overlap with arXiv:2401.06080
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[300]  arXiv:2405.00433 [pdf, other]
Title: Weight Sparsity Complements Activity Sparsity in Neuromorphic Language Models
Comments: arXiv admin note: text overlap with arXiv:2311.07625
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[301]  arXiv:2405.00417 [pdf, other]
Title: Conformal Risk Control for Ordinal Classification
Comments: 17 pages, 8 figures, 2 table; 1 supplementary page
Journal-ref: In UAI 2023: The 39th Conference on Uncertainty in Artificial Intelligence
Subjects: Machine Learning (cs.LG); Methodology (stat.ME); Machine Learning (stat.ML)
[302]  arXiv:2405.00410 [pdf, other]
Title: UCB-driven Utility Function Search for Multi-objective Reinforcement Learning
Subjects: Machine Learning (cs.LG)
[303]  arXiv:2405.00349 [pdf, other]
Title: A Self-explaining Neural Architecture for Generalizable Concept Learning
Comments: IJCAI 2024
Subjects: Machine Learning (cs.LG)
[304]  arXiv:2405.00348 [pdf, other]
Title: Practical Dataset Distillation Based on Deep Support Vectors
Subjects: Machine Learning (cs.LG)
[305]  arXiv:2405.00334 [pdf, other]
Title: A Survey on Deep Active Learning: Recent Advances and New Frontiers
Comments: This paper is accepted by IEEE Transactions on Neural Networks and Learning Systems
Subjects: Machine Learning (cs.LG)
[306]  arXiv:2405.00319 [pdf, other]
Title: Data Augmentation Policy Search for Long-Term Forecasting
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[307]  arXiv:2405.00314 [pdf, other]
Title: Model Quantization and Hardware Acceleration for Vision Transformers: A Comprehensive Survey
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Performance (cs.PF)
[308]  arXiv:2405.00311 [pdf, ps, other]
Title: Three-layer deep learning network random trees for fault diagnosis in chemical production process
Subjects: Machine Learning (cs.LG)
[309]  arXiv:2405.00303 [pdf, other]
Title: Joint Optimization of Piecewise Linear Ensembles
Comments: 7 pages, 4 figures, submitted to IEEE MLSP 2024
Subjects: Machine Learning (cs.LG)
[310]  arXiv:2405.00220 [pdf, other]
Title: Context-Aware Mobile Network Performance Prediction Using Network & Remote Sensing Data
Comments: Accepted at the 17th International Workshop on AI-ML-Powered Autonomous Telco Networks - IEEE International Conference on Communications (ICC) 2024
Subjects: Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[311]  arXiv:2405.00219 [pdf, ps, other]
Title: Machine Learning-based Estimation of Respiratory Fluctuations in a Healthy Adult Population using BOLD fMRI and Head Motion Parameters
Comments: 6 pages, 5 figure, conference abstract
Subjects: Machine Learning (cs.LG)
[312]  arXiv:2405.00217 [pdf, other]
Title: GMC-PINNs: A new general Monte Carlo PINNs method for solving fractional partial differential equations on irregular domains
Subjects: Machine Learning (cs.LG)
[313]  arXiv:2405.00213 [pdf, other]
Title: Block-As-Domain Adaptation for Workload Prediction from fNIRS Data
Subjects: Machine Learning (cs.LG); Human-Computer Interaction (cs.HC); Signal Processing (eess.SP)
[314]  arXiv:2405.00202 [pdf, other]
Title: Leveraging Active Subspaces to Capture Epistemic Model Uncertainty in Deep Generative Models for Molecular Design
Subjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM); Machine Learning (stat.ML)
[315]  arXiv:2405.00184 [pdf, other]
Title: Semi-Supervised Hierarchical Multi-Label Classifier Based on Local Information
Subjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[316]  arXiv:2405.00182 [pdf, other]
Title: M-DEW: Extending Dynamic Ensemble Weighting to Handle Missing Values
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[317]  arXiv:2405.00172 [pdf, other]
Title: Re-visiting Skip-Gram Negative Sampling: Dimension Regularization for More Efficient Dissimilarity Preservation in Graph Embeddings
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[318]  arXiv:2405.00166 [pdf, other]
Title: Discovering intrinsic multi-compartment pharmacometric models using Physics Informed Neural Networks
Comments: Accepted into the International conference on Scientific Computation and Machine Learning 2024 (SCML 2024)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
[319]  arXiv:2405.00142 [pdf, other]
Title: Utilizing Machine Learning and 3D Neuroimaging to Predict Hearing Loss: A Comparative Analysis of Dimensionality Reduction and Regression Techniques
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[320]  arXiv:2405.00136 [pdf, other]
Title: Data-Driven Permissible Safe Control with Barrier Certificates
Subjects: Machine Learning (cs.LG); Robotics (cs.RO); Systems and Control (eess.SY)
[321]  arXiv:2405.00123 [pdf, other]
Title: Graph Neural Network Approach to Semantic Type Detection in Tables
Journal-ref: In Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 121-133. Singapore: Springer Nature Singapore, 2024
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[322]  arXiv:2405.00080 [pdf, other]
Title: Recommenadation aided Caching using Combinatorial Multi-armed Bandits
Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR); Networking and Internet Architecture (cs.NI)
[323]  arXiv:2405.00077 [pdf, other]
Title: BrainODE: Dynamic Brain Signal Analysis via Graph-Aided Neural Ordinary Differential Equations
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[324]  arXiv:2405.00076 [pdf, ps, other]
Title: On Correcting SHAP Scores
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[325]  arXiv:2405.00074 [pdf, other]
Title: PAODING: A High-fidelity Data-free Pruning Toolkit for Debloating Pre-trained Neural Networks
Comments: 3 pages
Subjects: Machine Learning (cs.LG); Software Engineering (cs.SE)
[326]  arXiv:2405.00664 (cross-list from cs.CL) [pdf, other]
Title: Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[327]  arXiv:2405.00657 (cross-list from cs.CL) [pdf, other]
Title: RST-LoRA: A Discourse-Aware Low-Rank Adaptation for Long Document Abstractive Summarization
Comments: NAACL 2024 Main & Long Conference Paper (Oral Presentation)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[328]  arXiv:2405.00647 (cross-list from physics.med-ph) [pdf, other]
Title: Screening of BindingDB database ligands against EGFR, HER2, Estrogen, Progesterone and NF-kB receptors based on machine learning and molecular docking
Subjects: Medical Physics (physics.med-ph); Machine Learning (cs.LG)
[329]  arXiv:2405.00646 (cross-list from cs.CV) [pdf, other]
Title: Learning to Compose: Improving Object Centric Learning by Injecting Compositionality
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[330]  arXiv:2405.00642 (cross-list from stat.ML) [pdf, other]
Title: From Empirical Observations to Universality: Dynamics of Deep Learning with Inputs Built on Gaussian mixture
Comments: 19 pages, 9 figures
Subjects: Machine Learning (stat.ML); Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Machine Learning (cs.LG)
[331]  arXiv:2405.00636 (cross-list from physics.soc-ph) [pdf, other]
Title: Robustness of graph embedding methods for community detection
Comments: 17 pages, 26 figures, 3 tables. Comments are welcome
Subjects: Physics and Society (physics.soc-ph); Machine Learning (cs.LG); Social and Information Networks (cs.SI); Data Analysis, Statistics and Probability (physics.data-an)
[332]  arXiv:2405.00627 (cross-list from eess.SY) [pdf, other]
Title: Koopman-based Deep Learning for Nonlinear System Estimation
Comments: 11 pages
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG)
[333]  arXiv:2405.00622 (cross-list from cs.CL) [pdf, other]
Title: Causal Evaluation of Language Models
Comments: 315 pages, 230 figures, 21 tables. Project website: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[334]  arXiv:2405.00602 (cross-list from cs.CL) [pdf, other]
Title: Investigating Automatic Scoring and Feedback using Large Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[335]  arXiv:2405.00592 (cross-list from stat.ML) [pdf, other]
Title: Scaling and renormalization in high-dimensional regression
Comments: 64 pages, 16 figures
Subjects: Machine Learning (stat.ML); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG)
[336]  arXiv:2405.00588 (cross-list from cs.CL) [pdf, other]
Title: Are Models Biased on Text without Gender-related Language?
Comments: In International Conference on Learning Representations 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[337]  arXiv:2405.00532 (cross-list from cs.AI) [pdf, other]
Title: ULLER: A Unified Language for Learning and Reasoning
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[338]  arXiv:2405.00505 (cross-list from cs.IR) [pdf, other]
Title: KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents
Comments: accepted ICDAR2024
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[339]  arXiv:2405.00482 (cross-list from cs.CR) [pdf, other]
Title: PackVFL: Efficient HE Packing for Vertical Federated Learning
Comments: 12 pages excluding references
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[340]  arXiv:2405.00451 (cross-list from cs.AI) [pdf, other]
Title: Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[341]  arXiv:2405.00442 (cross-list from stat.ML) [pdf, other]
Title: Geometric Insights into Focal Loss: Reducing Curvature for Enhanced Model Calibration
Comments: This paper is under consideration at Pattern Recognition Letters
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[342]  arXiv:2405.00420 (cross-list from cs.CV) [pdf, other]
Title: Self-supervised Pre-training of Text Recognizers
Comments: 18 pages, 6 figures, 4 tables, accepted to ICDAR24
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[343]  arXiv:2405.00394 (cross-list from cs.GT) [pdf, other]
Title: Enhancing Mutual Trustworthiness in Federated Learning for Data-Rich Smart Cities
Subjects: Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG)
[344]  arXiv:2405.00389 (cross-list from math.OC) [pdf, other]
Title: Employing Federated Learning for Training Autonomous HVAC Systems
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Systems and Control (eess.SY)
[345]  arXiv:2405.00387 (cross-list from cs.NI) [pdf, other]
Title: Cell Switching in HAPS-Aided Networking: How the Obscurity of Traffic Loads Affects the Decision
Subjects: Networking and Internet Architecture (cs.NI); Machine Learning (cs.LG); Systems and Control (eess.SY)
[346]  arXiv:2405.00385 (cross-list from stat.ML) [pdf, other]
Title: Variational Bayesian Methods for a Tree-Structured Stick-Breaking Process Mixture of Gaussians
Authors: Yuta Nakahara
Subjects: Machine Learning (stat.ML); Information Theory (cs.IT); Machine Learning (cs.LG)
[347]  arXiv:2405.00358 (cross-list from cs.AI) [pdf, other]
Title: Arbitrary Time Information Modeling via Polynomial Approximation for Temporal Knowledge Graph Embedding
Comments: Accepted by LREC-COLING 2024 (long paper, camera-ready version)
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[348]  arXiv:2405.00332 (cross-list from cs.CL) [pdf, other]
Title: A Careful Examination of Large Language Model Performance on Grade School Arithmetic
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[349]  arXiv:2405.00318 (cross-list from cs.NE) [pdf, other]
Title: Covariant spatio-temporal receptive fields for neuromorphic computing
Comments: Code available at this https URL
Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[350]  arXiv:2405.00304 (cross-list from quant-ph) [pdf, other]
Title: QUACK: Quantum Aligned Centroid Kernel
Comments: Submitted to IEEE International Conference on Quantum Computing and Engineering (QCE) 2024
Subjects: Quantum Physics (quant-ph); Machine Learning (cs.LG)
[351]  arXiv:2405.00287 (cross-list from cs.IR) [pdf, other]
Title: Stochastic Sampling for Contrastive Views and Hard Negative Samples in Graph-based Collaborative Filtering
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[352]  arXiv:2405.00285 (cross-list from cs.AI) [pdf, other]
Title: iMTSP: Solving Min-Max Multiple Traveling Salesman Problem with Imperative Learning
Comments: 8 pages, 3 figures, 3 tables
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[353]  arXiv:2405.00282 (cross-list from math.OC) [pdf, ps, other]
Title: MF-OML: Online Mean-Field Reinforcement Learning with Occupation Measures for Large Population Games
Authors: Anran Hu, Junzi Zhang
Subjects: Optimization and Control (math.OC); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[354]  arXiv:2405.00263 (cross-list from cs.CL) [pdf, other]
Title: Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[355]  arXiv:2405.00254 (cross-list from cs.AI) [pdf, other]
Title: Principled RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[356]  arXiv:2405.00252 (cross-list from quant-ph) [pdf, other]
Title: Hybrid Quantum-Classical Scheduling for Accelerating Neural Network Training with Newton's Gradient Descent
Comments: Our code is provided at this https URL
Subjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[357]  arXiv:2405.00251 (cross-list from cs.CV) [pdf, other]
Title: Semantically Consistent Video Inpainting with Conditional Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[358]  arXiv:2405.00239 (cross-list from eess.IV) [pdf, other]
Title: IgCONDA-PET: Implicitly-Guided Counterfactual Diffusion for Detecting Anomalies in PET Images
Comments: 12 pages, 6 figures, 1 table
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[359]  arXiv:2405.00236 (cross-list from cs.RO) [pdf, other]
Title: STT: Stateful Tracking with Transformers for Autonomous Driving
Comments: ICRA 2024
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[360]  arXiv:2405.00218 (cross-list from cs.CR) [pdf, other]
Title: Constrained Decoding for Secure Code Generation
Comments: 17 pages, 8 figures
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Software Engineering (cs.SE)
[361]  arXiv:2405.00216 (cross-list from cs.CL) [pdf, other]
Title: Graphical Reasoning: LLM-based Semi-Open Relation Extraction
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[362]  arXiv:2405.00205 (cross-list from cs.AI) [pdf, ps, other]
Title: A Logic for Reasoning About Aggregate-Combine Graph Neural Networks
Comments: arXiv admin note: text overlap with arXiv:2307.05150
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[363]  arXiv:2405.00158 (cross-list from stat.ME) [pdf, other]
Title: BayesBlend: Easy Model Blending using Pseudo-Bayesian Model Averaging, Stacking and Hierarchical Stacking in Python
Subjects: Methodology (stat.ME); Machine Learning (cs.LG); Machine Learning (stat.ML)
[364]  arXiv:2405.00156 (cross-list from cs.CV) [pdf, other]
Title: Expanding the Horizon: Enabling Hybrid Quantum Transfer Learning for Long-Tailed Chest X-Ray Classification
Comments: 11 pages, 13 figures, 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Quantum Physics (quant-ph)
[365]  arXiv:2405.00130 (cross-list from eess.IV) [pdf, other]
Title: A Flexible 2.5D Medical Image Segmentation Approach with In-Slice and Cross-Slice Attention
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[366]  arXiv:2405.00099 (cross-list from cs.AI) [pdf, other]
Title: Creative Beam Search
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[367]  arXiv:2405.00082 (cross-list from quant-ph) [pdf, other]
Title: Structure learning of Hamiltonians from real-time evolution
Comments: 50 pages
Subjects: Quantum Physics (quant-ph); Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG)
[368]  arXiv:2405.00065 (cross-list from math.OC) [pdf, other]
Title: From Linear to Linearizable Optimization: A Novel Framework with Applications to Stationary and Non-stationary DR-submodular Optimization
Subjects: Optimization and Control (math.OC); Computational Complexity (cs.CC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[369]  arXiv:2405.00055 (cross-list from eess.SY) [pdf, other]
Title: A Hybrid Probabilistic Battery Health Management Approach for Robust Inspection Drone Operations
Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[370]  arXiv:2405.00027 (cross-list from cs.CV) [pdf, other]
Title: Multidimensional Compressed Sensing for Spectral Light Field Imaging
Comments: 8 pages, published of VISAPP 2024
Journal-ref: In Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP 2024, ISBN 978-989-758-679-8, ISSN 2184-4321, pages 349-356
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[371]  arXiv:2405.00025 (cross-list from cs.CV) [pdf, other]
Title: Leveraging Pre-trained CNNs for Efficient Feature Extraction in Rice Leaf Disease Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[372]  arXiv:2405.00017 (cross-list from cs.DC) [pdf, other]
Title: Queuing dynamics of asynchronous Federated Learning
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[373]  arXiv:2404.17123 (cross-list from cs.CL) [pdf, ps, other]
Title: Text Sentiment Analysis and Classification Based on Bidirectional Gated Recurrent Units (GRUs) Model
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)

Wed, 1 May 2024

[374]  arXiv:2404.19756 [pdf, other]
Title: KAN: Kolmogorov-Arnold Networks
Comments: 48 pages, 20 figures. Codes are available at this https URL
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[375]  arXiv:2404.19725 [pdf, other]
Title: Fairness Without Demographics in Human-Centered Federated Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[376]  arXiv:2404.19719 [pdf, other]
Title: The lazy (NTK) and rich ($μ$P) regimes: a gentle tutorial
Authors: Dhruva Karkada
Comments: 22 pages, 7 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[377]  arXiv:2404.19710 [pdf, other]
Title: A rank decomposition for the topological classification of neural representations
Subjects: Machine Learning (cs.LG); Algebraic Topology (math.AT); Neurons and Cognition (q-bio.NC)
[378]  arXiv:2404.19708 [pdf, other]
Title: Harmonic LLMs are Trustworthy
Comments: 15 pages, 4 figures, 14 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[379]  arXiv:2404.19673 [pdf, ps, other]
Title: Neural Controlled Differential Equations with Quantum Hidden Evolutions
Comments: Code available at: this https URL
Subjects: Machine Learning (cs.LG)
[380]  arXiv:2404.19669 [pdf, other]
Title: Enhancing Predictive Accuracy in Pharmaceutical Sales Through An Ensemble Kernel Gaussian Process Regression Approach
Comments: 6 pages, 5 figures
Subjects: Machine Learning (cs.LG)
[381]  arXiv:2404.19660 [pdf, other]
Title: Decoder Decomposition for the Analysis of the Latent Space of Nonlinear Autoencoders With Wind-Tunnel Experimental Data
Subjects: Machine Learning (cs.LG); Fluid Dynamics (physics.flu-dyn)
[382]  arXiv:2404.19651 [pdf, other]
Title: Provably Robust Conformal Prediction with Improved Efficiency
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[383]  arXiv:2404.19649 [pdf, other]
Title: Landmark Alternating Diffusion
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST); Data Analysis, Statistics and Probability (physics.data-an); Machine Learning (stat.ML)
[384]  arXiv:2404.19640 [pdf, other]
Title: Attacking Bayes: On the Adversarial Robustness of Bayesian Neural Networks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Methodology (stat.ME); Machine Learning (stat.ML)
[385]  arXiv:2404.19631 [pdf, other]
Title: On Training a Neural Network to Explain Binaries
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Software Engineering (cs.SE)
[386]  arXiv:2404.19630 [pdf, other]
Title: Analyzing and Exploring Training Recipes for Large-Scale Transformer-Based Weather Prediction
Comments: 9 pages, 6 figures
Journal-ref: 23rd Conference on Artificial Intelligence for Environmental Science. Jan 2024. Abstract #437874
Subjects: Machine Learning (cs.LG)
[387]  arXiv:2404.19620 [pdf, other]
Title: Be Aware of the Neighborhood Effect: Modeling Selection Bias under Interference
Comments: ICLR 24
Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR); Machine Learning (stat.ML)
[388]  arXiv:2404.19605 [pdf, other]
Title: Data-Driven Invertible Neural Surrogates of Atmospheric Transmission
Comments: Manuscript accepted for presentation and publication at the 2024 IEEE International Geoscience and Remote Sensing Symposium (IGARSS)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Atmospheric and Oceanic Physics (physics.ao-ph)
[389]  arXiv:2404.19582 [pdf, other]
Title: Leveraging Label Information for Stealthy Data Stealing in Vertical Federated Learning
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[390]  arXiv:2404.19536 [pdf, other]
Title: Physics-Informed Machine Learning On Polar Ice: A Survey
Subjects: Machine Learning (cs.LG)
[391]  arXiv:2404.19519 [pdf, ps, other]
Title: Generating Robust Counterfactual Witnesses for Graph Neural Networks
Comments: This paper has been accepted by ICDE 2024
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[392]  arXiv:2404.19508 [pdf, other]
Title: Temporal Graph ODEs for Irregularly-Sampled Time Series
Comments: Preprint. Accepted at IJCAI 2024
Subjects: Machine Learning (cs.LG)
[393]  arXiv:2404.19501 [pdf, other]
Title: A Unified Theory of Exact Inference and Learning in Exponential Family Latent Variable Models
Authors: Sacha Sokoloski
Subjects: Machine Learning (cs.LG)
[394]  arXiv:2404.19487 [pdf, ps, other]
Title: Finetuning greedy kernel models by exchange algorithms
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[395]  arXiv:2404.19484 [pdf, other]
Title: More Compute Is What You Need
Authors: Zhen Guo
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[396]  arXiv:2404.19467 [pdf, ps, other]
Title: Bayesian Functional Connectivity and Graph Convolutional Network for Working Memory Load Classification
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP); Neurons and Cognition (q-bio.NC)
[397]  arXiv:2404.19462 [pdf, other]
Title: Continual Model-based Reinforcement Learning for Data Efficient Wireless Network Optimisation
Comments: Published at ECML 2023
Subjects: Machine Learning (cs.LG)
[398]  arXiv:2404.19460 [pdf, other]
Title: AttackBench: Evaluating Gradient-based Attacks for Adversarial Examples
Comments: this https URL
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[399]  arXiv:2404.19456 [pdf, other]
Title: Imitation Learning: A Survey of Learning Methods, Environments and Metrics
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[400]  arXiv:2404.19452 [pdf, other]
Title: How to Sustainably Monitor ML-Enabled Systems? Accuracy and Energy Efficiency Tradeoffs in Concept Drift Detection
Comments: Accepted for publication at the International Conference on Information and Communications Technology for Sustainability 2024 (ICT4S'24)
Subjects: Machine Learning (cs.LG); Software Engineering (cs.SE)
[401]  arXiv:2404.19420 [pdf, other]
Title: Let's Focus: Focused Backdoor Attack against Federated Transfer Learning
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[402]  arXiv:2404.19346 [pdf, other]
Title: Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning
Comments: Accepted by Artificial Intelligence (AIJ)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[403]  arXiv:2404.19306 [pdf, ps, other]
Title: Comprehensive Forecasting-Based Analysis of Hybrid and Stacked Stateful/ Stateless Models
Authors: Swayamjit Saha
Comments: 8 pages, 14 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[404]  arXiv:2404.19288 [pdf, other]
Title: Training-free Graph Neural Networks and the Power of Labels as Features
Authors: Ryoma Sato
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[405]  arXiv:2404.19284 [pdf, other]
Title: Approximate Nearest Neighbour Search on Dynamic Datasets: An Investigation
Subjects: Machine Learning (cs.LG)
[406]  arXiv:2404.19283 [pdf, other]
Title: MAP-Former: Multi-Agent-Pair Gaussian Joint Prediction
Comments: Accepted for publication in Proceedings of the IEEE Intelligent Vehicles Symposium (IV), Jeju Island - Korea, 2-5 June 2024
Subjects: Machine Learning (cs.LG)
[407]  arXiv:2404.19261 [pdf, other]
Title: High dimensional analysis reveals conservative sharpening and a stochastic edge of stability
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Statistics Theory (math.ST); Data Analysis, Statistics and Probability (physics.data-an)
[408]  arXiv:2404.19247 [pdf, ps, other]
Title: Improved AutoEncoder with LSTM module and KL divergence
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[409]  arXiv:2404.19228 [pdf, other]
Title: Understanding Multimodal Contrastive Learning Through Pointwise Mutual Information
Subjects: Machine Learning (cs.LG)
[410]  arXiv:2404.19218 [pdf, ps, other]
Title: Flight Trajectory Prediction Using an Enhanced CNN-LSTM Network
Subjects: Machine Learning (cs.LG)
[411]  arXiv:2404.19141 [pdf, other]
Title: Micro-Macro Spatial-Temporal Graph-based Encoder-Decoder for Map-Constrained Trajectory Recovery
Comments: This paper has been accepted as a regular paper at IEEE TKDE
Subjects: Machine Learning (cs.LG)
[412]  arXiv:2404.19132 [pdf, other]
Title: Integrating Present and Past in Unsupervised Continual Learning
Comments: CoLLAs 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[413]  arXiv:2404.19112 [pdf, other]
Title: Hidden Synergy: $L_1$ Weight Normalization and 1-Path-Norm Regularization
Authors: Aditya Biswas
Comments: 8 pages body, 2 tables, 1 figure, 3 appendices
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[414]  arXiv:2404.19109 [pdf, other]
Title: The Shape of Money Laundering: Subgraph Representation Learning on the Blockchain with the Elliptic2 Dataset
Subjects: Machine Learning (cs.LG); General Finance (q-fin.GN)
[415]  arXiv:2404.18978 [pdf, other]
Title: Towards Generalizable Agents in Text-Based Educational Environments: A Study of Integrating RL with LLMs
Comments: Accepted as a full paper at EDM 2024: The 17th International Conference on Educational Data Mining, 14-17 of July 2024, Atlanta
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[416]  arXiv:2404.18976 [pdf, other]
Title: Foundations of Multisensory Artificial Intelligence
Authors: Paul Pu Liang
Comments: CMU Machine Learning Department PhD Thesis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[417]  arXiv:2404.18975 [pdf, ps, other]
Title: M3H: Multimodal Multitask Machine Learning for Healthcare
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[418]  arXiv:2404.18963 [pdf, other]
Title: RE-GrievanceAssist: Enhancing Customer Experience through ML-Powered Complaint Management
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[419]  arXiv:2404.18961 [pdf, other]
Title: Unleashing the Power of Multi-Task Learning: A Comprehensive Survey Spanning Traditional, Deep, and Pretrained Foundation Model Eras
Comments: 60 figures, 116 pages, 500+ references
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[420]  arXiv:2404.18949 [pdf, other]
Title: The Simpler The Better: An Entropy-Based Importance Metric To Reduce Neural Networks' Depth
Comments: arXiv admin note: text overlap with arXiv:2404.16890
Subjects: Machine Learning (cs.LG)
[421]  arXiv:2404.18948 [pdf, other]
Title: Sub-Adjacent Transformer: Improving Time Series Anomaly Detection with Reconstruction Error from Sub-Adjacent Neighborhoods
Comments: IJCAI 2024
Subjects: Machine Learning (cs.LG)
[422]  arXiv:2404.18947 [pdf, other]
Title: Multimodal Fusion on Low-quality Data: A Comprehensive Survey
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[423]  arXiv:2404.18932 [pdf, ps, other]
Title: Dynamic Model Switching for Improved Accuracy in Machine Learning
Subjects: Machine Learning (cs.LG)
[424]  arXiv:2404.19753 (cross-list from cs.CV) [pdf, other]
Title: DOCCI: Descriptions of Connected and Contrasting Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[425]  arXiv:2404.19749 (cross-list from cs.IT) [pdf, other]
Title: Scale-Robust Timely Asynchronous Decentralized Learning
Subjects: Information Theory (cs.IT); Machine Learning (cs.LG); Multiagent Systems (cs.MA); Networking and Internet Architecture (cs.NI); Signal Processing (eess.SP)
[426]  arXiv:2404.19739 (cross-list from q-bio.BM) [pdf, other]
Title: Mixed Continuous and Categorical Flow Matching for 3D De Novo Molecule Generation
Subjects: Biomolecules (q-bio.BM); Machine Learning (cs.LG)
[427]  arXiv:2404.19696 (cross-list from cs.CV) [pdf, other]
Title: Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners
Comments: CVPR 2024. The first two authors contributed equally
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[428]  arXiv:2404.19689 (cross-list from math.AP) [pdf, ps, other]
Title: Continuum limit of $p$-biharmonic equations on graphs
Comments: 20 pages
Subjects: Analysis of PDEs (math.AP); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[429]  arXiv:2404.19675 (cross-list from cs.CY) [pdf, other]
Title: Deep Learning for Educational Data Science
Comments: 18 pages. To be published in Trust and Inclusion in AI-Mediated Education: Where Human Learning Meets Learning Machines by Springer International
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[430]  arXiv:2404.19671 (cross-list from cs.NI) [pdf, other]
Title: ML-based handover prediction over a real O-RAN deployment using RAN Intelligent controller
Subjects: Networking and Internet Architecture (cs.NI); Machine Learning (cs.LG)
[431]  arXiv:2404.19668 (cross-list from cs.NE) [pdf, other]
Title: SQUAT: Stateful Quantization-Aware Training in Recurrent Spiking Neural Networks
Comments: 10 pages, 4 figures, accepted at NICE 2024
Subjects: Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG)
[432]  arXiv:2404.19664 (cross-list from cs.RO) [pdf, other]
Title: Towards Generalist Robot Learning from Internet Video: A Survey
Subjects: Robotics (cs.RO); Machine Learning (cs.LG)
[433]  arXiv:2404.19654 (cross-list from cs.CV) [pdf, other]
Title: Masked Multi-Query Slot Attention for Unsupervised Object Discovery
Comments: Paper accepted for presentation at IJCNN 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[434]  arXiv:2404.19596 (cross-list from cs.IR) [pdf, other]
Title: Debiased Collaborative Filtering with Kernel-Based Causal Balancing
Comments: ICLR 24 Spotlight
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[435]  arXiv:2404.19591 (cross-list from cs.DB) [pdf, other]
Title: Towards Interactively Improving ML Data Preparation Code via "Shadow Pipelines"
Subjects: Databases (cs.DB); Machine Learning (cs.LG); Software Engineering (cs.SE)
[436]  arXiv:2404.19579 (cross-list from eess.IV) [pdf, ps, other]
Title: Automatic Cardiac Pathology Recognition in Echocardiography Images Using Higher Order Dynamic Mode Decomposition and a Vision Transformer for Small Datasets
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[437]  arXiv:2404.19557 (cross-list from stat.ML) [pdf, other]
Title: Neural Dynamic Data Valuation
Comments: 43 pages, 19 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[438]  arXiv:2404.19486 (cross-list from cs.CL) [pdf, other]
Title: Safe Training with Sensitive In-domain Data: Leveraging Data Fragmentation To Mitigate Linkage Attacks
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[439]  arXiv:2404.19429 (cross-list from cs.DC) [pdf, other]
Title: Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapping
Comments: 11 pages, 16 figures. Published in MLSys'24
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[440]  arXiv:2404.19397 (cross-list from cs.HC) [pdf, other]
Title: Can humans teach machines to code?
Subjects: Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[441]  arXiv:2404.19370 (cross-list from cs.AI) [pdf, other]
Title: Numeric Reward Machines
Comments: ICAPS 2024; Workshop on Bridging the Gap Between AI Planning and Reinforcement Learning
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[442]  arXiv:2404.19354 (cross-list from cs.AR) [pdf, other]
Title: PEFSL: A deployment Pipeline for Embedded Few-Shot Learning on a FPGA SoC
Authors: Lucas Grativol Ribeiro (IMT Atlantique - MEE, Lab\_STICC\_BRAIn, Lab-STICC\_2AI, LHC), Lubin Gauthier (Lab\_STICC\_BRAIn, IMT Atlantique - MEE), Mathieu Leonardon (IMT Atlantique - MEE, Lab\_STICC\_BRAIn), Jérémy Morlier (IMT Atlantique - MEE, Lab\_STICC\_BRAIn), Antoine Lavrard-Meyer (IMT Atlantique), Guillaume Muller (Mines Saint-Étienne MSE, FAYOL-ENSMSE, FAYOL-ENSMSE), Virginie Fresse (LHC, TSE), Matthieu Arzel (IMT Atlantique - MEE, Lab-STICC\_2AI)
Journal-ref: ISCAS 2024 : IEEE International Symposium on Circuits and Systems, May 2024, Singapore, Singapore
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[443]  arXiv:2404.19351 (cross-list from physics.geo-ph) [pdf, other]
Title: Deep Learning Forecasts Caldera Collapse Events at Kilauea Volcano
Subjects: Geophysics (physics.geo-ph); Machine Learning (cs.LG)
[444]  arXiv:2404.19349 (cross-list from cs.RO) [pdf, other]
Title: Human-AI Interaction in Industrial Robotics: Design and Empirical Evaluation of a User Interface for Explainable AI-Based Robot Program Optimization
Comments: 6 pages, 4 figures, accepted at the 2024 CIRP International Conference on Manufacturing Systems (CMS)
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[445]  arXiv:2404.19301 (cross-list from stat.ML) [pdf, ps, other]
Title: Statistics and explainability: a fruitful alliance
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[446]  arXiv:2404.19292 (cross-list from cs.IT) [pdf, other]
Title: Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning
Subjects: Information Theory (cs.IT); Machine Learning (cs.LG); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
[447]  arXiv:2404.19289 (cross-list from cs.CV) [pdf, other]
Title: On Improving the Algorithm-, Model-, and Data- Efficiency of Self-Supervised Learning
Comments: 13 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[448]  arXiv:2404.19256 (cross-list from cs.AI) [pdf, other]
Title: Bias Mitigation via Compensation: A Reinforcement Learning Perspective
Comments: 8 pages, 5 diagrams
Subjects: Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Computer Science and Game Theory (cs.GT); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[449]  arXiv:2404.19238 (cross-list from cs.IT) [pdf, other]
Title: Pilot Contamination in Massive MIMO Systems: Challenges and Future Prospects
Comments: Accepted At IWCMC 2024 Comm & SP Symposium
Subjects: Information Theory (cs.IT); Distributed, Parallel, and Cluster Computing (cs.DC); Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[450]  arXiv:2404.19220 (cross-list from stat.ML) [pdf, other]
Title: Regression for matrix-valued data via Kronecker products factorization
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[451]  arXiv:2404.19165 (cross-list from cs.NE) [pdf, other]
Title: DelGrad: Exact gradients in spiking networks for learning transmission delays and weights
Comments: 15 pages, 7 figures
Subjects: Neural and Evolutionary Computing (cs.NE); Emerging Technologies (cs.ET); Machine Learning (cs.LG)
[452]  arXiv:2404.19157 (cross-list from stat.ML) [pdf, other]
Title: Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks
Authors: Javier Antoran
Comments: PhD Thesis, University of Cambridge
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[453]  arXiv:2404.19145 (cross-list from stat.ME) [pdf, other]
Title: Orthogonal Bootstrap: Efficient Simulation of Input Uncertainty
Subjects: Methodology (stat.ME); Machine Learning (cs.LG); Econometrics (econ.EM); Statistics Theory (math.ST); Machine Learning (stat.ML)
[454]  arXiv:2404.19130 (cross-list from cs.IR) [pdf, other]
Title: SpherE: Expressive and Interpretable Knowledge Graph Embedding for Set Retrieval
Comments: Accepted by SIGIR 2024, Camera Ready Version
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[455]  arXiv:2404.19128 (cross-list from cs.CV) [pdf, other]
Title: Q-GroundCAM: Quantifying Grounding in Vision Language Models via GradCAM
Comments: Accepted to CVPR 2024, Second Workshop on Foundation Models (WFM)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[456]  arXiv:2404.19114 (cross-list from cs.CR) [pdf, other]
Title: Enhancing IoT Security: A Novel Feature Engineering Approach for ML-Based Intrusion Detection Systems
Comments: This paper has been accepted by DCOSS-IoT 2024
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[457]  arXiv:2404.19113 (cross-list from cs.CV) [pdf, other]
Title: Source-Free Domain Adaptation of Weakly-Supervised Object Localization Models for Histology
Comments: 16 pages, 21 figures, 5 tables, CVPRw 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[458]  arXiv:2404.19100 (cross-list from cs.SE) [pdf, other]
Title: Predicting Fairness of ML Software Configuration
Comments: To Appear in the 20th International Conference on Predictive Models and Data Analytics in Software Engineering (PROMISE'24)
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[459]  arXiv:2404.19095 (cross-list from cs.HC) [pdf, ps, other]
Title: Catalyzing Social Interactions in Mixed Reality using ML Recommendation Systems
Subjects: Human-Computer Interaction (cs.HC); Information Retrieval (cs.IR); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[460]  arXiv:2404.19094 (cross-list from cs.CL) [pdf, other]
Title: In-Context Symbolic Regression: Leveraging Language Models for Function Discovery
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[461]  arXiv:2404.19087 (cross-list from cs.RO) [pdf, other]
Title: Deep Reinforcement Learning for Advanced Longitudinal Control and Collision Avoidance in High-Risk Driving Scenarios
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Systems and Control (eess.SY)
[462]  arXiv:2404.19075 (cross-list from eess.IV) [pdf, other]
Title: Distributed Stochastic Optimization of a Neural Representation Network for Time-Space Tomography Reconstruction
Comments: submitted to Nature Machine Intelligence
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[463]  arXiv:2404.19073 (cross-list from stat.ML) [pdf, other]
Title: Learning Sparse High-Dimensional Matrix-Valued Graphical Models From Dependent Data
Comments: 16 pages, 2 figures, 1 table
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Signal Processing (eess.SP)
[464]  arXiv:2404.19065 (cross-list from cs.AI) [pdf, other]
Title: HELPER-X: A Unified Instructable Embodied Agent to Tackle Four Interactive Vision-Language Domains with Memory-Augmented Language Models
Comments: Videos and code this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[465]  arXiv:2404.18962 (cross-list from cs.CV) [pdf, other]
Title: An Aggregation-Free Federated Learning for Tackling Data Heterogeneity
Comments: Accepted to CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[466]  arXiv:2404.18960 (cross-list from q-bio.QM) [pdf, ps, other]
Title: Leak Proof CMap; a framework for training and evaluation of cell line agnostic L1000 similarity methods
Subjects: Quantitative Methods (q-bio.QM); Machine Learning (cs.LG)
[467]  arXiv:2404.18952 (cross-list from cs.CV) [pdf, other]
Title: CUE-Net: Violence Detection Video Analytics with Spatial Cropping, Enhanced UniformerV2 and Modified Efficient Additive Attention
Comments: To be published in the proceedings of 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[468]  arXiv:2404.18944 (cross-list from cs.SI) [pdf, ps, other]
Title: Investigating the dissemination of STEM content on social media with computational tools
Comments: 17 pages, 3 figures, 3 supplemental figures
Subjects: Social and Information Networks (cs.SI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[469]  arXiv:2404.18942 (cross-list from cs.CL) [pdf, other]
Title: GuideWalk -- Heterogeneous Data Fusion for Enhanced Learning -- A Multiclass Document Classification Case
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[470]  arXiv:2404.18933 (cross-list from cs.CV) [pdf, other]
Title: Learning Low-Rank Feature for Thorax Disease Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[471]  arXiv:2404.18247 (cross-list from hep-th) [pdf, other]
Title: Classical integrability in the presence of a cosmological constant: analytic and machine learning results
Comments: 32 pages, 7 figures
Subjects: High Energy Physics - Theory (hep-th); Machine Learning (cs.LG); Mathematical Physics (math-ph)
[472]  arXiv:2404.10188 (cross-list from cs.NI) [pdf, other]
Title: Smart Pilot Assignment for IoT in Massive MIMO Systems: A Path Towards Scalable IoT Infrastructure
Comments: Accepted At ICC-2024
Subjects: Networking and Internet Architecture (cs.NI); Computer Science and Game Theory (cs.GT); Information Theory (cs.IT); Machine Learning (cs.LG); Social and Information Networks (cs.SI)

Tue, 30 Apr 2024 (showing first 86 of 184 entries)

[473]  arXiv:2404.18922 [pdf, other]
Title: DPO Meets PPO: Reinforced Token Optimization for RLHF
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[474]  arXiv:2404.18909 [pdf, other]
Title: Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Environmental Uncertainty
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
[475]  arXiv:2404.18896 [pdf, other]
Title: Overcoming Knowledge Barriers: Online Imitation Learning from Observation with Pretrained World Models
Comments: 19 pages, 7 figures
Subjects: Machine Learning (cs.LG)
[476]  arXiv:2404.18886 [pdf, other]
Title: A Survey on Diffusion Models for Time Series and Spatio-Temporal Data
Comments: Ongoing work; 27 pages, 8 figures, 2 tables; Github Repo: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[477]  arXiv:2404.18869 [pdf, ps, other]
Title: Learning Mixtures of Gaussians Using Diffusion Models
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Probability (math.PR); Statistics Theory (math.ST); Machine Learning (stat.ML)
[478]  arXiv:2404.18848 [pdf, other]
Title: FeDeRA:Efficient Fine-tuning of Language Models in Federated Learning Leveraging Weight Decomposition
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[479]  arXiv:2404.18825 [pdf, other]
Title: Harmonic Machine Learning Models are Robust
Comments: 18 pages, 13 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[480]  arXiv:2404.18780 [pdf, other]
Title: Optimal time sampling in physics-informed neural networks
Authors: Gabriel Turinici
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Computational Physics (physics.comp-ph)
[481]  arXiv:2404.18773 [pdf, other]
Title: A Universal Metric of Dataset Similarity for Cross-silo Federated Learning
Subjects: Machine Learning (cs.LG)
[482]  arXiv:2404.18736 [pdf, other]
Title: Mapping the Potential of Explainable Artificial Intelligence (XAI) for Fairness Along the AI Lifecycle
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[483]  arXiv:2404.18730 [pdf, other]
Title: CVTN: Cross Variable and Temporal Integration for Time Series Forecasting
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Applications (stat.AP)
[484]  arXiv:2404.18702 [pdf, other]
Title: Why You Should Not Trust Interpretations in Machine Learning: Adversarial Attacks on Partial Dependence Plots
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Applications (stat.AP); Machine Learning (stat.ML)
[485]  arXiv:2404.18699 [pdf, other]
Title: Convergence Properties of Score-Based Models using Graduated Optimisation for Linear Inverse Problems
Comments: 8 pages
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[486]  arXiv:2404.18685 [pdf, other]
Title: FALE: Fairness-Aware ALE Plots for Auditing Bias in Subgroups
Comments: Presented in Uncertainty meets Explainability Workshop @ ECML/PKDD 2023
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[487]  arXiv:2404.18670 [pdf, other]
Title: Enhancing Uncertain Demand Prediction in Hospitals Using Simple and Advanced Machine Learning
Subjects: Machine Learning (cs.LG); Applications (stat.AP)
[488]  arXiv:2404.18631 [pdf, other]
Title: Feature importance to explain multimodal prediction models. A clinical use case
Comments: Accepted at World Conference on Explainable Artificial Intelligence; 19 pages, 2 figures, 7 tables
Subjects: Machine Learning (cs.LG)
[489]  arXiv:2404.18573 [pdf, other]
Title: Predicting Safety Misbehaviours in Autonomous Driving Systems using Uncertainty Quantification
Comments: In Proceedings of 17th IEEE International Conference on Software Testing, Verification and Validation 2024 (ICST '24)
Subjects: Machine Learning (cs.LG); Robotics (cs.RO); Software Engineering (cs.SE)
[490]  arXiv:2404.18572 [pdf, other]
Title: Learning Governing Equations of Unobserved States in Dynamical Systems
Subjects: Machine Learning (cs.LG)
[491]  arXiv:2404.18553 [pdf, other]
Title: Evaluating the effectiveness of predicting covariates in LSTM Networks for Time Series Forecasting
Authors: Gareth Davies
Comments: 9 content pages (22 total pages), 11 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[492]  arXiv:2404.18550 [pdf, other]
Title: IncidentResponseGPT: Generating Traffic Incident Response Plans with Generative Artificial Intelligence
Subjects: Machine Learning (cs.LG); Human-Computer Interaction (cs.HC)
[493]  arXiv:2404.18538 [pdf, ps, other]
Title: Symmetry group based domain decomposition to enhance physics-informed neural networks for solving partial differential equations
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[494]  arXiv:2404.18537 [pdf, other]
Title: Time Series Data Augmentation as an Imbalanced Learning Problem
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[495]  arXiv:2404.18530 [pdf, other]
Title: Predicting PDEs Fast and Efficiently with Equivariant Extreme Learning Machines
Subjects: Machine Learning (cs.LG)
[496]  arXiv:2404.18528 [pdf, other]
Title: Generation of Uncorrelated Residual Variables for Chemical Process Fault Diagnosis via Transfer Learning-based Input-Output Decoupled Network
Subjects: Machine Learning (cs.LG)
[497]  arXiv:2404.18527 [pdf, ps, other]
Title: Bridging Data Barriers among Participants: Assessing the Potential of Geoenergy through Federated Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Applications (stat.AP)
[498]  arXiv:2404.18525 [pdf, other]
Title: Enabling Efficient and Flexible Interpretability of Data-driven Anomaly Detection in Industrial Processes with AcME-AD
Subjects: Machine Learning (cs.LG)
[499]  arXiv:2404.18519 [pdf, other]
Title: On the Impact of Data Heterogeneity in Federated Learning Environments with Application to Healthcare Networks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[500]  arXiv:2404.18508 [pdf, other]
Title: Scalable Event-by-event Processing of Neuromorphic Sensory Signals With Deep State-Space Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[501]  arXiv:2404.18504 [pdf, other]
Title: Multisensor Data Fusion for Automatized Insect Monitoring (KInsecta)
Journal-ref: Remote Sensing for Agriculture, Ecosystems, and Hydrology XXV, SPIE 12727 (2023) 1272702
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[502]  arXiv:2404.18490 [pdf, other]
Title: Reduced-Rank Multi-objective Policy Learning and Optimization
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[503]  arXiv:2404.18444 [pdf, other]
Title: U-Nets as Belief Propagation: Efficient Classification, Denoising, and Diffusion in Generative Hierarchical Models
Authors: Song Mei
Comments: v2 updated discussions of related literature
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Statistics Theory (math.ST); Machine Learning (stat.ML)
[504]  arXiv:2404.18414 [pdf, other]
Title: Learning a Sparse Neural Network using IHT
Subjects: Machine Learning (cs.LG)
[505]  arXiv:2404.18400 [pdf, other]
Title: LLM-SR: Scientific Equation Discovery via Programming with Large Language Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
[506]  arXiv:2404.18326 [pdf, other]
Title: SAFE-RL: Saliency-Aware Counterfactual Explainer for Deep Reinforcement Learning Policies
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[507]  arXiv:2404.18314 [pdf, other]
Title: DIRESA, a distance-preserving nonlinear dimension reduction technique based on regularized autoencoders
Comments: 16 pages, 10 figures, 4 tables; 7 pages of Supporting Information
Subjects: Machine Learning (cs.LG); Chaotic Dynamics (nlin.CD); Atmospheric and Oceanic Physics (physics.ao-ph)
[508]  arXiv:2404.18311 [pdf, ps, other]
Title: Towards Real-time Learning in Large Language Models: A Critical Review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[509]  arXiv:2404.18287 [pdf, other]
Title: Joint Energy and Latency Optimization in Federated Learning over Cell-Free Massive MIMO Networks
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Networking and Internet Architecture (cs.NI)
[510]  arXiv:2404.18273 [pdf, other]
Title: Kernel Corrector LSTM
Comments: 12 pages, 4 figures, IDA 2024
Subjects: Machine Learning (cs.LG)
[511]  arXiv:2404.18246 [pdf, other]
Title: AdaFSNet: Time Series Classification Based on Convolutional Network with a Adaptive and Effective Kernel Size Configuration
Comments: Accepted by IJCNN 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[512]  arXiv:2404.18239 [pdf, other]
Title: SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[513]  arXiv:2404.18211 [pdf, other]
Title: A survey of dynamic graph neural networks
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[514]  arXiv:2404.18209 [pdf, other]
Title: 4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on Relational DBs
Comments: Under review
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[515]  arXiv:2404.18190 [pdf, other]
Title: Naive Bayes Classifiers and One-hot Encoding of Categorical Variables
Comments: 7 pages, 3 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[516]  arXiv:2404.18161 [pdf, other]
Title: IMEX-Reg: Implicit-Explicit Regularization in the Function Space for Continual Learning
Comments: Published in Transactions on Machine Learning Research
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[517]  arXiv:2404.18159 [pdf, other]
Title: Evaluating ROCKET and Catch22 features for calf behaviour classification from accelerometer data using Machine Learning models
Comments: 45 pages, 8 figures, 11 tables (3 in the Appendix), Journal paper
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[518]  arXiv:2404.18144 [pdf, other]
Title: Generative AI for Visualization: State of the Art and Future Directions
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[519]  arXiv:2404.18134 [pdf, other]
Title: Enhancing Fairness in Neural Networks Using FairVIC
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (stat.ML)
[520]  arXiv:2404.18101 [pdf, other]
Title: Advancing Supervised Learning with the Wave Loss Function: A Robust and Smooth Approach
Subjects: Machine Learning (cs.LG)
[521]  arXiv:2404.18063 [pdf, other]
Title: Machine Learning Techniques for Data Reduction of CFD Applications
Comments: 10 pages, 8 figures
Subjects: Machine Learning (cs.LG); Fluid Dynamics (physics.flu-dyn)
[522]  arXiv:2404.18008 [pdf, other]
Title: Implicit Generative Prior for Bayesian Neural Networks
Authors: Yijia Liu, Xiao Wang
Subjects: Machine Learning (cs.LG); Applications (stat.AP)
[523]  arXiv:2404.17997 [pdf, other]
Title: Optimal Initialization of Batch Bayesian Optimization
Comments: 10 pages, 8 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[524]  arXiv:2404.17990 [pdf, other]
Title: TabVFL: Improving Latent Representation in Vertical Federated Learning
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[525]  arXiv:2404.17951 [pdf, other]
Title: Cauchy-Schwarz Divergence Information Bottleneck for Regression
Comments: accepted by ICLR-24, project page: \url{this https URL}
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[526]  arXiv:2404.17947 [pdf, other]
Title: Bounding the Expected Robustness of Graph Neural Networks Subject to Node Feature Attacks
Comments: Accepted at ICLR 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[527]  arXiv:2404.17943 [pdf, other]
Title: Interaction Event Forecasting in Multi-Relational Recursive HyperGraphs: A Temporal Point Process Approach
Comments: 10 pages, 4 figures, 5 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[528]  arXiv:2404.17940 [pdf, ps, other]
Title: CBMAP: Clustering-based manifold approximation and projection for dimensionality reduction
Authors: Berat Dogan
Subjects: Machine Learning (cs.LG)
[529]  arXiv:2404.17937 [pdf, other]
Title: DTization: A New Method for Supervised Feature Scaling
Authors: Niful Islam
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[530]  arXiv:2404.17931 [pdf, ps, other]
Title: Critical Review for One-class Classification: recent advances and the reality behind them
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[531]  arXiv:2404.17925 [pdf, other]
Title: Accurate and fast anomaly detection in industrial processes and IoT environments
Authors: Simone Tonini (1), Andrea Vandin (1), Francesca Chiaromonte (1 and 2), Daniele Licari (3), Fernando Barsacchi (4) ((1) L'EMbeDS and Institute of Economics, Sant'Anna School of Advanced Studies, Pisa, (2) Dept. of Statistics, The Pennsylvania State University, (3) L'EMbeDS, Sant'Anna School of Advanced Studies, (4) A. Celli Group, Lucca)
Subjects: Machine Learning (cs.LG); Applications (stat.AP)
[532]  arXiv:2404.17916 [pdf, other]
Title: FedCRL: Personalized Federated Learning with Contrastive Shared Representations for Label Heterogeneity in Non-IID Data
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[533]  arXiv:2404.17886 [pdf, other]
Title: Feature graphs for interpretable unsupervised tree ensembles: centrality, interaction, and application in disease subtyping
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[534]  arXiv:2404.17875 [pdf, other]
Title: Noisy Node Classification by Bi-level Optimization based Multi-teacher Distillation
Subjects: Machine Learning (cs.LG)
[535]  arXiv:2404.17847 [pdf, other]
Title: pFedAFM: Adaptive Feature Mixture for Batch-Level Personalization in Heterogeneous Federated Learning
Subjects: Machine Learning (cs.LG)
[536]  arXiv:2404.17830 [pdf, other]
Title: Dynamic Against Dynamic: An Open-set Self-learning Framework
Comments: The first two authors contributed equally to this work. Accepted at IJCAI2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[537]  arXiv:2404.17805 [pdf, other]
Title: From Optimization to Generalization: Fair Federated Learning against Quality Shift via Inter-Client Sharpness Matching
Comments: This paper is accepted at IJCAI'24 (Main Track)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[538]  arXiv:2404.17801 [pdf, ps, other]
Title: Dynamical Mode Recognition of Coupled Flame Oscillators by Supervised and Unsupervised Learning Approaches
Comments: research paper (21 pages, 15 figures)
Subjects: Machine Learning (cs.LG)
[539]  arXiv:2404.17799 [pdf, other]
Title: Personalized Federated Learning via Sequential Layer Expansion in Representation Learning
Comments: 12 pages, 7 figure
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[540]  arXiv:2404.17789 [pdf, other]
Title: BiLO: Bilevel Local Operator Learning for PDE inverse problems
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Optimization and Control (math.OC)
[541]  arXiv:2404.17773 [pdf, other]
Title: Compressing Latent Space via Least Volume
Authors: Qiuyi Chen, Mark Fuge
Comments: 24 pages, International Conference on Learning Representations 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[542]  arXiv:2404.17768 [pdf, other]
Title: Make the Most of Your Data: Changing the Training Data Distribution to Improve In-distribution Generalization Performance
Comments: 32 pages, 11 figures, 6 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[543]  arXiv:2404.17766 [pdf, other]
Title: Implementation of Big AI Models for Wireless Networks with Collaborative Edge Computing
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Networking and Internet Architecture (cs.NI)
[544]  arXiv:2404.17746 [pdf, other]
Title: On the Rashomon ratio of infinite hypothesis sets
Subjects: Machine Learning (cs.LG); Probability (math.PR); Machine Learning (stat.ML)
[545]  arXiv:2404.17735 [pdf, other]
Title: Causal Diffusion Autoencoders: Toward Counterfactual Generation via Diffusion Probabilistic Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME)
[546]  arXiv:2404.17699 [pdf, other]
Title: Deep Learning for Melt Pool Depth Contour Prediction From Surface Thermal Images via Vision Transformers
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[547]  arXiv:2404.17690 [pdf, other]
Title: A Biased Estimator for MinMax Sampling and Distributed Aggregation
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Applications (stat.AP)
[548]  arXiv:2404.17687 [pdf, other]
Title: Knowledge Transfer for Cross-Domain Reinforcement Learning: A Systematic Review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[549]  arXiv:2404.17674 [pdf, other]
Title: Center-Based Relaxed Learning Against Membership Inference Attacks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[550]  arXiv:2404.17673 [pdf, other]
Title: Learning Manipulation Tasks in Dynamic and Shared 3D Spaces
Comments: 5 pages
Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[551]  arXiv:2404.17651 [pdf, other]
Title: Hard ASH: Sparsity and the right optimizer make a continual learner
Authors: Santtu Keskinen
Comments: ICLR 2024 TinyPaper
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[552]  arXiv:2404.17626 [pdf, other]
Title: Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank
Subjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM); Applications (stat.AP); Computation (stat.CO)
[553]  arXiv:2404.17625 [pdf, other]
Title: Alice's Adventures in a Differentiable Wonderland -- Volume I, A Tour of the Land
Comments: Companion website for additional chapters: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[554]  arXiv:2404.17620 [pdf, other]
Title: Neural Modes: Self-supervised Learning of Nonlinear Modal Subspaces
Comments: Accepted to CVPR 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[555]  arXiv:2404.17609 [pdf, other]
Title: CoSD: Collaborative Stance Detection with Contrastive Heterogeneous Topic Graph Learning
Comments: 13 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[556]  arXiv:2404.18928 (cross-list from cs.CV) [pdf, other]
Title: Stylus: Automatic Adapter Selection for Diffusion Models
Comments: Project Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Graphics (cs.GR); Machine Learning (cs.LG)
[557]  arXiv:2404.18926 (cross-list from cs.RO) [pdf, other]
Title: Point Cloud Models Improve Visual Robustness in Robotic Learners
Comments: Accepted at International Conference on Robotics and Automation, 2024
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[558]  arXiv:2404.18911 (cross-list from cs.CL) [pdf, other]
Title: Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[ total of 656 entries: 1-558 | 559-656 ]
[ showing 558 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)