We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions, skipping first 365

[ total of 666 entries: 1-402 | 366-666 ]
[ showing 402 entries per page: fewer | more | all ]

Tue, 7 May 2024 (continued, showing last 184 of 231 entries)

[366]  arXiv:2405.03146 [pdf, other]
Title: Quantifying the Capabilities of LLMs across Scale and Precision
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[367]  arXiv:2405.03140 [pdf, other]
Title: TimeMIL: Advancing Multivariate Time Series Classification via a Time-aware Multiple Instance Learning
Comments: Accepted by ICML2024
Subjects: Machine Learning (cs.LG)
[368]  arXiv:2405.03103 [pdf, other]
Title: Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs
Comments: Accepted to ICML 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[369]  arXiv:2405.03097 [pdf, other]
Title: To Each (Textual Sequence) Its Own: Improving Memorized-Data Unlearning in Large Language Models
Comments: Published as a conference paper at ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[370]  arXiv:2405.03095 [pdf, other]
Title: Loss Jump During Loss Switch in Solving PDEs with Neural Networks
Subjects: Machine Learning (cs.LG); Mathematical Physics (math-ph)
[371]  arXiv:2405.03089 [pdf, other]
Title: Structure-Preserving Network Compression Via Low-Rank Induced Training Through Linear Layers Composition
Subjects: Machine Learning (cs.LG)
[372]  arXiv:2405.03082 [pdf, other]
Title: Finite-Time Convergence and Sample Complexity of Actor-Critic Multi-Objective Reinforcement Learning
Comments: Accepted in ICML 2024
Subjects: Machine Learning (cs.LG)
[373]  arXiv:2405.03075 [pdf, other]
Title: AnoGAN for Tabular Data: A Novel Approach to Anomaly Detection
Comments: 12 pages, 6 figures, accepted as Short paper at HCII 2024 (this https URL)
Subjects: Machine Learning (cs.LG)
[374]  arXiv:2405.03064 [pdf, other]
Title: RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation
Comments: Accepted by ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[375]  arXiv:2405.03060 [pdf, other]
Title: Tree-based Ensemble Learning for Out-of-distribution Detection
Subjects: Machine Learning (cs.LG)
[376]  arXiv:2405.03059 [pdf, other]
Title: Active Preference Learning for Ordering Items In- and Out-of-sample
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[377]  arXiv:2405.03056 [pdf, other]
Title: Convolutional Learning on Directed Acyclic Graphs
Subjects: Machine Learning (cs.LG)
[378]  arXiv:2405.03052 [pdf, other]
Title: A View on Out-of-Distribution Identification from a Statistical Testing Theory Perspective
Subjects: Machine Learning (cs.LG)
[379]  arXiv:2405.03005 [pdf, other]
Title: Safe Reinforcement Learning with Learned Non-Markovian Safety Constraints
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[380]  arXiv:2405.03003 [pdf, other]
Title: Parameter-Efficient Fine-Tuning with Discrete Fourier Transform
Comments: Accepted by ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[381]  arXiv:2405.02969 [pdf, other]
Title: Towards a Flexible and High-Fidelity Approach to Distributed DNN Training Emulation
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[382]  arXiv:2405.02952 [pdf, other]
Title: Accelerating Legacy Numerical Solvers by Non-intrusive Gradient-based Meta-solving
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[383]  arXiv:2405.02936 [pdf, ps, other]
Title: On the tractability of SHAP explanations under Markovian distributions
Comments: Accepted at ICML'24 (This version is a pre-print)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[384]  arXiv:2405.02881 [pdf, other]
Title: FedConPE: Efficient Federated Conversational Bandits with Heterogeneous Clients
Comments: Accepted in the 33rd International Joint Conference on Artificial Intelligence (IJCAI), 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[385]  arXiv:2405.02845 [pdf, other]
Title: Data-Efficient Molecular Generation with Hierarchical Textual Inversion
Subjects: Machine Learning (cs.LG); Molecular Networks (q-bio.MN)
[386]  arXiv:2405.02842 [pdf, other]
Title: IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs
Subjects: Machine Learning (cs.LG)
[387]  arXiv:2405.02807 [pdf, ps, other]
Title: Kinematic analysis of structural mechanics based on convolutional neural network
Comments: 9 pages, 13 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[388]  arXiv:2405.02805 [pdf, other]
Title: Verlet Flows: Exact-Likelihood Integrators for Flow-Based Generative Models
Comments: ICLR AI4DifferentialEqautions In Science workshop 2024
Subjects: Machine Learning (cs.LG)
[389]  arXiv:2405.02803 [pdf, other]
Title: Is Flash Attention Stable?
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[390]  arXiv:2405.02795 [pdf, other]
Title: Graph as Point Set
Comments: ICML 2024
Subjects: Machine Learning (cs.LG)
[391]  arXiv:2405.02774 [pdf, other]
Title: Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMs
Comments: Published as a conference paper at ICLR 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[392]  arXiv:2405.02770 [pdf, other]
[393]  arXiv:2405.02769 [pdf, other]
Title: Linear Convergence of Independent Natural Policy Gradient in Games with Entropy Regularization
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA); Optimization and Control (math.OC)
[394]  arXiv:2405.02766 [pdf, other]
Title: Beyond Unimodal Learning: The Importance of Integrating Multiple Modalities for Lifelong Learning
Comments: Accepted at 3rd Conference on Lifelong Learning Agents (CoLLAs), 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[395]  arXiv:2405.02749 [pdf, other]
Title: Sub-goal Distillation: A Method to Improve Small Language Agents
Subjects: Machine Learning (cs.LG)
[396]  arXiv:2405.02745 [pdf, other]
Title: Understanding Server-Assisted Federated Learning in the Presence of Incomplete Client Participation
Comments: Accepted in ICML2024
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[397]  arXiv:2405.02731 [pdf, other]
Title: Systematic Review: Anomaly Detection in Connected and Autonomous Vehicles
Comments: 17 pages, 2 tables, 5 figures
Subjects: Machine Learning (cs.LG)
[398]  arXiv:2405.02726 [pdf, other]
Title: A Mathematical Model of the Hidden Feedback Loop Effect in Machine Learning Systems
Comments: 21 pages, 15 figures
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[399]  arXiv:2405.02724 [pdf, ps, other]
Title: Taming Equilibrium Bias in Risk-Sensitive Multi-Agent Reinforcement Learning
Authors: Yingjie Fei, Ruitu Xu
Comments: 29 pages
Subjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT)
[400]  arXiv:2405.02700 [pdf, other]
Title: Towards a Scalable Identification of Novel Modes in Generative Models
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[401]  arXiv:2405.02698 [pdf, ps, other]
Title: Stable Diffusion Dataset Generation for Downstream Classification Tasks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[402]  arXiv:2405.02688 [pdf, other]
Title: Semi-supervised Symmetric Matrix Factorization with Low-Rank Tensor Representation
Subjects: Machine Learning (cs.LG)
[403]  arXiv:2405.02685 [pdf, other]
Title: FedProK: Trustworthy Federated Class-Incremental Learning via Prototypical Feature Knowledge Transfer
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[404]  arXiv:2405.02678 [pdf, other]
Title: Position Paper: Quo Vadis, Unsupervised Time Series Anomaly Detection?
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[405]  arXiv:2405.02670 [pdf, other]
Title: From Generalization Analysis to Optimization Designs for State Space Models
Subjects: Machine Learning (cs.LG)
[406]  arXiv:2405.02661 [pdf, other]
Title: DDE-Find: Learning Delay Differential Equations from Data
Authors: Robert Stephany
Comments: 42 pages, 19 tables, 8 figures
Subjects: Machine Learning (cs.LG)
[407]  arXiv:2405.02649 [pdf, other]
Title: Generic Multi-modal Representation Learning for Network Traffic Analysis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[408]  arXiv:2405.02648 [pdf, other]
Title: A Conformal Prediction Score that is Robust to Label Noise
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[409]  arXiv:2405.02644 [pdf, other]
Title: Interpretable Multi-View Clustering
Comments: 12 pages,6 figures
Subjects: Machine Learning (cs.LG)
[410]  arXiv:2405.02642 [pdf, other]
Title: Machine Learning in Space: Surveying the Robustness of on-board ML models to Radiation
Subjects: Machine Learning (cs.LG)
[411]  arXiv:2405.02638 [pdf, other]
Title: PrivSGP-VR: Differentially Private Variance-Reduced Stochastic Gradient Push with Tight Utility Bounds
Comments: This paper has been accepted by the 33rd International Joint Conference on Artificial Intelligence(IJCAI 2024)
Subjects: Machine Learning (cs.LG)
[412]  arXiv:2405.02634 [pdf, other]
Title: Onboard Out-of-Calibration Detection of Deep Learning Models using Conformal Prediction
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[413]  arXiv:2405.02631 [pdf, other]
Title: Unsupervised machine learning for data-driven classification of rock mass using drilling data: How can a data-driven system handle limitations in existing rock mass classification systems?
Comments: 38 pages, 11 figures. Includes ancillary interactive versions of some figures
Subjects: Machine Learning (cs.LG); Emerging Technologies (cs.ET); Systems and Control (eess.SY)
[414]  arXiv:2405.02628 [pdf, other]
Title: Contrastive Dual-Interaction Graph Neural Network for Molecular Property Prediction
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[415]  arXiv:2405.02612 [pdf, other]
Title: Learning Linear Utility Functions From Pairwise Comparison Queries
Comments: Submitted to ECAI for review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (stat.ML)
[416]  arXiv:2405.02609 [pdf, other]
Title: Advanced Equalization in 112 Gb/s Upstream PON Using a Novel Fourier Convolution-based Network
Comments: 4 pages, 5 figures
Subjects: Machine Learning (cs.LG)
[417]  arXiv:2405.02598 [pdf, other]
Title: UDUC: An Uncertainty-driven Approach for Learning-based Robust Control
Subjects: Machine Learning (cs.LG)
[418]  arXiv:2405.02596 [pdf, other]
Title: Random Masking Finds Winning Tickets for Parameter Efficient Fine-tuning
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[419]  arXiv:2405.02594 [pdf, other]
Title: Leveraging (Biased) Information: Multi-armed Bandits with Offline Data
Comments: 24 pages, 5 figures. Accepted to ICML 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[420]  arXiv:2405.02576 [pdf, other]
Title: CTD4 - A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[421]  arXiv:2405.02574 [pdf, ps, other]
Title: A Data Mining-Based Dynamical Anomaly Detection Method for Integrating with an Advance Metering System
Authors: Sarit Maitra
Subjects: Machine Learning (cs.LG)
[422]  arXiv:2405.02572 [pdf, other]
Title: Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline
Comments: 12 pages, 3 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[423]  arXiv:2405.02569 [pdf, other]
Title: Decoupling Exploration and Exploitation for Unsupervised Pre-training with Successor Features
Comments: IJCNN 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[424]  arXiv:2405.02561 [pdf, other]
Title: Understanding the Difficulty of Solving Cauchy Problems with PINNs
Comments: 13 pages and 18 figures
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[425]  arXiv:2405.02534 [pdf, other]
Title: A Multi-Domain Multi-Task Approach for Feature Selection from Bulk RNA Datasets
Subjects: Machine Learning (cs.LG); Genomics (q-bio.GN)
[426]  arXiv:2405.02485 [pdf, other]
Title: A Survey of Few-Shot Learning for Biomedical Time Series
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[427]  arXiv:2405.02481 [pdf, other]
Title: Proximal Curriculum with Task Correlations for Deep Reinforcement Learning
Comments: IJCAI'24 paper (longer version)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[428]  arXiv:2405.02478 [pdf, other]
Title: Continuous Learned Primal Dual
Subjects: Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[429]  arXiv:2405.02475 [pdf, other]
Title: Generalizing Orthogonalization for Models with Non-linearities
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation (stat.CO); Methodology (stat.ME)
[430]  arXiv:2405.02441 [pdf, ps, other]
Title: Learning minimal volume uncertainty ellipsoids
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[431]  arXiv:2405.02413 [pdf, other]
Title: A Unified Framework for Human-Allied Learning of Probabilistic Circuits
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[432]  arXiv:2405.02412 [pdf, other]
Title: Deep Learning and Transfer Learning Architectures for English Premier League Player Performance Forecasting
Comments: 10 pages
Subjects: Machine Learning (cs.LG)
[433]  arXiv:2405.02385 [pdf, other]
Title: Efficient Deep Learning with Decorrelated Backpropagation
Subjects: Machine Learning (cs.LG)
[434]  arXiv:2405.02377 [pdf, other]
Title: Robustness of Decentralised Learning to Nodes and Data Disruption
Comments: Supported by the H2020 HumaneAI Net (952026), CHIST-ERA-19-XAI010 SAI, PNRR - M4C2 - Investimento 1.3, Partenariato Esteso PE00000013 FAIR, PNRR - M4C2 - Investimento 1.3, Partenariato Esteso PE00000001 RESTART
Subjects: Machine Learning (cs.LG)
[435]  arXiv:2405.02375 [pdf, other]
Title: The Sparse Tsetlin Machine: Sparse Representation with Active Literals
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Formal Languages and Automata Theory (cs.FL)
[436]  arXiv:2405.02367 [pdf, other]
Title: Enhancing Social Media Post Popularity Prediction with Visual Content
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[437]  arXiv:2405.02364 [pdf, other]
Title: A Survey on Contribution Evaluation in Vertical Federated Learning
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[438]  arXiv:2405.02360 [pdf, other]
Title: Holistic Evaluation Metrics: Use Case Sensitive Evaluation Metrics for Federated Learning
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[439]  arXiv:2405.02359 [pdf, other]
Title: CVTGAD: Simplified Transformer with Cross-View Attention for Unsupervised Graph-level Anomaly Detection
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[440]  arXiv:2405.02358 [pdf, other]
Title: A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language Model
Comments: 5 figures, 6 tables, 41 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[441]  arXiv:2405.02357 [pdf, other]
Title: Large Language Models for Mobility in Transportation Systems: A Survey on Forecasting Tasks
Comments: 9 pages
Subjects: Machine Learning (cs.LG)
[442]  arXiv:2405.02356 [pdf, other]
Title: Stochastic Multivariate Universal-Radix Finite-State Machine: a Theoretically and Practically Elegant Nonlinear Function Approximator
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[443]  arXiv:2405.02354 [pdf, ps, other]
Title: Heterogeneous network and graph attention auto-encoder for LncRNA-disease association prediction
Comments: 10 pages, 8 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
[444]  arXiv:2405.02351 [pdf, other]
Title: Towards General Neural Surrogate Solvers with Specialized Neural Accelerators
Comments: 8 pages, 7 Figures, to be published in ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Optics (physics.optics)
[445]  arXiv:2405.02350 [pdf, ps, other]
Title: What makes Models Compositional? A Theoretical View: With Supplement
Comments: Extended version of the original IJCAI 2024 paper with detailed supplementary materials (27 pages, 7 figures)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[446]  arXiv:2405.02349 [pdf, ps, other]
Title: Explainable Multi-Label Classification of MBTI Types
Comments: 22 pages, 12 tables, 2 figure
Subjects: Machine Learning (cs.LG)
[447]  arXiv:2405.02347 [pdf, other]
Title: COPAL: Continual Pruning in Large Language Generative Models
Comments: Accepted to ICML2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[448]  arXiv:2405.03688 (cross-list from cs.CL) [pdf, other]
Title: Large Language Models Reveal Information Operation Goals, Tactics, and Narrative Frames
Comments: 15 pages, 9 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[449]  arXiv:2405.03685 (cross-list from cs.CV) [pdf, other]
Title: Language-Image Models with 3D Understanding
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[450]  arXiv:2405.03672 (cross-list from cs.CR) [pdf, other]
Title: Cutting through buggy adversarial example defenses: fixing 1 line of code breaks Sabre
Authors: Nicholas Carlini
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[451]  arXiv:2405.03667 (cross-list from eess.SP) [pdf, other]
Title: Fault Detection and Monitoring using an Information-Driven Strategy: Method, Theory, and Application
Comments: 28 pages, 11 figures
Subjects: Signal Processing (eess.SP); Information Theory (cs.IT); Machine Learning (cs.LG)
[452]  arXiv:2405.03661 (cross-list from cs.DS) [pdf, ps, other]
Title: Competitive strategies to use "warm start" algorithms with predictions
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG)
[453]  arXiv:2405.03658 (cross-list from cs.CE) [pdf, other]
Title: A review on data-driven constitutive laws for solids
Comments: 57 pages, 7 Figures
Subjects: Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG); Applied Physics (physics.app-ph)
[454]  arXiv:2405.03651 (cross-list from cs.IR) [pdf, other]
Title: Adaptive Retrieval and Scalable Indexing for k-NN Search with Cross-Encoders
Comments: ICLR 2024
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[455]  arXiv:2405.03650 (cross-list from cs.CV) [pdf, other]
Title: Generated Contents Enrichment
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[456]  arXiv:2405.03642 (cross-list from cs.CV) [pdf, other]
Title: Classification of Breast Cancer Histopathology Images using a Modified Supervised Contrastive Learning Method
Comments: 16 pages, 3 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[457]  arXiv:2405.03636 (cross-list from cs.CR) [pdf, other]
Title: Federated Learning Privacy: Attacks, Defenses, Applications, and Policy Landscape - A Survey
Comments: Submitted to ACM Computing Surveys
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[458]  arXiv:2405.03549 (cross-list from stat.ML) [pdf, other]
Title: Bridging discrete and continuous state spaces: Exploring the Ehrenfest process in time-continuous diffusion models
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Dynamical Systems (math.DS); Probability (math.PR)
[459]  arXiv:2405.03546 (cross-list from cs.CV) [pdf, other]
Title: CCDM: Continuous Conditional Diffusion Models for Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[460]  arXiv:2405.03542 (cross-list from eess.SP) [pdf, other]
Title: Enhancing Channel Estimation in Quantized Systems with a Generative Prior
Subjects: Signal Processing (eess.SP); Information Theory (cs.IT); Machine Learning (cs.LG)
[461]  arXiv:2405.03541 (cross-list from cs.CV) [pdf, other]
Title: RepVGG-GELAN: Enhanced GELAN with VGG-STYLE ConvNets for Brain Tumour Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[462]  arXiv:2405.03534 (cross-list from cs.RO) [pdf, other]
Title: Meta-Evolve: Continuous Robot Evolution for One-to-many Policy Transfer
Comments: ICLR 2024
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[463]  arXiv:2405.03526 (cross-list from cs.NI) [pdf, other]
Title: ReinWiFi: A Reinforcement-Learning-Based Framework for the Application-Layer QoS Optimization of WiFi Networks
Subjects: Networking and Internet Architecture (cs.NI); Machine Learning (cs.LG)
[464]  arXiv:2405.03484 (cross-list from cs.SD) [pdf, other]
Title: Whispy: Adapting STT Whisper Models to Real-Time Environments
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[465]  arXiv:2405.03472 (cross-list from math.OC) [pdf, other]
Title: A Symplectic Analysis of Alternating Mirror Descent
Comments: 95 pages, 3 figures
Subjects: Optimization and Control (math.OC); Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Dynamical Systems (math.DS); Numerical Analysis (math.NA)
[466]  arXiv:2405.03468 (cross-list from stat.ML) [pdf, other]
Title: Hierarchic Flows to Estimate and Sample High-dimensional Probabilities
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Fluid Dynamics (physics.flu-dyn)
[467]  arXiv:2405.03462 (cross-list from cs.CV) [pdf, ps, other]
Title: A Lightweight Neural Architecture Search Model for Medical Image Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[468]  arXiv:2405.03440 (cross-list from cs.RO) [pdf, other]
Title: Robotic Constrained Imitation Learning for the Peg Transfer Task in Fundamentals of Laparoscopic Surgery
Comments: Accepted at ICRA2024, website - this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[469]  arXiv:2405.03435 (cross-list from cond-mat.dis-nn) [pdf, other]
Title: A method for quantifying the generalization capabilities of generative models for solving Ising models
Comments: 10 pages, 7 figures
Journal-ref: Mach. Learn.: Sci. Technol. 5 (2024) 025011
Subjects: Disordered Systems and Neural Networks (cond-mat.dis-nn); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[470]  arXiv:2405.03419 (cross-list from cs.NE) [pdf, other]
Title: Automated Metaheuristic Algorithm Design with Autoregressive Learning
Subjects: Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG)
[471]  arXiv:2405.03381 (cross-list from cs.CV) [pdf, other]
Title: Statistical Edge Detection And UDF Learning For Shape Representation
Authors: Virgile Foy (IMT), Fabrice Gamboa (IMT), Reda Chhaibi (IMT)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Applications (stat.AP)
[472]  arXiv:2405.03363 (cross-list from cs.HC) [pdf, other]
Title: Telextiles: End-to-end Remote Transmission of Fabric Tactile Sensation
Comments: 10 pages, 8 figures, Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology
Journal-ref: Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology (2023)
Subjects: Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[473]  arXiv:2405.03314 (cross-list from cs.CV) [pdf, other]
Title: Deep Learning-based Point Cloud Registration for Augmented Reality-guided Surgery
Comments: 5 pages, 4 figures; accepted at IEEE ISBI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[474]  arXiv:2405.03311 (cross-list from cs.CV) [pdf, other]
Title: Federated Learning for Drowsiness Detection in Connected Vehicles
Comments: 14 pages, 8 figures, 1 table, EAI INTSYS 2023 conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[475]  arXiv:2405.03298 (cross-list from cs.CR) [pdf, other]
Title: Online Clustering of Known and Emerging Malware Families
Comments: arXiv admin note: text overlap with arXiv:2305.00605
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[476]  arXiv:2405.03293 (cross-list from astro-ph.IM) [pdf, other]
Title: Deep Learning and genetic algorithms for cosmological Bayesian inference speed-up
Subjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Cosmology and Nongalactic Astrophysics (astro-ph.CO); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[477]  arXiv:2405.03235 (cross-list from cs.CV) [pdf, ps, other]
Title: Cross-Modal Domain Adaptation in Brain Disease Diagnosis: Maximum Mean Discrepancy-based Convolutional Neural Networks
Authors: Xuran Zhu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[478]  arXiv:2405.03234 (cross-list from cs.HC) [pdf, other]
Title: A Reliable Framework for Human-in-the-Loop Anomaly Detection in Time Series
Comments: The manuscript is currently under review
Subjects: Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[479]  arXiv:2405.03221 (cross-list from cs.CV) [pdf, other]
Title: Spatial and Surface Correspondence Field for Interaction Transfer
Comments: Accepted to SIGGRAPH 2024, project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[480]  arXiv:2405.03205 (cross-list from cs.CL) [pdf, other]
Title: Anchored Answers: Unravelling Positional Bias in GPT-2's Multiple-Choice Questions
Authors: Ruizhe Li, Yanjun Gao
Comments: Work in process
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[481]  arXiv:2405.03198 (cross-list from stat.ML) [pdf, other]
Title: Stability Evaluation via Distributional Perturbation Analysis
Comments: Accepted by ICML 2024
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
[482]  arXiv:2405.03180 (cross-list from stat.ML) [pdf, other]
Title: Braced Fourier Continuation and Regression for Anomaly Detection
Authors: Josef Sabuda
Comments: 16 pages, 9 figures, associated Github link: this https URL
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[483]  arXiv:2405.03162 (cross-list from cs.CV) [pdf, other]
[484]  arXiv:2405.03153 (cross-list from cs.CL) [pdf, ps, other]
Title: Exploring the Potential of the Large Language Models (LLMs) in Identifying Misleading News Headlines
Comments: 5 pages, 2 tables, 1st HEAL Workshop at CHI Conference on Human Factors in Computing Systems, May 12, Honolulu, HI, USA 2024
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[485]  arXiv:2405.03150 (cross-list from cs.CV) [pdf, other]
Title: Video Diffusion Models: A Survey
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[486]  arXiv:2405.03144 (cross-list from cs.CV) [pdf, other]
Title: PTQ4SAM: Post-Training Quantization for Segment Anything
Comments: CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[487]  arXiv:2405.03133 (cross-list from cs.CL) [pdf, other]
Title: Lory: Fully Differentiable Mixture-of-Experts for Autoregressive Language Model Pre-training
Comments: 21 pages, 12 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[488]  arXiv:2405.03131 (cross-list from cs.IT) [pdf, other]
Title: WDMoE: Wireless Distributed Large Language Models with Mixture of Experts
Comments: submitted to IEEE conference
Subjects: Information Theory (cs.IT); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[489]  arXiv:2405.03130 (cross-list from stat.ML) [pdf, other]
Title: Deep Learning for Causal Inference: A Comparison of Architectures for Heterogeneous Treatment Effect Estimation
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[490]  arXiv:2405.03092 (cross-list from cond-mat.mtrl-sci) [pdf, other]
Title: Bayesian optimization for stable properties amid processing fluctuations in sputter deposition
Journal-ref: J. Vac. Sci. Technol. A 1 May 2024; 42 (3): 033408
Subjects: Materials Science (cond-mat.mtrl-sci); Machine Learning (cs.LG); Optimization and Control (math.OC)
[491]  arXiv:2405.03091 (cross-list from cs.CV) [pdf, ps, other]
Title: Research on Image Recognition Technology Based on Multimodal Deep Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[492]  arXiv:2405.03084 (cross-list from cs.CL) [pdf, ps, other]
Title: Analyzing Emotional Trends from X platform using SenticNet: A Comparative Analysis with Cryptocurrency Price
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[493]  arXiv:2405.03083 (cross-list from stat.ME) [pdf, other]
Title: Causal K-Means Clustering
Subjects: Methodology (stat.ME); Machine Learning (cs.LG); Machine Learning (stat.ML)
[494]  arXiv:2405.03063 (cross-list from math.ST) [pdf, other]
Title: Stability of a Generalized Debiased Lasso with Applications to Resampling-Based Variable Selection
Authors: Jingbo Liu
Subjects: Statistics Theory (math.ST); Information Theory (cs.IT); Machine Learning (cs.LG); Methodology (stat.ME); Machine Learning (stat.ML)
[495]  arXiv:2405.03008 (cross-list from eess.IV) [pdf, other]
Title: DVMSR: Distillated Vision Mamba for Efficient Super-Resolution
Comments: 8 pages, 8 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[496]  arXiv:2405.03004 (cross-list from cs.CL) [pdf, other]
Title: Exploring prompts to elicit memorization in masked language model-based named entity recognition
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[497]  arXiv:2405.02984 (cross-list from cs.CL) [pdf, other]
Title: E-TSL: A Continuous Educational Turkish Sign Language Dataset with Baseline Methods
Comments: 7 pages, 3 figures, 4 tables, submitted to IEEE conference
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[498]  arXiv:2405.02977 (cross-list from cs.CV) [pdf, other]
Title: SkelCap: Automated Generation of Descriptive Text from Skeleton Keypoint Sequences
Comments: 8 pages, 5 figures, 7 tables, submitted to IEEE conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[499]  arXiv:2405.02968 (cross-list from cs.RO) [pdf, other]
Title: CoverLib: Classifiers-equipped Experience Library by Iterative Problem Distribution Coverage Maximization for Domain-tuned Motion Planning
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[500]  arXiv:2405.02954 (cross-list from cs.CV) [pdf, other]
Title: Source-Free Domain Adaptation Guided by Vision and Vision-Language Pre-Training
Comments: Extension of ICCV paper arXiv:2212.07585, submitted to IJCV
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[501]  arXiv:2405.02953 (cross-list from eess.SY) [pdf, other]
Title: Analysis of the Identifying Regulation with Adversarial Surrogates Algorithm
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG)
[502]  arXiv:2405.02917 (cross-list from cs.CV) [pdf, other]
Title: Overconfidence is Key: Verbalized Uncertainty Evaluation in Large Language and Vision-Language Models
Comments: 8 pages, with appendix. To appear in TrustNLP workshop @ NAACL 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[503]  arXiv:2405.02903 (cross-list from cs.CE) [pdf, other]
Title: Predicting Open-Hole Laminates Failure Using Support Vector Machines With Classical and Quantum Kernels
Subjects: Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[504]  arXiv:2405.02876 (cross-list from cs.NE) [pdf, ps, other]
Title: Exploring the Improvement of Evolutionary Computation via Large Language Models
Comments: accepted by GECCO 2024
Subjects: Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG)
[505]  arXiv:2405.02861 (cross-list from cs.CL) [pdf, other]
Title: Revisiting a Pain in the Neck: Semantic Phrase Processing Benchmark for Language Models
Comments: 24 pages, 17 figures, 10 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[506]  arXiv:2405.02828 (cross-list from cs.SE) [pdf, other]
Title: Trojans in Large Language Models of Code: A Critical Review through a Trigger-Based Taxonomy
Comments: arXiv admin note: substantial text overlap with arXiv:2305.03803
Subjects: Software Engineering (cs.SE); Machine Learning (cs.LG)
[507]  arXiv:2405.02821 (cross-list from cs.SD) [pdf, other]
Title: Sim2Real Transfer for Audio-Visual Navigation with Frequency-Adaptive Acoustic Field Prediction
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO); Audio and Speech Processing (eess.AS)
[508]  arXiv:2405.02816 (cross-list from cs.CL) [pdf, other]
Title: Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization
Comments: To appear in the proceedings of SIGIR 2024
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[509]  arXiv:2405.02797 (cross-list from cs.CV) [pdf, other]
Title: Adapting to Distribution Shift by Visual Domain Prompt Generation
Comments: ICLR2024, code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[510]  arXiv:2405.02790 (cross-list from cs.CR) [pdf, other]
Title: Confidential and Protected Disease Classifier using Fully Homomorphic Encryption
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[511]  arXiv:2405.02783 (cross-list from stat.ML) [pdf, other]
Title: Linear Noise Approximation Assisted Bayesian Inference on Mechanistic Model of Partially Observed Stochastic Reaction Network
Authors: Wandi Xu, Wei Xie
Comments: 11 pages, 2 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[512]  arXiv:2405.02771 (cross-list from cs.CV) [pdf, other]
Title: MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning
Comments: Data and code is available on the project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[513]  arXiv:2405.02764 (cross-list from cs.CL) [pdf, other]
Title: Assessing Adversarial Robustness of Large Language Models: An Empirical Study
Comments: 16 pages, 9 figures, 10 tables
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[514]  arXiv:2405.02762 (cross-list from cs.CV) [pdf, other]
Title: TK-Planes: Tiered K-Planes with High Dimensional Feature Vectors for Dynamic UAV-based Scenes
Comments: 8 pages, submitted to IROS2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[515]  arXiv:2405.02754 (cross-list from cs.RO) [pdf, other]
Title: Implicit Safe Set Algorithm for Provably Safe Reinforcement Learning
Comments: submissions to Journal of Artificial Intelligence Research. arXiv admin note: text overlap with arXiv:2308.13140
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[516]  arXiv:2405.02679 (cross-list from physics.ao-ph) [pdf, other]
Title: Prévisions météorologiques basées sur l'intelligence artificielle : une révolution peut en cacher une autre
Comments: 8 pages, in French
Subjects: Atmospheric and Oceanic Physics (physics.ao-ph); Machine Learning (cs.LG)
[517]  arXiv:2405.02563 (cross-list from eess.SP) [pdf, other]
Title: Deep Representation Learning-Based Dynamic Trajectory Phenotyping for Acute Respiratory Failure in Medical Intensive Care Units
Comments: 9 pages
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)
[518]  arXiv:2405.02548 (cross-list from cs.CR) [pdf, other]
Title: CNN-LSTM and Transfer Learning Models for Malware Classification based on Opcodes and API Calls
Journal-ref: Bensaoud, A., & Kalita, J. (2024). CNN-LSTM and transfer learning models for malware classification based on opcodes and API calls. Knowledge-Based Systems, 111543
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[519]  arXiv:2405.02545 (cross-list from astro-ph.SR) [pdf, other]
Title: Prediction of Space Weather Events through Analysis of Active Region Magnetograms using Convolutional Neural Network
Authors: Shlesh Sakpal
Comments: 6 pages, 12 figures
Subjects: Solar and Stellar Astrophysics (astro-ph.SR); Earth and Planetary Astrophysics (astro-ph.EP); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[520]  arXiv:2405.02488 (cross-list from stat.ML) [pdf, other]
Title: Modelling Sampling Distributions of Test Statistics with Autograd
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); High Energy Physics - Experiment (hep-ex); Computation (stat.CO)
[521]  arXiv:2405.02466 (cross-list from cs.CR) [pdf, ps, other]
Title: ProFLingo: A Fingerprinting-based Copyright Protection Scheme for Large Language Models
Comments: This is the author's pre-print version of the work. It is posted here for your personal use. Not for redistribution
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[522]  arXiv:2405.02456 (cross-list from math.OC) [pdf, ps, other]
Title: Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[523]  arXiv:2405.02449 (cross-list from stat.ML) [pdf, other]
Title: Quality-Weighted Vendi Scores And Their Application To Diverse Experimental Design
Comments: Published in International Conference on Machine Learning, ICML 2024. Code can be found in the Vertaix GitHub: this https URL Paper dedicated to Kwame Nkrumah
Subjects: Machine Learning (stat.ML); Materials Science (cond-mat.mtrl-sci); Machine Learning (cs.LG); Biomolecules (q-bio.BM)
[524]  arXiv:2405.02437 (cross-list from cs.CR) [pdf, other]
Title: FastLloyd: Federated, Accurate, Secure, and Tunable $k$-Means Clustering with Differential Privacy
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[525]  arXiv:2405.02429 (cross-list from cs.IR) [pdf, other]
Title: CALRec: Contrastive Alignment of Generative LLMs For Sequential Recommendation
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[526]  arXiv:2405.02384 (cross-list from cs.NE) [pdf, other]
Title: CogDPM: Diffusion Probabilistic Models via Cognitive Predictive Coding
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[527]  arXiv:2405.02383 (cross-list from stat.ML) [pdf, other]
Title: A Fresh Look at Sanity Checks for Saliency Maps
Comments: arXiv admin note: text overlap with arXiv:2401.06465
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[528]  arXiv:2405.02374 (cross-list from q-bio.QM) [pdf, other]
Title: Protein binding affinity prediction under multiple substitutions applying eGNNs on Residue and Atomic graphs combined with Language model information: eGRAL
Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[529]  arXiv:2405.02373 (cross-list from math.OC) [pdf, other]
Title: Exponentially Weighted Algorithm for Online Network Resource Allocation with Long-Term Constraints
Comments: arXiv admin note: text overlap with arXiv:2305.15558
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[530]  arXiv:2405.02372 (cross-list from stat.ML) [pdf, ps, other]
Title: Triadic-OCD: Asynchronous Online Change Detection with Provable Robustness, Optimality, and Convergence
Comments: Accepted at ICML2024
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[531]  arXiv:2405.02371 (cross-list from cs.NE) [pdf, ps, other]
Title: Architecture of a Cortex Inspired Hierarchical Event Recaller
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[532]  arXiv:2405.02369 (cross-list from cs.NE) [pdf, other]
Title: No One-Size-Fits-All Neurons: Task-based Neurons for Artificial Neural Networks
Comments: 12 pages, 4 figures
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[533]  arXiv:2405.02366 (cross-list from astro-ph.IM) [pdf, other]
Title: Bayesian and Convolutional Networks for Hierarchical Morphological Classification of Galaxies
Subjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Astrophysics of Galaxies (astro-ph.GA); Machine Learning (cs.LG)
[534]  arXiv:2405.02353 (cross-list from cs.CL) [pdf, other]
Title: Early Transformers: A study on Efficient Training of Transformer Models through Early-Bird Lottery Tickets
Authors: Shravan Cheekati
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[535]  arXiv:2405.02346 (cross-list from cs.CR) [pdf, other]
Title: Temporal assessment of malicious behaviors: application to turnout field data monitoring
Comments: To be published in the International Conference on Control, Automation and Diagnosis (ICCAD24)
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG); Systems and Control (eess.SY)
[536]  arXiv:2405.02344 (cross-list from cs.CR) [pdf, other]
Title: Backdoor-based Explainable AI Benchmark for High Fidelity Evaluation of Attribution Methods
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[537]  arXiv:2405.02341 (cross-list from cs.CR) [pdf, other]
Title: Improved Communication-Privacy Trade-offs in $L_2$ Mean Estimation under Streaming Differential Privacy
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[538]  arXiv:2405.02340 (cross-list from stat.AP) [pdf, other]
Title: A Comprehensive Approach to Carbon Dioxide Emission Analysis in High Human Development Index Countries using Statistical and Machine Learning Techniques
Subjects: Applications (stat.AP); Machine Learning (cs.LG)
[539]  arXiv:2405.02336 (cross-list from cs.AI) [pdf, other]
Title: Artificial General Intelligence (AGI)-Native Wireless Systems: A Journey Beyond 6G
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[540]  arXiv:2405.02335 (cross-list from cs.IT) [pdf, other]
Title: sDAC -- Semantic Digital Analog Converter for Semantic Communications
Subjects: Information Theory (cs.IT); Machine Learning (cs.LG)
[541]  arXiv:2405.02334 (cross-list from cs.CV) [pdf, other]
Title: Rad4XCNN: a new agnostic method for post-hoc global explanation of CNN-derived features by means of radiomics
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[542]  arXiv:2405.02330 (cross-list from cs.IT) [pdf, other]
Title: Adaptive Semantic Token Selection for AI-native Goal-oriented Communications
Comments: 5 pages
Subjects: Information Theory (cs.IT); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[543]  arXiv:2405.02326 (cross-list from cs.AR) [pdf, other]
Title: Evaluating LLMs for Hardware Design and Test
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Programming Languages (cs.PL)
[544]  arXiv:2405.02323 (cross-list from cs.AR) [pdf, ps, other]
Title: CNN-Based Equalization for Communications: Achieving Gigabit Throughput with a Flexible FPGA Hardware Architecture
Comments: The article was submitted to the International Journal of Parallel Programming (IJPP) and is currently under review
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG); Signal Processing (eess.SP)
[545]  arXiv:2405.02318 (cross-list from cs.CL) [pdf, other]
Title: NL2FOL: Translating Natural Language to First-Order Logic for Logical Fallacy Detection
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[546]  arXiv:2405.02316 (cross-list from eess.SY) [pdf, ps, other]
Title: A Cloud-Edge Framework for Energy-Efficient Event-Driven Control: An Integration of Online Supervised Learning, Spiking Neural Networks and Local Plasticity Rules
Comments: 13 pages, 19 figures
Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[547]  arXiv:2405.02299 (cross-list from cs.CE) [pdf, other]
Title: Deep Reinforcement Learning for Modelling Protein Complexes
Comments: International Conference on Learning Representations (ICLR 2024)
Subjects: Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
[548]  arXiv:2405.02295 (cross-list from cs.CV) [pdf, other]
Title: Neural Additive Image Model: Interpretation through Interpolation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[549]  arXiv:2405.02292 (cross-list from cs.RO) [pdf, other]
Title: ALOHA 2: An Enhanced Low-Cost Hardware for Bimanual Teleoperation
Comments: Project website: aloha-2.github.io
Subjects: Robotics (cs.RO); Machine Learning (cs.LG)

Mon, 6 May 2024

[550]  arXiv:2405.02267 [pdf, other]
Title: Structural Pruning of Pre-trained Language Models via Neural Architecture Search
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[551]  arXiv:2405.02240 [pdf, other]
Title: Subgraph2vec: A random walk-based algorithm for embedding knowledge graphs
Subjects: Machine Learning (cs.LG)
[552]  arXiv:2405.02235 [pdf, other]
Title: Learning Optimal Deterministic Policies with Stochastic Policy Gradients
Comments: Accepted to ICML 2024
Subjects: Machine Learning (cs.LG)
[553]  arXiv:2405.02200 [pdf, other]
Title: Position Paper: Rethinking Empirical Research in Machine Learning: Addressing Epistemic and Methodological Challenges of Experimentation
Comments: Accepted for publication at ICML 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[554]  arXiv:2405.02183 [pdf, other]
Title: Metalearners for Ranking Treatment Effects
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[555]  arXiv:2405.02181 [pdf, other]
Title: Imitation Learning in Discounted Linear MDPs without exploration assumptions
Comments: Accepted at ICML 2024
Subjects: Machine Learning (cs.LG)
[556]  arXiv:2405.02180 [pdf, other]
Title: A Flow-Based Model for Conditional and Probabilistic Electricity Consumption Profile Generation and Prediction
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[557]  arXiv:2405.02161 [pdf, other]
Title: Simulating the economic impact of rationality through reinforcement learning and agent-based modelling
Comments: 8 pages, 4 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Multiagent Systems (cs.MA); General Economics (econ.GN)
[558]  arXiv:2405.02154 [pdf, other]
Title: Neural Context Flows for Learning Generalizable Dynamical Systems
Comments: 14 pages, 5 figures
Subjects: Machine Learning (cs.LG); Dynamical Systems (math.DS)
[559]  arXiv:2405.02140 [pdf, other]
Title: An Information Theoretic Perspective on Conformal Prediction
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[560]  arXiv:2405.02098 [pdf, other]
Title: Forecasting Ferry Passenger Flow Using Long-Short Term Memory Neural Networks
Authors: Daniel Fesalbon
Journal-ref: IJSRED - Volume 7, Issue 3 May-June 2024
Subjects: Machine Learning (cs.LG)
[561]  arXiv:2405.02086 [pdf, other]
Title: Multi-level projection with exponential parallel speedup; Application to sparse auto-encoders neural networks
Subjects: Machine Learning (cs.LG)
[562]  arXiv:2405.02081 [pdf, other]
Title: A Mutual Information Perspective on Federated Contrastive Learning
Comments: Published as a conference paper at ICLR 2024
Subjects: Machine Learning (cs.LG)
[563]  arXiv:2405.02074 [pdf, other]
Title: A Federated Learning Benchmark on Tabular Data: Comparing Tree-Based Models and Neural Networks
Comments: 8 pages, 6 figures, 6 tables, FMEC 2023 (best paper)
Subjects: Machine Learning (cs.LG)
[564]  arXiv:2405.02067 [pdf, other]
Title: Histogram-Based Federated XGBoost using Minimal Variance Sampling for Federated Tabular Data
Comments: 6 figures, 5 tables, 8 pages, FLTA 2023 (together with FMEC 2023)
Subjects: Machine Learning (cs.LG)
[565]  arXiv:2405.02063 [pdf, other]
Title: Few-sample Variational Inference of Bayesian Neural Networks with Arbitrary Nonlinearities
Authors: David J. Schodt
Subjects: Machine Learning (cs.LG)
[566]  arXiv:2405.02062 [pdf, other]
Title: Dyna-Style Learning with A Macroscopic Model for Vehicle Platooning in Mixed-Autonomy Traffic
Subjects: Machine Learning (cs.LG)
[567]  arXiv:2405.02060 [pdf, other]
Title: Federated Learning for Tabular Data using TabNet: A Vehicular Use-Case
Comments: 7 pages, 9 figures, 1 table, ICCP Conference 2022
Subjects: Machine Learning (cs.LG)
[568]  arXiv:2405.02044 [pdf, other]
Title: Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Systems and Control (eess.SY); Optimization and Control (math.OC)
[569]  arXiv:2405.02041 [pdf, other]
Title: Stabilizing Backpropagation Through Time to Learn Complex Physics
Comments: Published at ICLR 2024, code available at this https URL
Subjects: Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[570]  arXiv:2405.01995 [pdf, other]
Title: Cooperation and Federation in Distributed Radar Point Cloud Processing
Journal-ref: 2023 IEEE 34th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[571]  arXiv:2405.01990 [pdf, other]
Title: Soft Label PU Learning
Subjects: Machine Learning (cs.LG)
[572]  arXiv:2405.01978 [pdf, other]
Title: Quantifying Distribution Shifts and Uncertainties for Enhanced Model Robustness in Machine Learning Applications
Authors: Vegard Flovik
Comments: Working paper
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[573]  arXiv:2405.01974 [pdf, other]
Title: Multitask Extension of Geometrically Aligned Transfer Encoder
Comments: 7 pages, 3 figures, 2 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
[574]  arXiv:2405.01927 [pdf, other]
Title: SlotGAT: Slot-based Message Passing for Heterogeneous Graph Neural Network
Comments: Published as a conference paper at ICML 2023
Subjects: Machine Learning (cs.LG)
[575]  arXiv:2405.01851 [pdf, other]
Title: Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[576]  arXiv:2405.01843 [pdf, ps, other]
Title: Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization
Comments: Accepted at ICML 2024. This is a revised version of arXiv:2306.10486, where we have gone from finite action space to continuous action space, from average iterate convergence to last iterate convergence and from $\epsilon^{-4}$ to $\epsilon^{-3}$ sample complexity
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[577]  arXiv:2405.01838 [pdf, other]
Title: A Novel Approach to Guard from Adversarial Attacks using Stable Diffusion
Subjects: Machine Learning (cs.LG)
[578]  arXiv:2405.01817 [pdf, other]
Title: Uniformly Stable Algorithms for Adversarial Training and Beyond
Comments: ICML 2024
Subjects: Machine Learning (cs.LG)
[579]  arXiv:2405.01814 [pdf, other]
Title: Efficient and Economic Large Language Model Inference with Attention Offloading
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[580]  arXiv:2405.01778 [pdf, other]
Title: Hierarchical mixture of discriminative Generalized Dirichlet classifiers
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[581]  arXiv:2405.01762 [pdf, ps, other]
Title: EiG-Search: Generating Edge-Induced Subgraphs for GNN Explanation in Linear Time
Comments: 19 pages
Journal-ref: ICML 2024
Subjects: Machine Learning (cs.LG)
[582]  arXiv:2405.01760 [pdf, other]
Title: Reinforcement Learning-Guided Semi-Supervised Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[583]  arXiv:2405.01744 [pdf, other]
Title: ALCM: Autonomous LLM-Augmented Causal Discovery Framework
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Methodology (stat.ME)
[584]  arXiv:2405.01739 [pdf, other]
Title: Enhancing User Experience in On-Device Machine Learning with Gated Compression Layers
Comments: Initial Submission
Subjects: Machine Learning (cs.LG)
[585]  arXiv:2405.01731 [pdf, other]
Title: Dynamic Anisotropic Smoothing for Noisy Derivative-Free Optimization
Comments: Accepted to ICML2024
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[586]  arXiv:2405.01719 [pdf, other]
Title: Inherent Trade-Offs between Diversity and Stability in Multi-Task Benchmarks
Comments: To be published in ICML 2024
Subjects: Machine Learning (cs.LG)
[587]  arXiv:2405.01718 [pdf, other]
Title: Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Risk
Authors: Xinyi Ni, Lifeng Lai
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[588]  arXiv:2405.01714 [pdf, other]
Title: Interpretable Vital Sign Forecasting with Model Agnostic Attention Maps
Comments: 8 pages, 4 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[589]  arXiv:2405.01711 [pdf, ps, other]
Title: Individual Fairness Through Reweighting and Tuning
Comments: 14 pages, 1 figure, and 2 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[590]  arXiv:2405.01708 [pdf, other]
Title: A deep causal inference model for fully-interpretable travel behaviour analysis
Subjects: Machine Learning (cs.LG)
[591]  arXiv:2405.01704 [pdf, other]
Title: Privacy-aware Berrut Approximated Coded Computing for Federated Learning
Subjects: Machine Learning (cs.LG); Computational Complexity (cs.CC); Distributed, Parallel, and Cluster Computing (cs.DC); Information Theory (cs.IT)
[592]  arXiv:2405.01702 [pdf, other]
Title: Optimization without retraction on the random generalized Stiefel manifold
Comments: 21 pages, 10 figures
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[593]  arXiv:2405.01684 [pdf, other]
Title: Intelligent Switching for Reset-Free RL
Comments: Published at ICLR 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[594]  arXiv:2405.01680 [pdf, other]
Title: Physics-Informed Neural Networks: Minimizing Residual Loss with Wide Networks and Effective Activations
Comments: Accepted at IJCAI 2024
Subjects: Machine Learning (cs.LG)
[595]  arXiv:2405.01677 [pdf, other]
Title: Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[596]  arXiv:2405.01663 [pdf, ps, other]
Title: ATNPA: A Unified View of Oversmoothing Alleviation in Graph Neural Networks
Comments: 16 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[597]  arXiv:2405.01661 [pdf, other]
Title: When a Relation Tells More Than a Concept: Exploring and Evaluating Classifier Decisions with CoReX
Comments: preliminary version, submitted to Machine Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[598]  arXiv:2405.01617 [pdf, other]
Title: An Explainable and Conformal AI Model to Detect Temporomandibular Joint Involvement in Children Suffering from Juvenile Idiopathic Arthritis
Comments: Accepted at EMBC 2024
Subjects: Machine Learning (cs.LG)
[599]  arXiv:2405.01614 [pdf, other]
Title: A probabilistic estimation of remaining useful life from censored time-to-event data
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[600]  arXiv:2405.01611 [pdf, other]
Title: Unifying and extending Precision Recall metrics for assessing generative models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME); Machine Learning (stat.ML)
[601]  arXiv:2405.01607 [pdf, other]
Title: Wildfire Risk Prediction: A Review
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[602]  arXiv:2405.01603 [pdf, other]
Title: KITE: A Kernel-based Improved Transferability Estimation Method
Authors: Yunhui Guo
Comments: 14 pages
Subjects: Machine Learning (cs.LG)
[603]  arXiv:2405.01563 [pdf, other]
Title: Mitigating LLM Hallucinations via Conformal Abstention
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[604]  arXiv:2405.01557 [pdf, other]
Title: An Experimental Study on the Rashomon Effect of Balancing Methods in Imbalanced Classification
Comments: 16 pages, 6 figures
Subjects: Machine Learning (cs.LG)
[605]  arXiv:2405.01554 [pdf, other]
Title: Early-stage detection of cognitive impairment by hybrid quantum-classical algorithm using resting-state functional MRI time-series
Comments: 28 pages, 10 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neurons and Cognition (q-bio.NC)
[606]  arXiv:2405.02225 (cross-list from stat.ML) [pdf, other]
Title: Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks
Comments: 28 pages, 8 figures, accepted by ICML2024
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG); Methodology (stat.ME)
[607]  arXiv:2405.02221 (cross-list from math.NA) [pdf, other]
Title: Discretization Error of Fourier Neural Operators
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG)
[608]  arXiv:2405.02220 (cross-list from cs.CV) [pdf, other]
Title: Designed Dithering Sign Activation for Binary Neural Networks
Comments: 7 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[609]  arXiv:2405.02213 (cross-list from cs.SE) [pdf, other]
Title: Automatic Programming: Large Language Models and Beyond
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[610]  arXiv:2405.02201 (cross-list from math.OC) [pdf, other]
Title: Regularized Q-learning through Robust Averaging
Comments: 26 pages, 5 figures
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[611]  arXiv:2405.02195 (cross-list from cs.CL) [pdf, ps, other]
Title: Impact of emoji exclusion on the performance of Arabic sarcasm detection models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[612]  arXiv:2405.02191 (cross-list from cs.CV) [pdf, ps, other]
Title: Non-Destructive Peat Analysis using Hyperspectral Imaging and Machine Learning
Comments: 4 pages,4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[613]  arXiv:2405.02188 (cross-list from stat.ML) [pdf, other]
Title: Optimistic Regret Bounds for Online Learning in Adversarial Markov Decision Processes
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[614]  arXiv:2405.02175 (cross-list from cs.CL) [pdf, other]
Title: Hoaxpedia: A Unified Wikipedia Hoax Articles Dataset
Comments: Short paper
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[615]  arXiv:2405.02148 (cross-list from cs.AI) [pdf, ps, other]
Title: Towards a Formal Creativity Theory: Preliminary results in Novelty and Transformativeness
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[616]  arXiv:2405.02141 (cross-list from cs.IR) [pdf, other]
Title: Multi-Objective Recommendation via Multivariate Policy Learning
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[617]  arXiv:2405.02124 (cross-list from eess.AS) [pdf, other]
Title: TIPAA-SSL: Text Independent Phone-to-Audio Alignment based on Self-Supervised Learning and Knowledge Transfer
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[618]  arXiv:2405.02119 (cross-list from cs.SD) [pdf, other]
Title: Can We Identify Unknown Audio Recording Environments in Forensic Scenarios?
Comments: This work has been submitted to the IEEE for possible publication
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[619]  arXiv:2405.02101 (cross-list from eess.SP) [pdf, other]
Title: Discrete Aware Matrix Completion via Convexized $\ell_0$-Norm Approximation
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)
[620]  arXiv:2405.02082 (cross-list from stat.ML) [pdf, ps, other]
Title: A comparative study of conformal prediction methods for valid uncertainty quantification in machine learning
Authors: Nicolas Dewolf
Comments: At 339 pages, this document is a live/working version of my PhD dissertation published in 2024 by the University of Ghent (UGent)
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Statistics Theory (math.ST)
[621]  arXiv:2405.01994 (cross-list from stat.ML) [pdf, ps, other]
Title: Mathematics of statistical sequential decision-making: concentration, risk-awareness and modelling in stochastic bandits, with applications to bariatric surgery
Authors: Patrick Saux
Comments: Doctoral thesis. Some pdf readers (e.g. Firefox) have trouble rendering the theorems/definitions environment. When reading online, please prefer e.g. Chrome
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[622]  arXiv:2405.01988 (cross-list from cs.SD) [pdf, other]
Title: Joint sentiment analysis of lyrics and audio in music
Comments: published at DAGA 2024
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[623]  arXiv:2405.01976 (cross-list from cs.CL) [pdf, other]
Title: Conformal Prediction for Natural Language Processing: A Survey
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[624]  arXiv:2405.01975 (cross-list from cs.CE) [pdf, other]
Title: Introducing a microstructure-embedded autoencoder approach for reconstructing high-resolution solution field data from a reduced parametric space
Subjects: Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
[625]  arXiv:2405.01964 (cross-list from stat.ML) [pdf, other]
Title: Understanding LLMs Requires More Than Statistical Generalization
Comments: Accepted at ICML2024
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[626]  arXiv:2405.01963 (cross-list from cs.CR) [pdf, other]
Title: From Attack to Defense: Insights into Deep Learning Security Measures in Black-Box Settings
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[627]  arXiv:2405.01952 (cross-list from stat.ML) [pdf, other]
Title: Three Quantization Regimes for ReLU Networks
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Information Theory (cs.IT); Machine Learning (cs.LG)
[628]  arXiv:2405.01943 (cross-list from cs.CL) [pdf, other]
Title: Dependency-Aware Semi-Structured Sparsity of GLU Variants in Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[629]  arXiv:2405.01934 (cross-list from cs.CV) [pdf, other]
Title: Impact of Architectural Modifications on Deep Learning Adversarial Robustness
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[630]  arXiv:2405.01906 (cross-list from cs.AI) [pdf, other]
Title: Instance-Conditioned Adaptation for Large-scale Generalization of Neural Combinatorial Optimization
Comments: 17 pages, 6 figures
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[631]  arXiv:2405.01883 (cross-list from cs.CL) [pdf, other]
Title: DALLMi: Domain Adaption for LLM-based Multi-label Classifier
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[632]  arXiv:2405.01881 (cross-list from q-fin.RM) [pdf, ps, other]
Title: Explainable Risk Classification in Financial Reports
Comments: ICIS 2023 Proceedings. 3. this https URL
Subjects: Risk Management (q-fin.RM); Machine Learning (cs.LG)
[633]  arXiv:2405.01873 (cross-list from cs.CL) [pdf, other]
Title: Enhancing Bangla Language Next Word Prediction and Sentence Completion through Extended RNN with Bi-LSTM Model On N-gram Language
Comments: This paper contains 6 pages, 8 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[634]  arXiv:2405.01859 (cross-list from cs.CY) [pdf, other]
Title: AI-Powered Autonomous Weapons Risk Geopolitical Instability and Threaten AI Research
Comments: 9 pages, in ICML 2024
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[635]  arXiv:2405.01855 (cross-list from cs.IR) [pdf, ps, other]
Title: Robust Explainable Recommendation
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[636]  arXiv:2405.01849 (cross-list from cs.IR) [pdf, ps, other]
Title: Stability of Explainable Recommendation
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[637]  arXiv:2405.01848 (cross-list from cs.IR) [pdf, other]
Title: RankSHAP: a Gold Standard Feature Attribution Method for the Ranking Task
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[638]  arXiv:2405.01810 (cross-list from cs.AI) [pdf, other]
Title: Non-linear Welfare-Aware Strategic Learning
Authors: Tian Xie, Xueru Zhang
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[639]  arXiv:2405.01792 (cross-list from cs.RO) [pdf, other]
Title: Learning Robust Autonomous Navigation and Locomotion for Wheeled-Legged Robots
Journal-ref: Science Robotics, 2024, Vol 9, Issue 89
Subjects: Robotics (cs.RO); Machine Learning (cs.LG); Systems and Control (eess.SY)
[640]  arXiv:2405.01776 (cross-list from cs.RO) [pdf, other]
Title: An Approach to Systematic Data Acquisition and Data-Driven Simulation for the Safety Testing of Automated Driving Functions
Comments: 8 pages, 5 figures
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[641]  arXiv:2405.01775 (cross-list from cs.AR) [pdf, other]
Title: Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator Design
Comments: Accepted for publication at MLSys 2024
Subjects: Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[642]  arXiv:2405.01761 (cross-list from stat.ML) [pdf, other]
Title: Multivariate Bayesian Last Layer for Regression: Uncertainty Quantification and Disentanglement
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[643]  arXiv:2405.01758 (cross-list from cs.RO) [pdf, other]
Title: CGD: Constraint-Guided Diffusion Policies for UAV Trajectory Planning
Comments: 8 pages, 3 figures
Subjects: Robotics (cs.RO); Machine Learning (cs.LG); Systems and Control (eess.SY)
[644]  arXiv:2405.01745 (cross-list from cs.AI) [pdf, other]
Title: Large Language Models for UAVs: Current State and Pathways to the Future
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[645]  arXiv:2405.01741 (cross-list from cs.CR) [pdf, other]
Title: PVF (Parameter Vulnerability Factor): A Quantitative Metric Measuring AI Vulnerability and Resilience Against Parameter Corruptions
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Machine Learning (cs.LG)
[646]  arXiv:2405.01737 (cross-list from stat.ML) [pdf, other]
Title: Sample-efficient neural likelihood-free Bayesian inference of implicit HMMs
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO)
[647]  arXiv:2405.01726 (cross-list from eess.IV) [pdf, ps, other]
Title: SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[648]  arXiv:2405.01725 (cross-list from eess.IV) [pdf, other]
Title: Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[649]  arXiv:2405.01691 (cross-list from cs.CV) [pdf, other]
Title: Language-Enhanced Latent Representations for Out-of-Distribution Detection in Autonomous Driving
Comments: Presented at the Robot Trust for Symbiotic Societies (RTSS) Workshop, co-located with ICRA 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[650]  arXiv:2405.01656 (cross-list from cs.CV) [pdf, other]
Title: S4: Self-Supervised Sensing Across the Spectrum
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[651]  arXiv:2405.01616 (cross-list from q-bio.BM) [pdf, other]
[652]  arXiv:2405.01615 (cross-list from cs.NE) [pdf, other]
Title: Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning
Comments: 16 pages, including proofs in the appendix
Subjects: Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG)
[653]  arXiv:2405.01606 (cross-list from quant-ph) [pdf, other]
Title: Improving Trainability of Variational Quantum Circuits via Regularization Strategies
Comments: preprint, under review. TL;DR: we propose a regularization strategy to improve the trainability of VQCs
Subjects: Quantum Physics (quant-ph); Machine Learning (cs.LG)
[654]  arXiv:2405.01604 (cross-list from q-fin.PM) [pdf, other]
Title: Portfolio Management using Deep Reinforcement Learning
Comments: 7 pages, 9 figures
Subjects: Portfolio Management (q-fin.PM); Machine Learning (cs.LG)
[655]  arXiv:2405.01601 (cross-list from cs.CL) [pdf, other]
Title: Efficient Sample-Specific Encoder Perturbations
Comments: To appear in NAACL 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[656]  arXiv:2405.01600 (cross-list from eess.IV) [pdf, other]
Title: Deep Learning Descriptor Hybridization with Feature Reduction for Accurate Cervical Cancer Colposcopy Image Classification
Comments: 7 Pages double column, 5 figures, and 5 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[657]  arXiv:2405.01587 (cross-list from cs.CL) [pdf, ps, other]
Title: Improve Academic Query Resolution through BERT-based Question Extraction from Images
Journal-ref: 2024 IEEE International Conference on Interdisciplinary Approaches in Technology and Management for Social Innovation (IATMSI) volume 2 (2024) 1-4
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[658]  arXiv:2405.01584 (cross-list from cs.CL) [pdf, other]
Title: Lightweight Conceptual Dictionary Learning for Text Classification Using Information Compression
Comments: 12 pages, TKDE format
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Signal Processing (eess.SP)
[659]  arXiv:2405.01583 (cross-list from cs.CL) [pdf, other]
Title: MediFact at MEDIQA-M3G 2024: Medical Question Answering in Dermatology with Multimodal Learning
Authors: Nadia Saeed
Comments: 7 pages, 3 figures, Clinical NLP 2024 workshop proceedings in Shared Task
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[660]  arXiv:2405.01582 (cross-list from cs.CL) [pdf, other]
Title: Text Quality-Based Pruning for Efficient Training of Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[661]  arXiv:2405.01579 (cross-list from cs.SE) [pdf, other]
Title: Mining patterns in syntax trees to automate code reviews of student solutions for programming exercises
Subjects: Software Engineering (cs.SE); Computers and Society (cs.CY); Machine Learning (cs.LG)
[662]  arXiv:2405.01577 (cross-list from cs.CL) [pdf, other]
Title: HateTinyLLM : Hate Speech Detection Using Tiny Large Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[663]  arXiv:2405.01576 (cross-list from cs.CL) [pdf, other]
Title: Uncovering Deceptive Tendencies in Language Models: A Simulated Company AI Assistant
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[664]  arXiv:2405.01559 (cross-list from cs.SE) [pdf, other]
Title: Untangling Knots: Leveraging LLM for Error Resolution in Computational Notebooks
Comments: accepted at 1st ACM CHI Workshop on Human-Notebook Interactions
Subjects: Software Engineering (cs.SE); Machine Learning (cs.LG)
[665]  arXiv:2405.01558 (cross-list from cs.CV) [pdf, other]
Title: Configurable Learned Holography
Comments: 14 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Optics (physics.optics)
[666]  arXiv:2405.01540 (cross-list from cs.AI) [pdf, other]
Title: Universal Imitation Games
Comments: 98 pages. arXiv admin note: substantial text overlap with arXiv:2402.18732
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[ total of 666 entries: 1-402 | 366-666 ]
[ showing 402 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)