We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions, skipping first 236

[ total of 1090 entries: 1-1000 | 237-1090 ]
[ showing 1000 entries per page: fewer | more | all ]

Wed, 5 Jun 2024 (continued, showing last 205 of 208 entries)

[237]  arXiv:2406.02542 [pdf, other]
Title: Loki: Low-Rank Keys for Efficient Sparse Attention
Subjects: Machine Learning (cs.LG)
[238]  arXiv:2406.02515 [pdf, ps, other]
Title: Uncertainty of Joint Neural Contextual Bandit
Subjects: Machine Learning (cs.LG)
[239]  arXiv:2406.02510 [pdf, other]
Title: Fairness-Optimized Synthetic EHR Generation for Arbitrary Downstream Predictive Tasks
Subjects: Machine Learning (cs.LG)
[240]  arXiv:2406.02500 [pdf, other]
Title: Demystifying the Compression of Mixture-of-Experts Through a Unified Framework
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[241]  arXiv:2406.02496 [pdf, other]
Title: Kolmogorov-Arnold Networks for Time Series: Bridging Predictive Power and Interpretability
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[242]  arXiv:2406.02490 [pdf, other]
Title: Ai-Sampler: Adversarial Learning of Markov kernels with involutive maps
Journal-ref: Proceedings of the 41 st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[243]  arXiv:2406.02486 [pdf, other]
Title: A Temporal Kolmogorov-Arnold Transformer for Time Series Forecasting
Comments: arXiv admin note: text overlap with arXiv:2405.07344
Subjects: Machine Learning (cs.LG)
[244]  arXiv:2406.02479 [pdf, ps, other]
Title: Applying Fine-Tuned LLMs for Reducing Data Needs in Load Profile Analysis
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP); Systems and Control (eess.SY)
[245]  arXiv:2406.02469 [pdf, other]
Title: Landscape-Aware Growing: The Power of a Little LAG
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[246]  arXiv:2406.02465 [pdf, other]
Title: An Empirical Study into Clustering of Unseen Datasets with Self-Supervised Encoders
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[247]  arXiv:2406.02464 [pdf, other]
Title: Meta-Learners for Partially-Identified Treatment Effects Across Multiple Environments
Comments: Accepted at ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[248]  arXiv:2406.02456 [pdf, other]
Title: Offline Bayesian Aleatoric and Epistemic Uncertainty Quantification and Posterior Value Optimisation in Finite-State MDPs
Comments: 19 pages, 13 figures, 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024)
Subjects: Machine Learning (cs.LG)
[249]  arXiv:2406.02450 [pdf, other]
Title: A Generalized Apprenticeship Learning Framework for Modeling Heterogeneous Student Pedagogical Strategies
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[250]  arXiv:2406.02447 [pdf, other]
Title: Reducing Bias in Federated Class-Incremental Learning with Hierarchical Generative Prototypes
Subjects: Machine Learning (cs.LG)
[251]  arXiv:2406.02428 [pdf, other]
Title: Harnessing Neural Unit Dynamics for Effective and Scalable Class-Incremental Learning
Comments: Accepted to ICML 2024
Subjects: Machine Learning (cs.LG)
[252]  arXiv:2406.02424 [pdf, ps, other]
Title: Contextual Dynamic Pricing: Algorithms, Optimality, and Local Differential Privacy Constraints
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST); Methodology (stat.ME)
[253]  arXiv:2406.02416 [pdf, other]
Title: Improved Modelling of Federated Datasets using Mixtures-of-Dirichlet-Multinomials
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[254]  arXiv:2406.02395 [pdf, other]
Title: GrootVL: Tree Topology is All You Need in State Space Model
Comments: The code is available at this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[255]  arXiv:2406.02366 [pdf, other]
Title: Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models
Comments: Preprint
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[256]  arXiv:2406.02362 [pdf, other]
Title: Temporal Graph Rewiring with Expander Graphs
Comments: 10 pages, 2 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[257]  arXiv:2406.02361 [pdf, other]
Title: Using Self-supervised Learning Can Improve Model Fairness
Comments: arXiv admin note: text overlap with arXiv:2401.01640
Subjects: Machine Learning (cs.LG)
[258]  arXiv:2406.02356 [pdf, other]
Title: Language Models Do Hard Arithmetic Tasks Easily and Hardly Do Easy Arithmetic Tasks
Comments: In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[259]  arXiv:2406.02354 [pdf, other]
Title: Label-wise Aleatoric and Epistemic Uncertainty Quantification
Comments: Uncertainty in Artificial Intelligence. arXiv admin note: substantial text overlap with arXiv:2401.00276
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[260]  arXiv:2406.02352 [pdf, other]
Title: System-Aware Neural ODE Processes for Few-Shot Bayesian Optimization
Subjects: Machine Learning (cs.LG)
[261]  arXiv:2406.02348 [pdf, ps, other]
Title: AMOSL: Adaptive Modality-wise Structure Learning in Multi-view Graph Neural Networks For Enhanced Unified Representation
Journal-ref: 13th International Conference on Soft Computing, Artificial Intelligence and Applications (SAI 2024)
Subjects: Machine Learning (cs.LG)
[262]  arXiv:2406.02344 [pdf, other]
Title: Incorporating Navigation Context into Inland Vessel Trajectory Prediction: A Gaussian Mixture Model and Transformer Approach
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Machine Learning (cs.LG)
[263]  arXiv:2406.02343 [pdf, other]
Title: Cluster-Aware Similarity Diffusion for Instance Retrieval
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[264]  arXiv:2406.02336 [pdf, other]
Title: Polynomial-Augmented Neural Networks (PANNs) with Weak Orthogonality Constraints for Enhanced Function and PDE Approximation
Subjects: Machine Learning (cs.LG)
[265]  arXiv:2406.02332 [pdf, other]
Title: Extended Mind Transformers
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[266]  arXiv:2406.02322 [pdf, other]
Title: A Survey of Transformer Enabled Time Series Synthesis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[267]  arXiv:2406.02318 [pdf, other]
Title: PeFAD: A Parameter-Efficient Federated Framework for Time Series Anomaly Detection
Comments: Accepted by SIGKDD 2024 (Research Track)
Subjects: Machine Learning (cs.LG); Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC)
[268]  arXiv:2406.02317 [pdf, other]
Title: Generative Conditional Distributions by Neural (Entropic) Optimal Transport
Comments: 15 pages, 8 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[269]  arXiv:2406.02310 [pdf, other]
Title: Disentangled Representation via Variational AutoEncoder for Continuous Treatment Effect Estimation
Subjects: Machine Learning (cs.LG)
[270]  arXiv:2406.02309 [pdf, other]
Title: Effects of Exponential Gaussian Distribution on (Double Sampling) Randomized Smoothing
Comments: ICML 2024 Poster
Subjects: Machine Learning (cs.LG)
[271]  arXiv:2406.02296 [pdf, other]
Title: Learning-Rate-Free Stochastic Optimization over Riemannian Manifolds
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[272]  arXiv:2406.02295 [pdf, other]
Title: How to Explore with Belief: State Entropy Maximization in POMDPs
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[273]  arXiv:2406.02294 [pdf, other]
Title: Smaller Batches, Bigger Gains? Investigating the Impact of Batch Sizes on Reinforcement Learning Based Real-World Production Scheduling
Comments: This paper was accepted at the ETFA 2024 conference
Subjects: Machine Learning (cs.LG)
[274]  arXiv:2406.02292 [pdf, other]
Title: An Axiomatic Approach to Loss Aggregation and an Adapted Aggregating Algorithm
Comments: 31 pages
Subjects: Machine Learning (cs.LG)
[275]  arXiv:2406.02290 [pdf, other]
Title: A Study of Optimizations for Fine-tuning Large Language Models
Comments: 10 pages, 4 figures
Subjects: Machine Learning (cs.LG)
[276]  arXiv:2406.02282 [pdf, other]
Title: Test-Time Regret Minimization in Meta Reinforcement Learning
Subjects: Machine Learning (cs.LG)
[277]  arXiv:2406.02268 [pdf, other]
Title: Analyzing the Benefits of Prototypes for Semi-Supervised Category Learning
Comments: 7 pages, 3 figures
Subjects: Machine Learning (cs.LG)
[278]  arXiv:2406.02258 [pdf, other]
Title: Reinforcement Learning with Lookahead Information
Authors: Nadav Merlis
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[279]  arXiv:2406.02234 [pdf, other]
Title: On the Limitations of Fractal Dimension as a Measure of Generalization
Comments: 17 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Dynamical Systems (math.DS); Machine Learning (stat.ML)
[280]  arXiv:2406.02214 [pdf, other]
Title: SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining
Subjects: Machine Learning (cs.LG)
[281]  arXiv:2406.02213 [pdf, other]
Title: Rectifying Reinforcement Learning for Reward Matching
Subjects: Machine Learning (cs.LG)
[282]  arXiv:2406.02189 [pdf, other]
Title: Fast and Scalable Multi-Kernel Encoder Classifier
Authors: Cencheng Shen
Comments: 12 pages main + 3 pages appendix
Subjects: Machine Learning (cs.LG)
[283]  arXiv:2406.02187 [pdf, other]
Title: DNCs Require More Planning Steps
Subjects: Machine Learning (cs.LG)
[284]  arXiv:2406.02180 [pdf, other]
Title: On The Statistical Representation Properties Of The Perturb-Softmax And The Perturb-Argmax Probability Distributions
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[285]  arXiv:2406.02177 [pdf, other]
Title: One-Shot Federated Learning with Bayesian Pseudocoresets
Comments: 10 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
[286]  arXiv:2406.02176 [pdf, other]
Title: AROMA: Preserving Spatial Structure for Latent PDE Modeling with Local Neural Fields
Subjects: Machine Learning (cs.LG)
[287]  arXiv:2406.02175 [pdf, other]
Title: Branches: A Fast Dynamic Programming and Branch & Bound Algorithm for Optimal Decision Trees
Comments: This preprint is currently under review
Subjects: Machine Learning (cs.LG)
[288]  arXiv:2406.02165 [pdf, other]
Title: SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Subjects: Machine Learning (cs.LG)
[289]  arXiv:2406.02146 [pdf, other]
Title: Activation Bottleneck: Sigmoidal Neural Networks Cannot Forecast a Straight Line
Subjects: Machine Learning (cs.LG)
[290]  arXiv:2406.02131 [pdf, other]
Title: CondTSF: One-line Plugin of Dataset Condensation for Time Series Forecasting
Comments: 23 pages, 13 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[291]  arXiv:2406.02128 [pdf, other]
Title: Iteration Head: A Mechanistic Study of Chain-of-Thought
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[292]  arXiv:2406.02105 [pdf, other]
Title: Kernel vs. Kernel: Exploring How the Data Structure Affects Neural Collapse
Comments: 34 pages, 14 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Theory (cs.IT); Machine Learning (stat.ML)
[293]  arXiv:2406.02075 [pdf, other]
Title: ReLU-KAN: New Kolmogorov-Arnold Networks that Only Need Matrix Addition, Dot Multiplication, and ReLU
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[294]  arXiv:2406.02066 [pdf, other]
Title: Preference Optimization for Molecule Synthesis with Conditional Residual Energy-based Models
Comments: Accepted by ICML 2024(Oral)
Subjects: Machine Learning (cs.LG); Biomolecules (q-bio.BM)
[295]  arXiv:2406.02064 [pdf, other]
Title: Advancing Generalized Transfer Attack with Initialization Derived Bilevel Optimization and Dynamic Sequence Truncation
Comments: Accepted by IJCAI 2024. 10 pages
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[296]  arXiv:2406.02061 [pdf, other]
Title: Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models
Comments: v1
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[297]  arXiv:2406.02059 [pdf, other]
Title: Graph Adversarial Diffusion Convolution
Comments: Accepted by ICML 2024
Subjects: Machine Learning (cs.LG)
[298]  arXiv:2406.02056 [pdf, other]
Title: CAP: A Context-Aware Neural Predictor for NAS
Comments: Accepted by IJCAI24
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[299]  arXiv:2406.02052 [pdf, other]
Title: PETRA: Parallel End-to-end Training with Reversible Architectures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[300]  arXiv:2406.02040 [pdf, other]
Title: DFA-GNN: Forward Learning of Graph Neural Networks by Direct Feedback Alignment
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[301]  arXiv:2406.02035 [pdf, other]
Title: A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[302]  arXiv:2406.02027 [pdf, other]
Title: Inference Attacks in Machine Learning as a Service: A Taxonomy, Review, and Promising Directions
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[303]  arXiv:2406.02024 [pdf, other]
Title: Verifying the Generalization of Deep Learning to Out-of-Distribution Domains
Comments: To appear in the Journal of Automated Reasoning (JAR), 2024. arXiv admin note: substantial text overlap with arXiv:2302.05745
Subjects: Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[304]  arXiv:2406.02017 [pdf, other]
Title: On the Mode-Seeking Properties of Langevin Dynamics
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[305]  arXiv:2406.02015 [pdf, other]
Title: Parameterizing Federated Continual Learning for Reproducible Research
Comments: Preprint: Accepted at the 1st WAFL (Workshop on Advancements in Federated Learning) workshop, ECML-PKDD 2023
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[306]  arXiv:2406.02013 [pdf, other]
Title: Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in Offline Reinforcement Learning
Comments: 16 pages, 5 figures
Subjects: Machine Learning (cs.LG)
[307]  arXiv:2406.01996 [pdf, other]
Title: Bayesian Mesh Optimization for Graph Neural Networks to Enhance Engineering Performance Prediction
Comments: 17 pages, 8 figures, 3 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[308]  arXiv:2406.01977 [pdf, other]
Title: What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding
Comments: ICML 2024
Subjects: Machine Learning (cs.LG)
[309]  arXiv:2406.01975 [pdf, other]
Title: Can Dense Connectivity Benefit Outlier Detection? An Odyssey with NAS
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[310]  arXiv:2406.01969 [pdf, other]
Title: Multiway Multislice PHATE: Visualizing Hidden Dynamics of RNNs through Training
Subjects: Machine Learning (cs.LG)
[311]  arXiv:2406.01960 [pdf, other]
Title: Certifiably Byzantine-Robust Federated Conformal Prediction
Comments: Accepted to ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[312]  arXiv:2406.01950 [pdf, ps, other]
Title: A Comparative Study of Sampling Methods with Cross-Validation in the FedHome Framework
Comments: 11 Figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[313]  arXiv:2406.01913 [pdf, other]
Title: Generating Synthetic Net Load Data with Physics-informed Diffusion Model
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[314]  arXiv:2406.01909 [pdf, other]
Title: A Global Geometric Analysis of Maximal Coding Rate Reduction
Comments: 43 pages, 9 figures. This work has been accepted for publication in the Proceedings of the 41st International Conference on Machine Learning (ICML 2024)
Subjects: Machine Learning (cs.LG)
[315]  arXiv:2406.01908 [pdf, other]
Title: PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming
Comments: Accepted by ICML 2024
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[316]  arXiv:2406.01901 [pdf, other]
Title: Bifurcated Generative Flow Networks
Subjects: Machine Learning (cs.LG)
[317]  arXiv:2406.01899 [pdf, other]
Title: Cross-Domain Graph Data Scaling: A Showcase with Diffusion Models
Subjects: Machine Learning (cs.LG)
[318]  arXiv:2406.01895 [pdf, other]
Title: Explicitly Encoding Structural Symmetry is Key to Length Generalization in Arithmetic Tasks
Comments: 32 pages, 16 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[319]  arXiv:2406.01870 [pdf, other]
Title: Understanding Stochastic Natural Gradient Variational Inference
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[320]  arXiv:2406.01857 [pdf, other]
Title: Neural Green's Operators for Parametric Partial Differential Equations
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[321]  arXiv:2406.01853 [pdf, other]
Title: Multi-Agent Reinforcement Learning Meets Leaf Sequencing in Radiotherapy
Comments: Accepted by ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[322]  arXiv:2406.01838 [pdf, other]
Title: Learning the Target Network in Function Space
Comments: Accepted to International Conference on Machine Learning (ICML24)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[323]  arXiv:2406.01833 [pdf, other]
Title: CAFO: Feature-Centric Explanation on Time Series Classification
Comments: Accepted to KDD 2024 Research Track
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[324]  arXiv:2406.01825 [pdf, other]
Title: EMOE: Expansive Matching of Experts for Robust Uncertainty Based Rejection
Authors: Yunni Qu (1), James Wellnitz (2), Alexander Tropsha (2), Junier Oliva (1) ((1) Department of Computer Science, University of North Carolina at Chapel Hill, (2) Eshelman School of Pharmacy, University of North Carolina at Chapel Hill)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[325]  arXiv:2406.01823 [pdf, other]
Title: Causal Discovery with Fewer Conditional Independence Tests
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME); Machine Learning (stat.ML)
[326]  arXiv:2406.01808 [pdf, other]
Title: In-Context Learning of Physical Properties: Few-Shot Adaptation to Out-of-Distribution Molecular Graphs
Comments: 12 pages, 4 figures
Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci)
[327]  arXiv:2406.01805 [pdf, other]
Title: TabMDA: Tabular Manifold Data Augmentation for Any Classifier using Transformers with In-context Subsetting
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[328]  arXiv:2406.01799 [pdf, other]
Title: Online Control in Population Dynamics
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[329]  arXiv:2406.01793 [pdf, other]
Title: Towards the Transferability of Rewards Recovered via Regularized Inverse Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[330]  arXiv:2406.01789 [pdf, ps, other]
Title: AI-based Classification of Customer Support Tickets: State of the Art and Implementation with AutoML
Journal-ref: Proceedings of the IWEMB 2021/2022: Fifth and Sixth International Workshop on Entrepreneurship, Electronic and Mobile Business
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[331]  arXiv:2406.01781 [pdf, other]
Title: DEFT: Efficient Finetuning of Conditional Diffusion Models by Learning the Generalised $h$-transform
Comments: arXiv admin note: text overlap with arXiv:2312.09236
Subjects: Machine Learning (cs.LG)
[332]  arXiv:2406.01766 [pdf, ps, other]
Title: How Does Gradient Descent Learn Features -- A Local Analysis for Regularized Two-Layer Neural Networks
Authors: Mo Zhou, Rong Ge
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[333]  arXiv:2406.01762 [pdf, other]
Title: Non-Asymptotic Analysis for Single-Loop (Natural) Actor-Critic with Compatible Function Approximation
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[334]  arXiv:2406.01757 [pdf, other]
Title: Position: Cracking the Code of Cascading Disparity Towards Marginalized Communities
Comments: 14 pages, 1 figure
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[335]  arXiv:2406.01755 [pdf, other]
Title: Sparser, Better, Deeper, Stronger: Improving Sparse Training with Exact Orthogonal Initialization
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[336]  arXiv:2406.01753 [pdf, other]
Title: Optimizing the Optimal Weighted Average: Efficient Distributed Sparse Classification
Comments: Under review
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
[337]  arXiv:2406.01733 [pdf, other]
Title: Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching
Comments: Code is available at this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[338]  arXiv:2406.01727 [pdf, other]
Title: Federated Learning-based Collaborative Wideband Spectrum Sensing and Scheduling for UAVs in UTM Systems
Comments: This is a preprint version submitted to IEEE Transactions on Machine learning in Communications and Networking. arXiv admin note: text overlap with arXiv:2308.05036
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA); Signal Processing (eess.SP)
[339]  arXiv:2406.01661 [pdf, other]
Title: A Diffusion Model Framework for Unsupervised Neural Combinatorial Optimization
Comments: Accepted at ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Discrete Mathematics (cs.DM); Machine Learning (stat.ML)
[340]  arXiv:2406.01660 [pdf, other]
Title: Self-Improving Robust Preference Optimization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[341]  arXiv:2406.01649 [pdf, other]
Title: CoLa-DCE -- Concept-guided Latent Diffusion Counterfactual Explanations
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME)
[342]  arXiv:2406.01647 [pdf, other]
Title: An Analysis under a Unified Fomulation of Learning Algorithms with Output Constraints
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[343]  arXiv:2406.01646 [pdf, other]
Title: iKAN: Global Incremental Learning with KAN for Human Activity Recognition Across Heterogeneous Datasets
Comments: This work is submitted to Ubicomp/ISWC24 and is under review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Signal Processing (eess.SP)
[344]  arXiv:2406.01645 [pdf, other]
Title: FNP: Fourier Neural Processes for Arbitrary-Resolution Data Assimilation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[345]  arXiv:2406.01638 [pdf, other]
Title: TimeCMA: Towards LLM-Empowered Time Series Forecasting via Cross-Modality Alignment
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[346]  arXiv:2406.02539 (cross-list from cs.CV) [pdf, other]
Title: Parrot: Multilingual Visual Instruction Tuning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[347]  arXiv:2406.02537 (cross-list from cs.CL) [pdf, other]
Title: TopViewRS: Vision-Language Models as Top-View Spatial Reasoners
Comments: 9 pages, 3 figures, 3 tables (21 pages, 4 figures, 15 tables including references and appendices)
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[348]  arXiv:2406.02536 (cross-list from cs.CL) [pdf, other]
Title: Mitigate Position Bias in Large Language Models via Scaling a Single Dimension
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[349]  arXiv:2406.02534 (cross-list from eess.IV) [pdf, other]
Title: Enhancing predictive imaging biomarker discovery through treatment effect analysis
Comments: 19 pages, 12 figures
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[350]  arXiv:2406.02529 (cross-list from eess.IV) [pdf, other]
Title: ReLUs Are Sufficient for Learning Implicit Neural Representations
Comments: Accepted to ICML 2024
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[351]  arXiv:2406.02523 (cross-list from cs.RO) [pdf, other]
Title: RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
Comments: RSS 2024
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[352]  arXiv:2406.02507 (cross-list from cs.CV) [pdf, other]
Title: Guiding a Diffusion Model with a Bad Version of Itself
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[353]  arXiv:2406.02497 (cross-list from eess.SY) [pdf, ps, other]
Title: Dropout MPC: An Ensemble Neural MPC Approach for Systems with Learned Dynamics
Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[354]  arXiv:2406.02477 (cross-list from eess.IV) [pdf, other]
Title: Inpainting Pathology in Lumbar Spine MRI with Latent Diffusion
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[355]  arXiv:2406.02470 (cross-list from quant-ph) [pdf, other]
Title: Meta-Designing Quantum Experiments with Language Models
Comments: 10+3 pages, 5 figures
Subjects: Quantum Physics (quant-ph); Machine Learning (cs.LG)
[356]  arXiv:2406.02457 (cross-list from cond-mat.mtrl-sci) [pdf, other]
Title: Machine learning Hubbard parameters with equivariant neural networks
Subjects: Materials Science (cond-mat.mtrl-sci); Machine Learning (cs.LG); Chemical Physics (physics.chem-ph)
[357]  arXiv:2406.02432 (cross-list from cs.DS) [pdf, other]
Title: Coresets for Multiple $\ell_p$ Regression
Comments: ICML 2024
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
[358]  arXiv:2406.02431 (cross-list from cs.DS) [pdf, other]
Title: Reweighted Solutions for Weighted Low Rank Approximation
Comments: ICML 2024
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
[359]  arXiv:2406.02426 (cross-list from math.OC) [pdf, other]
Title: Contextual Optimization under Covariate Shift: A Robust Approach by Intersecting Wasserstein Balls
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[360]  arXiv:2406.02422 (cross-list from eess.IV) [pdf, other]
Title: IterMask2: Iterative Unsupervised Anomaly Segmentation via Spatial and Frequency Masking for Brain Lesions in MRI
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[361]  arXiv:2406.02421 (cross-list from cs.DM) [pdf, other]
Title: Representing Piecewise-Linear Functions by Functions with Minimal Arity
Subjects: Discrete Mathematics (cs.DM); Machine Learning (cs.LG); Symbolic Computation (cs.SC)
[362]  arXiv:2406.02394 (cross-list from cs.CL) [pdf, other]
Title: Multiple Choice Questions and Large Languages Models: A Case Study with Fictional Medical Data
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[363]  arXiv:2406.02383 (cross-list from cs.CV) [pdf, other]
Title: Learning to Edit Visual Programs with Self-Supervision
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[364]  arXiv:2406.02357 (cross-list from cs.GT) [pdf, ps, other]
Title: The complexity of approximate (coarse) correlated equilibrium for incomplete information games
Subjects: Computer Science and Game Theory (cs.GT); Artificial Intelligence (cs.AI); Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG)
[365]  arXiv:2406.02355 (cross-list from cs.CV) [pdf, other]
Title: FedDr+: Stabilizing Dot-regression with Global Feature Distillation for Federated Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[366]  arXiv:2406.02347 (cross-list from cs.CV) [pdf, other]
Title: Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
Comments: 16 pages + 16 pages appendices
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[367]  arXiv:2406.02345 (cross-list from cs.CV) [pdf, other]
Title: Progressive Confident Masking Attention Network for Audio-Visual Segmentation
Comments: 10 pages, 9 figures, submitted to IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[368]  arXiv:2406.02333 (cross-list from cs.NI) [pdf, other]
Title: Towards Neural Architecture Search for Transfer Learning in 6G Networks
Subjects: Networking and Internet Architecture (cs.NI); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[369]  arXiv:2406.02329 (cross-list from cs.CL) [pdf, other]
Title: On Affine Homotopy between Language Encoders
Comments: 10 pages
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[370]  arXiv:2406.02327 (cross-list from cs.CV) [pdf, other]
Title: Continual Unsupervised Out-of-Distribution Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[371]  arXiv:2406.02315 (cross-list from cs.SD) [pdf, other]
Title: An Independence-promoting Loss for Music Generation with Language Models
Comments: Accepted to ICML 2024
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[372]  arXiv:2406.02313 (cross-list from cond-mat.stat-mech) [pdf, other]
Title: Neural Thermodynamic Integration: Free Energies from Energy-based Diffusion Models
Subjects: Statistical Mechanics (cond-mat.stat-mech); Machine Learning (cs.LG)
[373]  arXiv:2406.02300 (cross-list from math.AT) [pdf, other]
Title: Node-Level Topological Representation Learning on Point Clouds
Comments: 30 pages, 10 figures, comments welcome
Subjects: Algebraic Topology (math.AT); Computational Geometry (cs.CG); Machine Learning (cs.LG)
[374]  arXiv:2406.02298 (cross-list from math-ph) [pdf, other]
Title: Solving Partial Differential Equations in Different Domains by Operator Learning method Based on Boundary Integral Equations
Subjects: Mathematical Physics (math-ph); Machine Learning (cs.LG)
[375]  arXiv:2406.02293 (cross-list from stat.ML) [pdf, other]
Title: Composite Quantile Regression With XGBoost Using the Novel Arctan Pinball Loss
Comments: 24 pages, 9 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[376]  arXiv:2406.02285 (cross-list from eess.AS) [pdf, other]
Title: Towards Supervised Performance on Speaker Verification with Self-Supervised Learning by Leveraging Large-Scale ASR Models
Comments: accepted at INTERSPEECH 2024
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[377]  arXiv:2406.02273 (cross-list from math.OC) [pdf, ps, other]
Title: A KL-based Analysis Framework with Applications to Non-Descent Optimization Methods
Comments: 29 pages
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[378]  arXiv:2406.02269 (cross-list from stat.ML) [pdf, ps, other]
Title: Graph Neural Networks Do Not Always Oversmooth
Subjects: Machine Learning (stat.ML); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG)
[379]  arXiv:2406.02255 (cross-list from eess.AS) [pdf, other]
Title: MidiCaps -- A large-scale MIDI dataset with text captions
Comments: Under review
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD)
[380]  arXiv:2406.02245 (cross-list from cs.CL) [pdf, other]
Title: Description Boosting for Zero-Shot Entity and Relation Classification
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[381]  arXiv:2406.02225 (cross-list from math.OC) [pdf, other]
Title: Riemannian coordinate descent algorithms on matrix manifolds
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[382]  arXiv:2406.02223 (cross-list from cs.CV) [pdf, other]
Title: SMCL: Saliency Masked Contrastive Learning for Long-tailed Recognition
Comments: accepted at ICASSP 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[383]  arXiv:2406.02204 (cross-list from cs.CE) [pdf, other]
Title: The Deep Latent Space Particle Filter for Real-Time Data Assimilation with Uncertainty Quantification
Subjects: Computational Engineering, Finance, and Science (cs.CE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[384]  arXiv:2406.02191 (cross-list from stat.ML) [pdf, other]
Title: On the Recoverability of Causal Relations from Temporally Aggregated I.I.D. Data
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[385]  arXiv:2406.02173 (cross-list from math.NA) [pdf, other]
Title: Learning the Hodgkin-Huxley Model with Operator Learning Techniques
Comments: 24 pages, 8 figures
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG)
[386]  arXiv:2406.02158 (cross-list from cs.CV) [pdf, other]
Title: Radar Spectra-Language Model for Automotive Scene Parsing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[387]  arXiv:2406.02157 (cross-list from stat.ML) [pdf, other]
Title: Online Learning and Information Exponents: On The Importance of Batch size, and Time/Complexity Tradeoffs
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[388]  arXiv:2406.02156 (cross-list from cs.CR) [pdf, ps, other]
Title: Almost linear time differentially private release of synthetic graphs
Subjects: Cryptography and Security (cs.CR); Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG)
[389]  arXiv:2406.02154 (cross-list from math-ph) [pdf, other]
Title: Learning Hamiltonian neural Koopman operator and simultaneously sustaining and discovering conservation law
Subjects: Mathematical Physics (math-ph); Machine Learning (cs.LG)
[390]  arXiv:2406.02140 (cross-list from cs.CR) [pdf, other]
Title: Optimality of Matrix Mechanism on $\ell_p^p$-metric
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[391]  arXiv:2406.02133 (cross-list from eess.AS) [pdf, other]
Title: SimulTron: On-Device Simultaneous Speech to Speech Translation
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[392]  arXiv:2406.02126 (cross-list from eess.SY) [pdf, other]
Title: CityLight: A Universal Model Towards Real-world City-scale Traffic Signal Control Coordination
Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[393]  arXiv:2406.02092 (cross-list from cs.SD) [pdf, other]
Title: MaskSR: Masked Language Model for Full-band Speech Restoration
Comments: Accepted by INTERSPEECH 2024. Demo page: this https URL
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[394]  arXiv:2406.02081 (cross-list from cs.MA) [pdf, other]
Title: FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning
Comments: ICML 2024
Subjects: Multiagent Systems (cs.MA); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[395]  arXiv:2406.02080 (cross-list from cs.CL) [pdf, other]
Title: LongSSM: On the Length Extension of State-space Models in Language Modelling
Authors: Shida Wang
Comments: 23 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Dynamical Systems (math.DS)
[396]  arXiv:2406.02057 (cross-list from cs.AI) [pdf, other]
Title: Tabular and Deep Learning for the Whittle Index
Authors: Francisco Robledo Relaño (LMAP, UPPA, UPV / EHU), Vivek Borkar (EE-IIT), Urtzi Ayesta (IRIT-RMESS, UPV/EHU, CNRS), Konstantin Avrachenkov (Inria)
Comments: ACM Transactions on Modeling and Performance Evaluation of Computing Systems, 2024
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[397]  arXiv:2406.02049 (cross-list from stat.ML) [pdf, other]
Title: Causal Effect Identification in LiNGAM Models with Latent Confounders
Comments: Accepted at International Conference on Machine Learning (ICML) 2024
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[398]  arXiv:2406.02044 (cross-list from cs.CL) [pdf, ps, other]
Title: QROA: A Black-Box Query-Response Optimization Attack on LLMs
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[399]  arXiv:2406.02021 (cross-list from cs.CV) [pdf, other]
Title: MetaMixer Is All You Need
Comments: Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[400]  arXiv:2406.02016 (cross-list from math.OC) [pdf, other]
Title: Adaptive and Optimal Second-order Optimistic Methods for Minimax Optimization
Comments: 33 pages, 2 figures
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[401]  arXiv:2406.02014 (cross-list from q-bio.NC) [pdf, other]
Title: Understanding Auditory Evoked Brain Signal via Physics-informed Embedding Network with Multi-Task Transformer
Subjects: Neurons and Cognition (q-bio.NC); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[402]  arXiv:2406.01967 (cross-list from cs.RO) [pdf, other]
Title: DrEureka: Language Model Guided Sim-To-Real Transfer
Comments: Robotics: Science and Systems (RSS) 2024. Project website and open-source code: this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[403]  arXiv:2406.01959 (cross-list from math.OC) [pdf, other]
Title: Adaptive Variance Reduction for Stochastic Optimization under Weaker Assumptions
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[404]  arXiv:2406.01947 (cross-list from cs.RO) [pdf, other]
Title: Data-Driven Approaches for Thrust Prediction in Underwater Flapping Fin Propulsion Systems
Comments: 9 pages, 11 figures, AAAI 2021 Fall Series Symposium on Science-Guided AI
Subjects: Robotics (cs.RO); Machine Learning (cs.LG)
[405]  arXiv:2406.01940 (cross-list from cs.CL) [pdf, other]
Title: Process-Driven Autoformalization in Lean 4
Comments: 22 pages, 1 figures, 11 tables
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[406]  arXiv:2406.01939 (cross-list from cs.AI) [pdf, other]
Title: Speeding up Policy Simulation in Supply Chain RL
Subjects: Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[407]  arXiv:2406.01933 (cross-list from stat.ML) [pdf, ps, other]
Title: Orthogonal Causal Calibration
Comments: 44 pages
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST); Methodology (stat.ME)
[408]  arXiv:2406.01876 (cross-list from cs.DB) [pdf, other]
Title: GRAM: Generative Retrieval Augmented Matching of Data Schemas in the Context of Data Security
Comments: KDD 2024 Camera Ready; 11 pages, 8 figures
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[409]  arXiv:2406.01873 (cross-list from cs.CL) [pdf, other]
Title: CR-UTP: Certified Robustness against Universal Text Perturbations on Large Language Models
Comments: Accepted by ACL Findings 2024
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[410]  arXiv:2406.01852 (cross-list from cs.NI) [pdf, other]
Title: Non-uniformity is All You Need: Efficient and Timely Encrypted Traffic Classification With ECHO
Subjects: Networking and Internet Architecture (cs.NI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[411]  arXiv:2406.01829 (cross-list from cs.NE) [pdf, other]
Title: FacAID: A Transformer Model for Neuro-Symbolic Facade Reconstruction
Comments: 11 pages, 10 figures, preprint
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[412]  arXiv:2406.01813 (cross-list from stat.ML) [pdf, other]
Title: Diffusion Boosted Trees
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Applications (stat.AP); Methodology (stat.ME)
[413]  arXiv:2406.01801 (cross-list from stat.ML) [pdf, other]
Title: Fearless Stochasticity in Expectation Propagation
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[414]  arXiv:2406.01782 (cross-list from eess.SY) [pdf, other]
Title: Multi-agent assignment via state augmented reinforcement learning
Comments: 12 pages, 3 figures, 6th Annual Conference on Learning for Dynamics and Control
Journal-ref: Proceedings of Machine Learning Research vol 242 1 12, 2024. 6th Annual Conference on Learning for Dynamics and Control
Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[415]  arXiv:2406.01774 (cross-list from cs.DC) [pdf, other]
Title: Efficient Data Distribution Estimation for Accelerated Federated Learning
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[416]  arXiv:2406.01708 (cross-list from cs.CR) [pdf, other]
Title: Model for Peanuts: Hijacking ML Models without Training Access is Possible
Comments: 17 pages, 14 figures, 7 tables
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[417]  arXiv:2406.01698 (cross-list from cs.AR) [pdf, other]
Title: Demystifying Platform Requirements for Diverse LLM Inference Use Cases
Comments: 12 Pages, this https URL
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[418]  arXiv:2406.01663 (cross-list from stat.ML) [pdf, other]
Title: An efficient solution to Hidden Markov Models on trees with coupled branches
Comments: 24 + 6 pages, 5 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Signal Processing (eess.SP); Quantitative Methods (q-bio.QM); Methodology (stat.ME)
[419]  arXiv:2406.01655 (cross-list from cs.SD) [pdf, other]
Title: TinySV: Speaker Verification in TinyML with On-device Learning
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[420]  arXiv:2406.01653 (cross-list from stat.ML) [pdf, other]
Title: An efficient Wasserstein-distance approach for reconstructing jump-diffusion processes using parameterized neural networks
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Probability (math.PR); Applications (stat.AP); Methodology (stat.ME)
[421]  arXiv:2406.01652 (cross-list from stat.ME) [pdf, ps, other]
Title: Distributional bias compromises leave-one-out cross-validation
Comments: 20 pages, 5 figures, supplementary information
Subjects: Methodology (stat.ME); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[422]  arXiv:2406.01651 (cross-list from q-bio.QM) [pdf, other]
Title: FusionDTI: Fine-grained Binding Discovery with Token-level Fusion for Drug-Target Interaction
Comments: 10 pages, 8 figures
Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Biomolecules (q-bio.BM)
[423]  arXiv:2406.01650 (cross-list from q-bio.BM) [pdf, other]
Title: TAGMol: Target-Aware Gradient-guided Molecule Generation
Subjects: Biomolecules (q-bio.BM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[424]  arXiv:2406.01633 (cross-list from cs.IR) [pdf, other]
Title: On Overcoming Miscalibrated Conversational Priors in LLM-based Chatbots
Comments: Preprint of UAI'24 conference publication
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[425]  arXiv:2406.01631 (cross-list from cs.IR) [pdf, other]
Title: An LLM-based Recommender System Environment
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[426]  arXiv:2406.01630 (cross-list from q-bio.QM) [pdf, other]
Title: Equivariant amortized inference of poses for cryo-EM
Comments: Published at the GEM workshop, ICLR 2024
Subjects: Quantitative Methods (q-bio.QM); Machine Learning (cs.LG)
[427]  arXiv:2406.01627 (cross-list from q-bio.GN) [pdf, other]
Title: GenBench: A Benchmarking Suite for Systematic Evaluation of Genomic Foundation Models
Subjects: Genomics (q-bio.GN); Machine Learning (cs.LG)
[428]  arXiv:2406.01624 (cross-list from eess.AS) [pdf, other]
Title: Unveiling Hidden Factors: Explainable AI for Feature Boosting in Speech Emotion Recognition
Comments: Published in: Springer Nature International Journal of Applied Intelligence (2024)
Journal-ref: Applied Intelligence (2024)
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[429]  arXiv:2406.01622 (cross-list from q-bio.BM) [pdf, other]
Title: Sifting through the Noise: A Survey of Diffusion Probabilistic Models and Their Applications to Biomolecules
Comments: 31 pages, 6 figures
Subjects: Biomolecules (q-bio.BM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[430]  arXiv:2406.01617 (cross-list from q-bio.BM) [pdf, ps, other]
Title: LightCPPgen: An Explainable Machine Learning Pipeline for Rational Design of Cell Penetrating Peptides
Subjects: Biomolecules (q-bio.BM); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[431]  arXiv:2406.01611 (cross-list from cs.IR) [pdf, other]
Title: System-2 Recommenders: Disentangling Utility and Engagement in Recommendation Systems via Temporal Point-Processes
Comments: Accepted at FAccT'24
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG); Machine Learning (stat.ML)
[432]  arXiv:2406.01609 (cross-list from cs.IR) [pdf, other]
Title: Judgement Citation Retrieval using Contextual Similarity
Comments: 14 pages, 16 images, Submitted to Multimedia Tools and Applications Springer journal
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[433]  arXiv:2406.01603 (cross-list from cs.IR) [pdf, other]
Title: Privacy-preserving recommender system using the data collaboration analysis for distributed datasets
Subjects: Information Retrieval (cs.IR); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[434]  arXiv:2406.01601 (cross-list from cs.DC) [pdf, other]
Title: Backpropogation-Free Multi-modal On-Device Model Adaptation via Cloud-Device Collaboration
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[435]  arXiv:2406.01599 (cross-list from q-bio.QM) [pdf, ps, other]
Title: Markov Chain Monte Carlo with Gaussian Process Emulation for a 1D Hemodynamics Model of CTEPH
Subjects: Quantitative Methods (q-bio.QM); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG); Data Analysis, Statistics and Probability (physics.data-an); Applications (stat.AP)
[436]  arXiv:2406.01157 (cross-list from quant-ph) [pdf, other]
Title: Quantum consistent neural/tensor networks for photonic circuits with strongly/weakly entangled states
Authors: Nicolas Allegra
Comments: 13 pages. Paper under review for Physical Review A
Subjects: Quantum Physics (quant-ph); Statistical Mechanics (cond-mat.stat-mech); Machine Learning (cs.LG)
[437]  arXiv:2406.00503 (cross-list from math.OC) [pdf, other]
Title: Schrödinger Bridge with Quadratic State Cost is Exactly Solvable
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Systems and Control (eess.SY); Mathematical Physics (math-ph); Machine Learning (stat.ML)
[438]  arXiv:2405.14785 (cross-list from cs.CV) [pdf, other]
Title: EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
Comments: Project: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[439]  arXiv:2402.12908 (cross-list from cs.CV) [pdf, other]
Title: RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models
Comments: Project: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[440]  arXiv:2401.11708 (cross-list from cs.CV) [pdf, other]
Title: Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs
Comments: ICML 2024. Project: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[441]  arXiv:2105.13287 (cross-list from cs.DS) [pdf, other]
Title: Differentially Private Densest Subgraph Detection
Comments: Accepted by ICML 2021
Subjects: Data Structures and Algorithms (cs.DS); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)

Tue, 4 Jun 2024

[442]  arXiv:2406.01588 [pdf, other]
Title: nn2poly: An R Package for Converting Neural Networks into Interpretable Polynomials
Authors: Pablo Morala (1 and 2), Jenny Alexandra Cifuentes (3), Rosa E. Lillo (1 and 2), Iñaki Ucar (1 and 2) ((1) uc3m-Santander Big Data Institute, Universidad Carlos III de Madrid. Spain., (2) Department of Statistics, Universidad Carlos III de Madrid. Spain., (3) ICADE, Department of Quantitative Methods, Faculty of Economics and Business Administration and the Institute for Research in Technology (IIT), ICAI School of Engineering, Universidad Pontificia Comillas. Spain.)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[443]  arXiv:2406.01581 [pdf, other]
Title: Neural network learns low-dimensional polynomials with SGD near the information-theoretic limit
Comments: 34 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[444]  arXiv:2406.01577 [pdf, ps, other]
Title: An Equivalence Between Static and Dynamic Regret Minimization
Comments: 26 pages
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[445]  arXiv:2406.01572 [pdf, other]
Title: Unlocking Guidance for Discrete State-Space Diffusion and Flow Models
Subjects: Machine Learning (cs.LG)
[446]  arXiv:2406.01570 [pdf, ps, other]
Title: Single Trajectory Conformal Prediction
Comments: 16 pages
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Machine Learning (stat.ML)
[447]  arXiv:2406.01562 [pdf, other]
Title: A New View on Planning in Online Reinforcement Learning
Comments: Published in the Planning and Reinforcement Learning Workshop at ICAPS 2024. arXiv admin note: text overlap with arXiv:2206.02902
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[448]  arXiv:2406.01539 [pdf, other]
Title: Physics-informed deep learning and compressive collocation for high-dimensional diffusion-reaction equations: practical existence theory and numerics
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Numerical Analysis (math.NA)
[449]  arXiv:2406.01529 [pdf, other]
Title: How to Count Coughs: An Event-Based Framework for Evaluating Automatic Cough Detection Algorithm Performance
Subjects: Machine Learning (cs.LG)
[450]  arXiv:2406.01528 [pdf, other]
Title: Physics-Informed Neural Networks for Dynamic Process Operations with Limited Physical Knowledge and Data
Comments: manuscript (31 pages, 8 figures, 7 tables), supporting materials (11 pages, 3 figures, 3 tables)
Subjects: Machine Learning (cs.LG)
[451]  arXiv:2406.01521 [pdf, other]
Title: MOSEAC: Streamlined Variable Time Step Reinforcement Learning
Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[452]  arXiv:2406.01481 [pdf, other]
Title: Learning from Streaming Data when Users Choose
Authors: Jinyan Su, Sarah Dean
Comments: Accepted by ICML24
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[453]  arXiv:2406.01477 [pdf, other]
Title: Finding Optimally Robust Data Mixtures via Concave Maximization
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[454]  arXiv:2406.01471 [pdf, ps, other]
Title: Inverse design of photonic surfaces on Inconel via multi-fidelity machine learning ensemble framework and high throughput femtosecond laser processing
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Optics (physics.optics)
[455]  arXiv:2406.01462 [pdf, other]
Title: Understanding Preference Fine-Tuning Through the Lens of Coverage
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[456]  arXiv:2406.01461 [pdf, other]
Title: Hardness of Learning Neural Networks under the Manifold Hypothesis
Subjects: Machine Learning (cs.LG); Differential Geometry (math.DG); Machine Learning (stat.ML)
[457]  arXiv:2406.01457 [pdf, other]
Title: Differentially Private Tabular Data Synthesis using Large Language Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[458]  arXiv:2406.01439 [pdf, other]
Title: Asynchronous Multi-Server Federated Learning for Geo-Distributed Clients
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[459]  arXiv:2406.01438 [pdf, other]
Title: Asynchronous Byzantine Federated Learning
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[460]  arXiv:2406.01435 [pdf, other]
Title: Learning Analysis of Kernel Ridgeless Regression with Asymmetric Kernel Learning
Comments: arXiv admin note: text overlap with arXiv:2310.05236
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[461]  arXiv:2406.01424 [pdf, other]
Title: Universal In-Context Approximation By Prompting Fully Recurrent Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[462]  arXiv:2406.01423 [pdf, other]
Title: Value Improved Actor Critic Algorithms
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[463]  arXiv:2406.01417 [pdf, other]
Title: Mixup Augmentation with Multiple Interpolations
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[464]  arXiv:2406.01416 [pdf, other]
Title: Adapting Conformal Prediction to Distribution Shifts Without Labels
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[465]  arXiv:2406.01414 [pdf, other]
Title: CE-NAS: An End-to-End Carbon-Efficient Neural Architecture Search Framework
Comments: arXiv admin note: text overlap with arXiv:2307.04131
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[466]  arXiv:2406.01411 [pdf, other]
Title: Using Constraints to Discover Sparse and Alternative Subgroup Descriptions
Authors: Jakob Bach
Subjects: Machine Learning (cs.LG)
[467]  arXiv:2406.01389 [pdf, other]
Title: RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Systems and Control (eess.SY)
[468]  arXiv:2406.01386 [pdf, ps, other]
Title: Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond
Subjects: Machine Learning (cs.LG)
[469]  arXiv:2406.01378 [pdf, ps, other]
Title: A Theory of Learnability for Offline Decision Making
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[470]  arXiv:2406.01361 [pdf, other]
Title: Learning to Play Atari in a World of Tokens
Comments: Accepted at ICML 2024
Subjects: Machine Learning (cs.LG)
[471]  arXiv:2406.01345 [pdf, other]
Title: BMRS: Bayesian Model Reduction for Structured Pruning
Comments: 17 pages; 8 figures; 2 tables
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[472]  arXiv:2406.01317 [pdf, other]
Title: The Intelligible and Effective Graph Neural Additive Networks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[473]  arXiv:2406.01290 [pdf, other]
Title: Resource-constrained Fairness
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[474]  arXiv:2406.01282 [pdf, other]
Title: Continuous Geometry-Aware Graph Diffusion via Hyperbolic Neural PDE
Comments: The short version of this work will appear in the Proceedings of the 2024 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD 2024)
Subjects: Machine Learning (cs.LG)
[475]  arXiv:2406.01274 [pdf, other]
Title: Expected Grad-CAM: Towards gradient faithfulness
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[476]  arXiv:2406.01257 [pdf, other]
Title: What makes unlearning hard and what to do about it
Subjects: Machine Learning (cs.LG)
[477]  arXiv:2406.01255 [pdf, other]
Title: On the Nonlinearity of Layer Normalization
Comments: 42 pages, accepted to ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[478]  arXiv:2406.01249 [pdf, other]
Title: Equivariant Machine Learning on Graphs with Nonlinear Spectral Filters
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[479]  arXiv:2406.01234 [pdf, other]
Title: Achieving Tractable Minimax Optimal Regret in Average Reward MDPs
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Optimization and Control (math.OC); Machine Learning (stat.ML)
[480]  arXiv:2406.01229 [pdf, other]
Title: AGALE: A Graph-Aware Continual Learning Evaluation Framework
Subjects: Machine Learning (cs.LG)
[481]  arXiv:2406.01192 [pdf, other]
Title: Sparsity-Agnostic Linear Bandits with Adaptive Adversaries
Comments: 25 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[482]  arXiv:2406.01189 [pdf, other]
Title: MultiMax: Sparse and Multi-Modal Attention Learning
Comments: Accepted at ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[483]  arXiv:2406.01183 [pdf, other]
Title: Automatic Input Feature Relevance via Spectral Neural Networks
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Artificial Intelligence (cs.AI)
[484]  arXiv:2406.01178 [pdf, other]
Title: Deep Reinforcement Learning Behavioral Mode Switching Using Optimal Control Based on a Latent Space Objective
Comments: Published in the proceedings of the 32nd Mediterranean Conference on Control and Automation [MED2024]
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[485]  arXiv:2406.01175 [pdf, other]
Title: NeoRL: Efficient Exploration for Nonepisodic RL
Subjects: Machine Learning (cs.LG)
[486]  arXiv:2406.01163 [pdf, other]
Title: When to Sense and Control? A Time-adaptive Approach for Continuous-Time RL
Subjects: Machine Learning (cs.LG)
[487]  arXiv:2406.01162 [pdf, other]
Title: Conditional Gumbel-Softmax for constrained feature selection with application to node selection in wireless sensor networks
Subjects: Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[488]  arXiv:2406.01150 [pdf, other]
Title: Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets
Subjects: Machine Learning (cs.LG)
[489]  arXiv:2406.01130 [pdf, other]
Title: SAVA: Scalable Learning-Agnostic Data Valuation
Comments: 21 pages, 12 figures
Subjects: Machine Learning (cs.LG)
[490]  arXiv:2406.01124 [pdf, other]
Title: Latent Logic Tree Extraction for Event Sequence Explanation from LLMs
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[491]  arXiv:2406.01116 [pdf, other]
Title: Accelerating Heterogeneous Federated Learning with Closed-form Classifiers
Comments: Accepted at ICML 2024 - this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[492]  arXiv:2406.01115 [pdf, other]
Title: Cohort Squeeze: Beyond a Single Communication Round per Cohort in Cross-Device Federated Learning
Subjects: Machine Learning (cs.LG)
[493]  arXiv:2406.01114 [pdf, ps, other]
Title: Globally Interpretable Classifiers via Boolean Formulas with Dynamic Propositions
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Logic in Computer Science (cs.LO)
[494]  arXiv:2406.01099 [pdf, other]
Title: Deep reinforcement learning for weakly coupled MDP's with continuous actions
Authors: Francisco Robledo (LMAP, UPPA, UPV / EHU), Urtzi Ayesta (IRIT-RMESS, UPV/EHU, CNRS), Konstantin Avrachenkov (Inria)
Comments: ACM SIGMETRICS / ASMTA 2024, Jun 2024, Venise, Italy
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[495]  arXiv:2406.01098 [pdf, other]
Title: Learning Decision Trees and Forests with Algorithmic Recourse
Comments: 27 pages, 10 figures, to appear in the 41st International Conference on Machine Learning (ICML 2024)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[496]  arXiv:2406.01086 [pdf, other]
Title: Effective Subset Selection Through The Lens of Neural Network Pruning
Authors: Noga Bar, Raja Giryes
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[497]  arXiv:2406.01066 [pdf, other]
Title: Topology-Aware Dynamic Reweighting for Distribution Shifts on Graph
Subjects: Machine Learning (cs.LG)
[498]  arXiv:2406.01065 [pdf, other]
Title: Causal prompting model-based offline reinforcement learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[499]  arXiv:2406.01054 [pdf, other]
Title: Confidence-Based Task Prediction in Continual Disease Classification Using Probability Distribution
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[500]  arXiv:2406.01032 [pdf, other]
Title: LLM and GNN are Complementary: Distilling LLM for Multimodal Graph Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[501]  arXiv:2406.01013 [pdf, other]
Title: Scalable Ensembling For Mitigating Reward Overoptimisation
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[502]  arXiv:2406.01012 [pdf, other]
Title: Attention-based Iterative Decomposition for Tensor Product Representation
Comments: Published in ICLR 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[503]  arXiv:2406.00999 [pdf, other]
Title: Seeing the Forest through the Trees: Data Leakage from Partial Transformer Gradients
Comments: 12 pages, 7 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[504]  arXiv:2406.00990 [pdf, other]
Title: Constraint-Aware Diffusion Models for Trajectory Optimization
Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[505]  arXiv:2406.00987 [pdf, other]
Title: Enhancing Fairness in Unsupervised Graph Anomaly Detection through Disentanglement
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Social and Information Networks (cs.SI)
[506]  arXiv:2406.00958 [pdf, other]
Title: Navigating Conflicting Views: Harnessing Trust for Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[507]  arXiv:2406.00943 [pdf, other]
Title: State Space Models on Temporal Graphs: A First-Principles Study
Comments: Preprint; Code will be made available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[508]  arXiv:2406.00924 [pdf, ps, other]
Title: Faster Diffusion-based Sampling with Randomized Midpoints: Sequential and Parallel
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Statistics Theory (math.ST); Machine Learning (stat.ML)
[509]  arXiv:2406.00894 [pdf, other]
Title: Pretrained Hybrids with MAD Skills
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[510]  arXiv:2406.00889 [pdf, other]
Title: Reservoir History Matching of the Norne field with generative exotic priors and a coupled Mixture of Experts -- Physics Informed Neural Operator Forward Model
Comments: 30 pages. arXiv admin note: substantial text overlap with arXiv:2404.14447
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE)
[511]  arXiv:2406.00877 [pdf, other]
Title: Evidence of Learned Look-Ahead in a Chess-Playing Neural Network
Comments: Project page: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[512]  arXiv:2406.00868 [pdf, other]
Title: Dual Policy Reinforcement Learning for Real-time Rebalancing in Bike-sharing Systems
Subjects: Machine Learning (cs.LG)
[513]  arXiv:2406.00855 [pdf, other]
Title: LinkLogic: A New Method and Benchmark for Explainable Knowledge Graph Predictions
Comments: 12 pages, 4 figures in main text. For code and data, see this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[514]  arXiv:2406.00846 [pdf, other]
Title: Local Methods with Adaptivity via Scaling
Comments: 42 pages, 2 algorithms, 6 figures, 1 table
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Optimization and Control (math.OC)
[515]  arXiv:2406.00826 [pdf, other]
Title: Learning-Based Verification of Stochastic Dynamical Systems with Neural Network Policies
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[516]  arXiv:2406.00816 [pdf, other]
Title: Invisible Backdoor Attacks on Diffusion Models
Comments: Code: this https URL
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[517]  arXiv:2406.00814 [pdf, ps, other]
Title: Expected Possession Value of Control and Duel Actions for Soccer Player's Skills Estimation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[518]  arXiv:2406.00806 [pdf, other]
Title: Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection
Comments: ICML 2024
Subjects: Machine Learning (cs.LG)
[519]  arXiv:2406.00805 [pdf, ps, other]
Title: Extrapolability Improvement of Machine Learning-Based Evapotranspiration Models via Domain-Adversarial Neural Networks
Authors: Haiyang Shi
Subjects: Machine Learning (cs.LG); Geophysics (physics.geo-ph)
[520]  arXiv:2406.00801 [pdf, other]
Title: Ensemble Deep Random Vector Functional Link Neural Network Based on Fuzzy Inference System
Journal-ref: IEEE Transactions on Fuzzy Systems, 2024
Subjects: Machine Learning (cs.LG)
[521]  arXiv:2406.00800 [pdf, other]
Title: MagR: Weight Magnitude Reduction for Enhancing Post-Training Quantization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[522]  arXiv:2406.00779 [pdf, other]
Title: Differentiation of Multi-objective Data-driven Decision Pipeline
Subjects: Machine Learning (cs.LG)
[523]  arXiv:2406.00775 [pdf, other]
Title: Constrained Adaptive Attack: Effective Adversarial Attack Against Deep Neural Networks for Tabular Data
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[524]  arXiv:2406.00773 [pdf, other]
Title: Diffusion Tuning: Transferring Diffusion Models via Chain of Forgetting
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[525]  arXiv:2406.00766 [pdf, other]
Title: Scaling Tractable Probabilistic Circuits: A Systems Perspective
Subjects: Machine Learning (cs.LG)
[526]  arXiv:2406.00764 [pdf, other]
Title: IENE: Identifying and Extrapolating the Node Environment for Out-of-Distribution Generalization on Graphs
Subjects: Machine Learning (cs.LG)
[527]  arXiv:2406.00761 [pdf, other]
Title: Shared-unique Features and Task-aware Prioritized Sampling on Multi-task Reinforcement Learning
Comments: The first two authors contribute equally
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[528]  arXiv:2406.00748 [pdf, ps, other]
Title: Augmenting the FedProx Algorithm by Minimizing Convergence
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[529]  arXiv:2406.00738 [pdf, other]
Title: Global Rewards in Restless Multi-Armed Bandits
Comments: 27 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[530]  arXiv:2406.00734 [pdf, other]
Title: GLADformer: A Mixed Perspective for Graph-level Anomaly Detection
Subjects: Machine Learning (cs.LG)
[531]  arXiv:2406.00681 [pdf, other]
Title: Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient
Subjects: Machine Learning (cs.LG)
[532]  arXiv:2406.00661 [pdf, other]
Title: Bridging Multicalibration and Out-of-distribution Generalization Beyond Covariate Shift
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[533]  arXiv:2406.00655 [pdf, other]
Title: Generalized Exponentiated Gradient Algorithms and Their Application to On-Line Portfolio Selection
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Portfolio Management (q-fin.PM)
[534]  arXiv:2406.00645 [pdf, other]
Title: FuRL: Visual-Language Models as Fuzzy Rewards for Reinforcement Learning
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[535]  arXiv:2406.00633 [pdf, other]
Title: Improving GFlowNets for Text-to-Image Diffusion Alignment
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[536]  arXiv:2406.00619 [pdf, ps, other]
Title: A Multi-Graph Convolutional Neural Network Model for Short-Term Prediction of Turning Movements at Signalized Intersections
Comments: 26 pages, 12 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[537]  arXiv:2406.00614 [pdf, other]
Title: Efficient Monte Carlo Tree Search via On-the-Fly State-Conditioned Action Abstraction
Comments: UAI 2024 (Oral). The first two authors contributed equally
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[538]  arXiv:2406.00611 [pdf, other]
Title: DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation
Comments: Accepted at ICML 2024. 22 pages, 5 figures
Subjects: Machine Learning (cs.LG); Methodology (stat.ME)
[539]  arXiv:2406.00599 [pdf, other]
Title: Robust Fair Clustering with Group Membership Uncertainty Sets
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Data Structures and Algorithms (cs.DS)
[540]  arXiv:2406.00596 [pdf, other]
Title: Multi-variable Adversarial Time-Series Forecast Model
Authors: Xiaoqiao Chen
Comments: 14 pages. arXiv admin note: text overlap with arXiv:1701.00160 by other authors
Subjects: Machine Learning (cs.LG)
[541]  arXiv:2406.00588 [pdf, other]
Title: Generalization Bound and New Algorithm for Clean-Label Backdoor Attack
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Statistics Theory (math.ST)
[542]  arXiv:2406.00578 [pdf, other]
Title: ContextFlow++: Generalist-Specialist Flow-based Generative Models with Mixed-Variable Context Encoding
Comments: Accepted to UAI 2024. Preprint
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[543]  arXiv:2406.00573 [pdf, other]
Title: VOICE: Variance of Induced Contrastive Explanations to quantify Uncertainty in Neural Network Interpretability
Comments: Journal of Selected Topics in Signal Processing (J-STSP) Special Series on AI in Signal & Data Science
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[544]  arXiv:2406.00570 [pdf, other]
Title: A Gaussian Process-based Streaming Algorithm for Prediction of Time Series With Regimes and Outliers
Comments: 8 pages, 4 figures. Accepted to the International Conference on Information Fusion 2024 (FUSION 2024)
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP); Machine Learning (stat.ML)
[545]  arXiv:2406.00569 [pdf, other]
Title: Redefining Contributions: Shapley-Driven Federated Learning
Comments: Accepted by IJCAI 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[546]  arXiv:2406.00566 [pdf, other]
Title: An Unsupervised Approach for Periodic Source Detection in Time Series
Comments: To appear at the International Conference on Machine Learning (ICML) 2024
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[547]  arXiv:2406.00561 [pdf, other]
Title: Learning to Approximate Particle Smoothing Trajectories via Diffusion Generative Models
Subjects: Machine Learning (cs.LG)
[548]  arXiv:2406.00552 [pdf, other]
Title: Graph Neural Network Training Systems: A Performance Comparison of Full-Graph and Mini-Batch
Comments: 12 pages, 1 appendix, 8 Figures, 16 Tables, Graph Neural Network, Graph Neural Networks, Full-graph training, Mini-batch training, full-batch training, distributed training, performance, epoch time, time to accuracy, accuracy
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[549]  arXiv:2406.00551 [pdf, other]
Title: Strategic Linear Contextual Bandits
Subjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT)
[550]  arXiv:2406.00548 [pdf, ps, other]
Title: LIDAO: Towards Limited Interventions for Debiasing (Large) Language Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[551]  arXiv:2406.00544 [pdf, other]
Title: Leveraging Knowlegde Graphs for Interpretable Feature Generation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[552]  arXiv:2406.00539 [pdf, other]
Title: CONFINE: Conformal Prediction for Interpretable Neural Networks
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[553]  arXiv:2406.00535 [pdf, other]
Title: Causal Contrastive Learning for Counterfactual Regression Over Time
Subjects: Machine Learning (cs.LG); Methodology (stat.ME)
[554]  arXiv:2406.00529 [pdf, other]
Title: On the Use of Anchoring for Training Vision Models
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[555]  arXiv:2406.00524 [pdf, ps, other]
Title: Adaptive boosting with dynamic weight adjustment
Subjects: Machine Learning (cs.LG)
[556]  arXiv:2406.00519 [pdf, other]
Title: Learning Discrete Concepts in Latent Hierarchical Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[557]  arXiv:2406.00509 [pdf, other]
Title: Empirical influence functions to understand the logic of fine-tuning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[558]  arXiv:2406.00499 [pdf, ps, other]
Title: Conformal Transformation of Kernels: A Geometric Perspective on Text Classification
Comments: 30 pages
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Differential Geometry (math.DG)
[559]  arXiv:2406.00494 [pdf, other]
Title: Activation-Descent Regularization for Input Optimization of ReLU Networks
Comments: ICML'24 Proceedings
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[560]  arXiv:2406.00489 [pdf, other]
Title: Efficient Sign-Based Optimization: Accelerating Convergence via Variance Reduction
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[561]  arXiv:2406.00488 [pdf, other]
Title: Federated Model Heterogeneous Matryoshka Representation Learning
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[562]  arXiv:2406.00487 [pdf, other]
Title: Optimistic Rates for Learning from Label Proportions
Comments: Accepted to COLT 2024. Comments welcome
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[563]  arXiv:2406.00483 [pdf, other]
Title: Exploring the limits of Hierarchical World Models in Reinforcement Learning
Comments: 26 pages, 14 figures
Subjects: Machine Learning (cs.LG)
[564]  arXiv:2406.00469 [pdf, other]
Title: Learning to Solve Multiresolution Matrix Factorization by Manifold Optimization and Evolutionary Metaheuristics
Comments: arXiv admin note: substantial text overlap with arXiv:2111.01940
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[565]  arXiv:2406.00456 [pdf, other]
Title: Mix-of-Granularity: Optimize the Chunking Granularity for Retrieval-Augmented Generation
Comments: 17 pages, 6 figures and 8 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[566]  arXiv:2406.00452 [pdf, other]
Title: Towards a Unified Framework of Clustering-based Anomaly Detection
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[567]  arXiv:2406.00438 [pdf, other]
Title: Stein Random Feature Regression
Comments: To appear at UAI24
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[568]  arXiv:2406.00431 [pdf, ps, other]
Title: SpaFL: Communication-Efficient Federated Learning with Sparse Models and Low computational Overhead
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[569]  arXiv:2406.00426 [pdf, other]
Title: InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation
Comments: ICML 2024
Subjects: Machine Learning (cs.LG)
[570]  arXiv:2406.00418 [pdf, other]
Title: GATE: How to Keep Out Intrusive Neighbors
Comments: 26 pages. To be published at the International Conference on Machine Learning (ICML), 2024
Subjects: Machine Learning (cs.LG)
[571]  arXiv:2406.00410 [pdf, other]
Title: Posterior Label Smoothing for Node Classification
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[572]  arXiv:2406.00403 [pdf, other]
Title: Dual-perspective Cross Contrastive Learning in Graph Transformers
Comments: 12 pages, 5 figures, submitted to IEEE TKDE
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[573]  arXiv:2406.00396 [pdf, other]
Title: Stochastic Restarting to Overcome Overfitting in Neural Networks with Noisy Labels
Comments: 21 pages, 10 figures
Subjects: Machine Learning (cs.LG); Statistical Mechanics (cond-mat.stat-mech); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[574]  arXiv:2406.00394 [pdf, other]
Title: Learning Causal Abstractions of Linear Structural Causal Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME)
[575]  arXiv:2406.00371 [pdf, ps, other]
Title: Alternative Methods to SHAP Derived from Properties of Kernels: A Note on Theoretical Analysis
Subjects: Machine Learning (cs.LG)
[576]  arXiv:2406.00368 [pdf, other]
Title: Modeling Randomly Observed Spatiotemporal Dynamical Systems
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[577]  arXiv:2406.00335 [pdf, other]
Title: Benchmarking for Deep Uplift Modeling in Online Marketing
Subjects: Machine Learning (cs.LG)
[578]  arXiv:2406.00332 [pdf, ps, other]
Title: A Structured Review of Literature on Uncertainty in Machine Learning & Deep Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[579]  arXiv:2406.00324 [pdf, other]
Title: Do's and Don'ts: Learning Desirable Skills with Instruction Videos
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[580]  arXiv:2406.00318 [pdf, other]
Title: KGLink: A column type annotation method that combines knowledge graph and pre-trained language model
Comments: To be published in ICDE 2024
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[581]  arXiv:2406.00302 [pdf, other]
Title: FedAST: Federated Asynchronous Simultaneous Training
Comments: Accepted to UAI 2024
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[582]  arXiv:2406.00300 [pdf, other]
Title: Coded Computing: A Learning-Theoretic Framework
Comments: 28 pages, 4 figures
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Information Theory (cs.IT)
[583]  arXiv:2406.00291 [pdf, other]
Title: Multi-objective Neural Architecture Search by Learning Search Space Partitions
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[584]  arXiv:2406.00288 [pdf, other]
Title: Neural Optimal Transport with Lagrangian Costs
Comments: UAI 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[585]  arXiv:2406.00281 [pdf, other]
Title: Cross-Table Pretraining towards a Universal Function Space for Heterogeneous Tabular Data
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[586]  arXiv:2406.00276 [pdf, ps, other]
Title: Non-destructive Degradation Pattern Decoupling for Ultra-early Battery Prototype Verification Using Physics-informed Machine Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Data Analysis, Statistics and Probability (physics.data-an)
[587]  arXiv:2406.00262 [pdf, other]
Title: Contrastive Learning Via Equivariant Representation
Comments: Preprint. Under review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[588]  arXiv:2406.00249 [pdf, other]
Title: Privacy Challenges in Meta-Learning: An Investigation on Model-Agnostic Meta-Learning
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[589]  arXiv:2406.00240 [pdf, other]
Title: Exploring Vulnerabilities and Protections in Large Language Models: A Survey
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[590]  arXiv:2406.00234 [pdf, other]
Title: Learning to Stabilize Unknown LTI Systems on a Single Trajectory under Stochastic Noise
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[591]  arXiv:2406.00209 [pdf, other]
Title: Mamba State-Space Models Can Be Strong Downstream Learners
Comments: 16 pages, 4 figures, 3 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[592]  arXiv:2406.00177 [pdf, other]
Title: Flexible and Efficient Surrogate Gradient Modeling with Forward Gradient Injection
Authors: Sebastian Otte
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[593]  arXiv:2406.00153 [pdf, other]
Title: $μ$LO: Compute-Efficient Meta-Generalization of Learned Optimizers
Subjects: Machine Learning (cs.LG)
[594]  arXiv:2406.00150 [pdf, other]
Title: Non-Federated Multi-Task Split Learning for Heterogeneous Sources
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[595]  arXiv:2406.00144 [pdf, other]
Title: Query2CAD: Generating CAD models using natural language queries
Comments: 8 pages, 5 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE)
[596]  arXiv:2406.00134 [pdf, other]
Title: Anomaly Detection in Dynamic Graphs: A Comprehensive Survey
Comments: 32 pages (double column), 4 figures, and the manuscript has just been accepted in ACM Journals of Transactions on Knowledge Discovery from Data (TKDD)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[597]  arXiv:2406.00133 [pdf, other]
Title: Streamflow Prediction with Uncertainty Quantification for Water Management: A Constrained Reasoning and Learning Approach
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[598]  arXiv:2406.00132 [pdf, other]
Title: QuanTA: Efficient High-Rank Fine-Tuning of LLMs with Quantum-Informed Tensor Adaptation
Subjects: Machine Learning (cs.LG); Quantum Physics (quant-ph)
[599]  arXiv:2406.00131 [pdf, other]
Title: How In-Context Learning Emerges from Training on Unstructured Data: On the Role of Co-Occurrence, Positional Information, and Noise Structures
Comments: 33 pages
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[600]  arXiv:2406.00120 [pdf, other]
Title: Reward Machines for Deep RL in Noisy and Uncertain Environments
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Formal Languages and Automata Theory (cs.FL)
[601]  arXiv:2406.00118 [pdf, other]
Title: ADEP: A Novel Approach Based on Discriminator-Enhanced Encoder-Decoder Architecture for Accurate Prediction of Adverse Effects in Polypharmacy
Comments: 13 pages, 1 figure
Subjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[602]  arXiv:2406.00104 [pdf, other]
Title: Scalable Bayesian Learning with posteriors
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[603]  arXiv:2406.00081 [pdf, other]
Title: From Structured to Unstructured:A Comparative Analysis of Computer Vision and Graph Models in solving Mesh-based PDEs
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Computer Vision and Pattern Recognition (cs.CV)
[604]  arXiv:2406.00080 [pdf, other]
Title: An Efficient Multi Quantile Regression Network with Ad Hoc Prevention of Quantile Crossing
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[605]  arXiv:2406.00079 [pdf, other]
Title: Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence Modeling
Comments: arXiv admin note: text overlap with arXiv:2405.20692. arXiv admin note: text overlap with arXiv:2405.20692; text overlap with arXiv:2305.16554, arXiv:2210.14215 by other authors
Subjects: Machine Learning (cs.LG)
[606]  arXiv:2406.00075 [pdf, other]
Title: Arbitrary Length Generalization for Addition
Comments: First draft (8 pages, 1 figure)
Subjects: Machine Learning (cs.LG); Applications (stat.AP); Machine Learning (stat.ML)
[607]  arXiv:2406.00073 [pdf, other]
Title: A Novel Review of Stability Techniques for Improved Privacy-Preserving Machine Learning
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[608]  arXiv:2406.00061 [pdf, other]
Title: STAT: Shrinking Transformers After Training
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[609]  arXiv:2406.01594 (cross-list from cs.CV) [pdf, other]
Title: DiffUHaul: A Training-Free Method for Object Dragging in Images
Comments: Project page is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[610]  arXiv:2406.01592 (cross-list from cs.CV) [pdf, other]
Title: Text-guided Controllable Mesh Refinement for Interactive 3D Modeling
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Graphics (cs.GR); Machine Learning (cs.LG)
[611]  arXiv:2406.01589 (cross-list from stat.ML) [pdf, other]
Title: Tilting the Odds at the Lottery: the Interplay of Overparameterisation and Curricula in Neural Networks
Comments: Accepted to ICML 2024
Subjects: Machine Learning (stat.ML); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[612]  arXiv:2406.01583 (cross-list from cs.CV) [pdf, other]
Title: Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP
Comments: 22 pages, 15 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[613]  arXiv:2406.01575 (cross-list from math.OC) [pdf, other]
Title: Stochastic Bilevel Optimization with Lower-Level Contextual Markov Decision Processes
Comments: 54 pages, 18 Figures
Subjects: Optimization and Control (math.OC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[614]  arXiv:2406.01566 (cross-list from cs.DC) [pdf, other]
Title: Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computation and Language (cs.CL); Machine Learning (cs.LG)
[615]  arXiv:2406.01561 (cross-list from cs.CV) [pdf, other]
Title: Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
[616]  arXiv:2406.01552 (cross-list from stat.ML) [pdf, ps, other]
Title: Learning equivariant tensor functions with applications to sparse vector recovery
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[617]  arXiv:2406.01544 (cross-list from cs.RO) [pdf, other]
Title: Learning from Mistakes: a Weakly-supervised Method for Mitigating the Distribution Shift in Autonomous Vehicle Planning
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[618]  arXiv:2406.01517 (cross-list from cs.SI) [pdf, other]
Title: Beyond symmetrization: effective adjacency matrices and renormalization for (un)singed directed graphs
Comments: This work was carried out during the author's PhD program at the University of S\~ao Paulo (USP), S\~ao Carlos, Brazil
Subjects: Social and Information Networks (cs.SI); Machine Learning (cs.LG)
[619]  arXiv:2406.01506 (cross-list from cs.CL) [pdf, other]
Title: The Geometry of Categorical and Hierarchical Concepts in Large Language Models
Comments: Code is available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[620]  arXiv:2406.01494 (cross-list from cs.CV) [pdf, other]
Title: Robust Classification by Coupling Data Mollification with Label Smoothing
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[621]  arXiv:2406.01484 (cross-list from math.OC) [pdf, other]
Title: Online Optimization Perspective on First-Order and Zero-Order Decentralized Nonsmooth Nonconvex Stochastic Optimization
Comments: To appear in ICML 2024
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Systems and Control (eess.SY)
[622]  arXiv:2406.01478 (cross-list from math.OC) [pdf, other]
Title: Stochastic Newton Proximal Extragradient Method
Comments: 32 pages, 1 figure
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[623]  arXiv:2406.01468 (cross-list from cs.CL) [pdf, other]
Title: Understanding Token Probability Encoding in Output Embeddings
Comments: 15 pages, 17 figures, 3 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[624]  arXiv:2406.01455 (cross-list from cs.CV) [pdf, other]
Title: Automatic Fused Multimodal Deep Learning for Plant Identification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[625]  arXiv:2406.01446 (cross-list from cs.CL) [pdf, ps, other]
Title: Enabling ASR for Low-Resource Languages: A Comprehensive Dataset Creation Approach
Authors: Ara Yeroyan (Data Science Department, American University of Armenia), Nikolay Karpov (Nvidia, NeMo Conversational AI team)
Comments: 13 pages, 10 figures (including ablation studies), to be published in 2024 IEEE Spoken Language Technology Workshop. Additionally, the associated software package can be accessed at (this https URL) for practical applications and further development
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[626]  arXiv:2406.01421 (cross-list from cs.AI) [pdf, ps, other]
Title: Problematizing AI Omnipresence in Landscape Architecture
Journal-ref: Journal of Digital Landscape Architecture, 2024
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[627]  arXiv:2406.01402 (cross-list from cs.CV) [pdf, other]
Title: Mixture of Rationale: Multi-Modal Reasoning Mixture for Visual Question Answering
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[628]  arXiv:2406.01400 (cross-list from physics.comp-ph) [pdf, other]
Title: Efficient Computation Using Spatial-Photonic Ising Machines: Utilizing Low-Rank and Circulant Matrix Constraints
Comments: 15 pages, 7 figures
Subjects: Computational Physics (physics.comp-ph); Disordered Systems and Neural Networks (cond-mat.dis-nn); Emerging Technologies (cs.ET); Machine Learning (cs.LG); Optics (physics.optics)
[629]  arXiv:2406.01365 (cross-list from cs.CV) [pdf, other]
Title: From Feature Visualization to Visual Circuits: Effect of Adversarial Model Manipulation
Comments: Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[630]  arXiv:2406.01321 (cross-list from cs.SD) [pdf, other]
Title: Sequence-to-Sequence Multi-Modal Speech In-Painting
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[631]  arXiv:2406.01315 (cross-list from cs.CV) [pdf, other]
Title: Scale-Free Image Keypoints Using Differentiable Persistent Homology
Comments: Accepted to ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Algebraic Topology (math.AT)
[632]  arXiv:2406.01288 (cross-list from cs.CL) [pdf, other]
Title: Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[633]  arXiv:2406.01285 (cross-list from cs.IR) [pdf, other]
Title: Large Language Models as Recommender Systems: A Study of Popularity Bias
Comments: Accepted at Gen-IR@SIGIR24 workshop
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[634]  arXiv:2406.01278 (cross-list from cs.CV) [pdf, other]
Title: fruit-SALAD: A Style Aligned Artwork Dataset to reveal similarity perception in image embeddings
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computational Complexity (cs.CC); Machine Learning (cs.LG)
[635]  arXiv:2406.01275 (cross-list from cs.AI) [pdf, other]
Title: Lifting Factor Graphs with Some Unknown Factors
Comments: Accepted to the Proceedings of the 17th European Conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty (ECSQARU-23)
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[636]  arXiv:2406.01250 (cross-list from cs.DB) [pdf, other]
Title: DumpKV: Learning based lifetime aware garbage collection for key value separation in LSM-tree
Comments: Hi
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[637]  arXiv:2406.01205 (cross-list from eess.AS) [pdf, other]
Title: ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[638]  arXiv:2406.01203 (cross-list from cs.CV) [pdf, other]
Title: Scaling Up Deep Clustering Methods Beyond ImageNet-1K
Comments: Work in progress
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[639]  arXiv:2406.01191 (cross-list from eess.IV) [pdf, other]
Title: S-CycleGAN: Semantic Segmentation Enhanced CT-Ultrasound Image-to-Image Translation for Robotic Ultrasonography
Comments: This paper is submitted to 2024 IEEE International Conference on Cyborg and Bionic Systems, and still under review
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[640]  arXiv:2406.01187 (cross-list from eess.IV) [pdf, other]
Title: Patch-Based Encoder-Decoder Architecture for Automatic Transmitted Light to Fluorescence Imaging Transition: Contribution to the LightMyCells Challenge
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[641]  arXiv:2406.01149 (cross-list from stat.ML) [pdf, ps, other]
Title: Agnostic Learning of Mixed Linear Regressions with EM and AM Algorithms
Comments: To appear in ICML 2024
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Information Theory (cs.IT); Machine Learning (cs.LG)
[642]  arXiv:2406.01096 (cross-list from cs.CL) [pdf, ps, other]
Title: Synergizing Unsupervised and Supervised Learning: A Hybrid Approach for Accurate Natural Language Task Modeling
Journal-ref: International Journal of Innovative Science and Research Technology: Vol. 9 (2024): No. 5, 1499-1508
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[643]  arXiv:2406.01080 (cross-list from cs.CR) [pdf, other]
Title: No Vandalism: Privacy-Preserving and Byzantine-Robust Federated Learning
Subjects: Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[644]  arXiv:2406.01076 (cross-list from cs.CV) [pdf, other]
Title: Estimating Canopy Height at Scale
Comments: ICML Camera-Ready, 17 pages, 14 figures, 7 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[645]  arXiv:2406.01071 (cross-list from cs.CV) [pdf, other]
Title: Visual Car Brand Classification by Implementing a Synthetic Image Dataset Creation Pipeline
Comments: 10 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[646]  arXiv:2406.01056 (cross-list from cs.CV) [pdf, other]
Title: Virtual avatar generation models as world navigators
Authors: Sai Mandava
Comments: 16 pages, 15 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Robotics (cs.RO)
[647]  arXiv:2406.01047 (cross-list from cs.DC) [pdf, other]
Title: An Advanced Reinforcement Learning Framework for Online Scheduling of Deferrable Workloads in Cloud Computing
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[648]  arXiv:2406.01033 (cross-list from cs.CV) [pdf, ps, other]
Title: Generalized Jersey Number Recognition Using Multi-task Learning With Orientation-guided Weight Refinement
Comments: 10 pages, 6 figures, 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[649]  arXiv:2406.01027 (cross-list from cs.DB) [pdf, other]
Title: PRICE: A Pretrained Model for Cross-Database Cardinality Estimation
Subjects: Databases (cs.DB); Machine Learning (cs.LG)
[650]  arXiv:2406.01018 (cross-list from eess.AS) [pdf, other]
Title: Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training
Comments: Under review
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[651]  arXiv:2406.00998 (cross-list from stat.ML) [pdf, other]
Title: Distributional Refinement Network: Distributional Forecasting via Deep Learning
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Risk Management (q-fin.RM); Methodology (stat.ME)
[652]  arXiv:2406.00973 (cross-list from cs.IR) [pdf, other]
Title: Cold-start Recommendation by Personalized Embedding Region Elicitation
Comments: Accepted at UAI 2024
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[653]  arXiv:2406.00956 (cross-list from cs.CV) [pdf, other]
Title: Improving Segment Anything on the Fly: Auxiliary Online Learning and Adaptive Fusion for Medical Image Segmentation
Comments: Project Link: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[654]  arXiv:2406.00920 (cross-list from stat.ML) [pdf, ps, other]
Title: Demystifying SGD with Doubly Stochastic Gradients
Comments: Accepted to ICML'24
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
[655]  arXiv:2406.00918 (cross-list from cs.CR) [pdf, other]
Title: Assessing the Adversarial Security of Perceptual Hashing Algorithms
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[656]  arXiv:2406.00907 (cross-list from cs.CV) [pdf, other]
Title: DDA: Dimensionality Driven Augmentation Search for Contrastive Learning in Laparoscopic Surgery
Comments: 29 pages, 16 figures; MIDL 2024 - Medical Imaging with Deep Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[657]  arXiv:2406.00901 (cross-list from cs.MM) [pdf, other]
Title: Robust Multi-Modal Speech In-Painting: A Sequence-to-Sequence Approach
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[658]  arXiv:2406.00879 (cross-list from quant-ph) [pdf, ps, other]
Title: Quantum Equilibrium Propagation: Gradient-Descent Training of Quantum Systems
Subjects: Quantum Physics (quant-ph); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG)
[659]  arXiv:2406.00873 (cross-list from q-bio.QM) [pdf, ps, other]
Title: Scaffold Splits Overestimate Virtual Screening Performance
Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG); Biomolecules (q-bio.BM)
[660]  arXiv:2406.00856 (cross-list from cs.CV) [pdf, other]
Title: DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection
Comments: 6 pages, 1 figure
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[661]  arXiv:2406.00853 (cross-list from stat.ML) [pdf, other]
Title: A Tutorial on Doubly Robust Learning for Causal Inference
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST); Methodology (stat.ME)
[662]  arXiv:2406.00843 (cross-list from quant-ph) [pdf, other]
Title: Diffusion-Inspired Quantum Noise Mitigation in Parameterized Quantum Circuits
Subjects: Quantum Physics (quant-ph); Machine Learning (cs.LG)
[663]  arXiv:2406.00832 (cross-list from cs.CL) [pdf, other]
Title: BoNBoN Alignment for Large Language Models and the Sweetness of Best-of-n Sampling
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[664]  arXiv:2406.00823 (cross-list from stat.ML) [pdf, other]
Title: Lasso Bandit with Compatibility Condition on Optimal Arm
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[665]  arXiv:2406.00812 (cross-list from stat.ML) [pdf, other]
Title: Covariance-Adaptive Sequential Black-box Optimization for Diffusion Targeted Generation
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[666]  arXiv:2406.00809 (cross-list from math.NA) [pdf, other]
Title: Graph Neural Preconditioners for Iterative Solutions of Sparse Linear Systems
Authors: Jie Chen
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG)
[667]  arXiv:2406.00793 (cross-list from stat.ML) [pdf, other]
Title: Is In-Context Learning in Large Language Models Bayesian? A Martingale Perspective
Comments: Accepted at International Conference on Machine Learning (ICML) 2024
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[668]  arXiv:2406.00778 (cross-list from stat.ML) [pdf, other]
Title: Bayesian Joint Additive Factor Models for Multiview Learning
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Computation (stat.CO); Methodology (stat.ME)
[669]  arXiv:2406.00755 (cross-list from cs.CL) [pdf, other]
Title: Evaluating Mathematical Reasoning of Large Language Models: A Focus on Error Identification and Correction
Comments: ACL Findings 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[670]  arXiv:2406.00750 (cross-list from cs.CV) [pdf, other]
Title: Freeplane: Unlocking Free Lunch in Triplane-Based Sparse-View Reconstruction Models
Comments: project can be found in: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[671]  arXiv:2406.00741 (cross-list from cs.AI) [pdf, other]
Title: Learning to Play 7 Wonders Duel Without Human Supervision
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[672]  arXiv:2406.00735 (cross-list from q-bio.BM) [pdf, other]
Title: Full-Atom Peptide Design based on Multi-modal Flow Matching
Comments: ICML 2024
Subjects: Biomolecules (q-bio.BM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[673]  arXiv:2406.00713 (cross-list from stat.ML) [pdf, other]
Title: Logistic Variational Bayes Revisited
Comments: Accepted at the 41st International Conference on Machine Learning
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[674]  arXiv:2406.00704 (cross-list from cs.CV) [pdf, other]
Title: An Optimized Toolbox for Advanced Image Processing with Tsetlin Machine Composites
Comments: 8 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[675]  arXiv:2406.00695 (cross-list from physics.flu-dyn) [pdf, other]
Title: Discovering an interpretable mathematical expression for a full wind-turbine wake with artificial intelligence enhanced symbolic regression
Subjects: Fluid Dynamics (physics.flu-dyn); Machine Learning (cs.LG); Symbolic Computation (cs.SC); Applications (stat.AP)
[676]  arXiv:2406.00685 (cross-list from cs.CV) [pdf, other]
Title: Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[677]  arXiv:2406.00667 (cross-list from eess.IV) [pdf, other]
Title: An Early Investigation into the Utility of Multimodal Large Language Models in Medical Imaging
Comments: Accepted in Fifth IEEE Workshop on Artificial Intelligence for HealthCare, IEEE 25th International Conference on Information Reuse and Integration for Data Science
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[678]  arXiv:2406.00663 (cross-list from cs.CV) [pdf, other]
Title: SimSAM: Zero-shot Medical Image Segmentation via Simulated Interaction
Comments: Published at ISBI 2024. Awarded Top 12 Oral Presentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[679]  arXiv:2406.00630 (cross-list from stat.ML) [pdf, other]
Title: On Non-asymptotic Theory of Recurrent Neural Networks in Temporal Point Processes
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[680]  arXiv:2406.00628 (cross-list from cs.CL) [pdf, other]
Title: Transforming Computer Security and Public Trust Through the Exploration of Fine-Tuning Large Language Models
Comments: A preprint, 17 pages. 11 images
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Computers and Society (cs.CY); Machine Learning (cs.LG)
[681]  arXiv:2406.00615 (cross-list from cs.IR) [pdf, other]
Title: Making Recommender Systems More Knowledgeable: A Framework to Incorporate Side Information
Comments: 15 pages, 8 figures
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[682]  arXiv:2406.00532 (cross-list from cs.AI) [pdf, other]
Title: Breast Cancer Diagnosis: A Comprehensive Exploration of Explainable Artificial Intelligence (XAI) Techniques
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[683]  arXiv:2406.00518 (cross-list from cs.RO) [pdf, other]
Title: Learning to Play Air Hockey with Model-Based Deep Reinforcement Learning
Authors: Andrej Orsula
Comments: Robot Air Hockey Challenge 2023 | The source code is available at this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[684]  arXiv:2406.00502 (cross-list from math.OC) [pdf, other]
Title: Non-geodesically-convex optimization in the Wasserstein space
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[685]  arXiv:2406.00501 (cross-list from cs.CV) [pdf, other]
Title: Diffusion-based Image Generation for In-distribution Data Augmentation in Surface Defect Detection
Comments: Accepted at the 19th International Conference on Computer Vision Theory and Applications (VISAPP 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[686]  arXiv:2406.00492 (cross-list from eess.IV) [pdf, other]
Title: SAM-VMNet: Deep Neural Networks For Coronary Angiography Vessel Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[687]  arXiv:2406.00447 (cross-list from cs.CV) [pdf, other]
Title: DroneVis: Versatile Computer Vision Library for Drones
Comments: 23 pages, 15 figure, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG); Robotics (cs.RO)
[688]  arXiv:2406.00441 (cross-list from physics.chem-ph) [pdf, other]
Title: Neural Polarization: Toward Electron Density for Molecules by Extending Equivariant Networks
Subjects: Chemical Physics (physics.chem-ph); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[689]  arXiv:2406.00424 (cross-list from stat.ML) [pdf, other]
Title: A Batch Sequential Halving Algorithm without Performance Degradation
Comments: Accepted to RLC 2024
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[690]  arXiv:2406.00423 (cross-list from cs.CV) [pdf, other]
Title: Multimodal Metadata Assignment for Cultural Heritage Artifacts
Journal-ref: Multimedia Systems 29 (2023) 847-869
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[691]  arXiv:2406.00416 (cross-list from stat.ML) [pdf, other]
Title: Representation and De-interleaving of Mixtures of Hidden Markov Processes
Comments: 13 pages, 9 figures, submitted to IEEE transactions on Signal Processing
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Signal Processing (eess.SP)
[692]  arXiv:2406.00409 (cross-list from cs.CV) [pdf, other]
Title: Arabic Handwritten Text for Person Biometric Identification: A Deep Learning Approach
Comments: 6 pages, 11 figures, 4 tables, International IEEE Conference on the Intelligent Methods, Systems, and Applications (IMSA)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM); Neural and Evolutionary Computing (cs.NE)
[693]  arXiv:2406.00389 (cross-list from cs.NE) [pdf, other]
Title: Understanding the Convergence in Balanced Resonate-and-Fire Neurons
Subjects: Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG)
[694]  arXiv:2406.00345 (cross-list from cs.CV) [pdf, other]
Title: DeCoOp: Robust Prompt Tuning with Out-of-Distribution Detection
Comments: Accepted by ICML 2024. Code is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[695]  arXiv:2406.00339 (cross-list from cs.DS) [pdf, other]
Title: Turnstile $\ell_p$ leverage score sampling with applications
Comments: ICML 2024
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
[696]  arXiv:2406.00329 (cross-list from eess.IV) [pdf, other]
Title: Whole Heart 3D+T Representation Learning Through Sparse 2D Cardiac MR Images
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[697]  arXiv:2406.00328 (cross-list from cs.DS) [pdf, other]
Title: Optimal bounds for $\ell_p$ sensitivity sampling via $\ell_2$ augmentation
Comments: ICML 2024
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
[698]  arXiv:2406.00317 (cross-list from stat.ML) [pdf, other]
Title: Combining Experimental and Historical Data for Policy Evaluation
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[699]  arXiv:2406.00314 (cross-list from cs.CL) [pdf, other]
Title: CASE: Curricular Data Pre-training for Building Generative and Discriminative Assistive Psychology Expert Models
Comments: 19 pages (single column), 5 figures, 5 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[700]  arXiv:2406.00294 (cross-list from cs.SD) [pdf, other]
Title: Creative Text-to-Audio Generation via Synthesizer Programming
Comments: Accepted to ICML 2024
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[701]  arXiv:2406.00290 (cross-list from cs.CV) [pdf, other]
Title: Phasor-Driven Acceleration for FFT-based CNNs
Comments: Presented in the 21st Conference on Robots and Vision (CRV 2024) Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
[702]  arXiv:2406.00275 (cross-list from cs.CV) [pdf, other]
Title: StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization
Comments: Accepted at ICML 2024; Work in 2022 spring
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[703]  arXiv:2406.00239 (cross-list from cs.CV) [pdf, other]
Title: A Review of Pulse-Coupled Neural Network Applications in Computer Vision and Image Processing
Comments: The 25th International Conference on Image Processing, Computer Vision, and Pattern Recognition (IPCV 2021)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[704]  arXiv:2406.00238 (cross-list from cs.GR) [pdf, other]
Title: Robust Biharmonic Skinning Using Geometric Fields
Subjects: Graphics (cs.GR); Machine Learning (cs.LG)
[705]  arXiv:2406.00237 (cross-list from eess.IV) [pdf, other]
Title: A Comparative Study of CNN, ResNet, and Vision Transformers for Multi-Classification of Chest Diseases
Comments: 8 pages, 6 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[706]  arXiv:2406.00222 (cross-list from cs.CL) [pdf, other]
Title: Learning to Clarify: Multi-turn Conversations with Action-Based Contrastive Self-Training
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[707]  arXiv:2406.00198 (cross-list from cs.IR) [pdf, other]
Title: ImplicitSLIM and How it Improves Embedding-based Collaborative Filtering
Comments: Published as a conference paper at ICLR 2024; authors' version
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[708]  arXiv:2406.00192 (cross-list from eess.IV) [pdf, other]
Title: Direct Cardiac Segmentation from Undersampled K-space Using Transformers
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[709]  arXiv:2406.00183 (cross-list from physics.chem-ph) [pdf, other]
Title: Predicting solvation free energies with an implicit solvent machine learning potential
Subjects: Chemical Physics (physics.chem-ph); Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[710]  arXiv:2406.00147 (cross-list from cs.GT) [pdf, other]
Title: Fair Allocation in Dynamic Mechanism Design
Subjects: Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Theoretical Economics (econ.TH)
[711]  arXiv:2406.00146 (cross-list from cs.SD) [pdf, other]
Title: A Survey of Deep Learning Audio Generation Methods
Comments: 14 pages, 2 figures
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[712]  arXiv:2406.00135 (cross-list from cs.CV) [pdf, other]
Title: Advancing Ear Biometrics: Enhancing Accuracy and Robustness through Deep Learning
Comments: 6 pages, 8 figures, 3 tables, International IEEE Conference on the Intelligent Methods, Systems, and Applications
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Multimedia (cs.MM)
[713]  arXiv:2406.00127 (cross-list from stat.ML) [pdf, ps, other]
Title: Training on the Edge of Stability Is Caused by Layerwise Jacobian Alignment
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[714]  arXiv:2406.00125 (cross-list from eess.IV) [pdf, ps, other]
Title: TotalVibeSegmentator: Full Torso Segmentation for the NAKO and UK Biobank in Volumetric Interpolated Breath-hold Examination Body Images
Comments: this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[715]  arXiv:2406.00116 (cross-list from cs.HC) [pdf, other]
Title: A Sim2Real Approach for Identifying Task-Relevant Properties in Interpretable Machine Learning
Subjects: Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[716]  arXiv:2406.00093 (cross-list from cs.CV) [pdf, other]
Title: Bootstrap3D: Improving 3D Content Creation with Synthetic Data
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
[717]  arXiv:2406.00092 (cross-list from cs.AI) [pdf, other]
Title: How Random is Random? Evaluating the Randomness and Humaness of LLMs' Coin Flips
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[718]  arXiv:2406.00085 (cross-list from eess.IV) [pdf, other]
Title: Augmentation-based Unsupervised Cross-Domain Functional MRI Adaptation for Major Depressive Disorder Identification
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[719]  arXiv:2406.00083 (cross-list from cs.CR) [pdf, other]
Title: BadRAG: Identifying Vulnerabilities in Retrieval Augmented Generation of Large Language Models
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[720]  arXiv:2406.00071 (cross-list from astro-ph.IM) [pdf, ps, other]
Title: Optimizing Photometric Light Curve Analysis: Evaluating Scipy's Minimize Function for Eclipse Mapping of Cataclysmic Variables
Subjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Solar and Stellar Astrophysics (astro-ph.SR); Machine Learning (cs.LG)
[721]  arXiv:2406.00069 (cross-list from cs.CL) [pdf, other]
Title: Confidence-Aware Sub-Structure Beam Search (CABS): Mitigating Hallucination in Structured Data Generation with Large Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[722]  arXiv:2406.00062 (cross-list from cs.CL) [pdf, other]
Title: Unlocking the Potential of Large Language Models for Clinical Text Anonymization: A Comparative Study
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[723]  arXiv:2406.00060 (cross-list from cs.CL) [pdf, other]
Title: Cascade-Aware Training of Language Models
Comments: 22 pages, 13 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[724]  arXiv:2406.00059 (cross-list from cs.CL) [pdf, other]
Title: Conveyor: Efficient Tool-aware LLM Serving with Tool Partial Execution
Comments: 11 pages, 8 figures
Subjects: Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[725]  arXiv:2406.00057 (cross-list from cs.CL) [pdf, other]
Title: Toward Conversational Agents with Context and Time Sensitive Long-term Memory
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[726]  arXiv:2406.00054 (cross-list from cs.GT) [pdf, ps, other]
Title: $ε$-Optimally Solving Zero-Sum POSGs
Subjects: Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG)
[727]  arXiv:2406.00053 (cross-list from cs.CL) [pdf, other]
Title: Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting
Comments: 9 pages, 5 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[728]  arXiv:2406.00049 (cross-list from cs.CL) [pdf, other]
Title: QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine Translation
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[729]  arXiv:2406.00048 (cross-list from cs.CL) [pdf, other]
Title: Towards a theory of how the structure of language is acquired by deep neural networks
Comments: 9 pages, 4 figures (main)
Subjects: Computation and Language (cs.CL); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG)
[730]  arXiv:2406.00047 (cross-list from physics.chem-ph) [pdf, ps, other]
Title: A Theoretical Framework for an Efficient Normalizing Flow-Based Solution to the Schrodinger Equation
Subjects: Chemical Physics (physics.chem-ph); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[731]  arXiv:2406.00046 (cross-list from cs.CL) [pdf, other]
Title: Hate Speech Detection with Generalizable Target-aware Fairness
Comments: To appear in KDD 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[732]  arXiv:2406.00045 (cross-list from cs.CL) [pdf, other]
Title: Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[733]  arXiv:2406.00044 (cross-list from cs.CL) [pdf, other]
Title: Stochastic Adversarial Networks for Multi-Domain Text Classification
Authors: Xu Wang, Yuan Wu
Comments: Technical report
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[734]  arXiv:2406.00036 (cross-list from cs.CL) [pdf, other]
Title: EMERGE: Integrating RAG for Improved Multimodal EHR Predictive Modeling
Comments: arXiv admin note: text overlap with arXiv:2402.07016
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[735]  arXiv:2406.00031 (cross-list from cs.CL) [pdf, other]
Title: AMGPT: a Large Language Model for Contextual Querying in Additive Manufacturing
Comments: 54 pages, 4 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[736]  arXiv:2406.00030 (cross-list from cs.CL) [pdf, other]
Title: Large Language Model Pruning
Authors: Hanjuan Huang (1) (2), Hao-Jia Song (1), Hsing-Kuo Pao (1) ((1) Dept. of Computer Science and Information Engineering National Taiwan University of Science and Technology, Taipei, Taiwan, (2) College of Mechanical and Electrical Engineering, WUYI University, Wuyishan, China)
Comments: 17 pages, 7 figures, 2 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[737]  arXiv:2406.00028 (cross-list from cs.CL) [pdf, ps, other]
Title: Persian Homograph Disambiguation: Leveraging ParsBERT for Enhanced Sentence Understanding with a Novel Word Disambiguation Dataset
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[738]  arXiv:2406.00027 (cross-list from cs.CL) [pdf, other]
Title: Adapting PromptORE for Modern History: Information Extraction from Hispanic Monarchy Documents of the XVIth Century
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[739]  arXiv:2406.00024 (cross-list from cs.CL) [pdf, other]
Title: Embedding-Aligned Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET); Machine Learning (cs.LG)
[740]  arXiv:2406.00013 (cross-list from cs.IR) [pdf, ps, other]
Title: Thesis: Document Summarization with applications to Keyword extraction and Image Retrieval
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[741]  arXiv:2406.00004 (cross-list from cs.IR) [pdf, other]
Title: Navigating the Future of Federated Recommendation Systems with Foundation Models
Comments: 20 pages, position paper
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[742]  arXiv:2406.00001 (cross-list from cs.RO) [pdf, other]
Title: PhyPlan: Generalizable and Rapid Physical Task Planning with Physics Informed Skill Networks for Robot Manipulators
Comments: arXiv admin note: substantial text overlap with arXiv:2402.15767
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[743]  arXiv:2405.19815 (cross-list from cs.AI) [pdf, other]
Title: Efficient Stimuli Generation using Reinforcement Learning in Design Verification
Comments: Accepted for publication at the 20th International Conference on Synthesis, Modeling, Analysis and Simulation Methods, and Applications to Circuit Design (SMACD'24), Jul 2-5 2024, Volos, Greece
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

Mon, 3 Jun 2024

[744]  arXiv:2405.21064 [pdf, other]
Title: Recurrent neural networks: vanishing and exploding gradients are not the end of the story
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
[745]  arXiv:2405.21063 [pdf, other]
Title: Neural Network Verification with Branch-and-Bound for General Nonlinearities
Comments: Preprint
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[746]  arXiv:2405.21061 [pdf, other]
Title: Graph External Attention Enhanced Transformer
Comments: In Proceedings of ICML 2024
Subjects: Machine Learning (cs.LG)
[747]  arXiv:2405.21060 [pdf, other]
Title: Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality
Authors: Tri Dao, Albert Gu
Comments: ICML 2024
Subjects: Machine Learning (cs.LG)
[748]  arXiv:2405.21046 [pdf, other]
Title: Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[749]  arXiv:2405.21045 [pdf, ps, other]
Title: An Attention-Based Multi-Context Convolutional Encoder-Decoder Neural Network for Work Zone Traffic Impact Prediction
Subjects: Machine Learning (cs.LG)
[750]  arXiv:2405.21043 [pdf, other]
Title: Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation
Journal-ref: Proceedings of the 41 st International Conference on Machine Learning, 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[751]  arXiv:2405.21042 [pdf, other]
Title: Comparing information content of representation spaces for disentanglement with VAE ensembles
Comments: Code: this https URL
Subjects: Machine Learning (cs.LG)
[752]  arXiv:2405.21036 [pdf, ps, other]
Title: A-PETE: Adaptive Prototype Explanations of Tree Ensembles
Subjects: Machine Learning (cs.LG)
[753]  arXiv:2405.21021 [pdf, other]
Title: Beyond Conventional Parametric Modeling: Data-Driven Framework for Estimation and Prediction of Time Activity Curves in Dynamic PET Imaging
Subjects: Machine Learning (cs.LG); Image and Video Processing (eess.IV); Dynamical Systems (math.DS)
[754]  arXiv:2405.21018 [pdf, other]
Title: Improved Techniques for Optimization-Based Jailbreaking on Large Language Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[755]  arXiv:2405.21012 [pdf, other]
Title: G-Transformer for Conditional Average Potential Outcome Estimation over Time
Subjects: Machine Learning (cs.LG); Methodology (stat.ME)
[756]  arXiv:2405.21003 [pdf, other]
Title: Explaining Predictions by Characteristic Rules
Comments: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022
Journal-ref: In: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022. Lecture Notes in Computer Science(), vol 13713. Springer, Cham (2023)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[757]  arXiv:2405.20988 [pdf, other]
Title: Communication-Efficient Distributed Deep Learning via Federated Dynamic Averaging
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[758]  arXiv:2405.20986 [pdf, other]
Title: Uncertainty Quantification for Bird's Eye View Semantic Segmentation: Methods and Benchmarks
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[759]  arXiv:2405.20984 [pdf, other]
Title: Bayesian Design Principles for Offline-to-Online Reinforcement Learning
Comments: Forty-first International Conference on Machine Learning (ICML), 2024
Subjects: Machine Learning (cs.LG)
[760]  arXiv:2405.20973 [pdf, other]
Title: LCQ: Low-Rank Codebook based Quantization for Large Language Models
Authors: Wen-Pu Cai, Wu-Jun Li
Comments: 10 pages, 5 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[761]  arXiv:2405.20971 [pdf, other]
Title: Amortizing intractable inference in diffusion models for vision, language, and control
Comments: Code: this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[762]  arXiv:2405.20954 [pdf, other]
Title: Aligning Multiclass Neural Network Classifier Criterion with Task Performance via $F_β$-Score
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[763]  arXiv:2405.20935 [pdf, other]
Title: Effective Interplay between Sparsity and Quantization: From Theory to Practice
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[764]  arXiv:2405.20933 [pdf, ps, other]
Title: Concentration Bounds for Optimized Certainty Equivalent Risk Estimation
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[765]  arXiv:2405.20915 [pdf, other]
Title: Fast yet Safe: Early-Exiting with Risk Control
Comments: 25 pages, 11 figures, 4 tables (incl. appendix)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[766]  arXiv:2405.20905 [pdf, other]
Title: VENI, VINDy, VICI: a variational reduced-order modeling framework with uncertainty quantification
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Dynamical Systems (math.DS)
[767]  arXiv:2405.20882 [pdf, other]
Title: Sheaf HyperNetworks for Personalized Federated Learning
Comments: 25 pages, 12 figures, 7 tables, pre-print under review
Subjects: Machine Learning (cs.LG)
[768]  arXiv:2405.20879 [pdf, other]
Title: Flow matching achieves minimax optimal convergence
Subjects: Machine Learning (cs.LG)
[769]  arXiv:2405.20860 [pdf, other]
Title: Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Subjects: Machine Learning (cs.LG)
[770]  arXiv:2405.20838 [pdf, other]
Title: einspace: Searching for Neural Architectures from Fundamental Operations
Comments: Project page at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[771]  arXiv:2405.20835 [pdf, other]
Title: Outliers and Calibration Sets have Diminishing Effect on Quantization of Modern LLMs
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[772]  arXiv:2405.20824 [pdf, ps, other]
Title: Online Convex Optimisation: The Optimal Switching Regret for all Segmentations Simultaneously
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[773]  arXiv:2405.20821 [pdf, other]
Title: Pursuing Overall Welfare in Federated Learning through Sequential Decision Making
Comments: Accepted at ICML 2024
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
[774]  arXiv:2405.20800 [pdf, other]
Title: Shape Constraints in Symbolic Regression using Penalized Least Squares
Subjects: Machine Learning (cs.LG); Symbolic Computation (cs.SC)
[775]  arXiv:2405.20794 [pdf, ps, other]
Title: Model Interpretation and Explainability: Towards Creating Transparency in Prediction Models
Subjects: Machine Learning (cs.LG)
[776]  arXiv:2405.20790 [pdf, other]
Title: Intersectional Unfairness Discovery
Comments: ICML-2024 camera-ready
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[777]  arXiv:2405.20772 [pdf, ps, other]
Title: Reinforcement Learning for Sociohydrology
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[778]  arXiv:2405.20763 [pdf, other]
Title: Improving Generalization and Convergence by Enhancing Implicit Regularization
Comments: 35 pages
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[779]  arXiv:2405.20761 [pdf, other]
Title: Share Your Secrets for Privacy! Confidential Forecasting with Vertical Federated Learning
Comments: Submitted to the 27TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2024)
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC)
[780]  arXiv:2405.20759 [pdf, other]
Title: Information Theoretic Text-to-Image Alignment
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[781]  arXiv:2405.20738 [pdf, other]
Title: Federated Random Forest for Partially Overlapping Clinical Data
Subjects: Machine Learning (cs.LG)
[782]  arXiv:2405.20724 [pdf, other]
Title: Learning on Large Graphs using Intersecting Communities
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[783]  arXiv:2405.20692 [pdf, other]
Title: In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[784]  arXiv:2405.20690 [pdf, other]
Title: Unleashing the Potential of Diffusion Models for Incomplete Data Imputation
Subjects: Machine Learning (cs.LG)
[785]  arXiv:2405.20685 [pdf, other]
Title: Enhancing Counterfactual Image Generation Using Mahalanobis Distance with Distribution Preferences in Feature Space
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[786]  arXiv:2405.20678 [pdf, ps, other]
Title: No-Regret Learning for Fair Multi-Agent Social Welfare Optimization
Subjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
[787]  arXiv:2405.20677 [pdf, other]
Title: Provably Efficient Interactive-Grounded Learning with Personalized Reward
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[788]  arXiv:2405.20671 [pdf, other]
Title: Position Coupling: Leveraging Task Structure for Improved Length Generalization of Transformers
Comments: 73 pages, 20 figures, 90 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[789]  arXiv:2405.20664 [pdf, other]
Title: Weak Robust Compatibility Between Learning Algorithms and Counterfactual Explanation Generation Algorithms
Authors: Ao Xu, Tieru Wu
Subjects: Machine Learning (cs.LG)
[790]  arXiv:2405.20652 [pdf, other]
Title: Sign is Not a Remedy: Multiset-to-Multiset Message Passing for Learning on Heterophilic Graphs
Comments: Published as a conference paper at ICML 2024
Subjects: Machine Learning (cs.LG)
[791]  arXiv:2405.20642 [pdf, other]
Title: Principal-Agent Multitasking: the Uniformity of Optimal Contracts and its Efficient Learning via Instrumental Regression
Authors: Shiliang Zuo
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[792]  arXiv:2405.20640 [pdf, other]
Title: Heterophilous Distribution Propagation for Graph Neural Networks
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[793]  arXiv:2405.20630 [pdf, other]
Title: Stochastic Optimal Control for Diffusion Bridges in Function Spaces
Subjects: Machine Learning (cs.LG)
[794]  arXiv:2405.20623 [pdf, other]
Title: Prune at the Clients, Not the Server: Accelerated Sparse Training in Federated Learning
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[795]  arXiv:2405.20622 [pdf, other]
Title: Superfast Selection for Decision Tree Algorithms
Subjects: Machine Learning (cs.LG)
[796]  arXiv:2405.20620 [pdf, other]
Title: "Forgetting" in Machine Learning and Beyond: A Survey
Subjects: Machine Learning (cs.LG)
[797]  arXiv:2405.20605 [pdf, other]
Title: Searching for internal symbols underlying deep learning
Comments: 10 pages, 7 figures, 3 tables and Appendix
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[798]  arXiv:2405.20603 [pdf, ps, other]
Title: Advancing Financial Risk Prediction Through Optimized LSTM Model Performance and Comparative Analysis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[799]  arXiv:2405.20602 [pdf, other]
Title: Masked Language Modeling Becomes Conditional Density Estimation for Tabular Data Synthesis
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[800]  arXiv:2405.20594 [pdf, other]
Title: Deep Learning without Weight Symmetry
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neurons and Cognition (q-bio.NC)
[801]  arXiv:2405.20592 [pdf, other]
Title: LInK: Learning Joint Representations of Design and Performance Spaces through Contrastive Learning for Mechanism Synthesis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[802]  arXiv:2405.20590 [pdf, other]
Title: Class-Based Time Series Data Augmentation to Mitigate Extreme Class Imbalance for Solar Flare Prediction
Subjects: Machine Learning (cs.LG); Instrumentation and Methods for Astrophysics (astro-ph.IM); Solar and Stellar Astrophysics (astro-ph.SR); Artificial Intelligence (cs.AI)
[803]  arXiv:2405.20589 [pdf, other]
Title: Selective Knowledge Sharing for Personalized Federated Learning Under Capacity Heterogeneity
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[804]  arXiv:2405.20573 [pdf, other]
Title: Enhancing Generative Molecular Design via Uncertainty-guided Fine-tuning of Variational Autoencoders
Subjects: Machine Learning (cs.LG); Biomolecules (q-bio.BM); Quantitative Methods (q-bio.QM); Machine Learning (stat.ML)
[805]  arXiv:2405.20568 [pdf, other]
Title: Generative AI for Deep Reinforcement Learning: Framework, Analysis, and Use Cases
Subjects: Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[806]  arXiv:2405.20562 [pdf, other]
Title: Can Machine Learning Assist in Diagnosis of Primary Immune Thrombocytopenia? A feasibility study
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[807]  arXiv:2405.20556 [pdf, other]
Title: Certifying Global Robustness for Deep Neural Networks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[808]  arXiv:2405.20555 [pdf, other]
Title: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Subjects: Machine Learning (cs.LG)
[809]  arXiv:2405.20550 [pdf, ps, other]
Title: Uncertainty Quantification for Deep Learning
Comments: 25 pages 4 figures, submitted to Environmental data Science
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[810]  arXiv:2405.20543 [pdf, other]
Title: Towards a General GNN Framework for Combinatorial Optimization
Comments: 15 pages, 1 figure
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Discrete Mathematics (cs.DM)
[811]  arXiv:2405.20542 [pdf, ps, other]
Title: On the Connection Between Non-negative Matrix Factorization and Latent Dirichlet Allocation
Comments: 9 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[812]  arXiv:2405.20541 [pdf, other]
Title: Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[813]  arXiv:2405.20540 [pdf, ps, other]
Title: Fully Unconstrained Online Learning
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[814]  arXiv:2405.20539 [pdf, other]
Title: SleeperNets: Universal Backdoor Poisoning Attacks Against Reinforcement Learning Agents
Comments: 23 pages, 14 figures, NeurIPS
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[815]  arXiv:2405.20538 [pdf, other]
Title: Q-learning as a monotone scheme
Authors: Lingyi Yang
Subjects: Machine Learning (cs.LG)
[816]  arXiv:2405.20534 [pdf, other]
Title: Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement Learning
Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[817]  arXiv:2405.20531 [pdf, ps, other]
Title: Mitigating the Impact of Labeling Errors on Training via Rockafellian Relaxation
Subjects: Machine Learning (cs.LG)
[818]  arXiv:2405.20516 [pdf, other]
Title: WaveCastNet: An AI-enabled Wavefield Forecasting Framework for Earthquake Early Warning
Subjects: Machine Learning (cs.LG); Geophysics (physics.geo-ph)
[819]  arXiv:2405.20513 [pdf, other]
Title: Deep Modeling of Non-Gaussian Aleatoric Uncertainty
Comments: 8 pages, 7 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[820]  arXiv:2405.20504 [pdf, other]
Title: FCOM: A Federated Collaborative Online Monitoring Framework via Representation Learning
Subjects: Machine Learning (cs.LG)
[821]  arXiv:2405.20503 [pdf, ps, other]
Title: Optimizing cnn-Bigru performance: Mish activation and comparative analysis with Relu
Journal-ref: International Journal of Computer Networks & Communications (IJCNC) Vol.16, No.3, May 2024
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[822]  arXiv:2405.20486 [pdf, other]
Title: Policy Trees for Prediction: Interpretable and Adaptive Model Selection for Machine Learning
Comments: Submitted to JMLR on 5/30/2024
Subjects: Machine Learning (cs.LG)
[823]  arXiv:2405.20482 [pdf, other]
Title: Leveraging Structure Between Environments: Phylogenetic Regularization Incentivizes Disentangled Representations
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[824]  arXiv:2405.20467 [pdf, ps, other]
Title: Performance of NPG in Countable State-Space Average-Cost RL
Comments: 23 pages
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[825]  arXiv:2405.20456 [pdf, other]
Title: Scaling Laws for the Value of Individual Data Points in Machine Learning
Comments: ICML 2024 camera-ready
Subjects: Machine Learning (cs.LG)
[826]  arXiv:2405.20452 [pdf, other]
Title: Understanding Encoder-Decoder Structures in Machine Learning Using Information Measures
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[827]  arXiv:2405.20448 [pdf, other]
Title: Knockout: A simple way to handle missing inputs
Subjects: Machine Learning (cs.LG)
[828]  arXiv:2405.20445 [pdf, other]
Title: GraphAny: A Foundation Model for Node Classification on Any Graph
Comments: Preprint. Work in progress
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[829]  arXiv:2405.20439 [pdf, other]
Title: Sharpness-Aware Minimization Enhances Feature Quality via Balanced Learning
Comments: 25 pages, 10 figures, 2 tables
Subjects: Machine Learning (cs.LG)
[830]  arXiv:2405.20435 [pdf, other]
Title: Deep Learning for Computing Convergence Rates of Markov Chains
Subjects: Machine Learning (cs.LG); Probability (math.PR); Machine Learning (stat.ML)
[831]  arXiv:2405.20431 [pdf, other]
Title: Exploring the Practicality of Federated Learning: A Survey Towards the Communication Perspective
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[832]  arXiv:2405.20430 [pdf, other]
Title: Enhancing Performance for Highly Imbalanced Medical Data via Data Regularization in a Federated Learning Setting
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[833]  arXiv:2405.20420 [pdf, other]
Title: Back to the Basics on Predicting Transfer Performance
Comments: 15 pages, 3 figures, 2 tables
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[834]  arXiv:2405.20419 [pdf, other]
Title: Enhancing Antibiotic Stewardship using a Natural Language Approach for Better Feature Representation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[835]  arXiv:2405.20414 [pdf, ps, other]
Title: The Impact of Ontology on the Prediction of Cardiovascular Disease Compared to Machine Learning Algorithms
Journal-ref: International journal of online and biomedical engineering, Volume 18, Issue 11, 2022, Pages 143 - 157
Subjects: Machine Learning (cs.LG)
[836]  arXiv:2405.20397 [pdf, other]
Title: Explainable Data-driven Modeling of Adsorption Energy in Heterogeneous Catalysis
Subjects: Machine Learning (cs.LG); Chemical Physics (physics.chem-ph)
[837]  arXiv:2405.20390 [pdf, other]
Title: Quantitative Convergences of Lie Group Momentum Optimizers
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Optimization and Control (math.OC); Machine Learning (stat.ML)
[838]  arXiv:2405.20358 [pdf, other]
Title: Medication Recommendation via Dual Molecular Modalities and Multi-Substructure Distillation
Comments: 14 pages, 9 figures
Subjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[839]  arXiv:2405.20351 [pdf, other]
Title: ADR-BC: Adversarial Density Weighted Regression Behavior Cloning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[840]  arXiv:2405.20350 [pdf, other]
Title: Linear Function Approximation as a Computationally Efficient Method to Solve Classical Reinforcement Learning Challenges
Authors: Hari Srikanth
Subjects: Machine Learning (cs.LG)
[841]  arXiv:2405.21070 (cross-list from cs.CV) [pdf, other]
Title: Generalization Beyond Data Imbalance: A Controlled Study on CLIP for Transferable Insights
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[842]  arXiv:2405.21050 (cross-list from cs.CV) [pdf, other]
Title: Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[843]  arXiv:2405.21047 (cross-list from cs.AI) [pdf, other]
Title: Grammar-Aligned Decoding
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[844]  arXiv:2405.21027 (cross-list from cs.GT) [pdf, other]
Title: Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles
Comments: 20 pages, 5 figures
Subjects: Computer Science and Game Theory (cs.GT); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[845]  arXiv:2405.20993 (cross-list from cs.IT) [pdf, other]
Title: Information limits and Thouless-Anderson-Palmer equations for spiked matrix models with structured noise
Subjects: Information Theory (cs.IT); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG); Statistics Theory (math.ST)
[846]  arXiv:2405.20991 (cross-list from cs.CV) [pdf, other]
Title: Hard Cases Detection in Motion Prediction by Vision-Language Foundation Models
Comments: IEEE Intelligent Vehicles Symposium (IV) 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[847]  arXiv:2405.20990 (cross-list from cs.CR) [pdf, other]
Title: Locking Machine Learning Models into Hardware
Comments: 10 pages, 2 figures of main text; 14 pages, 16 figures of appendices
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[848]  arXiv:2405.20987 (cross-list from cs.CV) [pdf, other]
Title: Early Stopping Criteria for Training Generative Adversarial Networks in Biomedical Imaging
Comments: This paper is accepted at the 35th IEEE Irish Signals and Systems Conference (ISSC 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[849]  arXiv:2405.20980 (cross-list from cs.CV) [pdf, other]
Title: Neural Gaussian Scale-Space Fields
Comments: 15 pages; SIGGRAPH 2024; project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[850]  arXiv:2405.20975 (cross-list from cs.CR) [pdf, other]
Title: ACE: A Model Poisoning Attack on Contribution Evaluation Methods in Federated Learning
Comments: To appear in the 33rd USENIX Security Symposium, 2024
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[851]  arXiv:2405.20974 (cross-list from cs.CL) [pdf, other]
Title: SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
Comments: The code is available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[852]  arXiv:2405.20970 (cross-list from stat.ML) [pdf, other]
Title: PUAL: A Classifier on Trifurcate Positive-Unlabeled Data
Comments: 24 pages, 6 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[853]  arXiv:2405.20917 (cross-list from cs.CL) [pdf, other]
Title: Learning to Estimate System Specifications in Linear Temporal Logic using Transformers and Mamba
Comments: 20 pages, 15 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[854]  arXiv:2405.20887 (cross-list from cs.SD) [pdf, other]
Title: On the Condition Monitoring of Bolted Joints through Acoustic Emission and Deep Transfer Learning: Generalization, Ordinal Loss and Super-Convergence
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[855]  arXiv:2405.20877 (cross-list from cs.IT) [pdf, other]
Title: Waveform Design for Over-the-Air Computing
Comments: 14 pages
Subjects: Information Theory (cs.IT); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Signal Processing (eess.SP); Statistics Theory (math.ST)
[856]  arXiv:2405.20848 (cross-list from cs.SE) [pdf, other]
Title: SLIM: a Scalable Light-weight Root Cause Analysis for Imbalanced Data in Microservice
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[857]  arXiv:2405.20836 (cross-list from math.NA) [pdf, other]
Title: Solving partial differential equations with sampled neural networks
Comments: 16 pages, 15 figures
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG)
[858]  arXiv:2405.20830 (cross-list from cs.CL) [pdf, other]
Title: Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[859]  arXiv:2405.20829 (cross-list from cs.CV) [pdf, other]
Title: Rethinking Open-World Semi-Supervised Learning: Distribution Mismatch and Inductive Inference
Comments: CVPR Workshop on Computer Vision in the Wild (CVinW), 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[860]  arXiv:2405.20825 (cross-list from physics.med-ph) [pdf, ps, other]
Title: Analysis of clinical, dosimetric and radiomic features for predicting local failure after stereotactic radiotherapy of brain metastases in malignant melanoma
Subjects: Medical Physics (physics.med-ph); Machine Learning (cs.LG)
[861]  arXiv:2405.20808 (cross-list from cs.DS) [pdf, other]
Title: Optimally Improving Cooperative Learning in a Social Setting
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[862]  arXiv:2405.20799 (cross-list from stat.ML) [pdf, other]
Title: Rough Transformers: Lightweight Continuous-Time Sequence Modelling with Path Signatures
Comments: Preprint. Under review. arXiv admin note: text overlap with arXiv:2403.10288
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[863]  arXiv:2405.20797 (cross-list from cs.CV) [pdf, other]
Title: Ovis: Structural Embedding Alignment for Multimodal Large Language Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[864]  arXiv:2405.20791 (cross-list from cs.CV) [pdf, other]
Title: GS-Phong: Meta-Learned 3D Gaussians for Relightable Novel View Synthesis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[865]  arXiv:2405.20778 (cross-list from cs.CR) [pdf, other]
Title: Improved Generation of Adversarial Examples Against Safety-aligned LLMs
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[866]  arXiv:2405.20777 (cross-list from cs.CR) [pdf, other]
Title: Black-Box Detection of Language Model Watermarks
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[867]  arXiv:2405.20776 (cross-list from cs.CR) [pdf, other]
Title: Federated Learning with Blockchain-Enhanced Machine Unlearning: A Trustworthy Approach
Comments: 13 pages, 25 figures
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[868]  arXiv:2405.20771 (cross-list from cs.CR) [pdf, other]
Title: Towards Black-Box Membership Inference Attack for Diffusion Models
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[869]  arXiv:2405.20769 (cross-list from cs.CR) [pdf, other]
Title: Avoiding Pitfalls for Privacy Accounting of Subsampled Mechanisms under Composition
Subjects: Cryptography and Security (cs.CR); Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
[870]  arXiv:2405.20768 (cross-list from cs.NE) [pdf, other]
Title: Expanded Gating Ranges Improve Activation Functions
Authors: Allen Hao Huang
Subjects: Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG)
[871]  arXiv:2405.20748 (cross-list from cs.AI) [pdf, other]
Title: OpenTensor: Reproducing Faster Matrix Multiplication Discovering Algorithms
Authors: Yiwen Sun, Wenye Li
Subjects: Artificial Intelligence (cs.AI); Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG)
[872]  arXiv:2405.20743 (cross-list from cs.CV) [pdf, other]
Title: Trajectory Forecasting through Low-Rank Adaptation of Discrete Latent Codes
Comments: 15 pages, 3 figures, 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[873]  arXiv:2405.20731 (cross-list from cs.AI) [pdf, other]
Title: Maximum Temperature Prediction Using Remote Sensing Data Via Convolutional Neural Network
Comments: 4 pages, submitted to IEEE MetroLivEnv 2024 conference
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[874]  arXiv:2405.20717 (cross-list from cs.CV) [pdf, other]
Title: Cyclic image generation using chaotic dynamics
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Chaotic Dynamics (nlin.CD)
[875]  arXiv:2405.20687 (cross-list from cs.CV) [pdf, other]
Title: Conditioning GAN Without Training Dataset
Comments: 5 pages, 2 figures, Part of my MSc project course, School Project Course 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[876]  arXiv:2405.20675 (cross-list from cs.CV) [pdf, other]
Title: Adv-KD: Adversarial Knowledge Distillation for Faster Diffusion Sampling
Comments: 7 pages, 11 figures, ELLIS Doctoral Symposium 2023 in Helsinki, Finland
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[877]  arXiv:2405.20668 (cross-list from q-bio.BM) [pdf, other]
Title: Improving Paratope and Epitope Prediction by Multi-Modal Contrastive Learning and Interaction Informativeness Estimation
Comments: This paper is accepted by IJCAI 2024
Subjects: Biomolecules (q-bio.BM); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[878]  arXiv:2405.20649 (cross-list from cs.CL) [pdf, other]
Title: Reward-based Input Construction for Cross-document Relation Extraction
Comments: Accepted at ACL 2024 main conference
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[879]  arXiv:2405.20648 (cross-list from cs.CV) [pdf, other]
Title: Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[880]  arXiv:2405.20611 (cross-list from cs.CR) [pdf, ps, other]
Title: Bi-Directional Transformers vs. word2vec: Discovering Vulnerabilities in Lifted Compiled Code
Comments: 8 pages, 0 figures, IEEE 4th Cyber Awareness and Research Symposium 2024 (CARS'24)
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Machine Learning (cs.LG); Software Engineering (cs.SE)
[881]  arXiv:2405.20606 (cross-list from cs.CV) [pdf, other]
Title: Vision-Language Meets the Skeleton: Progressively Distillation with Cross-Modal Knowledge for 3D Action Representation Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[882]  arXiv:2405.20596 (cross-list from cs.CV) [pdf, other]
Title: Generalized Semi-Supervised Learning via Self-Supervised Feature Adaptation
Comments: 10 pages; Accepted by NeurIPS 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[883]  arXiv:2405.20591 (cross-list from q-bio.PE) [pdf, other]
Title: Weak-Form Inference for Hybrid Dynamical Systems in Ecology
Subjects: Populations and Evolution (q-bio.PE); Machine Learning (cs.LG); Dynamical Systems (math.DS)
[884]  arXiv:2405.20582 (cross-list from cs.CL) [pdf, ps, other]
Title: The Point of View of a Sentiment: Towards Clinician Bias Detection in Psychiatric Notes
Comments: Oral presentation at NAACL 2024 Queer in AI Workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[885]  arXiv:2405.20579 (cross-list from cs.RO) [pdf, other]
Title: HOPE: A Reinforcement Learning-based Hybrid Policy Path Planner for Diverse Parking Scenarios
Comments: 10 pages, 6 tables, 5 figures, 1 page appendix
Subjects: Robotics (cs.RO); Machine Learning (cs.LG)
[886]  arXiv:2405.20551 (cross-list from cs.SE) [pdf, other]
Title: EM-Assist: Safe Automated ExtractMethod Refactoring with LLMs
Comments: This paper is accepted to the tool demonstration track of the 32nd ACM Symposium on the Foundations of Software Engineering (FSE 2024). This is an author copy
Subjects: Software Engineering (cs.SE); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Programming Languages (cs.PL)
[887]  arXiv:2405.20512 (cross-list from cs.CL) [pdf, other]
Title: How Multilingual Are Large Language Models Fine-Tuned for Translation?
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[888]  arXiv:2405.20505 (cross-list from cs.CL) [pdf, other]
Title: SPOT: Text Source Prediction from Originality Score Thresholding
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[889]  arXiv:2405.20501 (cross-list from cs.RO) [pdf, other]
Title: ShelfHelp: Empowering Humans to Perform Vision-Independent Manipulation Tasks with a Socially Assistive Robotic Cane
Comments: 8 pages, 14 figures and charts
Journal-ref: In AAMAS (pp. 1514-1523) 2023
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[890]  arXiv:2405.20500 (cross-list from math.OC) [pdf, other]
Title: Hybrid Reinforcement Learning Framework for Mixed-Variable Problems
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[891]  arXiv:2405.20495 (cross-list from cs.CL) [pdf, other]
Title: Transfer Q Star: Principled Decoding for LLM Alignment
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[892]  arXiv:2405.20494 (cross-list from cs.CV) [pdf, other]
Title: Slight Corruption in Pre-training Data Makes Better Diffusion Models
Comments: 50 pages, 33 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[893]  arXiv:2405.20485 (cross-list from cs.CR) [pdf, other]
Title: Phantom: General Trigger Attacks on Retrieval Augmented Language Generation
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[894]  arXiv:2405.20468 (cross-list from cs.CL) [pdf, other]
Title: Extending the Massive Text Embedding Benchmark to French
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[895]  arXiv:2405.20465 (cross-list from cs.CV) [pdf, other]
Title: ENTIRe-ID: An Extensive and Diverse Dataset for Person Re-Identification
Comments: 5 pages, 2024 18th International Conference on Automatic Face and Gesture Recognition (FG)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[896]  arXiv:2405.20451 (cross-list from stat.ML) [pdf, other]
Title: Statistical Properties of Robust Satisficing
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
[897]  arXiv:2405.20447 (cross-list from stat.ML) [pdf, other]
Title: Algorithmic Fairness in Performative Policy Learning: Escaping the Impossibility of Group Fairness
Subjects: Machine Learning (stat.ML); Computers and Society (cs.CY); Machine Learning (cs.LG)
[898]  arXiv:2405.20446 (cross-list from cs.CR) [pdf, other]
Title: Is My Data in Your Retrieval Database? Membership Inference Attacks Against Retrieval Augmented Generation
Comments: 7 pages, 3 figures
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[899]  arXiv:2405.20413 (cross-list from cs.CR) [pdf, other]
Title: Jailbreaking Large Language Models Against Moderation Guardrails via Cipher Characters
Comments: 20 pages
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[900]  arXiv:2405.20412 (cross-list from cs.GR) [pdf, other]
Title: Audio2Rig: Artist-oriented deep learning tool for facial animation
Comments: Video examples and description: this https URL&ab_channel=Golaem
Subjects: Graphics (cs.GR); Machine Learning (cs.LG)
[901]  arXiv:2405.20407 (cross-list from physics.ins-det) [pdf, other]
Title: Convolutional L2LFlows: Generating Accurate Showers in Highly Granular Calorimeters Using Convolutional Normalizing Flows
Subjects: Instrumentation and Detectors (physics.ins-det); Machine Learning (cs.LG); High Energy Physics - Experiment (hep-ex); High Energy Physics - Phenomenology (hep-ph); Data Analysis, Statistics and Probability (physics.data-an)
[902]  arXiv:2405.20405 (cross-list from cs.DS) [pdf, other]
Title: Private Mean Estimation with Person-Level Differential Privacy
Comments: 67 pages, 3 figures
Subjects: Data Structures and Algorithms (cs.DS); Cryptography and Security (cs.CR); Information Theory (cs.IT); Machine Learning (cs.LG); Machine Learning (stat.ML)
[903]  arXiv:2405.20404 (cross-list from cs.CL) [pdf, other]
Title: XPrompt:Explaining Large Language Model's Generation via Joint Prompt Attribution
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[904]  arXiv:2405.20400 (cross-list from stat.ME) [pdf, other]
Title: Fast leave-one-cluster-out cross-validation by clustered Network Information Criteria (NICc)
Subjects: Methodology (stat.ME); Machine Learning (cs.LG); Computation (stat.CO); Machine Learning (stat.ML)
[905]  arXiv:2405.20384 (cross-list from cond-mat.quant-gas) [pdf, other]
Title: Recurrent neural network wave functions for Rydberg atom arrays on kagome lattice
Comments: 13 pages, 5 figures, 3 tables. Link to GitHub repository: this https URL
Subjects: Quantum Gases (cond-mat.quant-gas); Disordered Systems and Neural Networks (cond-mat.dis-nn); Strongly Correlated Electrons (cond-mat.str-el); Machine Learning (cs.LG); Quantum Physics (quant-ph)
[906]  arXiv:2405.20355 (cross-list from cs.NE) [pdf, other]
Title: Enhancing Adversarial Robustness in SNNs with Sparse Gradients
Comments: accepted by ICML 2024
Subjects: Neural and Evolutionary Computing (cs.NE); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[907]  arXiv:2405.20354 (cross-list from cs.DL) [pdf, other]
Title: Literature Filtering for Systematic Reviews with Transformers
Subjects: Digital Libraries (cs.DL); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[908]  arXiv:2405.20348 (cross-list from physics.ao-ph) [pdf, other]
Title: Personalized Adapter for Large Meteorology Model on Devices: Towards Weather Foundation Models
Comments: 42 pages, under review
Subjects: Atmospheric and Oceanic Physics (physics.ao-ph); Machine Learning (cs.LG)
[909]  arXiv:2405.20347 (cross-list from cs.CL) [pdf, other]
Title: Small Language Models for Application Interactions: A Case Study
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

Fri, 31 May 2024

[910]  arXiv:2405.20341 [pdf, other]
Title: From Zero to Hero: Cold-Start Anomaly Detection
Comments: ACL 2024. Our code is available at this https URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[911]  arXiv:2405.20331 [pdf, other]
Title: CoSy: Evaluating Textual Explanations of Neurons
Comments: 10 pages, 5 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[912]  arXiv:2405.20313 [pdf, other]
Title: Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation
Comments: preprint
Subjects: Machine Learning (cs.LG); Biomolecules (q-bio.BM)
[913]  arXiv:2405.20309 [pdf, other]
Title: Large Language Models Can Self-Improve At Web Agent Tasks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[914]  arXiv:2405.20287 [pdf, other]
Title: Flexible SE(2) graph neural networks with applications to PDE surrogates
Comments: 9 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Numerical Analysis (math.NA); Fluid Dynamics (physics.flu-dyn)
[915]  arXiv:2405.20278 [pdf, ps, other]
Title: Length independent generalization bounds for deep SSM architectures with stability constraints
Comments: 25 pages, no figures, under submission
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[916]  arXiv:2405.20272 [pdf, other]
Title: Reconstruction Attacks on Machine Unlearning: Simple Models are Vulnerable
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[917]  arXiv:2405.20271 [pdf, other]
Title: ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections
Comments: Accepted to ICML 2024. Code available at this https URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[918]  arXiv:2405.20233 [pdf, other]
Title: Grokfast: Accelerated Grokking by Amplifying Slow Gradients
Comments: 17 pages, 13 figures. Typo fixed. Project page: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[919]  arXiv:2405.20231 [pdf, other]
Title: The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof
Comments: 27 pages. Preparing code for release
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[920]  arXiv:2405.20200 [pdf, other]
Title: Unified Explanations in Machine Learning Models: A Perturbation Approach
Subjects: Machine Learning (cs.LG)
[921]  arXiv:2405.20194 [pdf, ps, other]
Title: Occam Gradient Descent
Authors: B.N. Kausik
Subjects: Machine Learning (cs.LG)
[922]  arXiv:2405.20180 [pdf, other]
Title: Transformers and Slot Encoding for Sample Efficient Physical World Modelling
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[923]  arXiv:2405.20174 [pdf, other]
Title: Tropical Expressivity of Neural Networks
Subjects: Machine Learning (cs.LG); Algebraic Geometry (math.AG)
[924]  arXiv:2405.20114 [pdf, other]
Title: Near Optimal Decentralized Optimization with Compression and Momentum Tracking
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Machine Learning (stat.ML)
[925]  arXiv:2405.20085 [pdf, other]
Title: Soft Partitioning of Latent Space for Semantic Channel Equalization
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Multiagent Systems (cs.MA)
[926]  arXiv:2405.20082 [pdf, other]
Title: Segment, Shuffle, and Stitch: A Simple Mechanism for Improving Time-Series Representations
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[927]  arXiv:2405.20051 [pdf, other]
Title: Threshold-Independent Fair Matching through Score Calibration
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[928]  arXiv:2405.20045 [pdf, other]
Title: Iterative Learning Control of Fast, Nonlinear, Oscillatory Dynamics (Preprint)
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Dynamical Systems (math.DS)
[929]  arXiv:2405.20042 [pdf, other]
Title: CycleFormer : TSP Solver Based on Language Modeling
Subjects: Machine Learning (cs.LG)
[930]  arXiv:2405.20029 [pdf, ps, other]
Title: A Random Forest-based Prediction Model for Turning Points in Antagonistic Event-Group Competitions
Authors: Zishuo Zhu
Subjects: Machine Learning (cs.LG)
[931]  arXiv:2405.20028 [pdf, ps, other]
Title: A Simple and Adaptive Learning Rate for FTRL in Online Learning with Minimax Regret of $Θ(T^{2/3})$ and its Application to Best-of-Both-Worlds
Comments: 31 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[932]  arXiv:2405.20014 [pdf, other]
Title: subMFL: Compatiple subModel Generation for Federated Learning in Device Heterogenous Environment
Comments: 12 pages, 7 figures, European Conference on Parallel Processing, pp. between 52 and 64, Springer, 2023
Subjects: Machine Learning (cs.LG)
[933]  arXiv:2405.20012 [pdf, other]
Title: FlexiDrop: Theoretical Insights and Practical Advances in Random Dropout Method on GNNs
Subjects: Machine Learning (cs.LG)
[934]  arXiv:2405.20003 [pdf, other]
Title: Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[935]  arXiv:2405.19978 [pdf, other]
Title: Domain Adaptation with Cauchy-Schwarz Divergence
Comments: Accepted by UAI-24
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[936]  arXiv:2405.19961 [pdf, other]
Title: Collective Variable Free Transition Path Sampling with Generative Flow Network
Comments: 9 pages, 5 figures, 2 tables
Subjects: Machine Learning (cs.LG)
[937]  arXiv:2405.19950 [pdf, other]
Title: MM-Lego: Modular Biomedical Multimodal Models with Minimal Fine-Tuning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[938]  arXiv:2405.19933 [pdf, other]
Title: Learning Latent Graph Structures and their Uncertainty
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[939]  arXiv:2405.19928 [pdf, other]
Title: BAN: Detecting Backdoors Activated by Adversarial Neuron Noise
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[940]  arXiv:2405.19919 [pdf, other]
Title: Unraveling the Impact of Heterophilic Structures on Graph Positive-Unlabeled Learning
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[941]  arXiv:2405.19909 [pdf, other]
Title: Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning
Comments: ICML 2024, 19 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[942]  arXiv:2405.19902 [pdf, other]
Title: Learning Discriminative Dynamics with Label Corruption for Noisy Label Detection
Comments: Accepted to CVPR 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[943]  arXiv:2405.19901 [pdf, other]
Title: Urban Air Pollution Forecasting: a Machine Learning Approach leveraging Satellite Observations and Meteorological Forecasts
Comments: 5 pages, 2 figures, submitted to IEEE MetroLivEnv 2024
Subjects: Machine Learning (cs.LG); Atmospheric and Oceanic Physics (physics.ao-ph)
[944]  arXiv:2405.19893 [pdf, other]
Title: Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts
Comments: 12 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[945]  arXiv:2405.19888 [pdf, other]
Title: Parrot: Efficient Serving of LLM-based Applications with Semantic Variable
Comments: To appear on USENIX OSDI 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[946]  arXiv:2405.19885 [pdf, other]
Title: Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning
Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[947]  arXiv:2405.19883 [pdf, other]
Title: From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems
Comments: Accepted by ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[948]  arXiv:2405.19878 [pdf, other]
Title: Learning from Random Demonstrations: Offline Reinforcement Learning with Importance-Sampled Diffusion Models
Authors: Zeyu Fang, Tian Lan
Subjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT)
[949]  arXiv:2405.19870 [pdf, other]
Title: On Vessel Location Forecasting and the Effect of Federated Learning
Journal-ref: 2024 IEEE International Conference on Mobile Data Management (MDM), June 24 - June 27, 2024, Brussels, Belgium
Subjects: Machine Learning (cs.LG)
[950]  arXiv:2405.19864 [pdf, ps, other]
Title: Out-of-distribution Reject Option Method for Dataset Shift Problem in Early Disease Onset Prediction
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Applications (stat.AP)
[951]  arXiv:2405.19836 [pdf, other]
Title: The Merit of River Network Topology for Neural Flood Forecasting
Comments: this https URL
Journal-ref: ICML 2024
Subjects: Machine Learning (cs.LG)
[952]  arXiv:2405.19823 [pdf, other]
Title: Joint Selective State Space Model and Detrending for Robust Time Series Anomaly Detection
Comments: Submitted to IEEE Signal Processing Letters
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[953]  arXiv:2405.19811 [pdf, ps, other]
Title: Approximate Global Convergence of Independent Learning in Multi-Agent Systems
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[954]  arXiv:2405.19807 [pdf, ps, other]
Title: MetaCURL: Non-stationary Concave Utility Reinforcement Learning
Authors: Bianca Marin Moreno (UGA, Thoth, EDF R&D, FiME Lab), Margaux Brégère (LPSM, EDF R&D), Pierre Gaillard (UGA, Thoth), Nadia Oudjane (EDF R&D, FiME Lab)
Subjects: Machine Learning (cs.LG); Probability (math.PR); Statistics Theory (math.ST); Machine Learning (stat.ML)
[955]  arXiv:2405.19806 [pdf, other]
Title: Preference Alignment with Flow Matching
Subjects: Machine Learning (cs.LG)
[956]  arXiv:2405.19804 [pdf, ps, other]
Title: Exploring Key Factors for Long-Term Vessel Incident Risk Prediction
Subjects: Machine Learning (cs.LG)
[957]  arXiv:2405.19789 [pdf, other]
Title: Estimating before Debiasing: A Bayesian Approach to Detaching Prior Bias in Federated Semi-Supervised Learning
Comments: Accepted by IJCAI 2024
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[958]  arXiv:2405.19785 [pdf, other]
Title: Recurrent Deep Kernel Learning of Dynamical Systems
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[959]  arXiv:2405.19757 [pdf, other]
Title: Improving SMOTE via Fusing Conditional VAE for Data-adaptive Noise Filtering
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[960]  arXiv:2405.19752 [pdf, ps, other]
Title: Understanding Memory-Regret Trade-Off for Streaming Stochastic Multi-Armed Bandits
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[961]  arXiv:2405.19747 [pdf, other]
Title: Understanding and mitigating difficulties in posterior predictive evaluation
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[962]  arXiv:2405.19729 [pdf, other]
Title: Dynamic feature selection in medical predictive monitoring by reinforcement learning
Comments: preview version
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[963]  arXiv:2405.19705 [pdf, ps, other]
Title: Universal Online Convex Optimization with $1$ Projection per Round
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[964]  arXiv:2405.19703 [pdf, other]
Title: Towards a Better Evaluation of Out-of-Domain Generalization
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[965]  arXiv:2405.19690 [pdf, other]
Title: Diffusion Policies creating a Trust Region for Offline Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[966]  arXiv:2405.19679 [pdf, other]
Title: Efficient Trajectory Inference in Wasserstein Space Using Consecutive Averaging
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Optimization and Control (math.OC)
[967]  arXiv:2405.19673 [pdf, other]
Title: Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models
Comments: Under review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[968]  arXiv:2405.19667 [pdf, other]
Title: Reconciling Model Multiplicity for Downstream Decision Making
Comments: 16 pages main body, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[969]  arXiv:2405.19661 [pdf, other]
Title: MGCP: A Multi-Grained Correlation based Prediction Network for Multivariate Time Series
Subjects: Machine Learning (cs.LG)
[970]  arXiv:2405.19653 [pdf, other]
Title: SysCaps: Language Interfaces for Simulation Surrogates of Complex Systems
Comments: 17 pages. Under review
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Systems and Control (eess.SY)
[971]  arXiv:2405.19650 [pdf, other]
Title: Few for Many: Tchebycheff Set Scalarization for Many-Objective Optimization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Optimization and Control (math.OC)
[972]  arXiv:2405.19649 [pdf, ps, other]
Title: Towards Deeper Understanding of PPR-based Embedding Approaches: A Topological Perspective
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[973]  arXiv:2405.19647 [pdf, other]
Title: FTS: A Framework to Find a Faithful TimeSieve
Subjects: Machine Learning (cs.LG)
[974]  arXiv:2405.19600 [pdf, ps, other]
Title: Do spectral cues matter in contrast-based graph self-supervised learning?
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[975]  arXiv:2405.19597 [pdf, other]
Title: SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors
Comments: 17 pages, 5 figures, 14 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[976]  arXiv:2405.19592 [pdf, other]
Title: Why Larger Language Models Do In-context Learning Differently?
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[977]  arXiv:2405.19590 [pdf, other]
Title: Weights Augmentation: it has never ever ever ever let her model down
Subjects: Machine Learning (cs.LG)
[978]  arXiv:2405.19559 [pdf, ps, other]
Title: Clustering Mixtures of Discrete Distributions: A Note on Mitra's Algorithm
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[979]  arXiv:2405.19550 [pdf, other]
Title: Stress-Testing Capability Elicitation With Password-Locked Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[980]  arXiv:2405.19548 [pdf, other]
Title: RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Comments: 25 pages, 19 figures
Subjects: Machine Learning (cs.LG)
[981]  arXiv:2405.19547 [pdf, other]
Title: CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning
Comments: This paper supercedes our previous VAS paper (arXiv:2402.02055)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[982]  arXiv:2405.19534 [pdf, other]
Title: Preference Learning Algorithms Do Not Learn Preference Rankings
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[983]  arXiv:2405.19532 [pdf, other]
Title: Contrasting Multiple Representations with the Multi-Marginal Matching Gap
Comments: To be presented at ICML 2024
Subjects: Machine Learning (cs.LG)
[984]  arXiv:2405.19521 [pdf, other]
Title: Crowdsourcing with Difficulty: A Bayesian Rating Model for Heterogeneous Items
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[985]  arXiv:2405.19513 [pdf, other]
Title: Decentralized Optimization in Time-Varying Networks with Arbitrary Delays
Comments: arXiv admin note: text overlap with arXiv:2401.11344
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Systems and Control (eess.SY); Optimization and Control (math.OC); Machine Learning (stat.ML)
[986]  arXiv:2405.19499 [pdf, other]
Title: Momentum for the Win: Collaborative Federated Reinforcement Learning across Heterogeneous Environments
Journal-ref: Proceedings of the 41st International Conference on Machine Learning, 2024 Learning
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA); Optimization and Control (math.OC)
[987]  arXiv:2405.19471 [pdf, other]
Title: The Data Minimization Principle in Machine Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[988]  arXiv:2405.19466 [pdf, other]
Title: Posterior Sampling via Autoregressive Generation
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[989]  arXiv:2405.19461 [pdf, other]
Title: Clustering-Based Validation Splits for Domain Generalisation
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[990]  arXiv:2405.19454 [pdf, other]
Title: Deep Grokking: Would Deep Neural Networks Generalize Better?
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[991]  arXiv:2405.19440 [pdf, other]
Title: On the Convergence of Multi-objective Optimization under Generalized Smoothness
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[992]  arXiv:2405.19420 [pdf, other]
Title: Using Contrastive Learning with Generative Similarity to Learn Spaces that Capture Human Inductive Biases
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neurons and Cognition (q-bio.NC)
[993]  arXiv:2405.19414 [pdf, other]
Title: Safety through Permissibility: Shield Construction for Fast and Safe Reinforcement Learning
Comments: 9 pages, 3 figures
Subjects: Machine Learning (cs.LG)
[994]  arXiv:2405.19376 [pdf, other]
Title: PureEBM: Universal Poison Purification via Mid-Run Dynamics of Energy-Based Models
Comments: arXiv admin note: substantial text overlap with arXiv:2405.18627
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[995]  arXiv:2405.20343 (cross-list from cs.CV) [pdf, other]
Title: Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[996]  arXiv:2405.20324 (cross-list from cs.CV) [pdf, other]
Title: Don't drop your samples! Coherence-aware training benefits Conditional diffusion
Comments: Accepted at CVPR 2024 as a Highlight. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[997]  arXiv:2405.20321 (cross-list from cs.RO) [pdf, other]
Title: Vision-based Manipulation from Single Human Video with Open-World Object Graphs
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[998]  arXiv:2405.20320 (cross-list from cs.CV) [pdf, other]
Title: Improving the Training of Rectified Flows
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[999]  arXiv:2405.20318 (cross-list from cs.CL) [pdf, other]
Title: CausalQuest: Collecting Natural Causal Questions for AI Agents
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1000]  arXiv:2405.20304 (cross-list from cs.CL) [pdf, other]
Title: Group Robust Preference Optimization in Reward-free RLHF
Comments: Preprint
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[1001]  arXiv:2405.20289 (cross-list from cs.SD) [pdf, other]
Title: DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1002]  arXiv:2405.20274 (cross-list from cs.CL) [pdf, other]
Title: ROAST: Review-level Opinion Aspect Sentiment Target Joint Detection
Comments: arXiv admin note: text overlap with arXiv:2309.13297
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1003]  arXiv:2405.20250 (cross-list from math.OC) [pdf, ps, other]
Title: Entropy annealing for policy mirror descent in continuous time and space
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Probability (math.PR)
[1004]  arXiv:2405.20247 (cross-list from cs.AI) [pdf, other]
Title: KerasCV and KerasNLP: Vision and Language Power-Ups
Comments: Submitted to Journal of Machine Learning Open Source Software
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Software Engineering (cs.SE)
[1005]  arXiv:2405.20245 (cross-list from cs.CL) [pdf, other]
Title: Retrieval Augmented Structured Generation: Business Document Information Extraction As Tool Use
Comments: Accepted by IEEE 7th International Conference on Multimedia Information Processing and Retrieval (MIPR), 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[1006]  arXiv:2405.20237 (cross-list from quant-ph) [pdf, other]
Title: Training-efficient density quantum machine learning
Comments: 17 pages main text, 9 pages appendices. 9 figures
Subjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1007]  arXiv:2405.20236 (cross-list from stat.ML) [pdf, other]
Title: Disentangling and Mitigating the Impact of Task Similarity for Continual Learning
Authors: Naoki Hiratani
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[1008]  arXiv:2405.20230 (cross-list from cs.CV) [pdf, other]
Title: Feature Fusion for Improved Classification: Combining Dempster-Shafer Theory and Multiple CNN Architectures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1009]  arXiv:2405.20216 (cross-list from cs.CV) [pdf, other]
Title: Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback
Comments: 28 pages, 18 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1010]  arXiv:2405.20213 (cross-list from cs.AI) [pdf, other]
Title: PostDoc: Generating Poster from a Long Multimodal Document Using Deep Submodular Optimization
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[1011]  arXiv:2405.20178 (cross-list from eess.SY) [pdf, other]
Title: Non-intrusive data-driven model order reduction for circuits based on Hammerstein architectures
Comments: 13 pages, 13 figures; submitted to IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG)
[1012]  arXiv:2405.20172 (cross-list from cs.SD) [pdf, other]
Title: Iterative Feature Boosting for Explainable Speech Emotion Recognition
Comments: Published in: 2023 International Conference on Machine Learning and Applications (ICMLA)
Journal-ref: 2023 International Conference on Machine Learning and Applications (ICMLA), Jacksonville, FL, USA, 2023, pp. 543-549
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[1013]  arXiv:2405.20165 (cross-list from stat.ML) [pdf, other]
Title: Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[1014]  arXiv:2405.20139 (cross-list from cs.CL) [pdf, other]
Title: GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1015]  arXiv:2405.20127 (cross-list from math.OC) [pdf, other]
Title: SPAM: Stochastic Proximal Point Method with Momentum Variance Reduction for Non-convex Cross-Device Federated Learning
Comments: The main part of the paper is around 9 pages. It contains the proposed algorithms, the main theoretical results and the experimental setting. The proofs of the main results and other technicalities are deferred to the Appendix
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[1016]  arXiv:2405.20124 (cross-list from stat.ML) [pdf, other]
Title: A Geometric Unification of Distributionally Robust Covariance Estimators: Shrinking the Spectrum by Inflating the Ambiguity Set
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
[1017]  arXiv:2405.20094 (cross-list from math.NA) [pdf, other]
Title: Low-dimensional approximations of the conditional law of Volterra processes: a non-positive curvature approach
Comments: Main body: 25 Pages, Appendices 29 Pages, 14 Tables, 6 Figures
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Differential Geometry (math.DG); Computational Finance (q-fin.CP)
[1018]  arXiv:2405.20091 (cross-list from cs.CV) [pdf, other]
Title: Visual Attention Analysis in Online Learning
Comments: Accepted in CEDI 2024 (VII Congreso Espa\~nol de Inform\'atica), A Coru\~na, Spain
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[1019]  arXiv:2405.20086 (cross-list from math.ST) [pdf, other]
Title: Analysis of a multi-target linear shrinkage covariance estimator
Authors: Benoit Oriol
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Probability (math.PR); Machine Learning (stat.ML)
[1020]  arXiv:2405.20079 (cross-list from cs.CL) [pdf, other]
Title: Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language Learning
Comments: Accepted as a poster paper at EDM 2024: 17th International Conference on Educational Data Mining in Atlanta, USA
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[1021]  arXiv:2405.20071 (cross-list from physics.med-ph) [pdf, ps, other]
Title: A Staged Approach using Machine Learning and Uncertainty Quantification to Predict the Risk of Hip Fracture
Comments: 29 pages, 5 figures, 6 tables
Subjects: Medical Physics (physics.med-ph); Machine Learning (cs.LG)
[1022]  arXiv:2405.20053 (cross-list from cs.CL) [pdf, other]
Title: Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1023]  arXiv:2405.20052 (cross-list from eess.SP) [pdf, other]
Title: Hardware-Efficient EMG Decoding for Next-Generation Hand Prostheses
Comments: \{copyright} 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)
[1024]  arXiv:2405.20039 (cross-list from stat.ML) [pdf, other]
Title: Task-Agnostic Machine Learning-Assisted Inference
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[1025]  arXiv:2405.20018 (cross-list from cs.MA) [pdf, other]
Title: Safe Multi-agent Reinforcement Learning with Natural Language Constraints
Comments: 23 pages, 6 figures
Subjects: Multiagent Systems (cs.MA); Computation and Language (cs.CL); Machine Learning (cs.LG)
[1026]  arXiv:2405.19995 (cross-list from stat.ML) [pdf, other]
Title: Symmetries in Overparametrized Neural Networks: A Mean-Field View
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Probability (math.PR)
[1027]  arXiv:2405.19988 (cross-list from cs.RO) [pdf, other]
Title: Video-Language Critic: Transferable Reward Functions for Language-Conditioned Robotics
Comments: 10 pages in the main text, 16 pages including references and supplementary materials. 4 figures and 3 tables in the main text, 1 table in supplementary materials
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1028]  arXiv:2405.19985 (cross-list from stat.ME) [pdf, other]
Title: Targeted Sequential Indirect Experiment Design
Subjects: Methodology (stat.ME); Machine Learning (cs.LG)
[1029]  arXiv:2405.19977 (cross-list from cs.DS) [pdf, other]
Title: Consistent Submodular Maximization
Comments: To appear at ICML 24
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1030]  arXiv:2405.19971 (cross-list from cs.CR) [pdf, other]
Title: GasTrace: Detecting Sandwich Attack Malicious Accounts in Ethereum
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[1031]  arXiv:2405.19967 (cross-list from cs.CL) [pdf, other]
Title: Improved Out-of-Scope Intent Classification with Dual Encoding and Threshold-based Re-Classification
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1032]  arXiv:2405.19954 (cross-list from cs.CR) [pdf, other]
Title: GenKubeSec: LLM-Based Kubernetes Misconfiguration Detection, Localization, Reasoning, and Remediation
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[1033]  arXiv:2405.19931 (cross-list from cs.CV) [pdf, other]
Title: Exploring Diffusion Models' Corruption Stage in Few-Shot Fine-tuning and Mitigating with Bayesian Neural Networks
Comments: Preprint. Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1034]  arXiv:2405.19912 (cross-list from stat.ML) [pdf, other]
Title: Robust Kernel Hypothesis Testing under Data Corruption
Comments: 26 pages, 2 figures, 2 algorithms
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[1035]  arXiv:2405.19889 (cross-list from eess.SP) [pdf, other]
Title: Deep Joint Semantic Coding and Beamforming for Near-Space Airship-Borne Massive MIMO Network
Comments: Major Revision by IEEE JSAC
Subjects: Signal Processing (eess.SP); Information Theory (cs.IT); Machine Learning (cs.LG); Multimedia (cs.MM)
[1036]  arXiv:2405.19886 (cross-list from cs.NI) [pdf, other]
Title: Federated Learning with Multi-resolution Model Broadcast
Subjects: Networking and Internet Architecture (cs.NI); Machine Learning (cs.LG)
[1037]  arXiv:2405.19874 (cross-list from cs.CL) [pdf, other]
Title: Is In-Context Learning Sufficient for Instruction Following in LLMs?
Comments: Preprint. Code at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1038]  arXiv:2405.19805 (cross-list from cs.CC) [pdf, ps, other]
Title: Complexity of Deciding Injectivity and Surjectivity of ReLU Neural Networks
Comments: 17 pages
Subjects: Computational Complexity (cs.CC); Discrete Mathematics (cs.DM); Machine Learning (cs.LG)
[1039]  arXiv:2405.19787 (cross-list from cs.CL) [pdf, other]
Title: From Symbolic Tasks to Code Generation: Diversification Yields Better Task Performers
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Logic in Computer Science (cs.LO); Programming Languages (cs.PL)
[1040]  arXiv:2405.19784 (cross-list from cs.DB) [pdf, ps, other]
Title: PixelsDB: Serverless and Natural-Language-Aided Data Analytics with Flexible Service Levels and Prices
Comments: 4 pages, 3 figures
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[1041]  arXiv:2405.19783 (cross-list from cs.CV) [pdf, other]
Title: Instruction-Guided Visual Masking
Comments: preprint, 21 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[1042]  arXiv:2405.19779 (cross-list from cs.NE) [pdf, other]
Title: Automatic Graph Topology-Aware Transformer
Comments: This work has been submitted to the IEEE (Under Second Review). Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Neural and Evolutionary Computing (cs.NE); Graphics (cs.GR); Machine Learning (cs.LG)
[1043]  arXiv:2405.19760 (cross-list from stat.ML) [pdf, ps, other]
Title: Identifiability of a statistical model with two latent vectors: Importance of the dimensionality relation and application to graph embedding
Authors: Hiroaki Sasaki
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[1044]  arXiv:2405.19732 (cross-list from cs.CV) [pdf, other]
Title: Two Optimizers Are Better Than One: LLM Catalyst for Enhancing Gradient-Based Optimization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[1045]  arXiv:2405.19730 (cross-list from cs.AI) [pdf, ps, other]
Title: Research on Foundation Model for Spatial Data Intelligence: China's 2024 White Paper on Strategic Development of Spatial Data Intelligence
Authors: Shaohua Wang (1), Xing Xie (2), Yong Li (3), Danhuai Guo (4), Zhi Cai (5), Yu Liu (6), Yang Yue (7), Xiao Pan (8), Feng Lu (9), Huayi Wu (10), Zhipeng Gui (10), Zhiming Ding (11), Bolong Zheng (12), Fuzheng Zhang (13), Tao Qin (2), Jingyuan Wang (14), Chuang Tao (15), Zhengchao Chen (1), Hao Lu (16), Jiayi Li (10), Hongyang Chen (17), Peng Yue (10), Wenhao Yu (18), Yao Yao (18), Leilei Sun (14), Yong Zhang (5), Longbiao Chen (19), Xiaoping Du (20), Xiang Li (21), Xueying Zhang (22), Kun Qin (10), Zhaoya Gong (6), Weihua Dong (23), Xiaofeng Meng (24) ((1) Aerospace Information Research Institute, Chinese Academy of Sciences,(2) Microsoft Research Asia, (3) Tsinghua University, (4) Beijing University of Chemical Technology, (5) Beijing University of Technology, (6) Peking University, (7) Shenzhen University, (8) Shijiazhuang Tiedao University, (9) Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, (10) Wuhan University, (11) Institute of Software, Chinese Academy of Sciences, (12) Huazhong University of Science and Technology, (13) Kuaishou Natural Language Processing Center and Audio Center, (14) Beijing University of Aeronautics and Astronautics, (15) Shanghai Figure Interesting Information Technology Co., Ltd., (16) SuperMap Software Co., Ltd., (17) Zhejiang Lab, (18) China University of Geosciences (Wuhan), (19) Xiamen University, (20) Key Laboratory of Digital Earth, Chinese Academy of Sciences, (21) East China Normal University, (22) Nanjing Normal University, (23) Beijing Normal University, (24) Renmin University of China)
Comments: in Chinese language
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1046]  arXiv:2405.19715 (cross-list from cs.CL) [pdf, other]
Title: SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1047]  arXiv:2405.19704 (cross-list from stat.ML) [pdf, other]
Title: Enhancing Sufficient Dimension Reduction via Hellinger Correlation
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[1048]  arXiv:2405.19697 (cross-list from math.OC) [pdf, other]
Title: Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
Comments: 43 pages, 1 figure, 1 table
Subjects: Optimization and Control (math.OC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1049]  arXiv:2405.19683 (cross-list from cs.CR) [pdf, other]
Title: Breaking Indistinguishability with Transfer Learning: A First Look at SPECK32/64 Lightweight Block Ciphers
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[1050]  arXiv:2405.19681 (cross-list from stat.ML) [pdf, other]
Title: Bayesian Online Natural Gradient (BONG)
Comments: 41 pages, 11 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO)
[1051]  arXiv:2405.19672 (cross-list from eess.IV) [pdf, other]
Title: CRIS: Collaborative Refinement Integrated with Segmentation for Polyp Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1052]  arXiv:2405.19665 (cross-list from eess.SY) [pdf, ps, other]
Title: A novel fault localization with data refinement for hydroelectric units
Comments: 6pages,4 figures,Conference on Decision and Control(CDC) conference
Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1053]  arXiv:2405.19648 (cross-list from cs.CL) [pdf, other]
Title: Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach
Comments: ICAI'24 - The 26th Int'l Conf on Artificial Intelligence
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1054]  arXiv:2405.19644 (cross-list from cs.CV) [pdf, other]
Title: EgoSurgery-Phase: A Dataset of Surgical Phase Recognition from Egocentric Open Surgery Videos
Comments: Early accepted by MICCAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1055]  arXiv:2405.19616 (cross-list from cs.AI) [pdf, other]
Title: Easy Problems That LLMs Get Wrong
Comments: AutogenAI Ltd. GitHub Repo: this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[1056]  arXiv:2405.19610 (cross-list from stat.ML) [pdf, other]
Title: Factor Augmented Tensor-on-Tensor Neural Networks
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[1057]  arXiv:2405.19586 (cross-list from cs.CV) [pdf, other]
Title: SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation
Comments: ICML 2024. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[1058]  arXiv:2405.19567 (cross-list from cs.AI) [pdf, other]
Title: Dr-LLaVA: Visual Instruction Tuning with Symbolic Clinical Grounding
Comments: Code available at: this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1059]  arXiv:2405.19562 (cross-list from cs.CY) [pdf, other]
Title: Selective Explanations
Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL); Machine Learning (cs.LG)
[1060]  arXiv:2405.19553 (cross-list from math.ST) [pdf, ps, other]
Title: Convergence Bounds for Sequential Monte Carlo on Multimodal Distributions using Soft Decomposition
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Probability (math.PR); Machine Learning (stat.ML)
[1061]  arXiv:2405.19544 (cross-list from cs.AI) [pdf, other]
Title: One-Shot Safety Alignment for Large Language Models via Optimal Dualization
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[1062]  arXiv:2405.19542 (cross-list from eess.SP) [pdf, other]
Title: Anatomical Region Recognition and Real-time Bone Tracking Methods by Dynamically Decoding A-Mode Ultrasound Signals
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG); Robotics (cs.RO)
[1063]  arXiv:2405.19538 (cross-list from cs.CL) [pdf, other]
Title: CheXpert Plus: Augmenting a Large Chest X-ray Dataset with Text Radiology Reports, Patient Demographics and Additional Image Formats
Comments: 13 pages Updated title
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1064]  arXiv:2405.19531 (cross-list from cs.RO) [pdf, other]
Title: Real-Time Dynamic Robot-Assisted Hand-Object Interaction via Motion Primitives
Comments: 8 pages, 10 figures
Subjects: Robotics (cs.RO); Machine Learning (cs.LG)
[1065]  arXiv:2405.19518 (cross-list from physics.ao-ph) [pdf, other]
Title: Exploring the Potential of Hybrid Machine-Learning/Physics-Based Modeling for Atmospheric/Oceanic Prediction Beyond the Medium Range
Subjects: Atmospheric and Oceanic Physics (physics.ao-ph); Machine Learning (cs.LG); Chaotic Dynamics (nlin.CD)
[1066]  arXiv:2405.19516 (cross-list from eess.SP) [pdf, other]
Title: Enabling Visual Recognition at Radio Frequency
Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[1067]  arXiv:2405.19497 (cross-list from eess.AS) [pdf, other]
Title: Gaussian Flow Bridges for Audio Domain Transfer with Unpaired Data
Comments: Submitted to IWAENC 2024
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[1068]  arXiv:2405.19486 (cross-list from stat.ML) [pdf, other]
Title: Online Nonparametric Supervised Learning for Massive Data
Comments: 24 pages, 10 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[1069]  arXiv:2405.19479 (cross-list from cs.CY) [pdf, other]
Title: Participation in the age of foundation models
Comments: 13 pages, 2 figures. Appeared at FAccT '24
Journal-ref: In The 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT '24), June 3-6, 2024, Rio de Janeiro, Brazil. ACM, New York, NY, USA, 13 pages
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[1070]  arXiv:2405.19463 (cross-list from stat.ML) [pdf, other]
Title: Stochastic Optimization Algorithms for Instrumental Variable Regression with Streaming Data
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Econometrics (econ.EM); Optimization and Control (math.OC)
[1071]  arXiv:2405.19452 (cross-list from cs.RO) [pdf, other]
Title: Gaitor: Learning a Unified Representation Across Gaits for Real-World Quadruped Locomotion
Comments: 10 pages, 8 figures, 2 tables
Subjects: Robotics (cs.RO); Machine Learning (cs.LG)
[1072]  arXiv:2405.19398 (cross-list from hep-th) [pdf, other]
Title: Neural Scaling Laws From Large-N Field Theory: Solvable Model Beyond the Ridgeless Limit
Authors: Zhengkang Zhang
Comments: 51 pages, 3 figures
Subjects: High Energy Physics - Theory (hep-th); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG); High Energy Physics - Phenomenology (hep-ph)
[1073]  arXiv:2405.19397 (cross-list from cond-mat.str-el) [pdf, other]
Title: Ground state phases of the two-dimension electron gas with a unified variational approach
Subjects: Strongly Correlated Electrons (cond-mat.str-el); Machine Learning (cs.LG); Computational Physics (physics.comp-ph); Quantum Physics (quant-ph)
[1074]  arXiv:2405.19384 (cross-list from astro-ph.EP) [pdf, other]
Title: NeuralODEs for VLEO simulations: Introducing thermoNET for Thermosphere Modeling
Comments: Paper presented and published in the 29th ISSFD Conference, Darmstadt, Germany
Subjects: Earth and Planetary Astrophysics (astro-ph.EP); Machine Learning (cs.LG); Atmospheric and Oceanic Physics (physics.ao-ph); Space Physics (physics.space-ph)
[1075]  arXiv:2405.19383 (cross-list from cs.SI) [pdf, other]
Title: Network Analytics for Anti-Money Laundering -- A Systematic Literature Review and Experimental Evaluation
Subjects: Social and Information Networks (cs.SI); Machine Learning (cs.LG)
[1076]  arXiv:2405.19380 (cross-list from stat.ML) [pdf, other]
Title: Approximate Thompson Sampling for Learning Linear Quadratic Regulators with $O(\sqrt{T})$ Regret
Comments: 61 pages, 6 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Systems and Control (eess.SY)
[1077]  arXiv:2405.19375 (cross-list from cs.SI) [pdf, other]
Title: Improving global awareness of linkset predictions using Cross-Attentive Modulation tokens
Comments: 17 pages, 2 figures, not published nor submitted yet
Subjects: Social and Information Networks (cs.SI); Machine Learning (cs.LG)
[1078]  arXiv:2405.19374 (cross-list from stat.ML) [pdf, ps, other]
Title: Optimal Multiclass U-Calibration Error and Beyond
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[1079]  arXiv:2405.19373 (cross-list from eess.SP) [pdf, other]
Title: Multi-modal Mood Reader: Pre-trained Model Empowers Cross-Subject Emotion Recognition
Comments: Accepted by International Conference on Neural Computing for Advanced Applications, 2024
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)
[1080]  arXiv:2405.19363 (cross-list from eess.SP) [pdf, other]
Title: Medformer: A Multi-Granularity Patching Transformer for Medical Time-Series Classification
Comments: 20pages (14 pages main paper + 6 pages supplementary materials)
Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1081]  arXiv:2405.19359 (cross-list from eess.SP) [pdf, other]
Title: Modally Reduced Representation Learning of Multi-Lead ECG Signals through Simultaneous Alignment and Reconstruction
Comments: Accepted as a Workshop Paper at TS4H@ICLR2024
Journal-ref: ICLR 2024 Workshop on Learning from Time Series For Health
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)
[1082]  arXiv:2405.19356 (cross-list from eess.SP) [pdf, other]
Title: An LSTM Feature Imitation Network for Hand Movement Recognition from sEMG Signals
Comments: This work has been submitted to RA-L, and under review
Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[1083]  arXiv:2405.19351 (cross-list from eess.SP) [pdf, other]
Title: Resonate-and-Fire Spiking Neurons for Target Detection and Hand Gesture Recognition: A Hybrid Approach
Journal-ref: 2024 Smart Systems Integration Conference and Exhibition (SSI)
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[1084]  arXiv:2405.19349 (cross-list from eess.SP) [pdf, other]
Title: Beyond Isolated Frames: Enhancing Sensor-Based Human Activity Recognition through Intra- and Inter-Frame Attention
Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[1085]  arXiv:2405.19348 (cross-list from eess.SP) [pdf, other]
Title: NERULA: A Dual-Pathway Self-Supervised Learning Framework for Electrocardiogram Signal Analysis
Comments: Paper in review
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)
[1086]  arXiv:2405.19347 (cross-list from eess.SP) [pdf, other]
Title: Near-Field Spot Beamfocusing: A Correlation-Aware Transfer Learning Approach
Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1087]  arXiv:2405.19346 (cross-list from eess.SP) [pdf, other]
Title: Subject-Adaptive Transfer Learning Using Resting State EEG Signals for Cross-Subject EEG Motor Imagery Classification
Comments: Early Accepted at MICCAI 2024
Subjects: Signal Processing (eess.SP); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1088]  arXiv:2405.19345 (cross-list from eess.SP) [pdf, other]
Title: Review of Deep Representation Learning Techniques for Brain-Computer Interfaces and Recommendations
Comments: Submitted to: Journal of Neural Engineering (JNE)
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)
[1089]  arXiv:2405.19342 (cross-list from cs.SD) [pdf, other]
Title: Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[1090]  arXiv:2405.19340 (cross-list from eess.SP) [pdf, ps, other]
Title: Obtaining physical layer data of latest generation networks for investigating adversary attacks
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[ total of 1090 entries: 1-1000 | 237-1090 ]
[ showing 1000 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help  (Access key information)