We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions, skipping first 638

[ total of 1090 entries: 1-402 | 237-638 | 639-1040 | 1041-1090 ]
[ showing 402 entries per page: fewer | more | all ]

Tue, 4 Jun 2024 (continued, showing last 105 of 302 entries)

[639]  arXiv:2406.01191 (cross-list from eess.IV) [pdf, other]
Title: S-CycleGAN: Semantic Segmentation Enhanced CT-Ultrasound Image-to-Image Translation for Robotic Ultrasonography
Comments: This paper is submitted to 2024 IEEE International Conference on Cyborg and Bionic Systems, and still under review
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[640]  arXiv:2406.01187 (cross-list from eess.IV) [pdf, other]
Title: Patch-Based Encoder-Decoder Architecture for Automatic Transmitted Light to Fluorescence Imaging Transition: Contribution to the LightMyCells Challenge
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[641]  arXiv:2406.01149 (cross-list from stat.ML) [pdf, ps, other]
Title: Agnostic Learning of Mixed Linear Regressions with EM and AM Algorithms
Comments: To appear in ICML 2024
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Information Theory (cs.IT); Machine Learning (cs.LG)
[642]  arXiv:2406.01096 (cross-list from cs.CL) [pdf, ps, other]
Title: Synergizing Unsupervised and Supervised Learning: A Hybrid Approach for Accurate Natural Language Task Modeling
Journal-ref: International Journal of Innovative Science and Research Technology: Vol. 9 (2024): No. 5, 1499-1508
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[643]  arXiv:2406.01080 (cross-list from cs.CR) [pdf, other]
Title: No Vandalism: Privacy-Preserving and Byzantine-Robust Federated Learning
Subjects: Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[644]  arXiv:2406.01076 (cross-list from cs.CV) [pdf, other]
Title: Estimating Canopy Height at Scale
Comments: ICML Camera-Ready, 17 pages, 14 figures, 7 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[645]  arXiv:2406.01071 (cross-list from cs.CV) [pdf, other]
Title: Visual Car Brand Classification by Implementing a Synthetic Image Dataset Creation Pipeline
Comments: 10 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[646]  arXiv:2406.01056 (cross-list from cs.CV) [pdf, other]
Title: Virtual avatar generation models as world navigators
Authors: Sai Mandava
Comments: 16 pages, 15 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Robotics (cs.RO)
[647]  arXiv:2406.01047 (cross-list from cs.DC) [pdf, other]
Title: An Advanced Reinforcement Learning Framework for Online Scheduling of Deferrable Workloads in Cloud Computing
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[648]  arXiv:2406.01033 (cross-list from cs.CV) [pdf, ps, other]
Title: Generalized Jersey Number Recognition Using Multi-task Learning With Orientation-guided Weight Refinement
Comments: 10 pages, 6 figures, 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[649]  arXiv:2406.01027 (cross-list from cs.DB) [pdf, other]
Title: PRICE: A Pretrained Model for Cross-Database Cardinality Estimation
Subjects: Databases (cs.DB); Machine Learning (cs.LG)
[650]  arXiv:2406.01018 (cross-list from eess.AS) [pdf, other]
Title: Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training
Comments: Under review
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[651]  arXiv:2406.00998 (cross-list from stat.ML) [pdf, other]
Title: Distributional Refinement Network: Distributional Forecasting via Deep Learning
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Risk Management (q-fin.RM); Methodology (stat.ME)
[652]  arXiv:2406.00973 (cross-list from cs.IR) [pdf, other]
Title: Cold-start Recommendation by Personalized Embedding Region Elicitation
Comments: Accepted at UAI 2024
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[653]  arXiv:2406.00956 (cross-list from cs.CV) [pdf, other]
Title: Improving Segment Anything on the Fly: Auxiliary Online Learning and Adaptive Fusion for Medical Image Segmentation
Comments: Project Link: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[654]  arXiv:2406.00920 (cross-list from stat.ML) [pdf, ps, other]
Title: Demystifying SGD with Doubly Stochastic Gradients
Comments: Accepted to ICML'24
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
[655]  arXiv:2406.00918 (cross-list from cs.CR) [pdf, other]
Title: Assessing the Adversarial Security of Perceptual Hashing Algorithms
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[656]  arXiv:2406.00907 (cross-list from cs.CV) [pdf, other]
Title: DDA: Dimensionality Driven Augmentation Search for Contrastive Learning in Laparoscopic Surgery
Comments: 29 pages, 16 figures; MIDL 2024 - Medical Imaging with Deep Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[657]  arXiv:2406.00901 (cross-list from cs.MM) [pdf, other]
Title: Robust Multi-Modal Speech In-Painting: A Sequence-to-Sequence Approach
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[658]  arXiv:2406.00879 (cross-list from quant-ph) [pdf, ps, other]
Title: Quantum Equilibrium Propagation: Gradient-Descent Training of Quantum Systems
Subjects: Quantum Physics (quant-ph); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG)
[659]  arXiv:2406.00873 (cross-list from q-bio.QM) [pdf, ps, other]
Title: Scaffold Splits Overestimate Virtual Screening Performance
Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG); Biomolecules (q-bio.BM)
[660]  arXiv:2406.00856 (cross-list from cs.CV) [pdf, other]
Title: DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection
Comments: 6 pages, 1 figure
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[661]  arXiv:2406.00853 (cross-list from stat.ML) [pdf, other]
Title: A Tutorial on Doubly Robust Learning for Causal Inference
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST); Methodology (stat.ME)
[662]  arXiv:2406.00843 (cross-list from quant-ph) [pdf, other]
Title: Diffusion-Inspired Quantum Noise Mitigation in Parameterized Quantum Circuits
Subjects: Quantum Physics (quant-ph); Machine Learning (cs.LG)
[663]  arXiv:2406.00832 (cross-list from cs.CL) [pdf, other]
Title: BoNBoN Alignment for Large Language Models and the Sweetness of Best-of-n Sampling
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[664]  arXiv:2406.00823 (cross-list from stat.ML) [pdf, other]
Title: Lasso Bandit with Compatibility Condition on Optimal Arm
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[665]  arXiv:2406.00812 (cross-list from stat.ML) [pdf, other]
Title: Covariance-Adaptive Sequential Black-box Optimization for Diffusion Targeted Generation
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[666]  arXiv:2406.00809 (cross-list from math.NA) [pdf, other]
Title: Graph Neural Preconditioners for Iterative Solutions of Sparse Linear Systems
Authors: Jie Chen
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG)
[667]  arXiv:2406.00793 (cross-list from stat.ML) [pdf, other]
Title: Is In-Context Learning in Large Language Models Bayesian? A Martingale Perspective
Comments: Accepted at International Conference on Machine Learning (ICML) 2024
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[668]  arXiv:2406.00778 (cross-list from stat.ML) [pdf, other]
Title: Bayesian Joint Additive Factor Models for Multiview Learning
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Computation (stat.CO); Methodology (stat.ME)
[669]  arXiv:2406.00755 (cross-list from cs.CL) [pdf, other]
Title: Evaluating Mathematical Reasoning of Large Language Models: A Focus on Error Identification and Correction
Comments: ACL Findings 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[670]  arXiv:2406.00750 (cross-list from cs.CV) [pdf, other]
Title: Freeplane: Unlocking Free Lunch in Triplane-Based Sparse-View Reconstruction Models
Comments: project can be found in: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[671]  arXiv:2406.00741 (cross-list from cs.AI) [pdf, other]
Title: Learning to Play 7 Wonders Duel Without Human Supervision
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[672]  arXiv:2406.00735 (cross-list from q-bio.BM) [pdf, other]
Title: Full-Atom Peptide Design based on Multi-modal Flow Matching
Comments: ICML 2024
Subjects: Biomolecules (q-bio.BM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[673]  arXiv:2406.00713 (cross-list from stat.ML) [pdf, other]
Title: Logistic Variational Bayes Revisited
Comments: Accepted at the 41st International Conference on Machine Learning
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[674]  arXiv:2406.00704 (cross-list from cs.CV) [pdf, other]
Title: An Optimized Toolbox for Advanced Image Processing with Tsetlin Machine Composites
Comments: 8 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[675]  arXiv:2406.00695 (cross-list from physics.flu-dyn) [pdf, other]
Title: Discovering an interpretable mathematical expression for a full wind-turbine wake with artificial intelligence enhanced symbolic regression
Subjects: Fluid Dynamics (physics.flu-dyn); Machine Learning (cs.LG); Symbolic Computation (cs.SC); Applications (stat.AP)
[676]  arXiv:2406.00685 (cross-list from cs.CV) [pdf, other]
Title: Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[677]  arXiv:2406.00667 (cross-list from eess.IV) [pdf, other]
Title: An Early Investigation into the Utility of Multimodal Large Language Models in Medical Imaging
Comments: Accepted in Fifth IEEE Workshop on Artificial Intelligence for HealthCare, IEEE 25th International Conference on Information Reuse and Integration for Data Science
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[678]  arXiv:2406.00663 (cross-list from cs.CV) [pdf, other]
Title: SimSAM: Zero-shot Medical Image Segmentation via Simulated Interaction
Comments: Published at ISBI 2024. Awarded Top 12 Oral Presentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[679]  arXiv:2406.00630 (cross-list from stat.ML) [pdf, other]
Title: On Non-asymptotic Theory of Recurrent Neural Networks in Temporal Point Processes
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[680]  arXiv:2406.00628 (cross-list from cs.CL) [pdf, other]
Title: Transforming Computer Security and Public Trust Through the Exploration of Fine-Tuning Large Language Models
Comments: A preprint, 17 pages. 11 images
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Computers and Society (cs.CY); Machine Learning (cs.LG)
[681]  arXiv:2406.00615 (cross-list from cs.IR) [pdf, other]
Title: Making Recommender Systems More Knowledgeable: A Framework to Incorporate Side Information
Comments: 15 pages, 8 figures
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[682]  arXiv:2406.00532 (cross-list from cs.AI) [pdf, other]
Title: Breast Cancer Diagnosis: A Comprehensive Exploration of Explainable Artificial Intelligence (XAI) Techniques
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[683]  arXiv:2406.00518 (cross-list from cs.RO) [pdf, other]
Title: Learning to Play Air Hockey with Model-Based Deep Reinforcement Learning
Authors: Andrej Orsula
Comments: Robot Air Hockey Challenge 2023 | The source code is available at this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[684]  arXiv:2406.00502 (cross-list from math.OC) [pdf, other]
Title: Non-geodesically-convex optimization in the Wasserstein space
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[685]  arXiv:2406.00501 (cross-list from cs.CV) [pdf, other]
Title: Diffusion-based Image Generation for In-distribution Data Augmentation in Surface Defect Detection
Comments: Accepted at the 19th International Conference on Computer Vision Theory and Applications (VISAPP 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[686]  arXiv:2406.00492 (cross-list from eess.IV) [pdf, other]
Title: SAM-VMNet: Deep Neural Networks For Coronary Angiography Vessel Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[687]  arXiv:2406.00447 (cross-list from cs.CV) [pdf, other]
Title: DroneVis: Versatile Computer Vision Library for Drones
Comments: 23 pages, 15 figure, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG); Robotics (cs.RO)
[688]  arXiv:2406.00441 (cross-list from physics.chem-ph) [pdf, other]
Title: Neural Polarization: Toward Electron Density for Molecules by Extending Equivariant Networks
Subjects: Chemical Physics (physics.chem-ph); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[689]  arXiv:2406.00424 (cross-list from stat.ML) [pdf, other]
Title: A Batch Sequential Halving Algorithm without Performance Degradation
Comments: Accepted to RLC 2024
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[690]  arXiv:2406.00423 (cross-list from cs.CV) [pdf, other]
Title: Multimodal Metadata Assignment for Cultural Heritage Artifacts
Journal-ref: Multimedia Systems 29 (2023) 847-869
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[691]  arXiv:2406.00416 (cross-list from stat.ML) [pdf, other]
Title: Representation and De-interleaving of Mixtures of Hidden Markov Processes
Comments: 13 pages, 9 figures, submitted to IEEE transactions on Signal Processing
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Signal Processing (eess.SP)
[692]  arXiv:2406.00409 (cross-list from cs.CV) [pdf, other]
Title: Arabic Handwritten Text for Person Biometric Identification: A Deep Learning Approach
Comments: 6 pages, 11 figures, 4 tables, International IEEE Conference on the Intelligent Methods, Systems, and Applications (IMSA)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM); Neural and Evolutionary Computing (cs.NE)
[693]  arXiv:2406.00389 (cross-list from cs.NE) [pdf, other]
Title: Understanding the Convergence in Balanced Resonate-and-Fire Neurons
Subjects: Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG)
[694]  arXiv:2406.00345 (cross-list from cs.CV) [pdf, other]
Title: DeCoOp: Robust Prompt Tuning with Out-of-Distribution Detection
Comments: Accepted by ICML 2024. Code is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[695]  arXiv:2406.00339 (cross-list from cs.DS) [pdf, other]
Title: Turnstile $\ell_p$ leverage score sampling with applications
Comments: ICML 2024
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
[696]  arXiv:2406.00329 (cross-list from eess.IV) [pdf, other]
Title: Whole Heart 3D+T Representation Learning Through Sparse 2D Cardiac MR Images
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[697]  arXiv:2406.00328 (cross-list from cs.DS) [pdf, other]
Title: Optimal bounds for $\ell_p$ sensitivity sampling via $\ell_2$ augmentation
Comments: ICML 2024
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
[698]  arXiv:2406.00317 (cross-list from stat.ML) [pdf, other]
Title: Combining Experimental and Historical Data for Policy Evaluation
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[699]  arXiv:2406.00314 (cross-list from cs.CL) [pdf, other]
Title: CASE: Curricular Data Pre-training for Building Generative and Discriminative Assistive Psychology Expert Models
Comments: 19 pages (single column), 5 figures, 5 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[700]  arXiv:2406.00294 (cross-list from cs.SD) [pdf, other]
Title: Creative Text-to-Audio Generation via Synthesizer Programming
Comments: Accepted to ICML 2024
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[701]  arXiv:2406.00290 (cross-list from cs.CV) [pdf, other]
Title: Phasor-Driven Acceleration for FFT-based CNNs
Comments: Presented in the 21st Conference on Robots and Vision (CRV 2024) Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
[702]  arXiv:2406.00275 (cross-list from cs.CV) [pdf, other]
Title: StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization
Comments: Accepted at ICML 2024; Work in 2022 spring
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[703]  arXiv:2406.00239 (cross-list from cs.CV) [pdf, other]
Title: A Review of Pulse-Coupled Neural Network Applications in Computer Vision and Image Processing
Comments: The 25th International Conference on Image Processing, Computer Vision, and Pattern Recognition (IPCV 2021)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[704]  arXiv:2406.00238 (cross-list from cs.GR) [pdf, other]
Title: Robust Biharmonic Skinning Using Geometric Fields
Subjects: Graphics (cs.GR); Machine Learning (cs.LG)
[705]  arXiv:2406.00237 (cross-list from eess.IV) [pdf, other]
Title: A Comparative Study of CNN, ResNet, and Vision Transformers for Multi-Classification of Chest Diseases
Comments: 8 pages, 6 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[706]  arXiv:2406.00222 (cross-list from cs.CL) [pdf, other]
Title: Learning to Clarify: Multi-turn Conversations with Action-Based Contrastive Self-Training
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[707]  arXiv:2406.00198 (cross-list from cs.IR) [pdf, other]
Title: ImplicitSLIM and How it Improves Embedding-based Collaborative Filtering
Comments: Published as a conference paper at ICLR 2024; authors' version
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[708]  arXiv:2406.00192 (cross-list from eess.IV) [pdf, other]
Title: Direct Cardiac Segmentation from Undersampled K-space Using Transformers
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[709]  arXiv:2406.00183 (cross-list from physics.chem-ph) [pdf, other]
Title: Predicting solvation free energies with an implicit solvent machine learning potential
Subjects: Chemical Physics (physics.chem-ph); Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[710]  arXiv:2406.00147 (cross-list from cs.GT) [pdf, other]
Title: Fair Allocation in Dynamic Mechanism Design
Subjects: Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Theoretical Economics (econ.TH)
[711]  arXiv:2406.00146 (cross-list from cs.SD) [pdf, other]
Title: A Survey of Deep Learning Audio Generation Methods
Comments: 14 pages, 2 figures
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[712]  arXiv:2406.00135 (cross-list from cs.CV) [pdf, other]
Title: Advancing Ear Biometrics: Enhancing Accuracy and Robustness through Deep Learning
Comments: 6 pages, 8 figures, 3 tables, International IEEE Conference on the Intelligent Methods, Systems, and Applications
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Multimedia (cs.MM)
[713]  arXiv:2406.00127 (cross-list from stat.ML) [pdf, ps, other]
Title: Training on the Edge of Stability Is Caused by Layerwise Jacobian Alignment
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[714]  arXiv:2406.00125 (cross-list from eess.IV) [pdf, ps, other]
Title: TotalVibeSegmentator: Full Torso Segmentation for the NAKO and UK Biobank in Volumetric Interpolated Breath-hold Examination Body Images
Comments: this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[715]  arXiv:2406.00116 (cross-list from cs.HC) [pdf, other]
Title: A Sim2Real Approach for Identifying Task-Relevant Properties in Interpretable Machine Learning
Subjects: Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[716]  arXiv:2406.00093 (cross-list from cs.CV) [pdf, other]
Title: Bootstrap3D: Improving 3D Content Creation with Synthetic Data
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
[717]  arXiv:2406.00092 (cross-list from cs.AI) [pdf, other]
Title: How Random is Random? Evaluating the Randomness and Humaness of LLMs' Coin Flips
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[718]  arXiv:2406.00085 (cross-list from eess.IV) [pdf, other]
Title: Augmentation-based Unsupervised Cross-Domain Functional MRI Adaptation for Major Depressive Disorder Identification
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[719]  arXiv:2406.00083 (cross-list from cs.CR) [pdf, other]
Title: BadRAG: Identifying Vulnerabilities in Retrieval Augmented Generation of Large Language Models
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[720]  arXiv:2406.00071 (cross-list from astro-ph.IM) [pdf, ps, other]
Title: Optimizing Photometric Light Curve Analysis: Evaluating Scipy's Minimize Function for Eclipse Mapping of Cataclysmic Variables
Subjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Solar and Stellar Astrophysics (astro-ph.SR); Machine Learning (cs.LG)
[721]  arXiv:2406.00069 (cross-list from cs.CL) [pdf, other]
Title: Confidence-Aware Sub-Structure Beam Search (CABS): Mitigating Hallucination in Structured Data Generation with Large Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[722]  arXiv:2406.00062 (cross-list from cs.CL) [pdf, other]
Title: Unlocking the Potential of Large Language Models for Clinical Text Anonymization: A Comparative Study
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[723]  arXiv:2406.00060 (cross-list from cs.CL) [pdf, other]
Title: Cascade-Aware Training of Language Models
Comments: 22 pages, 13 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[724]  arXiv:2406.00059 (cross-list from cs.CL) [pdf, other]
Title: Conveyor: Efficient Tool-aware LLM Serving with Tool Partial Execution
Comments: 11 pages, 8 figures
Subjects: Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[725]  arXiv:2406.00057 (cross-list from cs.CL) [pdf, other]
Title: Toward Conversational Agents with Context and Time Sensitive Long-term Memory
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[726]  arXiv:2406.00054 (cross-list from cs.GT) [pdf, ps, other]
Title: $ε$-Optimally Solving Zero-Sum POSGs
Subjects: Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG)
[727]  arXiv:2406.00053 (cross-list from cs.CL) [pdf, other]
Title: Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting
Comments: 9 pages, 5 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[728]  arXiv:2406.00049 (cross-list from cs.CL) [pdf, other]
Title: QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine Translation
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[729]  arXiv:2406.00048 (cross-list from cs.CL) [pdf, other]
Title: Towards a theory of how the structure of language is acquired by deep neural networks
Comments: 9 pages, 4 figures (main)
Subjects: Computation and Language (cs.CL); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG)
[730]  arXiv:2406.00047 (cross-list from physics.chem-ph) [pdf, ps, other]
Title: A Theoretical Framework for an Efficient Normalizing Flow-Based Solution to the Schrodinger Equation
Subjects: Chemical Physics (physics.chem-ph); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[731]  arXiv:2406.00046 (cross-list from cs.CL) [pdf, other]
Title: Hate Speech Detection with Generalizable Target-aware Fairness
Comments: To appear in KDD 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[732]  arXiv:2406.00045 (cross-list from cs.CL) [pdf, other]
Title: Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[733]  arXiv:2406.00044 (cross-list from cs.CL) [pdf, other]
Title: Stochastic Adversarial Networks for Multi-Domain Text Classification
Authors: Xu Wang, Yuan Wu
Comments: Technical report
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[734]  arXiv:2406.00036 (cross-list from cs.CL) [pdf, other]
Title: EMERGE: Integrating RAG for Improved Multimodal EHR Predictive Modeling
Comments: arXiv admin note: text overlap with arXiv:2402.07016
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[735]  arXiv:2406.00031 (cross-list from cs.CL) [pdf, other]
Title: AMGPT: a Large Language Model for Contextual Querying in Additive Manufacturing
Comments: 54 pages, 4 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[736]  arXiv:2406.00030 (cross-list from cs.CL) [pdf, other]
Title: Large Language Model Pruning
Authors: Hanjuan Huang (1) (2), Hao-Jia Song (1), Hsing-Kuo Pao (1) ((1) Dept. of Computer Science and Information Engineering National Taiwan University of Science and Technology, Taipei, Taiwan, (2) College of Mechanical and Electrical Engineering, WUYI University, Wuyishan, China)
Comments: 17 pages, 7 figures, 2 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[737]  arXiv:2406.00028 (cross-list from cs.CL) [pdf, ps, other]
Title: Persian Homograph Disambiguation: Leveraging ParsBERT for Enhanced Sentence Understanding with a Novel Word Disambiguation Dataset
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[738]  arXiv:2406.00027 (cross-list from cs.CL) [pdf, other]
Title: Adapting PromptORE for Modern History: Information Extraction from Hispanic Monarchy Documents of the XVIth Century
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[739]  arXiv:2406.00024 (cross-list from cs.CL) [pdf, other]
Title: Embedding-Aligned Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET); Machine Learning (cs.LG)
[740]  arXiv:2406.00013 (cross-list from cs.IR) [pdf, ps, other]
Title: Thesis: Document Summarization with applications to Keyword extraction and Image Retrieval
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[741]  arXiv:2406.00004 (cross-list from cs.IR) [pdf, other]
Title: Navigating the Future of Federated Recommendation Systems with Foundation Models
Comments: 20 pages, position paper
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[742]  arXiv:2406.00001 (cross-list from cs.RO) [pdf, other]
Title: PhyPlan: Generalizable and Rapid Physical Task Planning with Physics Informed Skill Networks for Robot Manipulators
Comments: arXiv admin note: substantial text overlap with arXiv:2402.15767
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[743]  arXiv:2405.19815 (cross-list from cs.AI) [pdf, other]
Title: Efficient Stimuli Generation using Reinforcement Learning in Design Verification
Comments: Accepted for publication at the 20th International Conference on Synthesis, Modeling, Analysis and Simulation Methods, and Applications to Circuit Design (SMACD'24), Jul 2-5 2024, Volos, Greece
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

Mon, 3 Jun 2024

[744]  arXiv:2405.21064 [pdf, other]
Title: Recurrent neural networks: vanishing and exploding gradients are not the end of the story
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
[745]  arXiv:2405.21063 [pdf, other]
Title: Neural Network Verification with Branch-and-Bound for General Nonlinearities
Comments: Preprint
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[746]  arXiv:2405.21061 [pdf, other]
Title: Graph External Attention Enhanced Transformer
Comments: In Proceedings of ICML 2024
Subjects: Machine Learning (cs.LG)
[747]  arXiv:2405.21060 [pdf, other]
Title: Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality
Authors: Tri Dao, Albert Gu
Comments: ICML 2024
Subjects: Machine Learning (cs.LG)
[748]  arXiv:2405.21046 [pdf, other]
Title: Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[749]  arXiv:2405.21045 [pdf, ps, other]
Title: An Attention-Based Multi-Context Convolutional Encoder-Decoder Neural Network for Work Zone Traffic Impact Prediction
Subjects: Machine Learning (cs.LG)
[750]  arXiv:2405.21043 [pdf, other]
Title: Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation
Journal-ref: Proceedings of the 41 st International Conference on Machine Learning, 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[751]  arXiv:2405.21042 [pdf, other]
Title: Comparing information content of representation spaces for disentanglement with VAE ensembles
Comments: Code: this https URL
Subjects: Machine Learning (cs.LG)
[752]  arXiv:2405.21036 [pdf, ps, other]
Title: A-PETE: Adaptive Prototype Explanations of Tree Ensembles
Subjects: Machine Learning (cs.LG)
[753]  arXiv:2405.21021 [pdf, other]
Title: Beyond Conventional Parametric Modeling: Data-Driven Framework for Estimation and Prediction of Time Activity Curves in Dynamic PET Imaging
Subjects: Machine Learning (cs.LG); Image and Video Processing (eess.IV); Dynamical Systems (math.DS)
[754]  arXiv:2405.21018 [pdf, other]
Title: Improved Techniques for Optimization-Based Jailbreaking on Large Language Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[755]  arXiv:2405.21012 [pdf, other]
Title: G-Transformer for Conditional Average Potential Outcome Estimation over Time
Subjects: Machine Learning (cs.LG); Methodology (stat.ME)
[756]  arXiv:2405.21003 [pdf, other]
Title: Explaining Predictions by Characteristic Rules
Comments: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022
Journal-ref: In: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022. Lecture Notes in Computer Science(), vol 13713. Springer, Cham (2023)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[757]  arXiv:2405.20988 [pdf, other]
Title: Communication-Efficient Distributed Deep Learning via Federated Dynamic Averaging
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[758]  arXiv:2405.20986 [pdf, other]
Title: Uncertainty Quantification for Bird's Eye View Semantic Segmentation: Methods and Benchmarks
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[759]  arXiv:2405.20984 [pdf, other]
Title: Bayesian Design Principles for Offline-to-Online Reinforcement Learning
Comments: Forty-first International Conference on Machine Learning (ICML), 2024
Subjects: Machine Learning (cs.LG)
[760]  arXiv:2405.20973 [pdf, other]
Title: LCQ: Low-Rank Codebook based Quantization for Large Language Models
Authors: Wen-Pu Cai, Wu-Jun Li
Comments: 10 pages, 5 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[761]  arXiv:2405.20971 [pdf, other]
Title: Amortizing intractable inference in diffusion models for vision, language, and control
Comments: Code: this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[762]  arXiv:2405.20954 [pdf, other]
Title: Aligning Multiclass Neural Network Classifier Criterion with Task Performance via $F_β$-Score
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[763]  arXiv:2405.20935 [pdf, other]
Title: Effective Interplay between Sparsity and Quantization: From Theory to Practice
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[764]  arXiv:2405.20933 [pdf, ps, other]
Title: Concentration Bounds for Optimized Certainty Equivalent Risk Estimation
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[765]  arXiv:2405.20915 [pdf, other]
Title: Fast yet Safe: Early-Exiting with Risk Control
Comments: 25 pages, 11 figures, 4 tables (incl. appendix)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[766]  arXiv:2405.20905 [pdf, other]
Title: VENI, VINDy, VICI: a variational reduced-order modeling framework with uncertainty quantification
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Dynamical Systems (math.DS)
[767]  arXiv:2405.20882 [pdf, other]
Title: Sheaf HyperNetworks for Personalized Federated Learning
Comments: 25 pages, 12 figures, 7 tables, pre-print under review
Subjects: Machine Learning (cs.LG)
[768]  arXiv:2405.20879 [pdf, other]
Title: Flow matching achieves minimax optimal convergence
Subjects: Machine Learning (cs.LG)
[769]  arXiv:2405.20860 [pdf, other]
Title: Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Subjects: Machine Learning (cs.LG)
[770]  arXiv:2405.20838 [pdf, other]
Title: einspace: Searching for Neural Architectures from Fundamental Operations
Comments: Project page at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[771]  arXiv:2405.20835 [pdf, other]
Title: Outliers and Calibration Sets have Diminishing Effect on Quantization of Modern LLMs
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[772]  arXiv:2405.20824 [pdf, ps, other]
Title: Online Convex Optimisation: The Optimal Switching Regret for all Segmentations Simultaneously
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[773]  arXiv:2405.20821 [pdf, other]
Title: Pursuing Overall Welfare in Federated Learning through Sequential Decision Making
Comments: Accepted at ICML 2024
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
[774]  arXiv:2405.20800 [pdf, other]
Title: Shape Constraints in Symbolic Regression using Penalized Least Squares
Subjects: Machine Learning (cs.LG); Symbolic Computation (cs.SC)
[775]  arXiv:2405.20794 [pdf, ps, other]
Title: Model Interpretation and Explainability: Towards Creating Transparency in Prediction Models
Subjects: Machine Learning (cs.LG)
[776]  arXiv:2405.20790 [pdf, other]
Title: Intersectional Unfairness Discovery
Comments: ICML-2024 camera-ready
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[777]  arXiv:2405.20772 [pdf, ps, other]
Title: Reinforcement Learning for Sociohydrology
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[778]  arXiv:2405.20763 [pdf, other]
Title: Improving Generalization and Convergence by Enhancing Implicit Regularization
Comments: 35 pages
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[779]  arXiv:2405.20761 [pdf, other]
Title: Share Your Secrets for Privacy! Confidential Forecasting with Vertical Federated Learning
Comments: Submitted to the 27TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2024)
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC)
[780]  arXiv:2405.20759 [pdf, other]
Title: Information Theoretic Text-to-Image Alignment
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[781]  arXiv:2405.20738 [pdf, other]
Title: Federated Random Forest for Partially Overlapping Clinical Data
Subjects: Machine Learning (cs.LG)
[782]  arXiv:2405.20724 [pdf, other]
Title: Learning on Large Graphs using Intersecting Communities
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[783]  arXiv:2405.20692 [pdf, other]
Title: In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[784]  arXiv:2405.20690 [pdf, other]
Title: Unleashing the Potential of Diffusion Models for Incomplete Data Imputation
Subjects: Machine Learning (cs.LG)
[785]  arXiv:2405.20685 [pdf, other]
Title: Enhancing Counterfactual Image Generation Using Mahalanobis Distance with Distribution Preferences in Feature Space
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[786]  arXiv:2405.20678 [pdf, ps, other]
Title: No-Regret Learning for Fair Multi-Agent Social Welfare Optimization
Subjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
[787]  arXiv:2405.20677 [pdf, other]
Title: Provably Efficient Interactive-Grounded Learning with Personalized Reward
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[788]  arXiv:2405.20671 [pdf, other]
Title: Position Coupling: Leveraging Task Structure for Improved Length Generalization of Transformers
Comments: 73 pages, 20 figures, 90 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[789]  arXiv:2405.20664 [pdf, other]
Title: Weak Robust Compatibility Between Learning Algorithms and Counterfactual Explanation Generation Algorithms
Authors: Ao Xu, Tieru Wu
Subjects: Machine Learning (cs.LG)
[790]  arXiv:2405.20652 [pdf, other]
Title: Sign is Not a Remedy: Multiset-to-Multiset Message Passing for Learning on Heterophilic Graphs
Comments: Published as a conference paper at ICML 2024
Subjects: Machine Learning (cs.LG)
[791]  arXiv:2405.20642 [pdf, other]
Title: Principal-Agent Multitasking: the Uniformity of Optimal Contracts and its Efficient Learning via Instrumental Regression
Authors: Shiliang Zuo
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[792]  arXiv:2405.20640 [pdf, other]
Title: Heterophilous Distribution Propagation for Graph Neural Networks
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[793]  arXiv:2405.20630 [pdf, other]
Title: Stochastic Optimal Control for Diffusion Bridges in Function Spaces
Subjects: Machine Learning (cs.LG)
[794]  arXiv:2405.20623 [pdf, other]
Title: Prune at the Clients, Not the Server: Accelerated Sparse Training in Federated Learning
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[795]  arXiv:2405.20622 [pdf, other]
Title: Superfast Selection for Decision Tree Algorithms
Subjects: Machine Learning (cs.LG)
[796]  arXiv:2405.20620 [pdf, other]
Title: "Forgetting" in Machine Learning and Beyond: A Survey
Subjects: Machine Learning (cs.LG)
[797]  arXiv:2405.20605 [pdf, other]
Title: Searching for internal symbols underlying deep learning
Comments: 10 pages, 7 figures, 3 tables and Appendix
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[798]  arXiv:2405.20603 [pdf, ps, other]
Title: Advancing Financial Risk Prediction Through Optimized LSTM Model Performance and Comparative Analysis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[799]  arXiv:2405.20602 [pdf, other]
Title: Masked Language Modeling Becomes Conditional Density Estimation for Tabular Data Synthesis
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[800]  arXiv:2405.20594 [pdf, other]
Title: Deep Learning without Weight Symmetry
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neurons and Cognition (q-bio.NC)
[801]  arXiv:2405.20592 [pdf, other]
Title: LInK: Learning Joint Representations of Design and Performance Spaces through Contrastive Learning for Mechanism Synthesis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[802]  arXiv:2405.20590 [pdf, other]
Title: Class-Based Time Series Data Augmentation to Mitigate Extreme Class Imbalance for Solar Flare Prediction
Subjects: Machine Learning (cs.LG); Instrumentation and Methods for Astrophysics (astro-ph.IM); Solar and Stellar Astrophysics (astro-ph.SR); Artificial Intelligence (cs.AI)
[803]  arXiv:2405.20589 [pdf, other]
Title: Selective Knowledge Sharing for Personalized Federated Learning Under Capacity Heterogeneity
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[804]  arXiv:2405.20573 [pdf, other]
Title: Enhancing Generative Molecular Design via Uncertainty-guided Fine-tuning of Variational Autoencoders
Subjects: Machine Learning (cs.LG); Biomolecules (q-bio.BM); Quantitative Methods (q-bio.QM); Machine Learning (stat.ML)
[805]  arXiv:2405.20568 [pdf, other]
Title: Generative AI for Deep Reinforcement Learning: Framework, Analysis, and Use Cases
Subjects: Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[806]  arXiv:2405.20562 [pdf, other]
Title: Can Machine Learning Assist in Diagnosis of Primary Immune Thrombocytopenia? A feasibility study
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[807]  arXiv:2405.20556 [pdf, other]
Title: Certifying Global Robustness for Deep Neural Networks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[808]  arXiv:2405.20555 [pdf, other]
Title: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Subjects: Machine Learning (cs.LG)
[809]  arXiv:2405.20550 [pdf, ps, other]
Title: Uncertainty Quantification for Deep Learning
Comments: 25 pages 4 figures, submitted to Environmental data Science
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[810]  arXiv:2405.20543 [pdf, other]
Title: Towards a General GNN Framework for Combinatorial Optimization
Comments: 15 pages, 1 figure
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Discrete Mathematics (cs.DM)
[811]  arXiv:2405.20542 [pdf, ps, other]
Title: On the Connection Between Non-negative Matrix Factorization and Latent Dirichlet Allocation
Comments: 9 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[812]  arXiv:2405.20541 [pdf, other]
Title: Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[813]  arXiv:2405.20540 [pdf, ps, other]
Title: Fully Unconstrained Online Learning
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[814]  arXiv:2405.20539 [pdf, other]
Title: SleeperNets: Universal Backdoor Poisoning Attacks Against Reinforcement Learning Agents
Comments: 23 pages, 14 figures, NeurIPS
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[815]  arXiv:2405.20538 [pdf, other]
Title: Q-learning as a monotone scheme
Authors: Lingyi Yang
Subjects: Machine Learning (cs.LG)
[816]  arXiv:2405.20534 [pdf, other]
Title: Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement Learning
Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[817]  arXiv:2405.20531 [pdf, ps, other]
Title: Mitigating the Impact of Labeling Errors on Training via Rockafellian Relaxation
Subjects: Machine Learning (cs.LG)
[818]  arXiv:2405.20516 [pdf, other]
Title: WaveCastNet: An AI-enabled Wavefield Forecasting Framework for Earthquake Early Warning
Subjects: Machine Learning (cs.LG); Geophysics (physics.geo-ph)
[819]  arXiv:2405.20513 [pdf, other]
Title: Deep Modeling of Non-Gaussian Aleatoric Uncertainty
Comments: 8 pages, 7 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[820]  arXiv:2405.20504 [pdf, other]
Title: FCOM: A Federated Collaborative Online Monitoring Framework via Representation Learning
Subjects: Machine Learning (cs.LG)
[821]  arXiv:2405.20503 [pdf, ps, other]
Title: Optimizing cnn-Bigru performance: Mish activation and comparative analysis with Relu
Journal-ref: International Journal of Computer Networks & Communications (IJCNC) Vol.16, No.3, May 2024
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[822]  arXiv:2405.20486 [pdf, other]
Title: Policy Trees for Prediction: Interpretable and Adaptive Model Selection for Machine Learning
Comments: Submitted to JMLR on 5/30/2024
Subjects: Machine Learning (cs.LG)
[823]  arXiv:2405.20482 [pdf, other]
Title: Leveraging Structure Between Environments: Phylogenetic Regularization Incentivizes Disentangled Representations
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[824]  arXiv:2405.20467 [pdf, ps, other]
Title: Performance of NPG in Countable State-Space Average-Cost RL
Comments: 23 pages
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[825]  arXiv:2405.20456 [pdf, other]
Title: Scaling Laws for the Value of Individual Data Points in Machine Learning
Comments: ICML 2024 camera-ready
Subjects: Machine Learning (cs.LG)
[826]  arXiv:2405.20452 [pdf, other]
Title: Understanding Encoder-Decoder Structures in Machine Learning Using Information Measures
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
[827]  arXiv:2405.20448 [pdf, other]
Title: Knockout: A simple way to handle missing inputs
Subjects: Machine Learning (cs.LG)
[828]  arXiv:2405.20445 [pdf, other]
Title: GraphAny: A Foundation Model for Node Classification on Any Graph
Comments: Preprint. Work in progress
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[829]  arXiv:2405.20439 [pdf, other]
Title: Sharpness-Aware Minimization Enhances Feature Quality via Balanced Learning
Comments: 25 pages, 10 figures, 2 tables
Subjects: Machine Learning (cs.LG)
[830]  arXiv:2405.20435 [pdf, other]
Title: Deep Learning for Computing Convergence Rates of Markov Chains
Subjects: Machine Learning (cs.LG); Probability (math.PR); Machine Learning (stat.ML)
[831]  arXiv:2405.20431 [pdf, other]
Title: Exploring the Practicality of Federated Learning: A Survey Towards the Communication Perspective
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[832]  arXiv:2405.20430 [pdf, other]
Title: Enhancing Performance for Highly Imbalanced Medical Data via Data Regularization in a Federated Learning Setting
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[833]  arXiv:2405.20420 [pdf, other]
Title: Back to the Basics on Predicting Transfer Performance
Comments: 15 pages, 3 figures, 2 tables
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[834]  arXiv:2405.20419 [pdf, other]
Title: Enhancing Antibiotic Stewardship using a Natural Language Approach for Better Feature Representation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[835]  arXiv:2405.20414 [pdf, ps, other]
Title: The Impact of Ontology on the Prediction of Cardiovascular Disease Compared to Machine Learning Algorithms
Journal-ref: International journal of online and biomedical engineering, Volume 18, Issue 11, 2022, Pages 143 - 157
Subjects: Machine Learning (cs.LG)
[836]  arXiv:2405.20397 [pdf, other]
Title: Explainable Data-driven Modeling of Adsorption Energy in Heterogeneous Catalysis
Subjects: Machine Learning (cs.LG); Chemical Physics (physics.chem-ph)
[837]  arXiv:2405.20390 [pdf, other]
Title: Quantitative Convergences of Lie Group Momentum Optimizers
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Optimization and Control (math.OC); Machine Learning (stat.ML)
[838]  arXiv:2405.20358 [pdf, other]
Title: Medication Recommendation via Dual Molecular Modalities and Multi-Substructure Distillation
Comments: 14 pages, 9 figures
Subjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[839]  arXiv:2405.20351 [pdf, other]
Title: ADR-BC: Adversarial Density Weighted Regression Behavior Cloning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[840]  arXiv:2405.20350 [pdf, other]
Title: Linear Function Approximation as a Computationally Efficient Method to Solve Classical Reinforcement Learning Challenges
Authors: Hari Srikanth
Subjects: Machine Learning (cs.LG)
[841]  arXiv:2405.21070 (cross-list from cs.CV) [pdf, other]
Title: Generalization Beyond Data Imbalance: A Controlled Study on CLIP for Transferable Insights
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[842]  arXiv:2405.21050 (cross-list from cs.CV) [pdf, other]
Title: Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[843]  arXiv:2405.21047 (cross-list from cs.AI) [pdf, other]
Title: Grammar-Aligned Decoding
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[844]  arXiv:2405.21027 (cross-list from cs.GT) [pdf, other]
Title: Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles
Comments: 20 pages, 5 figures
Subjects: Computer Science and Game Theory (cs.GT); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[845]  arXiv:2405.20993 (cross-list from cs.IT) [pdf, other]
Title: Information limits and Thouless-Anderson-Palmer equations for spiked matrix models with structured noise
Subjects: Information Theory (cs.IT); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG); Statistics Theory (math.ST)
[846]  arXiv:2405.20991 (cross-list from cs.CV) [pdf, other]
Title: Hard Cases Detection in Motion Prediction by Vision-Language Foundation Models
Comments: IEEE Intelligent Vehicles Symposium (IV) 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[847]  arXiv:2405.20990 (cross-list from cs.CR) [pdf, other]
Title: Locking Machine Learning Models into Hardware
Comments: 10 pages, 2 figures of main text; 14 pages, 16 figures of appendices
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[848]  arXiv:2405.20987 (cross-list from cs.CV) [pdf, other]
Title: Early Stopping Criteria for Training Generative Adversarial Networks in Biomedical Imaging
Comments: This paper is accepted at the 35th IEEE Irish Signals and Systems Conference (ISSC 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[849]  arXiv:2405.20980 (cross-list from cs.CV) [pdf, other]
Title: Neural Gaussian Scale-Space Fields
Comments: 15 pages; SIGGRAPH 2024; project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[850]  arXiv:2405.20975 (cross-list from cs.CR) [pdf, other]
Title: ACE: A Model Poisoning Attack on Contribution Evaluation Methods in Federated Learning
Comments: To appear in the 33rd USENIX Security Symposium, 2024
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[851]  arXiv:2405.20974 (cross-list from cs.CL) [pdf, other]
Title: SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
Comments: The code is available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[852]  arXiv:2405.20970 (cross-list from stat.ML) [pdf, other]
Title: PUAL: A Classifier on Trifurcate Positive-Unlabeled Data
Comments: 24 pages, 6 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[853]  arXiv:2405.20917 (cross-list from cs.CL) [pdf, other]
Title: Learning to Estimate System Specifications in Linear Temporal Logic using Transformers and Mamba
Comments: 20 pages, 15 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[854]  arXiv:2405.20887 (cross-list from cs.SD) [pdf, other]
Title: On the Condition Monitoring of Bolted Joints through Acoustic Emission and Deep Transfer Learning: Generalization, Ordinal Loss and Super-Convergence
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[855]  arXiv:2405.20877 (cross-list from cs.IT) [pdf, other]
Title: Waveform Design for Over-the-Air Computing
Comments: 14 pages
Subjects: Information Theory (cs.IT); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Signal Processing (eess.SP); Statistics Theory (math.ST)
[856]  arXiv:2405.20848 (cross-list from cs.SE) [pdf, other]
Title: SLIM: a Scalable Light-weight Root Cause Analysis for Imbalanced Data in Microservice
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[857]  arXiv:2405.20836 (cross-list from math.NA) [pdf, other]
Title: Solving partial differential equations with sampled neural networks
Comments: 16 pages, 15 figures
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG)
[858]  arXiv:2405.20830 (cross-list from cs.CL) [pdf, other]
Title: Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[859]  arXiv:2405.20829 (cross-list from cs.CV) [pdf, other]
Title: Rethinking Open-World Semi-Supervised Learning: Distribution Mismatch and Inductive Inference
Comments: CVPR Workshop on Computer Vision in the Wild (CVinW), 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[860]  arXiv:2405.20825 (cross-list from physics.med-ph) [pdf, ps, other]
Title: Analysis of clinical, dosimetric and radiomic features for predicting local failure after stereotactic radiotherapy of brain metastases in malignant melanoma
Subjects: Medical Physics (physics.med-ph); Machine Learning (cs.LG)
[861]  arXiv:2405.20808 (cross-list from cs.DS) [pdf, other]
Title: Optimally Improving Cooperative Learning in a Social Setting
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[862]  arXiv:2405.20799 (cross-list from stat.ML) [pdf, other]
Title: Rough Transformers: Lightweight Continuous-Time Sequence Modelling with Path Signatures
Comments: Preprint. Under review. arXiv admin note: text overlap with arXiv:2403.10288
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[863]  arXiv:2405.20797 (cross-list from cs.CV) [pdf, other]
Title: Ovis: Structural Embedding Alignment for Multimodal Large Language Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[864]  arXiv:2405.20791 (cross-list from cs.CV) [pdf, other]
Title: GS-Phong: Meta-Learned 3D Gaussians for Relightable Novel View Synthesis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[865]  arXiv:2405.20778 (cross-list from cs.CR) [pdf, other]
Title: Improved Generation of Adversarial Examples Against Safety-aligned LLMs
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[866]  arXiv:2405.20777 (cross-list from cs.CR) [pdf, other]
Title: Black-Box Detection of Language Model Watermarks
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[867]  arXiv:2405.20776 (cross-list from cs.CR) [pdf, other]
Title: Federated Learning with Blockchain-Enhanced Machine Unlearning: A Trustworthy Approach
Comments: 13 pages, 25 figures
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[868]  arXiv:2405.20771 (cross-list from cs.CR) [pdf, other]
Title: Towards Black-Box Membership Inference Attack for Diffusion Models
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[869]  arXiv:2405.20769 (cross-list from cs.CR) [pdf, other]
Title: Avoiding Pitfalls for Privacy Accounting of Subsampled Mechanisms under Composition
Subjects: Cryptography and Security (cs.CR); Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
[870]  arXiv:2405.20768 (cross-list from cs.NE) [pdf, other]
Title: Expanded Gating Ranges Improve Activation Functions
Authors: Allen Hao Huang
Subjects: Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG)
[871]  arXiv:2405.20748 (cross-list from cs.AI) [pdf, other]
Title: OpenTensor: Reproducing Faster Matrix Multiplication Discovering Algorithms
Authors: Yiwen Sun, Wenye Li
Subjects: Artificial Intelligence (cs.AI); Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG)
[872]  arXiv:2405.20743 (cross-list from cs.CV) [pdf, other]
Title: Trajectory Forecasting through Low-Rank Adaptation of Discrete Latent Codes
Comments: 15 pages, 3 figures, 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[873]  arXiv:2405.20731 (cross-list from cs.AI) [pdf, other]
Title: Maximum Temperature Prediction Using Remote Sensing Data Via Convolutional Neural Network
Comments: 4 pages, submitted to IEEE MetroLivEnv 2024 conference
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[874]  arXiv:2405.20717 (cross-list from cs.CV) [pdf, other]
Title: Cyclic image generation using chaotic dynamics
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Chaotic Dynamics (nlin.CD)
[875]  arXiv:2405.20687 (cross-list from cs.CV) [pdf, other]
Title: Conditioning GAN Without Training Dataset
Comments: 5 pages, 2 figures, Part of my MSc project course, School Project Course 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[876]  arXiv:2405.20675 (cross-list from cs.CV) [pdf, other]
Title: Adv-KD: Adversarial Knowledge Distillation for Faster Diffusion Sampling
Comments: 7 pages, 11 figures, ELLIS Doctoral Symposium 2023 in Helsinki, Finland
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[877]  arXiv:2405.20668 (cross-list from q-bio.BM) [pdf, other]
Title: Improving Paratope and Epitope Prediction by Multi-Modal Contrastive Learning and Interaction Informativeness Estimation
Comments: This paper is accepted by IJCAI 2024
Subjects: Biomolecules (q-bio.BM); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[878]  arXiv:2405.20649 (cross-list from cs.CL) [pdf, other]
Title: Reward-based Input Construction for Cross-document Relation Extraction
Comments: Accepted at ACL 2024 main conference
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[879]  arXiv:2405.20648 (cross-list from cs.CV) [pdf, other]
Title: Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[880]  arXiv:2405.20611 (cross-list from cs.CR) [pdf, ps, other]
Title: Bi-Directional Transformers vs. word2vec: Discovering Vulnerabilities in Lifted Compiled Code
Comments: 8 pages, 0 figures, IEEE 4th Cyber Awareness and Research Symposium 2024 (CARS'24)
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Machine Learning (cs.LG); Software Engineering (cs.SE)
[881]  arXiv:2405.20606 (cross-list from cs.CV) [pdf, other]
Title: Vision-Language Meets the Skeleton: Progressively Distillation with Cross-Modal Knowledge for 3D Action Representation Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[882]  arXiv:2405.20596 (cross-list from cs.CV) [pdf, other]
Title: Generalized Semi-Supervised Learning via Self-Supervised Feature Adaptation
Comments: 10 pages; Accepted by NeurIPS 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[883]  arXiv:2405.20591 (cross-list from q-bio.PE) [pdf, other]
Title: Weak-Form Inference for Hybrid Dynamical Systems in Ecology
Subjects: Populations and Evolution (q-bio.PE); Machine Learning (cs.LG); Dynamical Systems (math.DS)
[884]  arXiv:2405.20582 (cross-list from cs.CL) [pdf, ps, other]
Title: The Point of View of a Sentiment: Towards Clinician Bias Detection in Psychiatric Notes
Comments: Oral presentation at NAACL 2024 Queer in AI Workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[885]  arXiv:2405.20579 (cross-list from cs.RO) [pdf, other]
Title: HOPE: A Reinforcement Learning-based Hybrid Policy Path Planner for Diverse Parking Scenarios
Comments: 10 pages, 6 tables, 5 figures, 1 page appendix
Subjects: Robotics (cs.RO); Machine Learning (cs.LG)
[886]  arXiv:2405.20551 (cross-list from cs.SE) [pdf, other]
Title: EM-Assist: Safe Automated ExtractMethod Refactoring with LLMs
Comments: This paper is accepted to the tool demonstration track of the 32nd ACM Symposium on the Foundations of Software Engineering (FSE 2024). This is an author copy
Subjects: Software Engineering (cs.SE); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Programming Languages (cs.PL)
[887]  arXiv:2405.20512 (cross-list from cs.CL) [pdf, other]
Title: How Multilingual Are Large Language Models Fine-Tuned for Translation?
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[888]  arXiv:2405.20505 (cross-list from cs.CL) [pdf, other]
Title: SPOT: Text Source Prediction from Originality Score Thresholding
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[889]  arXiv:2405.20501 (cross-list from cs.RO) [pdf, other]
Title: ShelfHelp: Empowering Humans to Perform Vision-Independent Manipulation Tasks with a Socially Assistive Robotic Cane
Comments: 8 pages, 14 figures and charts
Journal-ref: In AAMAS (pp. 1514-1523) 2023
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[890]  arXiv:2405.20500 (cross-list from math.OC) [pdf, other]
Title: Hybrid Reinforcement Learning Framework for Mixed-Variable Problems
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[891]  arXiv:2405.20495 (cross-list from cs.CL) [pdf, other]
Title: Transfer Q Star: Principled Decoding for LLM Alignment
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[892]  arXiv:2405.20494 (cross-list from cs.CV) [pdf, other]
Title: Slight Corruption in Pre-training Data Makes Better Diffusion Models
Comments: 50 pages, 33 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[893]  arXiv:2405.20485 (cross-list from cs.CR) [pdf, other]
Title: Phantom: General Trigger Attacks on Retrieval Augmented Language Generation
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[894]  arXiv:2405.20468 (cross-list from cs.CL) [pdf, other]
Title: Extending the Massive Text Embedding Benchmark to French
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[895]  arXiv:2405.20465 (cross-list from cs.CV) [pdf, other]
Title: ENTIRe-ID: An Extensive and Diverse Dataset for Person Re-Identification
Comments: 5 pages, 2024 18th International Conference on Automatic Face and Gesture Recognition (FG)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[896]  arXiv:2405.20451 (cross-list from stat.ML) [pdf, other]
Title: Statistical Properties of Robust Satisficing
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
[897]  arXiv:2405.20447 (cross-list from stat.ML) [pdf, other]
Title: Algorithmic Fairness in Performative Policy Learning: Escaping the Impossibility of Group Fairness
Subjects: Machine Learning (stat.ML); Computers and Society (cs.CY); Machine Learning (cs.LG)
[898]  arXiv:2405.20446 (cross-list from cs.CR) [pdf, other]
Title: Is My Data in Your Retrieval Database? Membership Inference Attacks Against Retrieval Augmented Generation
Comments: 7 pages, 3 figures
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[899]  arXiv:2405.20413 (cross-list from cs.CR) [pdf, other]
Title: Jailbreaking Large Language Models Against Moderation Guardrails via Cipher Characters
Comments: 20 pages
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[900]  arXiv:2405.20412 (cross-list from cs.GR) [pdf, other]
Title: Audio2Rig: Artist-oriented deep learning tool for facial animation
Comments: Video examples and description: this https URL&ab_channel=Golaem
Subjects: Graphics (cs.GR); Machine Learning (cs.LG)
[901]  arXiv:2405.20407 (cross-list from physics.ins-det) [pdf, other]
Title: Convolutional L2LFlows: Generating Accurate Showers in Highly Granular Calorimeters Using Convolutional Normalizing Flows
Subjects: Instrumentation and Detectors (physics.ins-det); Machine Learning (cs.LG); High Energy Physics - Experiment (hep-ex); High Energy Physics - Phenomenology (hep-ph); Data Analysis, Statistics and Probability (physics.data-an)
[902]  arXiv:2405.20405 (cross-list from cs.DS) [pdf, other]
Title: Private Mean Estimation with Person-Level Differential Privacy
Comments: 67 pages, 3 figures
Subjects: Data Structures and Algorithms (cs.DS); Cryptography and Security (cs.CR); Information Theory (cs.IT); Machine Learning (cs.LG); Machine Learning (stat.ML)
[903]  arXiv:2405.20404 (cross-list from cs.CL) [pdf, other]
Title: XPrompt:Explaining Large Language Model's Generation via Joint Prompt Attribution
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[904]  arXiv:2405.20400 (cross-list from stat.ME) [pdf, other]
Title: Fast leave-one-cluster-out cross-validation by clustered Network Information Criteria (NICc)
Subjects: Methodology (stat.ME); Machine Learning (cs.LG); Computation (stat.CO); Machine Learning (stat.ML)
[905]  arXiv:2405.20384 (cross-list from cond-mat.quant-gas) [pdf, other]
Title: Recurrent neural network wave functions for Rydberg atom arrays on kagome lattice
Comments: 13 pages, 5 figures, 3 tables. Link to GitHub repository: this https URL
Subjects: Quantum Gases (cond-mat.quant-gas); Disordered Systems and Neural Networks (cond-mat.dis-nn); Strongly Correlated Electrons (cond-mat.str-el); Machine Learning (cs.LG); Quantum Physics (quant-ph)
[906]  arXiv:2405.20355 (cross-list from cs.NE) [pdf, other]
Title: Enhancing Adversarial Robustness in SNNs with Sparse Gradients
Comments: accepted by ICML 2024
Subjects: Neural and Evolutionary Computing (cs.NE); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[907]  arXiv:2405.20354 (cross-list from cs.DL) [pdf, other]
Title: Literature Filtering for Systematic Reviews with Transformers
Subjects: Digital Libraries (cs.DL); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[908]  arXiv:2405.20348 (cross-list from physics.ao-ph) [pdf, other]
Title: Personalized Adapter for Large Meteorology Model on Devices: Towards Weather Foundation Models
Comments: 42 pages, under review
Subjects: Atmospheric and Oceanic Physics (physics.ao-ph); Machine Learning (cs.LG)
[909]  arXiv:2405.20347 (cross-list from cs.CL) [pdf, other]
Title: Small Language Models for Application Interactions: A Case Study
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

Fri, 31 May 2024 (showing first 131 of 181 entries)

[910]  arXiv:2405.20341 [pdf, other]
Title: From Zero to Hero: Cold-Start Anomaly Detection
Comments: ACL 2024. Our code is available at this https URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[911]  arXiv:2405.20331 [pdf, other]
Title: CoSy: Evaluating Textual Explanations of Neurons
Comments: 10 pages, 5 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[912]  arXiv:2405.20313 [pdf, other]
Title: Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation
Comments: preprint
Subjects: Machine Learning (cs.LG); Biomolecules (q-bio.BM)
[913]  arXiv:2405.20309 [pdf, other]
Title: Large Language Models Can Self-Improve At Web Agent Tasks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[914]  arXiv:2405.20287 [pdf, other]
Title: Flexible SE(2) graph neural networks with applications to PDE surrogates
Comments: 9 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Numerical Analysis (math.NA); Fluid Dynamics (physics.flu-dyn)
[915]  arXiv:2405.20278 [pdf, ps, other]
Title: Length independent generalization bounds for deep SSM architectures with stability constraints
Comments: 25 pages, no figures, under submission
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[916]  arXiv:2405.20272 [pdf, other]
Title: Reconstruction Attacks on Machine Unlearning: Simple Models are Vulnerable
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[917]  arXiv:2405.20271 [pdf, other]
Title: ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections
Comments: Accepted to ICML 2024. Code available at this https URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[918]  arXiv:2405.20233 [pdf, other]
Title: Grokfast: Accelerated Grokking by Amplifying Slow Gradients
Comments: 17 pages, 13 figures. Typo fixed. Project page: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[919]  arXiv:2405.20231 [pdf, other]
Title: The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof
Comments: 27 pages. Preparing code for release
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[920]  arXiv:2405.20200 [pdf, other]
Title: Unified Explanations in Machine Learning Models: A Perturbation Approach
Subjects: Machine Learning (cs.LG)
[921]  arXiv:2405.20194 [pdf, ps, other]
Title: Occam Gradient Descent
Authors: B.N. Kausik
Subjects: Machine Learning (cs.LG)
[922]  arXiv:2405.20180 [pdf, other]
Title: Transformers and Slot Encoding for Sample Efficient Physical World Modelling
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[923]  arXiv:2405.20174 [pdf, other]
Title: Tropical Expressivity of Neural Networks
Subjects: Machine Learning (cs.LG); Algebraic Geometry (math.AG)
[924]  arXiv:2405.20114 [pdf, other]
Title: Near Optimal Decentralized Optimization with Compression and Momentum Tracking
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Machine Learning (stat.ML)
[925]  arXiv:2405.20085 [pdf, other]
Title: Soft Partitioning of Latent Space for Semantic Channel Equalization
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Multiagent Systems (cs.MA)
[926]  arXiv:2405.20082 [pdf, other]
Title: Segment, Shuffle, and Stitch: A Simple Mechanism for Improving Time-Series Representations
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[927]  arXiv:2405.20051 [pdf, other]
Title: Threshold-Independent Fair Matching through Score Calibration
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[928]  arXiv:2405.20045 [pdf, other]
Title: Iterative Learning Control of Fast, Nonlinear, Oscillatory Dynamics (Preprint)
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Dynamical Systems (math.DS)
[929]  arXiv:2405.20042 [pdf, other]
Title: CycleFormer : TSP Solver Based on Language Modeling
Subjects: Machine Learning (cs.LG)
[930]  arXiv:2405.20029 [pdf, ps, other]
Title: A Random Forest-based Prediction Model for Turning Points in Antagonistic Event-Group Competitions
Authors: Zishuo Zhu
Subjects: Machine Learning (cs.LG)
[931]  arXiv:2405.20028 [pdf, ps, other]
Title: A Simple and Adaptive Learning Rate for FTRL in Online Learning with Minimax Regret of $Θ(T^{2/3})$ and its Application to Best-of-Both-Worlds
Comments: 31 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[932]  arXiv:2405.20014 [pdf, other]
Title: subMFL: Compatiple subModel Generation for Federated Learning in Device Heterogenous Environment
Comments: 12 pages, 7 figures, European Conference on Parallel Processing, pp. between 52 and 64, Springer, 2023
Subjects: Machine Learning (cs.LG)
[933]  arXiv:2405.20012 [pdf, other]
Title: FlexiDrop: Theoretical Insights and Practical Advances in Random Dropout Method on GNNs
Subjects: Machine Learning (cs.LG)
[934]  arXiv:2405.20003 [pdf, other]
Title: Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[935]  arXiv:2405.19978 [pdf, other]
Title: Domain Adaptation with Cauchy-Schwarz Divergence
Comments: Accepted by UAI-24
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[936]  arXiv:2405.19961 [pdf, other]
Title: Collective Variable Free Transition Path Sampling with Generative Flow Network
Comments: 9 pages, 5 figures, 2 tables
Subjects: Machine Learning (cs.LG)
[937]  arXiv:2405.19950 [pdf, other]
Title: MM-Lego: Modular Biomedical Multimodal Models with Minimal Fine-Tuning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[938]  arXiv:2405.19933 [pdf, other]
Title: Learning Latent Graph Structures and their Uncertainty
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[939]  arXiv:2405.19928 [pdf, other]
Title: BAN: Detecting Backdoors Activated by Adversarial Neuron Noise
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[940]  arXiv:2405.19919 [pdf, other]
Title: Unraveling the Impact of Heterophilic Structures on Graph Positive-Unlabeled Learning
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[941]  arXiv:2405.19909 [pdf, other]
Title: Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning
Comments: ICML 2024, 19 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[942]  arXiv:2405.19902 [pdf, other]
Title: Learning Discriminative Dynamics with Label Corruption for Noisy Label Detection
Comments: Accepted to CVPR 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[943]  arXiv:2405.19901 [pdf, other]
Title: Urban Air Pollution Forecasting: a Machine Learning Approach leveraging Satellite Observations and Meteorological Forecasts
Comments: 5 pages, 2 figures, submitted to IEEE MetroLivEnv 2024
Subjects: Machine Learning (cs.LG); Atmospheric and Oceanic Physics (physics.ao-ph)
[944]  arXiv:2405.19893 [pdf, other]
Title: Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts
Comments: 12 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[945]  arXiv:2405.19888 [pdf, other]
Title: Parrot: Efficient Serving of LLM-based Applications with Semantic Variable
Comments: To appear on USENIX OSDI 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[946]  arXiv:2405.19885 [pdf, other]
Title: Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning
Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[947]  arXiv:2405.19883 [pdf, other]
Title: From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems
Comments: Accepted by ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[948]  arXiv:2405.19878 [pdf, other]
Title: Learning from Random Demonstrations: Offline Reinforcement Learning with Importance-Sampled Diffusion Models
Authors: Zeyu Fang, Tian Lan
Subjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT)
[949]  arXiv:2405.19870 [pdf, other]
Title: On Vessel Location Forecasting and the Effect of Federated Learning
Journal-ref: 2024 IEEE International Conference on Mobile Data Management (MDM), June 24 - June 27, 2024, Brussels, Belgium
Subjects: Machine Learning (cs.LG)
[950]  arXiv:2405.19864 [pdf, ps, other]
Title: Out-of-distribution Reject Option Method for Dataset Shift Problem in Early Disease Onset Prediction
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Applications (stat.AP)
[951]  arXiv:2405.19836 [pdf, other]
Title: The Merit of River Network Topology for Neural Flood Forecasting
Comments: this https URL
Journal-ref: ICML 2024
Subjects: Machine Learning (cs.LG)
[952]  arXiv:2405.19823 [pdf, other]
Title: Joint Selective State Space Model and Detrending for Robust Time Series Anomaly Detection
Comments: Submitted to IEEE Signal Processing Letters
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[953]  arXiv:2405.19811 [pdf, ps, other]
Title: Approximate Global Convergence of Independent Learning in Multi-Agent Systems
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[954]  arXiv:2405.19807 [pdf, ps, other]
Title: MetaCURL: Non-stationary Concave Utility Reinforcement Learning
Authors: Bianca Marin Moreno (UGA, Thoth, EDF R&D, FiME Lab), Margaux Brégère (LPSM, EDF R&D), Pierre Gaillard (UGA, Thoth), Nadia Oudjane (EDF R&D, FiME Lab)
Subjects: Machine Learning (cs.LG); Probability (math.PR); Statistics Theory (math.ST); Machine Learning (stat.ML)
[955]  arXiv:2405.19806 [pdf, other]
Title: Preference Alignment with Flow Matching
Subjects: Machine Learning (cs.LG)
[956]  arXiv:2405.19804 [pdf, ps, other]
Title: Exploring Key Factors for Long-Term Vessel Incident Risk Prediction
Subjects: Machine Learning (cs.LG)
[957]  arXiv:2405.19789 [pdf, other]
Title: Estimating before Debiasing: A Bayesian Approach to Detaching Prior Bias in Federated Semi-Supervised Learning
Comments: Accepted by IJCAI 2024
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[958]  arXiv:2405.19785 [pdf, other]
Title: Recurrent Deep Kernel Learning of Dynamical Systems
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[959]  arXiv:2405.19757 [pdf, other]
Title: Improving SMOTE via Fusing Conditional VAE for Data-adaptive Noise Filtering
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[960]  arXiv:2405.19752 [pdf, ps, other]
Title: Understanding Memory-Regret Trade-Off for Streaming Stochastic Multi-Armed Bandits
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[961]  arXiv:2405.19747 [pdf, other]
Title: Understanding and mitigating difficulties in posterior predictive evaluation
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[962]  arXiv:2405.19729 [pdf, other]
Title: Dynamic feature selection in medical predictive monitoring by reinforcement learning
Comments: preview version
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[963]  arXiv:2405.19705 [pdf, ps, other]
Title: Universal Online Convex Optimization with $1$ Projection per Round
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[964]  arXiv:2405.19703 [pdf, other]
Title: Towards a Better Evaluation of Out-of-Domain Generalization
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[965]  arXiv:2405.19690 [pdf, other]
Title: Diffusion Policies creating a Trust Region for Offline Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[966]  arXiv:2405.19679 [pdf, other]
Title: Efficient Trajectory Inference in Wasserstein Space Using Consecutive Averaging
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Optimization and Control (math.OC)
[967]  arXiv:2405.19673 [pdf, other]
Title: Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models
Comments: Under review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[968]  arXiv:2405.19667 [pdf, other]
Title: Reconciling Model Multiplicity for Downstream Decision Making
Comments: 16 pages main body, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[969]  arXiv:2405.19661 [pdf, other]
Title: MGCP: A Multi-Grained Correlation based Prediction Network for Multivariate Time Series
Subjects: Machine Learning (cs.LG)
[970]  arXiv:2405.19653 [pdf, other]
Title: SysCaps: Language Interfaces for Simulation Surrogates of Complex Systems
Comments: 17 pages. Under review
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Systems and Control (eess.SY)
[971]  arXiv:2405.19650 [pdf, other]
Title: Few for Many: Tchebycheff Set Scalarization for Many-Objective Optimization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Optimization and Control (math.OC)
[972]  arXiv:2405.19649 [pdf, ps, other]
Title: Towards Deeper Understanding of PPR-based Embedding Approaches: A Topological Perspective
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[973]  arXiv:2405.19647 [pdf, other]
Title: FTS: A Framework to Find a Faithful TimeSieve
Subjects: Machine Learning (cs.LG)
[974]  arXiv:2405.19600 [pdf, ps, other]
Title: Do spectral cues matter in contrast-based graph self-supervised learning?
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[975]  arXiv:2405.19597 [pdf, other]
Title: SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors
Comments: 17 pages, 5 figures, 14 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[976]  arXiv:2405.19592 [pdf, other]
Title: Why Larger Language Models Do In-context Learning Differently?
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[977]  arXiv:2405.19590 [pdf, other]
Title: Weights Augmentation: it has never ever ever ever let her model down
Subjects: Machine Learning (cs.LG)
[978]  arXiv:2405.19559 [pdf, ps, other]
Title: Clustering Mixtures of Discrete Distributions: A Note on Mitra's Algorithm
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[979]  arXiv:2405.19550 [pdf, other]
Title: Stress-Testing Capability Elicitation With Password-Locked Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[980]  arXiv:2405.19548 [pdf, other]
Title: RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Comments: 25 pages, 19 figures
Subjects: Machine Learning (cs.LG)
[981]  arXiv:2405.19547 [pdf, other]
Title: CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning
Comments: This paper supercedes our previous VAS paper (arXiv:2402.02055)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[982]  arXiv:2405.19534 [pdf, other]
Title: Preference Learning Algorithms Do Not Learn Preference Rankings
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[983]  arXiv:2405.19532 [pdf, other]
Title: Contrasting Multiple Representations with the Multi-Marginal Matching Gap
Comments: To be presented at ICML 2024
Subjects: Machine Learning (cs.LG)
[984]  arXiv:2405.19521 [pdf, other]
Title: Crowdsourcing with Difficulty: A Bayesian Rating Model for Heterogeneous Items
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[985]  arXiv:2405.19513 [pdf, other]
Title: Decentralized Optimization in Time-Varying Networks with Arbitrary Delays
Comments: arXiv admin note: text overlap with arXiv:2401.11344
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Systems and Control (eess.SY); Optimization and Control (math.OC); Machine Learning (stat.ML)
[986]  arXiv:2405.19499 [pdf, other]
Title: Momentum for the Win: Collaborative Federated Reinforcement Learning across Heterogeneous Environments
Journal-ref: Proceedings of the 41st International Conference on Machine Learning, 2024 Learning
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA); Optimization and Control (math.OC)
[987]  arXiv:2405.19471 [pdf, other]
Title: The Data Minimization Principle in Machine Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[988]  arXiv:2405.19466 [pdf, other]
Title: Posterior Sampling via Autoregressive Generation
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[989]  arXiv:2405.19461 [pdf, other]
Title: Clustering-Based Validation Splits for Domain Generalisation
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[990]  arXiv:2405.19454 [pdf, other]
Title: Deep Grokking: Would Deep Neural Networks Generalize Better?
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[991]  arXiv:2405.19440 [pdf, other]
Title: On the Convergence of Multi-objective Optimization under Generalized Smoothness
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[992]  arXiv:2405.19420 [pdf, other]
Title: Using Contrastive Learning with Generative Similarity to Learn Spaces that Capture Human Inductive Biases
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neurons and Cognition (q-bio.NC)
[993]  arXiv:2405.19414 [pdf, other]
Title: Safety through Permissibility: Shield Construction for Fast and Safe Reinforcement Learning
Comments: 9 pages, 3 figures
Subjects: Machine Learning (cs.LG)
[994]  arXiv:2405.19376 [pdf, other]
Title: PureEBM: Universal Poison Purification via Mid-Run Dynamics of Energy-Based Models
Comments: arXiv admin note: substantial text overlap with arXiv:2405.18627
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[995]  arXiv:2405.20343 (cross-list from cs.CV) [pdf, other]
Title: Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[996]  arXiv:2405.20324 (cross-list from cs.CV) [pdf, other]
Title: Don't drop your samples! Coherence-aware training benefits Conditional diffusion
Comments: Accepted at CVPR 2024 as a Highlight. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[997]  arXiv:2405.20321 (cross-list from cs.RO) [pdf, other]
Title: Vision-based Manipulation from Single Human Video with Open-World Object Graphs
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[998]  arXiv:2405.20320 (cross-list from cs.CV) [pdf, other]
Title: Improving the Training of Rectified Flows
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[999]  arXiv:2405.20318 (cross-list from cs.CL) [pdf, other]
Title: CausalQuest: Collecting Natural Causal Questions for AI Agents
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1000]  arXiv:2405.20304 (cross-list from cs.CL) [pdf, other]
Title: Group Robust Preference Optimization in Reward-free RLHF
Comments: Preprint
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[1001]  arXiv:2405.20289 (cross-list from cs.SD) [pdf, other]
Title: DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1002]  arXiv:2405.20274 (cross-list from cs.CL) [pdf, other]
Title: ROAST: Review-level Opinion Aspect Sentiment Target Joint Detection
Comments: arXiv admin note: text overlap with arXiv:2309.13297
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1003]  arXiv:2405.20250 (cross-list from math.OC) [pdf, ps, other]
Title: Entropy annealing for policy mirror descent in continuous time and space
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Probability (math.PR)
[1004]  arXiv:2405.20247 (cross-list from cs.AI) [pdf, other]
Title: KerasCV and KerasNLP: Vision and Language Power-Ups
Comments: Submitted to Journal of Machine Learning Open Source Software
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Software Engineering (cs.SE)
[1005]  arXiv:2405.20245 (cross-list from cs.CL) [pdf, other]
Title: Retrieval Augmented Structured Generation: Business Document Information Extraction As Tool Use
Comments: Accepted by IEEE 7th International Conference on Multimedia Information Processing and Retrieval (MIPR), 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[1006]  arXiv:2405.20237 (cross-list from quant-ph) [pdf, other]
Title: Training-efficient density quantum machine learning
Comments: 17 pages main text, 9 pages appendices. 9 figures
Subjects: Quantum Physics (quant-ph); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1007]  arXiv:2405.20236 (cross-list from stat.ML) [pdf, other]
Title: Disentangling and Mitigating the Impact of Task Similarity for Continual Learning
Authors: Naoki Hiratani
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[1008]  arXiv:2405.20230 (cross-list from cs.CV) [pdf, other]
Title: Feature Fusion for Improved Classification: Combining Dempster-Shafer Theory and Multiple CNN Architectures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1009]  arXiv:2405.20216 (cross-list from cs.CV) [pdf, other]
Title: Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback
Comments: 28 pages, 18 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1010]  arXiv:2405.20213 (cross-list from cs.AI) [pdf, other]
Title: PostDoc: Generating Poster from a Long Multimodal Document Using Deep Submodular Optimization
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[1011]  arXiv:2405.20178 (cross-list from eess.SY) [pdf, other]
Title: Non-intrusive data-driven model order reduction for circuits based on Hammerstein architectures
Comments: 13 pages, 13 figures; submitted to IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG)
[1012]  arXiv:2405.20172 (cross-list from cs.SD) [pdf, other]
Title: Iterative Feature Boosting for Explainable Speech Emotion Recognition
Comments: Published in: 2023 International Conference on Machine Learning and Applications (ICMLA)
Journal-ref: 2023 International Conference on Machine Learning and Applications (ICMLA), Jacksonville, FL, USA, 2023, pp. 543-549
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[1013]  arXiv:2405.20165 (cross-list from stat.ML) [pdf, other]
Title: Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[1014]  arXiv:2405.20139 (cross-list from cs.CL) [pdf, other]
Title: GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1015]  arXiv:2405.20127 (cross-list from math.OC) [pdf, other]
Title: SPAM: Stochastic Proximal Point Method with Momentum Variance Reduction for Non-convex Cross-Device Federated Learning
Comments: The main part of the paper is around 9 pages. It contains the proposed algorithms, the main theoretical results and the experimental setting. The proofs of the main results and other technicalities are deferred to the Appendix
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[1016]  arXiv:2405.20124 (cross-list from stat.ML) [pdf, other]
Title: A Geometric Unification of Distributionally Robust Covariance Estimators: Shrinking the Spectrum by Inflating the Ambiguity Set
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
[1017]  arXiv:2405.20094 (cross-list from math.NA) [pdf, other]
Title: Low-dimensional approximations of the conditional law of Volterra processes: a non-positive curvature approach
Comments: Main body: 25 Pages, Appendices 29 Pages, 14 Tables, 6 Figures
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Differential Geometry (math.DG); Computational Finance (q-fin.CP)
[1018]  arXiv:2405.20091 (cross-list from cs.CV) [pdf, other]
Title: Visual Attention Analysis in Online Learning
Comments: Accepted in CEDI 2024 (VII Congreso Espa\~nol de Inform\'atica), A Coru\~na, Spain
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[1019]  arXiv:2405.20086 (cross-list from math.ST) [pdf, other]
Title: Analysis of a multi-target linear shrinkage covariance estimator
Authors: Benoit Oriol
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Probability (math.PR); Machine Learning (stat.ML)
[1020]  arXiv:2405.20079 (cross-list from cs.CL) [pdf, other]
Title: Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language Learning
Comments: Accepted as a poster paper at EDM 2024: 17th International Conference on Educational Data Mining in Atlanta, USA
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[1021]  arXiv:2405.20071 (cross-list from physics.med-ph) [pdf, ps, other]
Title: A Staged Approach using Machine Learning and Uncertainty Quantification to Predict the Risk of Hip Fracture
Comments: 29 pages, 5 figures, 6 tables
Subjects: Medical Physics (physics.med-ph); Machine Learning (cs.LG)
[1022]  arXiv:2405.20053 (cross-list from cs.CL) [pdf, other]
Title: Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1023]  arXiv:2405.20052 (cross-list from eess.SP) [pdf, other]
Title: Hardware-Efficient EMG Decoding for Next-Generation Hand Prostheses
Comments: \{copyright} 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)
[1024]  arXiv:2405.20039 (cross-list from stat.ML) [pdf, other]
Title: Task-Agnostic Machine Learning-Assisted Inference
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[1025]  arXiv:2405.20018 (cross-list from cs.MA) [pdf, other]
Title: Safe Multi-agent Reinforcement Learning with Natural Language Constraints
Comments: 23 pages, 6 figures
Subjects: Multiagent Systems (cs.MA); Computation and Language (cs.CL); Machine Learning (cs.LG)
[1026]  arXiv:2405.19995 (cross-list from stat.ML) [pdf, other]
Title: Symmetries in Overparametrized Neural Networks: A Mean-Field View
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Probability (math.PR)
[1027]  arXiv:2405.19988 (cross-list from cs.RO) [pdf, other]
Title: Video-Language Critic: Transferable Reward Functions for Language-Conditioned Robotics
Comments: 10 pages in the main text, 16 pages including references and supplementary materials. 4 figures and 3 tables in the main text, 1 table in supplementary materials
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1028]  arXiv:2405.19985 (cross-list from stat.ME) [pdf, other]
Title: Targeted Sequential Indirect Experiment Design
Subjects: Methodology (stat.ME); Machine Learning (cs.LG)
[1029]  arXiv:2405.19977 (cross-list from cs.DS) [pdf, other]
Title: Consistent Submodular Maximization
Comments: To appear at ICML 24
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1030]  arXiv:2405.19971 (cross-list from cs.CR) [pdf, other]
Title: GasTrace: Detecting Sandwich Attack Malicious Accounts in Ethereum
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[1031]  arXiv:2405.19967 (cross-list from cs.CL) [pdf, other]
Title: Improved Out-of-Scope Intent Classification with Dual Encoding and Threshold-based Re-Classification
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1032]  arXiv:2405.19954 (cross-list from cs.CR) [pdf, other]
Title: GenKubeSec: LLM-Based Kubernetes Misconfiguration Detection, Localization, Reasoning, and Remediation
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[1033]  arXiv:2405.19931 (cross-list from cs.CV) [pdf, other]
Title: Exploring Diffusion Models' Corruption Stage in Few-Shot Fine-tuning and Mitigating with Bayesian Neural Networks
Comments: Preprint. Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1034]  arXiv:2405.19912 (cross-list from stat.ML) [pdf, other]
Title: Robust Kernel Hypothesis Testing under Data Corruption
Comments: 26 pages, 2 figures, 2 algorithms
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[1035]  arXiv:2405.19889 (cross-list from eess.SP) [pdf, other]
Title: Deep Joint Semantic Coding and Beamforming for Near-Space Airship-Borne Massive MIMO Network
Comments: Major Revision by IEEE JSAC
Subjects: Signal Processing (eess.SP); Information Theory (cs.IT); Machine Learning (cs.LG); Multimedia (cs.MM)
[1036]  arXiv:2405.19886 (cross-list from cs.NI) [pdf, other]
Title: Federated Learning with Multi-resolution Model Broadcast
Subjects: Networking and Internet Architecture (cs.NI); Machine Learning (cs.LG)
[1037]  arXiv:2405.19874 (cross-list from cs.CL) [pdf, other]
Title: Is In-Context Learning Sufficient for Instruction Following in LLMs?
Comments: Preprint. Code at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1038]  arXiv:2405.19805 (cross-list from cs.CC) [pdf, ps, other]
Title: Complexity of Deciding Injectivity and Surjectivity of ReLU Neural Networks
Comments: 17 pages
Subjects: Computational Complexity (cs.CC); Discrete Mathematics (cs.DM); Machine Learning (cs.LG)
[1039]  arXiv:2405.19787 (cross-list from cs.CL) [pdf, other]
Title: From Symbolic Tasks to Code Generation: Diversification Yields Better Task Performers
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Logic in Computer Science (cs.LO); Programming Languages (cs.PL)
[1040]  arXiv:2405.19784 (cross-list from cs.DB) [pdf, ps, other]
Title: PixelsDB: Serverless and Natural-Language-Aided Data Analytics with Flexible Service Levels and Prices
Comments: 4 pages, 3 figures
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[ total of 1090 entries: 1-402 | 237-638 | 639-1040 | 1041-1090 ]
[ showing 402 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help  (Access key information)