We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions, skipping first 236

[ total of 1090 entries: 1-250 | 237-486 | 487-736 | 737-986 | 987-1090 ]
[ showing 250 entries per page: fewer | more | all ]

Wed, 5 Jun 2024 (continued, showing last 205 of 208 entries)

[237]  arXiv:2406.02542 [pdf, other]
Title: Loki: Low-Rank Keys for Efficient Sparse Attention
Subjects: Machine Learning (cs.LG)
[238]  arXiv:2406.02515 [pdf, ps, other]
Title: Uncertainty of Joint Neural Contextual Bandit
Subjects: Machine Learning (cs.LG)
[239]  arXiv:2406.02510 [pdf, other]
Title: Fairness-Optimized Synthetic EHR Generation for Arbitrary Downstream Predictive Tasks
Subjects: Machine Learning (cs.LG)
[240]  arXiv:2406.02500 [pdf, other]
Title: Demystifying the Compression of Mixture-of-Experts Through a Unified Framework
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[241]  arXiv:2406.02496 [pdf, other]
Title: Kolmogorov-Arnold Networks for Time Series: Bridging Predictive Power and Interpretability
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[242]  arXiv:2406.02490 [pdf, other]
Title: Ai-Sampler: Adversarial Learning of Markov kernels with involutive maps
Journal-ref: Proceedings of the 41 st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[243]  arXiv:2406.02486 [pdf, other]
Title: A Temporal Kolmogorov-Arnold Transformer for Time Series Forecasting
Comments: arXiv admin note: text overlap with arXiv:2405.07344
Subjects: Machine Learning (cs.LG)
[244]  arXiv:2406.02479 [pdf, ps, other]
Title: Applying Fine-Tuned LLMs for Reducing Data Needs in Load Profile Analysis
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP); Systems and Control (eess.SY)
[245]  arXiv:2406.02469 [pdf, other]
Title: Landscape-Aware Growing: The Power of a Little LAG
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[246]  arXiv:2406.02465 [pdf, other]
Title: An Empirical Study into Clustering of Unseen Datasets with Self-Supervised Encoders
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[247]  arXiv:2406.02464 [pdf, other]
Title: Meta-Learners for Partially-Identified Treatment Effects Across Multiple Environments
Comments: Accepted at ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[248]  arXiv:2406.02456 [pdf, other]
Title: Offline Bayesian Aleatoric and Epistemic Uncertainty Quantification and Posterior Value Optimisation in Finite-State MDPs
Comments: 19 pages, 13 figures, 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024)
Subjects: Machine Learning (cs.LG)
[249]  arXiv:2406.02450 [pdf, other]
Title: A Generalized Apprenticeship Learning Framework for Modeling Heterogeneous Student Pedagogical Strategies
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[250]  arXiv:2406.02447 [pdf, other]
Title: Reducing Bias in Federated Class-Incremental Learning with Hierarchical Generative Prototypes
Subjects: Machine Learning (cs.LG)
[251]  arXiv:2406.02428 [pdf, other]
Title: Harnessing Neural Unit Dynamics for Effective and Scalable Class-Incremental Learning
Comments: Accepted to ICML 2024
Subjects: Machine Learning (cs.LG)
[252]  arXiv:2406.02424 [pdf, ps, other]
Title: Contextual Dynamic Pricing: Algorithms, Optimality, and Local Differential Privacy Constraints
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST); Methodology (stat.ME)
[253]  arXiv:2406.02416 [pdf, other]
Title: Improved Modelling of Federated Datasets using Mixtures-of-Dirichlet-Multinomials
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[254]  arXiv:2406.02395 [pdf, other]
Title: GrootVL: Tree Topology is All You Need in State Space Model
Comments: The code is available at this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[255]  arXiv:2406.02366 [pdf, other]
Title: Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models
Comments: Preprint
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[256]  arXiv:2406.02362 [pdf, other]
Title: Temporal Graph Rewiring with Expander Graphs
Comments: 10 pages, 2 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[257]  arXiv:2406.02361 [pdf, other]
Title: Using Self-supervised Learning Can Improve Model Fairness
Comments: arXiv admin note: text overlap with arXiv:2401.01640
Subjects: Machine Learning (cs.LG)
[258]  arXiv:2406.02356 [pdf, other]
Title: Language Models Do Hard Arithmetic Tasks Easily and Hardly Do Easy Arithmetic Tasks
Comments: In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[259]  arXiv:2406.02354 [pdf, other]
Title: Label-wise Aleatoric and Epistemic Uncertainty Quantification
Comments: Uncertainty in Artificial Intelligence. arXiv admin note: substantial text overlap with arXiv:2401.00276
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[260]  arXiv:2406.02352 [pdf, other]
Title: System-Aware Neural ODE Processes for Few-Shot Bayesian Optimization
Subjects: Machine Learning (cs.LG)
[261]  arXiv:2406.02348 [pdf, ps, other]
Title: AMOSL: Adaptive Modality-wise Structure Learning in Multi-view Graph Neural Networks For Enhanced Unified Representation
Journal-ref: 13th International Conference on Soft Computing, Artificial Intelligence and Applications (SAI 2024)
Subjects: Machine Learning (cs.LG)
[262]  arXiv:2406.02344 [pdf, other]
Title: Incorporating Navigation Context into Inland Vessel Trajectory Prediction: A Gaussian Mixture Model and Transformer Approach
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Machine Learning (cs.LG)
[263]  arXiv:2406.02343 [pdf, other]
Title: Cluster-Aware Similarity Diffusion for Instance Retrieval
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[264]  arXiv:2406.02336 [pdf, other]
Title: Polynomial-Augmented Neural Networks (PANNs) with Weak Orthogonality Constraints for Enhanced Function and PDE Approximation
Subjects: Machine Learning (cs.LG)
[265]  arXiv:2406.02332 [pdf, other]
Title: Extended Mind Transformers
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[266]  arXiv:2406.02322 [pdf, other]
Title: A Survey of Transformer Enabled Time Series Synthesis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[267]  arXiv:2406.02318 [pdf, other]
Title: PeFAD: A Parameter-Efficient Federated Framework for Time Series Anomaly Detection
Comments: Accepted by SIGKDD 2024 (Research Track)
Subjects: Machine Learning (cs.LG); Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC)
[268]  arXiv:2406.02317 [pdf, other]
Title: Generative Conditional Distributions by Neural (Entropic) Optimal Transport
Comments: 15 pages, 8 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[269]  arXiv:2406.02310 [pdf, other]
Title: Disentangled Representation via Variational AutoEncoder for Continuous Treatment Effect Estimation
Subjects: Machine Learning (cs.LG)
[270]  arXiv:2406.02309 [pdf, other]
Title: Effects of Exponential Gaussian Distribution on (Double Sampling) Randomized Smoothing
Comments: ICML 2024 Poster
Subjects: Machine Learning (cs.LG)
[271]  arXiv:2406.02296 [pdf, other]
Title: Learning-Rate-Free Stochastic Optimization over Riemannian Manifolds
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[272]  arXiv:2406.02295 [pdf, other]
Title: How to Explore with Belief: State Entropy Maximization in POMDPs
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[273]  arXiv:2406.02294 [pdf, other]
Title: Smaller Batches, Bigger Gains? Investigating the Impact of Batch Sizes on Reinforcement Learning Based Real-World Production Scheduling
Comments: This paper was accepted at the ETFA 2024 conference
Subjects: Machine Learning (cs.LG)
[274]  arXiv:2406.02292 [pdf, other]
Title: An Axiomatic Approach to Loss Aggregation and an Adapted Aggregating Algorithm
Comments: 31 pages
Subjects: Machine Learning (cs.LG)
[275]  arXiv:2406.02290 [pdf, other]
Title: A Study of Optimizations for Fine-tuning Large Language Models
Comments: 10 pages, 4 figures
Subjects: Machine Learning (cs.LG)
[276]  arXiv:2406.02282 [pdf, other]
Title: Test-Time Regret Minimization in Meta Reinforcement Learning
Subjects: Machine Learning (cs.LG)
[277]  arXiv:2406.02268 [pdf, other]
Title: Analyzing the Benefits of Prototypes for Semi-Supervised Category Learning
Comments: 7 pages, 3 figures
Subjects: Machine Learning (cs.LG)
[278]  arXiv:2406.02258 [pdf, other]
Title: Reinforcement Learning with Lookahead Information
Authors: Nadav Merlis
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[279]  arXiv:2406.02234 [pdf, other]
Title: On the Limitations of Fractal Dimension as a Measure of Generalization
Comments: 17 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Dynamical Systems (math.DS); Machine Learning (stat.ML)
[280]  arXiv:2406.02214 [pdf, other]
Title: SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining
Subjects: Machine Learning (cs.LG)
[281]  arXiv:2406.02213 [pdf, other]
Title: Rectifying Reinforcement Learning for Reward Matching
Subjects: Machine Learning (cs.LG)
[282]  arXiv:2406.02189 [pdf, other]
Title: Fast and Scalable Multi-Kernel Encoder Classifier
Authors: Cencheng Shen
Comments: 12 pages main + 3 pages appendix
Subjects: Machine Learning (cs.LG)
[283]  arXiv:2406.02187 [pdf, other]
Title: DNCs Require More Planning Steps
Subjects: Machine Learning (cs.LG)
[284]  arXiv:2406.02180 [pdf, other]
Title: On The Statistical Representation Properties Of The Perturb-Softmax And The Perturb-Argmax Probability Distributions
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[285]  arXiv:2406.02177 [pdf, other]
Title: One-Shot Federated Learning with Bayesian Pseudocoresets
Comments: 10 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
[286]  arXiv:2406.02176 [pdf, other]
Title: AROMA: Preserving Spatial Structure for Latent PDE Modeling with Local Neural Fields
Subjects: Machine Learning (cs.LG)
[287]  arXiv:2406.02175 [pdf, other]
Title: Branches: A Fast Dynamic Programming and Branch & Bound Algorithm for Optimal Decision Trees
Comments: This preprint is currently under review
Subjects: Machine Learning (cs.LG)
[288]  arXiv:2406.02165 [pdf, other]
Title: SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Subjects: Machine Learning (cs.LG)
[289]  arXiv:2406.02146 [pdf, other]
Title: Activation Bottleneck: Sigmoidal Neural Networks Cannot Forecast a Straight Line
Subjects: Machine Learning (cs.LG)
[290]  arXiv:2406.02131 [pdf, other]
Title: CondTSF: One-line Plugin of Dataset Condensation for Time Series Forecasting
Comments: 23 pages, 13 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[291]  arXiv:2406.02128 [pdf, other]
Title: Iteration Head: A Mechanistic Study of Chain-of-Thought
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[292]  arXiv:2406.02105 [pdf, other]
Title: Kernel vs. Kernel: Exploring How the Data Structure Affects Neural Collapse
Comments: 34 pages, 14 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Theory (cs.IT); Machine Learning (stat.ML)
[293]  arXiv:2406.02075 [pdf, other]
Title: ReLU-KAN: New Kolmogorov-Arnold Networks that Only Need Matrix Addition, Dot Multiplication, and ReLU
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[294]  arXiv:2406.02066 [pdf, other]
Title: Preference Optimization for Molecule Synthesis with Conditional Residual Energy-based Models
Comments: Accepted by ICML 2024(Oral)
Subjects: Machine Learning (cs.LG); Biomolecules (q-bio.BM)
[295]  arXiv:2406.02064 [pdf, other]
Title: Advancing Generalized Transfer Attack with Initialization Derived Bilevel Optimization and Dynamic Sequence Truncation
Comments: Accepted by IJCAI 2024. 10 pages
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[296]  arXiv:2406.02061 [pdf, other]
Title: Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models
Comments: v1
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[297]  arXiv:2406.02059 [pdf, other]
Title: Graph Adversarial Diffusion Convolution
Comments: Accepted by ICML 2024
Subjects: Machine Learning (cs.LG)
[298]  arXiv:2406.02056 [pdf, other]
Title: CAP: A Context-Aware Neural Predictor for NAS
Comments: Accepted by IJCAI24
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[299]  arXiv:2406.02052 [pdf, other]
Title: PETRA: Parallel End-to-end Training with Reversible Architectures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[300]  arXiv:2406.02040 [pdf, other]
Title: DFA-GNN: Forward Learning of Graph Neural Networks by Direct Feedback Alignment
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[301]  arXiv:2406.02035 [pdf, other]
Title: A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[302]  arXiv:2406.02027 [pdf, other]
Title: Inference Attacks in Machine Learning as a Service: A Taxonomy, Review, and Promising Directions
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[303]  arXiv:2406.02024 [pdf, other]
Title: Verifying the Generalization of Deep Learning to Out-of-Distribution Domains
Comments: To appear in the Journal of Automated Reasoning (JAR), 2024. arXiv admin note: substantial text overlap with arXiv:2302.05745
Subjects: Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[304]  arXiv:2406.02017 [pdf, other]
Title: On the Mode-Seeking Properties of Langevin Dynamics
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[305]  arXiv:2406.02015 [pdf, other]
Title: Parameterizing Federated Continual Learning for Reproducible Research
Comments: Preprint: Accepted at the 1st WAFL (Workshop on Advancements in Federated Learning) workshop, ECML-PKDD 2023
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[306]  arXiv:2406.02013 [pdf, other]
Title: Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in Offline Reinforcement Learning
Comments: 16 pages, 5 figures
Subjects: Machine Learning (cs.LG)
[307]  arXiv:2406.01996 [pdf, other]
Title: Bayesian Mesh Optimization for Graph Neural Networks to Enhance Engineering Performance Prediction
Comments: 17 pages, 8 figures, 3 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[308]  arXiv:2406.01977 [pdf, other]
Title: What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding
Comments: ICML 2024
Subjects: Machine Learning (cs.LG)
[309]  arXiv:2406.01975 [pdf, other]
Title: Can Dense Connectivity Benefit Outlier Detection? An Odyssey with NAS
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[310]  arXiv:2406.01969 [pdf, other]
Title: Multiway Multislice PHATE: Visualizing Hidden Dynamics of RNNs through Training
Subjects: Machine Learning (cs.LG)
[311]  arXiv:2406.01960 [pdf, other]
Title: Certifiably Byzantine-Robust Federated Conformal Prediction
Comments: Accepted to ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[312]  arXiv:2406.01950 [pdf, ps, other]
Title: A Comparative Study of Sampling Methods with Cross-Validation in the FedHome Framework
Comments: 11 Figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[313]  arXiv:2406.01913 [pdf, other]
Title: Generating Synthetic Net Load Data with Physics-informed Diffusion Model
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[314]  arXiv:2406.01909 [pdf, other]
Title: A Global Geometric Analysis of Maximal Coding Rate Reduction
Comments: 43 pages, 9 figures. This work has been accepted for publication in the Proceedings of the 41st International Conference on Machine Learning (ICML 2024)
Subjects: Machine Learning (cs.LG)
[315]  arXiv:2406.01908 [pdf, other]
Title: PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming
Comments: Accepted by ICML 2024
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[316]  arXiv:2406.01901 [pdf, other]
Title: Bifurcated Generative Flow Networks
Subjects: Machine Learning (cs.LG)
[317]  arXiv:2406.01899 [pdf, other]
Title: Cross-Domain Graph Data Scaling: A Showcase with Diffusion Models
Subjects: Machine Learning (cs.LG)
[318]  arXiv:2406.01895 [pdf, other]
Title: Explicitly Encoding Structural Symmetry is Key to Length Generalization in Arithmetic Tasks
Comments: 32 pages, 16 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[319]  arXiv:2406.01870 [pdf, other]
Title: Understanding Stochastic Natural Gradient Variational Inference
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[320]  arXiv:2406.01857 [pdf, other]
Title: Neural Green's Operators for Parametric Partial Differential Equations
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[321]  arXiv:2406.01853 [pdf, other]
Title: Multi-Agent Reinforcement Learning Meets Leaf Sequencing in Radiotherapy
Comments: Accepted by ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[322]  arXiv:2406.01838 [pdf, other]
Title: Learning the Target Network in Function Space
Comments: Accepted to International Conference on Machine Learning (ICML24)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[323]  arXiv:2406.01833 [pdf, other]
Title: CAFO: Feature-Centric Explanation on Time Series Classification
Comments: Accepted to KDD 2024 Research Track
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[324]  arXiv:2406.01825 [pdf, other]
Title: EMOE: Expansive Matching of Experts for Robust Uncertainty Based Rejection
Authors: Yunni Qu (1), James Wellnitz (2), Alexander Tropsha (2), Junier Oliva (1) ((1) Department of Computer Science, University of North Carolina at Chapel Hill, (2) Eshelman School of Pharmacy, University of North Carolina at Chapel Hill)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[325]  arXiv:2406.01823 [pdf, other]
Title: Causal Discovery with Fewer Conditional Independence Tests
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME); Machine Learning (stat.ML)
[326]  arXiv:2406.01808 [pdf, other]
Title: In-Context Learning of Physical Properties: Few-Shot Adaptation to Out-of-Distribution Molecular Graphs
Comments: 12 pages, 4 figures
Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci)
[327]  arXiv:2406.01805 [pdf, other]
Title: TabMDA: Tabular Manifold Data Augmentation for Any Classifier using Transformers with In-context Subsetting
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[328]  arXiv:2406.01799 [pdf, other]
Title: Online Control in Population Dynamics
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[329]  arXiv:2406.01793 [pdf, other]
Title: Towards the Transferability of Rewards Recovered via Regularized Inverse Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[330]  arXiv:2406.01789 [pdf, ps, other]
Title: AI-based Classification of Customer Support Tickets: State of the Art and Implementation with AutoML
Journal-ref: Proceedings of the IWEMB 2021/2022: Fifth and Sixth International Workshop on Entrepreneurship, Electronic and Mobile Business
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[331]  arXiv:2406.01781 [pdf, other]
Title: DEFT: Efficient Finetuning of Conditional Diffusion Models by Learning the Generalised $h$-transform
Comments: arXiv admin note: text overlap with arXiv:2312.09236
Subjects: Machine Learning (cs.LG)
[332]  arXiv:2406.01766 [pdf, ps, other]
Title: How Does Gradient Descent Learn Features -- A Local Analysis for Regularized Two-Layer Neural Networks
Authors: Mo Zhou, Rong Ge
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[333]  arXiv:2406.01762 [pdf, other]
Title: Non-Asymptotic Analysis for Single-Loop (Natural) Actor-Critic with Compatible Function Approximation
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[334]  arXiv:2406.01757 [pdf, other]
Title: Position: Cracking the Code of Cascading Disparity Towards Marginalized Communities
Comments: 14 pages, 1 figure
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[335]  arXiv:2406.01755 [pdf, other]
Title: Sparser, Better, Deeper, Stronger: Improving Sparse Training with Exact Orthogonal Initialization
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[336]  arXiv:2406.01753 [pdf, other]
Title: Optimizing the Optimal Weighted Average: Efficient Distributed Sparse Classification
Comments: Under review
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
[337]  arXiv:2406.01733 [pdf, other]
Title: Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching
Comments: Code is available at this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[338]  arXiv:2406.01727 [pdf, other]
Title: Federated Learning-based Collaborative Wideband Spectrum Sensing and Scheduling for UAVs in UTM Systems
Comments: This is a preprint version submitted to IEEE Transactions on Machine learning in Communications and Networking. arXiv admin note: text overlap with arXiv:2308.05036
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA); Signal Processing (eess.SP)
[339]  arXiv:2406.01661 [pdf, other]
Title: A Diffusion Model Framework for Unsupervised Neural Combinatorial Optimization
Comments: Accepted at ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Discrete Mathematics (cs.DM); Machine Learning (stat.ML)
[340]  arXiv:2406.01660 [pdf, other]
Title: Self-Improving Robust Preference Optimization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[341]  arXiv:2406.01649 [pdf, other]
Title: CoLa-DCE -- Concept-guided Latent Diffusion Counterfactual Explanations
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME)
[342]  arXiv:2406.01647 [pdf, other]
Title: An Analysis under a Unified Fomulation of Learning Algorithms with Output Constraints
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[343]  arXiv:2406.01646 [pdf, other]
Title: iKAN: Global Incremental Learning with KAN for Human Activity Recognition Across Heterogeneous Datasets
Comments: This work is submitted to Ubicomp/ISWC24 and is under review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Signal Processing (eess.SP)
[344]  arXiv:2406.01645 [pdf, other]
Title: FNP: Fourier Neural Processes for Arbitrary-Resolution Data Assimilation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[345]  arXiv:2406.01638 [pdf, other]
Title: TimeCMA: Towards LLM-Empowered Time Series Forecasting via Cross-Modality Alignment
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[346]  arXiv:2406.02539 (cross-list from cs.CV) [pdf, other]
Title: Parrot: Multilingual Visual Instruction Tuning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[347]  arXiv:2406.02537 (cross-list from cs.CL) [pdf, other]
Title: TopViewRS: Vision-Language Models as Top-View Spatial Reasoners
Comments: 9 pages, 3 figures, 3 tables (21 pages, 4 figures, 15 tables including references and appendices)
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[348]  arXiv:2406.02536 (cross-list from cs.CL) [pdf, other]
Title: Mitigate Position Bias in Large Language Models via Scaling a Single Dimension
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[349]  arXiv:2406.02534 (cross-list from eess.IV) [pdf, other]
Title: Enhancing predictive imaging biomarker discovery through treatment effect analysis
Comments: 19 pages, 12 figures
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[350]  arXiv:2406.02529 (cross-list from eess.IV) [pdf, other]
Title: ReLUs Are Sufficient for Learning Implicit Neural Representations
Comments: Accepted to ICML 2024
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[351]  arXiv:2406.02523 (cross-list from cs.RO) [pdf, other]
Title: RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
Comments: RSS 2024
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[352]  arXiv:2406.02507 (cross-list from cs.CV) [pdf, other]
Title: Guiding a Diffusion Model with a Bad Version of Itself
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[353]  arXiv:2406.02497 (cross-list from eess.SY) [pdf, ps, other]
Title: Dropout MPC: An Ensemble Neural MPC Approach for Systems with Learned Dynamics
Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[354]  arXiv:2406.02477 (cross-list from eess.IV) [pdf, other]
Title: Inpainting Pathology in Lumbar Spine MRI with Latent Diffusion
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[355]  arXiv:2406.02470 (cross-list from quant-ph) [pdf, other]
Title: Meta-Designing Quantum Experiments with Language Models
Comments: 10+3 pages, 5 figures
Subjects: Quantum Physics (quant-ph); Machine Learning (cs.LG)
[356]  arXiv:2406.02457 (cross-list from cond-mat.mtrl-sci) [pdf, other]
Title: Machine learning Hubbard parameters with equivariant neural networks
Subjects: Materials Science (cond-mat.mtrl-sci); Machine Learning (cs.LG); Chemical Physics (physics.chem-ph)
[357]  arXiv:2406.02432 (cross-list from cs.DS) [pdf, other]
Title: Coresets for Multiple $\ell_p$ Regression
Comments: ICML 2024
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
[358]  arXiv:2406.02431 (cross-list from cs.DS) [pdf, other]
Title: Reweighted Solutions for Weighted Low Rank Approximation
Comments: ICML 2024
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
[359]  arXiv:2406.02426 (cross-list from math.OC) [pdf, other]
Title: Contextual Optimization under Covariate Shift: A Robust Approach by Intersecting Wasserstein Balls
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[360]  arXiv:2406.02422 (cross-list from eess.IV) [pdf, other]
Title: IterMask2: Iterative Unsupervised Anomaly Segmentation via Spatial and Frequency Masking for Brain Lesions in MRI
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[361]  arXiv:2406.02421 (cross-list from cs.DM) [pdf, other]
Title: Representing Piecewise-Linear Functions by Functions with Minimal Arity
Subjects: Discrete Mathematics (cs.DM); Machine Learning (cs.LG); Symbolic Computation (cs.SC)
[362]  arXiv:2406.02394 (cross-list from cs.CL) [pdf, other]
Title: Multiple Choice Questions and Large Languages Models: A Case Study with Fictional Medical Data
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[363]  arXiv:2406.02383 (cross-list from cs.CV) [pdf, other]
Title: Learning to Edit Visual Programs with Self-Supervision
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[364]  arXiv:2406.02357 (cross-list from cs.GT) [pdf, ps, other]
Title: The complexity of approximate (coarse) correlated equilibrium for incomplete information games
Subjects: Computer Science and Game Theory (cs.GT); Artificial Intelligence (cs.AI); Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG)
[365]  arXiv:2406.02355 (cross-list from cs.CV) [pdf, other]
Title: FedDr+: Stabilizing Dot-regression with Global Feature Distillation for Federated Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[366]  arXiv:2406.02347 (cross-list from cs.CV) [pdf, other]
Title: Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
Comments: 16 pages + 16 pages appendices
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[367]  arXiv:2406.02345 (cross-list from cs.CV) [pdf, other]
Title: Progressive Confident Masking Attention Network for Audio-Visual Segmentation
Comments: 10 pages, 9 figures, submitted to IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[368]  arXiv:2406.02333 (cross-list from cs.NI) [pdf, other]
Title: Towards Neural Architecture Search for Transfer Learning in 6G Networks
Subjects: Networking and Internet Architecture (cs.NI); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[369]  arXiv:2406.02329 (cross-list from cs.CL) [pdf, other]
Title: On Affine Homotopy between Language Encoders
Comments: 10 pages
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[370]  arXiv:2406.02327 (cross-list from cs.CV) [pdf, other]
Title: Continual Unsupervised Out-of-Distribution Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[371]  arXiv:2406.02315 (cross-list from cs.SD) [pdf, other]
Title: An Independence-promoting Loss for Music Generation with Language Models
Comments: Accepted to ICML 2024
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[372]  arXiv:2406.02313 (cross-list from cond-mat.stat-mech) [pdf, other]
Title: Neural Thermodynamic Integration: Free Energies from Energy-based Diffusion Models
Subjects: Statistical Mechanics (cond-mat.stat-mech); Machine Learning (cs.LG)
[373]  arXiv:2406.02300 (cross-list from math.AT) [pdf, other]
Title: Node-Level Topological Representation Learning on Point Clouds
Comments: 30 pages, 10 figures, comments welcome
Subjects: Algebraic Topology (math.AT); Computational Geometry (cs.CG); Machine Learning (cs.LG)
[374]  arXiv:2406.02298 (cross-list from math-ph) [pdf, other]
Title: Solving Partial Differential Equations in Different Domains by Operator Learning method Based on Boundary Integral Equations
Subjects: Mathematical Physics (math-ph); Machine Learning (cs.LG)
[375]  arXiv:2406.02293 (cross-list from stat.ML) [pdf, other]
Title: Composite Quantile Regression With XGBoost Using the Novel Arctan Pinball Loss
Comments: 24 pages, 9 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[376]  arXiv:2406.02285 (cross-list from eess.AS) [pdf, other]
Title: Towards Supervised Performance on Speaker Verification with Self-Supervised Learning by Leveraging Large-Scale ASR Models
Comments: accepted at INTERSPEECH 2024
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[377]  arXiv:2406.02273 (cross-list from math.OC) [pdf, ps, other]
Title: A KL-based Analysis Framework with Applications to Non-Descent Optimization Methods
Comments: 29 pages
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[378]  arXiv:2406.02269 (cross-list from stat.ML) [pdf, ps, other]
Title: Graph Neural Networks Do Not Always Oversmooth
Subjects: Machine Learning (stat.ML); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG)
[379]  arXiv:2406.02255 (cross-list from eess.AS) [pdf, other]
Title: MidiCaps -- A large-scale MIDI dataset with text captions
Comments: Under review
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD)
[380]  arXiv:2406.02245 (cross-list from cs.CL) [pdf, other]
Title: Description Boosting for Zero-Shot Entity and Relation Classification
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[381]  arXiv:2406.02225 (cross-list from math.OC) [pdf, other]
Title: Riemannian coordinate descent algorithms on matrix manifolds
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[382]  arXiv:2406.02223 (cross-list from cs.CV) [pdf, other]
Title: SMCL: Saliency Masked Contrastive Learning for Long-tailed Recognition
Comments: accepted at ICASSP 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[383]  arXiv:2406.02204 (cross-list from cs.CE) [pdf, other]
Title: The Deep Latent Space Particle Filter for Real-Time Data Assimilation with Uncertainty Quantification
Subjects: Computational Engineering, Finance, and Science (cs.CE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[384]  arXiv:2406.02191 (cross-list from stat.ML) [pdf, other]
Title: On the Recoverability of Causal Relations from Temporally Aggregated I.I.D. Data
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[385]  arXiv:2406.02173 (cross-list from math.NA) [pdf, other]
Title: Learning the Hodgkin-Huxley Model with Operator Learning Techniques
Comments: 24 pages, 8 figures
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG)
[386]  arXiv:2406.02158 (cross-list from cs.CV) [pdf, other]
Title: Radar Spectra-Language Model for Automotive Scene Parsing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[387]  arXiv:2406.02157 (cross-list from stat.ML) [pdf, other]
Title: Online Learning and Information Exponents: On The Importance of Batch size, and Time/Complexity Tradeoffs
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[388]  arXiv:2406.02156 (cross-list from cs.CR) [pdf, ps, other]
Title: Almost linear time differentially private release of synthetic graphs
Subjects: Cryptography and Security (cs.CR); Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG)
[389]  arXiv:2406.02154 (cross-list from math-ph) [pdf, other]
Title: Learning Hamiltonian neural Koopman operator and simultaneously sustaining and discovering conservation law
Subjects: Mathematical Physics (math-ph); Machine Learning (cs.LG)
[390]  arXiv:2406.02140 (cross-list from cs.CR) [pdf, other]
Title: Optimality of Matrix Mechanism on $\ell_p^p$-metric
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[391]  arXiv:2406.02133 (cross-list from eess.AS) [pdf, other]
Title: SimulTron: On-Device Simultaneous Speech to Speech Translation
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[392]  arXiv:2406.02126 (cross-list from eess.SY) [pdf, other]
Title: CityLight: A Universal Model Towards Real-world City-scale Traffic Signal Control Coordination
Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[393]  arXiv:2406.02092 (cross-list from cs.SD) [pdf, other]
Title: MaskSR: Masked Language Model for Full-band Speech Restoration
Comments: Accepted by INTERSPEECH 2024. Demo page: this https URL
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[394]  arXiv:2406.02081 (cross-list from cs.MA) [pdf, other]
Title: FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning
Comments: ICML 2024
Subjects: Multiagent Systems (cs.MA); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[395]  arXiv:2406.02080 (cross-list from cs.CL) [pdf, other]
Title: LongSSM: On the Length Extension of State-space Models in Language Modelling
Authors: Shida Wang
Comments: 23 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Dynamical Systems (math.DS)
[396]  arXiv:2406.02057 (cross-list from cs.AI) [pdf, other]
Title: Tabular and Deep Learning for the Whittle Index
Authors: Francisco Robledo Relaño (LMAP, UPPA, UPV / EHU), Vivek Borkar (EE-IIT), Urtzi Ayesta (IRIT-RMESS, UPV/EHU, CNRS), Konstantin Avrachenkov (Inria)
Comments: ACM Transactions on Modeling and Performance Evaluation of Computing Systems, 2024
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[397]  arXiv:2406.02049 (cross-list from stat.ML) [pdf, other]
Title: Causal Effect Identification in LiNGAM Models with Latent Confounders
Comments: Accepted at International Conference on Machine Learning (ICML) 2024
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[398]  arXiv:2406.02044 (cross-list from cs.CL) [pdf, ps, other]
Title: QROA: A Black-Box Query-Response Optimization Attack on LLMs
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[399]  arXiv:2406.02021 (cross-list from cs.CV) [pdf, other]
Title: MetaMixer Is All You Need
Comments: Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[400]  arXiv:2406.02016 (cross-list from math.OC) [pdf, other]
Title: Adaptive and Optimal Second-order Optimistic Methods for Minimax Optimization
Comments: 33 pages, 2 figures
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[401]  arXiv:2406.02014 (cross-list from q-bio.NC) [pdf, other]
Title: Understanding Auditory Evoked Brain Signal via Physics-informed Embedding Network with Multi-Task Transformer
Subjects: Neurons and Cognition (q-bio.NC); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[402]  arXiv:2406.01967 (cross-list from cs.RO) [pdf, other]
Title: DrEureka: Language Model Guided Sim-To-Real Transfer
Comments: Robotics: Science and Systems (RSS) 2024. Project website and open-source code: this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[403]  arXiv:2406.01959 (cross-list from math.OC) [pdf, other]
Title: Adaptive Variance Reduction for Stochastic Optimization under Weaker Assumptions
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[404]  arXiv:2406.01947 (cross-list from cs.RO) [pdf, other]
Title: Data-Driven Approaches for Thrust Prediction in Underwater Flapping Fin Propulsion Systems
Comments: 9 pages, 11 figures, AAAI 2021 Fall Series Symposium on Science-Guided AI
Subjects: Robotics (cs.RO); Machine Learning (cs.LG)
[405]  arXiv:2406.01940 (cross-list from cs.CL) [pdf, other]
Title: Process-Driven Autoformalization in Lean 4
Comments: 22 pages, 1 figures, 11 tables
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[406]  arXiv:2406.01939 (cross-list from cs.AI) [pdf, other]
Title: Speeding up Policy Simulation in Supply Chain RL
Subjects: Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[407]  arXiv:2406.01933 (cross-list from stat.ML) [pdf, ps, other]
Title: Orthogonal Causal Calibration
Comments: 44 pages
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST); Methodology (stat.ME)
[408]  arXiv:2406.01876 (cross-list from cs.DB) [pdf, other]
Title: GRAM: Generative Retrieval Augmented Matching of Data Schemas in the Context of Data Security
Comments: KDD 2024 Camera Ready; 11 pages, 8 figures
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[409]  arXiv:2406.01873 (cross-list from cs.CL) [pdf, other]
Title: CR-UTP: Certified Robustness against Universal Text Perturbations on Large Language Models
Comments: Accepted by ACL Findings 2024
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[410]  arXiv:2406.01852 (cross-list from cs.NI) [pdf, other]
Title: Non-uniformity is All You Need: Efficient and Timely Encrypted Traffic Classification With ECHO
Subjects: Networking and Internet Architecture (cs.NI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[411]  arXiv:2406.01829 (cross-list from cs.NE) [pdf, other]
Title: FacAID: A Transformer Model for Neuro-Symbolic Facade Reconstruction
Comments: 11 pages, 10 figures, preprint
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[412]  arXiv:2406.01813 (cross-list from stat.ML) [pdf, other]
Title: Diffusion Boosted Trees
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Applications (stat.AP); Methodology (stat.ME)
[413]  arXiv:2406.01801 (cross-list from stat.ML) [pdf, other]
Title: Fearless Stochasticity in Expectation Propagation
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[414]  arXiv:2406.01782 (cross-list from eess.SY) [pdf, other]
Title: Multi-agent assignment via state augmented reinforcement learning
Comments: 12 pages, 3 figures, 6th Annual Conference on Learning for Dynamics and Control
Journal-ref: Proceedings of Machine Learning Research vol 242 1 12, 2024. 6th Annual Conference on Learning for Dynamics and Control
Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[415]  arXiv:2406.01774 (cross-list from cs.DC) [pdf, other]
Title: Efficient Data Distribution Estimation for Accelerated Federated Learning
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[416]  arXiv:2406.01708 (cross-list from cs.CR) [pdf, other]
Title: Model for Peanuts: Hijacking ML Models without Training Access is Possible
Comments: 17 pages, 14 figures, 7 tables
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[417]  arXiv:2406.01698 (cross-list from cs.AR) [pdf, other]
Title: Demystifying Platform Requirements for Diverse LLM Inference Use Cases
Comments: 12 Pages, this https URL
Subjects: Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[418]  arXiv:2406.01663 (cross-list from stat.ML) [pdf, other]
Title: An efficient solution to Hidden Markov Models on trees with coupled branches
Comments: 24 + 6 pages, 5 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Signal Processing (eess.SP); Quantitative Methods (q-bio.QM); Methodology (stat.ME)
[419]  arXiv:2406.01655 (cross-list from cs.SD) [pdf, other]
Title: TinySV: Speaker Verification in TinyML with On-device Learning
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[420]  arXiv:2406.01653 (cross-list from stat.ML) [pdf, other]
Title: An efficient Wasserstein-distance approach for reconstructing jump-diffusion processes using parameterized neural networks
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Probability (math.PR); Applications (stat.AP); Methodology (stat.ME)
[421]  arXiv:2406.01652 (cross-list from stat.ME) [pdf, ps, other]
Title: Distributional bias compromises leave-one-out cross-validation
Comments: 20 pages, 5 figures, supplementary information
Subjects: Methodology (stat.ME); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[422]  arXiv:2406.01651 (cross-list from q-bio.QM) [pdf, other]
Title: FusionDTI: Fine-grained Binding Discovery with Token-level Fusion for Drug-Target Interaction
Comments: 10 pages, 8 figures
Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Biomolecules (q-bio.BM)
[423]  arXiv:2406.01650 (cross-list from q-bio.BM) [pdf, other]
Title: TAGMol: Target-Aware Gradient-guided Molecule Generation
Subjects: Biomolecules (q-bio.BM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[424]  arXiv:2406.01633 (cross-list from cs.IR) [pdf, other]
Title: On Overcoming Miscalibrated Conversational Priors in LLM-based Chatbots
Comments: Preprint of UAI'24 conference publication
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[425]  arXiv:2406.01631 (cross-list from cs.IR) [pdf, other]
Title: An LLM-based Recommender System Environment
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[426]  arXiv:2406.01630 (cross-list from q-bio.QM) [pdf, other]
Title: Equivariant amortized inference of poses for cryo-EM
Comments: Published at the GEM workshop, ICLR 2024
Subjects: Quantitative Methods (q-bio.QM); Machine Learning (cs.LG)
[427]  arXiv:2406.01627 (cross-list from q-bio.GN) [pdf, other]
Title: GenBench: A Benchmarking Suite for Systematic Evaluation of Genomic Foundation Models
Subjects: Genomics (q-bio.GN); Machine Learning (cs.LG)
[428]  arXiv:2406.01624 (cross-list from eess.AS) [pdf, other]
Title: Unveiling Hidden Factors: Explainable AI for Feature Boosting in Speech Emotion Recognition
Comments: Published in: Springer Nature International Journal of Applied Intelligence (2024)
Journal-ref: Applied Intelligence (2024)
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[429]  arXiv:2406.01622 (cross-list from q-bio.BM) [pdf, other]
Title: Sifting through the Noise: A Survey of Diffusion Probabilistic Models and Their Applications to Biomolecules
Comments: 31 pages, 6 figures
Subjects: Biomolecules (q-bio.BM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[430]  arXiv:2406.01617 (cross-list from q-bio.BM) [pdf, ps, other]
Title: LightCPPgen: An Explainable Machine Learning Pipeline for Rational Design of Cell Penetrating Peptides
Subjects: Biomolecules (q-bio.BM); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[431]  arXiv:2406.01611 (cross-list from cs.IR) [pdf, other]
Title: System-2 Recommenders: Disentangling Utility and Engagement in Recommendation Systems via Temporal Point-Processes
Comments: Accepted at FAccT'24
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG); Machine Learning (stat.ML)
[432]  arXiv:2406.01609 (cross-list from cs.IR) [pdf, other]
Title: Judgement Citation Retrieval using Contextual Similarity
Comments: 14 pages, 16 images, Submitted to Multimedia Tools and Applications Springer journal
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[433]  arXiv:2406.01603 (cross-list from cs.IR) [pdf, other]
Title: Privacy-preserving recommender system using the data collaboration analysis for distributed datasets
Subjects: Information Retrieval (cs.IR); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[434]  arXiv:2406.01601 (cross-list from cs.DC) [pdf, other]
Title: Backpropogation-Free Multi-modal On-Device Model Adaptation via Cloud-Device Collaboration
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[435]  arXiv:2406.01599 (cross-list from q-bio.QM) [pdf, ps, other]
Title: Markov Chain Monte Carlo with Gaussian Process Emulation for a 1D Hemodynamics Model of CTEPH
Subjects: Quantitative Methods (q-bio.QM); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG); Data Analysis, Statistics and Probability (physics.data-an); Applications (stat.AP)
[436]  arXiv:2406.01157 (cross-list from quant-ph) [pdf, other]
Title: Quantum consistent neural/tensor networks for photonic circuits with strongly/weakly entangled states
Authors: Nicolas Allegra
Comments: 13 pages. Paper under review for Physical Review A
Subjects: Quantum Physics (quant-ph); Statistical Mechanics (cond-mat.stat-mech); Machine Learning (cs.LG)
[437]  arXiv:2406.00503 (cross-list from math.OC) [pdf, other]
Title: Schrödinger Bridge with Quadratic State Cost is Exactly Solvable
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Systems and Control (eess.SY); Mathematical Physics (math-ph); Machine Learning (stat.ML)
[438]  arXiv:2405.14785 (cross-list from cs.CV) [pdf, other]
Title: EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
Comments: Project: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[439]  arXiv:2402.12908 (cross-list from cs.CV) [pdf, other]
Title: RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models
Comments: Project: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[440]  arXiv:2401.11708 (cross-list from cs.CV) [pdf, other]
Title: Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs
Comments: ICML 2024. Project: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[441]  arXiv:2105.13287 (cross-list from cs.DS) [pdf, other]
Title: Differentially Private Densest Subgraph Detection
Comments: Accepted by ICML 2021
Subjects: Data Structures and Algorithms (cs.DS); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)

Tue, 4 Jun 2024 (showing first 45 of 302 entries)

[442]  arXiv:2406.01588 [pdf, other]
Title: nn2poly: An R Package for Converting Neural Networks into Interpretable Polynomials
Authors: Pablo Morala (1 and 2), Jenny Alexandra Cifuentes (3), Rosa E. Lillo (1 and 2), Iñaki Ucar (1 and 2) ((1) uc3m-Santander Big Data Institute, Universidad Carlos III de Madrid. Spain., (2) Department of Statistics, Universidad Carlos III de Madrid. Spain., (3) ICADE, Department of Quantitative Methods, Faculty of Economics and Business Administration and the Institute for Research in Technology (IIT), ICAI School of Engineering, Universidad Pontificia Comillas. Spain.)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[443]  arXiv:2406.01581 [pdf, other]
Title: Neural network learns low-dimensional polynomials with SGD near the information-theoretic limit
Comments: 34 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[444]  arXiv:2406.01577 [pdf, ps, other]
Title: An Equivalence Between Static and Dynamic Regret Minimization
Comments: 26 pages
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[445]  arXiv:2406.01572 [pdf, other]
Title: Unlocking Guidance for Discrete State-Space Diffusion and Flow Models
Subjects: Machine Learning (cs.LG)
[446]  arXiv:2406.01570 [pdf, ps, other]
Title: Single Trajectory Conformal Prediction
Comments: 16 pages
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Machine Learning (stat.ML)
[447]  arXiv:2406.01562 [pdf, other]
Title: A New View on Planning in Online Reinforcement Learning
Comments: Published in the Planning and Reinforcement Learning Workshop at ICAPS 2024. arXiv admin note: text overlap with arXiv:2206.02902
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[448]  arXiv:2406.01539 [pdf, other]
Title: Physics-informed deep learning and compressive collocation for high-dimensional diffusion-reaction equations: practical existence theory and numerics
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT); Numerical Analysis (math.NA)
[449]  arXiv:2406.01529 [pdf, other]
Title: How to Count Coughs: An Event-Based Framework for Evaluating Automatic Cough Detection Algorithm Performance
Subjects: Machine Learning (cs.LG)
[450]  arXiv:2406.01528 [pdf, other]
Title: Physics-Informed Neural Networks for Dynamic Process Operations with Limited Physical Knowledge and Data
Comments: manuscript (31 pages, 8 figures, 7 tables), supporting materials (11 pages, 3 figures, 3 tables)
Subjects: Machine Learning (cs.LG)
[451]  arXiv:2406.01521 [pdf, other]
Title: MOSEAC: Streamlined Variable Time Step Reinforcement Learning
Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[452]  arXiv:2406.01481 [pdf, other]
Title: Learning from Streaming Data when Users Choose
Authors: Jinyan Su, Sarah Dean
Comments: Accepted by ICML24
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[453]  arXiv:2406.01477 [pdf, other]
Title: Finding Optimally Robust Data Mixtures via Concave Maximization
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[454]  arXiv:2406.01471 [pdf, ps, other]
Title: Inverse design of photonic surfaces on Inconel via multi-fidelity machine learning ensemble framework and high throughput femtosecond laser processing
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Optics (physics.optics)
[455]  arXiv:2406.01462 [pdf, other]
Title: Understanding Preference Fine-Tuning Through the Lens of Coverage
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[456]  arXiv:2406.01461 [pdf, other]
Title: Hardness of Learning Neural Networks under the Manifold Hypothesis
Subjects: Machine Learning (cs.LG); Differential Geometry (math.DG); Machine Learning (stat.ML)
[457]  arXiv:2406.01457 [pdf, other]
Title: Differentially Private Tabular Data Synthesis using Large Language Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[458]  arXiv:2406.01439 [pdf, other]
Title: Asynchronous Multi-Server Federated Learning for Geo-Distributed Clients
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[459]  arXiv:2406.01438 [pdf, other]
Title: Asynchronous Byzantine Federated Learning
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[460]  arXiv:2406.01435 [pdf, other]
Title: Learning Analysis of Kernel Ridgeless Regression with Asymmetric Kernel Learning
Comments: arXiv admin note: text overlap with arXiv:2310.05236
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[461]  arXiv:2406.01424 [pdf, other]
Title: Universal In-Context Approximation By Prompting Fully Recurrent Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[462]  arXiv:2406.01423 [pdf, other]
Title: Value Improved Actor Critic Algorithms
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[463]  arXiv:2406.01417 [pdf, other]
Title: Mixup Augmentation with Multiple Interpolations
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[464]  arXiv:2406.01416 [pdf, other]
Title: Adapting Conformal Prediction to Distribution Shifts Without Labels
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[465]  arXiv:2406.01414 [pdf, other]
Title: CE-NAS: An End-to-End Carbon-Efficient Neural Architecture Search Framework
Comments: arXiv admin note: text overlap with arXiv:2307.04131
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[466]  arXiv:2406.01411 [pdf, other]
Title: Using Constraints to Discover Sparse and Alternative Subgroup Descriptions
Authors: Jakob Bach
Subjects: Machine Learning (cs.LG)
[467]  arXiv:2406.01389 [pdf, other]
Title: RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Systems and Control (eess.SY)
[468]  arXiv:2406.01386 [pdf, ps, other]
Title: Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond
Subjects: Machine Learning (cs.LG)
[469]  arXiv:2406.01378 [pdf, ps, other]
Title: A Theory of Learnability for Offline Decision Making
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[470]  arXiv:2406.01361 [pdf, other]
Title: Learning to Play Atari in a World of Tokens
Comments: Accepted at ICML 2024
Subjects: Machine Learning (cs.LG)
[471]  arXiv:2406.01345 [pdf, other]
Title: BMRS: Bayesian Model Reduction for Structured Pruning
Comments: 17 pages; 8 figures; 2 tables
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[472]  arXiv:2406.01317 [pdf, other]
Title: The Intelligible and Effective Graph Neural Additive Networks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[473]  arXiv:2406.01290 [pdf, other]
Title: Resource-constrained Fairness
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[474]  arXiv:2406.01282 [pdf, other]
Title: Continuous Geometry-Aware Graph Diffusion via Hyperbolic Neural PDE
Comments: The short version of this work will appear in the Proceedings of the 2024 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD 2024)
Subjects: Machine Learning (cs.LG)
[475]  arXiv:2406.01274 [pdf, other]
Title: Expected Grad-CAM: Towards gradient faithfulness
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[476]  arXiv:2406.01257 [pdf, other]
Title: What makes unlearning hard and what to do about it
Subjects: Machine Learning (cs.LG)
[477]  arXiv:2406.01255 [pdf, other]
Title: On the Nonlinearity of Layer Normalization
Comments: 42 pages, accepted to ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[478]  arXiv:2406.01249 [pdf, other]
Title: Equivariant Machine Learning on Graphs with Nonlinear Spectral Filters
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[479]  arXiv:2406.01234 [pdf, other]
Title: Achieving Tractable Minimax Optimal Regret in Average Reward MDPs
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Optimization and Control (math.OC); Machine Learning (stat.ML)
[480]  arXiv:2406.01229 [pdf, other]
Title: AGALE: A Graph-Aware Continual Learning Evaluation Framework
Subjects: Machine Learning (cs.LG)
[481]  arXiv:2406.01192 [pdf, other]
Title: Sparsity-Agnostic Linear Bandits with Adaptive Adversaries
Comments: 25 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[482]  arXiv:2406.01189 [pdf, other]
Title: MultiMax: Sparse and Multi-Modal Attention Learning
Comments: Accepted at ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[483]  arXiv:2406.01183 [pdf, other]
Title: Automatic Input Feature Relevance via Spectral Neural Networks
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Artificial Intelligence (cs.AI)
[484]  arXiv:2406.01178 [pdf, other]
Title: Deep Reinforcement Learning Behavioral Mode Switching Using Optimal Control Based on a Latent Space Objective
Comments: Published in the proceedings of the 32nd Mediterranean Conference on Control and Automation [MED2024]
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[485]  arXiv:2406.01175 [pdf, other]
Title: NeoRL: Efficient Exploration for Nonepisodic RL
Subjects: Machine Learning (cs.LG)
[486]  arXiv:2406.01163 [pdf, other]
Title: When to Sense and Control? A Time-adaptive Approach for Continuous-Time RL
Subjects: Machine Learning (cs.LG)
[ total of 1090 entries: 1-250 | 237-486 | 487-736 | 737-986 | 987-1090 ]
[ showing 250 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help  (Access key information)