We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions, skipping first 106

[ total of 1410 entries: 1-396 | 107-502 | 503-898 | 899-1294 | 1295-1410 ]
[ showing 396 entries per page: fewer | more | all ]

Wed, 29 May 2024 (continued, showing last 150 of 256 entries)

[107]  arXiv:2405.17580 [pdf, other]
Title: Mixed Dynamics In Linear Networks: Unifying the Lazy and Active Regimes
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[108]  arXiv:2405.17575 [pdf, other]
Title: Interpretable Prognostics with Concept Bottleneck Models
Comments: Under review
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP); Machine Learning (stat.ML)
[109]  arXiv:2405.17569 [pdf, other]
Title: Discriminant audio properties in deep learning based respiratory insufficiency detection in Brazilian Portuguese
Comments: 5 pages, 2 figures, 1 table. Published in Artificial Intelligence in Medicine (AIME) 2023
Journal-ref: Artificial Intellingence in Medicine Proceedings 2023, page 271-275
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[110]  arXiv:2405.17556 [pdf, other]
Title: Probabilistic Verification of Neural Networks using Branch and Bound
Comments: 16 pages, 2 figures, 22 pages references and appendix, including 4 more figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[111]  arXiv:2405.17544 [pdf, other]
Title: Towards Human-AI Complementarity with Predictions Sets
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Human-Computer Interaction (cs.HC)
[112]  arXiv:2405.17535 [pdf, other]
Title: Calibrated Dataset Condensation for Faster Hyperparameter Search
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[113]  arXiv:2405.17534 [pdf, other]
Title: SMR: State Memory Replay for Long Sequence Modeling
Subjects: Machine Learning (cs.LG)
[114]  arXiv:2405.17529 [pdf, other]
Title: Clip Body and Tail Separately: High Probability Guarantees for DPSGD with Heavy Tails
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[115]  arXiv:2405.17527 [pdf, other]
Title: Unisolver: PDE-Conditional Transformers Are Universal PDE Solvers
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Numerical Analysis (math.NA)
[116]  arXiv:2405.17525 [pdf, ps, other]
Title: SmoothGNN: Smoothing-based GNN for Unsupervised Node Anomaly Detection
Subjects: Machine Learning (cs.LG)
[117]  arXiv:2405.17522 [pdf, other]
Title: Efficient Model Compression for Hierarchical Federated Learning
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[118]  arXiv:2405.17517 [pdf, other]
Title: WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average
Authors: Louis Fournier (MLIA), Adel Nabli (MLIA, Mila), Masih Aminbeidokhti (ETS), Marco Pedersoli (ETS), Eugene Belilovsky (Mila), Edouard Oyallon
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[119]  arXiv:2405.17512 [pdf, other]
Title: On Fairness of Low-Rank Adaptation of Large Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[120]  arXiv:2405.17509 [pdf, other]
Title: Reference Neural Operators: Learning the Smooth Dependence of Solutions of PDEs on Geometric Deformations
Subjects: Machine Learning (cs.LG)
[121]  arXiv:2405.17508 [pdf, other]
Title: Unveiling the Secrets: How Masking Strategies Shape Time Series Imputation
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[122]  arXiv:2405.17507 [pdf, other]
Title: Enhancing Sustainable Urban Mobility Prediction with Telecom Data: A Spatio-Temporal Framework Approach
Comments: 8 Figures, 5 Tables. Just accepted by IJCAI (to appear)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Networking and Internet Architecture (cs.NI)
[123]  arXiv:2405.17506 [pdf, other]
Title: Subspace Node Pruning
Comments: 14 pages, 8 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[124]  arXiv:2405.17505 [pdf, other]
Title: Predicting Rental Price of Lane Houses in Shanghai with Machine Learning Methods and Large Language Models
Comments: 13 pages, 11 figures, 39 references
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[125]  arXiv:2405.17502 [pdf, other]
Title: Exploring Nutritional Impact on Alzheimer's Mortality: An Explainable AI Approach
Comments: 5 pages, 1 figure, 5 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[126]  arXiv:2405.17501 [pdf, other]
Title: Geometry of Critical Sets and Existence of Saddle Branches for Two-layer Neural Networks
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[127]  arXiv:2405.17497 [pdf, other]
Title: Secure Hierarchical Federated Learning in Vehicular Networks Using Dynamic Client Selection and Anomaly Detection
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Systems and Control (eess.SY)
[128]  arXiv:2405.17495 [pdf, other]
Title: Vertical Federated Learning for Effectiveness, Security, Applicability: A Survey
Comments: 31 pages, 9 figures, 10 tables
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[129]  arXiv:2405.17494 [pdf, other]
Title: Transitional Uncertainty with Layered Intermediate Predictions
Subjects: Machine Learning (cs.LG)
[130]  arXiv:2405.17493 [pdf, other]
Title: Overcoming Negative Transfer by Online Selection: Distant Domain Adaptation for Fault Diagnosis
Comments: 8 pages, 5 figures
Subjects: Machine Learning (cs.LG)
[131]  arXiv:2405.17490 [pdf, other]
Title: Revisit, Extend, and Enhance Hessian-Free Influence Functions
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[132]  arXiv:2405.17489 [pdf, other]
Title: On the Inflation of KNN-Shapley Value
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[133]  arXiv:2405.17488 [pdf, other]
Title: Pattern-Based Time-Series Risk Scoring for Anomaly Detection and Alert Filtering -- A Predictive Maintenance Case Study
Authors: Elad Liebman
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[134]  arXiv:2405.17485 [pdf, other]
Title: $\textit{Comet:}$ A $\underline{Com}$munication-$\underline{e}$fficient and Performant Approxima$\underline{t}$ion for Private Transformer Inference
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[135]  arXiv:2405.17484 [pdf, other]
Title: Bridging The Gap between Low-rank and Orthogonal Adaptation via Householder Reflection Adaptation
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[136]  arXiv:2405.17481 [pdf, ps, other]
Title: Improving Simulation Regression Efficiency using a Machine Learning-based Method in Design Verification
Comments: Published in DVCon Europe 2022
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[137]  arXiv:2405.17479 [pdf, other]
Title: A rationale from frequency perspective for grokking in training neural network
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[138]  arXiv:2405.17478 [pdf, other]
Title: ROSE: Register Assisted General Time Series Forecasting with Decomposed Frequency Learning
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[139]  arXiv:2405.17477 [pdf, other]
Title: OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[140]  arXiv:2405.17476 [pdf, other]
Title: How to Leverage Diverse Demonstrations in Offline Imitation Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[141]  arXiv:2405.17474 [pdf, other]
Title: Federated Offline Policy Optimization with Dual Regularization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[142]  arXiv:2405.17473 [pdf, other]
Title: Repeat-Aware Neighbor Sampling for Dynamic Graph Learning
Comments: Accepted by KDD 2024, Research Track
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[143]  arXiv:2405.17472 [pdf, other]
Title: FreezeAsGuard: Mitigating Illegal Adaptation of Diffusion Models via Selective Tensor Freezing
Authors: Kai Huang, Wei Gao
Comments: 18 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[144]  arXiv:2405.17471 [pdf, other]
Title: Momentum-Based Federated Reinforcement Learning with Interaction and Communication Efficiency
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[145]  arXiv:2405.17470 [pdf, other]
Title: Athena: Efficient Block-Wise Post-Training Quantization for Large Language Models Using Second-Order Matrix Derivative Information
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[146]  arXiv:2405.17469 [pdf, other]
Title: A Dataset for Research on Water Sustainability
Comments: Accepted by ACM e-Energy 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[147]  arXiv:2405.17468 [pdf, other]
Title: Deep Activity Model: A Generative Approach for Human Mobility Pattern Synthesis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[148]  arXiv:2405.17467 [pdf, other]
Title: Sports center customer segmentation: a case study
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[149]  arXiv:2405.17466 [pdf, other]
Title: Distributed Continual Learning
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[150]  arXiv:2405.17465 [pdf, other]
Title: Application of Machine Learning in Agriculture: Recent Trends and Future Research Avenues
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[151]  arXiv:2405.17464 [pdf, other]
Title: Data Valuation by Leveraging Global and Local Statistical Information
Comments: 12 pages, 8 figures. arXiv admin note: text overlap with arXiv:2306.10577 by other authors
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[152]  arXiv:2405.17462 [pdf, other]
Title: Ferrari: Federated Feature Unlearning via Optimizing Feature Sensitivity
Comments: 9 pages of main paper
Subjects: Machine Learning (cs.LG)
[153]  arXiv:2405.17461 [pdf, other]
Title: EMR-Merging: Tuning-Free High-Performance Model Merging
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[154]  arXiv:2405.17460 [pdf, ps, other]
Title: Investigation of Customized Medical Decision Algorithms Utilizing Graph Neural Networks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[155]  arXiv:2405.17459 [pdf, ps, other]
Title: Integrating Medical Imaging and Clinical Reports Using Multimodal Deep Learning for Advanced Disease Analysis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[156]  arXiv:2405.17458 [pdf, other]
Title: Blood Glucose Control Via Pre-trained Counterfactual Invertible Neural Networks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[157]  arXiv:2405.17451 [pdf, other]
Title: Green AI in Action: Strategic Model Selection for Ensembles in Production
Comments: 9 pages. Accepted at the 1st ACM International Conference on AI-powered Software (AIware), 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Software Engineering (cs.SE)
[158]  arXiv:2405.17445 [pdf, other]
Title: On margin-based generalization prediction in deep neural networks
Authors: Coenraad Mouton
Comments: PhD Thesis
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[159]  arXiv:2405.17440 [pdf, other]
Title: CataLM: Empowering Catalyst Design Through Large Language Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[160]  arXiv:2405.18427 (cross-list from stat.ML) [pdf, other]
Title: Classifying Overlapping Gaussian Mixtures in High Dimensions: From Optimal Classifiers to Neural Nets
Comments: 19 pages, 14 figures
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[161]  arXiv:2405.18415 (cross-list from cs.CV) [pdf, other]
Title: Why are Visually-Grounded Language Models Bad at Image Classification?
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[162]  arXiv:2405.18414 (cross-list from cs.CL) [pdf, other]
Title: Don't Forget to Connect! Improving RAG with Graph-based Reranking
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[163]  arXiv:2405.18400 (cross-list from cs.CL) [pdf, other]
Title: Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass
Comments: 22 pages, 15 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[164]  arXiv:2405.18386 (cross-list from cs.SD) [pdf, other]
Title: Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning
Comments: Demo are available at: this https URL
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[165]  arXiv:2405.18383 (cross-list from cs.CV) [pdf, other]
[166]  arXiv:2405.18379 (cross-list from stat.ML) [pdf, other]
Title: A Note on the Prediction-Powered Bootstrap
Authors: Tijana Zrnic
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[167]  arXiv:2405.18373 (cross-list from stat.ML) [pdf, other]
Title: A Hessian-Aware Stochastic Differential Equation for Modelling SGD
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
[168]  arXiv:2405.18369 (cross-list from cs.CL) [pdf, other]
Title: PromptWizard: Task-Aware Agent-driven Prompt Optimization Framework
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[169]  arXiv:2405.18359 (cross-list from cs.CL) [pdf, other]
Title: Bridging the Gap: Dynamic Learning Strategies for Improving Multilingual Performance in LLMs
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[170]  arXiv:2405.18358 (cross-list from cs.CL) [pdf, other]
Title: MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[171]  arXiv:2405.18335 (cross-list from cs.CL) [pdf, other]
Title: Interpretable classification of wiki-review streams
Journal-ref: (2023) IEEE Access
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[172]  arXiv:2405.18334 (cross-list from cs.DB) [pdf, other]
Title: SketchQL Demonstration: Zero-shot Video Moment Querying with Sketches
Journal-ref: Published on International Conference on Very Large Databases 2024
Subjects: Databases (cs.DB); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[173]  arXiv:2405.18327 (cross-list from q-bio.QM) [pdf, ps, other]
Title: Histopathology Based AI Model Predicts Anti-Angiogenic Therapy Response in Renal Cancer Clinical Trial
Comments: 19 pages, 4 Figures
Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[174]  arXiv:2405.18306 (cross-list from stat.ML) [pdf, other]
Title: Learning Staged Trees from Incomplete Data
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[175]  arXiv:2405.18299 (cross-list from cs.CV) [pdf, other]
Title: Deep Learning Innovations for Underwater Waste Detection: An In-Depth Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[176]  arXiv:2405.18298 (cross-list from stat.ML) [pdf, other]
Title: Context-Specific Refinements of Bayesian Network Classifiers
Comments: arXiv admin note: text overlap with arXiv:2206.06970
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[177]  arXiv:2405.18284 (cross-list from stat.ML) [pdf, other]
Title: Adaptive debiased SGD in high-dimensional GLMs with steaming data
Comments: 37 pages, 4 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[178]  arXiv:2405.18278 (cross-list from astro-ph.EP) [pdf, other]
Title: NotPlaNET: Removing False Positives from Planet Hunters TESS with Machine Learning
Authors: Valentina Tardugno Poleo (NYU), Nora Eisner (CCA), David W. Hogg (NYU, CCA)
Comments: Under review at The Astronomical Journal
Subjects: Earth and Planetary Astrophysics (astro-ph.EP); Instrumentation and Methods for Astrophysics (astro-ph.IM); Machine Learning (cs.LG)
[179]  arXiv:2405.18274 (cross-list from math.ST) [pdf, other]
Title: Signal-Plus-Noise Decomposition of Nonlinear Spiked Random Matrix Models
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Signal Processing (eess.SP); Machine Learning (stat.ML)
[180]  arXiv:2405.18273 (cross-list from math.OC) [pdf, other]
Title: Synchronization on circles and spheres with nonlinear interactions
Comments: 28 pages, 1 figure
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Dynamical Systems (math.DS)
[181]  arXiv:2405.18267 (cross-list from eess.IV) [pdf, other]
Title: CT-based brain ventricle segmentation via diffusion Schrödinger Bridge without target domain ground truths
Comments: Early acceptance at MICCAI2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[182]  arXiv:2405.18236 (cross-list from cs.CR) [pdf, other]
Title: Position Paper: Think Globally, React Locally -- Bringing Real-time Reference-based Website Phishing Detection on macOS
Comments: 8 pages, 7 figures, 8 tables. Accepted to STAST'24, 14th International Workshop on Socio-Technical Aspects in Security, Affiliated with the 9th IEEE European Symposium on Security and Privacy, this https URL
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[183]  arXiv:2405.18221 (cross-list from math.OC) [pdf, other]
Title: Recurrent Natural Policy Gradient for POMDPs
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[184]  arXiv:2405.18220 (cross-list from stat.ML) [pdf, other]
Title: Non-negative Tensor Mixture Learning for Discrete Density Estimation
Comments: 24 pages, 5 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[185]  arXiv:2405.18209 (cross-list from cs.RO) [pdf, other]
Title: Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous Driving
Subjects: Robotics (cs.RO); Machine Learning (cs.LG)
[186]  arXiv:2405.18208 (cross-list from cs.AI) [pdf, other]
Title: A Human-Like Reasoning Framework for Multi-Phases Planning Task with Large Language Models
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[187]  arXiv:2405.18196 (cross-list from cs.RO) [pdf, other]
Title: Render and Diffuse: Aligning Image and Action Spaces for Diffusion-based Behaviour Cloning
Comments: Robotics: Science and Systems (RSS) 2024. Videos are available on our project webpage at this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[188]  arXiv:2405.18180 (cross-list from cs.AI) [pdf, other]
Title: Safe Reinforcement Learning in Black-Box Environments via Adaptive Shielding
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[189]  arXiv:2405.18176 (cross-list from stat.ML) [pdf, other]
Title: SEMF: Supervised Expectation-Maximization Framework for Predicting Intervals
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[190]  arXiv:2405.18172 (cross-list from cs.CV) [pdf, other]
Title: AnyFit: Controllable Virtual Try-on for Any Combination of Attire Across Any Scenario
Comments: Project website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[191]  arXiv:2405.18153 (cross-list from cs.SD) [pdf, other]
Title: Practical aspects for the creation of an audio dataset from field recordings with optimized labeling budget with AI-assisted strategy
Comments: Submitted to ICML 2024 Workshop on Data-Centric Machine Learning Research
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[192]  arXiv:2405.18146 (cross-list from cs.IR) [pdf, other]
Title: Unified Low-rank Compression Framework for Click-through Rate Prediction
Comments: Accepted by KDD2024 Applied Data Science (ADS) Track
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
[193]  arXiv:2405.18119 (cross-list from cs.CV) [pdf, other]
Title: Low-Resource Crop Classification from Multi-Spectral Time Series Using Lossless Compressors
Comments: 8 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[194]  arXiv:2405.18095 (cross-list from stat.ML) [pdf, other]
Title: Is machine learning good or bad for the natural sciences?
Authors: David W. Hogg (NYU, MPIA, Flatiron), Soledad Villar (JHU, Flatiron)
Comments: A Position Paper accepted for publication in the 2024 International Conference on Machine Learning
Subjects: Machine Learning (stat.ML); Instrumentation and Methods for Astrophysics (astro-ph.IM); Machine Learning (cs.LG); Data Analysis, Statistics and Probability (physics.data-an)
[195]  arXiv:2405.18093 (cross-list from cs.DC) [pdf, other]
Title: Pipette: Automatic Fine-grained Large Language Model Training Configurator for Real-World Clusters
Comments: published at DATE 2024
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[196]  arXiv:2405.18091 (cross-list from math.ST) [pdf, ps, other]
Title: An adaptive transfer learning perspective on classification in non-stationary environments
Authors: Henry W J Reeve
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG)
[197]  arXiv:2405.18068 (cross-list from cs.IR) [pdf, other]
Title: A Survey of Latent Factor Models in Recommender Systems
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[198]  arXiv:2405.18042 (cross-list from cs.CV) [pdf, other]
Title: Visualizing the loss landscape of Self-supervised Vision Transformer
Comments: NeurIPS 2023 Workshop: Self-Supervised Learning - Theory and Practice
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[199]  arXiv:2405.18031 (cross-list from math.OC) [pdf, other]
Title: Lower Bounds and Optimal Algorithms for Non-Smooth Convex Decentralized Optimization over Time-Varying Networks
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[200]  arXiv:2405.18029 (cross-list from cs.CV) [pdf, other]
Title: Are Image Distributions Indistinguishable to Humans Indistinguishable to Classifiers?
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[201]  arXiv:2405.18009 (cross-list from cs.CL) [pdf, other]
Title: Exploring Context Window of Large Language Models via Decomposed Positional Vectors
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[202]  arXiv:2405.17995 (cross-list from cs.CV) [pdf, other]
Title: DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[203]  arXiv:2405.17969 (cross-list from cs.CL) [pdf, other]
Title: Knowledge Circuits in Pretrained Transformers
Comments: Work in progress, 25 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[204]  arXiv:2405.17955 (cross-list from stat.ML) [pdf, other]
Title: Efficient Prior Calibration From Indirect Data
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO)
[205]  arXiv:2405.17931 (cross-list from cs.CL) [pdf, other]
Title: Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[206]  arXiv:2405.17927 (cross-list from cs.AI) [pdf, other]
Title: The Evolution of Multimodal Model Architectures
Comments: 30 pages, 6 tables, 7 figures
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[207]  arXiv:2405.17905 (cross-list from cs.CV) [pdf, other]
Title: Cycle-YOLO: A Efficient and Robust Framework for Pavement Damage Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[208]  arXiv:2405.17902 (cross-list from cs.AI) [pdf, other]
Title: Boosting Protein Language Models with Negative Sample Mining
Comments: 17 pages, 4 figures
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[209]  arXiv:2405.17890 (cross-list from cs.IR) [pdf, other]
Title: SLMRec: Empowering Small Language Models for Sequential Recommendation
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[210]  arXiv:2405.17875 (cross-list from math.OC) [pdf, other]
Title: BO4IO: A Bayesian optimization approach to inverse optimization with uncertainty quantification
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[211]  arXiv:2405.17842 (cross-list from cs.CV) [pdf, other]
Title: Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[212]  arXiv:2405.17836 (cross-list from eess.SP) [pdf, other]
Title: An Innovative Networks in Federated Learning
Comments: Work in progress
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG); Machine Learning (stat.ML)
[213]  arXiv:2405.17823 (cross-list from stat.ML) [pdf, other]
Title: Spectral Truncation Kernels: Noncommutativity in $C^*$-algebraic Kernel Machines
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Operator Algebras (math.OA)
[214]  arXiv:2405.17816 (cross-list from cs.CV) [pdf, other]
Title: Pursuing Feature Separation based on Neural Collapse for Out-of-Distribution Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[215]  arXiv:2405.17756 (cross-list from eess.IV) [pdf, ps, other]
Title: Motion-Informed Deep Learning for Brain MR Image Reconstruction Framework
Comments: 22 pages, 7 figures, 4 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[216]  arXiv:2405.17743 (cross-list from cs.CL) [pdf, other]
Title: ORLM: Training Large Language Models for Optimization Modeling
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
[217]  arXiv:2405.17730 (cross-list from cs.CV) [pdf, other]
Title: MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance
Authors: Yake Wei, Di Hu
Comments: Accepted by ICML2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[218]  arXiv:2405.17720 (cross-list from cs.CV) [pdf, other]
Title: MindFormer: A Transformer Architecture for Multi-Subject Brain Decoding via fMRI
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[219]  arXiv:2405.17718 (cross-list from cs.CV) [pdf, other]
Title: AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[220]  arXiv:2405.17713 (cross-list from cs.AI) [pdf, other]
Title: AI Alignment with Changing and Influenceable Reward Functions
Comments: Accepted to ICML 2024
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[221]  arXiv:2405.17712 (cross-list from cs.CL) [pdf, other]
Title: CLAIM Your Data: Enhancing Imputation Accuracy with Contextual Large Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[222]  arXiv:2405.17700 (cross-list from cs.GT) [pdf, other]
Title: Learning Social Welfare Functions
Subjects: Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG)
[223]  arXiv:2405.17693 (cross-list from stat.ML) [pdf, other]
Title: Tamed Langevin sampling under weaker conditions
Comments: 32 pages, 2 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Numerical Analysis (math.NA); Optimization and Control (math.OC); Probability (math.PR)
[224]  arXiv:2405.17691 (cross-list from cs.AI) [pdf, ps, other]
Title: Ontology-Enhanced Decision-Making for Autonomous Agents in Dynamic and Partially Observable Environments
Comments: PhD thesis
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[225]  arXiv:2405.17673 (cross-list from cs.CV) [pdf, other]
Title: Fast Samplers for Inverse Problems in Iterative Refinement Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[226]  arXiv:2405.17667 (cross-list from astro-ph.SR) [pdf, other]
Title: Hunting for Polluted White Dwarfs and Other Treasures with Gaia XP Spectra and Unsupervised Machine Learning
Comments: 16 pages, 10 figures, submitted to ApJ on May 20, 2024
Subjects: Solar and Stellar Astrophysics (astro-ph.SR); Earth and Planetary Astrophysics (astro-ph.EP); Machine Learning (cs.LG)
[227]  arXiv:2405.17666 (cross-list from stat.ML) [pdf, other]
Title: Structured Partial Stochasticity in Bayesian Neural Networks
Authors: Tommy Rochussen
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[228]  arXiv:2405.17615 (cross-list from cs.SD) [pdf, other]
Title: Listenable Maps for Zero-Shot Audio Classifiers
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[229]  arXiv:2405.17613 (cross-list from cs.CV) [pdf, other]
Title: A Framework for Multi-modal Learning: Jointly Modeling Inter- & Intra-Modality Dependencies
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[230]  arXiv:2405.17612 (cross-list from physics.flu-dyn) [pdf, ps, other]
Title: A note on the error analysis of data-driven closure models for large eddy simulations of turbulence
Subjects: Fluid Dynamics (physics.flu-dyn); Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[231]  arXiv:2405.17610 (cross-list from cs.CL) [pdf, other]
Title: Explainable machine learning multi-label classification of Spanish legal judgements
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[232]  arXiv:2405.17607 (cross-list from cs.IR) [pdf, other]
Title: Advancing Cultural Inclusivity: Optimizing Embedding Spaces for Balanced Music Recommendations
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[233]  arXiv:2405.17587 (cross-list from cs.IR) [pdf, other]
Title: RAGSys: Item-Cold-Start Recommender as RAG System
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[234]  arXiv:2405.17573 (cross-list from stat.ML) [pdf, other]
Title: Hamiltonian Mechanics of Feature Learning: Bottleneck Structure in Leaky ResNets
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[235]  arXiv:2405.17566 (cross-list from astro-ph.CO) [pdf, other]
Title: A deep-learning algorithm to disentangle self-interacting dark matter and AGN feedback models
Authors: David Harvey
Comments: Accepted Nature Astronomy
Subjects: Cosmology and Nongalactic Astrophysics (astro-ph.CO); Machine Learning (cs.LG); Computational Physics (physics.comp-ph)
[236]  arXiv:2405.17541 (cross-list from quant-ph) [pdf, other]
Title: Approximately-symmetric neural networks for quantum spin liquids
Comments: 5+10 pages
Subjects: Quantum Physics (quant-ph); Disordered Systems and Neural Networks (cond-mat.dis-nn); Strongly Correlated Electrons (cond-mat.str-el); Machine Learning (cs.LG)
[237]  arXiv:2405.17538 (cross-list from hep-th) [pdf, other]
Title: Bayesian RG Flow in Neural Network Field Theories
Comments: 46 pages, 10 figures, 2 tables
Subjects: High Energy Physics - Theory (hep-th); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG)
[238]  arXiv:2405.17533 (cross-list from cs.AI) [pdf, other]
Title: PAE: LLM-based Product Attribute Extraction for E-Commerce Fashion Trends
Comments: Attribute Extraction, PDF files, Bert Embedding, Hashtag, Large Language Model (LLM), Text and Images
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[239]  arXiv:2405.17523 (cross-list from cs.CV) [pdf, other]
Title: Locally Testing Model Detections for Semantic Global Concepts
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[240]  arXiv:2405.17516 (cross-list from cs.NE) [pdf, other]
Title: Time Elastic Neural Networks
Authors: Pierre-François Marteau (EXPRESSION)
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[241]  arXiv:2405.17486 (cross-list from quant-ph) [pdf, other]
Title: eQMARL: Entangled Quantum Multi-Agent Reinforcement Learning for Distributed Cooperation over Quantum Channels
Comments: 19 pages, 8 figures
Subjects: Quantum Physics (quant-ph); Emerging Technologies (cs.ET); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[242]  arXiv:2405.17483 (cross-list from eess.IV) [pdf, other]
Title: Concept-based Explainable Malignancy Scoring on Pulmonary Nodules in CT Images
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[243]  arXiv:2405.17475 (cross-list from cs.CV) [pdf, other]
Title: How Culturally Aware are Vision-Language Models?
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[244]  arXiv:2405.17463 (cross-list from cs.GT) [pdf, other]
Title: No Algorithmic Collusion in Two-Player Blindfolded Game with Thompson Sampling
Subjects: Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG)
[245]  arXiv:2405.17457 (cross-list from cs.CV) [pdf, other]
Title: Data-Free Federated Class Incremental Learning with Diffusion-Based Generative Memory
Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[246]  arXiv:2405.17456 (cross-list from cs.CV) [pdf, other]
Title: Optimized Linear Measurements for Inverse Problems using Diffusion-Based Image Generation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[247]  arXiv:2405.17455 (cross-list from cs.CV) [pdf, other]
Title: WeatherFormer: A Pretrained Encoder Model for Learning Robust Weather Representations from Small Datasets
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Atmospheric and Oceanic Physics (physics.ao-ph); Machine Learning (stat.ML)
[248]  arXiv:2405.17450 (cross-list from cs.CV) [pdf, other]
Title: The Power of Next-Frame Prediction for Learning Physical Laws
Comments: 7 Figures, 12 Pages, 1 Table
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[249]  arXiv:2405.17449 (cross-list from cs.CV) [pdf, ps, other]
Title: Image Based Character Recognition, Documentation System To Decode Inscription From Temple
Comments: This research paper is a part of capstone project submitted to VIT Chennai, VIT University
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[250]  arXiv:2405.17447 (cross-list from cs.CV) [pdf, other]
Title: How to train your ViT for OOD Detection
Comments: arXiv admin note: text overlap with arXiv:2306.00826
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[251]  arXiv:2405.17444 (cross-list from cs.CV) [pdf, other]
Title: Towards Gradient-based Time-Series Explanations through a SpatioTemporal Attention Network
Authors: Min Hun Lee
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[252]  arXiv:2405.17442 (cross-list from cs.NI) [pdf, other]
Title: Leveraging Machine Learning for Accurate IoT Device Identification in Dynamic Wireless Contexts
Subjects: Networking and Internet Architecture (cs.NI); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Operating Systems (cs.OS)
[253]  arXiv:2405.17439 (cross-list from cs.NI) [pdf, other]
Title: An Overview of Machine Learning-Enabled Optimization for Reconfigurable Intelligent Surfaces-Aided 6G Networks: From Reinforcement Learning to Large Language Models
Subjects: Networking and Internet Architecture (cs.NI); Machine Learning (cs.LG); Systems and Control (eess.SY)
[254]  arXiv:2405.17438 (cross-list from cs.PL) [pdf, other]
Title: An LLM-Tool Compiler for Fused Parallel Function Calling
Subjects: Programming Languages (cs.PL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[255]  arXiv:2405.17436 (cross-list from cs.NI) [pdf, other]
Title: Intelligent Hybrid Resource Allocation in MEC-assisted RAN Slicing Network
Subjects: Networking and Internet Architecture (cs.NI); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[256]  arXiv:2405.16153 (cross-list from cs.CL) [pdf, other]
Title: DefSent+: Improving sentence embeddings of language models by projecting definition sentences into a quasi-isotropic or isotropic vector space of unlimited dictionary entries
Authors: Xiaodong Liu
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

Tue, 28 May 2024 (showing first 246 of 393 entries)

[257]  arXiv:2405.17425 [pdf, other]
Title: From Neurons to Neutrons: A Case Study in Interpretability
Comments: International Conference on Machine Learning (ICML) 2024
Subjects: Machine Learning (cs.LG); Nuclear Theory (nucl-th)
[258]  arXiv:2405.17420 [pdf, other]
Title: Survival of the Fittest Representation: A Case Study with Modular Addition
Subjects: Machine Learning (cs.LG)
[259]  arXiv:2405.17416 [pdf, other]
Title: A Recipe for Unbounded Data Augmentation in Visual Reinforcement Learning
Comments: Accepted at RLC 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[260]  arXiv:2405.17404 [pdf, other]
Title: Spectral Greedy Coresets for Graph Neural Networks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[261]  arXiv:2405.17403 [pdf, other]
Title: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[262]  arXiv:2405.17401 [pdf, other]
Title: RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control
Comments: Preprint. Under review
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[263]  arXiv:2405.17399 [pdf, other]
Title: Transformers Can Do Arithmetic with the Right Embeddings
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[264]  arXiv:2405.17391 [pdf, other]
Title: Dataset-learning duality and emergent criticality
Comments: 27 pages, 9 figures
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Neural and Evolutionary Computing (cs.NE)
[265]  arXiv:2405.17382 [pdf, other]
Title: ReMoDetect: Reward Models Recognize Aligned LLM's Generations
Comments: 20 pages
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[266]  arXiv:2405.17378 [pdf, ps, other]
Title: RTL-Repo: A Benchmark for Evaluating LLMs on Large-Scale RTL Design Projects
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR)
[267]  arXiv:2405.17377 [pdf, other]
Title: How Does Perfect Fitting Affect Representation Learning? On the Training Dynamics of Representations in Deep Neural Networks
Subjects: Machine Learning (cs.LG)
[268]  arXiv:2405.17374 [pdf, other]
Title: Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language Models
Subjects: Machine Learning (cs.LG)
[269]  arXiv:2405.17366 [pdf, other]
Title: EM-GANSim: Real-time and Accurate EM Simulation Using Conditional GANs for 3D Indoor Scenes
Comments: 10 pages, 8 figures, 5 tables
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[270]  arXiv:2405.17358 [pdf, other]
Title: Rethinking Transformers in Solving POMDPs
Comments: Accepted by ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[271]  arXiv:2405.17352 [pdf, other]
Title: Assessing the significance of longitudinal data in Alzheimer's Disease forecasting
Subjects: Machine Learning (cs.LG)
[272]  arXiv:2405.17346 [pdf, other]
Title: Prompt Optimization with Human Feedback
Comments: Preprint, 18 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[273]  arXiv:2405.17339 [pdf, other]
Title: Physics-Informed Real NVP for Satellite Power System Fault Detection
Comments: Accepted at International Conference on Advanced Intelligent Mechatronics (AIM) 2024
Subjects: Machine Learning (cs.LG); Robotics (cs.RO)
[274]  arXiv:2405.17324 [pdf, other]
Title: Leveraging Offline Data in Linear Latent Bandits
Comments: 40 pages. 14 pages for main paper, 26 pages for references + appendix
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[275]  arXiv:2405.17311 [pdf, other]
Title: Probabilistic Graph Rewiring via Virtual Nodes
Comments: arXiv admin note: text overlap with arXiv:2310.02156
Subjects: Machine Learning (cs.LG)
[276]  arXiv:2405.17309 [pdf, other]
Title: Survey of Graph Neural Network for Internet of Things and NextG Networks
Subjects: Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[277]  arXiv:2405.17293 [pdf, other]
Title: Efficient Ensembles Improve Training Data Attribution
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[278]  arXiv:2405.17287 [pdf, other]
Title: Opinion-Guided Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[279]  arXiv:2405.17283 [pdf, other]
Title: Recurrent Complex-Weighted Autoencoders for Unsupervised Object Discovery
Comments: minor typo fixed
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[280]  arXiv:2405.17277 [pdf, other]
Title: Gradients of Functions of Large Matrices
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[281]  arXiv:2405.17272 [pdf, other]
Title: DPN: Decoupling Partition and Navigation for Neural Solvers of Min-max Vehicle Routing Problems
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[282]  arXiv:2405.17267 [pdf, other]
Title: FedHPL: Efficient Heterogeneous Federated Learning with Prompt Tuning and Logit Distillation
Comments: 35 pages
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[283]  arXiv:2405.17260 [pdf, other]
Title: Accelerating Simulation of Two-Phase Flows with Neural PDE Surrogates
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Fluid Dynamics (physics.flu-dyn)
[284]  arXiv:2405.17258 [pdf, other]
Title: $\textit{Trans-LoRA}$: towards data-free Transferable Parameter Efficient Finetuning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[285]  arXiv:2405.17253 [pdf, other]
Title: Gaussian Embedding of Temporal Networks
Journal-ref: IEEE Access ( Volume: 11, 2023) Page(s): 117971 - 117983
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[286]  arXiv:2405.17247 [pdf, other]
[287]  arXiv:2405.17243 [pdf, other]
Title: Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning
Comments: Published at the Reinforcement Learning Conference 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[288]  arXiv:2405.17233 [pdf, other]
Title: CLAQ: Pushing the Limits of Low-Bit Post-Training Quantization for LLMs
Subjects: Machine Learning (cs.LG)
[289]  arXiv:2405.17222 [pdf, other]
Title: A Retrospective of the Tutorial on Opportunities and Challenges of Online Deep Learning
Comments: Accepted for publication on ECML-PKDD 2023 joint Post-Workshop Proceeding
Subjects: Machine Learning (cs.LG)
[290]  arXiv:2405.17216 [pdf, other]
Title: Autoformalizing Euclidean Geometry
Comments: Accepted to ICML 2024. The first two authors contributed equally
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Logic in Computer Science (cs.LO); Machine Learning (stat.ML)
[291]  arXiv:2405.17211 [pdf, other]
Title: Spectral-Refiner: Fine-Tuning of Accurate Spatiotemporal Neural Operator for Turbulent Flows
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Fluid Dynamics (physics.flu-dyn)
[292]  arXiv:2405.17209 [pdf, other]
Title: How Do Transformers "Do" Physics? Investigating the Simple Harmonic Oscillator
Comments: 9 pages, 9 figures
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Artificial Intelligence (cs.AI)
[293]  arXiv:2405.17198 [pdf, other]
Title: Convex Relaxation for Solving Large-Margin Classifiers in Hyperbolic Space
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[294]  arXiv:2405.17181 [pdf, other]
Title: Spectral regularization for adversarially-robust representation learning
Comments: 15 + 15 pages, 8 + 11 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[295]  arXiv:2405.17170 [pdf, other]
Title: Forecasting Four Business Cycle Phases Using Machine Learning: A Case Study of US and EuroZone
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[296]  arXiv:2405.17164 [pdf, other]
Title: WeiPer: OOD Detection using Weight Perturbations of Class Projections
Subjects: Machine Learning (cs.LG)
[297]  arXiv:2405.17163 [pdf, other]
Title: Injecting Hamiltonian Architectural Bias into Deep Graph Networks for Long-Range Propagation
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY)
[298]  arXiv:2405.17151 [pdf, other]
Title: Smoke and Mirrors in Causal Downstream Tasks
Subjects: Machine Learning (cs.LG)
[299]  arXiv:2405.17132 [pdf, other]
Title: Your decision path does matter in pre-training industrial recommenders with multi-source behaviors
Subjects: Machine Learning (cs.LG)
[300]  arXiv:2405.17130 [pdf, other]
Title: Exploiting the Layered Intrinsic Dimensionality of Deep Models for Practical Adversarial Training
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[301]  arXiv:2405.17111 [pdf, other]
Title: Diffusion Bridge AutoEncoders for Unsupervised Representation Learning
Subjects: Machine Learning (cs.LG)
[302]  arXiv:2405.17108 [pdf, ps, other]
Title: Finding good policies in average-reward Markov Decision Processes without prior knowledge
Subjects: Machine Learning (cs.LG)
[303]  arXiv:2405.17098 [pdf, other]
Title: Q-value Regularized Transformer for Offline Reinforcement Learning
Comments: Published at ICML 2024
Subjects: Machine Learning (cs.LG)
[304]  arXiv:2405.17088 [pdf, other]
Title: Phase Transitions in the Output Distribution of Large Language Models
Comments: 21 pages, 4 figures
Subjects: Machine Learning (cs.LG); Statistical Mechanics (cond-mat.stat-mech); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[305]  arXiv:2405.17081 [pdf, other]
Title: Effective Layer Pruning Through Similarity Metric Perspective
Comments: Code available at: github.com/IanPons/CKA-Layer-Pruning
Subjects: Machine Learning (cs.LG)
[306]  arXiv:2405.17075 [pdf, other]
Title: Interaction-Force Transport Gradient Flows
Subjects: Machine Learning (cs.LG); Analysis of PDEs (math.AP); Machine Learning (stat.ML)
[307]  arXiv:2405.17068 [pdf, other]
Title: The Poisson Midpoint Method for Langevin Dynamics: Provably Efficient Discretization for Diffusion Models
Comments: "One often meets his destiny on the road he takes to avoid it" - Master Oogway. My destiny seems to be to write triangle inequalities for the rest of my life
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[308]  arXiv:2405.17061 [pdf, ps, other]
Title: Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation
Subjects: Machine Learning (cs.LG)
[309]  arXiv:2405.17059 [pdf, ps, other]
Title: Comparative Study of Machine Learning Algorithms in Detecting Cardiovascular Diseases
Subjects: Machine Learning (cs.LG)
[310]  arXiv:2405.17054 [pdf, other]
Title: Improving Data-aware and Parameter-aware Robustness for Continual Learning
Authors: Hanxi Xiao, Fan Lyu
Subjects: Machine Learning (cs.LG)
[311]  arXiv:2405.17051 [pdf, other]
Title: BeamVQ: Aligning Space-Time Forecasting Model via Self-training on Physics-aware Metrics
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[312]  arXiv:2405.17050 [pdf, other]
Title: HeNCler: Node Clustering in Heterophilous Graphs through Learned Asymmetric Similarity
Subjects: Machine Learning (cs.LG)
[313]  arXiv:2405.17049 [pdf, other]
Title: Verifying Properties of Binary Neural Networks Using Sparse Polynomial Optimization
Comments: 22 pages, 2 figures, 7 tables
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[314]  arXiv:2405.17042 [pdf, other]
Title: LabObf: A Label Protection Scheme for Vertical Federated Learning Through Label Obfuscation
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[315]  arXiv:2405.17035 [pdf, other]
Title: Glauber Generative Model: Discrete Diffusion Models via Binary Classification
Subjects: Machine Learning (cs.LG)
[316]  arXiv:2405.17034 [pdf, other]
Title: FUGNN: Harmonizing Fairness and Utility in Graph Neural Networks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[317]  arXiv:2405.17031 [pdf, other]
Title: Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning
Subjects: Machine Learning (cs.LG)
[318]  arXiv:2405.17027 [pdf, other]
Title: Supervised Batch Normalization
Subjects: Machine Learning (cs.LG)
[319]  arXiv:2405.17003 [pdf, other]
Title: Graph Condensation for Open-World Graph Learning
Comments: Accepted by KDD 2024
Subjects: Machine Learning (cs.LG)
[320]  arXiv:2405.16978 [pdf, other]
Title: OSLO: One-Shot Label-Only Membership Inference Attacks
Subjects: Machine Learning (cs.LG)
[321]  arXiv:2405.16971 [pdf, other]
Title: A Correlation- and Mean-Aware Loss Function and Benchmarking Framework to Improve GAN-based Tabular Data Synthesis
Comments: n.a
Subjects: Machine Learning (cs.LG)
[322]  arXiv:2405.16966 [pdf, other]
Title: Dual-Delayed Asynchronous SGD for Arbitrarily Heterogeneous Data
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[323]  arXiv:2405.16956 [pdf, other]
Title: Functional Programming Paradigm of Python for Scientific Computation Pipeline Integration
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Programming Languages (cs.PL); Software Engineering (cs.SE)
[324]  arXiv:2405.16951 [pdf, other]
Title: Fast ML-driven Analog Circuit Layout using Reinforcement Learning and Steiner Trees
Comments: 4 pages, 3 figures, accepted by SMACD 2024 conference
Subjects: Machine Learning (cs.LG)
[325]  arXiv:2405.16924 [pdf, other]
Title: Demystifying amortized causal discovery with transformers
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[326]  arXiv:2405.16918 [pdf, other]
Title: The Uncanny Valley: Exploring Adversarial Robustness from a Flatness Perspective
Subjects: Machine Learning (cs.LG)
[327]  arXiv:2405.16902 [pdf, other]
Title: Predicting from a Different Perspective in Re-ranking Model for Inductive Knowledge Graph Completion
Comments: 12 pages, 2 figures
Subjects: Machine Learning (cs.LG)
[328]  arXiv:2405.16901 [pdf, other]
Title: Recurrent and Convolutional Neural Networks in Classification of EEG Signal for Guided Imagery and Mental Workload Detection
Comments: In review
Subjects: Machine Learning (cs.LG)
[329]  arXiv:2405.16899 [pdf, other]
Title: Partial Models for Building Adaptive Model-Based Reinforcement Learning Agents
Comments: Published as a conference paper at CoLLAs 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[330]  arXiv:2405.16883 [pdf, other]
Title: Scorch: A Library for Sparse Deep Learning
Comments: 25 pages, 8 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Mathematical Software (cs.MS); Programming Languages (cs.PL)
[331]  arXiv:2405.16879 [pdf, other]
Title: Unsupervised Generative Feature Transformation via Graph Contrastive Pre-training and Multi-objective Fine-tuning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[332]  arXiv:2405.16877 [pdf, other]
Title: Are Self-Attentions Effective for Time Series Forecasting?
Comments: 20 pages, 14 figures, 13 tables. Submitted to NeurIPS 2024 (under review)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[333]  arXiv:2405.16876 [pdf, other]
Title: Transfer Learning for Diffusion Models
Comments: 24 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[334]  arXiv:2405.16852 [pdf, other]
Title: EM Distillation for One-step Diffusion Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[335]  arXiv:2405.16845 [pdf, other]
Title: On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability
Comments: 37pages
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[336]  arXiv:2405.16843 [pdf, ps, other]
Title: Non-stochastic Bandits With Evolving Observations
Subjects: Machine Learning (cs.LG)
[337]  arXiv:2405.16836 [pdf, other]
Title: Enhancing Fast Feed Forward Networks with Load Balancing and a Master Leaf Node
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[338]  arXiv:2405.16833 [pdf, other]
Title: Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models
Subjects: Machine Learning (cs.LG)
[339]  arXiv:2405.16828 [pdf, other]
Title: Kernel-based optimally weighted conformal prediction intervals
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
[340]  arXiv:2405.16820 [pdf, other]
Title: Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings
Comments: Accepted at the ACM Conference on Fairness, Accountability, and Transparency (FAccT) 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Human-Computer Interaction (cs.HC)
[341]  arXiv:2405.16819 [pdf, other]
Title: Automatic Domain Adaptation by Transformers in In-Context Learning
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[342]  arXiv:2405.16809 [pdf, ps, other]
Title: Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear $q^π$-Realizability and Concentrability
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[343]  arXiv:2405.16805 [pdf, other]
Title: Gradient Compressed Sensing: A Query-Efficient Gradient Estimator for High-Dimensional Zeroth-Order Optimization
Comments: ICML 2024
Subjects: Machine Learning (cs.LG)
[344]  arXiv:2405.16800 [pdf, other]
Title: TAGA: Text-Attributed Graph Self-Supervised Learning by Synergizing Graph and Text Mutual Transformations
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[345]  arXiv:2405.16799 [pdf, other]
Title: Dual-State Personalized Knowledge Tracing with Emotional Incorporation
Subjects: Machine Learning (cs.LG)
[346]  arXiv:2405.16798 [pdf, other]
Title: Exploring Fairness in Educational Data Mining in the Context of the Right to be Forgotten
Subjects: Machine Learning (cs.LG)
[347]  arXiv:2405.16771 [pdf, other]
Title: ARC: A Generalist Graph Anomaly Detector with In-Context Learning
Comments: 25 pages, 10 figures
Subjects: Machine Learning (cs.LG)
[348]  arXiv:2405.16770 [pdf, other]
Title: Physics informed cell representations for variational formulation of multiscale problems
Subjects: Machine Learning (cs.LG)
[349]  arXiv:2405.16765 [pdf, ps, other]
Title: Study of Robust Direction Finding Based on Joint Sparse Representation
Comments: 6 pages, 4 figures
Subjects: Machine Learning (cs.LG); Signal Processing (eess.SP)
[350]  arXiv:2405.16763 [pdf, other]
Title: Transport of Algebraic Structure to Latent Embeddings
Comments: Proceedings of the 41st International Conference on Machine Learning (2024)
Subjects: Machine Learning (cs.LG)
[351]  arXiv:2405.16756 [pdf, other]
Title: Symmetry-Informed Governing Equation Discovery
Subjects: Machine Learning (cs.LG)
[352]  arXiv:2405.16755 [pdf, other]
Title: CHESS: Contextual Harnessing for Efficient SQL Synthesis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Databases (cs.DB)
[353]  arXiv:2405.16752 [pdf, other]
Title: Model Ensembling for Constrained Optimization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[354]  arXiv:2405.16749 [pdf, other]
Title: DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion Models
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[355]  arXiv:2405.16747 [pdf, other]
Title: Understanding Linear Probing then Fine-tuning Language Models from NTK Perspective
Subjects: Machine Learning (cs.LG)
[356]  arXiv:2405.16739 [pdf, other]
Title: Oracle-Efficient Reinforcement Learning for Max Value Ensembles
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Systems and Control (eess.SY)
[357]  arXiv:2405.16731 [pdf, other]
Title: Pretraining with Random Noise for Fast and Robust Learning without Weight Transport
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[358]  arXiv:2405.16730 [pdf, other]
Title: Latent Energy-Based Odyssey: Black-Box Optimization via Expanded Exploration in the Energy-Based Latent Space
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Applications (stat.AP)
[359]  arXiv:2405.16727 [pdf, other]
Title: Disentangling and Integrating Relational and Sensory Information in Transformer Architectures
Comments: 23 pages, 13 figures
Subjects: Machine Learning (cs.LG)
[360]  arXiv:2405.16726 [pdf, other]
Title: Exploring Edge Probability Graph Models Beyond Edge Independency: Concepts, Analyses, and Algorithms
Subjects: Machine Learning (cs.LG)
[361]  arXiv:2405.16718 [pdf, other]
Title: Amortized Active Causal Induction with Deep Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[362]  arXiv:2405.16712 [pdf, other]
Title: Zamba: A Compact 7B SSM Hybrid Model
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[363]  arXiv:2405.16697 [pdf, other]
Title: CNN Autoencoder Resizer: A Power-Efficient LoS/NLoS Detector in MIMO-enabled UAV Networks
Subjects: Machine Learning (cs.LG)
[364]  arXiv:2405.16682 [pdf, other]
Title: A Systematic Review of Federated Generative Models
Comments: 24 Pages, 3 Figures, 5 Tables
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[365]  arXiv:2405.16674 [pdf, other]
Title: Limits of Deep Learning: Sequence Modeling through the Lens of Complexity Theory
Comments: 23 pages, 17 figures, 4 tables
Subjects: Machine Learning (cs.LG); Computational Complexity (cs.CC); Logic in Computer Science (cs.LO)
[366]  arXiv:2405.16671 [pdf, other]
Title: Mixture of Experts Using Tensor Products
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[367]  arXiv:2405.16668 [pdf, other]
Title: Provably Efficient Off-Policy Adversarial Imitation Learning with Convergence Guarantees
Subjects: Machine Learning (cs.LG)
[368]  arXiv:2405.16666 [pdf, other]
Title: Comments on Friedman's Method for Class Distribution Estimation
Authors: Dirk Tasche
Comments: 16 pages
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[369]  arXiv:2405.16658 [pdf, other]
Title: Acceleration of Grokking in Learning Arithmetic Operations via Kolmogorov-Arnold Representation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[370]  arXiv:2405.16646 [pdf, other]
Title: A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts
Journal-ref: The 41st International Conference on Machine Learning, ICML 2024
Subjects: Machine Learning (cs.LG)
[371]  arXiv:2405.16642 [pdf, other]
Title: Pick up the PACE: A Parameter-Free Optimizer for Lifelong Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[372]  arXiv:2405.16639 [pdf, ps, other]
Title: A unified law of robustness for Bregman divergence losses
Comments: 16 pages
Subjects: Machine Learning (cs.LG)
[373]  arXiv:2405.16623 [pdf, other]
Title: Graph neural networks with configuration cross-attention for tensor compilers
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Performance (cs.PF)
[374]  arXiv:2405.16616 [pdf, other]
Title: DPHGNN: A Dual Perspective Hypergraph Neural Networks
Comments: Accepted in SIGKDD'24 -- Research Track
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[375]  arXiv:2405.16608 [pdf, other]
Title: Efficient Probabilistic Modeling of Crystallization at Mesoscopic Scale
Comments: Under review in AI for Science @ ICML 2024
Subjects: Machine Learning (cs.LG); Mesoscale and Nanoscale Physics (cond-mat.mes-hall); Materials Science (cond-mat.mtrl-sci)
[376]  arXiv:2405.16601 [pdf, other]
Title: A CMDP-within-online framework for Meta-Safe Reinforcement Learning
Journal-ref: ICLR 2023
Subjects: Machine Learning (cs.LG)
[377]  arXiv:2405.16598 [pdf, other]
Title: Regularized Projection Matrix Approximation with Applications to Community Detection
Subjects: Machine Learning (cs.LG)
[378]  arXiv:2405.16587 [pdf, other]
Title: Cost-Effective Online Multi-LLM Selection with Versatile Reward Models
Comments: 29 pages, 12 figures, conference
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[379]  arXiv:2405.16585 [pdf, other]
Title: Fair Federated Learning under Domain Skew with Local Consistency and Domain Diversity
Comments: Accepted by CVPR2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[380]  arXiv:2405.16581 [pdf, other]
Title: On Bits and Bandits: Quantifying the Regret-Information Trade-off
Subjects: Machine Learning (cs.LG)
[381]  arXiv:2405.16563 [pdf, other]
Title: Reality Only Happens Once: Single-Path Generalization Bounds for Transformers
Comments: 11 pages (+30 appendix), 3 figures, 6 tables
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Numerical Analysis (math.NA); Probability (math.PR); Machine Learning (stat.ML)
[382]  arXiv:2405.16560 [pdf, other]
Title: Task Groupings Regularization: Data-Free Meta-Learning with Heterogeneous Pre-trained Models
Subjects: Machine Learning (cs.LG)
[383]  arXiv:2405.16557 [pdf, other]
Title: Scalable Numerical Embeddings for Multivariate Time Series: Enhancing Healthcare Data Representation Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[384]  arXiv:2405.16528 [pdf, other]
Title: LoQT: Low Rank Adapters for Quantized Training
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[385]  arXiv:2405.16522 [pdf, other]
Title: Multi-State TD Target for Model-Free Reinforcement Learning
Comments: 6 pages, 16 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[386]  arXiv:2405.16519 [pdf, other]
Title: Injective Sliced-Wasserstein embedding for weighted sets and point clouds
Authors: Tal Amir, Nadav Dym
Comments: 28 pages
Subjects: Machine Learning (cs.LG)
[387]  arXiv:2405.16511 [pdf, other]
Title: SE3Set: Harnessing equivariant hypergraph neural networks for molecular representation learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Physics (physics.comp-ph)
[388]  arXiv:2405.16508 [pdf, other]
Title: AnyCBMs: How to Turn Any Black Box into a Concept Bottleneck Model
Subjects: Machine Learning (cs.LG)
[389]  arXiv:2405.16507 [pdf, other]
Title: Causal Concept Embedding Models: Beyond Causal Opacity in Deep Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[390]  arXiv:2405.16506 [pdf, other]
Title: GRAG: Graph Retrieval-Augmented Generation
Comments: 14 pages, 4 figures
Subjects: Machine Learning (cs.LG)
[391]  arXiv:2405.16504 [pdf, other]
Title: A Unified Implicit Attention Formulation for Gated-Linear Recurrent Sequence Models
Subjects: Machine Learning (cs.LG)
[392]  arXiv:2405.16498 [pdf, other]
Title: On Sequential Loss Approximation for Continual Learning
Subjects: Machine Learning (cs.LG)
[393]  arXiv:2405.16489 [pdf, other]
Title: Causal-Aware Graph Neural Architecture Search under Distribution Shifts
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[394]  arXiv:2405.16475 [pdf, other]
Title: Looks Too Good To Be True: An Information-Theoretic Analysis of Hallucinations in Generative Restoration Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[395]  arXiv:2405.16474 [pdf, other]
Title: Inaccurate Label Distribution Learning with Dependency Noise
Subjects: Machine Learning (cs.LG)
[396]  arXiv:2405.16472 [pdf, other]
Title: Multi-Level Additive Modeling for Structured Non-IID Federated Learning
Subjects: Machine Learning (cs.LG)
[397]  arXiv:2405.16460 [pdf, other]
Title: Probabilistic Contrastive Learning with Explicit Concentration on the Hypersphere
Comments: technical report
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[398]  arXiv:2405.16456 [pdf, other]
Title: Dominant Shuffle: A Simple Yet Powerful Data Augmentation for Time-series Prediction
Comments: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[399]  arXiv:2405.16450 [pdf, other]
Title: Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Programming Languages (cs.PL)
[400]  arXiv:2405.16449 [pdf, other]
Title: Reinforcement Learning for Jump-Diffusions
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Mathematical Finance (q-fin.MF)
[401]  arXiv:2405.16447 [pdf, other]
Title: Fast Asymmetric Factorization for Large Scale Multiple Kernel Clustering
Subjects: Machine Learning (cs.LG)
[402]  arXiv:2405.16444 [pdf, other]
Title: CacheBlend: Fast Large Language Model Serving with Cached Knowledge Fusion
Subjects: Machine Learning (cs.LG)
[403]  arXiv:2405.16441 [pdf, other]
Title: Categorical Flow Matching on Statistical Manifolds
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[404]  arXiv:2405.16440 [pdf, other]
Title: MambaTS: Improved Selective State Space Models for Long-term Time Series Forecasting
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[405]  arXiv:2405.16436 [pdf, other]
Title: Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer
Comments: 27 pages, 7 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[406]  arXiv:2405.16435 [pdf, other]
Title: Structure-aware Semantic Node Identifiers for Learning on Graphs
Subjects: Machine Learning (cs.LG)
[407]  arXiv:2405.16418 [pdf, other]
Title: Unraveling the Smoothness Properties of Diffusion Models: A Gaussian Mixture Perspective
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[408]  arXiv:2405.16411 [pdf, other]
Title: Tensor Attention Training: Provably Efficient Learning of Higher-order Transformers
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[409]  arXiv:2405.16406 [pdf, other]
Title: SpinQuant -- LLM quantization with learned rotations
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[410]  arXiv:2405.16405 [pdf, other]
Title: Intruding with Words: Towards Understanding Graph Injection Attacks at the Text Level
Comments: 29 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[411]  arXiv:2405.16397 [pdf, other]
Title: AdaFisher: Adaptive Second Order Optimization via Fisher Information
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[412]  arXiv:2405.16396 [pdf, other]
Title: Machine learning in business process management: A systematic literature review
Subjects: Machine Learning (cs.LG)
[413]  arXiv:2405.16395 [pdf, other]
Title: Daily Physical Activity Monitoring -- Adaptive Learning from Multi-source Motion Sensor Data
Subjects: Machine Learning (cs.LG)
[414]  arXiv:2405.16391 [pdf, other]
Title: When does compositional structure yield compositional generalization? A kernel theory
Subjects: Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[415]  arXiv:2405.16386 [pdf, other]
Title: Variational Offline Multi-agent Skill Discovery
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[416]  arXiv:2405.16383 [pdf, other]
Title: Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space
Subjects: Machine Learning (cs.LG)
[417]  arXiv:2405.16381 [pdf, other]
Title: Trivialized Momentum Facilitates Diffusion Generative Modeling on Lie Groups
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[418]  arXiv:2405.16380 [pdf, other]
Title: Dynamic Inhomogeneous Quantum Resource Scheduling with Reinforcement Learning
Subjects: Machine Learning (cs.LG); Quantum Physics (quant-ph)
[419]  arXiv:2405.16368 [pdf, other]
Title: Qsco: A Quantum Scoring Module for Open-set Supervised Anomaly Detection
Subjects: Machine Learning (cs.LG)
[420]  arXiv:2405.16361 [pdf, other]
Title: LDPKiT: Recovering Utility in LDP Schemes by Training with Noise^2
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computers and Society (cs.CY)
[421]  arXiv:2405.16325 [pdf, other]
Title: SLoPe: Double-Pruned Sparse Plus Lazy Low-Rank Adapter Pretraining of LLMs
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[422]  arXiv:2405.16312 [pdf, other]
Title: Time-SSM: Simplifying and Unifying State Space Models for Time Series Forecasting
Comments: arXiv admin note: text overlap with arXiv:2402.11463
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[423]  arXiv:2405.16305 [pdf, other]
Title: Efficiently Parameterized Neural Metriplectic Sysyems
Subjects: Machine Learning (cs.LG)
[424]  arXiv:2405.16304 [pdf, other]
Title: Federated Unsupervised Domain Generalization using Global and Local Alignment of Gradients
Comments: 23 pages, 4 figure
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[425]  arXiv:2405.16297 [pdf, other]
Title: LUCIE: A Lightweight Uncoupled ClImate Emulator with long-term stability and physical consistency for O(1000)-member ensembles
Subjects: Machine Learning (cs.LG); Atmospheric and Oceanic Physics (physics.ao-ph); Computational Physics (physics.comp-ph)
[426]  arXiv:2405.16287 [pdf, other]
Title: LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters
Comments: 16 pages
Subjects: Machine Learning (cs.LG)
[427]  arXiv:2405.16286 [pdf, ps, other]
Title: Generation of synthetic data using breast cancer dataset and classification with resnet18
Comments: 17 pages, 4 figures
Subjects: Machine Learning (cs.LG)
[428]  arXiv:2405.16285 [pdf, other]
Title: ModelLock: Locking Your Model With a Spell
Subjects: Machine Learning (cs.LG)
[429]  arXiv:2405.16267 [pdf, other]
Title: A GPU-Accelerated Bi-linear ADMM Algorithm for Distributed Sparse Machine Learning
Subjects: Machine Learning (cs.LG)
[430]  arXiv:2405.16265 [pdf, other]
Title: MindStar: Enhancing Math Reasoning in Pre-trained LLMs at Inference Time
Subjects: Machine Learning (cs.LG)
[431]  arXiv:2405.16262 [pdf, other]
Title: Layer-Aware Analysis of Catastrophic Overfitting: Revealing the Pseudo-Robust Shortcut Dependency
Subjects: Machine Learning (cs.LG)
[432]  arXiv:2405.16258 [pdf, other]
Title: USD: Unsupervised Soft Contrastive Learning for Fault Detection in Multivariate Time Series
Comments: 19 pages, 7 figures, under review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Systems and Control (eess.SY)
[433]  arXiv:2405.16255 [pdf, other]
Title: GeoAdaLer: Geometric Insights into Adaptive Stochastic Gradient Descent Algorithms
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[434]  arXiv:2405.16240 [pdf, other]
Title: Analytic Federated Learning
Subjects: Machine Learning (cs.LG)
[435]  arXiv:2405.16233 [pdf, other]
Title: Client2Vec: Improving Federated Learning by Distribution Shifts Aware Client Indexing
Subjects: Machine Learning (cs.LG)
[436]  arXiv:2405.16225 [pdf, ps, other]
Title: Local Causal Structure Learning in the Presence of Latent Variables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[437]  arXiv:2405.16224 [pdf, other]
Title: Negative as Positive: Enhancing Out-of-distribution Generalization for Graph Contrastive Learning
Comments: 5 pages, 5 figures, In Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '24), July 14-18, 2024, Washington, DC, USA
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[438]  arXiv:2405.16219 [pdf, other]
Title: Deep Causal Generative Models with Property Control
Comments: 13 pages, 6 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[439]  arXiv:2405.16206 [pdf, other]
Title: GlycanML: A Multi-Task and Multi-Structure Benchmark for Glycan Machine Learning
Comments: Research project paper. All code and data are released
Subjects: Machine Learning (cs.LG)
[440]  arXiv:2405.16203 [pdf, other]
Title: Evolutionary Large Language Model for Automated Feature Transformation
Subjects: Machine Learning (cs.LG)
[441]  arXiv:2405.16196 [pdf, other]
Title: Maintaining and Managing Road Quality:Using MLP and DNN
Subjects: Machine Learning (cs.LG)
[442]  arXiv:2405.16195 [pdf, other]
Title: Adaptive $Q$-Network: On-the-fly Target Selection for Deep Reinforcement Learning
Comments: Preprint
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[443]  arXiv:2405.16194 [pdf, other]
Title: Diffusion-Reward Adversarial Imitation Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[444]  arXiv:2405.16185 [pdf, other]
Title: Differentiable Cluster Graph Neural Network
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[445]  arXiv:2405.16183 [pdf, other]
Title: Graph Neural PDE Solvers with Conservation and Similarity-Equivariance
Comments: ICML2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Computational Physics (physics.comp-ph)
[446]  arXiv:2405.16173 [pdf, other]
Title: Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Subjects: Machine Learning (cs.LG)
[447]  arXiv:2405.16168 [pdf, other]
Title: Multi-Player Approaches for Dueling Bandits
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[448]  arXiv:2405.16164 [pdf, other]
Title: Acquiring Better Load Estimates by Combining Anomaly and Change-point Detection in Power Grid Time-series Measurements
Comments: All code can be found at: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Applications (stat.AP); Machine Learning (stat.ML)
[449]  arXiv:2405.16159 [pdf, other]
Title: A Declarative Query Language for Scientific Machine Learning
Authors: Hasan M Jamil
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[450]  arXiv:2405.16158 [pdf, other]
Title: Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Comments: Preprint
Subjects: Machine Learning (cs.LG)
[451]  arXiv:2405.16156 [pdf, other]
Title: Mixture of In-Context Prompters for Tabular PFNs
Comments: 32 pages, 16 figures
Subjects: Machine Learning (cs.LG)
[452]  arXiv:2405.16148 [pdf, other]
Title: Accelerating Transformers with Spectrum-Preserving Token Merging
Comments: Version 1
Subjects: Machine Learning (cs.LG)
[453]  arXiv:2405.16141 [pdf, other]
Title: AIGB: Generative Auto-bidding via Diffusion Modeling
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE)
[454]  arXiv:2405.16130 [pdf, ps, other]
Title: Automating the Selection of Proxy Variables of Unmeasured Confounders
Subjects: Machine Learning (cs.LG); Methodology (stat.ME)
[455]  arXiv:2405.16124 [pdf, other]
Title: Unsupervised Meta-Learning via In-Context Learning
Subjects: Machine Learning (cs.LG)
[456]  arXiv:2405.16119 [pdf, ps, other]
Title: Method and Software Tool for Generating Artificial Databases of Biomedical Images Based on Deep Neural Networks
Comments: CEUR Workshop Proceedings (CEUR-WS.org). IDDM'2023: 6th International Conference on Informatics & Data-Driven Medicine, November 17 - 19, 2023, Bratislava, Slovakia
Subjects: Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[457]  arXiv:2405.16118 [pdf, ps, other]
Title: Beyond Primal-Dual Methods in Bandits with Stochastic and Adversarial Constraints
Subjects: Machine Learning (cs.LG)
[458]  arXiv:2405.16113 [pdf, other]
Title: Enabling On-Device Learning via Experience Replay with Efficient Dataset Condensation
Comments: 9 pages, 10 figures
Subjects: Machine Learning (cs.LG)
[459]  arXiv:2405.16104 [pdf, other]
Title: Global Well-posedness and Convergence Analysis of Score-based Generative Models via Sharp Lipschitz Estimates
Subjects: Machine Learning (cs.LG); Analysis of PDEs (math.AP)
[460]  arXiv:2405.16083 [pdf, other]
Title: From Orthogonality to Dependency: Learning Disentangled Representation for Multi-Modal Time-Series Sensing Signals
Subjects: Machine Learning (cs.LG)
[461]  arXiv:2405.16077 [pdf, ps, other]
Title: Finite-Time Analysis for Conflict-Avoidant Multi-Task Reinforcement Learning
Comments: Initial submission at the 41$^{st}$ International Conference on Machine Learning
Subjects: Machine Learning (cs.LG)
[462]  arXiv:2405.16075 [pdf, other]
Title: Continuous Temporal Domain Generalization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[463]  arXiv:2405.16069 [pdf, other]
Title: IncomeSCM: From tabular data set to time-series simulator and causal estimation benchmark
Subjects: Machine Learning (cs.LG); Methodology (stat.ME)
[464]  arXiv:2405.16056 [pdf, other]
Title: FedSheafHN: Personalized Federated Learning on Graph-structured Data
Subjects: Machine Learning (cs.LG)
[465]  arXiv:2405.16053 [pdf, other]
Title: Pausing Policy Learning in Non-stationary Reinforcement Learning
Comments: conference
Subjects: Machine Learning (cs.LG)
[466]  arXiv:2405.16043 [pdf, other]
Title: Theoretical Analysis of Weak-to-Strong Generalization
Comments: 36 pages, 3 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[467]  arXiv:2405.16041 [pdf, other]
Title: Explainable Molecular Property Prediction: Aligning Chemical Concepts with Predictions via Language Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[468]  arXiv:2405.16039 [pdf, other]
Title: MoEUT: Mixture-of-Experts Universal Transformers
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[469]  arXiv:2405.16036 [pdf, other]
Title: Certifying Adapters: Enabling and Enhancing the Certification of Classifier Adversarial Robustness
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[470]  arXiv:2405.16030 [pdf, other]
Title: Constrained Ensemble Exploration for Unsupervised Skill Discovery
Comments: Accepted by ICML 2024
Subjects: Machine Learning (cs.LG)
[471]  arXiv:2405.16029 [pdf, other]
Title: Online Resource Allocation for Edge Intelligence with Colocated Model Retraining and Inference
Comments: This paper has been accepted by the IEEE INFOCOM 2024 Main Conference
Subjects: Machine Learning (cs.LG)
[472]  arXiv:2405.16027 [pdf, other]
Title: Feature Protection For Out-of-distribution Generalization
Comments: arXiv admin note: substantial text overlap with arXiv:2309.06256
Subjects: Machine Learning (cs.LG)
[473]  arXiv:2405.16013 [pdf, other]
Title: Convergence Behavior of an Adversarial Weak Supervision Method
Authors: Steven An (1), Sanjoy Dasgupta (1) ((1) University of California, San Diego)
Comments: 49 pages, 16 figures, to be published in UAI 2024
Subjects: Machine Learning (cs.LG)
[474]  arXiv:2405.16012 [pdf, other]
Title: Pessimistic Backward Policy for GFlowNets
Subjects: Machine Learning (cs.LG)
[475]  arXiv:2405.16002 [pdf, other]
Title: Does SGD really happen in tiny subspaces?
Comments: 22 pages
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[476]  arXiv:2405.15994 [pdf, ps, other]
Title: Verified Safe Reinforcement Learning for Neural Network Dynamic Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[477]  arXiv:2405.15992 [pdf, ps, other]
Title: Data Complexity Estimates for Operator Learning
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[478]  arXiv:2405.15991 [pdf, other]
Title: Rényi Neural Processes
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[479]  arXiv:2405.15988 [pdf, ps, other]
Title: Transductive Confidence Machine and its application to Medical Data Sets
Authors: David Lindsay
Comments: 160 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[480]  arXiv:2405.15986 [pdf, ps, other]
Title: Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear Time Complexity
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Numerical Analysis (math.NA); Machine Learning (stat.ML)
[481]  arXiv:2405.15979 [pdf, other]
Title: BadGD: A unified data-centric framework to identify gradient descent vulnerabilities
Comments: 25 pages, 1 figure
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
[482]  arXiv:2405.15971 [pdf, other]
Title: Robust width: A lightweight and certifiable adversarial defense
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[483]  arXiv:2405.15943 [pdf, other]
Title: Transformers represent belief state geometry in their residual stream
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[484]  arXiv:2405.15942 [pdf, other]
Title: Can Implicit Bias Imply Adversarial Robustness?
Comments: icml 2024
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[485]  arXiv:2405.15934 [pdf, other]
Title: Clustering Survival Data using a Mixture of Non-parametric Experts
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[486]  arXiv:2405.15926 [pdf, other]
Title: Dissecting the Interplay of Attention Paths in a Statistical Mechanics Theory of Transformers
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Machine Learning (stat.ML)
[487]  arXiv:2405.15920 [pdf, other]
Title: SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning
Comments: arXiv admin note: text overlap with arXiv:2310.16173
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[488]  arXiv:2405.15913 [pdf, other]
Title: Scaling up the Banded Matrix Factorization Mechanism for Differentially Private ML
Authors: Ryan McKenna
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Data Structures and Algorithms (cs.DS)
[489]  arXiv:2405.15911 [pdf, other]
Title: Learning accurate and interpretable decision trees
Comments: 26 pages, UAI 2024
Subjects: Machine Learning (cs.LG)
[490]  arXiv:2405.15903 [pdf, other]
Title: UnitNorm: Rethinking Normalization for Transformers in Time Series
Subjects: Machine Learning (cs.LG)
[491]  arXiv:2405.15895 [pdf, other]
Title: Predicting the Impact of Model Expansion through the Minima Manifold: A Loss Landscape Perspective
Subjects: Machine Learning (cs.LG)
[492]  arXiv:2405.15885 [pdf, other]
Title: Diffusion Bridge Implicit Models
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[493]  arXiv:2405.15882 [pdf, other]
Title: Risk Factor Identification In Osteoporosis Using Unsupervised Machine Learning Techniques
Authors: Mikayla Calitis
Comments: 24 pages, 10 figures, 4 algorithms
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[494]  arXiv:2405.15877 [pdf, other]
Title: Basis Selection: Low-Rank Decomposition of Pretrained Large Language Models for Target Applications
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Computation and Language (cs.CL)
[495]  arXiv:2405.15871 [pdf, other]
Title: CausalConceptTS: Causal Attributions for Time Series Classification using High Fidelity Diffusion Models
Comments: 17 pages, 8 figures. Source code under this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[496]  arXiv:2405.15861 [pdf, other]
Title: Achieving Dimension-Free Communication in Federated Learning via Zeroth-Order Optimization
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[497]  arXiv:2405.15829 [pdf, other]
Title: Spatio-temporal Value Semantics-based Abstraction for Dense Deep Reinforcement Learning
Comments: 24 pages, 7 figures, conference
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[498]  arXiv:2405.15824 [pdf, other]
Title: Efficient Mitigation of Bus Bunching through Setter-Based Curriculum Learning
Comments: 9 pages, preprint
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[499]  arXiv:2405.17430 (cross-list from cs.CV) [pdf, other]
Title: Matryoshka Multimodal Models
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[500]  arXiv:2405.17428 (cross-list from cs.CL) [pdf, other]
Title: NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[501]  arXiv:2405.17422 (cross-list from cs.CV) [pdf, other]
Title: Hardness-Aware Scene Synthesis for Semi-Supervised 3D Object Detection
Comments: Code is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[502]  arXiv:2405.17419 (cross-list from cs.CV) [pdf, other]
Title: MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities
Comments: Code and MultiOOD benchmark: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[ total of 1410 entries: 1-396 | 107-502 | 503-898 | 899-1294 | 1295-1410 ]
[ showing 396 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)