We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions

[ total of 632 entries: 1-250 | 251-500 | 501-632 ]
[ showing 250 entries per page: fewer | more | all ]

Wed, 24 Apr 2024

[1]  arXiv:2404.15274 [pdf, other]
Title: Metric-guided Image Reconstruction Bounds via Conformal Prediction
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Medical Physics (physics.med-ph)
[2]  arXiv:2404.15255 [pdf, other]
Title: How to use and interpret activation patching
Comments: A tutorial on activation patching. 13 pages, 2 figures
Subjects: Machine Learning (cs.LG)
[3]  arXiv:2404.15242 [pdf, other]
Title: A Hybrid Kernel-Free Boundary Integral Method with Operator Learning for Solving Parametric Partial Differential Equations In Complex Domains
Comments: 30 pages,6 figures
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[4]  arXiv:2404.15225 [pdf, other]
Title: PHLP: Sole Persistent Homology for Link Prediction -- Interpretable Feature Extraction
Subjects: Machine Learning (cs.LG); Computational Geometry (cs.CG); Algebraic Topology (math.AT); Machine Learning (stat.ML)
[5]  arXiv:2404.15209 [pdf, other]
Title: Data-Driven Knowledge Transfer in Batch $Q^*$ Learning
Subjects: Machine Learning (cs.LG); Methodology (stat.ME); Machine Learning (stat.ML)
[6]  arXiv:2404.15201 [pdf, other]
Title: CORE-BEHRT: A Carefully Optimized and Rigorously Evaluated BEHRT
Comments: 11 pages, 5 figures
Subjects: Machine Learning (cs.LG)
[7]  arXiv:2404.15199 [pdf, other]
Title: Reinforcement Learning with Adaptive Control Regularization for Safe Control of Critical Systems
Subjects: Machine Learning (cs.LG)
[8]  arXiv:2404.15198 [pdf, other]
Title: Lossless and Near-Lossless Compression for Foundation Models
Subjects: Machine Learning (cs.LG); Information Theory (cs.IT)
[9]  arXiv:2404.15182 [pdf, other]
Title: FLoRA: Enhancing Vision-Language Models with Parameter-Efficient Federated Learning
Comments: 10 pages, 11 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[10]  arXiv:2404.15146 [pdf, other]
Title: Rethinking LLM Memorization through the Lens of Adversarial Compression
Comments: this https URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[11]  arXiv:2404.15109 [pdf, other]
Title: Compete and Compose: Learning Independent Mechanisms for Modular World Models
Subjects: Machine Learning (cs.LG)
[12]  arXiv:2404.15095 [pdf, ps, other]
Title: Using ARIMA to Predict the Expansion of Subscriber Data Consumption
Authors: Mike Wa Nkongolo
Subjects: Machine Learning (cs.LG)
[13]  arXiv:2404.15084 [pdf, other]
Title: Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It
Comments: IJCAI'24
Subjects: Machine Learning (cs.LG)
[14]  arXiv:2404.15065 [pdf, other]
Title: Formal Verification of Graph Convolutional Networks with Uncertain Node Features and Uncertain Graph Structure
Comments: under review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[15]  arXiv:2404.15034 [pdf, other]
Title: Deep Multi-View Channel-Wise Spatio-Temporal Network for Traffic Flow Prediction
Comments: Accepted by AAAI2020 workshop
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[16]  arXiv:2404.15029 [pdf, other]
Title: Explainable LightGBM Approach for Predicting Myocardial Infarction Mortality
Comments: This article has been accepted at the 2023 International Conference on Computational Science and Computational Intelligence (CSCI 23)
Subjects: Machine Learning (cs.LG)
[17]  arXiv:2404.15018 [pdf, other]
Title: Conformal Predictive Systems Under Covariate Shift
Comments: 13 pages, 4 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[18]  arXiv:2404.14986 [pdf, other]
Title: $\texttt{MiniMol}$: A Parameter-Efficient Foundation Model for Molecular Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[19]  arXiv:2404.14973 [pdf, ps, other]
Title: Symbolic Integration Algorithm Selection with Machine Learning: LSTMs vs Tree LSTMs
Subjects: Machine Learning (cs.LG); Mathematical Software (cs.MS); Symbolic Computation (cs.SC)
[20]  arXiv:2404.14970 [pdf, other]
Title: Integrating Heterogeneous Gene Expression Data through Knowledge Graphs for Improving Diabetes Prediction
Comments: 11 pages, 4 figures, 7th Workshop on Semantic Web Solutions for Large-scale Biomedical Data Analytics at ESWC2024
Subjects: Machine Learning (cs.LG)
[21]  arXiv:2404.14961 [pdf, other]
Title: Cache-Aware Reinforcement Learning in Large-Scale Recommender Systems
Comments: 8 pages, 8 figures
Subjects: Machine Learning (cs.LG)
[22]  arXiv:2404.14953 [pdf, other]
Title: Dynamic pricing with Bayesian updates from online reviews
Subjects: Machine Learning (cs.LG)
[23]  arXiv:2404.14941 [pdf, other]
Title: Delayed Bottlenecking: Alleviating Forgetting in Pre-trained Graph Neural Networks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[24]  arXiv:2404.14933 [pdf, other]
Title: Fin-Fed-OD: Federated Outlier Detection on Financial Tabular Data
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[25]  arXiv:2404.14928 [pdf, other]
Title: Graph Machine Learning in the Era of Large Language Models (LLMs)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[26]  arXiv:2404.14909 [pdf, other]
Title: MultiSTOP: Solving Functional Equations with Reinforcement Learning
Comments: ICLR 2024 Workshop on AI4DifferentialEquations In Science
Subjects: Machine Learning (cs.LG); High Energy Physics - Theory (hep-th)
[27]  arXiv:2404.14886 [pdf, other]
Title: GCEPNet: Graph Convolution-Enhanced Expectation Propagation for Massive MIMO Detection
Subjects: Machine Learning (cs.LG)
[28]  arXiv:2404.14875 [pdf, other]
Title: Regularized Gauss-Newton for Optimizing Overparameterized Neural Networks
Comments: 27 pages, 9 figures, 2 tables
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[29]  arXiv:2404.14855 [pdf, other]
Title: The Geometry of the Set of Equivalent Linear Neural Networks
Comments: 99 pages, 14 figures
Subjects: Machine Learning (cs.LG); Computational Geometry (cs.CG)
[30]  arXiv:2404.14829 [pdf, other]
Title: Revisiting Neural Networks for Continual Learning: An Architectural Perspective
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[31]  arXiv:2404.14815 [pdf, other]
Title: Time-aware Heterogeneous Graph Transformer with Adaptive Attention Merging for Health Event Prediction
Comments: 38 pages, 7 figures, 5 tables
Subjects: Machine Learning (cs.LG)
[32]  arXiv:2404.14757 [pdf, other]
Title: Integrating Mamba and Transformer for Long-Short Range Time Series Forecasting
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[33]  arXiv:2404.14754 [pdf, other]
Title: Skip the Benchmark: Generating System-Level High-Level Synthesis Data using Generative Machine Learning
Comments: Accepted at Great Lakes Symposium on VLSI 2024 (GLSVLSI 24)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR)
[34]  arXiv:2404.14749 [pdf, ps, other]
Title: Semantic Cells: Evolutional Process to Acquire Sense Diversity of Items
Comments: 18 pages, 3 figures, 1 table
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[35]  arXiv:2404.14746 [pdf, ps, other]
Title: A Customer Level Fraudulent Activity Detection Benchmark for Enhancing Machine Learning Model Research and Evaluation
Comments: 12 pages, 3 figures, 1 table
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[36]  arXiv:2404.14728 [pdf, ps, other]
Title: Novel Topological Machine Learning Methodology for Stream-of-Quality Modeling in Smart Manufacturing
Comments: The paper has been submitted to Manufacturing Letters (Under Review)
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[37]  arXiv:2404.14721 [pdf, other]
Title: Dynamically Anchored Prompting for Task-Imbalanced Continual Learning
Comments: Accepted by IJCAI 2024
Subjects: Machine Learning (cs.LG)
[38]  arXiv:2404.14701 [pdf, other]
Title: Deep neural networks for choice analysis: Enhancing behavioral regularity with gradient regularization
Subjects: Machine Learning (cs.LG)
[39]  arXiv:2404.14689 [pdf, other]
Title: Interpretable Prediction and Feature Selection for Survival Analysis
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[40]  arXiv:2404.14688 [pdf, other]
Title: FMint: Bridging Human Designed and Data Pretrained Models for Differential Equation Foundation Model
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Dynamical Systems (math.DS); Numerical Analysis (math.NA)
[41]  arXiv:2404.14674 [pdf, other]
Title: HOIN: High-Order Implicit Neural Representations
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[42]  arXiv:2404.14664 [pdf, ps, other]
Title: Employing Layerwised Unsupervised Learning to Lessen Data and Loss Requirements in Forward-Forward Algorithms
Comments: 8 pages, 6 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[43]  arXiv:2404.14662 [pdf, other]
Title: NExT: Teaching Large Language Models to Reason about Code Execution
Comments: 35 pages
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Programming Languages (cs.PL); Software Engineering (cs.SE)
[44]  arXiv:2404.14642 [pdf, other]
Title: Uncertainty Quantification on Graph Learning: A Survey
Subjects: Machine Learning (cs.LG)
[45]  arXiv:2404.14635 [pdf, ps, other]
Title: Digital Twins for forecasting and decision optimisation with machine learning: applications in wastewater treatment
Comments: A bit thin, but an interesting application of ML methods for decision making
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[46]  arXiv:2404.14620 [pdf, other]
Title: Fairness Incentives in Response to Unfair Dynamic Pricing
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[47]  arXiv:2404.14618 [pdf, other]
Title: Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing
Comments: Accepted to ICLR 2024 (main conference)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[48]  arXiv:2404.14588 [pdf, ps, other]
Title: Brain-Inspired Continual Learning-Robust Feature Distillation and Re-Consolidation for Class Incremental Learning
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[49]  arXiv:2404.14552 [pdf, other]
Title: Generalizing Multi-Step Inverse Models for Representation Learning to Finite-Memory POMDPs
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[50]  arXiv:2404.14523 [pdf, ps, other]
Title: Edge-Assisted ML-Aided Uncertainty-Aware Vehicle Collision Avoidance at Urban Intersections
Comments: Accepted in IEEE Transactions on Intelligent Vehicles
Subjects: Machine Learning (cs.LG)
[51]  arXiv:2404.14462 [pdf, other]
Title: Towards smaller, faster decoder-only transformers: Architectural variants and their implications
Comments: 8 pages, 6 figures
Subjects: Machine Learning (cs.LG)
[52]  arXiv:2404.14457 [pdf, ps, other]
Title: Graph Coloring Using Heat Diffusion
Authors: Vivek Chaudhary
Comments: 5 Pages, 3 Figures
Subjects: Machine Learning (cs.LG)
[53]  arXiv:2404.14456 [pdf, ps, other]
Title: Multifidelity Surrogate Models: A New Data Fusion Perspective
Authors: Daniel N Wilke
Comments: 8 pages, 4 figures, SACAM2024 Conference, 22-23 January 2024
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[54]  arXiv:2404.14455 [pdf, other]
Title: A Neuro-Symbolic Explainer for Rare Events: A Case Study on Predictive Maintenance
Comments: 26 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[55]  arXiv:2404.14451 [pdf, other]
Title: Generative Subspace Adversarial Active Learning for Outlier Detection in Multiple Views of High-dimensional Data
Comments: 16 pages, Pre-print
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[56]  arXiv:2404.14447 [pdf, ps, other]
Title: A Novel A.I Enhanced Reservoir Characterization with a Combined Mixture of Experts -- NVIDIA Modulus based Physics Informed Neural Operator Forward Model
Comments: 55 pages, 46 figures
Subjects: Machine Learning (cs.LG)
[57]  arXiv:2404.14445 [pdf, other]
Title: A Multi-Faceted Evaluation Framework for Assessing Synthetic Data Generated by Large Language Models
Comments: 10 pages, 1 figure, 4 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[58]  arXiv:2404.14444 [pdf, other]
Title: Practical Battery Health Monitoring using Uncertainty-Aware Bayesian Neural Network
Comments: 6 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET)
[59]  arXiv:2404.14442 [pdf, ps, other]
Title: Unified ODE Analysis of Smooth Q-Learning Algorithms
Authors: Donghwan Lee
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[60]  arXiv:2404.14436 [pdf, other]
Title: Investigating Resource-efficient Neutron/Gamma Classification ML Models Targeting eFPGAs
Subjects: Machine Learning (cs.LG); High Energy Physics - Experiment (hep-ex); Nuclear Experiment (nucl-ex); Instrumentation and Detectors (physics.ins-det)
[61]  arXiv:2404.14433 [pdf, other]
Title: KATO: Knowledge Alignment and Transfer for Transistor Sizing of Different Design and Technology
Comments: 6 pages, received by DAC2024
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE)
[62]  arXiv:2404.15276 (cross-list from cs.CV) [pdf, other]
Title: SMPLer: Taming Transformers for Monocular 3D Human Shape and Pose Estimation
Comments: Published at TPAMI 2024
Journal-ref: https://www.computer.org/csdl/journal/tp/2024/05/10354384/1SP2qWh8Fq0
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
[63]  arXiv:2404.15273 (cross-list from math.OC) [pdf, other]
Title: Estimation Network Design framework for efficient distributed optimization
Comments: 8 pages, 4 figures. arXiv admin note: substantial text overlap with arXiv:2208.11377
Subjects: Optimization and Control (math.OC); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[64]  arXiv:2404.15269 (cross-list from cs.CL) [pdf, other]
Title: Aligning LLM Agents by Learning Latent Preference from User Edits
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[65]  arXiv:2404.15261 (cross-list from math.OC) [pdf, other]
Title: All You Need is Resistance: On the Equivalence of Effective Resistance and Certain Optimal Transport Problems on Graphs
Comments: 35 pages, 7 figures
Subjects: Optimization and Control (math.OC); Discrete Mathematics (cs.DM); Machine Learning (cs.LG); Probability (math.PR)
[66]  arXiv:2404.15258 (cross-list from math.PR) [pdf, other]
Title: Score matching for sub-Riemannian bridge sampling
Comments: 33 pages, 4 figures
Subjects: Probability (math.PR); Machine Learning (cs.LG); Differential Geometry (math.DG); Statistics Theory (math.ST); Machine Learning (stat.ML)
[67]  arXiv:2404.15247 (cross-list from cs.CL) [pdf, other]
Title: XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Software Engineering (cs.SE)
[68]  arXiv:2404.15245 (cross-list from stat.ME) [pdf, other]
Title: Mining Invariance from Nonlinear Multi-Environment Data: Binary Classification
Comments: Accepted to the 2024 International Symposium on Information Theory (ISIT)
Subjects: Methodology (stat.ME); Machine Learning (cs.LG)
[69]  arXiv:2404.15244 (cross-list from cs.CV) [pdf, other]
Title: Efficient Transformer Encoders for Mask2Former-style models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[70]  arXiv:2404.15243 (cross-list from cs.NI) [pdf, other]
Title: UCINet0: A Machine Learning based Receiver for 5G NR PUCCH Format 0
Subjects: Networking and Internet Architecture (cs.NI); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Signal Processing (eess.SP)
[71]  arXiv:2404.15224 (cross-list from cs.CV) [pdf, other]
Title: Deep Models for Multi-View 3D Object Recognition: A Review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[72]  arXiv:2404.15217 (cross-list from cs.CV) [pdf, other]
Title: Towards Large-Scale Training of Pathology Foundation Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[73]  arXiv:2404.15213 (cross-list from cs.HC) [pdf, other]
Title: Automatic Classification of Subjective Time Perception Using Multi-modal Physiological Data of Air Traffic Controllers
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[74]  arXiv:2404.15211 (cross-list from cs.DC) [pdf, other]
Title: LACS: Learning-Augmented Algorithms for Carbon-Aware Resource Scaling with Uncertain Demand
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[75]  arXiv:2404.15207 (cross-list from cs.CE) [pdf, other]
Title: Simulation-Free Determination of Microstructure Representative Volume Element Size via Fisher Scores
Journal-ref: APL Mach. Learn. 2(2): 026101 (2024)
Subjects: Computational Engineering, Finance, and Science (cs.CE); Materials Science (cond-mat.mtrl-sci); Machine Learning (cs.LG); Applications (stat.AP)
[76]  arXiv:2404.15204 (cross-list from cs.PL) [pdf, other]
Title: Towards a high-performance AI compiler with upstream MLIR
Comments: 13 pages, 8 figures, presented at CGO C4ML 2024 & MLIR Workshop EuroLLVM 2024
Subjects: Programming Languages (cs.PL); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[77]  arXiv:2404.15197 (cross-list from cs.NI) [pdf, other]
Title: Multi-Task Learning as enabler for General-Purpose AI-native RAN
Comments: Accepted for 2024 IEEE ICC Workshop on Edge Learning over 5G Mobile Networks and Beyond
Subjects: Networking and Internet Architecture (cs.NI); Machine Learning (cs.LG)
[78]  arXiv:2404.15193 (cross-list from cs.NE) [pdf, other]
Title: Structurally Flexible Neural Networks: Evolving the Building Blocks for General Agents
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[79]  arXiv:2404.15176 (cross-list from eess.AS) [pdf, other]
Title: Voice Passing : a Non-Binary Voice Gender Prediction System for evaluating Transgender voice transition
Comments: 5 pages, 1 figure, keywords: Transgender voice, Gender perception, Speaker gender classification, CNN, X-Vector
Journal-ref: Proc. INTERSPEECH 2023, 5207-5211
Subjects: Audio and Speech Processing (eess.AS); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Sound (cs.SD)
[80]  arXiv:2404.15168 (cross-list from eess.AS) [pdf, other]
Title: Artificial Neural Networks to Recognize Speakers Division from Continuous Bengali Speech
Subjects: Audio and Speech Processing (eess.AS); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Sound (cs.SD)
[81]  arXiv:2404.15155 (cross-list from cs.CL) [pdf, other]
Title: Adaptive Collaboration Strategy for LLMs in Medical Decision Making
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[82]  arXiv:2404.15149 (cross-list from cs.CL) [pdf, other]
Title: Bias patterns in the application of LLMs for clinical decision support: A comprehensive study
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[83]  arXiv:2404.15098 (cross-list from eess.SY) [pdf, other]
Title: Uncertainty Quantification of Data-Driven Output Predictors in the Output Error Setting
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG)
[84]  arXiv:2404.15096 (cross-list from cs.RO) [pdf, other]
Title: Impedance Matching: Enabling an RL-Based Running Jump in a Quadruped Robot
Comments: Accepted by Ubiquitous Robots 2024
Subjects: Robotics (cs.RO); Machine Learning (cs.LG)
[85]  arXiv:2404.15081 (cross-list from cs.CV) [pdf, other]
Title: Perturbing Attention Gives You More Bang for the Buck: Subtle Imaging Perturbations That Efficiently Fool Customized Diffusion Models
Comments: Published at CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[86]  arXiv:2404.15045 (cross-list from cs.CL) [pdf, other]
Title: Multi-Head Mixture-of-Experts
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[87]  arXiv:2404.15024 (cross-list from cs.CV) [pdf, other]
Title: A Learning Paradigm for Interpretable Gradients
Comments: VISAPP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[88]  arXiv:2404.14999 (cross-list from cs.DB) [pdf, other]
Title: A Unified Replay-based Continuous Learning Framework for Spatio-Temporal Prediction on Streaming Data
Comments: Accepted by ICDE 2024
Subjects: Databases (cs.DB); Machine Learning (cs.LG)
[89]  arXiv:2404.14994 (cross-list from cs.CL) [pdf, other]
Title: Transformers Can Represent $n$-gram Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computational Complexity (cs.CC); Formal Languages and Automata Theory (cs.FL); Machine Learning (cs.LG)
[90]  arXiv:2404.14966 (cross-list from cs.CV) [pdf, other]
Title: Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model
Comments: 10 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[91]  arXiv:2404.14943 (cross-list from cs.CL) [pdf, other]
Title: Does It Make Sense to Explain a Black Box With Another Black Box?
Comments: This article was originally published in French at the Journal TAL. VOL 64 n{\deg}3/2023. arXiv admin note: substantial text overlap with arXiv:2402.10888
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[92]  arXiv:2404.14942 (cross-list from cs.CR) [pdf, other]
Title: Manipulating Recommender Systems: A Survey of Poisoning Attacks and Countermeasures
Subjects: Cryptography and Security (cs.CR); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[93]  arXiv:2404.14913 (cross-list from eess.AS) [pdf, other]
Title: Additive Margin in Contrastive Self-Supervised Frameworks to Learn Discriminative Speaker Representations
Comments: accepted at Odyssey 2024: The Speaker and Language Recognition Workshop. arXiv admin note: text overlap with arXiv:2306.03664
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[94]  arXiv:2404.14906 (cross-list from cs.CV) [pdf, other]
Title: Driver Activity Classification Using Generalizable Representations from Vision-Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[95]  arXiv:2404.14901 (cross-list from cs.SE) [pdf, other]
Title: Beyond Code Generation: An Observational Study of ChatGPT Usage in Software Engineering Practice
Comments: Accepted at the ACM International Conference on the Foundations of Software Engineering (FSE) 2024
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[96]  arXiv:2404.14873 (cross-list from stat.ML) [pdf, ps, other]
Title: Estimating the Distribution of Parameters in Differential Equations with Repeated Cross-Sectional Data
Comments: 16 pages, 10 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[97]  arXiv:2404.14869 (cross-list from cs.HC) [pdf, other]
Title: EEGEncoder: Advancing BCI with Transformer-Based Motor Imagery Classification
Authors: Wangdan Liao
Subjects: Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[98]  arXiv:2404.14850 (cross-list from cs.CL) [pdf, other]
Title: Simple, Efficient and Scalable Structure-aware Adapter Boosts Protein Language Models
Comments: 30 pages, 4 figures, 8 tables
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Biomolecules (q-bio.BM)
[99]  arXiv:2404.14836 (cross-list from eess.SY) [src]
Title: Probabilistic forecasting of power system imbalance using neural network-based ensembles
Comments: One of the co-authors objected with having it on Arxiv already
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG)
[100]  arXiv:2404.14811 (cross-list from eess.SP) [pdf, other]
Title: FLARE: A New Federated Learning Framework with Adjustable Learning Rates over Resource-Constrained Wireless Networks
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)
[101]  arXiv:2404.14795 (cross-list from cs.CL) [pdf, other]
Title: Talk Too Much: Poisoning Large Language Models under Token Limit
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[102]  arXiv:2404.14786 (cross-list from cs.AI) [pdf, other]
Title: LLM-Enhanced Causal Discovery in Temporal Domain from Interventional Data
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Methodology (stat.ME)
[103]  arXiv:2404.14777 (cross-list from cs.CL) [pdf, other]
Title: CT-Agent: Clinical Trial Multi-Agent with Large Language Model-based Reasoning
Authors: Ling Yue, Tianfan Fu
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[104]  arXiv:2404.14760 (cross-list from cs.CL) [pdf, other]
Title: Retrieval Augmented Generation for Domain-specific Question Answering
Comments: AAAI 2024 (Association for the Advancement of Artificial Intelligence) Scientific Document Understanding Workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[105]  arXiv:2404.14758 (cross-list from math.OC) [pdf, other]
Title: Second-order Information Promotes Mini-Batch Robustness in Variance-Reduced Gradients
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[106]  arXiv:2404.14743 (cross-list from stat.ML) [pdf, other]
Title: Gradient Guidance for Diffusion Models: An Optimization Perspective
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[107]  arXiv:2404.14700 (cross-list from eess.AS) [pdf, other]
Title: FlashSpeech: Efficient Zero-Shot Speech Synthesis
Comments: Efficient zero-shot speech synthesis
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[108]  arXiv:2404.14680 (cross-list from cs.CL) [pdf, other]
Title: Automated Multi-Language to English Machine Translation Using Generative Pre-Trained Transformers
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[109]  arXiv:2404.14661 (cross-list from cs.CV) [pdf, other]
Title: First Mapping the Canopy Height of Primeval Forests in the Tallest Tree Area of Asia
Subjects: Computer Vision and Pattern Recognition (cs.CV); Earth and Planetary Astrophysics (astro-ph.EP); Machine Learning (cs.LG)
[110]  arXiv:2404.14653 (cross-list from cs.CV) [pdf, ps, other]
Title: Machine Vision Based Assessment of Fall Color Changes in Apple Trees: Exploring Relationship with Leaf Nitrogen Concentration
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[111]  arXiv:2404.14651 (cross-list from nlin.AO) [pdf, other]
Title: Forecasting the Forced Van der Pol Equation with Frequent Phase Shifts Using a Reservoir Computer
Subjects: Adaptation and Self-Organizing Systems (nlin.AO); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[112]  arXiv:2404.14631 (cross-list from cs.CL) [pdf, other]
Title: Learning Word Embedding with Better Distance Weighting and Window Size Scheduling
Authors: Chaohao Yang
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[113]  arXiv:2404.14625 (cross-list from cs.RO) [pdf, other]
Title: Towards Multi-Morphology Controllers with Diversity and Knowledge Distillation
Comments: Accepted at the Genetic and Evolutionary Computation Conference 2024 Evolutionary Machine Learning track as a full paper
Subjects: Robotics (cs.RO); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[114]  arXiv:2404.14619 (cross-list from cs.CL) [pdf, other]
Title: OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[115]  arXiv:2404.14602 (cross-list from eess.SY) [pdf, other]
Title: Adaptive Bayesian Optimization for High-Precision Motion Systems
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG); Robotics (cs.RO)
[116]  arXiv:2404.14586 (cross-list from cs.IT) [pdf, other]
Title: Latency-Distortion Tradeoffs in Communicating Classification Results over Noisy Channels
Comments: Submitted to IEEE Transactions on Communications
Subjects: Information Theory (cs.IT); Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[117]  arXiv:2404.14551 (cross-list from hep-th) [pdf, other]
Title: Learning S-Matrix Phases with Neural Operators
Comments: 36 pages, 8 figures
Subjects: High Energy Physics - Theory (hep-th); Machine Learning (cs.LG)
[118]  arXiv:2404.14527 (cross-list from cs.DC) [pdf, other]
Title: Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[119]  arXiv:2404.14507 (cross-list from cs.CV) [pdf, other]
Title: Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[120]  arXiv:2404.14497 (cross-list from cs.NI) [pdf, other]
Title: Mapping Wireless Networks into Digital Reality through Joint Vertical and Horizontal Learning
Comments: Accepted by IFIP/IEEE Networking 2024
Subjects: Networking and Internet Architecture (cs.NI); Machine Learning (cs.LG); Signal Processing (eess.SP)
[121]  arXiv:2404.14463 (cross-list from cs.CL) [pdf, other]
Title: DAIC-WOZ: On the Validity of Using the Therapist's prompts in Automatic Depression Detection from Clinical Interviews
Comments: Accepted to Clinical NLP workshop at NAACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[122]  arXiv:2404.14461 (cross-list from cs.CL) [pdf, other]
Title: Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMs
Comments: Competition Report
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[123]  arXiv:2404.14460 (cross-list from stat.ML) [pdf, other]
Title: Inference of Causal Networks using a Topological Threshold
Comments: 17 pages, 12 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[124]  arXiv:2404.14449 (cross-list from cs.CL) [pdf, ps, other]
Title: Predicting Question Quality on StackOverflow with Neural Networks
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[125]  arXiv:2404.14441 (cross-list from cs.CV) [pdf, ps, other]
Title: Optimizing Contrail Detection: A Deep Learning Approach with EfficientNet-b4 Encoding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[126]  arXiv:2404.14419 (cross-list from cs.SE) [pdf, other]
Title: Enhancing Fault Detection for Large Language Models via Mutation-Based Confidence Smoothing
Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[127]  arXiv:2404.14418 (cross-list from cs.SI) [pdf, other]
Title: Mitigating Cascading Effects in Large Adversarial Graph Environments
Comments: 10 pages, 7 figures
Subjects: Social and Information Networks (cs.SI); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[128]  arXiv:2404.14416 (cross-list from physics.geo-ph) [pdf, other]
Title: Conditional diffusion models for downscaling & bias correction of Earth system model precipitation
Subjects: Geophysics (physics.geo-ph); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Atmospheric and Oceanic Physics (physics.ao-ph)
[129]  arXiv:2404.13630 (cross-list from cs.SE) [pdf, ps, other]
Title: Utilizing Deep Learning to Optimize Software Development Processes
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)

Tue, 23 Apr 2024 (showing first 121 of 186 entries)

[130]  arXiv:2404.14388 [pdf, other]
Title: STROOBnet Optimization via GPU-Accelerated Proximal Recurrence Strategies
Comments: 10 pages, 17 figures, 2023 IEEE International Conference on Big Data (BigData)
Journal-ref: 2023 IEEE International Conference on Big Data (BigData), Sorrento, Italy, 2023, pp. 2920-2929
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
[131]  arXiv:2404.14367 [pdf, other]
Title: Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data
Subjects: Machine Learning (cs.LG)
[132]  arXiv:2404.14326 [pdf, ps, other]
Title: Machine Learning Techniques for MRI Data Processing at Expanding Scale
Authors: Taro Langner
Comments: Book chapter pre-print
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[133]  arXiv:2404.14271 [pdf, other]
Title: Sparse Explanations of Neural Networks Using Pruned Layer-Wise Relevance Propagation
Comments: 15 pages, 5 figures
Subjects: Machine Learning (cs.LG)
[134]  arXiv:2404.14265 [pdf, other]
Title: Deep Learning as Ricci Flow
Subjects: Machine Learning (cs.LG); Differential Geometry (math.DG)
[135]  arXiv:2404.14202 [pdf, other]
Title: Rotting Infinitely Many-armed Bandits beyond the Worst-case Rotting: An Adaptive Approach
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[136]  arXiv:2404.14197 [pdf, other]
Title: SOFTS: Efficient Multivariate Time Series Forecasting with Series-Core Fusion
Subjects: Machine Learning (cs.LG)
[137]  arXiv:2404.14164 [pdf, other]
Title: New Solutions Based on the Generalized Eigenvalue Problem for the Data Collaboration Analysis
Comments: 16 pages, 9 figures, preprint
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[138]  arXiv:2404.14161 [pdf, other]
Title: Multidimensional Interpolants
Authors: Dohoon Lee, Kyogu Lee
Comments: 9 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[139]  arXiv:2404.14107 [pdf, other]
Title: PGNAA Spectral Classification of Aluminium and Copper Alloys with Machine Learning
Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci)
[140]  arXiv:2404.14076 [pdf, other]
Title: Noise contrastive estimation with soft targets for conditional models
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[141]  arXiv:2404.14073 [pdf, other]
Title: Towards Robust Trajectory Representations: Isolating Environmental Confounders with Causal Learning
Comments: The paper has been accepted by IJCAI 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[142]  arXiv:2404.14064 [pdf, other]
Title: Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[143]  arXiv:2404.14061 [pdf, other]
Title: FedTAD: Topology-aware Data-free Knowledge Distillation for Subgraph Federated Learning
Comments: Accepted by IJCAI 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Databases (cs.DB); Social and Information Networks (cs.SI)
[144]  arXiv:2404.14047 [pdf, other]
Title: How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study
Subjects: Machine Learning (cs.LG)
[145]  arXiv:2404.14017 [pdf, other]
Title: Hybrid Ensemble-Based Travel Mode Prediction
Comments: This preprint has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this contribution is published in Advances in Intelligent Data Analysis XXII. IDA 2024. Lecture Notes in Computer Science, vol 14641. Springer, and is available online at Cham this https URL The preprint includes 12+22 pages, 1+1 figures
Journal-ref: Advances in Intelligent Data Analysis XXII, IDA 2024, LNCS, vol 14641, (2024), 191-202
Subjects: Machine Learning (cs.LG)
[146]  arXiv:2404.14016 [pdf, other]
Title: Ungeneralizable Examples
Comments: Accepted by CVPR2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[147]  arXiv:2404.14006 [pdf, other]
Title: Distilled Datamodel with Reverse Gradient Matching
Comments: Accepted by CVPR2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[148]  arXiv:2404.13990 [pdf, other]
Title: QCore: Data-Efficient, On-Device Continual Calibration for Quantized Models -- Extended Version
Comments: 15 pages. An extended version of "QCore: Data-Efficient, On-Device Continual Calibration for Quantized Models" accepted at PVLDB 2024
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[149]  arXiv:2404.13964 [pdf, other]
Title: An Economic Solution to Copyright Challenges of Generative AI
Subjects: Machine Learning (cs.LG); General Economics (econ.GN); Methodology (stat.ME)
[150]  arXiv:2404.13954 [pdf, ps, other]
Title: A survey of air combat behavior modeling using machine learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[151]  arXiv:2404.13946 [pdf, other]
Title: Dual Model Replacement:invisible Multi-target Backdoor Attack based on Federal Learning
Subjects: Machine Learning (cs.LG)
[152]  arXiv:2404.13910 [pdf, other]
Title: Integrated Gradient Correlation: a Dataset-wise Attribution Method
Authors: Pierre Lelièvre, Chien-Chung Chen (National Taiwan University)
Comments: 12 pages, 8 figures, source code at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[153]  arXiv:2404.13904 [pdf, other]
Title: Deep Regression Representation Learning with Topology
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[154]  arXiv:2404.13895 [pdf, other]
Title: Optimal Design for Human Feedback
Subjects: Machine Learning (cs.LG)
[155]  arXiv:2404.13891 [pdf, other]
Title: Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent
Comments: Accepted to 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT)
[156]  arXiv:2404.13879 [pdf, other]
Title: Explicit Lipschitz Value Estimation Enhances Policy Robustness Against Perturbation
Subjects: Machine Learning (cs.LG)
[157]  arXiv:2404.13860 [pdf, other]
Title: Distributional Black-Box Model Inversion Attack with Multi-Agent Reinforcement Learning
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
[158]  arXiv:2404.13853 [pdf, other]
Title: ICST-DNET: An Interpretable Causal Spatio-Temporal Diffusion Network for Traffic Speed Prediction
Subjects: Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[159]  arXiv:2404.13846 [pdf, other]
Title: Filtered Direct Preference Optimization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[160]  arXiv:2404.13844 [pdf, other]
Title: ColA: Collaborative Adaptation with Gradient Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[161]  arXiv:2404.13841 [pdf, other]
Title: Fair Concurrent Training of Multiple Models in Federated Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[162]  arXiv:2404.13815 [pdf, other]
Title: Improving Group Robustness on Spurious Correlation Requires Preciser Group Inference
Authors: Yujin Han, Difan Zou
Subjects: Machine Learning (cs.LG)
[163]  arXiv:2404.13785 [pdf, ps, other]
Title: How to Inverting the Leverage Score Distribution?
Subjects: Machine Learning (cs.LG)
[164]  arXiv:2404.13752 [pdf, other]
Title: Towards General Conceptual Model Editing via Adversarial Representation Engineering
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Optimization and Control (math.OC)
[165]  arXiv:2404.13736 [pdf, other]
Title: Interval Abstractions for Robust Counterfactual Explanations
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[166]  arXiv:2404.13733 [pdf, other]
Title: Elucidating the Design Space of Dataset Condensation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[167]  arXiv:2404.13715 [pdf, other]
Title: TF2AIF: Facilitating development and deployment of accelerated AI models on the cloud-edge continuum
Comments: to be published in EUCNC & 6G Summit 2024
Subjects: Machine Learning (cs.LG)
[168]  arXiv:2404.13663 [pdf, other]
Title: Cumulative Hazard Function Based Efficient Multivariate Temporal Point Process Learning
Authors: Bingqing Liu
Comments: 8 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[169]  arXiv:2404.13655 [pdf, other]
Title: SPGNN: Recognizing Salient Subgraph Patterns via Enhanced Graph Convolution and Pooling
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[170]  arXiv:2404.13647 [pdf, other]
Title: Mean Aggregator Is More Robust Than Robust Aggregators Under Label Poisoning Attacks
Comments: Accepted by IJCAI 2024
Subjects: Machine Learning (cs.LG)
[171]  arXiv:2404.13634 [pdf, other]
Title: Bt-GAN: Generating Fair Synthetic Healthdata via Bias-transforming Generative Adversarial Networks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[172]  arXiv:2404.13631 [pdf, other]
Title: Fermi-Bose Machine
Comments: 17 pages, 6 figures, a physics inspired machine without backpropagation and enhanced adversarial robustness
Subjects: Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Neural and Evolutionary Computing (cs.NE); Neurons and Cognition (q-bio.NC)
[173]  arXiv:2404.13604 [pdf, other]
Title: CKGConv: General Graph Convolution with Continuous Kernels
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[174]  arXiv:2404.13588 [pdf, other]
Title: Machine Unlearning via Null Space Calibration
Comments: Accepted by IJCAI-2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[175]  arXiv:2404.13571 [pdf, other]
Title: Test-Time Training on Graphs with Large Language Models (LLMs)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[176]  arXiv:2404.13528 [pdf, other]
Title: SmartMem: Layout Transformation Elimination and Adaptation for Efficient DNN Execution on Mobile
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[177]  arXiv:2404.13515 [pdf, other]
Title: FedTrans: Efficient Federated Learning Over Heterogeneous Clients via Model Transformation
Journal-ref: MLSys (2024)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[178]  arXiv:2404.13506 [pdf, other]
Title: Parameter Efficient Fine Tuning: A Comprehensive Analysis Across Applications
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[179]  arXiv:2404.13503 [pdf, other]
Title: Predict to Minimize Swap Regret for All Payoff-Bounded Tasks
Authors: Lunjia Hu, Yifan Wu
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[180]  arXiv:2404.13500 [pdf, ps, other]
Title: Generalized Regression with Conditional GANs
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[181]  arXiv:2404.13476 [pdf, other]
Title: A Framework for Feasible Counterfactual Exploration incorporating Causality, Sparsity and Density
Comments: 10 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[182]  arXiv:2404.13456 [pdf, other]
Title: Real-Time Safe Control of Neural Network Dynamic Models with Sound Approximation
Comments: L4DC 2024, 12 pages, 3 figures, 4 tables
Subjects: Machine Learning (cs.LG); Robotics (cs.RO); Systems and Control (eess.SY)
[183]  arXiv:2404.13423 [pdf, other]
Title: PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling
Subjects: Machine Learning (cs.LG)
[184]  arXiv:2404.13421 [pdf, other]
Title: MultiConfederated Learning: Inclusive Non-IID Data handling with Decentralized Federated Learning
Journal-ref: Proceedings of the 39th ACM/SIGAPP Symposium on Applied Computing, SAC '24, 1587-1595, April 2024. ACM
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[185]  arXiv:2404.13401 [pdf, other]
Title: Approximate Algorithms For $k$-Sparse Wasserstein Barycenter With Outliers
Subjects: Machine Learning (cs.LG)
[186]  arXiv:2404.13393 [pdf, other]
Title: Transfer Learning for Molecular Property Predictions from Small Data Sets
Subjects: Machine Learning (cs.LG); Chemical Physics (physics.chem-ph)
[187]  arXiv:2404.13381 [pdf, other]
Title: DNA: Differentially private Neural Augmentation for contact tracing
Comments: Privacy Regulation and Protection in Machine Learning Workshop at ICLR 2024
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Multiagent Systems (cs.MA); Populations and Evolution (q-bio.PE)
[188]  arXiv:2404.13347 [pdf, other]
Title: Augmenting Safety-Critical Driving Scenarios while Preserving Similarity to Expert Trajectories
Comments: Accepted to 35th IEEE Intelligent Vehicles Symposium, 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[189]  arXiv:2404.13344 [pdf, other]
Title: GRANOLA: Adaptive Normalization for Graph Neural Networks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[190]  arXiv:2404.13327 [pdf, other]
Title: Comparative Analysis on Snowmelt-Driven Streamflow Forecasting Using Machine Learning Techniques
Comments: 17 pages, 4 Tables, 7 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[191]  arXiv:2404.13322 [pdf, other]
Title: MergeNet: Knowledge Migration across Heterogeneous Models, Tasks, and Modalities
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[192]  arXiv:2404.13318 [pdf, other]
Title: EHRFL: Federated Learning Framework for Heterogeneous EHRs and Precision-guided Selection of Participating Clients
Subjects: Machine Learning (cs.LG)
[193]  arXiv:2404.13300 [pdf, other]
Title: Capturing Momentum: Tennis Match Analysis Using Machine Learning and Time Series Theory
Comments: 16 pages, 18 figures
Subjects: Machine Learning (cs.LG)
[194]  arXiv:2404.13278 [pdf, other]
Title: Federated Transfer Learning with Task Personalization for Condition Monitoring in Ultrasonic Metal Welding
Comments: 37 pages, 8 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Signal Processing (eess.SP)
[195]  arXiv:2404.13260 [pdf, ps, other]
Title: Predicting Diabetes with Machine Learning Analysis of Income and Health Factors
Subjects: Machine Learning (cs.LG)
[196]  arXiv:2404.13257 [pdf, other]
Title: ST-SSMs: Spatial-Temporal Selective State of Space Model for Traffic Forecasting
Comments: 17 pages, 5 figures
Subjects: Machine Learning (cs.LG)
[197]  arXiv:2404.13244 [pdf, other]
Title: Intelligent Agents for Auction-based Federated Learning: A Survey
Subjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT)
[198]  arXiv:2404.13238 [pdf, other]
Title: Personalized Wireless Federated Learning for Large Language Models
Comments: 8 pages, 5 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[199]  arXiv:2404.13235 [pdf, other]
Title: TrialDura: Hierarchical Attention Transformer for Interpretable Clinical Trial Duration Prediction
Subjects: Machine Learning (cs.LG)
[200]  arXiv:2404.13224 [pdf, other]
Title: Model-Based Counterfactual Explanations Incorporating Feature Space Attributes for Tabular Data
Comments: 11 pages, 5 figures, 8 tables
Subjects: Machine Learning (cs.LG)
[201]  arXiv:2404.13218 [pdf, other]
Title: On the Temperature of Machine Learning Systems
Authors: Dong Zhang
Comments: 44 pages, 8 figures, 3 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[202]  arXiv:2404.13194 [pdf, other]
Title: Privacy-Preserving Debiasing using Data Augmentation and Machine Unlearning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[203]  arXiv:2404.13182 [pdf, other]
Title: Spectral Convolutional Conditional Neural Processes
Subjects: Machine Learning (cs.LG)
[204]  arXiv:2404.13139 [pdf, other]
Title: Explainable AI for Fair Sepsis Mortality Predictive Model
Comments: Accepted to the 22nd International Conference on Artificial Intelligence in Medicine (AIME'24)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[205]  arXiv:2404.13056 [pdf, other]
Title: Variational Bayesian Optimal Experimental Design with Normalizing Flows
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Computation (stat.CO); Methodology (stat.ME); Machine Learning (stat.ML)
[206]  arXiv:2404.14408 (cross-list from cs.CL) [pdf, other]
Title: SpaceByte: Towards Deleting Tokenization from Large Language Modeling
Authors: Kevin Slagle
Comments: 9+9 pages, 3+1 figures, 2+4 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[207]  arXiv:2404.14402 (cross-list from math.AP) [pdf, ps, other]
Title: A mean curvature flow arising in adversarial training
Subjects: Analysis of PDEs (math.AP); Machine Learning (cs.LG)
[208]  arXiv:2404.14397 (cross-list from cs.CL) [pdf, other]
[209]  arXiv:2404.14395 (cross-list from cs.CL) [pdf, other]
Title: PARAMANU-GANITA: Language Model with Mathematical Capabilities
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[210]  arXiv:2404.14389 (cross-list from cs.NI) [pdf, other]
Title: Poisoning Attacks on Federated Learning-based Wireless Traffic Prediction
Comments: Accepted by IFIP/IEEE Networking 2024
Subjects: Networking and Internet Architecture (cs.NI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[211]  arXiv:2404.14358 (cross-list from math.OC) [pdf, other]
Title: A General Continuous-Time Formulation of Stochastic ADMM and Its Variants
Authors: Chris Junchi Li
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[212]  arXiv:2404.14332 (cross-list from hep-ex) [pdf, other]
Title: Full Event Particle-Level Unfolding with Variable-Length Latent Variational Diffusion
Comments: Submission to SciPost
Subjects: High Energy Physics - Experiment (hep-ex); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); High Energy Physics - Phenomenology (hep-ph)
[213]  arXiv:2404.14322 (cross-list from eess.IV) [pdf, ps, other]
Title: A Novel Approach to Chest X-ray Lung Segmentation Using U-net and Modified Convolutional Block Attention Module
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[214]  arXiv:2404.14319 (cross-list from eess.SY) [pdf, other]
Title: Multi-Agent Hybrid SAC for Joint SS-DSA in CRNs
Comments: 10 pages. Currently under review for ACM MobiHoc 2024
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG)
[215]  arXiv:2404.14312 (cross-list from math.NA) [pdf, other]
Title: Structure-preserving neural networks for the regularzied entropy-based closure of the Boltzmann moment system
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG)
[216]  arXiv:2404.14276 (cross-list from stat.ML) [pdf, other]
Title: A Bayesian Approach for Prioritising Driving Behaviour Investigations in Telematic Auto Insurance Policies
Comments: International Congress of Actuaries (2023)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[217]  arXiv:2404.14270 (cross-list from cs.CL) [pdf, other]
Title: What do Transformers Know about Government?
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[218]  arXiv:2404.14244 (cross-list from cs.CR) [pdf, other]
Title: AI-Generated Faces in the Real World: A Large-Scale Case Study of Twitter Profile Images
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[219]  arXiv:2404.14243 (cross-list from cs.IR) [pdf, other]
Title: Turbo-CF: Matrix Decomposition-Free Graph Filtering for Fast Recommendation
Comments: 5 pages, 4 figures, 4 tables; 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2024) (to appear) (Please cite our conference version.)
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Information Theory (cs.IT); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[220]  arXiv:2404.14240 (cross-list from cs.IR) [pdf, other]
Title: Collaborative Filtering Based on Diffusion Models: Unveiling the Potential of High-Order Connectivity
Comments: 10 pages, 6 figures, 4 tables; 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2024) (to appear) (Please cite our conference version.)
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Information Theory (cs.IT); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[221]  arXiv:2404.14233 (cross-list from cs.CV) [pdf, other]
Title: Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[222]  arXiv:2404.14212 (cross-list from physics.comp-ph) [pdf, other]
Title: Toward Routing River Water in Land Surface Models with Recurrent Neural Networks
Comments: 27 pages, 10 figures, to be submitted in HESS (EGU)
Subjects: Computational Physics (physics.comp-ph); Machine Learning (cs.LG); Geophysics (physics.geo-ph)
[223]  arXiv:2404.14188 (cross-list from eess.IV) [pdf, other]
Title: Experimental Validation of Ultrasound Beamforming with End-to-End Deep Learning for Single Plane Wave Imaging
Comments: 8 pages, 9 figures, currently submitted to IEEE Transactions on Medical Imaging
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[224]  arXiv:2404.14146 (cross-list from cond-mat.mtrl-sci) [pdf, ps, other]
Title: Physics-based reward driven image analysis in microscopy
Comments: 12 pages, 4 figures
Subjects: Materials Science (cond-mat.mtrl-sci); Machine Learning (cs.LG)
[225]  arXiv:2404.14068 (cross-list from cs.AI) [pdf, other]
Title: Holistic Safety and Responsibility Evaluations of Advanced AI Models
Comments: 10 pages excluding bibliography
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[226]  arXiv:2404.14063 (cross-list from cs.SD) [pdf, other]
Title: LVNS-RAVE: Diversified audio generation with RAVE and Latent Vector Novelty Search
Comments: Accepted to GECCO 24 Companion
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Audio and Speech Processing (eess.AS)
[227]  arXiv:2404.14062 (cross-list from cs.CV) [pdf, other]
Title: GatedLexiconNet: A Comprehensive End-to-End Handwritten Paragraph Text Recognition System
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[228]  arXiv:2404.14033 (cross-list from cs.DC) [pdf, other]
Title: Apodotiko: Enabling Efficient Serverless Federated Learning in Heterogeneous Environments
Comments: Accepted at IEEE/ACM CCGrid'24
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[229]  arXiv:2404.14027 (cross-list from cs.CV) [pdf, other]
Title: OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[230]  arXiv:2404.13941 (cross-list from eess.SY) [pdf, other]
Title: Autoencoder-assisted Feature Ensemble Net for Incipient Faults
Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[231]  arXiv:2404.13885 (cross-list from cs.CY) [pdf, other]
Title: Surveying Attitudinal Alignment Between Large Language Models Vs. Humans Towards 17 Sustainable Development Goals
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[232]  arXiv:2404.13831 (cross-list from math.OC) [pdf, other]
Title: Data-Driven Performance Guarantees for Classical and Learned Optimizers
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[233]  arXiv:2404.13808 (cross-list from cs.IR) [pdf, other]
Title: General Item Representation Learning for Cold-start Content Recommendations
Comments: 14 pages
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG); Multimedia (cs.MM)
[234]  arXiv:2404.13804 (cross-list from cs.DC) [pdf, other]
Title: Adaptive Heterogeneous Client Sampling for Federated Learning over Wireless Networks
Comments: Published in IEEE Transactions on Mobile Computing (TMC). arXiv admin note: substantial text overlap with arXiv:2112.11256
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI); Systems and Control (eess.SY)
[235]  arXiv:2404.13786 (cross-list from eess.SY) [pdf, other]
Title: Soar: Design and Deployment of A Smart Roadside Infrastructure System for Autonomous Driving
Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[236]  arXiv:2404.13779 (cross-list from cs.CL) [pdf, other]
Title: Automated Text Mining of Experimental Methodologies from Biomedical Literature
Authors: Ziqing Guo
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[237]  arXiv:2404.13770 (cross-list from cs.CV) [pdf, other]
Title: EncodeNet: A Framework for Boosting DNN Accuracy with Entropy-driven Generalized Converting Autoencoder
Comments: 15 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[238]  arXiv:2404.13731 (cross-list from stat.ML) [pdf, ps, other]
Title: Training-Conditional Coverage Bounds for Uniformly Stable Learning Algorithms
Comments: Accepted to the ISIT 2024 workshop on Information-Theoretic Methods for Trustworthy Machine Learning (IT-TML)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[239]  arXiv:2404.13706 (cross-list from cs.CV) [pdf, other]
Title: Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[240]  arXiv:2404.13704 (cross-list from eess.IV) [pdf, other]
Title: PEMMA: Parameter-Efficient Multi-Modal Adaptation for Medical Image Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[241]  arXiv:2404.13702 (cross-list from astro-ph.CO) [pdf, other]
Title: Learning Galaxy Intrinsic Alignment Correlations
Comments: 15 pages, 6 figures, 1 table. Accepted at the Data-centric Machine Learning Research (DMLR) Workshop at ICLR 2024
Subjects: Cosmology and Nongalactic Astrophysics (astro-ph.CO); Astrophysics of Galaxies (astro-ph.GA); Machine Learning (cs.LG)
[242]  arXiv:2404.13701 (cross-list from cs.CV) [pdf, other]
Title: Semantic-Rearrangement-Based Multi-Level Alignment for Domain Generalized Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[243]  arXiv:2404.13698 (cross-list from cs.RO) [pdf, other]
Title: Resampling-free Particle Filters in High-dimensions
Comments: Published at ICRA 2024, 7 pages, 5 figures
Subjects: Robotics (cs.RO); Machine Learning (cs.LG); Machine Learning (stat.ML)
[244]  arXiv:2404.13690 (cross-list from cs.CR) [pdf, other]
Title: Detecting Compromised IoT Devices Using Autoencoders with Sequential Hypothesis Testing
Comments: 2023 IEEE International Conference on Big Data (BigData)
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[245]  arXiv:2404.13682 (cross-list from cs.DB) [pdf, other]
Title: Reproducible data science over data lakes: replayable data pipelines with Bauplan and Nessie
Comments: Pre-print of paper accepted at SIGMOD (DEEM2024)
Subjects: Databases (cs.DB); Machine Learning (cs.LG)
[246]  arXiv:2404.13671 (cross-list from cs.CV) [pdf, other]
Title: FiLo: Zero-Shot Anomaly Detection by Fine-Grained Description and High-Quality Localization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[247]  arXiv:2404.13669 (cross-list from math.OC) [pdf, other]
Title: Rate Analysis of Coupled Distributed Stochastic Approximation for Misspecified Optimization
Comments: 27 pages, 6 figures
Subjects: Optimization and Control (math.OC); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[248]  arXiv:2404.13652 (cross-list from cs.RO) [pdf, other]
Title: BANSAI: Towards Bridging the AI Adoption Gap in Industrial Robotics with Neurosymbolic Programming
Comments: 6 pages, 3 figures, accepted at the 2024 CIRP International Conference on Manufacturing Systems (CMS)
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
[249]  arXiv:2404.13649 (cross-list from stat.ML) [pdf, other]
Title: Distributional Principal Autoencoders
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[250]  arXiv:2404.13648 (cross-list from cs.CV) [pdf, other]
Title: Data-independent Module-aware Pruning for Hierarchical Vision Transformers
Comments: Accepted by ICLR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[ total of 632 entries: 1-250 | 251-500 | 501-632 ]
[ showing 250 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2404, contact, help  (Access key information)