We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions, skipping first 100

[ total of 632 entries: 1-50 | 51-100 | 101-150 | 151-200 | 201-250 | 251-300 | ... | 601-632 ]
[ showing 50 entries per page: fewer | more | all ]

Wed, 24 Apr 2024 (continued, showing last 29 of 129 entries)

[101]  arXiv:2404.14795 (cross-list from cs.CL) [pdf, other]
Title: Talk Too Much: Poisoning Large Language Models under Token Limit
Comments: 20 pages
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[102]  arXiv:2404.14786 (cross-list from cs.AI) [pdf, other]
Title: LLM-Enhanced Causal Discovery in Temporal Domain from Interventional Data
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Methodology (stat.ME)
[103]  arXiv:2404.14777 (cross-list from cs.CL) [pdf, other]
Title: CT-Agent: Clinical Trial Multi-Agent with Large Language Model-based Reasoning
Authors: Ling Yue, Tianfan Fu
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[104]  arXiv:2404.14760 (cross-list from cs.CL) [pdf, other]
Title: Retrieval Augmented Generation for Domain-specific Question Answering
Comments: AAAI 2024 (Association for the Advancement of Artificial Intelligence) Scientific Document Understanding Workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[105]  arXiv:2404.14758 (cross-list from math.OC) [pdf, other]
Title: Second-order Information Promotes Mini-Batch Robustness in Variance-Reduced Gradients
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
[106]  arXiv:2404.14743 (cross-list from stat.ML) [pdf, other]
Title: Gradient Guidance for Diffusion Models: An Optimization Perspective
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[107]  arXiv:2404.14700 (cross-list from eess.AS) [pdf, other]
Title: FlashSpeech: Efficient Zero-Shot Speech Synthesis
Comments: Efficient zero-shot speech synthesis
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[108]  arXiv:2404.14680 (cross-list from cs.CL) [pdf, other]
Title: Automated Multi-Language to English Machine Translation Using Generative Pre-Trained Transformers
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[109]  arXiv:2404.14661 (cross-list from cs.CV) [pdf, other]
Title: First Mapping the Canopy Height of Primeval Forests in the Tallest Tree Area of Asia
Subjects: Computer Vision and Pattern Recognition (cs.CV); Earth and Planetary Astrophysics (astro-ph.EP); Machine Learning (cs.LG)
[110]  arXiv:2404.14653 (cross-list from cs.CV) [pdf, ps, other]
Title: Machine Vision Based Assessment of Fall Color Changes in Apple Trees: Exploring Relationship with Leaf Nitrogen Concentration
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[111]  arXiv:2404.14651 (cross-list from nlin.AO) [pdf, other]
Title: Forecasting the Forced Van der Pol Equation with Frequent Phase Shifts Using a Reservoir Computer
Subjects: Adaptation and Self-Organizing Systems (nlin.AO); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[112]  arXiv:2404.14631 (cross-list from cs.CL) [pdf, other]
Title: Learning Word Embedding with Better Distance Weighting and Window Size Scheduling
Authors: Chaohao Yang
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[113]  arXiv:2404.14625 (cross-list from cs.RO) [pdf, other]
Title: Towards Multi-Morphology Controllers with Diversity and Knowledge Distillation
Comments: Accepted at the Genetic and Evolutionary Computation Conference 2024 Evolutionary Machine Learning track as a full paper
Subjects: Robotics (cs.RO); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[114]  arXiv:2404.14619 (cross-list from cs.CL) [pdf, other]
Title: OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[115]  arXiv:2404.14602 (cross-list from eess.SY) [pdf, other]
Title: Adaptive Bayesian Optimization for High-Precision Motion Systems
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG); Robotics (cs.RO)
[116]  arXiv:2404.14586 (cross-list from cs.IT) [pdf, other]
Title: Latency-Distortion Tradeoffs in Communicating Classification Results over Noisy Channels
Comments: Submitted to IEEE Transactions on Communications
Subjects: Information Theory (cs.IT); Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[117]  arXiv:2404.14551 (cross-list from hep-th) [pdf, other]
Title: Learning S-Matrix Phases with Neural Operators
Comments: 36 pages, 8 figures
Subjects: High Energy Physics - Theory (hep-th); Machine Learning (cs.LG)
[118]  arXiv:2404.14527 (cross-list from cs.DC) [pdf, other]
Title: Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[119]  arXiv:2404.14507 (cross-list from cs.CV) [pdf, other]
Title: Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[120]  arXiv:2404.14497 (cross-list from cs.NI) [pdf, other]
Title: Mapping Wireless Networks into Digital Reality through Joint Vertical and Horizontal Learning
Comments: Accepted by IFIP/IEEE Networking 2024
Subjects: Networking and Internet Architecture (cs.NI); Machine Learning (cs.LG); Signal Processing (eess.SP)
[121]  arXiv:2404.14463 (cross-list from cs.CL) [pdf, other]
Title: DAIC-WOZ: On the Validity of Using the Therapist's prompts in Automatic Depression Detection from Clinical Interviews
Comments: Accepted to Clinical NLP workshop at NAACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[122]  arXiv:2404.14461 (cross-list from cs.CL) [pdf, other]
Title: Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMs
Comments: Competition Report
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[123]  arXiv:2404.14460 (cross-list from stat.ML) [pdf, other]
Title: Inference of Causal Networks using a Topological Threshold
Comments: 17 pages, 12 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[124]  arXiv:2404.14449 (cross-list from cs.CL) [pdf, ps, other]
Title: Predicting Question Quality on StackOverflow with Neural Networks
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[125]  arXiv:2404.14441 (cross-list from cs.CV) [pdf, ps, other]
Title: Optimizing Contrail Detection: A Deep Learning Approach with EfficientNet-b4 Encoding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[126]  arXiv:2404.14419 (cross-list from cs.SE) [pdf, other]
Title: Enhancing Fault Detection for Large Language Models via Mutation-Based Confidence Smoothing
Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[127]  arXiv:2404.14418 (cross-list from cs.SI) [pdf, other]
Title: Mitigating Cascading Effects in Large Adversarial Graph Environments
Comments: 10 pages, 7 figures
Subjects: Social and Information Networks (cs.SI); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[128]  arXiv:2404.14416 (cross-list from physics.geo-ph) [pdf, other]
Title: Conditional diffusion models for downscaling & bias correction of Earth system model precipitation
Subjects: Geophysics (physics.geo-ph); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Atmospheric and Oceanic Physics (physics.ao-ph)
[129]  arXiv:2404.13630 (cross-list from cs.SE) [pdf, ps, other]
Title: Utilizing Deep Learning to Optimize Software Development Processes
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)

Tue, 23 Apr 2024 (showing first 21 of 186 entries)

[130]  arXiv:2404.14388 [pdf, other]
Title: STROOBnet Optimization via GPU-Accelerated Proximal Recurrence Strategies
Comments: 10 pages, 17 figures, 2023 IEEE International Conference on Big Data (BigData)
Journal-ref: 2023 IEEE International Conference on Big Data (BigData), Sorrento, Italy, 2023, pp. 2920-2929
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
[131]  arXiv:2404.14367 [pdf, other]
Title: Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data
Subjects: Machine Learning (cs.LG)
[132]  arXiv:2404.14326 [pdf, ps, other]
Title: Machine Learning Techniques for MRI Data Processing at Expanding Scale
Authors: Taro Langner
Comments: Book chapter pre-print
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[133]  arXiv:2404.14271 [pdf, other]
Title: Sparse Explanations of Neural Networks Using Pruned Layer-Wise Relevance Propagation
Comments: 15 pages, 5 figures
Subjects: Machine Learning (cs.LG)
[134]  arXiv:2404.14265 [pdf, other]
Title: Deep Learning as Ricci Flow
Subjects: Machine Learning (cs.LG); Differential Geometry (math.DG)
[135]  arXiv:2404.14202 [pdf, other]
Title: Rotting Infinitely Many-armed Bandits beyond the Worst-case Rotting: An Adaptive Approach
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[136]  arXiv:2404.14197 [pdf, other]
Title: SOFTS: Efficient Multivariate Time Series Forecasting with Series-Core Fusion
Subjects: Machine Learning (cs.LG)
[137]  arXiv:2404.14164 [pdf, other]
Title: New Solutions Based on the Generalized Eigenvalue Problem for the Data Collaboration Analysis
Comments: 16 pages, 9 figures, preprint
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[138]  arXiv:2404.14161 [pdf, other]
Title: Multidimensional Interpolants
Authors: Dohoon Lee, Kyogu Lee
Comments: 9 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[139]  arXiv:2404.14107 [pdf, other]
Title: PGNAA Spectral Classification of Aluminium and Copper Alloys with Machine Learning
Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci)
[140]  arXiv:2404.14076 [pdf, other]
Title: Noise contrastive estimation with soft targets for conditional models
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[141]  arXiv:2404.14073 [pdf, other]
Title: Towards Robust Trajectory Representations: Isolating Environmental Confounders with Causal Learning
Comments: The paper has been accepted by IJCAI 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[142]  arXiv:2404.14064 [pdf, other]
Title: Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[143]  arXiv:2404.14061 [pdf, other]
Title: FedTAD: Topology-aware Data-free Knowledge Distillation for Subgraph Federated Learning
Comments: Accepted by IJCAI 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Databases (cs.DB); Social and Information Networks (cs.SI)
[144]  arXiv:2404.14047 [pdf, other]
Title: How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study
Subjects: Machine Learning (cs.LG)
[145]  arXiv:2404.14017 [pdf, other]
Title: Hybrid Ensemble-Based Travel Mode Prediction
Comments: This preprint has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this contribution is published in Advances in Intelligent Data Analysis XXII. IDA 2024. Lecture Notes in Computer Science, vol 14641. Springer, and is available online at Cham this https URL The preprint includes 12+22 pages, 1+1 figures
Journal-ref: Advances in Intelligent Data Analysis XXII, IDA 2024, LNCS, vol 14641, (2024), 191-202
Subjects: Machine Learning (cs.LG)
[146]  arXiv:2404.14016 [pdf, other]
Title: Ungeneralizable Examples
Comments: Accepted by CVPR2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[147]  arXiv:2404.14006 [pdf, other]
Title: Distilled Datamodel with Reverse Gradient Matching
Comments: Accepted by CVPR2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[148]  arXiv:2404.13990 [pdf, other]
Title: QCore: Data-Efficient, On-Device Continual Calibration for Quantized Models -- Extended Version
Comments: 15 pages. An extended version of "QCore: Data-Efficient, On-Device Continual Calibration for Quantized Models" accepted at PVLDB 2024
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[149]  arXiv:2404.13964 [pdf, other]
Title: An Economic Solution to Copyright Challenges of Generative AI
Subjects: Machine Learning (cs.LG); General Economics (econ.GN); Methodology (stat.ME)
[150]  arXiv:2404.13954 [pdf, ps, other]
Title: A survey of air combat behavior modeling using machine learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[ total of 632 entries: 1-50 | 51-100 | 101-150 | 151-200 | 201-250 | 251-300 | ... | 601-632 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2404, contact, help  (Access key information)