We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions, skipping first 103

[ total of 1248 entries: 1-50 | 4-53 | 54-103 | 104-153 | 154-203 | 204-253 | 254-303 | ... | 1204-1248 ]
[ showing 50 entries per page: fewer | more | all ]

Fri, 31 May 2024 (continued, showing 50 of 181 entries)

[104]  arXiv:2405.20165 (cross-list from stat.ML) [pdf, other]
Title: Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[105]  arXiv:2405.20139 (cross-list from cs.CL) [pdf, other]
Title: GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[106]  arXiv:2405.20127 (cross-list from math.OC) [pdf, other]
Title: SPAM: Stochastic Proximal Point Method with Momentum Variance Reduction for Non-convex Cross-Device Federated Learning
Comments: The main part of the paper is around 9 pages. It contains the proposed algorithms, the main theoretical results and the experimental setting. The proofs of the main results and other technicalities are deferred to the Appendix
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
[107]  arXiv:2405.20124 (cross-list from stat.ML) [pdf, other]
Title: A Geometric Unification of Distributionally Robust Covariance Estimators: Shrinking the Spectrum by Inflating the Ambiguity Set
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
[108]  arXiv:2405.20094 (cross-list from math.NA) [pdf, other]
Title: Low-dimensional approximations of the conditional law of Volterra processes: a non-positive curvature approach
Comments: Main body: 25 Pages, Appendices 29 Pages, 14 Tables, 6 Figures
Subjects: Numerical Analysis (math.NA); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Differential Geometry (math.DG); Computational Finance (q-fin.CP)
[109]  arXiv:2405.20091 (cross-list from cs.CV) [pdf, other]
Title: Visual Attention Analysis in Online Learning
Comments: Accepted in CEDI 2024 (VII Congreso Espa\~nol de Inform\'atica), A Coru\~na, Spain
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[110]  arXiv:2405.20086 (cross-list from math.ST) [pdf, other]
Title: Analysis of a multi-target linear shrinkage covariance estimator
Authors: Benoit Oriol
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Probability (math.PR); Machine Learning (stat.ML)
[111]  arXiv:2405.20079 (cross-list from cs.CL) [pdf, other]
Title: Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language Learning
Comments: Accepted as a poster paper at EDM 2024: 17th International Conference on Educational Data Mining in Atlanta, USA
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[112]  arXiv:2405.20071 (cross-list from physics.med-ph) [pdf, ps, other]
Title: A Staged Approach using Machine Learning and Uncertainty Quantification to Predict the Risk of Hip Fracture
Comments: 29 pages, 5 figures, 6 tables
Subjects: Medical Physics (physics.med-ph); Machine Learning (cs.LG)
[113]  arXiv:2405.20053 (cross-list from cs.CL) [pdf, other]
Title: Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[114]  arXiv:2405.20052 (cross-list from eess.SP) [pdf, other]
Title: A Hardware-Efficient EMG Decoder with an Attractor-based Neural Network for Next-Generation Hand Prostheses
Comments: \c{opyright} 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG)
[115]  arXiv:2405.20039 (cross-list from stat.ML) [pdf, other]
Title: Task-Agnostic Machine Learning-Assisted Inference
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[116]  arXiv:2405.20018 (cross-list from cs.MA) [pdf, other]
Title: Safe Multi-agent Reinforcement Learning with Natural Language Constraints
Comments: 23 pages, 6 figures
Subjects: Multiagent Systems (cs.MA); Computation and Language (cs.CL); Machine Learning (cs.LG)
[117]  arXiv:2405.19995 (cross-list from stat.ML) [pdf, other]
Title: Symmetries in Overparametrized Neural Networks: A Mean-Field View
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Probability (math.PR)
[118]  arXiv:2405.19988 (cross-list from cs.RO) [pdf, other]
Title: Video-Language Critic: Transferable Reward Functions for Language-Conditioned Robotics
Comments: 10 pages in the main text, 16 pages including references and supplementary materials. 4 figures and 3 tables in the main text, 1 table in supplementary materials
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[119]  arXiv:2405.19985 (cross-list from stat.ME) [pdf, other]
Title: Targeted Sequential Indirect Experiment Design
Subjects: Methodology (stat.ME); Machine Learning (cs.LG)
[120]  arXiv:2405.19977 (cross-list from cs.DS) [pdf, other]
Title: Consistent Submodular Maximization
Comments: To appear at ICML 24
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)
[121]  arXiv:2405.19971 (cross-list from cs.CR) [pdf, other]
Title: GasTrace: Detecting Sandwich Attack Malicious Accounts in Ethereum
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[122]  arXiv:2405.19967 (cross-list from cs.CL) [pdf, other]
Title: Improved Out-of-Scope Intent Classification with Dual Encoding and Threshold-based Re-Classification
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[123]  arXiv:2405.19954 (cross-list from cs.CR) [pdf, other]
Title: GenKubeSec: LLM-Based Kubernetes Misconfiguration Detection, Localization, Reasoning, and Remediation
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[124]  arXiv:2405.19931 (cross-list from cs.CV) [pdf, other]
Title: Exploring Diffusion Models' Corruption Stage in Few-Shot Fine-tuning and Mitigating with Bayesian Neural Networks
Comments: Preprint. Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[125]  arXiv:2405.19912 (cross-list from stat.ML) [pdf, other]
Title: Robust Kernel Hypothesis Testing under Data Corruption
Comments: 26 pages, 2 figures, 2 algorithms
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[126]  arXiv:2405.19889 (cross-list from eess.SP) [pdf, other]
Title: Deep Joint Semantic Coding and Beamforming for Near-Space Airship-Borne Massive MIMO Network
Comments: Major Revision by IEEE JSAC
Subjects: Signal Processing (eess.SP); Information Theory (cs.IT); Machine Learning (cs.LG); Multimedia (cs.MM)
[127]  arXiv:2405.19886 (cross-list from cs.NI) [pdf, other]
Title: Federated Learning with Multi-resolution Model Broadcast
Subjects: Networking and Internet Architecture (cs.NI); Machine Learning (cs.LG)
[128]  arXiv:2405.19874 (cross-list from cs.CL) [pdf, other]
Title: Is In-Context Learning Sufficient for Instruction Following in LLMs?
Comments: Preprint. Code at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[129]  arXiv:2405.19805 (cross-list from cs.CC) [pdf, ps, other]
Title: Complexity of Deciding Injectivity and Surjectivity of ReLU Neural Networks
Comments: 17 pages
Subjects: Computational Complexity (cs.CC); Discrete Mathematics (cs.DM); Machine Learning (cs.LG)
[130]  arXiv:2405.19787 (cross-list from cs.CL) [pdf, other]
Title: From Symbolic Tasks to Code Generation: Diversification Yields Better Task Performers
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Logic in Computer Science (cs.LO); Programming Languages (cs.PL)
[131]  arXiv:2405.19784 (cross-list from cs.DB) [pdf, ps, other]
Title: PixelsDB: Serverless and Natural-Language-Aided Data Analytics with Flexible Service Levels and Prices
Comments: 4 pages, 3 figures
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[132]  arXiv:2405.19783 (cross-list from cs.CV) [pdf, other]
Title: Instruction-Guided Visual Masking
Comments: preprint, 21 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[133]  arXiv:2405.19779 (cross-list from cs.NE) [pdf, other]
Title: Automatic Graph Topology-Aware Transformer
Comments: This work has been submitted to the IEEE (Under Second Review). Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Neural and Evolutionary Computing (cs.NE); Graphics (cs.GR); Machine Learning (cs.LG)
[134]  arXiv:2405.19760 (cross-list from stat.ML) [pdf, ps, other]
Title: Identifiability of a statistical model with two latent vectors: Importance of the dimensionality relation and application to graph embedding
Authors: Hiroaki Sasaki
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
[135]  arXiv:2405.19732 (cross-list from cs.CV) [pdf, other]
Title: Two Optimizers Are Better Than One: LLM Catalyst for Enhancing Gradient-Based Optimization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[136]  arXiv:2405.19730 (cross-list from cs.AI) [pdf, ps, other]
Title: Research on Foundation Model for Spatial Data Intelligence: China's 2024 White Paper on Strategic Development of Spatial Data Intelligence
Authors: Shaohua Wang (1), Xing Xie (2), Yong Li (3), Danhuai Guo (4), Zhi Cai (5), Yu Liu (6), Yang Yue (7), Xiao Pan (8), Feng Lu (9), Huayi Wu (10), Zhipeng Gui (10), Zhiming Ding (11), Bolong Zheng (12), Fuzheng Zhang (13), Tao Qin (2), Jingyuan Wang (14), Chuang Tao (15), Zhengchao Chen (1), Hao Lu (16), Jiayi Li (10), Hongyang Chen (17), Peng Yue (10), Wenhao Yu (18), Yao Yao (18), Leilei Sun (14), Yong Zhang (5), Longbiao Chen (19), Xiaoping Du (20), Xiang Li (21), Xueying Zhang (22), Kun Qin (10), Zhaoya Gong (6), Weihua Dong (23), Xiaofeng Meng (24) ((1) Aerospace Information Research Institute, Chinese Academy of Sciences,(2) Microsoft Research Asia, (3) Tsinghua University, (4) Beijing University of Chemical Technology, (5) Beijing University of Technology, (6) Peking University, (7) Shenzhen University, (8) Shijiazhuang Tiedao University, (9) Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, (10) Wuhan University, (11) Institute of Software, Chinese Academy of Sciences, (12) Huazhong University of Science and Technology, (13) Kuaishou Natural Language Processing Center and Audio Center, (14) Beijing University of Aeronautics and Astronautics, (15) Shanghai Figure Interesting Information Technology Co., Ltd., (16) SuperMap Software Co., Ltd., (17) Zhejiang Lab, (18) China University of Geosciences (Wuhan), (19) Xiamen University, (20) Key Laboratory of Digital Earth, Chinese Academy of Sciences, (21) East China Normal University, (22) Nanjing Normal University, (23) Beijing Normal University, (24) Renmin University of China)
Comments: in Chinese language
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[137]  arXiv:2405.19715 (cross-list from cs.CL) [pdf, other]
Title: SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[138]  arXiv:2405.19704 (cross-list from stat.ML) [pdf, other]
Title: Enhancing Sufficient Dimension Reduction via Hellinger Correlation
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[139]  arXiv:2405.19697 (cross-list from math.OC) [pdf, other]
Title: Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
Comments: 43 pages, 1 figure, 1 table
Subjects: Optimization and Control (math.OC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[140]  arXiv:2405.19683 (cross-list from cs.CR) [pdf, other]
Title: Breaking Indistinguishability with Transfer Learning: A First Look at SPECK32/64 Lightweight Block Ciphers
Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[141]  arXiv:2405.19681 (cross-list from stat.ML) [pdf, other]
Title: Bayesian Online Natural Gradient (BONG)
Comments: 41 pages, 11 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Computation (stat.CO)
[142]  arXiv:2405.19672 (cross-list from eess.IV) [pdf, other]
Title: CRIS: Collaborative Refinement Integrated with Segmentation for Polyp Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[143]  arXiv:2405.19665 (cross-list from eess.SY) [pdf, ps, other]
Title: A novel fault localization with data refinement for hydroelectric units
Comments: 6pages,4 figures,Conference on Decision and Control(CDC) conference
Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[144]  arXiv:2405.19648 (cross-list from cs.CL) [pdf, other]
Title: Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach
Comments: ICAI'24 - The 26th Int'l Conf on Artificial Intelligence
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[145]  arXiv:2405.19644 (cross-list from cs.CV) [pdf, other]
Title: EgoSurgery-Phase: A Dataset of Surgical Phase Recognition from Egocentric Open Surgery Videos
Comments: Early accepted by MICCAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[146]  arXiv:2405.19616 (cross-list from cs.AI) [pdf, other]
Title: Easy Problems That LLMs Get Wrong
Comments: AutogenAI Ltd. Associated code at this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[147]  arXiv:2405.19610 (cross-list from stat.ML) [pdf, other]
Title: Factor Augmented Tensor-on-Tensor Neural Networks
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
[148]  arXiv:2405.19586 (cross-list from cs.CV) [pdf, other]
Title: SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation
Comments: ICML 2024. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[149]  arXiv:2405.19567 (cross-list from cs.AI) [pdf, other]
Title: Dr-LLaVA: Visual Instruction Tuning with Symbolic Clinical Grounding
Comments: Code available at: this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[150]  arXiv:2405.19562 (cross-list from cs.CY) [pdf, other]
Title: Selective Explanations
Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL); Machine Learning (cs.LG)
[151]  arXiv:2405.19553 (cross-list from math.ST) [pdf, ps, other]
Title: Convergence Bounds for Sequential Monte Carlo on Multimodal Distributions using Soft Decomposition
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Probability (math.PR); Machine Learning (stat.ML)
[152]  arXiv:2405.19544 (cross-list from cs.AI) [pdf, other]
Title: One-Shot Safety Alignment for Large Language Models via Optimal Dualization
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[153]  arXiv:2405.19542 (cross-list from eess.SP) [pdf, other]
Title: Anatomical Region Recognition and Real-time Bone Tracking Methods by Dynamically Decoding A-Mode Ultrasound Signals
Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG); Robotics (cs.RO)
[ total of 1248 entries: 1-50 | 4-53 | 54-103 | 104-153 | 154-203 | 204-253 | 254-303 | ... | 1204-1248 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)