We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions, skipping first 151

[ total of 540 entries: 1-45 | 17-61 | 62-106 | 107-151 | 152-196 | 197-241 | 242-286 | 287-331 | ... | 512-540 ]
[ showing 45 entries per page: fewer | more | all ]

Mon, 27 May 2024 (continued, showing 45 of 72 entries)

[152]  arXiv:2405.15202 [pdf, other]
Title: Cross-Task Defense: Instruction-Tuning LLMs for Content Safety
Comments: accepted to NAACL2024 TrustNLP workshop
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[153]  arXiv:2405.15198 [pdf, other]
Title: RAEE: A Training-Free Retrieval-Augmented Early Exiting Framework for Efficient Inference
Subjects: Computation and Language (cs.CL)
[154]  arXiv:2405.15185 [pdf, other]
Title: An Evaluation of Estimative Uncertainty in Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[155]  arXiv:2405.15179 [pdf, other]
Title: VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks
Subjects: Computation and Language (cs.CL)
[156]  arXiv:2405.15165 [pdf, other]
Title: A Solution-based LLM API-using Methodology for Academic Information Seeking
Comments: 22 pages, 13 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
[157]  arXiv:2405.15152 [pdf, other]
Title: Machine Unlearning in Large Language Models
Comments: 10 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[158]  arXiv:2405.15134 [pdf, other]
Title: Efficient Biomedical Entity Linking: Clinical Text Standardization with Low-Resource Techniques
Subjects: Computation and Language (cs.CL)
[159]  arXiv:2405.15122 [pdf, other]
Title: Generalizable and Scalable Multistage Biomedical Concept Normalization Leveraging Large Language Models
Subjects: Computation and Language (cs.CL)
[160]  arXiv:2405.15110 [pdf, other]
Title: CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems
Comments: To appear in Findings ACL 2024
Subjects: Computation and Language (cs.CL)
[161]  arXiv:2405.15097 [pdf, other]
Title: Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding
Comments: Accepted NAACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[162]  arXiv:2405.15077 [pdf, other]
Title: Eliciting Informative Text Evaluations with Large Language Models
Comments: Accepted by the Twenty-Fifth ACM Conference on Economics and Computation (EC'24)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT)
[163]  arXiv:2405.15071 [pdf, other]
Title: Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization
Comments: 22 pages, 16 figures. Code and data: this https URL
Subjects: Computation and Language (cs.CL)
[164]  arXiv:2405.15070 [pdf, other]
Title: Optimizing example selection for retrieval-augmented machine translation with translation memories
Comments: TALN conference, French, 10 pages, 7 figures
Subjects: Computation and Language (cs.CL)
[165]  arXiv:2405.15067 [pdf, other]
Title: Promoting Constructive Deliberation: Reframing for Receptiveness
Subjects: Computation and Language (cs.CL)
[166]  arXiv:2405.15064 [pdf, other]
Title: Reframing Spatial Reasoning Evaluation in Language Models: A Real-World Simulation Benchmark for Qualitative Reasoning
Comments: Camera-Ready version for IJCAI 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[167]  arXiv:2405.15039 [pdf, other]
Title: CEEBERT: Cross-Domain Inference in Early Exit BERT
Comments: Accepted at ACL 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[168]  arXiv:2405.15032 [pdf, other]
Title: Aya 23: Open Weight Releases to Further Multilingual Progress
Subjects: Computation and Language (cs.CL)
[169]  arXiv:2405.15028 [pdf, other]
Title: AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[170]  arXiv:2405.15012 [pdf, other]
Title: Extracting Prompts by Inverting LLM Outputs
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[171]  arXiv:2405.15007 [pdf, other]
Title: RE-Adapt: Reverse Engineered Adaptation of Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[172]  arXiv:2405.14992 [pdf, other]
Title: Linking In-context Learning in Transformers to Human Episodic Memory
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[173]  arXiv:2405.14962 [pdf, ps, other]
Title: Data Augmentation Method Utilizing Template Sentences for Variable Definition Extraction
Subjects: Computation and Language (cs.CL)
[174]  arXiv:2405.14899 [pdf, other]
Title: DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[175]  arXiv:2405.15766 (cross-list from cs.AI) [pdf, other]
Title: Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development
Comments: ACL Findings 2024
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[176]  arXiv:2405.15729 (cross-list from cs.SE) [pdf, other]
Title: Optimizing Large Language Models for OpenAPI Code Completion
Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[177]  arXiv:2405.15683 (cross-list from cs.CV) [pdf, other]
Title: VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap
Comments: Preprint. Under review. Code will be released on paper acceptance
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[178]  arXiv:2405.15638 (cross-list from cs.CV) [pdf, other]
Title: M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models
Comments: Work in progress
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[179]  arXiv:2405.15556 (cross-list from cs.LG) [pdf, other]
Title: Certifiably Robust RAG against Retrieval Corruption
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[180]  arXiv:2405.15485 (cross-list from cs.AI) [pdf, other]
Title: Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[181]  arXiv:2405.15374 (cross-list from cs.IR) [pdf, other]
Title: Leveraging Large Language Models for Semantic Query Processing in a Scholarly Knowledge Graph
Comments: for the associated repository, see this http URL
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[182]  arXiv:2405.15362 (cross-list from cs.LG) [pdf, other]
Title: Pipeline Parallelism with Controllable Memory
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC)
[183]  arXiv:2405.15302 (cross-list from cs.AI) [pdf, other]
Title: Towards Understanding How Transformer Perform Multi-step Reasoning with Matching Operation
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[184]  arXiv:2405.15232 (cross-list from cs.CV) [pdf, other]
Title: DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception
Comments: 25 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[185]  arXiv:2405.15216 (cross-list from cs.LG) [pdf, other]
Title: Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Comments: under review
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[186]  arXiv:2405.15189 (cross-list from cs.SE) [pdf, other]
Title: SOAP: Enhancing Efficiency of Generated Code via Self-Optimization
Comments: 31 pages, 18 figures, and 8 tables
Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL)
[187]  arXiv:2405.15145 (cross-list from cs.AI) [pdf, other]
Title: CulturePark: Boosting Cross-cultural Understanding in Large Language Models
Comments: Technical report; 28 pages
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multiagent Systems (cs.MA)
[188]  arXiv:2405.15143 (cross-list from cs.LG) [pdf, other]
Title: Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[189]  arXiv:2405.15130 (cross-list from cs.SE) [pdf, other]
Title: OptLLM: Optimal Assignment of Queries to Large Language Models
Comments: This paper is accepted by ICWS 2024
Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[190]  arXiv:2405.15115 (cross-list from cs.LG) [pdf, other]
Title: Towards Better Understanding of In-Context Learning Ability from In-Context Uncertainty Quantification
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[191]  arXiv:2405.15092 (cross-list from cs.AI) [pdf, other]
Title: Dissociation of Faithful and Unfaithful Reasoning in LLMs
Comments: code published at this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[192]  arXiv:2405.15025 (cross-list from cs.LG) [pdf, other]
Title: OAC: Output-adaptive Calibration for Accurate Post-training Quantization
Authors: Ali Edalati (1), Alireza Ghaffari (1 and 2), Masoud Asgharian (2), Lu Hou (1), Boxing Chen (1), Vahid Partovi Nia (1) ((1) Huawei Noah's Ark Lab, (2) Department of Mathematics and Statistics, McGill University)
Comments: 20 pages, 4 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[193]  arXiv:2405.14982 (cross-list from cs.LG) [pdf, other]
Title: In-context Time Series Predictor
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[194]  arXiv:2405.14974 (cross-list from cs.CV) [pdf, other]
Title: LOVA3: Learning to Visual Question Answering, Asking and Assessment
Comments: The code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[195]  arXiv:2405.14917 (cross-list from cs.LG) [pdf, other]
Title: SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models
Comments: 22 pages
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[196]  arXiv:2405.14908 (cross-list from cs.LG) [pdf, other]
Title: Data Mixing Made Efficient: A Bivariate Scaling Law for Language Model Pretraining
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[ total of 540 entries: 1-45 | 17-61 | 62-106 | 107-151 | 152-196 | 197-241 | 242-286 | 287-331 | ... | 512-540 ]
[ showing 45 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)