We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions, skipping first 350

[ total of 427 entries: 1-250 | 101-350 | 351-427 ]
[ showing 250 entries per page: fewer | more | all ]

Tue, 28 May 2024 (continued, showing last 5 of 126 entries)

[351]  arXiv:2405.15902 (cross-list from cs.CR) [pdf, other]
Title: Hacc-Man: An Arcade Game for Jailbreaking LLMs
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[352]  arXiv:2405.15877 (cross-list from cs.LG) [pdf, other]
Title: Basis Selection: Low-Rank Decomposition of Pretrained Large Language Models for Target Applications
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Computation and Language (cs.CL)
[353]  arXiv:2405.15793 (cross-list from cs.SE) [pdf, other]
Title: SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering
Comments: First two authors contributed equally. Code and demo at this https URL
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[354]  arXiv:2405.15787 (cross-list from cs.IR) [pdf, ps, other]
Title: Extracting chemical food safety hazards from the scientific literature automatically using large language models
Comments: 31 pages, 5 figures
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[355]  arXiv:2405.15784 (cross-list from cs.IR) [pdf, other]
Title: CLARINET: Augmenting Language Models to Ask Clarification Questions for Retrieval
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Mon, 27 May 2024

[356]  arXiv:2405.15765 [pdf, other]
Title: Scaling Laws for Discriminative Classification in Large Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[357]  arXiv:2405.15760 [pdf, other]
Title: GPT is Not an Annotator: The Necessity of Human Annotation in Fairness Benchmark Construction
Comments: Accepted to ACL 2024 (main conference)
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[358]  arXiv:2405.15750 [pdf, other]
Title: Filtered Corpus Training (FiCT) Shows that Language Models can Generalize from Indirect Evidence
Comments: 10 pages + 7 pages of references/appendices. For code and trained models, see this http URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[359]  arXiv:2405.15708 [pdf, other]
Title: EmpathicStories++: A Multimodal Dataset for Empathy towards Personal Experiences
Comments: Accepted to ACL 2024 Findings
Subjects: Computation and Language (cs.CL)
[360]  arXiv:2405.15640 [pdf, other]
Title: GECKO: Generative Language Model for English, Code and Korean
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[361]  arXiv:2405.15604 [pdf, other]
Title: Text Generation: A Systematic Literature Review of Tasks, Evaluation, and Challenges
Comments: 35 pages, 2 figures, 2 tables, Under review
Subjects: Computation and Language (cs.CL)
[362]  arXiv:2405.15590 [pdf, ps, other]
Title: Profiling checkpointing schedules in adjoint ST-AD
Subjects: Computation and Language (cs.CL)
[363]  arXiv:2405.15585 [pdf, other]
Title: Synergizing In-context Learning with Hints for End-to-end Task-oriented Dialog Systems
Subjects: Computation and Language (cs.CL)
[364]  arXiv:2405.15525 [pdf, other]
Title: Sparse Matrix in Large Language Model Fine-tuning
Comments: 14 pages
Subjects: Computation and Language (cs.CL)
[365]  arXiv:2405.15523 [pdf, other]
Title: Mosaic Memory: Fuzzy Duplication in Copyright Traps for Large Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[366]  arXiv:2405.15471 [pdf, other]
Title: Emergence of a High-Dimensional Abstraction Phase in Language Transformers
Subjects: Computation and Language (cs.CL)
[367]  arXiv:2405.15454 [pdf, other]
Title: Linearly Controlled Language Generation with Performative Guarantees
Subjects: Computation and Language (cs.CL); Systems and Control (eess.SY)
[368]  arXiv:2405.15453 [pdf, other]
Title: Benchmarking Pre-trained Large Language Models' Potential Across Urdu NLP tasks
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[369]  arXiv:2405.15452 [pdf, other]
Title: Leveraging Logical Rules in Knowledge Editing: A Cherry on the Top
Comments: 18 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[370]  arXiv:2405.15370 [pdf, other]
Title: Large Language Models can Deliver Accurate and Interpretable Time Series Anomaly Detection
Subjects: Computation and Language (cs.CL)
[371]  arXiv:2405.15349 [pdf, other]
Title: UnKE: Unstructured Knowledge Editing in Large Language Models
Subjects: Computation and Language (cs.CL)
[372]  arXiv:2405.15346 [pdf, other]
Title: BiSup: Bidirectional Quantization Error Suppression for Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[373]  arXiv:2405.15334 [pdf, other]
Title: Detection and Positive Reconstruction of Cognitive Distortion sentences: Mandarin Dataset and Evaluation
Subjects: Computation and Language (cs.CL)
[374]  arXiv:2405.15329 [pdf, other]
Title: Decompose and Aggregate: A Step-by-Step Interpretable Evaluation Framework
Subjects: Computation and Language (cs.CL)
[375]  arXiv:2405.15320 [pdf, other]
Title: Organic Data-Driven Approach for Turkish Grammatical Error Correction and LLMs
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[376]  arXiv:2405.15319 [pdf, other]
Title: Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training
Comments: Preprint; The project link: $\href{https://llm-stacking.github.io/}{this https URL}$
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[377]  arXiv:2405.15318 [pdf, other]
Title: Are Long-LLMs A Necessity For Long-Context Tasks?
Comments: 18 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[378]  arXiv:2405.15307 [pdf, other]
Title: Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL Generation
Comments: Accepted to ACL Findings 2024
Subjects: Computation and Language (cs.CL)
[379]  arXiv:2405.15306 [pdf, other]
Title: DeTikZify: Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ
Comments: Project page: this https URL
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[380]  arXiv:2405.15208 [pdf, other]
Title: Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs
Comments: Accepted for publication at LREC-COLING 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[381]  arXiv:2405.15202 [pdf, other]
Title: Cross-Task Defense: Instruction-Tuning LLMs for Content Safety
Comments: accepted to NAACL2024 TrustNLP workshop
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[382]  arXiv:2405.15198 [pdf, other]
Title: RAEE: A Training-Free Retrieval-Augmented Early Exiting Framework for Efficient Inference
Subjects: Computation and Language (cs.CL)
[383]  arXiv:2405.15185 [pdf, other]
Title: An Evaluation of Estimative Uncertainty in Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[384]  arXiv:2405.15179 [pdf, other]
Title: VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks
Subjects: Computation and Language (cs.CL)
[385]  arXiv:2405.15165 [pdf, other]
Title: A Solution-based LLM API-using Methodology for Academic Information Seeking
Comments: 22 pages, 13 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
[386]  arXiv:2405.15152 [pdf, other]
Title: Machine Unlearning in Large Language Models
Comments: 10 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[387]  arXiv:2405.15134 [pdf, other]
Title: Efficient Biomedical Entity Linking: Clinical Text Standardization with Low-Resource Techniques
Subjects: Computation and Language (cs.CL)
[388]  arXiv:2405.15122 [pdf, other]
Title: Generalizable and Scalable Multistage Biomedical Concept Normalization Leveraging Large Language Models
Subjects: Computation and Language (cs.CL)
[389]  arXiv:2405.15110 [pdf, other]
Title: CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems
Comments: To appear in Findings ACL 2024
Subjects: Computation and Language (cs.CL)
[390]  arXiv:2405.15097 [pdf, other]
Title: Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding
Comments: Accepted NAACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[391]  arXiv:2405.15077 [pdf, other]
Title: Eliciting Informative Text Evaluations with Large Language Models
Comments: Accepted by the Twenty-Fifth ACM Conference on Economics and Computation (EC'24)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT)
[392]  arXiv:2405.15071 [pdf, other]
Title: Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization
Comments: 22 pages, 16 figures. Code and data: this https URL
Subjects: Computation and Language (cs.CL)
[393]  arXiv:2405.15070 [pdf, other]
Title: Optimizing example selection for retrieval-augmented machine translation with translation memories
Comments: TALN conference, French, 10 pages, 7 figures
Subjects: Computation and Language (cs.CL)
[394]  arXiv:2405.15067 [pdf, other]
Title: Promoting Constructive Deliberation: Reframing for Receptiveness
Subjects: Computation and Language (cs.CL)
[395]  arXiv:2405.15064 [pdf, other]
Title: Reframing Spatial Reasoning Evaluation in Language Models: A Real-World Simulation Benchmark for Qualitative Reasoning
Comments: Camera-Ready version for IJCAI 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[396]  arXiv:2405.15039 [pdf, other]
Title: CEEBERT: Cross-Domain Inference in Early Exit BERT
Comments: Accepted at ACL 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[397]  arXiv:2405.15032 [pdf, other]
Title: Aya 23: Open Weight Releases to Further Multilingual Progress
Subjects: Computation and Language (cs.CL)
[398]  arXiv:2405.15028 [pdf, other]
Title: AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[399]  arXiv:2405.15012 [pdf, other]
Title: Extracting Prompts by Inverting LLM Outputs
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[400]  arXiv:2405.15007 [pdf, other]
Title: RE-Adapt: Reverse Engineered Adaptation of Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[401]  arXiv:2405.14992 [pdf, other]
Title: Linking In-context Learning in Transformers to Human Episodic Memory
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[402]  arXiv:2405.14962 [pdf, ps, other]
Title: Data Augmentation Method Utilizing Template Sentences for Variable Definition Extraction
Subjects: Computation and Language (cs.CL)
[403]  arXiv:2405.14899 [pdf, other]
Title: DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[404]  arXiv:2405.15766 (cross-list from cs.AI) [pdf, other]
Title: Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development
Comments: ACL Findings 2024
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[405]  arXiv:2405.15729 (cross-list from cs.SE) [pdf, other]
Title: Optimizing Large Language Models for OpenAPI Code Completion
Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[406]  arXiv:2405.15683 (cross-list from cs.CV) [pdf, other]
Title: VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap
Comments: Preprint. Under review. Code will be released on paper acceptance
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[407]  arXiv:2405.15638 (cross-list from cs.CV) [pdf, other]
Title: M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models
Comments: Work in progress
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[408]  arXiv:2405.15556 (cross-list from cs.LG) [pdf, other]
Title: Certifiably Robust RAG against Retrieval Corruption
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[409]  arXiv:2405.15485 (cross-list from cs.AI) [pdf, other]
Title: Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[410]  arXiv:2405.15374 (cross-list from cs.IR) [pdf, other]
Title: Leveraging Large Language Models for Semantic Query Processing in a Scholarly Knowledge Graph
Comments: for the associated repository, see this http URL
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[411]  arXiv:2405.15362 (cross-list from cs.LG) [pdf, other]
Title: Pipeline Parallelism with Controllable Memory
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC)
[412]  arXiv:2405.15302 (cross-list from cs.AI) [pdf, other]
Title: Towards Understanding How Transformer Perform Multi-step Reasoning with Matching Operation
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[413]  arXiv:2405.15232 (cross-list from cs.CV) [pdf, other]
Title: DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception
Comments: 25 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[414]  arXiv:2405.15216 (cross-list from cs.LG) [pdf, other]
Title: Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Comments: under review
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[415]  arXiv:2405.15189 (cross-list from cs.SE) [pdf, other]
Title: SOAP: Enhancing Efficiency of Generated Code via Self-Optimization
Comments: 31 pages, 18 figures, and 8 tables
Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL)
[416]  arXiv:2405.15145 (cross-list from cs.AI) [pdf, other]
Title: CulturePark: Boosting Cross-cultural Understanding in Large Language Models
Comments: Technical report; 28 pages
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multiagent Systems (cs.MA)
[417]  arXiv:2405.15143 (cross-list from cs.LG) [pdf, other]
Title: Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[418]  arXiv:2405.15130 (cross-list from cs.SE) [pdf, other]
Title: OptLLM: Optimal Assignment of Queries to Large Language Models
Comments: This paper is accepted by ICWS 2024
Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[419]  arXiv:2405.15115 (cross-list from cs.LG) [pdf, other]
Title: Towards Better Understanding of In-Context Learning Ability from In-Context Uncertainty Quantification
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[420]  arXiv:2405.15092 (cross-list from cs.AI) [pdf, other]
Title: Dissociation of Faithful and Unfaithful Reasoning in LLMs
Comments: code published at this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[421]  arXiv:2405.15025 (cross-list from cs.LG) [pdf, other]
Title: OAC: Output-adaptive Calibration for Accurate Post-training Quantization
Authors: Ali Edalati (1), Alireza Ghaffari (1 and 2), Masoud Asgharian (2), Lu Hou (1), Boxing Chen (1), Vahid Partovi Nia (1) ((1) Huawei Noah's Ark Lab, (2) Department of Mathematics and Statistics, McGill University)
Comments: 20 pages, 4 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[422]  arXiv:2405.14982 (cross-list from cs.LG) [pdf, other]
Title: In-context Time Series Predictor
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[423]  arXiv:2405.14974 (cross-list from cs.CV) [pdf, other]
Title: LOVA3: Learning to Visual Question Answering, Asking and Assessment
Comments: The code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[424]  arXiv:2405.14917 (cross-list from cs.LG) [pdf, other]
Title: SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models
Comments: 22 pages
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[425]  arXiv:2405.14908 (cross-list from cs.LG) [pdf, other]
Title: Data Mixing Made Efficient: A Bivariate Scaling Law for Language Model Pretraining
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[426]  arXiv:2405.14905 (cross-list from eess.IV) [pdf, other]
Title: Structural Entities Extraction and Patient Indications Incorporation for Chest X-ray Report Generation
Comments: The code is available at this https URL or this https URL
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[427]  arXiv:2405.14191 (cross-list from cs.CR) [pdf, other]
Title: S-Eval: Automatic and Adaptive Test Generation for Benchmarking Safety Evaluation of Large Language Models
Comments: 18 pages, 11 figures
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL)
[ total of 427 entries: 1-250 | 101-350 | 351-427 ]
[ showing 250 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help  (Access key information)