We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions, skipping first 166

[ total of 515 entries: 1-100 | 67-166 | 167-266 | 267-366 | 367-466 | 467-515 ]
[ showing 100 entries per page: fewer | more | all ]

Tue, 28 May 2024 (continued, showing last 41 of 126 entries)

[167]  arXiv:2405.17345 (cross-list from cs.AI) [pdf, other]
Title: Exploring and steering the moral compass of Large Language Models
Authors: Alejandro Tlaie
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[168]  arXiv:2405.17217 (cross-list from cs.HC) [pdf, other]
Title: Collage is the New Writing: Exploring the Fragmentation of Text and User Interfaces in AI Tools
Authors: Daniel Buschek
Comments: 19 pages, 7 figures, 2 tables, ACM DIS 2024
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[169]  arXiv:2405.17130 (cross-list from cs.LG) [pdf, other]
Title: Exploiting the Layered Intrinsic Dimensionality of Deep Models for Practical Adversarial Training
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[170]  arXiv:2405.17104 (cross-list from cs.CV) [pdf, other]
Title: LLM-Optic: Unveiling the Capabilities of Large Language Models for Universal Visual Grounding
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[171]  arXiv:2405.17088 (cross-list from cs.LG) [pdf, other]
Title: Phase Transitions in the Output Distribution of Large Language Models
Comments: 21 pages, 4 figures
Subjects: Machine Learning (cs.LG); Statistical Mechanics (cond-mat.stat-mech); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[172]  arXiv:2405.17076 (cross-list from cs.AI) [pdf, other]
Title: Leveraging small language models for Text2SPARQL tasks to improve the resilience of AI assistance
Comments: To appear in Proceedings of the Workshop on Linked Data-driven Resilience Research 2024 (D2R2) co-located with Extended Semantic Web Conference 2024 (ESWC 2024)
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[173]  arXiv:2405.17044 (cross-list from cs.AI) [pdf, other]
Title: Generation and human-expert evaluation of interesting research ideas using knowledge graphs and large language models
Comments: 10 pages; 5 figures
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Digital Libraries (cs.DL); Machine Learning (cs.LG)
[174]  arXiv:2405.16994 (cross-list from cs.AI) [pdf, other]
Title: Vision-and-Language Navigation Generative Pretrained Transformer
Authors: Wen Hanlin
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[175]  arXiv:2405.16919 (cross-list from cs.CV) [pdf, other]
Title: VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[176]  arXiv:2405.16869 (cross-list from cs.AI) [pdf, other]
Title: Mixture of Modality Knowledge Experts for Robust Multi-modal Knowledge Graph Completion
Comments: Work in progress. Code and data will be released at this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[177]  arXiv:2405.16845 (cross-list from cs.LG) [pdf, other]
Title: On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability
Comments: 37pages
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[178]  arXiv:2405.16751 (cross-list from cs.AI) [pdf, other]
Title: LLM-Based Cooperative Agents using Information Relevance and Plan Validation
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
[179]  arXiv:2405.16712 (cross-list from cs.LG) [pdf, other]
Title: Zamba: A Compact 7B SSM Hybrid Model
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[180]  arXiv:2405.16700 (cross-list from cs.CV) [pdf, other]
Title: Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs
Comments: Project page: this https URL 37 Pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[181]  arXiv:2405.16682 (cross-list from cs.LG) [pdf, other]
Title: A Systematic Review of Federated Generative Models
Comments: 24 Pages, 3 Figures, 5 Tables
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[182]  arXiv:2405.16677 (cross-list from eess.AS) [pdf, other]
Title: Crossmodal ASR Error Correction with Discrete Speech Units
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[183]  arXiv:2405.16669 (cross-list from cs.HC) [pdf, other]
Title: Low-resourced Languages and Online Knowledge Repositories: A Need-Finding Study
Comments: In Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI 2024)
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[184]  arXiv:2405.16662 (cross-list from cs.LO) [pdf, ps, other]
Title: Conjunctive categorial grammars and Lambek grammars with additives
Comments: This article is an extended version of the conference presentation "Conjunctive categorial grammars" at the Mathematics of Language 2017 meeting (London, UK, July 13-14, 2017; proceedings published in ACL Anthology, W17-3414)
Subjects: Logic in Computer Science (cs.LO); Computation and Language (cs.CL); Logic (math.LO)
[185]  arXiv:2405.16640 (cross-list from cs.AI) [pdf, other]
Title: A Survey of Multimodal Large Language Model from A Data-centric Perspective
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[186]  arXiv:2405.16546 (cross-list from cs.IR) [pdf, other]
Title: Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration
Comments: Accepted by Findings of ACL 2024; Datasets Link: this https URL
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[187]  arXiv:2405.16528 (cross-list from cs.LG) [pdf, other]
Title: LoQT: Low Rank Adapters for Quantized Training
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[188]  arXiv:2405.16510 (cross-list from cs.AI) [pdf, other]
Title: Meta-Task Planning for Language Agents
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[189]  arXiv:2405.16473 (cross-list from cs.CV) [pdf, other]
Title: M$^3$CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought
Comments: Accepted at ACL2024 Main Conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[190]  arXiv:2405.16442 (cross-list from cs.CY) [pdf, ps, other]
Title: Development of an open education resources (OER) system: a comparative analysis and implementation approach
Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL)
[191]  arXiv:2405.16434 (cross-list from cs.AI) [pdf, other]
Title: The Importance of Directional Feedback for LLM-based Optimizers
Comments: Presented at Foundation Models for Decision Making at NeurIPS 2023
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
[192]  arXiv:2405.16413 (cross-list from cs.AI) [pdf, other]
Title: Augmented Risk Prediction for the Onset of Alzheimer's Disease from Electronic Health Records with Large Language Models
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Applications (stat.AP)
[193]  arXiv:2405.16411 (cross-list from cs.LG) [pdf, other]
Title: Tensor Attention Training: Provably Efficient Learning of Higher-order Transformers
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[194]  arXiv:2405.16406 (cross-list from cs.LG) [pdf, other]
Title: SpinQuant -- LLM quantization with learned rotations
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[195]  arXiv:2405.16247 (cross-list from cs.AI) [pdf, other]
Title: AutoManual: Generating Instruction Manuals by LLM Agents via Interactive Environmental Learning
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[196]  arXiv:2405.16205 (cross-list from cs.AI) [pdf, ps, other]
Title: GeneAgent: Self-verification Language Agent for Gene Set Knowledge Discovery using Domain Databases
Comments: 30 pages with 10 figures and/or tables
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[197]  arXiv:2405.16136 (cross-list from cs.AI) [pdf, other]
Title: C3LLM: Conditional Multimodal Content Generation Using Large Language Models
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[198]  arXiv:2405.16128 (cross-list from cs.AI) [pdf, other]
Title: How Well Do Deep Learning Models Capture Human Concepts? The Case of the Typicality Effect
Comments: To appear at CogSci 2024
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[199]  arXiv:2405.16122 (cross-list from cs.AI) [pdf, other]
Title: Prompt Optimization with EASE? Efficient Ordering-aware Automated Selection of Exemplars
Comments: 23 pages, 1 figure, 23 tables
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
[200]  arXiv:2405.16043 (cross-list from cs.LG) [pdf, other]
Title: Theoretical Analysis of Weak-to-Strong Generalization
Comments: 36 pages, 3 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[201]  arXiv:2405.15973 (cross-list from cs.CV) [pdf, other]
Title: Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement
Comments: 15 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[202]  arXiv:2405.15943 (cross-list from cs.LG) [pdf, other]
Title: Transformers represent belief state geometry in their residual stream
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[203]  arXiv:2405.15902 (cross-list from cs.CR) [pdf, other]
Title: Hacc-Man: An Arcade Game for Jailbreaking LLMs
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[204]  arXiv:2405.15877 (cross-list from cs.LG) [pdf, other]
Title: Basis Selection: Low-Rank Decomposition of Pretrained Large Language Models for Target Applications
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Computation and Language (cs.CL)
[205]  arXiv:2405.15793 (cross-list from cs.SE) [pdf, other]
Title: SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering
Comments: First two authors contributed equally. Code and demo at this https URL
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[206]  arXiv:2405.15787 (cross-list from cs.IR) [pdf, ps, other]
Title: Extracting chemical food safety hazards from the scientific literature automatically using large language models
Comments: 31 pages, 5 figures
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[207]  arXiv:2405.15784 (cross-list from cs.IR) [pdf, other]
Title: CLARINET: Augmenting Language Models to Ask Clarification Questions for Retrieval
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Mon, 27 May 2024 (showing first 59 of 72 entries)

[208]  arXiv:2405.15765 [pdf, other]
Title: Scaling Laws for Discriminative Classification in Large Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[209]  arXiv:2405.15760 [pdf, other]
Title: GPT is Not an Annotator: The Necessity of Human Annotation in Fairness Benchmark Construction
Comments: Accepted to ACL 2024 (main conference)
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[210]  arXiv:2405.15750 [pdf, other]
Title: Filtered Corpus Training (FiCT) Shows that Language Models can Generalize from Indirect Evidence
Comments: 10 pages + 7 pages of references/appendices. For code and trained models, see this http URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[211]  arXiv:2405.15708 [pdf, other]
Title: EmpathicStories++: A Multimodal Dataset for Empathy towards Personal Experiences
Comments: Accepted to ACL 2024 Findings
Subjects: Computation and Language (cs.CL)
[212]  arXiv:2405.15640 [pdf, other]
Title: GECKO: Generative Language Model for English, Code and Korean
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[213]  arXiv:2405.15604 [pdf, other]
Title: Text Generation: A Systematic Literature Review of Tasks, Evaluation, and Challenges
Comments: 35 pages, 2 figures, 2 tables, Under review
Subjects: Computation and Language (cs.CL)
[214]  arXiv:2405.15590 [pdf, ps, other]
Title: Profiling checkpointing schedules in adjoint ST-AD
Subjects: Computation and Language (cs.CL)
[215]  arXiv:2405.15585 [pdf, other]
Title: Synergizing In-context Learning with Hints for End-to-end Task-oriented Dialog Systems
Subjects: Computation and Language (cs.CL)
[216]  arXiv:2405.15525 [pdf, other]
Title: Sparse Matrix in Large Language Model Fine-tuning
Comments: 14 pages
Subjects: Computation and Language (cs.CL)
[217]  arXiv:2405.15523 [pdf, other]
Title: Mosaic Memory: Fuzzy Duplication in Copyright Traps for Large Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[218]  arXiv:2405.15471 [pdf, other]
Title: Emergence of a High-Dimensional Abstraction Phase in Language Transformers
Subjects: Computation and Language (cs.CL)
[219]  arXiv:2405.15454 [pdf, other]
Title: Linearly Controlled Language Generation with Performative Guarantees
Subjects: Computation and Language (cs.CL); Systems and Control (eess.SY)
[220]  arXiv:2405.15453 [pdf, other]
Title: Benchmarking Pre-trained Large Language Models' Potential Across Urdu NLP tasks
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[221]  arXiv:2405.15452 [pdf, other]
Title: Leveraging Logical Rules in Knowledge Editing: A Cherry on the Top
Comments: 18 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[222]  arXiv:2405.15370 [pdf, other]
Title: Large Language Models can Deliver Accurate and Interpretable Time Series Anomaly Detection
Subjects: Computation and Language (cs.CL)
[223]  arXiv:2405.15349 [pdf, other]
Title: UnKE: Unstructured Knowledge Editing in Large Language Models
Subjects: Computation and Language (cs.CL)
[224]  arXiv:2405.15346 [pdf, other]
Title: BiSup: Bidirectional Quantization Error Suppression for Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[225]  arXiv:2405.15334 [pdf, other]
Title: Detection and Positive Reconstruction of Cognitive Distortion sentences: Mandarin Dataset and Evaluation
Subjects: Computation and Language (cs.CL)
[226]  arXiv:2405.15329 [pdf, other]
Title: Decompose and Aggregate: A Step-by-Step Interpretable Evaluation Framework
Subjects: Computation and Language (cs.CL)
[227]  arXiv:2405.15320 [pdf, other]
Title: Organic Data-Driven Approach for Turkish Grammatical Error Correction and LLMs
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[228]  arXiv:2405.15319 [pdf, other]
Title: Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training
Comments: Preprint; The project link: $\href{https://llm-stacking.github.io/}{this https URL}$
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[229]  arXiv:2405.15318 [pdf, other]
Title: Are Long-LLMs A Necessity For Long-Context Tasks?
Comments: 18 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[230]  arXiv:2405.15307 [pdf, other]
Title: Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL Generation
Comments: Accepted to ACL Findings 2024
Subjects: Computation and Language (cs.CL)
[231]  arXiv:2405.15306 [pdf, other]
Title: DeTikZify: Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ
Comments: Project page: this https URL
Subjects: Computation and Language (cs.CL)
[232]  arXiv:2405.15208 [pdf, other]
Title: Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs
Comments: Accepted for publication at LREC-COLING 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[233]  arXiv:2405.15202 [pdf, other]
Title: Cross-Task Defense: Instruction-Tuning LLMs for Content Safety
Comments: accepted to NAACL2024 TrustNLP workshop
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[234]  arXiv:2405.15198 [pdf, other]
Title: RAEE: A Training-Free Retrieval-Augmented Early Exiting Framework for Efficient Inference
Subjects: Computation and Language (cs.CL)
[235]  arXiv:2405.15185 [pdf, other]
Title: An Evaluation of Estimative Uncertainty in Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[236]  arXiv:2405.15179 [pdf, other]
Title: VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks
Subjects: Computation and Language (cs.CL)
[237]  arXiv:2405.15165 [pdf, other]
Title: A Solution-based LLM API-using Methodology for Academic Information Seeking
Comments: 22 pages, 13 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
[238]  arXiv:2405.15152 [pdf, other]
Title: Machine Unlearning in Large Language Models
Comments: 10 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[239]  arXiv:2405.15134 [pdf, other]
Title: Efficient Biomedical Entity Linking: Clinical Text Standardization with Low-Resource Techniques
Subjects: Computation and Language (cs.CL)
[240]  arXiv:2405.15122 [pdf, other]
Title: Generalizable and Scalable Multistage Biomedical Concept Normalization Leveraging Large Language Models
Subjects: Computation and Language (cs.CL)
[241]  arXiv:2405.15110 [pdf, other]
Title: CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems
Comments: To appear in Findings ACL 2024
Subjects: Computation and Language (cs.CL)
[242]  arXiv:2405.15097 [pdf, other]
Title: Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding
Comments: Accepted NAACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[243]  arXiv:2405.15077 [pdf, other]
Title: Eliciting Informative Text Evaluations with Large Language Models
Comments: Accepted by the Twenty-Fifth ACM Conference on Economics and Computation (EC'24)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT)
[244]  arXiv:2405.15071 [pdf, other]
Title: Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization
Comments: 22 pages, 16 figures. Code and data: this https URL
Subjects: Computation and Language (cs.CL)
[245]  arXiv:2405.15070 [pdf, other]
Title: Optimizing example selection for retrieval-augmented machine translation with translation memories
Comments: TALN conference, French, 10 pages, 7 figures
Subjects: Computation and Language (cs.CL)
[246]  arXiv:2405.15067 [pdf, other]
Title: Promoting Constructive Deliberation: Reframing for Receptiveness
Subjects: Computation and Language (cs.CL)
[247]  arXiv:2405.15064 [pdf, other]
Title: Reframing Spatial Reasoning Evaluation in Language Models: A Real-World Simulation Benchmark for Qualitative Reasoning
Comments: Camera-Ready version for IJCAI 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[248]  arXiv:2405.15039 [pdf, other]
Title: CEEBERT: Cross-Domain Inference in Early Exit BERT
Comments: Accepted at ACL 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[249]  arXiv:2405.15032 [pdf, other]
Title: Aya 23: Open Weight Releases to Further Multilingual Progress
Subjects: Computation and Language (cs.CL)
[250]  arXiv:2405.15028 [pdf, other]
Title: AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[251]  arXiv:2405.15012 [pdf, other]
Title: Extracting Prompts by Inverting LLM Outputs
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[252]  arXiv:2405.15007 [pdf, other]
Title: RE-Adapt: Reverse Engineered Adaptation of Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[253]  arXiv:2405.14992 [pdf, other]
Title: Linking In-context Learning in Transformers to Human Episodic Memory
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[254]  arXiv:2405.14962 [pdf, ps, other]
Title: Data Augmentation Method Utilizing Template Sentences for Variable Definition Extraction
Subjects: Computation and Language (cs.CL)
[255]  arXiv:2405.14899 [pdf, other]
Title: DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[256]  arXiv:2405.15766 (cross-list from cs.AI) [pdf, other]
Title: Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development
Comments: ACL Findings 2024
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[257]  arXiv:2405.15729 (cross-list from cs.SE) [pdf, other]
Title: Optimizing Large Language Models for OpenAPI Code Completion
Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[258]  arXiv:2405.15683 (cross-list from cs.CV) [pdf, other]
Title: VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap
Comments: Preprint. Under review. Code will be released on paper acceptance
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[259]  arXiv:2405.15638 (cross-list from cs.CV) [pdf, other]
Title: M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models
Comments: Work in progress
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[260]  arXiv:2405.15556 (cross-list from cs.LG) [pdf, other]
Title: Certifiably Robust RAG against Retrieval Corruption
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[261]  arXiv:2405.15485 (cross-list from cs.AI) [pdf, other]
Title: Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[262]  arXiv:2405.15374 (cross-list from cs.IR) [pdf, other]
Title: Leveraging Large Language Models for Semantic Query Processing in a Scholarly Knowledge Graph
Comments: for the associated repository, see this http URL
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[263]  arXiv:2405.15362 (cross-list from cs.LG) [pdf, other]
Title: Pipeline Parallelism with Controllable Memory
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC)
[264]  arXiv:2405.15302 (cross-list from cs.AI) [pdf, other]
Title: Towards Understanding How Transformer Perform Multi-step Reasoning with Matching Operation
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[265]  arXiv:2405.15232 (cross-list from cs.CV) [pdf, other]
Title: DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception
Comments: 25 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[266]  arXiv:2405.15216 (cross-list from cs.LG) [pdf, other]
Title: Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Comments: under review
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[ total of 515 entries: 1-100 | 67-166 | 167-266 | 267-366 | 367-466 | 467-515 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)