We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions

[ total of 427 entries: 1-156 | 157-312 | 313-427 ]
[ showing 156 entries per page: fewer | more | all ]

Fri, 31 May 2024

[1]  arXiv:2405.20335 [pdf, other]
Title: Xwin-LM: Strong and Scalable Alignment Practice for LLMs
Subjects: Computation and Language (cs.CL)
[2]  arXiv:2405.20318 [pdf, other]
Title: CausalQuest: Collecting Natural Causal Questions for AI Agents
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[3]  arXiv:2405.20315 [pdf, other]
Title: ANAH: Analytical Annotation of Hallucinations in Large Language Models
Comments: Accepted by ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[4]  arXiv:2405.20314 [pdf, ps, other]
Title: S3D: A Simple and Cost-Effective Self-Speculative Decoding Scheme for Low-Memory GPUs
Subjects: Computation and Language (cs.CL)
[5]  arXiv:2405.20304 [pdf, other]
Title: Group Robust Preference Optimization in Reward-free RLHF
Comments: Preprint
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[6]  arXiv:2405.20285 [pdf, other]
Title: Who Writes the Review, Human or AI?
Subjects: Computation and Language (cs.CL)
[7]  arXiv:2405.20274 [pdf, other]
Title: ROAST: Review-level Opinion Aspect Sentiment Target Joint Detection
Comments: arXiv admin note: text overlap with arXiv:2309.13297
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[8]  arXiv:2405.20269 [pdf, ps, other]
Title: IsraParlTweet: The Israeli Parliamentary and Twitter Resource
Comments: Presented at LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[9]  arXiv:2405.20267 [pdf, other]
Title: Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions
Subjects: Computation and Language (cs.CL)
[10]  arXiv:2405.20253 [pdf, other]
Title: Evaluating Large Language Model Biases in Persona-Steered Generation
Comments: Accepted to Findings of ACL 2024. Code and data available at this https URL
Subjects: Computation and Language (cs.CL)
[11]  arXiv:2405.20252 [pdf, other]
Title: Towards Hierarchical Multi-Agent Workflows for Zero-Shot Prompt Optimization
Subjects: Computation and Language (cs.CL)
[12]  arXiv:2405.20245 [pdf, other]
Title: Retrieval Augmented Structured Generation: Business Document Information Extraction As Tool Use
Comments: Accepted by IEEE 7th International Conference on Multimedia Information Processing and Retrieval (MIPR), 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[13]  arXiv:2405.20215 [pdf, other]
Title: TS-Align: A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models
Subjects: Computation and Language (cs.CL)
[14]  arXiv:2405.20204 [pdf, other]
Title: Jina CLIP: Your CLIP Model Is Also Your Text Retriever
Comments: 4 pages, ICML2024 workshop submission
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[15]  arXiv:2405.20192 [pdf, other]
Title: TAIA: Large Language Models are Out-of-Distribution Data Learners
Comments: 25 pages
Subjects: Computation and Language (cs.CL)
[16]  arXiv:2405.20179 [pdf, other]
Title: Robo-Instruct: Simulator-Augmented Instruction Alignment For Finetuning CodeLLMs
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[17]  arXiv:2405.20175 [pdf, other]
Title: InstructionCP: A fast approach to transfer Large Language Models into target language
Comments: 10 pages, 1 figure
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[18]  arXiv:2405.20163 [pdf, other]
Title: Reasoning about concepts with LLMs: Inconsistencies abound
Comments: 15 pages, 5 figures, 3 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[19]  arXiv:2405.20145 [pdf, other]
Title: Heidelberg-Boston @ SIGTYP 2024 Shared Task: Enhancing Low-Resource Language Analysis With Character-Aware Hierarchical Transformers
Comments: Accepted for publication at the 6th Workshop on Research in Computational Linguistic Typology and Multilingual NLP (SIGTYP-WS) 2024; 11 pages, 1 figure, 9 tables
Subjects: Computation and Language (cs.CL)
[20]  arXiv:2405.20139 [pdf, other]
Title: GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[21]  arXiv:2405.20131 [pdf, other]
Title: Language Models Need Inductive Biases to Count Inductively
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[22]  arXiv:2405.20092 [pdf, other]
Title: Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation
Subjects: Computation and Language (cs.CL); Software Engineering (cs.SE)
[23]  arXiv:2405.20089 [pdf, other]
Title: The Fine-Tuning Paradox: Boosting Translation Quality Without Sacrificing LLM Abilities
Comments: Accepted to ACL 2024 (long, main)
Subjects: Computation and Language (cs.CL)
[24]  arXiv:2405.20079 [pdf, other]
Title: Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language Learning
Comments: Accepted as a poster paper at EDM 2024: 17th International Conference on Educational Data Mining in Atlanta, USA
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[25]  arXiv:2405.20053 [pdf, other]
Title: Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[26]  arXiv:2405.19967 [pdf, other]
Title: Improved Out-of-Scope Intent Classification with Dual Encoding and Threshold-based Re-Classification
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[27]  arXiv:2405.19958 [pdf, other]
Title: Multi-Aspect Controllable Text Generation with Disentangled Counterfactual Augmentation
Comments: Accepted in the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[28]  arXiv:2405.19874 [pdf, other]
Title: Is In-Context Learning Sufficient for Instruction Following in LLMs?
Comments: Preprint. Code at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[29]  arXiv:2405.19856 [pdf, other]
Title: DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories
Comments: Accepted by the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024). arXiv admin note: substantial text overlap with arXiv:2404.00599, arXiv:2401.06401
Subjects: Computation and Language (cs.CL); Software Engineering (cs.SE)
[30]  arXiv:2405.19846 [pdf, other]
Title: Quest: Query-centric Data Synthesis Approach for Long-context Scaling of Large Language Model
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[31]  arXiv:2405.19842 [pdf, other]
Title: Improve Student's Reasoning Generalizability through Cascading Decomposed CoTs Distillation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[32]  arXiv:2405.19831 [pdf, other]
Title: Just Rewrite It Again: A Post-Processing Method for Enhanced Semantic Similarity and Privacy Preservation of Differentially Private Rewritten Text
Comments: 10 pages, 2 figures, 2 tables. Accepted to ARES 2024 (IWAPS)
Subjects: Computation and Language (cs.CL)
[33]  arXiv:2405.19799 [pdf, other]
Title: Unsupervised Mutual Learning of Dialogue Discourse Parsing and Topic Segmentation
Subjects: Computation and Language (cs.CL)
[34]  arXiv:2405.19795 [pdf, other]
Title: SLM as Guardian: Pioneering AI Safety with Small Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[35]  arXiv:2405.19793 [pdf, other]
Title: PDDLEGO: Iterative Planning in Textual Environments
Comments: In *SEM 2024
Subjects: Computation and Language (cs.CL)
[36]  arXiv:2405.19787 [pdf, other]
Title: From Symbolic Tasks to Code Generation: Diversification Yields Better Task Performers
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Logic in Computer Science (cs.LO); Programming Languages (cs.PL)
[37]  arXiv:2405.19778 [pdf, other]
Title: Enhancing Consistency and Role-Specific Knowledge Capturing by Rebuilding Fictional Character's Persona
Comments: preprint
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[38]  arXiv:2405.19763 [pdf, other]
Title: Enhancing Reinforcement Learning with Label-Sensitive Reward for Natural Language Understanding
Comments: Accept at ACL2024 Main
Subjects: Computation and Language (cs.CL)
[39]  arXiv:2405.19744 [pdf, other]
Title: X-Instruction: Aligning Language Model in Low-resource Languages with Self-curated Cross-lingual Instructions
Comments: ACL 2024. Our codes, data and model weights are available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[40]  arXiv:2405.19740 [pdf, other]
Title: PertEval: Unveiling Real Knowledge Capacity of LLMs with Knowledge-Invariant Perturbations
Comments: 23 pages, 12 figures, 10 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[41]  arXiv:2405.19737 [pdf, other]
Title: Beyond Imitation: Learning Key Reasoning Steps from Dual Chain-of-Thoughts in Reasoning Distillation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[42]  arXiv:2405.19715 [pdf, other]
Title: SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[43]  arXiv:2405.19701 [pdf, other]
Title: Significance of Chain of Thought in Gender Bias Mitigation for English-Dravidian Machine Translation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[44]  arXiv:2405.19670 [pdf, other]
Title: One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models
Comments: working in progress, repo: this https URL
Subjects: Computation and Language (cs.CL)
[45]  arXiv:2405.19660 [pdf, other]
Title: PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals
Comments: Work in progress
Subjects: Computation and Language (cs.CL)
[46]  arXiv:2405.19648 [pdf, other]
Title: Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach
Comments: ICAI'24 - The 26th Int'l Conf on Artificial Intelligence
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[47]  arXiv:2405.19635 [pdf, other]
Title: GKT: A Novel Guidance-Based Knowledge Transfer Framework For Efficient Cloud-edge Collaboration LLM Deployment
Subjects: Computation and Language (cs.CL)
[48]  arXiv:2405.19575 [pdf, other]
Title: A Deep Convolutional Neural Network-based Model for Aspect and Polarity Classification in Hausa Movie Reviews
Comments: To be published in the proceedings of ICCAIT 2023
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[49]  arXiv:2405.19563 [pdf, other]
Title: Unlearning Climate Misinformation in Large Language Models
Subjects: Computation and Language (cs.CL)
[50]  arXiv:2405.19538 [pdf, other]
Title: CheXpert Plus: Hundreds of Thousands of Aligned Radiology Texts, Images and Patients
Comments: 13 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[51]  arXiv:2405.19519 [pdf, other]
Title: Two-layer retrieval augmented generation framework for low-resource medical question-answering: proof of concept using Reddit data
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[52]  arXiv:2405.19487 [pdf, other]
Title: A Full-duplex Speech Dialogue Scheme Based On Large Language Models
Subjects: Computation and Language (cs.CL)
[53]  arXiv:2405.19462 [pdf, other]
Title: Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning
Comments: Accepted to ACL 2024 Findings
Subjects: Computation and Language (cs.CL)
[54]  arXiv:2405.19433 [pdf, other]
Title: Beyond Agreement: Diagnosing the Rationale Alignment of Automated Essay Scoring Methods based on Linguistically-informed Counterfactuals
Subjects: Computation and Language (cs.CL)
[55]  arXiv:2405.19426 [pdf, other]
Title: Deep Learning for Assessment of Oral Reading Fluency
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[56]  arXiv:2405.19425 [pdf, other]
Title: Adaptive In-conversation Team Building for Language Model Agents
Subjects: Computation and Language (cs.CL)
[57]  arXiv:2405.20341 (cross-list from cs.LG) [pdf, other]
Title: From Zero to Hero: Cold-Start Anomaly Detection
Comments: ACL 2024. Our code is available at this https URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[58]  arXiv:2405.20309 (cross-list from cs.LG) [pdf, other]
Title: Large Language Models Can Self-Improve At Web Agent Tasks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[59]  arXiv:2405.20271 (cross-list from cs.LG) [pdf, other]
Title: ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections
Comments: Accepted to ICML 2024. Code available at this https URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[60]  arXiv:2405.20213 (cross-list from cs.AI) [pdf, other]
Title: PostDoc: Generating Poster from a Long Multimodal Document Using Deep Submodular Optimization
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[61]  arXiv:2405.20172 (cross-list from cs.SD) [pdf, other]
Title: Iterative Feature Boosting for Explainable Speech Emotion Recognition
Comments: Published in: 2023 International Conference on Machine Learning and Applications (ICMLA)
Journal-ref: 2023 International Conference on Machine Learning and Applications (ICMLA), Jacksonville, FL, USA, 2023, pp. 543-549
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[62]  arXiv:2405.20101 (cross-list from cs.SD) [pdf, other]
Title: Fill in the Gap! Combining Self-supervised Representation Learning with Neural Audio Synthesis for Speech Inpainting
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[63]  arXiv:2405.20003 (cross-list from cs.LG) [pdf, other]
Title: Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[64]  arXiv:2405.19954 (cross-list from cs.CR) [pdf, other]
Title: GenKubeSec: LLM-Based Kubernetes Misconfiguration Detection, Localization, Reasoning, and Remediation
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[65]  arXiv:2405.19877 (cross-list from cs.AI) [pdf, other]
Title: KNOW: A Real-World Ontology for Knowledge Capture with Large Language Models
Authors: Arto Bendiken
Comments: 5 pages, 1 figure
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[66]  arXiv:2405.19782 (cross-list from cs.SE) [pdf, other]
Title: Dataflow-Guided Retrieval Augmentation for Repository-Level Code Completion
Comments: Accepted in the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)
Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL)
[67]  arXiv:2405.19732 (cross-list from cs.CV) [pdf, other]
Title: Two Optimizers Are Better Than One: LLM Catalyst for Enhancing Gradient-Based Optimization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[68]  arXiv:2405.19716 (cross-list from cs.CV) [pdf, other]
Title: Enhancing Large Vision Language Models with Self-Training on Image Comprehension
Comments: 19 pages, 14 figures, 6 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[69]  arXiv:2405.19616 (cross-list from cs.AI) [pdf, other]
Title: Easy Problems That LLMs Get Wrong
Comments: AutogenAI Ltd. Associated code at this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[70]  arXiv:2405.19597 (cross-list from cs.LG) [pdf, other]
Title: SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors
Comments: 17 pages, 5 figures, 14 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[71]  arXiv:2405.19592 (cross-list from cs.LG) [pdf, other]
Title: Why Larger Language Models Do In-context Learning Differently?
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[72]  arXiv:2405.19562 (cross-list from cs.CY) [pdf, other]
Title: Selective Explanations
Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL); Machine Learning (cs.LG)
[73]  arXiv:2405.19561 (cross-list from cs.AI) [pdf, other]
Title: Quo Vadis ChatGPT? From Large Language Models to Large Knowledge Models
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[74]  arXiv:2405.19534 (cross-list from cs.LG) [pdf, other]
Title: Preference Learning Algorithms Do Not Learn Preference Rankings
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[75]  arXiv:2405.19343 (cross-list from cs.SD) [pdf, other]
Title: Luganda Speech Intent Recognition for IoT Applications
Comments: Presented as a conference paper at ICLR 2024/AfricaNLP
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[76]  arXiv:2405.19342 (cross-list from cs.SD) [pdf, other]
Title: Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)

Thu, 30 May 2024

[77]  arXiv:2405.19327 [pdf, other]
[78]  arXiv:2405.19325 [pdf, other]
Title: Nearest Neighbor Speculative Decoding for LLM Generation and Attribution
Subjects: Computation and Language (cs.CL)
[79]  arXiv:2405.19323 [pdf, other]
Title: Are Large Language Models Chameleons?
Comments: 16 pages,8 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[80]  arXiv:2405.19299 [pdf, other]
Title: Expert-Guided Extinction of Toxic Tokens for Debiased Generation
Subjects: Computation and Language (cs.CL)
[81]  arXiv:2405.19290 [pdf, other]
Title: Integrating Multi-scale Contextualized Information for Byte-based Neural Machine Translation
Comments: Accepted by ACL2024 Findings
Subjects: Computation and Language (cs.CL)
[82]  arXiv:2405.19285 [pdf, other]
Title: MASSIVE Multilingual Abstract Meaning Representation: A Dataset and Baselines for Hallucination Detection
Subjects: Computation and Language (cs.CL)
[83]  arXiv:2405.19266 [pdf, other]
Title: PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications
Comments: A Technical Report on a Powerful Chinese Medical Large Language Model
Subjects: Computation and Language (cs.CL)
[84]  arXiv:2405.19265 [pdf, other]
Title: AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data
Comments: Preprint with 20 pages and 20 figures. Source code and models at this https URL
Subjects: Computation and Language (cs.CL)
[85]  arXiv:2405.19262 [pdf, other]
Title: Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[86]  arXiv:2405.19261 [pdf, other]
Title: Faster Cascades via Speculative Decoding
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[87]  arXiv:2405.19222 [pdf, other]
Title: Lower Bounds on the Expressivity of Recurrent Neural Language Models
Subjects: Computation and Language (cs.CL)
[88]  arXiv:2405.19220 [pdf, other]
Title: WRDScore: New Metric for Evaluation of Natural Language Generation Models
Authors: Ravil Mussabayev
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[89]  arXiv:2405.19139 [pdf, other]
Title: DGRC: An Effective Fine-tuning Framework for Distractor Generation in Chinese Multi-choice Reading Comprehension
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[90]  arXiv:2405.19109 [pdf, other]
Title: PathReasoner: Modeling Reasoning Path with Equivalent Extension for Logical Question Answering
Comments: Accepted by ACL 2024
Subjects: Computation and Language (cs.CL)
[91]  arXiv:2405.19094 [pdf, other]
Title: Faithful Chart Summarization with ChaTS-Pi
Comments: To be published in the proceedings of the 2024 Annual Meeting of the Association for Computational Linguistics
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[92]  arXiv:2405.19093 [pdf, other]
Title: Multi-stage Retrieve and Re-rank Model for Automatic Medical Coding Recommendation
Comments: Accepted to NAACL 2024 -- camera-ready version
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[93]  arXiv:2405.19088 [pdf, other]
Title: Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[94]  arXiv:2405.19086 [pdf, other]
Title: MEMoE: Enhancing Model Editing with Mixture of Experts Adaptors
Authors: Renzhi Wang, Piji Li
Subjects: Computation and Language (cs.CL)
[95]  arXiv:2405.19084 [pdf, other]
Title: Auxiliary Knowledge-Induced Learning for Automatic Multi-Label Medical Document Classification
Comments: Accepted to LREC-COLING 2024 -- camera-ready version
Subjects: Computation and Language (cs.CL)
[96]  arXiv:2405.19041 [pdf, other]
Title: BLSP-KD: Bootstrapping Language-Speech Pre-training via Knowledge Distillation
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[97]  arXiv:2405.19010 [pdf, other]
Title: Evaluating the External and Parametric Knowledge Fusion of Large Language Models
Comments: 15 pages, 3 figures, 3 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[98]  arXiv:2405.18974 [pdf, other]
Title: Encoding Hierarchical Schema via Concept Flow for Multifaceted Ideology Detection
Comments: 13pages, 4 figures (Accepted to Findings of ACL 2024)
Subjects: Computation and Language (cs.CL)
[99]  arXiv:2405.18952 [pdf, other]
Title: Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets
Authors: Peter Devine
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[100]  arXiv:2405.18922 [pdf, other]
Title: Understanding and Addressing the Under-Translation Problem from the Perspective of Decoding Objective
Comments: ACL 2024 main conference
Subjects: Computation and Language (cs.CL)
[101]  arXiv:2405.18915 [pdf, other]
Title: Towards Faithful Chain-of-Thought: Large Language Models are Bridging Reasoners
Comments: 25 pages, under review
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[102]  arXiv:2405.18906 [pdf, other]
Title: Language Generation with Strictly Proper Scoring Rules
Comments: ICML 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[103]  arXiv:2405.18845 [pdf, other]
Title: Simulation, Modelling and Classification of Wiki Contributors: Spotting The Good, The Bad, and The Ugly
Journal-ref: Simulation Modelling Practice and Theory, 120, 102616 (2022)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[104]  arXiv:2405.18822 [pdf, other]
Title: Toxicity Detection for Free
Subjects: Computation and Language (cs.CL)
[105]  arXiv:2405.18741 [pdf, other]
Title: Genshin: General Shield for Natural Language Processing with Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[106]  arXiv:2405.18740 [pdf, other]
Title: Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs
Subjects: Computation and Language (cs.CL)
[107]  arXiv:2405.18727 [pdf, other]
Title: CtrlA: Adaptive Retrieval-Augmented Generation via Probe-Guided Control
Comments: 28 pages, 7 figures, 9 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[108]  arXiv:2405.18719 [pdf, other]
Title: Contextual Position Encoding: Learning to Count What's Important
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[109]  arXiv:2405.18718 [pdf, other]
Title: Efficient Model-agnostic Alignment via Bayesian Persuasion
Subjects: Computation and Language (cs.CL)
[110]  arXiv:2405.18682 [pdf, other]
Title: Can GPT Redefine Medical Understanding? Evaluating GPT on Biomedical Machine Reading Comprehension
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[111]  arXiv:2405.18662 [pdf, other]
Title: Understanding Intrinsic Socioeconomic Biases in Large Language Models
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[112]  arXiv:2405.18653 [pdf, other]
Title: Recent Advances of Foundation Language Models-based Continual Learning: A Survey
Subjects: Computation and Language (cs.CL)
[113]  arXiv:2405.18649 [pdf, other]
Title: Training LLMs to Better Self-Debug and Explain Code
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
[114]  arXiv:2405.18638 [pdf, other]
Title: ConSiDERS-The-Human Evaluation Framework: Rethinking Human Evaluation for Generative Large Language Models
Comments: Accepted in ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[115]  arXiv:2405.18613 [pdf, ps, other]
Title: GLOCON Database: Design Decisions and User Manual (v1.0)
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Databases (cs.DB); Machine Learning (cs.LG)
[116]  arXiv:2405.18605 [pdf, ps, other]
Title: BioBERT-based Deep Learning and Merged ChemProt-DrugProt for Enhanced Biomedical Relation Extraction
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Molecular Networks (q-bio.MN)
[117]  arXiv:2405.18540 [pdf, other]
Title: Learning diverse attacks on large language models for robust red-teaming and safety tuning
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[118]  arXiv:2405.18492 [pdf, other]
Title: LLMs and Memorization: On Quality and Specificity of Copyright Compliance
Comments: 10 pages, 3 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[119]  arXiv:2405.18448 [pdf, other]
Title: Multi-objective Representation for Numbers in Clinical Narratives Using CamemBERT-bio
Comments: Under the revision. arXiv admin note: substantial text overlap with arXiv:2404.10171
Subjects: Computation and Language (cs.CL); Signal Processing (eess.SP)
[120]  arXiv:2405.19335 (cross-list from cs.CV) [pdf, other]
Title: X-VILA: Cross-Modality Alignment for Large Language Model
Comments: Technical Report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[121]  arXiv:2405.19334 (cross-list from cs.AI) [pdf, other]
Title: LLMs Meet Multimodal Generation and Editing: A Survey
Comments: 51 Pages with 16 Figures, 12 Tables, and 534 References. GitHub Repository at: this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[122]  arXiv:2405.19316 (cross-list from cs.LG) [pdf, other]
Title: Robust Preference Optimization through Reward Model Distillation
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[123]  arXiv:2405.19315 (cross-list from cs.CV) [pdf, other]
Title: Matryoshka Query Transformer for Large Vision-Language Models
Comments: Preprint. Our code and model are publicly available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[124]  arXiv:2405.19313 (cross-list from cs.AI) [pdf, other]
Title: Language Models Trained to do Arithmetic Predict Human Risky and Intertemporal Choice
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); General Economics (econ.GN)
[125]  arXiv:2405.19209 (cross-list from cs.CV) [pdf, other]
Title: VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
Comments: 20 pages, first three authors contributed equally; Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[126]  arXiv:2405.19186 (cross-list from cs.CV) [pdf, other]
Title: MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification
Authors: Laura Fieback (1,2), Jakob Spiegelberg (1), Hanno Gottschalk (2) ((1) Volkswagen AG, (2) TU Berlin)
Comments: 18 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[127]  arXiv:2405.19076 (cross-list from cs.CV) [pdf, other]
Title: Cephalo: Multi-Modal Vision-Language Models for Bio-Inspired Materials Analysis and Design
Subjects: Computer Vision and Pattern Recognition (cs.CV); Mesoscale and Nanoscale Physics (cond-mat.mes-hall); Materials Science (cond-mat.mtrl-sci); Computation and Language (cs.CL); Machine Learning (cs.LG)
[128]  arXiv:2405.19026 (cross-list from cs.LG) [pdf, other]
Title: DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[129]  arXiv:2405.18991 (cross-list from cs.CV) [pdf, other]
Title: EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture
Comments: 6 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[130]  arXiv:2405.18937 (cross-list from cs.CV) [pdf, other]
Title: Kestrel: Point Grounding Multimodal LLM for Part-Aware 3D Vision-Language Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[131]  arXiv:2405.18874 (cross-list from cond-mat.dis-nn) [pdf, other]
Title: Are queries and keys always relevant? A case study on Transformer wave functions
Comments: 9 pages, 4 figures
Subjects: Disordered Systems and Neural Networks (cond-mat.dis-nn); Computation and Language (cs.CL); Computational Physics (physics.comp-ph)
[132]  arXiv:2405.18870 (cross-list from cs.AI) [pdf, other]
Title: LLMs achieve adult human performance on higher-order theory of mind tasks
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[133]  arXiv:2405.18776 (cross-list from cs.CR) [pdf, other]
Title: LMO-DP: Optimizing the Randomization Mechanism for Differentially Private Fine-Tuning (Large) Language Models
Comments: 18 pages, 15 figures
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[134]  arXiv:2405.18742 (cross-list from cs.AI) [pdf, other]
Title: Musical Phrase Segmentation via Grammatical Induction
Comments: Extended version of a paper appearing in the proceedings of IJCAI 2024 that includes additional material in an appendix. Please cite the IJCAI version
Journal-ref: Proceedings of the International Joint Conference on Artificial Intelligence, 2024
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[135]  arXiv:2405.18721 (cross-list from cs.CV) [pdf, other]
Title: Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Comments: Accepted by TPAMI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[136]  arXiv:2405.18711 (cross-list from cs.AI) [pdf, other]
Title: Calibrating Reasoning in Language Models with Internal Consistency
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[137]  arXiv:2405.18688 (cross-list from cs.LG) [pdf, other]
Title: Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[138]  arXiv:2405.18672 (cross-list from cs.CV) [pdf, other]
Title: LLM-based Hierarchical Concept Decomposition for Interpretable Fine-Grained Image Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[139]  arXiv:2405.18669 (cross-list from cs.LG) [pdf, other]
Title: Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities
Comments: Under review at NeurIPS
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[140]  arXiv:2405.18642 (cross-list from cs.AI) [pdf, other]
Title: JADS: A Framework for Self-supervised Joint Aspect Discovery and Summarization
Comments: preprint
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[141]  arXiv:2405.18639 (cross-list from q-bio.NC) [pdf, other]
Title: Improving Speech Decoding from ECoG with Self-Supervised Pretraining
Subjects: Neurons and Cognition (q-bio.NC); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[142]  arXiv:2405.18634 (cross-list from cs.LG) [pdf, other]
Title: A Theoretical Understanding of Self-Correction through In-context Alignment
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[143]  arXiv:2405.18628 (cross-list from cs.LG) [pdf, other]
Title: Hardware-Aware Parallel Prompt Decoding for Memory-Efficient Acceleration of LLM Inference
Comments: The code for this implementation is available at this https URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[144]  arXiv:2405.18620 (cross-list from cs.HC) [pdf, other]
Title: RealitySummary: On-Demand Mixed Reality Document Enhancement using Large Language Models
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[145]  arXiv:2405.18572 (cross-list from cs.LG) [pdf, other]
Title: Low-rank finetuning for LLMs: A fairness perspective
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[146]  arXiv:2405.18570 (cross-list from cs.CV) [pdf, other]
Title: Its Not a Modality Gap: Characterizing and Addressing the Contrastive Gap
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[147]  arXiv:2405.18542 (cross-list from cs.AI) [pdf, other]
Title: Automatic detection of cognitive impairment in elderly people using an entertainment chatbot with Natural Language Processing capabilities
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[148]  arXiv:2405.17653 (cross-list from cs.LG) [pdf, other]
Title: InversionView: A General-Purpose Method for Reading Information from Neural Activations
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Wed, 29 May 2024 (showing first 8 of 81 entries)

[149]  arXiv:2405.18433 [pdf, other]
Title: Notes on Applicability of GPT-4 to Document Understanding
Subjects: Computation and Language (cs.CL)
[150]  arXiv:2405.18414 [pdf, other]
Title: Don't Forget to Connect! Improving RAG with Graph-based Reranking
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[151]  arXiv:2405.18400 [pdf, other]
Title: Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass
Comments: 22 pages, 15 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[152]  arXiv:2405.18375 [pdf, other]
Title: Thai Winograd Schemas: A Benchmark for Thai Commonsense Reasoning
Authors: Phakphum Artkaew
Subjects: Computation and Language (cs.CL)
[153]  arXiv:2405.18369 [pdf, other]
Title: PromptWizard: Task-Aware Agent-driven Prompt Optimization Framework
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[154]  arXiv:2405.18359 [pdf, other]
Title: Bridging the Gap: Dynamic Learning Strategies for Improving Multilingual Performance in LLMs
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[155]  arXiv:2405.18358 [pdf, other]
Title: MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[156]  arXiv:2405.18357 [pdf, other]
Title: Faithful Logical Reasoning via Symbolic Chain-of-Thought
Comments: Accepted by ACL 2024 (main proceeding)
Subjects: Computation and Language (cs.CL)
[ total of 427 entries: 1-156 | 157-312 | 313-427 ]
[ showing 156 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)