We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions, skipping first 162

[ total of 333 entries: 1-151 | 12-162 | 163-313 | 314-333 ]
[ showing 151 entries per page: fewer | more | all ]

Fri, 3 May 2024 (continued, showing last 60 of 71 entries)

[163]  arXiv:2405.01376 [pdf, other]
Title: Topics in the Study of the Pragmatic Functions of Phonetic Reduction in Dialog
Subjects: Computation and Language (cs.CL)
[164]  arXiv:2405.01359 [pdf, other]
Title: GAIA: A General AI Assistant for Intelligent Accelerator Operations
Authors: Frank Mayet
Subjects: Computation and Language (cs.CL); Accelerator Physics (physics.acc-ph)
[165]  arXiv:2405.01345 [pdf, other]
Title: The Power of Question Translation Training in Multilingual Reasoning: Broadened Scope and Deepened Insights
Subjects: Computation and Language (cs.CL)
[166]  arXiv:2405.01299 [pdf, other]
Title: The Effectiveness of LLMs as Annotators: A Comparative Overview and Empirical Analysis of Direct Representation
Comments: LREC-COLING NLPerspectives workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[167]  arXiv:2405.01293 [pdf, ps, other]
Title: Low-resource speech recognition and dialect identification of Irish in a multi-task framework
Comments: 7 pages. Accepted to Odyssey 2024 - The Speaker and Language Recognition Workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[168]  arXiv:2405.01280 [pdf, other]
Title: Reinforcement Learning for Edit-Based Non-Autoregressive Neural Machine Translation
Subjects: Computation and Language (cs.CL)
[169]  arXiv:2405.01249 [pdf, ps, other]
Title: Prompt engineering paradigms for medical applications: scoping review and recommendations for better practices
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[170]  arXiv:2405.01216 [pdf, other]
Title: DMON: A Simple yet Effective Approach for Argument Structure Learning
Comments: COLING 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[171]  arXiv:2405.01159 [pdf, other]
Title: TartuNLP at EvaLatin 2024: Emotion Polarity Detection
Comments: Accepted to The Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2024)
Subjects: Computation and Language (cs.CL)
[172]  arXiv:2405.01139 [pdf, other]
Title: It Couldn't Help But Overhear: On the Limits of Modelling Meta-Communicative Grounding Acts with Supervised Learning
Comments: work in progress
Subjects: Computation and Language (cs.CL)
[173]  arXiv:2405.01121 [pdf, other]
Title: Efficient Data Generation for Source-grounded Information-seeking Dialogs: A Use Case for Meeting Transcripts
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[174]  arXiv:2405.01022 [pdf, other]
Title: UniGen: Universal Domain Generalization for Sentiment Classification via Zero-shot Dataset Generation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[175]  arXiv:2405.00997 [pdf, other]
Title: The IgboAPI Dataset: Empowering Igbo Language Technologies through Multi-dialectal Enrichment
Comments: Accepted to the LREC-COLING 2024 conference
Subjects: Computation and Language (cs.CL)
[176]  arXiv:2405.00988 [pdf, other]
Title: Context-Aware Clustering using Large Language Models
Comments: 16 pages
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[177]  arXiv:2405.00982 [pdf, other]
Title: On the Evaluation of Machine-Generated Reports
Comments: 12 pages, 4 figures, accepted at SIGIR 2024 as perspective paper
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[178]  arXiv:2405.00980 [pdf, other]
Title: A Hong Kong Sign Language Corpus Collected from Sign-interpreted TV News
Comments: Accepted by LREC-COLING 2024
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[179]  arXiv:2405.00972 [pdf, other]
Title: CACTUS: Chemistry Agent Connecting Tool-Usage to Science
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Chemical Physics (physics.chem-ph); Quantitative Methods (q-bio.QM)
[180]  arXiv:2405.00970 [pdf, other]
Title: How Can I Get It Right? Using GPT to Rephrase Incorrect Trainee Responses
Comments: International Journal of Artificial Intelligence in Education
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[181]  arXiv:2405.00966 [pdf, other]
Title: Efficient Compression of Multitask Multilingual Speech Models
Comments: Master Thesis
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[182]  arXiv:2405.00948 [pdf, other]
Title: Modeling Empathetic Alignment in Conversation
Comments: Camera-ready version for NAACL 2024
Subjects: Computation and Language (cs.CL)
[183]  arXiv:2405.00903 [pdf, other]
Title: A Named Entity Recognition and Topic Modeling-based Solution for Locating and Better Assessment of Natural Disasters in Social Media
Comments: 15 pages; 4 tables; 4 figures
Subjects: Computation and Language (cs.CL)
[184]  arXiv:2405.00888 [pdf, other]
Title: DynaMo: Accelerating Language Model Inference with Dynamic Multi-Token Sampling
Comments: Accepted at NAACL 2024
Subjects: Computation and Language (cs.CL)
[185]  arXiv:2405.00864 [pdf, other]
Title: Math Multiple Choice Question Generation via Human-Large Language Model Collaboration
Comments: 17th International Conference on Educational Data Mining (EDM 2024)
Subjects: Computation and Language (cs.CL)
[186]  arXiv:2405.00828 [pdf, other]
Title: WIBA: What Is Being Argued? A Comprehensive Approach to Argument Mining
Comments: 8 pages, 2 figures, submitted to The 16th International Conference on Advances in Social Networks Analysis and Mining (ASONAM) '24
Subjects: Computation and Language (cs.CL)
[187]  arXiv:2405.00823 [pdf, other]
Title: WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace Setting
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[188]  arXiv:2405.00821 [pdf, other]
Title: Uncovering Agendas: A Novel French & English Dataset for Agenda Detection on Social Media
Subjects: Computation and Language (cs.CL)
[189]  arXiv:2405.00801 [pdf, ps, other]
Title: "Ask Me Anything": How Comcast Uses LLMs to Assist Agents in Real Time
Subjects: Computation and Language (cs.CL)
[190]  arXiv:2405.00732 [pdf, other]
Title: LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[191]  arXiv:2405.00728 [pdf, ps, other]
Title: Evaluating the Application of ChatGPT in Outpatient Triage Guidance: A Comparative Study
Comments: 8 pages, 1 figure, conference(International Ergonomics Association)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[192]  arXiv:2405.00722 [pdf, other]
Title: LLMs for Generating and Evaluating Counterfactuals: A Comprehensive Study
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[193]  arXiv:2405.00718 [pdf, other]
Title: Can't say cant? Measuring and Reasoning of Dark Jargons in Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[194]  arXiv:2405.00717 [pdf, other]
Title: Exploring News Summarization and Enrichment in a Highly Resource-Scarce Indian Language: A Case Study of Mizo
Comments: Accepted at LREC-COLING2024 WILDRE Workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[195]  arXiv:2405.00716 [pdf, other]
Title: Large Language Models in Healthcare: A Comprehensive Benchmark
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[196]  arXiv:2405.00715 [pdf, other]
Title: Towards Adapting Open-Source Large Language Models for Expert-Level Clinical Note Generation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[197]  arXiv:2405.00711 [pdf, other]
Title: Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of Theories, Detection Methods, and Opportunities
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[198]  arXiv:2405.00710 [pdf, ps, other]
Title: Homonym Sense Disambiguation in the Georgian Language
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[199]  arXiv:2405.00709 [pdf, other]
Title: Evaluating Tool-Augmented Agents in Remote Sensing Platforms
Comments: ICLR 2024 Machine Learning for Remote Sensing (ML4RS) Workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[200]  arXiv:2405.00708 [pdf, other]
Title: Interactive Analysis of LLMs using Meaningful Counterfactuals
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[201]  arXiv:2405.00706 [pdf, ps, other]
Title: Science Written by Generative AI is Perceived as Less Intelligent, but More Credible and Trustworthy than Science Written by Humans
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[202]  arXiv:2405.00705 [pdf, other]
Title: SHED: Shapley-Based Automated Dataset Refinement for Instruction Fine-Tuning
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[203]  arXiv:2405.00704 [pdf, ps, other]
Title: A Survey on the Real Power of ChatGPT
Comments: 9 pages, 2 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[204]  arXiv:2405.01509 (cross-list from cs.CR) [pdf, other]
Title: Learnable Linguistic Watermarks for Tracing Model Extraction Attacks on Large Language Models
Comments: not decided
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[205]  arXiv:2405.01483 (cross-list from cs.CV) [pdf, other]
Title: MANTIS: Interleaved Multi-Image Instruction Tuning
Comments: 9 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[206]  arXiv:2405.01413 (cross-list from cs.CV) [pdf, other]
Title: MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors
Comments: 17 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[207]  arXiv:2405.01310 (cross-list from cs.IR) [pdf, other]
Title: Overcoming LLM Challenges using RAG-Driven Precision in Coffee Leaf Disease Remediation
Comments: 6 pages, 3 figures
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[208]  arXiv:2405.01259 (cross-list from cs.AI) [pdf, other]
Title: Identification of Entailment and Contradiction Relations between Natural Language Sentences: A Neurosymbolic Approach
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[209]  arXiv:2405.01229 (cross-list from cs.LG) [pdf, ps, other]
Title: Boosting Jailbreak Attack with Momentum
Comments: ICLR 2024 Workshop on Reliable and Responsible Foundation Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Optimization and Control (math.OC)
[210]  arXiv:2405.01097 (cross-list from cs.CY) [pdf, other]
Title: Silencing the Risk, Not the Whistle: A Semi-automated Text Sanitization Tool for Mitigating the Risk of Whistleblower Re-Identification
Comments: Accepted for publication at the ACM Conference on Fairness, Accountability, and Transparency 2024 (ACM FAccT'24). This is a preprint manuscript (authors' own version before final copy-editing)
Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Information Retrieval (cs.IR); Software Engineering (cs.SE)
[211]  arXiv:2405.01040 (cross-list from cs.CV) [pdf, other]
Title: Few Shot Class Incremental Learning using Vision-Language models
Comments: under review at Pattern Recognition Letters
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Image and Video Processing (eess.IV)
[212]  arXiv:2405.00981 (cross-list from cs.AI) [pdf, other]
Title: Bayesian Optimization with LLM-Based Acquisition Functions for Natural Language Preference Elicitation
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[213]  arXiv:2405.00978 (cross-list from cs.IR) [pdf, other]
Title: Language Fairness in Multilingual Information Retrieval
Comments: 5 pages, 1 figure, accepted at SIGIR 2024 as short paper
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[214]  arXiv:2405.00977 (cross-list from cs.IR) [pdf, other]
Title: Distillation for Multilingual Information Retrieval
Comments: 6 pages, 1 figure, accepted at SIGIR 2024 as short paper
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[215]  arXiv:2405.00975 (cross-list from cs.IR) [pdf, other]
Title: PLAID SHIRTTT for Large-Scale Streaming Dense Retrieval
Comments: 5 pages, 1 figure, accepted at SIGIR 2024 as short paper
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[216]  arXiv:2405.00949 (cross-list from cs.LG) [pdf, other]
Title: The Role of Model Architecture and Scale in Predicting Molecular Properties: Insights from Fine-Tuning RoBERTa, BART, and LLaMA
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Chemical Physics (physics.chem-ph); Biomolecules (q-bio.BM)
[217]  arXiv:2405.00942 (cross-list from cs.CV) [pdf, other]
Title: LLaVA Finds Free Lunch: Teaching Human Behavior Improves Content Understanding Abilities Of LLMs
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[218]  arXiv:2405.00899 (cross-list from cs.HC) [pdf, other]
Title: Characterising the Creative Process in Humans and Large Language Models
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Neurons and Cognition (q-bio.NC)
[219]  arXiv:2405.00740 (cross-list from cs.CV) [pdf, other]
Title: Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Comments: 14 pages, 8 figures, 7 tables, to be published at ICML2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[220]  arXiv:2405.00693 (cross-list from cs.RO) [pdf, other]
Title: Large Language Models for Human-Robot Interaction: Opportunities and Risks
Authors: Jesse Atuhurra
Subjects: Robotics (cs.RO); Computation and Language (cs.CL)
[221]  arXiv:2405.00688 (cross-list from cs.RO) [pdf, ps, other]
Title: Understanding Social Perception, Interactions, and Safety Aspects of Sidewalk Delivery Robots Using Sentiment Analysis
Authors: Yuchen Du, Tho V. Le
Comments: 34 pages, 7 figures, 2 tables
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[222]  arXiv:2405.00522 (cross-list from econ.GN) [pdf, other]
Title: DAM: A Universal Dual Attention Mechanism for Multimodal Timeseries Cryptocurrency Trend Forecasting
Subjects: General Economics (econ.GN); Computational Engineering, Finance, and Science (cs.CE); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Computational Finance (q-fin.CP)

Thu, 2 May 2024

[223]  arXiv:2405.00664 [pdf, other]
Title: Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[224]  arXiv:2405.00659 [pdf, other]
Title: NLU-STR at SemEval-2024 Task 1: Generative-based Augmentation and Encoder-based Scoring for Semantic Textual Relatedness
Subjects: Computation and Language (cs.CL)
[225]  arXiv:2405.00657 [pdf, other]
Title: RST-LoRA: A Discourse-Aware Low-Rank Adaptation for Long Document Abstractive Summarization
Comments: NAACL 2024 Main & Long Conference Paper (Oral Presentation)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[226]  arXiv:2405.00632 [pdf, other]
Title: When Quantization Affects Confidence of Large Language Models?
Comments: Accepted to NAACL 2024 Findings
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[227]  arXiv:2405.00622 [pdf, other]
Title: Causal Evaluation of Language Models
Comments: 315 pages, 230 figures, 21 tables. Project website: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[228]  arXiv:2405.00611 [pdf, other]
Title: Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling
Subjects: Computation and Language (cs.CL)
[229]  arXiv:2405.00602 [pdf, other]
Title: Investigating Automatic Scoring and Feedback using Large Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[230]  arXiv:2405.00588 [pdf, other]
Title: Are Models Biased on Text without Gender-related Language?
Comments: In International Conference on Learning Representations 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[231]  arXiv:2405.00578 [pdf, other]
Title: The Real, the Better: Aligning Large Language Models with Online Human Behaviors
Comments: 11 pages, 6 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[232]  arXiv:2405.00557 [pdf, other]
Title: Mixture of insighTful Experts (MoTE): The Synergy of Thought Chains and Expert Mixtures in Self-Alignment
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[233]  arXiv:2405.00543 [pdf, other]
Title: New Benchmark Dataset and Fine-Grained Cross-Modal Fusion Framework for Vietnamese Multimodal Aspect-Category Sentiment Analysis
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[234]  arXiv:2405.00536 [pdf, other]
Title: A Legal Framework for Natural Language Processing Model Training in Portugal
Comments: LEGAL2024 Legal and Ethical Issues in Human Language Technologies, LREC 2024
Subjects: Computation and Language (cs.CL); Emerging Technologies (cs.ET)
[235]  arXiv:2405.00492 [pdf, other]
Title: Is Temperature the Creativity Parameter of Large Language Models?
Comments: To be published in the Proceedings of the 15th International Conference on Computational Creativity (ICCC'24), 8 pages, 2 figures, 2 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[236]  arXiv:2405.00467 [pdf, other]
Title: Harnessing the Power of Multiple Minds: Lessons Learned from LLM Routing
Comments: Accepted to Workshop on Insights from Negative Results in NLP 2024 (co-located with NAACL 2024)
Subjects: Computation and Language (cs.CL)
[237]  arXiv:2405.00465 [pdf, other]
Title: BiomedRAG: A Retrieval Augmented Large Language Model for Biomedicine
Subjects: Computation and Language (cs.CL)
[238]  arXiv:2405.00402 [pdf, other]
Title: Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models
Subjects: Computation and Language (cs.CL)
[239]  arXiv:2405.00390 [pdf, other]
Title: CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal Models
Comments: 25 pages, 7 figures, and 18 tables
Subjects: Computation and Language (cs.CL)
[240]  arXiv:2405.00361 [pdf, other]
Title: AdaMoLE: Fine-Tuning Large Language Models with Adaptive Mixture of Low-Rank Adaptation Experts
Subjects: Computation and Language (cs.CL)
[241]  arXiv:2405.00332 [pdf, other]
Title: A Careful Examination of Large Language Model Performance on Grade School Arithmetic
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[242]  arXiv:2405.00321 [pdf, other]
Title: DFKI-NLP at SemEval-2024 Task 2: Towards Robust LLMs Using Data Perturbations and MinMax Training
Subjects: Computation and Language (cs.CL)
[243]  arXiv:2405.00302 [pdf, other]
Title: Generating Feedback-Ladders for Logical Errors in Programming using Large Language Models
Comments: Published on the 17th EDM 2024 - Posters and Demos Track
Subjects: Computation and Language (cs.CL)
[244]  arXiv:2405.00301 [pdf, other]
Title: LITO: Learnable Intervention for Truthfulness Optimization
Comments: 14 pages, 5 figures
Subjects: Computation and Language (cs.CL)
[245]  arXiv:2405.00291 [pdf, other]
Title: How Can I Improve? Using GPT to Highlight the Desired and Undesired Parts of Open-ended Responses
Comments: 11 pages, full research paper, EDM 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[246]  arXiv:2405.00289 [pdf, other]
Title: Adversarial Attacks and Defense for Conversation Entailment Task
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[247]  arXiv:2405.00273 [pdf, other]
Title: Social Life Simulation for Non-Cognitive Skills Learning
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[248]  arXiv:2405.00263 [pdf, other]
Title: Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[249]  arXiv:2405.00253 [pdf, other]
Title: CodeHalu: Code Hallucinations in LLMs Driven by Execution-based Verification
Subjects: Computation and Language (cs.CL); Software Engineering (cs.SE)
[250]  arXiv:2405.00216 [pdf, other]
Title: Graphical Reasoning: LLM-based Semi-Open Relation Extraction
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[251]  arXiv:2405.00208 [pdf, other]
Title: A Primer on the Inner Workings of Transformer-based Language Models
Subjects: Computation and Language (cs.CL)
[252]  arXiv:2405.00204 [pdf, other]
Title: General Purpose Verification for Chain of Thought Prompting
Comments: 22 pages, preprint
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[253]  arXiv:2405.00201 [pdf, other]
Title: SPAFIT: Stratified Progressive Adaptation Fine-tuning for Pre-trained Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[254]  arXiv:2405.00200 [pdf, other]
Title: In-Context Learning with Long-Context Models: An In-Depth Exploration
Comments: 27 pages; preprint
Subjects: Computation and Language (cs.CL)
[255]  arXiv:2405.00175 [pdf, other]
Title: Towards a Search Engine for Machines: Unified Ranking for Multiple Retrieval-Augmented Large Language Models
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[256]  arXiv:2405.00155 [pdf, other]
Title: HistNERo: Historical Named Entity Recognition for the Romanian Language
Comments: Accepted at the International Conference on Document Analysis and Recognition (ICDAR 2024)
Subjects: Computation and Language (cs.CL)
[257]  arXiv:2405.00134 [pdf, other]
Title: Transforming Dutch: Debiasing Dutch Coreference Resolution Systems for Non-binary Pronouns
Comments: 22 pages, 2 figures. Accepted at the 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT '24)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[258]  arXiv:2405.00675 (cross-list from cs.LG) [pdf, other]
Title: Self-Play Preference Optimization for Language Model Alignment
Comments: 25 pages, 4 figures, 5 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[259]  arXiv:2405.00566 (cross-list from cs.CE) [pdf, other]
Title: NumLLM: Numeric-Sensitive Large Language Model for Chinese Finance
Subjects: Computational Engineering, Finance, and Science (cs.CE); Computation and Language (cs.CL); General Finance (q-fin.GN)
[260]  arXiv:2405.00523 (cross-list from cs.AI) [pdf, other]
Title: CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions
Comments: LREC-COLING 2024 Accepted
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[261]  arXiv:2405.00516 (cross-list from cs.LG) [pdf, other]
Title: Navigating WebAI: Training Agents to Complete Web Tasks with Large Language Models and Reinforcement Learning
Comments: ACM 2024, Avila Spain. 9 pages
Journal-ref: ACM SAC Conference 2024, Avila, Spain, Article 4, 9 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[262]  arXiv:2405.00494 (cross-list from cs.AI) [pdf, other]
Title: GOLD: Geometry Problem Solver with Natural Language Description
Comments: Accepted in NAACL 2024 Findings
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[263]  arXiv:2405.00489 (cross-list from cs.LG) [pdf, other]
Title: Explainable Automatic Grading with Neural Additive Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Applications (stat.AP)
[264]  arXiv:2405.00461 (cross-list from cs.RO) [pdf, other]
Title: Enhancing Surgical Robots with Embodied Intelligence for Autonomous Ultrasound Scanning
Comments: ICRA 2024 Full-day Workshop: C4SR+: Continuum, Compliant, Cooperative, Cognitive
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[265]  arXiv:2405.00449 (cross-list from cs.LG) [pdf, other]
Title: RAG-based Explainable Prediction of Road Users Behaviors for Automated Driving using Knowledge Graphs and Large Language Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Neural and Evolutionary Computing (cs.NE)
[266]  arXiv:2405.00438 (cross-list from cs.LG) [pdf, other]
Title: MetaRM: Shifted Distributions Alignment via Meta-Learning
Comments: 11 pages, 6 figures. arXiv admin note: text overlap with arXiv:2401.06080
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[267]  arXiv:2405.00123 (cross-list from cs.LG) [pdf, other]
Title: Graph Neural Network Approach to Semantic Type Detection in Tables
Journal-ref: In Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 121-133. Singapore: Springer Nature Singapore, 2024
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[268]  arXiv:2405.00099 (cross-list from cs.AI) [pdf, other]
Title: Creative Beam Search
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[269]  arXiv:2405.00021 (cross-list from cs.CV) [pdf, other]
Title: SIMPLOT: Enhancing Chart Question Answering by Distilling Essentials
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Wed, 1 May 2024 (showing first 44 of 64 entries)

[270]  arXiv:2404.19737 [pdf, other]
Title: Better & Faster Large Language Models via Multi-token Prediction
Subjects: Computation and Language (cs.CL)
[271]  arXiv:2404.19733 [pdf, other]
Title: Iterative Reasoning Preference Optimization
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[272]  arXiv:2404.19714 [pdf, other]
Title: ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text data on social disorders in children and adolescents
Comments: 4 pages
Subjects: Computation and Language (cs.CL)
[273]  arXiv:2404.19713 [pdf, ps, other]
Title: Automated Generation of High-Quality Medical Simulation Scenarios Through Integration of Semi-Structured Data and Large Language Models
Authors: Scott Sumpter
Comments: 22 pages but 12 are appendices which are examples of the main text. 3 figures, 4 tables
Subjects: Computation and Language (cs.CL)
[274]  arXiv:2404.19705 [pdf, other]
Title: When to Retrieve: Teaching LLMs to Utilize Information Retrieval Effectively
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[275]  arXiv:2404.19597 [pdf, other]
Title: Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning
Comments: work in progress
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[276]  arXiv:2404.19563 [pdf, other]
Title: RepEval: Effective Text Evaluation with LLM Representation
Subjects: Computation and Language (cs.CL)
[277]  arXiv:2404.19553 [pdf, other]
Title: Extending Llama-3's Context Ten-Fold Overnight
Subjects: Computation and Language (cs.CL)
[278]  arXiv:2404.19543 [pdf, other]
Title: RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing
Authors: Yucheng Hu, Yuxing Lu
Comments: 30 pages, 7 figures. Draft version 1
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[279]  arXiv:2404.19509 [pdf, other]
Title: Do Large Language Models Understand Conversational Implicature -- A case study with a chinese sitcom
Comments: 14 pages, 8 tables and 5 figures
Subjects: Computation and Language (cs.CL)
[280]  arXiv:2404.19505 [pdf, other]
Title: Context-Aware Machine Translation with Source Coreference Explanation
Comments: Accepted to TACL. This is a pre-MIT Press publication version
Subjects: Computation and Language (cs.CL)
[281]  arXiv:2404.19486 [pdf, other]
Title: Safe Training with Sensitive In-domain Data: Leveraging Data Fragmentation To Mitigate Linkage Attacks
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[282]  arXiv:2404.19482 [pdf, other]
Title: FactCheck Editor: Multilingual Text Editor with End-to-End fact-checking
Authors: Vinay Setty
Comments: Accepted in SIGIR 2024 (demo track)
Subjects: Computation and Language (cs.CL)
[283]  arXiv:2404.19442 [pdf, other]
Title: Which Nigerian-Pidgin does Generative AI speak?: Issues about Representativeness and Bias for Multilingual and Low Resource Languages
Comments: Working paper
Subjects: Computation and Language (cs.CL)
[284]  arXiv:2404.19432 [pdf, other]
Title: Can Large Language Models put 2 and 2 together? Probing for Entailed Arithmetical Relationships
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[285]  arXiv:2404.19430 [pdf, other]
Title: Sõnajaht: Definition Embeddings and Semantic Search for Reverse Dictionary Creation
Comments: Accepted to *SEM 2024
Subjects: Computation and Language (cs.CL)
[286]  arXiv:2404.19409 [pdf, other]
Title: Countering Reward Over-optimization in LLM with Demonstration-Guided Reinforcement Learning
Subjects: Computation and Language (cs.CL)
[287]  arXiv:2404.19369 [pdf, ps, other]
Title: Evaluating Telugu Proficiency in Large Language Models_ A Comparative Analysis of ChatGPT and Gemini
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[288]  arXiv:2404.19364 [pdf, other]
Title: Navigating Brain Language Representations: A Comparative Analysis of Neural Language Models and Psychologically Plausible Models
Subjects: Computation and Language (cs.CL)
[289]  arXiv:2404.19363 [pdf, other]
Title: Expressivity and Speech Synthesis
Comments: Invited contribution. Under review
Subjects: Computation and Language (cs.CL)
[290]  arXiv:2404.19359 [pdf, other]
Title: Evaluating Lexicon Incorporation for Depression Symptom Estimation
Comments: Accepted to Clinical NLP workshop at NAACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[291]  arXiv:2404.19335 [pdf, other]
Title: StablePT: Towards Stable Prompting for Few-shot Learning via Input Separation
Comments: Submitted to ACL 2024
Subjects: Computation and Language (cs.CL)
[292]  arXiv:2404.19328 [pdf, other]
Title: Computational Approaches for Integrating out Subjectivity in Cognate Synonym Selection
Comments: Experiments available on GitHub (this https URL, this https URL)
Subjects: Computation and Language (cs.CL); Populations and Evolution (q-bio.PE)
[293]  arXiv:2404.19319 [pdf, other]
Title: Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget
Comments: Accepted to the 5th Workshop on Insights from Negative Results in NLP at NAACL 2024
Subjects: Computation and Language (cs.CL)
[294]  arXiv:2404.19316 [pdf, other]
Title: QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering
Comments: Accepted by the 2024 International Joint Conference on Neural Networks (IJCNN 2024)
Subjects: Computation and Language (cs.CL)
[295]  arXiv:2404.19315 [pdf, other]
Title: Modeling Orthographic Variation in Occitan's Dialects
Authors: Zachary William Hopton (Language and Space Lab, University of Zurich), Noëmi Aepli (Department of Computational Linguistics, University of Zurich)
Comments: Accepted at VarDial 2024: The Eleventh Workshop on NLP for Similar Languages, Varieties and Dialects
Subjects: Computation and Language (cs.CL)
[296]  arXiv:2404.19310 [pdf, other]
Title: Does Whisper understand Swiss German? An automatic, qualitative, and human evaluation
Comments: Accepted to VarDial 2024 (the eleventh Workshop on NLP for Similar Languages, Varieties and Dialects 2024), Mexico City
Subjects: Computation and Language (cs.CL)
[297]  arXiv:2404.19296 [pdf, other]
Title: Octopus v4: Graph of language models
Authors: Wei Chen, Zhiyuan Li
Subjects: Computation and Language (cs.CL)
[298]  arXiv:2404.19260 [pdf, ps, other]
Title: Aspect and Opinion Term Extraction Using Graph Attention Network
Authors: Abir Chakraborty
Subjects: Computation and Language (cs.CL)
[299]  arXiv:2404.19254 [pdf, other]
Title: Suvach -- Generated Hindi QA benchmark
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[300]  arXiv:2404.19252 [pdf, other]
Title: Exploiting Hatred by Targets for Hate Speech Detection on Vietnamese Social Media Texts
Subjects: Computation and Language (cs.CL)
[301]  arXiv:2404.19245 [pdf, other]
Title: HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning
Comments: 19 pages, 7 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[302]  arXiv:2404.19232 [pdf, other]
Title: GRAMMAR: Grounded and Modular Methodology for Assessment of Domain-Specific Retrieval-Augmented Language Model
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[303]  arXiv:2404.19192 [pdf, other]
Title: Mix of Experts Language Model for Named Entity Recognition
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[304]  arXiv:2404.19178 [pdf, other]
Title: Revenge of the Fallen? Recurrent Models Match Transformers at Predicting Human Language Comprehension Metrics
Subjects: Computation and Language (cs.CL)
[305]  arXiv:2404.19175 [pdf, other]
Title: Game-MUG: Multimodal Oriented Game Situation Understanding and Commentary Generation Dataset
Subjects: Computation and Language (cs.CL)
[306]  arXiv:2404.19159 [pdf, other]
Title: What Drives Performance in Multilingual Language Models?
Comments: Accepted at VarDial @ NAACL 2024
Subjects: Computation and Language (cs.CL)
[307]  arXiv:2404.19154 [pdf, other]
Title: RTF: Region-based Table Filling Method for Relational Triple Extraction
Comments: Rejected by EMNLP 2023
Subjects: Computation and Language (cs.CL)
[308]  arXiv:2404.19124 [pdf, other]
Title: Accelerating Production LLMs with Combined Token/Embedding Speculators
Subjects: Computation and Language (cs.CL)
[309]  arXiv:2404.19119 [pdf, ps, other]
Title: Effects of Added Emphasis and Pause in Audio Delivery of Health Information
Authors: Arif Ahmed (1), Gondy Leroy (1), Stephen A. Rains (1), Philip Harber (1), David Kauchak (2), Prosanta Barai (1) ((1) The University of Arizona, (2) Pomona College)
Comments: This manuscript is accepted to American Medical Informatics Association summit, 2024
Subjects: Computation and Language (cs.CL)
[310]  arXiv:2404.19094 [pdf, other]
Title: In-Context Symbolic Regression: Leveraging Language Models for Function Discovery
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[311]  arXiv:2404.19063 [pdf, other]
Title: SuperCLUE-Fin: Graded Fine-Grained Analysis of Chinese LLMs on Diverse Financial Tasks and Applications
Comments: 11 pages, 19 figures, and tables
Subjects: Computation and Language (cs.CL)
[312]  arXiv:2404.19055 [pdf, other]
Title: Plan of Thoughts: Heuristic-Guided Problem Solving with Large Language Models
Authors: Houjun Liu
Comments: 7 pages, 2 figures
Subjects: Computation and Language (cs.CL)
[313]  arXiv:2404.19048 [pdf, other]
Title: A Framework for Real-time Safeguarding the Text Generation of Large Language Model
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[ total of 333 entries: 1-151 | 12-162 | 163-313 | 314-333 ]
[ showing 151 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)