We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions

[ total of 317 entries: 1-256 | 257-317 ]
[ showing 256 entries per page: fewer | more | all ]

Tue, 23 Apr 2024

[1]  arXiv:2404.14408 [pdf, other]
Title: SpaceByte: Towards Deleting Tokenization from Large Language Modeling
Authors: Kevin Slagle
Comments: 9+9 pages, 3+1 figures, 2+4 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[2]  arXiv:2404.14397 [pdf, other]
[3]  arXiv:2404.14395 [pdf, other]
Title: PARAMANU-GANITA: Language Model with Mathematical Capabilities
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[4]  arXiv:2404.14387 [pdf, other]
Title: A Survey on Self-Evolution of Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[5]  arXiv:2404.14372 [pdf, other]
Title: Beyond Scaling: Predicting Patent Approval with Domain-specific Fine-grained Claim Dependency Graph
Comments: 17 Pages, Under Review
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[6]  arXiv:2404.14361 [pdf, other]
Title: Better Synthetic Data by Retrieving and Transforming Existing Datasets
Subjects: Computation and Language (cs.CL)
[7]  arXiv:2404.14355 [pdf, other]
Title: Calc-CMU at SemEval-2024 Task 7: Pre-Calc -- Learning to Use the Calculator Improves Numeracy in Language Models
Comments: NumEval at SemEval, NAACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[8]  arXiv:2404.14339 [pdf, other]
Title: Zero-shot Cross-lingual Stance Detection via Adversarial Language Adaptation
Subjects: Computation and Language (cs.CL)
[9]  arXiv:2404.14316 [pdf, other]
Title: Automated Long Answer Grading with RiceChem Dataset
Subjects: Computation and Language (cs.CL)
[10]  arXiv:2404.14313 [pdf, other]
Title: Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels
Subjects: Computation and Language (cs.CL)
[11]  arXiv:2404.14301 [pdf, other]
Title: Marking: Visual Grading with Highlighting Errors and Annotating Missing Bits
Subjects: Computation and Language (cs.CL)
[12]  arXiv:2404.14294 [pdf, other]
Title: A Survey on Efficient Inference for Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[13]  arXiv:2404.14270 [pdf, other]
Title: What do Transformers Know about Government?
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[14]  arXiv:2404.14219 [pdf, other]
[15]  arXiv:2404.14215 [pdf, other]
Title: Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction
Subjects: Computation and Language (cs.CL)
[16]  arXiv:2404.14209 [pdf, ps, other]
[17]  arXiv:2404.14192 [pdf, other]
Title: Swap distance minimization beyond entropy minimization in word order variation
Subjects: Computation and Language (cs.CL); Physics and Society (physics.soc-ph)
[18]  arXiv:2404.14183 [pdf, other]
Title: SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection
Comments: 23 pages, 12 tables
Journal-ref: Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)
Subjects: Computation and Language (cs.CL)
[19]  arXiv:2404.14122 [pdf, other]
Title: Fine-Tuning Large Language Models to Translate: Will a Touch of Noisy Data in Misaligned Languages Suffice?
Subjects: Computation and Language (cs.CL)
[20]  arXiv:2404.14057 [pdf, ps, other]
Title: Bored to Death: Artificial Intelligence Research Reveals the Role of Boredom in Suicide Behavior
Journal-ref: www.frontiersin.org/journals/psychiatry/articles/10.3389/fpsyt.2024.1328122
Subjects: Computation and Language (cs.CL)
[21]  arXiv:2404.14052 [pdf, other]
Title: Differential contributions of machine learning and statistical analysis to language and cognitive sciences
Authors: Kun Sun, Rong Wang
Subjects: Computation and Language (cs.CL); Methodology (stat.ME)
[22]  arXiv:2404.14043 [pdf, other]
Title: LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented Generation
Subjects: Computation and Language (cs.CL)
[23]  arXiv:2404.14024 [pdf, other]
Title: Exploring neural oscillations during speech perception via surrogate gradient spiking neural networks
Subjects: Computation and Language (cs.CL); Neurons and Cognition (q-bio.NC)
[24]  arXiv:2404.13985 [pdf, other]
Title: Information Re-Organization Improves Reasoning in Large Language Models
Comments: 10 pages, 3 figures
Subjects: Computation and Language (cs.CL)
[25]  arXiv:2404.13968 [pdf, other]
Title: Protecting Your LLMs with Information Bottleneck
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[26]  arXiv:2404.13957 [pdf, other]
Title: How Well Can LLMs Echo Us? Evaluating AI Chatbots' Role-Play Ability with ECHO
Comments: 9 pages
Subjects: Computation and Language (cs.CL)
[27]  arXiv:2404.13948 [pdf, other]
Title: Typos that Broke the RAG's Back: Genetic Attack on RAG Pipeline by Simulating Documents in the Wild via Low-level Perturbations
Comments: Under Review
Subjects: Computation and Language (cs.CL)
[28]  arXiv:2404.13940 [pdf, other]
Title: A User-Centric Benchmark for Evaluating Large Language Models
Subjects: Computation and Language (cs.CL)
[29]  arXiv:2404.13925 [pdf, other]
Title: MARIO Eval: Evaluate Your Math LLM with your Math LLM--A mathematical dataset evaluation toolkit
Subjects: Computation and Language (cs.CL)
[30]  arXiv:2404.13919 [pdf, other]
Title: Navigating the Path of Writing: Outline-guided Text Generation with Large Language Models
Comments: under review
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[31]  arXiv:2404.13906 [pdf, other]
Title: Generating Attractive and Authentic Copywriting from Customer Reviews
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[32]  arXiv:2404.13899 [pdf, other]
Title: Towards Better Text-to-Image Generation Alignment via Attention Modulation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[33]  arXiv:2404.13874 [pdf, other]
Title: VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models
Comments: Work in process
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[34]  arXiv:2404.13865 [pdf, other]
Title: Context-Enhanced Language Models for Generating Multi-Paper Citations
Comments: 14 pages, 7 figures, 11th International Conference, BDA 2023, Delhi, India
Journal-ref: Big Data and Artificial Intelligence 2023, Delhi, India, December 7, 80 94
Subjects: Computation and Language (cs.CL)
[35]  arXiv:2404.13855 [pdf, other]
Title: Understanding the role of FFNs in driving multilingual behaviour in LLMs
Comments: 10 pages
Subjects: Computation and Language (cs.CL)
[36]  arXiv:2404.13813 [pdf, other]
Title: From LLM to NMT: Advancing Low-Resource Machine Translation with Claude
Comments: 17 pages, 15 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[37]  arXiv:2404.13793 [pdf, other]
Title: Lightweight Connective Detection Using Gradient Boosting
Comments: 7 pages, 2 figures, 5 tables
Subjects: Computation and Language (cs.CL)
[38]  arXiv:2404.13781 [pdf, other]
Title: Evaluating Retrieval Quality in Retrieval-Augmented Generation
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[39]  arXiv:2404.13779 [pdf, other]
Title: Automated Text Mining of Experimental Methodologies from Biomedical Literature
Authors: Ziqing Guo
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[40]  arXiv:2404.13764 [pdf, other]
Title: Using Adaptive Empathetic Responses for Teaching English
Comments: Accepted to BEA workshop at NAACL 2024
Subjects: Computation and Language (cs.CL)
[41]  arXiv:2404.13760 [pdf, other]
Title: How to Encode Domain Information in Relation Classification
Comments: Accepted at LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[42]  arXiv:2404.13751 [pdf, other]
Title: Embarrassingly Simple Unsupervised Aspect Based Sentiment Tuple Extraction
Comments: 4 pages, 4 tables, 3 figures, 2 appendix pages
Subjects: Computation and Language (cs.CL)
[43]  arXiv:2404.13660 [pdf, other]
Title: Trojan Detection in Large Language Models: Insights from The Trojan Detection Challenge
Subjects: Computation and Language (cs.CL)
[44]  arXiv:2404.13645 [pdf, other]
Title: PEACH: Pretrained-embedding Explanation Across Contextual and Hierarchical Structure
Comments: Accepted at IJCAI 2024
Subjects: Computation and Language (cs.CL)
[45]  arXiv:2404.13628 [pdf, other]
Title: Mixture of LoRA Experts
Comments: 17 pages, 11 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
[46]  arXiv:2404.13627 [pdf, other]
Title: NegotiationToM: A Benchmark for Stress-testing Machine Theory of Mind on Negotiation Surrounding
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[47]  arXiv:2404.13613 [pdf, other]
Title: The Branch Not Taken: Predicting Branching in Online Conversations
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[48]  arXiv:2404.13599 [pdf, other]
Title: "A good pun is its own reword": Can Large Language Models Understand Puns?
Subjects: Computation and Language (cs.CL)
[49]  arXiv:2404.13547 [pdf, other]
Title: E-QGen: Educational Lecture Abstract-based Question Generation System
Comments: IJCAI 2024 Demo Paper
Subjects: Computation and Language (cs.CL)
[50]  arXiv:2404.13504 [pdf, other]
Title: IMO: Greedy Layer-Wise Sparse Representation Learning for Out-of-Distribution Text Classification with Pre-trained Models
Subjects: Computation and Language (cs.CL)
[51]  arXiv:2404.13465 [pdf, other]
Title: Do "English" Named Entity Recognizers Work Well on Global Englishes?
Comments: EMNLP Findings 2023
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[52]  arXiv:2404.13439 [pdf, other]
Title: Fine-Grained Named Entities for Corona News
Comments: Published at SWAT4HCLS 2023: The 14th International Conference on Semantic Web Applications and Tools for Health Care and Life Sciences
Subjects: Computation and Language (cs.CL)
[53]  arXiv:2404.13397 [pdf, ps, other]
Title: Retrieval-Augmented Generation-based Relation Extraction
Comments: Submitted to Semantic Web Journal. Under Review
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[54]  arXiv:2404.13390 [pdf, other]
Title: Explanation based Bias Decoupling Regularization for Natural Language Inference
Subjects: Computation and Language (cs.CL)
[55]  arXiv:2404.13364 [pdf, other]
Title: MahaSQuAD: Bridging Linguistic Divides in Marathi Question-Answering
Comments: Accepted at the International Conference on Natural Language Processing (ICON 2023)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[56]  arXiv:2404.13362 [pdf, other]
Title: Semantically Corrected Amharic Automatic Speech Recognition
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[57]  arXiv:2404.13350 [pdf, ps, other]
Title: Swa Bhasha: Message-Based Singlish to Sinhala Transliteration
Comments: 6 pages, 6 figures, 2 Tables, Presented at International Conference on Innovations in Info-business and Technology, Colombo, February 2022
Subjects: Computation and Language (cs.CL)
[58]  arXiv:2404.13343 [pdf, other]
Title: UnibucLLM: Harnessing LLMs for Automated Prediction of Item Difficulty and Response Time for Multiple-Choice Questions
Comments: Accepted at BEA 2024 (NAACL Workshop)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[59]  arXiv:2404.13307 [pdf, other]
Title: Beyond Accuracy: Investigating Error Types in GPT-4 Responses to USMLE Questions
Comments: 10 pages, 4 figures. Accepted for publication at the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2024)
Subjects: Computation and Language (cs.CL)
[60]  arXiv:2404.13292 [pdf, other]
Title: Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization Challenge
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[61]  arXiv:2404.13289 [pdf, other]
Title: Double Mixture: Towards Continual Event Detection from Speech
Comments: The first two authors contributed equally to this work
Subjects: Computation and Language (cs.CL); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[62]  arXiv:2404.13246 [pdf, other]
Title: ISQA: Informative Factuality Feedback for Scientific Summarization
Comments: 18 pages, 4 figures
Subjects: Computation and Language (cs.CL)
[63]  arXiv:2404.13192 [pdf, other]
Title: Heterogeneous Subgraph Transformer for Fake News Detection
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[64]  arXiv:2404.13149 [pdf, other]
Title: Beyond Self-Consistency: Ensemble Reasoning Boosts Consistency and Accuracy of LLMs in Cancer Staging
Comments: accepted to the 22nd International Conference on Artificial Intelligence in Medicine (AIME'24)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[65]  arXiv:2404.13104 [pdf, other]
Title: Multi Class Depression Detection Through Tweets using Artificial Intelligence
Comments: 33 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[66]  arXiv:2404.13099 [pdf, other]
Title: Mathify: Evaluating Large Language Models on Mathematical Problem Solving Tasks
Comments: 10 pages, 3 figures, NeurIPS 2023 Workshop on Generative AI for Education (GAIED)
Journal-ref: NeurIPS 2023 Workshop on Generative AI for Education (GAIED)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[67]  arXiv:2404.13087 [pdf, other]
Title: Demystifying Legalese: An Automated Approach for Summarizing and Analyzing Overlaps in Privacy Policies and Terms of Service
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[68]  arXiv:2404.13082 [pdf, other]
Title: TREACLE: Thrifty Reasoning via Context-Aware LLM and Prompt Selection
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[69]  arXiv:2404.13081 [pdf, other]
Title: SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs
Comments: Accepted at ICLR 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[70]  arXiv:2404.13079 [pdf, other]
Title: Relational Graph Convolutional Networks for Sentiment Analysis
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[71]  arXiv:2404.13078 [pdf, other]
Title: Empowering Interdisciplinary Research with BERT-Based Models: An Approach Through SciBERT-CNN with Topic Modeling
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[72]  arXiv:2404.13077 [pdf, ps, other]
Title: Improving the Capabilities of Large Language Model Based Marketing Analytics Copilots With Semantic Search And Fine-Tuning
Comments: 16 pages, 5 figures, presented at the 2nd International Conference on NLP & AI (NLPAI 2024)
Journal-ref: International Journal on Cybernetics & Informatics (IJCI), vol. 13, no. 2, pp. 15-31, Apr. 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[73]  arXiv:2404.13076 [pdf, other]
Title: LLM Evaluators Recognize and Favor Their Own Generations
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[74]  arXiv:2404.13074 [pdf, other]
Title: Towards Compositionally Generalizable Semantic Parsing in Large Language Models: A Survey
Authors: Amogh Mannekote
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[75]  arXiv:2404.13071 [pdf, other]
Title: Modeling Emotions and Ethics with Large Language Models
Authors: Edward Y. Chang
Comments: 9 pages, 4 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[76]  arXiv:2404.13070 [pdf, other]
Title: Evidence from counterfactual tasks supports emergent analogical reasoning in large language models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[77]  arXiv:2404.13069 [pdf, other]
Title: Subtle Signs of Scribal Intent in the Voynich Manuscript
Comments: Submitted to Histocrypt 2024
Subjects: Computation and Language (cs.CL)
[78]  arXiv:2404.13067 [pdf, other]
Title: Towards Efficient Resume Understanding: A Multi-Granularity Multi-Modal Pre-Training Approach
Comments: ICME 2024 Accepted
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[79]  arXiv:2404.13066 [pdf, other]
Title: Leveraging Large Language Model as Simulated Patients for Clinical Education
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[80]  arXiv:2404.13065 [pdf, other]
Title: Intellecta Cognitiva: A Comprehensive Dataset for Advancing Academic Knowledge and Machine Reasoning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[81]  arXiv:2404.13057 [pdf, other]
Title: "Hey..! This medicine made me sick": Sentiment Analysis of User-Generated Drug Reviews using Machine Learning Techniques
Subjects: Computation and Language (cs.CL)
[82]  arXiv:2404.13050 [pdf, other]
Title: FlowMind: Automatic Workflow Generation with LLMs
Comments: Published in ACM ICAIF 2023
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[83]  arXiv:2404.14394 (cross-list from cs.AI) [pdf, other]
Title: A Multimodal Automated Interpretability Agent
Comments: 25 pages, 13 figures
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[84]  arXiv:2404.14368 (cross-list from cs.CV) [pdf, other]
Title: Graphic Design with Large Multimodal Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[85]  arXiv:2404.14233 (cross-list from cs.CV) [pdf, other]
Title: Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[86]  arXiv:2404.13885 (cross-list from cs.CY) [pdf, other]
Title: Surveying Attitudinal Alignment Between Large Language Models Vs. Humans Towards 17 Sustainable Development Goals
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[87]  arXiv:2404.13847 (cross-list from cs.CV) [pdf, other]
Title: EventLens: Leveraging Event-Aware Pretraining and Cross-modal Linking Enhances Visual Commonsense Reasoning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[88]  arXiv:2404.13846 (cross-list from cs.LG) [pdf, other]
Title: Filtered Direct Preference Optimization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[89]  arXiv:2404.13792 (cross-list from cs.MM) [pdf, other]
Title: Counterfactual Reasoning Using Predicted Latent Personality Dimensions for Optimizing Persuasion Outcome
Comments: 14 pages, 10 figures, Accepted by Persuasive Technology 2024
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[90]  arXiv:2404.13784 (cross-list from cs.CR) [pdf, other]
Title: Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[91]  arXiv:2404.13752 (cross-list from cs.LG) [pdf, other]
Title: Towards General Conceptual Model Editing via Adversarial Representation Engineering
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Optimization and Control (math.OC)
[92]  arXiv:2404.13721 (cross-list from cs.AI) [pdf, ps, other]
Title: The Framework of a Design Process Language
Authors: Arnulf Hagen
Comments: PhD dissertation, 1993, Norwegian Institute of Technology
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[93]  arXiv:2404.13633 (cross-list from cs.HC) [pdf, other]
Title: Incorporating Different Verbal Cues to Improve Text-Based Computer-Delivered Health Messaging
Authors: Samuel Rhys Cox
Comments: PhD thesis - National University of Singapore, November 2023
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[94]  arXiv:2404.13630 (cross-list from cs.SE) [pdf, ps, other]
Title: Utilizing Deep Learning to Optimize Software Development Processes
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[95]  arXiv:2404.13611 (cross-list from cs.CV) [pdf, other]
Title: Video sentence grounding with temporally global textual knowledge
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[96]  arXiv:2404.13565 (cross-list from cs.CV) [pdf, other]
Title: Exploring Diverse Methods in Visual Question Answering
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[97]  arXiv:2404.13556 (cross-list from cs.IR) [pdf, other]
Title: ChatRetriever: Adapting Large Language Models for Generalized and Robust Conversational Dense Retrieval
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[98]  arXiv:2404.13530 (cross-list from cs.CV) [pdf, other]
Title: Listen Then See: Video Alignment with Speaker Attention
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[99]  arXiv:2404.13506 (cross-list from cs.LG) [pdf, other]
Title: Parameter Efficient Fine Tuning: A Comprehensive Analysis Across Applications
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[100]  arXiv:2404.13402 (cross-list from cs.CR) [pdf, other]
Title: Intrusion Detection at Scale with the Assistance of a Command-line Language Model
Comments: Accepted by IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), industry track
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[101]  arXiv:2404.13370 (cross-list from cs.CV) [pdf, other]
Title: Movie101v2: Improved Movie Narration Benchmark
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[102]  arXiv:2404.13238 (cross-list from cs.LG) [pdf, other]
Title: Personalized Wireless Federated Learning for Large Language Models
Comments: 8 pages, 5 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[103]  arXiv:2404.13208 (cross-list from cs.CR) [pdf, other]
Title: The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[104]  arXiv:2404.13163 (cross-list from econ.GN) [pdf, other]
Title: A national longitudinal dataset of skills taught in U.S. higher education curricula
Comments: 44 pages, 21 figures, 10 tables
Subjects: General Economics (econ.GN); Computation and Language (cs.CL)

Mon, 22 Apr 2024

[105]  arXiv:2404.13033 [pdf, other]
Title: Sample Design Engineering: An Empirical Study of What Makes Good Downstream Fine-Tuning Samples for LLMs
Comments: 23 pages, 12 figures, 14 tables
Subjects: Computation and Language (cs.CL)
[106]  arXiv:2404.13020 [pdf, other]
Title: Stronger Random Baselines for In-Context Learning
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[107]  arXiv:2404.12957 [pdf, other]
Title: Towards Reliable Latent Knowledge Estimation in LLMs: In-Context Learning vs. Prompting Based Factual Knowledge Extraction
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[108]  arXiv:2404.12938 [pdf, other]
Title: MAiDE-up: Multilingual Deception Detection of GPT-generated Hotel Reviews
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[109]  arXiv:2404.12933 [pdf, other]
Title: Cross-cultural Inspiration Detection and Analysis in Real and LLM-generated Social Media Data
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[110]  arXiv:2404.12897 [pdf, other]
Title: Enabling Natural Zero-Shot Prompting on Encoder Models via Statement-Tuning
Subjects: Computation and Language (cs.CL)
[111]  arXiv:2404.12879 [pdf, other]
Title: Unlocking Multi-View Insights in Knowledge-Dense Retrieval-Augmented Generation
Subjects: Computation and Language (cs.CL)
[112]  arXiv:2404.12866 [pdf, other]
Title: How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning?
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[113]  arXiv:2404.12845 [pdf, other]
Title: TartuNLP @ SIGTYP 2024 Shared Task: Adapting XLM-RoBERTa for Ancient and Historical Languages
Comments: 11 pages, 3 figures
Journal-ref: Proceedings of the 6th Workshop on Research in Computational Linguistic Typology and Multilingual NLP, pp. 120-130, March 2024
Subjects: Computation and Language (cs.CL)
[114]  arXiv:2404.12829 [pdf, other]
Title: LiMe: a Latin Corpus of Late Medieval Criminal Sentences
Comments: to be published in: LT4HALA@LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[115]  arXiv:2404.12827 [pdf, ps, other]
Title: CT-ADE: An Evaluation Benchmark for Adverse Drug Event Prediction from Clinical Trial Results
Subjects: Computation and Language (cs.CL)
[116]  arXiv:2404.12788 [pdf, other]
Title: REXEL: An End-to-end Model for Document-Level Relation Extraction and Entity Linking
Comments: Accepted at NAACL Industry Track 2024
Subjects: Computation and Language (cs.CL)
[117]  arXiv:2404.12753 [pdf, other]
Title: AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation
Comments: 18 pages, 5 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[118]  arXiv:2404.12744 [pdf, other]
Title: Beyond Human Norms: Unveiling Unique Values of Large Language Models through Interdisciplinary Approaches
Comments: 16 pages, work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[119]  arXiv:2404.12728 [pdf, other]
Title: Relevant or Random: Can LLMs Truly Perform Analogical Reasoning?
Subjects: Computation and Language (cs.CL)
[120]  arXiv:2404.12726 [pdf, other]
Title: Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works
Subjects: Computation and Language (cs.CL)
[121]  arXiv:2404.12715 [pdf, other]
Title: Enabling Ensemble Learning for Heterogeneous Large Language Models with Deep Parallel Collaboration
Comments: 12 pages, 5 figures
Subjects: Computation and Language (cs.CL)
[122]  arXiv:2404.12698 [pdf, other]
Title: Neural Semantic Parsing with Extremely Rich Symbolic Meaning Representations
Comments: This manuscript has been submitted to Computational Linguistics journal on 2024-03-15
Subjects: Computation and Language (cs.CL)
[123]  arXiv:2404.12659 [pdf, ps, other]
Title: SOS-1K: A Fine-grained Suicide Risk Classification Dataset for Chinese Social Media Analysis
Subjects: Computation and Language (cs.CL)
[124]  arXiv:2404.12642 [pdf, other]
Title: Cooperative Sentiment Agents for Multimodal Sentiment Analysis
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[125]  arXiv:2404.12628 [pdf, other]
Title: Efficient infusion of self-supervised representations in Automatic Speech Recognition
Comments: Accepted to ENLSP workshop, NeurIPS 2023
Subjects: Computation and Language (cs.CL)
[126]  arXiv:2404.12618 [pdf, ps, other]
Title: CORI: CJKV Benchmark with Romanization Integration -- A step towards Cross-lingual Transfer Beyond Textual Scripts
Comments: Accepted at LREC-COLING 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[127]  arXiv:2404.12596 [pdf, other]
Title: Parameter Efficient Diverse Paraphrase Generation Using Sequence-Level Knowledge Distillation
Comments: Published in: 2024 5th International Conference on Advancements in Computational Sciences (ICACS) with IEEE
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[128]  arXiv:2404.12580 [pdf, other]
Title: iTBLS: A Dataset of Interactive Conversations Over Tabular Information
Comments: 14 pages, 4 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[129]  arXiv:2404.12560 [pdf, other]
Title: Dubo-SQL: Diverse Retrieval-Augmented Generation and Fine Tuning for Text-to-SQL
Comments: 10 pages, 3 figures, 3 tables
Subjects: Computation and Language (cs.CL); Databases (cs.DB)
[130]  arXiv:2404.12545 [pdf, other]
Title: Latent Concept-based Explanation of NLP Models
Subjects: Computation and Language (cs.CL)
[131]  arXiv:2404.12494 [pdf, other]
Title: BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models
Subjects: Computation and Language (cs.CL)
[132]  arXiv:2404.12493 [pdf, other]
Title: EnriCo: Enriched Representation and Globally Constrained Inference for Entity and Relation Extraction
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[133]  arXiv:2404.12491 [pdf, other]
Title: GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[134]  arXiv:2404.12489 [pdf, other]
Title: Grammatical Error Correction for Code-Switched Sentences by Learners of English
Subjects: Computation and Language (cs.CL)
[135]  arXiv:2404.12464 [pdf, other]
Title: NORMAD: A Benchmark for Measuring the Cultural Adaptability of Large Language Models
Comments: Preprint. In Review
Subjects: Computation and Language (cs.CL)
[136]  arXiv:2404.12452 [pdf, other]
Title: Characterizing LLM Abstention Behavior in Science QA with Context Perturbations
Subjects: Computation and Language (cs.CL)
[137]  arXiv:2404.12447 [pdf, other]
Title: AmbigDocs: Reasoning across Documents on Different Entities under the Same Name
Subjects: Computation and Language (cs.CL)
[138]  arXiv:2404.12444 [pdf, other]
Title: mOthello: When Do Cross-Lingual Representation Alignment and Cross-Lingual Transfer Emerge in Multilingual Models?
Comments: Accepted at Findings of NAACL 2024. Project Webpage: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[139]  arXiv:2404.13043 (cross-list from cs.CV) [pdf, other]
Title: Data Alignment for Zero-Shot Concept Generation in Dermatology AI
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[140]  arXiv:2404.13039 (cross-list from cs.CV) [pdf, other]
Title: LaPA: Latent Prompt Assist Model For Medical Visual Question Answering
Comments: 10 pages, 4 figures, Accepted by CVPRW2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[141]  arXiv:2404.13013 (cross-list from cs.CV) [pdf, other]
Title: Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[142]  arXiv:2404.12994 (cross-list from cs.IR) [pdf, other]
Title: Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs
Comments: Accepted at SIGIR 2024 long paper track
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[143]  arXiv:2404.12872 (cross-list from cs.DB) [pdf, other]
Title: LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query Efficiency
Comments: 12 pages
Subjects: Databases (cs.DB); Computation and Language (cs.CL)
[144]  arXiv:2404.12843 (cross-list from cs.LG) [pdf, other]
Title: Towards Logically Consistent Language Models via Probabilistic Reasoning
Comments: Accepted at ICLR 2024 Workshop on Reliable and Responsible Foundation Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[145]  arXiv:2404.12720 (cross-list from cs.CV) [pdf, other]
Title: PDF-MVQA: A Dataset for Multimodal Information Retrieval in PDF-based Visual Question Answering
Comments: Accepted by IJCAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[146]  arXiv:2404.12670 (cross-list from cs.IR) [pdf, other]
Title: Towards Human-centered Proactive Conversational Agents
Comments: Accepted by SIGIR 2024 (Perspectives Track)
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[147]  arXiv:2404.12652 (cross-list from cs.CV) [pdf, other]
Title: Pre-trained Vision-Language Models Learn Discoverable Visual Concepts
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[148]  arXiv:2404.12608 (cross-list from cs.DB) [pdf, other]
Title: Auto-Formula: Recommend Formulas in Spreadsheets using Contrastive Learning for Table Representations
Comments: full version of a paper to appear in SIGMOD 2024
Subjects: Databases (cs.DB); Computation and Language (cs.CL); Programming Languages (cs.PL)
[149]  arXiv:2404.12535 (cross-list from cs.LG) [pdf, other]
Title: HalluciBot: Is There No Such Thing as a Bad Question?
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[150]  arXiv:2404.12526 (cross-list from cs.LG) [pdf, other]
Title: Adaptive Memory Replay for Continual Learning
Comments: CVPR-W 2024 (Spotlight)
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[151]  arXiv:2404.12457 (cross-list from cs.DC) [pdf, other]
Title: RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computation and Language (cs.CL); Machine Learning (cs.LG)
[152]  arXiv:2404.12394 (cross-list from cs.LG) [pdf, other]
Title: A Big Data Analytics System for Predicting Suicidal Ideation in Real-Time Based on Social Media Streaming Data
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[153]  arXiv:2404.12045 (cross-list from cs.AI) [pdf, other]
Title: RAM: Towards an Ever-Improving Memory System by Learning from Communications
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[154]  arXiv:2404.11891 (cross-list from cs.AI) [pdf, other]
Title: Large Language Models Can Plan Your Travels Rigorously with Formal Verification Tools
Comments: 31 pages, 3 figures, 4 tables, submitted to ACL RR
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)

Fri, 19 Apr 2024

[155]  arXiv:2404.12387 [pdf, other]
Title: Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[156]  arXiv:2404.12365 [pdf, other]
Title: When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes
Comments: Accepted to NAACL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[157]  arXiv:2404.12342 [pdf, other]
Title: Large Language Models in Targeted Sentiment Analysis
Comments: Fine-tuned Flan-T5-xl outperforms the top #1 results of transformer-based classifier in RuSentNE-2023 competition, to appear in Lobachevskii Journal of Mathematics No.8/2024 proceedings
Subjects: Computation and Language (cs.CL)
[158]  arXiv:2404.12318 [pdf, other]
Title: Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment
Subjects: Computation and Language (cs.CL)
[159]  arXiv:2404.12299 [pdf, other]
Title: Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair
Comments: 23 pages, 9 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[160]  arXiv:2404.12291 [pdf, ps, other]
Title: Augmenting emotion features in irony detection with Large language modeling
Comments: 11 pages, 3 tables, 2 figures. Accepted by the 25th Chinese Lexical Semantics Workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[161]  arXiv:2404.12289 [pdf, other]
Title: Resilience through Scene Context in Visual Referring Expression Generation
Subjects: Computation and Language (cs.CL)
[162]  arXiv:2404.12283 [pdf, ps, other]
Title: Enhancing Embedding Performance through Large Language Model-based Text Enrichment and Rewriting
Subjects: Computation and Language (cs.CL)
[163]  arXiv:2404.12274 [pdf, other]
Title: Advancing the Robustness of Large Language Models through Self-Denoised Smoothing
Comments: Accepted by NAACL 2024. Jiabao, Bairu, Zhen, Guanhua contributed equally. This is an updated version of the paper: arXiv:2307.07171
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[164]  arXiv:2404.12253 [pdf, other]
Title: Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[165]  arXiv:2404.12242 [pdf, other]
Title: CMNEE: A Large-Scale Document-Level Event Extraction Dataset based on Open-Source Chinese Military News
Comments: 13 pages, 7 figures, accepted to LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[166]  arXiv:2404.12241 [pdf, other]
[167]  arXiv:2404.12224 [pdf, other]
Title: Length Generalization of Causal Transformers without Position Encoding
Subjects: Computation and Language (cs.CL)
[168]  arXiv:2404.12195 [pdf, other]
Title: OpenBezoar: Small, Cost-Effective and Open Models Trained on Mixes of Instruction Data
Comments: 25 pages, 27 Figures, 8 Tables
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[169]  arXiv:2404.12177 [pdf, ps, other]
Title: EuSQuAD: Automatically Translated and Aligned SQuAD2.0 for Basque
Comments: Under review in the journal of Procesamiento de Lenguaje Natural
Subjects: Computation and Language (cs.CL)
[170]  arXiv:2404.12174 [pdf, other]
Title: Claim Check-Worthiness Detection: How Well do LLMs Grasp Annotation Guidelines?
Subjects: Computation and Language (cs.CL)
[171]  arXiv:2404.12171 [pdf, other]
Title: Stance Detection on Social Media with Fine-Tuned Large Language Models
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[172]  arXiv:2404.12152 [pdf, other]
Title: FecTek: Enhancing Term Weight in Lexicon-Based Retrieval with Feature Context and Term-level Knowledge
Subjects: Computation and Language (cs.CL)
[173]  arXiv:2404.12145 [pdf, other]
Title: From Form(s) to Meaning: Probing the Semantic Depths of Language Models Using Multisense Consistency
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[174]  arXiv:2404.12096 [pdf, other]
Title: LongEmbed: Extending Embedding Models for Long Context Retrieval
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[175]  arXiv:2404.12065 [pdf, other]
Title: RAGAR, Your Falsehood RADAR: RAG-Augmented Reasoning for Political Fact-Checking using Multimodal Large Language Models
Comments: 8 pages, submitted to ACL Rolling Review
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Emerging Technologies (cs.ET); Multiagent Systems (cs.MA)
[176]  arXiv:2404.12059 [pdf, other]
Title: Constituents Correspond to Word Sequence Patterns among Sentences with Equivalent Predicate-Argument Structures: Unsupervised Constituency Parsing by Span Matching
Subjects: Computation and Language (cs.CL)
[177]  arXiv:2404.12050 [pdf, other]
Title: emrQA-msquad: A Medical Dataset Structured with the SQuAD V2.0 Framework, Enriched with emrQA Medical Information
Comments: The dataset is available in this https URL
Subjects: Computation and Language (cs.CL)
[178]  arXiv:2404.12042 [pdf, other]
Title: Exploring Boundaries and Intensities in Offensive and Hate Speech: Unveiling the Complex Spectrum of Social Media Discourse
Subjects: Computation and Language (cs.CL)
[179]  arXiv:2404.12041 [pdf, other]
Title: Can We Catch the Elephant? The Evolvement of Hallucination Evaluation on Natural Language Generation: A Survey
Comments: 19 pages in total, with 9 pages as main body. Under review as a conference paper at CoLM 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[180]  arXiv:2404.12038 [pdf, other]
Title: Uncovering Safety Risks in Open-source LLMs through Concept Activation Vector
Subjects: Computation and Language (cs.CL)
[181]  arXiv:2404.12022 [pdf, other]
Title: Parallel Decoding via Hidden Transfer for Lossless Large Language Model Acceleration
Subjects: Computation and Language (cs.CL)
[182]  arXiv:2404.12014 [pdf, other]
Title: Enhance Robustness of Language Models Against Variation Attack through Graph Integration
Comments: 12 pages, 4 figures, accepted by COLING 2024
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[183]  arXiv:2404.12013 [pdf, other]
Title: Sequential Compositional Generalization in Multimodal Models
Comments: Accepted to the main conference of NAACL (2024) as a long paper
Subjects: Computation and Language (cs.CL)
[184]  arXiv:2404.12010 [pdf, other]
Title: ParaFusion: A Large-Scale LLM-Driven English Paraphrase Dataset Infused with High-Quality Lexical and Syntactic Diversity
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[185]  arXiv:2404.12006 [pdf, other]
Title: Variational Multi-Modal Hypergraph Attention Network for Multi-Modal Relation Extraction
Subjects: Computation and Language (cs.CL)
[186]  arXiv:2404.11999 [pdf, other]
Title: Token-level Direct Preference Optimization
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[187]  arXiv:2404.11978 [pdf, other]
Title: EVIT: Event-Oriented Instruction Tuning for Event Reasoning
Subjects: Computation and Language (cs.CL)
[188]  arXiv:2404.11972 [pdf, other]
Title: Aligning Language Models to Explicitly Handle Ambiguity
Subjects: Computation and Language (cs.CL)
[189]  arXiv:2404.11968 [pdf, other]
Title: P-NAL: an Effective and Interpretable Entity Alignment Method
Comments: 13 pages, 2 figures
Subjects: Computation and Language (cs.CL)
[190]  arXiv:2404.11932 [pdf, other]
Title: CrossIn: An Efficient Instruction Tuning Approach for Cross-Lingual Knowledge Alignment
Comments: 11 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[191]  arXiv:2404.11916 [pdf, other]
Title: SKIP: Skill-Localized Prompt Tuning for Inference Speed Boost-Up
Comments: 6 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[192]  arXiv:2404.11912 [pdf, other]
Title: TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[193]  arXiv:2404.11845 [pdf, other]
Title: Challenging Negative Gender Stereotypes: A Study on the Effectiveness of Automated Counter-Stereotypes
Comments: LREC-COLING2024
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[194]  arXiv:2404.11826 [pdf, other]
Title: AdvisorQA: Towards Helpful and Harmless Advice-seeking Question Answering with Collective Intelligence
Comments: 19 pages, 11 figures
Subjects: Computation and Language (cs.CL)
[195]  arXiv:2404.11809 [pdf, other]
Title: Sharing Parameter by Conjugation for Knowledge Graph Embeddings in Complex Space
Comments: 8 pages, 1 figure, 6 tables, accepted at TextGraphs-16 workshop held in conjunction with COLING 2022
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[196]  arXiv:2404.11793 [pdf, other]
Title: Enhancing Argument Summarization: Prioritizing Exhaustiveness in Key Point Generation and Introducing an Automatic Coverage Evaluation Metric
Comments: NAACL 2024 Main Conference
Subjects: Computation and Language (cs.CL)
[197]  arXiv:2404.11782 [pdf, other]
Title: REQUAL-LM: Reliability and Equity through Aggregation in Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[198]  arXiv:2404.11757 [pdf, other]
Title: Language Models Still Struggle to Zero-shot Reason about Time Series
Subjects: Computation and Language (cs.CL)
[199]  arXiv:2404.11752 [pdf, ps, other]
Title: Mapping Violence: Developing an Extensive Framework to Build a Bangla Sectarian Expression Dataset from Social Media Interactions
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[200]  arXiv:2404.11730 [pdf, other]
Title: Missed Connections: Lateral Thinking Puzzles for Large Language Models
Comments: 8 pages, 3 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[201]  arXiv:2404.11726 [pdf, other]
Title: Investigating Gender Bias in Turkish Language Models
Comments: arXiv admin note: text overlap with arXiv:1903.10561 by other authors
Subjects: Computation and Language (cs.CL)
[202]  arXiv:2404.11717 [pdf, other]
Title: How often are errors in natural language reasoning due to paraphrastic variability?
Comments: accepted to TACL 2024 (pre-MIT Press publication version)
Subjects: Computation and Language (cs.CL)
[203]  arXiv:2404.11691 [pdf, ps, other]
Title: Improvement in Semantic Address Matching using Natural Language Processing
Comments: 5 pages, 7 tables, 2021 2nd International Conference for Emerging Technology (INCET)
Journal-ref: 2021 2nd International Conference for Emerging Technology (INCET), Belagavi, India, 2021, pp. 1-5
Subjects: Computation and Language (cs.CL)
[204]  arXiv:2404.11682 [pdf, other]
Title: How Well Can You Articulate that Idea? Insights from Automated Formative Assessment
Subjects: Computation and Language (cs.CL)
[205]  arXiv:2404.11672 [pdf, other]
Title: MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory
Subjects: Computation and Language (cs.CL)
[206]  arXiv:2404.12390 (cross-list from cs.CV) [pdf, other]
Title: BLINK: Multimodal Large Language Models Can See but Not Perceive
Comments: Multimodal Benchmark, Project Url: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[207]  arXiv:2404.12273 (cross-list from cs.AI) [pdf, other]
Title: FedEval-LLM: Federated Evaluation of Large Language Models on Downstream Tasks with Collective Wisdom
Comments: In Progress
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[208]  arXiv:2404.12150 (cross-list from cs.LG) [pdf, other]
Title: Aligning language models with human preferences
Authors: Tomasz Korbak
Comments: PhD thesis
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[209]  arXiv:2404.12132 (cross-list from cs.SD) [pdf, other]
Title: Enhancing Suicide Risk Assessment: A Speech-Based Automated Approach in Emergency Medicine
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[210]  arXiv:2404.12104 (cross-list from cs.CV) [pdf, other]
Title: Ethical-Lens: Curbing Malicious Usages of Open-Source Text-to-Image Models
Comments: 42 pages, 17 figures, 29 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[211]  arXiv:2404.12077 (cross-list from cs.SD) [pdf, other]
Title: TIMIT Speaker Profiling: A Comparison of Multi-task learning and Single-task learning Approaches
Authors: Rong Wang, Kun Sun
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[212]  arXiv:2404.11870 (cross-list from cs.LG) [pdf, ps, other]
Title: Enhancing Length Extrapolation in Sequential Models with Pointer-Augmented Neural Memory
Comments: Preprint
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[213]  arXiv:2404.11619 (cross-list from eess.AS) [pdf, ps, other]
Title: Advancing Speech Translation: A Corpus of Mandarin-English Conversational Telephone Speech
Comments: 2 pages
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)

Thu, 18 Apr 2024 (showing first 43 of 57 entries)

[214]  arXiv:2404.11588 [pdf, other]
Title: Related Work and Citation Text Generation: A Survey
Subjects: Computation and Language (cs.CL)
[215]  arXiv:2404.11553 [pdf, other]
Title: Quantifying Multilingual Performance of Large Language Models Across Languages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[216]  arXiv:2404.11539 [pdf, other]
Title: Evaluating Span Extraction in Generative Paradigm: A Reflection on Aspect-Based Sentiment Analysis
Comments: 10 pages
Subjects: Computation and Language (cs.CL)
[217]  arXiv:2404.11532 [pdf, other]
Title: Select and Reorder: A Novel Approach for Neural Sign Language Production
Comments: 8 Pages, 5 Figures, 7 Tables, LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[218]  arXiv:2404.11531 [pdf, other]
Title: Pack of LLMs: Model Fusion at Test-Time via Perplexity Optimization
Subjects: Computation and Language (cs.CL)
[219]  arXiv:2404.11502 [pdf, other]
Title: Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[220]  arXiv:2404.11500 [pdf, other]
Title: Paraphrase and Solve: Exploring and Exploiting the Impact of Surface Form on Mathematical Reasoning in Large Language Models
Comments: Accepted to the main conference of NAACL (2024)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[221]  arXiv:2404.11499 [pdf, other]
Title: A Data-Driven Representation for Sign Language Production
Comments: 8 Pages, 3 Figures, 7 Tables, 18th IEEE International Conference on Automatic Face and Gesture Recognition 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[222]  arXiv:2404.11470 [pdf, other]
Title: A Federated Learning Approach to Privacy Preserving Offensive Language Identification
Comments: Accepted to TRAC 2024 (Fourth Workshop on Threat, Aggression and Cyberbullying) at LREC-COLING 2024 (The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[223]  arXiv:2404.11459 [pdf, other]
Title: Octopus v3: Technical Report for On-device Sub-billion Multimodal AI Agent
Authors: Wei Chen, Zhiyuan Li
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[224]  arXiv:2404.11449 [pdf, other]
Title: AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media Texts
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[225]  arXiv:2404.11446 [pdf, other]
Title: Open-Ended Wargames with Large Language Models
Comments: 15 pages, 2 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[226]  arXiv:2404.11384 [pdf, other]
Title: Exploring Key Point Analysis with Pairwise Generation and Graph Partitioning
Comments: 11 pages, 4 figures, 4 tables. Accepted to NAACL 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[227]  arXiv:2404.11349 [pdf, other]
Title: TeClass: A Human-Annotated Relevance-based Headline Classification and Generation Dataset for Telugu
Comments: Accepted at LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[228]  arXiv:2404.11315 [pdf, other]
Title: To Drop or Not to Drop? Predicting Argument Ellipsis Judgments: A Case Study in Japanese
Comments: 13 pages; accepted by LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[229]  arXiv:2404.11288 [pdf, other]
Title: A Preference-driven Paradigm for Enhanced Translation with Large Language Models
Comments: Accepted to NAACL 2024 (long, main)
Subjects: Computation and Language (cs.CL)
[230]  arXiv:2404.11262 [pdf, other]
Title: Sampling-based Pseudo-Likelihood for Membership Inference Attacks
Subjects: Computation and Language (cs.CL)
[231]  arXiv:2404.11225 [pdf, other]
Title: In-Context Learning State Vector with Inner and Momentum Optimization
Comments: 17 pages, 7 figures, 5 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[232]  arXiv:2404.11216 [pdf, other]
Title: Position Engineering: Boosting Large Language Models through Positional Information Manipulation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[233]  arXiv:2404.11206 [pdf, other]
Title: Prompt-tuning for Clickbait Detection via Text Summarization
Subjects: Computation and Language (cs.CL)
[234]  arXiv:2404.11201 [pdf, other]
Title: Neuron Specialization: Leveraging intrinsic task modularity for multilingual machine translation
Subjects: Computation and Language (cs.CL)
[235]  arXiv:2404.11184 [pdf, other]
Title: FIZZ: Factual Inconsistency Detection by Zoom-in Summary and Zoom-out Document
Subjects: Computation and Language (cs.CL)
[236]  arXiv:2404.11141 [pdf, other]
Title: Context-Aware Siamese Networks for Efficient Emotion Recognition in Conversation
Authors: Barbara Gendron (LORIA, Uni.lu), Gaël Guibon (LORIA)
Subjects: Computation and Language (cs.CL)
[237]  arXiv:2404.11132 [pdf, other]
Title: A Novel ICD Coding Framework Based on Associated and Hierarchical Code Description Distillation
Authors: Bin Zhang, Junli Wang
Subjects: Computation and Language (cs.CL)
[238]  arXiv:2404.11124 [pdf, other]
Title: What's under the hood: Investigating Automatic Metrics on Meeting Summarization
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[239]  arXiv:2404.11109 [pdf, other]
Title: Consistency Training by Synthetic Question Generation for Conversational Question Answering
Subjects: Computation and Language (cs.CL)
[240]  arXiv:2404.11095 [pdf, other]
Title: Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues
Comments: 27 pages, 3 figures, 12 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[241]  arXiv:2404.11086 [pdf, other]
Title: ViLLM-Eval: A Comprehensive Evaluation Suite for Vietnamese Large Language Models
Comments: arXiv admin note: text overlap with arXiv:2305.08322 by other authors
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[242]  arXiv:2404.11061 [pdf, other]
Title: Unified Examination of Entity Linking in Absence of Candidate Sets
Subjects: Computation and Language (cs.CL)
[243]  arXiv:2404.11055 [pdf, other]
Title: On the Causal Nature of Sentiment Analysis
Comments: An enhanced version of our previous exploration in arXiv:2305.01764
Subjects: Computation and Language (cs.CL)
[244]  arXiv:2404.11045 [pdf, other]
Title: Offset Unlearning for Large Language Models
Subjects: Computation and Language (cs.CL)
[245]  arXiv:2404.10975 [pdf, other]
Title: Procedural Dilemma Generation for Evaluating Moral Reasoning in Humans and Language Models
Comments: CogSci 2024
Subjects: Computation and Language (cs.CL)
[246]  arXiv:2404.10960 [pdf, other]
Title: Uncertainty-Based Abstention in LLMs Improves Safety and Reduces Hallucinations
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[247]  arXiv:2404.10952 [pdf, other]
Title: Can Language Models Solve Olympiad Programming?
Comments: Code and data: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Programming Languages (cs.PL)
[248]  arXiv:2404.10939 [pdf, other]
Title: More Room for Language: Investigating the Effect of Retrieval on Language Models
Comments: NAACL 2024
Subjects: Computation and Language (cs.CL)
[249]  arXiv:2404.10924 [pdf, other]
Title: Binder: Hierarchical Concept Representation through Order Embedding of Binary Vectors
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[250]  arXiv:2404.10922 [pdf, other]
Title: Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training
Comments: NAACL Findings 2024
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[251]  arXiv:2404.10917 [pdf, other]
Title: Which questions should I answer? Salience Prediction of Inquisitive Questions
Subjects: Computation and Language (cs.CL)
[252]  arXiv:2404.10887 [pdf, other]
Title: Search Beyond Queries: Training Smaller Language Models for Web Interactions via Reinforcement Learning
Comments: 9 pages
Subjects: Computation and Language (cs.CL)
[253]  arXiv:2404.10877 [pdf, other]
Title: Incubating Text Classifiers Following User Instruction with Nothing but LLM
Subjects: Computation and Language (cs.CL)
[254]  arXiv:2404.10859 [pdf, other]
Title: Forcing Diffuse Distributions out of Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[255]  arXiv:2404.10857 [pdf, other]
Title: D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation
Subjects: Computation and Language (cs.CL)
[256]  arXiv:2404.10848 [pdf, other]
Title: A LayoutLMv3-Based Model for Enhanced Relation Extraction in Visually-Rich Documents
Comments: Accepted at the International Conference on Document Analysis and Recognition (ICDAR 2024)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[ total of 317 entries: 1-256 | 257-317 ]
[ showing 256 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2404, contact, help  (Access key information)