We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions

[ total of 346 entries: 1-147 | 148-294 | 295-346 ]
[ showing 147 entries per page: fewer | more | all ]

Fri, 19 Apr 2024

[1]  arXiv:2404.12387 [pdf, other]
Title: Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[2]  arXiv:2404.12365 [pdf, other]
Title: When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes
Comments: Accepted to NAACL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[3]  arXiv:2404.12342 [pdf, other]
Title: Large Language Models in Targeted Sentiment Analysis
Comments: Fine-tuned Flan-T5-xl outperforms the top #1 results of transformer-based classifier in RuSentNE-2023 competition, to appear in Lobachevskii Journal of Mathematics No.8/2024 proceedings
Subjects: Computation and Language (cs.CL)
[4]  arXiv:2404.12318 [pdf, other]
Title: Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment
Subjects: Computation and Language (cs.CL)
[5]  arXiv:2404.12299 [pdf, other]
Title: Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair
Comments: 23 pages, 9 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[6]  arXiv:2404.12291 [pdf, ps, other]
Title: Augmenting emotion features in irony detection with Large language modeling
Comments: 11 pages, 3 tables, 2 figures. Submitted to the 25th Chinese Lexical Semantics Workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[7]  arXiv:2404.12289 [pdf, other]
Title: Resilience through Scene Context in Visual Referring Expression Generation
Subjects: Computation and Language (cs.CL)
[8]  arXiv:2404.12283 [pdf, ps, other]
Title: Enhancing Embedding Performance through Large Language Model-based Text Enrichment and Rewriting
Subjects: Computation and Language (cs.CL)
[9]  arXiv:2404.12274 [pdf, other]
Title: Advancing the Robustness of Large Language Models through Self-Denoised Smoothing
Comments: Accepted by NAACL 2024. Jiabao, Bairu, Zhen, Guanhua contributed equally. This is an updated version of the paper: arXiv:2307.07171
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[10]  arXiv:2404.12253 [pdf, other]
Title: Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Subjects: Computation and Language (cs.CL)
[11]  arXiv:2404.12242 [pdf, other]
Title: CMNEE: A Large-Scale Document-Level Event Extraction Dataset based on Open-Source Chinese Military News
Comments: 13 pages, 7 figures, accepted to LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[12]  arXiv:2404.12241 [pdf, other]
[13]  arXiv:2404.12224 [pdf, other]
Title: Length Generalization of Causal Transformers without Position Encoding
Subjects: Computation and Language (cs.CL)
[14]  arXiv:2404.12195 [pdf, other]
Title: OpenBezoar: Small, Cost-Effective and Open Models Trained on Mixes of Instruction Data
Comments: 25 pages, 27 Figures, 8 Tables
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[15]  arXiv:2404.12177 [pdf, ps, other]
Title: EuSQuAD: Automatically Translated and Aligned SQuAD2.0 for Basque
Comments: Under review in the journal of Procesamiento de Lenguaje Natural
Subjects: Computation and Language (cs.CL)
[16]  arXiv:2404.12174 [pdf, other]
Title: Claim Check-Worthiness Detection: How Well do LLMs Grasp Annotation Guidelines?
Subjects: Computation and Language (cs.CL)
[17]  arXiv:2404.12171 [pdf, other]
Title: Stance Detection on Social Media with Fine-Tuned Large Language Models
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[18]  arXiv:2404.12152 [pdf, other]
Title: FecTek: Enhancing Term Weight in Lexicon-Based Retrieval with Feature Context and Term-level Knowledge
Subjects: Computation and Language (cs.CL)
[19]  arXiv:2404.12145 [pdf, other]
Title: From Form(s) to Meaning: Probing the Semantic Depths of Language Models Using Multisense Consistency
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[20]  arXiv:2404.12096 [pdf, other]
Title: LongEmbed: Extending Embedding Models for Long Context Retrieval
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[21]  arXiv:2404.12065 [pdf, other]
Title: RAGAR, Your Falsehood RADAR: RAG-Augmented Reasoning for Political Fact-Checking using Multimodal Large Language Models
Comments: 8 pages, submitted to ACL Rolling Review
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Emerging Technologies (cs.ET); Multiagent Systems (cs.MA)
[22]  arXiv:2404.12059 [pdf, other]
Title: Constituents Correspond to Word Sequence Patterns among Sentences with Equivalent Predicate-Argument Structures: Unsupervised Constituency Parsing by Span Matching
Subjects: Computation and Language (cs.CL)
[23]  arXiv:2404.12050 [pdf, other]
Title: emrQA-msquad: A Medical Dataset Structured with the SQuAD V2.0 Framework, Enriched with emrQA Medical Information
Comments: The dataset is available in this https URL
Subjects: Computation and Language (cs.CL)
[24]  arXiv:2404.12042 [pdf, other]
Title: Exploring Boundaries and Intensities in Offensive and Hate Speech: Unveiling the Complex Spectrum of Social Media Discourse
Subjects: Computation and Language (cs.CL)
[25]  arXiv:2404.12041 [pdf, other]
Title: Can We Catch the Elephant? The Evolvement of Hallucination Evaluation on Natural Language Generation: A Survey
Comments: 19 pages in total, with 9 pages as main body. Under review as a conference paper at CoLM 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[26]  arXiv:2404.12038 [pdf, other]
Title: Uncovering Safety Risks in Open-source LLMs through Concept Activation Vector
Subjects: Computation and Language (cs.CL)
[27]  arXiv:2404.12022 [pdf, other]
Title: Parallel Decoding via Hidden Transfer for Lossless Large Language Model Acceleration
Subjects: Computation and Language (cs.CL)
[28]  arXiv:2404.12014 [pdf, other]
Title: Enhance Robustness of Language Models Against Variation Attack through Graph Integration
Comments: 12 pages, 4 figures, accepted by COLING 2024
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[29]  arXiv:2404.12013 [pdf, other]
Title: Sequential Compositional Generalization in Multimodal Models
Comments: Accepted to the main conference of NAACL (2024) as a long paper
Subjects: Computation and Language (cs.CL)
[30]  arXiv:2404.12010 [pdf, other]
Title: ParaFusion: A Large-Scale LLM-Driven English Paraphrase Dataset Infused with High-Quality Lexical and Syntactic Diversity
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[31]  arXiv:2404.12006 [pdf, other]
Title: Variational Multi-Modal Hypergraph Attention Network for Multi-Modal Relation Extraction
Subjects: Computation and Language (cs.CL)
[32]  arXiv:2404.11999 [pdf, other]
Title: Token-level Direct Preference Optimization
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[33]  arXiv:2404.11978 [pdf, other]
Title: EVIT: Event-Oriented Instruction Tuning for Event Reasoning
Subjects: Computation and Language (cs.CL)
[34]  arXiv:2404.11972 [pdf, other]
Title: Aligning Language Models to Explicitly Handle Ambiguity
Subjects: Computation and Language (cs.CL)
[35]  arXiv:2404.11968 [pdf, other]
Title: P-NAL: an Effective and Interpretable Entity Alignment Method
Comments: 13 pages, 2 figures
Subjects: Computation and Language (cs.CL)
[36]  arXiv:2404.11932 [pdf, other]
Title: CrossIn: An Efficient Instruction Tuning Approach for Cross-Lingual Knowledge Alignment
Comments: 11 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[37]  arXiv:2404.11916 [pdf, other]
Title: SKIP: Skill-Localized Prompt Tuning for Inference Speed Boost-Up
Comments: 6 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[38]  arXiv:2404.11912 [pdf, other]
Title: TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[39]  arXiv:2404.11845 [pdf, other]
Title: Challenging Negative Gender Stereotypes: A Study on the Effectiveness of Automated Counter-Stereotypes
Comments: LREC-COLING2024
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[40]  arXiv:2404.11826 [pdf, other]
Title: AdvisorQA: Towards Helpful and Harmless Advice-seeking Question Answering with Collective Intelligence
Comments: 19 pages, 11 figures
Subjects: Computation and Language (cs.CL)
[41]  arXiv:2404.11809 [pdf, other]
Title: Sharing Parameter by Conjugation for Knowledge Graph Embeddings in Complex Space
Comments: 8 pages, 1 figure, 6 tables, accepted at TextGraphs-16 workshop held in conjunction with COLING 2022
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[42]  arXiv:2404.11793 [pdf, other]
Title: Enhancing Argument Summarization: Prioritizing Exhaustiveness in Key Point Generation and Introducing an Automatic Coverage Evaluation Metric
Comments: NAACL 2024 Main Conference
Subjects: Computation and Language (cs.CL)
[43]  arXiv:2404.11782 [pdf, other]
Title: REQUAL-LM: Reliability and Equity through Aggregation in Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[44]  arXiv:2404.11757 [pdf, other]
Title: Language Models Still Struggle to Zero-shot Reason about Time Series
Subjects: Computation and Language (cs.CL)
[45]  arXiv:2404.11752 [pdf, ps, other]
Title: Mapping Violence: Developing an Extensive Framework to Build a Bangla Sectarian Expression Dataset from Social Media Interactions
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[46]  arXiv:2404.11730 [pdf, other]
Title: Missed Connections: Lateral Thinking Puzzles for Large Language Models
Comments: 8 pages, 3 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[47]  arXiv:2404.11726 [pdf, other]
Title: Investigating Gender Bias in Turkish Language Models
Comments: arXiv admin note: text overlap with arXiv:1903.10561 by other authors
Subjects: Computation and Language (cs.CL)
[48]  arXiv:2404.11717 [pdf, other]
Title: How often are errors in natural language reasoning due to paraphrastic variability?
Comments: accepted to TACL 2024 (pre-MIT Press publication version)
Subjects: Computation and Language (cs.CL)
[49]  arXiv:2404.11691 [pdf, ps, other]
Title: Improvement in Semantic Address Matching using Natural Language Processing
Comments: 5 pages, 7 tables, 2021 2nd International Conference for Emerging Technology (INCET)
Journal-ref: 2021 2nd International Conference for Emerging Technology (INCET), Belagavi, India, 2021, pp. 1-5
Subjects: Computation and Language (cs.CL)
[50]  arXiv:2404.11682 [pdf, other]
Title: How Well Can You Articulate that Idea? Insights from Automated Formative Assessment
Subjects: Computation and Language (cs.CL)
[51]  arXiv:2404.11672 [pdf, other]
Title: MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory
Subjects: Computation and Language (cs.CL)
[52]  arXiv:2404.12390 (cross-list from cs.CV) [pdf, other]
Title: BLINK: Multimodal Large Language Models Can See but Not Perceive
Comments: Multimodal Benchmark, Project Url: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[53]  arXiv:2404.12273 (cross-list from cs.AI) [pdf, other]
Title: FedEval-LLM: Federated Evaluation of Large Language Models on Downstream Tasks with Collective Wisdom
Comments: In Progress
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[54]  arXiv:2404.12150 (cross-list from cs.LG) [pdf, other]
Title: Aligning language models with human preferences
Authors: Tomasz Korbak
Comments: PhD thesis
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[55]  arXiv:2404.12132 (cross-list from cs.SD) [pdf, other]
Title: Enhancing Suicide Risk Assessment: A Speech-Based Automated Approach in Emergency Medicine
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[56]  arXiv:2404.12104 (cross-list from cs.CV) [pdf, other]
Title: Ethical-Lens: Curbing Malicious Usages of Open-Source Text-to-Image Models
Comments: 42 pages, 17 figures, 29 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[57]  arXiv:2404.12077 (cross-list from cs.SD) [pdf, other]
Title: TIMIT Speaker Profiling: A Comparison of Multi-task learning and Single-task learning Approaches
Authors: Rong Wang, Kun Sun
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[58]  arXiv:2404.11870 (cross-list from cs.LG) [pdf, ps, other]
Title: Enhancing Length Extrapolation in Sequential Models with Pointer-Augmented Neural Memory
Comments: Preprint
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[59]  arXiv:2404.11619 (cross-list from eess.AS) [pdf, ps, other]
Title: Advancing Speech Translation: A Corpus of Mandarin-English Conversational Telephone Speech
Comments: 2 pages
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)

Thu, 18 Apr 2024

[60]  arXiv:2404.11588 [pdf, other]
Title: Related Work and Citation Text Generation: A Survey
Subjects: Computation and Language (cs.CL)
[61]  arXiv:2404.11553 [pdf, other]
Title: Quantifying Multilingual Performance of Large Language Models Across Languages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[62]  arXiv:2404.11539 [pdf, other]
Title: Evaluating Span Extraction in Generative Paradigm: A Reflection on Aspect-Based Sentiment Analysis
Comments: 10 pages
Subjects: Computation and Language (cs.CL)
[63]  arXiv:2404.11532 [pdf, other]
Title: Select and Reorder: A Novel Approach for Neural Sign Language Production
Comments: 8 Pages, 5 Figures, 7 Tables, LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[64]  arXiv:2404.11531 [pdf, other]
Title: Pack of LLMs: Model Fusion at Test-Time via Perplexity Optimization
Subjects: Computation and Language (cs.CL)
[65]  arXiv:2404.11502 [pdf, other]
Title: Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[66]  arXiv:2404.11500 [pdf, other]
Title: Paraphrase and Solve: Exploring and Exploiting the Impact of Surface Form on Mathematical Reasoning in Large Language Models
Comments: Accepted to the main conference of NAACL (2024)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[67]  arXiv:2404.11499 [pdf, other]
Title: A Data-Driven Representation for Sign Language Production
Comments: 8 Pages, 3 Figures, 7 Tables, 18th IEEE International Conference on Automatic Face and Gesture Recognition 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[68]  arXiv:2404.11470 [pdf, other]
Title: A Federated Learning Approach to Privacy Preserving Offensive Language Identification
Comments: Accepted to TRAC 2024 (Fourth Workshop on Threat, Aggression and Cyberbullying) at LREC-COLING 2024 (The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[69]  arXiv:2404.11459 [pdf, other]
Title: Octopus v3: Technical Report for On-device Sub-billion Multimodal AI Agent
Authors: Wei Chen, Zhiyuan Li
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[70]  arXiv:2404.11449 [pdf, other]
Title: AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media Texts
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[71]  arXiv:2404.11446 [pdf, other]
Title: Open-Ended Wargames with Large Language Models
Comments: 15 pages, 2 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[72]  arXiv:2404.11384 [pdf, other]
Title: Exploring Key Point Analysis with Pairwise Generation and Graph Partitioning
Comments: 11 pages, 4 figures, 4 tables. Accepted to NAACL 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[73]  arXiv:2404.11349 [pdf, other]
Title: TeClass: A Human-Annotated Relevance-based Headline Classification and Generation Dataset for Telugu
Comments: Accepted at LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[74]  arXiv:2404.11315 [pdf, other]
Title: To Drop or Not to Drop? Predicting Argument Ellipsis Judgments: A Case Study in Japanese
Comments: 13 pages; accepted by LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[75]  arXiv:2404.11288 [pdf, other]
Title: A Preference-driven Paradigm for Enhanced Translation with Large Language Models
Comments: Accepted to NAACL 2024 (long, main)
Subjects: Computation and Language (cs.CL)
[76]  arXiv:2404.11262 [pdf, other]
Title: Sampling-based Pseudo-Likelihood for Membership Inference Attacks
Subjects: Computation and Language (cs.CL)
[77]  arXiv:2404.11225 [pdf, other]
Title: In-Context Learning State Vector with Inner and Momentum Optimization
Comments: 17 pages, 7 figures, 5 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[78]  arXiv:2404.11216 [pdf, other]
Title: Position Engineering: Boosting Large Language Models through Positional Information Manipulation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[79]  arXiv:2404.11206 [pdf, other]
Title: Prompt-tuning for Clickbait Detection via Text Summarization
Subjects: Computation and Language (cs.CL)
[80]  arXiv:2404.11201 [pdf, other]
Title: Neuron Specialization: Leveraging intrinsic task modularity for multilingual machine translation
Subjects: Computation and Language (cs.CL)
[81]  arXiv:2404.11184 [pdf, other]
Title: FIZZ: Factual Inconsistency Detection by Zoom-in Summary and Zoom-out Document
Subjects: Computation and Language (cs.CL)
[82]  arXiv:2404.11141 [pdf, other]
Title: Context-Aware Siamese Networks for Efficient Emotion Recognition in Conversation
Authors: Barbara Gendron (LORIA, Uni.lu), Gaël Guibon (LORIA)
Subjects: Computation and Language (cs.CL)
[83]  arXiv:2404.11132 [pdf, other]
Title: A Novel ICD Coding Framework Based on Associated and Hierarchical Code Description Distillation
Authors: Bin Zhang, Junli Wang
Subjects: Computation and Language (cs.CL)
[84]  arXiv:2404.11124 [pdf, other]
Title: What's under the hood: Investigating Automatic Metrics on Meeting Summarization
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[85]  arXiv:2404.11109 [pdf, other]
Title: Consistency Training by Synthetic Question Generation for Conversational Question Answering
Subjects: Computation and Language (cs.CL)
[86]  arXiv:2404.11095 [pdf, other]
Title: Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues
Comments: 27 pages, 3 figures, 12 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[87]  arXiv:2404.11086 [pdf, other]
Title: ViLLM-Eval: A Comprehensive Evaluation Suite for Vietnamese Large Language Models
Comments: arXiv admin note: text overlap with arXiv:2305.08322 by other authors
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[88]  arXiv:2404.11061 [pdf, other]
Title: Unified Examination of Entity Linking in Absence of Candidate Sets
Subjects: Computation and Language (cs.CL)
[89]  arXiv:2404.11055 [pdf, other]
Title: On the Causal Nature of Sentiment Analysis
Comments: An enhanced version of our previous exploration in arXiv:2305.01764
Subjects: Computation and Language (cs.CL)
[90]  arXiv:2404.11045 [pdf, other]
Title: Offset Unlearning for Large Language Models
Subjects: Computation and Language (cs.CL)
[91]  arXiv:2404.10975 [pdf, other]
Title: Procedural Dilemma Generation for Evaluating Moral Reasoning in Humans and Language Models
Comments: CogSci 2024
Subjects: Computation and Language (cs.CL)
[92]  arXiv:2404.10960 [pdf, other]
Title: Uncertainty-Based Abstention in LLMs Improves Safety and Reduces Hallucinations
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[93]  arXiv:2404.10952 [pdf, other]
Title: Can Language Models Solve Olympiad Programming?
Comments: Code and data: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Programming Languages (cs.PL)
[94]  arXiv:2404.10939 [pdf, other]
Title: More Room for Language: Investigating the Effect of Retrieval on Language Models
Comments: NAACL 2024
Subjects: Computation and Language (cs.CL)
[95]  arXiv:2404.10924 [pdf, other]
Title: Binder: Hierarchical Concept Representation through Order Embedding of Binary Vectors
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[96]  arXiv:2404.10922 [pdf, other]
Title: Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training
Comments: NAACL Findings 2024
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[97]  arXiv:2404.10917 [pdf, other]
Title: Which questions should I answer? Salience Prediction of Inquisitive Questions
Subjects: Computation and Language (cs.CL)
[98]  arXiv:2404.10887 [pdf, other]
Title: Search Beyond Queries: Training Smaller Language Models for Web Interactions via Reinforcement Learning
Comments: 9 pages
Subjects: Computation and Language (cs.CL)
[99]  arXiv:2404.10877 [pdf, other]
Title: Incubating Text Classifiers Following User Instruction with Nothing but LLM
Subjects: Computation and Language (cs.CL)
[100]  arXiv:2404.10859 [pdf, other]
Title: Forcing Diffuse Distributions out of Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[101]  arXiv:2404.10857 [pdf, other]
Title: D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation
Subjects: Computation and Language (cs.CL)
[102]  arXiv:2404.10848 [pdf, other]
Title: A LayoutLMv3-Based Model for Enhanced Relation Extraction in Visually-Rich Documents
Comments: Accepted at the International Conference on Document Analysis and Recognition (ICDAR 2024)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[103]  arXiv:2404.10830 [pdf, other]
Title: Fewer Truncations Improve Language Modeling
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[104]  arXiv:2404.11584 (cross-list from cs.AI) [pdf, other]
Title: The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey
Comments: 13 pages,6 figures,38 references
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[105]  arXiv:2404.11538 (cross-list from cs.LG) [pdf, other]
Title: GenFighter: A Generative and Evolutive Textual Attack Removal
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[106]  arXiv:2404.11457 (cross-list from cs.IR) [pdf, other]
Title: Unifying Bias and Unfairness in Information Retrieval: A Survey of Challenges and Opportunities with Large Language Models
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[107]  arXiv:2404.11447 (cross-list from cs.AI) [pdf, ps, other]
Title: Research on emotionally intelligent dialogue generation based on automatic dialogue system
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[108]  arXiv:2404.11205 (cross-list from cs.CV) [pdf, other]
Title: Kathakali Hand Gesture Recognition With Minimal Data
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[109]  arXiv:2404.11049 (cross-list from cs.LG) [pdf, other]
Title: Stepwise Alignment for Constrained Language Model Policy Optimization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[110]  arXiv:2404.11036 (cross-list from cs.LG) [pdf, other]
Title: Cross-Platform Hate Speech Detection with Weakly Supervised Causal Disentanglement
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[111]  arXiv:2404.11023 (cross-list from cs.HC) [pdf, other]
Title: Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions
Comments: Position Paper, Under Review, 19 pages, 2 figures
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); Machine Learning (cs.LG)
[112]  arXiv:2404.11018 (cross-list from cs.LG) [pdf, other]
Title: Many-Shot In-Context Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[113]  arXiv:2404.10981 (cross-list from cs.IR) [pdf, other]
Title: A Survey on Retrieval-Augmented Text Generation for Large Language Models
Comments: Ongoing work
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[114]  arXiv:2404.10934 (cross-list from cs.LG) [pdf, other]
Title: Shears: Unstructured Sparsity with Neural Low-rank Adapter Search
Comments: 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (Industry Track)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[115]  arXiv:2404.10933 (cross-list from cs.AI) [pdf, other]
Title: LLMem: Estimating GPU Memory Usage for Fine-Tuning Pre-Trained LLMs
Comments: 9 pages, 9 figures, accepted to IJCAI 2024
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[116]  arXiv:2404.10838 (cross-list from cs.CV) [pdf, other]
Title: Dynamic Self-adaptive Multiscale Distillation from Pre-trained Multimodal Large Model for Efficient Cross-modal Representation Learning
Comments: 10 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)

Wed, 17 Apr 2024 (showing first 31 of 47 entries)

[117]  arXiv:2404.10774 [pdf, other]
Title: MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents
Comments: LLM-AggreFact benchmark, MiniCheck models, data generation code at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[118]  arXiv:2404.10719 [pdf, other]
Title: Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Comments: 16 pages, 2 figures, 14 tables
Subjects: Computation and Language (cs.CL)
[119]  arXiv:2404.10710 [pdf, other]
Title: Dual Modalities of Text: Visual and Textual Generative Pre-training
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[120]  arXiv:2404.10704 [pdf, other]
Title: Question Difficulty Ranking for Multiple-Choice Reading Comprehension
Comments: 7 pages, 3 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[121]  arXiv:2404.10696 [pdf, other]
Title: Integrating knowledge bases to improve coreference and bridging resolution for the chemical domain
Comments: working in progress
Subjects: Computation and Language (cs.CL)
[122]  arXiv:2404.10652 [pdf, other]
Title: ViTextVQA: A Large-Scale Visual Question Answering Dataset for Evaluating Vietnamese Text Comprehension in Images
Comments: Preprint submitted to IJCV
Subjects: Computation and Language (cs.CL)
[123]  arXiv:2404.10642 [pdf, other]
Title: Self-playing Adversarial Language Game Enhances LLM Reasoning
Comments: Preprint
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[124]  arXiv:2404.10630 [pdf, other]
Title: HLAT: High-quality Large Language Model Pre-trained on AWS Trainium
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[125]  arXiv:2404.10555 [pdf, other]
Title: Construction of Domain-specified Japanese Large Language Model for Finance through Continual Pre-training
Comments: 7 pages
Subjects: Computation and Language (cs.CL); Computational Finance (q-fin.CP)
[126]  arXiv:2404.10552 [pdf, other]
Title: Unveiling the Misuse Potential of Base Large Language Models via In-Context Learning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[127]  arXiv:2404.10513 [pdf, other]
Title: CoTAR: Chain-of-Thought Attribution Reasoning with Multi-level Granularity
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[128]  arXiv:2404.10508 [pdf, other]
Title: White Men Lead, Black Women Help: Uncovering Gender, Racial, and Intersectional Bias in Language Agency
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[129]  arXiv:2404.10503 [pdf, other]
Title: A Sentiment Analysis of Medical Text Based on Deep Learning
Authors: Yinan Chen
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[130]  arXiv:2404.10500 [pdf, other]
Title: When Emotional Stimuli meet Prompt Designing: An Auto-Prompt Graphical Paradigm
Comments: 9 pages, 5 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[131]  arXiv:2404.10475 [pdf, other]
Title: Conversations as a Source for Teaching Scientific Concepts at Different Education Levels
Subjects: Computation and Language (cs.CL)
[132]  arXiv:2404.10464 [pdf, other]
Title: DESTEIN: Navigating Detoxification of Language Models via Universal Steering Pairs and Head-wise Activation Fusion
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[133]  arXiv:2404.10440 [pdf, other]
Title: Language Proficiency and F0 Entrainment: A Study of L2 English Imitation in Italian, French, and Slovak Speakers
Comments: Accepted at Speech Prosody 2024
Subjects: Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[134]  arXiv:2404.10384 [pdf, other]
Title: Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[135]  arXiv:2404.10346 [pdf, other]
Title: Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards
Comments: Preprint Under Review
Subjects: Computation and Language (cs.CL)
[136]  arXiv:2404.10315 [pdf, other]
Title: Enhancing Confidence Expression in Large Language Models Through Learning from Past Experience
Subjects: Computation and Language (cs.CL)
[137]  arXiv:2404.10306 [pdf, other]
Title: Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model
Comments: 43 pages, 10 figures
Subjects: Computation and Language (cs.CL)
[138]  arXiv:2404.10297 [pdf, other]
Title: Future Language Modeling from Temporal Document History
Comments: Accepted by ICLR 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[139]  arXiv:2404.10268 [pdf, other]
Title: Modeling Low-Resource Health Coaching Dialogues via Neuro-Symbolic Goal Summarization and Text-Units-Text Generation
Comments: Accepted to the main conference of LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[140]  arXiv:2404.10259 [pdf, other]
Title: Uncovering Latent Arguments in Social Media Messaging by Employing LLMs-in-the-Loop Strategy
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[141]  arXiv:2404.10229 [pdf, other]
Title: Generative Text Steganography with Large Language Model
Subjects: Computation and Language (cs.CL)
[142]  arXiv:2404.10199 [pdf, other]
Title: CULTURE-GEN: Revealing Global Cultural Perception in Language Models through Natural Language Prompting
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[143]  arXiv:2404.10198 [pdf, other]
Title: How faithful are RAG models? Quantifying the tug-of-war between RAG and LLMs' internal prior
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[144]  arXiv:2404.10180 [pdf, other]
Title: Deferred NAM: Low-latency Top-K Context Injection via DeferredContext Encoding for Non-Streaming ASR
Comments: 9 pages, 3 figures, accepted by NAACL 2024 - Industry Track
Journal-ref: 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics - Industry Track
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Audio and Speech Processing (eess.AS)
[145]  arXiv:2404.10174 [pdf, other]
Title: On the Effects of Fine-tuning Language Models for Text-Based Reinforcement Learning
Subjects: Computation and Language (cs.CL)
[146]  arXiv:2404.10150 [pdf, other]
Title: TabSQLify: Enhancing Reasoning Capabilities of LLMs Through Table Decomposition
Comments: Accepted to NAACL 2024 (long, main)
Subjects: Computation and Language (cs.CL); Databases (cs.DB); Information Retrieval (cs.IR)
[147]  arXiv:2404.10136 [pdf, other]
Title: Language Model Cascades: Token-level uncertainty and beyond
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[ total of 346 entries: 1-147 | 148-294 | 295-346 ]
[ showing 147 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2404, contact, help  (Access key information)