We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions

[ total of 299 entries: 1-140 | 141-280 | 281-299 ]
[ showing 140 entries per page: fewer | more | all ]

Thu, 9 May 2024

[1]  arXiv:2405.05254 [pdf, other]
Title: You Only Cache Once: Decoder-Decoder Architectures for Language Models
Subjects: Computation and Language (cs.CL)
[2]  arXiv:2405.05253 [pdf, other]
Title: Open Source Language Models Can Provide Feedback: Evaluating LLMs' Ability to Help Students Using GPT-4-As-A-Judge
Comments: 7 pages, 4 figures, 2 tables. Accepted for publication at the 29th annual ACM conference on Innovation and Technology in Computer Science Education (ITiCSE 2024)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[3]  arXiv:2405.05248 [pdf, other]
Title: LLMs with Personalities in Multi-issue Negotiation Games
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[4]  arXiv:2405.05204 [pdf, ps, other]
Title: CARE-SD: Classifier-based analysis for recognizing and eliminating stigmatizing and doubt marker labels in electronic health records: model development and validation
Comments: 28 pages, 3 figures, 4 tables. 5 Appendices
Subjects: Computation and Language (cs.CL)
[5]  arXiv:2405.05189 [pdf, other]
Title: MIDGARD: Self-Consistency Using Minimum Description Length for Structured Commonsense Reasoning
Comments: Under review at ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[6]  arXiv:2405.05176 [pdf, other]
Title: Encoder-Decoder Framework for Interactive Free Verses with Generation with Controllable High-Quality Rhyming
Comments: 18 pages, 1 figure
Subjects: Computation and Language (cs.CL)
[7]  arXiv:2405.05161 [pdf, ps, other]
Title: Motion Capture Analysis of Verb and Adjective Types in Austrian Sign Language
Comments: 10 pages, 7 figures
Subjects: Computation and Language (cs.CL); Neurons and Cognition (q-bio.NC)
[8]  arXiv:2405.05116 [pdf, other]
Title: XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples
Subjects: Computation and Language (cs.CL)
[9]  arXiv:2405.05109 [pdf, other]
Title: QFMTS: Generating Query-Focused Summaries over Multi-Table Inputs
Comments: 16 pages, 3 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[10]  arXiv:2405.05060 [pdf, other]
Title: Conversational Topic Recommendation in Counseling and Psychotherapy with Decision Transformer and Large Language Models
Comments: 5 pages excluding references, 3 figures; accepted at Clinical NLP Workshop @ NAACL 2024
Subjects: Computation and Language (cs.CL)
[11]  arXiv:2405.05049 [pdf, ps, other]
Title: Seeds of Stereotypes: A Large-Scale Textual Analysis of Race and Gender Associations with Diseases in Online Sources
Subjects: Computation and Language (cs.CL)
[12]  arXiv:2405.05008 [pdf, other]
Title: ADELIE: Aligning Large Language Models on Information Extraction
Subjects: Computation and Language (cs.CL)
[13]  arXiv:2405.04960 [pdf, other]
Title: P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models
Subjects: Computation and Language (cs.CL)
[14]  arXiv:2405.04955 [pdf, other]
Title: Improving Long Text Understanding with Knowledge Distilled from Summarization Model
Comments: arXiv admin note: text overlap with arXiv:2110.04741
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[15]  arXiv:2405.04897 [pdf, ps, other]
Title: Machine Learning-based NLP for Emotion Classification on a Cholera X Dataset
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[16]  arXiv:2405.04872 [pdf, other]
Title: Logical Negation Augmenting and Debiasing for Prompt-based Methods
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Logic in Computer Science (cs.LO)
[17]  arXiv:2405.04829 [pdf, other]
Title: Fine-tuning Pre-trained Named Entity Recognition Models For Indian Languages
Comments: 8 pages, accepted in NAACL-SRW, 2024
Subjects: Computation and Language (cs.CL)
[18]  arXiv:2405.04828 [pdf, other]
Title: ChuXin: 1.6B Technical Report
Comments: Technical Report
Subjects: Computation and Language (cs.CL)
[19]  arXiv:2405.04820 [pdf, other]
Title: APrompt4EM: Augmented Prompt Tuning for Generalized Entity Matching
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[20]  arXiv:2405.04819 [pdf, other]
Title: DALK: Dynamic Co-Augmentation of LLMs and KG to answer Alzheimer's Disease Questions with Scientific Literature
Comments: Under Review
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[21]  arXiv:2405.04818 [pdf, other]
Title: ACORN: Aspect-wise Commonsense Reasoning Explanation Evaluation
Comments: 18 pages, 7 figures, under review. Data available here: this https URL
Subjects: Computation and Language (cs.CL)
[22]  arXiv:2405.04793 [pdf, other]
Title: Zero-shot LLM-guided Counterfactual Generation for Text
Comments: arXiv admin note: text overlap with arXiv:2309.13340
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[23]  arXiv:2405.04781 [pdf, other]
Title: CourseGPT-zh: an Educational Large Language Model Based on Knowledge Distillation Incorporating Prompt Optimization
Subjects: Computation and Language (cs.CL)
[24]  arXiv:2405.04777 [pdf, other]
Title: Empathy Through Multimodality in Conversational Interfaces
Comments: 7 pages, 2 figures, 2 tables, conference paper
Subjects: Computation and Language (cs.CL)
[25]  arXiv:2405.04756 [pdf, other]
Title: BiasKG: Adversarial Knowledge Graphs to Induce Bias in Large Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[26]  arXiv:2405.04726 [pdf, other]
Title: Learning Phonotactics from Linguistic Informants
Subjects: Computation and Language (cs.CL)
[27]  arXiv:2405.04685 [pdf, other]
Title: Bridging the Bosphorus: Advancing Turkish Large Language Models through Strategies for Low-Resource Language Adaptation and Benchmarking
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[28]  arXiv:2405.04655 [pdf, other]
Title: Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense
Subjects: Computation and Language (cs.CL)
[29]  arXiv:2405.04590 [pdf, other]
Title: Language Modeling Using Tensor Trains
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[30]  arXiv:2405.04585 [pdf, other]
Title: PoPE: Legendre Orthogonal Polynomials Based Position Encoding for Large Language Models
Authors: Arpit Aggarwal
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[31]  arXiv:2405.05175 (cross-list from cs.CR) [pdf, other]
Title: Air Gap: Protecting Privacy-Conscious Conversational Agents
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[32]  arXiv:2405.05136 (cross-list from cs.CY) [pdf, other]
Title: Integrating LSTM and BERT for Long-Sequence Data Analysis in Intelligent Tutoring Systems
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[33]  arXiv:2405.05135 (cross-list from cs.SE) [pdf, ps, other]
Title: Lessons from the Use of Natural Language Inference (NLI) in Requirements Engineering Tasks
Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[34]  arXiv:2405.04950 (cross-list from cs.CV) [pdf, other]
Title: VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context
Comments: 17 pages; Accepted by ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[35]  arXiv:2405.04758 (cross-list from cs.CR) [pdf, other]
Title: Honeyfile Camouflage: Hiding Fake Files in Plain Sight
Comments: 3rd Workshop on the security implications of Deepfakes and Cheapfakes (WDC) co-located at ACM ASIACCS 2024
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[36]  arXiv:2405.04620 (cross-list from hep-ph) [pdf, ps, other]
Title: Folded context condensation in Path Integral formalism for infinite context transformers
Comments: 7 pages, 2 figures
Subjects: High Energy Physics - Phenomenology (hep-ph); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)

Wed, 8 May 2024

[37]  arXiv:2405.04532 [pdf, other]
Title: QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
Comments: The first three authors contribute equally to this project and are listed in the alphabetical order. Yujun Lin leads the quantization algorithm, Haotian Tang and Shang Yang lead the GPU kernels and the serving system. Code is available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Performance (cs.PF)
[38]  arXiv:2405.04520 [pdf, other]
Title: NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Prompts
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Software Engineering (cs.SE)
[39]  arXiv:2405.04515 [pdf, other]
Title: A Transformer with Stack Attention
Comments: NAACL 2024
Subjects: Computation and Language (cs.CL)
[40]  arXiv:2405.04513 [pdf, other]
Title: Switchable Decision: Dynamic Neural Generation Networks
Comments: Accepted to ICML 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[41]  arXiv:2405.04495 [pdf, other]
Title: Toward In-Context Teaching: Adapting Examples to Students' Misconceptions
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[42]  arXiv:2405.04435 [pdf, other]
Title: Fast Exact Retrieval for Nearest-neighbor Lookup (FERN)
Authors: Richard Zhu
Comments: NAACL 2024 SRW
Subjects: Computation and Language (cs.CL); Data Structures and Algorithms (cs.DS)
[43]  arXiv:2405.04434 [pdf, other]
Title: DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Authors: DeepSeek-AI
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[44]  arXiv:2405.04325 [pdf, other]
Title: Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation
Subjects: Computation and Language (cs.CL)
[45]  arXiv:2405.04304 [pdf, other]
Title: Accelerating Speculative Decoding using Dynamic Speculation Length
Subjects: Computation and Language (cs.CL)
[46]  arXiv:2405.04296 [pdf, other]
Title: Open Implementation and Study of BEST-RQ for Speech Processing
Comments: Accepted in IEEE ICASSP 2024 workshop on Self-supervision in Audio, Speech and Beyond (SASB 2024)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[47]  arXiv:2405.04292 [pdf, other]
Title: Mitigating Clickbait: An Approach to Spoiler Generation Using Multitask Learning
Comments: Accepted in ICON 2023
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[48]  arXiv:2405.04286 [pdf, other]
Title: Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore
Subjects: Computation and Language (cs.CL)
[49]  arXiv:2405.04271 [pdf, other]
Title: Generating Feature Vectors from Phonetic Transcriptions in Cross-Linguistic Data Formats
Comments: To appear in the Proceedings of the 2024 Meeting of the Society for Computation in Linguistics (SCiL)
Subjects: Computation and Language (cs.CL)
[50]  arXiv:2405.04219 [pdf, other]
Title: Iterative Experience Refinement of Software-Developing Agents
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA); Software Engineering (cs.SE)
[51]  arXiv:2405.04170 [pdf, other]
Title: D-NLP at SemEval-2024 Task 2: Evaluating Clinical Inference Capabilities of Large Language Models
Authors: Duygu Altinok
Comments: accepted to SemEval-2024, ranked 9th on Task 2
Subjects: Computation and Language (cs.CL)
[52]  arXiv:2405.04165 [pdf, other]
Title: LingML: Linguistic-Informed Machine Learning for Enhanced Fake News Detection
Comments: 7 pages
Subjects: Computation and Language (cs.CL)
[53]  arXiv:2405.04163 [pdf, other]
Title: MEDVOC: Vocabulary Adaptation for Fine-tuning Pre-trained Language Models on Medical Text Summarization
Comments: 13 pages, Accepted to the 33rd International Joint Conference on Artificial Intelligence, IJCAI 2024 (Main) Track
Subjects: Computation and Language (cs.CL)
[54]  arXiv:2405.04160 [pdf, other]
Title: A Causal Explainable Guardrails for Large Language Models
Comments: 23 pages
Subjects: Computation and Language (cs.CL)
[55]  arXiv:2405.04128 [pdf, other]
Title: Fine-grained Speech Sentiment Analysis in Chinese Psychological Support Hotlines Based on Large-scale Pre-trained Model
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[56]  arXiv:2405.04086 [pdf, other]
Title: Optimizing Language Model's Reasoning Abilities with Weak Supervision
Subjects: Computation and Language (cs.CL)
[57]  arXiv:2405.04065 [pdf, other]
Title: FlashBack:Efficient Retrieval-Augmented Language Modeling for Long Context Inference
Comments: 14 pages
Subjects: Computation and Language (cs.CL)
[58]  arXiv:2405.04053 [pdf, other]
Title: Evaluating Text Summaries Generated by Large Language Models Using OpenAI's GPT
Comments: 10 pages, 5 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[59]  arXiv:2405.04048 [pdf, other]
Title: Philosophy of Cognitive Science in the Age of Deep Learning
Comments: Forthcoming in WIREs Cognitive Science
Subjects: Computation and Language (cs.CL)
[60]  arXiv:2405.04039 [pdf, other]
Title: Utilizing GPT to Enhance Text Summarization: A Strategy to Minimize Hallucinations
Comments: 9 pages, 3 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[61]  arXiv:2405.03960 [pdf, other]
Title: ESIHGNN: Event-State Interactions Infused Heterogeneous Graph Neural Network for Conversational Emotion Recognition
Journal-ref: published at ICASSP 2024
Subjects: Computation and Language (cs.CL)
[62]  arXiv:2405.03939 [pdf, other]
Title: Long Context Alignment with Short Instructions and Synthesized Positions
Comments: preview
Subjects: Computation and Language (cs.CL)
[63]  arXiv:2405.03920 [pdf, other]
Title: A Roadmap for Multilingual, Multimodal Domain Independent Deception Detection
Comments: 6 pages, 1 figure, shorter version in SIAM International Conference on Data Mining (SDM) 2024
Journal-ref: Proc. SDM 2024, 396-399
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[64]  arXiv:2405.03845 [pdf, other]
Title: Self-Improving Customer Review Response Generation Based on LLMs
Comments: 18 pages, 4 figure, 8 figures in Appendix, accepted to LREC-COLING 2024 workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[65]  arXiv:2405.03832 [pdf, other]
Title: Guylingo: The Republic of Guyana Creole Corpora
Comments: Accepted to NAACL 2024 Main Conference Special Theme Track: Languages of Latin America
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[66]  arXiv:2405.03794 [pdf, other]
Title: Detecting Anti-Semitic Hate Speech using Transformer-based Large Language Models
Subjects: Computation and Language (cs.CL)
[67]  arXiv:2405.03764 [pdf, other]
Title: GOVERN: Gradient Orientation Vote Ensemble for Multi-Teacher Reinforced Distillation
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[68]  arXiv:2405.03695 [pdf, other]
Title: Evaluating Large Language Models for Material Selection
Comments: arXiv admin note: text overlap with arXiv:2307.03109 by other authors
Subjects: Computation and Language (cs.CL)
[69]  arXiv:2405.04404 (cross-list from cs.CV) [pdf, other]
Title: Vision Mamba: A Comprehensive Survey and Taxonomy
Comments: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[70]  arXiv:2405.04346 (cross-list from cs.LG) [pdf, other]
Title: Revisiting character-level adversarial attacks
Comments: Accepted in ICML 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[71]  arXiv:2405.04324 (cross-list from cs.AI) [pdf, other]
Title: Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Comments: Corresponding Authors: Rameswar Panda, Ruchir Puri; Equal Contributors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Software Engineering (cs.SE)
[72]  arXiv:2405.04136 (cross-list from cs.AI) [pdf, other]
Title: Enriched BERT Embeddings for Scholarly Publication Classification
Comments: 8 pages, 2 figures, NSLP2024 conference
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[73]  arXiv:2405.04118 (cross-list from cs.LG) [pdf, other]
Title: Policy Learning with a Language Bottleneck
Comments: 18 pages, 13 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[74]  arXiv:2405.03998 (cross-list from cs.HC) [pdf, other]
Title: Sketch Then Generate: Providing Incremental User Feedback and Guiding LLM Code Generation through Language-Oriented Code Sketches
Comments: 4 pages
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[75]  arXiv:2405.03952 (cross-list from cs.SD) [pdf, other]
Title: HAFFormer: A Hierarchical Attention-Free Framework for Alzheimer's Disease Detection From Spontaneous Speech
Journal-ref: publised at ICASSP 2024
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[76]  arXiv:2405.03932 (cross-list from cs.AI) [pdf, other]
Title: CleanGraph: Human-in-the-loop Knowledge Graph Refinement and Completion
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[77]  arXiv:2405.03862 (cross-list from cs.AI) [pdf, other]
Title: Conformity, Confabulation, and Impersonation: Persona Inconstancy in Multi-Agent LLM Collaboration
Comments: 16 pages, 8 figures, 3 tables
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Tue, 7 May 2024 (showing first 63 of 82 entries)

[78]  arXiv:2405.03688 [pdf, other]
Title: Large Language Models Reveal Information Operation Goals, Tactics, and Narrative Frames
Comments: 15 pages, 9 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[79]  arXiv:2405.03677 [pdf, other]
Title: Towards A Human-in-the-Loop LLM Approach to Collaborative Discourse Analysis
Comments: In press at the 25th international conference on Artificial Intelligence in Education (AIED) Late-Breaking Results (LBR) track
Subjects: Computation and Language (cs.CL)
[80]  arXiv:2405.03595 [pdf, other]
Title: GREEN: Generative Radiology Report Evaluation and Error Notation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[81]  arXiv:2405.03594 [pdf, other]
Title: Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[82]  arXiv:2405.03553 [pdf, other]
Title: AlphaMath Almost Zero: process Supervision without process
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[83]  arXiv:2405.03548 [pdf, other]
Title: MAmmoTH2: Scaling Instructions from the Web
Comments: Work in Progress
Subjects: Computation and Language (cs.CL)
[84]  arXiv:2405.03425 [pdf, other]
Title: Gaussian Stochastic Weight Averaging for Bayesian Low-Rank Adaptation of Large Language Models
Comments: 14 pages, 1 figure, 2 tables
Subjects: Computation and Language (cs.CL)
[85]  arXiv:2405.03387 [pdf, ps, other]
Title: The high dimensional psychological profile and cultural bias of ChatGPT
Authors: Hang Yuan (1), Zhongyue Che (1), Shao Li (1), Yue Zhang, Xiaomeng Hu (2), Siyang Luo (1) ((1) Sun Yat-Sen University, (2) Renmin University of China)
Subjects: Computation and Language (cs.CL)
[86]  arXiv:2405.03371 [pdf, other]
Title: Explainable Fake News Detection With Large Language Model via Defense Among Competing Wisdom
Comments: 12 pages, WWW'2024
Subjects: Computation and Language (cs.CL)
[87]  arXiv:2405.03359 [pdf, ps, other]
Title: MedDoc-Bot: A Chat Tool for Comparative Analysis of Large Language Models in the Context of the Pediatric Hypertension Guideline
Comments: {copyright} 2024 IEEE. This work has been accepted for publication and presentation at the 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, to be held in Orlando, Florida, USA, July 15-19, 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[88]  arXiv:2405.03279 [pdf, other]
Title: Lifelong Knowledge Editing for LLMs with Retrieval-Augmented Continuous Prompt Learning
Comments: 14 pages, 4 figures, 6 tables
Subjects: Computation and Language (cs.CL)
[89]  arXiv:2405.03207 [pdf, other]
Title: A Philosophical Introduction to Language Models - Part II: The Way Forward
Subjects: Computation and Language (cs.CL)
[90]  arXiv:2405.03206 [pdf, other]
Title: Vietnamese AI Generated Text Detection
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[91]  arXiv:2405.03205 [pdf, other]
Title: Anchored Answers: Unravelling Positional Bias in GPT-2's Multiple-Choice Questions
Authors: Ruizhe Li, Yanjun Gao
Comments: Work in process
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[92]  arXiv:2405.03170 [pdf, other]
Title: Oracle-Checker Scheme for Evaluating a Generative Large Language Model
Subjects: Computation and Language (cs.CL)
[93]  arXiv:2405.03153 [pdf, ps, other]
Title: Exploring the Potential of the Large Language Models (LLMs) in Identifying Misleading News Headlines
Comments: 5 pages, 2 tables, 1st HEAL Workshop at CHI Conference on Human Factors in Computing Systems, May 12, Honolulu, HI, USA 2024
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[94]  arXiv:2405.03138 [pdf, other]
Title: CRAFT: Extracting and Tuning Cultural Instructions from the Wild
Comments: 6 pages
Subjects: Computation and Language (cs.CL)
[95]  arXiv:2405.03133 [pdf, other]
Title: Lory: Fully Differentiable Mixture-of-Experts for Autoregressive Language Model Pre-training
Comments: 21 pages, 12 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[96]  arXiv:2405.03111 [pdf, ps, other]
Title: An Active Inference Agent for Simulating Human Translation Processes in a Hierarchical Architecture: Integrating the Task Segment Framework and the HOF taxonomy
Authors: Michael Carl
Subjects: Computation and Language (cs.CL)
[97]  arXiv:2405.03098 [pdf, other]
Title: FairMonitor: A Dual-framework for Detecting Stereotypes and Biases in Large Language Models
Subjects: Computation and Language (cs.CL)
[98]  arXiv:2405.03085 [pdf, other]
Title: Compressing Long Context for Enhancing RAG with AMR-based Concept Distillation
Subjects: Computation and Language (cs.CL)
[99]  arXiv:2405.03084 [pdf, ps, other]
Title: Analyzing Emotional Trends from X platform using SenticNet: A Comparative Analysis with Cryptocurrency Price
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[100]  arXiv:2405.03004 [pdf, other]
Title: Exploring prompts to elicit memorization in masked language model-based named entity recognition
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[101]  arXiv:2405.03000 [pdf, other]
Title: MedAdapter: Efficient Test-Time Adaptation of Large Language Models towards Medical Reasoning
Comments: Work in Progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[102]  arXiv:2405.02985 [pdf, ps, other]
Title: Can Large Language Models Make the Grade? An Empirical Study Evaluating LLMs Ability to Mark Short Answer Questions in K-12 Education
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[103]  arXiv:2405.02984 [pdf, other]
Title: E-TSL: A Continuous Educational Turkish Sign Language Dataset with Baseline Methods
Comments: 7 pages, 3 figures, 4 tables, submitted to IEEE conference
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[104]  arXiv:2405.02937 [pdf, other]
Title: Unraveling the Dominance of Large Language Models Over Transformer Models for Bangla Natural Language Inference: A Comprehensive Study
Comments: Accepted in 4th International Conference on Computing and Communication Networks (ICCCNet-2024)
Subjects: Computation and Language (cs.CL)
[105]  arXiv:2405.02935 [pdf, other]
Title: Enabling Patient-side Disease Prediction via the Integration of Patient Narratives
Subjects: Computation and Language (cs.CL)
[106]  arXiv:2405.02933 [pdf, other]
Title: Relay Decoding: Concatenating Large Language Models for Machine Translation
Comments: Work in progress
Subjects: Computation and Language (cs.CL)
[107]  arXiv:2405.02925 [pdf, other]
Title: A Two-Stage Prediction-Aware Contrastive Learning Framework for Multi-Intent NLU
Comments: LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[108]  arXiv:2405.02887 [pdf, other]
Title: Sentiment Analysis Across Languages: Evaluation Before and After Machine Translation to English
Comments: 6 pages, 3 Figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[109]  arXiv:2405.02861 [pdf, other]
Title: Revisiting a Pain in the Neck: Semantic Phrase Processing Benchmark for Language Models
Comments: 24 pages, 17 figures, 10 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[110]  arXiv:2405.02817 [pdf, other]
Title: HuixiangDou-CR: Coreference Resolution in Group Chats
Authors: Huanjun Kong
Comments: 5 pages, 3 tables, 3 figures
Subjects: Computation and Language (cs.CL)
[111]  arXiv:2405.02816 [pdf, other]
Title: Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization
Comments: To appear in the proceedings of SIGIR 2024
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[112]  arXiv:2405.02814 [pdf, other]
Title: NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional Stimuli
Comments: This paper has been accepted by IJCAI 2024
Subjects: Computation and Language (cs.CL)
[113]  arXiv:2405.02765 [pdf, other]
Title: Detecting Edited Knowledge in Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[114]  arXiv:2405.02764 [pdf, other]
Title: Assessing Adversarial Robustness of Large Language Models: An Empirical Study
Comments: 16 pages, 9 figures, 10 tables
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[115]  arXiv:2405.02750 [pdf, other]
Title: Enhancing Contextual Understanding in Large Language Models through Contrastive Decoding
Comments: Accepted to NAACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[116]  arXiv:2405.02743 [pdf, other]
Title: Beyond Performance: Quantifying and Mitigating Label Bias in LLMs
Comments: NAACL 2024
Subjects: Computation and Language (cs.CL)
[117]  arXiv:2405.02738 [pdf, other]
Title: Relations Prediction for Knowledge Graph Completion using Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[118]  arXiv:2405.02732 [pdf, other]
Title: Recall Them All: Retrieval-Augmented Language Models for Long Object List Extraction from Long Documents
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[119]  arXiv:2405.02712 [pdf, other]
Title: CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions
Subjects: Computation and Language (cs.CL)
[120]  arXiv:2405.02710 [pdf, other]
Title: Enhancing News Summarization with ELearnFit through Efficient In-Context Learning and Efficient Fine-Tuning
Comments: 9 Pages
Subjects: Computation and Language (cs.CL)
[121]  arXiv:2405.02677 [pdf, other]
Title: Evaluating the Ability of Computationally Extracted Narrative Maps to Encode Media Framing
Comments: Text2Story Workshop 2024
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[122]  arXiv:2405.02673 [pdf, other]
Title: On the Information Redundancy in Non-Autoregressive Translation
Comments: 10 pages, 10 tables
Subjects: Computation and Language (cs.CL)
[123]  arXiv:2405.02659 [pdf, other]
Title: R4: Reinforced Retriever-Reorder-Responder for Retrieval-Augmented Large Language Models
Subjects: Computation and Language (cs.CL)
[124]  arXiv:2405.02650 [pdf, other]
Title: Identifying Narrative Patterns and Outliers in Holocaust Testimonies Using Topic Modeling
Comments: 9 pages, 7 figures, LREC-COLING 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[125]  arXiv:2405.02602 [pdf, other]
Title: Astro-NER -- Astronomy Named Entity Recognition: Is GPT a Good Domain Expert Annotator?
Comments: 9 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Theory (cs.IT)
[126]  arXiv:2405.02578 [pdf, ps, other]
Title: Mixat: A Data Set of Bilingual Emirati-English Speech
Comments: SIGUL 2024
Subjects: Computation and Language (cs.CL)
[127]  arXiv:2405.02573 [pdf, other]
Title: A Combination of BERT and Transformer for Vietnamese Spelling Correction
Comments: 13 pages
Journal-ref: ACIIDS 2022, LNCS, vol 13757, Springer, Cham
Subjects: Computation and Language (cs.CL)
[128]  arXiv:2405.02559 [pdf, ps, other]
Title: A Literature Review and Framework for Human Evaluation of Generative Large Language Models in Healthcare
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[129]  arXiv:2405.02517 [pdf, other]
Title: Mothman at SemEval-2024 Task 9: An Iterative System for Chain-of-Thought Prompt Optimization
Comments: 13 pages, 2 figures, to be published in Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)
Subjects: Computation and Language (cs.CL)
[130]  arXiv:2405.02501 [pdf, other]
Title: Beyond Helpfulness and Harmlessness: Eliciting Diverse Behaviors from Large Language Models with Persona In-Context Learning
Comments: Paper accepted at ICML 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[131]  arXiv:2405.02472 [pdf, other]
Title: Semantic Scaling: Bayesian Ideal Point Estimates with Large Language Models
Authors: Michael Burnham
Subjects: Computation and Language (cs.CL)
[132]  arXiv:2405.02454 [pdf, other]
Title: What is Sentiment Meant to Mean to Language Models?
Authors: Michael Burnham
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[133]  arXiv:2405.02421 [pdf, other]
Title: What does the Knowledge Neuron Thesis Have to do with Knowledge?
Comments: ICLR 2024 (Spotlight)
Subjects: Computation and Language (cs.CL)
[134]  arXiv:2405.02411 [pdf, other]
Title: The Call for Socially Aware Language Technologies
Subjects: Computation and Language (cs.CL)
[135]  arXiv:2405.02353 [pdf, other]
Title: Early Transformers: A study on Efficient Training of Transformer Models through Early-Bird Lottery Tickets
Authors: Shravan Cheekati
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[136]  arXiv:2405.02318 [pdf, other]
Title: NL2FOL: Translating Natural Language to First-Order Logic for Logical Fallacy Detection
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[137]  arXiv:2405.03689 (cross-list from cs.CV) [pdf, other]
Title: Pose Priors from Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[138]  arXiv:2405.03685 (cross-list from cs.CV) [pdf, other]
Title: Language-Image Models with 3D Understanding
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[139]  arXiv:2405.03452 (cross-list from cs.CY) [pdf, ps, other]
Title: Large Language Models (LLMs) as Agents for Augmented Democracy
Comments: 15 pages main manuscript with 3 figures. 12 pages of supplementary material
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[140]  arXiv:2405.03162 (cross-list from cs.CV) [pdf, other]
[ total of 299 entries: 1-140 | 141-280 | 281-299 ]
[ showing 140 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)