Computation and Language

Authors and titles for recent submissions

[ total of 299 entries: 1-140 | 141-280 | 281-299 ]
[ showing 140 entries per page: fewer | more | all ]

Thu, 9 May 2024

[1] arXiv:2405.05254 [pdf, other]: Title: You Only Cache Once: Decoder-Decoder Architectures for Language Models

Authors: Yutao Sun, Li Dong, Yi Zhu, Shaohan Huang, Wenhui Wang, Shuming Ma, Quanlu Zhang, Jianyong Wang, Furu Wei

Subjects: Computation and Language (cs.CL)
[2] arXiv:2405.05253 [pdf, other]: Title: Open Source Language Models Can Provide Feedback: Evaluating LLMs' Ability to Help Students Using GPT-4-As-A-Judge

Authors: Charles Koutcheme, Nicola Dainese, Sami Sarsa, Arto Hellas, Juho Leinonen, Paul Denny

Comments: 7 pages, 4 figures, 2 tables. Accepted for publication at the 29th annual ACM conference on Innovation and Technology in Computer Science Education (ITiCSE 2024)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[3] arXiv:2405.05248 [pdf, other]: Title: LLMs with Personalities in Multi-issue Negotiation Games

Authors: Sean Noh, Ho-Chun Herbert Chang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[4] arXiv:2405.05204 [pdf, ps, other]: Title: CARE-SD: Classifier-based analysis for recognizing and eliminating stigmatizing and doubt marker labels in electronic health records: model development and validation

Authors: Drew Walker, Annie Thorne, Sudeshna Das, Jennifer Love, Hannah LF Cooper, Melvin Livingston III, Abeed Sarker

Comments: 28 pages, 3 figures, 4 tables. 5 Appendices

Subjects: Computation and Language (cs.CL)
[5] arXiv:2405.05189 [pdf, other]: Title: MIDGARD: Self-Consistency Using Minimum Description Length for Structured Commonsense Reasoning

Authors: Inderjeet Nair, Lu Wang

Comments: Under review at ACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[6] arXiv:2405.05176 [pdf, other]: Title: Encoder-Decoder Framework for Interactive Free Verses with Generation with Controllable High-Quality Rhyming

Authors: Tommaso Pasini, Alejo López-Ávila, Husam Quteineh, Gerasimos Lampouras, Jinhua Du, Yubing Wang, Ze Li, Yusen Sun

Comments: 18 pages, 1 figure

Subjects: Computation and Language (cs.CL)
[7] arXiv:2405.05161 [pdf, ps, other]: Title: Motion Capture Analysis of Verb and Adjective Types in Austrian Sign Language

Authors: Julia Krebs, Evie Malaia, Ronnie B. Wilbur, Isabella Fessl, Hans-Peter Wiesinger, Hermann Schwameder, Dietmar Roehm

Comments: 10 pages, 7 figures

Subjects: Computation and Language (cs.CL); Neurons and Cognition (q-bio.NC)
[8] arXiv:2405.05116 [pdf, other]: Title: XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples

Authors: Peiqin Lin, André F. T. Martins, Hinrich Schütze

Subjects: Computation and Language (cs.CL)
[9] arXiv:2405.05109 [pdf, other]: Title: QFMTS: Generating Query-Focused Summaries over Multi-Table Inputs

Authors: Weijia Zhang, Vaishali Pal, Jia-Hong Huang, Evangelos Kanoulas, Maarten de Rijke

Comments: 16 pages, 3 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[10] arXiv:2405.05060 [pdf, other]: Title: Conversational Topic Recommendation in Counseling and Psychotherapy with Decision Transformer and Large Language Models

Authors: Aylin Gunal, Baihan Lin, Djallel Bouneffouf

Comments: 5 pages excluding references, 3 figures; accepted at Clinical NLP Workshop @ NAACL 2024

Subjects: Computation and Language (cs.CL)
[11] arXiv:2405.05049 [pdf, ps, other]: Title: Seeds of Stereotypes: A Large-Scale Textual Analysis of Race and Gender Associations with Diseases in Online Sources

Authors: Lasse Hyldig Hansen, Nikolaj Andersen, Jack Gallifant, Liam G. McCoy, James K Stone, Nura Izath, Marcela Aguirre-Jerez, Danielle S Bitterman, Judy Gichoya, Leo Anthony Celi

Subjects: Computation and Language (cs.CL)
[12] arXiv:2405.05008 [pdf, other]: Title: ADELIE: Aligning Large Language Models on Information Extraction

Authors: Yunjia Qi, Hao Peng, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li

Subjects: Computation and Language (cs.CL)
[13] arXiv:2405.04960 [pdf, other]: Title: P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models

Authors: Guochao Jiang, Zepeng Ding, Yuchen Shi, Deqing Yang

Subjects: Computation and Language (cs.CL)
[14] arXiv:2405.04955 [pdf, other]: Title: Improving Long Text Understanding with Knowledge Distilled from Summarization Model

Authors: Yan Liu, Yazheng Yang, Xiaokang Chen

Comments: arXiv admin note: text overlap with arXiv:2110.04741

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[15] arXiv:2405.04897 [pdf, ps, other]: Title: Machine Learning-based NLP for Emotion Classification on a Cholera X Dataset

Authors: Paul Jideani, Aurona Gerber

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[16] arXiv:2405.04872 [pdf, other]: Title: Logical Negation Augmenting and Debiasing for Prompt-based Methods

Authors: Yitian Li, Jidong Tian, Hao He, Yaohui Jin

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Logic in Computer Science (cs.LO)
[17] arXiv:2405.04829 [pdf, other]: Title: Fine-tuning Pre-trained Named Entity Recognition Models For Indian Languages

Authors: Sankalp Bahad, Pruthwik Mishra, Karunesh Arora, Rakesh Chandra Balabantaray, Dipti Misra Sharma, Parameswari Krishnamurthy

Comments: 8 pages, accepted in NAACL-SRW, 2024

Subjects: Computation and Language (cs.CL)
[18] arXiv:2405.04828 [pdf, other]: Title: ChuXin: 1.6B Technical Report

Authors: Xiaomin Zhuang, Yufan Jiang, Qiaozhi He, Zhihua Wu

Comments: Technical Report

Subjects: Computation and Language (cs.CL)
[19] arXiv:2405.04820 [pdf, other]: Title: APrompt4EM: Augmented Prompt Tuning for Generalized Entity Matching

Authors: Yikuan Xia, Jiazun Chen, Xinchi Li, Jun Gao

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[20] arXiv:2405.04819 [pdf, other]: Title: DALK: Dynamic Co-Augmentation of LLMs and KG to answer Alzheimer's Disease Questions with Scientific Literature

Authors: Dawei Li, Shu Yang, Zhen Tan, Jae Young Baik, Sunkwon Yun, Joseph Lee, Aaron Chacko, Bojian Hou, Duy Duong-Tran, Ying Ding, Huan Liu, Li Shen, Tianlong Chen

Comments: Under Review

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[21] arXiv:2405.04818 [pdf, other]: Title: ACORN: Aspect-wise Commonsense Reasoning Explanation Evaluation

Authors: Ana Brassard, Benjamin Heinzerling, Keito Kudo, Keisuke Sakaguchi, Kentaro Inui

Comments: 18 pages, 7 figures, under review. Data available here: this https URL

Subjects: Computation and Language (cs.CL)
[22] arXiv:2405.04793 [pdf, other]: Title: Zero-shot LLM-guided Counterfactual Generation for Text

Authors: Amrita Bhattacharjee, Raha Moraffah, Joshua Garland, Huan Liu

Comments: arXiv admin note: text overlap with arXiv:2309.13340

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[23] arXiv:2405.04781 [pdf, other]: Title: CourseGPT-zh: an Educational Large Language Model Based on Knowledge Distillation Incorporating Prompt Optimization

Authors: Zheyan Qu, Lu Yin, Zitong Yu, Wenbo Wang, Xing zhang

Subjects: Computation and Language (cs.CL)
[24] arXiv:2405.04777 [pdf, other]: Title: Empathy Through Multimodality in Conversational Interfaces

Authors: Mahyar Abbasian, Iman Azimi, Mohammad Feli, Amir M. Rahmani, Ramesh Jain

Comments: 7 pages, 2 figures, 2 tables, conference paper

Subjects: Computation and Language (cs.CL)
[25] arXiv:2405.04756 [pdf, other]: Title: BiasKG: Adversarial Knowledge Graphs to Induce Bias in Large Language Models

Authors: Chu Fei Luo, Ahmad Ghawanmeh, Xiaodan Zhu, Faiza Khan Khattak

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[26] arXiv:2405.04726 [pdf, other]: Title: Learning Phonotactics from Linguistic Informants

Authors: Canaan Breiss, Alexis Ross, Amani Maina-Kilaas, Roger Levy, Jacob Andreas

Subjects: Computation and Language (cs.CL)
[27] arXiv:2405.04685 [pdf, other]: Title: Bridging the Bosphorus: Advancing Turkish Large Language Models through Strategies for Low-Resource Language Adaptation and Benchmarking

Authors: Emre Can Acikgoz, Mete Erdogan, Deniz Yuret

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[28] arXiv:2405.04655 [pdf, other]: Title: Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense

Authors: Siqi Shen, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Soujanya Poria, Rada Mihalcea

Subjects: Computation and Language (cs.CL)
[29] arXiv:2405.04590 [pdf, other]: Title: Language Modeling Using Tensor Trains

Authors: Zhan Su, Yuqin Zhou, Fengran Mo, Jakob Grue Simonsen

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[30] arXiv:2405.04585 [pdf, other]: Title: PoPE: Legendre Orthogonal Polynomials Based Position Encoding for Large Language Models

Authors: Arpit Aggarwal

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[31] arXiv:2405.05175 (cross-list from cs.CR) [pdf, other]: Title: Air Gap: Protecting Privacy-Conscious Conversational Agents

Authors: Eugene Bagdasaryan, Ren Yi, Sahra Ghalebikesabi, Peter Kairouz, Marco Gruteser, Sewoong Oh, Borja Balle, Daniel Ramage

Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[32] arXiv:2405.05136 (cross-list from cs.CY) [pdf, other]: Title: Integrating LSTM and BERT for Long-Sequence Data Analysis in Intelligent Tutoring Systems

Authors: Zhaoxing Li, Jujie Yang, Jindi Wang, Lei Shi, Sebastian Stein

Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[33] arXiv:2405.05135 (cross-list from cs.SE) [pdf, ps, other]: Title: Lessons from the Use of Natural Language Inference (NLI) in Requirements Engineering Tasks

Authors: Mohamad Fazelnia, Viktoria Koscinski, Spencer Herzog, Mehdi Mirakhorli

Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[34] arXiv:2405.04950 (cross-list from cs.CV) [pdf, other]: Title: VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context

Authors: Yunxin Li, Baotian Hu, Haoyuan Shi, Wei Wang, Longyue Wang, Min Zhang

Comments: 17 pages; Accepted by ICML 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[35] arXiv:2405.04758 (cross-list from cs.CR) [pdf, other]: Title: Honeyfile Camouflage: Hiding Fake Files in Plain Sight

Authors: Roelien C. Timmer, David Liebowitz, Surya Nepal, Salil S. Kanhere

Comments: 3rd Workshop on the security implications of Deepfakes and Cheapfakes (WDC) co-located at ACM ASIACCS 2024

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[36] arXiv:2405.04620 (cross-list from hep-ph) [pdf, ps, other]: Title: Folded context condensation in Path Integral formalism for infinite context transformers

Authors: Won-Gi Paeng, Daesuk Kwon

Comments: 7 pages, 2 figures

Subjects: High Energy Physics - Phenomenology (hep-ph); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)

Wed, 8 May 2024

[37] arXiv:2405.04532 [pdf, other]: Title: QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Authors: Yujun Lin, Haotian Tang, Shang Yang, Zhekai Zhang, Guangxuan Xiao, Chuang Gan, Song Han

Comments: The first three authors contribute equally to this project and are listed in the alphabetical order. Yujun Lin leads the quantization algorithm, Haotian Tang and Shang Yang lead the GPU kernels and the serving system. Code is available at this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Performance (cs.PF)
[38] arXiv:2405.04520 [pdf, other]: Title: NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Prompts

Authors: Shudan Zhang, Hanlin Zhao, Xiao Liu, Qinkai Zheng, Zehan Qi, Xiaotao Gu, Xiaohan Zhang, Yuxiao Dong, Jie Tang

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Software Engineering (cs.SE)
[39] arXiv:2405.04515 [pdf, other]: Title: A Transformer with Stack Attention

Authors: Jiaoda Li, Jennifer C. White, Mrinmaya Sachan, Ryan Cotterell

Comments: NAACL 2024

Subjects: Computation and Language (cs.CL)
[40] arXiv:2405.04513 [pdf, other]: Title: Switchable Decision: Dynamic Neural Generation Networks

Authors: Shujian Zhang, Korawat Tanwisuth, Chengyue Gong, Pengcheng He, Mingyuan Zhou

Comments: Accepted to ICML 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[41] arXiv:2405.04495 [pdf, other]: Title: Toward In-Context Teaching: Adapting Examples to Students' Misconceptions

Authors: Alexis Ross, Jacob Andreas

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[42] arXiv:2405.04435 [pdf, other]: Title: Fast Exact Retrieval for Nearest-neighbor Lookup (FERN)

Authors: Richard Zhu

Comments: NAACL 2024 SRW

Subjects: Computation and Language (cs.CL); Data Structures and Algorithms (cs.DS)
[43] arXiv:2405.04434 [pdf, other]: Title: DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Authors: DeepSeek-AI

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[44] arXiv:2405.04325 [pdf, other]: Title: Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation

Authors: Atharvan Dogra, Ameet Deshpande, John Nay, Tanmay Rajpurohit, Ashwin Kalyan, Balaraman Ravindran

Subjects: Computation and Language (cs.CL)
[45] arXiv:2405.04304 [pdf, other]: Title: Accelerating Speculative Decoding using Dynamic Speculation Length

Authors: Jonathan Mamou, Oren Pereg, Daniel Korat, Moshe Berchansky, Nadav Timor, Moshe Wasserblat, Roy Schwartz

Subjects: Computation and Language (cs.CL)
[46] arXiv:2405.04296 [pdf, other]: Title: Open Implementation and Study of BEST-RQ for Speech Processing

Authors: Ryan Whetten, Titouan Parcollet, Marco Dinarelli, Yannick Estève

Comments: Accepted in IEEE ICASSP 2024 workshop on Self-supervision in Audio, Speech and Beyond (SASB 2024)

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[47] arXiv:2405.04292 [pdf, other]: Title: Mitigating Clickbait: An Approach to Spoiler Generation Using Multitask Learning

Authors: Sayantan Pal, Souvik Das, Rohini K. Srihari

Comments: Accepted in ICON 2023

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[48] arXiv:2405.04286 [pdf, other]: Title: Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore

Authors: Junchao Wu, Runzhe Zhan, Derek F. Wong, Shu Yang, Xuebo Liu, Lidia S. Chao, Min Zhang

Subjects: Computation and Language (cs.CL)
[49] arXiv:2405.04271 [pdf, other]: Title: Generating Feature Vectors from Phonetic Transcriptions in Cross-Linguistic Data Formats

Authors: Arne Rubehn, Jessica Nieder, Robert Forkel, Johann-Mattis List

Comments: To appear in the Proceedings of the 2024 Meeting of the Society for Computation in Linguistics (SCiL)

Subjects: Computation and Language (cs.CL)
[50] arXiv:2405.04219 [pdf, other]: Title: Iterative Experience Refinement of Software-Developing Agents

Authors: Chen Qian, Jiahao Li, Yufan Dang, Wei Liu, YiFei Wang, Zihao Xie, Weize Chen, Cheng Yang, Yingli Zhang, Zhiyuan Liu, Maosong Sun

Comments: Work in progress

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA); Software Engineering (cs.SE)
[51] arXiv:2405.04170 [pdf, other]: Title: D-NLP at SemEval-2024 Task 2: Evaluating Clinical Inference Capabilities of Large Language Models

Authors: Duygu Altinok

Comments: accepted to SemEval-2024, ranked 9th on Task 2

Subjects: Computation and Language (cs.CL)
[52] arXiv:2405.04165 [pdf, other]: Title: LingML: Linguistic-Informed Machine Learning for Enhanced Fake News Detection

Authors: Jasraj Singh, Fang Liu, Hong Xu, Bee Chin Ng, Wei Zhang

Comments: 7 pages

Subjects: Computation and Language (cs.CL)
[53] arXiv:2405.04163 [pdf, other]: Title: MEDVOC: Vocabulary Adaptation for Fine-tuning Pre-trained Language Models on Medical Text Summarization

Authors: Gunjan Balde, Soumyadeep Roy, Mainack Mondal, Niloy Ganguly

Comments: 13 pages, Accepted to the 33rd International Joint Conference on Artificial Intelligence, IJCAI 2024 (Main) Track

Subjects: Computation and Language (cs.CL)
[54] arXiv:2405.04160 [pdf, other]: Title: A Causal Explainable Guardrails for Large Language Models

Authors: Zhixuan Chu, Yan Wang, Longfei Li, Zhibo Wang, Zhan Qin, Kui Ren

Comments: 23 pages

Subjects: Computation and Language (cs.CL)
[55] arXiv:2405.04128 [pdf, other]: Title: Fine-grained Speech Sentiment Analysis in Chinese Psychological Support Hotlines Based on Large-scale Pre-trained Model

Authors: Zhonglong Chen, Changwei Song, Yining Chen, Jianqiang Li, Guanghui Fu, Yongsheng Tong, Qing Zhao

Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[56] arXiv:2405.04086 [pdf, other]: Title: Optimizing Language Model's Reasoning Abilities with Weak Supervision

Authors: Yongqi Tong, Sizhe Wang, Dawei Li, Yifan Wang, Simeng Han, Zi Lin, Chengsong Huang, Jiaxin Huang, Jingbo Shang

Subjects: Computation and Language (cs.CL)
[57] arXiv:2405.04065 [pdf, other]: Title: FlashBack:Efficient Retrieval-Augmented Language Modeling for Long Context Inference

Authors: Runheng Liu, Xingchen Xiao, Heyan Huang, Zewen Chi, Zhijing Wu

Comments: 14 pages

Subjects: Computation and Language (cs.CL)
[58] arXiv:2405.04053 [pdf, other]: Title: Evaluating Text Summaries Generated by Large Language Models Using OpenAI's GPT

Authors: Hassan Shakil, Atqiya Munawara Mahi, Phuoc Nguyen, Zeydy Ortiz, Mamoun T. Mardini

Comments: 10 pages, 5 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[59] arXiv:2405.04048 [pdf, other]: Title: Philosophy of Cognitive Science in the Age of Deep Learning

Authors: Raphaël Millière

Comments: Forthcoming in WIREs Cognitive Science

Subjects: Computation and Language (cs.CL)
[60] arXiv:2405.04039 [pdf, other]: Title: Utilizing GPT to Enhance Text Summarization: A Strategy to Minimize Hallucinations

Authors: Hassan Shakil, Zeydy Ortiz, Grant C. Forbes

Comments: 9 pages, 3 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[61] arXiv:2405.03960 [pdf, other]: Title: ESIHGNN: Event-State Interactions Infused Heterogeneous Graph Neural Network for Conversational Emotion Recognition

Authors: Xupeng Zha, Huan Zhao, Zixing Zhang

Journal-ref: published at ICASSP 2024

Subjects: Computation and Language (cs.CL)
[62] arXiv:2405.03939 [pdf, other]: Title: Long Context Alignment with Short Instructions and Synthesized Positions

Authors: Wenhao Wu, Yizhong Wang, Yao Fu, Xiang Yue, Dawei Zhu, Sujian Li

Comments: preview

Subjects: Computation and Language (cs.CL)
[63] arXiv:2405.03920 [pdf, other]: Title: A Roadmap for Multilingual, Multimodal Domain Independent Deception Detection

Authors: Dainis Boumber, Rakesh M. Verma, Fatima Zahra Qachfar

Comments: 6 pages, 1 figure, shorter version in SIAM International Conference on Data Mining (SDM) 2024

Journal-ref: Proc. SDM 2024, 396-399

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[64] arXiv:2405.03845 [pdf, other]: Title: Self-Improving Customer Review Response Generation Based on LLMs

Authors: Guy Azov, Tatiana Pelc, Adi Fledel Alon, Gila Kamhi

Comments: 18 pages, 4 figure, 8 figures in Appendix, accepted to LREC-COLING 2024 workshop

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[65] arXiv:2405.03832 [pdf, other]: Title: Guylingo: The Republic of Guyana Creole Corpora

Authors: Christopher Clarke, Roland Daynauth, Charlene Wilkinson, Hubert Devonish, Jason Mars

Comments: Accepted to NAACL 2024 Main Conference Special Theme Track: Languages of Latin America

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[66] arXiv:2405.03794 [pdf, other]: Title: Detecting Anti-Semitic Hate Speech using Transformer-based Large Language Models

Authors: Dengyi Liu, Minghao Wang, Andrew G. Catlin

Subjects: Computation and Language (cs.CL)
[67] arXiv:2405.03764 [pdf, other]: Title: GOVERN: Gradient Orientation Vote Ensemble for Multi-Teacher Reinforced Distillation

Authors: Wenjie Zhou, Zhenxin Ding, Xiaodong Zhang, Haibo Shi, Junfeng Wang, Dawei Yin

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[68] arXiv:2405.03695 [pdf, other]: Title: Evaluating Large Language Models for Material Selection

Authors: Daniele Grandi, Yash Patawari Jain, Allin Groom, Brandon Cramer, Christopher McComb

Comments: arXiv admin note: text overlap with arXiv:2307.03109 by other authors

Subjects: Computation and Language (cs.CL)
[69] arXiv:2405.04404 (cross-list from cs.CV) [pdf, other]: Title: Vision Mamba: A Comprehensive Survey and Taxonomy

Authors: Xiao Liu, Chenxu Zhang, Lei Zhang

Comments: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[70] arXiv:2405.04346 (cross-list from cs.LG) [pdf, other]: Title: Revisiting character-level adversarial attacks

Authors: Elias Abad Rocamora, Yongtao Wu, Fanghui Liu, Grigorios G. Chrysos, Volkan Cevher

Comments: Accepted in ICML 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[71] arXiv:2405.04324 (cross-list from cs.AI) [pdf, other]: Title: Granite Code Models: A Family of Open Foundation Models for Code Intelligence

Authors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang, Yikang Shen, Aditya Prasad, Adriana Meza Soria, Michele Merler, Parameswaran Selvam, Saptha Surendran, Shivdeep Singh, Manish Sethi, Xuan-Hong Dang, Pengyuan Li, Kun-Lung Wu, Syed Zawad, Andrew Coleman, Matthew White, Mark Lewis, Raju Pavuluri, Yan Koyfman, Boris Lublinsky, Maximilien de Bayser, Ibrahim Abdelaziz, Kinjal Basu, Mayank Agarwal, Yi Zhou, Chris Johnson, Aanchal Goyal, Hima Patel, Yousaf Shah, Petros Zerfos, Heiko Ludwig, Asim Munawar, Maxwell Crouse, Pavan Kapanipathi, Shweta Salaria, Bob Calio, Sophia Wen, Seetharami Seelam, Brian Belgodere, Carlos Fonseca, Amith Singhee, Nirmit Desai, David D. Cox, Ruchir Puri, Rameswar Panda

Comments: Corresponding Authors: Rameswar Panda, Ruchir Puri; Equal Contributors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Software Engineering (cs.SE)
[72] arXiv:2405.04136 (cross-list from cs.AI) [pdf, other]: Title: Enriched BERT Embeddings for Scholarly Publication Classification

Authors: Benjamin Wolff, Eva Seidlmayer, Konrad U. Förstner

Comments: 8 pages, 2 figures, NSLP2024 conference

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[73] arXiv:2405.04118 (cross-list from cs.LG) [pdf, other]: Title: Policy Learning with a Language Bottleneck

Authors: Megha Srivastava, Cedric Colas, Dorsa Sadigh, Jacob Andreas

Comments: 18 pages, 13 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[74] arXiv:2405.03998 (cross-list from cs.HC) [pdf, other]: Title: Sketch Then Generate: Providing Incremental User Feedback and Guiding LLM Code Generation through Language-Oriented Code Sketches

Authors: Chen Zhu-Tian, Zeyu Xiong, Xiaoshuo Yao, Elena Glassman

Comments: 4 pages

Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[75] arXiv:2405.03952 (cross-list from cs.SD) [pdf, other]: Title: HAFFormer: A Hierarchical Attention-Free Framework for Alzheimer's Disease Detection From Spontaneous Speech

Authors: Zhongren Dong, Zixing Zhang, Weixiang Xu, Jing Han, Jianjun Ou, Björn W. Schuller

Journal-ref: publised at ICASSP 2024

Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[76] arXiv:2405.03932 (cross-list from cs.AI) [pdf, other]: Title: CleanGraph: Human-in-the-loop Knowledge Graph Refinement and Completion

Authors: Tyler Bikaun, Michael Stewart, Wei Liu

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[77] arXiv:2405.03862 (cross-list from cs.AI) [pdf, other]: Title: Conformity, Confabulation, and Impersonation: Persona Inconstancy in Multi-Agent LLM Collaboration

Authors: Razan Baltaji, Babak Hemmatian, Lav R. Varshney

Comments: 16 pages, 8 figures, 3 tables

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Tue, 7 May 2024 (showing first 63 of 82 entries)

[78] arXiv:2405.03688 [pdf, other]: Title: Large Language Models Reveal Information Operation Goals, Tactics, and Narrative Frames

Authors: Keith Burghardt, Kai Chen, Kristina Lerman

Comments: 15 pages, 9 figures

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[79] arXiv:2405.03677 [pdf, other]: Title: Towards A Human-in-the-Loop LLM Approach to Collaborative Discourse Analysis

Authors: Clayton Cohn, Caitlin Snyder, Justin Montenegro, Gautam Biswas

Comments: In press at the 25th international conference on Artificial Intelligence in Education (AIED) Late-Breaking Results (LBR) track

Subjects: Computation and Language (cs.CL)
[80] arXiv:2405.03595 [pdf, other]: Title: GREEN: Generative Radiology Report Evaluation and Error Notation

Authors: Sophie Ostmeier, Justin Xu, Zhihong Chen, Maya Varma, Louis Blankemeier, Christian Bluethgen, Arne Edward Michalson, Michael Moseley, Curtis Langlotz, Akshay S Chaudhari, Jean-Benoit Delbrouck

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[81] arXiv:2405.03594 [pdf, other]: Title: Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment

Authors: Abhinav Agarwalla, Abhay Gupta, Alexandre Marques, Shubhra Pandit, Michael Goin, Eldar Kurtic, Kevin Leong, Tuan Nguyen, Mahmoud Salem, Dan Alistarh, Sean Lie, Mark Kurtz

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[82] arXiv:2405.03553 [pdf, other]: Title: AlphaMath Almost Zero: process Supervision without process

Authors: Guoxin Chen, Minpeng Liao, Chengxi Li, Kai Fan

Comments: Work in progress

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[83] arXiv:2405.03548 [pdf, other]: Title: MAmmoTH2: Scaling Instructions from the Web

Authors: Xiang Yue, Tuney Zheng, Ge Zhang, Wenhu Chen

Comments: Work in Progress

Subjects: Computation and Language (cs.CL)
[84] arXiv:2405.03425 [pdf, other]: Title: Gaussian Stochastic Weight Averaging for Bayesian Low-Rank Adaptation of Large Language Models

Authors: Emre Onal, Klemens Flöge, Emma Caldwell, Arsen Sheverdin, Vincent Fortuin

Comments: 14 pages, 1 figure, 2 tables

Subjects: Computation and Language (cs.CL)
[85] arXiv:2405.03387 [pdf, ps, other]: Title: The high dimensional psychological profile and cultural bias of ChatGPT

Authors: Hang Yuan (1), Zhongyue Che (1), Shao Li (1), Yue Zhang, Xiaomeng Hu (2), Siyang Luo (1) ((1) Sun Yat-Sen University, (2) Renmin University of China)

Subjects: Computation and Language (cs.CL)
[86] arXiv:2405.03371 [pdf, other]: Title: Explainable Fake News Detection With Large Language Model via Defense Among Competing Wisdom

Authors: Bo Wang, Jing Ma, Hongzhan Lin, Zhiwei Yang, Ruichao Yang, Yuan Tian, Yi Chang

Comments: 12 pages, WWW'2024

Subjects: Computation and Language (cs.CL)
[87] arXiv:2405.03359 [pdf, ps, other]: Title: MedDoc-Bot: A Chat Tool for Comparative Analysis of Large Language Models in the Context of the Pediatric Hypertension Guideline

Authors: Mohamed Yaseen Jabarulla, Steffen Oeltze-Jafra, Philipp Beerbaum, Theodor Uden

Comments: {copyright} 2024 IEEE. This work has been accepted for publication and presentation at the 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, to be held in Orlando, Florida, USA, July 15-19, 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[88] arXiv:2405.03279 [pdf, other]: Title: Lifelong Knowledge Editing for LLMs with Retrieval-Augmented Continuous Prompt Learning

Authors: Qizhou Chen, Taolin Zhang, Xiaofeng He, Dongyang Li, Chengyu Wang, Longtao Huang, Hui Xue

Comments: 14 pages, 4 figures, 6 tables

Subjects: Computation and Language (cs.CL)
[89] arXiv:2405.03207 [pdf, other]: Title: A Philosophical Introduction to Language Models - Part II: The Way Forward

Authors: Raphaël Millière, Cameron Buckner

Subjects: Computation and Language (cs.CL)
[90] arXiv:2405.03206 [pdf, other]: Title: Vietnamese AI Generated Text Detection

Authors: Quang-Dan Tran, Van-Quan Nguyen, Quang-Huy Pham, K. B. Thang Nguyen, Trong-Hop Do

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[91] arXiv:2405.03205 [pdf, other]: Title: Anchored Answers: Unravelling Positional Bias in GPT-2's Multiple-Choice Questions

Authors: Ruizhe Li, Yanjun Gao

Comments: Work in process

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[92] arXiv:2405.03170 [pdf, other]: Title: Oracle-Checker Scheme for Evaluating a Generative Large Language Model

Authors: Yueling Jenny Zeng, Li-C. Wang, Thomas Ibbetson

Subjects: Computation and Language (cs.CL)
[93] arXiv:2405.03153 [pdf, ps, other]: Title: Exploring the Potential of the Large Language Models (LLMs) in Identifying Misleading News Headlines

Authors: Md Main Uddin Rony, Md Mahfuzul Haque, Mohammad Ali, Ahmed Shatil Alam, Naeemul Hassan

Comments: 5 pages, 2 tables, 1st HEAL Workshop at CHI Conference on Human Factors in Computing Systems, May 12, Honolulu, HI, USA 2024

Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[94] arXiv:2405.03138 [pdf, other]: Title: CRAFT: Extracting and Tuning Cultural Instructions from the Wild

Authors: Bin Wang, Geyu Lin, Zhengyuan Liu, Chengwei Wei, Nancy F. Chen

Comments: 6 pages

Subjects: Computation and Language (cs.CL)
[95] arXiv:2405.03133 [pdf, other]: Title: Lory: Fully Differentiable Mixture-of-Experts for Autoregressive Language Model Pre-training

Authors: Zexuan Zhong, Mengzhou Xia, Danqi Chen, Mike Lewis

Comments: 21 pages, 12 figures

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[96] arXiv:2405.03111 [pdf, ps, other]: Title: An Active Inference Agent for Simulating Human Translation Processes in a Hierarchical Architecture: Integrating the Task Segment Framework and the HOF taxonomy

Authors: Michael Carl

Subjects: Computation and Language (cs.CL)
[97] arXiv:2405.03098 [pdf, other]: Title: FairMonitor: A Dual-framework for Detecting Stereotypes and Biases in Large Language Models

Authors: Yanhong Bai, Jiabao Zhao, Jinxin Shi, Zhentao Xie, Xingjiao Wu, Liang He

Subjects: Computation and Language (cs.CL)
[98] arXiv:2405.03085 [pdf, other]: Title: Compressing Long Context for Enhancing RAG with AMR-based Concept Distillation

Authors: Kaize Shi, Xueyao Sun, Qing Li, Guandong Xu

Subjects: Computation and Language (cs.CL)
[99] arXiv:2405.03084 [pdf, ps, other]: Title: Analyzing Emotional Trends from X platform using SenticNet: A Comparative Analysis with Cryptocurrency Price

Authors: Moein Shahiki Tash, Zahra Ahani, Olga Kolesnikova, Grigori Sidorov

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[100] arXiv:2405.03004 [pdf, other]: Title: Exploring prompts to elicit memorization in masked language model-based named entity recognition

Authors: Yuxi Xia, Anastasiia Sedova, Pedro Henrique Luz de Araujo, Vasiliki Kougia, Lisa Nußbaumer, Benjamin Roth

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[101] arXiv:2405.03000 [pdf, other]: Title: MedAdapter: Efficient Test-Time Adaptation of Large Language Models towards Medical Reasoning

Authors: Wenqi Shi, Ran Xu, Yuchen Zhuang, Yue Yu, Hang Wu, Carl Yang, May D. Wang

Comments: Work in Progress

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[102] arXiv:2405.02985 [pdf, ps, other]: Title: Can Large Language Models Make the Grade? An Empirical Study Evaluating LLMs Ability to Mark Short Answer Questions in K-12 Education

Authors: Owen Henkel, Adam Boxer, Libby Hills, Bill Roberts

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[103] arXiv:2405.02984 [pdf, other]: Title: E-TSL: A Continuous Educational Turkish Sign Language Dataset with Baseline Methods

Authors: Şükrü Öztürk, Hacer Yalim Keles

Comments: 7 pages, 3 figures, 4 tables, submitted to IEEE conference

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[104] arXiv:2405.02937 [pdf, other]: Title: Unraveling the Dominance of Large Language Models Over Transformer Models for Bangla Natural Language Inference: A Comprehensive Study

Authors: Fatema Tuj Johora Faria, Mukaffi Bin Moin, Asif Iftekher Fahim, Pronay Debnath, Faisal Muhammad Shah

Comments: Accepted in 4th International Conference on Computing and Communication Networks (ICCCNet-2024)

Subjects: Computation and Language (cs.CL)
[105] arXiv:2405.02935 [pdf, other]: Title: Enabling Patient-side Disease Prediction via the Integration of Patient Narratives

Authors: Zhixiang Su, Yinan Zhang, Jiazheng Jing, Jie Xiao, Zhiqi Shen

Subjects: Computation and Language (cs.CL)
[106] arXiv:2405.02933 [pdf, other]: Title: Relay Decoding: Concatenating Large Language Models for Machine Translation

Authors: Chengpeng Fu, Xiaocheng Feng, Yichong Huang, Wenshuai Huo, Baohang Li, Hui Wang, Bin Qin, Ting Liu

Comments: Work in progress

Subjects: Computation and Language (cs.CL)
[107] arXiv:2405.02925 [pdf, other]: Title: A Two-Stage Prediction-Aware Contrastive Learning Framework for Multi-Intent NLU

Authors: Guanhua Chen, Yutong Yao, Derek F. Wong, Lidia S. Chao

Comments: LREC-COLING 2024

Subjects: Computation and Language (cs.CL)
[108] arXiv:2405.02887 [pdf, other]: Title: Sentiment Analysis Across Languages: Evaluation Before and After Machine Translation to English

Authors: Aekansh Kathunia, Mohammad Kaif, Nalin Arora, N Narotam

Comments: 6 pages, 3 Figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[109] arXiv:2405.02861 [pdf, other]: Title: Revisiting a Pain in the Neck: Semantic Phrase Processing Benchmark for Language Models

Authors: Yang Liu, Melissa Xiaohui Qin, Hongming Li, Chao Huang

Comments: 24 pages, 17 figures, 10 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[110] arXiv:2405.02817 [pdf, other]: Title: HuixiangDou-CR: Coreference Resolution in Group Chats

Authors: Huanjun Kong

Comments: 5 pages, 3 tables, 3 figures

Subjects: Computation and Language (cs.CL)
[111] arXiv:2405.02816 [pdf, other]: Title: Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization

Authors: Hamed Zamani, Michael Bendersky

Comments: To appear in the proceedings of SIGIR 2024

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[112] arXiv:2405.02814 [pdf, other]: Title: NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional Stimuli

Authors: Xu Wang, Cheng Li, Yi Chang, Jindong Wang, Yuan Wu

Comments: This paper has been accepted by IJCAI 2024

Subjects: Computation and Language (cs.CL)
[113] arXiv:2405.02765 [pdf, other]: Title: Detecting Edited Knowledge in Language Models

Authors: Paul Youssef, Zhixue Zhao, Jörg Schlötterer, Christin Seifert

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[114] arXiv:2405.02764 [pdf, other]: Title: Assessing Adversarial Robustness of Large Language Models: An Empirical Study

Authors: Zeyu Yang, Zhao Meng, Xiaochen Zheng, Roger Wattenhofer

Comments: 16 pages, 9 figures, 10 tables

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[115] arXiv:2405.02750 [pdf, other]: Title: Enhancing Contextual Understanding in Large Language Models through Contrastive Decoding

Authors: Zheng Zhao, Emilio Monti, Jens Lehmann, Haytham Assem

Comments: Accepted to NAACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[116] arXiv:2405.02743 [pdf, other]: Title: Beyond Performance: Quantifying and Mitigating Label Bias in LLMs

Authors: Yuval Reif, Roy Schwartz

Comments: NAACL 2024

Subjects: Computation and Language (cs.CL)
[117] arXiv:2405.02738 [pdf, other]: Title: Relations Prediction for Knowledge Graph Completion using Large Language Models

Authors: Sakher Khalil Alqaaidi, Krzysztof Kochut

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[118] arXiv:2405.02732 [pdf, other]: Title: Recall Them All: Retrieval-Augmented Language Models for Long Object List Extraction from Long Documents

Authors: Sneha Singhania, Simon Razniewski, Gerhard Weikum

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[119] arXiv:2405.02712 [pdf, other]: Title: CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions

Authors: Hanchong Zhang, Ruisheng Cao, Hongshen Xu, Lu Chen, Kai Yu

Subjects: Computation and Language (cs.CL)
[120] arXiv:2405.02710 [pdf, other]: Title: Enhancing News Summarization with ELearnFit through Efficient In-Context Learning and Efficient Fine-Tuning

Authors: Che Guan, Andrew Chin, Puya Vahabi

Comments: 9 Pages

Subjects: Computation and Language (cs.CL)
[121] arXiv:2405.02677 [pdf, other]: Title: Evaluating the Ability of Computationally Extracted Narrative Maps to Encode Media Framing

Authors: Sebastián Concha Macías, Brian Keith Norambuena

Comments: Text2Story Workshop 2024

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[122] arXiv:2405.02673 [pdf, other]: Title: On the Information Redundancy in Non-Autoregressive Translation

Authors: Zhihao Wang, Longyue Wang, Jinsong Su, Junfeng Yao, Zhaopeng Tu

Comments: 10 pages, 10 tables

Subjects: Computation and Language (cs.CL)
[123] arXiv:2405.02659 [pdf, other]: Title: R4: Reinforced Retriever-Reorder-Responder for Retrieval-Augmented Large Language Models

Authors: Taolin Zhang, Dongyang Li, Qizhou Chen, Chengyu Wang, Longtao Huang, Hui Xue, Xiaofeng He, Jun Huang

Subjects: Computation and Language (cs.CL)
[124] arXiv:2405.02650 [pdf, other]: Title: Identifying Narrative Patterns and Outliers in Holocaust Testimonies Using Topic Modeling

Authors: Maxim Ifergan, Renana Keydar, Omri Abend, Amit Pinchevski

Comments: 9 pages, 7 figures, LREC-COLING 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[125] arXiv:2405.02602 [pdf, other]: Title: Astro-NER -- Astronomy Named Entity Recognition: Is GPT a Good Domain Expert Annotator?

Authors: Julia Evans, Sameer Sadruddin, Jennifer D'Souza

Comments: 9 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Theory (cs.IT)
[126] arXiv:2405.02578 [pdf, ps, other]: Title: Mixat: A Data Set of Bilingual Emirati-English Speech

Authors: Maryam Al Ali, Hanan Aldarmaki

Comments: SIGUL 2024

Subjects: Computation and Language (cs.CL)
[127] arXiv:2405.02573 [pdf, other]: Title: A Combination of BERT and Transformer for Vietnamese Spelling Correction

Authors: Hieu Ngo Trung, Duong Tran Ham, Tin Huynh, Kiem Hoang

Comments: 13 pages

Journal-ref: ACIIDS 2022, LNCS, vol 13757, Springer, Cham

Subjects: Computation and Language (cs.CL)
[128] arXiv:2405.02559 [pdf, ps, other]: Title: A Literature Review and Framework for Human Evaluation of Generative Large Language Models in Healthcare

Authors: Thomas Yu Chow Tam, Sonish Sivarajkumar, Sumit Kapoor, Alisa V Stolyar, Katelyn Polanska, Karleigh R McCarthy, Hunter Osterhoudt, Xizhi Wu, Shyam Visweswaran, Sunyang Fu, Piyush Mathur, Giovanni E. Cacciamani, Cong Sun, Yifan Peng, Yanshan Wang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[129] arXiv:2405.02517 [pdf, other]: Title: Mothman at SemEval-2024 Task 9: An Iterative System for Chain-of-Thought Prompt Optimization

Authors: Alvin Po-Chun Chen, Ray Groshan, Sean von Bayern

Comments: 13 pages, 2 figures, to be published in Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)

Subjects: Computation and Language (cs.CL)
[130] arXiv:2405.02501 [pdf, other]: Title: Beyond Helpfulness and Harmlessness: Eliciting Diverse Behaviors from Large Language Models with Persona In-Context Learning

Authors: Hyeong Kyu Choi, Yixuan Li

Comments: Paper accepted at ICML 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[131] arXiv:2405.02472 [pdf, other]: Title: Semantic Scaling: Bayesian Ideal Point Estimates with Large Language Models

Authors: Michael Burnham

Subjects: Computation and Language (cs.CL)
[132] arXiv:2405.02454 [pdf, other]: Title: What is Sentiment Meant to Mean to Language Models?

Authors: Michael Burnham

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[133] arXiv:2405.02421 [pdf, other]: Title: What does the Knowledge Neuron Thesis Have to do with Knowledge?

Authors: Jingcheng Niu, Andrew Liu, Zining Zhu, Gerald Penn

Comments: ICLR 2024 (Spotlight)

Subjects: Computation and Language (cs.CL)
[134] arXiv:2405.02411 [pdf, other]: Title: The Call for Socially Aware Language Technologies

Authors: Diyi Yang, Dirk Hovy, David Jurgens, Barbara Plank

Subjects: Computation and Language (cs.CL)
[135] arXiv:2405.02353 [pdf, other]: Title: Early Transformers: A study on Efficient Training of Transformer Models through Early-Bird Lottery Tickets

Authors: Shravan Cheekati

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[136] arXiv:2405.02318 [pdf, other]: Title: NL2FOL: Translating Natural Language to First-Order Logic for Logical Fallacy Detection

Authors: Abhinav Lalwani, Lovish Chopra, Christopher Hahn, Caroline Trippel, Zhijing Jin, Mrinmaya Sachan

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[137] arXiv:2405.03689 (cross-list from cs.CV) [pdf, other]: Title: Pose Priors from Language Models

Authors: Sanjay Subramanian, Evonne Ng, Lea Müller, Dan Klein, Shiry Ginosar, Trevor Darrell

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[138] arXiv:2405.03685 (cross-list from cs.CV) [pdf, other]: Title: Language-Image Models with 3D Understanding

Authors: Jang Hyun Cho, Boris Ivanovic, Yulong Cao, Edward Schmerling, Yue Wang, Xinshuo Weng, Boyi Li, Yurong You, Philipp Krähenbühl, Yan Wang, Marco Pavone

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[139] arXiv:2405.03452 (cross-list from cs.CY) [pdf, ps, other]: Title: Large Language Models (LLMs) as Agents for Augmented Democracy

Authors: Jairo Gudiño-Rosero, Umberto Grandi, César A. Hidalgo

Comments: 15 pages main manuscript with 3 figures. 12 pages of supplementary material

Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[140] arXiv:2405.03162 (cross-list from cs.CV) [pdf, other]: Title: Advancing Multimodal Medical Capabilities of Gemini

Authors: Lin Yang, Shawn Xu, Andrew Sellergren, Timo Kohlberger, Yuchen Zhou, Ira Ktena, Atilla Kiraly, Faruk Ahmed, Farhad Hormozdiari, Tiam Jaroensri, Eric Wang, Ellery Wulczyn, Fayaz Jamil, Theo Guidroz, Chuck Lau, Siyuan Qiao, Yun Liu, Akshay Goel, Kendall Park, Arnav Agharwal, Nick George, Yang Wang, Ryutaro Tanno, David G. T. Barrett, Wei-Hung Weng, S. Sara Mahdavi, Khaled Saab, Tao Tu, Sreenivasa Raju Kalidindi, Mozziyar Etemadi, Jorge Cuadros, Gregory Sorensen, Yossi Matias, Katherine Chou, Greg Corrado, Joelle Barral, Shravya Shetty, David Fleet, S. M. Ali Eslami, Daniel Tse, Shruthi Prabhakara, Cory McLean, Dave Steiner, Rory Pilgrim, Christopher Kelly, Shekoofeh Azizi, Daniel Golden

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)

[ total of 299 entries: 1-140 | 141-280 | 281-299 ]
[ showing 140 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help (Access key information)

> cs > cs.CL

Computation and Language

Authors and titles for recent submissions

Thu, 9 May 2024

Wed, 8 May 2024

Tue, 7 May 2024 (showing first 63 of 82 entries)