Computation and Language

Authors and titles for recent submissions

Mon, 6 May 2024
Fri, 3 May 2024
Thu, 2 May 2024
Wed, 1 May 2024
Tue, 30 Apr 2024

[ total of 350 entries: 1-340 | 341-350 ]
[ showing 340 entries per page: fewer | more | all ]

Mon, 6 May 2024

[1] arXiv:2405.02287 [pdf, other]: Title: Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models

Authors: Piotr Padlewski, Max Bain, Matthew Henderson, Zhongkai Zhu, Nishant Relan, Hai Pham, Donovan Ong, Kaloyan Aleksiev, Aitor Ormazabal, Samuel Phua, Ethan Yeo, Eugenie Lamprecht, Qi Liu, Yuqi Wang, Eric Chen, Deyu Fu, Lei Li, Che Zheng, Cyprien de Masson d'Autume, Dani Yogatama, Mikel Artetxe, Yi Tay

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2] arXiv:2405.02228 [pdf, other]: Title: REASONS: A benchmark for REtrieval and Automated citationS Of scieNtific Sentences using Public and Proprietary LLMs

Authors: Deepa Tilwani, Yash Saxena, Ali Mohammadi, Edward Raff, Amit Sheth, Srinivasan Parthasarathy, Manas Gaur

Comments: Submitted to ACL ARR April 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[3] arXiv:2405.02195 [pdf, ps, other]: Title: Impact of emoji exclusion on the performance of Arabic sarcasm detection models

Authors: Ghalyah H. Aleryani, Wael Deabes, Khaled Albishre, Alaa E. Abdel-Hakim

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[4] arXiv:2405.02178 [pdf, other]: Title: Assessing and Verifying Task Utility in LLM-Powered Applications

Authors: Negar Arabzadeh, Siging Huo, Nikhil Mehta, Qinqyun Wu, Chi Wang, Ahmed Awadallah, Charles L. A. Clarke, Julia Kiseleva

Comments: arXiv admin note: text overlap with arXiv:2402.09015

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[5] arXiv:2405.02175 [pdf, other]: Title: Hoaxpedia: A Unified Wikipedia Hoax Articles Dataset

Authors: Hsuvas Borkakoty, Luis Espinosa-Anke

Comments: Short paper

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[6] arXiv:2405.02165 [pdf, other]: Title: EEG2TEXT: Open Vocabulary EEG-to-Text Decoding with EEG Pre-Training and Multi-View Transformer

Authors: Hanwen Liu, Daniel Hajialigol, Benny Antony, Aiguo Han, Xuan Wang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[7] arXiv:2405.02144 [pdf, other]: Title: MedReadMe: A Systematic Study for Fine-grained Sentence Readability in Medical Domain

Authors: Chao Jiang, Wei Xu

Subjects: Computation and Language (cs.CL)
[8] arXiv:2405.02134 [pdf, other]: Title: Optimising Calls to Large Language Models with Uncertainty-Based Two-Tier Selection

Authors: Guillem Ramírez, Alexandra Birch, Ivan Titov

Subjects: Computation and Language (cs.CL)
[9] arXiv:2405.02128 [pdf, ps, other]: Title: Single and Multi-Hop Question-Answering Datasets for Reticular Chemistry with GPT-4-Turbo

Authors: Nakul Rampal, Kaiyu Wang, Matthew Burigana, Lingxiang Hou, Juri Al-Johani, Anna Sackmann, Hanan S. Murayshid, Walaa Abdullah Al-Sumari, Arwa M. Al-Abdulkarim, Nahla Eid Al-Hazmi, Majed O. Al-Awad, Christian Borgs, Jennifer T. Chayes, Omar M. Yaghi

Subjects: Computation and Language (cs.CL); Materials Science (cond-mat.mtrl-sci)
[10] arXiv:2405.02079 [pdf, other]: Title: Argumentative Large Language Models for Explainable and Contestable Decision-Making

Authors: Gabriel Freedman, Adam Dejl, Deniz Gorur, Xiang Yin, Antonio Rago, Francesca Toni

Comments: 19 pages, 17 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[11] arXiv:2405.02040 [pdf, ps, other]: Title: Large Multimodal Model based Standardisation of Pathology Reports with Confidence and their Prognostic Significance

Authors: Ethar Alzaid, Gabriele Pergola, Harriet Evans, David Snead, Fayyaz Minhas

Comments: 19 pages, 6 figures

Subjects: Computation and Language (cs.CL)
[12] arXiv:2405.02024 [pdf, other]: Title: Analyzing Narrative Processing in Large Language Models (LLMs): Using GPT4 to test BERT

Authors: Patrick Krauss, Jannik Hösch, Claus Metzner, Andreas Maier, Peter Uhrig, Achim Schilling

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[13] arXiv:2405.02010 [pdf, other]: Title: The Trade-off between Performance, Efficiency, and Fairness in Adapter Modules for Text Classification

Authors: Minh Duc Bui, Katharina von der Wense

Comments: Accepted to the 4th Workshop on Trustworthy Natural Language Processing (TrustNLP) at NAACL 2024

Subjects: Computation and Language (cs.CL)
[14] arXiv:2405.01997 [pdf, ps, other]: Title: Exploring Combinatorial Problem Solving with Large Language Models: A Case Study on the Travelling Salesman Problem Using GPT-3.5 Turbo

Authors: Mahmoud Masoud, Ahmed Abdelhay, Mohammed Elhenawy

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[15] arXiv:2405.01976 [pdf, other]: Title: Conformal Prediction for Natural Language Processing: A Survey

Authors: Margarida M. Campos, António Farinhas, Chrysoula Zerva, Mário A.T. Figueiredo, André F.T. Martins

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[16] arXiv:2405.01972 [pdf, other]: Title: A quantitative and typological study of Early Slavic participle clauses and their competition

Authors: Nilo Pedrazzini

Comments: 259 pages, 138 figures. DPhil Thesis in Linguistics submitted and defended at the University of Oxford (December 2023). This manuscript is a version formatted for improved readability and broader dissemination

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[17] arXiv:2405.01943 [pdf, other]: Title: Dependency-Aware Semi-Structured Sparsity of GLU Variants in Large Language Models

Authors: Zhiyu Guo, Hidetaka Kamigaito, Taro Wanatnabe

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[18] arXiv:2405.01942 [pdf, other]: Title: CRCL at SemEval-2024 Task 2: Simple prompt optimizations

Authors: Clément Brutti-Mairesse, Loïc Verlingue

Journal-ref: SemEval-2024

Subjects: Computation and Language (cs.CL)
[19] arXiv:2405.01930 [pdf, other]: Title: OARelatedWork: A Large-Scale Dataset of Related Work Sections with Full-texts from Open Access Sources

Authors: Martin Docekal, Martin Fajcik, Pavel Smrz

Subjects: Computation and Language (cs.CL)
[20] arXiv:2405.01924 [pdf, other]: Title: Semi-Parametric Retrieval via Binary Token Index

Authors: Jiawei Zhou, Li Dong, Furu Wei, Lei Chen

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[21] arXiv:2405.01886 [pdf, other]: Title: Aloe: A Family of Fine-tuned Open Healthcare LLMs

Authors: Ashwin Kumar Gururajan, Enrique Lopez-Cuena, Jordi Bayarri-Planas, Adrian Tormos, Daniel Hinjos, Pablo Bernabeu-Perez, Anna Arias-Duart, Pablo Agustin Martin-Torres, Lucia Urcelay-Ganzabal, Marta Gonzalez-Mallo, Sergio Alvarez-Napagao, Eduard Ayguadé-Parra, Ulises Cortés Dario Garcia-Gasulla

Comments: Five appendix

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[22] arXiv:2405.01884 [pdf, other]: Title: Beyond Single-Event Extraction: Towards Efficient Document-Level Multi-Event Argument Extraction

Authors: Wanlong Liu, Li Zhou, Dingyi Zeng, Yichen Xiao, Shaohuan Cheng, Chen Zhang, Grandee Lee, Malu Zhang, Wenyu Chen

Subjects: Computation and Language (cs.CL)
[23] arXiv:2405.01883 [pdf, other]: Title: DALLMi: Domain Adaption for LLM-based Multi-label Classifier

Authors: Miruna Beţianu, Abele Mălan, Marco Aldinucci, Robert Birke, Lydia Chen

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[24] arXiv:2405.01873 [pdf, other]: Title: Enhancing Bangla Language Next Word Prediction and Sentence Completion through Extended RNN with Bi-LSTM Model On N-gram Language

Authors: Md Robiul Islam, Al Amin, Aniqua Nusrat Zereen

Comments: This paper contains 6 pages, 8 figures

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[25] arXiv:2405.01868 [pdf, other]: Title: Incorporating External Knowledge and Goal Guidance for LLM-based Conversational Recommender Systems

Authors: Chuang Li, Yang Deng, Hengchang Hu, Min-Yen Kan, Haizhou Li

Comments: Main paper 8 pages; References and Appendix 9 pages; 7 figures and 14 tables

Subjects: Computation and Language (cs.CL)
[26] arXiv:2405.01858 [pdf, other]: Title: SUKHSANDESH: An Avatar Therapeutic Question Answering Platform for Sexual Education in Rural India

Authors: Salam Michael Singh, Shubhmoy Kumar Garg, Amitesh Misra, Aaditeshwar Seth, Tanmoy Chakraborty

Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[27] arXiv:2405.01842 [pdf, ps, other]: Title: SGHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Singapore

Authors: Ri Chi Ng, Nirmalendu Prakash, Ming Shan Hee, Kenny Tsu Wei Choo, Roy Ka-Wei Lee

Subjects: Computation and Language (cs.CL)
[28] arXiv:2405.01827 [pdf, other]: Title: SoftMCL: Soft Momentum Contrastive Learning for Fine-grained Sentiment-aware Pre-training

Authors: Jin Wang, Liang-Chih Yu, Xuejie Zhang

Comments: Accepted by LREC-COLING 2024

Subjects: Computation and Language (cs.CL)
[29] arXiv:2405.01799 [pdf, other]: Title: Exploiting ChatGPT for Diagnosing Autism-Associated Language Disorders and Identifying Distinct Features

Authors: Chuanbo Hu, Wenqi Li, Mindi Ruan, Xiangxu Yu, Lynn K. Paul, Shuo Wang, Xin Li

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[30] arXiv:2405.01796 [pdf, other]: Title: TOPICAL: TOPIC Pages AutomagicaLly

Authors: John Giorgi, Amanpreet Singh, Doug Downey, Sergey Feldman, Lucy Lu Wang

Comments: 10 pages, 7 figures, 2 tables, NAACL System Demonstrations 2024

Subjects: Computation and Language (cs.CL); Digital Libraries (cs.DL); Information Retrieval (cs.IR)
[31] arXiv:2405.01790 [pdf, other]: Title: Understanding Position Bias Effects on Fairness in Social Multi-Document Summarization

Authors: Olubusayo Olabisi, Ameeta Agrawal

Comments: Accepted at VarDial 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[32] arXiv:2405.01783 [pdf, ps, other]: Title: Layers of technology in pluriversal design. Decolonising language technology with the LiveLanguage initiative

Authors: Gertraud Koch, Gábor Bella, Paula Helm, Fausto Giunchiglia

Subjects: Computation and Language (cs.CL)
[33] arXiv:2405.01769 [pdf, other]: Title: A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and Law

Authors: Zhiyu Zoey Chen, Jing Ma, Xinlu Zhang, Nan Hao, An Yan, Armineh Nourbakhsh, Xianjun Yang, Julian McAuley, Linda Petzold, William Yang Wang

Comments: 35 pages, 6 figures

Subjects: Computation and Language (cs.CL)
[34] arXiv:2405.01768 [pdf, other]: Title: CoS: Enhancing Personalization and Mitigating Bias with Context Steering

Authors: Jerry Zhi-Yang He, Sashrika Pandey, Mariah L. Schrum, Anca Dragan

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[35] arXiv:2405.01740 [pdf, ps, other]: Title: The Psychosocial Impacts of Generative AI Harms

Authors: Faye-Marie Vassel, Evan Shieh, Cassidy R. Sugimoto, Thema Monroe-White

Comments: Presented in Impact of GenAI on Social and Individual Well-being at AAAI 2024 Spring Symposium Series (2024)

Subjects: Computation and Language (cs.CL)
[36] arXiv:2405.01738 [pdf, other]: Title: Question Suggestion for Conversational Shopping Assistants Using Product Metadata

Authors: Nikhita Vedula, Oleg Rokhlenko, Shervin Malmasi

Comments: 5 pages, 1 figure

Subjects: Computation and Language (cs.CL)
[37] arXiv:2405.01724 [pdf, other]: Title: Large Language Models are Inconsistent and Biased Evaluators

Authors: Rickard Stureborg, Dimitris Alikaniotis, Yoshi Suhara

Comments: 9 pages, 7 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[38] arXiv:2405.01686 [pdf, other]: Title: Automatically Extracting Numerical Results from Randomized Controlled Trials with Large Language Models

Authors: Hye Sun Yun, David Pogrebitskiy, Iain J. Marshall, Byron C. Wallace

Comments: 24 pages, 7 figures, 6 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[39] arXiv:2405.01682 [pdf, other]: Title: Leveraging Prompt-Learning for Structured Information Extraction from Crohn's Disease Radiology Reports in a Low-Resource Language

Authors: Liam Hazan, Gili Focht, Naama Gavrielov, Roi Reichart, Talar Hagopian, Mary-Louise C. Greer, Ruth Cytter Kuint, Dan Turner, Moti Freiman

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[40] arXiv:2405.01678 [pdf, other]: Title: 1-Diffractor: Efficient and Utility-Preserving Text Obfuscation Leveraging Word-Level Metric Differential Privacy

Authors: Stephen Meisenbacher, Maulik Chevli, Florian Matthes

Comments: 12 pages, 7 figures, 7 tables, 10th ACM International Workshop on Security and Privacy Analytics (IWSPA 2024)

Subjects: Computation and Language (cs.CL)
[41] arXiv:2405.01660 [pdf, other]: Title: Investigating Wit, Creativity, and Detectability of Large Language Models in Domain-Specific Writing Style Adaptation of Reddit's Showerthoughts

Authors: Tolga Buz, Benjamin Frost, Nikola Genchev, Moritz Schneider, Lucie-Aimée Kaffee, Gerard de Melo

Comments: Accepted to *SEM 2024 (StarSEM) conference

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[42] arXiv:2405.01649 [pdf, other]: Title: Improving Complex Reasoning over Knowledge Graph with Logic-Aware Curriculum Tuning

Authors: Tianle Xia, Liang Ding, Guojia Wan, Yibing Zhan, Bo Du, Dacheng Tao

Comments: arXiv admin note: text overlap with arXiv:2305.01157, arXiv:2212.09567 by other authors

Subjects: Computation and Language (cs.CL)
[43] arXiv:2405.01610 [pdf, other]: Title: Automating the Analysis of Public Saliency and Attitudes towards Biodiversity from Digital Media

Authors: Noah Giebink, Amrita Gupta, Diogo Verìssimo, Charlotte H. Chang, Tony Chang, Angela Brennan, Brett Dickson, Alex Bowmer, Jonathan Baillie

Comments: v0.1, 21 pages with 10 figures

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[44] arXiv:2405.01601 [pdf, other]: Title: Efficient Sample-Specific Encoder Perturbations

Authors: Yassir Fathullah, Mark J. F. Gales

Comments: To appear in NAACL 2024

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[45] arXiv:2405.01597 [pdf, other]: Title: Improving Disease Detection from Social Media Text via Self-Augmentation and Contrastive Learning

Authors: Pervaiz Iqbal Khan, Andreas Dengel, Sheraz Ahmed

Subjects: Computation and Language (cs.CL)
[46] arXiv:2405.01593 [pdf, other]: Title: Large Language Model Agent for Fake News Detection

Authors: Xinyi Li, Yongfeng Zhang, Edward C. Malthouse

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[47] arXiv:2405.01592 [pdf, ps, other]: Title: Text and Audio Simplification: Human vs. ChatGPT

Authors: Gondy Leroy, David Kauchak, Philip Harber, Ankit Pal, Akash Shukla

Comments: AMIA Summit, Boston, 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[48] arXiv:2405.01591 [pdf, other]: Title: Simplifying Multimodality: Unimodal Approach to Multimodal Challenges in Radiology with General-Domain Large Language Model

Authors: Seonhee Cho, Choonghan Kim, Jiho Lee, Chetan Chilkunda, Sujin Choi, Joo Heung Yoon

Comments: Under review

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[49] arXiv:2405.01590 [pdf, other]: Title: 101 Billion Arabic Words Dataset

Authors: Manel Aloui, Hasna Chouikhi, Ghaith Chaabane, Haithem Kchaou, Chehir Dhaouadi

Subjects: Computation and Language (cs.CL)
[50] arXiv:2405.01589 [pdf, ps, other]: Title: GPT-4 passes most of the 297 written Polish Board Certification Examinations

Authors: Jakub Pokrywka, Jeremi Kaczmarek, Edward Gorzelańczyk

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[51] arXiv:2405.01588 [pdf, other]: Title: Towards Unbiased Evaluation of Detecting Unanswerable Questions in EHRSQL

Authors: Yongjin Yang, Sihyeon Kim, SangMook Kim, Gyubok Lee, Se-Young Yun, Edward Choi

Comments: DPFM Workshop, ICLR 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[52] arXiv:2405.01587 [pdf, ps, other]: Title: Improve Academic Query Resolution through BERT-based Question Extraction from Images

Authors: Nidhi Kamal, Saurabh Yadav, Jorawar Singh, Aditi Avasthi

Journal-ref: 2024 IEEE International Conference on Interdisciplinary Approaches in Technology and Management for Social Innovation (IATMSI) volume 2 (2024) 1-4

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[53] arXiv:2405.01586 [pdf, other]: Title: Transfer Learning and Transformer Architecture for Financial Sentiment Analysis

Authors: Tohida Rehman, Raghubir Bose, Samiran Chattopadhyay, Debarshi Kumar Sanyal

Comments: 12 pages, 9 figures

Journal-ref: Proceedings of International Conference on Computational Intelligence, Data Science and Cloud Computing: IEM-ICDC 2021,pages 17--27

Subjects: Computation and Language (cs.CL)
[54] arXiv:2405.01584 [pdf, other]: Title: Lightweight Conceptual Dictionary Learning for Text Classification Using Information Compression

Authors: Li Wan, Tansu Alpcan, Margreta Kuijper, Emanuele Viterbo

Comments: 12 pages, TKDE format

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Signal Processing (eess.SP)
[55] arXiv:2405.01583 [pdf, other]: Title: MediFact at MEDIQA-M3G 2024: Medical Question Answering in Dermatology with Multimodal Learning

Authors: Nadia Saeed

Comments: 7 pages, 3 figures, Clinical NLP 2024 workshop proceedings in Shared Task

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[56] arXiv:2405.01582 [pdf, other]: Title: Text Quality-Based Pruning for Efficient Training of Language Models

Authors: Vasu Sharma, Karthik Padthe, Newsha Ardalani, Kushal Tirumala, Russell Howes, Hu Xu, Po-Yao Huang, Shang-Wen Li, Armen Aghajanyan, Gargi Ghosh

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[57] arXiv:2405.01581 [pdf, other]: Title: The Mercurial Top-Level Ontology of Large Language Models

Authors: Nele Köhler, Fabian Neuhaus

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[58] arXiv:2405.01577 [pdf, other]: Title: HateTinyLLM : Hate Speech Detection Using Tiny Large Language Models

Authors: Tanmay Sen, Ansuman Das, Mrinmay Sen

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[59] arXiv:2405.01576 [pdf, other]: Title: Uncovering Deceptive Tendencies in Language Models: A Simulated Company AI Assistant

Authors: Olli Järviniemi, Evan Hubinger

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[60] arXiv:2405.02267 (cross-list from cs.LG) [pdf, other]: Title: Structural Pruning of Pre-trained Language Models via Neural Architecture Search

Authors: Aaron Klein, Jacek Golebiowski, Xingchen Ma, Valerio Perrone, Cedric Archambeau

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[61] arXiv:2405.02132 (cross-list from cs.SD) [pdf, other]: Title: Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets

Authors: Xuelong Geng, Tianyi Xu, Kun Wei, Bingshen Mu, Hongfei Xue, He Wang, Yangze Li, Pengcheng Guo, Yuhang Dai, Longhao Li, Mingchen Shao, Lei Xie

Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[62] arXiv:2405.02124 (cross-list from eess.AS) [pdf, other]: Title: TIPAA-SSL: Text Independent Phone-to-Audio Alignment based on Self-Supervised Learning and Knowledge Transfer

Authors: Noé Tits, Prernna Bhatnagar, Thierry Dutoit

Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[63] arXiv:2405.02105 (cross-list from cs.AI) [pdf, other]: Title: Evaluating Large Language Models for Structured Science Summarization in the Open Research Knowledge Graph

Authors: Vladyslav Nechakhin, Jennifer D'Souza, Steffen Eger

Comments: 22 pages, 11 figures. In review at this https URL

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Theory (cs.IT)
[64] arXiv:2405.01988 (cross-list from cs.SD) [pdf, other]: Title: Joint sentiment analysis of lyrics and audio in music

Authors: Lea Schaab, Anna Kruspe

Comments: published at DAGA 2024

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[65] arXiv:2405.01744 (cross-list from cs.LG) [pdf, other]: Title: ALCM: Autonomous LLM-Augmented Causal Discovery Framework

Authors: Elahe Khatibi, Mahyar Abbasian, Zhongqi Yang, Iman Azimi, Amir M. Rahmani

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Methodology (stat.ME)
[66] arXiv:2405.01585 (cross-list from cs.AI) [pdf, other]: Title: Tabular Embedding Model (TEM): Finetuning Embedding Models For Tabular RAG Applications

Authors: Sujit Khanna, Shishir Subedi

Comments: 11 pages, 5 figures

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[67] arXiv:2405.01575 (cross-list from cs.SE) [pdf, other]: Title: Software Mention Recognition with a Three-Stage Framework Based on BERTology Models at SOMD 2024

Authors: Thuy Nguyen Thi, Anh Nguyen Viet, Thin Dang Van, Ngan Nguyen Luu Thuy

Comments: Software mention recognition, Named entity recognition, Transformer, Three-stage framework

Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[68] arXiv:2405.01563 (cross-list from cs.LG) [pdf, other]: Title: Mitigating LLM Hallucinations via Conformal Abstention

Authors: Yasin Abbasi Yadkori, Ilja Kuzborskij, David Stutz, András György, Adam Fisch, Arnaud Doucet, Iuliya Beloshapka, Wei-Hung Weng, Yao-Yuan Yang, Csaba Szepesvári, Ali Taylan Cemgil, Nenad Tomasev

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[69] arXiv:2405.01556 (cross-list from cs.SE) [pdf, other]: Title: Semantically Aligned Question and Code Generation for Automated Insight Generation

Authors: Ananya Singha, Bhavya Chopra, Anirudh Khatry, Sumit Gulwani, Austin Z. Henley, Vu Le, Chris Parnin, Mukul Singh, Gust Verbruggen

Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Fri, 3 May 2024

[70] arXiv:2405.01535 [pdf, other]: Title: Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Authors: Seungone Kim, Juyoung Suk, Shayne Longpre, Bill Yuchen Lin, Jamin Shin, Sean Welleck, Graham Neubig, Moontae Lee, Kyungjae Lee, Minjoon Seo

Comments: Work in Progress

Subjects: Computation and Language (cs.CL)
[71] arXiv:2405.01525 [pdf, other]: Title: FLAME: Factuality-Aware Alignment for Large Language Models

Authors: Sheng-Chieh Lin, Luyu Gao, Barlas Oguz, Wenhan Xiong, Jimmy Lin, Wen-tau Yih, Xilun Chen

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[72] arXiv:2405.01511 [pdf, other]: Title: D2PO: Discriminator-Guided DPO with Response Evaluation Models

Authors: Prasann Singhal, Nathan Lambert, Scott Niekum, Tanya Goyal, Greg Durrett

Comments: 20 pages, 12 figures

Subjects: Computation and Language (cs.CL)
[73] arXiv:2405.01502 [pdf, other]: Title: Analyzing the Role of Semantic Representations in the Era of Large Language Models

Authors: Zhijing Jin, Yuen Chen, Fernando Gonzalez, Jiarui Liu, Jiayi Zhang, Julian Michael, Bernhard Schölkopf, Mona Diab

Comments: NAACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[74] arXiv:2405.01490 [pdf, other]: Title: Controllable Text Generation in the Instruction-Tuning Era

Authors: Dhananjay Ashok, Barnabas Poczos

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[75] arXiv:2405.01481 [pdf, other]: Title: NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment

Authors: Gerald Shen, Zhilin Wang, Olivier Delalleau, Jiaqi Zeng, Yi Dong, Daniel Egert, Shengyang Sun, Jimmy Zhang, Sahil Jain, Ali Taghibakhshi, Markel Sanz Ausin, Ashwath Aithal, Oleksii Kuchaiev

Comments: 13 pages, 4 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[76] arXiv:2405.01474 [pdf, other]: Title: V-FLUTE: Visual Figurative Language Understanding with Textual Explanations

Authors: Arkadiy Saakyan, Shreyas Kulkarni, Tuhin Chakrabarty, Smaranda Muresan

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[77] arXiv:2405.01470 [pdf, other]: Title: WildChat: 1M ChatGPT Interaction Logs in the Wild

Authors: Wenting Zhao, Xiang Ren, Jack Hessel, Claire Cardie, Yejin Choi, Yuntian Deng

Comments: accepted by ICLR 2024

Subjects: Computation and Language (cs.CL)
[78] arXiv:2405.01458 [pdf, other]: Title: UQA: Corpus for Urdu Question Answering

Authors: Samee Arif, Sualeha Farid, Awais Athar, Agha Ali Raza

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[79] arXiv:2405.01403 [pdf, other]: Title: Unsupervised Flow Discovery from Task-oriented Dialogues

Authors: Patrícia Ferreira, Daniel Martins, Ana Alves, Catarina Silva, Hugo Gonçalo Oliveira

Comments: 12 pages, 4 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[80] arXiv:2405.01379 [pdf, other]: Title: Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving

Authors: Xin Quan, Marco Valentino, Louise A. Dennis, André Freitas

Subjects: Computation and Language (cs.CL)
[81] arXiv:2405.01376 [pdf, other]: Title: Topics in the Study of the Pragmatic Functions of Phonetic Reduction in Dialog

Authors: Nigel G. Ward, Carlos A. Ortega

Subjects: Computation and Language (cs.CL)
[82] arXiv:2405.01359 [pdf, other]: Title: GAIA: A General AI Assistant for Intelligent Accelerator Operations

Authors: Frank Mayet

Subjects: Computation and Language (cs.CL); Accelerator Physics (physics.acc-ph)
[83] arXiv:2405.01345 [pdf, other]: Title: The Power of Question Translation Training in Multilingual Reasoning: Broadened Scope and Deepened Insights

Authors: Wenhao Zhu, Shujian Huang, Fei Yuan, Cheng Chen, Jiajun Chen, Alexandra Birch

Subjects: Computation and Language (cs.CL)
[84] arXiv:2405.01299 [pdf, other]: Title: The Effectiveness of LLMs as Annotators: A Comparative Overview and Empirical Analysis of Direct Representation

Authors: Maja Pavlovic, Massimo Poesio

Comments: LREC-COLING NLPerspectives workshop

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[85] arXiv:2405.01293 [pdf, ps, other]: Title: Low-resource speech recognition and dialect identification of Irish in a multi-task framework

Authors: Liam Lonergan, Mengjie Qian, Neasa Ní Chiaráin, Christer Gobl, Ailbhe Ní Chasaide

Comments: 7 pages. Accepted to Odyssey 2024 - The Speaker and Language Recognition Workshop

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[86] arXiv:2405.01280 [pdf, other]: Title: Reinforcement Learning for Edit-Based Non-Autoregressive Neural Machine Translation

Authors: Hao Wang, Tetsuro Morimura, Ukyo Honda, Daisuke Kawahara

Subjects: Computation and Language (cs.CL)
[87] arXiv:2405.01249 [pdf, ps, other]: Title: Prompt engineering paradigms for medical applications: scoping review and recommendations for better practices

Authors: Jamil Zaghir, Marco Naguib, Mina Bjelogrlic, Aurélie Névéol, Xavier Tannier, Christian Lovis

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[88] arXiv:2405.01216 [pdf, other]: Title: DMON: A Simple yet Effective Approach for Argument Structure Learning

Authors: Wei Sun, Mingxiao Li, Jingyuan Sun, Jesse Davis, Marie-Francine Moens

Comments: COLING 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[89] arXiv:2405.01159 [pdf, other]: Title: TartuNLP at EvaLatin 2024: Emotion Polarity Detection

Authors: Aleksei Dorkin, Kairit Sirts

Comments: Accepted to The Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2024)

Subjects: Computation and Language (cs.CL)
[90] arXiv:2405.01139 [pdf, other]: Title: It Couldn't Help But Overhear: On the Limits of Modelling Meta-Communicative Grounding Acts with Supervised Learning

Authors: Brielen Madureira, David Schlangen

Comments: work in progress

Subjects: Computation and Language (cs.CL)
[91] arXiv:2405.01121 [pdf, other]: Title: Efficient Data Generation for Source-grounded Information-seeking Dialogs: A Use Case for Meeting Transcripts

Authors: Lotem Golany, Filippo Galgani, Maya Mamo, Nimrod Parasol, Omer Vandsburger, Nadav Bar, Ido Dagan

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[92] arXiv:2405.01022 [pdf, other]: Title: UniGen: Universal Domain Generalization for Sentiment Classification via Zero-shot Dataset Generation

Authors: Juhwan Choi, Yeonghwa Kim, Seunguk Yu, JungMin Yun, YoungBin Kim

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[93] arXiv:2405.00997 [pdf, other]: Title: The IgboAPI Dataset: Empowering Igbo Language Technologies through Multi-dialectal Enrichment

Authors: Chris Chinenye Emezue, Ifeoma Okoh, Chinedu Mbonu, Chiamaka Chukwuneke, Daisy Lal, Ignatius Ezeani, Paul Rayson, Ijemma Onwuzulike, Chukwuma Okeke, Gerald Nweya, Bright Ogbonna, Chukwuebuka Oraegbunam, Esther Chidinma Awo-Ndubuisi, Akudo Amarachukwu Osuagwu, Obioha Nmezi

Comments: Accepted to the LREC-COLING 2024 conference

Subjects: Computation and Language (cs.CL)
[94] arXiv:2405.00988 [pdf, other]: Title: Context-Aware Clustering using Large Language Models

Authors: Sindhu Tipirneni, Ravinarayana Adkathimar, Nurendra Choudhary, Gaurush Hiranandani, Rana Ali Amjad, Vassilis N. Ioannidis, Changhe Yuan, Chandan K. Reddy

Comments: 16 pages

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[95] arXiv:2405.00982 [pdf, other]: Title: On the Evaluation of Machine-Generated Reports

Authors: James Mayfield, Eugene Yang, Dawn Lawrie, Sean MacAvaney, Paul McNamee, Douglas W. Oard, Luca Soldaini, Ian Soboroff, Orion Weller, Efsun Kayi, Kate Sanders, Marc Mason, Noah Hibbler

Comments: 12 pages, 4 figures, accepted at SIGIR 2024 as perspective paper

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[96] arXiv:2405.00980 [pdf, other]: Title: A Hong Kong Sign Language Corpus Collected from Sign-interpreted TV News

Authors: Zhe Niu, Ronglai Zuo, Brian Mak, Fangyun Wei

Comments: Accepted by LREC-COLING 2024

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[97] arXiv:2405.00972 [pdf, other]: Title: CACTUS: Chemistry Agent Connecting Tool-Usage to Science

Authors: Andrew D. McNaughton, Gautham Ramalaxmi, Agustin Kruel, Carter R. Knutson, Rohith A. Varikoti, Neeraj Kumar

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Chemical Physics (physics.chem-ph); Quantitative Methods (q-bio.QM)
[98] arXiv:2405.00970 [pdf, other]: Title: How Can I Get It Right? Using GPT to Rephrase Incorrect Trainee Responses

Authors: Jionghao Lin, Zifei Han, Danielle R. Thomas, Ashish Gurung, Shivang Gupta, Vincent Aleven, Kenneth R. Koedinger

Comments: International Journal of Artificial Intelligence in Education

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[99] arXiv:2405.00966 [pdf, other]: Title: Efficient Compression of Multitask Multilingual Speech Models

Authors: Thomas Palmeira Ferraz

Comments: Master Thesis

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[100] arXiv:2405.00948 [pdf, other]: Title: Modeling Empathetic Alignment in Conversation

Authors: Jiamin Yang, David Jurgens

Comments: Camera-ready version for NAACL 2024

Subjects: Computation and Language (cs.CL)
[101] arXiv:2405.00903 [pdf, other]: Title: A Named Entity Recognition and Topic Modeling-based Solution for Locating and Better Assessment of Natural Disasters in Social Media

Authors: Ayaz Mehmood, Muhammad Tayyab Zamir, Muhammad Asif Ayub, Nasir Ahmad, Kashif Ahmad

Comments: 15 pages; 4 tables; 4 figures

Subjects: Computation and Language (cs.CL)
[102] arXiv:2405.00888 [pdf, other]: Title: DynaMo: Accelerating Language Model Inference with Dynamic Multi-Token Sampling

Authors: Shikhar Tuli, Chi-Heng Lin, Yen-Chang Hsu, Niraj K. Jha, Yilin Shen, Hongxia Jin

Comments: Accepted at NAACL 2024

Subjects: Computation and Language (cs.CL)
[103] arXiv:2405.00864 [pdf, other]: Title: Math Multiple Choice Question Generation via Human-Large Language Model Collaboration

Authors: Jaewook Lee, Digory Smith, Simon Woodhead, Andrew Lan

Comments: 17th International Conference on Educational Data Mining (EDM 2024)

Subjects: Computation and Language (cs.CL)
[104] arXiv:2405.00828 [pdf, other]: Title: WIBA: What Is Being Argued? A Comprehensive Approach to Argument Mining

Authors: Arman Irani, Ju Yeon Park, Kevin Esterling, Michalis Faloutsos

Comments: 8 pages, 2 figures, submitted to The 16th International Conference on Advances in Social Networks Analysis and Mining (ASONAM) '24

Subjects: Computation and Language (cs.CL)
[105] arXiv:2405.00823 [pdf, other]: Title: WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace Setting

Authors: Olly Styles, Sam Miller, Patricio Cerda-Mardini, Tanaya Guha, Victor Sanchez, Bertie Vidgen

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[106] arXiv:2405.00821 [pdf, other]: Title: Uncovering Agendas: A Novel French & English Dataset for Agenda Detection on Social Media

Authors: Gregorios Katsios, Ning Sa, Ankita Bhaumik, Tomek Strzalkowski

Subjects: Computation and Language (cs.CL)
[107] arXiv:2405.00801 [pdf, ps, other]: Title: "Ask Me Anything": How Comcast Uses LLMs to Assist Agents in Real Time

Authors: Scott Rome, Tianwen Chen, Raphael Tang, Luwei Zhou, Ferhan Ture

Subjects: Computation and Language (cs.CL)
[108] arXiv:2405.00732 [pdf, other]: Title: LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Authors: Justin Zhao, Timothy Wang, Wael Abid, Geoffrey Angus, Arnav Garg, Jeffery Kinnison, Alex Sherstinsky, Piero Molino, Travis Addair, Devvret Rishi

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[109] arXiv:2405.00728 [pdf, ps, other]: Title: Evaluating the Application of ChatGPT in Outpatient Triage Guidance: A Comparative Study

Authors: Dou Liu, Ying Han, Xiandi Wang, Xiaomei Tan, Di Liu, Guangwu Qian, Kang Li, Dan Pu, Rong Yin

Comments: 8 pages, 1 figure, conference(International Ergonomics Association)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[110] arXiv:2405.00722 [pdf, other]: Title: LLMs for Generating and Evaluating Counterfactuals: A Comprehensive Study

Authors: Van Bach Nguyen, Paul Youssef, Jörg Schlötterer, Christin Seifert

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[111] arXiv:2405.00718 [pdf, other]: Title: Can't say cant? Measuring and Reasoning of Dark Jargons in Large Language Models

Authors: Xu Ji, Jianyi Zhang, Ziyin Zhou, Zhangchi Zhao, Qianqian Qiao, Kaiying Han, Md Imran Hossen, Xiali Hei

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[112] arXiv:2405.00717 [pdf, other]: Title: Exploring News Summarization and Enrichment in a Highly Resource-Scarce Indian Language: A Case Study of Mizo

Authors: Abhinaba Bala, Ashok Urlana, Rahul Mishra, Parameswari Krishnamurthy

Comments: Accepted at LREC-COLING2024 WILDRE Workshop

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[113] arXiv:2405.00716 [pdf, other]: Title: Large Language Models in Healthcare: A Comprehensive Benchmark

Authors: Andrew Liu, Hongjian Zhou, Yining Hua, Omid Rohanian, Lei Clifton, David A. Clifton

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[114] arXiv:2405.00715 [pdf, other]: Title: Towards Adapting Open-Source Large Language Models for Expert-Level Clinical Note Generation

Authors: Hanyin Wang, Chufan Gao, Bolun Liu, Qiping Xu, Guleid Hussein, Mohamad El Labban, Kingsley Iheasirim, Hariprasad Korsapati, Jimeng Sun

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[115] arXiv:2405.00711 [pdf, other]: Title: Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of Theories, Detection Methods, and Opportunities

Authors: Xiaomin Yu, Yezhaohui Wang, Yanfang Chen, Zhen Tao, Dinghao Xi, Shichao Song, Simin Niu, Zhiyu Li

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[116] arXiv:2405.00710 [pdf, ps, other]: Title: Homonym Sense Disambiguation in the Georgian Language

Authors: Davit Melikidze, Alexander Gamkrelidze

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[117] arXiv:2405.00709 [pdf, other]: Title: Evaluating Tool-Augmented Agents in Remote Sensing Platforms

Authors: Simranjit Singh, Michael Fore, Dimitrios Stamoulis

Comments: ICLR 2024 Machine Learning for Remote Sensing (ML4RS) Workshop

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[118] arXiv:2405.00708 [pdf, other]: Title: Interactive Analysis of LLMs using Meaningful Counterfactuals

Authors: Furui Cheng, Vilém Zouhar, Robin Shing Moon Chan, Daniel Fürst, Hendrik Strobelt, Mennatallah El-Assady

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[119] arXiv:2405.00706 [pdf, ps, other]: Title: Science Written by Generative AI is Perceived as Less Intelligent, but More Credible and Trustworthy than Science Written by Humans

Authors: David M. Markowitz

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[120] arXiv:2405.00705 [pdf, other]: Title: SHED: Shapley-Based Automated Dataset Refinement for Instruction Fine-Tuning

Authors: Yexiao He, Ziyao Wang, Zheyu Shen, Guoheng Sun, Yucong Dai, Yongkai Wu, Hongyi Wang, Ang Li

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[121] arXiv:2405.00704 [pdf, ps, other]: Title: A Survey on the Real Power of ChatGPT

Authors: Ming Liu, Ran Liu, Hua Wang, Wray Buntine

Comments: 9 pages, 2 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[122] arXiv:2405.01509 (cross-list from cs.CR) [pdf, other]: Title: Learnable Linguistic Watermarks for Tracing Model Extraction Attacks on Large Language Models

Authors: Minhao Bai, Kaiyi Pang, Yongfeng Huang

Comments: not decided

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[123] arXiv:2405.01483 (cross-list from cs.CV) [pdf, other]: Title: MANTIS: Interleaved Multi-Image Instruction Tuning

Authors: Dongfu Jiang, Xuan He, Huaye Zeng, Cong Wei, Max Ku, Qian Liu, Wenhu Chen

Comments: 9 pages, 3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[124] arXiv:2405.01413 (cross-list from cs.CV) [pdf, other]: Title: MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors

Authors: Yuan Tang, Xu Han, Xianzhi Li, Qiao Yu, Yixue Hao, Long Hu, Min Chen

Comments: 17 pages, 9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[125] arXiv:2405.01310 (cross-list from cs.IR) [pdf, other]: Title: Overcoming LLM Challenges using RAG-Driven Precision in Coffee Leaf Disease Remediation

Authors: Dr. Selva Kumar S, Afifah Khan Mohammed Ajmal Khan, Imadh Ajaz Banday, Manikantha Gada, Vibha Venkatesh Shanbhag

Comments: 6 pages, 3 figures

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[126] arXiv:2405.01259 (cross-list from cs.AI) [pdf, other]: Title: Identification of Entailment and Contradiction Relations between Natural Language Sentences: A Neurosymbolic Approach

Authors: Xuyao Feng, Anthony Hunter

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[127] arXiv:2405.01229 (cross-list from cs.LG) [pdf, ps, other]: Title: Boosting Jailbreak Attack with Momentum

Authors: Yihao Zhang, Zeming Wei

Comments: ICLR 2024 Workshop on Reliable and Responsible Foundation Models

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Optimization and Control (math.OC)
[128] arXiv:2405.01097 (cross-list from cs.CY) [pdf, other]: Title: Silencing the Risk, Not the Whistle: A Semi-automated Text Sanitization Tool for Mitigating the Risk of Whistleblower Re-Identification

Authors: Dimitri Staufer, Frank Pallas, Bettina Berendt

Comments: Accepted for publication at the ACM Conference on Fairness, Accountability, and Transparency 2024 (ACM FAccT'24). This is a preprint manuscript (authors' own version before final copy-editing)

Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Information Retrieval (cs.IR); Software Engineering (cs.SE)
[129] arXiv:2405.01040 (cross-list from cs.CV) [pdf, other]: Title: Few Shot Class Incremental Learning using Vision-Language models

Authors: Anurag Kumar, Chinmay Bharti, Saikat Dutta, Srikrishna Karanam, Biplab Banerjee

Comments: under review at Pattern Recognition Letters

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Image and Video Processing (eess.IV)
[130] arXiv:2405.00981 (cross-list from cs.AI) [pdf, other]: Title: Bayesian Optimization with LLM-Based Acquisition Functions for Natural Language Preference Elicitation

Authors: David Eric Austin, Anton Korikov, Armin Toroghi, Scott Sanner

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[131] arXiv:2405.00978 (cross-list from cs.IR) [pdf, other]: Title: Language Fairness in Multilingual Information Retrieval

Authors: Eugene Yang, Thomas Jänich, James Mayfield, Dawn Lawrie

Comments: 5 pages, 1 figure, accepted at SIGIR 2024 as short paper

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[132] arXiv:2405.00977 (cross-list from cs.IR) [pdf, other]: Title: Distillation for Multilingual Information Retrieval

Authors: Eugene Yang, Dawn Lawrie, James Mayfield

Comments: 6 pages, 1 figure, accepted at SIGIR 2024 as short paper

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[133] arXiv:2405.00975 (cross-list from cs.IR) [pdf, other]: Title: PLAID SHIRTTT for Large-Scale Streaming Dense Retrieval

Authors: Dawn Lawrie, Efsun Kayi, Eugene Yang, James Mayfield, Douglas W. Oard

Comments: 5 pages, 1 figure, accepted at SIGIR 2024 as short paper

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[134] arXiv:2405.00949 (cross-list from cs.LG) [pdf, other]: Title: The Role of Model Architecture and Scale in Predicting Molecular Properties: Insights from Fine-Tuning RoBERTa, BART, and LLaMA

Authors: Lee Youngmin, Lang S.I.D. Andrew, Cai Duoduo, Wheat R. Stephen

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Chemical Physics (physics.chem-ph); Biomolecules (q-bio.BM)
[135] arXiv:2405.00942 (cross-list from cs.CV) [pdf, other]: Title: LLaVA Finds Free Lunch: Teaching Human Behavior Improves Content Understanding Abilities Of LLMs

Authors: Somesh Singh, Harini S I, Yaman K Singla, Veeky Baths, Rajiv Ratn Shah, Changyou Chen, Balaji Krishnamurthy

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[136] arXiv:2405.00899 (cross-list from cs.HC) [pdf, other]: Title: Characterising the Creative Process in Humans and Large Language Models

Authors: Surabhi S. Nath, Peter Dayan, Claire Stevenson

Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Neurons and Cognition (q-bio.NC)
[137] arXiv:2405.00740 (cross-list from cs.CV) [pdf, other]: Title: Modeling Caption Diversity in Contrastive Vision-Language Pretraining

Authors: Samuel Lavoie, Polina Kirichenko, Mark Ibrahim, Mahmoud Assran, Andrew Gordon Wildon, Aaron Courville, Nicolas Ballas

Comments: 14 pages, 8 figures, 7 tables, to be published at ICML2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[138] arXiv:2405.00693 (cross-list from cs.RO) [pdf, other]: Title: Large Language Models for Human-Robot Interaction: Opportunities and Risks

Authors: Jesse Atuhurra

Subjects: Robotics (cs.RO); Computation and Language (cs.CL)
[139] arXiv:2405.00688 (cross-list from cs.RO) [pdf, ps, other]: Title: Understanding Social Perception, Interactions, and Safety Aspects of Sidewalk Delivery Robots Using Sentiment Analysis

Authors: Yuchen Du, Tho V. Le

Comments: 34 pages, 7 figures, 2 tables

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[140] arXiv:2405.00522 (cross-list from econ.GN) [pdf, other]: Title: DAM: A Universal Dual Attention Mechanism for Multimodal Timeseries Cryptocurrency Trend Forecasting

Authors: Yihang Fu, Mingyu Zhou, Luyao Zhang

Subjects: General Economics (econ.GN); Computational Engineering, Finance, and Science (cs.CE); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Computational Finance (q-fin.CP)

Thu, 2 May 2024

[141] arXiv:2405.00664 [pdf, other]: Title: Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3

Authors: Junsang Yoon, Akshat Gupta, Gopala Anumanchipalli

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[142] arXiv:2405.00659 [pdf, other]: Title: NLU-STR at SemEval-2024 Task 1: Generative-based Augmentation and Encoder-based Scoring for Semantic Textual Relatedness

Authors: Sanad Malaysha, Mustafa Jarrar, Mohammed Khalilia

Subjects: Computation and Language (cs.CL)
[143] arXiv:2405.00657 [pdf, other]: Title: RST-LoRA: A Discourse-Aware Low-Rank Adaptation for Long Document Abstractive Summarization

Authors: Dongqi Pu, Vera Demberg

Comments: NAACL 2024 Main & Long Conference Paper (Oral Presentation)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[144] arXiv:2405.00632 [pdf, other]: Title: When Quantization Affects Confidence of Large Language Models?

Authors: Irina Proskurina, Luc Brun, Guillaume Metzler, Julien Velcin

Comments: Accepted to NAACL 2024 Findings

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[145] arXiv:2405.00622 [pdf, other]: Title: Causal Evaluation of Language Models

Authors: Sirui Chen, Bo Peng, Meiqi Chen, Ruiqi Wang, Mengying Xu, Xingyu Zeng, Rui Zhao, Shengjie Zhao, Yu Qiao, Chaochao Lu

Comments: 315 pages, 230 figures, 21 tables. Project website: this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[146] arXiv:2405.00611 [pdf, other]: Title: Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling

Authors: Yida Mu, Peizhen Bai, Kalina Bontcheva, Xingyi Song

Subjects: Computation and Language (cs.CL)
[147] arXiv:2405.00602 [pdf, other]: Title: Investigating Automatic Scoring and Feedback using Large Language Models

Authors: Gloria Ashiya Katuka, Alexander Gain, Yen-Yun Yu

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[148] arXiv:2405.00588 [pdf, other]: Title: Are Models Biased on Text without Gender-related Language?

Authors: Catarina G Belém, Preethi Seshadri, Yasaman Razeghi, Sameer Singh

Comments: In International Conference on Learning Representations 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[149] arXiv:2405.00578 [pdf, other]: Title: The Real, the Better: Aligning Large Language Models with Online Human Behaviors

Authors: Guanying Jiang, Lingyong Yan, Haibo Shi, Dawei Yin

Comments: 11 pages, 6 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[150] arXiv:2405.00557 [pdf, other]: Title: Mixture of insighTful Experts (MoTE): The Synergy of Thought Chains and Expert Mixtures in Self-Alignment

Authors: Zhili Liu, Yunhao Gou, Kai Chen, Lanqing Hong, Jiahui Gao, Fei Mi, Yu Zhang, Zhenguo Li, Xin Jiang, Qun Liu, James T. Kwok

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[151] arXiv:2405.00543 [pdf, other]: Title: New Benchmark Dataset and Fine-Grained Cross-Modal Fusion Framework for Vietnamese Multimodal Aspect-Category Sentiment Analysis

Authors: Quy Hoang Nguyen, Minh-Van Truong Nguyen, Kiet Van Nguyen

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[152] arXiv:2405.00536 [pdf, other]: Title: A Legal Framework for Natural Language Processing Model Training in Portugal

Authors: Rúben Almeida, Evelin Amorim

Comments: LEGAL2024 Legal and Ethical Issues in Human Language Technologies, LREC 2024

Subjects: Computation and Language (cs.CL); Emerging Technologies (cs.ET)
[153] arXiv:2405.00492 [pdf, other]: Title: Is Temperature the Creativity Parameter of Large Language Models?

Authors: Max Peeperkorn, Tom Kouwenhoven, Dan Brown, Anna Jordanous

Comments: To be published in the Proceedings of the 15th International Conference on Computational Creativity (ICCC'24), 8 pages, 2 figures, 2 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[154] arXiv:2405.00467 [pdf, other]: Title: Harnessing the Power of Multiple Minds: Lessons Learned from LLM Routing

Authors: KV Aditya Srivatsa, Kaushal Kumar Maurya, Ekaterina Kochmar

Comments: Accepted to Workshop on Insights from Negative Results in NLP 2024 (co-located with NAACL 2024)

Subjects: Computation and Language (cs.CL)
[155] arXiv:2405.00465 [pdf, other]: Title: BiomedRAG: A Retrieval Augmented Large Language Model for Biomedicine

Authors: Mingchen Li, Halil Kilicoglu, Hua Xu, Rui Zhang

Subjects: Computation and Language (cs.CL)
[156] arXiv:2405.00402 [pdf, other]: Title: Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models

Authors: Leonardo Ranaldi, Andrè Freitas

Subjects: Computation and Language (cs.CL)
[157] arXiv:2405.00390 [pdf, other]: Title: CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal Models

Authors: Hongzhan Lin, Zixin Chen, Ziyang Luo, Mingfei Cheng, Jing Ma, Guang Chen

Comments: 25 pages, 7 figures, and 18 tables

Subjects: Computation and Language (cs.CL)
[158] arXiv:2405.00361 [pdf, other]: Title: AdaMoLE: Fine-Tuning Large Language Models with Adaptive Mixture of Low-Rank Adaptation Experts

Authors: Zefang Liu, Jiahua Luo

Subjects: Computation and Language (cs.CL)
[159] arXiv:2405.00332 [pdf, other]: Title: A Careful Examination of Large Language Model Performance on Grade School Arithmetic

Authors: Hugh Zhang, Jeff Da, Dean Lee, Vaughn Robinson, Catherine Wu, Will Song, Tiffany Zhao, Pranav Raja, Dylan Slack, Qin Lyu, Sean Hendryx, Russell Kaplan, Michele Lunati, Summer Yue

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[160] arXiv:2405.00321 [pdf, other]: Title: DFKI-NLP at SemEval-2024 Task 2: Towards Robust LLMs Using Data Perturbations and MinMax Training

Authors: Bhuvanesh Verma, Lisa Raithel

Subjects: Computation and Language (cs.CL)
[161] arXiv:2405.00302 [pdf, other]: Title: Generating Feedback-Ladders for Logical Errors in Programming using Large Language Models

Authors: Hasnain Heickal, Andrew Lan

Comments: Published on the 17th EDM 2024 - Posters and Demos Track

Subjects: Computation and Language (cs.CL)
[162] arXiv:2405.00301 [pdf, other]: Title: LITO: Learnable Intervention for Truthfulness Optimization

Authors: Farima Fatahi Bayat, Xin Liu, H. V. Jagadish, Lu Wang

Comments: 14 pages, 5 figures

Subjects: Computation and Language (cs.CL)
[163] arXiv:2405.00291 [pdf, other]: Title: How Can I Improve? Using GPT to Highlight the Desired and Undesired Parts of Open-ended Responses

Authors: Jionghao Lin, Eason Chen, Zeifei Han, Ashish Gurung, Danielle R. Thomas, Wei Tan, Ngoc Dang Nguyen, Kenneth R. Koedinger

Comments: 11 pages, full research paper, EDM 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[164] arXiv:2405.00289 [pdf, other]: Title: Adversarial Attacks and Defense for Conversation Entailment Task

Authors: Zhenning Yang, Ryan Krawec, Liang-Yuan Wu

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[165] arXiv:2405.00273 [pdf, other]: Title: Social Life Simulation for Non-Cognitive Skills Learning

Authors: Zihan Yan, Yaohong Xiang, Yun Huang

Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[166] arXiv:2405.00263 [pdf, other]: Title: Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge

Authors: Bin Xiao, Chunan Shi, Xiaonan Nie, Fan Yang, Xiangwei Deng, Lei Su, Weipeng Chen, Bin Cui

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[167] arXiv:2405.00253 [pdf, other]: Title: CodeHalu: Code Hallucinations in LLMs Driven by Execution-based Verification

Authors: Yuchen Tian, Weixiang Yan, Qian Yang, Qian Chen, Wen Wang, Ziyang Luo, Lei Ma

Subjects: Computation and Language (cs.CL); Software Engineering (cs.SE)
[168] arXiv:2405.00216 [pdf, other]: Title: Graphical Reasoning: LLM-based Semi-Open Relation Extraction

Authors: Yicheng Tao, Yiqun Wang, Longju Bai

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[169] arXiv:2405.00208 [pdf, other]: Title: A Primer on the Inner Workings of Transformer-based Language Models

Authors: Javier Ferrando, Gabriele Sarti, Arianna Bisazza, Marta R. Costa-jussà

Subjects: Computation and Language (cs.CL)
[170] arXiv:2405.00204 [pdf, other]: Title: General Purpose Verification for Chain of Thought Prompting

Authors: Robert Vacareanu, Anurag Pratik, Evangelia Spiliopoulou, Zheng Qi, Giovanni Paolini, Neha Anna John, Jie Ma, Yassine Benajiba, Miguel Ballesteros

Comments: 22 pages, preprint

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[171] arXiv:2405.00201 [pdf, other]: Title: SPAFIT: Stratified Progressive Adaptation Fine-tuning for Pre-trained Large Language Models

Authors: Samir Arora, Liangliang Wang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[172] arXiv:2405.00200 [pdf, other]: Title: In-Context Learning with Long-Context Models: An In-Depth Exploration

Authors: Amanda Bertsch, Maor Ivgi, Uri Alon, Jonathan Berant, Matthew R. Gormley, Graham Neubig

Comments: 27 pages; preprint

Subjects: Computation and Language (cs.CL)
[173] arXiv:2405.00175 [pdf, other]: Title: Towards a Search Engine for Machines: Unified Ranking for Multiple Retrieval-Augmented Large Language Models

Authors: Alireza Salemi, Hamed Zamani

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[174] arXiv:2405.00155 [pdf, other]: Title: HistNERo: Historical Named Entity Recognition for the Romanian Language

Authors: Andrei-Marius Avram, Andreea Iuga, George-Vlad Manolache, Vlad-Cristian Matei, Răzvan-Gabriel Micliuş, Vlad-Andrei Muntean, Manuel-Petru Sorlescu, Dragoş-Andrei Şerban, Adrian-Dinu Urse, Vasile Păiş, Dumitru-Clementin Cercel

Comments: Accepted at the International Conference on Document Analysis and Recognition (ICDAR 2024)

Subjects: Computation and Language (cs.CL)
[175] arXiv:2405.00134 [pdf, other]: Title: Transforming Dutch: Debiasing Dutch Coreference Resolution Systems for Non-binary Pronouns

Authors: Goya van Boven, Yupei Du, Dong Nguyen

Comments: 22 pages, 2 figures. Accepted at the 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT '24)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[176] arXiv:2405.00675 (cross-list from cs.LG) [pdf, other]: Title: Self-Play Preference Optimization for Language Model Alignment

Authors: Yue Wu, Zhiqing Sun, Huizhuo Yuan, Kaixuan Ji, Yiming Yang, Quanquan Gu

Comments: 25 pages, 4 figures, 5 tables

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[177] arXiv:2405.00566 (cross-list from cs.CE) [pdf, other]: Title: NumLLM: Numeric-Sensitive Large Language Model for Chinese Finance

Authors: Huan-Yi Su, Ke Wu, Yu-Hao Huang, Wu-Jun Li

Subjects: Computational Engineering, Finance, and Science (cs.CE); Computation and Language (cs.CL); General Finance (q-fin.GN)
[178] arXiv:2405.00523 (cross-list from cs.AI) [pdf, other]: Title: CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions

Authors: Donghee Choi, Mogan Gim, Donghyeon Park, Mujeen Sung, Hyunjae Kim, Jaewoo Kang, Jihun Choi

Comments: LREC-COLING 2024 Accepted

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[179] arXiv:2405.00516 (cross-list from cs.LG) [pdf, other]: Title: Navigating WebAI: Training Agents to Complete Web Tasks with Large Language Models and Reinforcement Learning

Authors: Lucas-Andreï Thil, Mirela Popa, Gerasimos Spanakis

Comments: ACM 2024, Avila Spain. 9 pages

Journal-ref: ACM SAC Conference 2024, Avila, Spain, Article 4, 9 pages

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[180] arXiv:2405.00494 (cross-list from cs.AI) [pdf, other]: Title: GOLD: Geometry Problem Solver with Natural Language Description

Authors: Jiaxin Zhang, Yashar Moshfeghi

Comments: Accepted in NAACL 2024 Findings

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[181] arXiv:2405.00489 (cross-list from cs.LG) [pdf, other]: Title: Explainable Automatic Grading with Neural Additive Models

Authors: Aubrey Condor, Zachary Pardos

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Applications (stat.AP)
[182] arXiv:2405.00461 (cross-list from cs.RO) [pdf, other]: Title: Enhancing Surgical Robots with Embodied Intelligence for Autonomous Ultrasound Scanning

Authors: Huan Xu, Jinlin Wu, Guanglin Cao, Zhen Lei, Zhen Chen, Hongbin Liu

Comments: ICRA 2024 Full-day Workshop: C4SR+: Continuum, Compliant, Cooperative, Cognitive

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[183] arXiv:2405.00449 (cross-list from cs.LG) [pdf, other]: Title: RAG-based Explainable Prediction of Road Users Behaviors for Automated Driving using Knowledge Graphs and Large Language Models

Authors: Mohamed Manzour Hussien, Angie Nataly Melo, Augusto Luis Ballardini, Carlota Salinas Maldonado, Rubén Izquierdo, Miguel Ángel Sotelo

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Neural and Evolutionary Computing (cs.NE)
[184] arXiv:2405.00438 (cross-list from cs.LG) [pdf, other]: Title: MetaRM: Shifted Distributions Alignment via Meta-Learning

Authors: Shihan Dou, Yan Liu, Enyu Zhou, Tianlong Li, Haoxiang Jia, Limao Xiong, Xin Zhao, Junjie Ye, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang

Comments: 11 pages, 6 figures. arXiv admin note: text overlap with arXiv:2401.06080

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[185] arXiv:2405.00123 (cross-list from cs.LG) [pdf, other]: Title: Graph Neural Network Approach to Semantic Type Detection in Tables

Authors: Ehsan Hoseinzade, Ke Wang

Journal-ref: In Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 121-133. Singapore: Springer Nature Singapore, 2024

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[186] arXiv:2405.00099 (cross-list from cs.AI) [pdf, other]: Title: Creative Beam Search

Authors: Giorgio Franceschelli, Mirco Musolesi

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[187] arXiv:2405.00021 (cross-list from cs.CV) [pdf, other]: Title: SIMPLOT: Enhancing Chart Question Answering by Distilling Essentials

Authors: Wonjoong Kim, Sangwu Park, Yeonjun In, Seokwon Han, Chanyoung Park

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Wed, 1 May 2024

[188] arXiv:2404.19737 [pdf, other]: Title: Better & Faster Large Language Models via Multi-token Prediction

Authors: Fabian Gloeckle, Badr Youbi Idrissi, Baptiste Rozière, David Lopez-Paz, Gabriel Synnaeve

Subjects: Computation and Language (cs.CL)
[189] arXiv:2404.19733 [pdf, other]: Title: Iterative Reasoning Preference Optimization

Authors: Richard Yuanzhe Pang, Weizhe Yuan, Kyunghyun Cho, He He, Sainbayar Sukhbaatar, Jason Weston

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[190] arXiv:2404.19714 [pdf, other]: Title: ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text data on social disorders in children and adolescents

Authors: Hoang-Thang Ta, Abu Bakar Siddiqur Rahman, Lotfollah Najjar, Alexander Gelbukh

Comments: 4 pages

Subjects: Computation and Language (cs.CL)
[191] arXiv:2404.19713 [pdf, ps, other]: Title: Automated Generation of High-Quality Medical Simulation Scenarios Through Integration of Semi-Structured Data and Large Language Models

Authors: Scott Sumpter

Comments: 22 pages but 12 are appendices which are examples of the main text. 3 figures, 4 tables

Subjects: Computation and Language (cs.CL)
[192] arXiv:2404.19705 [pdf, other]: Title: When to Retrieve: Teaching LLMs to Utilize Information Retrieval Effectively

Authors: Tiziano Labruna, Jon Ander Campos, Gorka Azkune

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[193] arXiv:2404.19597 [pdf, other]: Title: Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning

Authors: Xuanli He, Jun Wang, Qiongkai Xu, Pasquale Minervini, Pontus Stenetorp, Benjamin I. P. Rubinstein, Trevor Cohn

Comments: work in progress

Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[194] arXiv:2404.19563 [pdf, other]: Title: RepEval: Effective Text Evaluation with LLM Representation

Authors: Shuqian Sheng, Yi Xu, Tianhang Zhang, Zanwei Shen, Luoyi Fu, Jiaxin Ding, Lei Zhou, Xinbing Wang, Chenghu Zhou

Subjects: Computation and Language (cs.CL)
[195] arXiv:2404.19553 [pdf, other]: Title: Extending Llama-3's Context Ten-Fold Overnight

Authors: Peitian Zhang, Ninglu Shao, Zheng Liu, Shitao Xiao, Hongjin Qian, Qiwei Ye, Zhicheng Dou

Subjects: Computation and Language (cs.CL)
[196] arXiv:2404.19543 [pdf, other]: Title: RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing

Authors: Yucheng Hu, Yuxing Lu

Comments: 30 pages, 7 figures. Draft version 1

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[197] arXiv:2404.19509 [pdf, other]: Title: Do Large Language Models Understand Conversational Implicature -- A case study with a chinese sitcom

Authors: Shisen Yue, Siyuan Song, Xinyuan Cheng, Hai Hu

Comments: 14 pages, 8 tables and 5 figures

Subjects: Computation and Language (cs.CL)
[198] arXiv:2404.19505 [pdf, other]: Title: Context-Aware Machine Translation with Source Coreference Explanation

Authors: Huy Hien Vu, Hidetaka Kamigaito, Taro Watanabe

Comments: Accepted to TACL. This is a pre-MIT Press publication version

Subjects: Computation and Language (cs.CL)
[199] arXiv:2404.19486 [pdf, other]: Title: Safe Training with Sensitive In-domain Data: Leveraging Data Fragmentation To Mitigate Linkage Attacks

Authors: Mariia Ignashina, Julia Ive

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[200] arXiv:2404.19482 [pdf, other]: Title: FactCheck Editor: Multilingual Text Editor with End-to-End fact-checking

Authors: Vinay Setty

Comments: Accepted in SIGIR 2024 (demo track)

Subjects: Computation and Language (cs.CL)
[201] arXiv:2404.19442 [pdf, other]: Title: Which Nigerian-Pidgin does Generative AI speak?: Issues about Representativeness and Bias for Multilingual and Low Resource Languages

Authors: David Ifeoluwa Adelani, A. Seza Doğruöz, Iyanuoluwa Shode, Anuoluwapo Aremu

Comments: Working paper

Subjects: Computation and Language (cs.CL)
[202] arXiv:2404.19432 [pdf, other]: Title: Can Large Language Models put 2 and 2 together? Probing for Entailed Arithmetical Relationships

Authors: D. Panas, S. Seth, V. Belle

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[203] arXiv:2404.19430 [pdf, other]: Title: Sõnajaht: Definition Embeddings and Semantic Search for Reverse Dictionary Creation

Authors: Aleksei Dorkin, Kairit Sirts

Comments: Accepted to *SEM 2024

Subjects: Computation and Language (cs.CL)
[204] arXiv:2404.19409 [pdf, other]: Title: Countering Reward Over-optimization in LLM with Demonstration-Guided Reinforcement Learning

Authors: Mathieu Rita, Florian Strub, Rahma Chaabouni, Paul Michel, Emmanuel Dupoux, Olivier Pietquin

Subjects: Computation and Language (cs.CL)
[205] arXiv:2404.19369 [pdf, ps, other]: Title: Evaluating Telugu Proficiency in Large Language Models_ A Comparative Analysis of ChatGPT and Gemini

Authors: Katikela Sreeharsha Kishore, Rahimanuddin Shaik

Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[206] arXiv:2404.19364 [pdf, other]: Title: Navigating Brain Language Representations: A Comparative Analysis of Neural Language Models and Psychologically Plausible Models

Authors: Yunhao Zhang, Shaonan Wang, Xinyi Dong, Jiajun Yu, Chengqing Zong

Subjects: Computation and Language (cs.CL)
[207] arXiv:2404.19363 [pdf, other]: Title: Expressivity and Speech Synthesis

Authors: Andreas Triantafyllopoulos, Björn W. Schuller

Comments: Invited contribution. Under review

Subjects: Computation and Language (cs.CL)
[208] arXiv:2404.19359 [pdf, other]: Title: Evaluating Lexicon Incorporation for Depression Symptom Estimation

Authors: Kirill Milintsevich, Gaël Dias, Kairit Sirts

Comments: Accepted to Clinical NLP workshop at NAACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[209] arXiv:2404.19335 [pdf, other]: Title: StablePT: Towards Stable Prompting for Few-shot Learning via Input Separation

Authors: Xiaoming Liu, Chen Liu, Zhaohan Zhang, Chengzhengxu Li, Longtian Wang, Yu Lan, Chao Shen

Comments: Submitted to ACL 2024

Subjects: Computation and Language (cs.CL)
[210] arXiv:2404.19328 [pdf, other]: Title: Computational Approaches for Integrating out Subjectivity in Cognate Synonym Selection

Authors: Luise Häuser, Gerhard Jäger, Alexandros Stamatakis

Comments: Experiments available on GitHub (this https URL, this https URL)

Subjects: Computation and Language (cs.CL); Populations and Evolution (q-bio.PE)
[211] arXiv:2404.19319 [pdf, other]: Title: Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget

Authors: Minh Duc Bui, Fabian David Schmidt, Goran Glavaš, Katharina von der Wense

Comments: Accepted to the 5th Workshop on Insights from Negative Results in NLP at NAACL 2024

Subjects: Computation and Language (cs.CL)
[212] arXiv:2404.19316 [pdf, other]: Title: QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering

Authors: Sheng Ouyang, Jianzong Wang, Yong Zhang, Zhitao Li, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao

Comments: Accepted by the 2024 International Joint Conference on Neural Networks (IJCNN 2024)

Subjects: Computation and Language (cs.CL)
[213] arXiv:2404.19315 [pdf, other]: Title: Modeling Orthographic Variation in Occitan's Dialects

Authors: Zachary William Hopton (Language and Space Lab, University of Zurich), Noëmi Aepli (Department of Computational Linguistics, University of Zurich)

Comments: Accepted at VarDial 2024: The Eleventh Workshop on NLP for Similar Languages, Varieties and Dialects

Subjects: Computation and Language (cs.CL)
[214] arXiv:2404.19310 [pdf, other]: Title: Does Whisper understand Swiss German? An automatic, qualitative, and human evaluation

Authors: Eyal Liron Dolev, Clemens Fidel Lutz, Noëmi Aepli

Comments: Accepted to VarDial 2024 (the eleventh Workshop on NLP for Similar Languages, Varieties and Dialects 2024), Mexico City

Subjects: Computation and Language (cs.CL)
[215] arXiv:2404.19296 [pdf, other]: Title: Octopus v4: Graph of language models

Authors: Wei Chen, Zhiyuan Li

Subjects: Computation and Language (cs.CL)
[216] arXiv:2404.19260 [pdf, ps, other]: Title: Aspect and Opinion Term Extraction Using Graph Attention Network

Authors: Abir Chakraborty

Subjects: Computation and Language (cs.CL)
[217] arXiv:2404.19254 [pdf, other]: Title: Suvach -- Generated Hindi QA benchmark

Authors: Vaishak Narayanan, Prabin Raj KP, Saifudheen Nouphal

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[218] arXiv:2404.19252 [pdf, other]: Title: Exploiting Hatred by Targets for Hate Speech Detection on Vietnamese Social Media Texts

Authors: Cuong Nhat Vo, Khanh Bao Huynh, Son T. Luu, Trong-Hop Do

Subjects: Computation and Language (cs.CL)
[219] arXiv:2404.19245 [pdf, other]: Title: HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning

Authors: Chunlin Tian, Zhan Shi, Zhijiang Guo, Li Li, Chengzhong Xu

Comments: 19 pages, 7 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[220] arXiv:2404.19232 [pdf, other]: Title: GRAMMAR: Grounded and Modular Methodology for Assessment of Domain-Specific Retrieval-Augmented Language Model

Authors: Xinzhe Li, Ming Liu, Shang Gao

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[221] arXiv:2404.19192 [pdf, other]: Title: Mix of Experts Language Model for Named Entity Recognition

Authors: Xinwei Chen, Kun Li, Tianyou Song, Jiangjian Guo

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[222] arXiv:2404.19178 [pdf, other]: Title: Revenge of the Fallen? Recurrent Models Match Transformers at Predicting Human Language Comprehension Metrics

Authors: James A. Michaelov, Catherine Arnett, Benjamin K. Bergen

Subjects: Computation and Language (cs.CL)
[223] arXiv:2404.19175 [pdf, other]: Title: Game-MUG: Multimodal Oriented Game Situation Understanding and Commentary Generation Dataset

Authors: Zhihao Zhang, Feiqi Cao, Yingbin Mo, Yiran Zhang, Josiah Poon, Caren Han

Subjects: Computation and Language (cs.CL)
[224] arXiv:2404.19159 [pdf, other]: Title: What Drives Performance in Multilingual Language Models?

Authors: Sina Bagheri Nezhad, Ameeta Agrawal

Comments: Accepted at VarDial @ NAACL 2024

Subjects: Computation and Language (cs.CL)
[225] arXiv:2404.19154 [pdf, other]: Title: RTF: Region-based Table Filling Method for Relational Triple Extraction

Authors: Ning An, Lei Hei, Yong Jiang, Weiping Meng, Jingjing Hu, Boran Huang, Feiliang Ren

Comments: Rejected by EMNLP 2023

Subjects: Computation and Language (cs.CL)
[226] arXiv:2404.19124 [pdf, other]: Title: Accelerating Production LLMs with Combined Token/Embedding Speculators

Authors: Davis Wertheimer, Joshua Rosenkranz, Thomas Parnell, Sahil Suneja, Pavithra Ranganathan, Raghu Ganti, Mudhakar Srivatsa

Subjects: Computation and Language (cs.CL)
[227] arXiv:2404.19119 [pdf, ps, other]: Title: Effects of Added Emphasis and Pause in Audio Delivery of Health Information

Authors: Arif Ahmed (1), Gondy Leroy (1), Stephen A. Rains (1), Philip Harber (1), David Kauchak (2), Prosanta Barai (1) ((1) The University of Arizona, (2) Pomona College)

Comments: This manuscript is accepted to American Medical Informatics Association summit, 2024

Subjects: Computation and Language (cs.CL)
[228] arXiv:2404.19094 [pdf, other]: Title: In-Context Symbolic Regression: Leveraging Language Models for Function Discovery

Authors: Matteo Merler, Nicola Dainese, Katsiaryna Haitsiukevich

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[229] arXiv:2404.19063 [pdf, other]: Title: SuperCLUE-Fin: Graded Fine-Grained Analysis of Chinese LLMs on Diverse Financial Tasks and Applications

Authors: Liang Xu, Lei Zhu, Yaotong Wu, Hang Xue

Comments: 11 pages, 19 figures, and tables

Subjects: Computation and Language (cs.CL)
[230] arXiv:2404.19055 [pdf, other]: Title: Plan of Thoughts: Heuristic-Guided Problem Solving with Large Language Models

Authors: Houjun Liu

Comments: 7 pages, 2 figures

Subjects: Computation and Language (cs.CL)
[231] arXiv:2404.19048 [pdf, other]: Title: A Framework for Real-time Safeguarding the Text Generation of Large Language Model

Authors: Ximing Dong, Dayi Lin, Shaowei Wang, Ahmed E. Hassan

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[232] arXiv:2404.19007 [pdf, other]: Title: How Did We Get Here? Summarizing Conversation Dynamics

Authors: Yilun Hua, Nicholas Chernogor, Yuzhe Gu, Seoyeon Julie Jeong, Miranda Luo, Cristian Danescu-Niculescu-Mizil

Comments: To appear in the Proceedings of NAACL 2024. Data available in ConvoKit this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[233] arXiv:2404.18988 [pdf, other]: Title: Markovian Agents for Truthful Language Modeling

Authors: Scott Viteri, Max Lamparth, Peter Chatain, Clark Barrett

Comments: 21 pages, 6 figures

Subjects: Computation and Language (cs.CL)
[234] arXiv:2404.18977 [pdf, other]: Title: Computational Job Market Analysis with Natural Language Processing

Authors: Mike Zhang

Comments: Ph.D. Thesis (315 total pages, 52 figures). The thesis slightly modified with this https URL ISBN (electronic): 978-87-7949-414-5

Subjects: Computation and Language (cs.CL)
[235] arXiv:2404.18971 [pdf, other]: Title: Credible, Unreliable or Leaked?: Evidence Verification for Enhanced Automated Fact-checking

Authors: Zacharias Chrysidis, Stefanos-Iordanis Papadopoulos, Symeon Papadopoulos, Panagiotis C. Petrantonakis

Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Information Retrieval (cs.IR); Social and Information Networks (cs.SI)
[236] arXiv:2404.18942 [pdf, other]: Title: GuideWalk -- Heterogeneous Data Fusion for Enhanced Learning -- A Multiclass Document Classification Case

Authors: Sarmad N. Mohammed, Semra Gündüç

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[237] arXiv:2404.19753 (cross-list from cs.CV) [pdf, other]: Title: DOCCI: Descriptions of Connected and Contrasting Images

Authors: Yasumasa Onoe, Sunayana Rane, Zachary Berger, Yonatan Bitton, Jaemin Cho, Roopal Garg, Alexander Ku, Zarana Parekh, Jordi Pont-Tuset, Garrett Tanzer, Su Wang, Jason Baldridge

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[238] arXiv:2404.19721 (cross-list from cs.AI) [pdf, ps, other]: Title: PANGeA: Procedural Artificial Narrative using Generative AI for Turn-Based Video Games

Authors: Steph Buongiorno, Lawrence Jake Klinkert, Tanishq Chawla, Zixin Zhuang, Corey Clark

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[239] arXiv:2404.19708 (cross-list from cs.LG) [pdf, other]: Title: Harmonic LLMs are Trustworthy

Authors: Nicholas S. Kersting, Mohammad Rahman, Suchismitha Vedala, Yang Wang

Comments: 15 pages, 4 figures, 14 tables

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[240] arXiv:2404.19696 (cross-list from cs.CV) [pdf, other]: Title: Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners

Authors: Chun Feng, Joy Hsu, Weiyu Liu, Jiajun Wu

Comments: CVPR 2024. The first two authors contributed equally

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[241] arXiv:2404.19484 (cross-list from cs.LG) [pdf, other]: Title: More Compute Is What You Need

Authors: Zhen Guo

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[242] arXiv:2404.19360 (cross-list from cs.CV) [pdf, other]: Title: Large Language Model Informed Patent Image Retrieval

Authors: Hao-Cheng Lo, Jung-Mei Chu, Jieh Hsiang, Chun-Chieh Cho

Comments: 8 pages. Under review

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[243] arXiv:2404.19318 (cross-list from cs.SE) [pdf, other]: Title: Enhancing Trust in LLM-Generated Code Summaries with Calibrated Confidence Scores

Authors: Yuvraj Virk, Premkumar Devanbu, Toufique Ahmed

Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL)
[244] arXiv:2404.19317 (cross-list from cs.CV) [pdf, other]: Title: Revisiting N-Gram Models: Their Impact in Modern Neural Networks for Handwritten Text Recognition

Authors: Solène Tarride, Christopher Kermorvant

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[245] arXiv:2404.19234 (cross-list from cs.AI) [pdf, other]: Title: Multi-hop Question Answering over Knowledge Graphs using Large Language Models

Authors: Abir Chakraborty

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Databases (cs.DB)
[246] arXiv:2404.19221 (cross-list from cs.CV) [pdf, other]: Title: Transcrib3D: 3D Referring Expression Resolution through Large Language Models

Authors: Jiading Fang, Xiangshan Tan, Shengjie Lin, Igor Vasiljevic, Vitor Guizilini, Hongyuan Mei, Rares Ambrus, Gregory Shakhnarovich, Matthew R Walter

Comments: CORLW 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[247] arXiv:2404.19128 (cross-list from cs.CV) [pdf, other]: Title: Q-GroundCAM: Quantifying Grounding in Vision Language Models via GradCAM

Authors: Navid Rajabi, Jana Kosecka

Comments: Accepted to CVPR 2024, Second Workshop on Foundation Models (WFM)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[248] arXiv:2404.19071 (cross-list from cs.HC) [pdf, other]: Title: Blind Spots and Biases: Exploring the Role of Annotator Cognitive Biases in NLP

Authors: Sanjana Gautam, Mukund Srinath

Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[249] arXiv:2404.19065 (cross-list from cs.AI) [pdf, other]: Title: HELPER-X: A Unified Instructable Embodied Agent to Tackle Four Interactive Vision-Language Domains with Memory-Augmented Language Models

Authors: Gabriel Sarch, Sahil Somani, Raghav Kapoor, Michael J. Tarr, Katerina Fragkiadaki

Comments: Videos and code this https URL

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[250] arXiv:2404.18976 (cross-list from cs.LG) [pdf, other]: Title: Foundations of Multisensory Artificial Intelligence

Authors: Paul Pu Liang

Comments: CMU Machine Learning Department PhD Thesis

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[251] arXiv:2404.18963 (cross-list from cs.LG) [pdf, other]: Title: RE-GrievanceAssist: Enhancing Customer Experience through ML-Powered Complaint Management

Authors: Venkatesh C, Harshit Oberoi, Anurag Kumar Pandey, Anil Goyal, Nikhil Sikka

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)

Tue, 30 Apr 2024 (showing first 89 of 99 entries)

[252] arXiv:2404.18923 [pdf, other]: Title: Holmes: Benchmark the Linguistic Competence of Language Models

Authors: Andreas Waldis, Yotam Perlitz, Leshem Choshen, Yufang Hou, Iryna Gurevych

Subjects: Computation and Language (cs.CL)
[253] arXiv:2404.18911 [pdf, other]: Title: Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting

Authors: Fangcheng Liu, Yehui Tang, Zhenhua Liu, Yunsheng Ni, Kai Han, Yunhe Wang

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[254] arXiv:2404.18880 [pdf, ps, other]: Title: Spivavtor: An Instruction Tuned Ukrainian Text Editing Model

Authors: Aman Saini, Artem Chernodub, Vipul Raheja, Vivek Kulkarni

Comments: Accepted to UNLP Workshop 2024

Subjects: Computation and Language (cs.CL)
[255] arXiv:2404.18870 [pdf, other]: Title: More RLHF, More Trust? On The Impact of Human Preference Alignment On Language Model Trustworthiness

Authors: Aaron J. Li, Satyapriya Krishna, Himabindu Lakkaraju

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[256] arXiv:2404.18865 [pdf, other]: Title: Truth-value judgment in language models: belief directions are context sensitive

Authors: Stefan F. Schouten, Peter Bloem, Ilia Markov, Piek Vossen

Subjects: Computation and Language (cs.CL)
[257] arXiv:2404.18851 [pdf, other]: Title: A Comprehensive Rubric for Annotating Pathological Speech

Authors: Mario Corrales-Astorgano, David Escudero-Mancebo, Lourdes Aguilar, Valle Flores-Lucas, Valentín Cardeñoso-Payo, Carlos Vivaracho-Pascual, César González-Ferreras

Comments: Submitted to LREC-Coling 2024

Subjects: Computation and Language (cs.CL)
[258] arXiv:2404.18832 [pdf, other]: Title: It's Difficult to be Neutral -- Human and LLM-based Sentiment Annotation of Patient Comments

Authors: Petter Mæhlum, David Samuel, Rebecka Maria Norman, Elma Jelin, Øyvind Andresen Bjertnæs, Lilja Øvrelid, Erik Velldal

Subjects: Computation and Language (cs.CL)
[259] arXiv:2404.18824 [pdf, other]: Title: Benchmarking Benchmark Leakage in Large Language Models

Authors: Ruijie Xu, Zengzhi Wang, Run-Ze Fan, Pengfei Liu

Comments: 30 pages; Homepage: this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[260] arXiv:2404.18810 [pdf, other]: Title: Unknown Script: Impact of Script on Cross-Lingual Transfer

Authors: Wondimagegnhue Tsegaye Tufa, Ilia Markov, Piek Vossen

Comments: Paper accepted to NAACL Student Research Workshop (SRW) 2024

Subjects: Computation and Language (cs.CL)
[261] arXiv:2404.18796 [pdf, other]: Title: Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Authors: Pat Verga, Sebastian Hofstatter, Sophia Althammer, Yixuan Su, Aleksandra Piktus, Arkady Arkhangorodsky, Minjie Xu, Naomi White, Patrick Lewis

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[262] arXiv:2404.18784 [pdf, other]: Title: Where on Earth Do Users Say They Are?: Geo-Entity Linking for Noisy Multilingual User Input

Authors: Tessa Masis, Brendan O'Connor

Comments: NLP+CSS workshop at NAACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[263] arXiv:2404.18759 [pdf, ps, other]: Title: Towards A Structured Overview of Use Cases for Natural Language Processing in the Legal Domain: A German Perspective

Authors: Juraj Vladika, Stephen Meisenbacher, Martina Preis, Alexandra Klymenko, Florian Matthes

Comments: 10 pages, 6 tables, 30th Americas Conference on Information Systems (AMCIS 2024)

Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[264] arXiv:2404.18739 [pdf, other]: Title: Towards Dog Bark Decoding: Leveraging Human Speech Processing for Automated Bark Classification

Authors: Artem Abzaliev, Humberto Pérez Espinosa, Rada Mihalcea

Comments: to be published in LREC-COLING 2024

Subjects: Computation and Language (cs.CL)
[265] arXiv:2404.18726 [pdf, other]: Title: The Constant in HATE: Analyzing Toxicity in Reddit across Topics and Languages

Authors: Wondimagegnhue Tsegaye Tufa, Ilia Markov, Piek Vossen

Comments: Accepted to TRAC 2024

Subjects: Computation and Language (cs.CL)
[266] arXiv:2404.18708 [pdf, other]: Title: Iconic Gesture Semantics

Authors: Andy Lücking, Alexander Henlein, Alexander Mehler

Comments: 39 pages, 28 figures, under revision

Subjects: Computation and Language (cs.CL)
[267] arXiv:2404.18684 [pdf, other]: Title: Work Smarter...Not Harder: Efficient Minimization of Dependency Length in SOV Languages

Authors: Sidharth Ranjan, Titus von der Malsburg

Comments: Accepted at CogSci-2024 as talk with full paper publication

Subjects: Computation and Language (cs.CL); Theoretical Economics (econ.TH); Optimization and Control (math.OC)
[268] arXiv:2404.18655 [pdf, other]: Title: Revealing the Parametric Knowledge of Language Models: A Unified Framework for Attribution Methods

Authors: Haeun Yu, Pepa Atanasova, Isabelle Augenstein

Comments: 14 pages, 6 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[269] arXiv:2404.18624 [pdf, other]: Title: Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Explanations?

Authors: Letitia Parcalabescu, Anette Frank

Comments: 27 pages, from which 12 pages contain the text of the main paper. 8 figures, 11 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[270] arXiv:2404.18615 [pdf, other]: Title: The SAMER Arabic Text Simplification Corpus

Authors: Bashar Alhafni, Reem Hazim, Juan Piñeros Liberato, Muhamed Al Khalil, Nizar Habash

Comments: Accepted to LREC-COLING 2024. 15 pages, 6 tables, 1 figure

Subjects: Computation and Language (cs.CL)
[271] arXiv:2404.18585 [pdf, other]: Title: FREB-TQA: A Fine-Grained Robustness Evaluation Benchmark for Table Question Answering

Authors: Wei Zhou, Mohsen Mesgar, Heike Adel, Annemarie Friedrich

Comments: Accepted at NAACL 2024

Subjects: Computation and Language (cs.CL)
[272] arXiv:2404.18570 [pdf, other]: Title: Analyzing Semantic Change through Lexical Replacements

Authors: Francesco Periti, Pierluigi Cassotti, Haim Dubossarsky, Nina Tahmasebi

Subjects: Computation and Language (cs.CL)
[273] arXiv:2404.18564 [pdf, other]: Title: Injecting Salesperson's Dialogue Strategies in Large Language Models with Chain-of-Thought Reasoning

Authors: Wen-Yu Chang, Yun-Nung Chen

Comments: arXiv admin note: substantial text overlap with arXiv:2308.14266

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[274] arXiv:2404.18557 [pdf, other]: Title: Can GPT-4 do L2 analytic assessment?

Authors: Stefano Bannò, Hari Krishna Vydana, Kate M. Knill, Mark J. F. Gales

Comments: Accepted for the 19th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2024)

Subjects: Computation and Language (cs.CL)
[275] arXiv:2404.18543 [pdf, other]: Title: Time Machine GPT

Authors: Felix Drinkall, Eghbal Rahimikia, Janet B. Pierrehumbert, Stefan Zohren

Comments: NAACL Findings 2024

Subjects: Computation and Language (cs.CL); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
[276] arXiv:2404.18534 [pdf, other]: Title: Evaluating and Mitigating Linguistic Discrimination in Large Language Models

Authors: Guoliang Dong, Haoyu Wang, Jun Sun, Xinyu Wang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Software Engineering (cs.SE)
[277] arXiv:2404.18532 [pdf, other]: Title: MileBench: Benchmarking MLLMs in Long Context

Authors: Dingjie Song, Shunian Chen, Guiming Hardy Chen, Fei Yu, Xiang Wan, Benyou Wang

Comments: 29 pages, 13 figures, 14 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[278] arXiv:2404.18510 [pdf, other]: Title: Explainability of Machine Learning Approaches in Forensic Linguistics: A Case Study in Geolinguistic Authorship Profiling

Authors: Dana Roemling, Yves Scherrer, Aleksandra Miletic

Subjects: Computation and Language (cs.CL)
[279] arXiv:2404.18466 [pdf, other]: Title: HFT: Half Fine-Tuning for Large Language Models

Authors: Tingfeng Hui, Zhenyu Zhang, Shuohuan Wang, Weiran Xu, Yu Sun, Hua Wu

Comments: Work in progress

Subjects: Computation and Language (cs.CL)
[280] arXiv:2404.18460 [pdf, other]: Title: Ethical Reasoning and Moral Value Alignment of LLMs Depend on the Language we Prompt them in

Authors: Utkarsh Agarwal, Kumar Tanmay, Aditi Khandelwal, Monojit Choudhury

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[281] arXiv:2404.18443 [pdf, other]: Title: BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers

Authors: Ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Yanqiao Zhu, May D. Wang, Joyce C. Ho, Chao Zhang, Carl Yang

Comments: Work in progress. The model and data will be uploaded to \url{this https URL}

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Quantitative Methods (q-bio.QM)
[282] arXiv:2404.18410 [pdf, other]: Title: Mixture-of-Instructions: Comprehensive Alignment of a Large Language Model through the Mixture of Diverse System Prompting Instructions

Authors: Bowen Xu, Shaoyu Wu, Kai Liu, Lulu Hu

Subjects: Computation and Language (cs.CL)
[283] arXiv:2404.18398 [pdf, other]: Title: MM-TTS: A Unified Framework for Multimodal, Prompt-Induced Emotional Text-to-Speech Synthesis

Authors: Xiang Li, Zhi-Qi Cheng, Jun-Yan He, Xiaojiang Peng, Alexander G. Hauptmann

Subjects: Computation and Language (cs.CL); Multimedia (cs.MM)
[284] arXiv:2404.18384 [pdf, other]: Title: Exploring the Limits of Fine-grained LLM-based Physics Inference via Premise Removal Interventions

Authors: Jordan Meadows, Tamsin James, Andre Freitas

Subjects: Computation and Language (cs.CL)
[285] arXiv:2404.18371 [pdf, other]: Title: QANA: LLM-based Question Generation and Network Analysis for Zero-shot Key Point Analysis and Beyond

Authors: Tomoki Fukuma, Koki Noda, Toshihide Ubukata Kousuke Hoso, Yoshiharu Ichikawa, Kyosuke Kambe, Yu Masubuch, Fujio Toriumi

Comments: Under review as a conference paper at COLM 2024

Subjects: Computation and Language (cs.CL)
[286] arXiv:2404.18359 [pdf, other]: Title: FoundaBench: Evaluating Chinese Fundamental Knowledge Capabilities of Large Language Models

Authors: Wei Li, Ren Ma, Jiang Wu, Chenya Gu, Jiahui Peng, Jinyang Len, Songyang Zhang, Hang Yan, Dahua Lin, Conghui He

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[287] arXiv:2404.18286 [pdf, other]: Title: Comparing LLM prompting with Cross-lingual transfer performance on Indigenous and Low-resource Brazilian Languages

Authors: David Ifeoluwa Adelani, A. Seza Doğruöz, André Coneglian, Atul Kr. Ojha

Comments: Accepted to the Americas NLP Workshop at NAACL 2024 (this https URL)

Subjects: Computation and Language (cs.CL)
[288] arXiv:2404.18276 [pdf, ps, other]: Title: Bias Neutralization Framework: Measuring Fairness in Large Language Models with Bias Intelligence Quotient (BiQ)

Authors: Malur Narayan, John Pasmore, Elton Sampaio, Vijay Raghavan, Gabriella Waters

Comments: 41 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[289] arXiv:2404.18271 [pdf, other]: Title: Parameter-Efficient Tuning Large Language Models for Graph Representation Learning

Authors: Qi Zhu, Da Zheng, Xiang Song, Shichang Zhang, Bowen Jin, Yizhou Sun, George Karypis

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[290] arXiv:2404.18264 [pdf, other]: Title: Modeling Orthographic Variation Improves NLP Performance for Nigerian Pidgin

Authors: Pin-Jie Lin, Merel Scholman, Muhammed Saeed, Vera Demberg

Comments: Accepted to LREC-COLING 2024 Main Conference

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[291] arXiv:2404.18257 [pdf, other]: Title: Mapping 'when'-clauses in Latin American and Caribbean languages: an experiment in subtoken-based typology

Authors: Nilo Pedrazzini

Comments: 10 pages, 6 figures. To be published in the 2024 Proceedings of the Workshop on Natural Language Processing for Indigenous Languages of the Americas (AmericasNLP)

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[292] arXiv:2404.18255 [pdf, other]: Title: PatentGPT: A Large Language Model for Intellectual Property

Authors: Zilong Bai, Ruiji Zhang, Linqing Chen, Qijun Cai, Yuan Zhong, Cong Wang, Yan Fang, Jie Fang, Jing Sun, Weikuan Wang, Lizhi Zhou, Haoran Hua, Tian Qiu, Chaochao Wang, Cheng Sun, Jianping Lu, Yixin Wang, Yubin Xia, Meng Hu, Haowen Liu, Peng Xu, Licong Xu, Fu Bian, Xiaolong Gu, Lisha Zhang, Weilei Wang, Changyang Tu

Comments: 19 pages, 9 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[293] arXiv:2404.18243 [pdf, other]: Title: LEGENT: Open Platform for Embodied Agents

Authors: Zhili Cheng, Zhitong Wang, Jinyi Hu, Shengding Hu, An Liu, Yuge Tu, Pengkai Li, Lei Shi, Zhiyuan Liu, Maosong Sun

Comments: Demo Paper

Subjects: Computation and Language (cs.CL)
[294] arXiv:2404.18231 [pdf, other]: Title: From Persona to Personalization: A Survey on Role-Playing Language Agents

Authors: Jiangjie Chen, Xintao Wang, Rui Xu, Siyu Yuan, Yikai Zhang, Wei Shi, Jian Xie, Shuang Li, Ruihan Yang, Tinghui Zhu, Aili Chen, Nianqi Li, Lida Chen, Caiyu Hu, Siye Wu, Scott Ren, Ziquan Fu, Yanghua Xiao

Comments: Preprint

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[295] arXiv:2404.18228 [pdf, other]: Title: TextGram: Towards a better domain-adaptive pretraining

Authors: Sharayu Hiwarkhedkar, Saloni Mittal, Vidula Magdum, Omkar Dhekane, Raviraj Joshi, Geetanjali Kale, Arnav Ladkat

Comments: Accepted at SPELLL 2023

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[296] arXiv:2404.18216 [pdf, other]: Title: L3Cube-MahaNews: News-based Short Text and Long Document Classification Datasets in Marathi

Authors: Saloni Mittal, Vidula Magdum, Omkar Dhekane, Sharayu Hiwarkhedkar, Raviraj Joshi

Comments: Accepted at SPELLL 2023

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[297] arXiv:2404.18191 [pdf, other]: Title: Exploring the Robustness of In-Context Learning with Noisy Labels

Authors: Chen Cheng, Xinzhi Yu, Haodong Wen, Jingsong Sun, Guanzhang Yue, Yihao Zhang, Zeming Wei

Comments: ICLR 2024 Workshop on Reliable and Responsible Foundation Models

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Optimization and Control (math.OC)
[298] arXiv:2404.18180 [pdf, other]: Title: EkoHate: Abusive Language and Hate Speech Detection for Code-switched Political Discussions on Nigerian Twitter

Authors: Comfort Eseohen Ilevbare, Jesujoba O. Alabi, David Ifeoluwa Adelani, Firdous Damilola Bakare, Oluwatoyin Bunmi Abiola, Oluwaseyi Adesina Adeyemo

Comments: AfricaNLP workshop @ ICLR2024 and WOAH @ NAACL2024

Subjects: Computation and Language (cs.CL)
[299] arXiv:2404.18154 [pdf, other]: Title: Explaining vague language

Authors: Paul Égré, Benjamin Spector

Subjects: Computation and Language (cs.CL); Computer Science and Game Theory (cs.GT); Information Theory (cs.IT)
[300] arXiv:2404.18085 [pdf, other]: Title: CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model

Authors: Zhengpeng Shi, Haoran Luo

Comments: preprint

Subjects: Computation and Language (cs.CL)
[301] arXiv:2404.18072 [pdf, ps, other]: Title: Contextual Spelling Correction with Language Model for Low-resource Setting

Authors: Nishant Luitel, Nirajan Bekoju, Anand Kumar Sah, Subarna Shakya

Comments: 8 pages

Subjects: Computation and Language (cs.CL)
[302] arXiv:2404.18071 [pdf, ps, other]: Title: Can Perplexity Predict Fine-Tuning Performance? An Investigation of Tokenization Effects on Sequential Language Models for Nepali

Authors: Nishant Luitel, Nirajan Bekoju, Anand Kumar Sah, Subarna Shakya

Comments: 11 pages

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[303] arXiv:2404.18057 [pdf, other]: Title: Efficient LLM Inference with Kcache

Authors: Qiaozhi He, Zhihua Wu

Comments: Technical Report, 8 pages

Subjects: Computation and Language (cs.CL)
[304] arXiv:2404.18043 [pdf, ps, other]: Title: Utilizing Large Language Models for Information Extraction from Real Estate Transactions

Authors: Yu Zhao, Haoxiang Gao

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[305] arXiv:2404.18040 [pdf, other]: Title: Fashion Recommendation: Outfit Compatibility using GNN

Authors: Samaksh Gulati

Subjects: Computation and Language (cs.CL)
[306] arXiv:2404.18031 [pdf, other]: Title: Quality Estimation with $k$-nearest Neighbors and Automatic Evaluation for Model-specific Quality Estimation

Authors: Tu Anh Dinh, Tobias Palzer, Jan Niehues

Comments: Accepted to EAMT 2024

Subjects: Computation and Language (cs.CL)
[307] arXiv:2404.17999 [pdf, other]: Title: MediFact at MEDIQA-CORR 2024: Why AI Needs a Human Touch

Authors: Nadia Saeed

Comments: 7 pages, 4 figures, Clinical NLP 2024 Workshop

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[308] arXiv:2404.17991 [pdf, other]: Title: Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading Comprehension

Authors: Lin Ai, Zheng Hui, Zizhou Liu, Julia Hirschberg

Subjects: Computation and Language (cs.CL)
[309] arXiv:2404.17985 [pdf, other]: Title: Detection of Conspiracy Theories Beyond Keyword Bias in German-Language Telegram Using Large Language Models

Authors: Milena Pustet, Elisabeth Steffen, Helena Mihaljević

Comments: Accepted to the 8th Workshop on Online Abuse and Harms (WOAH), ACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[310] arXiv:2404.17975 [pdf, ps, other]: Title: Automating Customer Needs Analysis: A Comparative Study of Large Language Models in the Travel Industry

Authors: Simone Barandoni, Filippo Chiarello, Lorenzo Cascone, Emiliano Marrale, Salvatore Puccio

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[311] arXiv:2404.17968 [pdf, other]: Title: Usefulness of Emotional Prosody in Neural Machine Translation

Authors: Charles Brazier, Jean-Luc Rouas

Comments: 5 pages, In Proceedings of the 11th International Conference on Speech Prosody (SP), Leiden, The Netherlands, 2024

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[312] arXiv:2404.17949 [pdf, other]: Title: Transfer Learning Enhanced Single-choice Decision for Multi-choice Question Answering

Authors: Chenhao Cui, Yufan Jiang, Shuangzhi Wu, Zhoujun Li

Comments: 10 pages, 1 figures.This article supersedes arXiv:2011.03292

Subjects: Computation and Language (cs.CL)
[313] arXiv:2404.17918 [pdf, other]: Title: I Have an Attention Bridge to Sell You: Generalization Capabilities of Modular Translation Architectures

Authors: Timothee Mickus, Raúl Vázquez, Joseph Attieh

Subjects: Computation and Language (cs.CL)
[314] arXiv:2404.17912 [pdf, other]: Title: SERPENT-VLM : Self-Refining Radiology Report Generation Using Vision Language Models

Authors: Manav Nitin Kapadnis, Sohan Patnaik, Abhilash Nandy, Sourjyadip Ray, Pawan Goyal, Debdoot Sheet

Comments: 8 pages, 3 figures, 4 tables, Accepted as oral at Clinical NLP workshop at NAACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[315] arXiv:2404.17897 [pdf, other]: Title: Tool Calling: Enhancing Medication Consultation via Retrieval-Augmented Large Language Models

Authors: Zhongzhen Huang, Kui Xue, Yongqi Fan, Linjie Mu, Ruoyu Liu, Tong Ruan, Shaoting Zhang, Xiaofan Zhang

Subjects: Computation and Language (cs.CL)
[316] arXiv:2404.17877 [pdf, ps, other]: Title: PromptCL: Improving Event Representation via Prompt Template and Contrastive Learning

Authors: Yubo Feng, Lishuang Li, Yi Xiang, Xueyang Qin

Comments: NLPCC 2023 Best Student Paper

Journal-ref: Natural Language Processing and Chinese Computing (NLPCC 2023)

Subjects: Computation and Language (cs.CL)
[317] arXiv:2404.17874 [pdf, other]: Title: From Languages to Geographies: Towards Evaluating Cultural Bias in Hate Speech Datasets

Authors: Manuel Tonneau, Diyi Liu, Samuel Fraiberger, Ralph Schroeder, Scott A. Hale, Paul Röttger

Comments: Accepted at WOAH (NAACL 2024)

Subjects: Computation and Language (cs.CL)
[318] arXiv:2404.17862 [pdf, other]: Title: Revisiting Multimodal Emotion Recognition in Conversation from the Perspective of Graph Spectrum

Authors: Tao Meng, Fuchen Zhang, Yuntao Shou, Wei Ai, Nan Yin, Keqin Li

Comments: 10 pages, 4 figures

Subjects: Computation and Language (cs.CL)
[319] arXiv:2404.17858 [pdf, other]: Title: Revisiting Multi-modal Emotion Learning with Broad State Space Models and Probability-guidance Fusion

Authors: Yuntao Shou, Tao Meng, Fuchen Zhang, Nan Yin, Keqin Li

Comments: 10 pages, 6 figures

Subjects: Computation and Language (cs.CL)
[320] arXiv:2404.17841 [pdf, other]: Title: Toxicity Classification in Ukrainian

Authors: Daryna Dementieva, Valeriia Khylenko, Nikolay Babakov, Georg Groh

Comments: Accepted to WOAH, NAACL, 2024. arXiv admin note: text overlap with arXiv:2404.02043

Subjects: Computation and Language (cs.CL)
[321] arXiv:2404.17835 [pdf, other]: Title: VANER: Leveraging Large Language Model for Versatile and Adaptive Biomedical Named Entity Recognition

Authors: Junyi Biana, Weiqi Zhai, Xiaodi Huang, Jiaxuan Zheng, Shanfeng Zhu

Subjects: Computation and Language (cs.CL)
[322] arXiv:2404.17832 [pdf, other]: Title: Evaluation of Few-Shot Learning for Classification Tasks in the Polish Language

Authors: Tsimur Hadeliya, Dariusz Kajtoch

Comments: 34 pages, 3 figures, 10 tables

Subjects: Computation and Language (cs.CL)
[323] arXiv:2404.17809 [pdf, other]: Title: Recall, Retrieve and Reason: Towards Better In-Context Relation Extraction

Authors: Guozheng Li, Peng Wang, Wenjun Ke, Yikai Guo, Ke Ji, Ziyu Shang, Jiajun Liu, Zijie Xu

Comments: IJCAI 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[324] arXiv:2404.17808 [pdf, other]: Title: Scaffold-BPE: Enhancing Byte Pair Encoding with Simple and Effective Scaffold Token Removal

Authors: Haoran Lian, Yizhe Xiong, Jianwei Niu, Shasha Mo, Zhenpeng Su, Zijia Lin, Peng Liu, Hui Chen, Guiguang Ding

Subjects: Computation and Language (cs.CL)
[325] arXiv:2404.17807 [pdf, other]: Title: Meta In-Context Learning Makes Large Language Models Better Zero and Few-Shot Relation Extractors

Authors: Guozheng Li, Peng Wang, Jiajun Liu, Yikai Guo, Ke Ji, Ziyu Shang, Zijie Xu

Comments: IJCAI 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[326] arXiv:2404.17802 [pdf, other]: Title: Empirical Analysis of Dialogue Relation Extraction with Large Language Models

Authors: Guozheng Li, Zijie Xu, Ziyu Shang, Jiajun Liu, Ke Ji, Yikai Guo

Comments: IJCAI 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[327] arXiv:2404.17790 [pdf, other]: Title: Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities

Authors: Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Hiroki Iida, Masanari Ohi, Kakeru Hattori, Hirai Shota, Sakae Mizuki, Rio Yokota, Naoaki Okazaki

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[328] arXiv:2404.17785 [pdf, other]: Title: Temporal Scaling Law for Large Language Models

Authors: Yizhe Xiong, Xiansheng Chen, Xin Ye, Hui Chen, Zijia Lin, Haoran Lian, Jianwei Niu, Guiguang Ding

Comments: Work in progress

Subjects: Computation and Language (cs.CL)
[329] arXiv:2404.17779 [pdf, other]: Title: Medical Vision-Language Pre-Training for Brain Abnormalities

Authors: Masoud Monajatipoor, Zi-Yi Dou, Aichi Chien, Nanyun Peng, Kai-Wei Chang

Subjects: Computation and Language (cs.CL)
[330] arXiv:2404.17778 [pdf, other]: Title: MRScore: Evaluating Radiology Report Generation with LLM-based Reward System

Authors: Yunyi Liu, Zhanyu Wang, Yingshu Li, Xinyu Liang, Lingqiao Liu, Lei Wang, Luping Zhou

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[331] arXiv:2404.17733 [pdf, other]: Title: Building a Large Japanese Web Corpus for Large Language Models

Authors: Naoaki Okazaki, Kakeru Hattori, Hirai Shota, Hiroki Iida, Masanari Ohi, Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Rio Yokota, Sakae Mizuki

Comments: 17 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[332] arXiv:2404.17729 [pdf, other]: Title: CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem Solving

Authors: Pei Chen, Boran Han, Shuai Zhang

Comments: Accepted to NAACL 2024

Subjects: Computation and Language (cs.CL)
[333] arXiv:2404.17662 [pdf, other]: Title: PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery Games

Authors: Qinglin Zhu, Runcong Zhao, Jinhua Du, Lin Gui, Yulan He

Subjects: Computation and Language (cs.CL)
[334] arXiv:2404.17642 [pdf, other]: Title: Empowering Large Language Models for Textual Data Augmentation

Authors: Yichuan Li, Kaize Ding, Jianling Wang, Kyumin Lee

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[335] arXiv:2404.18928 (cross-list from cs.CV) [pdf, other]: Title: Stylus: Automatic Adapter Selection for Diffusion Models

Authors: Michael Luo, Justin Wong, Brandon Trabucco, Yanping Huang, Joseph E. Gonzalez, Zhifeng Chen, Ruslan Salakhutdinov, Ion Stoica

Comments: Project Website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Graphics (cs.GR); Machine Learning (cs.LG)
[336] arXiv:2404.18922 (cross-list from cs.LG) [pdf, other]: Title: DPO Meets PPO: Reinforced Token Optimization for RLHF

Authors: Han Zhong, Guhao Feng, Wei Xiong, Li Zhao, Di He, Jiang Bian, Liwei Wang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[337] arXiv:2404.18722 (cross-list from cs.CV) [pdf, ps, other]: Title: Improving Automatic Text Recognition with Language Models in the PyLaia Open-Source Library

Authors: Solène Tarride, Yoann Schneider, Marie Generali-Lince, Mélodie Boillet, Bastien Abadie, Christopher Kermorvant

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[338] arXiv:2404.18518 (cross-list from cs.DL) [pdf, ps, other]: Title: From ChatGPT, DALL-E 3 to Sora: How has Generative AI Changed Digital Humanities Research and Services?

Authors: Jiangfeng Liu, Ziyi Wang, Jing Xie, Lei Pei

Comments: 21 pages, 3 figures

Subjects: Digital Libraries (cs.DL); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY)
[339] arXiv:2404.18470 (cross-list from cs.CE) [pdf, other]: Title: ECC Analyzer: Extract Trading Signal from Earnings Conference Calls using Large Language Model for Stock Performance Prediction

Authors: Yupeng Cao, Zhi Chen, Qingyun Pei, Prashant Kumar, K.P. Subbalakshmi, Papa Momar Ndiaye

Comments: 15 pages, 3 figures, 5 tables

Subjects: Computational Engineering, Finance, and Science (cs.CE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Risk Management (q-fin.RM); Trading and Market Microstructure (q-fin.TR)
[340] arXiv:2404.18416 (cross-list from cs.AI) [pdf, other]: Title: Capabilities of Gemini Models in Medicine

Authors: Khaled Saab, Tao Tu, Wei-Hung Weng, Ryutaro Tanno, David Stutz, Ellery Wulczyn, Fan Zhang, Tim Strother, Chunjong Park, Elahe Vedadi, Juanma Zambrano Chaves, Szu-Yeu Hu, Mike Schaekermann, Aishwarya Kamath, Yong Cheng, David G.T. Barrett, Cathy Cheung, Basil Mustafa, Anil Palepu, Daniel McDuff, Le Hou, Tomer Golany, Luyang Liu, Jean-baptiste Alayrac, Neil Houlsby, Nenad Tomasev, Jan Freyberg, Charles Lau, Jonas Kemp, Jeremy Lai, Shekoofeh Azizi, Kimberly Kanada, SiWai Man, Kavita Kulkarni, Ruoxi Sun, Siamak Shakeri, Luheng He, Ben Caine, Albert Webson, Natasha Latysheva, Melvin Johnson, Philip Mansfield, Jian Lu, Ehud Rivlin, Jesper Anderson, Bradley Green, Renee Wong, Jonathan Krause, Jonathon Shlens, Ewa Dominowska, S. M. Ali Eslami, Katherine Chou, Claire Cui, Oriol Vinyals, Koray Kavukcuoglu, et al. (12 additional authors not shown)

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)

Mon, 6 May 2024
Fri, 3 May 2024
Thu, 2 May 2024
Wed, 1 May 2024
Tue, 30 Apr 2024

[ total of 350 entries: 1-340 | 341-350 ]
[ showing 340 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help (Access key information)

> cs > cs.CL

Computation and Language

Authors and titles for recent submissions

Mon, 6 May 2024

Fri, 3 May 2024

Thu, 2 May 2024

Wed, 1 May 2024

Tue, 30 Apr 2024 (showing first 89 of 99 entries)