Computation and Language

Authors and titles for recent submissions, skipping first 103

[ total of 352 entries: 1-100 | 4-103 | 104-203 | 204-303 | 304-352 ]
[ showing 100 entries per page: fewer | more | all ]

Tue, 16 Apr 2024 (continued, showing last 34 of 137 entries)

[104] arXiv:2404.09385 (cross-list from eess.AS) [pdf, other]: Title: A Large-Scale Evaluation of Speech Foundation Models

Authors: Shu-wen Yang, Heng-Jui Chang, Zili Huang, Andy T. Liu, Cheng-I Lai, Haibin Wu, Jiatong Shi, Xuankai Chang, Hsiang-Sheng Tsai, Wen-Chin Huang, Tzu-hsun Feng, Po-Han Chi, Yist Y. Lin, Yung-Sung Chuang, Tzu-Hsien Huang, Wei-Cheng Tseng, Kushal Lakhotia, Shang-Wen Li, Abdelrahman Mohamed, Shinji Watanabe, Hung-yi Lee

Comments: The extended journal version for SUPERB and SUPERB-SG. Accepted to TASLP. The arxiv version is further refined

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Signal Processing (eess.SP)
[105] arXiv:2404.09384 (cross-list from cs.SE) [pdf, other]: Title: Tasks People Prompt: A Taxonomy of LLM Downstream Tasks in Software Verification and Falsification Approaches

Authors: Víctor A. Braberman, Flavia Bonomo-Braberman, Yiannis Charalambous, Juan G. Colonna, Lucas C. Cordeiro, Rosiane de Freitas

Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[106] arXiv:2404.09375 (cross-list from cs.HC) [pdf, other]: Title: Deceptive Patterns of Intelligent and Interactive Writing Assistants

Authors: Karim Benharrak, Tim Zindulka, Daniel Buschek

Comments: Published as a workshop paper to the In2Writing workshop at CHI 2024

Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[107] arXiv:2404.09356 (cross-list from cs.CY) [pdf, other]: Title: LLeMpower: Understanding Disparities in the Control and Access of Large Language Models

Authors: Vishwas Sathish, Hannah Lin, Aditya K Kamath, Anish Nyayachavadi

Comments: 11 total pages, 7 page text, 4 page references, 3 figures (with subfigures), 1 table

Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Emerging Technologies (cs.ET)
[108] arXiv:2404.09275 (cross-list from cs.CV) [pdf, other]: Title: TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning

Authors: Quang Minh Dinh, Minh Khoi Ho, Anh Quan Dang, Hung Phong Tran

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[109] arXiv:2404.09249 (cross-list from cs.SE) [pdf, other]: Title: Test Code Generation for Telecom Software Systems using Two-Stage Generative Model

Authors: Mohamad Nabeel, Doumitrou Daniil Nimara, Tahar Zanouda

Comments: 6 pages, 5 figures, Accepted at 1st Workshop on The Impact of Large Language Models on 6G Networks - IEEE International Conference on Communications (ICC) 2024

Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[110] arXiv:2404.09248 (cross-list from cs.LG) [pdf, other]: Title: Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts

Authors: Jing-Cheng Pang, Si-Hang Yang, Kaiyuan Li, Jiaji Zhang, Xiong-Hui Chen, Nan Tang, Yang Yu

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[111] arXiv:2404.09173 (cross-list from cs.LG) [pdf, other]: Title: TransformerFAM: Feedback attention is working memory

Authors: Dongseong Hwang, Weiran Wang, Zhuoyuan Huo, Khe Chai Sim, Pedro Moreno Mengibar

Comments: 24 pages, 12 figures, 14 tables

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[112] arXiv:2404.09155 (cross-list from cs.LG) [pdf, other]: Title: Mitigating Heterogeneity among Factor Tensors via Lie Group Manifolds for Tensor Decomposition Based Temporal Knowledge Graph Embedding

Authors: Jiang Li, Xiangdong Su, Yeyun Gong, Guanglai Gao

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[113] arXiv:2404.09123 (cross-list from cs.LG) [pdf, other]: Title: Provable Interactive Learning with Hindsight Instruction Feedback

Authors: Dipendra Misra, Aldo Pacchiano, Robert E. Schapire

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[114] arXiv:2404.09091 (cross-list from cs.IR) [pdf, other]: Title: Semantic In-Domain Product Identification for Search Queries

Authors: Sanat Sharma, Jayant Kumar, Twisha Naik, Zhaoyu Lu, Arvind Srikantan, Tracy Holloway King

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[115] arXiv:2404.09066 (cross-list from cs.CR) [pdf, other]: Title: CodeCloak: A Method for Evaluating and Mitigating Code Leakage by LLM Code Assistants

Authors: Amit Finkman, Eden Bar-Kochva, Avishag Shapira, Dudu Mimran, Yuval Elovici, Asaf Shabtai

Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Machine Learning (cs.LG); Programming Languages (cs.PL)
[116] arXiv:2404.09022 (cross-list from cs.LG) [pdf, other]: Title: Navigating the Landscape of Large Language Models: A Comprehensive Review and Analysis of Paradigms and Fine-Tuning Strategies

Authors: Benjue Weng

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[117] arXiv:2404.08958 (cross-list from cs.CV) [pdf, other]: Title: AMU-Tuning: Effective Logit Bias for CLIP-based Few-shot Learning

Authors: Yuwei Tang, Zhenyi Lin, Qilong Wang, Pengfei Zhu, Qinghua Hu

Comments: Accepted by CVPR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[118] arXiv:2404.08940 (cross-list from cs.IR) [pdf, other]: Title: Introducing Super RAGs in Mistral 8x7B-v1

Authors: Ayush Thakur, Raghav Gupta

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[119] arXiv:2404.08886 (cross-list from cs.CV) [pdf, other]: Title: EIVEN: Efficient Implicit Attribute Value Extraction using Multimodal LLM

Authors: Henry Peng Zou, Gavin Heqing Yu, Ziwei Fan, Dan Bu, Han Liu, Peng Dai, Dongmei Jia, Cornelia Caragea

Comments: Accepted by NAACL 2024 Industry Track

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[120] arXiv:2404.08885 (cross-list from cs.PL) [pdf, other]: Title: Is Next Token Prediction Sufficient for GPT? Exploration on Code Logic Comprehension

Authors: Mengnan Qi, Yufan Huang, Yongqiang Yao, Maoquan Wang, Bin Gu, Neel Sundaresan

Subjects: Programming Languages (cs.PL); Computation and Language (cs.CL); Machine Learning (cs.LG)
[121] arXiv:2404.08877 (cross-list from cs.SE) [pdf, other]: Title: Aligning LLMs for FL-free Program Repair

Authors: Junjielong Xu, Ying Fu, Shin Hwei Tan, Pinjia He

Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[122] arXiv:2404.08846 (cross-list from cs.LG) [pdf, other]: Title: Experimental Design for Active Transductive Inference in Large Language Models

Authors: Subhojyoti Mukherjee, Ge Liu, Aniket Deshmukh, Anusha Lalitha, Yifei Ma, Branislav Kveton

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[123] arXiv:2404.08819 (cross-list from cs.LG) [pdf, other]: Title: The Illusion of State in State-Space Models

Authors: William Merrill, Jackson Petty, Ashish Sabharwal

Comments: Preprint

Subjects: Machine Learning (cs.LG); Computational Complexity (cs.CC); Computation and Language (cs.CL); Formal Languages and Automata Theory (cs.FL)
[124] arXiv:2404.08801 (cross-list from cs.LG) [pdf, other]: Title: Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Authors: Xuezhe Ma, Xiaomeng Yang, Wenhan Xiong, Beidi Chen, Lili Yu, Hao Zhang, Jonathan May, Luke Zettlemoyer, Omer Levy, Chunting Zhou

Comments: 9 pages, 6 figures and 8 tables

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[125] arXiv:2404.08793 (cross-list from cs.CR) [pdf, other]: Title: JailbreakLens: Visual Analysis of Jailbreak Attacks Against Large Language Models

Authors: Yingchaojie Feng, Zhizhang Chen, Zhining Kang, Sijia Wang, Minfeng Zhu, Wei Zhang, Wei Chen

Comments: Submitted to VIS 2024

Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[126] arXiv:2404.08763 (cross-list from cs.LG) [pdf, other]: Title: CATS: Contextually-Aware Thresholding for Sparsity in Large Language Models

Authors: Je-Yong Lee, Donghyun Lee, Genghan Zhang, Mo Tiwari, Azalia Mirhoseini

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[127] arXiv:2404.08727 (cross-list from cs.DB) [pdf, other]: Title: Can LLMs substitute SQL? Comparing Resource Utilization of Querying LLMs versus Traditional Relational Databases

Authors: Xiang Zhang, Khatoon Khedri, Reza Rawassizadeh

Comments: 13 pages, 2 figures, 5 tables

Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[128] arXiv:2404.08720 (cross-list from cs.LG) [pdf, other]: Title: Exploring Contrastive Learning for Long-Tailed Multi-Label Text Classification

Authors: Alexandre Audibert, Aurélien Gauffre, Massih-Reza Amini

Comments: 14 pages, 2 figures

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[129] arXiv:2404.08707 (cross-list from cs.LG) [pdf, other]: Title: Large Language Model Can Continue Evolving From Mistakes

Authors: Haokun Zhao, Haixia Han, Jie Shi, Chengyu Du, Jiaqing Liang, Yanghua Xiao

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[130] arXiv:2404.08692 (cross-list from cs.IR) [pdf, other]: Title: Apollonion: Profile-centric Dialog Agent

Authors: Shangyu Chen, Zibo Zhao, Yuanyuan Zhao, Xiang Li

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[131] arXiv:2404.08677 (cross-list from cs.IR) [pdf, other]: Title: PMG : Personalized Multimodal Generation with Large Language Models

Authors: Xiaoteng Shen, Rui Zhang, Xiaoyan Zhao, Jieming Zhu, Xi Xiao

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[132] arXiv:2404.08675 (cross-list from cs.IR) [pdf, other]: Title: RecGPT: Generative Personalized Prompts for Sequential Recommendation via ChatGPT Training Paradigm

Authors: Yabin Zhang, Wenhui Yu, Erhan Zhang, Xu Chen, Lantao Hu, Peng Jiang, Kun Gai

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[133] arXiv:2404.08672 (cross-list from cs.IR) [pdf, other]: Title: Taxonomy and Analysis of Sensitive User Queries in Generative AI Search

Authors: Hwiyeol Jo, Taiwoo Park, Nayoung Choi, Changbong Kim, Ohjoon Kwon, Donghyeon Jeon, Hyunwoo Lee, Eui-Hyeon Lee, Kyoungho Shin, Sun Suk Lim, Kyungmi Kim, Jihye Lee, Sun Kim

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[134] arXiv:2404.08665 (cross-list from cs.IR) [pdf, other]: Title: Targeted aspect-based emotion analysis to detect opportunities and precaution in financial Twitter messages

Authors: Silvia García-Méndez, Francisco de Arriba-Pérez, Ana Barros-Vila, Francisco J. González-Castaño

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG); Social and Information Networks (cs.SI); Trading and Market Microstructure (q-fin.TR)
[135] arXiv:2404.08664 (cross-list from cs.IR) [pdf, other]: Title: Identifying Banking Transaction Descriptions via Support Vector Machine Short-Text Classification Based on a Specialized Labelled Corpus

Authors: Silvia García-Méndez, Milagros Fernández-Gavilanes, Jonathan Juncal-Martínez, Francisco J. González-Castaño, Oscar Barba Seara

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[136] arXiv:2301.07150 (cross-list from cs.RO) [pdf, other]: Title: Embodied Agents for Efficient Exploration and Smart Scene Description

Authors: Roberto Bigazzi, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara

Comments: Accepted by IEEE International Conference on Robotics and Automation (ICRA 2023)

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[137] arXiv:2007.07268 (cross-list from cs.CV) [pdf, other]: Title: Explore and Explain: Self-supervised Navigation and Recounting

Authors: Roberto Bigazzi, Federico Landi, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara

Comments: ICPR 2020

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Robotics (cs.RO)

Mon, 15 Apr 2024

[138] arXiv:2404.08634 [pdf, other]: Title: Pre-training Small Base LMs with Fewer Tokens

Authors: Sunny Sanyal, Sujay Sanghavi, Alexandros G. Dimakis

Comments: 15 pages, 6 figures, 10 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[139] arXiv:2404.08627 [pdf, other]: Title: Is ChatGPT Transforming Academics' Writing Style?

Authors: Mingmeng Geng, Roberto Trotta

Comments: 15 pages, 19 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Digital Libraries (cs.DL); Machine Learning (cs.LG)
[140] arXiv:2404.08617 [pdf, ps, other]: Title: Synthetic Dataset Creation and Fine-Tuning of Transformer Models for Question Answering in Serbian

Authors: Aleksa Cvetanović, Predrag Tadić

Subjects: Computation and Language (cs.CL)
[141] arXiv:2404.08579 [pdf, other]: Title: Small Models Are (Still) Effective Cross-Domain Argument Extractors

Authors: William Gantt, Aaron Steven White

Comments: ACL Rolling Review Short Paper

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[142] arXiv:2404.08567 [pdf, other]: Title: CATP: Cross-Attention Token Pruning for Accuracy Preserved Multimodal Model Inference

Authors: Ruqi Liao, Chuqing Zhao, Jin Li, Weiqi Feng

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[143] arXiv:2404.08559 [pdf, other]: Title: MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking

Authors: Tianwen Tang, Tong Zhu, Haodong Liu, Yin Bai, Jia Cheng, Wenliang Chen

Comments: Accepted to LREC-COLING 2024

Subjects: Computation and Language (cs.CL)
[144] arXiv:2404.08538 [pdf, other]: Title: VertAttack: Taking advantage of Text Classifiers' horizontal vision

Authors: Jonathan Rusert

Comments: 14 pages, 4 figures, accepted to NAACL 2024

Subjects: Computation and Language (cs.CL)
[145] arXiv:2404.08491 [pdf, other]: Title: Mitigating Language-Level Performance Disparity in mPLMs via Teacher Language Selection and Cross-lingual Self-Distillation

Authors: Haozhe Zhao, Zefan Cai, Shuzheng Si, Liang Chen, Yufeng He, Kaikai An, Baobao Chang

Comments: NAACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[146] arXiv:2404.08488 [pdf, ps, other]: Title: Thematic Analysis with Large Language Models: does it work with languages other than English? A targeted test in Italian

Authors: Stefano De Paoli

Subjects: Computation and Language (cs.CL)
[147] arXiv:2404.08403 [pdf, other]: Title: Learning representations of learning representations

Authors: Rita González-Márquez, Dmitry Kobak

Subjects: Computation and Language (cs.CL); Digital Libraries (cs.DL); Machine Learning (cs.LG)
[148] arXiv:2404.08382 [pdf, other]: Title: Look at the Text: Instruction-Tuned Language Models are More Robust Multiple Choice Selectors than You Think

Authors: Xinpeng Wang, Chengzhi Hu, Bolei Ma, Paul Röttger, Barbara Plank

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[149] arXiv:2404.08368 [pdf, other]: Title: ASR advancements for indigenous languages: Quechua, Guarani, Bribri, Kotiria, and Wa'ikhana

Authors: Monica Romero, Sandra Gomez, Iván G. Torre

Subjects: Computation and Language (cs.CL)
[150] arXiv:2404.08359 [pdf, other]: Title: Improving Health Question Answering with Reliable and Time-Aware Evidence Retrieval

Authors: Juraj Vladika, Florian Matthes

Comments: Accepted to NAACL 2024 (Findings)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[151] arXiv:2404.08354 [pdf, other]: Title: Gaining More Insight into Neural Semantic Parsing with Challenging Benchmarks

Authors: Xiao Zhang, Chunliu Wang, Rik van Noord, Johan Bos

Subjects: Computation and Language (cs.CL)
[152] arXiv:2404.08345 [pdf, other]: Title: FastSpell: the LangId Magic Spell

Authors: Marta Bañón, Jaume Zaragoza-Bernabeu, Gema Ramírez-Sánchez, Sergio Ortiz-Rojas

Subjects: Computation and Language (cs.CL)
[153] arXiv:2404.08335 [pdf, other]: Title: Toward a Theory of Tokenization in LLMs

Authors: Nived Rajaraman, Jiantao Jiao, Kannan Ramchandran

Comments: 58 pages, 10 figures

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[154] arXiv:2404.08313 [pdf, other]: Title: The Integration of Semantic and Structural Knowledge in Knowledge Graph Entity Typing

Authors: Muzhi Li, Minda Hu, Irwin King, Ho-fung Leung

Comments: Accepted in NAACL2024 main

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[155] arXiv:2404.08263 [pdf, other]: Title: Relational Prompt-based Pre-trained Language Models for Social Event Detection

Authors: Pu Li, Xiaoyan Yu, Hao Peng, Yantuan Xian, Linqin Wang, Li Sun, Jingyun Zhang, Philip S. Yu

Comments: ACM TOIS Under Review

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[156] arXiv:2404.08262 [pdf, ps, other]: Title: Pretraining and Updating Language- and Domain-specific Large Language Model: A Case Study in Japanese Business Domain

Authors: Kosuke Takahashi, Takahiro Omi, Kosuke Arima, Tatsuya Ishigaki

Comments: 9 pages. preprint of COLM2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[157] arXiv:2404.08259 [pdf, ps, other]: Title: Investigating Neural Machine Translation for Low-Resource Languages: Using Bavarian as a Case Study

Authors: Wan-Hua Her, Udo Kruschwitz

Comments: Preprint accepted at the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages (SIGUL 2024)

Subjects: Computation and Language (cs.CL)
[158] arXiv:2404.08191 [pdf, other]: Title: Measuring Cross-lingual Transfer in Bytes

Authors: Leandro Rodrigues de Souza, Thales Sales Almeida, Roberto Lotufo, Rodrigo Nogueira

Comments: NAACL 2024

Subjects: Computation and Language (cs.CL)
[159] arXiv:2404.08156 [pdf, other]: Title: Multimodal Contextual Dialogue Breakdown Detection for Conversational AI Models

Authors: Md Messal Monem Miah, Ulie Schnaithmann, Arushi Raghuvanshi, Youngseo Son

Comments: Published in NAACL 2024 Industry Track

Subjects: Computation and Language (cs.CL)
[160] arXiv:2404.08155 [pdf, other]: Title: Graph Integrated Language Transformers for Next Action Prediction in Complex Phone Calls

Authors: Amin Hosseiny Marani, Ulie Schnaithmann, Youngseo Son, Akil Iyer, Manas Paldhe, Arushi Raghuvanshi

Comments: Published in NAACL 2024 Industry Track

Subjects: Computation and Language (cs.CL)
[161] arXiv:2404.08148 [pdf, other]: Title: Distilling Algorithmic Reasoning from LLMs via Explaining Solution Programs

Authors: Jierui Li, Raymond Mooney

Comments: pre-print

Subjects: Computation and Language (cs.CL)
[162] arXiv:2404.08118 [pdf, ps, other]: Title: HLTCOE at TREC 2023 NeuCLIR Track

Authors: Eugene Yang, Dawn Lawrie, James Mayfield

Comments: 6 pages. Part of TREC 2023 Proceedings

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[163] arXiv:2404.08092 [pdf, ps, other]: Title: Data-Augmentation-Based Dialectal Adaptation for LLMs

Authors: Fahim Faisal, Antonios Anastasopoulos

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[164] arXiv:2404.08078 [pdf, other]: Title: SQBC: Active Learning using LLM-Generated Synthetic Data for Stance Detection in Online Political Discussions

Authors: Stefan Sylvius Wagner, Maike Behrendt, Marc Ziegele, Stefan Harmeling

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[165] arXiv:2404.08066 [pdf, other]: Title: MSciNLI: A Diverse Benchmark for Scientific Natural Language Inference

Authors: Mobashir Sadat, Cornelia Caragea

Comments: Accepted to the NAACL 2024 Main Conference

Subjects: Computation and Language (cs.CL)
[166] arXiv:2404.08555 (cross-list from cs.LG) [pdf, other]: Title: RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

Authors: Shreyas Chaudhari, Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan, Ameet Deshpande, Bruno Castro da Silva

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[167] arXiv:2404.08517 (cross-list from cs.SE) [pdf, other]: Title: Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward

Authors: Xuan Xie, Jiayang Song, Zhehua Zhou, Yuheng Huang, Da Song, Lei Ma

Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[168] arXiv:2404.08511 (cross-list from cs.AI) [pdf, other]: Title: Leveraging Multi-AI Agents for Cross-Domain Knowledge Discovery

Authors: Shiva Aryal, Tuyen Do, Bisesh Heyojoo, Sandeep Chataut, Bichar Dip Shrestha Gurung, Venkataramana Gadhamshetty, Etienne Gnimpieba

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[169] arXiv:2404.08509 (cross-list from cs.DC) [pdf, other]: Title: Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction

Authors: Haoran Qiu, Weichao Mao, Archit Patke, Shengkun Cui, Saurabh Jha, Chen Wang, Hubertus Franke, Zbigniew T. Kalbarczyk, Tamer Başar, Ravishankar K. Iyer

Comments: Accepted at AIOps'24

Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computation and Language (cs.CL); Machine Learning (cs.LG)
[170] arXiv:2404.08495 (cross-list from cs.LG) [pdf, other]: Title: Dataset Reset Policy Optimization for RLHF

Authors: Jonathan D. Chang, Wenhao Zhan, Owen Oertell, Kianté Brantley, Dipendra Misra, Jason D. Lee, Wen Sun

Comments: 28 pages, 6 tables, 3 Figures, 3 Algorithms

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[171] arXiv:2404.08480 (cross-list from cs.LG) [pdf, other]: Title: Decoding AI: The inside story of data analysis in ChatGPT

Authors: Ozan Evkaya, Miguel de Carvalho

Comments: 15 pages with figures and appendix

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computation (stat.CO)
[172] arXiv:2404.08417 (cross-list from cs.LG) [pdf, other]: Title: AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees

Authors: William Fleshman, Aleem Khan, Marc Marone, Benjamin Van Durme

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[173] arXiv:2404.08309 (cross-list from cs.CR) [pdf, other]: Title: Subtoxic Questions: Dive Into Attitude Change of LLM's Response in Jailbreak Attempts

Authors: Tianyu Zhang, Zixuan Zhao, Jiaqi Huang, Jingyu Hua, Sheng Zhong

Comments: 4 pages, 2 figures. This paper was submitted to The 7th Deep Learning Security and Privacy Workshop (DLSP 2024) and was accepted as extended abstract, see this https URL

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[174] arXiv:2404.08189 (cross-list from cs.LG) [pdf, other]: Title: Reducing hallucination in structured outputs via Retrieval-Augmented Generation

Authors: Patrice Béchard, Orlando Marquez Ayala

Comments: To be presented at NAACL 2024. 11 pages and 4 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[175] arXiv:2404.08164 (cross-list from stat.ML) [pdf, other]: Title: Language Model Prompt Selection via Simulation Optimization

Authors: Haoting Zhang, Jinghai He, Rhonda Righter, Zeyu Zheng

Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[176] arXiv:2404.08134 (cross-list from cs.IR) [pdf, other]: Title: Extending Translate-Train for ColBERT-X to African Language CLIR

Authors: Eugene Yang, Dawn J. Lawrie, Paul McNamee, James Mayfield

Comments: 10 pages, 2 figures. System description paper for HLTCOE's participation in CIRAL@FIRE 2023

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[177] arXiv:2404.08111 (cross-list from cs.CV) [pdf, other]: Title: S3Editor: A Sparse Semantic-Disentangled Self-Training Framework for Face Video Editing

Authors: Guangzhi Wang, Tianyi Chen, Kamran Ghasedi, HsiangTao Wu, Tianyu Ding, Chris Nuesmeyer, Ilya Zharkov, Mohan Kankanhalli, Luming Liang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[178] arXiv:2404.08080 (cross-list from cs.LG) [pdf, other]: Title: Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models

Authors: Tanmay Gautam, Youngsuk Park, Hao Zhou, Parameswaran Raman, Wooseok Ha

Comments: 29 pages, 25 tables, 9 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Optimization and Control (math.OC)
[179] arXiv:2404.08020 (cross-list from cs.AI) [pdf, other]: Title: Augmenting Knowledge Graph Hierarchies Using Neural Transformers

Authors: Sanat Sharma, Mayank Poddar, Jayant Kumar, Kosta Blank, Tracy King

Comments: European Conference on Information Retrieval 2024

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Digital Libraries (cs.DL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[180] arXiv:2404.08018 (cross-list from cs.SE) [pdf, other]: Title: Analyzing the Performance of Large Language Models on Code Summarization

Authors: Rajarshi Haldar, Julia Hockenmaier

Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[181] arXiv:2404.08008 (cross-list from cs.LG) [pdf, other]: Title: Sample-Efficient Human Evaluation of Large Language Models via Maximum Discrepancy Competition

Authors: Kehua Feng, Keyan Ding, Kede Ma, Zhihua Wang, Qiang Zhang, Huajun Chen

Comments: 32 pages, 6 figures

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[182] arXiv:2404.08001 (cross-list from hep-ph) [pdf, other]: Title: Xiwu: A Basis Flexible and Learnable LLM for High Energy Physics

Authors: Zhengde Zhang, Yiyu Zhang, Haodong Yao, Jianwen Luo, Rui Zhao, Bo Huang, Jiameng Zhao, Yipu Liao, Ke Li, Lina Zhao, Jun Cao, Fazhi Qi, Changzheng Yuan

Comments: 15 pages, 8 figures

Subjects: High Energy Physics - Phenomenology (hep-ph); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); High Energy Physics - Experiment (hep-ex); Computational Physics (physics.comp-ph)
[183] arXiv:2404.07999 (cross-list from cs.LG) [pdf, other]: Title: A Multi-Level Framework for Accelerating Training Transformer Models

Authors: Longwei Zou, Han Zhang, Yangdong Deng

Comments: ICLR 2024

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)

Fri, 12 Apr 2024 (showing first 20 of 59 entries)

[184] arXiv:2404.07982 [pdf, other]: Title: Language Imbalance Can Boost Cross-lingual Generalisation

Authors: Anton Schäfer, Shauli Ravfogel, Thomas Hofmann, Tiago Pimentel, Imanol Schlag

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[185] arXiv:2404.07979 [pdf, other]: Title: LLoCO: Learning Long Contexts Offline

Authors: Sijun Tan, Xiuyu Li, Shishir Patil, Ziyang Wu, Tianjun Zhang, Kurt Keutzer, Joseph E. Gonzalez, Raluca Ada Popa

Comments: The first two authors contributed equally to this work

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[186] arXiv:2404.07965 [pdf, other]: Title: Rho-1: Not All Tokens Are What You Need

Authors: Zhenghao Lin, Zhibin Gou, Yeyun Gong, Xiao Liu, Yelong Shen, Ruochen Xu, Chen Lin, Yujiu Yang, Jian Jiao, Nan Duan, Weizhu Chen

Comments: First two authors equal contribution

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[187] arXiv:2404.07922 [pdf, other]: Title: LaVy: Vietnamese Multimodal Large Language Model

Authors: Chi Tran, Huong Le Thanh

Comments: 4 pages

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[188] arXiv:2404.07921 [pdf, other]: Title: AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMs

Authors: Zeyi Liao, Huan Sun

Subjects: Computation and Language (cs.CL)
[189] arXiv:2404.07904 [pdf, other]: Title: HGRN2: Gated Linear RNNs with State Expansion

Authors: Zhen Qin, Songlin Yang, Weixuan Sun, Xuyang Shen, Dong Li, Weigao Sun, Yiran Zhong

Comments: Techinical Report. Yiran Zhong is the corresponding author. The source code is available at this https URL

Subjects: Computation and Language (cs.CL)
[190] arXiv:2404.07900 [pdf, other]: Title: High-Dimension Human Value Representation in Large Language Models

Authors: Samuel Cahyawijaya, Delong Chen, Yejin Bang, Leila Khalatbari, Bryan Wilie, Ziwei Ji, Etsuko Ishii, Pascale Fung

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[191] arXiv:2404.07879 [pdf, other]: Title: Analyzing Toxicity in Deep Conversations: A Reddit Case Study

Authors: Vigneshwaran Shankaran, Rajesh Sharma

Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Social and Information Networks (cs.SI)
[192] arXiv:2404.07851 [pdf, other]: Title: Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations

Authors: Dayeon Ki, Marine Carpuat

Comments: 21 pages, 8 figures

Journal-ref: NAACL 2024 Findings

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[193] arXiv:2404.07840 [pdf, other]: Title: On Training Data Influence of GPT Models

Authors: Qingyi Liu, Yekun Chai, Shuohuan Wang, Yu Sun, Keze Wang, Hua Wu

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[194] arXiv:2404.07836 [pdf, ps, other]: Title: Question Generation in Knowledge-Driven Dialog: Explainability and Evaluation

Authors: Juliette Faille, Quentin Brabant, Gwenole Lecorve, Lina M. Rojas-Barahona, Claire Gardent

Subjects: Computation and Language (cs.CL)
[195] arXiv:2404.07814 [pdf, ps, other]: Title: MultiLS-SP/CA: Lexical Complexity Prediction and Lexical Simplification Resources for Catalan and Spanish

Authors: Stefan Bott, Horacio Saggion, Nelson Peréz Rojas, Martin Solis Salazar, Saul Calderon Ramirez

Comments: Submitted to the 40th edition of the SEPLN Conference. Under Revision

Subjects: Computation and Language (cs.CL)
[196] arXiv:2404.07792 [pdf, other]: Title: Nostra Domina at EvaLatin 2024: Improving Latin Polarity Detection through Data Augmentation

Authors: Stephen Bothwell, Abigail Swenor, David Chiang

Comments: Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[197] arXiv:2404.07775 [pdf, other]: Title: Discourse-Aware In-Context Learning for Temporal Expression Normalization

Authors: Akash Kumar Gautam, Lukas Lange, Jannik Strötgen

Comments: Accepted at NAACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[198] arXiv:2404.07768 [pdf, ps, other]: Title: Using Letter Positional Probabilities to Assess Word Complexity

Authors: Michael Dalvean

Comments: 25 Pages, 15 Tables

Subjects: Computation and Language (cs.CL)
[199] arXiv:2404.07765 [pdf, other]: Title: AnnoCTR: A Dataset for Detecting and Linking Entities, Tactics, and Techniques in Cyber Threat Reports

Authors: Lukas Lange, Marc Müller, Ghazaleh Haratinezhad Torbati, Dragan Milchevski, Patrick Grau, Subhash Pujari, Annemarie Friedrich

Comments: Accepted at LREC-COLING 2024. Corpus available at this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[200] arXiv:2404.07738 [pdf, other]: Title: ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

Authors: Jinheon Baek, Sujay Kumar Jauhar, Silviu Cucerzan, Sung Ju Hwang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[201] arXiv:2404.07720 [pdf, other]: Title: Automatic Generation and Evaluation of Reading Comprehension Test Items with Large Language Models

Authors: Andreas Säuberli, Simon Clematide

Comments: Accepted for publication at the 3rd Workshop on Tools and Resources for People with REAding DIfficulties (READI) at LREC-COLING 2024

Subjects: Computation and Language (cs.CL)
[202] arXiv:2404.07677 [pdf, other]: Title: ODA: Observation-Driven Agent for integrating LLMs and Knowledge Graphs

Authors: Lei Sun, Zhengwei Tao, Youdi Li, Hiroshi Arakawa

Comments: LLM+KG

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[203] arXiv:2404.07673 [pdf, other]: Title: Curated Datasets and Neural Models for Machine Translation of Informal Registers between Mayan and Spanish Vernaculars

Authors: Andrés Lou, Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Víctor M. Sánchez-Cartagena

Comments: 13 pages, 3 figures, 8 tables, Submitted to NAACL 2024

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)

[ total of 352 entries: 1-100 | 4-103 | 104-203 | 204-303 | 304-352 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2404, contact, help (Access key information)

> cs > cs.CL

Computation and Language

Authors and titles for recent submissions, skipping first 103

Tue, 16 Apr 2024 (continued, showing last 34 of 137 entries)

Mon, 15 Apr 2024

Fri, 12 Apr 2024 (showing first 20 of 59 entries)