Computation and Language

Authors and titles for recent submissions, skipping first 110

[ total of 515 entries: 1-100 | 11-110 | 111-210 | 211-310 | 311-410 | 411-510 | 511-515 ]
[ showing 100 entries per page: fewer | more | all ]

Tue, 28 May 2024 (continued, showing last 97 of 126 entries)

[111] arXiv:2405.16908 [pdf, other]: Title: Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words?

Authors: Gal Yona, Roee Aharoni, Mor Geva

Subjects: Computation and Language (cs.CL)
[112] arXiv:2405.16884 [pdf, other]: Title: Match, Compare, or Select? An Investigation of Large Language Models for Entity Matching

Authors: Tianshu Wang, Hongyu Lin, Xiaoyang Chen, Xianpei Han, Hao Wang, Zhenyu Zeng, Le Sun

Comments: Under revision. Code is available at this https URL

Subjects: Computation and Language (cs.CL); Databases (cs.DB)
[113] arXiv:2405.16856 [pdf, other]: Title: Can We Trust LLMs? Mitigate Overconfidence Bias in LLMs through Knowledge Transfer

Authors: Haoyan Yang, Yixuan Wang, Xingyin Xu, Hanyuan Zhang, Yirong Bian

Subjects: Computation and Language (cs.CL)
[114] arXiv:2405.16821 [pdf, other]: Title: Perturbation-Restrained Sequential Model Editing

Authors: Jun-Yu Ma, Hong Wang, Hao-Xiang Xu, Zhen-Hua Ling, Jia-Chen Gu

Subjects: Computation and Language (cs.CL)
[115] arXiv:2405.16810 [pdf, ps, other]: Title: Performance evaluation of Reddit Comments using Machine Learning and Natural Language Processing methods in Sentiment Analysis

Authors: Xiaoxia Zhang, Xiuyuan Qi, Zixin Teng

Comments: 11 pages, 5 figures, to be published in Computational and Experimental Simulations in Engineering - Proceedings of ICCES 2024 - Volume 2

Subjects: Computation and Language (cs.CL)
[116] arXiv:2405.16806 [pdf, other]: Title: Entity Alignment with Noisy Annotations from Large Language Models

Authors: Shengyuan Chen, Qinggang Zhang, Junnan Dong, Wen Hua, Qing Li, Xiao Huang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[117] arXiv:2405.16802 [pdf, other]: Title: AutoCV: Empowering Reasoning with Automated Process Labeling via Confidence Variation

Authors: Jianqiao Lu, Zhiyang Dou, Hongru Wang, Zeyu Cao, Jianbo Dai, Yingjia Wan, Yinya Huang, Zhijiang Guo

Comments: 20 pages, 1 figure, 13 tables

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[118] arXiv:2405.16720 [pdf, other]: Title: Large Scale Knowledge Washing

Authors: Yu Wang, Ruihan Wu, Zexue He, Xiusi Chen, Julian McAuley

Subjects: Computation and Language (cs.CL)
[119] arXiv:2405.16714 [pdf, other]: Title: Crafting Interpretable Embeddings by Asking LLMs Questions

Authors: Vinamra Benara, Chandan Singh, John X. Morris, Richard Antonello, Ion Stoica, Alexander G. Huth, Jianfeng Gao

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[120] arXiv:2405.16702 [pdf, other]: Title: Accurate and Nuanced Open-QA Evaluation Through Textual Entailment

Authors: Peiran Yao, Denilson Barbosa

Comments: To appear at ACL 2024 (Findings)

Subjects: Computation and Language (cs.CL)
[121] arXiv:2405.16684 [pdf, other]: Title: gzip Predicts Data-dependent Scaling Laws

Authors: Rohan Pandey

Comments: 9 pages, 9 figures

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[122] arXiv:2405.16681 [pdf, other]: Title: Triple Preference Optimization: Achieving Better Alignment with Less Data in a Single Step Optimization

Authors: Amir Saeidi, Shivanshu Verma, Aswin RRV, Chitta Baral

Subjects: Computation and Language (cs.CL)
[123] arXiv:2405.16661 [pdf, other]: Title: RLSF: Reinforcement Learning via Symbolic Feedback

Authors: Piyush Jha, Prithwish Jana, Arnav Arora, Vijay Ganesh

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[124] arXiv:2405.16635 [pdf, other]: Title: Compressing Lengthy Context With UltraGist

Authors: Peitian Zhang, Zheng Liu, Shitao Xiao, Ninglu Shao, Qiwei Ye, Zhicheng Dou

Subjects: Computation and Language (cs.CL)
[125] arXiv:2405.16631 [pdf, other]: Title: Let Silence Speak: Enhancing Fake News Detection with Generated Comments from Large Language Models

Authors: Qiong Nan, Qiang Sheng, Juan Cao, Beizhe Hu, Danding Wang, Jintao Li

Comments: 11 pages, 5 figures, 8 tables

Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Social and Information Networks (cs.SI)
[126] arXiv:2405.16584 [pdf, other]: Title: MentalManip: A Dataset For Fine-grained Analysis of Mental Manipulation in Conversations

Authors: Yuxin Wang, Ivory Yang, Saeed Hassanpour, Soroush Vosoughi

Comments: Accepted at ACL 2024

Subjects: Computation and Language (cs.CL)
[127] arXiv:2405.16579 [pdf, other]: Title: Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity

Authors: Shanghaoran Quan

Subjects: Computation and Language (cs.CL)
[128] arXiv:2405.16571 [pdf, other]: Title: A Preliminary Empirical Study on Prompt-based Unsupervised Keyphrase Extraction

Authors: Mingyang Song, Yi Feng, Liping Jing

Comments: work in progress

Subjects: Computation and Language (cs.CL)
[129] arXiv:2405.16552 [pdf, other]: Title: SED: Self-Evaluation Decoding Enhances Large Language Models for Better Generation

Authors: Ziqin Luo, Haixia Han, Haokun Zhao, Guochao Jiang, Chengyu Du, Tingyun Li, Jiaqing Liang, Deqing Yang, Yanghua Xiao

Comments: The relevant code will be released in subsequent versions

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[130] arXiv:2405.16533 [pdf, other]: Title: Chain of Tools: Large Language Model is an Automatic Multi-tool Learner

Authors: Zhengliang Shi, Shen Gao, Xiuyi Chen, Yue Feng, Lingyong Yan, Haibo Shi, Dawei Yin, Zhumin Chen, Suzan Verberne, Zhaochun Ren

Comments: Work in progress

Subjects: Computation and Language (cs.CL)
[131] arXiv:2405.16482 [pdf, other]: Title: DarijaBanking: A New Resource for Overcoming Language Barriers in Banking Intent Detection for Moroccan Arabic Speakers

Authors: Abderrahman Skiredj, Ferdaous Azhari, Ismail Berrada, Saad Ezzini

Subjects: Computation and Language (cs.CL)
[132] arXiv:2405.16433 [pdf, other]: Title: CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling

Authors: Chenhao Zhang, Renhao Li, Minghuan Tan, Min Yang, Jingwei Zhu, Di Yang, Jiahao Zhao, Guancheng Ye, Chengming Li, Xiping Hu, Derek F. Wong

Comments: Appectped to Findings of ACL2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[133] arXiv:2405.16422 [pdf, ps, other]: Title: AI-Generated Text Detection and Classification Based on BERT Deep Learning Algorithm

Authors: Hao Wang, Jianwei Li, Zhengyu Li

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[134] arXiv:2405.16420 [pdf, other]: Title: M-RAG: Reinforcing Large Language Model Performance through Retrieval-Augmented Generation with Multiple Partitions

Authors: Zheng Wang, Shu Xian Teo, Jieer Ouyang, Yongjun Xu, Wei Shi

Comments: This paper has been accepted by ACL 2024

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[135] arXiv:2405.16412 [pdf, other]: Title: KG-FIT: Knowledge Graph Fine-Tuning Upon Open-World Knowledge

Authors: Pengcheng Jiang, Lang Cao, Cao Xiao, Parminder Bhatia, Jimeng Sun, Jiawei Han

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[136] arXiv:2405.16402 [pdf, other]: Title: Assessing Empathy in Large Language Models with Real-World Physician-Patient Interactions

Authors: Man Luo, Christopher J. Warren, Lu Cheng, Haidar M. Abdul-Muhsin, Imon Banerjee

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[137] arXiv:2405.16388 [pdf, other]: Title: Multi-Reference Preference Optimization for Large Language Models

Authors: Hung Le, Quan Tran, Dung Nguyen, Kien Do, Saloni Mittal, Kelechi Ogueji, Svetha Venkatesh

Comments: 20 pages

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[138] arXiv:2405.16376 [pdf, other]: Title: STRIDE: A Tool-Assisted LLM Agent Framework for Strategic and Interactive Decision-Making

Authors: Chuanhao Li, Runhan Yang, Tiankai Li, Milad Bafarassat, Kourosh Sharifi, Dirk Bergemann, Zhuoran Yang

Comments: 39 pages, 4 figures

Subjects: Computation and Language (cs.CL); Computer Science and Game Theory (cs.GT)
[139] arXiv:2405.16337 [pdf, other]: Title: Learning to Reason via Program Generation, Emulation, and Search

Authors: Nathaniel Weir, Muhammad Khalifa, Linlu Qiu, Orion Weller, Peter Clark

Comments: 16 pages, 10 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[140] arXiv:2405.16295 [pdf, ps, other]: Title: Comparative Analysis of Open-Source Language Models in Summarizing Medical Text Data

Authors: Yuhao Chen, Zhimu Wang, Bo Wen, Farhana Zulkernine

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[141] arXiv:2405.16284 [pdf, ps, other]: Title: Generating clickbait spoilers with an ensemble of large language models

Authors: Mateusz Woźny, Mateusz Lango

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[142] arXiv:2405.16282 [pdf, other]: Title: Confidence Under the Hood: An Investigation into the Confidence-Probability Alignment in Large Language Models

Authors: Abhishek Kumar, Robert Morabito, Sanzhar Umbet, Jad Kabbara, Ali Emami

Comments: 9 pages (excluding references), accepted to ACL 2024 Main Conference

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[143] arXiv:2405.16281 [pdf, other]: Title: ConStat: Performance-Based Contamination Detection in Large Language Models

Authors: Jasper Dekoninck, Mark Niklas Müller, Martin Vechev

Subjects: Computation and Language (cs.CL)
[144] arXiv:2405.16277 [pdf, other]: Title: Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge

Authors: Brendan Park, Madeline Janecek, Naser Ezzati-Jivan, Yifeng Li, Ali Emami

Comments: 9 pages (excluding references), accepted to ACL 2024 Main Conference

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[145] arXiv:2405.16229 [pdf, other]: Title: No Two Devils Alike: Unveiling Distinct Mechanisms of Fine-tuning Attacks

Authors: Chak Tou Leong, Yi Cheng, Kaishuai Xu, Jian Wang, Hanlin Wang, Wenjie Li

Comments: work in progress

Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[146] arXiv:2405.16178 [pdf, other]: Title: Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection

Authors: Yun Zhu, Jia-Chen Gu, Caitlin Sikora, Ho Ko, Yinxiao Liu, Chu-Cheng Lin, Lei Shu, Liangchen Luo, Lei Meng, Bang Liu, Jindong Chen

Subjects: Computation and Language (cs.CL)
[147] arXiv:2405.16176 [pdf, other]: Title: Bi-reachability in Petri nets with data

Authors: Łukasz Kamiński, Sławomir Lasota

Subjects: Computation and Language (cs.CL); Formal Languages and Automata Theory (cs.FL); Logic in Computer Science (cs.LO)
[148] arXiv:2405.16155 [pdf, other]: Title: Improving Multi-lingual Alignment Through Soft Contrastive Learning

Authors: Minsu Park, Seyeon Choi, Chanyeol Choi, Jun-Seong Kim, Jy-yong Sohn

Comments: 8 pages, 1 figures, Accepted at NAACL SRW 2024

Subjects: Computation and Language (cs.CL)
[149] arXiv:2405.16153 [pdf, other]: Title: DefSent+: Improving sentence embeddings of language models by projecting definition sentences into a quasi-isotropic or isotropic vector space of unlimited dictionary entries

Authors: Xiaodong Liu

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[150] arXiv:2405.16150 [pdf, other]: Title: 5W1H Extraction With Large Language Models

Authors: Yang Cao, Yangsong Lan, Feiyan Zhai, Piji Li

Comments: IJCNN 2024

Subjects: Computation and Language (cs.CL)
[151] arXiv:2405.16129 [pdf, other]: Title: iREL at SemEval-2024 Task 9: Improving Conventional Prompting Methods for Brain Teasers

Authors: Harshit Gupta, Manav Chaudhary, Tathagata Raha, Shivansh Subramanian, Vasudeva Varma

Subjects: Computation and Language (cs.CL)
[152] arXiv:2405.16115 [pdf, other]: Title: SNOBERT: A Benchmark for clinical notes entity linking in the SNOMED CT clinical terminology

Authors: Mikhail Kulyabin, Gleb Sokolov, Aleksandr Galaida, Andreas Maier, Tomas Arias-Vergara

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[153] arXiv:2405.16089 [pdf, other]: Title: COLT: Towards Completeness-Oriented Tool Retrieval for Large Language Models

Authors: Changle Qu, Sunhao Dai, Xiaochi Wei, Hengyi Cai, Shuaiqiang Wang, Dawei Yin, Jun Xu, Ji-Rong Wen

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[154] arXiv:2405.16064 [pdf, other]: Title: Keypoint-based Progressive Chain-of-Thought Distillation for LLMs

Authors: Kaituo Feng, Changsheng Li, Xiaolu Zhang, Jun Zhou, Ye Yuan, Guoren Wang

Comments: Accepted by ICML 2024

Subjects: Computation and Language (cs.CL)
[155] arXiv:2405.16057 [pdf, other]: Title: SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models

Authors: Xudong Lu, Aojun Zhou, Yuhui Xu, Renrui Zhang, Peng Gao, Hongsheng Li

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[156] arXiv:2405.16042 [pdf, other]: Title: Incremental Comprehension of Garden-Path Sentences by Large Language Models: Semantic Interpretation, Syntactic Re-Analysis, and Attention

Authors: Andrew Li, Xianle Feng, Siddhant Narang, Austin Peng, Tianle Cai, Raj Sanjay Shah, Sashank Varma

Comments: Accepted by CogSci-24

Subjects: Computation and Language (cs.CL)
[157] arXiv:2405.15984 [pdf, other]: Title: Evaluating the Adversarial Robustness of Retrieval-Based In-Context Learning for Large Language Models

Authors: Simon Chi Lok Yu, Jie He, Pasquale Minervini, Jeff Z. Pan

Comments: 29 pages, 6 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[158] arXiv:2405.15964 [pdf, other]: Title: A hierarchical Bayesian model for syntactic priming

Authors: Weijie Xu, Richard Futrell

Comments: 6 pages; accepted to CogSci 2024

Subjects: Computation and Language (cs.CL)
[159] arXiv:2405.15936 [pdf, other]: Title: Zero-Shot Spam Email Classification Using Pre-trained Large Language Models

Authors: Sergio Rojas-Galeano

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[160] arXiv:2405.15924 [pdf, other]: Title: SLIDE: A Framework Integrating Small and Large Language Models for Open-Domain Dialogues Evaluation

Authors: Kun Zhao, Bohao Yang, Chen Tang, Chenghua Lin, Liang Zhan

Comments: Accepted by ACL2024 Findings

Subjects: Computation and Language (cs.CL)
[161] arXiv:2405.15896 [pdf, other]: Title: Enhancing Augmentative and Alternative Communication with Card Prediction and Colourful Semantics

Authors: Jayr Pereira, Francisco Rodrigues, Jaylton Pereira, Cleber Zanchettin, Robson Fidalgo

Subjects: Computation and Language (cs.CL)
[162] arXiv:2405.15818 [pdf, other]: Title: DuanzAI: Slang-Enhanced LLM with Prompt for Humor Understanding

Authors: Yesian Rohn

Subjects: Computation and Language (cs.CL)
[163] arXiv:2405.17430 (cross-list from cs.CV) [pdf, other]: Title: Matryoshka Multimodal Models

Authors: Mu Cai, Jianwei Yang, Jianfeng Gao, Yong Jae Lee

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[164] arXiv:2405.17423 (cross-list from cs.CV) [pdf, other]: Title: Privacy-Aware Visual Language Models

Authors: Laurens Samson, Nimrod Barazani, Sennay Ghebreab, Yuki M. Asano

Comments: preprint

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[165] arXiv:2405.17390 (cross-list from cs.IR) [pdf, ps, other]: Title: KSW: Khmer Stop Word based Dictionary for Keyword Extraction

Authors: Nimol Thuon, Wangrui Zhang, Sada Thuon

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[166] arXiv:2405.17382 (cross-list from cs.LG) [pdf, other]: Title: ReMoDetect: Reward Models Recognize Aligned LLM's Generations

Authors: Hyunseok Lee, Jihoon Tack, Jinwoo Shin

Comments: 20 pages

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[167] arXiv:2405.17345 (cross-list from cs.AI) [pdf, other]: Title: Exploring and steering the moral compass of Large Language Models

Authors: Alejandro Tlaie

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[168] arXiv:2405.17217 (cross-list from cs.HC) [pdf, other]: Title: Collage is the New Writing: Exploring the Fragmentation of Text and User Interfaces in AI Tools

Authors: Daniel Buschek

Comments: 19 pages, 7 figures, 2 tables, ACM DIS 2024

Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[169] arXiv:2405.17130 (cross-list from cs.LG) [pdf, other]: Title: Exploiting the Layered Intrinsic Dimensionality of Deep Models for Practical Adversarial Training

Authors: Enes Altinisik, Safa Messaoud, Husrev Taha Sencar, Hassan Sajjad, Sanjay Chawla

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[170] arXiv:2405.17104 (cross-list from cs.CV) [pdf, other]: Title: LLM-Optic: Unveiling the Capabilities of Large Language Models for Universal Visual Grounding

Authors: Haoyu Zhao, Wenhang Ge, Ying-cong Chen

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[171] arXiv:2405.17088 (cross-list from cs.LG) [pdf, other]: Title: Phase Transitions in the Output Distribution of Large Language Models

Authors: Julian Arnold, Flemming Holtorf, Frank Schäfer, Niels Lörch

Comments: 21 pages, 4 figures

Subjects: Machine Learning (cs.LG); Statistical Mechanics (cond-mat.stat-mech); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[172] arXiv:2405.17076 (cross-list from cs.AI) [pdf, other]: Title: Leveraging small language models for Text2SPARQL tasks to improve the resilience of AI assistance

Authors: Felix Brei, Johannes Frey, Lars-Peter Meyer

Comments: To appear in Proceedings of the Workshop on Linked Data-driven Resilience Research 2024 (D2R2) co-located with Extended Semantic Web Conference 2024 (ESWC 2024)

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[173] arXiv:2405.17044 (cross-list from cs.AI) [pdf, other]: Title: Generation and human-expert evaluation of interesting research ideas using knowledge graphs and large language models

Authors: Xuemei Gu, Mario Krenn

Comments: 10 pages; 5 figures

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Digital Libraries (cs.DL); Machine Learning (cs.LG)
[174] arXiv:2405.16994 (cross-list from cs.AI) [pdf, other]: Title: Vision-and-Language Navigation Generative Pretrained Transformer

Authors: Wen Hanlin

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[175] arXiv:2405.16919 (cross-list from cs.CV) [pdf, other]: Title: VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models

Authors: Zejun Li, Ruipu Luo, Jiwen Zhang, Minghui Qiu, Zhongyu Wei

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[176] arXiv:2405.16869 (cross-list from cs.AI) [pdf, other]: Title: Mixture of Modality Knowledge Experts for Robust Multi-modal Knowledge Graph Completion

Authors: Yichi Zhang, Zhuo Chen, Lingbing Guo, Yajing Xu, Binbin Hu, Ziqi Liu, Wen Zhang, Huajun Chen

Comments: Work in progress. Code and data will be released at this https URL

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[177] arXiv:2405.16845 (cross-list from cs.LG) [pdf, other]: Title: On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability

Authors: Chenyu Zheng, Wei Huang, Rongzhen Wang, Guoqiang Wu, Jun Zhu, Chongxuan Li

Comments: 37pages

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[178] arXiv:2405.16751 (cross-list from cs.AI) [pdf, other]: Title: LLM-Based Cooperative Agents using Information Relevance and Plan Validation

Authors: SeungWon Seo, Junhyeok Lee, SeongRae Noh, HyeongYeop Kang

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
[179] arXiv:2405.16712 (cross-list from cs.LG) [pdf, other]: Title: Zamba: A Compact 7B SSM Hybrid Model

Authors: Paolo Glorioso, Quentin Anthony, Yury Tokpanov, James Whittington, Jonathan Pilault, Adam Ibrahim, Beren Millidge

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[180] arXiv:2405.16700 (cross-list from cs.CV) [pdf, other]: Title: Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs

Authors: Mustafa Shukor, Matthieu Cord

Comments: Project page: this https URL 37 Pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[181] arXiv:2405.16682 (cross-list from cs.LG) [pdf, other]: Title: A Systematic Review of Federated Generative Models

Authors: Ashkan Vedadi Gargary, Emiliano De Cristofaro

Comments: 24 Pages, 3 Figures, 5 Tables

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[182] arXiv:2405.16677 (cross-list from eess.AS) [pdf, other]: Title: Crossmodal ASR Error Correction with Discrete Speech Units

Authors: Yuanchao Li, Pinzhen Chen, Peter Bell, Catherine Lai

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[183] arXiv:2405.16669 (cross-list from cs.HC) [pdf, other]: Title: Low-resourced Languages and Online Knowledge Repositories: A Need-Finding Study

Authors: Hellina Hailu Nigatu, John Canny, Sarah E. Chasins

Comments: In Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI 2024)

Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[184] arXiv:2405.16662 (cross-list from cs.LO) [pdf, ps, other]: Title: Conjunctive categorial grammars and Lambek grammars with additives

Authors: Stepan L. Kuznetsov, Alexander Okhotin

Comments: This article is an extended version of the conference presentation "Conjunctive categorial grammars" at the Mathematics of Language 2017 meeting (London, UK, July 13-14, 2017; proceedings published in ACL Anthology, W17-3414)

Subjects: Logic in Computer Science (cs.LO); Computation and Language (cs.CL); Logic (math.LO)
[185] arXiv:2405.16640 (cross-list from cs.AI) [pdf, other]: Title: A Survey of Multimodal Large Language Model from A Data-centric Perspective

Authors: Tianyi Bai, Hao Liang, Binwang Wan, Ling Yang, Bozhou Li, Yifan Wang, Bin Cui, Conghui He, Binhang Yuan, Wentao Zhang

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[186] arXiv:2405.16546 (cross-list from cs.IR) [pdf, other]: Title: Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration

Authors: Sunhao Dai, Weihao Liu, Yuqi Zhou, Liang Pang, Rongju Ruan, Gang Wang, Zhenhua Dong, Jun Xu, Ji-Rong Wen

Comments: Accepted by Findings of ACL 2024; Datasets Link: this https URL

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[187] arXiv:2405.16528 (cross-list from cs.LG) [pdf, other]: Title: LoQT: Low Rank Adapters for Quantized Training

Authors: Sebastian Loeschcke, Mads Toftrup, Michael J. Kastoryano, Serge Belongie, Vésteinn Snæbjarnarson

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[188] arXiv:2405.16510 (cross-list from cs.AI) [pdf, other]: Title: Meta-Task Planning for Language Agents

Authors: Cong Zhang, Derrick Goh Xin Deik, Dexun Li, Hao Zhang, Yong Liu

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[189] arXiv:2405.16473 (cross-list from cs.CV) [pdf, other]: Title: M$^3$CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought

Authors: Qiguang Chen, Libo Qin, Jin Zhang, Zhi Chen, Xiao Xu, Wanxiang Che

Comments: Accepted at ACL2024 Main Conference

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[190] arXiv:2405.16442 (cross-list from cs.CY) [pdf, ps, other]: Title: Development of an open education resources (OER) system: a comparative analysis and implementation approach

Authors: Nimol Thuon, Wangrui Zhang

Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL)
[191] arXiv:2405.16434 (cross-list from cs.AI) [pdf, other]: Title: The Importance of Directional Feedback for LLM-based Optimizers

Authors: Allen Nie, Ching-An Cheng, Andrey Kolobov, Adith Swaminathan

Comments: Presented at Foundation Models for Decision Making at NeurIPS 2023

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
[192] arXiv:2405.16413 (cross-list from cs.AI) [pdf, other]: Title: Augmented Risk Prediction for the Onset of Alzheimer's Disease from Electronic Health Records with Large Language Models

Authors: Jiankun Wang, Sumyeong Ahn, Taykhoom Dalal, Xiaodan Zhang, Weishen Pan, Qiannan Zhang, Bin Chen, Hiroko H. Dodge, Fei Wang, Jiayu Zhou

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Applications (stat.AP)
[193] arXiv:2405.16411 (cross-list from cs.LG) [pdf, other]: Title: Tensor Attention Training: Provably Efficient Learning of Higher-order Transformers

Authors: Jiuxiang Gu, Yingyu Liang, Zhenmei Shi, Zhao Song, Yufa Zhou

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[194] arXiv:2405.16406 (cross-list from cs.LG) [pdf, other]: Title: SpinQuant -- LLM quantization with learned rotations

Authors: Zechun Liu, Changsheng Zhao, Igor Fedorov, Bilge Soran, Dhruv Choudhary, Raghuraman Krishnamoorthi, Vikas Chandra, Yuandong Tian, Tijmen Blankevoort

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[195] arXiv:2405.16247 (cross-list from cs.AI) [pdf, other]: Title: AutoManual: Generating Instruction Manuals by LLM Agents via Interactive Environmental Learning

Authors: Minghao Chen, Yihang Li, Yanting Yang, Shiyu Yu, Binbin Lin, Xiaofei He

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[196] arXiv:2405.16205 (cross-list from cs.AI) [pdf, ps, other]: Title: GeneAgent: Self-verification Language Agent for Gene Set Knowledge Discovery using Domain Databases

Authors: Zhizheng Wang, Qiao Jin, Chih-Hsuan Wei, Shubo Tian, Po-Ting Lai, Qingqing Zhu, Chi-Ping Day, Christina Ross, Zhiyong Lu

Comments: 30 pages with 10 figures and/or tables

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[197] arXiv:2405.16136 (cross-list from cs.AI) [pdf, other]: Title: C3LLM: Conditional Multimodal Content Generation Using Large Language Models

Authors: Zixuan Wang, Qinkai Duan, Yu-Wing Tai, Chi-Keung Tang

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[198] arXiv:2405.16128 (cross-list from cs.AI) [pdf, other]: Title: How Well Do Deep Learning Models Capture Human Concepts? The Case of the Typicality Effect

Authors: Siddhartha K. Vemuri, Raj Sanjay Shah, Sashank Varma

Comments: To appear at CogSci 2024

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[199] arXiv:2405.16122 (cross-list from cs.AI) [pdf, other]: Title: Prompt Optimization with EASE? Efficient Ordering-aware Automated Selection of Exemplars

Authors: Zhaoxuan Wu, Xiaoqiang Lin, Zhongxiang Dai, Wenyang Hu, Yao Shu, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low

Comments: 23 pages, 1 figure, 23 tables

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
[200] arXiv:2405.16043 (cross-list from cs.LG) [pdf, other]: Title: Theoretical Analysis of Weak-to-Strong Generalization

Authors: Hunter Lang, David Sontag, Aravindan Vijayaraghavan

Comments: 36 pages, 3 figures

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[201] arXiv:2405.15973 (cross-list from cs.CV) [pdf, other]: Title: Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement

Authors: Xiyao Wang, Jiuhai Chen, Zhaoyang Wang, Yuhang Zhou, Yiyang Zhou, Huaxiu Yao, Tianyi Zhou, Tom Goldstein, Parminder Bhatia, Furong Huang, Cao Xiao

Comments: 15 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[202] arXiv:2405.15943 (cross-list from cs.LG) [pdf, other]: Title: Transformers represent belief state geometry in their residual stream

Authors: Adam S. Shai, Sarah E. Marzen, Lucas Teixeira, Alexander Gietelink Oldenziel, Paul M. Riechers

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[203] arXiv:2405.15902 (cross-list from cs.CR) [pdf, other]: Title: Hacc-Man: An Arcade Game for Jailbreaking LLMs

Authors: Matheus Valentim, Jeanette Falk, Nanna Inie

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[204] arXiv:2405.15877 (cross-list from cs.LG) [pdf, other]: Title: Basis Selection: Low-Rank Decomposition of Pretrained Large Language Models for Target Applications

Authors: Yang Li, Changsheng Zhao, Hyungtak Lee, Ernie Chang, Yangyang Shi, Vikas Chandra

Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Computation and Language (cs.CL)
[205] arXiv:2405.15793 (cross-list from cs.SE) [pdf, other]: Title: SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

Authors: John Yang, Carlos E. Jimenez, Alexander Wettig, Kilian Lieret, Shunyu Yao, Karthik Narasimhan, Ofir Press

Comments: First two authors contributed equally. Code and demo at this https URL

Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[206] arXiv:2405.15787 (cross-list from cs.IR) [pdf, ps, other]: Title: Extracting chemical food safety hazards from the scientific literature automatically using large language models

Authors: Neris Özen, Wenjuan Mu, Esther D. van Asselt, Leonieke M. van den Bulk

Comments: 31 pages, 5 figures

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[207] arXiv:2405.15784 (cross-list from cs.IR) [pdf, other]: Title: CLARINET: Augmenting Language Models to Ask Clarification Questions for Retrieval

Authors: Yizhou Chi, Jessy Lin, Kevin Lin, Dan Klein

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Mon, 27 May 2024 (showing first 3 of 72 entries)

[208] arXiv:2405.15765 [pdf, other]: Title: Scaling Laws for Discriminative Classification in Large Language Models

Authors: Dean Wyatte, Fatemeh Tahmasbi, Ming Li, Thomas Markovich

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[209] arXiv:2405.15760 [pdf, other]: Title: GPT is Not an Annotator: The Necessity of Human Annotation in Fairness Benchmark Construction

Authors: Virginia K. Felkner, Jennifer A. Thompson, Jonathan May

Comments: Accepted to ACL 2024 (main conference)

Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[210] arXiv:2405.15750 [pdf, other]: Title: Filtered Corpus Training (FiCT) Shows that Language Models can Generalize from Indirect Evidence

Authors: Abhinav Patil, Jaap Jumelet, Yu Ying Chiu, Andy Lapastora, Peter Shen, Lexie Wang, Clevis Willrich, Shane Steinert-Threlkeld

Comments: 10 pages + 7 pages of references/appendices. For code and trained models, see this http URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

[ total of 515 entries: 1-100 | 11-110 | 111-210 | 211-310 | 311-410 | 411-510 | 511-515 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help (Access key information)

> cs > cs.CL

Computation and Language

Authors and titles for recent submissions, skipping first 110

Tue, 28 May 2024 (continued, showing last 97 of 126 entries)

Mon, 27 May 2024 (showing first 3 of 72 entries)