Computation and Language

Authors and titles for recent submissions, skipping first 151

[ total of 540 entries: 1-45 | 17-61 | 62-106 | 107-151 | 152-196 | 197-241 | 242-286 | 287-331 | ... | 512-540 ]
[ showing 45 entries per page: fewer | more | all ]

Mon, 27 May 2024 (continued, showing 45 of 72 entries)

[152] arXiv:2405.15202 [pdf, other]: Title: Cross-Task Defense: Instruction-Tuning LLMs for Content Safety

Authors: Yu Fu, Wen Xiao, Jia Chen, Jiachen Li, Evangelos Papalexakis, Aichi Chien, Yue Dong

Comments: accepted to NAACL2024 TrustNLP workshop

Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[153] arXiv:2405.15198 [pdf, other]: Title: RAEE: A Training-Free Retrieval-Augmented Early Exiting Framework for Efficient Inference

Authors: Lianming Huang, Shangyu Wu, Yufei Cui, Ying Xiong, Xue Liu, Tei-Wei Kuo, Nan Guan, Chun Jason Xue

Subjects: Computation and Language (cs.CL)
[154] arXiv:2405.15185 [pdf, other]: Title: An Evaluation of Estimative Uncertainty in Large Language Models

Authors: Zhisheng Tang, Ke Shen, Mayank Kejriwal

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[155] arXiv:2405.15179 [pdf, other]: Title: VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks

Authors: Yang Li, Shaobo Han, Shihao Ji

Subjects: Computation and Language (cs.CL)
[156] arXiv:2405.15165 [pdf, other]: Title: A Solution-based LLM API-using Methodology for Academic Information Seeking

Authors: Yuanchun Wang, Jifan Yu, Zijun Yao, Jing Zhang, Yuyang Xie, Shangqing Tu, Yiyang Fu, Youhe Feng, Jinkai Zhang, Jingyao Zhang, Bowen Huang, Yuanyao Li, Huihui Yuan, Lei Hou, Juanzi Li, Jie Tang

Comments: 22 pages, 13 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
[157] arXiv:2405.15152 [pdf, other]: Title: Machine Unlearning in Large Language Models

Authors: Saaketh Koundinya Gundavarapu, Shreya Agarwal, Arushi Arora, Chandana Thimmalapura Jagadeeshaiah

Comments: 10 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[158] arXiv:2405.15134 [pdf, other]: Title: Efficient Biomedical Entity Linking: Clinical Text Standardization with Low-Resource Techniques

Authors: Akshit Achara, Sanand Sasidharan, Gagan N

Subjects: Computation and Language (cs.CL)
[159] arXiv:2405.15122 [pdf, other]: Title: Generalizable and Scalable Multistage Biomedical Concept Normalization Leveraging Large Language Models

Authors: Nicholas J Dobbins

Subjects: Computation and Language (cs.CL)
[160] arXiv:2405.15110 [pdf, other]: Title: CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems

Authors: Abbas Ghaddar, David Alfonso-Hermelo, Philippe Langlais, Mehdi Rezagholizadeh, Boxing Chen, Prasanna Parthasarathi

Comments: To appear in Findings ACL 2024

Subjects: Computation and Language (cs.CL)
[161] arXiv:2405.15097 [pdf, other]: Title: Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding

Authors: Suyoung Kim, Jiyeon Hwang, Ho-Young Jung

Comments: Accepted NAACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[162] arXiv:2405.15077 [pdf, other]: Title: Eliciting Informative Text Evaluations with Large Language Models

Authors: Yuxuan Lu, Shengwei Xu, Yichi Zhang, Yuqing Kong, Grant Schoenebeck

Comments: Accepted by the Twenty-Fifth ACM Conference on Economics and Computation (EC'24)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT)
[163] arXiv:2405.15071 [pdf, other]: Title: Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Authors: Boshi Wang, Xiang Yue, Yu Su, Huan Sun

Comments: 22 pages, 16 figures. Code and data: this https URL

Subjects: Computation and Language (cs.CL)
[164] arXiv:2405.15070 [pdf, other]: Title: Optimizing example selection for retrieval-augmented machine translation with translation memories

Authors: Maxime Bouthors, Josep Crego, François Yvon

Comments: TALN conference, French, 10 pages, 7 figures

Subjects: Computation and Language (cs.CL)
[165] arXiv:2405.15067 [pdf, other]: Title: Promoting Constructive Deliberation: Reframing for Receptiveness

Authors: Gauri Kambhatla, Matthew Lease, Ashwin Rajadesingan

Subjects: Computation and Language (cs.CL)
[166] arXiv:2405.15064 [pdf, other]: Title: Reframing Spatial Reasoning Evaluation in Language Models: A Real-World Simulation Benchmark for Qualitative Reasoning

Authors: Fangjun Li, David C. Hogg, Anthony G. Cohn

Comments: Camera-Ready version for IJCAI 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[167] arXiv:2405.15039 [pdf, other]: Title: CEEBERT: Cross-Domain Inference in Early Exit BERT

Authors: Divya Jyoti Bajpai, Manjesh Kumar Hanawal

Comments: Accepted at ACL 2024

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[168] arXiv:2405.15032 [pdf, other]: Title: Aya 23: Open Weight Releases to Further Multilingual Progress

Authors: Viraat Aryabumi, John Dang, Dwarak Talupuru, Saurabh Dash, David Cairuz, Hangyu Lin, Bharat Venkitesh, Madeline Smith, Kelly Marchisio, Sebastian Ruder, Acyr Locatelli, Julia Kreutzer, Nick Frosst, Phil Blunsom, Marzieh Fadaee, Ahmet Üstün, Sara Hooker

Subjects: Computation and Language (cs.CL)
[169] arXiv:2405.15028 [pdf, other]: Title: AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings

Authors: Revanth Gangi Reddy, Omar Attia, Yunyao Li, Heng Ji, Saloni Potdar

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[170] arXiv:2405.15012 [pdf, other]: Title: Extracting Prompts by Inverting LLM Outputs

Authors: Collin Zhang, John X. Morris, Vitaly Shmatikov

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[171] arXiv:2405.15007 [pdf, other]: Title: RE-Adapt: Reverse Engineered Adaptation of Large Language Models

Authors: William Fleshman, Benjamin Van Durme

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[172] arXiv:2405.14992 [pdf, other]: Title: Linking In-context Learning in Transformers to Human Episodic Memory

Authors: Li Ji-An, Corey Y. Zhou, Marcus K. Benna, Marcelo G. Mattar

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[173] arXiv:2405.14962 [pdf, ps, other]: Title: Data Augmentation Method Utilizing Template Sentences for Variable Definition Extraction

Authors: Kotaro Nagayama, Shota Kato, Manabu Kano

Subjects: Computation and Language (cs.CL)
[174] arXiv:2405.14899 [pdf, other]: Title: DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning

Authors: Zijian Zhou, Xiaoqiang Lin, Xinyi Xu, Alok Prakash, Daniela Rus, Bryan Kian Hsiang Low

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[175] arXiv:2405.15766 (cross-list from cs.AI) [pdf, other]: Title: Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development

Authors: Pranab Sahoo, Ayush Kumar Singh, Sriparna Saha, Aman Chadha, Samrat Mondal

Comments: ACL Findings 2024

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[176] arXiv:2405.15729 (cross-list from cs.SE) [pdf, other]: Title: Optimizing Large Language Models for OpenAPI Code Completion

Authors: Bohdan Petryshyn, Mantas Lukoševičius

Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[177] arXiv:2405.15683 (cross-list from cs.CV) [pdf, other]: Title: VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap

Authors: Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, Oriol Nieto, Zeyu Jin, Dinesh Manocha

Comments: Preprint. Under review. Code will be released on paper acceptance

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[178] arXiv:2405.15638 (cross-list from cs.CV) [pdf, other]: Title: M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models

Authors: Hongyu Wang, Jiayu Xu, Senwei Xie, Ruiping Wang, Jialin Li, Zhaojie Xie, Bin Zhang, Chuyan Xiong, Xilin Chen

Comments: Work in progress

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[179] arXiv:2405.15556 (cross-list from cs.LG) [pdf, other]: Title: Certifiably Robust RAG against Retrieval Corruption

Authors: Chong Xiang, Tong Wu, Zexuan Zhong, David Wagner, Danqi Chen, Prateek Mittal

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[180] arXiv:2405.15485 (cross-list from cs.AI) [pdf, other]: Title: Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs

Authors: Siyuan Guo, Aniket Didolkar, Nan Rosemary Ke, Anirudh Goyal, Ferenc Huszár, Bernhard Schölkopf

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[181] arXiv:2405.15374 (cross-list from cs.IR) [pdf, other]: Title: Leveraging Large Language Models for Semantic Query Processing in a Scholarly Knowledge Graph

Authors: Runsong Jia, Bowen Zhang, Sergio J. Rodríguez Méndez, Pouya G. Omran

Comments: for the associated repository, see this http URL

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[182] arXiv:2405.15362 (cross-list from cs.LG) [pdf, other]: Title: Pipeline Parallelism with Controllable Memory

Authors: Penghui Qi, Xinyi Wan, Nyamdavaa Amar, Min Lin

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC)
[183] arXiv:2405.15302 (cross-list from cs.AI) [pdf, other]: Title: Towards Understanding How Transformer Perform Multi-step Reasoning with Matching Operation

Authors: Zhiwei Wang, Yunji Wang, Zhongwang Zhang, Zhangchen Zhou, Hui Jin, Tianyang Hu, Jiacheng Sun, Zhenguo Li, Yaoyu Zhang, Zhi-Qin John Xu

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[184] arXiv:2405.15232 (cross-list from cs.CV) [pdf, other]: Title: DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception

Authors: Run Luo, Yunshui Li, Longze Chen, Wanwei He, Ting-En Lin, Ziqiang Liu, Lei Zhang, Zikai Song, Xiaobo Xia, Tongliang Liu, Min Yang, Binyuan Hui

Comments: 25 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[185] arXiv:2405.15216 (cross-list from cs.LG) [pdf, other]: Title: Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition

Authors: Zijin Gu, Tatiana Likhomanenko, He Bai, Erik McDermott, Ronan Collobert, Navdeep Jaitly

Comments: under review

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[186] arXiv:2405.15189 (cross-list from cs.SE) [pdf, other]: Title: SOAP: Enhancing Efficiency of Generated Code via Self-Optimization

Authors: Dong Huang, Jianbo Dai, Han Weng, Puzhen Wu, Yuhao Qing, Jie M.Zhang, Heming Cui, Zhijiang Guo

Comments: 31 pages, 18 figures, and 8 tables

Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL)
[187] arXiv:2405.15145 (cross-list from cs.AI) [pdf, other]: Title: CulturePark: Boosting Cross-cultural Understanding in Large Language Models

Authors: Cheng Li, Damien Teney, Linyi Yang, Qingsong Wen, Xing Xie, Jindong Wang

Comments: Technical report; 28 pages

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multiagent Systems (cs.MA)
[188] arXiv:2405.15143 (cross-list from cs.LG) [pdf, other]: Title: Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models

Authors: Cong Lu, Shengran Hu, Jeff Clune

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[189] arXiv:2405.15130 (cross-list from cs.SE) [pdf, other]: Title: OptLLM: Optimal Assignment of Queries to Large Language Models

Authors: Yueyue Liu, Hongyu Zhang, Yuantian Miao, Van-Hoang Le, Zhiqiang Li

Comments: This paper is accepted by ICWS 2024

Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[190] arXiv:2405.15115 (cross-list from cs.LG) [pdf, other]: Title: Towards Better Understanding of In-Context Learning Ability from In-Context Uncertainty Quantification

Authors: Shang Liu, Zhongze Cai, Guanting Chen, Xiaocheng Li

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[191] arXiv:2405.15092 (cross-list from cs.AI) [pdf, other]: Title: Dissociation of Faithful and Unfaithful Reasoning in LLMs

Authors: Evelyn Yee, Alice Li, Chenyu Tang, Yeon Ho Jung, Ramamohan Paturi, Leon Bergen

Comments: code published at this https URL

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[192] arXiv:2405.15025 (cross-list from cs.LG) [pdf, other]: Title: OAC: Output-adaptive Calibration for Accurate Post-training Quantization

Authors: Ali Edalati (1), Alireza Ghaffari (1 and 2), Masoud Asgharian (2), Lu Hou (1), Boxing Chen (1), Vahid Partovi Nia (1) ((1) Huawei Noah's Ark Lab, (2) Department of Mathematics and Statistics, McGill University)

Comments: 20 pages, 4 figures

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[193] arXiv:2405.14982 (cross-list from cs.LG) [pdf, other]: Title: In-context Time Series Predictor

Authors: Jiecheng Lu, Yan Sun, Shihao Yang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[194] arXiv:2405.14974 (cross-list from cs.CV) [pdf, other]: Title: LOVA3: Learning to Visual Question Answering, Asking and Assessment

Authors: Henry Hengyuan Zhao, Pan Zhou, Difei Gao, Mike Zheng Shou

Comments: The code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[195] arXiv:2405.14917 (cross-list from cs.LG) [pdf, other]: Title: SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models

Authors: Wei Huang, Haotong Qin, Yangdong Liu, Yawei Li, Xianglong Liu, Luca Benini, Michele Magno, Xiaojuan Qi

Comments: 22 pages

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[196] arXiv:2405.14908 (cross-list from cs.LG) [pdf, other]: Title: Data Mixing Made Efficient: A Bivariate Scaling Law for Language Model Pretraining

Authors: Ce Ge, Zhijian Ma, Daoyuan Chen, Yaliang Li, Bolin Ding

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

[ total of 540 entries: 1-45 | 17-61 | 62-106 | 107-151 | 152-196 | 197-241 | 242-286 | 287-331 | ... | 512-540 ]
[ showing 45 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help (Access key information)

> cs > cs.CL

Computation and Language

Authors and titles for recent submissions, skipping first 151

Mon, 27 May 2024 (continued, showing 45 of 72 entries)