Computation and Language

Authors and titles for recent submissions, skipping first 350

[ total of 427 entries: 1-250 | 101-350 | 351-427 ]
[ showing 250 entries per page: fewer | more | all ]

Tue, 28 May 2024 (continued, showing last 5 of 126 entries)

[351] arXiv:2405.15902 (cross-list from cs.CR) [pdf, other]: Title: Hacc-Man: An Arcade Game for Jailbreaking LLMs

Authors: Matheus Valentim, Jeanette Falk, Nanna Inie

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[352] arXiv:2405.15877 (cross-list from cs.LG) [pdf, other]: Title: Basis Selection: Low-Rank Decomposition of Pretrained Large Language Models for Target Applications

Authors: Yang Li, Changsheng Zhao, Hyungtak Lee, Ernie Chang, Yangyang Shi, Vikas Chandra

Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Computation and Language (cs.CL)
[353] arXiv:2405.15793 (cross-list from cs.SE) [pdf, other]: Title: SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

Authors: John Yang, Carlos E. Jimenez, Alexander Wettig, Kilian Lieret, Shunyu Yao, Karthik Narasimhan, Ofir Press

Comments: First two authors contributed equally. Code and demo at this https URL

Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[354] arXiv:2405.15787 (cross-list from cs.IR) [pdf, ps, other]: Title: Extracting chemical food safety hazards from the scientific literature automatically using large language models

Authors: Neris Özen, Wenjuan Mu, Esther D. van Asselt, Leonieke M. van den Bulk

Comments: 31 pages, 5 figures

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[355] arXiv:2405.15784 (cross-list from cs.IR) [pdf, other]: Title: CLARINET: Augmenting Language Models to Ask Clarification Questions for Retrieval

Authors: Yizhou Chi, Jessy Lin, Kevin Lin, Dan Klein

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Mon, 27 May 2024

[356] arXiv:2405.15765 [pdf, other]: Title: Scaling Laws for Discriminative Classification in Large Language Models

Authors: Dean Wyatte, Fatemeh Tahmasbi, Ming Li, Thomas Markovich

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[357] arXiv:2405.15760 [pdf, other]: Title: GPT is Not an Annotator: The Necessity of Human Annotation in Fairness Benchmark Construction

Authors: Virginia K. Felkner, Jennifer A. Thompson, Jonathan May

Comments: Accepted to ACL 2024 (main conference)

Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[358] arXiv:2405.15750 [pdf, other]: Title: Filtered Corpus Training (FiCT) Shows that Language Models can Generalize from Indirect Evidence

Authors: Abhinav Patil, Jaap Jumelet, Yu Ying Chiu, Andy Lapastora, Peter Shen, Lexie Wang, Clevis Willrich, Shane Steinert-Threlkeld

Comments: 10 pages + 7 pages of references/appendices. For code and trained models, see this http URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[359] arXiv:2405.15708 [pdf, other]: Title: EmpathicStories++: A Multimodal Dataset for Empathy towards Personal Experiences

Authors: Jocelyn Shen, Yubin Kim, Mohit Hulse, Wazeer Zulfikar, Sharifa Alghowinem, Cynthia Breazeal, Hae Won Park

Comments: Accepted to ACL 2024 Findings

Subjects: Computation and Language (cs.CL)
[360] arXiv:2405.15640 [pdf, other]: Title: GECKO: Generative Language Model for English, Code and Korean

Authors: Sungwoo Oh, Donggyu Kim

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[361] arXiv:2405.15604 [pdf, other]: Title: Text Generation: A Systematic Literature Review of Tasks, Evaluation, and Challenges

Authors: Jonas Becker, Jan Philip Wahle, Bela Gipp, Terry Ruas

Comments: 35 pages, 2 figures, 2 tables, Under review

Subjects: Computation and Language (cs.CL)
[362] arXiv:2405.15590 [pdf, ps, other]: Title: Profiling checkpointing schedules in adjoint ST-AD

Authors: Laurent Hascoët, Jean-Luc Bouchot, Shreyas Sunil Gaikwad, Sri Hari Krishna Narayanan, Jan Hückelheim

Subjects: Computation and Language (cs.CL)
[363] arXiv:2405.15585 [pdf, other]: Title: Synergizing In-context Learning with Hints for End-to-end Task-oriented Dialog Systems

Authors: Vishal Vivek Saley, Rocktim Jyoti Das, Dinesh Raghu, Mausam

Subjects: Computation and Language (cs.CL)
[364] arXiv:2405.15525 [pdf, other]: Title: Sparse Matrix in Large Language Model Fine-tuning

Authors: Haoze He, Juncheng Billy Li, Xuan Jiang, Heather Miller

Comments: 14 pages

Subjects: Computation and Language (cs.CL)
[365] arXiv:2405.15523 [pdf, other]: Title: Mosaic Memory: Fuzzy Duplication in Copyright Traps for Large Language Models

Authors: Igor Shilov, Matthieu Meeus, Yves-Alexandre de Montjoye

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[366] arXiv:2405.15471 [pdf, other]: Title: Emergence of a High-Dimensional Abstraction Phase in Language Transformers

Authors: Emily Cheng, Diego Doimo, Corentin Kervadec, Iuri Macocco, Jade Yu, Alessandro Laio, Marco Baroni

Subjects: Computation and Language (cs.CL)
[367] arXiv:2405.15454 [pdf, other]: Title: Linearly Controlled Language Generation with Performative Guarantees

Authors: Emily Cheng, Marco Baroni, Carmen Amo Alonso

Subjects: Computation and Language (cs.CL); Systems and Control (eess.SY)
[368] arXiv:2405.15453 [pdf, other]: Title: Benchmarking Pre-trained Large Language Models' Potential Across Urdu NLP tasks

Authors: Munief Hassan Tahir, Sana Shams, Layba Fiaz, Farah Adeeba, Sarmad Hussain

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[369] arXiv:2405.15452 [pdf, other]: Title: Leveraging Logical Rules in Knowledge Editing: A Cherry on the Top

Authors: Keyuan Cheng, Muhammad Asif Ali, Shu Yang, Gang Lin, Yuxuan Zhai, Haoyang Fei, Ke Xu, Lu Yu, Lijie Hu, Di Wang

Comments: 18 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[370] arXiv:2405.15370 [pdf, other]: Title: Large Language Models can Deliver Accurate and Interpretable Time Series Anomaly Detection

Authors: Jun Liu, Chaoyun Zhang, Jiaxu Qian, Minghua Ma, Si Qin, Chetan Bansal, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang

Subjects: Computation and Language (cs.CL)
[371] arXiv:2405.15349 [pdf, other]: Title: UnKE: Unstructured Knowledge Editing in Large Language Models

Authors: Jingcheng Deng, Zihao Wei, Liang Pang, Hanxing Ding, Huawei Shen, Xueqi Cheng

Subjects: Computation and Language (cs.CL)
[372] arXiv:2405.15346 [pdf, other]: Title: BiSup: Bidirectional Quantization Error Suppression for Large Language Models

Authors: Minghui Zou, Ronghui Guo, Sai Zhang, Xiaowang Zhang, Zhiyong Feng

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[373] arXiv:2405.15334 [pdf, other]: Title: Detection and Positive Reconstruction of Cognitive Distortion sentences: Mandarin Dataset and Evaluation

Authors: Shuya Lin, Yuxiong Wang, Jonathan Dong, Shiguang Ni

Subjects: Computation and Language (cs.CL)
[374] arXiv:2405.15329 [pdf, other]: Title: Decompose and Aggregate: A Step-by-Step Interpretable Evaluation Framework

Authors: Minzhi Li, Zhengyuan Liu, Shumin Deng, Shafiq Joty, Nancy F. Chen, Min-Yen Kan

Subjects: Computation and Language (cs.CL)
[375] arXiv:2405.15320 [pdf, other]: Title: Organic Data-Driven Approach for Turkish Grammatical Error Correction and LLMs

Authors: Asım Ersoy, Olcay Taner Yıldız

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[376] arXiv:2405.15319 [pdf, other]: Title: Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training

Authors: Wenyu Du, Tongxu Luo, Zihan Qiu, Zeyu Huang, Yikang Shen, Reynold Cheng, Yike Guo, Jie Fu

Comments: Preprint; The project link: $\href{https://llm-stacking.github.io/}{this https URL}$

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[377] arXiv:2405.15318 [pdf, other]: Title: Are Long-LLMs A Necessity For Long-Context Tasks?

Authors: Hongjin Qian, Zheng Liu, Peitian Zhang, Kelong Mao, Yujia Zhou, Xu Chen, Zhicheng Dou

Comments: 18 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[378] arXiv:2405.15307 [pdf, other]: Title: Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL Generation

Authors: Ge Qu, Jinyang Li, Bowen Li, Bowen Qin, Nan Huo, Chenhao Ma, Reynold Cheng

Comments: Accepted to ACL Findings 2024

Subjects: Computation and Language (cs.CL)
[379] arXiv:2405.15306 [pdf, other]: Title: DeTikZify: Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ

Authors: Jonas Belouadi, Simone Paolo Ponzetto, Steffen Eger

Comments: Project page: this https URL

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[380] arXiv:2405.15208 [pdf, other]: Title: Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs

Authors: Chenxi Sun, Hongzhi Zhang, Zijia Lin, Jingyuan Zhang, Fuzheng Zhang, Zhongyuan Wang, Bin Chen, Chengru Song, Di Zhang, Kun Gai, Deyi Xiong

Comments: Accepted for publication at LREC-COLING 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[381] arXiv:2405.15202 [pdf, other]: Title: Cross-Task Defense: Instruction-Tuning LLMs for Content Safety

Authors: Yu Fu, Wen Xiao, Jia Chen, Jiachen Li, Evangelos Papalexakis, Aichi Chien, Yue Dong

Comments: accepted to NAACL2024 TrustNLP workshop

Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[382] arXiv:2405.15198 [pdf, other]: Title: RAEE: A Training-Free Retrieval-Augmented Early Exiting Framework for Efficient Inference

Authors: Lianming Huang, Shangyu Wu, Yufei Cui, Ying Xiong, Xue Liu, Tei-Wei Kuo, Nan Guan, Chun Jason Xue

Subjects: Computation and Language (cs.CL)
[383] arXiv:2405.15185 [pdf, other]: Title: An Evaluation of Estimative Uncertainty in Large Language Models

Authors: Zhisheng Tang, Ke Shen, Mayank Kejriwal

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[384] arXiv:2405.15179 [pdf, other]: Title: VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks

Authors: Yang Li, Shaobo Han, Shihao Ji

Subjects: Computation and Language (cs.CL)
[385] arXiv:2405.15165 [pdf, other]: Title: A Solution-based LLM API-using Methodology for Academic Information Seeking

Authors: Yuanchun Wang, Jifan Yu, Zijun Yao, Jing Zhang, Yuyang Xie, Shangqing Tu, Yiyang Fu, Youhe Feng, Jinkai Zhang, Jingyao Zhang, Bowen Huang, Yuanyao Li, Huihui Yuan, Lei Hou, Juanzi Li, Jie Tang

Comments: 22 pages, 13 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
[386] arXiv:2405.15152 [pdf, other]: Title: Machine Unlearning in Large Language Models

Authors: Saaketh Koundinya Gundavarapu, Shreya Agarwal, Arushi Arora, Chandana Thimmalapura Jagadeeshaiah

Comments: 10 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[387] arXiv:2405.15134 [pdf, other]: Title: Efficient Biomedical Entity Linking: Clinical Text Standardization with Low-Resource Techniques

Authors: Akshit Achara, Sanand Sasidharan, Gagan N

Subjects: Computation and Language (cs.CL)
[388] arXiv:2405.15122 [pdf, other]: Title: Generalizable and Scalable Multistage Biomedical Concept Normalization Leveraging Large Language Models

Authors: Nicholas J Dobbins

Subjects: Computation and Language (cs.CL)
[389] arXiv:2405.15110 [pdf, other]: Title: CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems

Authors: Abbas Ghaddar, David Alfonso-Hermelo, Philippe Langlais, Mehdi Rezagholizadeh, Boxing Chen, Prasanna Parthasarathi

Comments: To appear in Findings ACL 2024

Subjects: Computation and Language (cs.CL)
[390] arXiv:2405.15097 [pdf, other]: Title: Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding

Authors: Suyoung Kim, Jiyeon Hwang, Ho-Young Jung

Comments: Accepted NAACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[391] arXiv:2405.15077 [pdf, other]: Title: Eliciting Informative Text Evaluations with Large Language Models

Authors: Yuxuan Lu, Shengwei Xu, Yichi Zhang, Yuqing Kong, Grant Schoenebeck

Comments: Accepted by the Twenty-Fifth ACM Conference on Economics and Computation (EC'24)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT)
[392] arXiv:2405.15071 [pdf, other]: Title: Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Authors: Boshi Wang, Xiang Yue, Yu Su, Huan Sun

Comments: 22 pages, 16 figures. Code and data: this https URL

Subjects: Computation and Language (cs.CL)
[393] arXiv:2405.15070 [pdf, other]: Title: Optimizing example selection for retrieval-augmented machine translation with translation memories

Authors: Maxime Bouthors, Josep Crego, François Yvon

Comments: TALN conference, French, 10 pages, 7 figures

Subjects: Computation and Language (cs.CL)
[394] arXiv:2405.15067 [pdf, other]: Title: Promoting Constructive Deliberation: Reframing for Receptiveness

Authors: Gauri Kambhatla, Matthew Lease, Ashwin Rajadesingan

Subjects: Computation and Language (cs.CL)
[395] arXiv:2405.15064 [pdf, other]: Title: Reframing Spatial Reasoning Evaluation in Language Models: A Real-World Simulation Benchmark for Qualitative Reasoning

Authors: Fangjun Li, David C. Hogg, Anthony G. Cohn

Comments: Camera-Ready version for IJCAI 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[396] arXiv:2405.15039 [pdf, other]: Title: CEEBERT: Cross-Domain Inference in Early Exit BERT

Authors: Divya Jyoti Bajpai, Manjesh Kumar Hanawal

Comments: Accepted at ACL 2024

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[397] arXiv:2405.15032 [pdf, other]: Title: Aya 23: Open Weight Releases to Further Multilingual Progress

Authors: Viraat Aryabumi, John Dang, Dwarak Talupuru, Saurabh Dash, David Cairuz, Hangyu Lin, Bharat Venkitesh, Madeline Smith, Kelly Marchisio, Sebastian Ruder, Acyr Locatelli, Julia Kreutzer, Nick Frosst, Phil Blunsom, Marzieh Fadaee, Ahmet Üstün, Sara Hooker

Subjects: Computation and Language (cs.CL)
[398] arXiv:2405.15028 [pdf, other]: Title: AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings

Authors: Revanth Gangi Reddy, Omar Attia, Yunyao Li, Heng Ji, Saloni Potdar

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[399] arXiv:2405.15012 [pdf, other]: Title: Extracting Prompts by Inverting LLM Outputs

Authors: Collin Zhang, John X. Morris, Vitaly Shmatikov

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[400] arXiv:2405.15007 [pdf, other]: Title: RE-Adapt: Reverse Engineered Adaptation of Large Language Models

Authors: William Fleshman, Benjamin Van Durme

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[401] arXiv:2405.14992 [pdf, other]: Title: Linking In-context Learning in Transformers to Human Episodic Memory

Authors: Li Ji-An, Corey Y. Zhou, Marcus K. Benna, Marcelo G. Mattar

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[402] arXiv:2405.14962 [pdf, ps, other]: Title: Data Augmentation Method Utilizing Template Sentences for Variable Definition Extraction

Authors: Kotaro Nagayama, Shota Kato, Manabu Kano

Subjects: Computation and Language (cs.CL)
[403] arXiv:2405.14899 [pdf, other]: Title: DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning

Authors: Zijian Zhou, Xiaoqiang Lin, Xinyi Xu, Alok Prakash, Daniela Rus, Bryan Kian Hsiang Low

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[404] arXiv:2405.15766 (cross-list from cs.AI) [pdf, other]: Title: Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development

Authors: Pranab Sahoo, Ayush Kumar Singh, Sriparna Saha, Aman Chadha, Samrat Mondal

Comments: ACL Findings 2024

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[405] arXiv:2405.15729 (cross-list from cs.SE) [pdf, other]: Title: Optimizing Large Language Models for OpenAPI Code Completion

Authors: Bohdan Petryshyn, Mantas Lukoševičius

Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[406] arXiv:2405.15683 (cross-list from cs.CV) [pdf, other]: Title: VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap

Authors: Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, Oriol Nieto, Zeyu Jin, Dinesh Manocha

Comments: Preprint. Under review. Code will be released on paper acceptance

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[407] arXiv:2405.15638 (cross-list from cs.CV) [pdf, other]: Title: M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models

Authors: Hongyu Wang, Jiayu Xu, Senwei Xie, Ruiping Wang, Jialin Li, Zhaojie Xie, Bin Zhang, Chuyan Xiong, Xilin Chen

Comments: Work in progress

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[408] arXiv:2405.15556 (cross-list from cs.LG) [pdf, other]: Title: Certifiably Robust RAG against Retrieval Corruption

Authors: Chong Xiang, Tong Wu, Zexuan Zhong, David Wagner, Danqi Chen, Prateek Mittal

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[409] arXiv:2405.15485 (cross-list from cs.AI) [pdf, other]: Title: Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs

Authors: Siyuan Guo, Aniket Didolkar, Nan Rosemary Ke, Anirudh Goyal, Ferenc Huszár, Bernhard Schölkopf

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[410] arXiv:2405.15374 (cross-list from cs.IR) [pdf, other]: Title: Leveraging Large Language Models for Semantic Query Processing in a Scholarly Knowledge Graph

Authors: Runsong Jia, Bowen Zhang, Sergio J. Rodríguez Méndez, Pouya G. Omran

Comments: for the associated repository, see this http URL

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[411] arXiv:2405.15362 (cross-list from cs.LG) [pdf, other]: Title: Pipeline Parallelism with Controllable Memory

Authors: Penghui Qi, Xinyi Wan, Nyamdavaa Amar, Min Lin

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC)
[412] arXiv:2405.15302 (cross-list from cs.AI) [pdf, other]: Title: Towards Understanding How Transformer Perform Multi-step Reasoning with Matching Operation

Authors: Zhiwei Wang, Yunji Wang, Zhongwang Zhang, Zhangchen Zhou, Hui Jin, Tianyang Hu, Jiacheng Sun, Zhenguo Li, Yaoyu Zhang, Zhi-Qin John Xu

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[413] arXiv:2405.15232 (cross-list from cs.CV) [pdf, other]: Title: DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception

Authors: Run Luo, Yunshui Li, Longze Chen, Wanwei He, Ting-En Lin, Ziqiang Liu, Lei Zhang, Zikai Song, Xiaobo Xia, Tongliang Liu, Min Yang, Binyuan Hui

Comments: 25 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[414] arXiv:2405.15216 (cross-list from cs.LG) [pdf, other]: Title: Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition

Authors: Zijin Gu, Tatiana Likhomanenko, He Bai, Erik McDermott, Ronan Collobert, Navdeep Jaitly

Comments: under review

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[415] arXiv:2405.15189 (cross-list from cs.SE) [pdf, other]: Title: SOAP: Enhancing Efficiency of Generated Code via Self-Optimization

Authors: Dong Huang, Jianbo Dai, Han Weng, Puzhen Wu, Yuhao Qing, Jie M.Zhang, Heming Cui, Zhijiang Guo

Comments: 31 pages, 18 figures, and 8 tables

Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL)
[416] arXiv:2405.15145 (cross-list from cs.AI) [pdf, other]: Title: CulturePark: Boosting Cross-cultural Understanding in Large Language Models

Authors: Cheng Li, Damien Teney, Linyi Yang, Qingsong Wen, Xing Xie, Jindong Wang

Comments: Technical report; 28 pages

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multiagent Systems (cs.MA)
[417] arXiv:2405.15143 (cross-list from cs.LG) [pdf, other]: Title: Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models

Authors: Cong Lu, Shengran Hu, Jeff Clune

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[418] arXiv:2405.15130 (cross-list from cs.SE) [pdf, other]: Title: OptLLM: Optimal Assignment of Queries to Large Language Models

Authors: Yueyue Liu, Hongyu Zhang, Yuantian Miao, Van-Hoang Le, Zhiqiang Li

Comments: This paper is accepted by ICWS 2024

Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[419] arXiv:2405.15115 (cross-list from cs.LG) [pdf, other]: Title: Towards Better Understanding of In-Context Learning Ability from In-Context Uncertainty Quantification

Authors: Shang Liu, Zhongze Cai, Guanting Chen, Xiaocheng Li

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[420] arXiv:2405.15092 (cross-list from cs.AI) [pdf, other]: Title: Dissociation of Faithful and Unfaithful Reasoning in LLMs

Authors: Evelyn Yee, Alice Li, Chenyu Tang, Yeon Ho Jung, Ramamohan Paturi, Leon Bergen

Comments: code published at this https URL

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[421] arXiv:2405.15025 (cross-list from cs.LG) [pdf, other]: Title: OAC: Output-adaptive Calibration for Accurate Post-training Quantization

Authors: Ali Edalati (1), Alireza Ghaffari (1 and 2), Masoud Asgharian (2), Lu Hou (1), Boxing Chen (1), Vahid Partovi Nia (1) ((1) Huawei Noah's Ark Lab, (2) Department of Mathematics and Statistics, McGill University)

Comments: 20 pages, 4 figures

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[422] arXiv:2405.14982 (cross-list from cs.LG) [pdf, other]: Title: In-context Time Series Predictor

Authors: Jiecheng Lu, Yan Sun, Shihao Yang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[423] arXiv:2405.14974 (cross-list from cs.CV) [pdf, other]: Title: LOVA3: Learning to Visual Question Answering, Asking and Assessment

Authors: Henry Hengyuan Zhao, Pan Zhou, Difei Gao, Mike Zheng Shou

Comments: The code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[424] arXiv:2405.14917 (cross-list from cs.LG) [pdf, other]: Title: SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models

Authors: Wei Huang, Haotong Qin, Yangdong Liu, Yawei Li, Xianglong Liu, Luca Benini, Michele Magno, Xiaojuan Qi

Comments: 22 pages

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[425] arXiv:2405.14908 (cross-list from cs.LG) [pdf, other]: Title: Data Mixing Made Efficient: A Bivariate Scaling Law for Language Model Pretraining

Authors: Ce Ge, Zhijian Ma, Daoyuan Chen, Yaliang Li, Bolin Ding

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[426] arXiv:2405.14905 (cross-list from eess.IV) [pdf, other]: Title: Structural Entities Extraction and Patient Indications Incorporation for Chest X-ray Report Generation

Authors: Kang Liu, Zhuoqi Ma, Xiaolu Kang, Zhusi Zhong, Zhicheng Jiao, Grayson Baird, Harrison Bai, Qiguang Miao

Comments: The code is available at this https URL or this https URL

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[427] arXiv:2405.14191 (cross-list from cs.CR) [pdf, other]: Title: S-Eval: Automatic and Adaptive Test Generation for Benchmarking Safety Evaluation of Large Language Models

Authors: Xiaohan Yuan, Jinfeng Li, Dongxia Wang, Yuefeng Chen, Xiaofeng Mao, Longtao Huang, Hui Xue, Wenhai Wang, Kui Ren, Jingyi Wang

Comments: 18 pages, 11 figures

Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL)

[ total of 427 entries: 1-250 | 101-350 | 351-427 ]
[ showing 250 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help (Access key information)

> cs > cs.CL

Computation and Language

Authors and titles for recent submissions, skipping first 350

Tue, 28 May 2024 (continued, showing last 5 of 126 entries)

Mon, 27 May 2024