Computation and Language

Authors and titles for recent submissions, skipping first 166

[ total of 515 entries: 1-100 | 67-166 | 167-266 | 267-366 | 367-466 | 467-515 ]
[ showing 100 entries per page: fewer | more | all ]

Tue, 28 May 2024 (continued, showing last 41 of 126 entries)

[167] arXiv:2405.17345 (cross-list from cs.AI) [pdf, other]: Title: Exploring and steering the moral compass of Large Language Models

Authors: Alejandro Tlaie

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[168] arXiv:2405.17217 (cross-list from cs.HC) [pdf, other]: Title: Collage is the New Writing: Exploring the Fragmentation of Text and User Interfaces in AI Tools

Authors: Daniel Buschek

Comments: 19 pages, 7 figures, 2 tables, ACM DIS 2024

Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[169] arXiv:2405.17130 (cross-list from cs.LG) [pdf, other]: Title: Exploiting the Layered Intrinsic Dimensionality of Deep Models for Practical Adversarial Training

Authors: Enes Altinisik, Safa Messaoud, Husrev Taha Sencar, Hassan Sajjad, Sanjay Chawla

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[170] arXiv:2405.17104 (cross-list from cs.CV) [pdf, other]: Title: LLM-Optic: Unveiling the Capabilities of Large Language Models for Universal Visual Grounding

Authors: Haoyu Zhao, Wenhang Ge, Ying-cong Chen

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[171] arXiv:2405.17088 (cross-list from cs.LG) [pdf, other]: Title: Phase Transitions in the Output Distribution of Large Language Models

Authors: Julian Arnold, Flemming Holtorf, Frank Schäfer, Niels Lörch

Comments: 21 pages, 4 figures

Subjects: Machine Learning (cs.LG); Statistical Mechanics (cond-mat.stat-mech); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[172] arXiv:2405.17076 (cross-list from cs.AI) [pdf, other]: Title: Leveraging small language models for Text2SPARQL tasks to improve the resilience of AI assistance

Authors: Felix Brei, Johannes Frey, Lars-Peter Meyer

Comments: To appear in Proceedings of the Workshop on Linked Data-driven Resilience Research 2024 (D2R2) co-located with Extended Semantic Web Conference 2024 (ESWC 2024)

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[173] arXiv:2405.17044 (cross-list from cs.AI) [pdf, other]: Title: Generation and human-expert evaluation of interesting research ideas using knowledge graphs and large language models

Authors: Xuemei Gu, Mario Krenn

Comments: 10 pages; 5 figures

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Digital Libraries (cs.DL); Machine Learning (cs.LG)
[174] arXiv:2405.16994 (cross-list from cs.AI) [pdf, other]: Title: Vision-and-Language Navigation Generative Pretrained Transformer

Authors: Wen Hanlin

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[175] arXiv:2405.16919 (cross-list from cs.CV) [pdf, other]: Title: VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models

Authors: Zejun Li, Ruipu Luo, Jiwen Zhang, Minghui Qiu, Zhongyu Wei

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[176] arXiv:2405.16869 (cross-list from cs.AI) [pdf, other]: Title: Mixture of Modality Knowledge Experts for Robust Multi-modal Knowledge Graph Completion

Authors: Yichi Zhang, Zhuo Chen, Lingbing Guo, Yajing Xu, Binbin Hu, Ziqi Liu, Wen Zhang, Huajun Chen

Comments: Work in progress. Code and data will be released at this https URL

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[177] arXiv:2405.16845 (cross-list from cs.LG) [pdf, other]: Title: On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability

Authors: Chenyu Zheng, Wei Huang, Rongzhen Wang, Guoqiang Wu, Jun Zhu, Chongxuan Li

Comments: 37pages

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[178] arXiv:2405.16751 (cross-list from cs.AI) [pdf, other]: Title: LLM-Based Cooperative Agents using Information Relevance and Plan Validation

Authors: SeungWon Seo, Junhyeok Lee, SeongRae Noh, HyeongYeop Kang

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
[179] arXiv:2405.16712 (cross-list from cs.LG) [pdf, other]: Title: Zamba: A Compact 7B SSM Hybrid Model

Authors: Paolo Glorioso, Quentin Anthony, Yury Tokpanov, James Whittington, Jonathan Pilault, Adam Ibrahim, Beren Millidge

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[180] arXiv:2405.16700 (cross-list from cs.CV) [pdf, other]: Title: Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs

Authors: Mustafa Shukor, Matthieu Cord

Comments: Project page: this https URL 37 Pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[181] arXiv:2405.16682 (cross-list from cs.LG) [pdf, other]: Title: A Systematic Review of Federated Generative Models

Authors: Ashkan Vedadi Gargary, Emiliano De Cristofaro

Comments: 24 Pages, 3 Figures, 5 Tables

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[182] arXiv:2405.16677 (cross-list from eess.AS) [pdf, other]: Title: Crossmodal ASR Error Correction with Discrete Speech Units

Authors: Yuanchao Li, Pinzhen Chen, Peter Bell, Catherine Lai

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[183] arXiv:2405.16669 (cross-list from cs.HC) [pdf, other]: Title: Low-resourced Languages and Online Knowledge Repositories: A Need-Finding Study

Authors: Hellina Hailu Nigatu, John Canny, Sarah E. Chasins

Comments: In Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI 2024)

Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[184] arXiv:2405.16662 (cross-list from cs.LO) [pdf, ps, other]: Title: Conjunctive categorial grammars and Lambek grammars with additives

Authors: Stepan L. Kuznetsov, Alexander Okhotin

Comments: This article is an extended version of the conference presentation "Conjunctive categorial grammars" at the Mathematics of Language 2017 meeting (London, UK, July 13-14, 2017; proceedings published in ACL Anthology, W17-3414)

Subjects: Logic in Computer Science (cs.LO); Computation and Language (cs.CL); Logic (math.LO)
[185] arXiv:2405.16640 (cross-list from cs.AI) [pdf, other]: Title: A Survey of Multimodal Large Language Model from A Data-centric Perspective

Authors: Tianyi Bai, Hao Liang, Binwang Wan, Ling Yang, Bozhou Li, Yifan Wang, Bin Cui, Conghui He, Binhang Yuan, Wentao Zhang

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[186] arXiv:2405.16546 (cross-list from cs.IR) [pdf, other]: Title: Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration

Authors: Sunhao Dai, Weihao Liu, Yuqi Zhou, Liang Pang, Rongju Ruan, Gang Wang, Zhenhua Dong, Jun Xu, Ji-Rong Wen

Comments: Accepted by Findings of ACL 2024; Datasets Link: this https URL

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[187] arXiv:2405.16528 (cross-list from cs.LG) [pdf, other]: Title: LoQT: Low Rank Adapters for Quantized Training

Authors: Sebastian Loeschcke, Mads Toftrup, Michael J. Kastoryano, Serge Belongie, Vésteinn Snæbjarnarson

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[188] arXiv:2405.16510 (cross-list from cs.AI) [pdf, other]: Title: Meta-Task Planning for Language Agents

Authors: Cong Zhang, Derrick Goh Xin Deik, Dexun Li, Hao Zhang, Yong Liu

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[189] arXiv:2405.16473 (cross-list from cs.CV) [pdf, other]: Title: M$^3$CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought

Authors: Qiguang Chen, Libo Qin, Jin Zhang, Zhi Chen, Xiao Xu, Wanxiang Che

Comments: Accepted at ACL2024 Main Conference

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[190] arXiv:2405.16442 (cross-list from cs.CY) [pdf, ps, other]: Title: Development of an open education resources (OER) system: a comparative analysis and implementation approach

Authors: Nimol Thuon, Wangrui Zhang

Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL)
[191] arXiv:2405.16434 (cross-list from cs.AI) [pdf, other]: Title: The Importance of Directional Feedback for LLM-based Optimizers

Authors: Allen Nie, Ching-An Cheng, Andrey Kolobov, Adith Swaminathan

Comments: Presented at Foundation Models for Decision Making at NeurIPS 2023

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
[192] arXiv:2405.16413 (cross-list from cs.AI) [pdf, other]: Title: Augmented Risk Prediction for the Onset of Alzheimer's Disease from Electronic Health Records with Large Language Models

Authors: Jiankun Wang, Sumyeong Ahn, Taykhoom Dalal, Xiaodan Zhang, Weishen Pan, Qiannan Zhang, Bin Chen, Hiroko H. Dodge, Fei Wang, Jiayu Zhou

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Applications (stat.AP)
[193] arXiv:2405.16411 (cross-list from cs.LG) [pdf, other]: Title: Tensor Attention Training: Provably Efficient Learning of Higher-order Transformers

Authors: Jiuxiang Gu, Yingyu Liang, Zhenmei Shi, Zhao Song, Yufa Zhou

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[194] arXiv:2405.16406 (cross-list from cs.LG) [pdf, other]: Title: SpinQuant -- LLM quantization with learned rotations

Authors: Zechun Liu, Changsheng Zhao, Igor Fedorov, Bilge Soran, Dhruv Choudhary, Raghuraman Krishnamoorthi, Vikas Chandra, Yuandong Tian, Tijmen Blankevoort

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[195] arXiv:2405.16247 (cross-list from cs.AI) [pdf, other]: Title: AutoManual: Generating Instruction Manuals by LLM Agents via Interactive Environmental Learning

Authors: Minghao Chen, Yihang Li, Yanting Yang, Shiyu Yu, Binbin Lin, Xiaofei He

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[196] arXiv:2405.16205 (cross-list from cs.AI) [pdf, ps, other]: Title: GeneAgent: Self-verification Language Agent for Gene Set Knowledge Discovery using Domain Databases

Authors: Zhizheng Wang, Qiao Jin, Chih-Hsuan Wei, Shubo Tian, Po-Ting Lai, Qingqing Zhu, Chi-Ping Day, Christina Ross, Zhiyong Lu

Comments: 30 pages with 10 figures and/or tables

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[197] arXiv:2405.16136 (cross-list from cs.AI) [pdf, other]: Title: C3LLM: Conditional Multimodal Content Generation Using Large Language Models

Authors: Zixuan Wang, Qinkai Duan, Yu-Wing Tai, Chi-Keung Tang

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[198] arXiv:2405.16128 (cross-list from cs.AI) [pdf, other]: Title: How Well Do Deep Learning Models Capture Human Concepts? The Case of the Typicality Effect

Authors: Siddhartha K. Vemuri, Raj Sanjay Shah, Sashank Varma

Comments: To appear at CogSci 2024

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[199] arXiv:2405.16122 (cross-list from cs.AI) [pdf, other]: Title: Prompt Optimization with EASE? Efficient Ordering-aware Automated Selection of Exemplars

Authors: Zhaoxuan Wu, Xiaoqiang Lin, Zhongxiang Dai, Wenyang Hu, Yao Shu, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low

Comments: 23 pages, 1 figure, 23 tables

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
[200] arXiv:2405.16043 (cross-list from cs.LG) [pdf, other]: Title: Theoretical Analysis of Weak-to-Strong Generalization

Authors: Hunter Lang, David Sontag, Aravindan Vijayaraghavan

Comments: 36 pages, 3 figures

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[201] arXiv:2405.15973 (cross-list from cs.CV) [pdf, other]: Title: Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement

Authors: Xiyao Wang, Jiuhai Chen, Zhaoyang Wang, Yuhang Zhou, Yiyang Zhou, Huaxiu Yao, Tianyi Zhou, Tom Goldstein, Parminder Bhatia, Furong Huang, Cao Xiao

Comments: 15 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[202] arXiv:2405.15943 (cross-list from cs.LG) [pdf, other]: Title: Transformers represent belief state geometry in their residual stream

Authors: Adam S. Shai, Sarah E. Marzen, Lucas Teixeira, Alexander Gietelink Oldenziel, Paul M. Riechers

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[203] arXiv:2405.15902 (cross-list from cs.CR) [pdf, other]: Title: Hacc-Man: An Arcade Game for Jailbreaking LLMs

Authors: Matheus Valentim, Jeanette Falk, Nanna Inie

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[204] arXiv:2405.15877 (cross-list from cs.LG) [pdf, other]: Title: Basis Selection: Low-Rank Decomposition of Pretrained Large Language Models for Target Applications

Authors: Yang Li, Changsheng Zhao, Hyungtak Lee, Ernie Chang, Yangyang Shi, Vikas Chandra

Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Computation and Language (cs.CL)
[205] arXiv:2405.15793 (cross-list from cs.SE) [pdf, other]: Title: SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

Authors: John Yang, Carlos E. Jimenez, Alexander Wettig, Kilian Lieret, Shunyu Yao, Karthik Narasimhan, Ofir Press

Comments: First two authors contributed equally. Code and demo at this https URL

Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[206] arXiv:2405.15787 (cross-list from cs.IR) [pdf, ps, other]: Title: Extracting chemical food safety hazards from the scientific literature automatically using large language models

Authors: Neris Özen, Wenjuan Mu, Esther D. van Asselt, Leonieke M. van den Bulk

Comments: 31 pages, 5 figures

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[207] arXiv:2405.15784 (cross-list from cs.IR) [pdf, other]: Title: CLARINET: Augmenting Language Models to Ask Clarification Questions for Retrieval

Authors: Yizhou Chi, Jessy Lin, Kevin Lin, Dan Klein

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Mon, 27 May 2024 (showing first 59 of 72 entries)

[208] arXiv:2405.15765 [pdf, other]: Title: Scaling Laws for Discriminative Classification in Large Language Models

Authors: Dean Wyatte, Fatemeh Tahmasbi, Ming Li, Thomas Markovich

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[209] arXiv:2405.15760 [pdf, other]: Title: GPT is Not an Annotator: The Necessity of Human Annotation in Fairness Benchmark Construction

Authors: Virginia K. Felkner, Jennifer A. Thompson, Jonathan May

Comments: Accepted to ACL 2024 (main conference)

Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[210] arXiv:2405.15750 [pdf, other]: Title: Filtered Corpus Training (FiCT) Shows that Language Models can Generalize from Indirect Evidence

Authors: Abhinav Patil, Jaap Jumelet, Yu Ying Chiu, Andy Lapastora, Peter Shen, Lexie Wang, Clevis Willrich, Shane Steinert-Threlkeld

Comments: 10 pages + 7 pages of references/appendices. For code and trained models, see this http URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[211] arXiv:2405.15708 [pdf, other]: Title: EmpathicStories++: A Multimodal Dataset for Empathy towards Personal Experiences

Authors: Jocelyn Shen, Yubin Kim, Mohit Hulse, Wazeer Zulfikar, Sharifa Alghowinem, Cynthia Breazeal, Hae Won Park

Comments: Accepted to ACL 2024 Findings

Subjects: Computation and Language (cs.CL)
[212] arXiv:2405.15640 [pdf, other]: Title: GECKO: Generative Language Model for English, Code and Korean

Authors: Sungwoo Oh, Donggyu Kim

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[213] arXiv:2405.15604 [pdf, other]: Title: Text Generation: A Systematic Literature Review of Tasks, Evaluation, and Challenges

Authors: Jonas Becker, Jan Philip Wahle, Bela Gipp, Terry Ruas

Comments: 35 pages, 2 figures, 2 tables, Under review

Subjects: Computation and Language (cs.CL)
[214] arXiv:2405.15590 [pdf, ps, other]: Title: Profiling checkpointing schedules in adjoint ST-AD

Authors: Laurent Hascoët, Jean-Luc Bouchot, Shreyas Sunil Gaikwad, Sri Hari Krishna Narayanan, Jan Hückelheim

Subjects: Computation and Language (cs.CL)
[215] arXiv:2405.15585 [pdf, other]: Title: Synergizing In-context Learning with Hints for End-to-end Task-oriented Dialog Systems

Authors: Vishal Vivek Saley, Rocktim Jyoti Das, Dinesh Raghu, Mausam

Subjects: Computation and Language (cs.CL)
[216] arXiv:2405.15525 [pdf, other]: Title: Sparse Matrix in Large Language Model Fine-tuning

Authors: Haoze He, Juncheng Billy Li, Xuan Jiang, Heather Miller

Comments: 14 pages

Subjects: Computation and Language (cs.CL)
[217] arXiv:2405.15523 [pdf, other]: Title: Mosaic Memory: Fuzzy Duplication in Copyright Traps for Large Language Models

Authors: Igor Shilov, Matthieu Meeus, Yves-Alexandre de Montjoye

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[218] arXiv:2405.15471 [pdf, other]: Title: Emergence of a High-Dimensional Abstraction Phase in Language Transformers

Authors: Emily Cheng, Diego Doimo, Corentin Kervadec, Iuri Macocco, Jade Yu, Alessandro Laio, Marco Baroni

Subjects: Computation and Language (cs.CL)
[219] arXiv:2405.15454 [pdf, other]: Title: Linearly Controlled Language Generation with Performative Guarantees

Authors: Emily Cheng, Marco Baroni, Carmen Amo Alonso

Subjects: Computation and Language (cs.CL); Systems and Control (eess.SY)
[220] arXiv:2405.15453 [pdf, other]: Title: Benchmarking Pre-trained Large Language Models' Potential Across Urdu NLP tasks

Authors: Munief Hassan Tahir, Sana Shams, Layba Fiaz, Farah Adeeba, Sarmad Hussain

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[221] arXiv:2405.15452 [pdf, other]: Title: Leveraging Logical Rules in Knowledge Editing: A Cherry on the Top

Authors: Keyuan Cheng, Muhammad Asif Ali, Shu Yang, Gang Lin, Yuxuan Zhai, Haoyang Fei, Ke Xu, Lu Yu, Lijie Hu, Di Wang

Comments: 18 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[222] arXiv:2405.15370 [pdf, other]: Title: Large Language Models can Deliver Accurate and Interpretable Time Series Anomaly Detection

Authors: Jun Liu, Chaoyun Zhang, Jiaxu Qian, Minghua Ma, Si Qin, Chetan Bansal, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang

Subjects: Computation and Language (cs.CL)
[223] arXiv:2405.15349 [pdf, other]: Title: UnKE: Unstructured Knowledge Editing in Large Language Models

Authors: Jingcheng Deng, Zihao Wei, Liang Pang, Hanxing Ding, Huawei Shen, Xueqi Cheng

Subjects: Computation and Language (cs.CL)
[224] arXiv:2405.15346 [pdf, other]: Title: BiSup: Bidirectional Quantization Error Suppression for Large Language Models

Authors: Minghui Zou, Ronghui Guo, Sai Zhang, Xiaowang Zhang, Zhiyong Feng

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[225] arXiv:2405.15334 [pdf, other]: Title: Detection and Positive Reconstruction of Cognitive Distortion sentences: Mandarin Dataset and Evaluation

Authors: Shuya Lin, Yuxiong Wang, Jonathan Dong, Shiguang Ni

Subjects: Computation and Language (cs.CL)
[226] arXiv:2405.15329 [pdf, other]: Title: Decompose and Aggregate: A Step-by-Step Interpretable Evaluation Framework

Authors: Minzhi Li, Zhengyuan Liu, Shumin Deng, Shafiq Joty, Nancy F. Chen, Min-Yen Kan

Subjects: Computation and Language (cs.CL)
[227] arXiv:2405.15320 [pdf, other]: Title: Organic Data-Driven Approach for Turkish Grammatical Error Correction and LLMs

Authors: Asım Ersoy, Olcay Taner Yıldız

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[228] arXiv:2405.15319 [pdf, other]: Title: Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training

Authors: Wenyu Du, Tongxu Luo, Zihan Qiu, Zeyu Huang, Yikang Shen, Reynold Cheng, Yike Guo, Jie Fu

Comments: Preprint; The project link: $\href{https://llm-stacking.github.io/}{this https URL}$

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[229] arXiv:2405.15318 [pdf, other]: Title: Are Long-LLMs A Necessity For Long-Context Tasks?

Authors: Hongjin Qian, Zheng Liu, Peitian Zhang, Kelong Mao, Yujia Zhou, Xu Chen, Zhicheng Dou

Comments: 18 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[230] arXiv:2405.15307 [pdf, other]: Title: Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL Generation

Authors: Ge Qu, Jinyang Li, Bowen Li, Bowen Qin, Nan Huo, Chenhao Ma, Reynold Cheng

Comments: Accepted to ACL Findings 2024

Subjects: Computation and Language (cs.CL)
[231] arXiv:2405.15306 [pdf, other]: Title: DeTikZify: Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ

Authors: Jonas Belouadi, Simone Paolo Ponzetto, Steffen Eger

Comments: Project page: this https URL

Subjects: Computation and Language (cs.CL)
[232] arXiv:2405.15208 [pdf, other]: Title: Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs

Authors: Chenxi Sun, Hongzhi Zhang, Zijia Lin, Jingyuan Zhang, Fuzheng Zhang, Zhongyuan Wang, Bin Chen, Chengru Song, Di Zhang, Kun Gai, Deyi Xiong

Comments: Accepted for publication at LREC-COLING 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[233] arXiv:2405.15202 [pdf, other]: Title: Cross-Task Defense: Instruction-Tuning LLMs for Content Safety

Authors: Yu Fu, Wen Xiao, Jia Chen, Jiachen Li, Evangelos Papalexakis, Aichi Chien, Yue Dong

Comments: accepted to NAACL2024 TrustNLP workshop

Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[234] arXiv:2405.15198 [pdf, other]: Title: RAEE: A Training-Free Retrieval-Augmented Early Exiting Framework for Efficient Inference

Authors: Lianming Huang, Shangyu Wu, Yufei Cui, Ying Xiong, Xue Liu, Tei-Wei Kuo, Nan Guan, Chun Jason Xue

Subjects: Computation and Language (cs.CL)
[235] arXiv:2405.15185 [pdf, other]: Title: An Evaluation of Estimative Uncertainty in Large Language Models

Authors: Zhisheng Tang, Ke Shen, Mayank Kejriwal

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[236] arXiv:2405.15179 [pdf, other]: Title: VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks

Authors: Yang Li, Shaobo Han, Shihao Ji

Subjects: Computation and Language (cs.CL)
[237] arXiv:2405.15165 [pdf, other]: Title: A Solution-based LLM API-using Methodology for Academic Information Seeking

Authors: Yuanchun Wang, Jifan Yu, Zijun Yao, Jing Zhang, Yuyang Xie, Shangqing Tu, Yiyang Fu, Youhe Feng, Jinkai Zhang, Jingyao Zhang, Bowen Huang, Yuanyao Li, Huihui Yuan, Lei Hou, Juanzi Li, Jie Tang

Comments: 22 pages, 13 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
[238] arXiv:2405.15152 [pdf, other]: Title: Machine Unlearning in Large Language Models

Authors: Saaketh Koundinya Gundavarapu, Shreya Agarwal, Arushi Arora, Chandana Thimmalapura Jagadeeshaiah

Comments: 10 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[239] arXiv:2405.15134 [pdf, other]: Title: Efficient Biomedical Entity Linking: Clinical Text Standardization with Low-Resource Techniques

Authors: Akshit Achara, Sanand Sasidharan, Gagan N

Subjects: Computation and Language (cs.CL)
[240] arXiv:2405.15122 [pdf, other]: Title: Generalizable and Scalable Multistage Biomedical Concept Normalization Leveraging Large Language Models

Authors: Nicholas J Dobbins

Subjects: Computation and Language (cs.CL)
[241] arXiv:2405.15110 [pdf, other]: Title: CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems

Authors: Abbas Ghaddar, David Alfonso-Hermelo, Philippe Langlais, Mehdi Rezagholizadeh, Boxing Chen, Prasanna Parthasarathi

Comments: To appear in Findings ACL 2024

Subjects: Computation and Language (cs.CL)
[242] arXiv:2405.15097 [pdf, other]: Title: Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding

Authors: Suyoung Kim, Jiyeon Hwang, Ho-Young Jung

Comments: Accepted NAACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[243] arXiv:2405.15077 [pdf, other]: Title: Eliciting Informative Text Evaluations with Large Language Models

Authors: Yuxuan Lu, Shengwei Xu, Yichi Zhang, Yuqing Kong, Grant Schoenebeck

Comments: Accepted by the Twenty-Fifth ACM Conference on Economics and Computation (EC'24)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT)
[244] arXiv:2405.15071 [pdf, other]: Title: Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Authors: Boshi Wang, Xiang Yue, Yu Su, Huan Sun

Comments: 22 pages, 16 figures. Code and data: this https URL

Subjects: Computation and Language (cs.CL)
[245] arXiv:2405.15070 [pdf, other]: Title: Optimizing example selection for retrieval-augmented machine translation with translation memories

Authors: Maxime Bouthors, Josep Crego, François Yvon

Comments: TALN conference, French, 10 pages, 7 figures

Subjects: Computation and Language (cs.CL)
[246] arXiv:2405.15067 [pdf, other]: Title: Promoting Constructive Deliberation: Reframing for Receptiveness

Authors: Gauri Kambhatla, Matthew Lease, Ashwin Rajadesingan

Subjects: Computation and Language (cs.CL)
[247] arXiv:2405.15064 [pdf, other]: Title: Reframing Spatial Reasoning Evaluation in Language Models: A Real-World Simulation Benchmark for Qualitative Reasoning

Authors: Fangjun Li, David C. Hogg, Anthony G. Cohn

Comments: Camera-Ready version for IJCAI 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[248] arXiv:2405.15039 [pdf, other]: Title: CEEBERT: Cross-Domain Inference in Early Exit BERT

Authors: Divya Jyoti Bajpai, Manjesh Kumar Hanawal

Comments: Accepted at ACL 2024

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[249] arXiv:2405.15032 [pdf, other]: Title: Aya 23: Open Weight Releases to Further Multilingual Progress

Authors: Viraat Aryabumi, John Dang, Dwarak Talupuru, Saurabh Dash, David Cairuz, Hangyu Lin, Bharat Venkitesh, Madeline Smith, Kelly Marchisio, Sebastian Ruder, Acyr Locatelli, Julia Kreutzer, Nick Frosst, Phil Blunsom, Marzieh Fadaee, Ahmet Üstün, Sara Hooker

Subjects: Computation and Language (cs.CL)
[250] arXiv:2405.15028 [pdf, other]: Title: AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings

Authors: Revanth Gangi Reddy, Omar Attia, Yunyao Li, Heng Ji, Saloni Potdar

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[251] arXiv:2405.15012 [pdf, other]: Title: Extracting Prompts by Inverting LLM Outputs

Authors: Collin Zhang, John X. Morris, Vitaly Shmatikov

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[252] arXiv:2405.15007 [pdf, other]: Title: RE-Adapt: Reverse Engineered Adaptation of Large Language Models

Authors: William Fleshman, Benjamin Van Durme

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[253] arXiv:2405.14992 [pdf, other]: Title: Linking In-context Learning in Transformers to Human Episodic Memory

Authors: Li Ji-An, Corey Y. Zhou, Marcus K. Benna, Marcelo G. Mattar

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[254] arXiv:2405.14962 [pdf, ps, other]: Title: Data Augmentation Method Utilizing Template Sentences for Variable Definition Extraction

Authors: Kotaro Nagayama, Shota Kato, Manabu Kano

Subjects: Computation and Language (cs.CL)
[255] arXiv:2405.14899 [pdf, other]: Title: DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning

Authors: Zijian Zhou, Xiaoqiang Lin, Xinyi Xu, Alok Prakash, Daniela Rus, Bryan Kian Hsiang Low

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[256] arXiv:2405.15766 (cross-list from cs.AI) [pdf, other]: Title: Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development

Authors: Pranab Sahoo, Ayush Kumar Singh, Sriparna Saha, Aman Chadha, Samrat Mondal

Comments: ACL Findings 2024

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[257] arXiv:2405.15729 (cross-list from cs.SE) [pdf, other]: Title: Optimizing Large Language Models for OpenAPI Code Completion

Authors: Bohdan Petryshyn, Mantas Lukoševičius

Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[258] arXiv:2405.15683 (cross-list from cs.CV) [pdf, other]: Title: VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap

Authors: Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, Oriol Nieto, Zeyu Jin, Dinesh Manocha

Comments: Preprint. Under review. Code will be released on paper acceptance

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[259] arXiv:2405.15638 (cross-list from cs.CV) [pdf, other]: Title: M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models

Authors: Hongyu Wang, Jiayu Xu, Senwei Xie, Ruiping Wang, Jialin Li, Zhaojie Xie, Bin Zhang, Chuyan Xiong, Xilin Chen

Comments: Work in progress

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[260] arXiv:2405.15556 (cross-list from cs.LG) [pdf, other]: Title: Certifiably Robust RAG against Retrieval Corruption

Authors: Chong Xiang, Tong Wu, Zexuan Zhong, David Wagner, Danqi Chen, Prateek Mittal

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[261] arXiv:2405.15485 (cross-list from cs.AI) [pdf, other]: Title: Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs

Authors: Siyuan Guo, Aniket Didolkar, Nan Rosemary Ke, Anirudh Goyal, Ferenc Huszár, Bernhard Schölkopf

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[262] arXiv:2405.15374 (cross-list from cs.IR) [pdf, other]: Title: Leveraging Large Language Models for Semantic Query Processing in a Scholarly Knowledge Graph

Authors: Runsong Jia, Bowen Zhang, Sergio J. Rodríguez Méndez, Pouya G. Omran

Comments: for the associated repository, see this http URL

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[263] arXiv:2405.15362 (cross-list from cs.LG) [pdf, other]: Title: Pipeline Parallelism with Controllable Memory

Authors: Penghui Qi, Xinyi Wan, Nyamdavaa Amar, Min Lin

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC)
[264] arXiv:2405.15302 (cross-list from cs.AI) [pdf, other]: Title: Towards Understanding How Transformer Perform Multi-step Reasoning with Matching Operation

Authors: Zhiwei Wang, Yunji Wang, Zhongwang Zhang, Zhangchen Zhou, Hui Jin, Tianyang Hu, Jiacheng Sun, Zhenguo Li, Yaoyu Zhang, Zhi-Qin John Xu

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[265] arXiv:2405.15232 (cross-list from cs.CV) [pdf, other]: Title: DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception

Authors: Run Luo, Yunshui Li, Longze Chen, Wanwei He, Ting-En Lin, Ziqiang Liu, Lei Zhang, Zikai Song, Xiaobo Xia, Tongliang Liu, Min Yang, Binyuan Hui

Comments: 25 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[266] arXiv:2405.15216 (cross-list from cs.LG) [pdf, other]: Title: Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition

Authors: Zijin Gu, Tatiana Likhomanenko, He Bai, Erik McDermott, Ronan Collobert, Navdeep Jaitly

Comments: under review

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)

[ total of 515 entries: 1-100 | 67-166 | 167-266 | 267-366 | 367-466 | 467-515 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help (Access key information)

> cs > cs.CL

Computation and Language

Authors and titles for recent submissions, skipping first 166

Tue, 28 May 2024 (continued, showing last 41 of 126 entries)

Mon, 27 May 2024 (showing first 59 of 72 entries)