Computation and Language

Authors and titles for recent submissions, skipping first 127

[ total of 497 entries: 1-75 | 53-127 | 128-202 | 203-277 | 278-352 | 353-427 | 428-497 ]
[ showing 75 entries per page: fewer | more | all ]

Thu, 6 Jun 2024 (continued, showing last 47 of 90 entries)

[128] arXiv:2406.02886 [pdf, other]: Title: PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs

Authors: Rongzhi Zhang, Jiaming Shen, Tianqi Liu, Haorui Wang, Zhen Qin, Feng Han, Jialu Liu, Simon Baumgartner, Michael Bendersky, Chao Zhang

Comments: Findings of ACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[129] arXiv:2406.02882 [pdf, other]: Title: Outdated Issue Aware Decoding for Factual Knowledge Editing

Authors: Zengkui Sun, Yijin Liu, Jiaan Wang, Fandong Meng, Jinan Xu, Yufeng Chen, Jie Zhou

Comments: ACL2024 Findings, Codes are at this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[130] arXiv:2406.02876 [pdf, other]: Title: LCS: A Language Converter Strategy for Zero-Shot Neural Machine Translation

Authors: Zengkui Sun, Yijin Liu, Fandong Meng, Jinan Xu, Yufeng Chen, Jie Zhou

Comments: ACL2024 Findings, Codes are at this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[131] arXiv:2406.02864 [pdf, other]: Title: NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models

Authors: Ancheng Xu, Minghuan Tan, Lei Wang, Min Yang, Ruifeng Xu

Comments: Findings of ACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[132] arXiv:2406.02863 [pdf, ps, other]: Title: LLM as a Scorer: The Impact of Output Order on Dialogue Evaluation

Authors: Yi-Pei Chen, KuanChao Chu, Hideki Nakayama

Comments: Presented in AAAI 2024 Spring Symposium. The first two authors contributed equally

Subjects: Computation and Language (cs.CL)
[133] arXiv:2406.02856 [pdf, other]: Title: Xmodel-LM Technical Report

Authors: Yichuan Wang, Yang Liu, Yu Yan, Xucheng Huang, Ling Jiang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[134] arXiv:2406.02832 [pdf, other]: Title: Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms

Authors: Firas Trabelsi, David Vilar, Mara Finkelstein, Markus Freitag

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[135] arXiv:2406.02830 [pdf, other]: Title: Too Big to Fail: Larger Language Models are Disproportionately Resilient to Induction of Dementia-Related Linguistic Anomalies

Authors: Changye Li, Zhecheng Sheng, Trevor Cohen, Serguei Pakhomov

Comments: Accepted to ACL 2024 findings

Subjects: Computation and Language (cs.CL)
[136] arXiv:2406.02826 [pdf, other]: Title: Exploring Robustness in Doctor-Patient Conversation Summarization: An Analysis of Out-of-Domain SOAP Notes

Authors: Yu-Wen Chen, Julia Hirschberg

Comments: Clinical NLP Workshop 2024

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[137] arXiv:2406.02818 [pdf, other]: Title: Chain of Agents: Large Language Models Collaborating on Long-Context Tasks

Authors: Yusen Zhang, Ruoxi Sun, Yanfei Chen, Tomas Pfister, Rui Zhang, Sercan Ö. Arik

Comments: 19 pages, 6 figures

Subjects: Computation and Language (cs.CL)
[138] arXiv:2406.02787 [pdf, other]: Title: Disentangling Logic: The Role of Context in Large Language Model Reasoning Capabilities

Authors: Wenyue Hua, Kaijie Zhu, Lingyao Li, Lizhou Fan, Shuhang Lin, Mingyu Jin, Haochen Xue, Zelong Li, JinDong Wang, Yongfeng Zhang

Comments: 22 pages, 9 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[139] arXiv:2406.02756 [pdf, other]: Title: Aligning Large Language Models via Fine-grained Supervision

Authors: Dehong Xu, Liang Qiu, Minseok Kim, Faisal Ladhak, Jaeyoung Do

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[140] arXiv:2406.02746 [pdf, other]: Title: RATT: AThought Structure for Coherent and Correct LLMReasoning

Authors: Jinghan Zhang, Xiting Wang, Weijieying Ren, Lu Jiang, Dongjie Wang, Kunpeng Liu

Subjects: Computation and Language (cs.CL)
[141] arXiv:2406.02733 [pdf, other]: Title: Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation

Authors: Min-Jae Hwang, Ilia Kulikov, Benjamin Peloquin, Hongyu Gong, Peng-Jen Chen, Ann Lee

Comments: Accepted to ACL 2024 (findings)

Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[142] arXiv:2406.02721 [pdf, other]: Title: Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller

Authors: Min Cai, Yuchen Zhang, Shichang Zhang, Fan Yin, Difan Zou, Yisong Yue, Ziniu Hu

Comments: 41 pages, 12 figures, 61 tables; Website: this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[143] arXiv:2406.02657 [pdf, other]: Title: Block Transformer: Global-to-Local Language Modeling for Fast Inference

Authors: Namgyu Ho, Sangmin Bae, Taehyeon Kim, Hyunjik Jo, Yireun Kim, Tal Schuster, Adam Fisch, James Thorne, Se-Young Yun

Comments: 30 pages, 21 figures, 5 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[144] arXiv:2406.02577 [pdf, other]: Title: Are PPO-ed Language Models Hackable?

Authors: Suraj Anand, David Getzen

Comments: 8 pages, 4 figures

Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[145] arXiv:2406.02575 [pdf, other]: Title: Cross-Modal Safety Alignment: Is textual unlearning all you need?

Authors: Trishna Chakraborty, Erfan Shayegani, Zikui Cai, Nael Abu-Ghazaleh, M. Salman Asif, Yue Dong, Amit K. Roy-Chowdhury, Chengyu Song

Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[146] arXiv:2406.03482 (cross-list from cs.LG) [pdf, other]: Title: QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead

Authors: Amir Zandieh, Majid Daliri, Insu Han

Comments: 13 pages

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Performance (cs.PF)
[147] arXiv:2406.03476 (cross-list from cs.LG) [pdf, other]: Title: Does your data spark joy? Performance gains from domain upsampling at the end of training

Authors: Cody Blakeney, Mansheej Paul, Brett W. Larsen, Sean Owen, Jonathan Frankle

Comments: The first three authors contributed equally

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[148] arXiv:2406.03445 (cross-list from cs.LG) [pdf, other]: Title: Pre-trained Large Language Models Use Fourier Features to Compute Addition

Authors: Tianyi Zhou, Deqing Fu, Vatsal Sharan, Robin Jia

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[149] arXiv:2406.03299 (cross-list from cs.AI) [pdf, other]: Title: The Good, the Bad, and the Hulk-like GPT: Analyzing Emotional Decisions of Large Language Models in Cooperation and Bargaining Games

Authors: Mikhail Mozikov, Nikita Severin, Valeria Bodishtianu, Maria Glushanina, Mikhail Baklashkin, Andrey V. Savchenko, Ilya Makarov

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[150] arXiv:2406.03287 (cross-list from cs.NE) [pdf, other]: Title: SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms

Authors: Xingrun Xing, Zheng Zhang, Ziyi Ni, Shitao Xiao, Yiming Ju, Siqi Fan, Yequan Wang, Jiajun Zhang, Guoqi Li

Subjects: Neural and Evolutionary Computing (cs.NE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[151] arXiv:2406.03280 (cross-list from cs.LG) [pdf, other]: Title: FusionBench: A Comprehensive Benchmark of Deep Model Fusion

Authors: Anke Tang, Li Shen, Yong Luo, Han Hu, Bo Do, Dacheng Tao

Comments: Project homepage: this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[152] arXiv:2406.03248 (cross-list from cs.IR) [pdf, other]: Title: Large Language Models as Evaluators for Recommendation Explanations

Authors: Xiaoyu Zhang, Yishan Li, Jiayin Wang, Bowen Sun, Weizhi Ma, Peijie Sun, Min Zhang

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[153] arXiv:2406.03068 (cross-list from cs.LG) [pdf, other]: Title: How Truncating Weights Improves Reasoning in Language Models

Authors: Lei Chen, Joan Bruna, Alberto Bietti

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[154] arXiv:2406.03008 (cross-list from cs.CV) [pdf, other]: Title: DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences

Authors: Yidong Huang, Jacob Sansom, Ziqiao Ma, Felix Gervits, Joyce Chai

Comments: First Vision and Language for Autonomous Driving and Robotics Workshop (VLADR @ CVPR 2024)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[155] arXiv:2406.02969 (cross-list from cs.LG) [pdf, other]: Title: Filtered not Mixed: Stochastic Filtering-Based Online Gating for Mixture of Large Language Models

Authors: Raeid Saqur, Anastasis Kratsios, Florian Krach, Yannick Limmer, Jacob-Junqi Tian, John Willes, Blanka Horvath, Frank Rudzicz

Comments: 29 pages, 5 Appendix sections

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computational Finance (q-fin.CP); Mathematical Finance (q-fin.MF)
[156] arXiv:2406.02958 (cross-list from cs.LG) [pdf, other]: Title: PrE-Text: Training Language Models on Private Federated Data in the Age of LLMs

Authors: Charlie Hou, Akshat Shrivastava, Hongyuan Zhan, Rylan Conway, Trang Le, Adithya Sagar, Giulia Fanti, Daniel Lazar

Comments: ICML 2024 (Oral)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC)
[157] arXiv:2406.02950 (cross-list from eess.AS) [pdf, other]: Title: 4D ASR: Joint Beam Search Integrating CTC, Attention, Transducer, and Mask Predict Decoders

Authors: Yui Sudo, Muhammad Shakeel, Yosuke Fukumoto, Brian Yan, Jiatong Shi, Yifan Peng, Shinji Watanabe

Comments: submitted to IEEE/ACM Transactions on Audio Speech and Language Processing

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[158] arXiv:2406.02943 (cross-list from cs.IR) [pdf, ps, other]: Title: The Task-oriented Queries Benchmark (ToQB)

Authors: Keun Soo Yim

Comments: Data available on GitHub, this https URL

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Neural and Evolutionary Computing (cs.NE)
[159] arXiv:2406.02925 (cross-list from eess.AS) [pdf, other]: Title: SYN2REAL: Leveraging Task Arithmetic for Mitigating Synthetic-Real Discrepancies in ASR Domain Adaptation

Authors: Hsuan Su, Hua Farn, Shang-Tse Chen, Hung-yi Lee

Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[160] arXiv:2406.02924 (cross-list from cs.LG) [pdf, other]: Title: Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for Large Language Models

Authors: Peijie Dong, Lujun Li, Zhenheng Tang, Xiang Liu, Xinglin Pan, Qiang Wang, Xiaowen Chu

Comments: Accepted by ICML2024, 29 pages, 4 figures

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
[161] arXiv:2406.02900 (cross-list from cs.LG) [pdf, other]: Title: Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms

Authors: Rafael Rafailov, Yaswanth Chittepu, Ryan Park, Harshit Sikchi, Joey Hejna, Bradley Knox, Chelsea Finn, Scott Niekum

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[162] arXiv:2406.02844 (cross-list from cs.IR) [pdf, other]: Title: Item-Language Model for Conversational Recommendation

Authors: Li Yang, Anushya Subbiah, Hardik Patel, Judith Yue Li, Yanwei Song, Reza Mirghaderi, Vikram Aggarwal

Comments: 15 pages, 3 figures

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[163] arXiv:2406.02804 (cross-list from cs.AI) [pdf, other]: Title: $\texttt{ACCORD}$: Closing the Commonsense Measurability Gap

Authors: François Roewer-Després, Jinyue Feng, Zining Zhu, Frank Rudzicz

Comments: For leaderboard and dataset download, see this https URL For source code, see this https URL

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[164] arXiv:2406.02798 (cross-list from cs.DL) [pdf, ps, other]: Title: Promotional Language and the Adoption of Innovative Ideas in Science

Authors: Hao Peng, Huilian Sophie Qiu, Henrik Barslund Fosse, Brian Uzzi

Subjects: Digital Libraries (cs.DL); Computation and Language (cs.CL); Computers and Society (cs.CY)
[165] arXiv:2406.02795 (cross-list from cs.HC) [pdf, other]: Title: ArguMentor: Augmenting User Experiences with Counter-Perspectives

Authors: Priya Pitre, Kurt Luther

Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[166] arXiv:2406.02791 (cross-list from cs.AI) [pdf, other]: Title: Language Models can Infer Action Semantics for Classical Planners from Environment Feedback

Authors: Wang Zhu, Ishika Singh, Robin Jia, Jesse Thomason

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Robotics (cs.RO)
[167] arXiv:2406.02592 (cross-list from cs.LG) [pdf, other]: Title: LOLAMEME: Logic, Language, Memory, Mechanistic Framework

Authors: Jay Desai, Xiaobo Guo, Srinivasan H. Sengamedu

Comments: this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[168] arXiv:2406.02566 (cross-list from eess.AS) [pdf, other]: Title: Combining X-Vectors and Bayesian Batch Active Learning: Two-Stage Active Learning Pipeline for Speech Recognition

Authors: Ognjen Kundacina, Vladimir Vincan, Dragisa Miskovic

Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[169] arXiv:2406.02565 (cross-list from cs.SD) [pdf, other]: Title: Sequence-to-sequence models in peer-to-peer learning: A practical application

Authors: Robert Šajina, Ivo Ipšić

Subjects: Sound (cs.SD); Computation and Language (cs.CL); Multiagent Systems (cs.MA); Audio and Speech Processing (eess.AS)
[170] arXiv:2406.02563 (cross-list from eess.AS) [pdf, other]: Title: A cost minimization approach to fix the vocabulary size in a tokenizer for an End-to-End ASR system

Authors: Sunil Kumar Kopparapu, Ashish Panda

Comments: 5 pages, 4 figures

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[171] arXiv:2406.02562 (cross-list from eess.AS) [pdf, other]: Title: Gated Low-rank Adaptation for personalized Code-Switching Automatic Speech Recognition on the low-spec devices

Authors: Gwantae Kim, Bokyeung Lee, Donghyeon Kim, Hanseok Ko

Comments: Table 2 is revised

Journal-ref: ICASSP 2024 Workshop(HSCMA 2024) paper

Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[172] arXiv:2406.02560 (cross-list from eess.AS) [pdf, other]: Title: Less Peaky and More Accurate CTC Forced Alignment by Label Priors

Authors: Ruizhe Huang, Xiaohui Zhang, Zhaoheng Ni, Li Sun, Moto Hira, Jeff Hwang, Vimal Manohar, Vineel Pratap, Matthew Wiesner, Shinji Watanabe, Daniel Povey, Sanjeev Khudanpur

Comments: Accepted by ICASSP 2024. Github repo: this https URL

Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[173] arXiv:2406.02555 (cross-list from eess.AS) [pdf, ps, other]: Title: PhoWhisper: Automatic Speech Recognition for Vietnamese

Authors: Thanh-Thien Le, Linh The Nguyen, Dat Quoc Nguyen

Comments: Accepted to ICLR 2024 Tiny Papers Track

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL)
[174] arXiv:2406.02554 (cross-list from eess.AS) [pdf, other]: Title: Hear Me, See Me, Understand Me: Audio-Visual Autism Behavior Recognition

Authors: Shijian Deng, Erin E. Kosloski, Siddhi Patel, Zeke A. Barnett, Yiyang Nan, Alexander Kaplan, Sisira Aarukapalli, William T. Doan, Matthew Wang, Harsh Singh, Pamela R. Rollins, Yapeng Tian

Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)

Wed, 5 Jun 2024 (showing first 28 of 93 entries)

[175] arXiv:2406.02537 [pdf, other]: Title: TopViewRS: Vision-Language Models as Top-View Spatial Reasoners

Authors: Chengzu Li, Caiqi Zhang, Han Zhou, Nigel Collier, Anna Korhonen, Ivan Vulić

Comments: 9 pages, 3 figures, 3 tables (21 pages, 4 figures, 15 tables including references and appendices)

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[176] arXiv:2406.02536 [pdf, other]: Title: Mitigate Position Bias in Large Language Models via Scaling a Single Dimension

Authors: Yijiong Yu, Huiqiang Jiang, Xufang Luo, Qianhui Wu, Chin-Yew Lin, Dongsheng Li, Yuqing Yang, Yongfeng Huang, Lili Qiu

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[177] arXiv:2406.02532 [pdf, other]: Title: SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices

Authors: Ruslan Svirschevski, Avner May, Zhuoming Chen, Beidi Chen, Zhihao Jia, Max Ryabinin

Comments: preprint. arXiv admin note: text overlap with arXiv:2312.17238 by other authors

Subjects: Computation and Language (cs.CL)
[178] arXiv:2406.02528 [pdf, other]: Title: Scalable MatMul-free Language Modeling

Authors: Rui-Jie Zhu, Yu Zhang, Ethan Sifferman, Tyler Sheaves, Yiqiao Wang, Dustin Richmond, Peng Zhou, Jason K. Eshraghian

Subjects: Computation and Language (cs.CL)
[179] arXiv:2406.02524 [pdf, other]: Title: CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks

Authors: Maciej Besta, Lorenzo Paleari, Ales Kubicek, Piotr Nyczyk, Robert Gerstenberger, Patrick Iff, Tomasz Lehmann, Hubert Niewiadomski, Torsten Hoefler

Subjects: Computation and Language (cs.CL)
[180] arXiv:2406.02517 [pdf, other]: Title: Deterministic Reversible Data Augmentation for Neural Machine Translation

Authors: Jiashu Yao, Heyan Huang, Zeming Liu, Yuhang Guo

Comments: Findings of ACL 2024

Subjects: Computation and Language (cs.CL)
[181] arXiv:2406.02481 [pdf, other]: Title: Hiding Text in Large Language Models: Introducing Unconditional Token Forcing Confusion

Authors: Jakub Hoscilowicz, Pawel Popiolek, Jan Rudkowski, Jedrzej Bieniasz, Artur Janicki

Comments: Work in progress. Code is available at this https URL

Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[182] arXiv:2406.02472 [pdf, other]: Title: Analyzing Temporal Complex Events with Large Language Models? A Benchmark towards Temporal, Long Context Understanding

Authors: Zhihan Zhang, Yixin Cao, Chenchen Ye, Yunshan Ma, Lizi Liao, Tat-Seng Chua

Comments: Accepted to ACL 2024

Subjects: Computation and Language (cs.CL)
[183] arXiv:2406.02449 [pdf, other]: Title: Representations as Language: An Information-Theoretic Framework for Interpretability

Authors: Henry Conklin, Kenny Smith

Comments: 6 pages, 3 Figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[184] arXiv:2406.02396 [pdf, other]: Title: The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding

Authors: Kenneth Enevoldsen, Márton Kardos, Niklas Muennighoff, Kristoffer Laigaard Nielbo

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[185] arXiv:2406.02394 [pdf, other]: Title: Multiple Choice Questions and Large Languages Models: A Case Study with Fictional Medical Data

Authors: Maxime Griot, Jean Vanderdonckt, Demet Yuksel, Coralie Hemptinne

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[186] arXiv:2406.02378 [pdf, other]: Title: On the Intrinsic Self-Correction Capability of LLMs: Uncertainty and Latent Concept

Authors: Guangliang Liu, Haitao Mao, Bochuan Cao, Zhiyu Xue, Kristen Johnson, Jiliang Tang, Rongrong Wang

Comments: 22 pages, 7 figures

Subjects: Computation and Language (cs.CL)
[187] arXiv:2406.02376 [pdf, other]: Title: Retaining Key Information under High Compression Ratios: Query-Guided Compressor for LLMs

Authors: Zhiwei Cao, Qian Cao, Yu Lu, Ningxin Peng, Luyang Huang, Shanbo Cheng, Jinsong Su

Comments: Accepted to ACL 2024

Subjects: Computation and Language (cs.CL)
[188] arXiv:2406.02350 [pdf, other]: Title: LlamaCare: A Large Medical Language Model for Enhancing Healthcare Knowledge Sharing

Authors: Maojun Sun

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[189] arXiv:2406.02338 [pdf, other]: Title: Linguistic Fingerprint in Transformer Models: How Language Variation Influences Parameter Selection in Irony Detection

Authors: Michele Mastromattei, Fabio Massimo Zanzotto

Journal-ref: Proceedings of the 3rd Workshop on Perspectivist Approaches to NLP (NLPerspectives) @ LREC-COLING 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[190] arXiv:2406.02335 [pdf, other]: Title: Probing the Category of Verbal Aspect in Transformer Language Models

Authors: Anisia Katinskaia, Roman Yangarber

Subjects: Computation and Language (cs.CL)
[191] arXiv:2406.02331 [pdf, other]: Title: Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering

Authors: ChaeHun Park, Koanho Lee, Hyesu Lim, Jaeseok Kim, Junmo Park, Yu-Jung Heo, Du-Seong Chang, Jaegul Choo

Comments: ACL 2024 Findings Accepted

Subjects: Computation and Language (cs.CL)
[192] arXiv:2406.02329 [pdf, other]: Title: On Affine Homotopy between Language Encoders

Authors: Robin SM Chan, Reda Boumasmoud, Anej Svete, Yuxin Ren, Qipeng Guo, Zhijing Jin, Shauli Ravfogel, Mrinmaya Sachan, Bernhard Schölkopf, Mennatallah El-Assady, Ryan Cotterell

Comments: 10 pages

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[193] arXiv:2406.02325 [pdf, other]: Title: Technical Language Processing for Telecommunications Specifications

Authors: Felipe A. Rodriguez Y.

Comments: Still not published

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[194] arXiv:2406.02301 [pdf, other]: Title: mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models

Authors: Huiyuan Lai, Malvina Nissim

Comments: Accepted to ACL 2024 main

Subjects: Computation and Language (cs.CL)
[195] arXiv:2406.02267 [pdf, ps, other]: Title: Prompting Large Language Models with Human Error Markings for Self-Correcting Machine Translation

Authors: Nathaniel Berger, Stefan Riezler, Miriam Exel, Matthias Huck

Comments: To appear at The 25th Annual Conference of the European Association for Machine Translation (EAMT 2024)

Subjects: Computation and Language (cs.CL)
[196] arXiv:2406.02266 [pdf, ps, other]: Title: Enhancing Retrieval-Augmented LMs with a Two-stage Consistency Learning Compressor

Authors: Chuankai Xu, Dongming Zhao, Bo Wang, Hanwen Xing

Subjects: Computation and Language (cs.CL)
[197] arXiv:2406.02251 [pdf, other]: Title: Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning

Authors: Lukas Christ, Shahin Amiriparian, Manuel Milling, Ilhan Aslan, Björn W. Schuller

Comments: Accepted to ACL 2024 Findings. arXiv admin note: text overlap with arXiv:2212.11382

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[198] arXiv:2406.02245 [pdf, other]: Title: Description Boosting for Zero-Shot Entity and Relation Classification

Authors: Gabriele Picco, Leopold Fuchs, Marcos Martínez Galindo, Alberto Purpura, Vanessa López, Hoang Thanh Lam

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[199] arXiv:2406.02237 [pdf, other]: Title: Self-Modifying State Modeling for Simultaneous Machine Translation

Authors: Donglei Yu, Xiaomian Kang, Yuchen Liu, Yu Zhou, Chengqing Zong

Comments: Accept to ACL 2024 main conference. 15 pages, 13 figures, 9 tables

Subjects: Computation and Language (cs.CL)
[200] arXiv:2406.02224 [pdf, other]: Title: FedMKT: Federated Mutual Knowledge Transfer for Large and Small Language Models

Authors: Tao Fan, Guoqiang Ma, Yan Kang, Hanlin Gu, Lixin Fan, Qiang Yang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[201] arXiv:2406.02169 [src]: Title: A multilingual dataset for offensive language and hate speech detection for hausa, yoruba and igbo languages

Authors: Saminu Mohammad Aliyu, Gregory Maksha Wajiga, Muhammad Murtala

Comments: The experimental result was erroneously reported and we also omitted other authors

Subjects: Computation and Language (cs.CL)
[202] arXiv:2406.02148 [pdf, other]: Title: Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Language Models

Authors: Qingkai Min, Qipeng Guo, Xiangkun Hu, Songfang Huang, Zheng Zhang, Yue Zhang

Comments: Accepted to ACL-24 Main

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)

[ total of 497 entries: 1-75 | 53-127 | 128-202 | 203-277 | 278-352 | 353-427 | 428-497 ]
[ showing 75 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help (Access key information)

> cs > cs.CL

Computation and Language

Authors and titles for recent submissions, skipping first 127

Thu, 6 Jun 2024 (continued, showing last 47 of 90 entries)

Wed, 5 Jun 2024 (showing first 28 of 93 entries)