Computation and Language

Authors and titles for recent submissions

[ total of 427 entries: 1-156 | 157-312 | 313-427 ]
[ showing 156 entries per page: fewer | more | all ]

Fri, 31 May 2024

[1] arXiv:2405.20335 [pdf, other]: Title: Xwin-LM: Strong and Scalable Alignment Practice for LLMs

Authors: Bolin Ni, JingCheng Hu, Yixuan Wei, Houwen Peng, Zheng Zhang, Gaofeng Meng, Han Hu

Subjects: Computation and Language (cs.CL)
[2] arXiv:2405.20318 [pdf, other]: Title: CausalQuest: Collecting Natural Causal Questions for AI Agents

Authors: Roberto Ceraolo, Dmitrii Kharlapenko, Amélie Reymond, Rada Mihalcea, Mrinmaya Sachan, Bernhard Schölkopf, Zhijing Jin

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[3] arXiv:2405.20315 [pdf, other]: Title: ANAH: Analytical Annotation of Hallucinations in Large Language Models

Authors: Ziwei Ji, Yuzhe Gu, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen

Comments: Accepted by ACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[4] arXiv:2405.20314 [pdf, ps, other]: Title: S3D: A Simple and Cost-Effective Self-Speculative Decoding Scheme for Low-Memory GPUs

Authors: Wei Zhong, Manasa Bharadwaj

Subjects: Computation and Language (cs.CL)
[5] arXiv:2405.20304 [pdf, other]: Title: Group Robust Preference Optimization in Reward-free RLHF

Authors: Shyam Sundhar Ramesh, Yifan Hu, Iason Chaimalas, Viraj Mehta, Pier Giuseppe Sessa, Haitham Bou Ammar, Ilija Bogunovic

Comments: Preprint

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[6] arXiv:2405.20285 [pdf, other]: Title: Who Writes the Review, Human or AI?

Authors: Panagiotis C. Theocharopoulos, Spiros V. Georgakopoulos, Sotiris K. Tasoulis, Vassilis P. Plagianakos

Subjects: Computation and Language (cs.CL)
[7] arXiv:2405.20274 [pdf, other]: Title: ROAST: Review-level Opinion Aspect Sentiment Target Joint Detection

Authors: Siva Uday Sampreeth Chebolu, Franck Dernoncourt, Nedim Lipka, Thamar Solorio

Comments: arXiv admin note: text overlap with arXiv:2309.13297

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[8] arXiv:2405.20269 [pdf, ps, other]: Title: IsraParlTweet: The Israeli Parliamentary and Twitter Resource

Authors: Guy Mor-Lan, Effi Levi, Tamir Sheafer, Shaul R. Shenhav

Comments: Presented at LREC-COLING 2024

Subjects: Computation and Language (cs.CL)
[9] arXiv:2405.20267 [pdf, other]: Title: Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions

Authors: Ruochen Zhao, Wenxuan Zhang, Yew Ken Chia, Deli Zhao, Lidong Bing

Subjects: Computation and Language (cs.CL)
[10] arXiv:2405.20253 [pdf, other]: Title: Evaluating Large Language Model Biases in Persona-Steered Generation

Authors: Andy Liu, Mona Diab, Daniel Fried

Comments: Accepted to Findings of ACL 2024. Code and data available at this https URL

Subjects: Computation and Language (cs.CL)
[11] arXiv:2405.20252 [pdf, other]: Title: Towards Hierarchical Multi-Agent Workflows for Zero-Shot Prompt Optimization

Authors: Yuchi Liu, Jaskirat Singh, Gaowen Liu, Ali Payani, Liang Zheng

Subjects: Computation and Language (cs.CL)
[12] arXiv:2405.20245 [pdf, other]: Title: Retrieval Augmented Structured Generation: Business Document Information Extraction As Tool Use

Authors: Franz Louis Cesista, Rui Aguiar, Jason Kim, Paolo Acilo

Comments: Accepted by IEEE 7th International Conference on Multimedia Information Processing and Retrieval (MIPR), 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[13] arXiv:2405.20215 [pdf, other]: Title: TS-Align: A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models

Authors: Chen Zhang, Chengguang Tang, Dading Chong, Ke Shi, Guohua Tang, Feng Jiang, Haizhou Li

Subjects: Computation and Language (cs.CL)
[14] arXiv:2405.20204 [pdf, other]: Title: Jina CLIP: Your CLIP Model Is Also Your Text Retriever

Authors: Andreas Koukounas, Georgios Mastrapas, Michael Günther, Bo Wang, Scott Martens, Isabelle Mohr, Saba Sturua, Mohammad Kalim Akram, Joan Fontanals Martínez, Saahil Ognawala, Susana Guzman, Maximilian Werk, Nan Wang, Han Xiao

Comments: 4 pages, ICML2024 workshop submission

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[15] arXiv:2405.20192 [pdf, other]: Title: TAIA: Large Language Models are Out-of-Distribution Data Learners

Authors: Shuyang Jiang, Yusheng Liao, Ya Zhang, Yu Wang, Yanfeng Wang

Comments: 25 pages

Subjects: Computation and Language (cs.CL)
[16] arXiv:2405.20179 [pdf, other]: Title: Robo-Instruct: Simulator-Augmented Instruction Alignment For Finetuning CodeLLMs

Authors: Zichao Hu, Junyi Jessy Li, Arjun Guha, Joydeep Biswas

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[17] arXiv:2405.20175 [pdf, other]: Title: InstructionCP: A fast approach to transfer Large Language Models into target language

Authors: Kuang-Ming Chen, Hung-yi Lee

Comments: 10 pages, 1 figure

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[18] arXiv:2405.20163 [pdf, other]: Title: Reasoning about concepts with LLMs: Inconsistencies abound

Authors: Rosario Uceda-Sosa, Karthikeyan Natesan Ramamurthy, Maria Chang, Moninder Singh

Comments: 15 pages, 5 figures, 3 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[19] arXiv:2405.20145 [pdf, other]: Title: Heidelberg-Boston @ SIGTYP 2024 Shared Task: Enhancing Low-Resource Language Analysis With Character-Aware Hierarchical Transformers

Authors: Frederick Riemenschneider, Kevin Krahn

Comments: Accepted for publication at the 6th Workshop on Research in Computational Linguistic Typology and Multilingual NLP (SIGTYP-WS) 2024; 11 pages, 1 figure, 9 tables

Subjects: Computation and Language (cs.CL)
[20] arXiv:2405.20139 [pdf, other]: Title: GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning

Authors: Costas Mavromatis, George Karypis

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[21] arXiv:2405.20131 [pdf, other]: Title: Language Models Need Inductive Biases to Count Inductively

Authors: Yingshan Chang, Yonatan Bisk

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[22] arXiv:2405.20092 [pdf, other]: Title: Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation

Authors: Jingchang Chen, Hongxuan Tang, Zheng Chu, Qianglong Chen, Zekun Wang, Ming Liu, Bing Qin

Subjects: Computation and Language (cs.CL); Software Engineering (cs.SE)
[23] arXiv:2405.20089 [pdf, other]: Title: The Fine-Tuning Paradox: Boosting Translation Quality Without Sacrificing LLM Abilities

Authors: David Stap, Eva Hasler, Bill Byrne, Christof Monz, Ke Tran

Comments: Accepted to ACL 2024 (long, main)

Subjects: Computation and Language (cs.CL)
[24] arXiv:2405.20079 [pdf, other]: Title: Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language Learning

Authors: Elena Grazia Gado, Tommaso Martorella, Luca Zunino, Paola Mejia-Domenzain, Vinitra Swamy, Jibril Frej, Tanja Käser

Comments: Accepted as a poster paper at EDM 2024: 17th International Conference on Educational Data Mining in Atlanta, USA

Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[25] arXiv:2405.20053 [pdf, other]: Title: Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads

Authors: Avelina Asada Hadji-Kyriacou, Ognjen Arandjelovic

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[26] arXiv:2405.19967 [pdf, other]: Title: Improved Out-of-Scope Intent Classification with Dual Encoding and Threshold-based Re-Classification

Authors: Hossam M. Zawbaa, Wael Rashwan, Sourav Dutta, Haytham Assem

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[27] arXiv:2405.19958 [pdf, other]: Title: Multi-Aspect Controllable Text Generation with Disentangled Counterfactual Augmentation

Authors: Yi Liu, Xiangyu Liu, Xiangrong Zhu, Wei Hu

Comments: Accepted in the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[28] arXiv:2405.19874 [pdf, other]: Title: Is In-Context Learning Sufficient for Instruction Following in LLMs?

Authors: Hao Zhao, Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion

Comments: Preprint. Code at this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[29] arXiv:2405.19856 [pdf, other]: Title: DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories

Authors: Jia Li, Ge Li, Yunfei Zhao, Yongmin Li, Huanyu Liu, Hao Zhu, Lecheng Wang, Kaibo Liu, Zheng Fang, Lanshen Wang, Jiazheng Ding, Xuanming Zhang, Yuqi Zhu, Yihong Dong, Zhi Jin, Binhua Li, Fei Huang, Yongbin Li

Comments: Accepted by the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024). arXiv admin note: substantial text overlap with arXiv:2404.00599, arXiv:2401.06401

Subjects: Computation and Language (cs.CL); Software Engineering (cs.SE)
[30] arXiv:2405.19846 [pdf, other]: Title: Quest: Query-centric Data Synthesis Approach for Long-context Scaling of Large Language Model

Authors: Chaochen Gao, Xing Wu, Qi Fu, Songlin Hu

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[31] arXiv:2405.19842 [pdf, other]: Title: Improve Student's Reasoning Generalizability through Cascading Decomposed CoTs Distillation

Authors: Chengwei Dai, Kun Li, Wei Zhou, Songlin Hu

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[32] arXiv:2405.19831 [pdf, other]: Title: Just Rewrite It Again: A Post-Processing Method for Enhanced Semantic Similarity and Privacy Preservation of Differentially Private Rewritten Text

Authors: Stephen Meisenbacher, Florian Matthes

Comments: 10 pages, 2 figures, 2 tables. Accepted to ARES 2024 (IWAPS)

Subjects: Computation and Language (cs.CL)
[33] arXiv:2405.19799 [pdf, other]: Title: Unsupervised Mutual Learning of Dialogue Discourse Parsing and Topic Segmentation

Authors: Jiahui Xu, Feng Jiang, Anningzhe Gao, Haizhou Li

Subjects: Computation and Language (cs.CL)
[34] arXiv:2405.19795 [pdf, other]: Title: SLM as Guardian: Pioneering AI Safety with Small Language Models

Authors: Ohjoon Kwon, Donghyeon Jeon, Nayoung Choi, Gyu-Hwung Cho, Changbong Kim, Hyunwoo Lee, Inho Kang, Sun Kim, Taiwoo Park

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[35] arXiv:2405.19793 [pdf, other]: Title: PDDLEGO: Iterative Planning in Textual Environments

Authors: Li Zhang, Peter Jansen, Tianyi Zhang, Peter Clark, Chris Callison-Burch, Niket Tandon

Comments: In *SEM 2024

Subjects: Computation and Language (cs.CL)
[36] arXiv:2405.19787 [pdf, other]: Title: From Symbolic Tasks to Code Generation: Diversification Yields Better Task Performers

Authors: Dylan Zhang, Justin Wang, Francois Charton

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Logic in Computer Science (cs.LO); Programming Languages (cs.PL)
[37] arXiv:2405.19778 [pdf, other]: Title: Enhancing Consistency and Role-Specific Knowledge Capturing by Rebuilding Fictional Character's Persona

Authors: Jeiyoon Park, Chanjun Park, Heuiseok Lim

Comments: preprint

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[38] arXiv:2405.19763 [pdf, other]: Title: Enhancing Reinforcement Learning with Label-Sensitive Reward for Natural Language Understanding

Authors: Kuo Liao, Shuang Li, Meng Zhao, Liqun Liu, Mengge Xue, Zhenyu Hu, Honglin Han, Chengguo Yin

Comments: Accept at ACL2024 Main

Subjects: Computation and Language (cs.CL)
[39] arXiv:2405.19744 [pdf, other]: Title: X-Instruction: Aligning Language Model in Low-resource Languages with Self-curated Cross-lingual Instructions

Authors: Chong Li, Wen Yang, Jiajun Zhang, Jinliang Lu, Shaonan Wang, Chengqing Zong

Comments: ACL 2024. Our codes, data and model weights are available at this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[40] arXiv:2405.19740 [pdf, other]: Title: PertEval: Unveiling Real Knowledge Capacity of LLMs with Knowledge-Invariant Perturbations

Authors: Jiatong Li, Renjun Hu, Kunzhe Huang, Yan Zhuang, Qi Liu, Mengxiao Zhu, Xing Shi, Wei Lin

Comments: 23 pages, 12 figures, 10 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[41] arXiv:2405.19737 [pdf, other]: Title: Beyond Imitation: Learning Key Reasoning Steps from Dual Chain-of-Thoughts in Reasoning Distillation

Authors: Chengwei Dai, Kun Li, Wei Zhou, Songlin Hu

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[42] arXiv:2405.19715 [pdf, other]: Title: SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths

Authors: Kaixuan Huang, Xudong Guo, Mengdi Wang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[43] arXiv:2405.19701 [pdf, other]: Title: Significance of Chain of Thought in Gender Bias Mitigation for English-Dravidian Machine Translation

Authors: Lavanya Prahallad, Radhika Mamidi

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[44] arXiv:2405.19670 [pdf, other]: Title: One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models

Authors: Yutao Zhu, Zhaoheng Huang, Zhicheng Dou, Ji-Rong Wen

Comments: working in progress, repo: this https URL

Subjects: Computation and Language (cs.CL)
[45] arXiv:2405.19660 [pdf, other]: Title: PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals

Authors: Ruiyi Wang, Stephanie Milani, Jamie C. Chiu, Shaun M. Eack, Travis Labrum, Samuel M. Murphy, Nev Jones, Kate Hardy, Hong Shen, Fei Fang, Zhiyu Zoey Chen

Comments: Work in progress

Subjects: Computation and Language (cs.CL)
[46] arXiv:2405.19648 [pdf, other]: Title: Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach

Authors: Ernesto Quevedo, Jorge Yero, Rachel Koerner, Pablo Rivas, Tomas Cerny

Comments: ICAI'24 - The 26th Int'l Conf on Artificial Intelligence

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[47] arXiv:2405.19635 [pdf, other]: Title: GKT: A Novel Guidance-Based Knowledge Transfer Framework For Efficient Cloud-edge Collaboration LLM Deployment

Authors: Yao Yao, Zuchao Li, Hai Zhao

Subjects: Computation and Language (cs.CL)
[48] arXiv:2405.19575 [pdf, other]: Title: A Deep Convolutional Neural Network-based Model for Aspect and Polarity Classification in Hausa Movie Reviews

Authors: Umar Ibrahim, Abubakar Yakubu Zandam, Fatima Muhammad Adam, Aminu Musa

Comments: To be published in the proceedings of ICCAIT 2023

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[49] arXiv:2405.19563 [pdf, other]: Title: Unlearning Climate Misinformation in Large Language Models

Authors: Michael Fore, Simranjit Singh, Chaehong Lee, Amritanshu Pandey, Antonios Anastasopoulos, Dimitrios Stamoulis

Subjects: Computation and Language (cs.CL)
[50] arXiv:2405.19538 [pdf, other]: Title: CheXpert Plus: Hundreds of Thousands of Aligned Radiology Texts, Images and Patients

Authors: Pierre Chambon, Jean-Benoit Delbrouck, Thomas Sounack, Shih-Cheng Huang, Zhihong Chen, Maya Varma, Steven QH Truong, Chu The Chuong, Curtis P. Langlotz

Comments: 13 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[51] arXiv:2405.19519 [pdf, other]: Title: Two-layer retrieval augmented generation framework for low-resource medical question-answering: proof of concept using Reddit data

Authors: Sudeshna Das, Yao Ge, Yuting Guo, Swati Rajwal, JaMor Hairston, Jeanne Powell, Drew Walker, Snigdha Peddireddy, Sahithi Lakamana, Selen Bozkurt, Matthew Reyna, Reza Sameni, Yunyu Xiao, Sangmi Kim, Rasheeta Chandler, Natalie Hernandez, Danielle Mowery, Rachel Wightman, Jennifer Love, Anthony Spadaro, Jeanmarie Perrone, Abeed Sarker

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[52] arXiv:2405.19487 [pdf, other]: Title: A Full-duplex Speech Dialogue Scheme Based On Large Language Models

Authors: Peng Wang, Songshuo Lu, Yaohua Tang, Sijie Yan, Yuanjun Xiong, Wei Xia

Subjects: Computation and Language (cs.CL)
[53] arXiv:2405.19462 [pdf, other]: Title: Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning

Authors: Everlyn Asiko Chimoto, Jay Gala, Orevaoghene Ahia, Julia Kreutzer, Bruce A. Bassett, Sara Hooker

Comments: Accepted to ACL 2024 Findings

Subjects: Computation and Language (cs.CL)
[54] arXiv:2405.19433 [pdf, other]: Title: Beyond Agreement: Diagnosing the Rationale Alignment of Automated Essay Scoring Methods based on Linguistically-informed Counterfactuals

Authors: Yupei Wang, Renfen Hu, Zhe Zhao

Subjects: Computation and Language (cs.CL)
[55] arXiv:2405.19426 [pdf, other]: Title: Deep Learning for Assessment of Oral Reading Fluency

Authors: Mithilesh Vaidya, Binaya Kumar Sahoo, Preeti Rao

Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[56] arXiv:2405.19425 [pdf, other]: Title: Adaptive In-conversation Team Building for Language Model Agents

Authors: Linxin Song, Jiale Liu, Jieyu Zhang, Shaokun Zhang, Ao Luo, Shijian Wang, Qingyun Wu, Chi Wang

Subjects: Computation and Language (cs.CL)
[57] arXiv:2405.20341 (cross-list from cs.LG) [pdf, other]: Title: From Zero to Hero: Cold-Start Anomaly Detection

Authors: Tal Reiss, George Kour, Naama Zwerdling, Ateret Anaby-Tavor, Yedid Hoshen

Comments: ACL 2024. Our code is available at this https URL

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[58] arXiv:2405.20309 (cross-list from cs.LG) [pdf, other]: Title: Large Language Models Can Self-Improve At Web Agent Tasks

Authors: Ajay Patel, Markus Hofmarcher, Claudiu Leoveanu-Condrei, Marius-Constantin Dinu, Chris Callison-Burch, Sepp Hochreiter

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[59] arXiv:2405.20271 (cross-list from cs.LG) [pdf, other]: Title: ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections

Authors: Massimo Bini, Karsten Roth, Zeynep Akata, Anna Khoreva

Comments: Accepted to ICML 2024. Code available at this https URL

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[60] arXiv:2405.20213 (cross-list from cs.AI) [pdf, other]: Title: PostDoc: Generating Poster from a Long Multimodal Document Using Deep Submodular Optimization

Authors: Vijay Jaisankar, Sambaran Bandyopadhyay, Kalp Vyas, Varre Chaitanya, Shwetha Somasundaram

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[61] arXiv:2405.20172 (cross-list from cs.SD) [pdf, other]: Title: Iterative Feature Boosting for Explainable Speech Emotion Recognition

Authors: Alaa Nfissi, Wassim Bouachir, Nizar Bouguila, Brian Mishara

Comments: Published in: 2023 International Conference on Machine Learning and Applications (ICMLA)

Journal-ref: 2023 International Conference on Machine Learning and Applications (ICMLA), Jacksonville, FL, USA, 2023, pp. 543-549

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[62] arXiv:2405.20101 (cross-list from cs.SD) [pdf, other]: Title: Fill in the Gap! Combining Self-supervised Representation Learning with Neural Audio Synthesis for Speech Inpainting

Authors: Ihab Asaad, Maxime Jacquelin, Olivier Perrotin, Laurent Girin, Thomas Hueber

Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[63] arXiv:2405.20003 (cross-list from cs.LG) [pdf, other]: Title: Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities

Authors: Alexander Nikitin, Jannik Kossen, Yarin Gal, Pekka Marttinen

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[64] arXiv:2405.19954 (cross-list from cs.CR) [pdf, other]: Title: GenKubeSec: LLM-Based Kubernetes Misconfiguration Detection, Localization, Reasoning, and Remediation

Authors: Ehud Malul, Yair Meidan, Dudu Mimran, Yuval Elovici, Asaf Shabtai

Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[65] arXiv:2405.19877 (cross-list from cs.AI) [pdf, other]: Title: KNOW: A Real-World Ontology for Knowledge Capture with Large Language Models

Authors: Arto Bendiken

Comments: 5 pages, 1 figure

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[66] arXiv:2405.19782 (cross-list from cs.SE) [pdf, other]: Title: Dataflow-Guided Retrieval Augmentation for Repository-Level Code Completion

Authors: Wei Cheng, Yuhan Wu, Wei Hu

Comments: Accepted in the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)

Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL)
[67] arXiv:2405.19732 (cross-list from cs.CV) [pdf, other]: Title: Two Optimizers Are Better Than One: LLM Catalyst for Enhancing Gradient-Based Optimization

Authors: Zixian Guo, Ming Liu, Zhilong Ji, Jinfeng Bai, Yiwen Guo, Wangmeng Zuo

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[68] arXiv:2405.19716 (cross-list from cs.CV) [pdf, other]: Title: Enhancing Large Vision Language Models with Self-Training on Image Comprehension

Authors: Yihe Deng, Pan Lu, Fan Yin, Ziniu Hu, Sheng Shen, James Zou, Kai-Wei Chang, Wei Wang

Comments: 19 pages, 14 figures, 6 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[69] arXiv:2405.19616 (cross-list from cs.AI) [pdf, other]: Title: Easy Problems That LLMs Get Wrong

Authors: Sean Williams, James Huckle

Comments: AutogenAI Ltd. Associated code at this https URL

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[70] arXiv:2405.19597 (cross-list from cs.LG) [pdf, other]: Title: SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors

Authors: Vijay Lingam, Atula Tejaswi, Aditya Vavre, Aneesh Shetty, Gautham Krishna Gudur, Joydeep Ghosh, Alex Dimakis, Eunsol Choi, Aleksandar Bojchevski, Sujay Sanghavi

Comments: 17 pages, 5 figures, 14 tables

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[71] arXiv:2405.19592 (cross-list from cs.LG) [pdf, other]: Title: Why Larger Language Models Do In-context Learning Differently?

Authors: Zhenmei Shi, Junyi Wei, Zhuoyan Xu, Yingyu Liang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[72] arXiv:2405.19562 (cross-list from cs.CY) [pdf, other]: Title: Selective Explanations

Authors: Lucas Monteiro Paes, Dennis Wei, Flavio P. Calmon

Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL); Machine Learning (cs.LG)
[73] arXiv:2405.19561 (cross-list from cs.AI) [pdf, other]: Title: Quo Vadis ChatGPT? From Large Language Models to Large Knowledge Models

Authors: Venkat Venkatasubramanian, Arijit Chakraborty

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[74] arXiv:2405.19534 (cross-list from cs.LG) [pdf, other]: Title: Preference Learning Algorithms Do Not Learn Preference Rankings

Authors: Angelica Chen, Sadhika Malladi, Lily H. Zhang, Xinyi Chen, Qiuyi Zhang, Rajesh Ranganath, Kyunghyun Cho

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[75] arXiv:2405.19343 (cross-list from cs.SD) [pdf, other]: Title: Luganda Speech Intent Recognition for IoT Applications

Authors: Andrew Katumba, Sudi Murindanyi, John Trevor Kasule, Elvis Mugume

Comments: Presented as a conference paper at ICLR 2024/AfricaNLP

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[76] arXiv:2405.19342 (cross-list from cs.SD) [pdf, other]: Title: Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants

Authors: Chloé Sekkat, Fanny Leroy, Salima Mdhaffar, Blake Perry Smith, Yannick Estève, Joseph Dureau, Alice Coucke

Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)

Thu, 30 May 2024

[77] arXiv:2405.19327 [pdf, other]: Title: MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

Authors: Ge Zhang, Scott Qu, Jiaheng Liu, Chenchen Zhang, Chenghua Lin, Chou Leuang Yu, Danny Pan, Esther Cheng, Jie Liu, Qunshu Lin, Raven Yuan, Tuney Zheng, Wei Pang, Xinrun Du, Yiming Liang, Yinghao Ma, Yizhi Li, Ziyang Ma, Bill Lin, Emmanouil Benetos, Huan Yang, Junting Zhou, Kaijing Ma, Minghao Liu, Morry Niu, Noah Wang, Quehry Que, Ruibo Liu, Sine Liu, Shawn Guo, Soren Gao, Wangchunshu Zhou, Xinyue Zhang, Yizhi Zhou, Yubo Wang, Yuelin Bai, Yuhan Zhang, Yuxiang Zhang, Zenith Wang, Zhenzhu Yang, Zijian Zhao, Jiajun Zhang, Wanli Ouyang, Wenhao Huang, Wenhu Chen

Comments: this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[78] arXiv:2405.19325 [pdf, other]: Title: Nearest Neighbor Speculative Decoding for LLM Generation and Attribution

Authors: Minghan Li, Xilun Chen, Ari Holtzman, Beidi Chen, Jimmy Lin, Wen-tau Yih, Xi Victoria Lin

Subjects: Computation and Language (cs.CL)
[79] arXiv:2405.19323 [pdf, other]: Title: Are Large Language Models Chameleons?

Authors: Mingmeng Geng, Sihong He, Roberto Trotta

Comments: 16 pages,8 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[80] arXiv:2405.19299 [pdf, other]: Title: Expert-Guided Extinction of Toxic Tokens for Debiased Generation

Authors: Xueyao Sun, Kaize Shi, Haoran Tang, Guandong Xu, Qing Li

Subjects: Computation and Language (cs.CL)
[81] arXiv:2405.19290 [pdf, other]: Title: Integrating Multi-scale Contextualized Information for Byte-based Neural Machine Translation

Authors: Langlin Huang, Yang Feng

Comments: Accepted by ACL2024 Findings

Subjects: Computation and Language (cs.CL)
[82] arXiv:2405.19285 [pdf, other]: Title: MASSIVE Multilingual Abstract Meaning Representation: A Dataset and Baselines for Hallucination Detection

Authors: Michael Regan, Shira Wein, George Baker, Emilio Monti

Subjects: Computation and Language (cs.CL)
[83] arXiv:2405.19266 [pdf, other]: Title: PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications

Authors: Dingkang Yang, Jinjie Wei, Dongling Xiao, Shunli Wang, Tong Wu, Gang Li, Mingcheng Li, Shuaibing Wang, Jiawei Chen, Yue Jiang, Qingyao Xu, Ke Li, Peng Zhai, Lihua Zhang

Comments: A Technical Report on a Powerful Chinese Medical Large Language Model

Subjects: Computation and Language (cs.CL)
[84] arXiv:2405.19265 [pdf, other]: Title: AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data

Authors: Zifan Song, Yudong Wang, Wenwei Zhang, Kuikun Liu, Chengqi Lyu, Demin Song, Qipeng Guo, Hang Yan, Dahua Lin, Kai Chen, Cairong Zhao

Comments: Preprint with 20 pages and 20 figures. Source code and models at this https URL

Subjects: Computation and Language (cs.CL)
[85] arXiv:2405.19262 [pdf, other]: Title: Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models

Authors: Zhanhui Zhou, Zhixuan Liu, Jie Liu, Zhichen Dong, Chao Yang, Yu Qiao

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[86] arXiv:2405.19261 [pdf, other]: Title: Faster Cascades via Speculative Decoding

Authors: Harikrishna Narasimhan, Wittawat Jitkrittum, Ankit Singh Rawat, Seungyeon Kim, Neha Gupta, Aditya Krishna Menon, Sanjiv Kumar

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[87] arXiv:2405.19222 [pdf, other]: Title: Lower Bounds on the Expressivity of Recurrent Neural Language Models

Authors: Anej Svete, Franz Nowak, Anisha Mohamed Sahabdeen, Ryan Cotterell

Subjects: Computation and Language (cs.CL)
[88] arXiv:2405.19220 [pdf, other]: Title: WRDScore: New Metric for Evaluation of Natural Language Generation Models

Authors: Ravil Mussabayev

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[89] arXiv:2405.19139 [pdf, other]: Title: DGRC: An Effective Fine-tuning Framework for Distractor Generation in Chinese Multi-choice Reading Comprehension

Authors: Runfeng Lin, Dacheng Xu, Huijiang Wang, Zebiao Chen, Yating Wang, Shouqiang Liu

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[90] arXiv:2405.19109 [pdf, other]: Title: PathReasoner: Modeling Reasoning Path with Equivalent Extension for Logical Question Answering

Authors: Fangzhi Xu, Qika Lin, Tianzhe Zhao, Jiawei Han, Jun Liu

Comments: Accepted by ACL 2024

Subjects: Computation and Language (cs.CL)
[91] arXiv:2405.19094 [pdf, other]: Title: Faithful Chart Summarization with ChaTS-Pi

Authors: Syrine Krichene, Francesco Piccinno, Fangyu Liu, Julian Martin Eisenschlos

Comments: To be published in the proceedings of the 2024 Annual Meeting of the Association for Computational Linguistics

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[92] arXiv:2405.19093 [pdf, other]: Title: Multi-stage Retrieve and Re-rank Model for Automatic Medical Coding Recommendation

Authors: Xindi Wang, Robert E. Mercer, Frank Rudzicz

Comments: Accepted to NAACL 2024 -- camera-ready version

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[93] arXiv:2405.19088 [pdf, other]: Title: Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions

Authors: Zhe Hu, Tuo Liang, Jing Li, Yiren Lu, Yunlai Zhou, Yiran Qiao, Jing Ma, Yu Yin

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[94] arXiv:2405.19086 [pdf, other]: Title: MEMoE: Enhancing Model Editing with Mixture of Experts Adaptors

Authors: Renzhi Wang, Piji Li

Subjects: Computation and Language (cs.CL)
[95] arXiv:2405.19084 [pdf, other]: Title: Auxiliary Knowledge-Induced Learning for Automatic Multi-Label Medical Document Classification

Authors: Xindi Wang, Robert E. Mercer, Frank Rudzicz

Comments: Accepted to LREC-COLING 2024 -- camera-ready version

Subjects: Computation and Language (cs.CL)
[96] arXiv:2405.19041 [pdf, other]: Title: BLSP-KD: Bootstrapping Language-Speech Pre-training via Knowledge Distillation

Authors: Chen Wang, Minpeng Liao, Zhongqiang Huang, Jiajun Zhang

Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[97] arXiv:2405.19010 [pdf, other]: Title: Evaluating the External and Parametric Knowledge Fusion of Large Language Models

Authors: Hao Zhang, Yuyang Zhang, Xiaoguang Li, Wenxuan Shi, Haonan Xu, Huanshuo Liu, Yasheng Wang, Lifeng Shang, Qun Liu, Yong Liu, Ruiming Tang

Comments: 15 pages, 3 figures, 3 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[98] arXiv:2405.18974 [pdf, other]: Title: Encoding Hierarchical Schema via Concept Flow for Multifaceted Ideology Detection

Authors: Songtao Liu, Bang Wang, Wei Xiang, Han Xu, Minghua Xu

Comments: 13pages, 4 figures (Accepted to Findings of ACL 2024)

Subjects: Computation and Language (cs.CL)
[99] arXiv:2405.18952 [pdf, other]: Title: Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets

Authors: Peter Devine

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[100] arXiv:2405.18922 [pdf, other]: Title: Understanding and Addressing the Under-Translation Problem from the Perspective of Decoding Objective

Authors: Chenze Shao, Fandong Meng, Jiali Zeng, Jie Zhou

Comments: ACL 2024 main conference

Subjects: Computation and Language (cs.CL)
[101] arXiv:2405.18915 [pdf, other]: Title: Towards Faithful Chain-of-Thought: Large Language Models are Bridging Reasoners

Authors: Jiachun Li, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao

Comments: 25 pages, under review

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[102] arXiv:2405.18906 [pdf, other]: Title: Language Generation with Strictly Proper Scoring Rules

Authors: Chenze Shao, Fandong Meng, Yijin Liu, Jie Zhou

Comments: ICML 2024

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[103] arXiv:2405.18845 [pdf, other]: Title: Simulation, Modelling and Classification of Wiki Contributors: Spotting The Good, The Bad, and The Ugly

Authors: Silvia García Méndez, Fátima Leal, Benedita Malheiro, Juan Carlos Burguillo Rial, Bruno Veloso, Adriana E. Chis, Horacio González Vélez

Journal-ref: Simulation Modelling Practice and Theory, 120, 102616 (2022)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[104] arXiv:2405.18822 [pdf, other]: Title: Toxicity Detection for Free

Authors: Zhanhao Hu, Julien Piet, Geng Zhao, Jiantao Jiao, David Wagner

Subjects: Computation and Language (cs.CL)
[105] arXiv:2405.18741 [pdf, other]: Title: Genshin: General Shield for Natural Language Processing with Large Language Models

Authors: Xiao Peng, Tao Liu, Ying Wang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[106] arXiv:2405.18740 [pdf, other]: Title: Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs

Authors: Jialiang Xu, Michael Moor, Jure Leskovec

Subjects: Computation and Language (cs.CL)
[107] arXiv:2405.18727 [pdf, other]: Title: CtrlA: Adaptive Retrieval-Augmented Generation via Probe-Guided Control

Authors: Huanshuo Liu, Hao Zhang, Zhijiang Guo, Kuicai Dong, Xiangyang Li, Yi Quan Lee, Cong Zhang, Yong Liu

Comments: 28 pages, 7 figures, 9 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[108] arXiv:2405.18719 [pdf, other]: Title: Contextual Position Encoding: Learning to Count What's Important

Authors: Olga Golovneva, Tianlu Wang, Jason Weston, Sainbayar Sukhbaatar

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[109] arXiv:2405.18718 [pdf, other]: Title: Efficient Model-agnostic Alignment via Bayesian Persuasion

Authors: Fengshuo Bai, Mingzhi Wang, Zhaowei Zhang, Boyuan Chen, Yinda Xu, Ying Wen, Yaodong Yang

Subjects: Computation and Language (cs.CL)
[110] arXiv:2405.18682 [pdf, other]: Title: Can GPT Redefine Medical Understanding? Evaluating GPT on Biomedical Machine Reading Comprehension

Authors: Shubham Vatsal, Ayush Singh

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[111] arXiv:2405.18662 [pdf, other]: Title: Understanding Intrinsic Socioeconomic Biases in Large Language Models

Authors: Mina Arzaghi, Florian Carichon, Golnoosh Farnadi

Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[112] arXiv:2405.18653 [pdf, other]: Title: Recent Advances of Foundation Language Models-based Continual Learning: A Survey

Authors: Yutao Yang, Jie Zhou, Xuanwen Ding, Tianyu Huai, Shunyu Liu, Qin Chen, Liang He, Yuan Xie

Subjects: Computation and Language (cs.CL)
[113] arXiv:2405.18649 [pdf, other]: Title: Training LLMs to Better Self-Debug and Explain Code

Authors: Nan Jiang, Xiaopeng Li, Shiqi Wang, Qiang Zhou, Soneya Binta Hossain, Baishakhi Ray, Varun Kumar, Xiaofei Ma, Anoop Deoras

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
[114] arXiv:2405.18638 [pdf, other]: Title: ConSiDERS-The-Human Evaluation Framework: Rethinking Human Evaluation for Generative Large Language Models

Authors: Aparna Elangovan, Ling Liu, Lei Xu, Sravan Bodapati, Dan Roth

Comments: Accepted in ACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[115] arXiv:2405.18613 [pdf, ps, other]: Title: GLOCON Database: Design Decisions and User Manual (v1.0)

Authors: Ali Hürriyetoğlu, Osman Mutlu, Fırat Duruşan, Erdem Yörük

Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Databases (cs.DB); Machine Learning (cs.LG)
[116] arXiv:2405.18605 [pdf, ps, other]: Title: BioBERT-based Deep Learning and Merged ChemProt-DrugProt for Enhanced Biomedical Relation Extraction

Authors: Bridget T. McInnes, Jiawei Tang, Darshini Mahendran, Mai H. Nguyen

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Molecular Networks (q-bio.MN)
[117] arXiv:2405.18540 [pdf, other]: Title: Learning diverse attacks on large language models for robust red-teaming and safety tuning

Authors: Seanie Lee, Minsu Kim, Lynn Cherif, David Dobre, Juho Lee, Sung Ju Hwang, Kenji Kawaguchi, Gauthier Gidel, Yoshua Bengio, Nikolay Malkin, Moksh Jain

Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[118] arXiv:2405.18492 [pdf, other]: Title: LLMs and Memorization: On Quality and Specificity of Copyright Compliance

Authors: Felix B Mueller, Rebekka Görge, Anna K Bernzen, Janna C Pirk, Maximilian Poretschkin

Comments: 10 pages, 3 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[119] arXiv:2405.18448 [pdf, other]: Title: Multi-objective Representation for Numbers in Clinical Narratives Using CamemBERT-bio

Authors: Boammani Aser Lompo, Thanh-Dung Le

Comments: Under the revision. arXiv admin note: substantial text overlap with arXiv:2404.10171

Subjects: Computation and Language (cs.CL); Signal Processing (eess.SP)
[120] arXiv:2405.19335 (cross-list from cs.CV) [pdf, other]: Title: X-VILA: Cross-Modality Alignment for Large Language Model

Authors: Hanrong Ye, De-An Huang, Yao Lu, Zhiding Yu, Wei Ping, Andrew Tao, Jan Kautz, Song Han, Dan Xu, Pavlo Molchanov, Hongxu Yin

Comments: Technical Report

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[121] arXiv:2405.19334 (cross-list from cs.AI) [pdf, other]: Title: LLMs Meet Multimodal Generation and Editing: A Survey

Authors: Yingqing He, Zhaoyang Liu, Jingye Chen, Zeyue Tian, Hongyu Liu, Xiaowei Chi, Runtao Liu, Ruibin Yuan, Yazhou Xing, Wenhai Wang, Jifeng Dai, Yong Zhang, Wei Xue, Qifeng Liu, Yike Guo, Qifeng Chen

Comments: 51 Pages with 16 Figures, 12 Tables, and 534 References. GitHub Repository at: this https URL

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[122] arXiv:2405.19316 (cross-list from cs.LG) [pdf, other]: Title: Robust Preference Optimization through Reward Model Distillation

Authors: Adam Fisch, Jacob Eisenstein, Vicky Zayats, Alekh Agarwal, Ahmad Beirami, Chirag Nagpal, Pete Shaw, Jonathan Berant

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[123] arXiv:2405.19315 (cross-list from cs.CV) [pdf, other]: Title: Matryoshka Query Transformer for Large Vision-Language Models

Authors: Wenbo Hu, Zi-Yi Dou, Liunian Harold Li, Amita Kamath, Nanyun Peng, Kai-Wei Chang

Comments: Preprint. Our code and model are publicly available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[124] arXiv:2405.19313 (cross-list from cs.AI) [pdf, other]: Title: Language Models Trained to do Arithmetic Predict Human Risky and Intertemporal Choice

Authors: Jian-Qiao Zhu, Haijiang Yan, Thomas L. Griffiths

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); General Economics (econ.GN)
[125] arXiv:2405.19209 (cross-list from cs.CV) [pdf, other]: Title: VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos

Authors: Ziyang Wang, Shoubin Yu, Elias Stengel-Eskin, Jaehong Yoon, Feng Cheng, Gedas Bertasius, Mohit Bansal

Comments: 20 pages, first three authors contributed equally; Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[126] arXiv:2405.19186 (cross-list from cs.CV) [pdf, other]: Title: MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification

Authors: Laura Fieback (1,2), Jakob Spiegelberg (1), Hanno Gottschalk (2) ((1) Volkswagen AG, (2) TU Berlin)

Comments: 18 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[127] arXiv:2405.19076 (cross-list from cs.CV) [pdf, other]: Title: Cephalo: Multi-Modal Vision-Language Models for Bio-Inspired Materials Analysis and Design

Authors: Markus J. Buehler

Subjects: Computer Vision and Pattern Recognition (cs.CV); Mesoscale and Nanoscale Physics (cond-mat.mes-hall); Materials Science (cond-mat.mtrl-sci); Computation and Language (cs.CL); Machine Learning (cs.LG)
[128] arXiv:2405.19026 (cross-list from cs.LG) [pdf, other]: Title: DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints

Authors: Andrew Zhao, Quentin Xu, Matthieu Lin, Shenzhi Wang, Yong-jin Liu, Zilong Zheng, Gao Huang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[129] arXiv:2405.18991 (cross-list from cs.CV) [pdf, other]: Title: EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture

Authors: Jiaqi Xu, Xinyi Zou, Kunzhe Huang, Yunkuo Chen, Bo Liu, MengLi Cheng, Xing Shi, Jun Huang

Comments: 6 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[130] arXiv:2405.18937 (cross-list from cs.CV) [pdf, other]: Title: Kestrel: Point Grounding Multimodal LLM for Part-Aware 3D Vision-Language Understanding

Authors: Junjie Fei, Mahmoud Ahmed, Jian Ding, Eslam Mohamed Bakr, Mohamed Elhoseiny

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[131] arXiv:2405.18874 (cross-list from cond-mat.dis-nn) [pdf, other]: Title: Are queries and keys always relevant? A case study on Transformer wave functions

Authors: Riccardo Rende, Luciano Loris Viteritti

Comments: 9 pages, 4 figures

Subjects: Disordered Systems and Neural Networks (cond-mat.dis-nn); Computation and Language (cs.CL); Computational Physics (physics.comp-ph)
[132] arXiv:2405.18870 (cross-list from cs.AI) [pdf, other]: Title: LLMs achieve adult human performance on higher-order theory of mind tasks

Authors: Winnie Street, John Oliver Siy, Geoff Keeling, Adrien Baranes, Benjamin Barnett, Michael McKibben, Tatenda Kanyere, Alison Lentz, Blaise Aguera y Arcas, Robin I. M. Dunbar

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[133] arXiv:2405.18776 (cross-list from cs.CR) [pdf, other]: Title: LMO-DP: Optimizing the Randomization Mechanism for Differentially Private Fine-Tuning (Large) Language Models

Authors: Qin Yang, Meisam Mohammad, Han Wang, Ali Payani, Ashish Kundu, Kai Shu, Yan Yan, Yuan Hong

Comments: 18 pages, 15 figures

Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[134] arXiv:2405.18742 (cross-list from cs.AI) [pdf, other]: Title: Musical Phrase Segmentation via Grammatical Induction

Authors: Reed Perkins, Dan Ventura

Comments: Extended version of a paper appearing in the proceedings of IJCAI 2024 that includes additional material in an appendix. Please cite the IJCAI version

Journal-ref: Proceedings of the International Joint Conference on Artificial Intelligence, 2024

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[135] arXiv:2405.18721 (cross-list from cs.CV) [pdf, other]: Title: Correctable Landmark Discovery via Large Models for Vision-Language Navigation

Authors: Bingqian Lin, Yunshuang Nie, Ziming Wei, Yi Zhu, Hang Xu, Shikui Ma, Jianzhuang Liu, Xiaodan Liang

Comments: Accepted by TPAMI 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[136] arXiv:2405.18711 (cross-list from cs.AI) [pdf, other]: Title: Calibrating Reasoning in Language Models with Internal Consistency

Authors: Zhihui Xie, Jizhou Guo, Tong Yu, Shuai Li

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[137] arXiv:2405.18688 (cross-list from cs.LG) [pdf, other]: Title: Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation

Authors: Fengshuo Bai, Rui Zhao, Hongming Zhang, Sijia Cui, Ying Wen, Yaodong Yang, Bo Xu, Lei Han

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[138] arXiv:2405.18672 (cross-list from cs.CV) [pdf, other]: Title: LLM-based Hierarchical Concept Decomposition for Interpretable Fine-Grained Image Classification

Authors: Renyi Qu, Mark Yatskar

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[139] arXiv:2405.18669 (cross-list from cs.LG) [pdf, other]: Title: Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities

Authors: Vicky Zayats, Peter Chen, Melissa Merrari, Dirk Padfield

Comments: Under review at NeurIPS

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[140] arXiv:2405.18642 (cross-list from cs.AI) [pdf, other]: Title: JADS: A Framework for Self-supervised Joint Aspect Discovery and Summarization

Authors: Xiaobo Guo, Jay Desai, Srinivasan H. Sengamedu

Comments: preprint

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[141] arXiv:2405.18639 (cross-list from q-bio.NC) [pdf, other]: Title: Improving Speech Decoding from ECoG with Self-Supervised Pretraining

Authors: Brian A. Yuan, Joseph G. Makin

Subjects: Neurons and Cognition (q-bio.NC); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[142] arXiv:2405.18634 (cross-list from cs.LG) [pdf, other]: Title: A Theoretical Understanding of Self-Correction through In-context Alignment

Authors: Yifei Wang, Yuyang Wu, Zeming Wei, Stefanie Jegelka, Yisen Wang

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[143] arXiv:2405.18628 (cross-list from cs.LG) [pdf, other]: Title: Hardware-Aware Parallel Prompt Decoding for Memory-Efficient Acceleration of LLM Inference

Authors: Hao (Mark) Chen, Wayne Luk, Ka Fai Cedric Yiu, Rui Li, Konstantin Mishchenko, Stylianos I. Venieris, Hongxiang Fan

Comments: The code for this implementation is available at this https URL

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[144] arXiv:2405.18620 (cross-list from cs.HC) [pdf, other]: Title: RealitySummary: On-Demand Mixed Reality Document Enhancement using Large Language Models

Authors: Aditya Gunturu, Shivesh Jadon, Nandi Zhang, Jarin Thundathil, Wesley Willett, Ryo Suzuki

Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[145] arXiv:2405.18572 (cross-list from cs.LG) [pdf, other]: Title: Low-rank finetuning for LLMs: A fairness perspective

Authors: Saswat Das, Marco Romanelli, Cuong Tran, Zarreen Reza, Bhavya Kailkhura, Ferdinando Fioretto

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[146] arXiv:2405.18570 (cross-list from cs.CV) [pdf, other]: Title: Its Not a Modality Gap: Characterizing and Addressing the Contrastive Gap

Authors: Abrar Fahim, Alex Murphy, Alona Fyshe

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[147] arXiv:2405.18542 (cross-list from cs.AI) [pdf, other]: Title: Automatic detection of cognitive impairment in elderly people using an entertainment chatbot with Natural Language Processing capabilities

Authors: Francisco de Arriba-Pérez, Silvia García-Méndez, Francisco J. González-Castaño, Enrique Costa-Montenegro

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[148] arXiv:2405.17653 (cross-list from cs.LG) [pdf, other]: Title: InversionView: A General-Purpose Method for Reading Information from Neural Activations

Authors: Xinting Huang, Madhur Panwar, Navin Goyal, Michael Hahn

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Wed, 29 May 2024 (showing first 8 of 81 entries)

[149] arXiv:2405.18433 [pdf, other]: Title: Notes on Applicability of GPT-4 to Document Understanding

Authors: Łukasz Borchmann

Subjects: Computation and Language (cs.CL)
[150] arXiv:2405.18414 [pdf, other]: Title: Don't Forget to Connect! Improving RAG with Graph-based Reranking

Authors: Jialin Dong, Bahare Fatemi, Bryan Perozzi, Lin F. Yang, Anton Tsitsulin

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[151] arXiv:2405.18400 [pdf, other]: Title: Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass

Authors: Ethan Shen, Alan Fan, Sarah M Pratt, Jae Sung Park, Matthew Wallingford, Sham M. Kakade, Ari Holtzman, Ranjay Krishna, Ali Farhadi, Aditya Kusupati

Comments: 22 pages, 15 figures

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[152] arXiv:2405.18375 [pdf, other]: Title: Thai Winograd Schemas: A Benchmark for Thai Commonsense Reasoning

Authors: Phakphum Artkaew

Subjects: Computation and Language (cs.CL)
[153] arXiv:2405.18369 [pdf, other]: Title: PromptWizard: Task-Aware Agent-driven Prompt Optimization Framework

Authors: Eshaan Agarwal, Vivek Dani, Tanuja Ganu, Akshay Nambi

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[154] arXiv:2405.18359 [pdf, other]: Title: Bridging the Gap: Dynamic Learning Strategies for Improving Multilingual Performance in LLMs

Authors: Somnath Kumar, Vaibhav Balloli, Mercy Ranjit, Kabir Ahuja, Tanuja Ganu, Sunayana Sitaram, Kalika Bali, Akshay Nambi

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[155] arXiv:2405.18358 [pdf, other]: Title: MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning

Authors: Somnath Kumar, Yash Gadhia, Tanuja Ganu, Akshay Nambi

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[156] arXiv:2405.18357 [pdf, other]: Title: Faithful Logical Reasoning via Symbolic Chain-of-Thought

Authors: Jundong Xu, Hao Fei, Liangming Pan, Qian Liu, Mong-Li Lee, Wynne Hsu

Comments: Accepted by ACL 2024 (main proceeding)

Subjects: Computation and Language (cs.CL)

[ total of 427 entries: 1-156 | 157-312 | 313-427 ]
[ showing 156 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help (Access key information)

> cs > cs.CL

Computation and Language

Authors and titles for recent submissions

Fri, 31 May 2024

Thu, 30 May 2024

Wed, 29 May 2024 (showing first 8 of 81 entries)