Computation and Language

Authors and titles for recent submissions, skipping first 88

[ total of 432 entries: 1-50 | 39-88 | 89-138 | 139-188 | 189-238 | 239-288 | ... | 389-432 ]
[ showing 50 entries per page: fewer | more | all ]

Fri, 31 May 2024 (continued, showing 50 of 76 entries)

[89] arXiv:2405.20245 [pdf, other]: Title: Retrieval Augmented Structured Generation: Business Document Information Extraction As Tool Use

Authors: Franz Louis Cesista, Rui Aguiar, Jason Kim, Paolo Acilo

Comments: Accepted by IEEE 7th International Conference on Multimedia Information Processing and Retrieval (MIPR), 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[90] arXiv:2405.20215 [pdf, other]: Title: TS-Align: A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models

Authors: Chen Zhang, Chengguang Tang, Dading Chong, Ke Shi, Guohua Tang, Feng Jiang, Haizhou Li

Subjects: Computation and Language (cs.CL)
[91] arXiv:2405.20204 [pdf, other]: Title: Jina CLIP: Your CLIP Model Is Also Your Text Retriever

Authors: Andreas Koukounas, Georgios Mastrapas, Michael Günther, Bo Wang, Scott Martens, Isabelle Mohr, Saba Sturua, Mohammad Kalim Akram, Joan Fontanals Martínez, Saahil Ognawala, Susana Guzman, Maximilian Werk, Nan Wang, Han Xiao

Comments: 4 pages, ICML2024 workshop submission

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[92] arXiv:2405.20192 [pdf, other]: Title: TAIA: Large Language Models are Out-of-Distribution Data Learners

Authors: Shuyang Jiang, Yusheng Liao, Ya Zhang, Yu Wang, Yanfeng Wang

Comments: 25 pages

Subjects: Computation and Language (cs.CL)
[93] arXiv:2405.20179 [pdf, other]: Title: Robo-Instruct: Simulator-Augmented Instruction Alignment For Finetuning CodeLLMs

Authors: Zichao Hu, Junyi Jessy Li, Arjun Guha, Joydeep Biswas

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[94] arXiv:2405.20175 [pdf, other]: Title: InstructionCP: A fast approach to transfer Large Language Models into target language

Authors: Kuang-Ming Chen, Hung-yi Lee

Comments: 10 pages, 1 figure

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[95] arXiv:2405.20163 [pdf, other]: Title: Reasoning about concepts with LLMs: Inconsistencies abound

Authors: Rosario Uceda-Sosa, Karthikeyan Natesan Ramamurthy, Maria Chang, Moninder Singh

Comments: 15 pages, 5 figures, 3 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[96] arXiv:2405.20145 [pdf, other]: Title: Heidelberg-Boston @ SIGTYP 2024 Shared Task: Enhancing Low-Resource Language Analysis With Character-Aware Hierarchical Transformers

Authors: Frederick Riemenschneider, Kevin Krahn

Comments: Accepted for publication at the 6th Workshop on Research in Computational Linguistic Typology and Multilingual NLP (SIGTYP-WS) 2024; 11 pages, 1 figure, 9 tables

Subjects: Computation and Language (cs.CL)
[97] arXiv:2405.20139 [pdf, other]: Title: GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning

Authors: Costas Mavromatis, George Karypis

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[98] arXiv:2405.20131 [pdf, other]: Title: Language Models Need Inductive Biases to Count Inductively

Authors: Yingshan Chang, Yonatan Bisk

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[99] arXiv:2405.20092 [pdf, other]: Title: Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation

Authors: Jingchang Chen, Hongxuan Tang, Zheng Chu, Qianglong Chen, Zekun Wang, Ming Liu, Bing Qin

Subjects: Computation and Language (cs.CL); Software Engineering (cs.SE)
[100] arXiv:2405.20089 [pdf, other]: Title: The Fine-Tuning Paradox: Boosting Translation Quality Without Sacrificing LLM Abilities

Authors: David Stap, Eva Hasler, Bill Byrne, Christof Monz, Ke Tran

Comments: Accepted to ACL 2024 (long, main)

Subjects: Computation and Language (cs.CL)
[101] arXiv:2405.20079 [pdf, other]: Title: Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language Learning

Authors: Elena Grazia Gado, Tommaso Martorella, Luca Zunino, Paola Mejia-Domenzain, Vinitra Swamy, Jibril Frej, Tanja Käser

Comments: Accepted as a poster paper at EDM 2024: 17th International Conference on Educational Data Mining in Atlanta, USA

Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[102] arXiv:2405.20053 [pdf, other]: Title: Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads

Authors: Avelina Asada Hadji-Kyriacou, Ognjen Arandjelovic

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[103] arXiv:2405.19967 [pdf, other]: Title: Improved Out-of-Scope Intent Classification with Dual Encoding and Threshold-based Re-Classification

Authors: Hossam M. Zawbaa, Wael Rashwan, Sourav Dutta, Haytham Assem

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[104] arXiv:2405.19958 [pdf, other]: Title: Multi-Aspect Controllable Text Generation with Disentangled Counterfactual Augmentation

Authors: Yi Liu, Xiangyu Liu, Xiangrong Zhu, Wei Hu

Comments: Accepted in the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[105] arXiv:2405.19874 [pdf, other]: Title: Is In-Context Learning Sufficient for Instruction Following in LLMs?

Authors: Hao Zhao, Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion

Comments: Preprint. Code at this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[106] arXiv:2405.19856 [pdf, other]: Title: DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories

Authors: Jia Li, Ge Li, Yunfei Zhao, Yongmin Li, Huanyu Liu, Hao Zhu, Lecheng Wang, Kaibo Liu, Zheng Fang, Lanshen Wang, Jiazheng Ding, Xuanming Zhang, Yuqi Zhu, Yihong Dong, Zhi Jin, Binhua Li, Fei Huang, Yongbin Li

Comments: Accepted by the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024). arXiv admin note: substantial text overlap with arXiv:2404.00599, arXiv:2401.06401

Subjects: Computation and Language (cs.CL); Software Engineering (cs.SE)
[107] arXiv:2405.19846 [pdf, other]: Title: Quest: Query-centric Data Synthesis Approach for Long-context Scaling of Large Language Model

Authors: Chaochen Gao, Xing Wu, Qi Fu, Songlin Hu

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[108] arXiv:2405.19842 [pdf, other]: Title: Improve Student's Reasoning Generalizability through Cascading Decomposed CoTs Distillation

Authors: Chengwei Dai, Kun Li, Wei Zhou, Songlin Hu

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[109] arXiv:2405.19831 [pdf, other]: Title: Just Rewrite It Again: A Post-Processing Method for Enhanced Semantic Similarity and Privacy Preservation of Differentially Private Rewritten Text

Authors: Stephen Meisenbacher, Florian Matthes

Comments: 10 pages, 2 figures, 2 tables. Accepted to ARES 2024 (IWAPS)

Subjects: Computation and Language (cs.CL)
[110] arXiv:2405.19799 [pdf, other]: Title: Unsupervised Mutual Learning of Dialogue Discourse Parsing and Topic Segmentation

Authors: Jiahui Xu, Feng Jiang, Anningzhe Gao, Haizhou Li

Subjects: Computation and Language (cs.CL)
[111] arXiv:2405.19795 [pdf, other]: Title: SLM as Guardian: Pioneering AI Safety with Small Language Models

Authors: Ohjoon Kwon, Donghyeon Jeon, Nayoung Choi, Gyu-Hwung Cho, Changbong Kim, Hyunwoo Lee, Inho Kang, Sun Kim, Taiwoo Park

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[112] arXiv:2405.19793 [pdf, other]: Title: PDDLEGO: Iterative Planning in Textual Environments

Authors: Li Zhang, Peter Jansen, Tianyi Zhang, Peter Clark, Chris Callison-Burch, Niket Tandon

Comments: In *SEM 2024

Subjects: Computation and Language (cs.CL)
[113] arXiv:2405.19787 [pdf, other]: Title: From Symbolic Tasks to Code Generation: Diversification Yields Better Task Performers

Authors: Dylan Zhang, Justin Wang, Francois Charton

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Logic in Computer Science (cs.LO); Programming Languages (cs.PL)
[114] arXiv:2405.19778 [pdf, other]: Title: Enhancing Consistency and Role-Specific Knowledge Capturing by Rebuilding Fictional Character's Persona

Authors: Jeiyoon Park, Chanjun Park, Heuiseok Lim

Comments: preprint

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[115] arXiv:2405.19763 [pdf, other]: Title: Enhancing Reinforcement Learning with Label-Sensitive Reward for Natural Language Understanding

Authors: Kuo Liao, Shuang Li, Meng Zhao, Liqun Liu, Mengge Xue, Zhenyu Hu, Honglin Han, Chengguo Yin

Comments: Accept at ACL2024 Main

Subjects: Computation and Language (cs.CL)
[116] arXiv:2405.19744 [pdf, other]: Title: X-Instruction: Aligning Language Model in Low-resource Languages with Self-curated Cross-lingual Instructions

Authors: Chong Li, Wen Yang, Jiajun Zhang, Jinliang Lu, Shaonan Wang, Chengqing Zong

Comments: ACL 2024. Our codes, data and model weights are available at this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[117] arXiv:2405.19740 [pdf, other]: Title: PertEval: Unveiling Real Knowledge Capacity of LLMs with Knowledge-Invariant Perturbations

Authors: Jiatong Li, Renjun Hu, Kunzhe Huang, Yan Zhuang, Qi Liu, Mengxiao Zhu, Xing Shi, Wei Lin

Comments: 23 pages, 12 figures, 10 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[118] arXiv:2405.19737 [pdf, other]: Title: Beyond Imitation: Learning Key Reasoning Steps from Dual Chain-of-Thoughts in Reasoning Distillation

Authors: Chengwei Dai, Kun Li, Wei Zhou, Songlin Hu

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[119] arXiv:2405.19715 [pdf, other]: Title: SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths

Authors: Kaixuan Huang, Xudong Guo, Mengdi Wang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[120] arXiv:2405.19701 [pdf, other]: Title: Significance of Chain of Thought in Gender Bias Mitigation for English-Dravidian Machine Translation

Authors: Lavanya Prahallad, Radhika Mamidi

Comments: 6 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[121] arXiv:2405.19670 [pdf, other]: Title: One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models

Authors: Yutao Zhu, Zhaoheng Huang, Zhicheng Dou, Ji-Rong Wen

Comments: working in progress, repo: this https URL

Subjects: Computation and Language (cs.CL)
[122] arXiv:2405.19660 [pdf, other]: Title: PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals

Authors: Ruiyi Wang, Stephanie Milani, Jamie C. Chiu, Shaun M. Eack, Travis Labrum, Samuel M. Murphy, Nev Jones, Kate Hardy, Hong Shen, Fei Fang, Zhiyu Zoey Chen

Comments: Work in progress

Subjects: Computation and Language (cs.CL)
[123] arXiv:2405.19648 [pdf, other]: Title: Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach

Authors: Ernesto Quevedo, Jorge Yero, Rachel Koerner, Pablo Rivas, Tomas Cerny

Comments: ICAI'24 - The 26th Int'l Conf on Artificial Intelligence

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[124] arXiv:2405.19635 [pdf, other]: Title: GKT: A Novel Guidance-Based Knowledge Transfer Framework For Efficient Cloud-edge Collaboration LLM Deployment

Authors: Yao Yao, Zuchao Li, Hai Zhao

Subjects: Computation and Language (cs.CL)
[125] arXiv:2405.19575 [pdf, other]: Title: A Deep Convolutional Neural Network-based Model for Aspect and Polarity Classification in Hausa Movie Reviews

Authors: Umar Ibrahim, Abubakar Yakubu Zandam, Fatima Muhammad Adam, Aminu Musa

Comments: To be published in the proceedings of ICCAIT 2023

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[126] arXiv:2405.19563 [pdf, other]: Title: Unlearning Climate Misinformation in Large Language Models

Authors: Michael Fore, Simranjit Singh, Chaehong Lee, Amritanshu Pandey, Antonios Anastasopoulos, Dimitrios Stamoulis

Subjects: Computation and Language (cs.CL)
[127] arXiv:2405.19538 [pdf, other]: Title: CheXpert Plus: Hundreds of Thousands of Aligned Radiology Texts, Images and Patients

Authors: Pierre Chambon, Jean-Benoit Delbrouck, Thomas Sounack, Shih-Cheng Huang, Zhihong Chen, Maya Varma, Steven QH Truong, Chu The Chuong, Curtis P. Langlotz

Comments: 13 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[128] arXiv:2405.19519 [pdf, other]: Title: Two-layer retrieval augmented generation framework for low-resource medical question-answering: proof of concept using Reddit data

Authors: Sudeshna Das, Yao Ge, Yuting Guo, Swati Rajwal, JaMor Hairston, Jeanne Powell, Drew Walker, Snigdha Peddireddy, Sahithi Lakamana, Selen Bozkurt, Matthew Reyna, Reza Sameni, Yunyu Xiao, Sangmi Kim, Rasheeta Chandler, Natalie Hernandez, Danielle Mowery, Rachel Wightman, Jennifer Love, Anthony Spadaro, Jeanmarie Perrone, Abeed Sarker

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[129] arXiv:2405.19487 [pdf, other]: Title: A Full-duplex Speech Dialogue Scheme Based On Large Language Models

Authors: Peng Wang, Songshuo Lu, Yaohua Tang, Sijie Yan, Yuanjun Xiong, Wei Xia

Subjects: Computation and Language (cs.CL)
[130] arXiv:2405.19462 [pdf, other]: Title: Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning

Authors: Everlyn Asiko Chimoto, Jay Gala, Orevaoghene Ahia, Julia Kreutzer, Bruce A. Bassett, Sara Hooker

Comments: Accepted to ACL 2024 Findings

Subjects: Computation and Language (cs.CL)
[131] arXiv:2405.19433 [pdf, other]: Title: Beyond Agreement: Diagnosing the Rationale Alignment of Automated Essay Scoring Methods based on Linguistically-informed Counterfactuals

Authors: Yupei Wang, Renfen Hu, Zhe Zhao

Subjects: Computation and Language (cs.CL)
[132] arXiv:2405.19426 [pdf, other]: Title: Deep Learning for Assessment of Oral Reading Fluency

Authors: Mithilesh Vaidya, Binaya Kumar Sahoo, Preeti Rao

Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[133] arXiv:2405.19425 [pdf, other]: Title: Adaptive In-conversation Team Building for Language Model Agents

Authors: Linxin Song, Jiale Liu, Jieyu Zhang, Shaokun Zhang, Ao Luo, Shijian Wang, Qingyun Wu, Chi Wang

Subjects: Computation and Language (cs.CL)
[134] arXiv:2405.20341 (cross-list from cs.LG) [pdf, other]: Title: From Zero to Hero: Cold-Start Anomaly Detection

Authors: Tal Reiss, George Kour, Naama Zwerdling, Ateret Anaby-Tavor, Yedid Hoshen

Comments: ACL 2024. Our code is available at this https URL

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[135] arXiv:2405.20309 (cross-list from cs.LG) [pdf, other]: Title: Large Language Models Can Self-Improve At Web Agent Tasks

Authors: Ajay Patel, Markus Hofmarcher, Claudiu Leoveanu-Condrei, Marius-Constantin Dinu, Chris Callison-Burch, Sepp Hochreiter

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[136] arXiv:2405.20271 (cross-list from cs.LG) [pdf, other]: Title: ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections

Authors: Massimo Bini, Karsten Roth, Zeynep Akata, Anna Khoreva

Comments: Accepted to ICML 2024. Code available at this https URL

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[137] arXiv:2405.20213 (cross-list from cs.AI) [pdf, other]: Title: PostDoc: Generating Poster from a Long Multimodal Document Using Deep Submodular Optimization

Authors: Vijay Jaisankar, Sambaran Bandyopadhyay, Kalp Vyas, Varre Chaitanya, Shwetha Somasundaram

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[138] arXiv:2405.20172 (cross-list from cs.SD) [pdf, other]: Title: Iterative Feature Boosting for Explainable Speech Emotion Recognition

Authors: Alaa Nfissi, Wassim Bouachir, Nizar Bouguila, Brian Mishara

Comments: Published in: 2023 International Conference on Machine Learning and Applications (ICMLA)

Journal-ref: 2023 International Conference on Machine Learning and Applications (ICMLA), Jacksonville, FL, USA, 2023, pp. 543-549

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)

[ total of 432 entries: 1-50 | 39-88 | 89-138 | 139-188 | 189-238 | 239-288 | ... | 389-432 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help (Access key information)

> cs > cs.CL

Computation and Language

Authors and titles for recent submissions, skipping first 88

Fri, 31 May 2024 (continued, showing 50 of 76 entries)