Computation and Language

Authors and titles for recent submissions, skipping first 436

[ total of 346 entries: 1-25 | ... | 247-271 | 272-296 | 297-321 | 322-346 ]
[ showing 25 entries per page: fewer | more | all ]

Mon, 15 Apr 2024 (continued, showing last 25 of 46 entries)

[322] arXiv:2404.08156 [pdf, other]: Title: Multimodal Contextual Dialogue Breakdown Detection for Conversational AI Models

Authors: Md Messal Monem Miah, Ulie Schnaithmann, Arushi Raghuvanshi, Youngseo Son

Comments: Published in NAACL 2024 Industry Track

Subjects: Computation and Language (cs.CL)
[323] arXiv:2404.08155 [pdf, other]: Title: Graph Integrated Language Transformers for Next Action Prediction in Complex Phone Calls

Authors: Amin Hosseiny Marani, Ulie Schnaithmann, Youngseo Son, Akil Iyer, Manas Paldhe, Arushi Raghuvanshi

Comments: Published in NAACL 2024 Industry Track

Subjects: Computation and Language (cs.CL)
[324] arXiv:2404.08148 [pdf, other]: Title: Distilling Algorithmic Reasoning from LLMs via Explaining Solution Programs

Authors: Jierui Li, Raymond Mooney

Comments: pre-print

Subjects: Computation and Language (cs.CL)
[325] arXiv:2404.08118 [pdf, ps, other]: Title: HLTCOE at TREC 2023 NeuCLIR Track

Authors: Eugene Yang, Dawn Lawrie, James Mayfield

Comments: 6 pages. Part of TREC 2023 Proceedings

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[326] arXiv:2404.08092 [pdf, ps, other]: Title: Data-Augmentation-Based Dialectal Adaptation for LLMs

Authors: Fahim Faisal, Antonios Anastasopoulos

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[327] arXiv:2404.08078 [pdf, other]: Title: SQBC: Active Learning using LLM-Generated Synthetic Data for Stance Detection in Online Political Discussions

Authors: Stefan Sylvius Wagner, Maike Behrendt, Marc Ziegele, Stefan Harmeling

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[328] arXiv:2404.08066 [pdf, other]: Title: MSciNLI: A Diverse Benchmark for Scientific Natural Language Inference

Authors: Mobashir Sadat, Cornelia Caragea

Comments: Accepted to the NAACL 2024 Main Conference

Subjects: Computation and Language (cs.CL)
[329] arXiv:2404.08555 (cross-list from cs.LG) [pdf, other]: Title: RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

Authors: Shreyas Chaudhari, Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan, Ameet Deshpande, Bruno Castro da Silva

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[330] arXiv:2404.08517 (cross-list from cs.SE) [pdf, other]: Title: Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward

Authors: Xuan Xie, Jiayang Song, Zhehua Zhou, Yuheng Huang, Da Song, Lei Ma

Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[331] arXiv:2404.08511 (cross-list from cs.AI) [pdf, other]: Title: Leveraging Multi-AI Agents for Cross-Domain Knowledge Discovery

Authors: Shiva Aryal, Tuyen Do, Bisesh Heyojoo, Sandeep Chataut, Bichar Dip Shrestha Gurung, Venkataramana Gadhamshetty, Etienne Gnimpieba

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[332] arXiv:2404.08509 (cross-list from cs.DC) [pdf, other]: Title: Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction

Authors: Haoran Qiu, Weichao Mao, Archit Patke, Shengkun Cui, Saurabh Jha, Chen Wang, Hubertus Franke, Zbigniew T. Kalbarczyk, Tamer Başar, Ravishankar K. Iyer

Comments: Accepted at AIOps'24

Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computation and Language (cs.CL); Machine Learning (cs.LG)
[333] arXiv:2404.08495 (cross-list from cs.LG) [pdf, other]: Title: Dataset Reset Policy Optimization for RLHF

Authors: Jonathan D. Chang, Wenhao Zhan, Owen Oertell, Kianté Brantley, Dipendra Misra, Jason D. Lee, Wen Sun

Comments: 28 pages, 6 tables, 3 Figures, 3 Algorithms

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[334] arXiv:2404.08480 (cross-list from cs.LG) [pdf, other]: Title: Decoding AI: The inside story of data analysis in ChatGPT

Authors: Ozan Evkaya, Miguel de Carvalho

Comments: 15 pages with figures and appendix

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computation (stat.CO)
[335] arXiv:2404.08417 (cross-list from cs.LG) [pdf, other]: Title: AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees

Authors: William Fleshman, Aleem Khan, Marc Marone, Benjamin Van Durme

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[336] arXiv:2404.08309 (cross-list from cs.CR) [pdf, other]: Title: Subtoxic Questions: Dive Into Attitude Change of LLM's Response in Jailbreak Attempts

Authors: Tianyu Zhang, Zixuan Zhao, Jiaqi Huang, Jingyu Hua, Sheng Zhong

Comments: 4 pages, 2 figures. This paper was submitted to The 7th Deep Learning Security and Privacy Workshop (DLSP 2024) and was accepted as extended abstract, see this https URL

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[337] arXiv:2404.08189 (cross-list from cs.LG) [pdf, other]: Title: Reducing hallucination in structured outputs via Retrieval-Augmented Generation

Authors: Patrice Béchard, Orlando Marquez Ayala

Comments: To be presented at NAACL 2024. 11 pages and 4 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[338] arXiv:2404.08164 (cross-list from stat.ML) [pdf, other]: Title: Language Model Prompt Selection via Simulation Optimization

Authors: Haoting Zhang, Jinghai He, Rhonda Righter, Zeyu Zheng

Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[339] arXiv:2404.08134 (cross-list from cs.IR) [pdf, other]: Title: Extending Translate-Train for ColBERT-X to African Language CLIR

Authors: Eugene Yang, Dawn J. Lawrie, Paul McNamee, James Mayfield

Comments: 10 pages, 2 figures. System description paper for HLTCOE's participation in CIRAL@FIRE 2023

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[340] arXiv:2404.08111 (cross-list from cs.CV) [pdf, other]: Title: S3Editor: A Sparse Semantic-Disentangled Self-Training Framework for Face Video Editing

Authors: Guangzhi Wang, Tianyi Chen, Kamran Ghasedi, HsiangTao Wu, Tianyu Ding, Chris Nuesmeyer, Ilya Zharkov, Mohan Kankanhalli, Luming Liang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[341] arXiv:2404.08080 (cross-list from cs.LG) [pdf, other]: Title: Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models

Authors: Tanmay Gautam, Youngsuk Park, Hao Zhou, Parameswaran Raman, Wooseok Ha

Comments: 29 pages, 25 tables, 9 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Optimization and Control (math.OC)
[342] arXiv:2404.08020 (cross-list from cs.AI) [pdf, other]: Title: Augmenting Knowledge Graph Hierarchies Using Neural Transformers

Authors: Sanat Sharma, Mayank Poddar, Jayant Kumar, Kosta Blank, Tracy King

Comments: European Conference on Information Retrieval 2024

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Digital Libraries (cs.DL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[343] arXiv:2404.08018 (cross-list from cs.SE) [pdf, other]: Title: Analyzing the Performance of Large Language Models on Code Summarization

Authors: Rajarshi Haldar, Julia Hockenmaier

Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[344] arXiv:2404.08008 (cross-list from cs.LG) [pdf, other]: Title: Sample-Efficient Human Evaluation of Large Language Models via Maximum Discrepancy Competition

Authors: Kehua Feng, Keyan Ding, Kede Ma, Zhihua Wang, Qiang Zhang, Huajun Chen

Comments: 32 pages, 6 figures

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[345] arXiv:2404.08001 (cross-list from hep-ph) [pdf, other]: Title: Xiwu: A Basis Flexible and Learnable LLM for High Energy Physics

Authors: Zhengde Zhang, Yiyu Zhang, Haodong Yao, Jianwen Luo, Rui Zhao, Bo Huang, Jiameng Zhao, Yipu Liao, Ke Li, Lina Zhao, Jun Cao, Fazhi Qi, Changzheng Yuan

Comments: 15 pages, 8 figures

Subjects: High Energy Physics - Phenomenology (hep-ph); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); High Energy Physics - Experiment (hep-ex); Computational Physics (physics.comp-ph)
[346] arXiv:2404.07999 (cross-list from cs.LG) [pdf, other]: Title: A Multi-Level Framework for Accelerating Training Transformer Models

Authors: Longwei Zou, Han Zhang, Yangdong Deng

Comments: ICLR 2024

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)

[ total of 346 entries: 1-25 | ... | 247-271 | 272-296 | 297-321 | 322-346 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2404, contact, help (Access key information)

> cs > cs.CL

Computation and Language

Authors and titles for recent submissions, skipping first 436

Mon, 15 Apr 2024 (continued, showing last 25 of 46 entries)