Computation and Language

Authors and titles for recent submissions, skipping first 102

[ total of 346 entries: 1-25 | ... | 28-52 | 53-77 | 78-102 | 103-127 | 128-152 | 153-177 | 178-202 | ... | 328-346 ]
[ showing 25 entries per page: fewer | more | all ]

Thu, 18 Apr 2024 (continued, showing last 14 of 57 entries)

[103] arXiv:2404.10830 [pdf, other]: Title: Fewer Truncations Improve Language Modeling

Authors: Hantian Ding, Zijian Wang, Giovanni Paolini, Varun Kumar, Anoop Deoras, Dan Roth, Stefano Soatto

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[104] arXiv:2404.11584 (cross-list from cs.AI) [pdf, other]: Title: The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey

Authors: Tula Masterman, Sandi Besen, Mason Sawtell, Alex Chao

Comments: 13 pages,6 figures,38 references

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[105] arXiv:2404.11538 (cross-list from cs.LG) [pdf, other]: Title: GenFighter: A Generative and Evolutive Textual Attack Removal

Authors: Md Athikul Islam, Edoardo Serra, Sushil Jajodia

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[106] arXiv:2404.11457 (cross-list from cs.IR) [pdf, other]: Title: Unifying Bias and Unfairness in Information Retrieval: A Survey of Challenges and Opportunities with Large Language Models

Authors: Sunhao Dai, Chen Xu, Shicheng Xu, Liang Pang, Zhenhua Dong, Jun Xu

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[107] arXiv:2404.11447 (cross-list from cs.AI) [pdf, ps, other]: Title: Research on emotionally intelligent dialogue generation based on automatic dialogue system

Authors: Jin Wang, JinFei Wang, Shuying Dai, Jiqiang Yu, Keqin Li

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[108] arXiv:2404.11205 (cross-list from cs.CV) [pdf, other]: Title: Kathakali Hand Gesture Recognition With Minimal Data

Authors: Kavitha Raju, Nandini J. Warrier

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[109] arXiv:2404.11049 (cross-list from cs.LG) [pdf, other]: Title: Stepwise Alignment for Constrained Language Model Policy Optimization

Authors: Akifumi Wachi, Thien Q Tran, Rei Sato, Takumi Tanabe, Yohei Akimoto

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[110] arXiv:2404.11036 (cross-list from cs.LG) [pdf, other]: Title: Cross-Platform Hate Speech Detection with Weakly Supervised Causal Disentanglement

Authors: Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[111] arXiv:2404.11023 (cross-list from cs.HC) [pdf, other]: Title: Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions

Authors: Leena Mathur, Paul Pu Liang, Louis-Philippe Morency

Comments: Position Paper, Under Review, 19 pages, 2 figures

Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); Machine Learning (cs.LG)
[112] arXiv:2404.11018 (cross-list from cs.LG) [pdf, other]: Title: Many-Shot In-Context Learning

Authors: Rishabh Agarwal, Avi Singh, Lei M. Zhang, Bernd Bohnet, Stephanie Chan, Ankesh Anand, Zaheer Abbas, Azade Nova, John D. Co-Reyes, Eric Chu, Feryal Behbahani, Aleksandra Faust, Hugo Larochelle

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[113] arXiv:2404.10981 (cross-list from cs.IR) [pdf, other]: Title: A Survey on Retrieval-Augmented Text Generation for Large Language Models

Authors: Yizheng Huang, Jimmy Huang

Comments: Ongoing work

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[114] arXiv:2404.10934 (cross-list from cs.LG) [pdf, other]: Title: Shears: Unstructured Sparsity with Neural Low-rank Adapter Search

Authors: J. Pablo Muñoz, Jinjie Yuan, Nilesh Jain

Comments: 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (Industry Track)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[115] arXiv:2404.10933 (cross-list from cs.AI) [pdf, other]: Title: LLMem: Estimating GPU Memory Usage for Fine-Tuning Pre-Trained LLMs

Authors: Taeho Kim, Yanming Wang, Vatshank Chaturvedi, Lokesh Gupta, Seyeon Kim, Yongin Kwon, Sangtae Ha

Comments: 9 pages, 9 figures, accepted to IJCAI 2024

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[116] arXiv:2404.10838 (cross-list from cs.CV) [pdf, other]: Title: Dynamic Self-adaptive Multiscale Distillation from Pre-trained Multimodal Large Model for Efficient Cross-modal Representation Learning

Authors: Zhengyang Liang, Meiyu Liang, Wei Huang, Yawen Li, Zhe Xue

Comments: 10 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)

Wed, 17 Apr 2024 (showing first 11 of 47 entries)

[117] arXiv:2404.10774 [pdf, other]: Title: MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents

Authors: Liyan Tang, Philippe Laban, Greg Durrett

Comments: LLM-AggreFact benchmark, MiniCheck models, data generation code at this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[118] arXiv:2404.10719 [pdf, other]: Title: Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

Authors: Shusheng Xu, Wei Fu, Jiaxuan Gao, Wenjie Ye, Weilin Liu, Zhiyu Mei, Guangju Wang, Chao Yu, Yi Wu

Comments: 16 pages, 2 figures, 14 tables

Subjects: Computation and Language (cs.CL)
[119] arXiv:2404.10710 [pdf, other]: Title: Dual Modalities of Text: Visual and Textual Generative Pre-training

Authors: Yekun Chai, Qingyi Liu, Jingwu Xiao, Shuohuan Wang, Yu Sun, Hua Wu

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[120] arXiv:2404.10704 [pdf, other]: Title: Question Difficulty Ranking for Multiple-Choice Reading Comprehension

Authors: Vatsal Raina, Mark Gales

Comments: 7 pages, 3 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[121] arXiv:2404.10696 [pdf, other]: Title: Integrating knowledge bases to improve coreference and bridging resolution for the chemical domain

Authors: Pengcheng Lu, Massimo Poesio

Comments: working in progress

Subjects: Computation and Language (cs.CL)
[122] arXiv:2404.10652 [pdf, other]: Title: ViTextVQA: A Large-Scale Visual Question Answering Dataset for Evaluating Vietnamese Text Comprehension in Images

Authors: Quan Van Nguyen, Dan Quang Tran, Huy Quang Pham, Thang Kien-Bao Nguyen, Nghia Hieu Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

Comments: Preprint submitted to IJCV

Subjects: Computation and Language (cs.CL)
[123] arXiv:2404.10642 [pdf, other]: Title: Self-playing Adversarial Language Game Enhances LLM Reasoning

Authors: Pengyu Cheng, Tianhao Hu, Han Xu, Zhisong Zhang, Yong Dai, Lei Han, Nan Du

Comments: Preprint

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[124] arXiv:2404.10630 [pdf, other]: Title: HLAT: High-quality Large Language Model Pre-trained on AWS Trainium

Authors: Haozheng Fan, Hao Zhou, Guangtai Huang, Parameswaran Raman, Xinwei Fu, Gaurav Gupta, Dhananjay Ram, Yida Wang, Jun Huan

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[125] arXiv:2404.10555 [pdf, other]: Title: Construction of Domain-specified Japanese Large Language Model for Finance through Continual Pre-training

Authors: Masanori Hirano, Kentaro Imajo

Comments: 7 pages

Subjects: Computation and Language (cs.CL); Computational Finance (q-fin.CP)
[126] arXiv:2404.10552 [pdf, other]: Title: Unveiling the Misuse Potential of Base Large Language Models via In-Context Learning

Authors: Xiao Wang, Tianze Chen, Xianjun Yang, Qi Zhang, Xun Zhao, Dahua Lin

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[127] arXiv:2404.10513 [pdf, other]: Title: CoTAR: Chain-of-Thought Attribution Reasoning with Multi-level Granularity

Authors: Moshe Berchansky, Daniel Fleischer, Moshe Wasserblat, Peter Izsak

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

[ total of 346 entries: 1-25 | ... | 28-52 | 53-77 | 78-102 | 103-127 | 128-152 | 153-177 | 178-202 | ... | 328-346 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2404, contact, help (Access key information)

> cs > cs.CL

Computation and Language

Authors and titles for recent submissions, skipping first 102

Thu, 18 Apr 2024 (continued, showing last 14 of 57 entries)

Wed, 17 Apr 2024 (showing first 11 of 47 entries)