Computation and Language
Authors and titles for recent submissions
[ total of 350 entries: 1-340 | 341-350 ][ showing 340 entries per page: fewer | more | all ]
Mon, 6 May 2024
- [1] arXiv:2405.02287 [pdf, other]
-
Title: Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language modelsAuthors: Piotr Padlewski, Max Bain, Matthew Henderson, Zhongkai Zhu, Nishant Relan, Hai Pham, Donovan Ong, Kaloyan Aleksiev, Aitor Ormazabal, Samuel Phua, Ethan Yeo, Eugenie Lamprecht, Qi Liu, Yuqi Wang, Eric Chen, Deyu Fu, Lei Li, Che Zheng, Cyprien de Masson d'Autume, Dani Yogatama, Mikel Artetxe, Yi TaySubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [2] arXiv:2405.02228 [pdf, other]
-
Title: REASONS: A benchmark for REtrieval and Automated citationS Of scieNtific Sentences using Public and Proprietary LLMsAuthors: Deepa Tilwani, Yash Saxena, Ali Mohammadi, Edward Raff, Amit Sheth, Srinivasan Parthasarathy, Manas GaurComments: Submitted to ACL ARR April 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
- [3] arXiv:2405.02195 [pdf, ps, other]
-
Title: Impact of emoji exclusion on the performance of Arabic sarcasm detection modelsSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [4] arXiv:2405.02178 [pdf, other]
-
Title: Assessing and Verifying Task Utility in LLM-Powered ApplicationsAuthors: Negar Arabzadeh, Siging Huo, Nikhil Mehta, Qinqyun Wu, Chi Wang, Ahmed Awadallah, Charles L. A. Clarke, Julia KiselevaComments: arXiv admin note: text overlap with arXiv:2402.09015Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [5] arXiv:2405.02175 [pdf, other]
-
Title: Hoaxpedia: A Unified Wikipedia Hoax Articles DatasetComments: Short paperSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [6] arXiv:2405.02165 [pdf, other]
-
Title: EEG2TEXT: Open Vocabulary EEG-to-Text Decoding with EEG Pre-Training and Multi-View TransformerSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [7] arXiv:2405.02144 [pdf, other]
-
Title: MedReadMe: A Systematic Study for Fine-grained Sentence Readability in Medical DomainSubjects: Computation and Language (cs.CL)
- [8] arXiv:2405.02134 [pdf, other]
-
Title: Optimising Calls to Large Language Models with Uncertainty-Based Two-Tier SelectionSubjects: Computation and Language (cs.CL)
- [9] arXiv:2405.02128 [pdf, ps, other]
-
Title: Single and Multi-Hop Question-Answering Datasets for Reticular Chemistry with GPT-4-TurboAuthors: Nakul Rampal, Kaiyu Wang, Matthew Burigana, Lingxiang Hou, Juri Al-Johani, Anna Sackmann, Hanan S. Murayshid, Walaa Abdullah Al-Sumari, Arwa M. Al-Abdulkarim, Nahla Eid Al-Hazmi, Majed O. Al-Awad, Christian Borgs, Jennifer T. Chayes, Omar M. YaghiSubjects: Computation and Language (cs.CL); Materials Science (cond-mat.mtrl-sci)
- [10] arXiv:2405.02079 [pdf, other]
-
Title: Argumentative Large Language Models for Explainable and Contestable Decision-MakingComments: 19 pages, 17 figuresSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [11] arXiv:2405.02040 [pdf, ps, other]
-
Title: Large Multimodal Model based Standardisation of Pathology Reports with Confidence and their Prognostic SignificanceComments: 19 pages, 6 figuresSubjects: Computation and Language (cs.CL)
- [12] arXiv:2405.02024 [pdf, other]
-
Title: Analyzing Narrative Processing in Large Language Models (LLMs): Using GPT4 to test BERTSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [13] arXiv:2405.02010 [pdf, other]
-
Title: The Trade-off between Performance, Efficiency, and Fairness in Adapter Modules for Text ClassificationComments: Accepted to the 4th Workshop on Trustworthy Natural Language Processing (TrustNLP) at NAACL 2024Subjects: Computation and Language (cs.CL)
- [14] arXiv:2405.01997 [pdf, ps, other]
-
Title: Exploring Combinatorial Problem Solving with Large Language Models: A Case Study on the Travelling Salesman Problem Using GPT-3.5 TurboSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [15] arXiv:2405.01976 [pdf, other]
-
Title: Conformal Prediction for Natural Language Processing: A SurveyAuthors: Margarida M. Campos, António Farinhas, Chrysoula Zerva, Mário A.T. Figueiredo, André F.T. MartinsSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [16] arXiv:2405.01972 [pdf, other]
-
Title: A quantitative and typological study of Early Slavic participle clauses and their competitionAuthors: Nilo PedrazziniComments: 259 pages, 138 figures. DPhil Thesis in Linguistics submitted and defended at the University of Oxford (December 2023). This manuscript is a version formatted for improved readability and broader disseminationSubjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
- [17] arXiv:2405.01943 [pdf, other]
-
Title: Dependency-Aware Semi-Structured Sparsity of GLU Variants in Large Language ModelsSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [18] arXiv:2405.01942 [pdf, other]
-
Title: CRCL at SemEval-2024 Task 2: Simple prompt optimizationsJournal-ref: SemEval-2024Subjects: Computation and Language (cs.CL)
- [19] arXiv:2405.01930 [pdf, other]
-
Title: OARelatedWork: A Large-Scale Dataset of Related Work Sections with Full-texts from Open Access SourcesSubjects: Computation and Language (cs.CL)
- [20] arXiv:2405.01924 [pdf, other]
-
Title: Semi-Parametric Retrieval via Binary Token IndexSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
- [21] arXiv:2405.01886 [pdf, other]
-
Title: Aloe: A Family of Fine-tuned Open Healthcare LLMsAuthors: Ashwin Kumar Gururajan, Enrique Lopez-Cuena, Jordi Bayarri-Planas, Adrian Tormos, Daniel Hinjos, Pablo Bernabeu-Perez, Anna Arias-Duart, Pablo Agustin Martin-Torres, Lucia Urcelay-Ganzabal, Marta Gonzalez-Mallo, Sergio Alvarez-Napagao, Eduard Ayguadé-Parra, Ulises Cortés Dario Garcia-GasullaComments: Five appendixSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [22] arXiv:2405.01884 [pdf, other]
-
Title: Beyond Single-Event Extraction: Towards Efficient Document-Level Multi-Event Argument ExtractionAuthors: Wanlong Liu, Li Zhou, Dingyi Zeng, Yichen Xiao, Shaohuan Cheng, Chen Zhang, Grandee Lee, Malu Zhang, Wenyu ChenSubjects: Computation and Language (cs.CL)
- [23] arXiv:2405.01883 [pdf, other]
-
Title: DALLMi: Domain Adaption for LLM-based Multi-label ClassifierSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [24] arXiv:2405.01873 [pdf, other]
-
Title: Enhancing Bangla Language Next Word Prediction and Sentence Completion through Extended RNN with Bi-LSTM Model On N-gram LanguageComments: This paper contains 6 pages, 8 figuresSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [25] arXiv:2405.01868 [pdf, other]
-
Title: Incorporating External Knowledge and Goal Guidance for LLM-based Conversational Recommender SystemsComments: Main paper 8 pages; References and Appendix 9 pages; 7 figures and 14 tablesSubjects: Computation and Language (cs.CL)
- [26] arXiv:2405.01858 [pdf, other]
-
Title: SUKHSANDESH: An Avatar Therapeutic Question Answering Platform for Sexual Education in Rural IndiaAuthors: Salam Michael Singh, Shubhmoy Kumar Garg, Amitesh Misra, Aaditeshwar Seth, Tanmoy ChakrabortySubjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
- [27] arXiv:2405.01842 [pdf, ps, other]
-
Title: SGHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of SingaporeSubjects: Computation and Language (cs.CL)
- [28] arXiv:2405.01827 [pdf, other]
-
Title: SoftMCL: Soft Momentum Contrastive Learning for Fine-grained Sentiment-aware Pre-trainingComments: Accepted by LREC-COLING 2024Subjects: Computation and Language (cs.CL)
- [29] arXiv:2405.01799 [pdf, other]
-
Title: Exploiting ChatGPT for Diagnosing Autism-Associated Language Disorders and Identifying Distinct FeaturesSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [30] arXiv:2405.01796 [pdf, other]
-
Title: TOPICAL: TOPIC Pages AutomagicaLlyComments: 10 pages, 7 figures, 2 tables, NAACL System Demonstrations 2024Subjects: Computation and Language (cs.CL); Digital Libraries (cs.DL); Information Retrieval (cs.IR)
- [31] arXiv:2405.01790 [pdf, other]
-
Title: Understanding Position Bias Effects on Fairness in Social Multi-Document SummarizationComments: Accepted at VarDial 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [32] arXiv:2405.01783 [pdf, ps, other]
-
Title: Layers of technology in pluriversal design. Decolonising language technology with the LiveLanguage initiativeSubjects: Computation and Language (cs.CL)
- [33] arXiv:2405.01769 [pdf, other]
-
Title: A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and LawAuthors: Zhiyu Zoey Chen, Jing Ma, Xinlu Zhang, Nan Hao, An Yan, Armineh Nourbakhsh, Xianjun Yang, Julian McAuley, Linda Petzold, William Yang WangComments: 35 pages, 6 figuresSubjects: Computation and Language (cs.CL)
- [34] arXiv:2405.01768 [pdf, other]
-
Title: CoS: Enhancing Personalization and Mitigating Bias with Context SteeringSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [35] arXiv:2405.01740 [pdf, ps, other]
-
Title: The Psychosocial Impacts of Generative AI HarmsComments: Presented in Impact of GenAI on Social and Individual Well-being at AAAI 2024 Spring Symposium Series (2024)Subjects: Computation and Language (cs.CL)
- [36] arXiv:2405.01738 [pdf, other]
-
Title: Question Suggestion for Conversational Shopping Assistants Using Product MetadataComments: 5 pages, 1 figureSubjects: Computation and Language (cs.CL)
- [37] arXiv:2405.01724 [pdf, other]
-
Title: Large Language Models are Inconsistent and Biased EvaluatorsComments: 9 pages, 7 figuresSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [38] arXiv:2405.01686 [pdf, other]
-
Title: Automatically Extracting Numerical Results from Randomized Controlled Trials with Large Language ModelsComments: 24 pages, 7 figures, 6 tablesSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [39] arXiv:2405.01682 [pdf, other]
-
Title: Leveraging Prompt-Learning for Structured Information Extraction from Crohn's Disease Radiology Reports in a Low-Resource LanguageAuthors: Liam Hazan, Gili Focht, Naama Gavrielov, Roi Reichart, Talar Hagopian, Mary-Louise C. Greer, Ruth Cytter Kuint, Dan Turner, Moti FreimanSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [40] arXiv:2405.01678 [pdf, other]
-
Title: 1-Diffractor: Efficient and Utility-Preserving Text Obfuscation Leveraging Word-Level Metric Differential PrivacyComments: 12 pages, 7 figures, 7 tables, 10th ACM International Workshop on Security and Privacy Analytics (IWSPA 2024)Subjects: Computation and Language (cs.CL)
- [41] arXiv:2405.01660 [pdf, other]
-
Title: Investigating Wit, Creativity, and Detectability of Large Language Models in Domain-Specific Writing Style Adaptation of Reddit's ShowerthoughtsAuthors: Tolga Buz, Benjamin Frost, Nikola Genchev, Moritz Schneider, Lucie-Aimée Kaffee, Gerard de MeloComments: Accepted to *SEM 2024 (StarSEM) conferenceSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [42] arXiv:2405.01649 [pdf, other]
-
Title: Improving Complex Reasoning over Knowledge Graph with Logic-Aware Curriculum TuningSubjects: Computation and Language (cs.CL)
- [43] arXiv:2405.01610 [pdf, other]
-
Title: Automating the Analysis of Public Saliency and Attitudes towards Biodiversity from Digital MediaAuthors: Noah Giebink, Amrita Gupta, Diogo Verìssimo, Charlotte H. Chang, Tony Chang, Angela Brennan, Brett Dickson, Alex Bowmer, Jonathan BaillieComments: v0.1, 21 pages with 10 figuresSubjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
- [44] arXiv:2405.01601 [pdf, other]
-
Title: Efficient Sample-Specific Encoder PerturbationsComments: To appear in NAACL 2024Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [45] arXiv:2405.01597 [pdf, other]
-
Title: Improving Disease Detection from Social Media Text via Self-Augmentation and Contrastive LearningSubjects: Computation and Language (cs.CL)
- [46] arXiv:2405.01593 [pdf, other]
-
Title: Large Language Model Agent for Fake News DetectionSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
- [47] arXiv:2405.01592 [pdf, ps, other]
-
Title: Text and Audio Simplification: Human vs. ChatGPTComments: AMIA Summit, Boston, 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [48] arXiv:2405.01591 [pdf, other]
-
Title: Simplifying Multimodality: Unimodal Approach to Multimodal Challenges in Radiology with General-Domain Large Language ModelComments: Under reviewSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
- [49] arXiv:2405.01590 [pdf, other]
-
Title: 101 Billion Arabic Words DatasetSubjects: Computation and Language (cs.CL)
- [50] arXiv:2405.01589 [pdf, ps, other]
-
Title: GPT-4 passes most of the 297 written Polish Board Certification ExaminationsSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [51] arXiv:2405.01588 [pdf, other]
-
Title: Towards Unbiased Evaluation of Detecting Unanswerable Questions in EHRSQLComments: DPFM Workshop, ICLR 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [52] arXiv:2405.01587 [pdf, ps, other]
-
Title: Improve Academic Query Resolution through BERT-based Question Extraction from ImagesJournal-ref: 2024 IEEE International Conference on Interdisciplinary Approaches in Technology and Management for Social Innovation (IATMSI) volume 2 (2024) 1-4Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [53] arXiv:2405.01586 [pdf, other]
-
Title: Transfer Learning and Transformer Architecture for Financial Sentiment AnalysisComments: 12 pages, 9 figuresJournal-ref: Proceedings of International Conference on Computational Intelligence, Data Science and Cloud Computing: IEM-ICDC 2021,pages 17--27Subjects: Computation and Language (cs.CL)
- [54] arXiv:2405.01584 [pdf, other]
-
Title: Lightweight Conceptual Dictionary Learning for Text Classification Using Information CompressionComments: 12 pages, TKDE formatSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Signal Processing (eess.SP)
- [55] arXiv:2405.01583 [pdf, other]
-
Title: MediFact at MEDIQA-M3G 2024: Medical Question Answering in Dermatology with Multimodal LearningAuthors: Nadia SaeedComments: 7 pages, 3 figures, Clinical NLP 2024 workshop proceedings in Shared TaskSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [56] arXiv:2405.01582 [pdf, other]
-
Title: Text Quality-Based Pruning for Efficient Training of Language ModelsAuthors: Vasu Sharma, Karthik Padthe, Newsha Ardalani, Kushal Tirumala, Russell Howes, Hu Xu, Po-Yao Huang, Shang-Wen Li, Armen Aghajanyan, Gargi GhoshSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [57] arXiv:2405.01581 [pdf, other]
-
Title: The Mercurial Top-Level Ontology of Large Language ModelsSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [58] arXiv:2405.01577 [pdf, other]
-
Title: HateTinyLLM : Hate Speech Detection Using Tiny Large Language ModelsSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [59] arXiv:2405.01576 [pdf, other]
-
Title: Uncovering Deceptive Tendencies in Language Models: A Simulated Company AI AssistantSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [60] arXiv:2405.02267 (cross-list from cs.LG) [pdf, other]
-
Title: Structural Pruning of Pre-trained Language Models via Neural Architecture SearchSubjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
- [61] arXiv:2405.02132 (cross-list from cs.SD) [pdf, other]
-
Title: Unveiling the Potential of LLM-Based ASR on Chinese Open-Source DatasetsAuthors: Xuelong Geng, Tianyi Xu, Kun Wei, Bingshen Mu, Hongfei Xue, He Wang, Yangze Li, Pengcheng Guo, Yuhang Dai, Longhao Li, Mingchen Shao, Lei XieSubjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
- [62] arXiv:2405.02124 (cross-list from eess.AS) [pdf, other]
-
Title: TIPAA-SSL: Text Independent Phone-to-Audio Alignment based on Self-Supervised Learning and Knowledge TransferSubjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [63] arXiv:2405.02105 (cross-list from cs.AI) [pdf, other]
-
Title: Evaluating Large Language Models for Structured Science Summarization in the Open Research Knowledge GraphComments: 22 pages, 11 figures. In review at this https URLSubjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Theory (cs.IT)
- [64] arXiv:2405.01988 (cross-list from cs.SD) [pdf, other]
-
Title: Joint sentiment analysis of lyrics and audio in musicComments: published at DAGA 2024Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
- [65] arXiv:2405.01744 (cross-list from cs.LG) [pdf, other]
-
Title: ALCM: Autonomous LLM-Augmented Causal Discovery FrameworkSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Methodology (stat.ME)
- [66] arXiv:2405.01585 (cross-list from cs.AI) [pdf, other]
-
Title: Tabular Embedding Model (TEM): Finetuning Embedding Models For Tabular RAG ApplicationsComments: 11 pages, 5 figuresSubjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
- [67] arXiv:2405.01575 (cross-list from cs.SE) [pdf, other]
-
Title: Software Mention Recognition with a Three-Stage Framework Based on BERTology Models at SOMD 2024Comments: Software mention recognition, Named entity recognition, Transformer, Three-stage frameworkSubjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [68] arXiv:2405.01563 (cross-list from cs.LG) [pdf, other]
-
Title: Mitigating LLM Hallucinations via Conformal AbstentionAuthors: Yasin Abbasi Yadkori, Ilja Kuzborskij, David Stutz, András György, Adam Fisch, Arnaud Doucet, Iuliya Beloshapka, Wei-Hung Weng, Yao-Yuan Yang, Csaba Szepesvári, Ali Taylan Cemgil, Nenad TomasevSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [69] arXiv:2405.01556 (cross-list from cs.SE) [pdf, other]
-
Title: Semantically Aligned Question and Code Generation for Automated Insight GenerationAuthors: Ananya Singha, Bhavya Chopra, Anirudh Khatry, Sumit Gulwani, Austin Z. Henley, Vu Le, Chris Parnin, Mukul Singh, Gust VerbruggenSubjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Fri, 3 May 2024
- [70] arXiv:2405.01535 [pdf, other]
-
Title: Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language ModelsAuthors: Seungone Kim, Juyoung Suk, Shayne Longpre, Bill Yuchen Lin, Jamin Shin, Sean Welleck, Graham Neubig, Moontae Lee, Kyungjae Lee, Minjoon SeoComments: Work in ProgressSubjects: Computation and Language (cs.CL)
- [71] arXiv:2405.01525 [pdf, other]
-
Title: FLAME: Factuality-Aware Alignment for Large Language ModelsSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [72] arXiv:2405.01511 [pdf, other]
-
Title: D2PO: Discriminator-Guided DPO with Response Evaluation ModelsComments: 20 pages, 12 figuresSubjects: Computation and Language (cs.CL)
- [73] arXiv:2405.01502 [pdf, other]
-
Title: Analyzing the Role of Semantic Representations in the Era of Large Language ModelsAuthors: Zhijing Jin, Yuen Chen, Fernando Gonzalez, Jiarui Liu, Jiayi Zhang, Julian Michael, Bernhard Schölkopf, Mona DiabComments: NAACL 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [74] arXiv:2405.01490 [pdf, other]
-
Title: Controllable Text Generation in the Instruction-Tuning EraSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [75] arXiv:2405.01481 [pdf, other]
-
Title: NeMo-Aligner: Scalable Toolkit for Efficient Model AlignmentAuthors: Gerald Shen, Zhilin Wang, Olivier Delalleau, Jiaqi Zeng, Yi Dong, Daniel Egert, Shengyang Sun, Jimmy Zhang, Sahil Jain, Ali Taghibakhshi, Markel Sanz Ausin, Ashwath Aithal, Oleksii KuchaievComments: 13 pages, 4 figuresSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [76] arXiv:2405.01474 [pdf, other]
-
Title: V-FLUTE: Visual Figurative Language Understanding with Textual ExplanationsSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
- [77] arXiv:2405.01470 [pdf, other]
-
Title: WildChat: 1M ChatGPT Interaction Logs in the WildComments: accepted by ICLR 2024Subjects: Computation and Language (cs.CL)
- [78] arXiv:2405.01458 [pdf, other]
-
Title: UQA: Corpus for Urdu Question AnsweringSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [79] arXiv:2405.01403 [pdf, other]
-
Title: Unsupervised Flow Discovery from Task-oriented DialoguesComments: 12 pages, 4 figuresSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [80] arXiv:2405.01379 [pdf, other]
-
Title: Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem ProvingSubjects: Computation and Language (cs.CL)
- [81] arXiv:2405.01376 [pdf, other]
-
Title: Topics in the Study of the Pragmatic Functions of Phonetic Reduction in DialogSubjects: Computation and Language (cs.CL)
- [82] arXiv:2405.01359 [pdf, other]
-
Title: GAIA: A General AI Assistant for Intelligent Accelerator OperationsAuthors: Frank MayetSubjects: Computation and Language (cs.CL); Accelerator Physics (physics.acc-ph)
- [83] arXiv:2405.01345 [pdf, other]
-
Title: The Power of Question Translation Training in Multilingual Reasoning: Broadened Scope and Deepened InsightsSubjects: Computation and Language (cs.CL)
- [84] arXiv:2405.01299 [pdf, other]
-
Title: The Effectiveness of LLMs as Annotators: A Comparative Overview and Empirical Analysis of Direct RepresentationComments: LREC-COLING NLPerspectives workshopSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [85] arXiv:2405.01293 [pdf, ps, other]
-
Title: Low-resource speech recognition and dialect identification of Irish in a multi-task frameworkComments: 7 pages. Accepted to Odyssey 2024 - The Speaker and Language Recognition WorkshopSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [86] arXiv:2405.01280 [pdf, other]
-
Title: Reinforcement Learning for Edit-Based Non-Autoregressive Neural Machine TranslationSubjects: Computation and Language (cs.CL)
- [87] arXiv:2405.01249 [pdf, ps, other]
-
Title: Prompt engineering paradigms for medical applications: scoping review and recommendations for better practicesAuthors: Jamil Zaghir, Marco Naguib, Mina Bjelogrlic, Aurélie Névéol, Xavier Tannier, Christian LovisSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [88] arXiv:2405.01216 [pdf, other]
-
Title: DMON: A Simple yet Effective Approach for Argument Structure LearningComments: COLING 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [89] arXiv:2405.01159 [pdf, other]
-
Title: TartuNLP at EvaLatin 2024: Emotion Polarity DetectionComments: Accepted to The Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2024)Subjects: Computation and Language (cs.CL)
- [90] arXiv:2405.01139 [pdf, other]
-
Title: It Couldn't Help But Overhear: On the Limits of Modelling Meta-Communicative Grounding Acts with Supervised LearningComments: work in progressSubjects: Computation and Language (cs.CL)
- [91] arXiv:2405.01121 [pdf, other]
-
Title: Efficient Data Generation for Source-grounded Information-seeking Dialogs: A Use Case for Meeting TranscriptsAuthors: Lotem Golany, Filippo Galgani, Maya Mamo, Nimrod Parasol, Omer Vandsburger, Nadav Bar, Ido DaganSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [92] arXiv:2405.01022 [pdf, other]
-
Title: UniGen: Universal Domain Generalization for Sentiment Classification via Zero-shot Dataset GenerationSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [93] arXiv:2405.00997 [pdf, other]
-
Title: The IgboAPI Dataset: Empowering Igbo Language Technologies through Multi-dialectal EnrichmentAuthors: Chris Chinenye Emezue, Ifeoma Okoh, Chinedu Mbonu, Chiamaka Chukwuneke, Daisy Lal, Ignatius Ezeani, Paul Rayson, Ijemma Onwuzulike, Chukwuma Okeke, Gerald Nweya, Bright Ogbonna, Chukwuebuka Oraegbunam, Esther Chidinma Awo-Ndubuisi, Akudo Amarachukwu Osuagwu, Obioha NmeziComments: Accepted to the LREC-COLING 2024 conferenceSubjects: Computation and Language (cs.CL)
- [94] arXiv:2405.00988 [pdf, other]
-
Title: Context-Aware Clustering using Large Language ModelsAuthors: Sindhu Tipirneni, Ravinarayana Adkathimar, Nurendra Choudhary, Gaurush Hiranandani, Rana Ali Amjad, Vassilis N. Ioannidis, Changhe Yuan, Chandan K. ReddyComments: 16 pagesSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [95] arXiv:2405.00982 [pdf, other]
-
Title: On the Evaluation of Machine-Generated ReportsAuthors: James Mayfield, Eugene Yang, Dawn Lawrie, Sean MacAvaney, Paul McNamee, Douglas W. Oard, Luca Soldaini, Ian Soboroff, Orion Weller, Efsun Kayi, Kate Sanders, Marc Mason, Noah HibblerComments: 12 pages, 4 figures, accepted at SIGIR 2024 as perspective paperSubjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
- [96] arXiv:2405.00980 [pdf, other]
-
Title: A Hong Kong Sign Language Corpus Collected from Sign-interpreted TV NewsComments: Accepted by LREC-COLING 2024Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [97] arXiv:2405.00972 [pdf, other]
-
Title: CACTUS: Chemistry Agent Connecting Tool-Usage to ScienceAuthors: Andrew D. McNaughton, Gautham Ramalaxmi, Agustin Kruel, Carter R. Knutson, Rohith A. Varikoti, Neeraj KumarSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Chemical Physics (physics.chem-ph); Quantitative Methods (q-bio.QM)
- [98] arXiv:2405.00970 [pdf, other]
-
Title: How Can I Get It Right? Using GPT to Rephrase Incorrect Trainee ResponsesAuthors: Jionghao Lin, Zifei Han, Danielle R. Thomas, Ashish Gurung, Shivang Gupta, Vincent Aleven, Kenneth R. KoedingerComments: International Journal of Artificial Intelligence in EducationSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
- [99] arXiv:2405.00966 [pdf, other]
-
Title: Efficient Compression of Multitask Multilingual Speech ModelsAuthors: Thomas Palmeira FerrazComments: Master ThesisSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [100] arXiv:2405.00948 [pdf, other]
-
Title: Modeling Empathetic Alignment in ConversationComments: Camera-ready version for NAACL 2024Subjects: Computation and Language (cs.CL)
- [101] arXiv:2405.00903 [pdf, other]
-
Title: A Named Entity Recognition and Topic Modeling-based Solution for Locating and Better Assessment of Natural Disasters in Social MediaComments: 15 pages; 4 tables; 4 figuresSubjects: Computation and Language (cs.CL)
- [102] arXiv:2405.00888 [pdf, other]
-
Title: DynaMo: Accelerating Language Model Inference with Dynamic Multi-Token SamplingComments: Accepted at NAACL 2024Subjects: Computation and Language (cs.CL)
- [103] arXiv:2405.00864 [pdf, other]
-
Title: Math Multiple Choice Question Generation via Human-Large Language Model CollaborationComments: 17th International Conference on Educational Data Mining (EDM 2024)Subjects: Computation and Language (cs.CL)
- [104] arXiv:2405.00828 [pdf, other]
-
Title: WIBA: What Is Being Argued? A Comprehensive Approach to Argument MiningComments: 8 pages, 2 figures, submitted to The 16th International Conference on Advances in Social Networks Analysis and Mining (ASONAM) '24Subjects: Computation and Language (cs.CL)
- [105] arXiv:2405.00823 [pdf, other]
-
Title: WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace SettingAuthors: Olly Styles, Sam Miller, Patricio Cerda-Mardini, Tanaya Guha, Victor Sanchez, Bertie VidgenSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
- [106] arXiv:2405.00821 [pdf, other]
-
Title: Uncovering Agendas: A Novel French & English Dataset for Agenda Detection on Social MediaSubjects: Computation and Language (cs.CL)
- [107] arXiv:2405.00801 [pdf, ps, other]
-
Title: "Ask Me Anything": How Comcast Uses LLMs to Assist Agents in Real TimeSubjects: Computation and Language (cs.CL)
- [108] arXiv:2405.00732 [pdf, other]
-
Title: LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical ReportAuthors: Justin Zhao, Timothy Wang, Wael Abid, Geoffrey Angus, Arnav Garg, Jeffery Kinnison, Alex Sherstinsky, Piero Molino, Travis Addair, Devvret RishiSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [109] arXiv:2405.00728 [pdf, ps, other]
-
Title: Evaluating the Application of ChatGPT in Outpatient Triage Guidance: A Comparative StudyAuthors: Dou Liu, Ying Han, Xiandi Wang, Xiaomei Tan, Di Liu, Guangwu Qian, Kang Li, Dan Pu, Rong YinComments: 8 pages, 1 figure, conference(International Ergonomics Association)Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
- [110] arXiv:2405.00722 [pdf, other]
-
Title: LLMs for Generating and Evaluating Counterfactuals: A Comprehensive StudySubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [111] arXiv:2405.00718 [pdf, other]
-
Title: Can't say cant? Measuring and Reasoning of Dark Jargons in Large Language ModelsAuthors: Xu Ji, Jianyi Zhang, Ziyin Zhou, Zhangchi Zhao, Qianqian Qiao, Kaiying Han, Md Imran Hossen, Xiali HeiSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [112] arXiv:2405.00717 [pdf, other]
-
Title: Exploring News Summarization and Enrichment in a Highly Resource-Scarce Indian Language: A Case Study of MizoComments: Accepted at LREC-COLING2024 WILDRE WorkshopSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [113] arXiv:2405.00716 [pdf, other]
-
Title: Large Language Models in Healthcare: A Comprehensive BenchmarkSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [114] arXiv:2405.00715 [pdf, other]
-
Title: Towards Adapting Open-Source Large Language Models for Expert-Level Clinical Note GenerationAuthors: Hanyin Wang, Chufan Gao, Bolun Liu, Qiping Xu, Guleid Hussein, Mohamad El Labban, Kingsley Iheasirim, Hariprasad Korsapati, Jimeng SunSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [115] arXiv:2405.00711 [pdf, other]
-
Title: Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of Theories, Detection Methods, and OpportunitiesAuthors: Xiaomin Yu, Yezhaohui Wang, Yanfang Chen, Zhen Tao, Dinghao Xi, Shichao Song, Simin Niu, Zhiyu LiSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
- [116] arXiv:2405.00710 [pdf, ps, other]
-
Title: Homonym Sense Disambiguation in the Georgian LanguageSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [117] arXiv:2405.00709 [pdf, other]
-
Title: Evaluating Tool-Augmented Agents in Remote Sensing PlatformsComments: ICLR 2024 Machine Learning for Remote Sensing (ML4RS) WorkshopSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [118] arXiv:2405.00708 [pdf, other]
-
Title: Interactive Analysis of LLMs using Meaningful CounterfactualsAuthors: Furui Cheng, Vilém Zouhar, Robin Shing Moon Chan, Daniel Fürst, Hendrik Strobelt, Mennatallah El-AssadySubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
- [119] arXiv:2405.00706 [pdf, ps, other]
-
Title: Science Written by Generative AI is Perceived as Less Intelligent, but More Credible and Trustworthy than Science Written by HumansAuthors: David M. MarkowitzSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
- [120] arXiv:2405.00705 [pdf, other]
-
Title: SHED: Shapley-Based Automated Dataset Refinement for Instruction Fine-TuningAuthors: Yexiao He, Ziyao Wang, Zheyu Shen, Guoheng Sun, Yucong Dai, Yongkai Wu, Hongyi Wang, Ang LiSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [121] arXiv:2405.00704 [pdf, ps, other]
-
Title: A Survey on the Real Power of ChatGPTComments: 9 pages, 2 tablesSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [122] arXiv:2405.01509 (cross-list from cs.CR) [pdf, other]
-
Title: Learnable Linguistic Watermarks for Tracing Model Extraction Attacks on Large Language ModelsComments: not decidedSubjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [123] arXiv:2405.01483 (cross-list from cs.CV) [pdf, other]
-
Title: MANTIS: Interleaved Multi-Image Instruction TuningComments: 9 pages, 3 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [124] arXiv:2405.01413 (cross-list from cs.CV) [pdf, other]
-
Title: MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D PriorsComments: 17 pages, 9 figuresSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [125] arXiv:2405.01310 (cross-list from cs.IR) [pdf, other]
-
Title: Overcoming LLM Challenges using RAG-Driven Precision in Coffee Leaf Disease RemediationAuthors: Dr. Selva Kumar S, Afifah Khan Mohammed Ajmal Khan, Imadh Ajaz Banday, Manikantha Gada, Vibha Venkatesh ShanbhagComments: 6 pages, 3 figuresSubjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
- [126] arXiv:2405.01259 (cross-list from cs.AI) [pdf, other]
-
Title: Identification of Entailment and Contradiction Relations between Natural Language Sentences: A Neurosymbolic ApproachSubjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [127] arXiv:2405.01229 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Boosting Jailbreak Attack with MomentumComments: ICLR 2024 Workshop on Reliable and Responsible Foundation ModelsSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Optimization and Control (math.OC)
- [128] arXiv:2405.01097 (cross-list from cs.CY) [pdf, other]
-
Title: Silencing the Risk, Not the Whistle: A Semi-automated Text Sanitization Tool for Mitigating the Risk of Whistleblower Re-IdentificationComments: Accepted for publication at the ACM Conference on Fairness, Accountability, and Transparency 2024 (ACM FAccT'24). This is a preprint manuscript (authors' own version before final copy-editing)Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Information Retrieval (cs.IR); Software Engineering (cs.SE)
- [129] arXiv:2405.01040 (cross-list from cs.CV) [pdf, other]
-
Title: Few Shot Class Incremental Learning using Vision-Language modelsComments: under review at Pattern Recognition LettersSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Image and Video Processing (eess.IV)
- [130] arXiv:2405.00981 (cross-list from cs.AI) [pdf, other]
-
Title: Bayesian Optimization with LLM-Based Acquisition Functions for Natural Language Preference ElicitationSubjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [131] arXiv:2405.00978 (cross-list from cs.IR) [pdf, other]
-
Title: Language Fairness in Multilingual Information RetrievalComments: 5 pages, 1 figure, accepted at SIGIR 2024 as short paperSubjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
- [132] arXiv:2405.00977 (cross-list from cs.IR) [pdf, other]
-
Title: Distillation for Multilingual Information RetrievalComments: 6 pages, 1 figure, accepted at SIGIR 2024 as short paperSubjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
- [133] arXiv:2405.00975 (cross-list from cs.IR) [pdf, other]
-
Title: PLAID SHIRTTT for Large-Scale Streaming Dense RetrievalComments: 5 pages, 1 figure, accepted at SIGIR 2024 as short paperSubjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
- [134] arXiv:2405.00949 (cross-list from cs.LG) [pdf, other]
-
Title: The Role of Model Architecture and Scale in Predicting Molecular Properties: Insights from Fine-Tuning RoBERTa, BART, and LLaMASubjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Chemical Physics (physics.chem-ph); Biomolecules (q-bio.BM)
- [135] arXiv:2405.00942 (cross-list from cs.CV) [pdf, other]
-
Title: LLaVA Finds Free Lunch: Teaching Human Behavior Improves Content Understanding Abilities Of LLMsAuthors: Somesh Singh, Harini S I, Yaman K Singla, Veeky Baths, Rajiv Ratn Shah, Changyou Chen, Balaji KrishnamurthySubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [136] arXiv:2405.00899 (cross-list from cs.HC) [pdf, other]
-
Title: Characterising the Creative Process in Humans and Large Language ModelsSubjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Neurons and Cognition (q-bio.NC)
- [137] arXiv:2405.00740 (cross-list from cs.CV) [pdf, other]
-
Title: Modeling Caption Diversity in Contrastive Vision-Language PretrainingAuthors: Samuel Lavoie, Polina Kirichenko, Mark Ibrahim, Mahmoud Assran, Andrew Gordon Wildon, Aaron Courville, Nicolas BallasComments: 14 pages, 8 figures, 7 tables, to be published at ICML2024Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [138] arXiv:2405.00693 (cross-list from cs.RO) [pdf, other]
-
Title: Large Language Models for Human-Robot Interaction: Opportunities and RisksAuthors: Jesse AtuhurraSubjects: Robotics (cs.RO); Computation and Language (cs.CL)
- [139] arXiv:2405.00688 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Understanding Social Perception, Interactions, and Safety Aspects of Sidewalk Delivery Robots Using Sentiment AnalysisComments: 34 pages, 7 figures, 2 tablesSubjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
- [140] arXiv:2405.00522 (cross-list from econ.GN) [pdf, other]
-
Title: DAM: A Universal Dual Attention Mechanism for Multimodal Timeseries Cryptocurrency Trend ForecastingSubjects: General Economics (econ.GN); Computational Engineering, Finance, and Science (cs.CE); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Computational Finance (q-fin.CP)
Thu, 2 May 2024
- [141] arXiv:2405.00664 [pdf, other]
-
Title: Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [142] arXiv:2405.00659 [pdf, other]
-
Title: NLU-STR at SemEval-2024 Task 1: Generative-based Augmentation and Encoder-based Scoring for Semantic Textual RelatednessSubjects: Computation and Language (cs.CL)
- [143] arXiv:2405.00657 [pdf, other]
-
Title: RST-LoRA: A Discourse-Aware Low-Rank Adaptation for Long Document Abstractive SummarizationComments: NAACL 2024 Main & Long Conference Paper (Oral Presentation)Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [144] arXiv:2405.00632 [pdf, other]
-
Title: When Quantization Affects Confidence of Large Language Models?Comments: Accepted to NAACL 2024 FindingsSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [145] arXiv:2405.00622 [pdf, other]
-
Title: Causal Evaluation of Language ModelsAuthors: Sirui Chen, Bo Peng, Meiqi Chen, Ruiqi Wang, Mengying Xu, Xingyu Zeng, Rui Zhao, Shengjie Zhao, Yu Qiao, Chaochao LuComments: 315 pages, 230 figures, 21 tables. Project website: this https URLSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [146] arXiv:2405.00611 [pdf, other]
-
Title: Addressing Topic Granularity and Hallucination in Large Language Models for Topic ModellingSubjects: Computation and Language (cs.CL)
- [147] arXiv:2405.00602 [pdf, other]
-
Title: Investigating Automatic Scoring and Feedback using Large Language ModelsSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [148] arXiv:2405.00588 [pdf, other]
-
Title: Are Models Biased on Text without Gender-related Language?Comments: In International Conference on Learning Representations 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
- [149] arXiv:2405.00578 [pdf, other]
-
Title: The Real, the Better: Aligning Large Language Models with Online Human BehaviorsComments: 11 pages, 6 figuresSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [150] arXiv:2405.00557 [pdf, other]
-
Title: Mixture of insighTful Experts (MoTE): The Synergy of Thought Chains and Expert Mixtures in Self-AlignmentAuthors: Zhili Liu, Yunhao Gou, Kai Chen, Lanqing Hong, Jiahui Gao, Fei Mi, Yu Zhang, Zhenguo Li, Xin Jiang, Qun Liu, James T. KwokSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [151] arXiv:2405.00543 [pdf, other]
-
Title: New Benchmark Dataset and Fine-Grained Cross-Modal Fusion Framework for Vietnamese Multimodal Aspect-Category Sentiment AnalysisSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [152] arXiv:2405.00536 [pdf, other]
-
Title: A Legal Framework for Natural Language Processing Model Training in PortugalComments: LEGAL2024 Legal and Ethical Issues in Human Language Technologies, LREC 2024Subjects: Computation and Language (cs.CL); Emerging Technologies (cs.ET)
- [153] arXiv:2405.00492 [pdf, other]
-
Title: Is Temperature the Creativity Parameter of Large Language Models?Comments: To be published in the Proceedings of the 15th International Conference on Computational Creativity (ICCC'24), 8 pages, 2 figures, 2 tablesSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [154] arXiv:2405.00467 [pdf, other]
-
Title: Harnessing the Power of Multiple Minds: Lessons Learned from LLM RoutingComments: Accepted to Workshop on Insights from Negative Results in NLP 2024 (co-located with NAACL 2024)Subjects: Computation and Language (cs.CL)
- [155] arXiv:2405.00465 [pdf, other]
-
Title: BiomedRAG: A Retrieval Augmented Large Language Model for BiomedicineSubjects: Computation and Language (cs.CL)
- [156] arXiv:2405.00402 [pdf, other]
-
Title: Self-Refine Instruction-Tuning for Aligning Reasoning in Language ModelsSubjects: Computation and Language (cs.CL)
- [157] arXiv:2405.00390 [pdf, other]
-
Title: CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal ModelsComments: 25 pages, 7 figures, and 18 tablesSubjects: Computation and Language (cs.CL)
- [158] arXiv:2405.00361 [pdf, other]
-
Title: AdaMoLE: Fine-Tuning Large Language Models with Adaptive Mixture of Low-Rank Adaptation ExpertsSubjects: Computation and Language (cs.CL)
- [159] arXiv:2405.00332 [pdf, other]
-
Title: A Careful Examination of Large Language Model Performance on Grade School ArithmeticAuthors: Hugh Zhang, Jeff Da, Dean Lee, Vaughn Robinson, Catherine Wu, Will Song, Tiffany Zhao, Pranav Raja, Dylan Slack, Qin Lyu, Sean Hendryx, Russell Kaplan, Michele Lunati, Summer YueSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [160] arXiv:2405.00321 [pdf, other]
-
Title: DFKI-NLP at SemEval-2024 Task 2: Towards Robust LLMs Using Data Perturbations and MinMax TrainingSubjects: Computation and Language (cs.CL)
- [161] arXiv:2405.00302 [pdf, other]
-
Title: Generating Feedback-Ladders for Logical Errors in Programming using Large Language ModelsComments: Published on the 17th EDM 2024 - Posters and Demos TrackSubjects: Computation and Language (cs.CL)
- [162] arXiv:2405.00301 [pdf, other]
-
Title: LITO: Learnable Intervention for Truthfulness OptimizationComments: 14 pages, 5 figuresSubjects: Computation and Language (cs.CL)
- [163] arXiv:2405.00291 [pdf, other]
-
Title: How Can I Improve? Using GPT to Highlight the Desired and Undesired Parts of Open-ended ResponsesAuthors: Jionghao Lin, Eason Chen, Zeifei Han, Ashish Gurung, Danielle R. Thomas, Wei Tan, Ngoc Dang Nguyen, Kenneth R. KoedingerComments: 11 pages, full research paper, EDM 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
- [164] arXiv:2405.00289 [pdf, other]
-
Title: Adversarial Attacks and Defense for Conversation Entailment TaskSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [165] arXiv:2405.00273 [pdf, other]
-
Title: Social Life Simulation for Non-Cognitive Skills LearningSubjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
- [166] arXiv:2405.00263 [pdf, other]
-
Title: Clover: Regressive Lightweight Speculative Decoding with Sequential KnowledgeSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [167] arXiv:2405.00253 [pdf, other]
-
Title: CodeHalu: Code Hallucinations in LLMs Driven by Execution-based VerificationSubjects: Computation and Language (cs.CL); Software Engineering (cs.SE)
- [168] arXiv:2405.00216 [pdf, other]
-
Title: Graphical Reasoning: LLM-based Semi-Open Relation ExtractionSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [169] arXiv:2405.00208 [pdf, other]
-
Title: A Primer on the Inner Workings of Transformer-based Language ModelsSubjects: Computation and Language (cs.CL)
- [170] arXiv:2405.00204 [pdf, other]
-
Title: General Purpose Verification for Chain of Thought PromptingAuthors: Robert Vacareanu, Anurag Pratik, Evangelia Spiliopoulou, Zheng Qi, Giovanni Paolini, Neha Anna John, Jie Ma, Yassine Benajiba, Miguel BallesterosComments: 22 pages, preprintSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [171] arXiv:2405.00201 [pdf, other]
-
Title: SPAFIT: Stratified Progressive Adaptation Fine-tuning for Pre-trained Large Language ModelsSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [172] arXiv:2405.00200 [pdf, other]
-
Title: In-Context Learning with Long-Context Models: An In-Depth ExplorationComments: 27 pages; preprintSubjects: Computation and Language (cs.CL)
- [173] arXiv:2405.00175 [pdf, other]
-
Title: Towards a Search Engine for Machines: Unified Ranking for Multiple Retrieval-Augmented Large Language ModelsSubjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
- [174] arXiv:2405.00155 [pdf, other]
-
Title: HistNERo: Historical Named Entity Recognition for the Romanian LanguageAuthors: Andrei-Marius Avram, Andreea Iuga, George-Vlad Manolache, Vlad-Cristian Matei, Răzvan-Gabriel Micliuş, Vlad-Andrei Muntean, Manuel-Petru Sorlescu, Dragoş-Andrei Şerban, Adrian-Dinu Urse, Vasile Păiş, Dumitru-Clementin CercelComments: Accepted at the International Conference on Document Analysis and Recognition (ICDAR 2024)Subjects: Computation and Language (cs.CL)
- [175] arXiv:2405.00134 [pdf, other]
-
Title: Transforming Dutch: Debiasing Dutch Coreference Resolution Systems for Non-binary PronounsComments: 22 pages, 2 figures. Accepted at the 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT '24)Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [176] arXiv:2405.00675 (cross-list from cs.LG) [pdf, other]
-
Title: Self-Play Preference Optimization for Language Model AlignmentComments: 25 pages, 4 figures, 5 tablesSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
- [177] arXiv:2405.00566 (cross-list from cs.CE) [pdf, other]
-
Title: NumLLM: Numeric-Sensitive Large Language Model for Chinese FinanceSubjects: Computational Engineering, Finance, and Science (cs.CE); Computation and Language (cs.CL); General Finance (q-fin.GN)
- [178] arXiv:2405.00523 (cross-list from cs.AI) [pdf, other]
-
Title: CookingSense: A Culinary Knowledgebase with Multidisciplinary AssertionsComments: LREC-COLING 2024 AcceptedSubjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [179] arXiv:2405.00516 (cross-list from cs.LG) [pdf, other]
-
Title: Navigating WebAI: Training Agents to Complete Web Tasks with Large Language Models and Reinforcement LearningComments: ACM 2024, Avila Spain. 9 pagesJournal-ref: ACM SAC Conference 2024, Avila, Spain, Article 4, 9 pagesSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [180] arXiv:2405.00494 (cross-list from cs.AI) [pdf, other]
-
Title: GOLD: Geometry Problem Solver with Natural Language DescriptionComments: Accepted in NAACL 2024 FindingsSubjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [181] arXiv:2405.00489 (cross-list from cs.LG) [pdf, other]
-
Title: Explainable Automatic Grading with Neural Additive ModelsSubjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Applications (stat.AP)
- [182] arXiv:2405.00461 (cross-list from cs.RO) [pdf, other]
-
Title: Enhancing Surgical Robots with Embodied Intelligence for Autonomous Ultrasound ScanningComments: ICRA 2024 Full-day Workshop: C4SR+: Continuum, Compliant, Cooperative, CognitiveSubjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
- [183] arXiv:2405.00449 (cross-list from cs.LG) [pdf, other]
-
Title: RAG-based Explainable Prediction of Road Users Behaviors for Automated Driving using Knowledge Graphs and Large Language ModelsAuthors: Mohamed Manzour Hussien, Angie Nataly Melo, Augusto Luis Ballardini, Carlota Salinas Maldonado, Rubén Izquierdo, Miguel Ángel SoteloSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Neural and Evolutionary Computing (cs.NE)
- [184] arXiv:2405.00438 (cross-list from cs.LG) [pdf, other]
-
Title: MetaRM: Shifted Distributions Alignment via Meta-LearningAuthors: Shihan Dou, Yan Liu, Enyu Zhou, Tianlong Li, Haoxiang Jia, Limao Xiong, Xin Zhao, Junjie Ye, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing HuangComments: 11 pages, 6 figures. arXiv admin note: text overlap with arXiv:2401.06080Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
- [185] arXiv:2405.00123 (cross-list from cs.LG) [pdf, other]
-
Title: Graph Neural Network Approach to Semantic Type Detection in TablesJournal-ref: In Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 121-133. Singapore: Springer Nature Singapore, 2024Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
- [186] arXiv:2405.00099 (cross-list from cs.AI) [pdf, other]
-
Title: Creative Beam SearchSubjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
- [187] arXiv:2405.00021 (cross-list from cs.CV) [pdf, other]
-
Title: SIMPLOT: Enhancing Chart Question Answering by Distilling EssentialsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Wed, 1 May 2024
- [188] arXiv:2404.19737 [pdf, other]
-
Title: Better & Faster Large Language Models via Multi-token PredictionSubjects: Computation and Language (cs.CL)
- [189] arXiv:2404.19733 [pdf, other]
-
Title: Iterative Reasoning Preference OptimizationAuthors: Richard Yuanzhe Pang, Weizhe Yuan, Kyunghyun Cho, He He, Sainbayar Sukhbaatar, Jason WestonSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [190] arXiv:2404.19714 [pdf, other]
-
Title: ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text data on social disorders in children and adolescentsComments: 4 pagesSubjects: Computation and Language (cs.CL)
- [191] arXiv:2404.19713 [pdf, ps, other]
-
Title: Automated Generation of High-Quality Medical Simulation Scenarios Through Integration of Semi-Structured Data and Large Language ModelsAuthors: Scott SumpterComments: 22 pages but 12 are appendices which are examples of the main text. 3 figures, 4 tablesSubjects: Computation and Language (cs.CL)
- [192] arXiv:2404.19705 [pdf, other]
-
Title: When to Retrieve: Teaching LLMs to Utilize Information Retrieval EffectivelySubjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
- [193] arXiv:2404.19597 [pdf, other]
-
Title: Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction TuningAuthors: Xuanli He, Jun Wang, Qiongkai Xu, Pasquale Minervini, Pontus Stenetorp, Benjamin I. P. Rubinstein, Trevor CohnComments: work in progressSubjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
- [194] arXiv:2404.19563 [pdf, other]
-
Title: RepEval: Effective Text Evaluation with LLM RepresentationAuthors: Shuqian Sheng, Yi Xu, Tianhang Zhang, Zanwei Shen, Luoyi Fu, Jiaxin Ding, Lei Zhou, Xinbing Wang, Chenghu ZhouSubjects: Computation and Language (cs.CL)
- [195] arXiv:2404.19553 [pdf, other]
-
Title: Extending Llama-3's Context Ten-Fold OvernightSubjects: Computation and Language (cs.CL)
- [196] arXiv:2404.19543 [pdf, other]
-
Title: RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language ProcessingComments: 30 pages, 7 figures. Draft version 1Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [197] arXiv:2404.19509 [pdf, other]
-
Title: Do Large Language Models Understand Conversational Implicature -- A case study with a chinese sitcomComments: 14 pages, 8 tables and 5 figuresSubjects: Computation and Language (cs.CL)
- [198] arXiv:2404.19505 [pdf, other]
-
Title: Context-Aware Machine Translation with Source Coreference ExplanationComments: Accepted to TACL. This is a pre-MIT Press publication versionSubjects: Computation and Language (cs.CL)
- [199] arXiv:2404.19486 [pdf, other]
-
Title: Safe Training with Sensitive In-domain Data: Leveraging Data Fragmentation To Mitigate Linkage AttacksSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [200] arXiv:2404.19482 [pdf, other]
-
Title: FactCheck Editor: Multilingual Text Editor with End-to-End fact-checkingAuthors: Vinay SettyComments: Accepted in SIGIR 2024 (demo track)Subjects: Computation and Language (cs.CL)
- [201] arXiv:2404.19442 [pdf, other]
-
Title: Which Nigerian-Pidgin does Generative AI speak?: Issues about Representativeness and Bias for Multilingual and Low Resource LanguagesComments: Working paperSubjects: Computation and Language (cs.CL)
- [202] arXiv:2404.19432 [pdf, other]
- [203] arXiv:2404.19430 [pdf, other]
-
Title: Sõnajaht: Definition Embeddings and Semantic Search for Reverse Dictionary CreationComments: Accepted to *SEM 2024Subjects: Computation and Language (cs.CL)
- [204] arXiv:2404.19409 [pdf, other]
-
Title: Countering Reward Over-optimization in LLM with Demonstration-Guided Reinforcement LearningAuthors: Mathieu Rita, Florian Strub, Rahma Chaabouni, Paul Michel, Emmanuel Dupoux, Olivier PietquinSubjects: Computation and Language (cs.CL)
- [205] arXiv:2404.19369 [pdf, ps, other]
-
Title: Evaluating Telugu Proficiency in Large Language Models_ A Comparative Analysis of ChatGPT and GeminiSubjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
- [206] arXiv:2404.19364 [pdf, other]
-
Title: Navigating Brain Language Representations: A Comparative Analysis of Neural Language Models and Psychologically Plausible ModelsSubjects: Computation and Language (cs.CL)
- [207] arXiv:2404.19363 [pdf, other]
-
Title: Expressivity and Speech SynthesisComments: Invited contribution. Under reviewSubjects: Computation and Language (cs.CL)
- [208] arXiv:2404.19359 [pdf, other]
-
Title: Evaluating Lexicon Incorporation for Depression Symptom EstimationComments: Accepted to Clinical NLP workshop at NAACL 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [209] arXiv:2404.19335 [pdf, other]
-
Title: StablePT: Towards Stable Prompting for Few-shot Learning via Input SeparationComments: Submitted to ACL 2024Subjects: Computation and Language (cs.CL)
- [210] arXiv:2404.19328 [pdf, other]
-
Title: Computational Approaches for Integrating out Subjectivity in Cognate Synonym SelectionSubjects: Computation and Language (cs.CL); Populations and Evolution (q-bio.PE)
- [211] arXiv:2404.19319 [pdf, other]
-
Title: Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) BudgetComments: Accepted to the 5th Workshop on Insights from Negative Results in NLP at NAACL 2024Subjects: Computation and Language (cs.CL)
- [212] arXiv:2404.19316 [pdf, other]
-
Title: QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question AnsweringAuthors: Sheng Ouyang, Jianzong Wang, Yong Zhang, Zhitao Li, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing XiaoComments: Accepted by the 2024 International Joint Conference on Neural Networks (IJCNN 2024)Subjects: Computation and Language (cs.CL)
- [213] arXiv:2404.19315 [pdf, other]
-
Title: Modeling Orthographic Variation in Occitan's DialectsAuthors: Zachary William Hopton (Language and Space Lab, University of Zurich), Noëmi Aepli (Department of Computational Linguistics, University of Zurich)Comments: Accepted at VarDial 2024: The Eleventh Workshop on NLP for Similar Languages, Varieties and DialectsSubjects: Computation and Language (cs.CL)
- [214] arXiv:2404.19310 [pdf, other]
-
Title: Does Whisper understand Swiss German? An automatic, qualitative, and human evaluationComments: Accepted to VarDial 2024 (the eleventh Workshop on NLP for Similar Languages, Varieties and Dialects 2024), Mexico CitySubjects: Computation and Language (cs.CL)
- [215] arXiv:2404.19296 [pdf, other]
-
Title: Octopus v4: Graph of language modelsSubjects: Computation and Language (cs.CL)
- [216] arXiv:2404.19260 [pdf, ps, other]
-
Title: Aspect and Opinion Term Extraction Using Graph Attention NetworkAuthors: Abir ChakrabortySubjects: Computation and Language (cs.CL)
- [217] arXiv:2404.19254 [pdf, other]
-
Title: Suvach -- Generated Hindi QA benchmarkSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [218] arXiv:2404.19252 [pdf, other]
-
Title: Exploiting Hatred by Targets for Hate Speech Detection on Vietnamese Social Media TextsSubjects: Computation and Language (cs.CL)
- [219] arXiv:2404.19245 [pdf, other]
-
Title: HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-TuningComments: 19 pages, 7 figuresSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [220] arXiv:2404.19232 [pdf, other]
- [221] arXiv:2404.19192 [pdf, other]
-
Title: Mix of Experts Language Model for Named Entity RecognitionSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [222] arXiv:2404.19178 [pdf, other]
-
Title: Revenge of the Fallen? Recurrent Models Match Transformers at Predicting Human Language Comprehension MetricsSubjects: Computation and Language (cs.CL)
- [223] arXiv:2404.19175 [pdf, other]
-
Title: Game-MUG: Multimodal Oriented Game Situation Understanding and Commentary Generation DatasetSubjects: Computation and Language (cs.CL)
- [224] arXiv:2404.19159 [pdf, other]
-
Title: What Drives Performance in Multilingual Language Models?Comments: Accepted at VarDial @ NAACL 2024Subjects: Computation and Language (cs.CL)
- [225] arXiv:2404.19154 [pdf, other]
-
Title: RTF: Region-based Table Filling Method for Relational Triple ExtractionComments: Rejected by EMNLP 2023Subjects: Computation and Language (cs.CL)
- [226] arXiv:2404.19124 [pdf, other]
-
Title: Accelerating Production LLMs with Combined Token/Embedding SpeculatorsAuthors: Davis Wertheimer, Joshua Rosenkranz, Thomas Parnell, Sahil Suneja, Pavithra Ranganathan, Raghu Ganti, Mudhakar SrivatsaSubjects: Computation and Language (cs.CL)
- [227] arXiv:2404.19119 [pdf, ps, other]
-
Title: Effects of Added Emphasis and Pause in Audio Delivery of Health InformationAuthors: Arif Ahmed (1), Gondy Leroy (1), Stephen A. Rains (1), Philip Harber (1), David Kauchak (2), Prosanta Barai (1) ((1) The University of Arizona, (2) Pomona College)Comments: This manuscript is accepted to American Medical Informatics Association summit, 2024Subjects: Computation and Language (cs.CL)
- [228] arXiv:2404.19094 [pdf, other]
-
Title: In-Context Symbolic Regression: Leveraging Language Models for Function DiscoverySubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [229] arXiv:2404.19063 [pdf, other]
-
Title: SuperCLUE-Fin: Graded Fine-Grained Analysis of Chinese LLMs on Diverse Financial Tasks and ApplicationsComments: 11 pages, 19 figures, and tablesSubjects: Computation and Language (cs.CL)
- [230] arXiv:2404.19055 [pdf, other]
-
Title: Plan of Thoughts: Heuristic-Guided Problem Solving with Large Language ModelsAuthors: Houjun LiuComments: 7 pages, 2 figuresSubjects: Computation and Language (cs.CL)
- [231] arXiv:2404.19048 [pdf, other]
-
Title: A Framework for Real-time Safeguarding the Text Generation of Large Language ModelSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [232] arXiv:2404.19007 [pdf, other]
-
Title: How Did We Get Here? Summarizing Conversation DynamicsAuthors: Yilun Hua, Nicholas Chernogor, Yuzhe Gu, Seoyeon Julie Jeong, Miranda Luo, Cristian Danescu-Niculescu-MizilComments: To appear in the Proceedings of NAACL 2024. Data available in ConvoKit this https URLSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
- [233] arXiv:2404.18988 [pdf, other]
-
Title: Markovian Agents for Truthful Language ModelingComments: 21 pages, 6 figuresSubjects: Computation and Language (cs.CL)
- [234] arXiv:2404.18977 [pdf, other]
-
Title: Computational Job Market Analysis with Natural Language ProcessingAuthors: Mike ZhangComments: Ph.D. Thesis (315 total pages, 52 figures). The thesis slightly modified with this https URL ISBN (electronic): 978-87-7949-414-5Subjects: Computation and Language (cs.CL)
- [235] arXiv:2404.18971 [pdf, other]
-
Title: Credible, Unreliable or Leaked?: Evidence Verification for Enhanced Automated Fact-checkingAuthors: Zacharias Chrysidis, Stefanos-Iordanis Papadopoulos, Symeon Papadopoulos, Panagiotis C. PetrantonakisSubjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Information Retrieval (cs.IR); Social and Information Networks (cs.SI)
- [236] arXiv:2404.18942 [pdf, other]
-
Title: GuideWalk -- Heterogeneous Data Fusion for Enhanced Learning -- A Multiclass Document Classification CaseSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
- [237] arXiv:2404.19753 (cross-list from cs.CV) [pdf, other]
-
Title: DOCCI: Descriptions of Connected and Contrasting ImagesAuthors: Yasumasa Onoe, Sunayana Rane, Zachary Berger, Yonatan Bitton, Jaemin Cho, Roopal Garg, Alexander Ku, Zarana Parekh, Jordi Pont-Tuset, Garrett Tanzer, Su Wang, Jason BaldridgeSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [238] arXiv:2404.19721 (cross-list from cs.AI) [pdf, ps, other]
-
Title: PANGeA: Procedural Artificial Narrative using Generative AI for Turn-Based Video GamesSubjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [239] arXiv:2404.19708 (cross-list from cs.LG) [pdf, other]
-
Title: Harmonic LLMs are TrustworthyComments: 15 pages, 4 figures, 14 tablesSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
- [240] arXiv:2404.19696 (cross-list from cs.CV) [pdf, other]
-
Title: Naturally Supervised 3D Visual Grounding with Language-Regularized Concept LearnersComments: CVPR 2024. The first two authors contributed equallySubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [241] arXiv:2404.19484 (cross-list from cs.LG) [pdf, other]
-
Title: More Compute Is What You NeedAuthors: Zhen GuoSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [242] arXiv:2404.19360 (cross-list from cs.CV) [pdf, other]
-
Title: Large Language Model Informed Patent Image RetrievalComments: 8 pages. Under reviewSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Information Retrieval (cs.IR)
- [243] arXiv:2404.19318 (cross-list from cs.SE) [pdf, other]
-
Title: Enhancing Trust in LLM-Generated Code Summaries with Calibrated Confidence ScoresSubjects: Software Engineering (cs.SE); Computation and Language (cs.CL)
- [244] arXiv:2404.19317 (cross-list from cs.CV) [pdf, other]
-
Title: Revisiting N-Gram Models: Their Impact in Modern Neural Networks for Handwritten Text RecognitionSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [245] arXiv:2404.19234 (cross-list from cs.AI) [pdf, other]
-
Title: Multi-hop Question Answering over Knowledge Graphs using Large Language ModelsAuthors: Abir ChakrabortySubjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Databases (cs.DB)
- [246] arXiv:2404.19221 (cross-list from cs.CV) [pdf, other]
-
Title: Transcrib3D: 3D Referring Expression Resolution through Large Language ModelsAuthors: Jiading Fang, Xiangshan Tan, Shengjie Lin, Igor Vasiljevic, Vitor Guizilini, Hongyuan Mei, Rares Ambrus, Gregory Shakhnarovich, Matthew R WalterComments: CORLW 2023Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [247] arXiv:2404.19128 (cross-list from cs.CV) [pdf, other]
-
Title: Q-GroundCAM: Quantifying Grounding in Vision Language Models via GradCAMComments: Accepted to CVPR 2024, Second Workshop on Foundation Models (WFM)Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [248] arXiv:2404.19071 (cross-list from cs.HC) [pdf, other]
-
Title: Blind Spots and Biases: Exploring the Role of Annotator Cognitive Biases in NLPSubjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
- [249] arXiv:2404.19065 (cross-list from cs.AI) [pdf, other]
-
Title: HELPER-X: A Unified Instructable Embodied Agent to Tackle Four Interactive Vision-Language Domains with Memory-Augmented Language ModelsComments: Videos and code this https URLSubjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [250] arXiv:2404.18976 (cross-list from cs.LG) [pdf, other]
-
Title: Foundations of Multisensory Artificial IntelligenceAuthors: Paul Pu LiangComments: CMU Machine Learning Department PhD ThesisSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
- [251] arXiv:2404.18963 (cross-list from cs.LG) [pdf, other]
-
Title: RE-GrievanceAssist: Enhancing Customer Experience through ML-Powered Complaint ManagementSubjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
Tue, 30 Apr 2024 (showing first 89 of 99 entries)
- [252] arXiv:2404.18923 [pdf, other]
-
Title: Holmes: Benchmark the Linguistic Competence of Language ModelsSubjects: Computation and Language (cs.CL)
- [253] arXiv:2404.18911 [pdf, other]
-
Title: Kangaroo: Lossless Self-Speculative Decoding via Double Early ExitingSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [254] arXiv:2404.18880 [pdf, ps, other]
-
Title: Spivavtor: An Instruction Tuned Ukrainian Text Editing ModelComments: Accepted to UNLP Workshop 2024Subjects: Computation and Language (cs.CL)
- [255] arXiv:2404.18870 [pdf, other]
-
Title: More RLHF, More Trust? On The Impact of Human Preference Alignment On Language Model TrustworthinessSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [256] arXiv:2404.18865 [pdf, other]
-
Title: Truth-value judgment in language models: belief directions are context sensitiveSubjects: Computation and Language (cs.CL)
- [257] arXiv:2404.18851 [pdf, other]
-
Title: A Comprehensive Rubric for Annotating Pathological SpeechAuthors: Mario Corrales-Astorgano, David Escudero-Mancebo, Lourdes Aguilar, Valle Flores-Lucas, Valentín Cardeñoso-Payo, Carlos Vivaracho-Pascual, César González-FerrerasComments: Submitted to LREC-Coling 2024Subjects: Computation and Language (cs.CL)
- [258] arXiv:2404.18832 [pdf, other]
-
Title: It's Difficult to be Neutral -- Human and LLM-based Sentiment Annotation of Patient CommentsAuthors: Petter Mæhlum, David Samuel, Rebecka Maria Norman, Elma Jelin, Øyvind Andresen Bjertnæs, Lilja Øvrelid, Erik VelldalSubjects: Computation and Language (cs.CL)
- [259] arXiv:2404.18824 [pdf, other]
-
Title: Benchmarking Benchmark Leakage in Large Language ModelsComments: 30 pages; Homepage: this https URLSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [260] arXiv:2404.18810 [pdf, other]
-
Title: Unknown Script: Impact of Script on Cross-Lingual TransferComments: Paper accepted to NAACL Student Research Workshop (SRW) 2024Subjects: Computation and Language (cs.CL)
- [261] arXiv:2404.18796 [pdf, other]
-
Title: Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse ModelsAuthors: Pat Verga, Sebastian Hofstatter, Sophia Althammer, Yixuan Su, Aleksandra Piktus, Arkady Arkhangorodsky, Minjie Xu, Naomi White, Patrick LewisSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [262] arXiv:2404.18784 [pdf, other]
-
Title: Where on Earth Do Users Say They Are?: Geo-Entity Linking for Noisy Multilingual User InputComments: NLP+CSS workshop at NAACL 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [263] arXiv:2404.18759 [pdf, ps, other]
-
Title: Towards A Structured Overview of Use Cases for Natural Language Processing in the Legal Domain: A German PerspectiveComments: 10 pages, 6 tables, 30th Americas Conference on Information Systems (AMCIS 2024)Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
- [264] arXiv:2404.18739 [pdf, other]
-
Title: Towards Dog Bark Decoding: Leveraging Human Speech Processing for Automated Bark ClassificationComments: to be published in LREC-COLING 2024Subjects: Computation and Language (cs.CL)
- [265] arXiv:2404.18726 [pdf, other]
-
Title: The Constant in HATE: Analyzing Toxicity in Reddit across Topics and LanguagesComments: Accepted to TRAC 2024Subjects: Computation and Language (cs.CL)
- [266] arXiv:2404.18708 [pdf, other]
-
Title: Iconic Gesture SemanticsComments: 39 pages, 28 figures, under revisionSubjects: Computation and Language (cs.CL)
- [267] arXiv:2404.18684 [pdf, other]
-
Title: Work Smarter...Not Harder: Efficient Minimization of Dependency Length in SOV LanguagesComments: Accepted at CogSci-2024 as talk with full paper publicationSubjects: Computation and Language (cs.CL); Theoretical Economics (econ.TH); Optimization and Control (math.OC)
- [268] arXiv:2404.18655 [pdf, other]
-
Title: Revealing the Parametric Knowledge of Language Models: A Unified Framework for Attribution MethodsComments: 14 pages, 6 figuresSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [269] arXiv:2404.18624 [pdf, other]
-
Title: Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Explanations?Comments: 27 pages, from which 12 pages contain the text of the main paper. 8 figures, 11 tablesSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [270] arXiv:2404.18615 [pdf, other]
-
Title: The SAMER Arabic Text Simplification CorpusComments: Accepted to LREC-COLING 2024. 15 pages, 6 tables, 1 figureSubjects: Computation and Language (cs.CL)
- [271] arXiv:2404.18585 [pdf, other]
-
Title: FREB-TQA: A Fine-Grained Robustness Evaluation Benchmark for Table Question AnsweringComments: Accepted at NAACL 2024Subjects: Computation and Language (cs.CL)
- [272] arXiv:2404.18570 [pdf, other]
-
Title: Analyzing Semantic Change through Lexical ReplacementsSubjects: Computation and Language (cs.CL)
- [273] arXiv:2404.18564 [pdf, other]
-
Title: Injecting Salesperson's Dialogue Strategies in Large Language Models with Chain-of-Thought ReasoningComments: arXiv admin note: substantial text overlap with arXiv:2308.14266Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [274] arXiv:2404.18557 [pdf, other]
-
Title: Can GPT-4 do L2 analytic assessment?Comments: Accepted for the 19th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2024)Subjects: Computation and Language (cs.CL)
- [275] arXiv:2404.18543 [pdf, other]
-
Title: Time Machine GPTComments: NAACL Findings 2024Subjects: Computation and Language (cs.CL); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
- [276] arXiv:2404.18534 [pdf, other]
-
Title: Evaluating and Mitigating Linguistic Discrimination in Large Language ModelsSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Software Engineering (cs.SE)
- [277] arXiv:2404.18532 [pdf, other]
-
Title: MileBench: Benchmarking MLLMs in Long ContextComments: 29 pages, 13 figures, 14 tablesSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
- [278] arXiv:2404.18510 [pdf, other]
-
Title: Explainability of Machine Learning Approaches in Forensic Linguistics: A Case Study in Geolinguistic Authorship ProfilingSubjects: Computation and Language (cs.CL)
- [279] arXiv:2404.18466 [pdf, other]
-
Title: HFT: Half Fine-Tuning for Large Language ModelsComments: Work in progressSubjects: Computation and Language (cs.CL)
- [280] arXiv:2404.18460 [pdf, other]
-
Title: Ethical Reasoning and Moral Value Alignment of LLMs Depend on the Language we Prompt them inSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [281] arXiv:2404.18443 [pdf, other]
-
Title: BMRetriever: Tuning Large Language Models as Better Biomedical Text RetrieversAuthors: Ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Yanqiao Zhu, May D. Wang, Joyce C. Ho, Chao Zhang, Carl YangComments: Work in progress. The model and data will be uploaded to \url{this https URL}Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Quantitative Methods (q-bio.QM)
- [282] arXiv:2404.18410 [pdf, other]
- [283] arXiv:2404.18398 [pdf, other]
-
Title: MM-TTS: A Unified Framework for Multimodal, Prompt-Induced Emotional Text-to-Speech SynthesisSubjects: Computation and Language (cs.CL); Multimedia (cs.MM)
- [284] arXiv:2404.18384 [pdf, other]
-
Title: Exploring the Limits of Fine-grained LLM-based Physics Inference via Premise Removal InterventionsSubjects: Computation and Language (cs.CL)
- [285] arXiv:2404.18371 [pdf, other]
-
Title: QANA: LLM-based Question Generation and Network Analysis for Zero-shot Key Point Analysis and BeyondAuthors: Tomoki Fukuma, Koki Noda, Toshihide Ubukata Kousuke Hoso, Yoshiharu Ichikawa, Kyosuke Kambe, Yu Masubuch, Fujio ToriumiComments: Under review as a conference paper at COLM 2024Subjects: Computation and Language (cs.CL)
- [286] arXiv:2404.18359 [pdf, other]
-
Title: FoundaBench: Evaluating Chinese Fundamental Knowledge Capabilities of Large Language ModelsAuthors: Wei Li, Ren Ma, Jiang Wu, Chenya Gu, Jiahui Peng, Jinyang Len, Songyang Zhang, Hang Yan, Dahua Lin, Conghui HeSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [287] arXiv:2404.18286 [pdf, other]
-
Title: Comparing LLM prompting with Cross-lingual transfer performance on Indigenous and Low-resource Brazilian LanguagesComments: Accepted to the Americas NLP Workshop at NAACL 2024 (this https URL)Subjects: Computation and Language (cs.CL)
- [288] arXiv:2404.18276 [pdf, ps, other]
-
Title: Bias Neutralization Framework: Measuring Fairness in Large Language Models with Bias Intelligence Quotient (BiQ)Comments: 41 pagesSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [289] arXiv:2404.18271 [pdf, other]
-
Title: Parameter-Efficient Tuning Large Language Models for Graph Representation LearningSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [290] arXiv:2404.18264 [pdf, other]
-
Title: Modeling Orthographic Variation Improves NLP Performance for Nigerian PidginComments: Accepted to LREC-COLING 2024 Main ConferenceSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [291] arXiv:2404.18257 [pdf, other]
-
Title: Mapping 'when'-clauses in Latin American and Caribbean languages: an experiment in subtoken-based typologyAuthors: Nilo PedrazziniComments: 10 pages, 6 figures. To be published in the 2024 Proceedings of the Workshop on Natural Language Processing for Indigenous Languages of the Americas (AmericasNLP)Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
- [292] arXiv:2404.18255 [pdf, other]
-
Title: PatentGPT: A Large Language Model for Intellectual PropertyAuthors: Zilong Bai, Ruiji Zhang, Linqing Chen, Qijun Cai, Yuan Zhong, Cong Wang, Yan Fang, Jie Fang, Jing Sun, Weikuan Wang, Lizhi Zhou, Haoran Hua, Tian Qiu, Chaochao Wang, Cheng Sun, Jianping Lu, Yixin Wang, Yubin Xia, Meng Hu, Haowen Liu, Peng Xu, Licong Xu, Fu Bian, Xiaolong Gu, Lisha Zhang, Weilei Wang, Changyang TuComments: 19 pages, 9 figuresSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [293] arXiv:2404.18243 [pdf, other]
-
Title: LEGENT: Open Platform for Embodied AgentsAuthors: Zhili Cheng, Zhitong Wang, Jinyi Hu, Shengding Hu, An Liu, Yuge Tu, Pengkai Li, Lei Shi, Zhiyuan Liu, Maosong SunComments: Demo PaperSubjects: Computation and Language (cs.CL)
- [294] arXiv:2404.18231 [pdf, other]
-
Title: From Persona to Personalization: A Survey on Role-Playing Language AgentsAuthors: Jiangjie Chen, Xintao Wang, Rui Xu, Siyu Yuan, Yikai Zhang, Wei Shi, Jian Xie, Shuang Li, Ruihan Yang, Tinghui Zhu, Aili Chen, Nianqi Li, Lida Chen, Caiyu Hu, Siye Wu, Scott Ren, Ziquan Fu, Yanghua XiaoComments: PreprintSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [295] arXiv:2404.18228 [pdf, other]
-
Title: TextGram: Towards a better domain-adaptive pretrainingAuthors: Sharayu Hiwarkhedkar, Saloni Mittal, Vidula Magdum, Omkar Dhekane, Raviraj Joshi, Geetanjali Kale, Arnav LadkatComments: Accepted at SPELLL 2023Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [296] arXiv:2404.18216 [pdf, other]
-
Title: L3Cube-MahaNews: News-based Short Text and Long Document Classification Datasets in MarathiComments: Accepted at SPELLL 2023Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [297] arXiv:2404.18191 [pdf, other]
-
Title: Exploring the Robustness of In-Context Learning with Noisy LabelsComments: ICLR 2024 Workshop on Reliable and Responsible Foundation ModelsSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Optimization and Control (math.OC)
- [298] arXiv:2404.18180 [pdf, other]
-
Title: EkoHate: Abusive Language and Hate Speech Detection for Code-switched Political Discussions on Nigerian TwitterAuthors: Comfort Eseohen Ilevbare, Jesujoba O. Alabi, David Ifeoluwa Adelani, Firdous Damilola Bakare, Oluwatoyin Bunmi Abiola, Oluwaseyi Adesina AdeyemoComments: AfricaNLP workshop @ ICLR2024 and WOAH @ NAACL2024Subjects: Computation and Language (cs.CL)
- [299] arXiv:2404.18154 [pdf, other]
-
Title: Explaining vague languageSubjects: Computation and Language (cs.CL); Computer Science and Game Theory (cs.GT); Information Theory (cs.IT)
- [300] arXiv:2404.18085 [pdf, other]
-
Title: CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language ModelComments: preprintSubjects: Computation and Language (cs.CL)
- [301] arXiv:2404.18072 [pdf, ps, other]
-
Title: Contextual Spelling Correction with Language Model for Low-resource SettingComments: 8 pagesSubjects: Computation and Language (cs.CL)
- [302] arXiv:2404.18071 [pdf, ps, other]
-
Title: Can Perplexity Predict Fine-Tuning Performance? An Investigation of Tokenization Effects on Sequential Language Models for NepaliComments: 11 pagesSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [303] arXiv:2404.18057 [pdf, other]
-
Title: Efficient LLM Inference with KcacheComments: Technical Report, 8 pagesSubjects: Computation and Language (cs.CL)
- [304] arXiv:2404.18043 [pdf, ps, other]
-
Title: Utilizing Large Language Models for Information Extraction from Real Estate TransactionsSubjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [305] arXiv:2404.18040 [pdf, other]
-
Title: Fashion Recommendation: Outfit Compatibility using GNNAuthors: Samaksh GulatiSubjects: Computation and Language (cs.CL)
- [306] arXiv:2404.18031 [pdf, other]
-
Title: Quality Estimation with $k$-nearest Neighbors and Automatic Evaluation for Model-specific Quality EstimationComments: Accepted to EAMT 2024Subjects: Computation and Language (cs.CL)
- [307] arXiv:2404.17999 [pdf, other]
-
Title: MediFact at MEDIQA-CORR 2024: Why AI Needs a Human TouchAuthors: Nadia SaeedComments: 7 pages, 4 figures, Clinical NLP 2024 WorkshopSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [308] arXiv:2404.17991 [pdf, other]
-
Title: Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading ComprehensionSubjects: Computation and Language (cs.CL)
- [309] arXiv:2404.17985 [pdf, other]
-
Title: Detection of Conspiracy Theories Beyond Keyword Bias in German-Language Telegram Using Large Language ModelsComments: Accepted to the 8th Workshop on Online Abuse and Harms (WOAH), ACL 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
- [310] arXiv:2404.17975 [pdf, ps, other]
-
Title: Automating Customer Needs Analysis: A Comparative Study of Large Language Models in the Travel IndustrySubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
- [311] arXiv:2404.17968 [pdf, other]
-
Title: Usefulness of Emotional Prosody in Neural Machine TranslationComments: 5 pages, In Proceedings of the 11th International Conference on Speech Prosody (SP), Leiden, The Netherlands, 2024Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [312] arXiv:2404.17949 [pdf, other]
-
Title: Transfer Learning Enhanced Single-choice Decision for Multi-choice Question AnsweringComments: 10 pages, 1 figures.This article supersedes arXiv:2011.03292Subjects: Computation and Language (cs.CL)
- [313] arXiv:2404.17918 [pdf, other]
-
Title: I Have an Attention Bridge to Sell You: Generalization Capabilities of Modular Translation ArchitecturesSubjects: Computation and Language (cs.CL)
- [314] arXiv:2404.17912 [pdf, other]
-
Title: SERPENT-VLM : Self-Refining Radiology Report Generation Using Vision Language ModelsAuthors: Manav Nitin Kapadnis, Sohan Patnaik, Abhilash Nandy, Sourjyadip Ray, Pawan Goyal, Debdoot SheetComments: 8 pages, 3 figures, 4 tables, Accepted as oral at Clinical NLP workshop at NAACL 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [315] arXiv:2404.17897 [pdf, other]
-
Title: Tool Calling: Enhancing Medication Consultation via Retrieval-Augmented Large Language ModelsAuthors: Zhongzhen Huang, Kui Xue, Yongqi Fan, Linjie Mu, Ruoyu Liu, Tong Ruan, Shaoting Zhang, Xiaofan ZhangSubjects: Computation and Language (cs.CL)
- [316] arXiv:2404.17877 [pdf, ps, other]
-
Title: PromptCL: Improving Event Representation via Prompt Template and Contrastive LearningComments: NLPCC 2023 Best Student PaperJournal-ref: Natural Language Processing and Chinese Computing (NLPCC 2023)Subjects: Computation and Language (cs.CL)
- [317] arXiv:2404.17874 [pdf, other]
-
Title: From Languages to Geographies: Towards Evaluating Cultural Bias in Hate Speech DatasetsComments: Accepted at WOAH (NAACL 2024)Subjects: Computation and Language (cs.CL)
- [318] arXiv:2404.17862 [pdf, other]
-
Title: Revisiting Multimodal Emotion Recognition in Conversation from the Perspective of Graph SpectrumComments: 10 pages, 4 figuresSubjects: Computation and Language (cs.CL)
- [319] arXiv:2404.17858 [pdf, other]
-
Title: Revisiting Multi-modal Emotion Learning with Broad State Space Models and Probability-guidance FusionComments: 10 pages, 6 figuresSubjects: Computation and Language (cs.CL)
- [320] arXiv:2404.17841 [pdf, other]
-
Title: Toxicity Classification in UkrainianComments: Accepted to WOAH, NAACL, 2024. arXiv admin note: text overlap with arXiv:2404.02043Subjects: Computation and Language (cs.CL)
- [321] arXiv:2404.17835 [pdf, other]
-
Title: VANER: Leveraging Large Language Model for Versatile and Adaptive Biomedical Named Entity RecognitionSubjects: Computation and Language (cs.CL)
- [322] arXiv:2404.17832 [pdf, other]
-
Title: Evaluation of Few-Shot Learning for Classification Tasks in the Polish LanguageComments: 34 pages, 3 figures, 10 tablesSubjects: Computation and Language (cs.CL)
- [323] arXiv:2404.17809 [pdf, other]
-
Title: Recall, Retrieve and Reason: Towards Better In-Context Relation ExtractionComments: IJCAI 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [324] arXiv:2404.17808 [pdf, other]
-
Title: Scaffold-BPE: Enhancing Byte Pair Encoding with Simple and Effective Scaffold Token RemovalAuthors: Haoran Lian, Yizhe Xiong, Jianwei Niu, Shasha Mo, Zhenpeng Su, Zijia Lin, Peng Liu, Hui Chen, Guiguang DingSubjects: Computation and Language (cs.CL)
- [325] arXiv:2404.17807 [pdf, other]
-
Title: Meta In-Context Learning Makes Large Language Models Better Zero and Few-Shot Relation ExtractorsComments: IJCAI 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [326] arXiv:2404.17802 [pdf, other]
-
Title: Empirical Analysis of Dialogue Relation Extraction with Large Language ModelsComments: IJCAI 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [327] arXiv:2404.17790 [pdf, other]
-
Title: Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language CapabilitiesAuthors: Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Hiroki Iida, Masanari Ohi, Kakeru Hattori, Hirai Shota, Sakae Mizuki, Rio Yokota, Naoaki OkazakiSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [328] arXiv:2404.17785 [pdf, other]
-
Title: Temporal Scaling Law for Large Language ModelsAuthors: Yizhe Xiong, Xiansheng Chen, Xin Ye, Hui Chen, Zijia Lin, Haoran Lian, Jianwei Niu, Guiguang DingComments: Work in progressSubjects: Computation and Language (cs.CL)
- [329] arXiv:2404.17779 [pdf, other]
-
Title: Medical Vision-Language Pre-Training for Brain AbnormalitiesSubjects: Computation and Language (cs.CL)
- [330] arXiv:2404.17778 [pdf, other]
-
Title: MRScore: Evaluating Radiology Report Generation with LLM-based Reward SystemSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [331] arXiv:2404.17733 [pdf, other]
-
Title: Building a Large Japanese Web Corpus for Large Language ModelsAuthors: Naoaki Okazaki, Kakeru Hattori, Hirai Shota, Hiroki Iida, Masanari Ohi, Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Rio Yokota, Sakae MizukiComments: 17 pagesSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [332] arXiv:2404.17729 [pdf, other]
-
Title: CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem SolvingComments: Accepted to NAACL 2024Subjects: Computation and Language (cs.CL)
- [333] arXiv:2404.17662 [pdf, other]
-
Title: PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery GamesSubjects: Computation and Language (cs.CL)
- [334] arXiv:2404.17642 [pdf, other]
-
Title: Empowering Large Language Models for Textual Data AugmentationSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [335] arXiv:2404.18928 (cross-list from cs.CV) [pdf, other]
-
Title: Stylus: Automatic Adapter Selection for Diffusion ModelsAuthors: Michael Luo, Justin Wong, Brandon Trabucco, Yanping Huang, Joseph E. Gonzalez, Zhifeng Chen, Ruslan Salakhutdinov, Ion StoicaComments: Project Website: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Graphics (cs.GR); Machine Learning (cs.LG)
- [336] arXiv:2404.18922 (cross-list from cs.LG) [pdf, other]
-
Title: DPO Meets PPO: Reinforced Token Optimization for RLHFSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
- [337] arXiv:2404.18722 (cross-list from cs.CV) [pdf, ps, other]
-
Title: Improving Automatic Text Recognition with Language Models in the PyLaia Open-Source LibraryAuthors: Solène Tarride, Yoann Schneider, Marie Generali-Lince, Mélodie Boillet, Bastien Abadie, Christopher KermorvantSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [338] arXiv:2404.18518 (cross-list from cs.DL) [pdf, ps, other]
-
Title: From ChatGPT, DALL-E 3 to Sora: How has Generative AI Changed Digital Humanities Research and Services?Comments: 21 pages, 3 figuresSubjects: Digital Libraries (cs.DL); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY)
- [339] arXiv:2404.18470 (cross-list from cs.CE) [pdf, other]
-
Title: ECC Analyzer: Extract Trading Signal from Earnings Conference Calls using Large Language Model for Stock Performance PredictionComments: 15 pages, 3 figures, 5 tablesSubjects: Computational Engineering, Finance, and Science (cs.CE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Risk Management (q-fin.RM); Trading and Market Microstructure (q-fin.TR)
- [340] arXiv:2404.18416 (cross-list from cs.AI) [pdf, other]
-
Title: Capabilities of Gemini Models in MedicineAuthors: Khaled Saab, Tao Tu, Wei-Hung Weng, Ryutaro Tanno, David Stutz, Ellery Wulczyn, Fan Zhang, Tim Strother, Chunjong Park, Elahe Vedadi, Juanma Zambrano Chaves, Szu-Yeu Hu, Mike Schaekermann, Aishwarya Kamath, Yong Cheng, David G.T. Barrett, Cathy Cheung, Basil Mustafa, Anil Palepu, Daniel McDuff, Le Hou, Tomer Golany, Luyang Liu, Jean-baptiste Alayrac, Neil Houlsby, Nenad Tomasev, Jan Freyberg, Charles Lau, Jonas Kemp, Jeremy Lai, Shekoofeh Azizi, Kimberly Kanada, SiWai Man, Kavita Kulkarni, Ruoxi Sun, Siamak Shakeri, Luheng He, Ben Caine, Albert Webson, Natasha Latysheva, Melvin Johnson, Philip Mansfield, Jian Lu, Ehud Rivlin, Jesper Anderson, Bradley Green, Renee Wong, Jonathan Krause, Jonathon Shlens, Ewa Dominowska, S. M. Ali Eslami, Katherine Chou, Claire Cui, Oriol Vinyals, Koray Kavukcuoglu, et al. (12 additional authors not shown)Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[ showing 340 entries per page: fewer | more | all ]
Disable MathJax (What is MathJax?)
Links to: arXiv, form interface, find, cs, new, 2405, contact, help (Access key information)