Computation and Language
Authors and titles for recent submissions
[ total of 353 entries: 1-147 | 148-294 | 295-353 ][ showing 147 entries per page: fewer | more | all ]
Wed, 24 Apr 2024
- [1] arXiv:2404.15269 [pdf, other]
-
Title: Aligning LLM Agents by Learning Latent Preference from User EditsSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [2] arXiv:2404.15247 [pdf, other]
-
Title: XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-ExpertsSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Software Engineering (cs.SE)
- [3] arXiv:2404.15238 [pdf, other]
-
Title: CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language TechnologiesAuthors: Weiyan Shi, Ryan Li, Yutong Zhang, Caleb Ziems, Chunhua yu, Raya Horesh, Rogério Abreu de Paula, Diyi YangComments: 32 pages, 7 figures, preprintSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [4] arXiv:2404.15219 [pdf, other]
-
Title: The Power of the Noisy Channel: Unsupervised End-to-End Task-Oriented Dialogue with LLMsComments: 16 Pages, 7 FiguresSubjects: Computation and Language (cs.CL)
- [5] arXiv:2404.15206 [pdf, other]
-
Title: Does Instruction Tuning Make LLMs More Consistent?Subjects: Computation and Language (cs.CL)
- [6] arXiv:2404.15196 [pdf, other]
-
Title: Setting up the Data Printer with Improved English to Ukrainian Machine TranslationSubjects: Computation and Language (cs.CL)
- [7] arXiv:2404.15166 [pdf, other]
-
Title: Pixels and Predictions: Potential of GPT-4V in Meteorological Imagery Analysis and Forecast CommunicationAuthors: John R. Lawson, Montgomery L. Flora, Kevin H. Goebbert, Seth N. Lyman, Corey K. Potvin, David M. Schultz, Adam J. Stepanek, Joseph E. Trujillo-FalcónComments: Supplementary material PDF attached. Submitted to Artificial Intelligence for the Earth Systems (American Meteorological Society) on 18 April 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Atmospheric and Oceanic Physics (physics.ao-ph)
- [8] arXiv:2404.15159 [pdf, other]
-
Title: MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA based Mixture of ExpertsAuthors: Dengchun Li, Yingzi Ma, Naizheng Wang, Zhiyuan Cheng, Lei Duan, Jie Zuo, Cal Yang, Mingjie TangComments: 11 pages, 4 figuresSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [9] arXiv:2404.15157 [pdf, other]
-
Title: FASTTRACK: Fast and Accurate Fact Tracing for LLMsSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [10] arXiv:2404.15156 [pdf, other]
-
Title: Regressive Side Effects of Training Language Models to Mimic Student MisconceptionsSubjects: Computation and Language (cs.CL)
- [11] arXiv:2404.15155 [pdf, other]
-
Title: Adaptive Collaboration Strategy for LLMs in Medical Decision MakingAuthors: Yubin Kim, Chanwoo Park, Hyewon Jeong, Yik Siu Chan, Xuhai Xu, Daniel McDuff, Cynthia Breazeal, Hae Won ParkSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [12] arXiv:2404.15154 [pdf, other]
-
Title: Do not think pink elephant!Comments: This paper is accepted in CVPRWSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [13] arXiv:2404.15153 [pdf, other]
-
Title: Expert Router: Orchestrating Efficient Language Model Inference through Prompt ClassificationSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Performance (cs.PF)
- [14] arXiv:2404.15149 [pdf, other]
-
Title: Bias patterns in the application of LLMs for clinical decision support: A comprehensive studySubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [15] arXiv:2404.15104 [pdf, other]
-
Title: Identifying Fairness Issues in Automatically Generated Testing ContentAuthors: Kevin Stowe, Benny Longwill, Alyssa Francis, Tatsuya Aoyama, Debanjan Ghosh, Swapna SomasundaranComments: 18 pages, 3 figures, accepted to the 19th Workshop on Innovative Use of NLP for Building Educational ApplicationsSubjects: Computation and Language (cs.CL)
- [16] arXiv:2404.15103 [pdf, other]
-
Title: Multi-view Content-aware Indexing for Long Document RetrievalAuthors: Kuicai Dong, Derrick Goh Xin Deik, Yi Quan Lee, Hao Zhang, Xiangyang Li, Cong Zhang, Yong LiuSubjects: Computation and Language (cs.CL)
- [17] arXiv:2404.15067 [pdf, other]
-
Title: Enhancing Textual Personality Detection toward Social Media: Integrating Long-term and Short-term PerspectivesAuthors: Haohao Zhu, Xiaokun Zhang, Junyu Lu, Youlin Wu, Zewen Bai, Changrong Min, Liang Yang, Bo Xu, Dongyu Zhang, Hongfei LinComments: 11 pages, 9 figuresSubjects: Computation and Language (cs.CL)
- [18] arXiv:2404.15045 [pdf, other]
-
Title: Multi-Head Mixture-of-ExpertsSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [19] arXiv:2404.15004 [pdf, other]
-
Title: TAXI: Evaluating Categorical Knowledge Editing for Language ModelsSubjects: Computation and Language (cs.CL)
- [20] arXiv:2404.15003 [pdf, other]
-
Title: Comparison of Current Approaches to Lemmatization: A Case Study in EstonianComments: 6 pages, 2 figuresJournal-ref: Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa), pp. 280-285, May 2023Subjects: Computation and Language (cs.CL)
- [21] arXiv:2404.14994 [pdf, other]
-
Title: Transformers Can Represent $n$-gram Language ModelsSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computational Complexity (cs.CC); Formal Languages and Automata Theory (cs.FL); Machine Learning (cs.LG)
- [22] arXiv:2404.14963 [pdf, other]
-
Title: Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Perfect ReasonersSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [23] arXiv:2404.14943 [pdf, other]
-
Title: Does It Make Sense to Explain a Black Box With Another Black Box?Comments: This article was originally published in French at the Journal TAL. VOL 64 n{\deg}3/2023. arXiv admin note: substantial text overlap with arXiv:2402.10888Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [24] arXiv:2404.14914 [pdf, other]
-
Title: Pillars of Grammatical Error Correction: Comprehensive Inspection Of Contemporary Approaches In The Era of Large Language ModelsAuthors: Kostiantyn Omelianchuk, Andrii Liubonko, Oleksandr Skurzhanskyi, Artem Chernodub, Oleksandr Korniienko, Igor SamokhinSubjects: Computation and Language (cs.CL)
- [25] arXiv:2404.14897 [pdf, other]
-
Title: Beyond the Speculative Game: A Survey of Speculative Execution in Large Language ModelsComments: 10 pages, 4 figures, 1 table, rejected from IJCAI 2024, revision in progressSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [26] arXiv:2404.14883 [pdf, ps, other]
-
Title: Language in Vivo vs. in Silico: Size Matters but Larger Language Models Still Do Not Comprehend Language on a Par with HumansSubjects: Computation and Language (cs.CL)
- [27] arXiv:2404.14850 [pdf, other]
-
Title: Simple, Efficient and Scalable Structure-aware Adapter Boosts Protein Language ModelsAuthors: Yang Tan, Mingchen Li, Bingxin Zhou, Bozitao Zhong, Lirong Zheng, Pan Tan, Ziyi Zhou, Huiqun Yu, Guisheng Fan, Liang HongComments: 30 pages, 4 figures, 8 tablesSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Biomolecules (q-bio.BM)
- [28] arXiv:2404.14827 [pdf, other]
-
Title: Sentence-Level or Token-Level? A Comprehensive Study on Knowledge DistillationSubjects: Computation and Language (cs.CL)
- [29] arXiv:2404.14812 [pdf, other]
-
Title: Pattern-Aware Chain-of-Thought Prompting in Large Language ModelsSubjects: Computation and Language (cs.CL)
- [30] arXiv:2404.14809 [pdf, other]
-
Title: A Survey of Large Language Models on Generative Graph Analytics: Query, Learning, and ApplicationsComments: 31 pages including references, 22 figuresSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
- [31] arXiv:2404.14795 [pdf, other]
-
Title: Talk Too Much: Poisoning Large Language Models under Token LimitComments: 20 pagesSubjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [32] arXiv:2404.14779 [pdf, other]
-
Title: Med42 -- Evaluating Fine-Tuning Strategies for Medical LLMs: Full-Parameter vs. Parameter-Efficient ApproachesAuthors: Clément Christophe, Praveen K Kanithi, Prateek Munjal, Tathagata Raha, Nasir Hayat, Ronnie Rajan, Ahmed Al-Mahrooqi, Avani Gupta, Muhammad Umar Salman, Gurpreet Gosal, Bhargav Kanakiya, Charles Chen, Natalia Vassilieva, Boulbaba Ben Amor, Marco AF Pimentel, Shadab KhanComments: Published at AAAI 2024 Spring Symposium - Clinical Foundation ModelsSubjects: Computation and Language (cs.CL)
- [33] arXiv:2404.14777 [pdf, other]
-
Title: CT-Agent: Clinical Trial Multi-Agent with Large Language Model-based ReasoningSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [34] arXiv:2404.14772 [pdf, other]
-
Title: Simulating Task-Oriented Dialogues with State Transition Graphs and Large Language ModelsAuthors: Chris Samarinas, Pracha Promthaw, Atharva Nijasure, Hansi Zeng, Julian Killingback, Hamed ZamaniSubjects: Computation and Language (cs.CL)
- [35] arXiv:2404.14760 [pdf, other]
-
Title: Retrieval Augmented Generation for Domain-specific Question AnsweringAuthors: Sanat Sharma, David Seunghyun Yoon, Franck Dernoncourt, Dewang Sultania, Karishma Bagga, Mengjiao Zhang, Trung Bui, Varun KotteComments: AAAI 2024 (Association for the Advancement of Artificial Intelligence) Scientific Document Understanding WorkshopSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- [36] arXiv:2404.14741 [pdf, other]
-
Title: Generate-on-Graph: Treat LLM as both Agent and KG in Incomplete Knowledge Graph Question AnsweringAuthors: Yao Xu, Shizhu He, Jiabei Chen, Zihao Wang, Yangqiu Song, Hanghang Tong, Kang Liu, Jun ZhaoSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [37] arXiv:2404.14740 [pdf, other]
-
Title: Modeling the Sacred: Considerations when Using Considerations when Using Religious Texts in Natural Language ProcessingAuthors: Ben HutchinsonComments: Findings of NAACL2024Subjects: Computation and Language (cs.CL)
- [38] arXiv:2404.14723 [pdf, other]
-
Title: Insights into Alignment: Evaluating DPO and its Variants Across Multiple TasksSubjects: Computation and Language (cs.CL)
- [39] arXiv:2404.14716 [pdf, other]
-
Title: Bayesian Example Selection Improves In-Context Learning for Speech, Text, and Visual ModalitiesComments: 16 pages, 6 figuresSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [40] arXiv:2404.14695 [pdf, other]
-
Title: MisgenderMender: A Community-Informed Approach to Interventions for MisgenderingComments: NAACL 2024Subjects: Computation and Language (cs.CL)
- [41] arXiv:2404.14680 [pdf, other]
-
Title: Automated Multi-Language to English Machine Translation Using Generative Pre-Trained TransformersSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [42] arXiv:2404.14631 [pdf, other]
-
Title: Learning Word Embedding with Better Distance Weighting and Window Size SchedulingAuthors: Chaohao YangSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [43] arXiv:2404.14619 [pdf, other]
-
Title: OpenELM: An Efficient Language Model Family with Open-source Training and Inference FrameworkAuthors: Sachin Mehta, Mohammad Hossein Sekhavat, Qingqing Cao, Maxwell Horton, Yanzi Jin, Chenfan Sun, Iman Mirzadeh, Mahyar Najibi, Dmitry Belenko, Peter Zatloukal, Mohammad RastegariSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [44] arXiv:2404.14607 [pdf, other]
-
Title: Q-Tuning: Queue-based Prompt Tuning for Lifelong Few-shot Language LearningComments: Accepted to NAACL 2024 findingsSubjects: Computation and Language (cs.CL)
- [45] arXiv:2404.14604 [pdf, other]
-
Title: Describe-then-Reason: Improving Multimodal Mathematical Reasoning through Visual Comprehension TrainingSubjects: Computation and Language (cs.CL)
- [46] arXiv:2404.14567 [pdf, other]
-
Title: WangLab at MEDIQA-M3G 2024: Multimodal Medical Answer Generation using Large Language ModelsSubjects: Computation and Language (cs.CL)
- [47] arXiv:2404.14544 [pdf, other]
-
Title: WangLab at MEDIQA-CORR 2024: Optimized LLM-based Programs for Medical Error Detection and CorrectionSubjects: Computation and Language (cs.CL)
- [48] arXiv:2404.14469 [pdf, other]
-
Title: SnapKV: LLM Knows What You are Looking for Before GenerationAuthors: Yuhong Li, Yingbing Huang, Bowen Yang, Bharat Venkitesh, Acyr Locatelli, Hanchen Ye, Tianle Cai, Patrick Lewis, Deming ChenSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [49] arXiv:2404.14467 [pdf, other]
-
Title: Integrating Chemistry Knowledge in Large Language Models via Prompt EngineeringComments: 43 pages, 17 figuresSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [50] arXiv:2404.14465 [pdf, other]
-
Title: Benchmarking Advanced Text Anonymisation Methods: A Comparative Study on Novel and Traditional ApproachesAuthors: Dimitris Asimopoulos, Ilias Siniosoglou, Vasileios Argyriou, Thomai Karamitsou, Eleftherios Fountoukidis, Sotirios K. Goudos, Ioannis D. Moscholios, Konstantinos E. Psannis, Panagiotis SarigiannidisSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
- [51] arXiv:2404.14464 [pdf, other]
-
Title: Tree of Reviews: A Tree-based Dynamic Iterative Retrieval Framework for Multi-hop Question AnsweringComments: Keywords: Muti-hop Question Answering; Retrieval-Augmented Generation; Tree of Thought; Reasoning TLDR: We proposed a tree-based dynamic, iterative retrieval framework for multi-hop question answeringSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
- [52] arXiv:2404.14463 [pdf, other]
-
Title: DAIC-WOZ: On the Validity of Using the Therapist's prompts in Automatic Depression Detection from Clinical InterviewsAuthors: Sergio Burdisso, Ernesto Reyes-Ramírez, Esaú Villatoro-Tello, Fernando Sánchez-Vega, Pastor López-Monroy, Petr MotlicekComments: Accepted to Clinical NLP workshop at NAACL 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
- [53] arXiv:2404.14461 [pdf, other]
-
Title: Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMsAuthors: Javier Rando, Francesco Croce, Kryštof Mitka, Stepan Shabalin, Maksym Andriushchenko, Nicolas Flammarion, Florian TramèrComments: Competition ReportSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- [54] arXiv:2404.14454 [pdf, other]
-
Title: Reinforcement of Explainability of ChatGPT Prompts by Embedding Breast Cancer Self-Screening Rules into AI ResponsesComments: 9 pages, 5 figures, 3 algorithms, 1 table, to be presented as a Poster at the ICCS'24Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [55] arXiv:2404.14453 [pdf, other]
-
Title: EPI-SQL: Enhancing Text-to-SQL Translation with Error-Prevention InstructionsSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
- [56] arXiv:2404.14449 [pdf, ps, other]
-
Title: Predicting Question Quality on StackOverflow with Neural NetworksSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [57] arXiv:2404.14443 [pdf, ps, other]
-
Title: Evaluation of Machine Translation Based on Semantic Dependencies and KeywordsSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [58] arXiv:2404.14415 [pdf, other]
-
Title: Domain Adaptation in Intent Classification Systems: A ReviewSubjects: Computation and Language (cs.CL)
- [59] arXiv:2404.15272 (cross-list from cs.CV) [pdf, other]
-
Title: CT-GLIP: 3D Grounded Language-Image Pretraining with CT Scans and Radiology Reports for Full-Body ScenariosComments: 12 pages, 5 figures, 3 tablesSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [60] arXiv:2404.15271 (cross-list from cs.CV) [pdf, other]
-
Title: Automatic Layout Planning for Visually-Rich Documents with Instruction-Following ModelsSubjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [61] arXiv:2404.15228 (cross-list from cs.CV) [pdf, other]
-
Title: Re-Thinking Inverse Graphics With Large Language ModelsComments: 31 pages; project page: this https URLSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [62] arXiv:2404.15190 (cross-list from cs.AI) [pdf, other]
-
Title: Socratic Planner: Inquiry-Based Zero-Shot Planning for Embodied Instruction FollowingComments: 14 pages, 6 figuresSubjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
- [63] arXiv:2404.15146 (cross-list from cs.LG) [pdf, other]
-
Title: Rethinking LLM Memorization through the Lens of Adversarial CompressionComments: this https URLSubjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
- [64] arXiv:2404.15127 (cross-list from cs.CV) [pdf, other]
-
Title: MedDr: Diagnosis-Guided Bootstrapping for Large-Scale Medical Vision-Language LearningSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [65] arXiv:2404.14989 (cross-list from cs.IR) [pdf, other]
-
Title: A Reproducibility Study of PLAIDComments: SIGIR 2024 (reproducibility track)Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
- [66] arXiv:2404.14977 (cross-list from cs.SI) [pdf, other]
-
Title: Social Media and Artificial Intelligence for Sustainable Cities and Societies: A Water Quality Analysis Use-caseAuthors: Muhammad Asif Auyb, Muhammad Tayyab Zamir, Imran Khan, Hannia Naseem, Nasir Ahmad, Kashif AhmadComments: 11 pages, 6 figures, and 3 tablesSubjects: Social and Information Networks (cs.SI); Computation and Language (cs.CL)
- [67] arXiv:2404.14946 (cross-list from cs.SD) [pdf, other]
-
Title: StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness AnnotationsComments: Accepted by ICASSP 2024Journal-ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024, pp. 11521-11525Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
- [68] arXiv:2404.14928 (cross-list from cs.LG) [pdf, other]
-
Title: Graph Machine Learning in the Era of Large Language Models (LLMs)Authors: Wenqi Fan, Shijie Wang, Jiani Huang, Zhikai Chen, Yu Song, Wenzhuo Tang, Haitao Mao, Hui Liu, Xiaorui Liu, Dawei Yin, Qing LiSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Social and Information Networks (cs.SI)
- [69] arXiv:2404.14901 (cross-list from cs.SE) [pdf, other]
-
Title: Beyond Code Generation: An Observational Study of ChatGPT Usage in Software Engineering PracticeComments: Accepted at the ACM International Conference on the Foundations of Software Engineering (FSE) 2024Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
- [70] arXiv:2404.14851 (cross-list from cs.IR) [pdf, other]
-
Title: From Matching to Generation: A Survey on Generative Information RetrievalSubjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [71] arXiv:2404.14831 (cross-list from cs.DB) [pdf, other]
-
Title: Towards Universal Dense Blocking for Entity ResolutionComments: Code and data are available at this this https URLSubjects: Databases (cs.DB); Computation and Language (cs.CL); Information Retrieval (cs.IR)
- [72] arXiv:2404.14749 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Semantic Cells: Evolutional Process to Acquire Sense Diversity of ItemsComments: 18 pages, 3 figures, 1 tableSubjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
- [73] arXiv:2404.14736 (cross-list from cs.HC) [pdf, ps, other]
-
Title: Qualitative Approaches to Voice UXJournal-ref: ACM Computing Surveys (2024)Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [74] arXiv:2404.14715 (cross-list from cs.CV) [pdf, other]
-
Title: FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and CorrectionAuthors: Hang Hua, Jing Shi, Kushal Kafle, Simon Jenni, Daoan Zhang, John Collomosse, Scott Cohen, Jiebo LuoSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
- [75] arXiv:2404.14700 (cross-list from eess.AS) [pdf, other]
-
Title: FlashSpeech: Efficient Zero-Shot Speech SynthesisAuthors: Zhen Ye, Zeqian Ju, Haohe Liu, Xu Tan, Jianyi Chen, Yiwen Lu, Peiwen Sun, Jiahao Pan, Weizhen Bian, Shulin He, Qifeng Liu, Yike Guo, Wei XueComments: Efficient zero-shot speech synthesisSubjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
- [76] arXiv:2404.14687 (cross-list from cs.MM) [pdf, other]
-
Title: Pegasus-v1 Technical ReportAuthors: Raehyuk Jung, Hyojun Go, Jaehyuk Yi, Jiho Jang, Daniel Kim, Jay Suh, Aiden Lee, Cooper Han, Jae Lee, Jeff Kim, Jin-Young Kim, Junwan Kim, Kyle Park, Lucas Lee, Mars Ha, Minjoon Seo, Abraham Jo, Ed Park, Hassan Kianinejad, SJ Kim, Tony Moon, Wade Jeong, Andrei Popescu, Esther Kim, EK Yoon, Genie Heo, Henry Choi, Jenna Kang, Kevin Han, Noah Seo, Sunny Nguyen, Ryan Won, Yeonhoo Park, Anthony Giuliani, Dave Chung, Hans Yoon, James Le, Jenny Ahn, June Lee, Maninder Saini, Meredith Sanders, Soyoung Lee, Sue Kim, Travis CoutureSubjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [77] arXiv:2404.14662 (cross-list from cs.LG) [pdf, other]
-
Title: NExT: Teaching Large Language Models to Reason about Code ExecutionAuthors: Ansong Ni, Miltiadis Allamanis, Arman Cohan, Yinlin Deng, Kensen Shi, Charles Sutton, Pengcheng YinComments: 35 pagesSubjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Programming Languages (cs.PL); Software Engineering (cs.SE)
- [78] arXiv:2404.14618 (cross-list from cs.LG) [pdf, other]
-
Title: Hybrid LLM: Cost-Efficient and Quality-Aware Query RoutingAuthors: Dujian Ding, Ankur Mallick, Chi Wang, Robert Sim, Subhabrata Mukherjee, Victor Ruhle, Laks V.S. Lakshmanan, Ahmed Hassan AwadallahComments: Accepted to ICLR 2024 (main conference)Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [79] arXiv:2404.14600 (cross-list from cs.IR) [pdf, other]
-
Title: Planning Ahead in Generative Retrieval: Guiding Autoregressive Generation through Simultaneous DecodingComments: Accepted to SIGIR 2024Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
- [80] arXiv:2404.14445 (cross-list from cs.LG) [pdf, other]
-
Title: A Multi-Faceted Evaluation Framework for Assessing Synthetic Data Generated by Large Language ModelsComments: 10 pages, 1 figure, 4 tablesSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
- [81] arXiv:2404.14432 (cross-list from cs.SI) [pdf, other]
-
Title: Monitoring Critical Infrastructure Facilities During Disasters Using Large Language ModelsComments: Accepted to appear at the 2024 ISCRAM conferenceSubjects: Social and Information Networks (cs.SI); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
- [82] arXiv:2404.14419 (cross-list from cs.SE) [pdf, other]
-
Title: Enhancing Fault Detection for Large Language Models via Mutation-Based Confidence SmoothingSubjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
- [83] arXiv:2404.12500 (cross-list from cs.HC) [pdf, other]
-
Title: UIClip: A Data-driven Model for Assessing User Interface DesignSubjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
Tue, 23 Apr 2024 (showing first 64 of 104 entries)
- [84] arXiv:2404.14408 [pdf, other]
-
Title: SpaceByte: Towards Deleting Tokenization from Large Language ModelingAuthors: Kevin SlagleComments: 9+9 pages, 3+1 figures, 2+4 tablesSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [85] arXiv:2404.14397 [pdf, other]
-
Title: RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?Authors: Adrian de Wynter, Ishaan Watts, Nektar Ege Altıntoprak, Tua Wongsangaroonsri, Minghui Zhang, Noura Farra, Lena Baur, Samantha Claudet, Pavel Gajdusek, Can Gören, Qilong Gu, Anna Kaminska, Tomasz Kaminski, Ruby Kuo, Akiko Kyuba, Jongho Lee, Kartik Mathur, Petter Merok, Ivana Milovanović, Nani Paananen, Vesa-Matti Paananen, Anna Pavlenko, Bruno Pereira Vidal, Luciano Strika, Yueh Tsao, Davide Turcato, Oleksandr Vakhno, Judit Velcsov, Anna Vickers, Stéphanie Visser, Herdyan Widarmanto, Andrey Zaikin, Si-Qing ChenComments: Work in progressSubjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
- [86] arXiv:2404.14395 [pdf, other]
-
Title: PARAMANU-GANITA: Language Model with Mathematical CapabilitiesSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [87] arXiv:2404.14387 [pdf, other]
-
Title: A Survey on Self-Evolution of Large Language ModelsAuthors: Zhengwei Tao, Ting-En Lin, Xiancai Chen, Hangyu Li, Yuchuan Wu, Yongbin Li, Zhi Jin, Fei Huang, Dacheng Tao, Jingren ZhouSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [88] arXiv:2404.14372 [pdf, other]
-
Title: Beyond Scaling: Predicting Patent Approval with Domain-specific Fine-grained Claim Dependency GraphAuthors: Xiaochen Kev Gao, Feng Yao, Kewen Zhao, Beilei He, Animesh Kumar, Vish Krishnan, Jingbo ShangComments: 17 Pages, Under ReviewSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [89] arXiv:2404.14361 [pdf, other]
-
Title: Better Synthetic Data by Retrieving and Transforming Existing DatasetsSubjects: Computation and Language (cs.CL)
- [90] arXiv:2404.14355 [pdf, other]
-
Title: Calc-CMU at SemEval-2024 Task 7: Pre-Calc -- Learning to Use the Calculator Improves Numeracy in Language ModelsComments: NumEval at SemEval, NAACL 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [91] arXiv:2404.14339 [pdf, other]
-
Title: Zero-shot Cross-lingual Stance Detection via Adversarial Language AdaptationSubjects: Computation and Language (cs.CL)
- [92] arXiv:2404.14316 [pdf, other]
-
Title: Automated Long Answer Grading with RiceChem DatasetAuthors: Shashank Sonkar, Kangqi Ni, Lesa Tran Lu, Kristi Kincaid, John S. Hutchinson, Richard G. BaraniukSubjects: Computation and Language (cs.CL)
- [93] arXiv:2404.14313 [pdf, other]
-
Title: Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference LabelsAuthors: Jan-Philipp Fränken, Eric Zelikman, Rafael Rafailov, Kanishk Gandhi, Tobias Gerstenberg, Noah D. GoodmanSubjects: Computation and Language (cs.CL)
- [94] arXiv:2404.14301 [pdf, other]
-
Title: Marking: Visual Grading with Highlighting Errors and Annotating Missing BitsSubjects: Computation and Language (cs.CL)
- [95] arXiv:2404.14294 [pdf, other]
-
Title: A Survey on Efficient Inference for Large Language ModelsAuthors: Zixuan Zhou, Xuefei Ning, Ke Hong, Tianyu Fu, Jiaming Xu, Shiyao Li, Yuming Lou, Luning Wang, Zhihang Yuan, Xiuhong Li, Shengen Yan, Guohao Dai, Xiao-Ping Zhang, Yuhan Dong, Yu WangSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [96] arXiv:2404.14270 [pdf, other]
-
Title: What do Transformers Know about Government?Authors: Jue Hou, Anisia Katinskaia, Lari Kotilainen, Sathianpong Trangcasanchai, Anh-Duc Vu, Roman YangarberSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [97] arXiv:2404.14219 [pdf, other]
-
Title: Phi-3 Technical Report: A Highly Capable Language Model Locally on Your PhoneAuthors: Marah Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Parul Chopra, Allie Del Giorno, Gustavo de Rosa, Matthew Dixon, Ronen Eldan, Dan Iter, Amit Garg, Abhishek Goswami, Suriya Gunasekar, Emman Haider, Junheng Hao, Russell J. Hewett, Jamie Huynh, Mojan Javaheripi, Xin Jin, Piero Kauffmann, Nikos Karampatziakis, Dongwoo Kim, Mahoud Khademi, Lev Kurilenko, James R. Lee, Yin Tat Lee, Yuanzhi Li, Chen Liang, Weishung Liu, Eric Lin, Zeqi Lin, Piyush Madan, Arindam Mitra, Hardik Modi, Anh Nguyen, Brandon Norick, Barun Patra, Daniel Perez-Becker, Thomas Portet, et al. (34 additional authors not shown)Comments: 12 pagesSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [98] arXiv:2404.14215 [pdf, other]
-
Title: Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple ExtractionAuthors: Zheye Deng, Chunkit Chan, Weiqi Wang, Yuxi Sun, Wei Fan, Tianshi Zheng, Yauwai Yim, Yangqiu SongSubjects: Computation and Language (cs.CL)
- [99] arXiv:2404.14209 [pdf, ps, other]
-
Title: EnzChemRED, a rich enzyme chemistry relation extraction datasetAuthors: Po-Ting Lai, Elisabeth Coudert, Lucila Aimo, Kristian Axelsen, Lionel Breuza, Edouard de Castro, Marc Feuermann, Anne Morgat, Lucille Pourcel, Ivo Pedruzzi, Sylvain Poux, Nicole Redaschi, Catherine Rivoire, Anastasia Sveshnikova, Chih-Hsuan Wei, Robert Leaman, Ling Luo, Zhiyong Lu, Alan BridgeSubjects: Computation and Language (cs.CL)
- [100] arXiv:2404.14192 [pdf, other]
-
Title: Swap distance minimization beyond entropy minimization in word order variationSubjects: Computation and Language (cs.CL); Physics and Society (physics.soc-ph)
- [101] arXiv:2404.14183 [pdf, other]
-
Title: SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text DetectionAuthors: Yuxia Wang, Jonibek Mansurov, Petar Ivanov, Jinyan Su, Artem Shelmanov, Akim Tsvigun, Osama Mohammed Afzal, Tarek Mahmoud, Giovanni Puccetti, Thomas Arnold, Chenxi Whitehouse, Alham Fikri Aji, Nizar Habash, Iryna Gurevych, Preslav NakovComments: 23 pages, 12 tablesJournal-ref: Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)Subjects: Computation and Language (cs.CL)
- [102] arXiv:2404.14122 [pdf, other]
-
Title: Fine-Tuning Large Language Models to Translate: Will a Touch of Noisy Data in Misaligned Languages Suffice?Subjects: Computation and Language (cs.CL)
- [103] arXiv:2404.14057 [pdf, ps, other]
-
Title: Bored to Death: Artificial Intelligence Research Reveals the Role of Boredom in Suicide BehaviorAuthors: Shir Lissak, Yaakov Ophir, Refael Tikochinski, Anat Brunstein Klomek, Itay Sisso, Eyal Fruchter, Roi ReichartJournal-ref: www.frontiersin.org/journals/psychiatry/articles/10.3389/fpsyt.2024.1328122Subjects: Computation and Language (cs.CL)
- [104] arXiv:2404.14052 [pdf, other]
- [105] arXiv:2404.14043 [pdf, other]
-
Title: LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented GenerationSubjects: Computation and Language (cs.CL)
- [106] arXiv:2404.14024 [pdf, other]
-
Title: Exploring neural oscillations during speech perception via surrogate gradient spiking neural networksSubjects: Computation and Language (cs.CL); Neurons and Cognition (q-bio.NC)
- [107] arXiv:2404.13985 [pdf, other]
-
Title: Information Re-Organization Improves Reasoning in Large Language ModelsComments: 10 pages, 3 figuresSubjects: Computation and Language (cs.CL)
- [108] arXiv:2404.13968 [pdf, other]
-
Title: Protecting Your LLMs with Information BottleneckAuthors: Zichuan Liu, Zefan Wang, Linjie Xu, Jinyu Wang, Lei Song, Tianchun Wang, Chunlin Chen, Wei Cheng, Jiang BianSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
- [109] arXiv:2404.13957 [pdf, other]
-
Title: How Well Can LLMs Echo Us? Evaluating AI Chatbots' Role-Play Ability with ECHOComments: 9 pagesSubjects: Computation and Language (cs.CL)
- [110] arXiv:2404.13948 [pdf, other]
-
Title: Typos that Broke the RAG's Back: Genetic Attack on RAG Pipeline by Simulating Documents in the Wild via Low-level PerturbationsComments: Under ReviewSubjects: Computation and Language (cs.CL)
- [111] arXiv:2404.13940 [pdf, other]
-
Title: A User-Centric Benchmark for Evaluating Large Language ModelsSubjects: Computation and Language (cs.CL)
- [112] arXiv:2404.13925 [pdf, other]
-
Title: MARIO Eval: Evaluate Your Math LLM with your Math LLM--A mathematical dataset evaluation toolkitSubjects: Computation and Language (cs.CL)
- [113] arXiv:2404.13919 [pdf, other]
-
Title: Navigating the Path of Writing: Outline-guided Text Generation with Large Language ModelsComments: under reviewSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
- [114] arXiv:2404.13906 [pdf, other]
-
Title: Generating Attractive and Authentic Copywriting from Customer ReviewsSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [115] arXiv:2404.13899 [pdf, other]
-
Title: Towards Better Text-to-Image Generation Alignment via Attention ModulationSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
- [116] arXiv:2404.13874 [pdf, other]
-
Title: VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language ModelsComments: Work in processSubjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
- [117] arXiv:2404.13865 [pdf, other]
-
Title: Context-Enhanced Language Models for Generating Multi-Paper CitationsAuthors: Avinash Anand, Kritarth Prasad, Ujjwal Goel, Mohit Gupta, Naman Lal, Astha Verma, Rajiv Ratn ShahComments: 14 pages, 7 figures, 11th International Conference, BDA 2023, Delhi, IndiaJournal-ref: Big Data and Artificial Intelligence 2023, Delhi, India, December 7, 80 94Subjects: Computation and Language (cs.CL)
- [118] arXiv:2404.13855 [pdf, other]
-
Title: Understanding the role of FFNs in driving multilingual behaviour in LLMsComments: 10 pagesSubjects: Computation and Language (cs.CL)
- [119] arXiv:2404.13813 [pdf, other]
-
Title: From LLM to NMT: Advancing Low-Resource Machine Translation with ClaudeComments: 17 pages, 15 figuresSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [120] arXiv:2404.13793 [pdf, other]
-
Title: Lightweight Connective Detection Using Gradient BoostingComments: 7 pages, 2 figures, 5 tablesSubjects: Computation and Language (cs.CL)
- [121] arXiv:2404.13781 [pdf, other]
-
Title: Evaluating Retrieval Quality in Retrieval-Augmented GenerationSubjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
- [122] arXiv:2404.13779 [pdf, other]
-
Title: Automated Text Mining of Experimental Methodologies from Biomedical LiteratureAuthors: Ziqing GuoSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [123] arXiv:2404.13764 [pdf, other]
-
Title: Using Adaptive Empathetic Responses for Teaching EnglishComments: Accepted to BEA workshop at NAACL 2024Subjects: Computation and Language (cs.CL)
- [124] arXiv:2404.13760 [pdf, other]
-
Title: How to Encode Domain Information in Relation ClassificationAuthors: Elisa Bassignana, Viggo Unmack Gascou, Frida Nøhr Laustsen, Gustav Kristensen, Marie Haahr Petersen, Rob van der Goot, Barbara PlankComments: Accepted at LREC-COLING 2024Subjects: Computation and Language (cs.CL)
- [125] arXiv:2404.13751 [pdf, other]
-
Title: Embarrassingly Simple Unsupervised Aspect Based Sentiment Tuple ExtractionComments: 4 pages, 4 tables, 3 figures, 2 appendix pagesSubjects: Computation and Language (cs.CL)
- [126] arXiv:2404.13660 [pdf, other]
-
Title: Trojan Detection in Large Language Models: Insights from The Trojan Detection ChallengeSubjects: Computation and Language (cs.CL)
- [127] arXiv:2404.13645 [pdf, other]
-
Title: PEACH: Pretrained-embedding Explanation Across Contextual and Hierarchical StructureComments: Accepted at IJCAI 2024Subjects: Computation and Language (cs.CL)
- [128] arXiv:2404.13628 [pdf, other]
-
Title: Mixture of LoRA ExpertsComments: 17 pages, 11 figuresSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
- [129] arXiv:2404.13627 [pdf, other]
-
Title: NegotiationToM: A Benchmark for Stress-testing Machine Theory of Mind on Negotiation SurroundingAuthors: Chunkit Chan, Cheng Jiayang, Yauwai Yim, Zheye Deng, Wei Fan, Haoran Li, Xin Liu, Hongming Zhang, Weiqi Wang, Yangqiu SongSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [130] arXiv:2404.13613 [pdf, other]
-
Title: The Branch Not Taken: Predicting Branching in Online ConversationsSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [131] arXiv:2404.13599 [pdf, other]
-
Title: "A good pun is its own reword": Can Large Language Models Understand Puns?Subjects: Computation and Language (cs.CL)
- [132] arXiv:2404.13547 [pdf, other]
-
Title: E-QGen: Educational Lecture Abstract-based Question Generation SystemComments: IJCAI 2024 Demo PaperSubjects: Computation and Language (cs.CL)
- [133] arXiv:2404.13504 [pdf, other]
-
Title: IMO: Greedy Layer-Wise Sparse Representation Learning for Out-of-Distribution Text Classification with Pre-trained ModelsSubjects: Computation and Language (cs.CL)
- [134] arXiv:2404.13465 [pdf, other]
-
Title: Do "English" Named Entity Recognizers Work Well on Global Englishes?Comments: EMNLP Findings 2023Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [135] arXiv:2404.13439 [pdf, other]
-
Title: Fine-Grained Named Entities for Corona NewsComments: Published at SWAT4HCLS 2023: The 14th International Conference on Semantic Web Applications and Tools for Health Care and Life SciencesSubjects: Computation and Language (cs.CL)
- [136] arXiv:2404.13397 [pdf, ps, other]
-
Title: Retrieval-Augmented Generation-based Relation ExtractionComments: Submitted to Semantic Web Journal. Under ReviewSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [137] arXiv:2404.13390 [pdf, other]
-
Title: Explanation based Bias Decoupling Regularization for Natural Language InferenceSubjects: Computation and Language (cs.CL)
- [138] arXiv:2404.13364 [pdf, other]
-
Title: MahaSQuAD: Bridging Linguistic Divides in Marathi Question-AnsweringComments: Accepted at the International Conference on Natural Language Processing (ICON 2023)Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- [139] arXiv:2404.13362 [pdf, other]
-
Title: Semantically Corrected Amharic Automatic Speech RecognitionSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
- [140] arXiv:2404.13350 [pdf, ps, other]
-
Title: Swa Bhasha: Message-Based Singlish to Sinhala TransliterationComments: 6 pages, 6 figures, 2 Tables, Presented at International Conference on Innovations in Info-business and Technology, Colombo, February 2022Subjects: Computation and Language (cs.CL)
- [141] arXiv:2404.13343 [pdf, other]
-
Title: UnibucLLM: Harnessing LLMs for Automated Prediction of Item Difficulty and Response Time for Multiple-Choice QuestionsComments: Accepted at BEA 2024 (NAACL Workshop)Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [142] arXiv:2404.13307 [pdf, other]
-
Title: Beyond Accuracy: Investigating Error Types in GPT-4 Responses to USMLE QuestionsAuthors: Soumyadeep Roy, Aparup Khatua, Fatemeh Ghoochani, Uwe Hadler, Wolfgang Nejdl, Niloy GangulyComments: 10 pages, 4 figures. Accepted for publication at the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2024)Subjects: Computation and Language (cs.CL)
- [143] arXiv:2404.13292 [pdf, other]
-
Title: Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization ChallengeAuthors: Khuyagbaatar Batsuren, Ekaterina Vylomova, Verna Dankers, Tsetsuukhei Delgerbaatar, Omri Uzan, Yuval Pinter, Gábor BellaSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [144] arXiv:2404.13289 [pdf, other]
-
Title: Double Mixture: Towards Continual Event Detection from SpeechAuthors: Jingqi Kang, Tongtong Wu, Jinming Zhao, Guitao Wang, Yinwei Wei, Hao Yang, Guilin Qi, Yuan-Fang Li, Gholamreza HaffariComments: The first two authors contributed equally to this workSubjects: Computation and Language (cs.CL); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- [145] arXiv:2404.13246 [pdf, other]
-
Title: ISQA: Informative Factuality Feedback for Scientific SummarizationComments: 18 pages, 4 figuresSubjects: Computation and Language (cs.CL)
- [146] arXiv:2404.13192 [pdf, other]
-
Title: Heterogeneous Subgraph Transformer for Fake News DetectionSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- [147] arXiv:2404.13149 [pdf, other]
-
Title: Beyond Self-Consistency: Ensemble Reasoning Boosts Consistency and Accuracy of LLMs in Cancer StagingComments: accepted to the 22nd International Conference on Artificial Intelligence in Medicine (AIME'24)Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[ showing 147 entries per page: fewer | more | all ]
Disable MathJax (What is MathJax?)
Links to: arXiv, form interface, find, cs, new, 2404, contact, help (Access key information)