We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions, skipping first 110

[ total of 515 entries: 1-100 | 11-110 | 111-210 | 211-310 | 311-410 | 411-510 | 511-515 ]
[ showing 100 entries per page: fewer | more | all ]

Tue, 28 May 2024 (continued, showing last 97 of 126 entries)

[111]  arXiv:2405.16908 [pdf, other]
Title: Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words?
Subjects: Computation and Language (cs.CL)
[112]  arXiv:2405.16884 [pdf, other]
Title: Match, Compare, or Select? An Investigation of Large Language Models for Entity Matching
Comments: Under revision. Code is available at this https URL
Subjects: Computation and Language (cs.CL); Databases (cs.DB)
[113]  arXiv:2405.16856 [pdf, other]
Title: Can We Trust LLMs? Mitigate Overconfidence Bias in LLMs through Knowledge Transfer
Subjects: Computation and Language (cs.CL)
[114]  arXiv:2405.16821 [pdf, other]
Title: Perturbation-Restrained Sequential Model Editing
Subjects: Computation and Language (cs.CL)
[115]  arXiv:2405.16810 [pdf, ps, other]
Title: Performance evaluation of Reddit Comments using Machine Learning and Natural Language Processing methods in Sentiment Analysis
Comments: 11 pages, 5 figures, to be published in Computational and Experimental Simulations in Engineering - Proceedings of ICCES 2024 - Volume 2
Subjects: Computation and Language (cs.CL)
[116]  arXiv:2405.16806 [pdf, other]
Title: Entity Alignment with Noisy Annotations from Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[117]  arXiv:2405.16802 [pdf, other]
Title: AutoCV: Empowering Reasoning with Automated Process Labeling via Confidence Variation
Comments: 20 pages, 1 figure, 13 tables
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[118]  arXiv:2405.16720 [pdf, other]
Title: Large Scale Knowledge Washing
Subjects: Computation and Language (cs.CL)
[119]  arXiv:2405.16714 [pdf, other]
Title: Crafting Interpretable Embeddings by Asking LLMs Questions
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[120]  arXiv:2405.16702 [pdf, other]
Title: Accurate and Nuanced Open-QA Evaluation Through Textual Entailment
Comments: To appear at ACL 2024 (Findings)
Subjects: Computation and Language (cs.CL)
[121]  arXiv:2405.16684 [pdf, other]
Title: gzip Predicts Data-dependent Scaling Laws
Authors: Rohan Pandey
Comments: 9 pages, 9 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[122]  arXiv:2405.16681 [pdf, other]
Title: Triple Preference Optimization: Achieving Better Alignment with Less Data in a Single Step Optimization
Subjects: Computation and Language (cs.CL)
[123]  arXiv:2405.16661 [pdf, other]
Title: RLSF: Reinforcement Learning via Symbolic Feedback
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[124]  arXiv:2405.16635 [pdf, other]
Title: Compressing Lengthy Context With UltraGist
Subjects: Computation and Language (cs.CL)
[125]  arXiv:2405.16631 [pdf, other]
Title: Let Silence Speak: Enhancing Fake News Detection with Generated Comments from Large Language Models
Comments: 11 pages, 5 figures, 8 tables
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Social and Information Networks (cs.SI)
[126]  arXiv:2405.16584 [pdf, other]
Title: MentalManip: A Dataset For Fine-grained Analysis of Mental Manipulation in Conversations
Comments: Accepted at ACL 2024
Subjects: Computation and Language (cs.CL)
[127]  arXiv:2405.16579 [pdf, other]
Title: Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity
Authors: Shanghaoran Quan
Subjects: Computation and Language (cs.CL)
[128]  arXiv:2405.16571 [pdf, other]
Title: A Preliminary Empirical Study on Prompt-based Unsupervised Keyphrase Extraction
Comments: work in progress
Subjects: Computation and Language (cs.CL)
[129]  arXiv:2405.16552 [pdf, other]
Title: SED: Self-Evaluation Decoding Enhances Large Language Models for Better Generation
Comments: The relevant code will be released in subsequent versions
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[130]  arXiv:2405.16533 [pdf, other]
Title: Chain of Tools: Large Language Model is an Automatic Multi-tool Learner
Comments: Work in progress
Subjects: Computation and Language (cs.CL)
[131]  arXiv:2405.16482 [pdf, other]
Title: DarijaBanking: A New Resource for Overcoming Language Barriers in Banking Intent Detection for Moroccan Arabic Speakers
Subjects: Computation and Language (cs.CL)
[132]  arXiv:2405.16433 [pdf, other]
Title: CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling
Comments: Appectped to Findings of ACL2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[133]  arXiv:2405.16422 [pdf, ps, other]
Title: AI-Generated Text Detection and Classification Based on BERT Deep Learning Algorithm
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[134]  arXiv:2405.16420 [pdf, other]
Title: M-RAG: Reinforcing Large Language Model Performance through Retrieval-Augmented Generation with Multiple Partitions
Comments: This paper has been accepted by ACL 2024
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[135]  arXiv:2405.16412 [pdf, other]
Title: KG-FIT: Knowledge Graph Fine-Tuning Upon Open-World Knowledge
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[136]  arXiv:2405.16402 [pdf, other]
Title: Assessing Empathy in Large Language Models with Real-World Physician-Patient Interactions
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[137]  arXiv:2405.16388 [pdf, other]
Title: Multi-Reference Preference Optimization for Large Language Models
Comments: 20 pages
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[138]  arXiv:2405.16376 [pdf, other]
Title: STRIDE: A Tool-Assisted LLM Agent Framework for Strategic and Interactive Decision-Making
Comments: 39 pages, 4 figures
Subjects: Computation and Language (cs.CL); Computer Science and Game Theory (cs.GT)
[139]  arXiv:2405.16337 [pdf, other]
Title: Learning to Reason via Program Generation, Emulation, and Search
Comments: 16 pages, 10 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[140]  arXiv:2405.16295 [pdf, ps, other]
Title: Comparative Analysis of Open-Source Language Models in Summarizing Medical Text Data
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[141]  arXiv:2405.16284 [pdf, ps, other]
Title: Generating clickbait spoilers with an ensemble of large language models
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[142]  arXiv:2405.16282 [pdf, other]
Title: Confidence Under the Hood: An Investigation into the Confidence-Probability Alignment in Large Language Models
Comments: 9 pages (excluding references), accepted to ACL 2024 Main Conference
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[143]  arXiv:2405.16281 [pdf, other]
Title: ConStat: Performance-Based Contamination Detection in Large Language Models
Subjects: Computation and Language (cs.CL)
[144]  arXiv:2405.16277 [pdf, other]
Title: Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge
Comments: 9 pages (excluding references), accepted to ACL 2024 Main Conference
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[145]  arXiv:2405.16229 [pdf, other]
Title: No Two Devils Alike: Unveiling Distinct Mechanisms of Fine-tuning Attacks
Comments: work in progress
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[146]  arXiv:2405.16178 [pdf, other]
Title: Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection
Subjects: Computation and Language (cs.CL)
[147]  arXiv:2405.16176 [pdf, other]
Title: Bi-reachability in Petri nets with data
Subjects: Computation and Language (cs.CL); Formal Languages and Automata Theory (cs.FL); Logic in Computer Science (cs.LO)
[148]  arXiv:2405.16155 [pdf, other]
Title: Improving Multi-lingual Alignment Through Soft Contrastive Learning
Comments: 8 pages, 1 figures, Accepted at NAACL SRW 2024
Subjects: Computation and Language (cs.CL)
[149]  arXiv:2405.16153 [pdf, other]
Title: DefSent+: Improving sentence embeddings of language models by projecting definition sentences into a quasi-isotropic or isotropic vector space of unlimited dictionary entries
Authors: Xiaodong Liu
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[150]  arXiv:2405.16150 [pdf, other]
Title: 5W1H Extraction With Large Language Models
Comments: IJCNN 2024
Subjects: Computation and Language (cs.CL)
[151]  arXiv:2405.16129 [pdf, other]
Title: iREL at SemEval-2024 Task 9: Improving Conventional Prompting Methods for Brain Teasers
Subjects: Computation and Language (cs.CL)
[152]  arXiv:2405.16115 [pdf, other]
Title: SNOBERT: A Benchmark for clinical notes entity linking in the SNOMED CT clinical terminology
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[153]  arXiv:2405.16089 [pdf, other]
Title: COLT: Towards Completeness-Oriented Tool Retrieval for Large Language Models
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[154]  arXiv:2405.16064 [pdf, other]
Title: Keypoint-based Progressive Chain-of-Thought Distillation for LLMs
Comments: Accepted by ICML 2024
Subjects: Computation and Language (cs.CL)
[155]  arXiv:2405.16057 [pdf, other]
Title: SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[156]  arXiv:2405.16042 [pdf, other]
Title: Incremental Comprehension of Garden-Path Sentences by Large Language Models: Semantic Interpretation, Syntactic Re-Analysis, and Attention
Comments: Accepted by CogSci-24
Subjects: Computation and Language (cs.CL)
[157]  arXiv:2405.15984 [pdf, other]
Title: Evaluating the Adversarial Robustness of Retrieval-Based In-Context Learning for Large Language Models
Comments: 29 pages, 6 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[158]  arXiv:2405.15964 [pdf, other]
Title: A hierarchical Bayesian model for syntactic priming
Comments: 6 pages; accepted to CogSci 2024
Subjects: Computation and Language (cs.CL)
[159]  arXiv:2405.15936 [pdf, other]
Title: Zero-Shot Spam Email Classification Using Pre-trained Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[160]  arXiv:2405.15924 [pdf, other]
Title: SLIDE: A Framework Integrating Small and Large Language Models for Open-Domain Dialogues Evaluation
Comments: Accepted by ACL2024 Findings
Subjects: Computation and Language (cs.CL)
[161]  arXiv:2405.15896 [pdf, other]
Title: Enhancing Augmentative and Alternative Communication with Card Prediction and Colourful Semantics
Subjects: Computation and Language (cs.CL)
[162]  arXiv:2405.15818 [pdf, other]
Title: DuanzAI: Slang-Enhanced LLM with Prompt for Humor Understanding
Authors: Yesian Rohn
Subjects: Computation and Language (cs.CL)
[163]  arXiv:2405.17430 (cross-list from cs.CV) [pdf, other]
Title: Matryoshka Multimodal Models
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[164]  arXiv:2405.17423 (cross-list from cs.CV) [pdf, other]
Title: Privacy-Aware Visual Language Models
Comments: preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[165]  arXiv:2405.17390 (cross-list from cs.IR) [pdf, ps, other]
Title: KSW: Khmer Stop Word based Dictionary for Keyword Extraction
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[166]  arXiv:2405.17382 (cross-list from cs.LG) [pdf, other]
Title: ReMoDetect: Reward Models Recognize Aligned LLM's Generations
Comments: 20 pages
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[167]  arXiv:2405.17345 (cross-list from cs.AI) [pdf, other]
Title: Exploring and steering the moral compass of Large Language Models
Authors: Alejandro Tlaie
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[168]  arXiv:2405.17217 (cross-list from cs.HC) [pdf, other]
Title: Collage is the New Writing: Exploring the Fragmentation of Text and User Interfaces in AI Tools
Authors: Daniel Buschek
Comments: 19 pages, 7 figures, 2 tables, ACM DIS 2024
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[169]  arXiv:2405.17130 (cross-list from cs.LG) [pdf, other]
Title: Exploiting the Layered Intrinsic Dimensionality of Deep Models for Practical Adversarial Training
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[170]  arXiv:2405.17104 (cross-list from cs.CV) [pdf, other]
Title: LLM-Optic: Unveiling the Capabilities of Large Language Models for Universal Visual Grounding
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[171]  arXiv:2405.17088 (cross-list from cs.LG) [pdf, other]
Title: Phase Transitions in the Output Distribution of Large Language Models
Comments: 21 pages, 4 figures
Subjects: Machine Learning (cs.LG); Statistical Mechanics (cond-mat.stat-mech); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[172]  arXiv:2405.17076 (cross-list from cs.AI) [pdf, other]
Title: Leveraging small language models for Text2SPARQL tasks to improve the resilience of AI assistance
Comments: To appear in Proceedings of the Workshop on Linked Data-driven Resilience Research 2024 (D2R2) co-located with Extended Semantic Web Conference 2024 (ESWC 2024)
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[173]  arXiv:2405.17044 (cross-list from cs.AI) [pdf, other]
Title: Generation and human-expert evaluation of interesting research ideas using knowledge graphs and large language models
Comments: 10 pages; 5 figures
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Digital Libraries (cs.DL); Machine Learning (cs.LG)
[174]  arXiv:2405.16994 (cross-list from cs.AI) [pdf, other]
Title: Vision-and-Language Navigation Generative Pretrained Transformer
Authors: Wen Hanlin
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[175]  arXiv:2405.16919 (cross-list from cs.CV) [pdf, other]
Title: VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[176]  arXiv:2405.16869 (cross-list from cs.AI) [pdf, other]
Title: Mixture of Modality Knowledge Experts for Robust Multi-modal Knowledge Graph Completion
Comments: Work in progress. Code and data will be released at this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[177]  arXiv:2405.16845 (cross-list from cs.LG) [pdf, other]
Title: On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability
Comments: 37pages
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[178]  arXiv:2405.16751 (cross-list from cs.AI) [pdf, other]
Title: LLM-Based Cooperative Agents using Information Relevance and Plan Validation
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
[179]  arXiv:2405.16712 (cross-list from cs.LG) [pdf, other]
Title: Zamba: A Compact 7B SSM Hybrid Model
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[180]  arXiv:2405.16700 (cross-list from cs.CV) [pdf, other]
Title: Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs
Comments: Project page: this https URL 37 Pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[181]  arXiv:2405.16682 (cross-list from cs.LG) [pdf, other]
Title: A Systematic Review of Federated Generative Models
Comments: 24 Pages, 3 Figures, 5 Tables
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[182]  arXiv:2405.16677 (cross-list from eess.AS) [pdf, other]
Title: Crossmodal ASR Error Correction with Discrete Speech Units
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[183]  arXiv:2405.16669 (cross-list from cs.HC) [pdf, other]
Title: Low-resourced Languages and Online Knowledge Repositories: A Need-Finding Study
Comments: In Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI 2024)
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[184]  arXiv:2405.16662 (cross-list from cs.LO) [pdf, ps, other]
Title: Conjunctive categorial grammars and Lambek grammars with additives
Comments: This article is an extended version of the conference presentation "Conjunctive categorial grammars" at the Mathematics of Language 2017 meeting (London, UK, July 13-14, 2017; proceedings published in ACL Anthology, W17-3414)
Subjects: Logic in Computer Science (cs.LO); Computation and Language (cs.CL); Logic (math.LO)
[185]  arXiv:2405.16640 (cross-list from cs.AI) [pdf, other]
Title: A Survey of Multimodal Large Language Model from A Data-centric Perspective
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[186]  arXiv:2405.16546 (cross-list from cs.IR) [pdf, other]
Title: Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration
Comments: Accepted by Findings of ACL 2024; Datasets Link: this https URL
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[187]  arXiv:2405.16528 (cross-list from cs.LG) [pdf, other]
Title: LoQT: Low Rank Adapters for Quantized Training
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[188]  arXiv:2405.16510 (cross-list from cs.AI) [pdf, other]
Title: Meta-Task Planning for Language Agents
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[189]  arXiv:2405.16473 (cross-list from cs.CV) [pdf, other]
Title: M$^3$CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought
Comments: Accepted at ACL2024 Main Conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[190]  arXiv:2405.16442 (cross-list from cs.CY) [pdf, ps, other]
Title: Development of an open education resources (OER) system: a comparative analysis and implementation approach
Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL)
[191]  arXiv:2405.16434 (cross-list from cs.AI) [pdf, other]
Title: The Importance of Directional Feedback for LLM-based Optimizers
Comments: Presented at Foundation Models for Decision Making at NeurIPS 2023
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
[192]  arXiv:2405.16413 (cross-list from cs.AI) [pdf, other]
Title: Augmented Risk Prediction for the Onset of Alzheimer's Disease from Electronic Health Records with Large Language Models
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Applications (stat.AP)
[193]  arXiv:2405.16411 (cross-list from cs.LG) [pdf, other]
Title: Tensor Attention Training: Provably Efficient Learning of Higher-order Transformers
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[194]  arXiv:2405.16406 (cross-list from cs.LG) [pdf, other]
Title: SpinQuant -- LLM quantization with learned rotations
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[195]  arXiv:2405.16247 (cross-list from cs.AI) [pdf, other]
Title: AutoManual: Generating Instruction Manuals by LLM Agents via Interactive Environmental Learning
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[196]  arXiv:2405.16205 (cross-list from cs.AI) [pdf, ps, other]
Title: GeneAgent: Self-verification Language Agent for Gene Set Knowledge Discovery using Domain Databases
Comments: 30 pages with 10 figures and/or tables
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[197]  arXiv:2405.16136 (cross-list from cs.AI) [pdf, other]
Title: C3LLM: Conditional Multimodal Content Generation Using Large Language Models
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[198]  arXiv:2405.16128 (cross-list from cs.AI) [pdf, other]
Title: How Well Do Deep Learning Models Capture Human Concepts? The Case of the Typicality Effect
Comments: To appear at CogSci 2024
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[199]  arXiv:2405.16122 (cross-list from cs.AI) [pdf, other]
Title: Prompt Optimization with EASE? Efficient Ordering-aware Automated Selection of Exemplars
Comments: 23 pages, 1 figure, 23 tables
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
[200]  arXiv:2405.16043 (cross-list from cs.LG) [pdf, other]
Title: Theoretical Analysis of Weak-to-Strong Generalization
Comments: 36 pages, 3 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[201]  arXiv:2405.15973 (cross-list from cs.CV) [pdf, other]
Title: Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement
Comments: 15 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[202]  arXiv:2405.15943 (cross-list from cs.LG) [pdf, other]
Title: Transformers represent belief state geometry in their residual stream
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[203]  arXiv:2405.15902 (cross-list from cs.CR) [pdf, other]
Title: Hacc-Man: An Arcade Game for Jailbreaking LLMs
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[204]  arXiv:2405.15877 (cross-list from cs.LG) [pdf, other]
Title: Basis Selection: Low-Rank Decomposition of Pretrained Large Language Models for Target Applications
Subjects: Machine Learning (cs.LG); Hardware Architecture (cs.AR); Computation and Language (cs.CL)
[205]  arXiv:2405.15793 (cross-list from cs.SE) [pdf, other]
Title: SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering
Comments: First two authors contributed equally. Code and demo at this https URL
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[206]  arXiv:2405.15787 (cross-list from cs.IR) [pdf, ps, other]
Title: Extracting chemical food safety hazards from the scientific literature automatically using large language models
Comments: 31 pages, 5 figures
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[207]  arXiv:2405.15784 (cross-list from cs.IR) [pdf, other]
Title: CLARINET: Augmenting Language Models to Ask Clarification Questions for Retrieval
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Mon, 27 May 2024 (showing first 3 of 72 entries)

[208]  arXiv:2405.15765 [pdf, other]
Title: Scaling Laws for Discriminative Classification in Large Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[209]  arXiv:2405.15760 [pdf, other]
Title: GPT is Not an Annotator: The Necessity of Human Annotation in Fairness Benchmark Construction
Comments: Accepted to ACL 2024 (main conference)
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[210]  arXiv:2405.15750 [pdf, other]
Title: Filtered Corpus Training (FiCT) Shows that Language Models can Generalize from Indirect Evidence
Comments: 10 pages + 7 pages of references/appendices. For code and trained models, see this http URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[ total of 515 entries: 1-100 | 11-110 | 111-210 | 211-310 | 311-410 | 411-510 | 511-515 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)