We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions, skipping first 153

[ total of 352 entries: 1-100 | 54-153 | 154-253 | 254-352 ]
[ showing 100 entries per page: fewer | more | all ]

Mon, 15 Apr 2024 (continued, showing last 30 of 46 entries)

[154]  arXiv:2404.08313 [pdf, other]
Title: The Integration of Semantic and Structural Knowledge in Knowledge Graph Entity Typing
Comments: Accepted in NAACL2024 main
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[155]  arXiv:2404.08263 [pdf, other]
Title: Relational Prompt-based Pre-trained Language Models for Social Event Detection
Comments: ACM TOIS Under Review
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[156]  arXiv:2404.08262 [pdf, ps, other]
Title: Pretraining and Updating Language- and Domain-specific Large Language Model: A Case Study in Japanese Business Domain
Comments: 9 pages. preprint of COLM2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[157]  arXiv:2404.08259 [pdf, ps, other]
Title: Investigating Neural Machine Translation for Low-Resource Languages: Using Bavarian as a Case Study
Comments: Preprint accepted at the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages (SIGUL 2024)
Subjects: Computation and Language (cs.CL)
[158]  arXiv:2404.08191 [pdf, other]
Title: Measuring Cross-lingual Transfer in Bytes
Comments: NAACL 2024
Subjects: Computation and Language (cs.CL)
[159]  arXiv:2404.08156 [pdf, other]
Title: Multimodal Contextual Dialogue Breakdown Detection for Conversational AI Models
Comments: Published in NAACL 2024 Industry Track
Subjects: Computation and Language (cs.CL)
[160]  arXiv:2404.08155 [pdf, other]
Title: Graph Integrated Language Transformers for Next Action Prediction in Complex Phone Calls
Comments: Published in NAACL 2024 Industry Track
Subjects: Computation and Language (cs.CL)
[161]  arXiv:2404.08148 [pdf, other]
Title: Distilling Algorithmic Reasoning from LLMs via Explaining Solution Programs
Comments: pre-print
Subjects: Computation and Language (cs.CL)
[162]  arXiv:2404.08118 [pdf, ps, other]
Title: HLTCOE at TREC 2023 NeuCLIR Track
Comments: 6 pages. Part of TREC 2023 Proceedings
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[163]  arXiv:2404.08092 [pdf, ps, other]
Title: Data-Augmentation-Based Dialectal Adaptation for LLMs
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[164]  arXiv:2404.08078 [pdf, other]
Title: SQBC: Active Learning using LLM-Generated Synthetic Data for Stance Detection in Online Political Discussions
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[165]  arXiv:2404.08066 [pdf, other]
Title: MSciNLI: A Diverse Benchmark for Scientific Natural Language Inference
Comments: Accepted to the NAACL 2024 Main Conference
Subjects: Computation and Language (cs.CL)
[166]  arXiv:2404.08555 (cross-list from cs.LG) [pdf, other]
Title: RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[167]  arXiv:2404.08517 (cross-list from cs.SE) [pdf, other]
Title: Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[168]  arXiv:2404.08511 (cross-list from cs.AI) [pdf, other]
Title: Leveraging Multi-AI Agents for Cross-Domain Knowledge Discovery
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[169]  arXiv:2404.08509 (cross-list from cs.DC) [pdf, other]
Title: Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction
Comments: Accepted at AIOps'24
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computation and Language (cs.CL); Machine Learning (cs.LG)
[170]  arXiv:2404.08495 (cross-list from cs.LG) [pdf, other]
Title: Dataset Reset Policy Optimization for RLHF
Comments: 28 pages, 6 tables, 3 Figures, 3 Algorithms
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[171]  arXiv:2404.08480 (cross-list from cs.LG) [pdf, other]
Title: Decoding AI: The inside story of data analysis in ChatGPT
Comments: 15 pages with figures and appendix
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computation (stat.CO)
[172]  arXiv:2404.08417 (cross-list from cs.LG) [pdf, other]
Title: AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[173]  arXiv:2404.08309 (cross-list from cs.CR) [pdf, other]
Title: Subtoxic Questions: Dive Into Attitude Change of LLM's Response in Jailbreak Attempts
Comments: 4 pages, 2 figures. This paper was submitted to The 7th Deep Learning Security and Privacy Workshop (DLSP 2024) and was accepted as extended abstract, see this https URL
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[174]  arXiv:2404.08189 (cross-list from cs.LG) [pdf, other]
Title: Reducing hallucination in structured outputs via Retrieval-Augmented Generation
Comments: To be presented at NAACL 2024. 11 pages and 4 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[175]  arXiv:2404.08164 (cross-list from stat.ML) [pdf, other]
Title: Language Model Prompt Selection via Simulation Optimization
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[176]  arXiv:2404.08134 (cross-list from cs.IR) [pdf, other]
Title: Extending Translate-Train for ColBERT-X to African Language CLIR
Comments: 10 pages, 2 figures. System description paper for HLTCOE's participation in CIRAL@FIRE 2023
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[177]  arXiv:2404.08111 (cross-list from cs.CV) [pdf, other]
Title: S3Editor: A Sparse Semantic-Disentangled Self-Training Framework for Face Video Editing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[178]  arXiv:2404.08080 (cross-list from cs.LG) [pdf, other]
Title: Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models
Comments: 29 pages, 25 tables, 9 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Optimization and Control (math.OC)
[179]  arXiv:2404.08020 (cross-list from cs.AI) [pdf, other]
Title: Augmenting Knowledge Graph Hierarchies Using Neural Transformers
Comments: European Conference on Information Retrieval 2024
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Digital Libraries (cs.DL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[180]  arXiv:2404.08018 (cross-list from cs.SE) [pdf, other]
Title: Analyzing the Performance of Large Language Models on Code Summarization
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[181]  arXiv:2404.08008 (cross-list from cs.LG) [pdf, other]
Title: Sample-Efficient Human Evaluation of Large Language Models via Maximum Discrepancy Competition
Comments: 32 pages, 6 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[182]  arXiv:2404.08001 (cross-list from hep-ph) [pdf, other]
Title: Xiwu: A Basis Flexible and Learnable LLM for High Energy Physics
Comments: 15 pages, 8 figures
Subjects: High Energy Physics - Phenomenology (hep-ph); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); High Energy Physics - Experiment (hep-ex); Computational Physics (physics.comp-ph)
[183]  arXiv:2404.07999 (cross-list from cs.LG) [pdf, other]
Title: A Multi-Level Framework for Accelerating Training Transformer Models
Comments: ICLR 2024
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)

Fri, 12 Apr 2024

[184]  arXiv:2404.07982 [pdf, other]
Title: Language Imbalance Can Boost Cross-lingual Generalisation
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[185]  arXiv:2404.07979 [pdf, other]
Title: LLoCO: Learning Long Contexts Offline
Comments: The first two authors contributed equally to this work
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[186]  arXiv:2404.07965 [pdf, other]
Title: Rho-1: Not All Tokens Are What You Need
Comments: First two authors equal contribution
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[187]  arXiv:2404.07922 [pdf, other]
Title: LaVy: Vietnamese Multimodal Large Language Model
Comments: 4 pages
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[188]  arXiv:2404.07921 [pdf, other]
Title: AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMs
Authors: Zeyi Liao, Huan Sun
Subjects: Computation and Language (cs.CL)
[189]  arXiv:2404.07904 [pdf, other]
Title: HGRN2: Gated Linear RNNs with State Expansion
Comments: Techinical Report. Yiran Zhong is the corresponding author. The source code is available at this https URL
Subjects: Computation and Language (cs.CL)
[190]  arXiv:2404.07900 [pdf, other]
Title: High-Dimension Human Value Representation in Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[191]  arXiv:2404.07879 [pdf, other]
Title: Analyzing Toxicity in Deep Conversations: A Reddit Case Study
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Social and Information Networks (cs.SI)
[192]  arXiv:2404.07851 [pdf, other]
Title: Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations
Comments: 21 pages, 8 figures
Journal-ref: NAACL 2024 Findings
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[193]  arXiv:2404.07840 [pdf, other]
Title: On Training Data Influence of GPT Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[194]  arXiv:2404.07836 [pdf, ps, other]
Title: Question Generation in Knowledge-Driven Dialog: Explainability and Evaluation
Subjects: Computation and Language (cs.CL)
[195]  arXiv:2404.07814 [pdf, ps, other]
Title: MultiLS-SP/CA: Lexical Complexity Prediction and Lexical Simplification Resources for Catalan and Spanish
Comments: Submitted to the 40th edition of the SEPLN Conference. Under Revision
Subjects: Computation and Language (cs.CL)
[196]  arXiv:2404.07792 [pdf, other]
Title: Nostra Domina at EvaLatin 2024: Improving Latin Polarity Detection through Data Augmentation
Comments: Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[197]  arXiv:2404.07775 [pdf, other]
Title: Discourse-Aware In-Context Learning for Temporal Expression Normalization
Comments: Accepted at NAACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[198]  arXiv:2404.07768 [pdf, ps, other]
Title: Using Letter Positional Probabilities to Assess Word Complexity
Authors: Michael Dalvean
Comments: 25 Pages, 15 Tables
Subjects: Computation and Language (cs.CL)
[199]  arXiv:2404.07765 [pdf, other]
Title: AnnoCTR: A Dataset for Detecting and Linking Entities, Tactics, and Techniques in Cyber Threat Reports
Comments: Accepted at LREC-COLING 2024. Corpus available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[200]  arXiv:2404.07738 [pdf, other]
Title: ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[201]  arXiv:2404.07720 [pdf, other]
Title: Automatic Generation and Evaluation of Reading Comprehension Test Items with Large Language Models
Comments: Accepted for publication at the 3rd Workshop on Tools and Resources for People with REAding DIfficulties (READI) at LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[202]  arXiv:2404.07677 [pdf, other]
Title: ODA: Observation-Driven Agent for integrating LLMs and Knowledge Graphs
Comments: LLM+KG
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[203]  arXiv:2404.07673 [pdf, other]
Title: Curated Datasets and Neural Models for Machine Translation of Informal Registers between Mayan and Spanish Vernaculars
Comments: 13 pages, 3 figures, 8 tables, Submitted to NAACL 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[204]  arXiv:2404.07654 [pdf, ps, other]
Title: rollama: An R package for using generative large language models through Ollama
Subjects: Computation and Language (cs.CL)
[205]  arXiv:2404.07647 [pdf, other]
Title: Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck
Subjects: Computation and Language (cs.CL)
[206]  arXiv:2404.07616 [pdf, other]
Title: Audio Dialogues: Dialogues dataset for audio and music understanding
Comments: Demo website: this https URL
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[207]  arXiv:2404.07613 [pdf, other]
Title: Medical mT5: An Open-Source Multilingual Text-to-Text LLM for The Medical Domain
Comments: LREC-COLING 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[208]  arXiv:2404.07611 [pdf, other]
Title: NoticIA: A Clickbait Article Summarization Dataset in Spanish
Comments: Under review in the journal Procesamiento del Lenguaje Natural
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[209]  arXiv:2404.07584 [pdf, other]
Title: UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMs
Subjects: Computation and Language (cs.CL)
[210]  arXiv:2404.07549 [pdf, other]
Title: Comments as Natural Logic Pivots: Improve Code Generation via Comment Perspective
Comments: The code is publicly available at this https URL
Subjects: Computation and Language (cs.CL)
[211]  arXiv:2404.07546 [pdf, other]
Title: Decomposing Label Space, Format and Discrimination: Rethinking How LLMs Respond and Solve Tasks via In-Context Learning
Comments: 36 pages, 8 figures
Subjects: Computation and Language (cs.CL)
[212]  arXiv:2404.07544 [pdf, other]
Title: From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples
Comments: 50 pages, 48 figures, preprint
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[213]  arXiv:2404.07503 [pdf, other]
Title: Best Practices and Lessons Learned on Synthetic Data for Language Models
Subjects: Computation and Language (cs.CL)
[214]  arXiv:2404.07501 [pdf, other]
Title: Leveraging Data Augmentation for Process Information Extraction
Comments: Accepted at BPMDS 2024 (this https URL), to be printed
Subjects: Computation and Language (cs.CL)
[215]  arXiv:2404.07498 [pdf, other]
Title: Interactive Prompt Debugging with Sequence Salience
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[216]  arXiv:2404.07475 [pdf, ps, other]
Title: Laissez-Faire Harms: Algorithmic Biases in Generative Language Models
Comments: 16 pages (44 if including supplementals), 4 figures (20 if including supplementals)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[217]  arXiv:2404.07470 [pdf, other]
Title: Scalable Language Model with Generalized Continual Learning
Comments: The Twelfth International Conference on Learning Representations
Subjects: Computation and Language (cs.CL)
[218]  arXiv:2404.07461 [pdf, other]
Title: "Confidently Nonsensical?'': A Critical Survey on the Perspectives and Challenges of 'Hallucinations' in NLP
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[219]  arXiv:2404.07413 [pdf, other]
Title: JetMoE: Reaching Llama2 Performance with 0.1M Dollars
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[220]  arXiv:2404.07376 [pdf, other]
Title: LLMs in Biomedicine: A study on clinical Named Entity Recognition
Subjects: Computation and Language (cs.CL)
[221]  arXiv:2404.07304 [pdf, other]
Title: We're Calling an Intervention: Taking a Closer Look at Language Model Adaptation to Different Types of Linguistic Variation
Comments: Preprint. Under review
Subjects: Computation and Language (cs.CL)
[222]  arXiv:2404.07229 [pdf, other]
Title: Personality-affected Emotion Generation in Dialog Systems
Comments: Accepted by ACM Transactions on Information Systems
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[223]  arXiv:2404.07989 (cross-list from cs.CV) [pdf, other]
Title: Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding
Comments: Code and models are released at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[224]  arXiv:2404.07981 (cross-list from cs.IR) [pdf, other]
Title: Manipulating Large Language Models to Increase Product Visibility
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[225]  arXiv:2404.07972 (cross-list from cs.AI) [pdf, other]
Title: OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Comments: 51 pages, 21 figures
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[226]  arXiv:2404.07963 (cross-list from cs.CY) [pdf, other]
Title: EduAgent: Generative Student Agents in Learning
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[227]  arXiv:2404.07954 (cross-list from cs.IR) [pdf, ps, other]
Title: An efficient domain-independent approach for supervised keyphrase extraction and ranking
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[228]  arXiv:2404.07917 (cross-list from cs.AI) [pdf, other]
Title: DesignQA: A Multimodal Benchmark for Evaluating Large Language Models' Understanding of Engineering Documentation
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[229]  arXiv:2404.07839 (cross-list from cs.LG) [pdf, other]
[230]  arXiv:2404.07824 (cross-list from cs.CV) [pdf, other]
Title: Heron-Bench: A Benchmark for Evaluating Vision Language Models in Japanese
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[231]  arXiv:2404.07622 (cross-list from cs.CV) [pdf, other]
Title: Multi-Image Visual Question Answering for Unsupervised Anomaly Detection
Comments: 13 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[232]  arXiv:2404.07520 (cross-list from cs.CV) [pdf, other]
Title: PromptSync: Bridging Domain Gaps in Vision-Language Models through Class-Aware Prototype Alignment and Discrimination
Authors: Anant Khandelwal
Comments: Accepted at CVPR 2024 LIMIT, 12 pages, 8 Tables, 2 Figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[233]  arXiv:2404.07471 (cross-list from cs.SE) [pdf, other]
Title: Structure-aware Fine-tuning for Code Pre-trained Models
Comments: Accepted by COLING 2024
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[234]  arXiv:2404.07448 (cross-list from cs.CV) [pdf, other]
Title: Transferable and Principled Efficiency for Open-Vocabulary Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Image and Video Processing (eess.IV)
[235]  arXiv:2404.07439 (cross-list from cs.AI) [pdf, other]
Title: Behavior Trees Enable Structured Programming of Language Model Agents
Authors: Richard Kelley
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[236]  arXiv:2404.07377 (cross-list from cs.LG) [pdf, other]
Title: Deep Generative Sampling in the Dual Divergence Space: A Data-efficient & Interpretative Approach for Generative AI
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[237]  arXiv:2404.07341 (cross-list from eess.AS) [pdf, other]
Title: Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[238]  arXiv:2404.07242 (cross-list from cs.CR) [pdf, other]
Title: Sandwich attack: Multi-language Mixture Adaptive Attack on LLMs
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[239]  arXiv:2404.07234 (cross-list from cs.CR) [pdf, other]
Title: Goal-guided Generative Prompt Injection Attack on Large Language Models
Comments: 22 pages, 8 figures
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[240]  arXiv:2404.07221 (cross-list from cs.IR) [pdf, other]
Title: Improving Retrieval for RAG based Question Answering Models on Financial Documents
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG); General Finance (q-fin.GN)
[241]  arXiv:2404.07220 (cross-list from cs.IR) [pdf, other]
Title: Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers
Comments: Pre-print version of paper submitted to conference
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[242]  arXiv:2404.07214 (cross-list from cs.CV) [pdf, other]
Title: Exploring the Frontier of Vision-Language Models: A Survey of Current Methodologies and Future Directions
Comments: The most extensive and up to date Survey on Visual Language Models covering 76 Visual Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Thu, 11 Apr 2024 (showing first 11 of 48 entries)

[243]  arXiv:2404.07143 [pdf, other]
Title: Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Comments: 9 pages, 4 figures, 4 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[244]  arXiv:2404.07135 [pdf, other]
Title: Towards Robustness of Text-to-Visualization Translation against Lexical and Phrasal Variability
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[245]  arXiv:2404.07117 [pdf, other]
Title: Continuous Language Model Interpolation for Dynamic and Controllable Text Generation
Comments: 20 pages, 22 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[246]  arXiv:2404.07108 [pdf, other]
Title: From Model-centered to Human-Centered: Revision Distance as a Metric for Text Evaluation in LLMs-based Applications
Comments: 9 pages, 2 figures, under review
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[247]  arXiv:2404.07103 [pdf, other]
Title: Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs
Comments: 21 pages. Code: this https URL
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[248]  arXiv:2404.07084 [pdf, other]
Title: Dynamic Generation of Personalities with Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[249]  arXiv:2404.07066 [pdf, other]
Title: Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?
Comments: 12 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[250]  arXiv:2404.07060 [pdf, other]
Title: Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study
Comments: NAACL 2024 (Findings)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[251]  arXiv:2404.07053 [pdf, other]
Title: Meta4XNLI: A Crosslingual Parallel Corpus for Metaphor Detection and Interpretation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[252]  arXiv:2404.07036 [pdf, other]
Title: A Computational Analysis of the Dehumanisation of Migrants from Syria and Ukraine in Slovene News Media
Comments: The first authors have contributted equally. Accepted at LREC-COLING
Subjects: Computation and Language (cs.CL)
[253]  arXiv:2404.07017 [pdf, other]
Title: Improving Language Model Reasoning with Self-motivated Learning
Comments: Accepted at LREC-COLING 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[ total of 352 entries: 1-100 | 54-153 | 154-253 | 254-352 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2404, contact, help  (Access key information)