We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions

[ total of 427 entries: 1-191 | 192-382 | 383-427 ]
[ showing 191 entries per page: fewer | more | all ]

Fri, 31 May 2024

[1]  arXiv:2405.20335 [pdf, other]
Title: Xwin-LM: Strong and Scalable Alignment Practice for LLMs
Subjects: Computation and Language (cs.CL)
[2]  arXiv:2405.20318 [pdf, other]
Title: CausalQuest: Collecting Natural Causal Questions for AI Agents
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[3]  arXiv:2405.20315 [pdf, other]
Title: ANAH: Analytical Annotation of Hallucinations in Large Language Models
Comments: Accepted by ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[4]  arXiv:2405.20314 [pdf, ps, other]
Title: S3D: A Simple and Cost-Effective Self-Speculative Decoding Scheme for Low-Memory GPUs
Subjects: Computation and Language (cs.CL)
[5]  arXiv:2405.20304 [pdf, other]
Title: Group Robust Preference Optimization in Reward-free RLHF
Comments: Preprint
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[6]  arXiv:2405.20285 [pdf, other]
Title: Who Writes the Review, Human or AI?
Subjects: Computation and Language (cs.CL)
[7]  arXiv:2405.20274 [pdf, other]
Title: ROAST: Review-level Opinion Aspect Sentiment Target Joint Detection
Comments: arXiv admin note: text overlap with arXiv:2309.13297
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[8]  arXiv:2405.20269 [pdf, ps, other]
Title: IsraParlTweet: The Israeli Parliamentary and Twitter Resource
Comments: Presented at LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[9]  arXiv:2405.20267 [pdf, other]
Title: Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions
Subjects: Computation and Language (cs.CL)
[10]  arXiv:2405.20253 [pdf, other]
Title: Evaluating Large Language Model Biases in Persona-Steered Generation
Comments: Accepted to Findings of ACL 2024. Code and data available at this https URL
Subjects: Computation and Language (cs.CL)
[11]  arXiv:2405.20252 [pdf, other]
Title: Towards Hierarchical Multi-Agent Workflows for Zero-Shot Prompt Optimization
Subjects: Computation and Language (cs.CL)
[12]  arXiv:2405.20245 [pdf, other]
Title: Retrieval Augmented Structured Generation: Business Document Information Extraction As Tool Use
Comments: Accepted by IEEE 7th International Conference on Multimedia Information Processing and Retrieval (MIPR), 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[13]  arXiv:2405.20215 [pdf, other]
Title: TS-Align: A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models
Subjects: Computation and Language (cs.CL)
[14]  arXiv:2405.20204 [pdf, other]
Title: Jina CLIP: Your CLIP Model Is Also Your Text Retriever
Comments: 4 pages, ICML2024 workshop submission
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[15]  arXiv:2405.20192 [pdf, other]
Title: TAIA: Large Language Models are Out-of-Distribution Data Learners
Comments: 25 pages
Subjects: Computation and Language (cs.CL)
[16]  arXiv:2405.20179 [pdf, other]
Title: Robo-Instruct: Simulator-Augmented Instruction Alignment For Finetuning CodeLLMs
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[17]  arXiv:2405.20175 [pdf, other]
Title: InstructionCP: A fast approach to transfer Large Language Models into target language
Comments: 10 pages, 1 figure
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[18]  arXiv:2405.20163 [pdf, other]
Title: Reasoning about concepts with LLMs: Inconsistencies abound
Comments: 15 pages, 5 figures, 3 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[19]  arXiv:2405.20145 [pdf, other]
Title: Heidelberg-Boston @ SIGTYP 2024 Shared Task: Enhancing Low-Resource Language Analysis With Character-Aware Hierarchical Transformers
Comments: Accepted for publication at the 6th Workshop on Research in Computational Linguistic Typology and Multilingual NLP (SIGTYP-WS) 2024; 11 pages, 1 figure, 9 tables
Subjects: Computation and Language (cs.CL)
[20]  arXiv:2405.20139 [pdf, other]
Title: GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[21]  arXiv:2405.20131 [pdf, other]
Title: Language Models Need Inductive Biases to Count Inductively
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[22]  arXiv:2405.20092 [pdf, other]
Title: Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation
Subjects: Computation and Language (cs.CL); Software Engineering (cs.SE)
[23]  arXiv:2405.20089 [pdf, other]
Title: The Fine-Tuning Paradox: Boosting Translation Quality Without Sacrificing LLM Abilities
Comments: Accepted to ACL 2024 (long, main)
Subjects: Computation and Language (cs.CL)
[24]  arXiv:2405.20079 [pdf, other]
Title: Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language Learning
Comments: Accepted as a poster paper at EDM 2024: 17th International Conference on Educational Data Mining in Atlanta, USA
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[25]  arXiv:2405.20053 [pdf, other]
Title: Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[26]  arXiv:2405.19967 [pdf, other]
Title: Improved Out-of-Scope Intent Classification with Dual Encoding and Threshold-based Re-Classification
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[27]  arXiv:2405.19958 [pdf, other]
Title: Multi-Aspect Controllable Text Generation with Disentangled Counterfactual Augmentation
Comments: Accepted in the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[28]  arXiv:2405.19874 [pdf, other]
Title: Is In-Context Learning Sufficient for Instruction Following in LLMs?
Comments: Preprint. Code at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[29]  arXiv:2405.19856 [pdf, other]
Title: DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories
Comments: Accepted by the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024). arXiv admin note: substantial text overlap with arXiv:2404.00599, arXiv:2401.06401
Subjects: Computation and Language (cs.CL); Software Engineering (cs.SE)
[30]  arXiv:2405.19846 [pdf, other]
Title: Quest: Query-centric Data Synthesis Approach for Long-context Scaling of Large Language Model
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[31]  arXiv:2405.19842 [pdf, other]
Title: Improve Student's Reasoning Generalizability through Cascading Decomposed CoTs Distillation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[32]  arXiv:2405.19831 [pdf, other]
Title: Just Rewrite It Again: A Post-Processing Method for Enhanced Semantic Similarity and Privacy Preservation of Differentially Private Rewritten Text
Comments: 10 pages, 2 figures, 2 tables. Accepted to ARES 2024 (IWAPS)
Subjects: Computation and Language (cs.CL)
[33]  arXiv:2405.19799 [pdf, other]
Title: Unsupervised Mutual Learning of Dialogue Discourse Parsing and Topic Segmentation
Subjects: Computation and Language (cs.CL)
[34]  arXiv:2405.19795 [pdf, other]
Title: SLM as Guardian: Pioneering AI Safety with Small Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[35]  arXiv:2405.19793 [pdf, other]
Title: PDDLEGO: Iterative Planning in Textual Environments
Comments: In *SEM 2024
Subjects: Computation and Language (cs.CL)
[36]  arXiv:2405.19787 [pdf, other]
Title: From Symbolic Tasks to Code Generation: Diversification Yields Better Task Performers
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Logic in Computer Science (cs.LO); Programming Languages (cs.PL)
[37]  arXiv:2405.19778 [pdf, other]
Title: Enhancing Consistency and Role-Specific Knowledge Capturing by Rebuilding Fictional Character's Persona
Comments: preprint
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[38]  arXiv:2405.19763 [pdf, other]
Title: Enhancing Reinforcement Learning with Label-Sensitive Reward for Natural Language Understanding
Comments: Accept at ACL2024 Main
Subjects: Computation and Language (cs.CL)
[39]  arXiv:2405.19744 [pdf, other]
Title: X-Instruction: Aligning Language Model in Low-resource Languages with Self-curated Cross-lingual Instructions
Comments: ACL 2024. Our codes, data and model weights are available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[40]  arXiv:2405.19740 [pdf, other]
Title: PertEval: Unveiling Real Knowledge Capacity of LLMs with Knowledge-Invariant Perturbations
Comments: 23 pages, 12 figures, 10 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[41]  arXiv:2405.19737 [pdf, other]
Title: Beyond Imitation: Learning Key Reasoning Steps from Dual Chain-of-Thoughts in Reasoning Distillation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[42]  arXiv:2405.19715 [pdf, other]
Title: SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[43]  arXiv:2405.19701 [pdf, other]
Title: Significance of Chain of Thought in Gender Bias Mitigation for English-Dravidian Machine Translation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[44]  arXiv:2405.19670 [pdf, other]
Title: One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models
Comments: working in progress, repo: this https URL
Subjects: Computation and Language (cs.CL)
[45]  arXiv:2405.19660 [pdf, other]
Title: PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals
Comments: Work in progress
Subjects: Computation and Language (cs.CL)
[46]  arXiv:2405.19648 [pdf, other]
Title: Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach
Comments: ICAI'24 - The 26th Int'l Conf on Artificial Intelligence
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[47]  arXiv:2405.19635 [pdf, other]
Title: GKT: A Novel Guidance-Based Knowledge Transfer Framework For Efficient Cloud-edge Collaboration LLM Deployment
Subjects: Computation and Language (cs.CL)
[48]  arXiv:2405.19575 [pdf, other]
Title: A Deep Convolutional Neural Network-based Model for Aspect and Polarity Classification in Hausa Movie Reviews
Comments: To be published in the proceedings of ICCAIT 2023
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[49]  arXiv:2405.19563 [pdf, other]
Title: Unlearning Climate Misinformation in Large Language Models
Subjects: Computation and Language (cs.CL)
[50]  arXiv:2405.19538 [pdf, other]
Title: CheXpert Plus: Hundreds of Thousands of Aligned Radiology Texts, Images and Patients
Comments: 13 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[51]  arXiv:2405.19519 [pdf, other]
Title: Two-layer retrieval augmented generation framework for low-resource medical question-answering: proof of concept using Reddit data
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[52]  arXiv:2405.19487 [pdf, other]
Title: A Full-duplex Speech Dialogue Scheme Based On Large Language Models
Subjects: Computation and Language (cs.CL)
[53]  arXiv:2405.19462 [pdf, other]
Title: Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning
Comments: Accepted to ACL 2024 Findings
Subjects: Computation and Language (cs.CL)
[54]  arXiv:2405.19433 [pdf, other]
Title: Beyond Agreement: Diagnosing the Rationale Alignment of Automated Essay Scoring Methods based on Linguistically-informed Counterfactuals
Subjects: Computation and Language (cs.CL)
[55]  arXiv:2405.19426 [pdf, other]
Title: Deep Learning for Assessment of Oral Reading Fluency
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[56]  arXiv:2405.19425 [pdf, other]
Title: Adaptive In-conversation Team Building for Language Model Agents
Subjects: Computation and Language (cs.CL)
[57]  arXiv:2405.20341 (cross-list from cs.LG) [pdf, other]
Title: From Zero to Hero: Cold-Start Anomaly Detection
Comments: ACL 2024. Our code is available at this https URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[58]  arXiv:2405.20309 (cross-list from cs.LG) [pdf, other]
Title: Large Language Models Can Self-Improve At Web Agent Tasks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[59]  arXiv:2405.20271 (cross-list from cs.LG) [pdf, other]
Title: ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections
Comments: Accepted to ICML 2024. Code available at this https URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[60]  arXiv:2405.20213 (cross-list from cs.AI) [pdf, other]
Title: PostDoc: Generating Poster from a Long Multimodal Document Using Deep Submodular Optimization
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[61]  arXiv:2405.20172 (cross-list from cs.SD) [pdf, other]
Title: Iterative Feature Boosting for Explainable Speech Emotion Recognition
Comments: Published in: 2023 International Conference on Machine Learning and Applications (ICMLA)
Journal-ref: 2023 International Conference on Machine Learning and Applications (ICMLA), Jacksonville, FL, USA, 2023, pp. 543-549
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[62]  arXiv:2405.20101 (cross-list from cs.SD) [pdf, other]
Title: Fill in the Gap! Combining Self-supervised Representation Learning with Neural Audio Synthesis for Speech Inpainting
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[63]  arXiv:2405.20003 (cross-list from cs.LG) [pdf, other]
Title: Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[64]  arXiv:2405.19954 (cross-list from cs.CR) [pdf, other]
Title: GenKubeSec: LLM-Based Kubernetes Misconfiguration Detection, Localization, Reasoning, and Remediation
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[65]  arXiv:2405.19877 (cross-list from cs.AI) [pdf, other]
Title: KNOW: A Real-World Ontology for Knowledge Capture with Large Language Models
Authors: Arto Bendiken
Comments: 5 pages, 1 figure
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[66]  arXiv:2405.19782 (cross-list from cs.SE) [pdf, other]
Title: Dataflow-Guided Retrieval Augmentation for Repository-Level Code Completion
Comments: Accepted in the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)
Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL)
[67]  arXiv:2405.19732 (cross-list from cs.CV) [pdf, other]
Title: Two Optimizers Are Better Than One: LLM Catalyst for Enhancing Gradient-Based Optimization
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[68]  arXiv:2405.19716 (cross-list from cs.CV) [pdf, other]
Title: Enhancing Large Vision Language Models with Self-Training on Image Comprehension
Comments: 19 pages, 14 figures, 6 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[69]  arXiv:2405.19616 (cross-list from cs.AI) [pdf, other]
Title: Easy Problems That LLMs Get Wrong
Comments: AutogenAI Ltd. Associated code at this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[70]  arXiv:2405.19597 (cross-list from cs.LG) [pdf, other]
Title: SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors
Comments: 17 pages, 5 figures, 14 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[71]  arXiv:2405.19592 (cross-list from cs.LG) [pdf, other]
Title: Why Larger Language Models Do In-context Learning Differently?
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[72]  arXiv:2405.19562 (cross-list from cs.CY) [pdf, other]
Title: Selective Explanations
Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL); Machine Learning (cs.LG)
[73]  arXiv:2405.19561 (cross-list from cs.AI) [pdf, other]
Title: Quo Vadis ChatGPT? From Large Language Models to Large Knowledge Models
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[74]  arXiv:2405.19534 (cross-list from cs.LG) [pdf, other]
Title: Preference Learning Algorithms Do Not Learn Preference Rankings
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[75]  arXiv:2405.19343 (cross-list from cs.SD) [pdf, other]
Title: Luganda Speech Intent Recognition for IoT Applications
Comments: Presented as a conference paper at ICLR 2024/AfricaNLP
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[76]  arXiv:2405.19342 (cross-list from cs.SD) [pdf, other]
Title: Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)

Thu, 30 May 2024

[77]  arXiv:2405.19327 [pdf, other]
[78]  arXiv:2405.19325 [pdf, other]
Title: Nearest Neighbor Speculative Decoding for LLM Generation and Attribution
Subjects: Computation and Language (cs.CL)
[79]  arXiv:2405.19323 [pdf, other]
Title: Are Large Language Models Chameleons?
Comments: 16 pages,8 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[80]  arXiv:2405.19299 [pdf, other]
Title: Expert-Guided Extinction of Toxic Tokens for Debiased Generation
Subjects: Computation and Language (cs.CL)
[81]  arXiv:2405.19290 [pdf, other]
Title: Integrating Multi-scale Contextualized Information for Byte-based Neural Machine Translation
Comments: Accepted by ACL2024 Findings
Subjects: Computation and Language (cs.CL)
[82]  arXiv:2405.19285 [pdf, other]
Title: MASSIVE Multilingual Abstract Meaning Representation: A Dataset and Baselines for Hallucination Detection
Subjects: Computation and Language (cs.CL)
[83]  arXiv:2405.19266 [pdf, other]
Title: PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications
Comments: A Technical Report on a Powerful Chinese Medical Large Language Model
Subjects: Computation and Language (cs.CL)
[84]  arXiv:2405.19265 [pdf, other]
Title: AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data
Comments: Preprint with 20 pages and 20 figures. Source code and models at this https URL
Subjects: Computation and Language (cs.CL)
[85]  arXiv:2405.19262 [pdf, other]
Title: Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[86]  arXiv:2405.19261 [pdf, other]
Title: Faster Cascades via Speculative Decoding
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[87]  arXiv:2405.19222 [pdf, other]
Title: Lower Bounds on the Expressivity of Recurrent Neural Language Models
Subjects: Computation and Language (cs.CL)
[88]  arXiv:2405.19220 [pdf, other]
Title: WRDScore: New Metric for Evaluation of Natural Language Generation Models
Authors: Ravil Mussabayev
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[89]  arXiv:2405.19139 [pdf, other]
Title: DGRC: An Effective Fine-tuning Framework for Distractor Generation in Chinese Multi-choice Reading Comprehension
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[90]  arXiv:2405.19109 [pdf, other]
Title: PathReasoner: Modeling Reasoning Path with Equivalent Extension for Logical Question Answering
Comments: Accepted by ACL 2024
Subjects: Computation and Language (cs.CL)
[91]  arXiv:2405.19094 [pdf, other]
Title: Faithful Chart Summarization with ChaTS-Pi
Comments: To be published in the proceedings of the 2024 Annual Meeting of the Association for Computational Linguistics
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[92]  arXiv:2405.19093 [pdf, other]
Title: Multi-stage Retrieve and Re-rank Model for Automatic Medical Coding Recommendation
Comments: Accepted to NAACL 2024 -- camera-ready version
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[93]  arXiv:2405.19088 [pdf, other]
Title: Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[94]  arXiv:2405.19086 [pdf, other]
Title: MEMoE: Enhancing Model Editing with Mixture of Experts Adaptors
Authors: Renzhi Wang, Piji Li
Subjects: Computation and Language (cs.CL)
[95]  arXiv:2405.19084 [pdf, other]
Title: Auxiliary Knowledge-Induced Learning for Automatic Multi-Label Medical Document Classification
Comments: Accepted to LREC-COLING 2024 -- camera-ready version
Subjects: Computation and Language (cs.CL)
[96]  arXiv:2405.19041 [pdf, other]
Title: BLSP-KD: Bootstrapping Language-Speech Pre-training via Knowledge Distillation
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[97]  arXiv:2405.19010 [pdf, other]
Title: Evaluating the External and Parametric Knowledge Fusion of Large Language Models
Comments: 15 pages, 3 figures, 3 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[98]  arXiv:2405.18974 [pdf, other]
Title: Encoding Hierarchical Schema via Concept Flow for Multifaceted Ideology Detection
Comments: 13pages, 4 figures (Accepted to Findings of ACL 2024)
Subjects: Computation and Language (cs.CL)
[99]  arXiv:2405.18952 [pdf, other]
Title: Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets
Authors: Peter Devine
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[100]  arXiv:2405.18922 [pdf, other]
Title: Understanding and Addressing the Under-Translation Problem from the Perspective of Decoding Objective
Comments: ACL 2024 main conference
Subjects: Computation and Language (cs.CL)
[101]  arXiv:2405.18915 [pdf, other]
Title: Towards Faithful Chain-of-Thought: Large Language Models are Bridging Reasoners
Comments: 25 pages, under review
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[102]  arXiv:2405.18906 [pdf, other]
Title: Language Generation with Strictly Proper Scoring Rules
Comments: ICML 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[103]  arXiv:2405.18845 [pdf, other]
Title: Simulation, Modelling and Classification of Wiki Contributors: Spotting The Good, The Bad, and The Ugly
Journal-ref: Simulation Modelling Practice and Theory, 120, 102616 (2022)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[104]  arXiv:2405.18822 [pdf, other]
Title: Toxicity Detection for Free
Subjects: Computation and Language (cs.CL)
[105]  arXiv:2405.18741 [pdf, other]
Title: Genshin: General Shield for Natural Language Processing with Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[106]  arXiv:2405.18740 [pdf, other]
Title: Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs
Subjects: Computation and Language (cs.CL)
[107]  arXiv:2405.18727 [pdf, other]
Title: CtrlA: Adaptive Retrieval-Augmented Generation via Probe-Guided Control
Comments: 28 pages, 7 figures, 9 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[108]  arXiv:2405.18719 [pdf, other]
Title: Contextual Position Encoding: Learning to Count What's Important
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[109]  arXiv:2405.18718 [pdf, other]
Title: Efficient Model-agnostic Alignment via Bayesian Persuasion
Subjects: Computation and Language (cs.CL)
[110]  arXiv:2405.18682 [pdf, other]
Title: Can GPT Redefine Medical Understanding? Evaluating GPT on Biomedical Machine Reading Comprehension
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[111]  arXiv:2405.18662 [pdf, other]
Title: Understanding Intrinsic Socioeconomic Biases in Large Language Models
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[112]  arXiv:2405.18653 [pdf, other]
Title: Recent Advances of Foundation Language Models-based Continual Learning: A Survey
Subjects: Computation and Language (cs.CL)
[113]  arXiv:2405.18649 [pdf, other]
Title: Training LLMs to Better Self-Debug and Explain Code
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
[114]  arXiv:2405.18638 [pdf, other]
Title: ConSiDERS-The-Human Evaluation Framework: Rethinking Human Evaluation for Generative Large Language Models
Comments: Accepted in ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[115]  arXiv:2405.18613 [pdf, ps, other]
Title: GLOCON Database: Design Decisions and User Manual (v1.0)
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Databases (cs.DB); Machine Learning (cs.LG)
[116]  arXiv:2405.18605 [pdf, ps, other]
Title: BioBERT-based Deep Learning and Merged ChemProt-DrugProt for Enhanced Biomedical Relation Extraction
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Molecular Networks (q-bio.MN)
[117]  arXiv:2405.18540 [pdf, other]
Title: Learning diverse attacks on large language models for robust red-teaming and safety tuning
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[118]  arXiv:2405.18492 [pdf, other]
Title: LLMs and Memorization: On Quality and Specificity of Copyright Compliance
Comments: 10 pages, 3 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[119]  arXiv:2405.18448 [pdf, other]
Title: Multi-objective Representation for Numbers in Clinical Narratives Using CamemBERT-bio
Comments: Under the revision. arXiv admin note: substantial text overlap with arXiv:2404.10171
Subjects: Computation and Language (cs.CL); Signal Processing (eess.SP)
[120]  arXiv:2405.19335 (cross-list from cs.CV) [pdf, other]
Title: X-VILA: Cross-Modality Alignment for Large Language Model
Comments: Technical Report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[121]  arXiv:2405.19334 (cross-list from cs.AI) [pdf, other]
Title: LLMs Meet Multimodal Generation and Editing: A Survey
Comments: 51 Pages with 16 Figures, 12 Tables, and 534 References. GitHub Repository at: this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[122]  arXiv:2405.19316 (cross-list from cs.LG) [pdf, other]
Title: Robust Preference Optimization through Reward Model Distillation
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[123]  arXiv:2405.19315 (cross-list from cs.CV) [pdf, other]
Title: Matryoshka Query Transformer for Large Vision-Language Models
Comments: Preprint. Our code and model are publicly available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[124]  arXiv:2405.19313 (cross-list from cs.AI) [pdf, other]
Title: Language Models Trained to do Arithmetic Predict Human Risky and Intertemporal Choice
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); General Economics (econ.GN)
[125]  arXiv:2405.19209 (cross-list from cs.CV) [pdf, other]
Title: VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
Comments: 20 pages, first three authors contributed equally; Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[126]  arXiv:2405.19186 (cross-list from cs.CV) [pdf, other]
Title: MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification
Authors: Laura Fieback (1,2), Jakob Spiegelberg (1), Hanno Gottschalk (2) ((1) Volkswagen AG, (2) TU Berlin)
Comments: 18 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[127]  arXiv:2405.19076 (cross-list from cs.CV) [pdf, other]
Title: Cephalo: Multi-Modal Vision-Language Models for Bio-Inspired Materials Analysis and Design
Subjects: Computer Vision and Pattern Recognition (cs.CV); Mesoscale and Nanoscale Physics (cond-mat.mes-hall); Materials Science (cond-mat.mtrl-sci); Computation and Language (cs.CL); Machine Learning (cs.LG)
[128]  arXiv:2405.19026 (cross-list from cs.LG) [pdf, other]
Title: DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[129]  arXiv:2405.18991 (cross-list from cs.CV) [pdf, other]
Title: EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture
Comments: 6 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[130]  arXiv:2405.18937 (cross-list from cs.CV) [pdf, other]
Title: Kestrel: Point Grounding Multimodal LLM for Part-Aware 3D Vision-Language Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[131]  arXiv:2405.18874 (cross-list from cond-mat.dis-nn) [pdf, other]
Title: Are queries and keys always relevant? A case study on Transformer wave functions
Comments: 9 pages, 4 figures
Subjects: Disordered Systems and Neural Networks (cond-mat.dis-nn); Computation and Language (cs.CL); Computational Physics (physics.comp-ph)
[132]  arXiv:2405.18870 (cross-list from cs.AI) [pdf, other]
Title: LLMs achieve adult human performance on higher-order theory of mind tasks
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[133]  arXiv:2405.18776 (cross-list from cs.CR) [pdf, other]
Title: LMO-DP: Optimizing the Randomization Mechanism for Differentially Private Fine-Tuning (Large) Language Models
Comments: 18 pages, 15 figures
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[134]  arXiv:2405.18742 (cross-list from cs.AI) [pdf, other]
Title: Musical Phrase Segmentation via Grammatical Induction
Comments: Extended version of a paper appearing in the proceedings of IJCAI 2024 that includes additional material in an appendix. Please cite the IJCAI version
Journal-ref: Proceedings of the International Joint Conference on Artificial Intelligence, 2024
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[135]  arXiv:2405.18721 (cross-list from cs.CV) [pdf, other]
Title: Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Comments: Accepted by TPAMI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[136]  arXiv:2405.18711 (cross-list from cs.AI) [pdf, other]
Title: Calibrating Reasoning in Language Models with Internal Consistency
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[137]  arXiv:2405.18688 (cross-list from cs.LG) [pdf, other]
Title: Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[138]  arXiv:2405.18672 (cross-list from cs.CV) [pdf, other]
Title: LLM-based Hierarchical Concept Decomposition for Interpretable Fine-Grained Image Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[139]  arXiv:2405.18669 (cross-list from cs.LG) [pdf, other]
Title: Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities
Comments: Under review at NeurIPS
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[140]  arXiv:2405.18642 (cross-list from cs.AI) [pdf, other]
Title: JADS: A Framework for Self-supervised Joint Aspect Discovery and Summarization
Comments: preprint
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[141]  arXiv:2405.18639 (cross-list from q-bio.NC) [pdf, other]
Title: Improving Speech Decoding from ECoG with Self-Supervised Pretraining
Subjects: Neurons and Cognition (q-bio.NC); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[142]  arXiv:2405.18634 (cross-list from cs.LG) [pdf, other]
Title: A Theoretical Understanding of Self-Correction through In-context Alignment
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[143]  arXiv:2405.18628 (cross-list from cs.LG) [pdf, other]
Title: Hardware-Aware Parallel Prompt Decoding for Memory-Efficient Acceleration of LLM Inference
Comments: The code for this implementation is available at this https URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[144]  arXiv:2405.18620 (cross-list from cs.HC) [pdf, other]
Title: RealitySummary: On-Demand Mixed Reality Document Enhancement using Large Language Models
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[145]  arXiv:2405.18572 (cross-list from cs.LG) [pdf, other]
Title: Low-rank finetuning for LLMs: A fairness perspective
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[146]  arXiv:2405.18570 (cross-list from cs.CV) [pdf, other]
Title: Its Not a Modality Gap: Characterizing and Addressing the Contrastive Gap
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[147]  arXiv:2405.18542 (cross-list from cs.AI) [pdf, other]
Title: Automatic detection of cognitive impairment in elderly people using an entertainment chatbot with Natural Language Processing capabilities
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[148]  arXiv:2405.17653 (cross-list from cs.LG) [pdf, other]
Title: InversionView: A General-Purpose Method for Reading Information from Neural Activations
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Wed, 29 May 2024 (showing first 43 of 81 entries)

[149]  arXiv:2405.18433 [pdf, other]
Title: Notes on Applicability of GPT-4 to Document Understanding
Subjects: Computation and Language (cs.CL)
[150]  arXiv:2405.18414 [pdf, other]
Title: Don't Forget to Connect! Improving RAG with Graph-based Reranking
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[151]  arXiv:2405.18400 [pdf, other]
Title: Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass
Comments: 22 pages, 15 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[152]  arXiv:2405.18375 [pdf, other]
Title: Thai Winograd Schemas: A Benchmark for Thai Commonsense Reasoning
Authors: Phakphum Artkaew
Subjects: Computation and Language (cs.CL)
[153]  arXiv:2405.18369 [pdf, other]
Title: PromptWizard: Task-Aware Agent-driven Prompt Optimization Framework
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[154]  arXiv:2405.18359 [pdf, other]
Title: Bridging the Gap: Dynamic Learning Strategies for Improving Multilingual Performance in LLMs
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[155]  arXiv:2405.18358 [pdf, other]
Title: MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[156]  arXiv:2405.18357 [pdf, other]
Title: Faithful Logical Reasoning via Symbolic Chain-of-Thought
Comments: Accepted by ACL 2024 (main proceeding)
Subjects: Computation and Language (cs.CL)
[157]  arXiv:2405.18350 [pdf, other]
Title: A System for Automatic English Text Expansion
Journal-ref: (2019) IEEE Access, 7, 123320-123333
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[158]  arXiv:2405.18348 [pdf, other]
Title: Can Automatic Metrics Assess High-Quality Translations?
Comments: work in progress
Subjects: Computation and Language (cs.CL)
[159]  arXiv:2405.18344 [pdf, other]
Title: The Battle of LLMs: A Comparative Study in Conversational QA Tasks
Comments: 9 pages, 4 figures, 2 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[160]  arXiv:2405.18335 [pdf, other]
Title: Interpretable classification of wiki-review streams
Journal-ref: (2023) IEEE Access
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[161]  arXiv:2405.18308 [pdf, other]
Title: Joint Lemmatization and Morphological Tagging with LEMMING
Comments: EMNLP 2015; Honorable Mention for Best Short Paper
Subjects: Computation and Language (cs.CL)
[162]  arXiv:2405.18292 [pdf, other]
Title: Semantic are Beacons: A Semantic Perspective for Unveiling Parameter-Efficient Fine-Tuning in Knowledge Learning
Authors: Renzhi Wang, Piji Li
Comments: Accepted at Findings of ACL 2024
Subjects: Computation and Language (cs.CL)
[163]  arXiv:2405.18241 [pdf, ps, other]
Title: Active Use of Latent Constituency Representation in both Humans and Large Language Models
Comments: 62 pages, 5 figures. Under review
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[164]  arXiv:2405.18203 [pdf, other]
Title: IAPT: Instruction-Aware Prompt Tuning for Large Language Models
Comments: Accepted by ACL-2024
Subjects: Computation and Language (cs.CL)
[165]  arXiv:2405.18115 [pdf, other]
Title: The Knesset Corpus: An Annotated Corpus of Hebrew Parliamentary Proceedings
Authors: Gili Goldin (1), Nick Howell (2), Noam Ordan (2), Ella Rabinovich (3), Shuly Wintner (1) ((1) Department of Computer Science, University of Haifa, Israel, (2) IAHLT, Israel, (3) School of Computer Science, The Academic College of Tel-Aviv Yaffo, Israel)
Comments: 28 pages, 7 figures
Subjects: Computation and Language (cs.CL)
[166]  arXiv:2405.18113 [pdf, other]
Title: Facilitating Multi-Role and Multi-Behavior Collaboration of Large Language Models for Online Job Seeking and Recruiting
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[167]  arXiv:2405.18111 [pdf, other]
Title: ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator
Comments: 16 pages
Subjects: Computation and Language (cs.CL)
[168]  arXiv:2405.18061 [pdf, other]
Title: Context is Important in Depressive Language: A Study of the Interaction Between the Sentiments and Linguistic Markers in Reddit Discussions
Subjects: Computation and Language (cs.CL)
[169]  arXiv:2405.18060 [pdf, ps, other]
Title: PRFashion24: A Dataset for Sentiment Analysis of Fashion Products Reviews in Persian
Comments: 8 page
Subjects: Computation and Language (cs.CL)
[170]  arXiv:2405.18035 [pdf, other]
Title: Instruction Tuning with Retrieval-based Examples Ranking for Aspect-based Sentiment Analysis
Comments: ACL Findings 2024
Subjects: Computation and Language (cs.CL)
[171]  arXiv:2405.18028 [pdf, other]
Title: Edinburgh Clinical NLP at MEDIQA-CORR 2024: Guiding Large Language Models with Hints
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[172]  arXiv:2405.18027 [pdf, other]
Title: TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models
Comments: ACL 2024 Findings. Code and dataset are released at this https URL
Subjects: Computation and Language (cs.CL)
[173]  arXiv:2405.18015 [pdf, other]
Title: MultiADE: A Multi-domain Benchmark for Adverse Drug Event Extraction
Comments: Under review; feedback welcome
Subjects: Computation and Language (cs.CL)
[174]  arXiv:2405.18009 [pdf, other]
Title: Exploring Context Window of Large Language Models via Decomposed Positional Vectors
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[175]  arXiv:2405.17992 [pdf, other]
Title: fMRI predictors based on language models of increasing complexity recover brain left lateralization
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Neurons and Cognition (q-bio.NC)
[176]  arXiv:2405.17980 [pdf, other]
Title: Peering into the Mind of Language Models: An Approach for Attribution in Contextual Question Answering
Subjects: Computation and Language (cs.CL)
[177]  arXiv:2405.17978 [pdf, other]
Title: FASTopic: A Fast, Adaptive, Stable, and Transferable Topic Modeling Paradigm
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[178]  arXiv:2405.17977 [pdf, other]
Title: Aligning to Thousands of Preferences via System Message Generalization
Comments: Work in progress
Subjects: Computation and Language (cs.CL)
[179]  arXiv:2405.17974 [pdf, other]
Title: Recent Trends in Personalized Dialogue Generation: A Review of Datasets, Methodologies, and Evaluations
Comments: Presented in LREC-COLING 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[180]  arXiv:2405.17969 [pdf, other]
Title: Knowledge Circuits in Pretrained Transformers
Comments: Work in progress, 25 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[181]  arXiv:2405.17964 [pdf, other]
Title: Transformer and Hybrid Deep Learning Based Models for Machine-Generated Text Detection
Subjects: Computation and Language (cs.CL)
[182]  arXiv:2405.17957 [pdf, other]
Title: Modeling Dynamic Topics in Chain-Free Fashion by Evolution-Tracking Contrastive Learning and Unassociated Word Exclusion
Comments: Accepted to ACL 2024 Findings
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[183]  arXiv:2405.17935 [pdf, other]
Title: Tool Learning with Large Language Models: A Survey
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[184]  arXiv:2405.17931 [pdf, other]
Title: Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[185]  arXiv:2405.17915 [pdf, other]
Title: Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language Models
Comments: 13 pages, 5 figures, ACL 2024
Subjects: Computation and Language (cs.CL)
[186]  arXiv:2405.17900 [pdf, other]
Title: Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Comments: Accepted by the 20th International Conference on Intelligent Computing (ICIC 2024)
Subjects: Computation and Language (cs.CL)
[187]  arXiv:2405.17893 [pdf, other]
Title: Arithmetic Reasoning with LLM: Prolog Generation & Permutation
Comments: 12 pages, 4 figures, accepted by NAACL 2024 Main Conference
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[188]  arXiv:2405.17840 [pdf, other]
[189]  arXiv:2405.17830 [pdf, other]
Title: More Than Catastrophic Forgetting: Integrating General Capabilities For Domain-Specific LLMs
Subjects: Computation and Language (cs.CL)
[190]  arXiv:2405.17822 [pdf, other]
Title: Conv-CoA: Improving Open-domain Question Answering in Large Language Models via Conversational Chain-of-Action
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[191]  arXiv:2405.17809 [pdf, other]
Title: TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[ total of 427 entries: 1-191 | 192-382 | 383-427 ]
[ showing 191 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help  (Access key information)