We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions, skipping first 103

[ total of 489 entries: 1-98 | 6-103 | 104-201 | 202-299 | 300-397 | 398-489 ]
[ showing 98 entries per page: fewer | more | all ]

Wed, 5 Jun 2024 (continued, showing last 80 of 93 entries)

[104]  arXiv:2406.02350 [pdf, other]
Title: LlamaCare: A Large Medical Language Model for Enhancing Healthcare Knowledge Sharing
Authors: Maojun Sun
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[105]  arXiv:2406.02338 [pdf, other]
Title: Linguistic Fingerprint in Transformer Models: How Language Variation Influences Parameter Selection in Irony Detection
Journal-ref: Proceedings of the 3rd Workshop on Perspectivist Approaches to NLP (NLPerspectives) @ LREC-COLING 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[106]  arXiv:2406.02335 [pdf, other]
Title: Probing the Category of Verbal Aspect in Transformer Language Models
Subjects: Computation and Language (cs.CL)
[107]  arXiv:2406.02331 [pdf, other]
Title: Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering
Comments: ACL 2024 Findings Accepted
Subjects: Computation and Language (cs.CL)
[108]  arXiv:2406.02329 [pdf, other]
Title: On Affine Homotopy between Language Encoders
Comments: 10 pages
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[109]  arXiv:2406.02325 [pdf, other]
Title: Technical Language Processing for Telecommunications Specifications
Comments: Still not published
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[110]  arXiv:2406.02301 [pdf, other]
Title: mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models
Comments: Accepted to ACL 2024 main
Subjects: Computation and Language (cs.CL)
[111]  arXiv:2406.02267 [pdf, ps, other]
Title: Prompting Large Language Models with Human Error Markings for Self-Correcting Machine Translation
Comments: To appear at The 25th Annual Conference of the European Association for Machine Translation (EAMT 2024)
Subjects: Computation and Language (cs.CL)
[112]  arXiv:2406.02266 [pdf, ps, other]
Title: Enhancing Retrieval-Augmented LMs with a Two-stage Consistency Learning Compressor
Subjects: Computation and Language (cs.CL)
[113]  arXiv:2406.02251 [pdf, other]
Title: Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning
Comments: Accepted to ACL 2024 Findings. arXiv admin note: text overlap with arXiv:2212.11382
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[114]  arXiv:2406.02245 [pdf, other]
Title: Description Boosting for Zero-Shot Entity and Relation Classification
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[115]  arXiv:2406.02237 [pdf, other]
Title: Self-Modifying State Modeling for Simultaneous Machine Translation
Comments: Accept to ACL 2024 main conference. 15 pages, 13 figures, 9 tables
Subjects: Computation and Language (cs.CL)
[116]  arXiv:2406.02224 [pdf, other]
Title: FedMKT: Federated Mutual Knowledge Transfer for Large and Small Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[117]  arXiv:2406.02169 [pdf, ps, other]
Title: A multilingual dataset for offensive language and hate speech detection for hausa, yoruba and igbo languages
Comments: 9 pages
Subjects: Computation and Language (cs.CL)
[118]  arXiv:2406.02148 [pdf, other]
Title: Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Language Models
Comments: Accepted to ACL-24 Main
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[119]  arXiv:2406.02143 [pdf, other]
Title: Reinforcement Tuning for Detecting Stances and Debunking Rumors Jointly with Large Language Models
Comments: ACL 2024 (Findings)
Subjects: Computation and Language (cs.CL)
[120]  arXiv:2406.02134 [pdf, other]
Title: The current status of large language models in summarizing radiology report impressions
Subjects: Computation and Language (cs.CL)
[121]  arXiv:2406.02120 [pdf, other]
Title: Diver: Large Language Model Decoding with Span-Level Mutual Information Verification
Subjects: Computation and Language (cs.CL)
[122]  arXiv:2406.02110 [pdf, other]
Title: UniOQA: A Unified Framework for Knowledge Graph Question Answering with Large Language Models
Comments: 10 pages, 5 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[123]  arXiv:2406.02106 [pdf, other]
Title: MARS: Benchmarking the Metaphysical Reasoning Abilities of Language Models with a Multi-task Evaluation Dataset
Subjects: Computation and Language (cs.CL)
[124]  arXiv:2406.02100 [pdf, other]
Title: Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data
Comments: Accept by Findings of ACL 2024
Subjects: Computation and Language (cs.CL)
[125]  arXiv:2406.02080 [pdf, other]
Title: LongSSM: On the Length Extension of State-space Models in Language Modelling
Authors: Shida Wang
Comments: 23 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Dynamical Systems (math.DS)
[126]  arXiv:2406.02079 [pdf, ps, other]
Title: Assessing the Performance of Chinese Open Source Large Language Models in Information Extraction Tasks
Subjects: Computation and Language (cs.CL)
[127]  arXiv:2406.02069 [pdf, other]
Title: PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[128]  arXiv:2406.02060 [pdf, ps, other]
Title: I've got the "Answer"! Interpretation of LLMs Hidden States in Question Answering
Comments: Accepted for NLDB-2024 conference
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[129]  arXiv:2406.02050 [pdf, other]
Title: Analyzing Social Biases in Japanese Large Language Models
Subjects: Computation and Language (cs.CL)
[130]  arXiv:2406.02044 [pdf, ps, other]
Title: QROA: A Black-Box Query-Response Optimization Attack on LLMs
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[131]  arXiv:2406.02030 [pdf, other]
Title: Multimodal Reasoning with Multimodal Knowledge Graph
Comments: Accepted by ACL 2024 (Main Conference)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[132]  arXiv:2406.02018 [pdf, other]
Title: Why Would You Suggest That? Human Trust in Language Model Responses
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[133]  arXiv:2406.02002 [pdf, other]
Title: Position Debiasing Fine-Tuning for Causal Perception in Long-Term Dialogue
Comments: Accepted to IJCAI 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[134]  arXiv:2406.01988 [pdf, other]
Title: Personalized Topic Selection Model for Topic-Grounded Dialogue
Comments: Accepted to ACL 2024 Findings
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[135]  arXiv:2406.01983 [pdf, other]
Title: RKLD: Reverse KL-Divergence-based Knowledge Distillation for Unlearning Personal Information in Large Language Models
Comments: Work is in progress
Subjects: Computation and Language (cs.CL)
[136]  arXiv:2406.01981 [pdf, other]
Title: Zyda: A 1.3T Dataset for Open Language Modeling
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[137]  arXiv:2406.01976 [pdf, other]
Title: Conditional Language Learning with Context
Authors: Xiao Zhang, Miao Li, Ji Wu
Comments: To appear at the 41st International Conference on Machine Learning (ICML 2024)
Subjects: Computation and Language (cs.CL)
[138]  arXiv:2406.01943 [pdf, ps, other]
Title: Enhancing Trust in LLMs: Algorithms for Comparing and Interpreting LLMs
Authors: Nik Bear Brown
Comments: An extensive survey of the literature specifying algorithms and techniques enhancing the trustworthiness and understanding of Large Language Models (LLMs)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[139]  arXiv:2406.01940 [pdf, other]
Title: Process-Driven Autoformalization in Lean 4
Comments: 22 pages, 1 figures, 11 tables
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[140]  arXiv:2406.01934 [pdf, other]
Title: Optimal Transport Guided Correlation Assignment for Multimodal Entity Linking
Comments: Findings of ACL 2024
Subjects: Computation and Language (cs.CL)
[141]  arXiv:2406.01931 [pdf, other]
Title: Dishonesty in Helpful and Harmless Alignment
Subjects: Computation and Language (cs.CL)
[142]  arXiv:2406.01919 [pdf, other]
Title: OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection
Comments: Accepted by ACL 2024 Findings
Subjects: Computation and Language (cs.CL)
[143]  arXiv:2406.01879 [pdf, other]
Title: Bi-DCSpell: A Bi-directional Detector-Corrector Interactive Framework for Chinese Spelling Check
Comments: 12 pages, 6 figures
Subjects: Computation and Language (cs.CL)
[144]  arXiv:2406.01873 [pdf, other]
Title: CR-UTP: Certified Robustness against Universal Text Perturbations on Large Language Models
Comments: Accepted by ACL Findings 2024
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[145]  arXiv:2406.01866 [pdf, other]
Title: #EpiTwitter: Public Health Messaging During the COVID-19 Pandemic
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Social and Information Networks (cs.SI)
[146]  arXiv:2406.01863 [pdf, other]
Title: Towards Effective Time-Aware Language Representation: Exploring Enhanced Temporal Understanding in Language Models
Subjects: Computation and Language (cs.CL)
[147]  arXiv:2406.01860 [pdf, other]
Title: Eliciting the Priors of Large Language Models using Iterated In-Context Learning
Subjects: Computation and Language (cs.CL)
[148]  arXiv:2406.01855 [pdf, other]
Title: TruthEval: A Dataset to Evaluate LLM Truthfulness and Reliability
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[149]  arXiv:2406.01835 [pdf, other]
Title: An Open Multilingual System for Scoring Readability of Wikipedia
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[150]  arXiv:2406.01806 [pdf, other]
Title: Contextualized Sequence Likelihood: Enhanced Confidence Scores for Natural Language Generation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[151]  arXiv:2406.01775 [pdf, other]
Title: OLoRA: Orthonormal Low-Rank Adaptation of Large Language Models
Comments: 10 pages, 5 figures
Subjects: Computation and Language (cs.CL)
[152]  arXiv:2406.01771 [pdf, other]
Title: LLMs Beyond English: Scaling the Multilingual Capability of LLMs with Cross-Lingual Feedback
Comments: Accepted to Findings of ACL 2024. The code, datasets, and models are publicly available at this https URL
Subjects: Computation and Language (cs.CL)
[153]  arXiv:2406.01749 [pdf, ps, other]
Title: Towards Harnessing Large Language Models for Comprehension of Conversational Grounding
Comments: Accepted to IWSDS 2024
Subjects: Computation and Language (cs.CL)
[154]  arXiv:2406.01721 [pdf, other]
Title: Rotation and Permutation for Advanced Outlier Management and Efficient Quantization of LLMs
Comments: 26 pages, 13 figures
Subjects: Computation and Language (cs.CL)
[155]  arXiv:2406.02543 (cross-list from cs.LG) [pdf, other]
Title: To Believe or Not to Believe Your LLM
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[156]  arXiv:2406.02539 (cross-list from cs.CV) [pdf, other]
Title: Parrot: Multilingual Visual Instruction Tuning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[157]  arXiv:2406.02488 (cross-list from eess.AS) [pdf, other]
Title: Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[158]  arXiv:2406.02469 (cross-list from cs.LG) [pdf, other]
Title: Landscape-Aware Growing: The Power of a Little LAG
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[159]  arXiv:2406.02377 (cross-list from cs.IR) [pdf, other]
Title: XRec: Large Language Models for Explainable Recommendation
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[160]  arXiv:2406.02368 (cross-list from cs.IR) [pdf, other]
Title: Large Language Models Make Sample-Efficient Recommender Systems
Comments: Accepted by Frontier of Computer Science
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[161]  arXiv:2406.02356 (cross-list from cs.LG) [pdf, other]
Title: Language Models Do Hard Arithmetic Tasks Easily and Hardly Do Easy Arithmetic Tasks
Comments: In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[162]  arXiv:2406.02332 (cross-list from cs.LG) [pdf, other]
Title: Extended Mind Transformers
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[163]  arXiv:2406.02265 (cross-list from cs.CV) [pdf, other]
Title: Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning
Comments: 9 pages, long paper at ACL 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[164]  arXiv:2406.02208 (cross-list from cs.CV) [pdf, other]
Title: Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts
Comments: IJCAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[165]  arXiv:2406.02166 (cross-list from cs.SD) [pdf, other]
Title: Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[166]  arXiv:2406.02135 (cross-list from cs.IR) [pdf, other]
Title: Robust Interaction-based Relevance Modeling for Online E-Commerce and LLM-based Retrieval
Comments: Accepted by ECML-PKDD'24 as Outstanding Paper. 8 pages, 2 figures, 7 tables
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[167]  arXiv:2406.02133 (cross-list from eess.AS) [pdf, other]
Title: SimulTron: On-Device Simultaneous Speech to Speech Translation
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[168]  arXiv:2406.02128 (cross-list from cs.LG) [pdf, other]
Title: Iteration Head: A Mechanistic Study of Chain-of-Thought
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[169]  arXiv:2406.02061 (cross-list from cs.LG) [pdf, other]
Title: Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models
Comments: v1
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[170]  arXiv:2406.02009 (cross-list from eess.AS) [pdf, other]
Title: Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis
Comments: Accepted by Interspeech 2024
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[171]  arXiv:2406.02004 (cross-list from cs.CR) [pdf, ps, other]
Title: Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[172]  arXiv:2406.01946 (cross-list from cs.CR) [pdf, other]
Title: Bileve: Securing Text Provenance in Large Language Models Against Spoofing with Bi-level Signature
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL)
[173]  arXiv:2406.01914 (cross-list from cs.CV) [pdf, other]
Title: HPE-CogVLM: New Head Pose Grounding Task Exploration on Vision Language Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[174]  arXiv:2406.01895 (cross-list from cs.LG) [pdf, other]
Title: Explicitly Encoding Structural Symmetry is Key to Length Generalization in Arithmetic Tasks
Comments: 32 pages, 16 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[175]  arXiv:2406.01876 (cross-list from cs.DB) [pdf, other]
Title: GRAM: Generative Retrieval Augmented Matching of Data Schemas in the Context of Data Security
Comments: KDD 2024 Camera Ready; 11 pages, 8 figures
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[176]  arXiv:2406.01789 (cross-list from cs.LG) [pdf, ps, other]
Title: AI-based Classification of Customer Support Tickets: State of the Art and Implementation with AutoML
Journal-ref: Proceedings of the IWEMB 2021/2022: Fifth and Sixth International Workshop on Entrepreneurship, Electronic and Mobile Business
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[177]  arXiv:2406.01638 (cross-list from cs.LG) [pdf, other]
Title: TimeCMA: Towards LLM-Empowered Time Series Forecasting via Cross-Modality Alignment
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[178]  arXiv:2406.01633 (cross-list from cs.IR) [pdf, other]
Title: On Overcoming Miscalibrated Conversational Priors in LLM-based Chatbots
Comments: Preprint of UAI'24 conference publication
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[179]  arXiv:2406.01624 (cross-list from eess.AS) [pdf, other]
Title: Unveiling Hidden Factors: Explainable AI for Feature Boosting in Speech Emotion Recognition
Comments: Published in: Springer Nature International Journal of Applied Intelligence (2024)
Journal-ref: Applied Intelligence (2024)
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[180]  arXiv:2406.01609 (cross-list from cs.IR) [pdf, other]
Title: Judgement Citation Retrieval using Contextual Similarity
Comments: 14 pages, 16 images, Submitted to Multimedia Tools and Applications Springer journal
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[181]  arXiv:2406.01608 (cross-list from cs.IR) [pdf, other]
Title: Detecting Deceptive Dark Patterns in E-commerce Platforms
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[182]  arXiv:2406.01607 (cross-list from cs.IR) [pdf, other]
Title: Recent advances in text embedding: A Comprehensive Review of Top-Performing Methods on the MTEB Benchmark
Authors: Hongliu Cao
Comments: 45 pages
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[183]  arXiv:2406.01606 (cross-list from cs.IR) [pdf, other]
Title: SymTax: Symbiotic Relationship and Taxonomy Fusion for Effective Citation Recommendation
Comments: Accepted in ACL 2024
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Tue, 4 Jun 2024 (showing first 18 of 153 entries)

[184]  arXiv:2406.01574 [pdf, other]
Title: MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark
Subjects: Computation and Language (cs.CL)
[185]  arXiv:2406.01563 [pdf, other]
Title: LoFiT: Localized Fine-tuning on LLM Representations
Subjects: Computation and Language (cs.CL)
[186]  arXiv:2406.01549 [pdf, other]
Title: An Information Bottleneck Perspective for Effective Noise Filtering on Retrieval-Augmented Generation
Comments: ACL24 Main
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[187]  arXiv:2406.01538 [pdf, other]
Title: What Are Large Language Models Mapping to in the Brain? A Case Against Over-Reliance on Brain Scores
Comments: 10 pages, 4 figures in the main paper
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[188]  arXiv:2406.01514 [pdf, other]
Title: Decoupled Alignment for Robust Plug-and-Play Adaptation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[189]  arXiv:2406.01512 [pdf, other]
Title: MAD: Multi-Alignment MEG-to-Text Decoding
Subjects: Computation and Language (cs.CL)
[190]  arXiv:2406.01506 [pdf, other]
Title: The Geometry of Categorical and Hierarchical Concepts in Large Language Models
Comments: Code is available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[191]  arXiv:2406.01495 [pdf, other]
Title: Reflection-Reinforced Self-Training for Language Agents
Subjects: Computation and Language (cs.CL)
[192]  arXiv:2406.01468 [pdf, other]
Title: Understanding Token Probability Encoding in Output Embeddings
Comments: 15 pages, 17 figures, 3 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[193]  arXiv:2406.01446 [pdf, ps, other]
Title: Enabling ASR for Low-Resource Languages: A Comprehensive Dataset Creation Approach
Authors: Ara Yeroyan (Data Science Department, American University of Armenia), Nikolay Karpov (Nvidia, NeMo Conversational AI team)
Comments: 13 pages, 10 figures (including ablation studies), to be published in 2024 IEEE Spoken Language Technology Workshop. Additionally, the associated software package can be accessed at (this https URL) for practical applications and further development
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[194]  arXiv:2406.01441 [pdf, other]
Title: LexMatcher: Dictionary-centric Data Collection for LLM-based Machine Translation
Subjects: Computation and Language (cs.CL)
[195]  arXiv:2406.01436 [pdf, other]
Title: Editing the Mind of Giants: An In-Depth Exploration of Pitfalls of Knowledge Editing in Large Language Models
Subjects: Computation and Language (cs.CL)
[196]  arXiv:2406.01428 [pdf, ps, other]
Title: Superhuman performance in urology board questions by an explainable large language model enabled for context integration of the European Association of Urology guidelines: the UroBot study
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[197]  arXiv:2406.01392 [pdf, other]
Title: Sparsity-Accelerated Training for Large Language Models
Comments: Accepted to ACL 2024 Findings
Subjects: Computation and Language (cs.CL)
[198]  arXiv:2406.01382 [pdf, other]
Title: Do Large Language Models Perform the Way People Expect? Measuring the Human Generalization Function
Comments: To appear in ICML 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[199]  arXiv:2406.01375 [pdf, other]
Title: D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models
Subjects: Computation and Language (cs.CL)
[200]  arXiv:2406.01372 [pdf, ps, other]
Title: Linguistic Analysis, Description, and Typological Exploration with Categorial Grammar (TheBench Guide)
Authors: Cem Bozsahin
Subjects: Computation and Language (cs.CL)
[201]  arXiv:2406.01363 [pdf, other]
Title: Privacy in LLM-based Recommendation: Recent Advances and Future Directions
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[ total of 489 entries: 1-98 | 6-103 | 104-201 | 202-299 | 300-397 | 398-489 ]
[ showing 98 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help  (Access key information)