We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions, skipping first 127

[ total of 497 entries: 1-75 | 53-127 | 128-202 | 203-277 | 278-352 | 353-427 | 428-497 ]
[ showing 75 entries per page: fewer | more | all ]

Thu, 6 Jun 2024 (continued, showing last 47 of 90 entries)

[128]  arXiv:2406.02886 [pdf, other]
Title: PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs
Comments: Findings of ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[129]  arXiv:2406.02882 [pdf, other]
Title: Outdated Issue Aware Decoding for Factual Knowledge Editing
Comments: ACL2024 Findings, Codes are at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[130]  arXiv:2406.02876 [pdf, other]
Title: LCS: A Language Converter Strategy for Zero-Shot Neural Machine Translation
Comments: ACL2024 Findings, Codes are at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[131]  arXiv:2406.02864 [pdf, other]
Title: NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models
Comments: Findings of ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[132]  arXiv:2406.02863 [pdf, ps, other]
Title: LLM as a Scorer: The Impact of Output Order on Dialogue Evaluation
Comments: Presented in AAAI 2024 Spring Symposium. The first two authors contributed equally
Subjects: Computation and Language (cs.CL)
[133]  arXiv:2406.02856 [pdf, other]
Title: Xmodel-LM Technical Report
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[134]  arXiv:2406.02832 [pdf, other]
Title: Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[135]  arXiv:2406.02830 [pdf, other]
Title: Too Big to Fail: Larger Language Models are Disproportionately Resilient to Induction of Dementia-Related Linguistic Anomalies
Comments: Accepted to ACL 2024 findings
Subjects: Computation and Language (cs.CL)
[136]  arXiv:2406.02826 [pdf, other]
Title: Exploring Robustness in Doctor-Patient Conversation Summarization: An Analysis of Out-of-Domain SOAP Notes
Comments: Clinical NLP Workshop 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[137]  arXiv:2406.02818 [pdf, other]
Title: Chain of Agents: Large Language Models Collaborating on Long-Context Tasks
Comments: 19 pages, 6 figures
Subjects: Computation and Language (cs.CL)
[138]  arXiv:2406.02787 [pdf, other]
Title: Disentangling Logic: The Role of Context in Large Language Model Reasoning Capabilities
Comments: 22 pages, 9 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[139]  arXiv:2406.02756 [pdf, other]
Title: Aligning Large Language Models via Fine-grained Supervision
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[140]  arXiv:2406.02746 [pdf, other]
Title: RATT: AThought Structure for Coherent and Correct LLMReasoning
Subjects: Computation and Language (cs.CL)
[141]  arXiv:2406.02733 [pdf, other]
Title: Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation
Comments: Accepted to ACL 2024 (findings)
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[142]  arXiv:2406.02721 [pdf, other]
Title: Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller
Comments: 41 pages, 12 figures, 61 tables; Website: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[143]  arXiv:2406.02657 [pdf, other]
Title: Block Transformer: Global-to-Local Language Modeling for Fast Inference
Comments: 30 pages, 21 figures, 5 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[144]  arXiv:2406.02577 [pdf, other]
Title: Are PPO-ed Language Models Hackable?
Comments: 8 pages, 4 figures
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[145]  arXiv:2406.02575 [pdf, other]
Title: Cross-Modal Safety Alignment: Is textual unlearning all you need?
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[146]  arXiv:2406.03482 (cross-list from cs.LG) [pdf, other]
Title: QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead
Comments: 13 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Performance (cs.PF)
[147]  arXiv:2406.03476 (cross-list from cs.LG) [pdf, other]
Title: Does your data spark joy? Performance gains from domain upsampling at the end of training
Comments: The first three authors contributed equally
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[148]  arXiv:2406.03445 (cross-list from cs.LG) [pdf, other]
Title: Pre-trained Large Language Models Use Fourier Features to Compute Addition
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[149]  arXiv:2406.03299 (cross-list from cs.AI) [pdf, other]
Title: The Good, the Bad, and the Hulk-like GPT: Analyzing Emotional Decisions of Large Language Models in Cooperation and Bargaining Games
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[150]  arXiv:2406.03287 (cross-list from cs.NE) [pdf, other]
Title: SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms
Subjects: Neural and Evolutionary Computing (cs.NE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[151]  arXiv:2406.03280 (cross-list from cs.LG) [pdf, other]
Title: FusionBench: A Comprehensive Benchmark of Deep Model Fusion
Comments: Project homepage: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[152]  arXiv:2406.03248 (cross-list from cs.IR) [pdf, other]
Title: Large Language Models as Evaluators for Recommendation Explanations
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[153]  arXiv:2406.03068 (cross-list from cs.LG) [pdf, other]
Title: How Truncating Weights Improves Reasoning in Language Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[154]  arXiv:2406.03008 (cross-list from cs.CV) [pdf, other]
Title: DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences
Comments: First Vision and Language for Autonomous Driving and Robotics Workshop (VLADR @ CVPR 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[155]  arXiv:2406.02969 (cross-list from cs.LG) [pdf, other]
Title: Filtered not Mixed: Stochastic Filtering-Based Online Gating for Mixture of Large Language Models
Comments: 29 pages, 5 Appendix sections
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computational Finance (q-fin.CP); Mathematical Finance (q-fin.MF)
[156]  arXiv:2406.02958 (cross-list from cs.LG) [pdf, other]
Title: PrE-Text: Training Language Models on Private Federated Data in the Age of LLMs
Comments: ICML 2024 (Oral)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC)
[157]  arXiv:2406.02950 (cross-list from eess.AS) [pdf, other]
Title: 4D ASR: Joint Beam Search Integrating CTC, Attention, Transducer, and Mask Predict Decoders
Comments: submitted to IEEE/ACM Transactions on Audio Speech and Language Processing
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[158]  arXiv:2406.02943 (cross-list from cs.IR) [pdf, ps, other]
Title: The Task-oriented Queries Benchmark (ToQB)
Authors: Keun Soo Yim
Comments: Data available on GitHub, this https URL
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Neural and Evolutionary Computing (cs.NE)
[159]  arXiv:2406.02925 (cross-list from eess.AS) [pdf, other]
Title: SYN2REAL: Leveraging Task Arithmetic for Mitigating Synthetic-Real Discrepancies in ASR Domain Adaptation
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[160]  arXiv:2406.02924 (cross-list from cs.LG) [pdf, other]
Title: Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for Large Language Models
Comments: Accepted by ICML2024, 29 pages, 4 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
[161]  arXiv:2406.02900 (cross-list from cs.LG) [pdf, other]
Title: Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[162]  arXiv:2406.02844 (cross-list from cs.IR) [pdf, other]
Title: Item-Language Model for Conversational Recommendation
Comments: 15 pages, 3 figures
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[163]  arXiv:2406.02804 (cross-list from cs.AI) [pdf, other]
Title: $\texttt{ACCORD}$: Closing the Commonsense Measurability Gap
Comments: For leaderboard and dataset download, see this https URL For source code, see this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[164]  arXiv:2406.02798 (cross-list from cs.DL) [pdf, ps, other]
Title: Promotional Language and the Adoption of Innovative Ideas in Science
Subjects: Digital Libraries (cs.DL); Computation and Language (cs.CL); Computers and Society (cs.CY)
[165]  arXiv:2406.02795 (cross-list from cs.HC) [pdf, other]
Title: ArguMentor: Augmenting User Experiences with Counter-Perspectives
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[166]  arXiv:2406.02791 (cross-list from cs.AI) [pdf, other]
Title: Language Models can Infer Action Semantics for Classical Planners from Environment Feedback
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Robotics (cs.RO)
[167]  arXiv:2406.02592 (cross-list from cs.LG) [pdf, other]
Title: LOLAMEME: Logic, Language, Memory, Mechanistic Framework
Comments: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[168]  arXiv:2406.02566 (cross-list from eess.AS) [pdf, other]
Title: Combining X-Vectors and Bayesian Batch Active Learning: Two-Stage Active Learning Pipeline for Speech Recognition
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[169]  arXiv:2406.02565 (cross-list from cs.SD) [pdf, other]
Title: Sequence-to-sequence models in peer-to-peer learning: A practical application
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Multiagent Systems (cs.MA); Audio and Speech Processing (eess.AS)
[170]  arXiv:2406.02563 (cross-list from eess.AS) [pdf, other]
Title: A cost minimization approach to fix the vocabulary size in a tokenizer for an End-to-End ASR system
Comments: 5 pages, 4 figures
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[171]  arXiv:2406.02562 (cross-list from eess.AS) [pdf, other]
Title: Gated Low-rank Adaptation for personalized Code-Switching Automatic Speech Recognition on the low-spec devices
Comments: Table 2 is revised
Journal-ref: ICASSP 2024 Workshop(HSCMA 2024) paper
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[172]  arXiv:2406.02560 (cross-list from eess.AS) [pdf, other]
Title: Less Peaky and More Accurate CTC Forced Alignment by Label Priors
Comments: Accepted by ICASSP 2024. Github repo: this https URL
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[173]  arXiv:2406.02555 (cross-list from eess.AS) [pdf, ps, other]
Title: PhoWhisper: Automatic Speech Recognition for Vietnamese
Comments: Accepted to ICLR 2024 Tiny Papers Track
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL)
[174]  arXiv:2406.02554 (cross-list from eess.AS) [pdf, other]
Title: Hear Me, See Me, Understand Me: Audio-Visual Autism Behavior Recognition
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)

Wed, 5 Jun 2024 (showing first 28 of 93 entries)

[175]  arXiv:2406.02537 [pdf, other]
Title: TopViewRS: Vision-Language Models as Top-View Spatial Reasoners
Comments: 9 pages, 3 figures, 3 tables (21 pages, 4 figures, 15 tables including references and appendices)
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[176]  arXiv:2406.02536 [pdf, other]
Title: Mitigate Position Bias in Large Language Models via Scaling a Single Dimension
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[177]  arXiv:2406.02532 [pdf, other]
Title: SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices
Comments: preprint. arXiv admin note: text overlap with arXiv:2312.17238 by other authors
Subjects: Computation and Language (cs.CL)
[178]  arXiv:2406.02528 [pdf, other]
Title: Scalable MatMul-free Language Modeling
Subjects: Computation and Language (cs.CL)
[179]  arXiv:2406.02524 [pdf, other]
Title: CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Subjects: Computation and Language (cs.CL)
[180]  arXiv:2406.02517 [pdf, other]
Title: Deterministic Reversible Data Augmentation for Neural Machine Translation
Comments: Findings of ACL 2024
Subjects: Computation and Language (cs.CL)
[181]  arXiv:2406.02481 [pdf, other]
Title: Hiding Text in Large Language Models: Introducing Unconditional Token Forcing Confusion
Comments: Work in progress. Code is available at this https URL
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[182]  arXiv:2406.02472 [pdf, other]
Title: Analyzing Temporal Complex Events with Large Language Models? A Benchmark towards Temporal, Long Context Understanding
Comments: Accepted to ACL 2024
Subjects: Computation and Language (cs.CL)
[183]  arXiv:2406.02449 [pdf, other]
Title: Representations as Language: An Information-Theoretic Framework for Interpretability
Comments: 6 pages, 3 Figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[184]  arXiv:2406.02396 [pdf, other]
Title: The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[185]  arXiv:2406.02394 [pdf, other]
Title: Multiple Choice Questions and Large Languages Models: A Case Study with Fictional Medical Data
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[186]  arXiv:2406.02378 [pdf, other]
Title: On the Intrinsic Self-Correction Capability of LLMs: Uncertainty and Latent Concept
Comments: 22 pages, 7 figures
Subjects: Computation and Language (cs.CL)
[187]  arXiv:2406.02376 [pdf, other]
Title: Retaining Key Information under High Compression Ratios: Query-Guided Compressor for LLMs
Comments: Accepted to ACL 2024
Subjects: Computation and Language (cs.CL)
[188]  arXiv:2406.02350 [pdf, other]
Title: LlamaCare: A Large Medical Language Model for Enhancing Healthcare Knowledge Sharing
Authors: Maojun Sun
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[189]  arXiv:2406.02338 [pdf, other]
Title: Linguistic Fingerprint in Transformer Models: How Language Variation Influences Parameter Selection in Irony Detection
Journal-ref: Proceedings of the 3rd Workshop on Perspectivist Approaches to NLP (NLPerspectives) @ LREC-COLING 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[190]  arXiv:2406.02335 [pdf, other]
Title: Probing the Category of Verbal Aspect in Transformer Language Models
Subjects: Computation and Language (cs.CL)
[191]  arXiv:2406.02331 [pdf, other]
Title: Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering
Comments: ACL 2024 Findings Accepted
Subjects: Computation and Language (cs.CL)
[192]  arXiv:2406.02329 [pdf, other]
Title: On Affine Homotopy between Language Encoders
Comments: 10 pages
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[193]  arXiv:2406.02325 [pdf, other]
Title: Technical Language Processing for Telecommunications Specifications
Comments: Still not published
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[194]  arXiv:2406.02301 [pdf, other]
Title: mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models
Comments: Accepted to ACL 2024 main
Subjects: Computation and Language (cs.CL)
[195]  arXiv:2406.02267 [pdf, ps, other]
Title: Prompting Large Language Models with Human Error Markings for Self-Correcting Machine Translation
Comments: To appear at The 25th Annual Conference of the European Association for Machine Translation (EAMT 2024)
Subjects: Computation and Language (cs.CL)
[196]  arXiv:2406.02266 [pdf, ps, other]
Title: Enhancing Retrieval-Augmented LMs with a Two-stage Consistency Learning Compressor
Subjects: Computation and Language (cs.CL)
[197]  arXiv:2406.02251 [pdf, other]
Title: Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning
Comments: Accepted to ACL 2024 Findings. arXiv admin note: text overlap with arXiv:2212.11382
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[198]  arXiv:2406.02245 [pdf, other]
Title: Description Boosting for Zero-Shot Entity and Relation Classification
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[199]  arXiv:2406.02237 [pdf, other]
Title: Self-Modifying State Modeling for Simultaneous Machine Translation
Comments: Accept to ACL 2024 main conference. 15 pages, 13 figures, 9 tables
Subjects: Computation and Language (cs.CL)
[200]  arXiv:2406.02224 [pdf, other]
Title: FedMKT: Federated Mutual Knowledge Transfer for Large and Small Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[201]  arXiv:2406.02169 [src]
Title: A multilingual dataset for offensive language and hate speech detection for hausa, yoruba and igbo languages
Comments: The experimental result was erroneously reported and we also omitted other authors
Subjects: Computation and Language (cs.CL)
[202]  arXiv:2406.02148 [pdf, other]
Title: Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Language Models
Comments: Accepted to ACL-24 Main
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[ total of 497 entries: 1-75 | 53-127 | 128-202 | 203-277 | 278-352 | 353-427 | 428-497 ]
[ showing 75 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help  (Access key information)