We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions, skipping first 49

[ total of 497 entries: 1-100 | 50-149 | 150-249 | 250-349 | 350-449 | 450-497 ]
[ showing 100 entries per page: fewer | more | all ]

Fri, 7 Jun 2024 (continued, showing last 35 of 84 entries)

[50]  arXiv:2406.03790 [pdf, other]
Title: End-to-End Trainable Soft Retriever for Low-resource Relation Extraction
Comments: preprint
Subjects: Computation and Language (cs.CL)
[51]  arXiv:2406.03776 [pdf, other]
Title: XL-HeadTags: Leveraging Multimodal Retrieval Augmentation for the Multilingual Generation of News Headlines and Tags
Comments: ACL 2024 camera ready
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[52]  arXiv:2406.03772 [pdf, other]
Title: Character-Level Chinese Dependency Parsing via Modeling Latent Intra-Word Structure
Authors: Yang Hou, Zhenghua Li
Comments: Findings of ACL 2024
Subjects: Computation and Language (cs.CL)
[53]  arXiv:2406.03749 [pdf, other]
Title: NAP^2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting by Learning from Human
Subjects: Computation and Language (cs.CL)
[54]  arXiv:2406.03746 [pdf, other]
Title: Efficient Knowledge Infusion via KG-LLM Alignment
Comments: ACL2024 Findings
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[55]  arXiv:2406.03725 [pdf, other]
Title: LLMEmbed: Rethinking Lightweight LLM's Genuine Function in Text Classification
Comments: ACL 2024 main conference
Subjects: Computation and Language (cs.CL)
[56]  arXiv:2406.03712 [pdf, other]
Title: A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[57]  arXiv:2406.03703 [pdf, other]
Title: Synthesizing Conversations from Unlabeled Documents using Automatic Response Segmentation
Comments: findings of ACL 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[58]  arXiv:2406.03699 [pdf, other]
Title: M-QALM: A Benchmark to Assess Clinical Reading Comprehension and Knowledge Recall in Large Language Models via Question Answering
Comments: Accepted at ACL 2024 (Findings)
Subjects: Computation and Language (cs.CL)
[59]  arXiv:2406.03689 [pdf, other]
Title: Evaluating the World Model Implicit in a Generative Model
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[60]  arXiv:2406.03673 [pdf, other]
Title: Linguistically Conditioned Semantic Textual Similarity
Comments: To appear in the ACL 2024 main proceedings
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[61]  arXiv:2406.03666 [pdf, other]
Title: What Makes Language Models Good-enough?
Comments: To appear in Findings of ACL2024
Subjects: Computation and Language (cs.CL)
[62]  arXiv:2406.03642 [pdf, other]
Title: Is Free Self-Alignment Possible?
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[63]  arXiv:2406.03618 [pdf, other]
Title: TACT: Advancing Complex Aggregative Reasoning with Information Extraction Tools
Comments: Website (this https URL), Huggingface (this https URL)
Subjects: Computation and Language (cs.CL)
[64]  arXiv:2406.03600 [pdf, other]
Title: Knowledge-Infused Legal Wisdom: Navigating LLM Consultation through the Lens of Diagnostics and Positive-Unlabeled Reinforcement Learning
Comments: Accepted by ACL Findings 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[65]  arXiv:2406.03592 [pdf, other]
Title: Measuring Retrieval Complexity in Question Answering Systems
Comments: Accepted to ACL 2024 (findings)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[66]  arXiv:2406.03589 [pdf, other]
Title: Ranking Manipulation for Conversational Search Engines
Subjects: Computation and Language (cs.CL)
[67]  arXiv:2406.04344 (cross-list from cs.LG) [pdf, other]
Title: Verbalized Machine Learning: Revisiting Machine Learning with Language Models
Comments: Technical Report v1 (92 pages, 15 figures)
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[68]  arXiv:2406.04313 (cross-list from cs.LG) [pdf, other]
Title: Improving Alignment and Robustness with Short Circuiting
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[69]  arXiv:2406.04298 (cross-list from cs.IR) [pdf, other]
Title: Measuring and Addressing Indexical Bias in Information Retrieval
Comments: ACL 2024
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[70]  arXiv:2406.04292 (cross-list from cs.IR) [pdf, other]
Title: VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval
Comments: Accepted to ACL 2024 main conference
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[71]  arXiv:2406.04274 (cross-list from cs.LG) [pdf, ps, other]
Title: Self-Play with Adversarial Critic: Provable and Scalable Offline Alignment for Language Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[72]  arXiv:2406.04264 (cross-list from cs.CV) [pdf, other]
Title: MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[73]  arXiv:2406.04240 (cross-list from cs.LG) [pdf, other]
Title: Hypernetworks for Personalizing ASR to Atypical Speech
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[74]  arXiv:2406.04229 (cross-list from cs.LG) [pdf, other]
Title: The CLRS-Text Algorithmic Reasoning Language Benchmark
Comments: Preprint, under review. Comments welcome
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
[75]  arXiv:2406.04151 (cross-list from cs.AI) [pdf, other]
Title: AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Comments: Project site: this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[76]  arXiv:2406.04116 (cross-list from cs.AI) [pdf, ps, other]
Title: Promoting Fairness and Diversity in Speech Datasets for Mental Health and Neurological Disorders Research
Comments: 34 pages
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[77]  arXiv:2406.03857 (cross-list from cs.LG) [pdf, other]
Title: MuJo: Multimodal Joint Feature Space Learning for Human Activity Recognition
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[78]  arXiv:2406.03807 (cross-list from cs.AI) [pdf, other]
Title: Tool-Planner: Dynamic Solution Tree Planning for Large Language Model with Tool Clustering
Comments: 46pages first version
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Robotics (cs.RO)
[79]  arXiv:2406.03736 (cross-list from cs.LG) [pdf, other]
Title: Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[80]  arXiv:2406.03718 (cross-list from cs.CR) [pdf, other]
Title: Generalization-Enhanced Code Vulnerability Detection via Multi-Task Instruction Fine-Tuning
Comments: Accepted to ACL 2024 Findings
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[81]  arXiv:2406.03707 (cross-list from cs.LG) [pdf, other]
Title: What Should Embeddings Embed? Autoregressive Models Represent Latent Generating Distributions
Comments: 15 pages, 8 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[82]  arXiv:2406.03706 (cross-list from cs.SD) [pdf, other]
Title: Improving Audio Codec-based Zero-Shot Text-to-Speech Synthesis with Multi-Modal Context and Large Language Model
Comments: Accepted by Interspeech 2024
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[83]  arXiv:2406.03637 (cross-list from eess.AS) [pdf, other]
Title: Style Mixture of Experts for Expressive Text-To-Speech Synthesis
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[84]  arXiv:2406.03614 (cross-list from cs.LG) [pdf, ps, other]
Title: Advancing Anomaly Detection: Non-Semantic Financial Data Encoding with LLMs
Authors: Alexander Bakumenko (1), Kateřina Hlaváčková-Schindler (2), Claudia Plant (2), Nina C. Hubig (1) ((1) Clemson University, USA, (2) University of Vienna, Austria)
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Risk Management (q-fin.RM)

Thu, 6 Jun 2024 (showing first 65 of 90 entries)

[85]  arXiv:2406.03496 [pdf, other]
Title: Wings: Learning Multimodal LLMs without Text-only Forgetting
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[86]  arXiv:2406.03487 [pdf, other]
Title: Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends
Comments: Accepted at ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[87]  arXiv:2406.03486 [pdf, other]
Title: BIPED: Pedagogically Informed Tutoring System for ESL Education
Comments: ACL 2024
Subjects: Computation and Language (cs.CL)
[88]  arXiv:2406.03479 [pdf, other]
Title: MODABS: Multi-Objective Learning for Dynamic Aspect-Based Summarization
Subjects: Computation and Language (cs.CL)
[89]  arXiv:2406.03452 [pdf, other]
Title: Using Synchronic Definitions and Semantic Relations to Classify Semantic Change Types
Subjects: Computation and Language (cs.CL)
[90]  arXiv:2406.03450 [pdf, other]
Title: What is the Best Way for ChatGPT to Translate Poetry?
Comments: 19 pages, 1 figure. The paper has been accepted by ACL 2024(Main Conference)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[91]  arXiv:2406.03442 [pdf, ps, other]
Title: Are language models rational? The case of coherence norms and belief revision
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[92]  arXiv:2406.03441 [pdf, other]
Title: Cycles of Thought: Measuring LLM Confidence through Stable Explanations
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[93]  arXiv:2406.03397 [pdf, other]
Title: Automating Turkish Educational Quiz Generation Using Large Language Models
Comments: Accepted Paper for ISPR 2024
Subjects: Computation and Language (cs.CL)
[94]  arXiv:2406.03368 [pdf, other]
[95]  arXiv:2406.03363 [pdf, other]
Title: LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback
Subjects: Computation and Language (cs.CL)
[96]  arXiv:2406.03339 [pdf, other]
Title: The Challenges of Evaluating LLM Applications: An Analysis of Automated, Human, and LLM-Based Approaches
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[97]  arXiv:2406.03239 [pdf, other]
Title: Document-level Claim Extraction and Decontextualisation for Fact-Checking
Comments: Accepted to ACL 2024
Subjects: Computation and Language (cs.CL)
[98]  arXiv:2406.03235 [pdf, other]
Title: Error-preserving Automatic Speech Recognition of Young English Learners' Language
Comments: Accepted at ACL 2024 Main Conference
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[99]  arXiv:2406.03221 [pdf, other]
Title: Linking Named Entities in Diderot's \textit{Encyclopédie} to Wikidata
Authors: Pierre Nugues
Comments: 6 pages, 3 figures
Journal-ref: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 10610--10615
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[100]  arXiv:2406.03202 [pdf, other]
Title: ChatLang-8: An LLM-Based Synthetic Data Generation Framework for Grammatical Error Correction
Comments: preprint
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[101]  arXiv:2406.03199 [pdf, other]
Title: Bayesian WeakS-to-Strong from Text Classification to Generation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[102]  arXiv:2406.03198 [pdf, other]
Title: The Impossibility of Fair LLMs
Comments: Presented at the 1st Human-Centered Evaluation and Auditing of Language Models (HEAL) workshop at CHI 2024
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Applications (stat.AP); Machine Learning (stat.ML)
[103]  arXiv:2406.03181 [pdf, other]
Title: Missci: Reconstructing Fallacies in Misrepresented Science
Comments: ACL 2024 (main)
Subjects: Computation and Language (cs.CL)
[104]  arXiv:2406.03170 [pdf, other]
Title: StatBot.Swiss: Bilingual Open Data Exploration in Natural Language
Comments: This work is accepted at ACL Findings 2024
Subjects: Computation and Language (cs.CL)
[105]  arXiv:2406.03158 [pdf, other]
Title: CSS: Contrastive Semantic Similarity for Uncertainty Quantification of LLMs
Comments: The paper is accepted by The Conference on Uncertainty in Artificial Intelligence (UAI), 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[106]  arXiv:2406.03151 [pdf, other]
Title: Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation
Comments: Published on ACL 2024 Findings
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[107]  arXiv:2406.03127 [pdf, other]
Title: Towards Real-world Scenario: Imbalanced New Intent Discovery
Comments: ACL 2024
Subjects: Computation and Language (cs.CL)
[108]  arXiv:2406.03125 [pdf, other]
Title: Space Decomposition for Sentence Embedding
Comments: ACL Finding 2024. The code and pre-trained models are available at this https URL
Subjects: Computation and Language (cs.CL)
[109]  arXiv:2406.03092 [pdf, other]
Title: FragRel: Exploiting Fragment-level Relations in the External Memory of Large Language Models
Subjects: Computation and Language (cs.CL)
[110]  arXiv:2406.03079 [pdf, other]
Title: Cryptocurrency Frauds for Dummies: How ChatGPT introduces us to fraud?
Comments: To be published in ACM journal "Digital Government: Research and Practice"
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[111]  arXiv:2406.03075 [pdf, other]
Title: Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework
Comments: 18 pages, 3 figures
Subjects: Computation and Language (cs.CL)
[112]  arXiv:2406.03062 [pdf, other]
Title: RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence Models for Abstractive Radiology Report Summarization
Subjects: Computation and Language (cs.CL)
[113]  arXiv:2406.03049 [pdf, other]
Title: StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning
Comments: Accepted to ACL 2024 main conference, Project Page: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[114]  arXiv:2406.03030 [pdf, other]
Title: From Tarzan to Tolkien: Controlling the Language Proficiency Level of LLMs for Content Generation
Journal-ref: In Findings of the Association for Computational Linguistics (ACL 2024)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[115]  arXiv:2406.03009 [pdf, other]
Title: Unveiling Selection Biases: Exploring Order and Token Sensitivity in Large Language Models
Comments: Accepted as a long findings paper at ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[116]  arXiv:2406.03007 [pdf, other]
Title: BadAgent: Inserting and Activating Backdoor Attacks in LLM Agents
Comments: Accepted by ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[117]  arXiv:2406.03004 [pdf, other]
Title: Evaluation of data inconsistency for multi-modal sentiment analysis
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[118]  arXiv:2406.02974 [pdf, ps, other]
Title: Readability-guided Idiom-aware Sentence Simplification (RISS) for Chinese
Comments: Accepted to the 23rd China National Conference on Computational Linguistics (CCL 2024)
Subjects: Computation and Language (cs.CL)
[119]  arXiv:2406.02962 [pdf, other]
Title: Docs2KG: Unified Knowledge Graph Construction from Heterogeneous Documents Assisted by Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[120]  arXiv:2406.02959 [pdf, other]
Title: Adversarial Moment-Matching Distillation of Large Language Models
Authors: Chen Jia
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[121]  arXiv:2406.02921 [pdf, other]
Title: Text Injection for Neural Contextual Biasing
Comments: 5 pages, 1 figure
Journal-ref: Interspeech 2024, Kos Island, Greece
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Audio and Speech Processing (eess.AS)
[122]  arXiv:2406.02919 [pdf, other]
Title: MultifacetEval: Multifaceted Evaluation to Probe LLMs in Mastering Medical Knowledge
Comments: Accepted by IJCAI 2024
Subjects: Computation and Language (cs.CL)
[123]  arXiv:2406.02911 [pdf, other]
Title: Improving In-Context Learning with Prediction Feedback for Sentiment Analysis
Comments: Accepted by ACL 2024 (Findings)
Subjects: Computation and Language (cs.CL)
[124]  arXiv:2406.02903 [pdf, other]
Title: Open Grounded Planning: Challenges and Benchmark Construction
Comments: Accept to ACL 2024 main conference
Subjects: Computation and Language (cs.CL)
[125]  arXiv:2406.02902 [pdf, other]
Title: S$^2$GSL: Incorporating Segment to Syntactic Enhanced Graph Structure Learning for Aspect-based Sentiment Analysis
Subjects: Computation and Language (cs.CL)
[126]  arXiv:2406.02893 [pdf, other]
Title: Language Model Can Do Knowledge Tracing: Simple but Effective Method to Integrate Language Model and Knowledge Tracing Task
Comments: 11 pages, 5 figures, 3 tables
Subjects: Computation and Language (cs.CL)
[127]  arXiv:2406.02888 [pdf, other]
Title: HYDRA: Model Factorization Framework for Black-Box LLM Personalization
Comments: 24 pages, 6 figures, work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[128]  arXiv:2406.02886 [pdf, other]
Title: PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs
Comments: Findings of ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[129]  arXiv:2406.02882 [pdf, other]
Title: Outdated Issue Aware Decoding for Factual Knowledge Editing
Comments: ACL2024 Findings, Codes are at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[130]  arXiv:2406.02876 [pdf, other]
Title: LCS: A Language Converter Strategy for Zero-Shot Neural Machine Translation
Comments: ACL2024 Findings, Codes are at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[131]  arXiv:2406.02864 [pdf, other]
Title: NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models
Comments: Findings of ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[132]  arXiv:2406.02863 [pdf, ps, other]
Title: LLM as a Scorer: The Impact of Output Order on Dialogue Evaluation
Comments: Presented in AAAI 2024 Spring Symposium. The first two authors contributed equally
Subjects: Computation and Language (cs.CL)
[133]  arXiv:2406.02856 [pdf, other]
Title: Xmodel-LM Technical Report
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[134]  arXiv:2406.02832 [pdf, other]
Title: Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[135]  arXiv:2406.02830 [pdf, other]
Title: Too Big to Fail: Larger Language Models are Disproportionately Resilient to Induction of Dementia-Related Linguistic Anomalies
Comments: Accepted to ACL 2024 findings
Subjects: Computation and Language (cs.CL)
[136]  arXiv:2406.02826 [pdf, other]
Title: Exploring Robustness in Doctor-Patient Conversation Summarization: An Analysis of Out-of-Domain SOAP Notes
Comments: Clinical NLP Workshop 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[137]  arXiv:2406.02818 [pdf, other]
Title: Chain of Agents: Large Language Models Collaborating on Long-Context Tasks
Comments: 19 pages, 6 figures
Subjects: Computation and Language (cs.CL)
[138]  arXiv:2406.02787 [pdf, other]
Title: Disentangling Logic: The Role of Context in Large Language Model Reasoning Capabilities
Comments: 22 pages, 9 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[139]  arXiv:2406.02756 [pdf, other]
Title: Aligning Large Language Models via Fine-grained Supervision
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[140]  arXiv:2406.02746 [pdf, other]
Title: RATT: AThought Structure for Coherent and Correct LLMReasoning
Subjects: Computation and Language (cs.CL)
[141]  arXiv:2406.02733 [pdf, other]
Title: Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation
Comments: Accepted to ACL 2024 (findings)
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[142]  arXiv:2406.02721 [pdf, other]
Title: Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller
Comments: 41 pages, 12 figures, 61 tables; Website: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[143]  arXiv:2406.02657 [pdf, other]
Title: Block Transformer: Global-to-Local Language Modeling for Fast Inference
Comments: 30 pages, 21 figures, 5 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[144]  arXiv:2406.02577 [pdf, other]
Title: Are PPO-ed Language Models Hackable?
Comments: 8 pages, 4 figures
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[145]  arXiv:2406.02575 [pdf, other]
Title: Cross-Modal Safety Alignment: Is textual unlearning all you need?
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[146]  arXiv:2406.03482 (cross-list from cs.LG) [pdf, other]
Title: QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead
Comments: 13 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Performance (cs.PF)
[147]  arXiv:2406.03476 (cross-list from cs.LG) [pdf, other]
Title: Does your data spark joy? Performance gains from domain upsampling at the end of training
Comments: The first three authors contributed equally
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[148]  arXiv:2406.03445 (cross-list from cs.LG) [pdf, other]
Title: Pre-trained Large Language Models Use Fourier Features to Compute Addition
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[149]  arXiv:2406.03299 (cross-list from cs.AI) [pdf, other]
Title: The Good, the Bad, and the Hulk-like GPT: Analyzing Emotional Decisions of Large Language Models in Cooperation and Bargaining Games
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[ total of 497 entries: 1-100 | 50-149 | 150-249 | 250-349 | 350-449 | 450-497 ]
[ showing 100 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help  (Access key information)