We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions

[ total of 489 entries: 1-250 | 251-489 ]
[ showing 250 entries per page: fewer | more | all ]

Thu, 6 Jun 2024

[1]  arXiv:2406.03496 [pdf, other]
Title: Wings: Learning Multimodal LLMs without Text-only Forgetting
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[2]  arXiv:2406.03487 [pdf, other]
Title: Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends
Comments: Accepted at ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3]  arXiv:2406.03486 [pdf, other]
Title: BIPED: Pedagogically Informed Tutoring System for ESL Education
Comments: ACL 2024
Subjects: Computation and Language (cs.CL)
[4]  arXiv:2406.03479 [pdf, other]
Title: MODABS: Multi-Objective Learning for Dynamic Aspect-Based Summarization
Subjects: Computation and Language (cs.CL)
[5]  arXiv:2406.03452 [pdf, other]
Title: Using Synchronic Definitions and Semantic Relations to Classify Semantic Change Types
Subjects: Computation and Language (cs.CL)
[6]  arXiv:2406.03450 [pdf, other]
Title: What is the Best Way for ChatGPT to Translate Poetry?
Comments: 19 pages, 1 figure. The paper has been accepted by ACL 2024(Main Conference)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[7]  arXiv:2406.03442 [pdf, ps, other]
Title: Are language models rational? The case of coherence norms and belief revision
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[8]  arXiv:2406.03441 [pdf, other]
Title: Cycles of Thought: Measuring LLM Confidence through Stable Explanations
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[9]  arXiv:2406.03397 [pdf, other]
Title: Automating Turkish Educational Quiz Generation Using Large Language Models
Comments: Accepted Paper for ISPR 2024
Subjects: Computation and Language (cs.CL)
[10]  arXiv:2406.03368 [pdf, other]
[11]  arXiv:2406.03363 [pdf, other]
Title: LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback
Subjects: Computation and Language (cs.CL)
[12]  arXiv:2406.03339 [pdf, other]
Title: The Challenges of Evaluating LLM Applications: An Analysis of Automated, Human, and LLM-Based Approaches
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[13]  arXiv:2406.03239 [pdf, other]
Title: Document-level Claim Extraction and Decontextualisation for Fact-Checking
Comments: Accepted to ACL 2024
Subjects: Computation and Language (cs.CL)
[14]  arXiv:2406.03235 [pdf, other]
Title: Error-preserving Automatic Speech Recognition of Young English Learners' Language
Comments: Accepted at ACL 2024 Main Conference
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[15]  arXiv:2406.03221 [pdf, other]
Title: Linking Named Entities in Diderot's \textit{Encyclopédie} to Wikidata
Authors: Pierre Nugues
Comments: 6 pages, 3 figures
Journal-ref: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 10610--10615
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[16]  arXiv:2406.03202 [pdf, other]
Title: ChatLang-8: An LLM-Based Synthetic Data Generation Framework for Grammatical Error Correction
Comments: preprint
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[17]  arXiv:2406.03199 [pdf, other]
Title: Bayesian WeakS-to-Strong from Text Classification to Generation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[18]  arXiv:2406.03198 [pdf, other]
Title: The Impossibility of Fair LLMs
Comments: Presented at the 1st Human-Centered Evaluation and Auditing of Language Models (HEAL) workshop at CHI 2024
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Applications (stat.AP); Machine Learning (stat.ML)
[19]  arXiv:2406.03181 [pdf, other]
Title: Missci: Reconstructing Fallacies in Misrepresented Science
Comments: ACL 2024 (main)
Subjects: Computation and Language (cs.CL)
[20]  arXiv:2406.03170 [pdf, other]
Title: StatBot.Swiss: Bilingual Open Data Exploration in Natural Language
Comments: This work is accepted at ACL Findings 2024
Subjects: Computation and Language (cs.CL)
[21]  arXiv:2406.03158 [pdf, other]
Title: CSS: Contrastive Semantic Similarity for Uncertainty Quantification of LLMs
Comments: The paper is accepted by The Conference on Uncertainty in Artificial Intelligence (UAI), 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[22]  arXiv:2406.03151 [pdf, other]
Title: Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation
Comments: Published on ACL 2024 Findings
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[23]  arXiv:2406.03127 [pdf, other]
Title: Towards Real-world Scenario: Imbalanced New Intent Discovery
Comments: ACL 2024
Subjects: Computation and Language (cs.CL)
[24]  arXiv:2406.03125 [pdf, other]
Title: Space Decomposition for Sentence Embedding
Comments: ACL Finding 2024. The code and pre-trained models are available at this https URL
Subjects: Computation and Language (cs.CL)
[25]  arXiv:2406.03092 [pdf, other]
Title: FragRel: Exploiting Fragment-level Relations in the External Memory of Large Language Models
Subjects: Computation and Language (cs.CL)
[26]  arXiv:2406.03079 [pdf, other]
Title: Cryptocurrency Frauds for Dummies: How ChatGPT introduces us to fraud?
Comments: To be published in ACM journal "Digital Government: Research and Practice"
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[27]  arXiv:2406.03075 [pdf, other]
Title: Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework
Comments: 18 pages, 3 figures
Subjects: Computation and Language (cs.CL)
[28]  arXiv:2406.03062 [pdf, other]
Title: RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence Models for Abstractive Radiology Report Summarization
Subjects: Computation and Language (cs.CL)
[29]  arXiv:2406.03049 [pdf, other]
Title: StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning
Comments: Accepted to ACL 2024 main conference, Project Page: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[30]  arXiv:2406.03030 [pdf, other]
Title: From Tarzan to Tolkien: Controlling the Language Proficiency Level of LLMs for Content Generation
Journal-ref: In Findings of the Association for Computational Linguistics (ACL 2024)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[31]  arXiv:2406.03009 [pdf, other]
Title: Unveiling Selection Biases: Exploring Order and Token Sensitivity in Large Language Models
Comments: Accepted as a long findings paper at ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[32]  arXiv:2406.03007 [pdf, other]
Title: BadAgent: Inserting and Activating Backdoor Attacks in LLM Agents
Comments: Accepted by ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[33]  arXiv:2406.03004 [pdf, other]
Title: Evaluation of data inconsistency for multi-modal sentiment analysis
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[34]  arXiv:2406.02974 [pdf, ps, other]
Title: Readability-guided Idiom-aware Sentence Simplification (RISS) for Chinese
Comments: Accepted to the 23rd China National Conference on Computational Linguistics (CCL 2024)
Subjects: Computation and Language (cs.CL)
[35]  arXiv:2406.02962 [pdf, other]
Title: Docs2KG: Unified Knowledge Graph Construction from Heterogeneous Documents Assisted by Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[36]  arXiv:2406.02959 [pdf, other]
Title: Adversarial Moment-Matching Distillation of Large Language Models
Authors: Chen Jia
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[37]  arXiv:2406.02921 [pdf, other]
Title: Text Injection for Neural Contextual Biasing
Comments: 5 pages, 1 figure
Journal-ref: Interspeech 2024, Kos Island, Greece
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Audio and Speech Processing (eess.AS)
[38]  arXiv:2406.02919 [pdf, other]
Title: MultifacetEval: Multifaceted Evaluation to Probe LLMs in Mastering Medical Knowledge
Comments: Accepted by IJCAI 2024
Subjects: Computation and Language (cs.CL)
[39]  arXiv:2406.02911 [pdf, other]
Title: Improving In-Context Learning with Prediction Feedback for Sentiment Analysis
Comments: Accepted by ACL 2024 (Findings)
Subjects: Computation and Language (cs.CL)
[40]  arXiv:2406.02903 [pdf, other]
Title: Open Grounded Planning: Challenges and Benchmark Construction
Comments: Accept to ACL 2024 main conference
Subjects: Computation and Language (cs.CL)
[41]  arXiv:2406.02902 [pdf, other]
Title: S$^2$GSL: Incorporating Segment to Syntactic Enhanced Graph Structure Learning for Aspect-based Sentiment Analysis
Subjects: Computation and Language (cs.CL)
[42]  arXiv:2406.02893 [pdf, other]
Title: Language Model Can Do Knowledge Tracing: Simple but Effective Method to Integrate Language Model and Knowledge Tracing Task
Comments: 11 pages, 5 figures, 3 tables
Subjects: Computation and Language (cs.CL)
[43]  arXiv:2406.02888 [pdf, other]
Title: HYDRA: Model Factorization Framework for Black-Box LLM Personalization
Comments: 24 pages, 6 figures, work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[44]  arXiv:2406.02886 [pdf, other]
Title: PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs
Comments: Findings of ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[45]  arXiv:2406.02882 [pdf, other]
Title: Outdated Issue Aware Decoding for Factual Knowledge Editing
Comments: ACL2024 Findings
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[46]  arXiv:2406.02876 [pdf, other]
Title: LCS: A Language Converter Strategy for Zero-Shot Neural Machine Translation
Comments: ACL2024 Findings
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[47]  arXiv:2406.02864 [pdf, other]
Title: NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models
Comments: Findings of ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[48]  arXiv:2406.02863 [pdf, ps, other]
Title: LLM as a Scorer: The Impact of Output Order on Dialogue Evaluation
Comments: Presented in AAAI 2024 Spring Symposium. The first two authors contributed equally
Subjects: Computation and Language (cs.CL)
[49]  arXiv:2406.02856 [pdf, other]
Title: Xmodel-LM Technical Report
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[50]  arXiv:2406.02832 [pdf, other]
Title: Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[51]  arXiv:2406.02830 [pdf, other]
Title: Too Big to Fail: Larger Language Models are Disproportionately Resilient to Induction of Dementia-Related Linguistic Anomalies
Comments: Accepted to ACL 2024 findings
Subjects: Computation and Language (cs.CL)
[52]  arXiv:2406.02826 [pdf, other]
Title: Exploring Robustness in Doctor-Patient Conversation Summarization: An Analysis of Out-of-Domain SOAP Notes
Comments: Clinical NLP Workshop 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[53]  arXiv:2406.02818 [pdf, other]
Title: Chain of Agents: Large Language Models Collaborating on Long-Context Tasks
Comments: 19 pages, 6 figures
Subjects: Computation and Language (cs.CL)
[54]  arXiv:2406.02787 [pdf, other]
Title: Disentangling Logic: The Role of Context in Large Language Model Reasoning Capabilities
Comments: 22 pages, 9 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[55]  arXiv:2406.02756 [pdf, other]
Title: Aligning Large Language Models via Fine-grained Supervision
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[56]  arXiv:2406.02746 [pdf, other]
Title: RATT: AThought Structure for Coherent and Correct LLMReasoning
Subjects: Computation and Language (cs.CL)
[57]  arXiv:2406.02733 [pdf, other]
Title: Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation
Comments: Accepted to ACL 2024 (findings)
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[58]  arXiv:2406.02721 [pdf, other]
Title: Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller
Comments: 41 pages, 12 figures, 61 tables; Website: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[59]  arXiv:2406.02657 [pdf, other]
Title: Block Transformer: Global-to-Local Language Modeling for Fast Inference
Comments: 30 pages, 21 figures, 5 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[60]  arXiv:2406.02577 [pdf, other]
Title: Are PPO-ed Language Models Hackable?
Comments: 8 pages, 4 figures
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[61]  arXiv:2406.02575 [pdf, other]
Title: Cross-Modal Safety Alignment: Is textual unlearning all you need?
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[62]  arXiv:2406.03482 (cross-list from cs.LG) [pdf, other]
Title: QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead
Comments: 13 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Performance (cs.PF)
[63]  arXiv:2406.03476 (cross-list from cs.LG) [pdf, other]
Title: Does your data spark joy? Performance gains from domain upsampling at the end of training
Comments: The first three authors contributed equally
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[64]  arXiv:2406.03445 (cross-list from cs.LG) [pdf, other]
Title: Pre-trained Large Language Models Use Fourier Features to Compute Addition
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[65]  arXiv:2406.03299 (cross-list from cs.AI) [pdf, other]
Title: The Good, the Bad, and the Hulk-like GPT: Analyzing Emotional Decisions of Large Language Models in Cooperation and Bargaining Games
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[66]  arXiv:2406.03287 (cross-list from cs.NE) [pdf, other]
Title: SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms
Subjects: Neural and Evolutionary Computing (cs.NE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[67]  arXiv:2406.03280 (cross-list from cs.LG) [pdf, other]
Title: FusionBench: A Comprehensive Benchmark of Deep Model Fusion
Comments: Project homepage: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[68]  arXiv:2406.03248 (cross-list from cs.IR) [pdf, other]
Title: Large Language Models as Evaluators for Recommendation Explanations
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[69]  arXiv:2406.03068 (cross-list from cs.LG) [pdf, other]
Title: How Truncating Weights Improves Reasoning in Language Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[70]  arXiv:2406.03008 (cross-list from cs.CV) [pdf, other]
Title: DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences
Comments: First Vision and Language for Autonomous Driving and Robotics Workshop (VLADR @ CVPR 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[71]  arXiv:2406.02969 (cross-list from cs.LG) [pdf, other]
Title: Filtered not Mixed: Stochastic Filtering-Based Online Gating for Mixture of Large Language Models
Comments: 29 pages, 5 Appendix sections
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computational Finance (q-fin.CP); Mathematical Finance (q-fin.MF)
[72]  arXiv:2406.02958 (cross-list from cs.LG) [pdf, other]
Title: PrE-Text: Training Language Models on Private Federated Data in the Age of LLMs
Comments: ICML 2024 (Oral)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC)
[73]  arXiv:2406.02950 (cross-list from eess.AS) [pdf, other]
Title: 4D ASR: Joint Beam Search Integrating CTC, Attention, Transducer, and Mask Predict Decoders
Comments: submitted to IEEE/ACM Transactions on Audio Speech and Language Processing
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[74]  arXiv:2406.02943 (cross-list from cs.IR) [pdf, ps, other]
Title: The Task-oriented Queries Benchmark (ToQB)
Authors: Keun Soo Yim
Comments: Data available on GitHub, this https URL
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Neural and Evolutionary Computing (cs.NE)
[75]  arXiv:2406.02925 (cross-list from eess.AS) [pdf, other]
Title: SYN2REAL: Leveraging Task Arithmetic for Mitigating Synthetic-Real Discrepancies in ASR Domain Adaptation
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[76]  arXiv:2406.02924 (cross-list from cs.LG) [pdf, other]
Title: Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for Large Language Models
Comments: Accepted by ICML2024, 29 pages, 4 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
[77]  arXiv:2406.02900 (cross-list from cs.LG) [pdf, other]
Title: Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[78]  arXiv:2406.02844 (cross-list from cs.IR) [pdf, other]
Title: Item-Language Model for Conversational Recommendation
Comments: 15 pages, 3 figures
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[79]  arXiv:2406.02804 (cross-list from cs.AI) [pdf, other]
Title: $\texttt{ACCORD}$: Closing the Commonsense Measurability Gap
Comments: For leaderboard and dataset download, see this https URL For source code, see this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[80]  arXiv:2406.02798 (cross-list from cs.DL) [pdf, ps, other]
Title: Promotional Language and the Adoption of Innovative Ideas in Science
Subjects: Digital Libraries (cs.DL); Computation and Language (cs.CL); Computers and Society (cs.CY)
[81]  arXiv:2406.02795 (cross-list from cs.HC) [pdf, other]
Title: ArguMentor: Augmenting User Experiences with Counter-Perspectives
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[82]  arXiv:2406.02791 (cross-list from cs.AI) [pdf, other]
Title: Language Models can Infer Action Semantics for Classical Planners from Environment Feedback
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Robotics (cs.RO)
[83]  arXiv:2406.02592 (cross-list from cs.LG) [pdf, other]
Title: LOLAMEME: Logic, Language, Memory, Mechanistic Framework
Comments: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[84]  arXiv:2406.02566 (cross-list from eess.AS) [pdf, other]
Title: Combining X-Vectors and Bayesian Batch Active Learning: Two-Stage Active Learning Pipeline for Speech Recognition
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[85]  arXiv:2406.02565 (cross-list from cs.SD) [pdf, other]
Title: Sequence-to-sequence models in peer-to-peer learning: A practical application
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Multiagent Systems (cs.MA); Audio and Speech Processing (eess.AS)
[86]  arXiv:2406.02563 (cross-list from eess.AS) [pdf, other]
Title: A cost minimization approach to fix the vocabulary size in a tokenizer for an End-to-End ASR system
Comments: 5 pages, 4 figures
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[87]  arXiv:2406.02562 (cross-list from eess.AS) [pdf, other]
Title: Gated Low-rank Adaptation for personalized Code-Switching Automatic Speech Recognition on the low-spec devices
Comments: Table 2 is revised
Journal-ref: ICASSP 2024 Workshop(HSCMA 2024) paper
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[88]  arXiv:2406.02560 (cross-list from eess.AS) [pdf, other]
Title: Less Peaky and More Accurate CTC Forced Alignment by Label Priors
Comments: Accepted by ICASSP 2024. Github repo: this https URL
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[89]  arXiv:2406.02555 (cross-list from eess.AS) [pdf, ps, other]
Title: PhoWhisper: Automatic Speech Recognition for Vietnamese
Comments: Accepted to ICLR 2024 Tiny Papers Track
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL)
[90]  arXiv:2406.02554 (cross-list from eess.AS) [pdf, other]
Title: Hear Me, See Me, Understand Me: Audio-Visual Autism Behavior Recognition
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)

Wed, 5 Jun 2024

[91]  arXiv:2406.02537 [pdf, other]
Title: TopViewRS: Vision-Language Models as Top-View Spatial Reasoners
Comments: 9 pages, 3 figures, 3 tables (21 pages, 4 figures, 15 tables including references and appendices)
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[92]  arXiv:2406.02536 [pdf, other]
Title: Mitigate Position Bias in Large Language Models via Scaling a Single Dimension
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[93]  arXiv:2406.02532 [pdf, other]
Title: SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices
Comments: preprint. arXiv admin note: text overlap with arXiv:2312.17238 by other authors
Subjects: Computation and Language (cs.CL)
[94]  arXiv:2406.02528 [pdf, other]
Title: Scalable MatMul-free Language Modeling
Subjects: Computation and Language (cs.CL)
[95]  arXiv:2406.02524 [pdf, other]
Title: CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Subjects: Computation and Language (cs.CL)
[96]  arXiv:2406.02517 [pdf, other]
Title: Deterministic Reversible Data Augmentation for Neural Machine Translation
Comments: Findings of ACL 2024
Subjects: Computation and Language (cs.CL)
[97]  arXiv:2406.02481 [pdf, other]
Title: Hiding Text in Large Language Models: Introducing Unconditional Token Forcing Confusion
Comments: Work in progress. Code is available at this https URL
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[98]  arXiv:2406.02472 [pdf, other]
Title: Analyzing Temporal Complex Events with Large Language Models? A Benchmark towards Temporal, Long Context Understanding
Comments: Accepted to ACL 2024
Subjects: Computation and Language (cs.CL)
[99]  arXiv:2406.02449 [pdf, other]
Title: Representations as Language: An Information-Theoretic Framework for Interpretability
Comments: 6 pages, 3 Figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[100]  arXiv:2406.02396 [pdf, other]
Title: The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[101]  arXiv:2406.02394 [pdf, other]
Title: Multiple Choice Questions and Large Languages Models: A Case Study with Fictional Medical Data
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[102]  arXiv:2406.02378 [pdf, other]
Title: On the Intrinsic Self-Correction Capability of LLMs: Uncertainty and Latent Concept
Comments: 22 pages, 7 figures
Subjects: Computation and Language (cs.CL)
[103]  arXiv:2406.02376 [pdf, other]
Title: Retaining Key Information under High Compression Ratios: Query-Guided Compressor for LLMs
Comments: Accepted to ACL 2024
Subjects: Computation and Language (cs.CL)
[104]  arXiv:2406.02350 [pdf, other]
Title: LlamaCare: A Large Medical Language Model for Enhancing Healthcare Knowledge Sharing
Authors: Maojun Sun
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[105]  arXiv:2406.02338 [pdf, other]
Title: Linguistic Fingerprint in Transformer Models: How Language Variation Influences Parameter Selection in Irony Detection
Journal-ref: Proceedings of the 3rd Workshop on Perspectivist Approaches to NLP (NLPerspectives) @ LREC-COLING 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[106]  arXiv:2406.02335 [pdf, other]
Title: Probing the Category of Verbal Aspect in Transformer Language Models
Subjects: Computation and Language (cs.CL)
[107]  arXiv:2406.02331 [pdf, other]
Title: Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering
Comments: ACL 2024 Findings Accepted
Subjects: Computation and Language (cs.CL)
[108]  arXiv:2406.02329 [pdf, other]
Title: On Affine Homotopy between Language Encoders
Comments: 10 pages
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[109]  arXiv:2406.02325 [pdf, other]
Title: Technical Language Processing for Telecommunications Specifications
Comments: Still not published
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[110]  arXiv:2406.02301 [pdf, other]
Title: mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models
Comments: Accepted to ACL 2024 main
Subjects: Computation and Language (cs.CL)
[111]  arXiv:2406.02267 [pdf, ps, other]
Title: Prompting Large Language Models with Human Error Markings for Self-Correcting Machine Translation
Comments: To appear at The 25th Annual Conference of the European Association for Machine Translation (EAMT 2024)
Subjects: Computation and Language (cs.CL)
[112]  arXiv:2406.02266 [pdf, ps, other]
Title: Enhancing Retrieval-Augmented LMs with a Two-stage Consistency Learning Compressor
Subjects: Computation and Language (cs.CL)
[113]  arXiv:2406.02251 [pdf, other]
Title: Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning
Comments: Accepted to ACL 2024 Findings. arXiv admin note: text overlap with arXiv:2212.11382
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[114]  arXiv:2406.02245 [pdf, other]
Title: Description Boosting for Zero-Shot Entity and Relation Classification
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[115]  arXiv:2406.02237 [pdf, other]
Title: Self-Modifying State Modeling for Simultaneous Machine Translation
Comments: Accept to ACL 2024 main conference. 15 pages, 13 figures, 9 tables
Subjects: Computation and Language (cs.CL)
[116]  arXiv:2406.02224 [pdf, other]
Title: FedMKT: Federated Mutual Knowledge Transfer for Large and Small Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[117]  arXiv:2406.02169 [pdf, ps, other]
Title: A multilingual dataset for offensive language and hate speech detection for hausa, yoruba and igbo languages
Comments: 9 pages
Subjects: Computation and Language (cs.CL)
[118]  arXiv:2406.02148 [pdf, other]
Title: Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Language Models
Comments: Accepted to ACL-24 Main
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[119]  arXiv:2406.02143 [pdf, other]
Title: Reinforcement Tuning for Detecting Stances and Debunking Rumors Jointly with Large Language Models
Comments: ACL 2024 (Findings)
Subjects: Computation and Language (cs.CL)
[120]  arXiv:2406.02134 [pdf, other]
Title: The current status of large language models in summarizing radiology report impressions
Subjects: Computation and Language (cs.CL)
[121]  arXiv:2406.02120 [pdf, other]
Title: Diver: Large Language Model Decoding with Span-Level Mutual Information Verification
Subjects: Computation and Language (cs.CL)
[122]  arXiv:2406.02110 [pdf, other]
Title: UniOQA: A Unified Framework for Knowledge Graph Question Answering with Large Language Models
Comments: 10 pages, 5 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[123]  arXiv:2406.02106 [pdf, other]
Title: MARS: Benchmarking the Metaphysical Reasoning Abilities of Language Models with a Multi-task Evaluation Dataset
Subjects: Computation and Language (cs.CL)
[124]  arXiv:2406.02100 [pdf, other]
Title: Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data
Comments: Accept by Findings of ACL 2024
Subjects: Computation and Language (cs.CL)
[125]  arXiv:2406.02080 [pdf, other]
Title: LongSSM: On the Length Extension of State-space Models in Language Modelling
Authors: Shida Wang
Comments: 23 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Dynamical Systems (math.DS)
[126]  arXiv:2406.02079 [pdf, ps, other]
Title: Assessing the Performance of Chinese Open Source Large Language Models in Information Extraction Tasks
Subjects: Computation and Language (cs.CL)
[127]  arXiv:2406.02069 [pdf, other]
Title: PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[128]  arXiv:2406.02060 [pdf, ps, other]
Title: I've got the "Answer"! Interpretation of LLMs Hidden States in Question Answering
Comments: Accepted for NLDB-2024 conference
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[129]  arXiv:2406.02050 [pdf, other]
Title: Analyzing Social Biases in Japanese Large Language Models
Subjects: Computation and Language (cs.CL)
[130]  arXiv:2406.02044 [pdf, ps, other]
Title: QROA: A Black-Box Query-Response Optimization Attack on LLMs
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[131]  arXiv:2406.02030 [pdf, other]
Title: Multimodal Reasoning with Multimodal Knowledge Graph
Comments: Accepted by ACL 2024 (Main Conference)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[132]  arXiv:2406.02018 [pdf, other]
Title: Why Would You Suggest That? Human Trust in Language Model Responses
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[133]  arXiv:2406.02002 [pdf, other]
Title: Position Debiasing Fine-Tuning for Causal Perception in Long-Term Dialogue
Comments: Accepted to IJCAI 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[134]  arXiv:2406.01988 [pdf, other]
Title: Personalized Topic Selection Model for Topic-Grounded Dialogue
Comments: Accepted to ACL 2024 Findings
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[135]  arXiv:2406.01983 [pdf, other]
Title: RKLD: Reverse KL-Divergence-based Knowledge Distillation for Unlearning Personal Information in Large Language Models
Comments: Work is in progress
Subjects: Computation and Language (cs.CL)
[136]  arXiv:2406.01981 [pdf, other]
Title: Zyda: A 1.3T Dataset for Open Language Modeling
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[137]  arXiv:2406.01976 [pdf, other]
Title: Conditional Language Learning with Context
Authors: Xiao Zhang, Miao Li, Ji Wu
Comments: To appear at the 41st International Conference on Machine Learning (ICML 2024)
Subjects: Computation and Language (cs.CL)
[138]  arXiv:2406.01943 [pdf, ps, other]
Title: Enhancing Trust in LLMs: Algorithms for Comparing and Interpreting LLMs
Authors: Nik Bear Brown
Comments: An extensive survey of the literature specifying algorithms and techniques enhancing the trustworthiness and understanding of Large Language Models (LLMs)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[139]  arXiv:2406.01940 [pdf, other]
Title: Process-Driven Autoformalization in Lean 4
Comments: 22 pages, 1 figures, 11 tables
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[140]  arXiv:2406.01934 [pdf, other]
Title: Optimal Transport Guided Correlation Assignment for Multimodal Entity Linking
Comments: Findings of ACL 2024
Subjects: Computation and Language (cs.CL)
[141]  arXiv:2406.01931 [pdf, other]
Title: Dishonesty in Helpful and Harmless Alignment
Subjects: Computation and Language (cs.CL)
[142]  arXiv:2406.01919 [pdf, other]
Title: OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection
Comments: Accepted by ACL 2024 Findings
Subjects: Computation and Language (cs.CL)
[143]  arXiv:2406.01879 [pdf, other]
Title: Bi-DCSpell: A Bi-directional Detector-Corrector Interactive Framework for Chinese Spelling Check
Comments: 12 pages, 6 figures
Subjects: Computation and Language (cs.CL)
[144]  arXiv:2406.01873 [pdf, other]
Title: CR-UTP: Certified Robustness against Universal Text Perturbations on Large Language Models
Comments: Accepted by ACL Findings 2024
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[145]  arXiv:2406.01866 [pdf, other]
Title: #EpiTwitter: Public Health Messaging During the COVID-19 Pandemic
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Social and Information Networks (cs.SI)
[146]  arXiv:2406.01863 [pdf, other]
Title: Towards Effective Time-Aware Language Representation: Exploring Enhanced Temporal Understanding in Language Models
Subjects: Computation and Language (cs.CL)
[147]  arXiv:2406.01860 [pdf, other]
Title: Eliciting the Priors of Large Language Models using Iterated In-Context Learning
Subjects: Computation and Language (cs.CL)
[148]  arXiv:2406.01855 [pdf, other]
Title: TruthEval: A Dataset to Evaluate LLM Truthfulness and Reliability
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[149]  arXiv:2406.01835 [pdf, other]
Title: An Open Multilingual System for Scoring Readability of Wikipedia
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[150]  arXiv:2406.01806 [pdf, other]
Title: Contextualized Sequence Likelihood: Enhanced Confidence Scores for Natural Language Generation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[151]  arXiv:2406.01775 [pdf, other]
Title: OLoRA: Orthonormal Low-Rank Adaptation of Large Language Models
Comments: 10 pages, 5 figures
Subjects: Computation and Language (cs.CL)
[152]  arXiv:2406.01771 [pdf, other]
Title: LLMs Beyond English: Scaling the Multilingual Capability of LLMs with Cross-Lingual Feedback
Comments: Accepted to Findings of ACL 2024. The code, datasets, and models are publicly available at this https URL
Subjects: Computation and Language (cs.CL)
[153]  arXiv:2406.01749 [pdf, ps, other]
Title: Towards Harnessing Large Language Models for Comprehension of Conversational Grounding
Comments: Accepted to IWSDS 2024
Subjects: Computation and Language (cs.CL)
[154]  arXiv:2406.01721 [pdf, other]
Title: Rotation and Permutation for Advanced Outlier Management and Efficient Quantization of LLMs
Comments: 26 pages, 13 figures
Subjects: Computation and Language (cs.CL)
[155]  arXiv:2406.02543 (cross-list from cs.LG) [pdf, other]
Title: To Believe or Not to Believe Your LLM
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[156]  arXiv:2406.02539 (cross-list from cs.CV) [pdf, other]
Title: Parrot: Multilingual Visual Instruction Tuning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[157]  arXiv:2406.02488 (cross-list from eess.AS) [pdf, other]
Title: Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[158]  arXiv:2406.02469 (cross-list from cs.LG) [pdf, other]
Title: Landscape-Aware Growing: The Power of a Little LAG
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[159]  arXiv:2406.02377 (cross-list from cs.IR) [pdf, other]
Title: XRec: Large Language Models for Explainable Recommendation
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[160]  arXiv:2406.02368 (cross-list from cs.IR) [pdf, other]
Title: Large Language Models Make Sample-Efficient Recommender Systems
Comments: Accepted by Frontier of Computer Science
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[161]  arXiv:2406.02356 (cross-list from cs.LG) [pdf, other]
Title: Language Models Do Hard Arithmetic Tasks Easily and Hardly Do Easy Arithmetic Tasks
Comments: In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[162]  arXiv:2406.02332 (cross-list from cs.LG) [pdf, other]
Title: Extended Mind Transformers
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[163]  arXiv:2406.02265 (cross-list from cs.CV) [pdf, other]
Title: Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning
Comments: 9 pages, long paper at ACL 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[164]  arXiv:2406.02208 (cross-list from cs.CV) [pdf, other]
Title: Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts
Comments: IJCAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[165]  arXiv:2406.02166 (cross-list from cs.SD) [pdf, other]
Title: Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[166]  arXiv:2406.02135 (cross-list from cs.IR) [pdf, other]
Title: Robust Interaction-based Relevance Modeling for Online E-Commerce and LLM-based Retrieval
Comments: Accepted by ECML-PKDD'24 as Outstanding Paper. 8 pages, 2 figures, 7 tables
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[167]  arXiv:2406.02133 (cross-list from eess.AS) [pdf, other]
Title: SimulTron: On-Device Simultaneous Speech to Speech Translation
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[168]  arXiv:2406.02128 (cross-list from cs.LG) [pdf, other]
Title: Iteration Head: A Mechanistic Study of Chain-of-Thought
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[169]  arXiv:2406.02061 (cross-list from cs.LG) [pdf, other]
Title: Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models
Comments: v1
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[170]  arXiv:2406.02009 (cross-list from eess.AS) [pdf, other]
Title: Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis
Comments: Accepted by Interspeech 2024
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[171]  arXiv:2406.02004 (cross-list from cs.CR) [pdf, ps, other]
Title: Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[172]  arXiv:2406.01946 (cross-list from cs.CR) [pdf, other]
Title: Bileve: Securing Text Provenance in Large Language Models Against Spoofing with Bi-level Signature
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL)
[173]  arXiv:2406.01914 (cross-list from cs.CV) [pdf, other]
Title: HPE-CogVLM: New Head Pose Grounding Task Exploration on Vision Language Model
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[174]  arXiv:2406.01895 (cross-list from cs.LG) [pdf, other]
Title: Explicitly Encoding Structural Symmetry is Key to Length Generalization in Arithmetic Tasks
Comments: 32 pages, 16 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
[175]  arXiv:2406.01876 (cross-list from cs.DB) [pdf, other]
Title: GRAM: Generative Retrieval Augmented Matching of Data Schemas in the Context of Data Security
Comments: KDD 2024 Camera Ready; 11 pages, 8 figures
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[176]  arXiv:2406.01789 (cross-list from cs.LG) [pdf, ps, other]
Title: AI-based Classification of Customer Support Tickets: State of the Art and Implementation with AutoML
Journal-ref: Proceedings of the IWEMB 2021/2022: Fifth and Sixth International Workshop on Entrepreneurship, Electronic and Mobile Business
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[177]  arXiv:2406.01638 (cross-list from cs.LG) [pdf, other]
Title: TimeCMA: Towards LLM-Empowered Time Series Forecasting via Cross-Modality Alignment
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[178]  arXiv:2406.01633 (cross-list from cs.IR) [pdf, other]
Title: On Overcoming Miscalibrated Conversational Priors in LLM-based Chatbots
Comments: Preprint of UAI'24 conference publication
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[179]  arXiv:2406.01624 (cross-list from eess.AS) [pdf, other]
Title: Unveiling Hidden Factors: Explainable AI for Feature Boosting in Speech Emotion Recognition
Comments: Published in: Springer Nature International Journal of Applied Intelligence (2024)
Journal-ref: Applied Intelligence (2024)
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[180]  arXiv:2406.01609 (cross-list from cs.IR) [pdf, other]
Title: Judgement Citation Retrieval using Contextual Similarity
Comments: 14 pages, 16 images, Submitted to Multimedia Tools and Applications Springer journal
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[181]  arXiv:2406.01608 (cross-list from cs.IR) [pdf, other]
Title: Detecting Deceptive Dark Patterns in E-commerce Platforms
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[182]  arXiv:2406.01607 (cross-list from cs.IR) [pdf, other]
Title: Recent advances in text embedding: A Comprehensive Review of Top-Performing Methods on the MTEB Benchmark
Authors: Hongliu Cao
Comments: 45 pages
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[183]  arXiv:2406.01606 (cross-list from cs.IR) [pdf, other]
Title: SymTax: Symbiotic Relationship and Taxonomy Fusion for Effective Citation Recommendation
Comments: Accepted in ACL 2024
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Tue, 4 Jun 2024 (showing first 67 of 153 entries)

[184]  arXiv:2406.01574 [pdf, other]
Title: MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark
Subjects: Computation and Language (cs.CL)
[185]  arXiv:2406.01563 [pdf, other]
Title: LoFiT: Localized Fine-tuning on LLM Representations
Subjects: Computation and Language (cs.CL)
[186]  arXiv:2406.01549 [pdf, other]
Title: An Information Bottleneck Perspective for Effective Noise Filtering on Retrieval-Augmented Generation
Comments: ACL24 Main
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[187]  arXiv:2406.01538 [pdf, other]
Title: What Are Large Language Models Mapping to in the Brain? A Case Against Over-Reliance on Brain Scores
Comments: 10 pages, 4 figures in the main paper
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[188]  arXiv:2406.01514 [pdf, other]
Title: Decoupled Alignment for Robust Plug-and-Play Adaptation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[189]  arXiv:2406.01512 [pdf, other]
Title: MAD: Multi-Alignment MEG-to-Text Decoding
Subjects: Computation and Language (cs.CL)
[190]  arXiv:2406.01506 [pdf, other]
Title: The Geometry of Categorical and Hierarchical Concepts in Large Language Models
Comments: Code is available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[191]  arXiv:2406.01495 [pdf, other]
Title: Reflection-Reinforced Self-Training for Language Agents
Subjects: Computation and Language (cs.CL)
[192]  arXiv:2406.01468 [pdf, other]
Title: Understanding Token Probability Encoding in Output Embeddings
Comments: 15 pages, 17 figures, 3 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[193]  arXiv:2406.01446 [pdf, ps, other]
Title: Enabling ASR for Low-Resource Languages: A Comprehensive Dataset Creation Approach
Authors: Ara Yeroyan (Data Science Department, American University of Armenia), Nikolay Karpov (Nvidia, NeMo Conversational AI team)
Comments: 13 pages, 10 figures (including ablation studies), to be published in 2024 IEEE Spoken Language Technology Workshop. Additionally, the associated software package can be accessed at (this https URL) for practical applications and further development
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[194]  arXiv:2406.01441 [pdf, other]
Title: LexMatcher: Dictionary-centric Data Collection for LLM-based Machine Translation
Subjects: Computation and Language (cs.CL)
[195]  arXiv:2406.01436 [pdf, other]
Title: Editing the Mind of Giants: An In-Depth Exploration of Pitfalls of Knowledge Editing in Large Language Models
Subjects: Computation and Language (cs.CL)
[196]  arXiv:2406.01428 [pdf, ps, other]
Title: Superhuman performance in urology board questions by an explainable large language model enabled for context integration of the European Association of Urology guidelines: the UroBot study
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[197]  arXiv:2406.01392 [pdf, other]
Title: Sparsity-Accelerated Training for Large Language Models
Comments: Accepted to ACL 2024 Findings
Subjects: Computation and Language (cs.CL)
[198]  arXiv:2406.01382 [pdf, other]
Title: Do Large Language Models Perform the Way People Expect? Measuring the Human Generalization Function
Comments: To appear in ICML 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[199]  arXiv:2406.01375 [pdf, other]
Title: D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models
Subjects: Computation and Language (cs.CL)
[200]  arXiv:2406.01372 [pdf, ps, other]
Title: Linguistic Analysis, Description, and Typological Exploration with Categorial Grammar (TheBench Guide)
Authors: Cem Bozsahin
Subjects: Computation and Language (cs.CL)
[201]  arXiv:2406.01363 [pdf, other]
Title: Privacy in LLM-based Recommendation: Recent Advances and Future Directions
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[202]  arXiv:2406.01359 [pdf, other]
Title: R2C2-Coder: Enhancing and Benchmarking Real-world Repository-level Code Completion Abilities of Code Large Language Models
Subjects: Computation and Language (cs.CL); Software Engineering (cs.SE)
[203]  arXiv:2406.01333 [pdf, other]
Title: Probing Language Models for Pre-training Data Detection
Comments: Accepted by ACL-2024 main conference
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[204]  arXiv:2406.01311 [pdf, other]
Title: FactGenius: Combining Zero-Shot Prompting and Fuzzy Relation Mining to Improve Fact Verification with Knowledge Graphs
Authors: Sushant Gautam
Comments: accepted and presented at the 6th IN5550 Workshop on Neural Natural Language Processing (WNNLP 2024) at the University of Oslo, Norway
Subjects: Computation and Language (cs.CL)
[205]  arXiv:2406.01306 [pdf, other]
Title: Unsupervised Distractor Generation via Large Language Model Distilling and Counterfactual Contrastive Decoding
Comments: Accepted as a long paper in ACL 2024 findings
Subjects: Computation and Language (cs.CL)
[206]  arXiv:2406.01304 [pdf, other]
Title: CodeR: Issue Resolving with Multi-Agent and Task Graphs
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
[207]  arXiv:2406.01297 [pdf, other]
Title: When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs
Subjects: Computation and Language (cs.CL)
[208]  arXiv:2406.01288 [pdf, other]
Title: Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[209]  arXiv:2406.01283 [pdf, other]
Title: Focus on the Core: Efficient Attention via Pruned Token Compression for Document Classification
Comments: Accepted to EMNLP 2023 Findings
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[210]  arXiv:2406.01276 [pdf, other]
Title: EduNLP: Towards a Unified and Modularized Library for Educational Resources
Subjects: Computation and Language (cs.CL)
[211]  arXiv:2406.01252 [pdf, other]
Title: Towards Scalable Automated Alignment of LLMs: A Survey
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[212]  arXiv:2406.01238 [pdf, other]
Title: EffiQA: Efficient Question-Answering with Strategic Multi-Model Collaboration on Knowledge Graphs
Comments: 10 pages, 4 figures, 3 tables
Subjects: Computation and Language (cs.CL)
[213]  arXiv:2406.01224 [pdf, other]
Title: Demonstration Augmentation for Zero-shot In-context Learning
Comments: Accepted to ACL 2024 Findings
Subjects: Computation and Language (cs.CL)
[214]  arXiv:2406.01213 [pdf, other]
Title: Improving Pseudo Labels with Global-Local Denoising Framework for Cross-lingual Named Entity Recognition
Comments: Accepted by IJCAI 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[215]  arXiv:2406.01198 [pdf, ps, other]
Title: Automatic Essay Multi-dimensional Scoring with Fine-tuning and Multiple Regression
Authors: Kun Sun, Rong Wang
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[216]  arXiv:2406.01179 [pdf, other]
Title: Are AI-Generated Text Detectors Robust to Adversarial Perturbations?
Comments: Accepted to ACL 2024 main conference
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[217]  arXiv:2406.01171 [pdf, other]
Title: Two Tales of Persona in LLMs: A Survey of Role-Playing and Personalization
Subjects: Computation and Language (cs.CL)
[218]  arXiv:2406.01145 [pdf, other]
Title: Explore then Determine: A GNN-LLM Synergy Framework for Reasoning over Knowledge Graph
Subjects: Computation and Language (cs.CL)
[219]  arXiv:2406.01126 [pdf, other]
Title: TCMBench: A Comprehensive Benchmark for Evaluating Large Language Models in Traditional Chinese Medicine
Comments: 20 pages, 15 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[220]  arXiv:2406.01096 [pdf, ps, other]
Title: Synergizing Unsupervised and Supervised Learning: A Hybrid Approach for Accurate Natural Language Task Modeling
Journal-ref: International Journal of Innovative Science and Research Technology: Vol. 9 (2024): No. 5, 1499-1508
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[221]  arXiv:2406.01070 [pdf, other]
Title: Guiding ChatGPT to Generate Salient Domain Summaries
Subjects: Computation and Language (cs.CL)
[222]  arXiv:2406.01052 [pdf, other]
Title: MACT: Model-Agnostic Cross-Lingual Training for Discourse Representation Structure Parsing
Authors: Jiangming Liu
Comments: Accepted by LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[223]  arXiv:2406.01045 [pdf, other]
Title: Decompose, Enrich, and Extract! Schema-aware Event Extraction using LLMs
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[224]  arXiv:2406.01026 [pdf, other]
Title: Strengthened Symbol Binding Makes Large Language Models Reliable Multiple-Choice Selectors
Comments: Accept at ACL2024 Main
Journal-ref: ACL 2024
Subjects: Computation and Language (cs.CL)
[225]  arXiv:2406.01021 [pdf, ps, other]
Title: Combining Qualitative and Computational Approaches for Literary Analysis of Finnish Novels
Comments: Accepted in Scandinavian Studies Journal, issue 97.3 (2025)
Subjects: Computation and Language (cs.CL)
[226]  arXiv:2406.01014 [pdf, other]
Title: Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration
Comments: 22 pages, 11 figures, 10 Tables
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[227]  arXiv:2406.01006 [pdf, other]
Title: SemCoder: Training Code Language Models with Comprehensive Semantics
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
[228]  arXiv:2406.00984 [pdf, other]
Title: Predicting Drug-Gene Relations via Analogy Tasks with Word Embeddings
Subjects: Computation and Language (cs.CL)
[229]  arXiv:2406.00983 [pdf, other]
Title: Take its Essence, Discard its Dross! Debiasing for Toxic Language Detection via Counterfactual Causal Effect
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[230]  arXiv:2406.00980 [pdf, other]
Title: Selectively Answering Visual Questions
Comments: To be published in the findings of the 2024 Annual Meeting of the Association for Computational Linguistics
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[231]  arXiv:2406.00976 [pdf, other]
Title: Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer
Comments: Accept in ACL2024-main
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[232]  arXiv:2406.00975 [pdf, other]
Title: Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[233]  arXiv:2406.00969 [pdf, other]
Title: Using RL to Identify Divisive Perspectives Improves LLMs Abilities to Identify Communities on Social Media
Subjects: Computation and Language (cs.CL)
[234]  arXiv:2406.00954 [pdf, other]
Title: Annotation Guidelines-Based Knowledge Augmentation: Towards Enhancing Large Language Models for Educational Text Classification
Comments: The manuscript has been submitted for peer review to the IEEE Transactions on Learning Technologies
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[235]  arXiv:2406.00944 [pdf, other]
Title: Unveil the Duality of Retrieval-Augmented Generation: Theoretical Analysis and Practical Solution
Comments: 23 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[236]  arXiv:2406.00936 [pdf, other]
Title: A Survey of Useful LLM Evaluation
Subjects: Computation and Language (cs.CL)
[237]  arXiv:2406.00922 [pdf, other]
Title: MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning
Comments: 29 pages, 12 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[238]  arXiv:2406.00899 [pdf, other]
Title: YODAS: Youtube-Oriented Dataset for Audio and Speech
Comments: ASRU 2023
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[239]  arXiv:2406.00888 [pdf, other]
Title: Show, Don't Tell: Aligning Language Models with Demonstrated Feedback
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[240]  arXiv:2406.00867 [pdf, ps, other]
Title: Formality Style Transfer in Persian
Comments: 20 pages, 4 figures, 8 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[241]  arXiv:2406.00842 [pdf, other]
Title: The Power of Summary-Source Alignments
Comments: Accepted to ACL-Findings 2024
Subjects: Computation and Language (cs.CL)
[242]  arXiv:2406.00839 [pdf, other]
Title: FOCUS: Forging Originality through Contrastive Use in Self-Plagiarism for Language Models
Comments: 16 pages, 8 figures. The paper has been accepted by ACL 2024 (Findings), with Kaixin Lan and Tao Fang contributing equally, and Derek F. Wong serving as the corresponding author
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[243]  arXiv:2406.00832 [pdf, other]
Title: BoNBoN Alignment for Large Language Models and the Sweetness of Best-of-n Sampling
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[244]  arXiv:2406.00789 [pdf, ps, other]
Title: Developing an efficient corpus using Ensemble Data cleaning approach
Authors: Md Taimur Ahad
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[245]  arXiv:2406.00787 [pdf, other]
Title: Applying Intrinsic Debiasing on Downstream Tasks: Challenges and Considerations for Machine Translation
Subjects: Computation and Language (cs.CL)
[246]  arXiv:2406.00770 [pdf, other]
Title: Automatic Instruction Evolving for Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[247]  arXiv:2406.00755 [pdf, other]
Title: Evaluating Mathematical Reasoning of Large Language Models: A Focus on Error Identification and Correction
Comments: ACL Findings 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[248]  arXiv:2406.00751 [pdf, other]
Title: How well do distributed representations convey contextual lexical semantics: a Thesis Proposal
Authors: Zhu Liu
Comments: 6 pages
Subjects: Computation and Language (cs.CL)
[249]  arXiv:2406.00697 [pdf, other]
Title: Topic Modeling for Short Texts with Large Language Models
Subjects: Computation and Language (cs.CL)
[250]  arXiv:2406.00656 [pdf, other]
Title: Presence or Absence: Are Unknown Word Usages in Dictionaries?
Subjects: Computation and Language (cs.CL)
[ total of 489 entries: 1-250 | 251-489 ]
[ showing 250 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help  (Access key information)