We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions, skipping first 365

[ total of 291 entries: 1-148 | 144-291 ]
[ showing 148 entries per page: fewer | more | all ]

Tue, 14 May 2024 (continued, showing last 21 of 122 entries)

[144]  arXiv:2405.06643 [pdf, ps, other]
Title: Levels of AI Agents: from Rules to Large Language Models
Authors: Yu Huang
Subjects: Computation and Language (cs.CL)
[145]  arXiv:2405.07960 (cross-list from cs.HC) [pdf, other]
Title: AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[146]  arXiv:2405.07863 (cross-list from cs.LG) [pdf, other]
Title: RLHF Workflow: From Reward Modeling to Online RLHF
Comments: 26 pages, 8 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[147]  arXiv:2405.07840 (cross-list from cs.HC) [pdf, other]
Title: Open-vocabulary Auditory Neural Decoding Using fMRI-prompted LLM
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[148]  arXiv:2405.07803 (cross-list from cs.IT) [pdf, other]
Title: Decoding Geometric Properties in Non-Random Data from First Information-Theoretic Principles
Comments: arXiv admin note: substantial text overlap with arXiv:2303.16045. substantial text overlap with arXiv:2303.16045
Subjects: Information Theory (cs.IT); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Information Retrieval (cs.IR); Statistics Theory (math.ST)
[149]  arXiv:2405.07682 (cross-list from cs.SD) [pdf, other]
Title: FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation
Comments: IJCAI 2024
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[150]  arXiv:2405.07671 (cross-list from cs.FL) [pdf, other]
Title: Constructing a BPE Tokenization DFA
Subjects: Formal Languages and Automata Theory (cs.FL); Computation and Language (cs.CL); Machine Learning (cs.LG)
[151]  arXiv:2405.07667 (cross-list from cs.CR) [pdf, other]
Title: Backdoor Removal for Generative Large Language Models
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL)
[152]  arXiv:2405.07663 (cross-list from cs.CV) [pdf, other]
Title: Sign Stitching: A Novel Approach to Sign Language Production
Comments: 18 pages, 3 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[153]  arXiv:2405.07500 (cross-list from cs.IR) [pdf, other]
Title: PromptLink: Leveraging Large Language Models for Cross-Source Biomedical Concept Linking
Journal-ref: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (Short-Paper Track), 2024
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[154]  arXiv:2405.07081 (cross-list from cs.DB) [pdf, other]
Title: T-curator: a trust based curation tool for LOD logs
Authors: Dihia Lanasri
Subjects: Databases (cs.DB); Computation and Language (cs.CL)
[155]  arXiv:2405.07010 (cross-list from cs.CY) [pdf, other]
Title: Deciphering public attention to geoengineering and climate issues using machine learning and dynamic analysis
Comments: 46 page, 6 main figures and SI
Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL)
[156]  arXiv:2405.06919 (cross-list from cs.CY) [pdf, other]
Title: Automating Thematic Analysis: How LLMs Analyse Controversial Topics
Comments: 18 pages, 6 figures
Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL)
[157]  arXiv:2405.06886 (cross-list from cs.IR) [pdf, other]
Title: Event GDR: Event-Centric Generative Document Retrieval
Comments: Accepted to WWW 2024
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[158]  arXiv:2405.06808 (cross-list from q-fin.RM) [pdf, other]
Title: Large Language Model in Financial Regulatory Interpretation
Subjects: Risk Management (q-fin.RM); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[159]  arXiv:2405.06772 (cross-list from cs.CR) [pdf, other]
Title: CANAL -- Cyber Activity News Alerting Language Model: Empirical Approach vs. Expensive LLM
Comments: Published in 2024 IEEE 3rd International Conference on AI in Cybersecurity (ICAIC), Conference Date: 07-09 February 2024
Journal-ref: 2024 IEEE 3rd International Conference on AI in Cybersecurity (ICAIC), Houston, TX, USA, 2024, pp. 1-12
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[160]  arXiv:2405.06762 (cross-list from cs.HC) [pdf, other]
Title: LIVE: LaTex Interactive Visual Editing
Authors: Jinwei Lin
Comments: 8 pages, double column, ieee
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[161]  arXiv:2405.06725 (cross-list from q-bio.NC) [pdf, other]
Title: On the Shape of Brainscores for Large Language Models (LLMs)
Authors: Jingkai Li
Comments: The Figure 10 from arXiv:1710.04019, Figure 6.28 from arXiv:2403.13825, and captions are both from this https URL, where the case in my paper is Figure 3, and has already cited its original source. I believe both arXiv:1710.04019 and arXiv:2403.13825 should cite the original source, rather than force me to cite them
Subjects: Neurons and Cognition (q-bio.NC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[162]  arXiv:2405.06708 (cross-list from q-bio.GN) [pdf, other]
Title: LangCell: Language-Cell Pre-training for Cell Identity Understanding
Comments: 27 pages, 21 figures, conference
Subjects: Genomics (q-bio.GN); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[163]  arXiv:2405.06690 (cross-list from q-bio.BM) [pdf, other]
Title: DrugLLM: Open Large Language Model for Few-shot Molecule Generation
Comments: 17 pages, 3 figures
Subjects: Biomolecules (q-bio.BM); Computation and Language (cs.CL); Machine Learning (cs.LG)
[164]  arXiv:2405.06662 (cross-list from q-bio.BM) [pdf, ps, other]
Title: Language Interaction Network for Clinical Trial Approval Estimation
Subjects: Biomolecules (q-bio.BM); Computation and Language (cs.CL); Machine Learning (cs.LG)

Mon, 13 May 2024

[165]  arXiv:2405.06640 [pdf, other]
Title: Linearizing Large Language Models
Subjects: Computation and Language (cs.CL)
[166]  arXiv:2405.06604 [pdf, other]
Title: Explaining Text Similarity in Transformer Models
Comments: Accepted to NAACL 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[167]  arXiv:2405.06563 [pdf, other]
[168]  arXiv:2405.06551 [pdf, other]
Title: ADSumm: Annotated Ground-truth Summary Datasets for Disaster Tweet Summarization
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[169]  arXiv:2405.06545 [pdf, other]
Title: Mitigating Hallucinations in Large Language Models via Self-Refinement-Enhanced Knowledge Retrieval
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[170]  arXiv:2405.06541 [pdf, other]
Title: ATSumm: Auxiliary information enhanced approach for abstractive disaster Tweet Summarization with sparse training data
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[171]  arXiv:2405.06524 [pdf, other]
Title: Prompting Large Language Models with Knowledge Graphs for Question Answering Involving Long-tail Facts
Subjects: Computation and Language (cs.CL)
[172]  arXiv:2405.06499 [pdf, other]
Title: Aspect-based Sentiment Evaluation of Chess Moves (ASSESS): an NLP-based Method for Evaluating Chess Strategies from Textbooks
Comments: accepted in the 10th Games and NLP 2024 workshop at LREC 2024
Subjects: Computation and Language (cs.CL)
[173]  arXiv:2405.06483 [pdf, other]
Title: LyS at SemEval-2024 Task 3: An Early Prototype for End-to-End Multimodal Emotion Linking as Graph-Based Parsing
Comments: Accepted at SemEval 2024
Subjects: Computation and Language (cs.CL)
[174]  arXiv:2405.06459 [pdf, other]
Title: Are EEG-to-Text Models Working?
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[175]  arXiv:2405.06454 [pdf, other]
Title: E2TP: Element to Tuple Prompting Improves Aspect Sentiment Tuple Prediction
Subjects: Computation and Language (cs.CL)
[176]  arXiv:2405.06424 [pdf, other]
Title: Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation
Comments: Accepted to ICML 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[177]  arXiv:2405.06414 [pdf, other]
Title: Can Large Language Models Replicate ITS Feedback on Open-Ended Math Questions?
Comments: Educational Data Mining 2024
Subjects: Computation and Language (cs.CL)
[178]  arXiv:2405.06410 [pdf, other]
Title: Potential and Limitations of LLMs in Capturing Structured Semantics: A Case Study on SRL
Comments: Accepted by ICIC 2024
Subjects: Computation and Language (cs.CL)
[179]  arXiv:2405.06373 [pdf, other]
Title: LLM Discussion: Enhancing the Creativity of Large Language Models via Discussion Framework and Role-Play
Comments: 10 pages, 6 figures, Under review as a conference paper at COLM 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[180]  arXiv:2405.06346 [pdf, other]
Title: Akal Badi ya Bias: An Exploratory Study of Gender Bias in Hindi Language Technology
Comments: Accepted to FAccT 2024
Subjects: Computation and Language (cs.CL)
[181]  arXiv:2405.06321 [pdf, other]
Title: Correlation Dimension of Natural Language in a Statistical Manifold
Comments: Published at Physical Review Research
Journal-ref: Du, X., & Tanaka-Ishii, K. (2024). Correlation dimension of natural language in a statistical manifold. Physical Review Research, 6(2), L022028
Subjects: Computation and Language (cs.CL); Statistical Mechanics (cond-mat.stat-mech); Artificial Intelligence (cs.AI)
[182]  arXiv:2405.06306 [pdf, other]
Title: A NLP Approach to "Review Bombing" in Metacritic PC Videogames User Ratings
Comments: 11 pages, 4 figures. Accepted by Discover Artificial Intelligence but withdrawn due to APC
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[183]  arXiv:2405.06295 [pdf, other]
Title: Aspect-oriented Consumer Health Answer Summarization
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[184]  arXiv:2405.06275 [pdf, other]
Title: Pruning as a Domain-specific LLM Extractor
Comments: NAACL 2024 Findings
Subjects: Computation and Language (cs.CL)
[185]  arXiv:2405.06258 [pdf, other]
Title: Automatic Generation of Model and Data Cards: A Step Towards Responsible AI
Comments: NAACL 2024 Main Poster
Subjects: Computation and Language (cs.CL)
[186]  arXiv:2405.06239 [pdf, other]
Title: SaudiBERT: A Large Language Model Pretrained on Saudi Dialect Corpora
Authors: Faisal Qarah
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[187]  arXiv:2405.06221 [pdf, other]
Title: For the Misgendered Chinese in Gender Bias Research: Multi-Task Learning with Knowledge Distillation for Pinyin Name-Gender Prediction
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[188]  arXiv:2405.06211 [pdf, other]
Title: A Survey on RAG Meets LLMs: Towards Retrieval-Augmented Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[189]  arXiv:2405.06204 [pdf, other]
Title: HC$^2$L: Hybrid and Cooperative Contrastive Learning for Cross-lingual Spoken Language Understanding
Comments: Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI). arXiv admin note: text overlap with arXiv:2312.03716
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[190]  arXiv:2405.06150 [pdf, other]
Title: Lost in Transcription: Identifying and Quantifying the Accuracy Biases of Automatic Speech Recognition Systems Against Disfluent Speech
Comments: Accepted to NAACL 2024
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Audio and Speech Processing (eess.AS)
[191]  arXiv:2405.06145 [pdf, other]
Title: Reddit-Impacts: A Named Entity Recognition Dataset for Analyzing Clinical and Social Effects of Substance Use Derived from Social Media
Comments: 7 pages, 1 figure, 4 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[192]  arXiv:2405.06134 [pdf, other]
Title: Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[193]  arXiv:2405.06105 [pdf, ps, other]
Title: Can Perplexity Reflect Large Language Model's Ability in Long Text Understanding?
Subjects: Computation and Language (cs.CL)
[194]  arXiv:2405.06067 [pdf, other]
Title: HMT: Hierarchical Memory Transformer for Long Context Language Processing
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[195]  arXiv:2405.06059 [pdf, other]
Title: A Mixture-of-Experts Approach to Few-Shot Task Transfer in Open-Ended Text Worlds
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[196]  arXiv:2405.06639 (cross-list from cs.LG) [pdf, other]
Title: Value Augmented Sampling for Language Model Alignment and Personalization
Comments: Website: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[197]  arXiv:2405.06636 (cross-list from cs.CV) [pdf, other]
Title: Federated Document Visual Question Answering: A Pilot Study
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[198]  arXiv:2405.06634 (cross-list from cs.CV) [pdf, other]
Title: Multimodal LLMs Struggle with Basic Visual Network Analysis: a VNA Benchmark
Comments: 11 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[199]  arXiv:2405.06626 (cross-list from cs.LG) [pdf, other]
Title: Characterizing the Accuracy - Efficiency Trade-off of Low-rank Decomposition in Language Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[200]  arXiv:2405.06549 (cross-list from stat.AP) [pdf, ps, other]
Title: Sampling the Swadesh List to Identify Similar Languages with Tree Spaces
Comments: 19 pages, 26 figures
Subjects: Applications (stat.AP); Computation and Language (cs.CL)
[201]  arXiv:2405.06468 (cross-list from cs.CV) [pdf, other]
Title: Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[202]  arXiv:2405.06331 (cross-list from cs.LG) [pdf, other]
Title: LMD3: Language Model Data Density Dependence
Comments: 10 pages in the main body
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[203]  arXiv:2405.06319 (cross-list from cs.CV) [pdf, other]
Title: Decoding Emotions in Abstract Art: Cognitive Plausibility of CLIP in Recognizing Color-Emotion Associations
Comments: To appear in the Proceedings of the Annual Meeting of the Cognitive Science Society 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[204]  arXiv:2405.06301 (cross-list from cs.LG) [pdf, ps, other]
Title: Learning from String Sequences
Comments: 10 pages, 1 figure, 4 tables, Technical Report
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[205]  arXiv:2405.06270 (cross-list from cs.LG) [pdf, other]
Title: XAI4LLM. Let Machine Learning Models and LLMs Collaborate for Enhanced In-Context Learning in Healthcare
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[206]  arXiv:2405.06219 (cross-list from cs.LG) [pdf, other]
Title: SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[207]  arXiv:2405.06196 (cross-list from cs.CV) [pdf, other]
Title: VLSM-Adapter: Finetuning Vision-Language Segmentation Efficiently with Lightweight Blocks
Comments: 12 pages, 5 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[208]  arXiv:2405.06129 (cross-list from cs.IR) [pdf, ps, other]
Title: Narrative to Trajectory (N2T+): Extracting Routes of Life or Death from Human Trafficking Text Corpora
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[209]  arXiv:2405.06093 (cross-list from cs.LG) [pdf, other]
Title: Selective Fine-tuning on LLM-labeled Data May Reduce Reliance on Human Annotation: A Case Study Using Schedule-of-Event Table Detection
Comments: 21 pages
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[210]  arXiv:2405.06064 (cross-list from cs.AI) [pdf, other]
Title: LLMs for XAI: Future Directions for Explaining Explanations
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[211]  arXiv:2405.06058 (cross-list from cs.AI) [pdf, other]
Title: Large Language Models Show Human-like Social Desirability Biases in Survey Responses
Comments: 3 pages, 2 figures, submitted to PNAS Nexus
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY); Human-Computer Interaction (cs.HC)
[212]  arXiv:2405.06001 (cross-list from cs.LG) [pdf, other]
Title: LLM-QBench: A Benchmark Towards the Best Practice for Post-training Quantization of Large Language Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[213]  arXiv:2405.05990 (cross-list from cs.CR) [pdf, other]
Title: Special Characters Attack: Toward Scalable Training Data Extraction From Large Language Models
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)

Fri, 10 May 2024

[214]  arXiv:2405.05966 [pdf, other]
Title: Natural Language Processing RELIES on Linguistics
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[215]  arXiv:2405.05957 [pdf, other]
Title: OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage Pruning
Subjects: Computation and Language (cs.CL)
[216]  arXiv:2405.05955 [pdf, other]
Title: Smurfs: Leveraging Multiple Proficiency Agents with Context-Efficiency for Tool Planning
Subjects: Computation and Language (cs.CL)
[217]  arXiv:2405.05938 [pdf, other]
Title: DOLOMITES: Domain-Specific Long-Form Methodical Tasks
Comments: Dataset link coming soon
Subjects: Computation and Language (cs.CL)
[218]  arXiv:2405.05904 [pdf, other]
Title: Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?
Subjects: Computation and Language (cs.CL)
[219]  arXiv:2405.05894 [pdf, other]
Title: Efficient LLM Comparative Assessment: a Product of Experts Framework for Pairwise Comparisons
Subjects: Computation and Language (cs.CL)
[220]  arXiv:2405.05777 [pdf, other]
Title: Towards a More Inclusive AI: Progress and Perspectives in Large Language Model Training for the Sámi Language
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[221]  arXiv:2405.05776 [pdf, other]
Title: Experimental Pragmatics with Machines: Testing LLM Predictions for the Inferences of Plain and Embedded Disjunctions
Comments: 8 pages, 3 figures, to appear in the Proceedings of the 46th Annual Conference of the Cognitive Science Society (2024)
Subjects: Computation and Language (cs.CL)
[222]  arXiv:2405.05741 [pdf, ps, other]
Title: Can large language models understand uncommon meanings of common words?
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[223]  arXiv:2405.05723 [pdf, other]
Title: Computational lexical analysis of Flamenco genres
Comments: 21 pages, 29 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[224]  arXiv:2405.05705 [pdf, other]
Title: Detecting Statements in Text: A Domain-Agnostic Few-Shot Solution
Comments: Paper accepted for publication at NOCAPS workshop at ICWSM 2024 conference
Subjects: Computation and Language (cs.CL)
[225]  arXiv:2405.05688 [pdf, other]
Title: Evaluating Dialect Robustness of Language Models via Conversation Understanding
Comments: 13 pages, 7 figures, 6 tables
Subjects: Computation and Language (cs.CL)
[226]  arXiv:2405.05616 [pdf, other]
Title: G-SAP: Graph-based Structure-Aware Prompt Learning over Heterogeneous Knowledge for Commonsense Reasoning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[227]  arXiv:2405.05610 [pdf, other]
Title: Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[228]  arXiv:2405.05583 [pdf, other]
Title: OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs
Comments: 19 pages, 8 tables, 8 figures
Subjects: Computation and Language (cs.CL)
[229]  arXiv:2405.05572 [pdf, other]
Title: From Human Judgements to Predictive Models: Unravelling Acceptability in Code-Mixed Sentences
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[230]  arXiv:2405.05513 [pdf, ps, other]
Title: Automatic question generation for propositional logical equivalences
Subjects: Computation and Language (cs.CL); Discrete Mathematics (cs.DM)
[231]  arXiv:2405.05506 [pdf, other]
Title: Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias
Comments: Submitted for review
Subjects: Computation and Language (cs.CL)
[232]  arXiv:2405.05496 [pdf, other]
Title: Boosting Large Language Models with Continual Learning for Aspect-based Sentiment Analysis
Subjects: Computation and Language (cs.CL)
[233]  arXiv:2405.05493 [pdf, ps, other]
Title: Parameter-Efficient Fine-Tuning With Adapters
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[234]  arXiv:2405.05478 [pdf, other]
Title: Using Machine Translation to Augment Multilingual Classification
Authors: Adam King
Subjects: Computation and Language (cs.CL)
[235]  arXiv:2405.05466 [pdf, other]
Title: Poser: Unmasking Alignment Faking LLMs by Manipulating Their Internals
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[236]  arXiv:2405.05444 [pdf, ps, other]
Title: Evaluating Students' Open-ended Written Responses with LLMs: Using the RAG Framework for GPT-3.5, GPT-4, Claude-3, and Mistral-Large
Comments: 18 pages, 6 tables, 1 figure
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[237]  arXiv:2405.05418 [pdf, other]
Title: Mitigating Exaggerated Safety in Large Language Models
Comments: 17 pages, 8 figures, 2 tables
Subjects: Computation and Language (cs.CL)
[238]  arXiv:2405.05417 [pdf, ps, other]
Title: Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models
Comments: 16 pages, 4 figures. For associated code, see this https URL
Subjects: Computation and Language (cs.CL)
[239]  arXiv:2405.05378 [pdf, other]
Title: "They are uncultured": Unveiling Covert Harms and Social Threats in LLM Generated Conversations
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[240]  arXiv:2405.05376 [pdf, other]
Title: Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages
Comments: NAACL 2024
Subjects: Computation and Language (cs.CL)
[241]  arXiv:2405.05374 [pdf, other]
Title: Arctic-Embed: Scalable, Efficient, and Accurate Text Embedding Models
Comments: 17 pages, 11 Figures, 9 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[242]  arXiv:2405.05348 [pdf, other]
Title: The Effect of Model Size on LLM Post-hoc Explainability via LIME
Comments: Published at ICLR 2024 Workshop on Secure and Trustworthy Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[243]  arXiv:2405.05345 [pdf, other]
Title: QuaLLM: An LLM-based Framework to Extract Quantitative Insights from Online Forums
Comments: Accepted to CHI LLM as Research Tools Workshop (2024)
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[244]  arXiv:2405.05860 (cross-list from cs.LG) [pdf, other]
Title: The Perspectivist Paradigm Shift: Assumptions and Challenges of Capturing Human Labels
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computers and Society (cs.CY)
[245]  arXiv:2405.05760 (cross-list from cs.CV) [pdf, other]
Title: Similarity Guided Multimodal Fusion Transformer for Semantic Location Prediction in Social Media
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[246]  arXiv:2405.05758 (cross-list from cs.HC) [pdf, other]
Title: Exploring the Potential of Human-LLM Synergy in Advancing Qualitative Analysis: A Case Study on Mental-Illness Stigma
Comments: 55 pages
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); Computers and Society (cs.CY)
[247]  arXiv:2405.05678 (cross-list from cs.HC) [pdf, ps, other]
Title: Beyond Prompts: Learning from Human Communication for Enhanced AI Intent Alignment
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[248]  arXiv:2405.05615 (cross-list from cs.CV) [pdf, other]
Title: Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning
Comments: Accepted to ICML2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[249]  arXiv:2405.05600 (cross-list from cs.IR) [pdf, other]
Title: Can We Use Large Language Models to Fill Relevance Judgment Holes?
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[250]  arXiv:2405.05581 (cross-list from cs.HC) [pdf, other]
Title: One vs. Many: Comprehending Accurate Information from Multiple Erroneous and Inconsistent AI Generations
Comments: Accepted to FAccT 2024
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[251]  arXiv:2405.05435 (cross-list from cs.CR) [pdf, other]
Title: Analysis and prevention of AI-based phishing email attacks
Comments: Electronics, accepted
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[252]  arXiv:2405.05386 (cross-list from cs.LG) [pdf, other]
Title: Interpretability Needs a New Paradigm
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[253]  arXiv:2405.05347 (cross-list from cs.SE) [pdf, other]
Title: Benchmarking Educational Program Repair
Comments: 15 pages, 2 figures, 3 tables. Non-archival report presented at the NeurIPS'23 Workshop on Generative AI for Education (GAIED)
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY)
[254]  arXiv:2405.05329 (cross-list from cs.DC) [pdf, other]
Title: KV-Runahead: Scalable Causal LLM Inference by Parallel Key-Value Cache Generation
Comments: preprint for ICML 2024
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[255]  arXiv:2405.05294 (cross-list from cs.HC) [pdf, other]
Title: Harmonizing Program Induction with Rate-Distortion Theory
Comments: CogSci 2024
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); Information Theory (cs.IT); Machine Learning (cs.LG); Symbolic Computation (cs.SC); Machine Learning (stat.ML)

Thu, 9 May 2024

[256]  arXiv:2405.05254 [pdf, other]
Title: You Only Cache Once: Decoder-Decoder Architectures for Language Models
Subjects: Computation and Language (cs.CL)
[257]  arXiv:2405.05253 [pdf, other]
Title: Open Source Language Models Can Provide Feedback: Evaluating LLMs' Ability to Help Students Using GPT-4-As-A-Judge
Comments: 7 pages, 4 figures, 2 tables. Accepted for publication at the 29th annual ACM conference on Innovation and Technology in Computer Science Education (ITiCSE 2024)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[258]  arXiv:2405.05248 [pdf, other]
Title: LLMs with Personalities in Multi-issue Negotiation Games
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[259]  arXiv:2405.05204 [pdf, ps, other]
Title: CARE-SD: Classifier-based analysis for recognizing and eliminating stigmatizing and doubt marker labels in electronic health records: model development and validation
Comments: 28 pages, 3 figures, 4 tables. 5 Appendices
Subjects: Computation and Language (cs.CL)
[260]  arXiv:2405.05189 [pdf, other]
Title: MIDGARD: Self-Consistency Using Minimum Description Length for Structured Commonsense Reasoning
Comments: Under review at ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[261]  arXiv:2405.05176 [pdf, other]
Title: Encoder-Decoder Framework for Interactive Free Verses with Generation with Controllable High-Quality Rhyming
Comments: 18 pages, 1 figure
Subjects: Computation and Language (cs.CL)
[262]  arXiv:2405.05161 [pdf, ps, other]
Title: Motion Capture Analysis of Verb and Adjective Types in Austrian Sign Language
Comments: 10 pages, 7 figures
Subjects: Computation and Language (cs.CL); Neurons and Cognition (q-bio.NC)
[263]  arXiv:2405.05116 [pdf, other]
Title: XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples
Subjects: Computation and Language (cs.CL)
[264]  arXiv:2405.05109 [pdf, other]
Title: QFMTS: Generating Query-Focused Summaries over Multi-Table Inputs
Comments: 16 pages, 3 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[265]  arXiv:2405.05060 [pdf, other]
Title: Conversational Topic Recommendation in Counseling and Psychotherapy with Decision Transformer and Large Language Models
Comments: 5 pages excluding references, 3 figures; accepted at Clinical NLP Workshop @ NAACL 2024
Subjects: Computation and Language (cs.CL)
[266]  arXiv:2405.05049 [pdf, ps, other]
Title: Seeds of Stereotypes: A Large-Scale Textual Analysis of Race and Gender Associations with Diseases in Online Sources
Subjects: Computation and Language (cs.CL)
[267]  arXiv:2405.05008 [pdf, other]
Title: ADELIE: Aligning Large Language Models on Information Extraction
Subjects: Computation and Language (cs.CL)
[268]  arXiv:2405.04960 [pdf, other]
Title: P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models
Subjects: Computation and Language (cs.CL)
[269]  arXiv:2405.04955 [pdf, other]
Title: Improving Long Text Understanding with Knowledge Distilled from Summarization Model
Comments: arXiv admin note: text overlap with arXiv:2110.04741
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[270]  arXiv:2405.04897 [pdf, ps, other]
Title: Machine Learning-based NLP for Emotion Classification on a Cholera X Dataset
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[271]  arXiv:2405.04872 [pdf, other]
Title: Logical Negation Augmenting and Debiasing for Prompt-based Methods
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Logic in Computer Science (cs.LO)
[272]  arXiv:2405.04829 [pdf, other]
Title: Fine-tuning Pre-trained Named Entity Recognition Models For Indian Languages
Comments: 8 pages, accepted in NAACL-SRW, 2024
Subjects: Computation and Language (cs.CL)
[273]  arXiv:2405.04828 [pdf, other]
Title: ChuXin: 1.6B Technical Report
Comments: Technical Report
Subjects: Computation and Language (cs.CL)
[274]  arXiv:2405.04820 [pdf, other]
Title: APrompt4EM: Augmented Prompt Tuning for Generalized Entity Matching
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[275]  arXiv:2405.04819 [pdf, other]
Title: DALK: Dynamic Co-Augmentation of LLMs and KG to answer Alzheimer's Disease Questions with Scientific Literature
Comments: Under Review; Incorrect author name revised
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[276]  arXiv:2405.04818 [pdf, other]
Title: ACORN: Aspect-wise Commonsense Reasoning Explanation Evaluation
Comments: 18 pages, 7 figures, under review. Data available here: this https URL
Subjects: Computation and Language (cs.CL)
[277]  arXiv:2405.04793 [pdf, other]
Title: Zero-shot LLM-guided Counterfactual Generation for Text
Comments: arXiv admin note: text overlap with arXiv:2309.13340
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[278]  arXiv:2405.04781 [pdf, other]
Title: CourseGPT-zh: an Educational Large Language Model Based on Knowledge Distillation Incorporating Prompt Optimization
Subjects: Computation and Language (cs.CL)
[279]  arXiv:2405.04777 [pdf, other]
Title: Empathy Through Multimodality in Conversational Interfaces
Comments: 7 pages, 2 figures, 2 tables, conference paper
Subjects: Computation and Language (cs.CL)
[280]  arXiv:2405.04756 [pdf, other]
Title: BiasKG: Adversarial Knowledge Graphs to Induce Bias in Large Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[281]  arXiv:2405.04726 [pdf, other]
Title: Learning Phonotactics from Linguistic Informants
Subjects: Computation and Language (cs.CL)
[282]  arXiv:2405.04685 [pdf, other]
Title: Bridging the Bosphorus: Advancing Turkish Large Language Models through Strategies for Low-Resource Language Adaptation and Benchmarking
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[283]  arXiv:2405.04655 [pdf, other]
Title: Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense
Subjects: Computation and Language (cs.CL)
[284]  arXiv:2405.04590 [pdf, other]
Title: Language Modeling Using Tensor Trains
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[285]  arXiv:2405.04585 [pdf, other]
Title: PoPE: Legendre Orthogonal Polynomials Based Position Encoding for Large Language Models
Authors: Arpit Aggarwal
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[286]  arXiv:2405.05175 (cross-list from cs.CR) [pdf, other]
Title: Air Gap: Protecting Privacy-Conscious Conversational Agents
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[287]  arXiv:2405.05136 (cross-list from cs.CY) [pdf, other]
Title: Integrating LSTM and BERT for Long-Sequence Data Analysis in Intelligent Tutoring Systems
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[288]  arXiv:2405.05135 (cross-list from cs.SE) [pdf, ps, other]
Title: Lessons from the Use of Natural Language Inference (NLI) in Requirements Engineering Tasks
Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Machine Learning (cs.LG)
[289]  arXiv:2405.04950 (cross-list from cs.CV) [pdf, other]
Title: VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context
Comments: 17 pages; Accepted by ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[290]  arXiv:2405.04758 (cross-list from cs.CR) [pdf, other]
Title: Honeyfile Camouflage: Hiding Fake Files in Plain Sight
Comments: 3rd Workshop on the security implications of Deepfakes and Cheapfakes (WDC) co-located at ACM ASIACCS 2024
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[291]  arXiv:2405.04620 (cross-list from hep-ph) [pdf, ps, other]
Title: Folded context condensation in Path Integral formalism for infinite context transformers
Comments: 7 pages, 2 figures
Subjects: High Energy Physics - Phenomenology (hep-ph); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[ total of 291 entries: 1-148 | 144-291 ]
[ showing 148 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)