We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions, skipping first 89

[ total of 346 entries: 1-50 | 40-89 | 90-139 | 140-189 | 190-239 | 240-289 | ... | 340-346 ]
[ showing 50 entries per page: fewer | more | all ]

Thu, 18 Apr 2024 (continued, showing last 27 of 57 entries)

[90]  arXiv:2404.11045 [pdf, other]
Title: Offset Unlearning for Large Language Models
Subjects: Computation and Language (cs.CL)
[91]  arXiv:2404.10975 [pdf, other]
Title: Procedural Dilemma Generation for Evaluating Moral Reasoning in Humans and Language Models
Comments: CogSci 2024
Subjects: Computation and Language (cs.CL)
[92]  arXiv:2404.10960 [pdf, other]
Title: Uncertainty-Based Abstention in LLMs Improves Safety and Reduces Hallucinations
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[93]  arXiv:2404.10952 [pdf, other]
Title: Can Language Models Solve Olympiad Programming?
Comments: Code and data: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Programming Languages (cs.PL)
[94]  arXiv:2404.10939 [pdf, other]
Title: More Room for Language: Investigating the Effect of Retrieval on Language Models
Comments: NAACL 2024
Subjects: Computation and Language (cs.CL)
[95]  arXiv:2404.10924 [pdf, other]
Title: Binder: Hierarchical Concept Representation through Order Embedding of Binary Vectors
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[96]  arXiv:2404.10922 [pdf, other]
Title: Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training
Comments: NAACL Findings 2024
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[97]  arXiv:2404.10917 [pdf, other]
Title: Which questions should I answer? Salience Prediction of Inquisitive Questions
Subjects: Computation and Language (cs.CL)
[98]  arXiv:2404.10887 [pdf, other]
Title: Search Beyond Queries: Training Smaller Language Models for Web Interactions via Reinforcement Learning
Comments: 9 pages
Subjects: Computation and Language (cs.CL)
[99]  arXiv:2404.10877 [pdf, other]
Title: Incubating Text Classifiers Following User Instruction with Nothing but LLM
Subjects: Computation and Language (cs.CL)
[100]  arXiv:2404.10859 [pdf, other]
Title: Forcing Diffuse Distributions out of Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[101]  arXiv:2404.10857 [pdf, other]
Title: D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation
Subjects: Computation and Language (cs.CL)
[102]  arXiv:2404.10848 [pdf, other]
Title: A LayoutLMv3-Based Model for Enhanced Relation Extraction in Visually-Rich Documents
Comments: Accepted at the International Conference on Document Analysis and Recognition (ICDAR 2024)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[103]  arXiv:2404.10830 [pdf, other]
Title: Fewer Truncations Improve Language Modeling
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[104]  arXiv:2404.11584 (cross-list from cs.AI) [pdf, other]
Title: The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey
Comments: 13 pages,6 figures,38 references
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[105]  arXiv:2404.11538 (cross-list from cs.LG) [pdf, other]
Title: GenFighter: A Generative and Evolutive Textual Attack Removal
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[106]  arXiv:2404.11457 (cross-list from cs.IR) [pdf, other]
Title: Unifying Bias and Unfairness in Information Retrieval: A Survey of Challenges and Opportunities with Large Language Models
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[107]  arXiv:2404.11447 (cross-list from cs.AI) [pdf, ps, other]
Title: Research on emotionally intelligent dialogue generation based on automatic dialogue system
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[108]  arXiv:2404.11205 (cross-list from cs.CV) [pdf, other]
Title: Kathakali Hand Gesture Recognition With Minimal Data
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[109]  arXiv:2404.11049 (cross-list from cs.LG) [pdf, other]
Title: Stepwise Alignment for Constrained Language Model Policy Optimization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[110]  arXiv:2404.11036 (cross-list from cs.LG) [pdf, other]
Title: Cross-Platform Hate Speech Detection with Weakly Supervised Causal Disentanglement
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[111]  arXiv:2404.11023 (cross-list from cs.HC) [pdf, other]
Title: Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions
Comments: Position Paper, Under Review, 19 pages, 2 figures
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); Machine Learning (cs.LG)
[112]  arXiv:2404.11018 (cross-list from cs.LG) [pdf, other]
Title: Many-Shot In-Context Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[113]  arXiv:2404.10981 (cross-list from cs.IR) [pdf, other]
Title: A Survey on Retrieval-Augmented Text Generation for Large Language Models
Comments: Ongoing work
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[114]  arXiv:2404.10934 (cross-list from cs.LG) [pdf, other]
Title: Shears: Unstructured Sparsity with Neural Low-rank Adapter Search
Comments: 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (Industry Track)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[115]  arXiv:2404.10933 (cross-list from cs.AI) [pdf, other]
Title: LLMem: Estimating GPU Memory Usage for Fine-Tuning Pre-Trained LLMs
Comments: 9 pages, 9 figures, accepted to IJCAI 2024
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[116]  arXiv:2404.10838 (cross-list from cs.CV) [pdf, other]
Title: Dynamic Self-adaptive Multiscale Distillation from Pre-trained Multimodal Large Model for Efficient Cross-modal Representation Learning
Comments: 10 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)

Wed, 17 Apr 2024 (showing first 23 of 47 entries)

[117]  arXiv:2404.10774 [pdf, other]
Title: MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents
Comments: LLM-AggreFact benchmark, MiniCheck models, data generation code at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[118]  arXiv:2404.10719 [pdf, other]
Title: Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Comments: 16 pages, 2 figures, 14 tables
Subjects: Computation and Language (cs.CL)
[119]  arXiv:2404.10710 [pdf, other]
Title: Dual Modalities of Text: Visual and Textual Generative Pre-training
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[120]  arXiv:2404.10704 [pdf, other]
Title: Question Difficulty Ranking for Multiple-Choice Reading Comprehension
Comments: 7 pages, 3 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[121]  arXiv:2404.10696 [pdf, other]
Title: Integrating knowledge bases to improve coreference and bridging resolution for the chemical domain
Comments: working in progress
Subjects: Computation and Language (cs.CL)
[122]  arXiv:2404.10652 [pdf, other]
Title: ViTextVQA: A Large-Scale Visual Question Answering Dataset for Evaluating Vietnamese Text Comprehension in Images
Comments: Preprint submitted to IJCV
Subjects: Computation and Language (cs.CL)
[123]  arXiv:2404.10642 [pdf, other]
Title: Self-playing Adversarial Language Game Enhances LLM Reasoning
Comments: Preprint
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[124]  arXiv:2404.10630 [pdf, other]
Title: HLAT: High-quality Large Language Model Pre-trained on AWS Trainium
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[125]  arXiv:2404.10555 [pdf, other]
Title: Construction of Domain-specified Japanese Large Language Model for Finance through Continual Pre-training
Comments: 7 pages
Subjects: Computation and Language (cs.CL); Computational Finance (q-fin.CP)
[126]  arXiv:2404.10552 [pdf, other]
Title: Unveiling the Misuse Potential of Base Large Language Models via In-Context Learning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[127]  arXiv:2404.10513 [pdf, other]
Title: CoTAR: Chain-of-Thought Attribution Reasoning with Multi-level Granularity
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[128]  arXiv:2404.10508 [pdf, other]
Title: White Men Lead, Black Women Help: Uncovering Gender, Racial, and Intersectional Bias in Language Agency
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[129]  arXiv:2404.10503 [pdf, other]
Title: A Sentiment Analysis of Medical Text Based on Deep Learning
Authors: Yinan Chen
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[130]  arXiv:2404.10500 [pdf, other]
Title: When Emotional Stimuli meet Prompt Designing: An Auto-Prompt Graphical Paradigm
Comments: 9 pages, 5 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[131]  arXiv:2404.10475 [pdf, other]
Title: Conversations as a Source for Teaching Scientific Concepts at Different Education Levels
Subjects: Computation and Language (cs.CL)
[132]  arXiv:2404.10464 [pdf, other]
Title: DESTEIN: Navigating Detoxification of Language Models via Universal Steering Pairs and Head-wise Activation Fusion
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[133]  arXiv:2404.10440 [pdf, other]
Title: Language Proficiency and F0 Entrainment: A Study of L2 English Imitation in Italian, French, and Slovak Speakers
Comments: Accepted at Speech Prosody 2024
Subjects: Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[134]  arXiv:2404.10384 [pdf, other]
Title: Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[135]  arXiv:2404.10346 [pdf, other]
Title: Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards
Comments: Preprint Under Review
Subjects: Computation and Language (cs.CL)
[136]  arXiv:2404.10315 [pdf, other]
Title: Enhancing Confidence Expression in Large Language Models Through Learning from Past Experience
Subjects: Computation and Language (cs.CL)
[137]  arXiv:2404.10306 [pdf, other]
Title: Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model
Comments: 43 pages, 10 figures
Subjects: Computation and Language (cs.CL)
[138]  arXiv:2404.10297 [pdf, other]
Title: Future Language Modeling from Temporal Document History
Comments: Accepted by ICLR 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[139]  arXiv:2404.10268 [pdf, other]
Title: Modeling Low-Resource Health Coaching Dialogues via Neuro-Symbolic Goal Summarization and Text-Units-Text Generation
Comments: Accepted to the main conference of LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[ total of 346 entries: 1-50 | 40-89 | 90-139 | 140-189 | 190-239 | 240-289 | ... | 340-346 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2404, contact, help  (Access key information)