We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions, skipping first 93

[ total of 346 entries: 1-25 | 19-43 | 44-68 | 69-93 | 94-118 | 119-143 | 144-168 | 169-193 | ... | 344-346 ]
[ showing 25 entries per page: fewer | more | all ]

Thu, 18 Apr 2024 (continued, showing last 23 of 57 entries)

[94]  arXiv:2404.10939 [pdf, other]
Title: More Room for Language: Investigating the Effect of Retrieval on Language Models
Comments: NAACL 2024
Subjects: Computation and Language (cs.CL)
[95]  arXiv:2404.10924 [pdf, other]
Title: Binder: Hierarchical Concept Representation through Order Embedding of Binary Vectors
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[96]  arXiv:2404.10922 [pdf, other]
Title: Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training
Comments: NAACL Findings 2024
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[97]  arXiv:2404.10917 [pdf, other]
Title: Which questions should I answer? Salience Prediction of Inquisitive Questions
Subjects: Computation and Language (cs.CL)
[98]  arXiv:2404.10887 [pdf, other]
Title: Search Beyond Queries: Training Smaller Language Models for Web Interactions via Reinforcement Learning
Comments: 9 pages
Subjects: Computation and Language (cs.CL)
[99]  arXiv:2404.10877 [pdf, other]
Title: Incubating Text Classifiers Following User Instruction with Nothing but LLM
Subjects: Computation and Language (cs.CL)
[100]  arXiv:2404.10859 [pdf, other]
Title: Forcing Diffuse Distributions out of Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[101]  arXiv:2404.10857 [pdf, other]
Title: D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation
Subjects: Computation and Language (cs.CL)
[102]  arXiv:2404.10848 [pdf, other]
Title: A LayoutLMv3-Based Model for Enhanced Relation Extraction in Visually-Rich Documents
Comments: Accepted at the International Conference on Document Analysis and Recognition (ICDAR 2024)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[103]  arXiv:2404.10830 [pdf, other]
Title: Fewer Truncations Improve Language Modeling
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[104]  arXiv:2404.11584 (cross-list from cs.AI) [pdf, other]
Title: The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey
Comments: 13 pages,6 figures,38 references
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[105]  arXiv:2404.11538 (cross-list from cs.LG) [pdf, other]
Title: GenFighter: A Generative and Evolutive Textual Attack Removal
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[106]  arXiv:2404.11457 (cross-list from cs.IR) [pdf, other]
Title: Unifying Bias and Unfairness in Information Retrieval: A Survey of Challenges and Opportunities with Large Language Models
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[107]  arXiv:2404.11447 (cross-list from cs.AI) [pdf, ps, other]
Title: Research on emotionally intelligent dialogue generation based on automatic dialogue system
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[108]  arXiv:2404.11205 (cross-list from cs.CV) [pdf, other]
Title: Kathakali Hand Gesture Recognition With Minimal Data
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[109]  arXiv:2404.11049 (cross-list from cs.LG) [pdf, other]
Title: Stepwise Alignment for Constrained Language Model Policy Optimization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[110]  arXiv:2404.11036 (cross-list from cs.LG) [pdf, other]
Title: Cross-Platform Hate Speech Detection with Weakly Supervised Causal Disentanglement
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[111]  arXiv:2404.11023 (cross-list from cs.HC) [pdf, other]
Title: Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions
Comments: Position Paper, Under Review, 19 pages, 2 figures
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); Machine Learning (cs.LG)
[112]  arXiv:2404.11018 (cross-list from cs.LG) [pdf, other]
Title: Many-Shot In-Context Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[113]  arXiv:2404.10981 (cross-list from cs.IR) [pdf, other]
Title: A Survey on Retrieval-Augmented Text Generation for Large Language Models
Comments: Ongoing work
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[114]  arXiv:2404.10934 (cross-list from cs.LG) [pdf, other]
Title: Shears: Unstructured Sparsity with Neural Low-rank Adapter Search
Comments: 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (Industry Track)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[115]  arXiv:2404.10933 (cross-list from cs.AI) [pdf, other]
Title: LLMem: Estimating GPU Memory Usage for Fine-Tuning Pre-Trained LLMs
Comments: 9 pages, 9 figures, accepted to IJCAI 2024
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[116]  arXiv:2404.10838 (cross-list from cs.CV) [pdf, other]
Title: Dynamic Self-adaptive Multiscale Distillation from Pre-trained Multimodal Large Model for Efficient Cross-modal Representation Learning
Comments: 10 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)

Wed, 17 Apr 2024 (showing first 2 of 47 entries)

[117]  arXiv:2404.10774 [pdf, other]
Title: MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents
Comments: LLM-AggreFact benchmark, MiniCheck models, data generation code at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[118]  arXiv:2404.10719 [pdf, other]
Title: Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Comments: 16 pages, 2 figures, 14 tables
Subjects: Computation and Language (cs.CL)
[ total of 346 entries: 1-25 | 19-43 | 44-68 | 69-93 | 94-118 | 119-143 | 144-168 | 169-193 | ... | 344-346 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2404, contact, help  (Access key information)