We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions, skipping first 102

[ total of 346 entries: 1-25 | ... | 28-52 | 53-77 | 78-102 | 103-127 | 128-152 | 153-177 | 178-202 | ... | 328-346 ]
[ showing 25 entries per page: fewer | more | all ]

Thu, 18 Apr 2024 (continued, showing last 14 of 57 entries)

[103]  arXiv:2404.10830 [pdf, other]
Title: Fewer Truncations Improve Language Modeling
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[104]  arXiv:2404.11584 (cross-list from cs.AI) [pdf, other]
Title: The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey
Comments: 13 pages,6 figures,38 references
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[105]  arXiv:2404.11538 (cross-list from cs.LG) [pdf, other]
Title: GenFighter: A Generative and Evolutive Textual Attack Removal
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[106]  arXiv:2404.11457 (cross-list from cs.IR) [pdf, other]
Title: Unifying Bias and Unfairness in Information Retrieval: A Survey of Challenges and Opportunities with Large Language Models
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[107]  arXiv:2404.11447 (cross-list from cs.AI) [pdf, ps, other]
Title: Research on emotionally intelligent dialogue generation based on automatic dialogue system
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[108]  arXiv:2404.11205 (cross-list from cs.CV) [pdf, other]
Title: Kathakali Hand Gesture Recognition With Minimal Data
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[109]  arXiv:2404.11049 (cross-list from cs.LG) [pdf, other]
Title: Stepwise Alignment for Constrained Language Model Policy Optimization
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[110]  arXiv:2404.11036 (cross-list from cs.LG) [pdf, other]
Title: Cross-Platform Hate Speech Detection with Weakly Supervised Causal Disentanglement
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[111]  arXiv:2404.11023 (cross-list from cs.HC) [pdf, other]
Title: Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions
Comments: Position Paper, Under Review, 19 pages, 2 figures
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); Machine Learning (cs.LG)
[112]  arXiv:2404.11018 (cross-list from cs.LG) [pdf, other]
Title: Many-Shot In-Context Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[113]  arXiv:2404.10981 (cross-list from cs.IR) [pdf, other]
Title: A Survey on Retrieval-Augmented Text Generation for Large Language Models
Comments: Ongoing work
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[114]  arXiv:2404.10934 (cross-list from cs.LG) [pdf, other]
Title: Shears: Unstructured Sparsity with Neural Low-rank Adapter Search
Comments: 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (Industry Track)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[115]  arXiv:2404.10933 (cross-list from cs.AI) [pdf, other]
Title: LLMem: Estimating GPU Memory Usage for Fine-Tuning Pre-Trained LLMs
Comments: 9 pages, 9 figures, accepted to IJCAI 2024
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[116]  arXiv:2404.10838 (cross-list from cs.CV) [pdf, other]
Title: Dynamic Self-adaptive Multiscale Distillation from Pre-trained Multimodal Large Model for Efficient Cross-modal Representation Learning
Comments: 10 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)

Wed, 17 Apr 2024 (showing first 11 of 47 entries)

[117]  arXiv:2404.10774 [pdf, other]
Title: MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents
Comments: LLM-AggreFact benchmark, MiniCheck models, data generation code at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[118]  arXiv:2404.10719 [pdf, other]
Title: Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Comments: 16 pages, 2 figures, 14 tables
Subjects: Computation and Language (cs.CL)
[119]  arXiv:2404.10710 [pdf, other]
Title: Dual Modalities of Text: Visual and Textual Generative Pre-training
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[120]  arXiv:2404.10704 [pdf, other]
Title: Question Difficulty Ranking for Multiple-Choice Reading Comprehension
Comments: 7 pages, 3 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[121]  arXiv:2404.10696 [pdf, other]
Title: Integrating knowledge bases to improve coreference and bridging resolution for the chemical domain
Comments: working in progress
Subjects: Computation and Language (cs.CL)
[122]  arXiv:2404.10652 [pdf, other]
Title: ViTextVQA: A Large-Scale Visual Question Answering Dataset for Evaluating Vietnamese Text Comprehension in Images
Comments: Preprint submitted to IJCV
Subjects: Computation and Language (cs.CL)
[123]  arXiv:2404.10642 [pdf, other]
Title: Self-playing Adversarial Language Game Enhances LLM Reasoning
Comments: Preprint
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[124]  arXiv:2404.10630 [pdf, other]
Title: HLAT: High-quality Large Language Model Pre-trained on AWS Trainium
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[125]  arXiv:2404.10555 [pdf, other]
Title: Construction of Domain-specified Japanese Large Language Model for Finance through Continual Pre-training
Comments: 7 pages
Subjects: Computation and Language (cs.CL); Computational Finance (q-fin.CP)
[126]  arXiv:2404.10552 [pdf, other]
Title: Unveiling the Misuse Potential of Base Large Language Models via In-Context Learning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[127]  arXiv:2404.10513 [pdf, other]
Title: CoTAR: Chain-of-Thought Attribution Reasoning with Multi-level Granularity
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[ total of 346 entries: 1-25 | ... | 28-52 | 53-77 | 78-102 | 103-127 | 128-152 | 153-177 | 178-202 | ... | 328-346 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2404, contact, help  (Access key information)