We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions, skipping first 88

[ total of 432 entries: 1-50 | 39-88 | 89-138 | 139-188 | 189-238 | 239-288 | ... | 389-432 ]
[ showing 50 entries per page: fewer | more | all ]

Fri, 31 May 2024 (continued, showing 50 of 76 entries)

[89]  arXiv:2405.20245 [pdf, other]
Title: Retrieval Augmented Structured Generation: Business Document Information Extraction As Tool Use
Comments: Accepted by IEEE 7th International Conference on Multimedia Information Processing and Retrieval (MIPR), 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[90]  arXiv:2405.20215 [pdf, other]
Title: TS-Align: A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models
Subjects: Computation and Language (cs.CL)
[91]  arXiv:2405.20204 [pdf, other]
Title: Jina CLIP: Your CLIP Model Is Also Your Text Retriever
Comments: 4 pages, ICML2024 workshop submission
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[92]  arXiv:2405.20192 [pdf, other]
Title: TAIA: Large Language Models are Out-of-Distribution Data Learners
Comments: 25 pages
Subjects: Computation and Language (cs.CL)
[93]  arXiv:2405.20179 [pdf, other]
Title: Robo-Instruct: Simulator-Augmented Instruction Alignment For Finetuning CodeLLMs
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[94]  arXiv:2405.20175 [pdf, other]
Title: InstructionCP: A fast approach to transfer Large Language Models into target language
Comments: 10 pages, 1 figure
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[95]  arXiv:2405.20163 [pdf, other]
Title: Reasoning about concepts with LLMs: Inconsistencies abound
Comments: 15 pages, 5 figures, 3 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[96]  arXiv:2405.20145 [pdf, other]
Title: Heidelberg-Boston @ SIGTYP 2024 Shared Task: Enhancing Low-Resource Language Analysis With Character-Aware Hierarchical Transformers
Comments: Accepted for publication at the 6th Workshop on Research in Computational Linguistic Typology and Multilingual NLP (SIGTYP-WS) 2024; 11 pages, 1 figure, 9 tables
Subjects: Computation and Language (cs.CL)
[97]  arXiv:2405.20139 [pdf, other]
Title: GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[98]  arXiv:2405.20131 [pdf, other]
Title: Language Models Need Inductive Biases to Count Inductively
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[99]  arXiv:2405.20092 [pdf, other]
Title: Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation
Subjects: Computation and Language (cs.CL); Software Engineering (cs.SE)
[100]  arXiv:2405.20089 [pdf, other]
Title: The Fine-Tuning Paradox: Boosting Translation Quality Without Sacrificing LLM Abilities
Comments: Accepted to ACL 2024 (long, main)
Subjects: Computation and Language (cs.CL)
[101]  arXiv:2405.20079 [pdf, other]
Title: Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language Learning
Comments: Accepted as a poster paper at EDM 2024: 17th International Conference on Educational Data Mining in Atlanta, USA
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[102]  arXiv:2405.20053 [pdf, other]
Title: Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[103]  arXiv:2405.19967 [pdf, other]
Title: Improved Out-of-Scope Intent Classification with Dual Encoding and Threshold-based Re-Classification
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[104]  arXiv:2405.19958 [pdf, other]
Title: Multi-Aspect Controllable Text Generation with Disentangled Counterfactual Augmentation
Comments: Accepted in the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[105]  arXiv:2405.19874 [pdf, other]
Title: Is In-Context Learning Sufficient for Instruction Following in LLMs?
Comments: Preprint. Code at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[106]  arXiv:2405.19856 [pdf, other]
Title: DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories
Comments: Accepted by the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024). arXiv admin note: substantial text overlap with arXiv:2404.00599, arXiv:2401.06401
Subjects: Computation and Language (cs.CL); Software Engineering (cs.SE)
[107]  arXiv:2405.19846 [pdf, other]
Title: Quest: Query-centric Data Synthesis Approach for Long-context Scaling of Large Language Model
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[108]  arXiv:2405.19842 [pdf, other]
Title: Improve Student's Reasoning Generalizability through Cascading Decomposed CoTs Distillation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[109]  arXiv:2405.19831 [pdf, other]
Title: Just Rewrite It Again: A Post-Processing Method for Enhanced Semantic Similarity and Privacy Preservation of Differentially Private Rewritten Text
Comments: 10 pages, 2 figures, 2 tables. Accepted to ARES 2024 (IWAPS)
Subjects: Computation and Language (cs.CL)
[110]  arXiv:2405.19799 [pdf, other]
Title: Unsupervised Mutual Learning of Dialogue Discourse Parsing and Topic Segmentation
Subjects: Computation and Language (cs.CL)
[111]  arXiv:2405.19795 [pdf, other]
Title: SLM as Guardian: Pioneering AI Safety with Small Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[112]  arXiv:2405.19793 [pdf, other]
Title: PDDLEGO: Iterative Planning in Textual Environments
Comments: In *SEM 2024
Subjects: Computation and Language (cs.CL)
[113]  arXiv:2405.19787 [pdf, other]
Title: From Symbolic Tasks to Code Generation: Diversification Yields Better Task Performers
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Logic in Computer Science (cs.LO); Programming Languages (cs.PL)
[114]  arXiv:2405.19778 [pdf, other]
Title: Enhancing Consistency and Role-Specific Knowledge Capturing by Rebuilding Fictional Character's Persona
Comments: preprint
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[115]  arXiv:2405.19763 [pdf, other]
Title: Enhancing Reinforcement Learning with Label-Sensitive Reward for Natural Language Understanding
Comments: Accept at ACL2024 Main
Subjects: Computation and Language (cs.CL)
[116]  arXiv:2405.19744 [pdf, other]
Title: X-Instruction: Aligning Language Model in Low-resource Languages with Self-curated Cross-lingual Instructions
Comments: ACL 2024. Our codes, data and model weights are available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[117]  arXiv:2405.19740 [pdf, other]
Title: PertEval: Unveiling Real Knowledge Capacity of LLMs with Knowledge-Invariant Perturbations
Comments: 23 pages, 12 figures, 10 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[118]  arXiv:2405.19737 [pdf, other]
Title: Beyond Imitation: Learning Key Reasoning Steps from Dual Chain-of-Thoughts in Reasoning Distillation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[119]  arXiv:2405.19715 [pdf, other]
Title: SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[120]  arXiv:2405.19701 [pdf, other]
Title: Significance of Chain of Thought in Gender Bias Mitigation for English-Dravidian Machine Translation
Comments: 6 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[121]  arXiv:2405.19670 [pdf, other]
Title: One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models
Comments: working in progress, repo: this https URL
Subjects: Computation and Language (cs.CL)
[122]  arXiv:2405.19660 [pdf, other]
Title: PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals
Comments: Work in progress
Subjects: Computation and Language (cs.CL)
[123]  arXiv:2405.19648 [pdf, other]
Title: Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach
Comments: ICAI'24 - The 26th Int'l Conf on Artificial Intelligence
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[124]  arXiv:2405.19635 [pdf, other]
Title: GKT: A Novel Guidance-Based Knowledge Transfer Framework For Efficient Cloud-edge Collaboration LLM Deployment
Subjects: Computation and Language (cs.CL)
[125]  arXiv:2405.19575 [pdf, other]
Title: A Deep Convolutional Neural Network-based Model for Aspect and Polarity Classification in Hausa Movie Reviews
Comments: To be published in the proceedings of ICCAIT 2023
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[126]  arXiv:2405.19563 [pdf, other]
Title: Unlearning Climate Misinformation in Large Language Models
Subjects: Computation and Language (cs.CL)
[127]  arXiv:2405.19538 [pdf, other]
Title: CheXpert Plus: Hundreds of Thousands of Aligned Radiology Texts, Images and Patients
Comments: 13 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[128]  arXiv:2405.19519 [pdf, other]
Title: Two-layer retrieval augmented generation framework for low-resource medical question-answering: proof of concept using Reddit data
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[129]  arXiv:2405.19487 [pdf, other]
Title: A Full-duplex Speech Dialogue Scheme Based On Large Language Models
Subjects: Computation and Language (cs.CL)
[130]  arXiv:2405.19462 [pdf, other]
Title: Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning
Comments: Accepted to ACL 2024 Findings
Subjects: Computation and Language (cs.CL)
[131]  arXiv:2405.19433 [pdf, other]
Title: Beyond Agreement: Diagnosing the Rationale Alignment of Automated Essay Scoring Methods based on Linguistically-informed Counterfactuals
Subjects: Computation and Language (cs.CL)
[132]  arXiv:2405.19426 [pdf, other]
Title: Deep Learning for Assessment of Oral Reading Fluency
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[133]  arXiv:2405.19425 [pdf, other]
Title: Adaptive In-conversation Team Building for Language Model Agents
Subjects: Computation and Language (cs.CL)
[134]  arXiv:2405.20341 (cross-list from cs.LG) [pdf, other]
Title: From Zero to Hero: Cold-Start Anomaly Detection
Comments: ACL 2024. Our code is available at this https URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[135]  arXiv:2405.20309 (cross-list from cs.LG) [pdf, other]
Title: Large Language Models Can Self-Improve At Web Agent Tasks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[136]  arXiv:2405.20271 (cross-list from cs.LG) [pdf, other]
Title: ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections
Comments: Accepted to ICML 2024. Code available at this https URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[137]  arXiv:2405.20213 (cross-list from cs.AI) [pdf, other]
Title: PostDoc: Generating Poster from a Long Multimodal Document Using Deep Submodular Optimization
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[138]  arXiv:2405.20172 (cross-list from cs.SD) [pdf, other]
Title: Iterative Feature Boosting for Explainable Speech Emotion Recognition
Comments: Published in: 2023 International Conference on Machine Learning and Applications (ICMLA)
Journal-ref: 2023 International Conference on Machine Learning and Applications (ICMLA), Jacksonville, FL, USA, 2023, pp. 543-549
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[ total of 432 entries: 1-50 | 39-88 | 89-138 | 139-188 | 189-238 | 239-288 | ... | 389-432 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help  (Access key information)