References & Citations
Computer Science > Computation and Language
Title: WangchanLion and WangchanX MRC Eval
(Submitted on 24 Mar 2024 (v1), last revised 23 Apr 2024 (this version, v2))
Abstract: This technical report describes the development of WangchanLion, an instruction fine-tuned model focusing on Machine Reading Comprehension (MRC) in the Thai language. Our model is based on SEA-LION and a collection of instruction following datasets. To promote open research and reproducibility, we publicly release all training data, code, and the final model weights under the Apache-2 license. To assess the contextual understanding capability, we conducted extensive experimental studies using two Thai MRC datasets, XQuAD and Iapp_wiki_qa_squad. Experimental results demonstrate the model's ability to comprehend the context and produce an answer faithful to the reference one in 0-shot and 1-shot settings. In addition, our evaluation goes beyond the traditional MRC. We propose a new evaluation scheme assessing the answer's correctness, helpfulness, conciseness, and contextuality. Our code is available publicly at this https URL
Submission history
From: Wannaphong Phatthiyaphaibun [view email][v1] Sun, 24 Mar 2024 12:49:30 GMT (963kb,D)
[v2] Tue, 23 Apr 2024 12:31:30 GMT (965kb,D)
Link back to: arXiv, form interface, contact.