References & Citations
Computer Science > Computation and Language
Title: Unsupervised Domain Adaptation on Reading Comprehension
(Submitted on 13 Nov 2019 (this version), latest version 27 Jul 2020 (v5))
Abstract: Reading comprehension (RC) has been studied in a variety of datasets with the boosted performance brought by deep neural networks. However, the generalization capability of these models across different domains remains unclear. To alleviate this issue, we are going to investigate unsupervised domain adaptation on RC, wherein a model is trained on labeled source domain and to be applied to the target domain with only unlabeled samples. We first show that even with the powerful BERT contextual representation, the performance is still unsatisfactory when the model trained on one dataset is directly applied to another target dataset. To solve this, we provide a novel conditional adversarial self-training method (CASe). Specifically, our approach leverages a BERT model fine-tuned on the source dataset along with the confidence filtering to generate reliable pseudo-labeled samples in the target domain for self-training. On the other hand, it further reduces domain distribution discrepancy through conditional adversarial learning across domains. Extensive experiments show our approach achieves comparable accuracy to supervised models on multiple large-scale benchmark datasets.
Submission history
From: Yu Cao [view email][v1] Wed, 13 Nov 2019 00:54:39 GMT (417kb,D)
[v2] Mon, 13 Jan 2020 02:19:15 GMT (417kb,D)
[v3] Sat, 25 Jan 2020 10:59:15 GMT (417kb,D)
[v4] Thu, 7 May 2020 09:37:36 GMT (417kb,D)
[v5] Mon, 27 Jul 2020 02:44:59 GMT (417kb,D)
Link back to: arXiv, form interface, contact.