What Gives the Answer Away? Question Answering Bias Analysis on Video QA Datasets

Yang, Jianing; Zhu, Yuying; Wang, Yongxin; Yi, Ruitao; Zadeh, Amir; Morency, Louis-Philippe

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2007

Computer Science > Computation and Language

Title: What Gives the Answer Away? Question Answering Bias Analysis on Video QA Datasets

Authors: Jianing Yang, Yuying Zhu, Yongxin Wang, Ruitao Yi, Amir Zadeh, Louis-Philippe Morency

(Submitted on 7 Jul 2020)

Abstract: Question answering biases in video QA datasets can mislead multimodal model to overfit to QA artifacts and jeopardize the model's ability to generalize. Understanding how strong these QA biases are and where they come from helps the community measure progress more accurately and provide researchers insights to debug their models. In this paper, we analyze QA biases in popular video question answering datasets and discover pretrained language models can answer 37-48% questions correctly without using any multimodal context information, far exceeding the 20% random guess baseline for 5-choose-1 multiple-choice questions. Our ablation study shows biases can come from annotators and type of questions. Specifically, annotators that have been seen during training are better predicted by the model and reasoning, abstract questions incur more biases than factual, direct questions. We also show empirically that using annotator-non-overlapping train-test splits can reduce QA biases for video QA datasets.

Subjects:	Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2007.03626 [cs.CL]
	(or arXiv:2007.03626v1 [cs.CL] for this version)

Submission history

From: Jianing Yang [view email]
[v1] Tue, 7 Jul 2020 17:00:11 GMT (52kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2007.03626

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: What Gives the Answer Away? Question Answering Bias Analysis on Video QA Datasets

Submission history