MuG: A Multimodal Classification Benchmark on Game Data with Tabular, Textual, and Visual Fields

Lu, Jiaying; Qian, Yongchen; Zhao, Shifan; Xi, Yuanzhe; Yang, Carl

doi:10.18653/v1/2023.findings-emnlp.354

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2302

Change to browse by:

Computer Science > Machine Learning

Title: MuG: A Multimodal Classification Benchmark on Game Data with Tabular, Textual, and Visual Fields

Authors: Jiaying Lu, Yongchen Qian, Shifan Zhao, Yuanzhe Xi, Carl Yang

(Submitted on 6 Feb 2023 (v1), last revised 17 Oct 2023 (this version, v2))

Abstract: Previous research has demonstrated the advantages of integrating data from multiple sources over traditional unimodal data, leading to the emergence of numerous novel multimodal applications. We propose a multimodal classification benchmark MuG with eight datasets that allows researchers to evaluate and improve their models. These datasets are collected from four various genres of games that cover tabular, textual, and visual modalities. We conduct multi-aspect data analysis to provide insights into the benchmark, including label balance ratios, percentages of missing features, distributions of data within each modality, and the correlations between labels and input modalities. We further present experimental results obtained by several state-of-the-art unimodal classifiers and multimodal classifiers, which demonstrate the challenging and multimodal-dependent properties of the benchmark. MuG is released at this https URL with the data, tutorials, and implemented baselines.

Subjects:	Machine Learning (cs.LG)
Journal reference:	In Findings of the Association for Computational Linguistics: EMNLP 2023
DOI:	10.18653/v1/2023.findings-emnlp.354
Cite as:	arXiv:2302.02978 [cs.LG]
	(or arXiv:2302.02978v2 [cs.LG] for this version)

Submission history

From: Jiaying Lu [view email]
[v1] Mon, 6 Feb 2023 18:09:06 GMT (570kb,D)
[v2] Tue, 17 Oct 2023 16:03:38 GMT (533kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2302.02978

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: MuG: A Multimodal Classification Benchmark on Game Data with Tabular, Textual, and Visual Fields

Submission history