Current browse context:
stat.ML
Change to browse by:
References & Citations
Statistics > Machine Learning
Title: Meta Two-Sample Testing: Learning Kernels for Testing with Limited Data
(Submitted on 14 Jun 2021 (v1), last revised 4 Jan 2022 (this version, v2))
Abstract: Modern kernel-based two-sample tests have shown great success in distinguishing complex, high-dimensional distributions with appropriate learned kernels. Previous work has demonstrated that this kernel learning procedure succeeds, assuming a considerable number of observed samples from each distribution. In realistic scenarios with very limited numbers of data samples, however, it can be challenging to identify a kernel powerful enough to distinguish complex distributions. We address this issue by introducing the problem of meta two-sample testing (M2ST), which aims to exploit (abundant) auxiliary data on related tasks to find an algorithm that can quickly identify a powerful test on new target tasks. We propose two specific algorithms for this task: a generic scheme which improves over baselines and a more tailored approach which performs even better. We provide both theoretical justification and empirical evidence that our proposed meta-testing schemes out-perform learning kernel-based tests directly from scarce observations, and identify when such schemes will be successful.
Submission history
From: Danica J. Sutherland [view email][v1] Mon, 14 Jun 2021 17:52:50 GMT (6028kb,D)
[v2] Tue, 4 Jan 2022 23:59:42 GMT (6003kb,D)
Link back to: arXiv, form interface, contact.