Which Invariance Should We Transfer? A Causal Minimax Learning Approach

Liu, Mingzhou; Zheng, Xiangyu; Sun, Xinwei; Fang, Fang; Wang, Yizhou

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 2107

Statistics > Machine Learning

Title: Which Invariance Should We Transfer? A Causal Minimax Learning Approach

Authors: Mingzhou Liu, Xiangyu Zheng, Xinwei Sun, Fang Fang, Yizhou Wang

(Submitted on 5 Jul 2021 (v1), last revised 30 May 2023 (this version, v5))

Abstract: A major barrier to deploying current machine learning models lies in their non-reliability to dataset shifts. To resolve this problem, most existing studies attempted to transfer stable information to unseen environments. Particularly, independent causal mechanisms-based methods proposed to remove mutable causal mechanisms via the do-operator. Compared to previous methods, the obtained stable predictors are more effective in identifying stable information. However, a key question remains: which subset of this whole stable information should the model transfer, in order to achieve optimal generalization ability? To answer this question, we present a comprehensive minimax analysis from a causal perspective. Specifically, we first provide a graphical condition for the whole stable set to be optimal. When this condition fails, we surprisingly find with an example that this whole stable set, although can fully exploit stable information, is not the optimal one to transfer. To identify the optimal subset under this case, we propose to estimate the worst-case risk with a novel optimization scheme over the intervention functions on mutable causal mechanisms. We then propose an efficient algorithm to search for the subset with minimal worst-case risk, based on a newly defined equivalence relation between stable subsets. Compared to the exponential cost of exhaustively searching over all subsets, our searching strategy enjoys a polynomial complexity. The effectiveness and efficiency of our methods are demonstrated on synthetic data and the diagnosis of Alzheimer's disease.

Comments:	Accepted version of ICML-23
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2107.01876 [stat.ML]
	(or arXiv:2107.01876v5 [stat.ML] for this version)

Submission history

From: Mingzhou Liu [view email]
[v1] Mon, 5 Jul 2021 09:07:29 GMT (107kb,D)
[v2] Thu, 7 Jul 2022 03:46:16 GMT (2559kb,D)
[v3] Sun, 12 Feb 2023 07:56:59 GMT (3351kb,D)
[v4] Wed, 24 May 2023 13:05:01 GMT (2642kb,D)
[v5] Tue, 30 May 2023 13:37:27 GMT (2729kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:2107.01876

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Which Invariance Should We Transfer? A Causal Minimax Learning Approach

Submission history