Current browse context:
cs
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: Discovering Invariances in Healthcare Neural Networks
(Submitted on 8 Nov 2019 (v1), last revised 3 Mar 2020 (this version, v3))
Abstract: We study the invariance characteristics of pre-trained predictive models by empirically learning transformations on the input that leave the prediction function approximately unchanged. To learn invariant transformations, we minimize the Wasserstein distance between the predictive distribution conditioned on the data instances and the predictive distribution conditioned on the transformed data instances. To avoid finding degenerate or perturbative transformations, we add a similarity regularization to discourage similarity between the data and its transformed values. We theoretically analyze the correctness of the algorithm and the structure of the solutions. Applying the proposed technique to clinical time series data, we discover variables that commonly-used LSTM models do not rely on for their prediction, especially when the LSTM is trained to be adversarially robust. We also analyze the invariances of BioBERT on clinical notes and discover words that it is invariant to.
Submission history
From: Mohammad Taha Bahadori [view email][v1] Fri, 8 Nov 2019 14:48:05 GMT (290kb,D)
[v2] Tue, 14 Jan 2020 03:40:38 GMT (314kb,D)
[v3] Tue, 3 Mar 2020 18:00:06 GMT (357kb,D)
Link back to: arXiv, form interface, contact.