If Influence Functions are the Answer, Then What is the Question?

Bae, Juhan; Ng, Nathan; Lo, Alston; Ghassemi, Marzyeh; Grosse, Roger

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2209

Computer Science > Machine Learning

Title: If Influence Functions are the Answer, Then What is the Question?

Authors: Juhan Bae, Nathan Ng, Alston Lo, Marzyeh Ghassemi, Roger Grosse

(Submitted on 12 Sep 2022)

Abstract: Influence functions efficiently estimate the effect of removing a single training data point on a model's learned parameters. While influence estimates align well with leave-one-out retraining for linear models, recent works have shown this alignment is often poor in neural networks. In this work, we investigate the specific factors that cause this discrepancy by decomposing it into five separate terms. We study the contributions of each term on a variety of architectures and datasets and how they vary with factors such as network width and training time. While practical influence function estimates may be a poor match to leave-one-out retraining for nonlinear networks, we show they are often a good approximation to a different object we term the proximal Bregman response function (PBRF). Since the PBRF can still be used to answer many of the questions motivating influence functions, such as identifying influential or mislabeled examples, our results suggest that current algorithms for influence function estimation give more informative results than previous error analyses would suggest.

Comments:	28 pages, 6 figures
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2209.05364 [cs.LG]
	(or arXiv:2209.05364v1 [cs.LG] for this version)

Submission history

From: Juhan Bae [view email]
[v1] Mon, 12 Sep 2022 16:17:43 GMT (2181kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2209.05364

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: If Influence Functions are the Answer, Then What is the Question?

Submission history