Probing as Quantifying Inductive Bias

Immer, Alexander; Hennigen, Lucas Torroba; Fortuin, Vincent; Cotterell, Ryan

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2110

Change to browse by:

Computer Science > Computation and Language

Title: Probing as Quantifying Inductive Bias

Authors: Alexander Immer, Lucas Torroba Hennigen, Vincent Fortuin, Ryan Cotterell

(Submitted on 15 Oct 2021 (v1), last revised 24 Mar 2022 (this version, v2))

Abstract: Pre-trained contextual representations have led to dramatic performance improvements on a range of downstream tasks. Such performance improvements have motivated researchers to quantify and understand the linguistic information encoded in these representations. In general, researchers quantify the amount of linguistic information through probing, an endeavor which consists of training a supervised model to predict a linguistic property directly from the contextual representations. Unfortunately, this definition of probing has been subject to extensive criticism in the literature, and has been observed to lead to paradoxical and counter-intuitive results. In the theoretical portion of this paper, we take the position that the goal of probing ought to be measuring the amount of inductive bias that the representations encode on a specific task. We further describe a Bayesian framework that operationalizes this goal and allows us to quantify the representations' inductive bias. In the empirical portion of the paper, we apply our framework to a variety of NLP tasks. Our results suggest that our proposed framework alleviates many previous problems found in probing. Moreover, we are able to offer concrete evidence that -- for some tasks -- fastText can offer a better inductive bias than BERT.

Comments:	ACL 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2110.08388 [cs.CL]
	(or arXiv:2110.08388v2 [cs.CL] for this version)

Submission history

From: Lucas Torroba Hennigen [view email]
[v1] Fri, 15 Oct 2021 22:01:16 GMT (573kb,D)
[v2] Thu, 24 Mar 2022 23:12:27 GMT (547kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2110.08388

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Probing as Quantifying Inductive Bias

Submission history