Automatic Rule Induction for Efficient Semi-Supervised Learning

Pryzant, Reid; Yang, Ziyi; Xu, Yichong; Zhu, Chenguang; Zeng, Michael

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2205

Change to browse by:

Computer Science > Computation and Language

Title: Automatic Rule Induction for Efficient Semi-Supervised Learning

Authors: Reid Pryzant, Ziyi Yang, Yichong Xu, Chenguang Zhu, Michael Zeng

(Submitted on 18 May 2022 (v1), revised 19 May 2022 (this version, v2), latest version 14 Oct 2022 (v5))

Abstract: Semi-supervised learning has shown promise in allowing NLP models to generalize from small amounts of labeled data. Meanwhile, pretrained transformer models act as black-box correlation engines that are difficult to explain and sometimes behave unreliably. In this paper, we propose tackling both of these challenges via Automatic Rule Induction (ARI), a simple and general-purpose framework for the automatic discovery and integration of symbolic rules into pretrained transformer models. First, we extract weak symbolic rules from low-capacity machine learning models trained on small amounts of labeled data. Next, we use an attention mechanism to integrate these rules into high-capacity pretrained transformer models. Last, the rule-augmented system becomes part of a self-training framework to boost supervision signal on unlabeled data. These steps can be layered beneath a variety of existing weak supervision and semi-supervised NLP algorithms in order to improve performance and interpretability. Experiments across nine sequence classification and relation extraction tasks suggest that ARI can improve state-of-the-art methods with no manual effort and minimal computational overhead.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2205.09067 [cs.CL]
	(or arXiv:2205.09067v2 [cs.CL] for this version)

Submission history

From: Reid Pryzant [view email]
[v1] Wed, 18 May 2022 16:50:20 GMT (445kb,D)
[v2] Thu, 19 May 2022 16:18:40 GMT (445kb,D)
[v3] Fri, 20 May 2022 16:42:21 GMT (446kb,D)
[v4] Tue, 11 Oct 2022 20:32:49 GMT (645kb,D)
[v5] Fri, 14 Oct 2022 17:10:39 GMT (645kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2205.09067v2

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Automatic Rule Induction for Efficient Semi-Supervised Learning

Submission history