TiltedBERT: Resource Adjustable Version of BERT

Kachuee, Sajjad; Sharifkhani, Mohammad

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2201

Change to browse by:

Computer Science > Computation and Language

Title: TiltedBERT: Resource Adjustable Version of BERT

Authors: Sajjad Kachuee, Mohammad Sharifkhani

(Submitted on 10 Jan 2022 (this version), latest version 20 Dec 2023 (v7))

Abstract: In this paper, we proposed a novel adjustable fine-tuning method that improves the training and inference time of the BERT model on downstream tasks. In the proposed method, we first detect more important word vectors in each layer by our proposed redundancy metric and then eliminate the less important word vectors with our proposed strategy. In our method, the word vector elimination rate in each layer is controlled by the Tilt-Rate hyper-parameter, and the model learns to work with a considerably lower number of Floating Point Operations (FLOPs) than the original BERT\textsubscript{base} model. Our proposed method does not need any extra training steps, and also it can be generalized to other transformer-based models. We perform extensive experiments that show the word vectors in higher layers have an impressive amount of redundancy that can be eliminated and decrease the training and inference time. Experimental results on extensive sentiment analysis, classification and regression datasets, and benchmarks like IMDB and GLUE showed that our proposed method is effective in various datasets. By applying our method on the BERT\textsubscript{base} model, we decrease the inference time up to 5.3 times with less than 0.85\% accuracy degradation on average. After the fine-tuning stage, the inference time of our model can be adjusted with our method offline-tuning property for a wide range of the Tilt-Rate value selections. Also, we propose a mathematical speedup analysis that can estimate the speedup of our method accurately. With the help of this analysis, the Tilt-Rate hyper-parameter can be selected before fine-tuning or while offline-tuning stages.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2201.03327 [cs.CL]
	(or arXiv:2201.03327v1 [cs.CL] for this version)

Submission history

From: Sajjad Kachuee [view email]
[v1] Mon, 10 Jan 2022 13:04:39 GMT (413kb,D)
[v2] Fri, 14 Jan 2022 14:45:24 GMT (487kb,D)
[v3] Fri, 11 Feb 2022 07:53:45 GMT (581kb,D)
[v4] Thu, 17 Mar 2022 14:04:59 GMT (578kb,D)
[v5] Tue, 21 Jun 2022 17:27:55 GMT (546kb,D)
[v6] Wed, 7 Sep 2022 13:53:53 GMT (786kb,D)
[v7] Wed, 20 Dec 2023 17:41:51 GMT (2185kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2201.03327v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: TiltedBERT: Resource Adjustable Version of BERT

Submission history