Revisiting Structured Dropout

Zhao, Yiren; Dada, Oluwatomisin; Gao, Xitong; Mullins, Robert D

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2210

Computer Science > Machine Learning

Title: Revisiting Structured Dropout

Authors: Yiren Zhao, Oluwatomisin Dada, Xitong Gao, Robert D Mullins

(Submitted on 5 Oct 2022)

Abstract: Large neural networks are often overparameterised and prone to overfitting, Dropout is a widely used regularization technique to combat overfitting and improve model generalization. However, unstructured Dropout is not always effective for specific network architectures and this has led to the formation of multiple structured Dropout approaches to improve model performance and, sometimes, reduce the computational resources required for inference. In this work, we revisit structured Dropout comparing different Dropout approaches to natural language processing and computer vision tasks for multiple state-of-the-art networks. Additionally, we devise an approach to structured Dropout we call \textbf{\emph{ProbDropBlock}} which drops contiguous blocks from feature maps with a probability given by the normalized feature salience values. We find that with a simple scheduling strategy the proposed approach to structured Dropout consistently improved model performance compared to baselines and other Dropout approaches on a diverse range of tasks and models. In particular, we show \textbf{\emph{ProbDropBlock}} improves RoBERTa finetuning on MNLI by $0.22\%$, and training of ResNet50 on ImageNet by $0.28\%$.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2210.02570 [cs.LG]
	(or arXiv:2210.02570v1 [cs.LG] for this version)

Submission history

From: Yiren Zhao [view email]
[v1] Wed, 5 Oct 2022 21:26:57 GMT (326kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2210.02570

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Revisiting Structured Dropout

Submission history