Finding Deviated Behaviors of the Compressed DNN Models for Image Classifications

Tian, Yongqiang; Zhang, Wuqi; Wen, Ming; Cheung, Shing-Chi; Sun, Chengnian; Ma, Shiqing; Jiang, Yu

doi:10.1145/3583564

Full-text links:

Download:

Current browse context:

cs.SE

< prev | next >

new | recent | 2112

Computer Science > Software Engineering

Title: Finding Deviated Behaviors of the Compressed DNN Models for Image Classifications

Authors: Yongqiang Tian, Wuqi Zhang, Ming Wen, Shing-Chi Cheung, Chengnian Sun, Shiqing Ma, Yu Jiang

(Submitted on 6 Dec 2021 (v1), last revised 4 Feb 2023 (this version, v2))

Abstract: Model compression can significantly reduce the sizes of deep neural network (DNN) models, and thus facilitates the dissemination of sophisticated, sizable DNN models, especially for their deployment on mobile or embedded devices. However, the prediction results of compressed models may deviate from those of their original models. To help developers thoroughly understand the impact of model compression, it is essential to test these models to find those deviated behaviors before dissemination. However, this is a non-trivial task because the architectures and gradients of compressed models are usually not available.
To this end, we propose DFLARE, a novel, search-based, black-box testing technique to automatically find triggering inputs that result in deviated behaviors in image classification tasks. DFLARE iteratively applies a series of mutation operations to a given seed image, until a triggering input is found. For better efficacy and efficiency, DFLARE models the search problem as Markov Chains and leverages the Metropolis-Hasting algorithm to guide the selection of mutation operators in each iteration. Further, DFLARE utilizes a novel fitness function to prioritize the mutated inputs that either cause large differences between two models' outputs, or trigger previously unobserved models' probability vectors. We evaluated DFLARE on 21 compressed models for image classification tasks with three datasets. The results show that DFLARE outperforms the baseline in terms of efficacy and efficiency. We also demonstrated that the triggering inputs found by DFLARE can be used to repair up to 48.48% deviated behaviors in image classification tasks and further decrease the effectiveness of DFLARE on the repaired models.

Comments:	This is the author version. The DOI of the published version is this http URL Please see the full abstract in the pdf
Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI)
Journal reference:	ACM Transactions on Software Engineering and Methodology, 2023
DOI:	10.1145/3583564
Cite as:	arXiv:2112.02819 [cs.SE]
	(or arXiv:2112.02819v2 [cs.SE] for this version)

Submission history

From: Yongqiang Tian [view email]
[v1] Mon, 6 Dec 2021 07:12:49 GMT (99kb,D)
[v2] Sat, 4 Feb 2023 23:23:19 GMT (675kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2112.02819

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Software Engineering

Title: Finding Deviated Behaviors of the Compressed DNN Models for Image Classifications

Submission history