Deep Fidelity in DNN Watermarking: A Study of Backdoor Watermarking for Classification Models

Hua, Guang; Teoh, Andrew Beng Jin

doi:10.1016/j.patcog.2023.109844

Full-text links:

Download:

Current browse context:

cs.CR

< prev | next >

new | recent | 2208

Change to browse by:

Computer Science > Cryptography and Security

Title: Deep Fidelity in DNN Watermarking: A Study of Backdoor Watermarking for Classification Models

Authors: Guang Hua, Andrew Beng Jin Teoh

(Submitted on 1 Aug 2022 (v1), last revised 1 Nov 2023 (this version, v2))

Abstract: Backdoor watermarking is a promising paradigm to protect the copyright of deep neural network (DNN) models. In the existing works on this subject, researchers have intensively focused on watermarking robustness, while the concept of fidelity, which is concerned with the preservation of the model's original functionality, has received less attention. In this paper, focusing on deep image classification models, we show that the existing shared notion of the sole measurement of learning accuracy is inadequate to characterize backdoor fidelity. Meanwhile, we show that the analogous concept of embedding distortion in multimedia watermarking, interpreted as the total weight loss (TWL) in DNN backdoor watermarking, is also problematic for fidelity measurement. To address this challenge, we propose the concept of deep fidelity, which states that the backdoor watermarked DNN model should preserve both the feature representation and decision boundary of the unwatermarked host model. To achieve deep fidelity, we propose two loss functions termed penultimate feature loss (PFL) and softmax probability-distribution loss (SPL) to preserve feature representation, while the decision boundary is preserved by the proposed fix last layer (FixLL) treatment, inspired by the recent discovery that deep learning with a fixed classifier causes no loss of learning accuracy. With the above designs, both embedding from scratch and fine-tuning strategies are implemented to evaluate the deep fidelity of backdoor embedding, whose advantages over the existing methods are verified via experiments using ResNet18 for MNIST and CIFAR-10 classifications, and wide residual network (i.e., WRN28_10) for CIFAR-100 task. PyTorch codes are available at this https URL

Comments:	Published in Pattern Recognition
Subjects:	Cryptography and Security (cs.CR)
Journal reference:	Pattern Recognition, Vol. 144, Dec. 2023
DOI:	10.1016/j.patcog.2023.109844
Cite as:	arXiv:2208.00563 [cs.CR]
	(or arXiv:2208.00563v2 [cs.CR] for this version)

Submission history

From: Guang Hua Dr. [view email]
[v1] Mon, 1 Aug 2022 01:30:36 GMT (3480kb,D)
[v2] Wed, 1 Nov 2023 03:00:59 GMT (6046kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2208.00563

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Cryptography and Security

Title: Deep Fidelity in DNN Watermarking: A Study of Backdoor Watermarking for Classification Models

Submission history