Adversarial Contrastive Distillation with Adaptive Denoising
Yuzheng Wang (Fudan University); Zhaoyu Chen (Fudan University); Dingkang Yang (Fudan University); Yang Liu (Fudan University); Siao Liu (Fudan University); Wenqiang Zhang (Fudan University); Lizhe Qi (Fudan University)
-
SPS
IEEE Members: $11.00
Non-members: $15.00
Adversarial Robustness Distillation (ARD) is a novel method to boost the robustness of small models. Unlike general adversarial training, its robust knowledge transfer can be less easily restricted by the model capacity. However, the teacher model that provides the robustness of knowledge does not always make correct predictions, interfering with the student’s robust performance. Besides, in the previous ARD methods, the robustness comes entirely from one-to-one imitation, ignoring the relationship between examples. To this end, we propose a novel structured ARD method called Contrastive Relationship DeNoise Distillation (CRDND). We design an adaptive compensation module to model the instability of the teacher. Moreover, we utilize the contrastive relationship to explore implicit robustness knowledge among multiple examples. Experimental results on multiple attack benchmarks show CRDND can transfer robust knowledge efficiently and achieves state-of-the-art performance.