Revealing Backdoors, Post-Training, In Dnn Classifiers Via Novel Inference On Optimized Perturbations Inducing Group Misclassification

Zhen Xiang, David Miller, George Kesidis

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 10:33

04 May 2020

Recently, a special type of data poisoning (DP) attack against deep neural network (DNN) classifiers, known as a backdoor, was proposed. These attacks do not seek to degrade classification accuracy, but rather to have the classifier learn to classify to a target class whenever the backdoor pattern is present in a test example. Here, we address the challenging post-training detection of backdoor attacks in DNN image classifiers, wherein the defender does not have access to the poisoned training set, but only to the trained classifier itself, as well as to clean (unpoisoned) examples from the classification domain. We propose a defense against imperceptible backdoor attacks based on perturbation optimization and novel, robust detection inference. Our method detects whether the trained DNN has been backdoor-attacked and infers the source and target classes involved in an attack. It outperforms alternative defenses for several backdoor patterns, data sets, and attack settings.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

Revealing Backdoors, Post-Training, In Dnn Classifiers Via Novel Inference On Optimized Perturbations Inducing Group Misclassification

Zhen Xiang, David Miller, George Kesidis

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society