Fooled by Imagination: Adversarial Attack to Image Captioning via Perturbation in Complex Domain

Shaofeng Zhang, Zheng Wang, Xing Xu, Xiang Guan, Yang Yang

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 08:04

07 Jul 2020

Adversarial attacks are very successful on image classification, but there are few researches on vision-language systems, such as image captioning. In this paper, we study the robustness of a CNN+RNN based image captioning system being subjected to adversarial noises in complex domain.
In particular, we propose \textbf{Fooled-by-Imagination}, a novel algorithm for crafting adversarial examples with semantic embedding of targeted caption as perturbation in complex domain. The proposed algorithm explores the great merit of complex values in introducing imaginary part for modeling adversarial perturbation, and maintains the similarity of the image in real part. Our approach provides two evaluation approaches, which check whether neural image captioning systems can be fooled to output some randomly chosen captions or keywords. Besides, our method has good transferability under black-box setting.
At last, our extensive experiments show that our algorithm can successfully craft visually-similar adversarial examples with randomly targeted captions or keywords at a higher success rate.

Tags:

icme 2020

sps conference

Fooled by Imagination: Adversarial Attack to Image Captioning via Perturbation in Complex Domain

Shaofeng Zhang, Zheng Wang, Xing Xu, Xiang Guan, Yang Yang

Value-Added Bundle(s) Including this Product

ICME 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society