Evading Defenses to Transferable Adversarial Examples by Translation-Invariant Attacks

Dong, Yinpeng; Pang, Tianyu; Su, Hang; Zhu, Jun

Computer Science > Computer Vision and Pattern Recognition

arXiv:1904.02884 (cs)

[Submitted on 5 Apr 2019]

Title:Evading Defenses to Transferable Adversarial Examples by Translation-Invariant Attacks

Authors:Yinpeng Dong, Tianyu Pang, Hang Su, Jun Zhu

View PDF

Abstract:Deep neural networks are vulnerable to adversarial examples, which can mislead classifiers by adding imperceptible perturbations. An intriguing property of adversarial examples is their good transferability, making black-box attacks feasible in real-world applications. Due to the threat of adversarial attacks, many methods have been proposed to improve the robustness. Several state-of-the-art defenses are shown to be robust against transferable adversarial examples. In this paper, we propose a translation-invariant attack method to generate more transferable adversarial examples against the defense models. By optimizing a perturbation over an ensemble of translated images, the generated adversarial example is less sensitive to the white-box model being attacked and has better transferability. To improve the efficiency of attacks, we further show that our method can be implemented by convolving the gradient at the untranslated image with a pre-defined kernel. Our method is generally applicable to any gradient-based attack method. Extensive experiments on the ImageNet dataset validate the effectiveness of the proposed method. Our best attack fools eight state-of-the-art defenses at an 82% success rate on average based only on the transferability, demonstrating the insecurity of the current defense techniques.

Comments:	CVPR 2019 (Oral)
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:1904.02884 [cs.CV]
	(or arXiv:1904.02884v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1904.02884

Submission history

From: Yinpeng Dong [view email]
[v1] Fri, 5 Apr 2019 06:15:51 UTC (7,928 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Evading Defenses to Transferable Adversarial Examples by Translation-Invariant Attacks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Evading Defenses to Transferable Adversarial Examples by Translation-Invariant Attacks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators