0% found this document useful (0 votes)

68 views32 pages

Review On Towards Deep Learning Models Resistant To Adversarial Attacks

Uploaded by

Nguyễn Anh Minh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views32 pages

Review On Towards Deep Learning Models Resistant To Adversarial Attacks

Uploaded by

Nguyễn Anh Minh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Chair of

Chair of Communication
Machine Learning Networks
Department of Electrical Engineering
Department of Electrical and Computer and Computer Engineering
Engineering
Technical University
Technical University of
of Munich
Munich

Review on Towards Deep Learning Models

Resistant to Adversarial Attacks

Anh Minh Nguyen

Seminar Machine Learning
17.06.2020

©2016 Technical University of Munich

Main ideas from Madry et al. 2017 [1]

 Prove that deep learning model could be adversarially resistant.

 Reliable adversarial training method.
 PGD attacks
 Madry Defense Model

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 2
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
PGD algorithm (Projected Gradient Descent)

 Main
idea:
 Start from a random perturbation in the ball around a sample
 Take a gradient step in the direction of greatest loss
 Project perturbation back into ball if necessary
 Repeat 2–3 until convergence.

δ δ

𝜀
𝜀
− 𝜀 𝜀 −𝜀
𝜀

−
𝜀 −𝜀

E.g: Project on E.g: Project on

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 3
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
PGD algorithm

 PGD algorithm for training Madry Defense model:

 => -bounded attack

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 4
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Baseline models (Source A)

 MNIST
 CNN: 2 convolutional layers with 32, 64 units; fully-connected layer with 1024
units, 2 max-pooling layers
 , , 100000 epochs

 CIFAR-10
 ResNet model
 , , 100000 epochs

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 5
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Evaluation

 Attack models used for evaluation:

 White-box attacks with PGD for a different number of iterations and restarts, or
loss function. (A)
 Black-box attacks from an independently trained copy of the networks (A’)
 Black-box attacks from a different CNN architecture (B)

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 6
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Evaluation on MNIST

White-box attacks

Black-box attacks

 Madry defense model work well against transferable

attack, achieving considerably high accuracy

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 7
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Evaluation on CIFAR-10

White-box attacks

Black-box attacks

 Madry defense model works well against transferable

attacks
 Failed to achieve the same performance compared to
model trained on MNIST

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 8
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Is Madry Defense
Model that robust?

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 9
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Resistance against different and -bounded attacks


 MNIST experiment:
 Types of attacks:
 PGD generated with 100 steps, increasing from 0 to 0.5
 Decision-based Attack (DBA) (Brendel et al 2017) generated with 2000 steps
 PGD generated with 100 steps, increasing from 0 to 6,

 Defense model to evaluate:

 Adversarially trained model against PGD with= 0.3 (Baseline model A)
 Stadard trained model with the same architecture as A

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 10
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Resistance against different and -bounded attacks

 Work well against PGD attacks for .

 Robust against DBA attacks
 Outperform naturally trained model
 Poor robustness against both -bounded attacks and large PGD
attacks.
[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 11
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Resistance against different and -bounded attacks


 CIFAR10 experiment:
 Types of attacks:
 PGD generated with 100 steps, increasing from 0 to 30
 PGD generated with 100 steps, increasing from 0 to 100,

 Defense model to evaluate:

 Adversarially trained model against PGD with = 8 (Baseline model A)

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 12
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Resistance against different and -bounded attacks

 Poor robustness against large attacks

 Poor robustness against both -bounded attacks
 Lower perfomnace compared to MNIST model
..

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 13
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Resistance against different and -bounded attacks

 Conclusion:
 The Madry Defense model only achieves considerably high accuracy against -bounded
adversaries with .
 The model underperforms against -bounded attacks due to the fact that large
perturbations would be significant enough to change the ground-truth label
→ Visual distortion

Sample adversarial examples with norm bounded by 4

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 14
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Weaknesses
of Madry Defense Model

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 15
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Weaknesses of Madry Defense Model

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 16
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Experiments in Schott et al. (2018) [2]

  For each model and norm, describe how the accuracy of the models
decreases with increasing adversarial perturbation size
 Models used:
 CNN
 Madry defense model
 Nearest Neighbor
 Analysis by Synthesis model (ABS)
 Binary models (Binary CNN, Binary ABS) preprocess images into binary inputs (Input
binarization)

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 17
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Experiments in Schott et al. (2018)

•Madry defense model achieves poor performance against -bounded and -bounded attacks (Even worse than
standard CNN in case)
• In case, input binarization models are more robust than Madry model for large perturbations (

Overfits on the metrics.

Madry defense model achieve greate performance .on MNIST only due to binary nature of the
dataset.

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 18
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Experiments in Schott et al. (2018)

 Robustness against unrecognizable images:

 Unrecognizable images [5]:
 Also know as distal adversarials, rubbish class example or fooling imagess
 Images that do not resemble images from the training set but which typically look like noise while
still being classified by the model with high confidence.

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 19
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Experiments in Schott et al. (2018)

 Compare behavior of CNN, Madry defense and ABS model to generate

fooling image for a fixed label using gradient ascent

Madry model easily More vulnerable

predicts wrong label for to distal
unrecognized images adversarials

Images that are classified as ‘one’

with a probability above 90%.

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 20
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Weaknesses of Madry Defense Model

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 21
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Weaknesses of Madry Defense Model

 From Sharma et al. (2018) [3]

 PGD attacks from Madry model were constrained to pertubations by at most
along distortion metrics.
→ Reduce the power of attacks
→ Impose unrealistic constraints to attackers
→ Diminish robustness

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 22
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Sharma et al. 2018 results

Madry model achieves poor performance for

PGD generates adversarial examples with high level of visual distortion for

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 23
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Sharma et al. 2018 results

 Elastic-net attack to deep neural network (EAD)

 Generalize C&W attack by combining both and regularization
 Formulation:

o Increasing κ could increases the necessary margin between the predicted probability
of the target class and that of the rest.
o Therefore, increasing κ improves transferability but compromises visual quality
→ More reliable adversarial examples

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 24
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Sharma et al. 2018 results

Adversaries generated by EAD has similar

compared to PGD-generated ones
distortion, but better visual quality

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 25
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Sharma et al. 2018 results

 Explain

Comparing 3 attacks that have the same success rate (ASR):

- PGD and I-FGM has slightly smaller distortion but much larger and distortion, leading to greater visual distortion.
→ Losing adversarial nature.

Drawbacks of using distortion as the sole distortion metric in Madry Model

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 26
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Weaknesseas of Madry Defense Model

 Running time complexity of PGD

→ the number of gradient computations here is proportional to O(MN) in a single epoch

→ slower than N times for standard training ( which has O(M) gradients computations)

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 27
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Weaknesseas of Madry Defense Model

 Fast Adversarial Training [4]

 Idea: Combine FGSM adversarial training + random initialization +
DawnBench techniques for cycling learning rates

→ Each epoch cost only twice the numper of gradient computations compared to standard training

→ Use cycle learning rates to reduce number of epochs

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 28
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Weaknesseas of Madry Defense Model

 Result:

Time to train a robust CIFAR10 classifier to 45% robust accuracy using various
adversarial
training methods with and without the DAWNBench techniques of cyclic learning
rates and mixed-precisionarithmetic

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 29
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Summary
 Key problems of Madry Defense Model: since PGD generates attack
samples independently for each data sample based on , it does not
lead to good generalization in terms of risk optimization.
 Overfits on the metrics.
 Vulnerable against unrecognizable images.
 Adversarial examples has high-level of visual distortion.
→ Resolve using optimization-based approaches (ABS, EAD)
 Runtime problem: O(MN) in a single epoch
→ Can resolve by implementing Fast Adversarial Training using FSGM
combining with DAWNBench

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 30
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
References

 Madry et al. (2017). Towards Deep Learning Models Resistant to

[1]
Adversarial Attacks. [Link]
[2] Shott et al. (2018). Towards the first adversarially robust neural
network model on MNIST. [Link]
[3] Sharma et al. (2018). Attacking the Madry Defense Model with -based
Adversarial Examples. [Link]
[4] Wong et al. (2020). Fast is better than free: Revisiting adversarial
training. [Link]
[5] Nguyen et al. (2014). Deep Neural Networks are Easily Fooled: High
Confidence Predictions for Unrecognizable Images.
[Link]

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 31
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks
Questions?

[Link]
rer. nat. Erika Mustermann (TUM) | Can be changed arbitrarily | Separate infos with lines 32
Anh Nguyen|Seminar Machine Learning| Towards DLM Resistant to Adversarial Attacks

Adversarial Machine Learning in Cybersecurit Attacks Defenses and Future Directions
No ratings yet
Adversarial Machine Learning in Cybersecurit Attacks Defenses and Future Directions
4 pages
Random Spiking and Systematic Evaluation of Defenses Against Adversarial Examples
No ratings yet
Random Spiking and Systematic Evaluation of Defenses Against Adversarial Examples
12 pages
Secure Machine Learning Against Adversarial Samples at Test Time
No ratings yet
Secure Machine Learning Against Adversarial Samples at Test Time
15 pages
My Project
No ratings yet
My Project
30 pages
Machine Learning Security and Privacy A Review of Threats and Countermeasures
No ratings yet
Machine Learning Security and Privacy A Review of Threats and Countermeasures
23 pages
Defense Against Adversarial Attacks Using Convolutional Auto-Encoders
No ratings yet
Defense Against Adversarial Attacks Using Convolutional Auto-Encoders
9 pages
Adversarial Machine Learning Attack Surfaces, Defence Mechanisms
No ratings yet
Adversarial Machine Learning Attack Surfaces, Defence Mechanisms
314 pages
271 Peeyush
No ratings yet
271 Peeyush
15 pages
Adversarially Robust Deep Learning
No ratings yet
Adversarially Robust Deep Learning
28 pages
Adversarial Attacks On Deep Learning Models
No ratings yet
Adversarial Attacks On Deep Learning Models
15 pages
Paper AI
No ratings yet
Paper AI
6 pages
Machine Learning Security and Privacy A Review of
No ratings yet
Machine Learning Security and Privacy A Review of
24 pages
Adversarial Machine Learning Survey
No ratings yet
Adversarial Machine Learning Survey
12 pages
Adversarial ML in Image Classification
No ratings yet
Adversarial ML in Image Classification
66 pages
Project Synopsis (1) 11111
No ratings yet
Project Synopsis (1) 11111
13 pages
Defense Against Adversarial Attacks On Deep Convolutional Neural Networks Through Nonlocal Denoising
No ratings yet
Defense Against Adversarial Attacks On Deep Convolutional Neural Networks Through Nonlocal Denoising
8 pages
Securing The Diagnosis of Medical Imaging: An In-Depth Analysis of AI-Resistant Attacks
No ratings yet
Securing The Diagnosis of Medical Imaging: An In-Depth Analysis of AI-Resistant Attacks
21 pages
w11 ML Security
No ratings yet
w11 ML Security
35 pages
Adversarial Attacks and Defenses in Deep Learning
No ratings yet
Adversarial Attacks and Defenses in Deep Learning
39 pages
DSCAE: Defense Against Adversarial Attacks
No ratings yet
DSCAE: Defense Against Adversarial Attacks
11 pages
Adversarial Attacks and Defenses in Machine Learning-Powered Networks: A Contemporary Survey
No ratings yet
Adversarial Attacks and Defenses in Machine Learning-Powered Networks: A Contemporary Survey
46 pages
2019 Adversarial Examples in Modern Machine Learning - A Review
No ratings yet
2019 Adversarial Examples in Modern Machine Learning - A Review
97 pages
Adversarial Attacks in ML
No ratings yet
Adversarial Attacks in ML
37 pages
Adversarial Learning Targeting Deep Neural Network Classification A Comprehensive
No ratings yet
Adversarial Learning Targeting Deep Neural Network Classification A Comprehensive
32 pages
Adversarial Robustness and Defense Mechanisms in M
No ratings yet
Adversarial Robustness and Defense Mechanisms in M
12 pages
Golden Ratio Search A Low-Power Adversarial Attack
No ratings yet
Golden Ratio Search A Low-Power Adversarial Attack
5 pages
Diffdefense: Defending Against Adversarial Attacks Via Diffusion Models
No ratings yet
Diffdefense: Defending Against Adversarial Attacks Via Diffusion Models
12 pages
Slides Security and Privacy in Machine Learning
No ratings yet
Slides Security and Privacy in Machine Learning
59 pages
Applsci 14 08119
No ratings yet
Applsci 14 08119
19 pages
Applsci 09 00909
No ratings yet
Applsci 09 00909
29 pages
Input Transformations Against Adversarial Attacks
No ratings yet
Input Transformations Against Adversarial Attacks
6 pages
07adversarialexamplesslides 190208164940
No ratings yet
07adversarialexamplesslides 190208164940
86 pages
Adversarial Attacks On Deep Learning Models in Natural Language Processing: A Survey
No ratings yet
Adversarial Attacks On Deep Learning Models in Natural Language Processing: A Survey
40 pages
Assignment 4 Adversarial Attacks
No ratings yet
Assignment 4 Adversarial Attacks
2 pages
Module 5-1
No ratings yet
Module 5-1
10 pages
Towards Trustworthy LLMs - Understanding The Security and Privacy
No ratings yet
Towards Trustworthy LLMs - Understanding The Security and Privacy
82 pages
TARP-VP Towards Evaluation of Transferred Adversarial Robustness and Privacy On Label Mapping Visual Prompting Models
No ratings yet
TARP-VP Towards Evaluation of Transferred Adversarial Robustness and Privacy On Label Mapping Visual Prompting Models
21 pages
Resilient Machine Learning Against Attacks
No ratings yet
Resilient Machine Learning Against Attacks
16 pages
Review-1 PPT (1) .PPTX (Autosaved) - 1
No ratings yet
Review-1 PPT (1) .PPTX (Autosaved) - 1
12 pages
Untargeted, Targeted and Universal Adversarial Attacks and Defenses On Time Series
No ratings yet
Untargeted, Targeted and Universal Adversarial Attacks and Defenses On Time Series
8 pages
Machine Learning Security Threats
No ratings yet
Machine Learning Security Threats
39 pages
Reliable Decision-Based Attacks on ML
No ratings yet
Reliable Decision-Based Attacks on ML
12 pages
Adversarial Attacks in Deep Learning
No ratings yet
Adversarial Attacks in Deep Learning
15 pages
Defending Adversarials
No ratings yet
Defending Adversarials
18 pages
"Boosting Ensemble Robustness with Diversity"
No ratings yet
"Boosting Ensemble Robustness with Diversity"
10 pages
CAM Ensemble Boosts Adversarial Attacks
No ratings yet
CAM Ensemble Boosts Adversarial Attacks
18 pages
Adversarial Defense On Harmony Reverse Attack For Robust AI Models Against Adversarial Attacks
No ratings yet
Adversarial Defense On Harmony Reverse Attack For Robust AI Models Against Adversarial Attacks
13 pages
Wild Patterns: Ten Years After The Rise of Adversarial Machine Learning
No ratings yet
Wild Patterns: Ten Years After The Rise of Adversarial Machine Learning
17 pages
PPT
No ratings yet
PPT
10 pages
West Et Al - 2023 - Towards Quantum Enhanced Adversarial Robustness in Machine Learning
No ratings yet
West Et Al - 2023 - Towards Quantum Enhanced Adversarial Robustness in Machine Learning
11 pages
Threat of Adversarial Attacks On Deep Learning A Survey
No ratings yet
Threat of Adversarial Attacks On Deep Learning A Survey
21 pages
Data Security Tutorial 12 - Solutions
No ratings yet
Data Security Tutorial 12 - Solutions
4 pages
E A T: A D: Nsemble Dversarial Raining Ttacks and Efenses
No ratings yet
E A T: A D: Nsemble Dversarial Raining Ttacks and Efenses
22 pages
ML Project 4 Final
No ratings yet
ML Project 4 Final
9 pages
Zhang The Secret Revealer Generative Model-Inversion Attacks Against Deep Neural Networks CVPR 2020 Paper
No ratings yet
Zhang The Secret Revealer Generative Model-Inversion Attacks Against Deep Neural Networks CVPR 2020 Paper
9 pages
MAD: Meta Adversarial Defense Benchmark: Xiaoxu Peng, Dong Zhou, Guanghui Sun,, Jiaqi Shi and Ligang Wu
No ratings yet
MAD: Meta Adversarial Defense Benchmark: Xiaoxu Peng, Dong Zhou, Guanghui Sun,, Jiaqi Shi and Ligang Wu
12 pages
Intelligent Supertrend AI Instructions
No ratings yet
Intelligent Supertrend AI Instructions
5 pages
Voice Emotion Recognition
No ratings yet
Voice Emotion Recognition
11 pages
Topic Modeling Using NLP For Student Feedback
No ratings yet
Topic Modeling Using NLP For Student Feedback
4 pages
Face Mask Detection Model
No ratings yet
Face Mask Detection Model
11 pages
Chat GPT
No ratings yet
Chat GPT
2 pages
Final Year Project Report
No ratings yet
Final Year Project Report
52 pages
Machine Learning Basics for Students
No ratings yet
Machine Learning Basics for Students
25 pages
Image Classification Using CNN
No ratings yet
Image Classification Using CNN
15 pages
Deep Learning for Tech Enthusiasts
No ratings yet
Deep Learning for Tech Enthusiasts
82 pages
Bit 2319 Artificial Intelligence Exam (Paper Ii)
No ratings yet
Bit 2319 Artificial Intelligence Exam (Paper Ii)
4 pages
Sentiment Analysis For Social Media
No ratings yet
Sentiment Analysis For Social Media
26 pages
H13-311 V3.5 Exam Questions & Answers
100% (1)
H13-311 V3.5 Exam Questions & Answers
132 pages
NPTEL
No ratings yet
NPTEL
39 pages
CNN for Cat-Dog Image Classification
No ratings yet
CNN for Cat-Dog Image Classification
15 pages
Data Science Lab-KTU
No ratings yet
Data Science Lab-KTU
5 pages
On Text To Image Generator
No ratings yet
On Text To Image Generator
10 pages
GenAI Projects for Aspiring AI Experts
No ratings yet
GenAI Projects for Aspiring AI Experts
3 pages
Missing Person Detection with AI Techniques
No ratings yet
Missing Person Detection with AI Techniques
4 pages
Hand Gesture Recognition
No ratings yet
Hand Gesture Recognition
13 pages
DEEP CNN
No ratings yet
DEEP CNN
2 pages
The Impact of Artificial Intelegence in Our Life
No ratings yet
The Impact of Artificial Intelegence in Our Life
37 pages
Artificial Neural Network (Ann)
No ratings yet
Artificial Neural Network (Ann)
1 page
ANN - A Case Study
No ratings yet
ANN - A Case Study
14 pages
CSE-352 Computer Vision Mid-Sem Assignment
No ratings yet
CSE-352 Computer Vision Mid-Sem Assignment
1 page
Multiclass TrAdaBoost for Mobile Lidar
No ratings yet
Multiclass TrAdaBoost for Mobile Lidar
25 pages
Learning Process
No ratings yet
Learning Process
54 pages
Introduction TO ANN
No ratings yet
Introduction TO ANN
10 pages
SVM - An Essay
No ratings yet
SVM - An Essay
1 page
Unit 3
No ratings yet
Unit 3
12 pages
Face Recognition Based Attendance System
No ratings yet
Face Recognition Based Attendance System
9 pages

Review On Towards Deep Learning Models Resistant To Adversarial Attacks

Uploaded by

Review On Towards Deep Learning Models Resistant To Adversarial Attacks

Uploaded by

Chair of

Review on Towards Deep Learning Models

Anh Minh Nguyen

©2016 Technical University of Munich

 Prove that deep learning model could be adversarially resistant.

E.g: Project on E.g: Project on

 PGD algorithm for training Madry Defense model:

 Attack models used for evaluation:

 Madry defense model work well against transferable

 Madry defense model works well against transferable

 Defense model to evaluate:

 Work well against PGD attacks for .

 Defense model to evaluate:

 Poor robustness against large attacks

Sample adversarial examples with norm bounded by 4

Overfits on the metrics.

 Robustness against unrecognizable images:

 Compare behavior of CNN, Madry defense and ABS model to generate

Madry model easily More vulnerable

Images that are classified as ‘one’

 From Sharma et al. (2018) [3]

Madry model achieves poor performance for

 Elastic-net attack to deep neural network (EAD)

Adversaries generated by EAD has similar

Comparing 3 attacks that have the same success rate (ASR):

Drawbacks of using distortion as the sole distortion metric in Madry Model

 Running time complexity of PGD

→ the number of gradient computations here is proportional to O(MN) in a single epoch

 Fast Adversarial Training [4]

→ Use cycle learning rates to reduce number of epochs

 Madry et al. (2017). Towards Deep Learning Models Resistant to

You might also like