0% found this document useful (0 votes)

22 views88 pages

Convolutional Neural Networks

Uploaded by

duckv

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views88 pages

Convolutional Neural Networks

Uploaded by

duckv

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 88

Convolutional Neural Networks

Eunbyung Park
Assistant Professor
Department of Artificial Intelligence
Eunbyung Park (silverbottlep.github.io)
Convolution
1D Convolution
• Convolution is a mathematical operation on two functions (𝑓 , 𝑔) that produces a third
function 𝑓 ∗ 𝑔

∞
𝑓 ∗ 𝑔 𝑡 ≔ න 𝑓 𝑡 − 𝜏 𝑔 𝜏 𝑑𝜏
−∞

𝑓 ∗ 𝑔 [𝑡] ≔ ෍ 𝑓 𝑡 − 𝜏 𝑔 𝜏
𝜏
1D Convolution 𝑓 ∗ 𝑔 [𝑡] ≔ ෍ 𝑓 𝑡 − 𝜏 𝑔 𝜏
• Flip the filter and sliding 𝜏

3 𝑓[𝑡]
1 2

−1 𝑡
𝑔[𝑡]
1 2 1

0⋅1+0⋅2+1⋅1=1 𝑓 ∗ 𝑔 [𝑡]

𝑡
1D Convolution 𝑓 ∗ 𝑔 [𝑡] ≔ ෍ 𝑓 𝑡 − 𝜏 𝑔 𝜏
• Flip the filter and sliding 𝜏

3 𝑓[𝑡]
1 2

−1 𝑡
𝑔[𝑡]
1 2 1

0⋅1+1⋅2+3⋅1=5

𝑓 ∗ 𝑔 [𝑡]

𝑡
1D Convolution 𝑓 ∗ 𝑔 [𝑡] ≔ ෍ 𝑓 𝑡 − 𝜏 𝑔 𝜏
• Flip the filter and sliding 𝜏

3 𝑓[𝑡]
1 2

−1 𝑡
𝑔[𝑡]
1 2 1

𝑡
9=1⋅1+3⋅2+2⋅1

𝑓 ∗ 𝑔 [𝑡]

𝑡
1D Convolution 𝑓 ∗ 𝑔 [𝑡] ≔ ෍ 𝑓 𝑡 − 𝜏 𝑔 𝜏
• Flip the filter and sliding 𝜏

3 𝑓[𝑡]
1 2

−1 𝑡
𝑔[𝑡]
1 2 1

6 = 3 ⋅ 1 + 2 ⋅ 2 + (−1) ⋅ 1
𝑓 ∗ 𝑔 [𝑡]

𝑡
1D Convolution 𝑓 ∗ 𝑔 [𝑡] ≔ ෍ 𝑓 𝑡 − 𝜏 𝑔 𝜏
• Flip the filter and sliding 𝜏

3 𝑓[𝑡]
1 2

−1 𝑡
𝑔[𝑡]
1 2 1

𝑓 ∗ 𝑔 [𝑡]
0 = 2 ⋅ 1 + (−1) ⋅ 2 + 0 ⋅ 1

𝑡
1D Convolution
• Example

Convolution - Wikipedia
1D Convolution
• Gaussian filter

𝑓(𝑡)

𝑓 𝑡 ∗ 𝑔(𝑡)

𝑔(𝑡)
2D Convolution

∞ ∞
𝑓 ∗ 𝑔 𝑠, 𝑡 ≔ න න 𝑓 𝑠 − 𝜏1 , 𝑡 − 𝜏2 𝑔 𝜏1 , 𝜏2 𝑑𝜏1 𝑑𝜏2
−∞ −∞

𝑓 ∗ 𝑔 [𝑠, 𝑡] ≔ ෍ ෍ 𝑓 𝑠 − 𝜏1 , 𝑡 − 𝜏2 𝑔 𝜏1 , 𝜏2
𝜏1 𝜏2
2D Convolution
• One input channel, e.g. gray color image
• Padding=1, stride=1

1 0 3 0 20 0 0 0 0
1 0 3 1 33 2 3 3 0 16
3 0 1 3 11 2 1 1 0
0 3 3 3 1 2 0
0 2 2 1 2 1 0
0 2 3 2 1 2 0
0 0 0 0 0 0 0
2D Convolution
• One input channel, e.g. gray color image
• Padding=1, stride=1

0 10 30 20 0 0 0
0 11 33 32 3 3 0 16 28
0 33 11 12 1 1 0
0 3 3 3 1 2 0
0 2 2 1 2 1 0
0 2 3 2 1 2 0
0 0 0 0 0 0 0
2D Convolution
• One input channel, e.g. gray color image
• Padding=1, stride=1

0 0 10 30 20 0 0
0 1 13 32 33 3 0 16 28 24
0 3 31 12 11 1 0
0 3 3 3 1 2 0
0 2 2 1 2 1 0
0 2 3 2 1 2 0
0 0 0 0 0 0 0
2D Convolution

File:2D Convolution Animation.gif - Wikimedia Commons

2D Convolution
• 2D gaussian filter

File:2D Convolution Animation.gif - Wikimedia Commons

2D Convolution
• 2D gaussian filter

2-D Gaussian filtering of images - MATLAB imgaussfilt (mathworks.com)

Convolutional Neural Networks
Convolutional Neural Network

Feature map

filters

Input
2D Convolution
• Input_channel=1, output_channel=1, padding=1, stride=1

1 0 3 0 20 0 0 0 0
1 0 3 1 33 2 3 3 0 16
3 0 1 3 11 2 1 1 0
0 3 3 3 1 2 0
0 2 2 1 2 1 0
0 2 3 2 1 2 0
0 0 0 0 0 0 0
2D Convolution
• Input_channel=1, output_channel=1, padding=1, stride=1

0 10 30 20 0 0 0
0 11 33 32 3 3 0 16 28
0 33 11 12 1 1 0
0 3 3 3 1 2 0
0 2 2 1 2 1 0
0 2 3 2 1 2 0
0 0 0 0 0 0 0
2D Convolution
• Input_channel=1, output_channel=1, padding=1, stride=1

0 0 10 30 20 0 0
0 1 13 32 33 3 0 16 28 24
0 3 31 12 11 1 0
0 3 3 3 1 2 0
0 2 2 1 2 1 0
0 2 3 2 1 2 0
0 0 0 0 0 0 0
2D Convolution
• Input_channel=1, output_channel=1, padding=1, stride=1

0 0 0 0 0 0 0
0 1 3 2 3 3 0 16 28 24 28 16
0 3 1 2 1 1 0 1 3 2 27 41 38 37 21
0 3 3 3 1 2 0 ∗ 1 3 3 = 33 40 33 25 18
0 2 2 1 2 1 0 3 1 1 32 40 37 29 17
0 2 3 2 1 2 0 25 27 21 20 12
filter
0 0 0 0 0 0 0
Output
Input
2D Convolution
• Input_channel=1, output_channel=1, padding=1, stride=2

10 30 20 0 0 0 0
10 31 33 2 3 3 0
30 13 11 2 1 1 0 16
0 3 3 3 1 2 0
0 2 2 1 2 1 0
0 2 3 2 1 2 0
0 0 0 0 0 0 0
2D Convolution
• Input_channel=1, output_channel=1, padding=1, stride=2

0 0 10 30 20 0 0
0 1 13 32 33 3 0
0 3 31 12 11 1 0 16 24
0 3 3 3 1 2 0
0 2 2 1 2 1 0
0 2 3 2 1 2 0
0 0 0 0 0 0 0
2D Convolution
• Input_channel=1, output_channel=1, padding=1, stride=2

0 0 0 0 10 30 20

0 1 3 2 13 33 30

0 3 1 2 31 11 10 16 24 16
0 3 3 3 1 2 0
0 2 2 1 2 1 0
0 2 3 2 1 2 0
0 0 0 0 0 0 0
2D Convolution
• Input_channel=1, output_channel=1, padding=1, stride=2

0 0 0 0 0 0 0
0 1 3 2 3 3 0
10 33 21 2 1 1 0 16 24 16
10 33 33 3 1 2 0 33
30 12 12 1 2 1 0
0 2 3 2 1 2 0
0 0 0 0 0 0 0
2D Convolution
• Input_channel=1, output_channel=1, padding=1, stride=2

0 0 0 0 0 0 0
0 1 3 2 3 3 0
0 3 1 2 1 1 0 1 3 2 16 24 16
0 3 3 3 1 2 0 ∗ 1 3 3 = 33 33 18
0 2 2 1 2 1 0 3 1 1 25 21 12
0 2 3 2 1 2 0
filter Output
0 0 0 0 0 0 0

Input
2D Convolution
• Input_channel=1, output_channel=1, padding=2, stride=1

0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0 0 1 4 8 14 12 12 9
0 0 1 3 2 3 3 0 0 6 16 28 24 28 16 6
0 0 3 1 2 1 1 0 0 1 3 2 14 27 41 38 37 21 10
0 0 3 3 3 1 2 0 0 ∗ 1 3 3 = 17 33 40 33 25 18 6
0 0 2 2 1 2 1 0 0 3 1 1 14 32 40 37 29 17 9
0 0 2 3 2 1 2 0 0 10 25 27 21 20 12 3
0 0 0 0 0 0 0 0 0 4 12 15 11 9 7 2
0 0 0 0 0 0 0 0 0
2D Convolution
• Input_channel=3, output_channel=1, padding=1, stride=1

0 0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0 0

0 0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0 0
2D Convolution
• Input_channel=3, output_channel=1, padding=1, stride=1

0 0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0 0

0 0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0 0
2D Convolution
• Input_channel=3, output_channel=1, padding=1, stride=1

0 0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0 0

0 0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0 0
2D Convolution
• Input_channel=3, output_channel=1, padding=1, stride=1

0 0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0 0

0 0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0 0
2D Convolution
• Input_channel=3, output_channel=2, padding=1, stride=1

0 0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0 0

0 0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0 0
2D Convolution
• Input_channel=3, output_channel=2, padding=1, stride=1

0 0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0 0

0 0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0 0
2D Convolution
• Input_channel=3, output_channel=2, padding=1, stride=1

0 0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0 0

0 0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0 0
2D Convolution
• Input_channel=3, output_channel=2, padding=1, stride=1

0 0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0 0

0 0 0 0 0 0 0
0 0
0 0
0 0
0 0
0 0
0 0 0 0 0 0 0
2D Convolution
• Input_channel=64, output_channel=64, kernel_size=3, padding=1, stride=1

3
3
64 64

64
⋮
2D Convolution
• Input_channel=64, output_channel=64, kernel_size=3, padding=1, stride=1

⋮
2D Convolution
• Input_channel=64, output_channel=64, kernel_size=3, padding=1, stride=1

⋮
Convolutions in PyTorch
Max Pooling
• Pooling a maximum value given the window
• Used to reduce the size of feature maps
• Example) stride=2, padding=1

0 0 0 0 0 0 0
0 1 3 2 3 3 0
3
0 3 1 2 1 1 0
0 3 3 3 1 2 0
0 2 2 1 2 1 0
0 2 3 2 1 2 0
0 0 0 0 0 0 0
Max Pooling
• Pooling a maximum value given the window
• Used to reduce the size of feature maps
• Example) stride=2, padding=1

0 0 0 0 0 0 0
0 1 3 2 3 3 0
3 3
0 3 1 2 1 1 0
0 3 3 3 1 2 0
0 2 2 1 2 1 0
0 2 3 2 1 2 0
0 0 0 0 0 0 0
Max Pooling
• Pooling a maximum value given the window
• Used to reduce the size of feature maps
• Example) stride=2, padding=1

0 0 0 0 0 0 0
0 1 3 2 3 3 0
3 3 3
0 3 1 2 1 1 0
0 3 3 3 1 2 0
0 2 2 1 2 1 0
0 2 3 2 1 2 0
0 0 0 0 0 0 0
Max Pooling in PyTorch
AlexNet

Dropout Dropout
Understanding AlexNet | LearnOpenCV #
Fully Connected Layer vs Convolutional Layer
• Translation equivariance and parameter sharing

𝑊𝑥

𝑊 ∈ ℝ131072×131072 32
32
Flatten FC Layer Reshape

64
64

32 ⋅ 64 ⋅ 64 = 131072 32 ⋅ 64 ⋅ 64 = 131072
Fully Connected Layer vs Convolutional Layer
• Translation equivariance and parameter sharing

3
3
32
32

64
64 32
⋮
64
64

𝑊 ∈ ℝ32×32×3×3
Visualization of Learned Filter
• First layer conv filters

CS231n Convolutional Neural Networks for Visual Recognition

Visualization of Learned Feature Maps

Applied Deep Learning - Part 4: Convolutional Neural Networks | by Arden Dertat | Towards Data Science
Visualization of Learned Feature Maps

Applied Deep Learning - Part 4: Convolutional Neural Networks | by Arden Dertat | Towards Data Science
ImageNet Large Scale Visual Recognition Challenge

(ILSVRC)
ILSVRC
• ImageNet is an image database organized according to the WordNet
hierarchy (nouns)
• 1000 object classes
• About 1.2M training images, 50K validation images, 100K test images

• The ImageNet Large Scale Visual Recognition Challenge (ILSVRC)

• 8 years history (2010 – 2017)
• It was the most powerful driving force to facilitate deep learning research
Classification Results

Beyond ILSVRC workshop 2017 (image-net.org)

Classification Results
Human: 0.05

AlexNet

ZFNet
GoogleNet Trimps-Soushen
VGGNet (Inception + WRN)
ResNet SENet

Beyond ILSVRC workshop 2017 (image-net.org)

AlexNet
• The winner of ILSVRC 2012
• It changed the entire computer vision research
ZFNet
• The winner of ILSVRC 2013
• The network architectures were developed by using the visualization
techniques
• Visualizing and Understanding Convolutional Networks, Zeiler et al, ECCV 2014

• Reduced the 1st layer filter size from 11x11 to 7x7

• 1st layer stride from 4 -> 2
VGGNet

Architecture comparison of AlexNet, VGGNet, ResNet, Inception, DenseNet | by Khush Patel | Towards Data Science
VGGNet

Architecture comparison of AlexNet, VGGNet, ResNet, Inception, DenseNet | by Khush Patel | Towards Data Science
GoogLeNet
• Winner of ISLVRC 2014
Max pooling
• Also called ‘Inception’
Concatenation

Convolution

Going deeper with convolutions, Szegedy et al, CVPR 2015

GoogLeNet
• Inception module

Going deeper with convolutions, Szegedy et al, CVPR 2015

GoogLeNet
• ILSVRC 2014 classification results

Going deeper with convolutions, Szegedy et al, CVPR 2015

ResNet
• The winner of ILSVRC 2015
• Residual building block

Deep residual learning for image recognition, He et al, CVPR 2016

ResNet

Deep residual learning for image recognition, He et al, CVPR 2016

ResNet
• Training on ImageNet

Deep residual learning for image recognition, He et al, CVPR 2016

ResNet
• ILSVRC 2015 classification results

Deep residual learning for image recognition, He et al, CVPR 2016

The Key Ingredients of Training CNNs
Drop-out/Drop-path
Dropout
• Turning off neurons w/ given probability (e.g. 0.5)
• Every iterations, new network architectures emerge

Improving neural networks by preventing co-adaptation of feature detectors, hinton et al, arXiv 2012
Dropout
• A simple way to train deep neural
networks for improving generalization
performance
• Avoiding co-adaptations: a hidden unit
cannot rely on other hidden units being
present
• Model averaging

Improving neural networks by preventing co-adaptation of feature detectors, hinton et al, arXiv 2012
Stochastic Depth (a.k.a DropPath)
• Training short networks and use deep networks at test time
• During training, randomly drop a subset of layers and bypass them with identity
function

Deep Networks with Stochastic Depth, Huang et al, ECCV 2016

Stochastic Depth (a.k.a DropPath)
• Linearly decaying ‘drop probability’
• Later layers will be dropped more frequently

Deep Networks with Stochastic Depth, Huang et al, ECCV 2016

Stochastic Depth (a.k.a DropPath)

Deep Networks with Stochastic Depth, Huang et al, ECCV 2016

Normalization Methods
Batch Normalization
• Normalizing training sets

𝑥2

𝑥1

Deep Learning Specialization - DeepLearning.AI

Batch Normalization
• Subtracting the mean

𝑥2

𝑁
1 (𝑖)
𝜇1 = ෍ 𝑥1
𝑁
𝑖=1

(𝑖) 𝑖
𝑥1 ≔ 𝑥1 − 𝜇1
𝑥1

Deep Learning Specialization - DeepLearning.AI

Batch Normalization
• Divide by standard deviation

𝑥2
𝑥1 ~𝑁(0,1)
𝑁
𝑥2 ~𝑁(0,1) 2
1 𝑖 2
𝜎1 = ෍ 𝑥1
𝑁
𝑖=1

(𝑖) 𝑖
𝑥1 ≔ 𝑥1 /𝜎1
𝑥1

Deep Learning Specialization - DeepLearning.AI

Batch Normalization
• Standardization

mean
𝑥−𝜇
𝑧=
𝜎
Standard deviation

𝑧 ~𝑁(0,1)
Batch Normalization
• When un-normalized, the loss surface is more skewed (elongated)
• Input feature scales are very different each other
Batch Normalization
• Normalizing inputs (also hidden
units) based on mini-batch
statistics
• Computing mean and variance
given the current batch
• During testing, we may not have
enough batch size for this (e.g. 1
batch), using mean and variance
from the training phase

Batch normalization: accelerating deep network training by reducing internal covariate shift, Ioffe et al, ICML 2015
Batch Normalization

Batch normalization: accelerating deep network training by reducing internal covariate shift, Ioffe et al, ICML 2015
Batch Normalization in CNN

𝑀: 128 The number of input channels

𝐷𝐹 : 64 The size of a feature map
𝐵 : 32 The mini-batch size

𝜇 ∈ ℝ? , 𝜎 2 ∈ ℝ?
Why Batch Normalization Works?
1. Normalization usually makes loss surface less ‘skewed’
2. BN may reduce the internal covariate shift
• [1502.03167] Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift (arxiv.org)

3. BN makes loss surface smoother

• [1805.11604] How Does Batch Normalization Help Optimization? (arxiv.org)
Layer Normalization
• Batch normalization is dependent on the mini-batch size
• What about the network size is too big, so only few mini-batch sizes are allowed?
• It is not obvious how to apply batch normalization to RNNs
• Noticeable computation

𝐻 𝐻
1 1 2
𝜇 = ෍ 𝑎𝑖𝑙
𝑙
𝜎 =𝑙
෍ 𝑎𝑖𝑙 − 𝜇 𝑙
𝐻 𝐻
𝑖=1 𝑖=1

𝐻: the number of hidden units in a layer

Layer normalization, Ba et al, arXiv 2016

Other Normalizaion Methods

Group normalization, Wu et al, ECCV 2018

Other Normalizaion Methods

Group normalization, Wu et al, ECCV 2018

Part 1.4. Convolution Neural Network
No ratings yet
Part 1.4. Convolution Neural Network
36 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
68 pages
6 CNN
No ratings yet
6 CNN
92 pages
Cours 8 A
No ratings yet
Cours 8 A
34 pages
AE556 2024 Topic4 CNN
No ratings yet
AE556 2024 Topic4 CNN
26 pages
UNIT 2 Study Materials 1
No ratings yet
UNIT 2 Study Materials 1
42 pages
Lecture 10 Slides - After
No ratings yet
Lecture 10 Slides - After
66 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
108 pages
Module 3
No ratings yet
Module 3
46 pages
Introduction to Convolutional Neural Networks
No ratings yet
Introduction to Convolutional Neural Networks
72 pages
Sarma CNN Vce Oct 2022
No ratings yet
Sarma CNN Vce Oct 2022
63 pages
Lecture 2 CNN
No ratings yet
Lecture 2 CNN
105 pages
Aiml Ece Unit-5
No ratings yet
Aiml Ece Unit-5
48 pages
DL6 - Convnets 4
No ratings yet
DL6 - Convnets 4
57 pages
DeepLearning Unit-II
No ratings yet
DeepLearning Unit-II
70 pages
(Fall 2024) Images and Convolutions
No ratings yet
(Fall 2024) Images and Convolutions
69 pages
Aiml Ece Unit-5
No ratings yet
Aiml Ece Unit-5
48 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
22 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
26 pages
CNN Iitkgp
No ratings yet
CNN Iitkgp
112 pages
Deep Learning Course Overview
No ratings yet
Deep Learning Course Overview
109 pages
Iii Unit - Deeplearning
No ratings yet
Iii Unit - Deeplearning
93 pages
Lesson 6 Convolutional Neural Network
No ratings yet
Lesson 6 Convolutional Neural Network
43 pages
Convolutional Neural Networks - Part 2
No ratings yet
Convolutional Neural Networks - Part 2
49 pages
Intro to Convolutional Networks
No ratings yet
Intro to Convolutional Networks
152 pages
Convolutional Networks Guide
No ratings yet
Convolutional Networks Guide
15 pages
21CS743 DL Module4 Notes
No ratings yet
21CS743 DL Module4 Notes
7 pages
CNN Basics: Convolution & Layers
No ratings yet
CNN Basics: Convolution & Layers
18 pages
Convolution Neural Network-1
No ratings yet
Convolution Neural Network-1
44 pages
Mod 5
No ratings yet
Mod 5
96 pages
80CCG
No ratings yet
80CCG
12 pages
201 Mandarin Chinese Verbs 2nd Edition Eugene Ching Nora Ching Ling Yan Download
No ratings yet
201 Mandarin Chinese Verbs 2nd Edition Eugene Ching Nora Ching Ling Yan Download
83 pages
21ME744 - PD&E - Module 5
No ratings yet
21ME744 - PD&E - Module 5
9 pages
Advanced Algebraic Structures
No ratings yet
Advanced Algebraic Structures
4 pages
Drill Bits
No ratings yet
Drill Bits
38 pages
AI Detector (Ad-Free and No Sign-Up Required) - QuillBot AI
No ratings yet
AI Detector (Ad-Free and No Sign-Up Required) - QuillBot AI
1 page
Biocybernaut Training Consent Form
No ratings yet
Biocybernaut Training Consent Form
9 pages
Answer The Following
No ratings yet
Answer The Following
3 pages
Pre to Post Quantum Cryptography
No ratings yet
Pre to Post Quantum Cryptography
12 pages
Detailed Soil Report
No ratings yet
Detailed Soil Report
27 pages
Solving Word Problems Involving Multiplication of Whole Numbers Including Money
No ratings yet
Solving Word Problems Involving Multiplication of Whole Numbers Including Money
9 pages
Electron Motion and Potential Differences
No ratings yet
Electron Motion and Potential Differences
8 pages
Activity 2
No ratings yet
Activity 2
4 pages
Paper Presentation
No ratings yet
Paper Presentation
13 pages
OxfordAQA 9203 1 WRE Jun23 v1.1
No ratings yet
OxfordAQA 9203 1 WRE Jun23 v1.1
8 pages
Ship Speed/Power Trial Guide
100% (1)
Ship Speed/Power Trial Guide
24 pages
Understanding Digital Financial Services
No ratings yet
Understanding Digital Financial Services
8 pages
FODS UNIT - 3 Study Material
No ratings yet
FODS UNIT - 3 Study Material
34 pages
Science 7 CM 1ST Grading
No ratings yet
Science 7 CM 1ST Grading
3 pages
Aqua Sphere Swim Gear Overview
No ratings yet
Aqua Sphere Swim Gear Overview
48 pages
Visual Thinking in Sequence and Series Analysis
No ratings yet
Visual Thinking in Sequence and Series Analysis
6 pages
Dynamics of Love and Commitment
No ratings yet
Dynamics of Love and Commitment
3 pages
Draft Guidelines To Promote Development of PSPs in The Country Seeking Comments
No ratings yet
Draft Guidelines To Promote Development of PSPs in The Country Seeking Comments
27 pages
Project Work Plan and Budget Matrix: Wins Aim For The Stars Project
100% (7)
Project Work Plan and Budget Matrix: Wins Aim For The Stars Project
17 pages
Venturekit Business Plan
No ratings yet
Venturekit Business Plan
29 pages
FY Microban Southco en
No ratings yet
FY Microban Southco en
1 page
National Interguard HB Spray Texture
No ratings yet
National Interguard HB Spray Texture
2 pages
Vocabulary & Themes in Earnest
No ratings yet
Vocabulary & Themes in Earnest
5 pages
Assessment of Probability of Success For Hydrogeothermal Wells
No ratings yet
Assessment of Probability of Success For Hydrogeothermal Wells
6 pages
Q2 Math5 Week4
No ratings yet
Q2 Math5 Week4
36 pages

Convolutional Neural Networks

Uploaded by

Convolutional Neural Networks

Uploaded by

Convolutional Neural Networks

File:2D Convolution Animation.gif - Wikimedia Commons

File:2D Convolution Animation.gif - Wikimedia Commons

2-D Gaussian filtering of images - MATLAB imgaussfilt (mathworks.com)

CS231n Convolutional Neural Networks for Visual Recognition

• The ImageNet Large Scale Visual Recognition Challenge (ILSVRC)

Beyond ILSVRC workshop 2017 (image-net.org)

Beyond ILSVRC workshop 2017 (image-net.org)

• Reduced the 1st layer filter size from 11x11 to 7x7

Going deeper with convolutions, Szegedy et al, CVPR 2015

Going deeper with convolutions, Szegedy et al, CVPR 2015

Going deeper with convolutions, Szegedy et al, CVPR 2015

Deep residual learning for image recognition, He et al, CVPR 2016

Deep residual learning for image recognition, He et al, CVPR 2016

Deep residual learning for image recognition, He et al, CVPR 2016

Deep residual learning for image recognition, He et al, CVPR 2016

Deep Networks with Stochastic Depth, Huang et al, ECCV 2016

Deep Networks with Stochastic Depth, Huang et al, ECCV 2016

Deep Networks with Stochastic Depth, Huang et al, ECCV 2016

Deep Learning Specialization - DeepLearning.AI

Deep Learning Specialization - DeepLearning.AI

Deep Learning Specialization - DeepLearning.AI

𝑀: 128 The number of input channels

3. BN makes loss surface smoother

𝐻: the number of hidden units in a layer

Layer normalization, Ba et al, arXiv 2016

Group normalization, Wu et al, ECCV 2018

Group normalization, Wu et al, ECCV 2018

You might also like