0% found this document useful (0 votes)

290 views

Understanding of Convolutional Neural Network (CNN) - Deep Learning

Convolutional neural networks (CNNs) are a type of neural network used for image recognition and classification. CNNs process images by passing them through multiple convolutional and pooling layers to extract features, followed by one or more fully connected layers to classify the image. Key aspects of CNNs include the use of filters in convolutional layers to detect features in images like edges or shapes, pooling layers that reduce the spatial size of representations, and multiple convolutional layers allowing the network to learn hierarchical representations of images. CNNs have achieved human-level performance on many image recognition tasks.

Uploaded by

Kashaf Bakali

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

290 views

Understanding of Convolutional Neural Network (CNN) - Deep Learning

Uploaded by

Kashaf Bakali

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Understanding of Convolutional

Neural Network (CNN) — Deep

Learning
Prabhu
Mar 4, 2018 · 5 min read

In neural networks, Convolutional neural network (ConvNets or CNNs) is one of the

main categories to do images recognition, images classifications. Objects detections,
recognition faces etc., are some of the areas where CNNs are widely used.

CNN image classifications takes an input image, process it and classify it under certain
categories (Eg., Dog, Cat, Tiger, Lion). Computers sees an input image as array of pixels
and it depends on the image resolution. Based on the image resolution, it will see h x w
x d( h = Height, w = Width, d = Dimension ). Eg., An image of 6 x 6 x 3 array of
matrix of RGB (3 refers to RGB values) and an image of 4 x 4 x 1 array of matrix of
grayscale image.

Figure 1 : Array of RGB Matrix

Technically, deep learning CNN models to train and test, each input image will pass it
through a series of convolution layers with filters (Kernals), Pooling, fully connected
layers (FC) and apply Softmax function to classify an object with probabilistic values
between 0 and 1. The below figure is a complete flow of CNN to process an input image
and classifies the objects based on values.

Figure 2 : Neural network with many convolutional layers

Convolution Layer

Convolution is the first layer to extract features from an input image. Convolution
preserves the relationship between pixels by learning image features using small
squares of input data. It is a mathematical operation that takes two inputs such as
image matrix and a filter or kernal

Figure 3: Image matrix multiplies kernel or lter matrix

Consider a 5 x 5 whose image pixel values are 0, 1 and filter matrix 3 x 3 as shown in
below
Figure 4: Image matrix multiplies kernel or lter matrix

Then the convolution of 5 x 5 image matrix multiplies with 3 x 3 filter matrix which is
called “Feature Map” as output shown in below

Figure 5: 3 x 3 Output matrix

Convolution of an image with different filters can perform operations such as edge
detection, blur and sharpen by applying filters. The below example shows various
convolution image after applying different types of filters (Kernels).
Figure 7 : Some common lters

Strides

Stride is the number of pixels shifts over the input matrix. When the stride is 1 then we
move the filters to 1 pixel at a time. When the stride is 2 then we move the filters to 2
pixels at a time and so on. The below figure shows convolution would work with a
stride of 2.

Figure 6 : Stride of 2 pixels

Padding

Sometimes filter does not fit perfectly fit the input image. We have two options:

Pad the picture with zeros (zero-padding) so that it fits

Drop the part of the image where the filter did not fit. This is called valid padding
which keeps only valid part of the image.

Non Linearity (ReLU)

ReLU stands for Rectified Linear Unit for a non-linear operation. The output is ƒ(x) =
max(0,x).
Why ReLU is important : ReLU’s purpose is to introduce non-linearity in our ConvNet.
Since, the real world data would want our ConvNet to learn would be non-negative
linear values.

Figure 7 : ReLU operation

There are other non linear functions such as tanh or sigmoid can also be used instead of
ReLU. Most of the data scientists uses ReLU since performance wise ReLU is better than
other two.

Pooling Layer

Pooling layers section would reduce the number of parameters when the images are
too large. Spatial pooling also called subsampling or downsampling which reduces the
dimensionality of each map but retains the important information. Spatial pooling can
be of different types:

Max Pooling

Average Pooling

Sum Pooling

Max pooling take the largest element from the rectified feature map. Taking the largest
element could also take the average pooling. Sum of all elements in the feature map
call as sum pooling.
Figure 8 : Max Pooling

Fully Connected Layer

The layer we call as FC layer, we flattened our matrix into vector and feed it into a fully
connected layer like neural network.

Figure 9 : After pooling layer, attened as FC layer

In the above diagram, feature map matrix will be converted as vector (x1, x2, x3, …).
With the fully connected layers, we combined these features together to create a
model. Finally, we have an activation function such as softmax or sigmoid to classify
the outputs as cat, dog, car, truck etc.,
Figure 10 : Complete CNN architecture

Summary

Provide input image into convolution layer

Choose parameters, apply filters with strides, padding if requires. Perform

convolution on the image and apply ReLU activation to the matrix.

Perform pooling to reduce dimensionality size

Add as many convolutional layers until satisfied

Flatten the output and feed into a fully connected layer (FC Layer)

Output the class using an activation function (Logistic Regression with cost
functions) and classifies images.

In the next post, I would like to talk about some popular CNN architectures such as
AlexNet, VGGNet, GoogLeNet and ResNet.

References :

https://www.mathworks.com/discovery/convolutional-neural-network.html

https://adeshpande3.github.io/adeshpande3.github.io/A-Beginner's-Guide-To-
Understanding-Convolutional-Neural-Networks/

https://ujjwalkarn.me/2016/08/11/intuitive-explanation-convnets/

https://blog.datawow.io/interns-explain-cnn-8a669d053f8b.

Machine Learning Cnn Convolution Neural Net Image Recognition Neural Networks

About Help Legal

HOIS OGTC Guidance Notes For HOIS-RP-103 v1
100% (2)
HOIS OGTC Guidance Notes For HOIS-RP-103 v1
98 pages
Session 1
0% (1)
Session 1
13 pages
Computer Vision55
100% (1)
Computer Vision55
268 pages
Deep Learning Notes Andrew NG
No ratings yet
Deep Learning Notes Andrew NG
54 pages
Digital Image Processing Solution of Homework
No ratings yet
Digital Image Processing Solution of Homework
7 pages
Homework 4
No ratings yet
Homework 4
3 pages
Order Flow Analysis
No ratings yet
Order Flow Analysis
28 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
A Novel Adoption of LSTM in Customer Touchpoint Prediction Problems Presentation 1
No ratings yet
A Novel Adoption of LSTM in Customer Touchpoint Prediction Problems Presentation 1
73 pages
Deep Learning CNN
No ratings yet
Deep Learning CNN
204 pages
Fully Convolutional Neural Network
No ratings yet
Fully Convolutional Neural Network
7 pages
Data Augmentation Techniques I
No ratings yet
Data Augmentation Techniques I
23 pages
Plant Disease Identification
No ratings yet
Plant Disease Identification
17 pages
22 Selected Top Papers On Deep Learning
No ratings yet
22 Selected Top Papers On Deep Learning
393 pages
Image Caption Generator
No ratings yet
Image Caption Generator
13 pages
CNN 1
No ratings yet
CNN 1
23 pages
Generative Adversarial Networks (GANs)
No ratings yet
Generative Adversarial Networks (GANs)
51 pages
Ebook Deep Learning Objective Type Questions
No ratings yet
Ebook Deep Learning Objective Type Questions
102 pages
Object Detection Using Image Processing
No ratings yet
Object Detection Using Image Processing
17 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
35 pages
Image Segmentation
No ratings yet
Image Segmentation
80 pages
Object Detection Using Deep Learning
No ratings yet
Object Detection Using Deep Learning
45 pages
Introduction To Image Processing and Computer Vision 2 PDF
100% (2)
Introduction To Image Processing and Computer Vision 2 PDF
179 pages
Deep Learning
100% (1)
Deep Learning
49 pages
Image Processing-Chapter 1
No ratings yet
Image Processing-Chapter 1
8 pages
Image Compression Using DCT
100% (1)
Image Compression Using DCT
10 pages
Emotion Detection
No ratings yet
Emotion Detection
23 pages
Feature Engineering Handout
No ratings yet
Feature Engineering Handout
33 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
47 pages
Face Detection & Emotion Recognition
No ratings yet
Face Detection & Emotion Recognition
26 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
66 pages
Machine Learning
No ratings yet
Machine Learning
29 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
24 pages
Convolutional Neural Networks (Part I)
No ratings yet
Convolutional Neural Networks (Part I)
61 pages
Tutorial Math Deep Learning 2018 PDF
No ratings yet
Tutorial Math Deep Learning 2018 PDF
103 pages
Deep Learning Patterns and Practices 1st Edition Andrew Ferlitsch 2024 scribd download
100% (3)
Deep Learning Patterns and Practices 1st Edition Andrew Ferlitsch 2024 scribd download
40 pages
A Survey of Graph Neural Networks in Various Learning Paradigms Methods, Applications, and Challenges
No ratings yet
A Survey of Graph Neural Networks in Various Learning Paradigms Methods, Applications, and Challenges
70 pages
How To Develop LSTM Models For Time Series Forecasting
100% (1)
How To Develop LSTM Models For Time Series Forecasting
188 pages
Deep Learning For Image Processing Using MATLAB
No ratings yet
Deep Learning For Image Processing Using MATLAB
19 pages
(Advances in Computer Vision and Pattern Recognition) Ke Gu, Hongyan Liu, Chengxu Zhou - Quality Assessment of Visual Content-Springer (2022)
No ratings yet
(Advances in Computer Vision and Pattern Recognition) Ke Gu, Hongyan Liu, Chengxu Zhou - Quality Assessment of Visual Content-Springer (2022)
256 pages
ANN Matlab
No ratings yet
ANN Matlab
13 pages
Deep Learning
100% (4)
Deep Learning
100 pages
Image Captioning Using CNN & LSTM: Digital Signal Processing Laboratory (EEE - 316)
No ratings yet
Image Captioning Using CNN & LSTM: Digital Signal Processing Laboratory (EEE - 316)
24 pages
Computer Vision & Image Processing
67% (3)
Computer Vision & Image Processing
34 pages
Final Year Project Thesis
No ratings yet
Final Year Project Thesis
25 pages
Basics of Deep Learning
100% (1)
Basics of Deep Learning
17 pages
Computer Vision
No ratings yet
Computer Vision
52 pages
Deep Reinforcement Learning: Overcoming The Challenges of Deep Learning in Discrete and Continuous Markov Decision Processes
No ratings yet
Deep Reinforcement Learning: Overcoming The Challenges of Deep Learning in Discrete and Continuous Markov Decision Processes
110 pages
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 3 Notes
No ratings yet
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 3 Notes
21 pages
Object Detection and Identification
67% (3)
Object Detection and Identification
20 pages
Vision Systems Applications PDF
No ratings yet
Vision Systems Applications PDF
618 pages
Digital Image Processing Segmntation Lab With Python
No ratings yet
Digital Image Processing Segmntation Lab With Python
9 pages
Graph Neural Networks
100% (1)
Graph Neural Networks
27 pages
Speech Emotion Recognition With Deep Learning
No ratings yet
Speech Emotion Recognition With Deep Learning
10 pages
An Analysis of Convolutional Neural Network Architectures
No ratings yet
An Analysis of Convolutional Neural Network Architectures
54 pages
Deep Learning
No ratings yet
Deep Learning
5 pages
Deep Reinforcement Learning
100% (1)
Deep Reinforcement Learning
410 pages
Object Detection - Week 1 - Object Detection in 20 Years - Final
No ratings yet
Object Detection - Week 1 - Object Detection in 20 Years - Final
280 pages
Deep Learning Notes
100% (1)
Deep Learning Notes
16 pages
Effective Amazon Machine Learning
From Everand
Effective Amazon Machine Learning
Alexis Perrier
No ratings yet
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
From Everand
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
Sebastian Thelen
5/5 (1)
Machine Learning with Python: Design and Develop Machine Learning and Deep Learning Technique using real world code examples
From Everand
Machine Learning with Python: Design and Develop Machine Learning and Deep Learning Technique using real world code examples
Abhishek Vijayvargia
No ratings yet
Intelligent Waste Separator PDF
No ratings yet
Intelligent Waste Separator PDF
15 pages
Understanding Backpropagation Algorithm - Towards Data Science
No ratings yet
Understanding Backpropagation Algorithm - Towards Data Science
11 pages
Turn Python Scripts Into Beautiful ML Tools - Towards Data Science PDF
No ratings yet
Turn Python Scripts Into Beautiful ML Tools - Towards Data Science PDF
14 pages
Turn Python Scripts Into Beautiful ML Tools - Towards Data Science
100% (1)
Turn Python Scripts Into Beautiful ML Tools - Towards Data Science
14 pages
Introduction To Machine Learning Top-Down Approach - Towards Data Science
No ratings yet
Introduction To Machine Learning Top-Down Approach - Towards Data Science
6 pages
Evaluated Kinetic Data For Combustion Modeling
No ratings yet
Evaluated Kinetic Data For Combustion Modeling
642 pages
Outliers in Time Series Data
No ratings yet
Outliers in Time Series Data
8 pages
Mars PDF
No ratings yet
Mars PDF
15 pages
Pre-Service Teachers Competency and Behavior in Teaching Ibong Adarna: Input For Instructional Program
No ratings yet
Pre-Service Teachers Competency and Behavior in Teaching Ibong Adarna: Input For Instructional Program
24 pages
Syllabus Geography
No ratings yet
Syllabus Geography
41 pages
Statistics and Data Collection
100% (1)
Statistics and Data Collection
11 pages
Mids 21
No ratings yet
Mids 21
10 pages
What Statistical Analysis Should I Use?: Sunday, June 4, 2017 04:22 AM
No ratings yet
What Statistical Analysis Should I Use?: Sunday, June 4, 2017 04:22 AM
364 pages
Analytical Chemistry Finals Reviewer
No ratings yet
Analytical Chemistry Finals Reviewer
10 pages
Classification With Logistic Regression, Newton's Method For Optimization, Generalized Linear Models
No ratings yet
Classification With Logistic Regression, Newton's Method For Optimization, Generalized Linear Models
55 pages
SENIOR HIGH SCHOOL-Practical Research 1: A. Most Essential Learning Competency (MELC)
No ratings yet
SENIOR HIGH SCHOOL-Practical Research 1: A. Most Essential Learning Competency (MELC)
6 pages
Download full Introduction to Statistical Methods Design of Experiments and Statistical Quality Control Dharmaraja Selvamuthu ebook all chapters
100% (1)
Download full Introduction to Statistical Methods Design of Experiments and Statistical Quality Control Dharmaraja Selvamuthu ebook all chapters
55 pages
Chapter - 13 Probability Topic: Conditional Probability
No ratings yet
Chapter - 13 Probability Topic: Conditional Probability
7 pages
Practice Assignment 1.1 - Not Graded
No ratings yet
Practice Assignment 1.1 - Not Graded
7 pages
Statistical Analysis in JASP 2024
No ratings yet
Statistical Analysis in JASP 2024
189 pages
Level 2 Quants Notes
No ratings yet
Level 2 Quants Notes
7 pages
PSY Final Exam PDF
No ratings yet
PSY Final Exam PDF
4 pages
Matematik Öğretmen Adaylarının Cebirsel, Uzamsal, Olasılıksal ve Orantısal
No ratings yet
Matematik Öğretmen Adaylarının Cebirsel, Uzamsal, Olasılıksal ve Orantısal
16 pages
Dmbi Assignment 3
No ratings yet
Dmbi Assignment 3
5 pages
Quantitative Analysis Technical Terms
No ratings yet
Quantitative Analysis Technical Terms
4 pages
Beine Docquier Rapoport 2008
No ratings yet
Beine Docquier Rapoport 2008
22 pages
K Nearest Neighbor Based Model For Intrusion Detection System
No ratings yet
K Nearest Neighbor Based Model For Intrusion Detection System
5 pages
Specification Accredited A Level Gce Mathematics B Mei h640
No ratings yet
Specification Accredited A Level Gce Mathematics B Mei h640
92 pages
Pastor Stambaugh
No ratings yet
Pastor Stambaugh
54 pages
Common Method Variance
No ratings yet
Common Method Variance
8 pages
Faktor-Faktor Yang Berhubungan Dengan Kunjungan Ibu Balita Ke Posyandu Di Wilayah Kerja Puskesmas Banjarbaru Selatan TAHUN 2021
No ratings yet
Faktor-Faktor Yang Berhubungan Dengan Kunjungan Ibu Balita Ke Posyandu Di Wilayah Kerja Puskesmas Banjarbaru Selatan TAHUN 2021
16 pages
12.industrial Safety Engineering PDF
43% (7)
12.industrial Safety Engineering PDF
15 pages
IE 27 ProbSet 2 PDF
No ratings yet
IE 27 ProbSet 2 PDF
1 page