Neural Network (RNN & CNN)

Recurrent Neural Networks (RNNs) are designed for sequential data, allowing them to consider previous inputs, making them effective for tasks like language translation and speech recognition. In contrast, Convolutional Neural Networks (CNNs) are optimized for spatial data, particularly images, using filters to identify patterns and requiring less preprocessing. While CNNs excel in image-related tasks, RNNs are better suited for temporal data analysis such as text and speech.

Uploaded by

pm.xvi.xii.mmiii

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views31 pages

Neural Network (RNN & CNN)

Uploaded by

pm.xvi.xii.mmiii

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Neural Network

RNN
A recurrent neural network (RNN) is a type of artificial neural network which uses
sequential data or time series data.
These deep learning algorithms are commonly used for ordinal or temporal problems,
such as language translation, natural language processing (nlp), speech recognition, and
image captioning; they are incorporated into popular applications such as Siri, voice
search, and Google Translate.
Like feedforward and convolutional neural networks (CNNs), recurrent neural
networks utilize training data to learn.
what makes RNN so powerful is the fact that it doesn't take into consideration just the
actual input but also the previous input which allows it to memorize what happens
previously
RNN
Naïve Bayes for example it takes a sentence as a set of independent words and
precisely the frequency of each word without worrying about the composition of words
or the order of words in a sentence which makes a huge difference to form the meaning
of a sentence.
 RNN unlike those classic algorithms, works well on sequence data because it takes the
word i as input and combine with the output of word i-1, the same thing would be
applied for word i+1 and
 this is the reason it’s called recurrent neural network because clearly the neural
network apply the same operations on each word i of the sentence.
Feed Forward NN
RNN
In a feed-forward neural network, the information only moves in one direction — from
the input layer, through the hidden layers, to the output layer. The information moves
straight through the network.
Feed-forward neural networks have no memory of the input they receive and are bad at
predicting what’s coming next. Because a feed-forward network only considers the
current input, it has no notion of order in time. It simply can’t remember anything about
what happened in the past except its training.
In a RNN the information cycles through a loop. When it makes a decision, it
considers the current input and also what it has learned from the inputs it received
previously.
RNN
RNN
Back propagation is like going back in time to compute derivative of the loss function
with respect to parameters
(RNN) is a special type of artificial neural network adapted to work for time series data
or data that involves sequences. Ordinary feedforward neural networks are only meant
for data points that are independent of each other.
RNNs are better suited to analyzing temporal, sequential data, such as text or videos.
A CNN has a different architecture from an RNN. CNNs are "feed-forward neural
networks" that use filters and pooling layers, whereas RNNs feed results back into the
network
The main and most important feature of RNN is Hidden state, which remembers some
information about a sequence. RNN have a “memory” which remembers all information
about what has been calculated.
RNN
RNN
In the examples above, for each instance of training (sentence) we map each word with
an output, if the word is name(john, Ellen …) we map it to 1. Otherwise, we map it to 0.
So to train RNN on sentences to recognize names within, the RNN architecture would be
something like that
RNN
for this training example, we have 5 words which means 5 steps so for each
step t we calculate a, y using the shared weights Wa,Wx, Wy, ba, by :
RNN
Back propagation is like going back in time to compute derivative of the loss function
with respect to parameters
After multiple iterations using several training examples, we’d be able to minimize the
loss function and the predicted output would converge to the real output. Thus, we’ll use
the optimized weights to detect names through future sentences.
Convolution Neural Network
The CNN is another type of neural network that can uncover key information in both
time series and image data. For this reason, it is highly valuable for image-related tasks,
such as image recognition, object classification and pattern recognition.
To identify patterns within an image, a CNN leverages principles from linear algebra,
such as matrix multiplication. CNNs can also classify audio and signal data.
CNN
A Convolutional Neural Network (ConvNet/CNN) is a Deep Learning algorithm that
can take in an input image, assign importance (learnable weights and biases) to various
aspects/objects in the image, and be able to differentiate one from the other.
 The pre-processing required in a ConvNet is much lower as compared to other
classification algorithms.
While in primitive methods filters are hand-engineered, with enough training,
ConvNets have the ability to learn these filters/characteristics.
Why CNN??
Regular artificial neural networks do not scale very well. For example, in CIFAR, a
dataset that is commonly used for training computer vision models, the images are only
of size 32x32 px and have 3 color channels. That means that a single fully-connected
neuron in a first hidden layer of this neural network would have 32x32x3 = 3072
weights. It is still manageable. But now imagine a bigger image, for example,
300x300x3. It would have 270,000 weights (training of which demands so much
computational power)!
A huge neural network like that demands a lot of resources but even then remains
prone to overfitting because the large number of parameters enable it to just memorize
the dataset.
CNN
Convolution is a mathematical operation that allows the merging of two sets of
information. In the case of CNN, convolution is applied to the input data to filter the
information and produce a feature map.
CNNs use parameter sharing. All neurons in a particular feature map share weights
which makes the whole system less computationally intense.
CNN ,RNN, MLP
CNN
Convolution is one of the main building blocks of a CNN. The term convolution refers
to the mathematical combination of two functions to produce a third function. It merges
two sets of information.
In the case of a CNN, the convolution is performed on the input data with the use of
a filter or kernel (these terms are used interchangeably) to then produce a feature map.
We execute a convolution by sliding the filter over the input. At every location, a
matrix multiplication is performed and sums the result onto the feature map.
CNN
CNN
CNN
We preform numerous convolutions on our input, where each operation uses a different
filter. This results in different feature maps. In the end, we take all of these feature maps
and put them together as the final output of the convolution layer.
Just like any other Neural Network, we use an activation function to make our output
non-linear. In the case of a Convolutional Neural Network, the output of the convolution
will be passed through the activation function. This could be the ReLU activation
function.
CNN
Max Pooling example
Average Pooling
Output of Pooling
Because the size of the feature map is always smaller than the input, we have to do something to
prevent our feature map from shrinking. This is where we use padding.
A layer of zero-value pixels is added to surround the input with zeros, so that our feature map
will not shrink. In addition to keeping the spatial size constant after performing convolution,
padding also improves performance and makes sure the kernel and stride size will fit in the input.
After a convolution layer, it is common to add a pooling layer in between CNN layers. The
function of pooling is to continuously reduce the dimensionality to reduce the number of
parameters and computation in the network. This shortens the training time and
controls overfitting.
The most frequent type of pooling is max pooling, which takes the maximum value in each
window. These window sizes need to be specified beforehand. This decreases the feature map size
while at the same time keeping the significant information.
CNN
Padding. Padding expands the input matrix by adding fake pixels to the borders of the
matrix. This is done because convolution reduces the size of the matrix. For example, a
5x5 matrix turns into a 3x3 matrix when a filter goes over it.
Striding. It often happens that when working with a convolutional layer, you need to
get an output that is smaller than the input. One way to achieve this is to use a pooling
layer. Another way to achieve this is to use striding. The idea behind stride is to skip
some areas when the kernel slides over: for example, skipping every 2 or 3 pixels. It
reduces spatial resolution and makes the network more computationally efficient.
CNN
Thus when using a CNN, the four important hyperparameters we have to decide on
are:
the kernel size
the filter count (that is, how many filters do we want to use)
CNN vs RNN
CNN RNN
It is suitable for spatial data such as images. RNN is suitable for temporal data, also called
sequential data.
CNN is considered to be more powerful than RNN includes less feature compatibility when
RNN. compared to CNN.
This network takes fixed size inputs and RNN can handle arbitrary input/output lengths.
generates fixed size outputs.
CNN is a type of feed-forward artificial neural RNN unlike feed forward neural networks - can
network with variations of multilayer perceptrons use their internal memory to process arbitrary
designed to use minimal amounts of sequences of inputs.
preprocessing.
CNNs use connectivity pattern between the Recurrent neural networks use time-series
neurons. This is inspired by the organization of information - what a user spoke last will impact
the animal visual cortex, whose individual what he/she will speak next.
neurons are arranged in such a way that they
respond to overlapping regions tiling the visual
field.
CNNs are ideal for images and video processing. RNNs are ideal for text and speech analysis.
Example
Examples of CNN in computer vision are face recognition, image classification etc.

Applications of Recurrent Neural Networks:

Prediction problems.; Machine Translation. ; Speech Recognition. ;Language
Modelling and Generating Text. ;Video Tagging.

Unit 3 NNDL-1
No ratings yet
Unit 3 NNDL-1
31 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
47 pages
DLA Unit 4
No ratings yet
DLA Unit 4
38 pages
Cs329-Lecture 5-2025
No ratings yet
Cs329-Lecture 5-2025
30 pages
DL Unit 4 Modified
No ratings yet
DL Unit 4 Modified
64 pages
DL Unit 4
No ratings yet
DL Unit 4
58 pages
Deep Learning Image Classification
No ratings yet
Deep Learning Image Classification
11 pages
DL Unit-Ii
No ratings yet
DL Unit-Ii
36 pages
UNIT-2 DL
No ratings yet
UNIT-2 DL
51 pages
DL Unit 3 2019PAT
No ratings yet
DL Unit 3 2019PAT
66 pages
Lecture 3 V33
No ratings yet
Lecture 3 V33
52 pages
DLT Unit - 4
No ratings yet
DLT Unit - 4
36 pages
CNN & RNN Explained for Beginners
No ratings yet
CNN & RNN Explained for Beginners
8 pages
L4 - Deep Learning
No ratings yet
L4 - Deep Learning
50 pages
DL Unit4
No ratings yet
DL Unit4
31 pages
Class Notes Unit 5
No ratings yet
Class Notes Unit 5
13 pages
Lec6 RNN Attention Search
No ratings yet
Lec6 RNN Attention Search
62 pages
Basic Introduction To Convolutional Neural Network in Deep Learning
No ratings yet
Basic Introduction To Convolutional Neural Network in Deep Learning
9 pages
DL 4
No ratings yet
DL 4
4 pages
DL CNN
No ratings yet
DL CNN
129 pages
Mergeddv
No ratings yet
Mergeddv
2 pages
Deep Learning Unit 5
No ratings yet
Deep Learning Unit 5
23 pages
Analysing 3 Networks
No ratings yet
Analysing 3 Networks
30 pages
Understanding RNN and CNN: Foundations, Differences, and Applications
No ratings yet
Understanding RNN and CNN: Foundations, Differences, and Applications
5 pages
Unit - 4 DL
No ratings yet
Unit - 4 DL
19 pages
NN 06
No ratings yet
NN 06
18 pages
Difference Between ANN, CNN and RNN
No ratings yet
Difference Between ANN, CNN and RNN
4 pages
DL PR 08 G Manual
No ratings yet
DL PR 08 G Manual
5 pages
ENG6500 8 DL IntroductionToDeepLearning Part2
No ratings yet
ENG6500 8 DL IntroductionToDeepLearning Part2
65 pages
What Is A Convolutional Neural Network-Unit3
No ratings yet
What Is A Convolutional Neural Network-Unit3
12 pages
Module5 ML
No ratings yet
Module5 ML
112 pages
DL Unit-4
No ratings yet
DL Unit-4
26 pages
CNN Architecture and Layers Guide
No ratings yet
CNN Architecture and Layers Guide
21 pages
Neural Networks Unit 3
No ratings yet
Neural Networks Unit 3
93 pages
CNN and RNN Applications in AI
No ratings yet
CNN and RNN Applications in AI
41 pages
Chapter 4 Neural Network
No ratings yet
Chapter 4 Neural Network
46 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES (WWW - Jntumaterials.co - In)
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES (WWW - Jntumaterials.co - In)
26 pages
CNNs for Image Recognition
No ratings yet
CNNs for Image Recognition
16 pages
Unit Iv DL
No ratings yet
Unit Iv DL
26 pages
NNDL U-3
No ratings yet
NNDL U-3
7 pages
CNN, RNN
No ratings yet
CNN, RNN
60 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
6 pages
CNNs for AI and Machine Learning
No ratings yet
CNNs for AI and Machine Learning
16 pages
Unit Iv (CNN)
No ratings yet
Unit Iv (CNN)
8 pages
DL Unit3 1
No ratings yet
DL Unit3 1
67 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
11 pages
CNN Students
No ratings yet
CNN Students
170 pages
UNIT-III DeepLearning Notes
No ratings yet
UNIT-III DeepLearning Notes
30 pages
Deep Learning & CNN Fundamentals
No ratings yet
Deep Learning & CNN Fundamentals
56 pages
3
No ratings yet
3
25 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
49 pages
Introduction To Convolutional Neural Networks1-Unit3
No ratings yet
Introduction To Convolutional Neural Networks1-Unit3
10 pages
Module2 1
No ratings yet
Module2 1
27 pages
Intro to ENGG1801 for Students
No ratings yet
Intro to ENGG1801 for Students
97 pages
Java Unit Wise Questions
No ratings yet
Java Unit Wise Questions
4 pages
Lab 10 2022 2
No ratings yet
Lab 10 2022 2
4 pages
In The 1960s and 1970s Dennis Ritchie and Ken Thompson Invented Unix
No ratings yet
In The 1960s and 1970s Dennis Ritchie and Ken Thompson Invented Unix
3 pages
Business Cash Control Strategies
No ratings yet
Business Cash Control Strategies
9 pages
SVEP User Manual
No ratings yet
SVEP User Manual
108 pages
OpenText VIM 16.3.4 - User Guide English
100% (10)
OpenText VIM 16.3.4 - User Guide English
138 pages
31mk1 Safety - Regulatory Dfqw5483za Non Nonlogo 24lang p20100694
No ratings yet
31mk1 Safety - Regulatory Dfqw5483za Non Nonlogo 24lang p20100694
136 pages
SMS-Controlled Motor System
No ratings yet
SMS-Controlled Motor System
6 pages
ACP Tutorials PDF
No ratings yet
ACP Tutorials PDF
10 pages
Fastener Standards
No ratings yet
Fastener Standards
8 pages
Gowifi - Ahd - HLD - Rosario Maclang Bautista General Hospital Rev1
No ratings yet
Gowifi - Ahd - HLD - Rosario Maclang Bautista General Hospital Rev1
32 pages
Evaluating Project Performance Using Baseline Schedules
No ratings yet
Evaluating Project Performance Using Baseline Schedules
5 pages
OLI Studio User Training Guide
No ratings yet
OLI Studio User Training Guide
7 pages
FW F436 P PDF
No ratings yet
FW F436 P PDF
2 pages
Final Report of Summer Project
No ratings yet
Final Report of Summer Project
53 pages
Java.io.File Class Overview
No ratings yet
Java.io.File Class Overview
5 pages
Definition of Big Data - IT Glossary - Gartner
No ratings yet
Definition of Big Data - IT Glossary - Gartner
1 page
Fundamentals of Law For Health Informatics and Information Management 3rd Edition Unlocked Test Bank
No ratings yet
Fundamentals of Law For Health Informatics and Information Management 3rd Edition Unlocked Test Bank
325 pages
E&H SIL Poster
No ratings yet
E&H SIL Poster
1 page
SATEC Catalog
No ratings yet
SATEC Catalog
28 pages
Toshiba 2SK2961 N-Channel MOSFET
No ratings yet
Toshiba 2SK2961 N-Channel MOSFET
7 pages
Higher Education in Afghanistan
No ratings yet
Higher Education in Afghanistan
8 pages
Android Developer Portfolio
No ratings yet
Android Developer Portfolio
1 page
Audi SSP 364 Audi q7 Electrical System
100% (67)
Audi SSP 364 Audi q7 Electrical System
6 pages
Slac Template
No ratings yet
Slac Template
5 pages
Daftar Harga: MPPI-62537 MNYW-50151 MNBL-63504 MNBL-63506
No ratings yet
Daftar Harga: MPPI-62537 MNYW-50151 MNBL-63504 MNBL-63506
4 pages
Mohammad Qaasim - Network Engineer CV
No ratings yet
Mohammad Qaasim - Network Engineer CV
4 pages
Practical 3linux Practical For B.tech Student
No ratings yet
Practical 3linux Practical For B.tech Student
6 pages
Fallout 3 PC Cheats
No ratings yet
Fallout 3 PC Cheats
14 pages

Neural Network (RNN & CNN)

Uploaded by

Neural Network (RNN & CNN)

Uploaded by

Neural Network

Applications of Recurrent Neural Networks:

You might also like