15.03.2024 Csa3007 A24+d23+d24

Uploaded by

Vitthal Dubey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views8 pages

15.03.2024 Csa3007 A24+d23+d24

Uploaded by

Vitthal Dubey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Recurrent Neural Networks (RNNs)

• Another type of neural network that is dominating difficult machine

learning problems that involve sequences of inputs called recurrent
neural networks.
• Recurrent Neural Networks (RNNs) are a special type of neural
network designed for “sequence problems”.
• Recurrent neural networks have connections that have loops,
adding feedback and memory to the networks over time.
• This memory allows this type of network to learn and generalize
across sequences of inputs rather than individual patterns.
• A powerful type of Recurrent Neural Network called → Long Short-
Term Memory Network (LSTM).
• Use cases: diverse array of problems i.e., Language translation,
automatic captioning of images and videos. 1
Support For Sequences in Neural
Networks
• There exist certain problem types that are best framed involving either
a sequence as an input or an output. Example → a univariate time
series problem, i.e., the price of a stock over time.
• This dataset can be framed as a prediction problem for a classical
feedforward Multilayer Perceptron network by defining a windows size
(e.g. 5) and training the network to learn to make short term predictions
from the fixed sized window of inputs.
• This strategy would work, but is very limited. The window of inputs adds
memory to the problem, but is limited to just a fixed number of points
and must be chosen with sufficient knowledge of the problem.
• A naive window would not capture the broader trends over minutes,
hours and days that might be relevant to making a prediction. From one
prediction to the next, the network only knows about the specific inputs
it is provided. 2
Problems that involve sequences
Following is the taxonomy of sequence problems that require a
mapping of an input to an output:

✓ One-to-Many: sequence output, for image captioning.

✓ Many-to-One: sequence input, for sentiment classification.
✓ Many-to-Many: sequence in and out, for machine translation.
✓ Synchronized Many-to-Many: synced sequences in and out,
for video classification.

3
Challenges / Issues → Solutions
For the techniques to be effective on real problems, two major
issues needed to be resolved for the network to be useful:
1) How to train the network with Backpropagation??
2) How to stop gradients vanishing or exploding during
training??

4
How to Train Recurrent Neural
Networks?
• Backpropagation breaks down in a recurrent neural network, because
of the recurrent or loop connections. This was addressed with a
modification of the Backpropagation technique called
Backpropagation Through Time or BPTT.
• Instead of performing Backpropagation on the recurrent network as
stated, the structure of the network is unrolled, where copies of the
neurons that have recurrent connections are created.
• For example: a single neuron with a connection to itself (A → A) could
be represented as two neurons with the same weight values (A → B).
This allows the cyclic graph of a recurrent neural network to be turned
into an acyclic graph like a classic feedforward neural network, and
Backpropagation can be applied.

5
How to Have Stable Gradients During
Training?
• When Backpropagation is used in very deep neural networks and
in unrolled recurrent neural networks, the gradients that are
calculated in order to update the weights can become unstable.
• They can become very large numbers called exploding gradients
or very small numbers called the vanishing gradient problem.
These large numbers in turn are used to update the weights in the
network, making training unstable and the network unreliable.
• This problem is alleviated in deep Multilayer Perceptron networks
through the use of the Rectifier transfer (activation) function.
• In recurrent neural network architectures, this problem has been
alleviated using a new type of architecture called the Long Short-
Term Memory Networks.

6
Long Short-Term Memory Networks
• LSTM network is a recurrent neural network that is trained using
Backpropagation through Time and overcomes the vanishing
gradient problem.
• Instead of neurons, LSTM networks have memory blocks that are
connected into layers.
• A block has components that make it smarter than a classical
neuron and a memory for recent sequences. A block contains gates
that manage the block's state and output. A unit operates upon an
input sequence and each gate within a unit uses the sigmoid
activation function to control whether they are triggered or not,
making the change of state and addition of information owing
through the unit conditional.

7
Contd…
There are three types of gates within a memory unit:
1. Forget Gate: conditionally decides what information to discard
from the unit.
2. Input Gate: conditionally decides which values from the input to
update the memory state.
3. Output Gate: conditionally decides what to output based on input
and the memory of the unit.
Each unit is like a mini state machine where the gates of the units
have weights that are learned during the training procedure.

Deep Learning RNN
100% (2)
Deep Learning RNN
53 pages
LSTM Ucl
100% (1)
LSTM Ucl
35 pages
CH4 - AA1.1-Sequence Models
No ratings yet
CH4 - AA1.1-Sequence Models
26 pages
Final PDL - Unit IV
No ratings yet
Final PDL - Unit IV
51 pages
Convolutional Neural Networks (CNNS)
No ratings yet
Convolutional Neural Networks (CNNS)
10 pages
What Is A Recurrent Neural Network
No ratings yet
What Is A Recurrent Neural Network
36 pages
RNNs: A Guide for AI Enthusiasts
No ratings yet
RNNs: A Guide for AI Enthusiasts
83 pages
Module 6
No ratings yet
Module 6
51 pages
DeepLearning SecC
No ratings yet
DeepLearning SecC
20 pages
DL Notes
No ratings yet
DL Notes
35 pages
Unit 4 NLP
No ratings yet
Unit 4 NLP
19 pages
Lec 4 Recurrent Neural Network Long Short-Term Memory
No ratings yet
Lec 4 Recurrent Neural Network Long Short-Term Memory
32 pages
Blue and White Simple Business Plan Presentation
No ratings yet
Blue and White Simple Business Plan Presentation
15 pages
Chap 7.2 Sequence Analysis Using RNN LSTM
No ratings yet
Chap 7.2 Sequence Analysis Using RNN LSTM
60 pages
RNN 2
No ratings yet
RNN 2
144 pages
Survey of Prediction Using Recurrent Neural Network
No ratings yet
Survey of Prediction Using Recurrent Neural Network
3 pages
DS303 RNN LSTM
No ratings yet
DS303 RNN LSTM
16 pages
Long Short-Term Memory Networks (LSTM) - Simply Explained! - Data Basecamp
No ratings yet
Long Short-Term Memory Networks (LSTM) - Simply Explained! - Data Basecamp
4 pages
Lec 10
No ratings yet
Lec 10
37 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
36 pages
Module 06
No ratings yet
Module 06
5 pages
Deep Learning RNN & LSTM Guide
100% (1)
Deep Learning RNN & LSTM Guide
44 pages
RNN LSTM
No ratings yet
RNN LSTM
72 pages
LSTM
No ratings yet
LSTM
123 pages
Slides RNN
No ratings yet
Slides RNN
75 pages
DeepLearning Unit-III
No ratings yet
DeepLearning Unit-III
99 pages
Introduction To Recurrent Neural Networks (RNNS) : Dr. Hans Weber February 9, 2024
No ratings yet
Introduction To Recurrent Neural Networks (RNNS) : Dr. Hans Weber February 9, 2024
9 pages
DL Co3 - PPT 1
No ratings yet
DL Co3 - PPT 1
22 pages
Unit IV
No ratings yet
Unit IV
31 pages
Intro to Recurrent Neural Networks
No ratings yet
Intro to Recurrent Neural Networks
79 pages
REPORT
No ratings yet
REPORT
24 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
13 pages
AN2DL 04 2324 RecurrentNeuralNetworks
No ratings yet
AN2DL 04 2324 RecurrentNeuralNetworks
34 pages
RNNs Explained for Tech Enthusiasts
No ratings yet
RNNs Explained for Tech Enthusiasts
6 pages
Endsem Imp DL Unit 4
No ratings yet
Endsem Imp DL Unit 4
30 pages
Lecture Notes - RRN
No ratings yet
Lecture Notes - RRN
8 pages
07 RNN Recurrent Neural Networks
No ratings yet
07 RNN Recurrent Neural Networks
115 pages
Dis6 Sol
No ratings yet
Dis6 Sol
6 pages
Unit IV
No ratings yet
Unit IV
22 pages
Unit V
No ratings yet
Unit V
32 pages
Unit-Iv DL
No ratings yet
Unit-Iv DL
54 pages
CNN RNN LSTM Attention
No ratings yet
CNN RNN LSTM Attention
86 pages
Recurrent Neural Network
No ratings yet
Recurrent Neural Network
11 pages
Deep & Reinforcement - Unit 4
No ratings yet
Deep & Reinforcement - Unit 4
17 pages
DeepLearning Unit-III
No ratings yet
DeepLearning Unit-III
42 pages
Lab 9 RNN
No ratings yet
Lab 9 RNN
8 pages
RNNs and Their Types - 15 Slides (Easy Copy-Paste Format)
No ratings yet
RNNs and Their Types - 15 Slides (Easy Copy-Paste Format)
6 pages
RNN Introduction
No ratings yet
RNN Introduction
22 pages
CNN RNN LSTM GRU Simple
100% (3)
CNN RNN LSTM GRU Simple
20 pages
Unit 3 RCNN
No ratings yet
Unit 3 RCNN
25 pages
Module 6
No ratings yet
Module 6
42 pages
Recurrent Neural Network
No ratings yet
Recurrent Neural Network
47 pages
Long Short-Term Memory Networks PDF
No ratings yet
Long Short-Term Memory Networks PDF
22 pages
Long Short Term Memory Networks - Architecture of LSTM
No ratings yet
Long Short Term Memory Networks - Architecture of LSTM
14 pages
Module 4 Recurrent Neural Network
100% (1)
Module 4 Recurrent Neural Network
78 pages
Unified CLDNN Architecture for Speech Recognition
No ratings yet
Unified CLDNN Architecture for Speech Recognition
5 pages
Keras Deep Learning Cheat Sheet
No ratings yet
Keras Deep Learning Cheat Sheet
1 page
Lecture 7. Multilayer Perceptron. Backpropagation: COMP90051 Statistical Machine Learning
No ratings yet
Lecture 7. Multilayer Perceptron. Backpropagation: COMP90051 Statistical Machine Learning
26 pages
Unit - 4 Artificial Neural Networks
No ratings yet
Unit - 4 Artificial Neural Networks
33 pages
ANN Question Paper 2022
No ratings yet
ANN Question Paper 2022
4 pages
Behavioural Finance Past Paper Answers
No ratings yet
Behavioural Finance Past Paper Answers
8 pages
CT1 NNDL Question Bank
No ratings yet
CT1 NNDL Question Bank
8 pages
Week 3
No ratings yet
Week 3
15 pages
Deep Learning Midterm Exam
No ratings yet
Deep Learning Midterm Exam
2 pages
MID-2 NNDL Exam Key Questions
No ratings yet
MID-2 NNDL Exam Key Questions
1 page
B.E Syllabus For DL
No ratings yet
B.E Syllabus For DL
4 pages
Learning XOR - Gradient Based Learning - Hidden Units
No ratings yet
Learning XOR - Gradient Based Learning - Hidden Units
43 pages
Tensorflow Cheat Sheet For Deep Learning Model Building
No ratings yet
Tensorflow Cheat Sheet For Deep Learning Model Building
12 pages
Sequence Modeling
100% (1)
Sequence Modeling
131 pages
Ch11 Presn
No ratings yet
Ch11 Presn
29 pages
Exam 2004
No ratings yet
Exam 2004
20 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Perceptron
No ratings yet
Perceptron
11 pages
UPDATEDAD3501 Deep Learning QB Athirayan
No ratings yet
UPDATEDAD3501 Deep Learning QB Athirayan
11 pages
Unit I Architecture of Neural Network
No ratings yet
Unit I Architecture of Neural Network
74 pages
Artificial Neural Networks
0% (1)
Artificial Neural Networks
53 pages
Artificial Neural Network
100% (1)
Artificial Neural Network
35 pages
Assignment2 (Section B)
No ratings yet
Assignment2 (Section B)
14 pages
Detr
No ratings yet
Detr
5 pages
Timeline: Timeline of Natural Language Processing Models
No ratings yet
Timeline: Timeline of Natural Language Processing Models
5 pages
Introduction To Deep Learning: Suresh Jaganathan
No ratings yet
Introduction To Deep Learning: Suresh Jaganathan
73 pages
RNNs: Applications and Training Guide
No ratings yet
RNNs: Applications and Training Guide
36 pages
Chapter 1. Introduction To Neural Network
100% (1)
Chapter 1. Introduction To Neural Network
34 pages
Final Quiz 1 - Attempt Review
No ratings yet
Final Quiz 1 - Attempt Review
6 pages
Module 4 RNN LSTM GRU
No ratings yet
Module 4 RNN LSTM GRU
59 pages