0% found this document useful (0 votes)

59 views

RNN and LSTM

Recurrent neural networks (RNNs) are neural networks that can process sequential data by incorporating information about previous elements in the sequence into the current state. Unlike feedforward neural networks, RNNs contain loops that allow information to persist. Long short-term memory (LSTM) networks are a type of RNN designed to avoid the long-term dependency problem by using gates to control the flow of information. RNNs and LSTMs are well-suited for tasks involving sequential data like natural language processing, speech recognition, and time series prediction.

Uploaded by

Abhiroop Chadha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views

RNN and LSTM

Uploaded by

Abhiroop Chadha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Recurrent Neural Networks (RNN)

An RNN is a type of neural network architecture specifically designed to

work with sequential data. Unlike traditional feedforward neural networks,
RNNs have connections that create a loop, allowing information to be
passed from one step of the sequence to the next.
2

Neural Networks ▪ Output depends on

Current Input
▪

▪ No Cycles or loop in
▪ Network No Memory about
the past
3

Recurrent Neural
Networks
▪ Can handle sequential data
▪ Considers the current input and
also the previously received
inputs
▪ Can memorize inputs due to its
internal memory
▪
4
5

WHY RNN ?
Application of RNN
Time Series
2 NLP
Text Classiﬁcation, Sentiment Analysis,
Document
prediction
Summary, Answer
3 Machine Translation
Translate the input into different language
4 Image Captioning
Caption the image by analysing the activities
5 Speech Recognition
PURPOSE

RNNs are well-suited for tasks where the temporal order of the data matters. This
includes applications such as time series prediction, natural language processing,
and speech recognition.
11

Networks with loops in them, allowing

information to persist
12

An unrolled recurrent
neural network
18
19

Recurrent Neuron
20
21
22
30

RNNs face challenges during

training, particularly the
vanishing gradient problem.
This occurs when the
gradients of the loss function
become extremely small
CHALLANGES
during backpropagation,
making it difficult for the
network to learn and capture
long-term dependencies.
30

Solution of RNN Problem

LSTMs were introduced to

address the shortcomings of
traditional RNNs, especially
the vanishing gradient LSTM
problem. This problem
hinders the ability of RNNs
to capture long-range
dependencies in the data.
31

Long Short Term

Memory networks -
“LSTM”
▪A Special kind of RNN,
capable of learning
long-term dependencies

▪Introduced by Hochreiter
& Schmidhuber (1997)
32

In standard RNNs, this

repeating module will have
a very simple structure,
such as a single tanh layer.
33

LSTMs also have this chain like structure,

but the repeating module has a different
structure. Instead of having a single
neural network layer, there are four,
interacting in a very special way.
34

Notation
35

Input Gates
Hidden State
LSTM Breakdown Forget Gates
Output Gates
36

An LSTM has three of these

gates, to protect and
control the cell state.

The sigmoid layer outputs numbers betweenzero,and one how much of each
describing
component should be let through. A value of zerolet nothing
means through
“,” while a value
of one means “let everything through!”
37

Forget Gates
What is added to the
hidden state

Let’s go to the example of a language model trying

to predict the next word based on all the previous
ones. In such a problem, the cell state might
include the gender of the present subject, so that
the correct pronouns can be used. When we see a
new subject, we want to forget the gender of the old
subject.
38

Input Gates
What is kept from
previous states

In the example of our language model, we’d want to add the

gender of the new subject to the cell state, to replace the old
one we’re forgetting.
39

Hidden State
Carry previous
information

In the case of the language model, this is where we’d

actually drop the information about the old subject’s
gender and add the new information, as we decided in
the previous steps.
40

Output Gates What is reported as

output

For the language model example, since it just saw a subject, it might want
to output information relevant to a verb, in case that’s what is coming next.
For example, it might output whether the subject is singular or plural, so that
we know what form a verb should be conjugated into if that’s what follows
next.
31

LSTM APPLICATIONS

LSTMs find applications in various domains, particularly

in natural language processing tasks such as language
translation and sentiment analysis. They excel in
scenarios where understanding and retaining context
over a sequence are crucial.
31

CONCLUSION

In conclusion, both RNNs and LSTMs are powerful tools for processing
sequential data. RNNs maintain a cyclic structure to retain memory, but
the vanishing gradient problem limits their effectiveness over long
sequences. LSTMs address this issue with a more sophisticated
architecture, allowing them to capture and retain long-term
dependencies more effectively. LSTMs have become instrumental in
various machine learning and artificial intelligence applications, where
understanding and utilizing context over time are critical.

What is LSTM - Long Short Term Memory_ - GeeksforGeeks
No ratings yet
What is LSTM - Long Short Term Memory_ - GeeksforGeeks
10 pages
TensorFlow in 1 Day: Make your own Neural Network
From Everand
TensorFlow in 1 Day: Make your own Neural Network
Krishna Rungta
3.5/5 (10)
Ghana Manual For DHIMS 2
100% (3)
Ghana Manual For DHIMS 2
48 pages
Jellicle Songs For Jellicle Cats
No ratings yet
Jellicle Songs For Jellicle Cats
2 pages
Jim Teague Against The Grain
100% (1)
Jim Teague Against The Grain
9 pages
Sequence Modeling
No ratings yet
Sequence Modeling
131 pages
LSTM
No ratings yet
LSTM
19 pages
Unit 4
No ratings yet
Unit 4
27 pages
OlahLSTM NEURAL NETWORK TUTORIAL 15
No ratings yet
OlahLSTM NEURAL NETWORK TUTORIAL 15
9 pages
longshorttermmemorylstm-231215171600-1feb7b1b
No ratings yet
longshorttermmemorylstm-231215171600-1feb7b1b
17 pages
lstm
No ratings yet
lstm
12 pages
What is an RNN
No ratings yet
What is an RNN
6 pages
RNN LSTM
No ratings yet
RNN LSTM
72 pages
Understanding LSTM Networks
No ratings yet
Understanding LSTM Networks
15 pages
Understanding LSTM Networks
No ratings yet
Understanding LSTM Networks
7 pages
RNN LSTM
No ratings yet
RNN LSTM
42 pages
Understanding LSTM Networks
No ratings yet
Understanding LSTM Networks
10 pages
Understanding LSTM Networks - Colah's Blog
No ratings yet
Understanding LSTM Networks - Colah's Blog
7 pages
Understanding LSTM Networks
No ratings yet
Understanding LSTM Networks
8 pages
LSTM by Bushra
No ratings yet
LSTM by Bushra
16 pages
9 RNN LSTM Gru
No ratings yet
9 RNN LSTM Gru
91 pages
chapter 2
No ratings yet
chapter 2
68 pages
Unit III- Recurrent Neural Networks
No ratings yet
Unit III- Recurrent Neural Networks
44 pages
LSTM
No ratings yet
LSTM
22 pages
UNIT-5 Foundations of Deep Learning
No ratings yet
UNIT-5 Foundations of Deep Learning
9 pages
RNNs and Their Types - Simple Explanation
No ratings yet
RNNs and Their Types - Simple Explanation
5 pages
Lecture Notes_RRN
No ratings yet
Lecture Notes_RRN
8 pages
CH4_AA1.1-Sequence Models (1)
No ratings yet
CH4_AA1.1-Sequence Models (1)
26 pages
Unit 4 - MachineLearning
No ratings yet
Unit 4 - MachineLearning
16 pages
lecture 11
No ratings yet
lecture 11
57 pages
Deep Learning RNN
100% (1)
Deep Learning RNN
53 pages
LSTM Presentation
No ratings yet
LSTM Presentation
23 pages
RNN_2
No ratings yet
RNN_2
144 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
36 pages
Deep Learning
No ratings yet
Deep Learning
49 pages
RNNs and LSTMs
No ratings yet
RNNs and LSTMs
41 pages
AAM unit 6 notes
No ratings yet
AAM unit 6 notes
20 pages
RNN Part1
No ratings yet
RNN Part1
12 pages
4. Recurrent Neural Network
No ratings yet
4. Recurrent Neural Network
36 pages
CNN RNN LSTM GRU Simple
100% (3)
CNN RNN LSTM GRU Simple
20 pages
GRU
No ratings yet
GRU
17 pages
Survey On Recurrent Neural Network in Natural Lang
No ratings yet
Survey On Recurrent Neural Network in Natural Lang
5 pages
RNN & LSTM: Nguyen Van Vinh Computer Science Department, UET, Vnu Ha Noi
No ratings yet
RNN & LSTM: Nguyen Van Vinh Computer Science Department, UET, Vnu Ha Noi
35 pages
RNN
No ratings yet
RNN
28 pages
Unit 3 Deep Learning SPPU BE IT
No ratings yet
Unit 3 Deep Learning SPPU BE IT
30 pages
Deep Arch Msc 2024
No ratings yet
Deep Arch Msc 2024
83 pages
Cs224n 2025 Lecture06 Fancy Rnn
No ratings yet
Cs224n 2025 Lecture06 Fancy Rnn
57 pages
Understanding LSTM Networks - Colah's Blog
No ratings yet
Understanding LSTM Networks - Colah's Blog
15 pages
LSTM Material 1
No ratings yet
LSTM Material 1
3 pages
Unit 4 - DL
No ratings yet
Unit 4 - DL
23 pages
Unit 5
No ratings yet
Unit 5
76 pages
Long Short Term Memory (LSTM)
No ratings yet
Long Short Term Memory (LSTM)
33 pages
Deep Learning (MODULE-5)
No ratings yet
Deep Learning (MODULE-5)
71 pages
LSTM
No ratings yet
LSTM
12 pages
30-35
No ratings yet
30-35
26 pages
Lecture14 RNN Intro
No ratings yet
Lecture14 RNN Intro
22 pages
DL Half TechKnowledge
No ratings yet
DL Half TechKnowledge
50 pages
Long Short-Term Memory (LSTM)
No ratings yet
Long Short-Term Memory (LSTM)
25 pages
RNN-StannfordBased
No ratings yet
RNN-StannfordBased
102 pages
MODULE 4
No ratings yet
MODULE 4
14 pages
LSTM
No ratings yet
LSTM
10 pages
34-Long-Term Dependencies - Echo State Networks - Long Short-Term Memory and Othe-03!10!2024
No ratings yet
34-Long-Term Dependencies - Echo State Networks - Long Short-Term Memory and Othe-03!10!2024
14 pages
A Review of Recurrent Neural Networks
No ratings yet
A Review of Recurrent Neural Networks
36 pages
Lesson 19 - The Number 10: Learning Objectives Extra Assistance
No ratings yet
Lesson 19 - The Number 10: Learning Objectives Extra Assistance
5 pages
Procedure To Homologation Desigual - Trimmings
No ratings yet
Procedure To Homologation Desigual - Trimmings
10 pages
Happiness Idioms
No ratings yet
Happiness Idioms
5 pages
Ruby On Rails 3 Cheat Sheet
No ratings yet
Ruby On Rails 3 Cheat Sheet
7 pages
Presentation - LRMDS Portal
No ratings yet
Presentation - LRMDS Portal
24 pages
Rights of Unpaid Seller Against The Goods
No ratings yet
Rights of Unpaid Seller Against The Goods
9 pages
In The Time of The Butterflies Text Set by Alena Munro
No ratings yet
In The Time of The Butterflies Text Set by Alena Munro
29 pages
24.8.ECCE Writing ResourcePack FNL
No ratings yet
24.8.ECCE Writing ResourcePack FNL
24 pages
Bahtin
No ratings yet
Bahtin
19 pages
Otd Chapter 2 Final Report
No ratings yet
Otd Chapter 2 Final Report
30 pages
Federalism Notes
No ratings yet
Federalism Notes
4 pages
The Shadow Vault
No ratings yet
The Shadow Vault
3 pages
24XX PDF
No ratings yet
24XX PDF
5 pages
Ameer Minai 2012 4
No ratings yet
Ameer Minai 2012 4
13 pages
Salman Rushdie: Last Sigh, Fury, and Shalimar The Clown Received Critical Acclaim For Their Themes As Well As His
No ratings yet
Salman Rushdie: Last Sigh, Fury, and Shalimar The Clown Received Critical Acclaim For Their Themes As Well As His
9 pages
PBL Action Plan Abe
No ratings yet
PBL Action Plan Abe
38 pages
SOULFUL JAPA - Bliss Unlimited by Japa Yagna
100% (1)
SOULFUL JAPA - Bliss Unlimited by Japa Yagna
28 pages
Worst Game Ever?: Directions: Read The Following Passage and Answer The Questions That Follow. Refer To The Text To Check
No ratings yet
Worst Game Ever?: Directions: Read The Following Passage and Answer The Questions That Follow. Refer To The Text To Check
2 pages
Effects of Inescapable Shock Upon Subsequent Escape and Avoidance Responding1
No ratings yet
Effects of Inescapable Shock Upon Subsequent Escape and Avoidance Responding1
8 pages
An Inspector Calls
No ratings yet
An Inspector Calls
4 pages
NFSW
No ratings yet
NFSW
3 pages
Partical HNS Level III
100% (1)
Partical HNS Level III
11 pages
C2 Trigonometry - Sine and Cosine Rule
No ratings yet
C2 Trigonometry - Sine and Cosine Rule
15 pages
1 Toward Unified Criminology Robert Agnew PDF
No ratings yet
1 Toward Unified Criminology Robert Agnew PDF
9 pages
What Is Photography, Role and Importance and How Camera Works
100% (2)
What Is Photography, Role and Importance and How Camera Works
3 pages
Paper 2: Title: Insight Paper On My Personal Experiences of Globalization
No ratings yet
Paper 2: Title: Insight Paper On My Personal Experiences of Globalization
4 pages
The Rubaiyat by Omar Khayyam
100% (1)
The Rubaiyat by Omar Khayyam
22 pages

RNN and LSTM

Uploaded by

RNN and LSTM

Uploaded by

Recurrent Neural Networks (RNN)

An RNN is a type of neural network architecture specifically designed to

Neural Networks ▪ Output depends on

Networks with loops in them, allowing

RNNs face challenges during

Solution of RNN Problem

LSTMs were introduced to

Long Short Term

In standard RNNs, this

LSTMs also have this chain like structure,

An LSTM has three of these

Let’s go to the example of a language model trying

In the example of our language model, we’d want to add the

In the case of the language model, this is where we’d

Output Gates What is reported as

LSTMs find applications in various domains, particularly

You might also like