0% found this document useful (0 votes)

4 views

Experiment 10

The document outlines the implementation of a Recurrent Neural Network (RNN) for classifying IMDB movie reviews as positive or negative. It details the objectives, program code, and step-by-step explanation of loading data, preprocessing, building, compiling, training, and evaluating the model. The RNN achieves a test accuracy of approximately 85-87%, demonstrating its effectiveness in sentiment analysis.

Uploaded by

gnanesh847

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Experiment 10

Uploaded by

gnanesh847

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

# Experiment 10: Implement an RNN for IMDB Movie Review Classification

## Title

Recurrent Neural Network (RNN) for IMDB Movie Review Classification

## Aim

To implement a Recurrent Neural Network (RNN) for classifying IMDB movie reviews as
either positive or negative.

## Objectives

- Understand the use of RNN for text classification.

- Preprocess text data and convert it into sequences using word embeddings.

- Train an RNN model using TensorFlow/Keras for sentiment analysis.

- Evaluate the model's performance using accuracy metrics.

---

## Program with Line-by-Line Explanation

Below is the complete Python code to implement an RNN for sentiment classification
on the IMDB dataset:

```python

# Import required libraries

import tensorflow as tf

from tensorflow import keras

from tensorflow.keras.preprocessing import sequence

from tensorflow.keras.models import Sequential

from tensorflow.keras.layers import Embedding, SimpleRNN, Dense

from tensorflow.keras.datasets import imdb

# Step 1: Load the IMDB dataset

max_features = 10000 # Vocabulary size (top 10,000 words)

maxlen = 500 # Max length of a review (truncate/pad to this size)

batch_size = 32

# Load dataset with only top `max_features` words

(x_train, y_train), (x_test, y_test) = imdb.load_data(num_words=max_features)

# Step 2: Preprocess the data (pad sequences to ensure equal length)

x_train = sequence.pad_sequences(x_train, maxlen=maxlen)

x_test = sequence.pad_sequences(x_test, maxlen=maxlen)

# Step 3: Build the RNN model

model = Sequential([

Embedding(input_dim=max_features, output_dim=32), # Embedding layer

SimpleRNN(32), # Simple RNN layer with 32 units

Dense(1, activation='sigmoid') # Output layer for binary classification

])

# Step 4: Compile the model

model.compile(loss='binary_crossentropy', optimizer='adam', metrics=['accuracy'])

# Step 5: Train the model

model.fit(x_train, y_train, epochs=5, batch_size=batch_size, validation_data=(x_test,

y_test))

# Step 6: Evaluate the model

test_loss, test_acc = model.evaluate(x_test, y_test)

print(f"Test Accuracy: {test_acc:.4f}")

```

### Explanation of Code (Line by Line)

#### Step 1: Load the IMDB Dataset

```python

max_features = 10000 # Vocabulary size (top 10,000 words)

maxlen = 500 # Max length of a review (truncate/pad to this size)

batch_size = 32

(x_train, y_train), (x_test, y_test) = imdb.load_data(num_words=max_features)

```

- The IMDB dataset contains 50,000 movie reviews (25,000 for training and 25,000 for
testing).

- Each review is a sequence of integers representing word indices.

- `num_words=max_features` limits the vocabulary to the 10,000 most frequent

words.

- Reviews are labeled as positive (1) or negative (0).

#### Step 2: Preprocess the Data

```python

x_train = sequence.pad_sequences(x_train, maxlen=maxlen)

x_test = sequence.pad_sequences(x_test, maxlen=maxlen)

```

- Reviews vary in length, so they are padded or truncated to a fixed length of 500
words.

- This ensures all input sequences have the same shape, which is required for the
RNN.

#### Step 3: Build the RNN Model

```python

model = Sequential([

Embedding(input_dim=max_features, output_dim=32), # Embedding layer

SimpleRNN(32), # Simple RNN layer with 32 units

Dense(1, activation='sigmoid') # Output layer for binary classification

])

```

- **Embedding Layer**: Converts word indices into dense vectors of size 32, learning
word representations during training.
- **SimpleRNN Layer**: A basic RNN with 32 units that processes the sequence and
captures temporal dependencies between words.

- Dense Layer: A single neuron with a sigmoid activation function outputs a

probability (0 to 1) for binary classification.

#### Step 4: Compile the Model

```python

model.compile(loss='binary_crossentropy', optimizer='adam', metrics=['accuracy'])

```

- Loss Function: `binary_crossentropy` is suitable for binary classification tasks.

- Optimizer: `adam` adapts the learning rate for efficient training.

- Metrics: `accuracy` measures the model's performance.

#### Step 5: Train the Model

```python

model.fit(x_train, y_train, epochs=5, batch_size=batch_size, validation_data=(x_test,

y_test))

```

- Trains the model for 5 epochs with a batch size of 32.

- Uses training data (`x_train`, `y_train`) and validates on test data (`x_test`, `y_test`)
after each epoch.

#### Step 6: Evaluate the Model

```python

test_loss, test_acc = model.evaluate(x_test, y_test)

print(f"Test Accuracy: {test_acc:.4f}")

```

- Evaluates the model on the test dataset and prints the test accuracy, showing
performance on unseen data.

---

## Expected Output

After training for 5 epochs, the output might look like this:
```

Epoch 1/5

782/782 [==============================] - 35s 45ms/step - loss:

0.6500 - accuracy: 0.6000 - val_loss: 0.5500 - val_accuracy: 0.7000

Epoch 2/5

782/782 [==============================] - 32s 41ms/step - loss:

0.4500 - accuracy: 0.8000 - val_loss: 0.4000 - val_accuracy: 0.8200

Epoch 3/5

782/782 [==============================] - 32s 41ms/step - loss:

0.3000 - accuracy: 0.8800 - val_loss: 0.3500 - val_accuracy: 0.8500

Epoch 4/5

782/782 [==============================] - 32s 41ms/step - loss:

0.2000 - accuracy: 0.9200 - val_loss: 0.3200 - val_accuracy: 0.8600

Epoch 5/5

782/782 [==============================] - 32s 41ms/step - loss:

0.1200 - accuracy: 0.9500 - val_loss: 0.3100 - val_accuracy: 0.8700

Test Accuracy: 0.8700

```

The model typically achieves a test accuracy of around 85–87%, meaning it correctly
classifies reviews as positive or negative about 85% of the time.

---

## Conclusion

- Successfully implemented an RNN for IMDB movie review classification.

- Used word embeddings to numerically represent text data, enabling sequence

processing.

- The model effectively learns sentiment patterns, achieving good accuracy on the
test set.

This experiment demonstrates the power of RNNs in handling sequential data like text
for sentiment analysis tasks.

Toyota Noah Wiring Diagram
50% (2)
Toyota Noah Wiring Diagram
6 pages
A Crash Course in Insulin Resistance PDF
100% (6)
A Crash Course in Insulin Resistance PDF
12 pages
Experiment 2
No ratings yet
Experiment 2
5 pages
Experiment 3 (D, E) (Embedding) (Plotting) PDF
No ratings yet
Experiment 3 (D, E) (Embedding) (Plotting) PDF
8 pages
Neuralnetworks Research Assignment
No ratings yet
Neuralnetworks Research Assignment
7 pages
RNN
No ratings yet
RNN
2 pages
SatishDeepLearningLabMAnual
No ratings yet
SatishDeepLearningLabMAnual
85 pages
Experiment No 6
No ratings yet
Experiment No 6
3 pages
dl lab1
No ratings yet
dl lab1
15 pages
FDL 6
No ratings yet
FDL 6
3 pages
DL_3
No ratings yet
DL_3
6 pages
DOC-20250104-WA0000.
No ratings yet
DOC-20250104-WA0000.
40 pages
Ex NO 9 DL LAB
No ratings yet
Ex NO 9 DL LAB
3 pages
Assignment No 2
No ratings yet
Assignment No 2
8 pages
vnd.openxmlformats-officedocument.wordprocessingml.document&rendition=1-10
No ratings yet
vnd.openxmlformats-officedocument.wordprocessingml.document&rendition=1-10
13 pages
Exp 6,7,8
No ratings yet
Exp 6,7,8
17 pages
Deep Learning
No ratings yet
Deep Learning
43 pages
Dl Lab Manual
No ratings yet
Dl Lab Manual
18 pages
Sequence Classification with LSTM Recurrent Neural Networks
No ratings yet
Sequence Classification with LSTM Recurrent Neural Networks
6 pages
vertopal.com_8-12
No ratings yet
vertopal.com_8-12
6 pages
dl_22Q71A4206
No ratings yet
dl_22Q71A4206
65 pages
Recurrent Neural Networks (RNNS) : Shusen Wang
No ratings yet
Recurrent Neural Networks (RNNS) : Shusen Wang
33 pages
Deep Learning Lab Assignments - 6-9
No ratings yet
Deep Learning Lab Assignments - 6-9
14 pages
DL_LSTM_3.ipynb - Colab
No ratings yet
DL_LSTM_3.ipynb - Colab
3 pages
Deep DL Manual Deep
No ratings yet
Deep DL Manual Deep
8 pages
Deep Learning Lab With Output
No ratings yet
Deep Learning Lab With Output
12 pages
Dl lab answers batch 2
No ratings yet
Dl lab answers batch 2
27 pages
DL Exp-10,11,12
No ratings yet
DL Exp-10,11,12
6 pages
hand writing using _cnn (1)
No ratings yet
hand writing using _cnn (1)
5 pages
Keras For Beginners: Implementing A Recurrent Neural Network
No ratings yet
Keras For Beginners: Implementing A Recurrent Neural Network
13 pages
Deep Learning LAB
No ratings yet
Deep Learning LAB
47 pages
Case Study - Sentiment Analysis With RNNs
No ratings yet
Case Study - Sentiment Analysis With RNNs
8 pages
Experiment 3.3
No ratings yet
Experiment 3.3
3 pages
DL Programs
No ratings yet
DL Programs
12 pages
Shaurya DL file
No ratings yet
Shaurya DL file
75 pages
Exercise 12
No ratings yet
Exercise 12
5 pages
Advanced Deep Learning Practical File
No ratings yet
Advanced Deep Learning Practical File
29 pages
lab 6 ml
No ratings yet
lab 6 ml
7 pages
Lab 1 Assignment_W2022
No ratings yet
Lab 1 Assignment_W2022
7 pages
ML PPT G3
No ratings yet
ML PPT G3
15 pages
deep learning lab
No ratings yet
deep learning lab
26 pages
Deep DL Manual Nainish
No ratings yet
Deep DL Manual Nainish
8 pages
Neural Networks
No ratings yet
Neural Networks
8 pages
Cse425 Assignement - 20101257
No ratings yet
Cse425 Assignement - 20101257
12 pages
rldl
No ratings yet
rldl
27 pages
Experiment 3 (A, B, C) (RNN) (Recuurent) (IMDB) )
No ratings yet
Experiment 3 (A, B, C) (RNN) (Recuurent) (IMDB) )
11 pages
vertopal.com_movie review classification
No ratings yet
vertopal.com_movie review classification
5 pages
unit4 (1)
No ratings yet
unit4 (1)
23 pages
ADAML PienSaimaa A1 Week6
No ratings yet
ADAML PienSaimaa A1 Week6
17 pages
DL-8
No ratings yet
DL-8
4 pages
Design A Neural Network For Classifying Movie Reviews
No ratings yet
Design A Neural Network For Classifying Movie Reviews
5 pages
DL Record Merged
No ratings yet
DL Record Merged
113 pages
EXP 2
No ratings yet
EXP 2
4 pages
Deep Learning Practical
No ratings yet
Deep Learning Practical
12 pages
Deep Learning lab with Tensorflow (2)
No ratings yet
Deep Learning lab with Tensorflow (2)
84 pages
Cv prince
No ratings yet
Cv prince
120 pages
RNN LSTM
No ratings yet
RNN LSTM
37 pages
IMDB - Colaboratory
No ratings yet
IMDB - Colaboratory
10 pages
Deep Learning Exp
No ratings yet
Deep Learning Exp
25 pages
Assignment 3 2
No ratings yet
Assignment 3 2
2 pages
Deep_Learning_IMDB_Model (1)
No ratings yet
Deep_Learning_IMDB_Model (1)
2 pages
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
From Everand
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
Tenko
No ratings yet
Eaton 93T 15-80kVA Integrated Technical Specification (with internal battery)_V4
No ratings yet
Eaton 93T 15-80kVA Integrated Technical Specification (with internal battery)_V4
4 pages
Grammar4A 4C
No ratings yet
Grammar4A 4C
3 pages
Dyslexia Final
100% (1)
Dyslexia Final
36 pages
Justice AB Palkar Commission of Inquiry Report Volume III
No ratings yet
Justice AB Palkar Commission of Inquiry Report Volume III
232 pages
Archer T3U Nano (EU) - UG - V2
No ratings yet
Archer T3U Nano (EU) - UG - V2
29 pages
Adminjaya,+9 +Ni+Nyoman+Ayu+Suciartini
No ratings yet
Adminjaya,+9 +Ni+Nyoman+Ayu+Suciartini
31 pages
"Ancient of Days" "God Is Good All The Time" Chorus
No ratings yet
"Ancient of Days" "God Is Good All The Time" Chorus
2 pages
Solution Manual For Advanced Accounting, 11th Edition - Hoyle
100% (2)
Solution Manual For Advanced Accounting, 11th Edition - Hoyle
27 pages
Rotary Encoder Interface For Spartan-3E Starter Kit Rev2
No ratings yet
Rotary Encoder Interface For Spartan-3E Starter Kit Rev2
10 pages
Koefesien Kappa Sebagai Indeks Kesepakatan Hasil D
No ratings yet
Koefesien Kappa Sebagai Indeks Kesepakatan Hasil D
9 pages
Monitor Hypothesis
No ratings yet
Monitor Hypothesis
11 pages
Assignments Week1 26072019
No ratings yet
Assignments Week1 26072019
10 pages
Quiz Chapter-22 Intangible-Assets
No ratings yet
Quiz Chapter-22 Intangible-Assets
9 pages
Hosts Blu
No ratings yet
Hosts Blu
2,988 pages
A Study On THE FINANCIAL PERFORMANCE ANA
No ratings yet
A Study On THE FINANCIAL PERFORMANCE ANA
84 pages
Tadhkirah Ibn Mulaqqin
No ratings yet
Tadhkirah Ibn Mulaqqin
22 pages
IISc Placement Brochure 18 19
No ratings yet
IISc Placement Brochure 18 19
16 pages
Capstone 1 Lesson Plan
No ratings yet
Capstone 1 Lesson Plan
3 pages
Lingwing - Privacy Police
No ratings yet
Lingwing - Privacy Police
6 pages
CVP Reporting Summary Tables Are Not Populated CSCue65248
No ratings yet
CVP Reporting Summary Tables Are Not Populated CSCue65248
2 pages
Trading Strategy - Technical Analysis With Python TA-Lib
No ratings yet
Trading Strategy - Technical Analysis With Python TA-Lib
12 pages
Architectural & Structural-Driver's Lounge
No ratings yet
Architectural & Structural-Driver's Lounge
14 pages
Corrosion Lab Report
No ratings yet
Corrosion Lab Report
10 pages
Overview of Factors Affecting Oral Drug Absorption
No ratings yet
Overview of Factors Affecting Oral Drug Absorption
11 pages
Literature Review of Pipe Bending Machine
100% (2)
Literature Review of Pipe Bending Machine
7 pages
Likert 1932
No ratings yet
Likert 1932
53 pages
Quiz 1 FM
No ratings yet
Quiz 1 FM
3 pages
Title: Plastic Analysis: 1.0 Objective 1.1 1.2
No ratings yet
Title: Plastic Analysis: 1.0 Objective 1.1 1.2
12 pages

Experiment 10

Uploaded by

Experiment 10

Uploaded by

# Experiment 10: Implement an RNN for IMDB Movie Review Classification

Recurrent Neural Network (RNN) for IMDB Movie Review Classification

- Understand the use of RNN for text classification.

- Train an RNN model using TensorFlow/Keras for sentiment analysis.

- Evaluate the model's performance using accuracy metrics.

## Program with Line-by-Line Explanation

# Import required libraries

from tensorflow import keras

from tensorflow.keras.preprocessing import sequence

from tensorflow.keras.models import Sequential

from tensorflow.keras.layers import Embedding, SimpleRNN, Dense

from tensorflow.keras.datasets import imdb

# Step 1: Load the IMDB dataset

max_features = 10000 # Vocabulary size (top 10,000 words)

# Load dataset with only top `max_features` words

(x_train, y_train), (x_test, y_test) = imdb.load_data(num_words=max_features)

# Step 2: Preprocess the data (pad sequences to ensure equal length)

x_train = sequence.pad_sequences(x_train, maxlen=maxlen)

x_test = sequence.pad_sequences(x_test, maxlen=maxlen)

# Step 3: Build the RNN model

Embedding(input_dim=max_features, output_dim=32), # Embedding layer

SimpleRNN(32), # Simple RNN layer with 32 units

Dense(1, activation='sigmoid') # Output layer for binary classification

# Step 4: Compile the model

model.compile(loss='binary_crossentropy', optimizer='adam', metrics=['accuracy'])

# Step 5: Train the model

model.fit(x_train, y_train, epochs=5, batch_size=batch_size, validation_data=(x_test,

# Step 6: Evaluate the model

test_loss, test_acc = model.evaluate(x_test, y_test)

print(f"Test Accuracy: {test_acc:.4f}")

### Explanation of Code (Line by Line)

#### Step 1: Load the IMDB Dataset

max_features = 10000 # Vocabulary size (top 10,000 words)

maxlen = 500 # Max length of a review (truncate/pad to this size)

(x_train, y_train), (x_test, y_test) = imdb.load_data(num_words=max_features)

- Each review is a sequence of integers representing word indices.

- `num_words=max_features` limits the vocabulary to the 10,000 most frequent

- Reviews are labeled as positive (1) or negative (0).

#### Step 2: Preprocess the Data

x_train = sequence.pad_sequences(x_train, maxlen=maxlen)

x_test = sequence.pad_sequences(x_test, maxlen=maxlen)

#### Step 3: Build the RNN Model

Embedding(input_dim=max_features, output_dim=32), # Embedding layer

SimpleRNN(32), # Simple RNN layer with 32 units

Dense(1, activation='sigmoid') # Output layer for binary classification

- **Dense Layer**: A single neuron with a sigmoid activation function outputs a

#### Step 4: Compile the Model

model.compile(loss='binary_crossentropy', optimizer='adam', metrics=['accuracy'])

- **Loss Function**: `binary_crossentropy` is suitable for binary classification tasks.

- **Optimizer**: `adam` adapts the learning rate for efficient training.

- **Metrics**: `accuracy` measures the model's performance.

#### Step 5: Train the Model

model.fit(x_train, y_train, epochs=5, batch_size=batch_size, validation_data=(x_test,

- Trains the model for 5 epochs with a batch size of 32.

#### Step 6: Evaluate the Model

test_loss, test_acc = model.evaluate(x_test, y_test)

print(f"Test Accuracy: {test_acc:.4f}")

782/782 [==============================] - 35s 45ms/step - loss:

782/782 [==============================] - 32s 41ms/step - loss:

782/782 [==============================] - 32s 41ms/step - loss:

782/782 [==============================] - 32s 41ms/step - loss:

782/782 [==============================] - 32s 41ms/step - loss:

Test Accuracy: 0.8700

- Successfully implemented an RNN for IMDB movie review classification.

- Used word embeddings to numerically represent text data, enabling sequence

You might also like

- Dense Layer: A single neuron with a sigmoid activation function outputs a

- Loss Function: `binary_crossentropy` is suitable for binary classification tasks.

- Optimizer: `adam` adapts the learning rate for efficient training.

- Metrics: `accuracy` measures the model's performance.