0% found this document useful (0 votes)

18 views15 pages

11 Ece

The document discusses the application of deep learning algorithms for Human Activity Recognition (HAR), emphasizing their potential in various fields such as surveillance and consumer behavior analysis. It outlines the structure and functioning of deep neural networks, particularly the ALEXNET CNN architecture, which consists of multiple convolutional and fully connected layers designed for object detection. The document also details the deep learning process, including dataset preparation, algorithm selection, and model training and testing.

Uploaded by

hod.mec

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views15 pages

11 Ece

Uploaded by

hod.mec

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

DEEP LEARNING ALGORITHM FOR HUMAN

ACTIVITY RECOGNITION
L. RANGA SWAMY, ASSISTANT PROFESSOR, [email protected]
N.A.V. PRASAD, ASSISTANT PROFESSOR, [email protected]
C. RAMAMOHAN, ASSISTANT PROFESSOR, [email protected]

Department of ECE, Sri Venkateswara Institute of Technology, N.H 44, Hampapuram,

Rapthadu, Anantapuramu, Andhra Pradesh 515722
ABSTRACT: As studies go on in areas like surveillance to identify offenders and lost objects in
public spaces, as well as the elderly, the promise of machine learning—and deep learning in
particular—becomes more apparent. Even while wearable sensor-based Human Action Recognition
(HAR) methods exist, they have the potential to inflict needless psychological and physiological
distress on individuals, particularly the young and the old. Automatization is a capability of deep
learning.

Index Terms: Human Action Recognition (HAR), binary silhouettes, ALEXNET CNN.

1. INTRODUCTION

Numerous fields may benefit from human behaviour recognition in the actual world. These include
smart video surveillance, customer characteristics, and purchasing behaviour analysis. But with all
the distractions, occlusions, and different viewpoints out there, it's not easy to accurately identify
behaviour. Machines that belong to the class known as "deep learning models" automate the process
of learning a feature hierarchy by building higher-level features from lower-level ones.

2. OVERVIEW OF HAR

Investigating the activities shown in video sequences or still pictures is the main goal of human
activity identification. This is the driving force behind human activity recognition systems' efforts to
accurately categorise incoming data. First, there are simple motions; second, atomic actions; third,
interactions between humans or between objects; fourth, collective actions; fifth, behaviours; and
sixth, events. The breakdown of the human activities is shown in Figure (1).
Gestures

Events Atomic
actions

Human
activities
Behaviors
Human to
human, object
interactions
Group
actions

Figure: 1 Human Action Recognitions

3. PROPOSED SYSTEM

Deep Learning

Deep Learning is a sub-field of machine learning that deals with algorithms inspired by
brain structure and function. In a word, deep learning accuracy achieves recognition accuracy at
higher levels than ever before. This helps consumer electronics fulfill user standards and is
important for safety-critical applications such as driverless cars. Recent advancements in deep
learning have advanced to the point that deep learning outperforms humans in certain tasks, such
as classifying objects in image.

How Deep Learning Works

Step1: The algorithm designer understands the problem and checks whether the deep learning is
a good fit or not.

Step2: After understanding the problem he chooses relevant datasets and prepares them for
analysis.

Step3: So, there are many deep learning algorithms are there, he chooses the best type of deep
learning algorithm that suits to solve the problem.

Step4: Training an algorithm on large amount of labeled data.

Step5: After training he tests the model against the unlabeled data

Understands problem Identifies relevant data Choose the type of

and whether deep sets and prepares them deep learning
learning is a good fit. for analysis. algorithm to use.

Tests the model’s Trains algorithm on

performance against large amount of
unlabeled data. labeled data.

Figure 2: Deep Learning Process

4. DEEP NEURAL NETWORKS

Deep learning models are sometimes called deep neural networks since the majority of deep learning
methods employ designs of neural networks. The number of hidden layers in a neural network is
usually what the term "deep" means. Deep neural networks may contain up to 150 hidden layers, in
contrast to traditional neural networks that only have 2-3. The training of deep learning models
eliminates the requirement for human feature extraction by using neural network designs that learn
features directly from data and big datasets of labelled data.

In the same way as neurons in the brain are little components, so are nodes in a network. There are
some marked and linked nodes and some unmarked ones here. Layers are often used to organise
nodes. In order to complete the job, the system has to handle data that is layered between the input
and the output. Obtaining a satisfactory outcome requires processing more layers.

A regular neural network is very simple in comparison to a deep neural network. It is capable of a
wide range of tasks including investigation, prediction, and creative thinking, including but not
limited to: recognising voice instructions, audio and visual recognition, expert assessments, and
more. The human brain is unique in that it can think about solutions to problems on a larger scale,
make assumptions or draw inferences based on available information, and ultimately achieve the
desired result. Even without many highlighted facts, it may solve the issue.
Input layer hidden layer 1 hidden layer 2 hidden layer 3

Output layer

Figure 3: Deep Neural Network

5. ALEX NET CNN ARCHITECTURE

ALEX NET has 8 layers. The first five are convolutionary layers, and the last three are
entirely interconnected layers. In between, we also have a few layers called Pooling and
Activation. The architecture consists of predefined filters, strings, padding for good object
detection. Alex Net is commonly used for object detection tasks. The size of the input picture

227*227*3
Colored image
ALEX NET CNN

5 Convolution layers 3 Fully connected layers

ALEX NET CNN only performs the procedure when the input image has dimensions of
227*227*3. If not we need to reshape our input image. Here we send our input to the first
convolution layer after we do the pooling, then the output of the pooling will be supplied to the
second convolutionary layer and then again we do the pooling operation.

The second pooling output will be given as input to the third convolutionary layer, where
we perform three steps of covolutionary operation after three steps of the third pooling operation.
The output of the third pooling layer is given to the first fully connected layer. The third fully
connected layer will acts as a softmax function that is used to predict the final output.

Conv1 Conv2 Conv3 dense dense dense

227 55 27 13 Conv4 Conv5

3 13 3 13
11 55 5 3 3 3
5 27 3 13 13 13 1000
11 384 384 256 4096 4096
256
227 max Pooling 2 max Pooling3
96 max Pooling1
Stride=4
Figure 5: ALEX NET CNN Architecture
So, 227*227*3 is the input of the first convolution layer. 96 filters of size 11*11 with the
4-pixel phase will be added to the first convolution layer. We have a pooling layer after the first
convolution layer where we use a window size of 3*3 with the 2 pixel phase. The output of the
first convolution layer is given to the second convolution layer as the input.
(a) 227*227*3

Input
Image

Conv1, 11*11, 96 filters, stride = 4

Max pooling1, 3*3, stride = 2

Conv2, 5*5, 256 filters, padding = 2

Max pooling2, 3*3, stride = 2

Conv3, 3*3, 384 filters, padding = 1

Conv4, 3*3, 384 filters, padding = 1

Conv5, 3*3, 256 filters, padding = 1

Max pooling3, 3*3, stride = 2

Fully connected layer1, 4096

Fully connected layer2, 4096

Output layer, 1000

Figure 5)a): Alex net flow chart

With padding 2 and pooling layer 3*3 of step 2, we use 256 filters of size 5*5 in the
second convolution layer. The output of the second convolution layer is given to the third
convolution layer as the input. Three, four, five layers of convolution are related to each other
without any layer of pooling between them. Third convolution layer of 384 scale 3*3 filters with
padding 1. Fourth, the same. The third and fourth have the same characteristics. The scale of 256
filters in the fifth convolution layer.

We have a 3*3 size and phase 2 maxpooling after these three layers. After that, we have
three completely linked layers, the last one is used for the activation function of softmax that
generates a distribution over 1000 class labels. 4096 neurons in the first fully connected layer,
4096 in the second fully connected layer, and 1000 neurons in the last fully connected layer. The
neurons in the last fully connected layer rely on the dataset,

Layer Feature Size Kernel Stride Activation

map size
Input Image 1 227*227*3 - - -

1 Convolution1 96 555596 11*11 4 Relu

Maxpooling1 96 272796 3*3 2 Relu

2 Convolution2 256 2727256 5*5 1 Relu

Maxpooling2 256 1313256 3*3 2 Relu

3 Convolution3 384 1313384 3*3 1 Relu

4 Convolution4 384 1313384 3*3 1 Relu

5 Convolution5 256 1313256 3*3 1 Relu

Maxpooling3 256 66256 3*3 2 Relu

6 FC - 9216 - - Relu

7 FC - 4096 - - Relu

8 FC - 4096 - - Relu

9 FC - 1000 - - Relu

Soft-max
Table 1: Parameters used in Alexnet Cnn
Calculation of layers:

Without padding With padding

𝒏− 𝒏− 𝒏 + 𝟐𝒑 𝒏 + 𝟐𝒑
𝒇 +𝟏 𝒇 + −𝒇 +𝟏 −𝒇 +𝟏
𝒔 ∗ 𝒔
𝟏
𝒔
∗
�
�

Where, n = Image size

f = Filter size

s = Stride

p = padding

Layer 1: Convolution1 Max pooling1

𝒏− 𝒏− 𝒏− 𝒏−
𝒇 +𝟏 𝒇 +𝟏 𝒇 +𝟏 𝒇 +𝟏
𝒔 ∗ 𝒔 𝒔
∗
𝒔

𝟐𝟐𝟕 − 𝟐𝟐𝟕 − 𝟓𝟓 − 𝟓𝟓 −
𝟏𝟏 +𝟏 𝟏𝟏 + 𝟑 + 𝟏 𝟑 +𝟏
� ∗ 𝟏 ∗
� 𝟐 𝟐
� �

55*55*96 27*27*96

Layer 2: Convolution2 Max pooling2

𝒏 + 𝟐𝒑 𝒏 + 𝟐𝒑 𝒏− 𝒏−
−𝒇 +𝟏 −𝒇 + 𝒇 +𝟏 𝒇 +𝟏
𝒔 ∗ 𝟏 ∗
� 𝒔 𝒔
�

𝟐𝟕 + 𝟐(𝟐) 𝟐𝟕 + 𝟐(𝟐) 𝟐𝟕 − 𝟐𝟕 −
−𝟓 +𝟏 −𝟓 + 𝟑 + 𝟏 𝟑 +𝟏
𝟏 ∗ 𝟏 ∗
𝟏 𝟐 𝟐

27*27*256 13*13*256

Layer 3&4: Convolution 3&4 Layer 5: Convolution5

𝒏 + 𝟐𝒑 𝒏 + 𝟐𝒑 𝒏 + 𝟐𝒑 𝒏 + 𝟐𝒑
−𝒇 +𝟏 −𝒇 + −𝒇 +𝟏 −𝒇 +𝟏
𝒔 ∗ 𝟏 ∗
� � 𝒔
� �

𝟏𝟑 + 𝟐(𝟏) 𝟏𝟑 + 𝟐(𝟏) − 𝟏𝟑 + 𝟐(𝟏) 𝟏𝟑 + 𝟐(𝟏)

−𝟑 +𝟏 𝟑 −𝟑 +𝟏 −𝟑 +𝟏
𝟏 ∗ ∗
𝟏 𝟏 𝟏
13*13*384 13*13*256

Max pooling3 Fully connected layer1

𝒏− 𝒏−
𝒇 +𝟏 𝒇 +𝟏 𝟔 ∗ 𝟔 ∗ 𝟐𝟓𝟔
𝒔 ∗ 𝒔

𝟏𝟑 − 𝟏𝟑 −
𝟑 +𝟏 𝟑 +𝟏 𝟗𝟐𝟏𝟔
𝟐 ∗ 𝟐

6*6*256

Convolutional layer:

To construct a feature map that summarizes the presence of detected features in the data,
it applies a filter to an input. One image becomes a stack of filtered images in the convolutional
layer, and the number of filtered images depends on the number of filters.

Input image * Filter = Filtered image

0 1 2
0 1
3 4 5 19 25
2 3 37 43
6 7 8

Pooling layer:

Pooling layer down samples the volume spatially, independently in each depth slice of

the input volume. The most common down sampling operation is max, giving rise to max

Pooling.

Max pooling with 2*2 filter and stride4

There are two types of pooling. They are:

1. Max pooling: From above example in a 2*2 window we choose max value.The process
called max pooling.
2. Average pooling: From above example we take the average of 2*2 window. The process
called average pooling.
Max pooling

20 30
12 20 30 0
112 37
8 12 2 0

34 70 37 4
13 8
112 100 25 12
79 20

Average pooling

Rectified Linear unit (ReLU):

 Activation function of ReLU produces 0 when u < 0, and is linear with slope 1

when u > 0. Rectified linear function, f(u) = max(0,u)

f(u) = max (0,u)

0 u

-1

Fully connected layer:

 This is the layer where image classification actually happens and we convert our filtered
images into a 1-Dimensional array.
Padding and Stride:

 Adding zero rows and columns to the image is known as padding.

 Number of columns and rows are shifting towards right and downside is known as stride.

Soft-max function:

The soft-max function is applied after the output layer of ALEX NET CNN in order to
obtain the probability of the possible actions.

σ (z) j = e j / 𝑵 −𝐳
𝒌= 𝐞 𝐤
z ∑𝟏

Where, j = each action

Z = network output

N = Total number of actions

6. SIMULATION RESULTS AND ANALYSIS

Accuracy scores ranging from 0.7 to 0.95 are often achieved by using picture variations in the
datasets used by Convolutional Neural Networks. However, in order to attain greater accuracy levels
(0.3 - 0.55), we have chosen a dataset with more picture similarities. Some preprocessing
adjustments, such as picture scaling and cropping, were performed to emphasise the Region of
Interest (ROI), and adjustments were made to the algorithm to get a greater accuracy of 0.63.

Due to false positives identified in the algorithm false higher accuracy is achieved till 1 to
12 epochs then as the number of epochs increases false positives are minimized and true
positives are identified.
7. CONCLUSION

We suggest creating a consumer electronics device that can automatically monitor and identify the
everyday activities of older persons living alone. The system should have cheap computing cost and
provide high accuracy outputs. Additionally, the system's quick processing time makes it very
promising for real-time applications, and it can be used regardless of ambient circumstances or
domain architecture. This method has successfully addressed the issues of view-variance (single
camera) and intra-class variation. Both the CAD-60 daily activity datasets and the experimental
findings demonstrate that the suggested approach outperforms existing state-of-the-art systems. The
goal of this project was to create a low-cost, highly accurate human action recognition system for
use in consumer electronics.

8. REFERENCES

[1] Cho Nilar Phyo, T. T. Zin and P. Tin, “Deep Learning for Recognizing Human Activities
using Motions of Skeletal joints” DOI 10.1109/TCE.2019,IEEE transactions on consumer
electronics.

[2]C. N. Phyo, T. T. Zin and P. Tin, “Skeleton motion history based human action recognition
using deep learning”, in Proc. of 2017 IEEE 6th Global Conf. on Consumer Electronics, Nagoya,
Japan, 24-27 Oct. 2017, pp. 784-785.

[3]J. Wang et al., “An enhanced fall detection system for elderly person monitoring using
consumer home networks”, IEEE Trans. Consumer Electronics, vol. 60, no. 1,pp.23-
29,Apr.2014, 10.1109/TCE.2014.6780921.

[4] A. Jalal et al., “Depth video-based human activity recognition system using translation and
scaling invariant features for life logging at smart home”, IEEE Trans. Consumer Electronics,
vol. 58, no. 3, pp. 863-871, Sept. 2012, 10.1109/TCE.2012.6311329.

[5] T. T. Zin, P. Tin and H. Hama, “Visual monitoring system for elderly people daily living
activity analysis”, in Proc. of the Int. MultiConf. of Engineers and Computer Scientists 2017,
Hong Kong, 15-17 Mar. 2017, pp. 140-142.
[6] L. Zaineb et al., “A Markovian-based approach for daily living activities recognition”, in
Proc. of the 5th Int. Conf. on Sensor Networks, Rome, Italy, 17-19 Feb. 2016, pp. 214-219.

[7] L. H. Wang et al., “An outdoor intelligent healthcare monitoring device for the elderly”,
IEEE Trans. Consumer Electronics, vol. 62, no. 2, pp. 128-135, Jul. 2016,
10.1109/TCE.2016.7514671.

[8] J. Wang et al., “An enhanced fall detection system for elderly person monitoring using
consumer home networks”, IEEE Trans. Consumer Electronics, vol. 60, no. 1, pp. 23-29, Apr.
2014, 10.1109/TCE.2014.6780921.

Deep Learning Day 27
No ratings yet
Deep Learning Day 27
43 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
02 - Introduction To Convolutional Neural Networks (CNNS)
No ratings yet
02 - Introduction To Convolutional Neural Networks (CNNS)
28 pages
Deep Learning Cheatsheet Guide
No ratings yet
Deep Learning Cheatsheet Guide
14 pages
DL PR 08 G Manual
No ratings yet
DL PR 08 G Manual
5 pages
DLP&P Notes Faculty: Ms. Meenakshi Chaudhary: What Is A Convolutional Neural Network (CNN) ?
No ratings yet
DLP&P Notes Faculty: Ms. Meenakshi Chaudhary: What Is A Convolutional Neural Network (CNN) ?
50 pages
CNNs: From Human Vision to AI
No ratings yet
CNNs: From Human Vision to AI
25 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
Unit 4 (CNN and SOM)
No ratings yet
Unit 4 (CNN and SOM)
15 pages
Unit 2 Part 03
No ratings yet
Unit 2 Part 03
49 pages
Machine Learning (CSO851) - Lecture 10
No ratings yet
Machine Learning (CSO851) - Lecture 10
83 pages
Identify Web Cam Images Using Neural Networks
No ratings yet
Identify Web Cam Images Using Neural Networks
17 pages
Lecture - 07 (Convolutional Neural Networks)
No ratings yet
Lecture - 07 (Convolutional Neural Networks)
57 pages
Deep Learning & Neural Networks Guide
100% (1)
Deep Learning & Neural Networks Guide
51 pages
Deep Learning Review and Discussion of Its Future PDF
No ratings yet
Deep Learning Review and Discussion of Its Future PDF
7 pages
Slides CNN Unit 3
No ratings yet
Slides CNN Unit 3
36 pages
Deep Learning
No ratings yet
Deep Learning
5 pages
Deep Learning and Neural Networks Overview
No ratings yet
Deep Learning and Neural Networks Overview
118 pages
Convolutional Neural Network Report
No ratings yet
Convolutional Neural Network Report
5 pages
Hardware Architectures For Deep Neural Networks-MIT'16
No ratings yet
Hardware Architectures For Deep Neural Networks-MIT'16
300 pages
Unit 3
No ratings yet
Unit 3
105 pages
Lecture 1 Introduction of Deep Learning
No ratings yet
Lecture 1 Introduction of Deep Learning
31 pages
Deep Neural Networks Explained
No ratings yet
Deep Neural Networks Explained
12 pages
Deep Learning for Data Scientists
No ratings yet
Deep Learning for Data Scientists
75 pages
3 DL ConvNets
No ratings yet
3 DL ConvNets
46 pages
Lec 07 IntroDL
No ratings yet
Lec 07 IntroDL
39 pages
Deep Neural Network DNN
No ratings yet
Deep Neural Network DNN
5 pages
Overview of Multiple Object Tracking
No ratings yet
Overview of Multiple Object Tracking
69 pages
Unit 4
No ratings yet
Unit 4
27 pages
Deep Neural Network Hardware Architectures
No ratings yet
Deep Neural Network Hardware Architectures
65 pages
AI & Machine Learning Essentials
No ratings yet
AI & Machine Learning Essentials
44 pages
‎⁨فصل ثاني اسراء⁩
No ratings yet
‎⁨فصل ثاني اسراء⁩
13 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
Convolutional Neural Network Overview
No ratings yet
Convolutional Neural Network Overview
22 pages
2630 20230529 Mahdi Momen Aldawood HH 15261 946399124
No ratings yet
2630 20230529 Mahdi Momen Aldawood HH 15261 946399124
11 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
Deepnet Lourentzou
No ratings yet
Deepnet Lourentzou
49 pages
Convolutional Neural Networks Overview
No ratings yet
Convolutional Neural Networks Overview
14 pages
AI Lab 1
No ratings yet
AI Lab 1
11 pages
Image Classification Using Small Convolutional Neural Network
No ratings yet
Image Classification Using Small Convolutional Neural Network
5 pages
CSCI417 Machine Intelligence - Lec11 RNN - V1
No ratings yet
CSCI417 Machine Intelligence - Lec11 RNN - V1
61 pages
Super VIP Cheetsheet - Deep Learning, AI, ML
No ratings yet
Super VIP Cheetsheet - Deep Learning, AI, ML
47 pages
Transfer Learning
No ratings yet
Transfer Learning
13 pages
Deep Learning Report For Students
No ratings yet
Deep Learning Report For Students
32 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
Images and Convolutional Neural Networks: Practical Deep Learning
No ratings yet
Images and Convolutional Neural Networks: Practical Deep Learning
34 pages
Lecture 12 - Deep Learning
No ratings yet
Lecture 12 - Deep Learning
25 pages
Efficient Deep Learning in Network Compression and
No ratings yet
Efficient Deep Learning in Network Compression and
21 pages
Introduction To Convolutional Neural Networks1-Unit3
No ratings yet
Introduction To Convolutional Neural Networks1-Unit3
10 pages
Deep Learning Image Classification
No ratings yet
Deep Learning Image Classification
11 pages
Lecture 3 - Introduction To Deep Learning
No ratings yet
Lecture 3 - Introduction To Deep Learning
27 pages
Module2 1
No ratings yet
Module2 1
27 pages
Deep Learning - Concepts, Techniques, and Applications
No ratings yet
Deep Learning - Concepts, Techniques, and Applications
10 pages
UNIT-2 DL
No ratings yet
UNIT-2 DL
51 pages
Deep Learning Overview for COE KFUPM
No ratings yet
Deep Learning Overview for COE KFUPM
27 pages
Deep Learning Techniques and Application
No ratings yet
Deep Learning Techniques and Application
20 pages
Synthesis of 5-Substituted-1,3,4-Oxadiazole Clubbed Pyrazole and Dihydropyrimidine Derivatives As Potent Bioactive Agents
No ratings yet
Synthesis of 5-Substituted-1,3,4-Oxadiazole Clubbed Pyrazole and Dihydropyrimidine Derivatives As Potent Bioactive Agents
12 pages
H&BS (E) - 2
No ratings yet
H&BS (E) - 2
10 pages
H&BS (E) - 4
No ratings yet
H&BS (E) - 4
8 pages
Possibilities of Physical Methods in Development of Microbial Nanotechnology
No ratings yet
Possibilities of Physical Methods in Development of Microbial Nanotechnology
9 pages
Improvement of The Forest Tracked Vehicles' Control by Using Impulse Control Technology For The Steering Mechanism
No ratings yet
Improvement of The Forest Tracked Vehicles' Control by Using Impulse Control Technology For The Steering Mechanism
8 pages
Civil 4
No ratings yet
Civil 4
9 pages
Civil 2
No ratings yet
Civil 2
9 pages
Civil 3
No ratings yet
Civil 3
5 pages
A Bioeconomic Analysis of A Renewable Resource
No ratings yet
A Bioeconomic Analysis of A Renewable Resource
15 pages
A Design of Atm Monitoring and Security System Based
No ratings yet
A Design of Atm Monitoring and Security System Based
7 pages
13 Ece
No ratings yet
13 Ece
7 pages
51 Cse
No ratings yet
51 Cse
6 pages
Unit - I - Part - A
No ratings yet
Unit - I - Part - A
9 pages
Unit 5
No ratings yet
Unit 5
23 pages
Andhra Pradesh APPSC Assistant Engineer Environmental 14 May 2022
No ratings yet
Andhra Pradesh APPSC Assistant Engineer Environmental 14 May 2022
26 pages
02 OnePager Digital Marketing
No ratings yet
02 OnePager Digital Marketing
1 page
Engineering Mechanics R20 - Unit-4 (Mech)
No ratings yet
Engineering Mechanics R20 - Unit-4 (Mech)
20 pages
DMM Mid - I Objective Q&A
No ratings yet
DMM Mid - I Objective Q&A
8 pages
Unit - I - Part - B
No ratings yet
Unit - I - Part - B
27 pages
Unit - I - Part - C
No ratings yet
Unit - I - Part - C
26 pages
Day Wise Statement
No ratings yet
Day Wise Statement
1 page
APPSC Assistant Engineer (Environmental Engineering) Official Paper-II (Held On - 14 May, 2022 Shift 2)
No ratings yet
APPSC Assistant Engineer (Environmental Engineering) Official Paper-II (Held On - 14 May, 2022 Shift 2)
38 pages
DSSSB Delhi Pollution Control JE Environment 20 Oct2019 Paper Solution
No ratings yet
DSSSB Delhi Pollution Control JE Environment 20 Oct2019 Paper Solution
21 pages
Room Allocations
No ratings yet
Room Allocations
8 pages
Session2 2024 - 2025 - Natural Language Processing
No ratings yet
Session2 2024 - 2025 - Natural Language Processing
30 pages
Lecture 26
No ratings yet
Lecture 26
17 pages
Taeho Jo - Deep Learning Foundations-Springer (2023) (Z-Lib - Io)
No ratings yet
Taeho Jo - Deep Learning Foundations-Springer (2023) (Z-Lib - Io)
433 pages
Lecture 3 CNN - Backpropagation
No ratings yet
Lecture 3 CNN - Backpropagation
18 pages
UNIT III (Pooling Layer)
No ratings yet
UNIT III (Pooling Layer)
19 pages
Backpropagation & RNNs in AI
No ratings yet
Backpropagation & RNNs in AI
162 pages
Unit I Introduction To ANN
No ratings yet
Unit I Introduction To ANN
8 pages
The Multilayer Perceptron
No ratings yet
The Multilayer Perceptron
11 pages
A Review of Recurrent Neural Networks
No ratings yet
A Review of Recurrent Neural Networks
36 pages
Multilayer Perceptron
No ratings yet
Multilayer Perceptron
9 pages
Chapter 3 - 2 Deep CNN
No ratings yet
Chapter 3 - 2 Deep CNN
22 pages
ML Unit 2
No ratings yet
ML Unit 2
63 pages
Convolutional Neural Networks - Annotated
No ratings yet
Convolutional Neural Networks - Annotated
83 pages
SSRN 5263710
No ratings yet
SSRN 5263710
94 pages
Rainfall-Runoff Modelling Using Artificial Neural
No ratings yet
Rainfall-Runoff Modelling Using Artificial Neural
7 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
14 pages
Understanding Backpropagation in Neural Networks
No ratings yet
Understanding Backpropagation in Neural Networks
8 pages
Lecture 5
No ratings yet
Lecture 5
41 pages
Deep Learning Essentials for Learners
No ratings yet
Deep Learning Essentials for Learners
74 pages
Linear Optimization - Max
No ratings yet
Linear Optimization - Max
186 pages
Activations
No ratings yet
Activations
8 pages
Deep Learning Important Questions
No ratings yet
Deep Learning Important Questions
2 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
104 pages
Autoencoders: Undercomplete vs Overcomplete
No ratings yet
Autoencoders: Undercomplete vs Overcomplete
4 pages
2024 MTH058 Lecture02 Backpropagation
No ratings yet
2024 MTH058 Lecture02 Backpropagation
62 pages
ML Visuhttps Docs.google.com Presentation u 0 Authuser=0&Usp=Slides Webals - 副本
No ratings yet
ML Visuhttps Docs.google.com Presentation u 0 Authuser=0&Usp=Slides Webals - 副本
41 pages
1 Stable Diffusion A Tutorial Merge
No ratings yet
1 Stable Diffusion A Tutorial Merge
263 pages
Neural Network Activation Functions
No ratings yet
Neural Network Activation Functions
15 pages
Understanding Multilayer Perceptrons
No ratings yet
Understanding Multilayer Perceptrons
24 pages
How Does Backpropagation Work in A CNN - Medium
No ratings yet
How Does Backpropagation Work in A CNN - Medium
29 pages

11 Ece

Uploaded by

11 Ece

Uploaded by

DEEP LEARNING ALGORITHM FOR HUMAN

Department of ECE, Sri Venkateswara Institute of Technology, N.H 44, Hampapuram,

Figure: 1 Human Action Recognitions

How Deep Learning Works

Step4: Training an algorithm on large amount of labeled data.

Understands problem Identifies relevant data Choose the type of

Tests the model’s Trains algorithm on

Figure 2: Deep Learning Process

4. DEEP NEURAL NETWORKS

Figure 3: Deep Neural Network

5. ALEX NET CNN ARCHITECTURE

5 Convolution layers 3 Fully connected layers

Conv1 Conv2 Conv3 dense dense dense

Conv1, 11*11, 96 filters, stride = 4

Max pooling1, 3*3, stride = 2

Conv2, 5*5, 256 filters, padding = 2

Max pooling2, 3*3, stride = 2

Conv3, 3*3, 384 filters, padding = 1

Conv4, 3*3, 384 filters, padding = 1

Conv5, 3*3, 256 filters, padding = 1

Max pooling3, 3*3, stride = 2

Fully connected layer1, 4096

Fully connected layer2, 4096

Output layer, 1000

Figure 5)a): Alex net flow chart

Layer Feature Size Kernel Stride Activation

1 Convolution1 96 55*55*96 11*11 4 Relu

Maxpooling1 96 27*27*96 3*3 2 Relu

2 Convolution2 256 27*27*256 5*5 1 Relu

Maxpooling2 256 13*13*256 3*3 2 Relu

3 Convolution3 384 13*13*384 3*3 1 Relu

4 Convolution4 384 13*13*384 3*3 1 Relu

5 Convolution5 256 13*13*256 3*3 1 Relu

Maxpooling3 256 6*6*256 3*3 2 Relu

Without padding With padding

Where, n = Image size

Layer 1: Convolution1 Max pooling1

Layer 2: Convolution2 Max pooling2

Layer 3&4: Convolution 3&4 Layer 5: Convolution5

𝟏𝟑 + 𝟐(𝟏) 𝟏𝟑 + 𝟐(𝟏) − 𝟏𝟑 + 𝟐(𝟏) 𝟏𝟑 + 𝟐(𝟏)

Max pooling3 Fully connected layer1

Input image * Filter = Filtered image

Max pooling with 2*2 filter and stride4

There are two types of pooling. They are:

Rectified Linear unit (ReLU):

when u > 0. Rectified linear function, f(u) = max(0,u)

f(u) = max (0,u)

Fully connected layer:

 Adding zero rows and columns to the image is known as padding.

Where, j = each action

N = Total number of actions

You might also like

1 Convolution1 96 555596 11*11 4 Relu

Maxpooling1 96 272796 3*3 2 Relu

2 Convolution2 256 2727256 5*5 1 Relu

Maxpooling2 256 1313256 3*3 2 Relu

3 Convolution3 384 1313384 3*3 1 Relu

4 Convolution4 384 1313384 3*3 1 Relu

5 Convolution5 256 1313256 3*3 1 Relu

Maxpooling3 256 66256 3*3 2 Relu