100% found this document useful (1 vote)

152 views62 pages

Module 2

Uploaded by

diop samba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

152 views62 pages

Module 2

Uploaded by

diop samba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 62

Artificial

Intelligence
Introduction to Neural Networks and
Deep Learning Frameworks

Copyright Intellipaat. All rights reserved.

Agenda
Topology of Neural
01 Networks 02 Perceptrons

Activation Functions Perceptron Training

03 and Their Types
04 Algorithm

Deep Learning 06 What Are Tensors?

05 Frameworks

Program Elements
07 Computational Graph 08 in TensorFlow

Copyright Intellipaat. All rights reserved.

Topology of a Neural Network

Typically, artificial neural networks have a layered structure. The Input Layer picks up the input signals and passes them on to the next layer, also
known as the ‘Hidden’ Layer (there may be more than one Hidden Layer in a neural network). Last comes the Output Layer that delivers the result

Input layer Hidden layer Output layer

Copyright Intellipaat. All rights reserved.
Well, everyone has heard about AI, but how
many of you know that the inspiration behind
artificial neural networks came from the
biological neurons that are found within
human brains?

Copyright Intellipaat. All rights reserved.

Let us first understand the architecture of
our biological neurons which is very similar to
that of artificial neurons

Copyright Intellipaat. All rights reserved.

Neurons: How Do They Work?

A neural network is a computer simulation of the way biological neurons work within a human brain

Dendrites: These branch-like structures extending away from the cell body
receive messages from other neurons and allow them travel to the cell body

Cell Body: It contains a nucleus, smooth and rough endoplasmic reticulum,

Golgi apparatus, mitochondria, and other cellular components

Axon: An axon carries an electrical impulse from the cell body to another
neuron

Copyright Intellipaat. All rights reserved.

Now, let us understand about artificial
neurons in detail!

Copyright Intellipaat. All rights reserved.

Artificial Neurons

▪ The most fundamental unit of a deep neural network is

called as an artificial neuron

▪ It takes an input, processes it, passes it through an

activation function, and returns the output

▪ Such type of artificial neurons are called as perceptrons

▪ A perceptron is a linear model used for binary

classification

Schematic Representation of a Neuron in a Neural Network

Copyright Intellipaat. All rights reserved.

Perceptron: How Does It Work?

▪ The three arrows correspond to the three inputs coming into the network

▪ Values [0.7, 0.6, and 1.4] are weights assigned to the corresponding input

▪ Inputs get multiplied with their respective weights and their sum is taken

▪ Consider the three inputs as x1, x2, and x3

▪ Let the three weights be w1, w2, and w3

Sum = x1w1 + x2w2+x3w3

Sum=x1(0.7) + x2(0.6) + x3(1.4)

▪ An offset is added to this sum. This offset is called Bias

▪ It is just a constant number, say 1, which is added for scaling purposes

New_Sum = x1(0.7) + x2(0.6) + x3(1.4) + bias

Copyright Intellipaat. All rights reserved.

Why Do We Need Weights?

▪ Statistically, weights determine the relative importance of input

▪ Mathematically, they are just the slope of the line

Copyright Intellipaat. All rights reserved.

Why Do We Need Weights?

Will it rain, if I
wear a blue
shirt?
Humidity x1
Output

Will it rain? (0/1)

Blue shirt x2

w2 is assigned a lower value because significance of the

input ‘blue shirt’ is less than ‘humidity’

Copyright Intellipaat. All rights reserved.

Why Do We Need Activation Functions?

We have two classes. One set is

represented with triangles and
the other with circles

Copyright Intellipaat. All rights reserved.

Why Do We Need Activation Functions?

Draw me a linear decision

boundary which can separate
these two classes

Copyright Intellipaat. All rights reserved.

Why Do We Need Activation Functions?

We will have to add a third

dimension to create a linearly
separable model which is easy to
deal with

Copyright Intellipaat. All rights reserved.

Activation Functions

▪ They are used to convert an input signal of a node in an

artificial neural network to an output signal

▪ That output signal now is used as an input in the next layer in

the stack

▪ Activation functions introduce non-linear properties to our

network

▪ A neural network without an activation function is essentially

just a linear regression model

▪ The activation function does non-linear transformation to the

input making it capable to learn and perform more complex

tasks

Copyright Intellipaat. All rights reserved.

Identity Types of Activation Functions

Binary Step

Sigmoid

Tanh

ReLU

Leaky ReLU

Softmax
Copyright Intellipaat. All rights reserved.
Identity Function
• A straight line function where activation is proportional to input

• No matter how many layers we have, if all of them are linear in nature, the final activation function of the last

layer will be nothing but just a linear function of the input of the first layer

• We use a linear function to solve a linear regression problem

• Range: (−∞,∞)

𝒇 𝒙 =𝒙

Copyright Intellipaat. All rights reserved.

Binary Step Function
• It is also known as the Heaviside step function, or the unit step function, which is usually denoted by H or θ, is a

discontinuous function

• Its value is 0 for the negative argument and 1 for the positive argument

• It depends on the threshold value we define

• We use the binary step function to solve a binary classification problem

𝟎 𝒇𝒐𝒓 𝒙 < 𝟎 • Range: (0,1)
𝒇 𝒙 =ቐ
𝟏 𝒇𝒐𝒓 𝒙 = > 𝟎

Copyright Intellipaat. All rights reserved.

Sigmoid Function
• The sigmoid function is an activation function where it scales values between 0 and 1 by applying a threshold

• When we apply the weighted sum in the place of x, the values are scaled in between 0 and 1

• Large negative numbers are scaled toward 0, and large positive numbers are scaled toward 1

• Range: (0,1)

𝟏
𝒇 𝒙 =
𝟏 + ⅇ−𝒙

Copyright Intellipaat. All rights reserved.

Tanh Function
• It is a hyperbolic trigonometric function

• The Tanh activation works almost always better than sigmoid functions as optimization is easier in this method

• The advantage of Tanh is that it can deal more easily with negative numbers

• It is actually a mathematically shifted version of the sigmoid function

• Range: (−1,1)

𝟐
𝒇 𝒙 : 𝒕𝒂𝒏 𝒉 𝒙 = 𝒙 −𝟏
𝟏+ⅇ−𝟐

Copyright Intellipaat. All rights reserved.

ReLU Function
• ReLU stands for rectified linear unit

• It is the most widely used activation function

• It is primarily implemented in Hidden Layers of the neural network

• This function allows only the maximum values to pass during the front propagation as shown in the graph below

• Range: (0,∞)
𝟎 𝒇𝒐𝒓 𝒙 < 𝟎
𝒇 𝒙 =ቐ
𝒙 𝒇𝒐𝒓 𝒙 = > 𝟎

Copyright Intellipaat. All rights reserved.

Leaky ReLU Function
• Leaky ReLU allows a small negative value to pass during the back propagation if we have a dead ReLU problem

• This eventually activates the neuron and brings it down

• Range: (−∞,∞)

𝟎. 𝟎𝟏𝒙 𝒇𝒐𝒓 𝒙 < 𝟎

𝒇 𝒙 =ቐ
𝒙 𝒇𝒐𝒓 𝒙 = > 𝟎

Copyright Intellipaat. All rights reserved.

Softmax Function
• The Softmax function is used when we have multiple classes

• It is useful for finding out the class which has the max. probability

• The Softmax function is ideally used in the Output Layer of the classifier where we are actually trying to attain the

probabilities to define the class of each input

• Range: (0,1)

ⅇ𝒛 𝒋
𝜎 𝒛 𝒋 = σ𝑲 𝒛 , 𝒋 = 𝟏, 𝟐, . . 𝑲
𝒌=𝟏 ⅇ 𝒌

Copyright Intellipaat. All rights reserved.

Got bored again? Let us get back to
perceptrons and try to understand them in a
better way!

Copyright Intellipaat. All rights reserved.

Training a Perceptron

By training a perceptron, we try to find a line, plane, or some hyperplane which can accurately separate two
classes by adjusting weights and biases

Error = 2 Error = 1 Error = 0

Copyright Intellipaat. All rights reserved.

Perceptron Training Algorithm

Input

W1
Bias
X1

Activation Initialize weights,

X2 W2 Function bias, and threshold

෍ Output
Calculate the sum
Update weights
X3 W3 and pass through an
activation function 𝜕𝐸
Wnew = Wold – LR*( )
𝜕𝑤

Error Produce the output

(Error)
Xn Wn
(Correct)

Stop

Copyright Intellipaat. All rights reserved.

Benefits of Using Artificial Neural Networks

Organic Non-linear Data

Fault Tolerance Self-repairing
Learning Processing

Copyright Intellipaat. All rights reserved.

Let us now move toward Deep
Learning frameworks!

Copyright Intellipaat. All rights reserved.

Deep Learning Frameworks

These Deep Learning libraries help in implementing artificial neural networks

Copyright Intellipaat. All rights reserved.

TensorFlow

Keras

PyTorch

DL4J
TensorFlow is an open-source software library for high-performance numerical computations
MXNet

Developed by Google

Copyright Intellipaat. All rights reserved.

TensorFlow

Keras

PyTorch

DL4J

MXNet Natural language

Forecasting
processing

Text classification Tagging

Google Translate

Copyright Intellipaat. All rights reserved.

TensorFlow

Keras

PyTorch

DL4J

MXNet
Tensor
Used for visualizing TensorFlow computations and graphs
Board

TensorFlow Used for rapid deployment of new algorithms/experiments

Serving while retaining the same server architecture and APIs

Copyright Intellipaat. All rights reserved.

TensorFlow

Keras

PyTorch

DL4J

MXNet

A high-Level API which can run on top of

TensorFlow, Theano, or CNTK

Copyright Intellipaat. All rights reserved.

TensorFlow

Keras

PyTorch

DL4J

MXNet
A recurrent neural network

A convolutional neural network

Copyright Intellipaat. All rights reserved.

TensorFlow

Keras

PyTorch

DL4J

MXNet

Stacks layers on top

Copyright Intellipaat. All rights reserved.

TensorFlow

Keras

PyTorch

DL4J

MXNet

A scientific computing framework developed by

Facebook

Copyright Intellipaat. All rights reserved.

TensorFlow

Keras

PyTorch

DL4J

MXNet

‘Pythonic’ in nature

Copyright Intellipaat. All rights reserved.

TensorFlow

Keras

PyTorch

DL4J

MXNet

Offers dynamic computational

graphs

Copyright Intellipaat. All rights reserved.

TensorFlow

Keras

PyTorch

DL4J

MXNet

A Deep Learning programming

library written for Java

Copyright Intellipaat. All rights reserved.

TensorFlow

Keras

PyTorch

DL4J

MXNet

Copyright Intellipaat. All rights reserved.

TensorFlow

Keras

PyTorch

DL4J

MXNet
Image recognition Fraud detection

Parts of speech
Text mining
tagging

Natural language
processing

TensorFlow

Keras

PyTorch

DL4J

MXNet

Developed by Apache
Software Foundation

TensorFlow

Keras

PyTorch

DL4J

MXNet

TensorFlow

Keras

PyTorch

DL4J

MXNet
Speech
Imaging
recognition

Forecasting NLP

What Are Tensors?

A tensor is a multi-dimensional array in which data is stored

Tensor is given
as an input to a
neural network

Tensor

Tensor Rank

Tensor rank represents the dimension of the n-dimensional array

Rank Math Entity Example

0 Scalar (magnitude only) s = 483

1 Vector (magnitude and v = [1.1, 2.2, 3.3]

direction)
2 Matrix (table of numbers)
m = [1, 2, 3], [4, 5, 6], [7, 8, 9]]
3 3-Tensor (cube of numbers) t=
[[[2], [4], [6]], [[8], [10], [12]], [
[14], [16], [18]]]
n n-Tensor ……

Computational Graph

Computation is done in the form of a graph

a = 10
b = 20 Addition
c = 30 c
ℎ = 𝑎∗𝑏 +𝑐
Multiplication

a b

Computational Graph

The computational graph is executed inside a session

h h

a = 10
b = 20 Addition Addition
c = 30 c c
ℎ = 𝑎∗𝑏 +𝑐
Multiplication Multiplication

a b a b

Session

Computational Graph

The computational graph is executed inside a session

a = 10 Node -> Mathematical operation

b = 20 Addition
c = 30 c
ℎ = 𝑎∗𝑏 +𝑐
Multiplication
Edge -> Tensor

a b

Program Elements in TensorFlow

Constant Placeholder Variable

Constant Constants are program elements whose values do not change

Placeholder

Variable a=tf.constant(10) b=tf.constant(20)

Constant A placeholder is a program element to which we can assign data at a later
time

Placeholder

Variable x=tf.placeholder(tf.float32) y=tf.placeholder(tf.string)

Constant A variable is a program element which allows us to add new trainable
parameters to the graph

Placeholder

Variable W=tf.Variable([3],tf.float32) b=tf.Variable([0.4],tf.float32)

Quiz

Quiz 1

A tensor is a single-dimensional array in

which data is stored

A True

B False

Answer 1

A tensor is a single-dimensional array in

which data is stored

A True

B False

Quiz 2

How many layers does a standard Neural

Network has?

A 1

B 2

C 3

D 4 or more

Answer 2

How many layers does a standard Neural

Network has?

A 1

B 2

C 3

D 4 or more

Quiz 3

Perceptron is other name of Neurons

A Yes

B No

Answer 3

Perceptron is other name of Neurons

A Yes

B No

Thank you!

India: +91-7847955955

US: 1-800-216-8930 (TOLL FREE)

[email protected]

24/7 Chat with Our Course Advisor

Deep Learning
No ratings yet
Deep Learning
37 pages
Unit 5
No ratings yet
Unit 5
102 pages
Components of Artificial Neural Networks
No ratings yet
Components of Artificial Neural Networks
52 pages
Unit I
No ratings yet
Unit I
90 pages
NN Unit - 1
100% (1)
NN Unit - 1
27 pages
Neural Networks
No ratings yet
Neural Networks
61 pages
ML Unit 2
No ratings yet
ML Unit 2
23 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
86 pages
Module 5 AIML Notes
No ratings yet
Module 5 AIML Notes
77 pages
Session NN
No ratings yet
Session NN
32 pages
Introduction to Neural Networks Basics
No ratings yet
Introduction to Neural Networks Basics
96 pages
Unit 5
No ratings yet
Unit 5
59 pages
Unit 5 ML
No ratings yet
Unit 5 ML
37 pages
Deep Learning: On Artificial Neural Networks (Anns)
No ratings yet
Deep Learning: On Artificial Neural Networks (Anns)
16 pages
Introduction to Deep Learning Concepts
No ratings yet
Introduction to Deep Learning Concepts
35 pages
Unit 2 Deep Learning and Neural Networks
No ratings yet
Unit 2 Deep Learning and Neural Networks
38 pages
Lesson 3 Artificial Neural Network
No ratings yet
Lesson 3 Artificial Neural Network
77 pages
Lecture - 05 (Introduction To ANN)
No ratings yet
Lecture - 05 (Introduction To ANN)
27 pages
Deep Learning Essentials
No ratings yet
Deep Learning Essentials
27 pages
Week 7 (ANN)
No ratings yet
Week 7 (ANN)
23 pages
Unit V Neural Networks
No ratings yet
Unit V Neural Networks
35 pages
Artificial Intelligence Basics
No ratings yet
Artificial Intelligence Basics
13 pages
2.deep Feed Forward Networks
No ratings yet
2.deep Feed Forward Networks
26 pages
House Dzone Refcard 383 Neural Network Essentials
No ratings yet
House Dzone Refcard 383 Neural Network Essentials
5 pages
NN Concepts
No ratings yet
NN Concepts
4 pages
Artificial Neural Networks Overview
No ratings yet
Artificial Neural Networks Overview
66 pages
Lecture8,9-Neural Networks
No ratings yet
Lecture8,9-Neural Networks
65 pages
Unit V
No ratings yet
Unit V
9 pages
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
No ratings yet
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
40 pages
Activation Functions
No ratings yet
Activation Functions
11 pages
DeepLearing Theory
No ratings yet
DeepLearing Theory
51 pages
Advanced Supervised Learning
No ratings yet
Advanced Supervised Learning
17 pages
Artificial Intelligence: Outline
No ratings yet
Artificial Intelligence: Outline
35 pages
Structure of Neural Networks
No ratings yet
Structure of Neural Networks
12 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
30 pages
Neural Networks for Tech Enthusiasts
No ratings yet
Neural Networks for Tech Enthusiasts
85 pages
Lecture Notes 3
No ratings yet
Lecture Notes 3
5 pages
Machine Learning (CSO851) - Lecture 08
No ratings yet
Machine Learning (CSO851) - Lecture 08
27 pages
Research Proposal Presentation
No ratings yet
Research Proposal Presentation
20 pages
ML MU Unit 5NeuralNetworkpdf 2025 04 16 13 47 39
No ratings yet
ML MU Unit 5NeuralNetworkpdf 2025 04 16 13 47 39
57 pages
Lesson 2 Neural Network Architectures
No ratings yet
Lesson 2 Neural Network Architectures
35 pages
Unit II
No ratings yet
Unit II
12 pages
Understanding Perceptron in Machine Learning
No ratings yet
Understanding Perceptron in Machine Learning
11 pages
Lec 06
No ratings yet
Lec 06
20 pages
Unit Ii DNN
No ratings yet
Unit Ii DNN
24 pages
Understanding Neurons and Perceptrons
No ratings yet
Understanding Neurons and Perceptrons
23 pages
Neural Networks: A Deep Dive
No ratings yet
Neural Networks: A Deep Dive
34 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
29 pages
AI Unit5 Neural Network 1c2c9166 c1b7 47a3 8ce1 E914f1ab6afb
No ratings yet
AI Unit5 Neural Network 1c2c9166 c1b7 47a3 8ce1 E914f1ab6afb
52 pages
DL Unit-1 San
No ratings yet
DL Unit-1 San
58 pages
05 ANN Artificial Neural Networks
No ratings yet
05 ANN Artificial Neural Networks
216 pages
Neural Networks
No ratings yet
Neural Networks
28 pages
Assignment B 3 Customer Churn Modeling
No ratings yet
Assignment B 3 Customer Churn Modeling
7 pages
Understanding Single Layer Perceptrons
No ratings yet
Understanding Single Layer Perceptrons
14 pages
Chapter 3-1 Neural Network
No ratings yet
Chapter 3-1 Neural Network
43 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
48 pages
L11 Introduction To Neural Network AI&ML CS877
No ratings yet
L11 Introduction To Neural Network AI&ML CS877
24 pages
CS 329 Lecture4 2025new
No ratings yet
CS 329 Lecture4 2025new
61 pages
Unit 4
No ratings yet
Unit 4
19 pages
LLM Fine Tune
No ratings yet
LLM Fine Tune
11 pages
Python For Data Analysis The Python Crash Course Comprehensive The Programming From The Ground Up To Python by Cannon, Jason
No ratings yet
Python For Data Analysis The Python Crash Course Comprehensive The Programming From The Ground Up To Python by Cannon, Jason
167 pages
PyTorch - Advanced Deep Learning
100% (1)
PyTorch - Advanced Deep Learning
237 pages
Machine Learning
No ratings yet
Machine Learning
12 pages
Data Science Interview Prep Guide
No ratings yet
Data Science Interview Prep Guide
2 pages
Harvard CS197 - AI Research Experiences - The Course Book-1
No ratings yet
Harvard CS197 - AI Research Experiences - The Course Book-1
16 pages
Unit Iv (CNN)
No ratings yet
Unit Iv (CNN)
8 pages
Recurrent Neural Networks: Pytorch
No ratings yet
Recurrent Neural Networks: Pytorch
6 pages
PyTorch Overview and Applications Guide
100% (4)
PyTorch Overview and Applications Guide
33 pages
Deep Learning Basics for Beginners
No ratings yet
Deep Learning Basics for Beginners
34 pages
AI Applications in Supply Chain Logistics
No ratings yet
AI Applications in Supply Chain Logistics
54 pages
AI Course for Aspiring Engineers
No ratings yet
AI Course for Aspiring Engineers
33 pages
Machine Learning Libraries Study
No ratings yet
Machine Learning Libraries Study
8 pages
ML Unit 1
No ratings yet
ML Unit 1
19 pages
PyTorch Guide for Deep Learning
No ratings yet
PyTorch Guide for Deep Learning
5 pages
PyTorch CrashCourse
No ratings yet
PyTorch CrashCourse
16 pages
Water Quality Prediction: SVM vs XGBoost
No ratings yet
Water Quality Prediction: SVM vs XGBoost
104 pages
Ghichu
No ratings yet
Ghichu
2,936 pages
Deep Learning With PyTorch Guide For Beginners and Intermediate
100% (7)
Deep Learning With PyTorch Guide For Beginners and Intermediate
120 pages
Lecture 11 Advanced CNN
No ratings yet
Lecture 11 Advanced CNN
42 pages
Data Science Career Boost
No ratings yet
Data Science Career Boost
46 pages
Aditya Intern Report
No ratings yet
Aditya Intern Report
52 pages
Machine Learning Guide for Developers
No ratings yet
Machine Learning Guide for Developers
54 pages
Python for Leaf Disease Detection
No ratings yet
Python for Leaf Disease Detection
50 pages
AI SEO Competitor Dashboard by Thatware
No ratings yet
AI SEO Competitor Dashboard by Thatware
50 pages
Pytorch Distributed: Experiences On Accelerating Data Parallel Training
No ratings yet
Pytorch Distributed: Experiences On Accelerating Data Parallel Training
14 pages
Python For NLP
100% (1)
Python For NLP
254 pages
Advance AI & ML Certification Program
100% (1)
Advance AI & ML Certification Program
29 pages
Module 2
100% (1)
Module 2
62 pages
Alok Shukla DS Resume
No ratings yet
Alok Shukla DS Resume
1 page