0% found this document useful (0 votes)

33 views62 pages

Introduction to Neural Networks Basics

The document provides an introduction to neural networks, detailing the structure and function of biological and artificial neurons, including perceptrons and their learning algorithms. It discusses key concepts such as activation functions, weight and bias significance, and the gradient descent optimization method. Additionally, it addresses the limitations of single-layer perceptrons and the necessity of multi-layer networks for solving non-linearly separable problems.

Uploaded by

Senthilselvi A

We take content rights seriously. If you suspect this is your content, claim it here.

0% found this document useful (0 votes)

33 views62 pages

Introduction to Neural Networks Basics

Uploaded by

Senthilselvi A

We take content rights seriously. If you suspect this is your content, claim it here.

Introduction to Neural Network

[Link]
PROFESSOR
CSE - AIML
SRM IST, Ramapuram

1
[Link] Prof/CSE - AIML 1
Unit-1 Introduction to Neural Network
Biological neuron, Motivation from biological neuron, McCulloch Pitts Neuron,
Perceptron, Perceptron learning Algorithm, Representation power of a network of
perceptrons, Activation functions-Sigmoid, tanh, ReLU, leaky ReLU, Sigmoid neuron,
Gradient descent leaming Algorithm, Representation power of multilayer Network of
Sigmoid Neurons, Representation power of function: Complex functions in real world
examples, Feedforward Neural Networks, Learning parameters, output and loss
functions of FFN Networks, Backpropagation learning Algorithm, Applying chain rule
across in a neural network, Computing partial derivatives [Link] a weight

[Link] Prof/CSE - AIML 2 2

Biological Neuron
⦿Neurons are the basic functional units of the nervous system, and they
generate electrical signals called action potentials, which allows them to
quickly transmit information over long distances. Almost all the neurons
have three basic functions essential for the normal functioning of all the
cells in the body.
⦿These are to:
1. Receive signals (or information) from outside.
2. Process the incoming signals and determine whether or not the
information should be passed along.
3. Communicate signals to target cells which might be other neurons or
muscles or glands.

[Link] Prof/CSE-Aiml 3 3
Biological Neuron

4
[Link] Prof/CSE - AIML 4
Main parts of biological neuron
⦿ Dendrite
Dendrites are responsible for getting incoming signals from outside
Incoming signals can be either excitatory — which means they tend to make the
neuron fire (generate an electrical impulse) — or inhibitory — which means that they tend to
keep the neuron from firing.
⦿ Soma

Soma is the cell body responsible for the processing of input signals and deciding whether a
neuron should fire an output signal
⦿ Axon

Axon is responsible for getting processed signals from neuron to relevant cells
⦿ Synapse
Synapse is the connection between an axon and other neuron dendrites

[Link] Prof/CSE- AIML 5 5

Artificial neuron

• Artificial neuron also known as perceptron is the basic unit of the

neural network. In simple terms, it is a mathematical function based on
a model of biological neurons.
• It can also be seen as a simple logic gate with binary outputs .

6
[Link] Prof/CSE - AIML 6
Main Functions of Artificial neuron

• Takes inputs from the input layer

• Weighs them separately and sums them up
• Pass this sum through a nonlinear function to produce output.

7
[Link] Prof/CSE -AIML 7
Biological Neuron Vs
Artificial Neuron

8
[Link] Prof/CSE - AIML 8
McCulloch-Pitts
Neuron Model
Binary neuron model (1943):
o Takes binary inputs (0 or 1).
o Applies weighted sum and threshold.
o Output is 1 if sum ≥ threshold, else 0.

9
[Link] Prof/CSE -AIML 9
Perceptron

10
[Link] Prof/CSE -AIML 10
Parts of Perceptron
⦿ Input layer

⦿Weights and Bias

⦿Activation Function

⦿Output Layer

[Link] Prof/CSE - AIML 11 11

Comparison between MP Neuron
Model and Perceptron Model
• Both, MP Neuron Model as well as the Perceptron model work on linearly
separable data.
• MP Neuron Model only accepts boolean input whereas Perceptron Model can
process any real input.
• Inputs aren’t weighted in MP Neuron Model, which makes this model less
flexible. On the other hand, Perceptron model can take weights with respective
to inputs provided.
• While using both the models we can adjust threshold input to make the model
fit the dataset.

12
[Link] Prof/CSE - AIML 12
Perceptron Learning Algorithm
1. First, multiply all input values with corresponding weight values and then add them to
determine the weighted sum. Mathematically, we can calculate the weighted sum as
follows: ∑wi∗xi=x1∗w1+x2∗w2+…+wn∗xn Add another essential term called bias 'b' to the
weighted sum to improve the model performance. ∑wi∗xi+b
2. Next, an activation function is applied to this weighed sum, producing a binary or a
continuous-valued output. Y=f(∑wi∗xi+b)
3. Next, the difference between this output and the actual target value is computed to get the
error term, E, generally in terms of mean squared error. The steps up to this form the forward
propagation part of the algorithm. E=(Y−Yactual)2

13
[Link] Prof/CSE - AIML 13
Perceptron Learning Algorithm
[Link] optimize this error (loss function) using an optimization algorithm. Generally, some form of
gradient descent algorithm is used to find the optimal values of the hyperparameters like learning
rate, weight, Bias, etc. This step forms the backward propagation part of the algorithm.

14
[Link] Prof/CSE - AIML 14
Importance of Weight and Bias
• Weight increases the steepness of activation function. This means weight decide how fast the
activation function will trigger whereas bias is used to delay the triggering of the activation
function.
• The weight shows the effectiveness of a particular input. More the weight of input, more it will
have impact on network.
• On the other hand Bias is like the intercept added in a linear equation. It is an additional
parameter in the Neural Network which is used to adjust the output along with the weighted sum
of the inputs to the neuron.
• Therefore Bias is a constant which helps the model in a way that it can fit best for the given
data.

[Link] Prof/CSE - AIML 15 15

Importance of Weight and Bias

• y = mx+c
Where m = weight and c = bias
• Now, Suppose if c was absent, then the graph will be
formed like in figure
• Due to absence of bias, model will train over point
passing through origin only, which is not in accordance
with real-world scenario.
• Also with the introduction of bias, the model will
become more flexible.

[Link] Prof/CSE - AIML 16 16

Importance of Weight and Bias

[Link] Prof/CSE - AIML 19 19

Example

output = sum (weights * inputs) + bias

y = f(x) = Σxiwi

[Link] Prof/CSE - AIML 20 20

Example

[Link] Prof/CSE - AIML 21 21

Activation Function
• An activation function is a function that is added into an artificial neural network in order to
help the network learn complex patterns in the data.
• When comparing with a neuron-based model that is in our brains, the activation function is at
the end deciding what is to be fired to the next neuron.
• That is exactly what an activation function does in an ANN as well. It takes in the output
signal from the previous cell and converts it into some form that can be taken as input to
the next cell.
1. Sigmoid Function
2. Softmax

22
[Link] Prof/CSE - AIML 22
Activation Function

[Link] Prof/CSE - AIML 23 23

Sigmoid Neuron

• Similar to perceptron but with sigmoid activation.

• Continuous output between 0 and 1.
• Useful for probabilistic interpretation.

[Link] Prof/CSE - AIML 24 24

Softmax Vs Sigmoid

[Link] Prof/CSE - AIML 25 25

Softmax

[Link] Prof/CSE - AIML 27 27

Multi Layer Feed Forward

[Link] Prof/CSE - AIML 28 28

Multi Layer Feed Forward

[Link] Prof/CSE - AIML 29 29

Simple Classification Problem

[Link] Prof/CSE - AIML 30 30

Simple Classification Problem

[Link] Prof/CSE - AIML 31 31

Simple Classification Problem

[Link] Prof/CSE - AIML 34 34

NEURAL NETWORK IMPLEMENTATION
FROM SCRATCH

[Link] Prof/CSE - AIML 35 35

WHAT IS LOGICAL OR GATE?
• Straightforwardly, when one of the inputs is 1, the output of the OR
gate is going to be 1. It means that the output is 0 only when both
of the inputs are 0.

[Link] Prof/CSE - AIML 36 36

TRUTH-TABLE FOR OR GATE:

[Link] Prof/CSE - AIML 37 37

PERCEPTRON FOR THE OR GATE:

[Link] Prof/CSE - AIML 38 38

[Link] Prof/CSE - AIML 39 39
[Link] Prof/CSE - AIML 40 40
[Link] Prof/CSE - AIML 41 41
ERROR CALCULATION:

[Link] Prof/CSE - AIML 42 42

WHAT IS GRADIENT DESCENT?

• Gradient Descent is an optimization algorithm used in machine

learning models to find the minimum value of a cost function.
• It does this by taking small steps in the direction that is opposite
to the gradient of the cost function until it reaches a local
minimum.
• The learning rate determines the size of each step and can be
adjusted to balance convergence speed and accuracy.

[Link] Prof/CSE - AIML 43 43

WHAT IS GRADIENT DESCENT?

• For updating weight values, we are going to use a gradient

descent algorithm.
• Gradient Descent is a machine learning algorithm that
operates iteratively to find the optimal values for its
parameters. It takes into account, user-defined learning rate,
and initial parameter values.

[Link] Prof/CSE - AIML 44 44

WHAT IS GRADIENT DESCENT?

[Link] Prof/CSE - AIML 45 45

GRADIENT DESCENT WORKING

Working: (Iterative)
• 1. Start with initial values.
• 2. Calculate cost.
• 3. Update values using the update function.
• 4. Returns minimized cost for our cost
function

[Link] Prof/CSE - AIML 46 46

WHY DO WE NEED IT?

• Generally, what we do is, we find the

formula that gives us the optimal values
for our parameter. However, in this
algorithm, it finds the value by itself!.
Formula for Gradient descent algorithm

[Link] Prof/CSE - AIML 47 47

Learning Rate

[Link] Prof/CSE - AIML 48 48

DERIVATION OF THE FORMULA USED IN A
NEURAL NETWORK
• what we want to find is how a particular
weight value affects the error. To find that
we are going to apply the chain rule.

[Link] Prof/CSE - AIML 49 49

CALCULATING DERIVATIVES:

[Link] Prof/CSE - AIML 50 50

• In our case:
• Output = 0.68997
Target = 1

[Link] Prof/CSE - AIML 51 51

FINDING THE SECOND PART OF THE
DERIVATIVE:

[Link] Prof/CSE - AIML 52 52

[Link] Prof/CSE - AIML 53 53
[Link] Prof/CSE - AIML 54 54
FINDING THE THIRD PART OF THE DERIVATIVE

Putting it all together:

[Link] Prof/CSE - AIML 55 55

• Putting it in our main equation:

w2=0.3-(0.5)*(-0.06631)
w2=0.3033
Notice that the value of the weight has increased
here. We can calculate all the values in this way, but
as we can see, it is going to be a lengthy process. So
now we are going to implement all the steps in
Python.

[Link] Prof/CSE - AIML 56 56

SUMMARY OF THE MANUAL
IMPLEMENTATION OF A NEURAL
NETWORK:

a. Input for perceptron:

b. Applying sigmoid function for predicted output :

c. Calculate the error:

57
[Link] Prof/CSE - AIML 57
d. Changing the weight value based on gradient descent formula:

e. Calculating the derivative:

f. Individual derivatives:

g. After then we run the same code with updated weight values.
58
IMPLEMENTATION OF A NEURAL
NETWORK IN PYTHON:

10.1 Import Required libraries:

10.2 Assign Input values:

[Link] Prof/CSE - AIML 59 59

10.3 Target Output:

10.3 Assign the Weights :

[Link] Prof/CSE - AIML 60 60

10.4 Adding Bias Values and Assigning a Learning Rate :

10.5 Applying a Sigmoid Function:

10.6 Derivative of sigmoid function:

[Link] Prof/CSE - AIML 61 61

10.7 The main logic for predicting output and updating the weight values:

[Link] Prof/CSE - AIML 62 62

Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
20 pages
Understanding Perceptron in Neural Networks
No ratings yet
Understanding Perceptron in Neural Networks
107 pages
Neural Networks: Supervised Learning Basics
No ratings yet
Neural Networks: Supervised Learning Basics
17 pages
Fundamentals of Deep Learning Concepts
No ratings yet
Fundamentals of Deep Learning Concepts
58 pages
Artificial Neural Networks Overview
No ratings yet
Artificial Neural Networks Overview
102 pages
Understanding Perceptrons in Neural Networks
No ratings yet
Understanding Perceptrons in Neural Networks
25 pages
Understanding Perceptrons in Neural Networks
No ratings yet
Understanding Perceptrons in Neural Networks
30 pages
Advantages and Disadvantages of Perceptrons
No ratings yet
Advantages and Disadvantages of Perceptrons
9 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
72 pages
Deep Learning Fundamentals and Models
No ratings yet
Deep Learning Fundamentals and Models
117 pages
Neural Networks: Basics and Perceptron
No ratings yet
Neural Networks: Basics and Perceptron
19 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
45 pages
Understanding Artificial Neurons
No ratings yet
Understanding Artificial Neurons
87 pages
Machine Learning: Types & Applications
No ratings yet
Machine Learning: Types & Applications
81 pages
Artificial Neural Networks Overview
No ratings yet
Artificial Neural Networks Overview
66 pages
Unit-Ii MLT1
No ratings yet
Unit-Ii MLT1
45 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
22 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
19 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
82 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
23 pages
Introduction to Neural Networks Basics
No ratings yet
Introduction to Neural Networks Basics
98 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
36 pages
Supervised Learning & Neural Networks
No ratings yet
Supervised Learning & Neural Networks
28 pages
Understanding Neural Network Flow
No ratings yet
Understanding Neural Network Flow
76 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
33 pages
AI and Machine Learning in Business
No ratings yet
AI and Machine Learning in Business
44 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
19 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
49 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
30 pages
Perceptron Design and Functionality
No ratings yet
Perceptron Design and Functionality
77 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
77 pages
McCulloch-Pitts Neuron vs Perceptron
No ratings yet
McCulloch-Pitts Neuron vs Perceptron
15 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
86 pages
Perceptron: Pros and Cons Explained
No ratings yet
Perceptron: Pros and Cons Explained
33 pages
Neural Networks in Drug Discovery
No ratings yet
Neural Networks in Drug Discovery
28 pages
Introduction to Artificial Neural Networks
No ratings yet
Introduction to Artificial Neural Networks
65 pages
Understanding Perceptrons and MLPs
No ratings yet
Understanding Perceptrons and MLPs
29 pages
Overview of Artificial Neural Networks
No ratings yet
Overview of Artificial Neural Networks
51 pages
Introduction to Neural Networks Basics
No ratings yet
Introduction to Neural Networks Basics
36 pages
Understanding Artificial Neural Networks
No ratings yet
Understanding Artificial Neural Networks
34 pages
Deep Neural Networks Overview
No ratings yet
Deep Neural Networks Overview
72 pages
Neural Network Activation Layers Explained
No ratings yet
Neural Network Activation Layers Explained
44 pages
Understanding Perceptron and Its Applications
No ratings yet
Understanding Perceptron and Its Applications
26 pages
Understanding Perceptron Basics
0% (1)
Understanding Perceptron Basics
33 pages
Perceptron: Foundations of Neural Networks
No ratings yet
Perceptron: Foundations of Neural Networks
43 pages
Introduction to Neural Networks Basics
No ratings yet
Introduction to Neural Networks Basics
39 pages
Introduction to Neural Networks Basics
No ratings yet
Introduction to Neural Networks Basics
54 pages
Introduction to Classification Models
No ratings yet
Introduction to Classification Models
14 pages
Introduction to Neural Networks Concepts
No ratings yet
Introduction to Neural Networks Concepts
96 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
25 pages
Perceptron vs Neural Networks Explained
No ratings yet
Perceptron vs Neural Networks Explained
13 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
25 pages
Bayesian Learning and Neural Networks
No ratings yet
Bayesian Learning and Neural Networks
31 pages
Multilayer Perceptron Overview
No ratings yet
Multilayer Perceptron Overview
71 pages
Medical Pattern Recognition: ANNs Explained
No ratings yet
Medical Pattern Recognition: ANNs Explained
42 pages
Introduction to Neural Networks
No ratings yet
Introduction to Neural Networks
111 pages
Deep Learning and Neural Networks Explained
No ratings yet
Deep Learning and Neural Networks Explained
180 pages
Module 2
100% (1)
Module 2
62 pages
Artificial Neural Networks Overview
No ratings yet
Artificial Neural Networks Overview
65 pages
Google ML Engineering Best Practices
100% (1)
Google ML Engineering Best Practices
24 pages
Sphinx-In-The-Head Group Signatures From Symmetric Primitives
No ratings yet
Sphinx-In-The-Head Group Signatures From Symmetric Primitives
35 pages
KVS Ranchi Class XII Math Exam Marking Scheme
No ratings yet
KVS Ranchi Class XII Math Exam Marking Scheme
9 pages
Numerical Methods Midterm Exam
No ratings yet
Numerical Methods Midterm Exam
7 pages
Key Regression Evaluation Metrics
No ratings yet
Key Regression Evaluation Metrics
14 pages
Newton Forward-Difference Interpolation
No ratings yet
Newton Forward-Difference Interpolation
31 pages
Deep Learning Practical: Movie Review Classifier
No ratings yet
Deep Learning Practical: Movie Review Classifier
34 pages
Key ML Concepts for Data Scientists
No ratings yet
Key ML Concepts for Data Scientists
6 pages
VASP Algorithms for Ionic Relaxation
No ratings yet
VASP Algorithms for Ionic Relaxation
61 pages
Solutions to Time Series Problems
No ratings yet
Solutions to Time Series Problems
31 pages
Assembly Line Balancing Techniques
No ratings yet
Assembly Line Balancing Techniques
11 pages
Blockchain Technology Question Paper
No ratings yet
Blockchain Technology Question Paper
6 pages
Deep Learning for Plant Disease Detection
No ratings yet
Deep Learning for Plant Disease Detection
3 pages
Denoising Autoencoders for Robust Features
No ratings yet
Denoising Autoencoders for Robust Features
16 pages
Interpolation Formula in Reserves Calculation
No ratings yet
Interpolation Formula in Reserves Calculation
1 page
Understanding Hash Tables and Functions
No ratings yet
Understanding Hash Tables and Functions
113 pages
R-tree Indexing and Algorithms Overview
100% (1)
R-tree Indexing and Algorithms Overview
44 pages
Data Analyst Projects by Atharva Dhamdhere
No ratings yet
Data Analyst Projects by Atharva Dhamdhere
1 page
Digital Signal Processing Course Overview
No ratings yet
Digital Signal Processing Course Overview
2 pages
Choosing Independent Variables in Regression
No ratings yet
Choosing Independent Variables in Regression
7 pages
MVD-Fusion: Depth-Consistent 3D Generation
No ratings yet
MVD-Fusion: Depth-Consistent 3D Generation
11 pages
XGBoost: A Comprehensive Overview
No ratings yet
XGBoost: A Comprehensive Overview
8 pages
Number Representations in Digital Systems
No ratings yet
Number Representations in Digital Systems
28 pages
Techniques for Imbalanced Data in ML
No ratings yet
Techniques for Imbalanced Data in ML
16 pages
Probability Questions and Solutions
No ratings yet
Probability Questions and Solutions
18 pages
ENEE 425 Digital Signal Processing Course
No ratings yet
ENEE 425 Digital Signal Processing Course
2 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
8 pages
Sample Images for Image Processing
No ratings yet
Sample Images for Image Processing
3 pages
DAA Unit 1: Algorithms Overview
No ratings yet
DAA Unit 1: Algorithms Overview
15 pages
Machine Learning in Healthcare Overview
100% (1)
Machine Learning in Healthcare Overview
47 pages