0% found this document useful (0 votes)

13 views27 pages

CSE465 T3 Perceptron

Uploaded by

adib136718

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views27 pages

CSE465 T3 Perceptron

Uploaded by

adib136718

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

CSE465

Lecture 2

Perceptron

Silvia Ahmed (SvA) CSE465 ECE@NSU 2

What is a Perceptron?
• Fundamental building block of ANN
• It is an algorithm, used for supervised ML.
• A Perceptron is a simple type of artificial
neural network algorithm developed by Frank
Rosenblatt in 1957. 1 b
• It's the basic unit of a neural network, taking w2
multiple binary inputs and producing a single x2 Σ A
binary output.
w1
• It computes a weighted sum of its input, x1
applies an activation function, and produces
an output.

Silvia Ahmed (SvA) CSE465 ECE@NSU 3

Different parts of Perceptron

Activation Function:
• Signum Function
Bias • Sigmoid
• ReLU
1 • tanh
b

w2
x2 Σ A
Input w1
features x1
Summation Function that
Weights works as dot product
𝑧 = 𝑤1 ∙ 𝑥1 + 𝑤2 ∙ 𝑥2 + 𝑏

Silvia Ahmed (SvA) CSE465 ECE@NSU 4

Example use of a Perceptron

1 b
IQ, x1 CGPA, x2 Job Placement
78 7.8 1 w2
69 5.1 0
x2 Σ A
… … … w1
x1

1) Training: 2) Prediction:

Main job is to learn the values of the For a new sample where IQ = 100 and CGPA = 5.1:
weights and the bias from the training 𝑧 = 100 × 1 + 5.1 × 2 + 3 = 113.2 ≥ 0
samples So Job placement = 1
Eg. w1 =1, w2 = 2, b = 3

Silvia Ahmed (SvA) CSE465 ECE@NSU 5

• Question: If there are more than 2 features?
1 b
IQ, x1 CGPA, x2 State Job x3 w3
Placement
w2
78 7.8 Dhaka 1 x2 Σ A
69 5.1 Khulna 0
w1
… … … x1

𝑧 = 𝑤1 ∙ 𝑥1 + 𝑤2 ∙ 𝑥2 + 𝑤3 ∙ 𝑥3 + 𝑏

Silvia Ahmed (SvA) CSE465 ECE@NSU 6

Perceptron vs Neuron
• Deep learning is inspired by nervous system.

Figure: Perceptron vs Neuron [2]

Silvia Ahmed (SvA) CSE465 ECE@NSU 7

Interpretation

1 b=1
IQ, x1 CGPA, x2 Job Placement
78 7.8 1
Σ
w2 = 4
x2 A
69 5.1 0
… … … w1 = 2
x1

• Weights actually depicts the strength of each (input) connections.

• Weights are mostly the feature importance.

Silvia Ahmed (SvA) CSE465 ECE@NSU 8

Geometric Intuition
1 b IQ, x1 CGPA, x2 Job Placement

w2
x2 Σ A y = 0,1 CGPA 𝐴𝑥 + 𝐵𝑦 + 𝑐 ≥ 0
w1
x1
𝑤1 => 𝐴, 𝑤2 => 𝐵, 𝑏 => 𝑐
𝑧 = 𝑤1 ∙ 𝑥1 + 𝑤2 ∙ 𝑥2 + 𝑏
𝑥1 => 𝑥, 𝑥2 => 𝑦
1 𝑧≥0
𝑦=𝑓 𝑧 =ቊ
0 𝑧<0 𝐴𝑥 + 𝐵𝑦 + 𝑐 IQ
𝐴𝑥 + 𝐵𝑦 + 𝑐 < 0
Equation of a line

• Perceptron is a “line” and its main functionality is to create “regions” 2D -> line
3D -> plane
• Perceptron is a binary classifier.
≥4D -> hyperplane

Silvia Ahmed (SvA) CSE465 ECE@NSU 9

Logic AND

input 1 input 2 output

1 1 1
1 0 0
0 1 0
0 0 0

Silvia Ahmed (SvA) CSE465 ECE@NSU 10

Logic OR

input 1 input 2 output

1 1 1
1 0 1
0 1 1
0 0 0

Silvia Ahmed (SvA) CSE465 ECE@NSU 11

Logic XOR

input 1 input 2 output

1 1 0
1 0 1
0 1 1
0 0 0

Silvia Ahmed (SvA) CSE465 ECE@NSU 12

Limitation
• Works only with linear or “sort-of” linear data

• Tensorflow playground: [Link]

Dataset type Noise Learning rate Activation
Gaussian 15-20 0.01 Sigmoid
Exclusive OR 15-20 0.01 Sigmoid

Silvia Ahmed (SvA) CSE465 ECE@NSU 13

Perception Trick

• Main target is to
get the decision
boundary in the
form:
𝑛

෍ 𝑤𝑖 𝑥𝑖 = 0
𝑖=0

Silvia Ahmed (SvA) CSE465 ECE@NSU 14

Steps - 1
• Initialize:
• A = 1, B = 1, C = 0

• Randomly select
one sample

Silvia Ahmed (SvA) CSE465 ECE@NSU 15

Steps - 2
• Initialize:
• A = 2, B = 1.5, C =
0.4

• Randomly select
one sample

Silvia Ahmed (SvA) CSE465 ECE@NSU 16

Steps - 3
• Initialize:
• A = 4, B = 1.5, C =
0.4

• Randomly select
one sample

Silvia Ahmed (SvA) CSE465 ECE@NSU 17

Line Transformation
• Shown in [Link]/calculator
• Ax+By+C=0

Main equation: 2x+3y+5=0 Effect

Change in c 2x+3y+10=0 2x+3y+0=0
Change in A 4x+3y+5=0 x+3y+5=0
Change in B 2x+6y+5=0 2x+y+5=0

Silvia Ahmed (SvA) CSE465 ECE@NSU 18

How much to transform?

Minus operation to
(1,3,1) 2 3 5 bring the wrongly
(1,3) (4,5) (4,5,1) (-) 4 5 1 “positive” point to
2 3 5 -2 -2 4 the correct
(+) 1 3 1 “negative” zone.

3 6 6
Plus operation to 2x+3y+5=0
bring the wrongly
“negative” point to
the correct
“positive” zone.

Silvia Ahmed (SvA) CSE465 ECE@NSU 19

Live Desmos demonstration

2x+3y+5=0 (5,2) (-3,-2)

Silvia Ahmed (SvA) CSE465 ECE@NSU 20

Learning rate
• The learning rate is a small number that controls how fast or slow a
machine learning or deep learning model updates its internal parameters
(like weights) during training.
• "It’s like the step size your model takes while learning. Too big, and it may
trip and fall. Too small, and it may take forever to learn."
• New coef = coef – learning rate * coef
• Why it's important:
• If the learning rate is too high → the model may skip over the best solution and
never settle.
• If the learning rate is too low → the model will learn very slowly, taking a long time
to improve (or getting stuck).

Silvia Ahmed (SvA) CSE465 ECE@NSU 21

Algorithm
• epoch = 1000, η = 0.01
for i in range(epoch):
randomly select a point for i in range(epoch):
if xi ∈ N and σ2𝑖=0 𝑤𝑖 𝑥𝑖 ≥ 0 randomly select a point
𝑤𝑛𝑒𝑤 = 𝑤𝑜𝑙𝑑 + η 𝑦𝑖 − 𝑦ො𝑖 𝑥𝑖
𝑤𝑛𝑒𝑤 = 𝑤𝑜𝑙𝑑 − η 𝑥𝑖
if xi ∈ P and σ2𝑖=0 𝑤𝑖 𝑥𝑖 < 0
𝑤𝑛𝑒𝑤 = 𝑤𝑜𝑙𝑑 + η 𝑥𝑖
𝑦𝑖 𝑦ො𝑖 𝑦𝑖 − 𝑦ො𝑖
1 1 0
0 0 0
1 0 1
0 1 -1

Silvia Ahmed (SvA) CSE465 ECE@NSU 22

Problem with Perceptron Trick
• Which decision boundary is better?
• Quantify the result
• Convergence

Silvia Ahmed (SvA) CSE465 ECE@NSU 23

Loss Function
• An error function (also called a loss function) measures how far off a
machine learning or deep learning model's predictions are from the actual
target values.
• It gives the model a numeric value that reflects its performance—lower
values mean better predictions.
• The error function guides the learning process by telling the optimizer
how to adjust the model’s parameters (like weights in a neural network)
during training.
• f(w1, w2, b)

Silvia Ahmed (SvA) CSE465 ECE@NSU 24

Perceptron Loss Function
• Number of misclassified points
• (Perpendicular) Distance of the misclassified points
• (In practice)
• Take the point and put it on the line
• This is proportional to the perpendicular
distance but the mathematics is much
simpler than calculating the actual distance. (4,5)

2(4)+3(5)+5=28
2(-2)+3(-2)+5= |-5| = 5

(-2,-2)
2x+3y+5=0

Silvia Ahmed (SvA) CSE465 ECE@NSU 25

More Loss Functions
• If activation function is Sigmoid:
• Loss is Binary cross entropy (used in logistic regression)
• So when activation function is sigmoid then perceptron is basically
logistic regression
• Multi-class classification:
• Activation: Softmax
• Loss: Categorical Cross Entropy
• Regression:
• Activation: Linear (no activation)
• Loss: MSE

Silvia Ahmed (SvA) CSE465 ECE@NSU 26

Reference and further reading
1. “Deep Learning”, Ian Goodfellow, et al.
2. Pramoditha, Rukshan. “The Concept of Artificial Neurons
(Perceptrons) in Neural Networks.” Medium, Towards Data
Science, 29 Dec. 2021, [Link]/the-concept-
of-artificial-neurons-perceptrons-in-neural-networks-
fab22249cbfc. Accessed 21 Jan. 2025.

Silvia Ahmed (SvA) CSE465 ECE@NSU 27

Perceptron 2015
No ratings yet
Perceptron 2015
63 pages
Neural Network
No ratings yet
Neural Network
82 pages
06 Ann
No ratings yet
06 Ann
56 pages
Slide 2
No ratings yet
Slide 2
35 pages
02 Neural Network
No ratings yet
02 Neural Network
28 pages
ML Lecture#4
No ratings yet
ML Lecture#4
109 pages
NN Part1
No ratings yet
NN Part1
43 pages
Dave Reed: Connectionist Approach To AI
No ratings yet
Dave Reed: Connectionist Approach To AI
26 pages
Nns Are A Study of Parallel and Distributed Processing Systems (PDPS)
No ratings yet
Nns Are A Study of Parallel and Distributed Processing Systems (PDPS)
46 pages
Perceptron Notes
No ratings yet
Perceptron Notes
27 pages
ch11 NeuralNetworks
No ratings yet
ch11 NeuralNetworks
51 pages
AN2DL 02 2324 Perceptron 2 FeedForward
No ratings yet
AN2DL 02 2324 Perceptron 2 FeedForward
55 pages
UNIT1 Perceptron MLP
No ratings yet
UNIT1 Perceptron MLP
26 pages
Clase3 Redunidireccional
No ratings yet
Clase3 Redunidireccional
74 pages
Lecture 5 NN
No ratings yet
Lecture 5 NN
57 pages
Deep Learning
No ratings yet
Deep Learning
180 pages
02 Perceptron
No ratings yet
02 Perceptron
114 pages
Neural Networks for Beginners
No ratings yet
Neural Networks for Beginners
65 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
6 pages
08 Neural Networks
No ratings yet
08 Neural Networks
47 pages
Single Layer Perceptron Classification Guide
No ratings yet
Single Layer Perceptron Classification Guide
33 pages
L13 Artificial Neural Network
No ratings yet
L13 Artificial Neural Network
45 pages
08 NN
No ratings yet
08 NN
43 pages
Chapter 2. Training NN
No ratings yet
Chapter 2. Training NN
50 pages
05 ANN Artificial Neural Networks
No ratings yet
05 ANN Artificial Neural Networks
221 pages
Neural Deep Learning
No ratings yet
Neural Deep Learning
221 pages
05 ANN Artificial Neural Networks
No ratings yet
05 ANN Artificial Neural Networks
216 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
87 pages
Module 4 Lab 1
No ratings yet
Module 4 Lab 1
5 pages
Linear Separability
No ratings yet
Linear Separability
4 pages
MLSlides3 1 Selected Shared
No ratings yet
MLSlides3 1 Selected Shared
20 pages
Slide 5 - Artificial Neural Networks Part1
No ratings yet
Slide 5 - Artificial Neural Networks Part1
27 pages
DL Lecture 04 05 Neural Network
No ratings yet
DL Lecture 04 05 Neural Network
51 pages
Neural Networks (Basics)
No ratings yet
Neural Networks (Basics)
30 pages
20.NeuralNets Short
No ratings yet
20.NeuralNets Short
60 pages
Unit 1.7 Linear Discriminants
No ratings yet
Unit 1.7 Linear Discriminants
7 pages
02A DL2023 NN Basics
No ratings yet
02A DL2023 NN Basics
52 pages
Perceptron Example (Practice Que)
No ratings yet
Perceptron Example (Practice Que)
26 pages
Learning XOR - Gradient Based Learning - Hidden Units
No ratings yet
Learning XOR - Gradient Based Learning - Hidden Units
43 pages
10 nn1
No ratings yet
10 nn1
162 pages
DL Unit-1 San
No ratings yet
DL Unit-1 San
58 pages
Perceptron
No ratings yet
Perceptron
26 pages
Machine Learning Algorithms Explained
No ratings yet
Machine Learning Algorithms Explained
46 pages
Assignment 1 Basic Perceptron Build A Neuron With Bias: Neural Networks
No ratings yet
Assignment 1 Basic Perceptron Build A Neuron With Bias: Neural Networks
7 pages
09 CSE358 Intro To Machine Leaning III
No ratings yet
09 CSE358 Intro To Machine Leaning III
64 pages
Unit - II ML
No ratings yet
Unit - II ML
9 pages
Haykin, Xue-Neural Networks and Learning Machines 3ed Soln
55% (20)
Haykin, Xue-Neural Networks and Learning Machines 3ed Soln
103 pages
Preceptron
No ratings yet
Preceptron
17 pages
Neural Networks Solutions Manual
50% (2)
Neural Networks Solutions Manual
103 pages
22PCOAM16 - Machine Learning - Session 6 Preceptrons
No ratings yet
22PCOAM16 - Machine Learning - Session 6 Preceptrons
17 pages
ANN-unit 4 PDF
No ratings yet
ANN-unit 4 PDF
23 pages
Jntuk R20 ML Unit-V
No ratings yet
Jntuk R20 ML Unit-V
19 pages
Lecture Notes 3 Perceptron
100% (1)
Lecture Notes 3 Perceptron
7 pages
Lecture 9
No ratings yet
Lecture 9
97 pages
S02 DNN Perceptron Wip
No ratings yet
S02 DNN Perceptron Wip
24 pages
Advanced Supervised Learning
No ratings yet
Advanced Supervised Learning
17 pages
Math - English - Prim5 - TR1 - T - W14
No ratings yet
Math - English - Prim5 - TR1 - T - W14
3 pages
Methods of Structural Analysis
No ratings yet
Methods of Structural Analysis
2 pages
Guesk Tutorial
No ratings yet
Guesk Tutorial
20 pages
Closure Properties of Regular Sets
100% (1)
Closure Properties of Regular Sets
20 pages
Deep Learning for Transport & Health
No ratings yet
Deep Learning for Transport & Health
9 pages
Critical Path Method: Project Management
No ratings yet
Critical Path Method: Project Management
11 pages
B Tech Supplementary Examination Time Table, MAY 2025 Y24 Batch
No ratings yet
B Tech Supplementary Examination Time Table, MAY 2025 Y24 Batch
1 page
Time Series & Forecasting Exam Guide
No ratings yet
Time Series & Forecasting Exam Guide
2 pages
ChatGPT A Comprehensive Review On Background, Applications, Key
No ratings yet
ChatGPT A Comprehensive Review On Background, Applications, Key
34 pages
Linear Programming - Course, Corrected Exercises
No ratings yet
Linear Programming - Course, Corrected Exercises
13 pages
Web Scraping with Machine Learning
No ratings yet
Web Scraping with Machine Learning
4 pages
Seatwork 3 2M1 Laplace Elementary
No ratings yet
Seatwork 3 2M1 Laplace Elementary
2 pages
Quiz 3 Solutions
No ratings yet
Quiz 3 Solutions
1 page
Competitive Bidding: A Multi-Criteria Approach To Assess The Probability of Winning
No ratings yet
Competitive Bidding: A Multi-Criteria Approach To Assess The Probability of Winning
12 pages
Maximum Likelihood Estimation Explained
No ratings yet
Maximum Likelihood Estimation Explained
5 pages
Inversion by Direct Iteration An Alternative To Denoising Diffusion 2303.11435v5
No ratings yet
Inversion by Direct Iteration An Alternative To Denoising Diffusion 2303.11435v5
35 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
18 pages
389chebyshev and Fourier Spectral Methods 2 Revised Edition John P. Boyd PDF Download
100% (7)
389chebyshev and Fourier Spectral Methods 2 Revised Edition John P. Boyd PDF Download
47 pages
25105-Article Text-29168-1-2-20230626
No ratings yet
25105-Article Text-29168-1-2-20230626
9 pages
Previous Exam Paper 3
No ratings yet
Previous Exam Paper 3
3 pages
Speech Emotion Recognition: Ashish B. Ingale, D. S. Chaudhari
No ratings yet
Speech Emotion Recognition: Ashish B. Ingale, D. S. Chaudhari
4 pages
2025 CSMC (E)
No ratings yet
2025 CSMC (E)
4 pages
090 - MA8491, MA6459 Numerical Methods - Question Bank 1
No ratings yet
090 - MA8491, MA6459 Numerical Methods - Question Bank 1
19 pages
QUESTIONS Thinking Procedurally XVxgVyYT7QMJVmRv
No ratings yet
QUESTIONS Thinking Procedurally XVxgVyYT7QMJVmRv
5 pages
Novo - 2013 - A Comprehensive Study of Educational Timetabling, A Survey
No ratings yet
Novo - 2013 - A Comprehensive Study of Educational Timetabling, A Survey
73 pages
Mathlab Ok
No ratings yet
Mathlab Ok
16 pages
AIML QB Solutions First Cie
No ratings yet
AIML QB Solutions First Cie
3 pages
Advanced Data Structures Course
100% (1)
Advanced Data Structures Course
92 pages
Statistics Notes
No ratings yet
Statistics Notes
8 pages
Arabic Speech Recognition Evolution
No ratings yet
Arabic Speech Recognition Evolution
8 pages

CSE465 T3 Perceptron

Uploaded by

CSE465 T3 Perceptron

Uploaded by

CSE465

CSE465: Pattern Recognition and Neural Network

Silvia Ahmed (SvA) CSE465 ECE@NSU 2

Silvia Ahmed (SvA) CSE465 ECE@NSU 3

Silvia Ahmed (SvA) CSE465 ECE@NSU 4

Silvia Ahmed (SvA) CSE465 ECE@NSU 5

Silvia Ahmed (SvA) CSE465 ECE@NSU 6

Figure: Perceptron vs Neuron [2]

Silvia Ahmed (SvA) CSE465 ECE@NSU 7

• Weights actually depicts the strength of each (input) connections.

Silvia Ahmed (SvA) CSE465 ECE@NSU 8

Silvia Ahmed (SvA) CSE465 ECE@NSU 9

input 1 input 2 output

Silvia Ahmed (SvA) CSE465 ECE@NSU 10

input 1 input 2 output

Silvia Ahmed (SvA) CSE465 ECE@NSU 11

input 1 input 2 output

Silvia Ahmed (SvA) CSE465 ECE@NSU 12

• Tensorflow playground: [Link]

Silvia Ahmed (SvA) CSE465 ECE@NSU 13

Silvia Ahmed (SvA) CSE465 ECE@NSU 14

Silvia Ahmed (SvA) CSE465 ECE@NSU 15

Silvia Ahmed (SvA) CSE465 ECE@NSU 16

Silvia Ahmed (SvA) CSE465 ECE@NSU 17

Main equation: 2x+3y+5=0 Effect

Silvia Ahmed (SvA) CSE465 ECE@NSU 18

Silvia Ahmed (SvA) CSE465 ECE@NSU 19

2x+3y+5=0 (5,2) (-3,-2)

Silvia Ahmed (SvA) CSE465 ECE@NSU 20

Silvia Ahmed (SvA) CSE465 ECE@NSU 21

Silvia Ahmed (SvA) CSE465 ECE@NSU 22

Silvia Ahmed (SvA) CSE465 ECE@NSU 23

Silvia Ahmed (SvA) CSE465 ECE@NSU 24

Silvia Ahmed (SvA) CSE465 ECE@NSU 25

Silvia Ahmed (SvA) CSE465 ECE@NSU 26

Silvia Ahmed (SvA) CSE465 ECE@NSU 27

You might also like