0% found this document useful (0 votes)

11 views

Lecture 11 - Introduction To Artificial Neural Networks (ANN)

This document provides an introduction to artificial neural networks (ANNs). It defines ANNs as being inspired by biological neural networks in the brain and composed of interconnected nodes similar to neurons. The document outlines ANN components including the artificial neuron, which receives weighted inputs and uses an activation function to determine output. It also describes common ANN architectures like feedforward and recurrent neural networks, and how ANNs are trained through forward and backward propagation to minimize error and adjust weights.

Uploaded by

johndeuterok

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

Lecture 11 - Introduction To Artificial Neural Networks (ANN)

Uploaded by

johndeuterok

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Introduction

to Artificial
Neural
Networks
(ANN)

Prepared by Stephanie Chua, edited by Liew SH

Outline

+ Definition
+ Human Biological Neuron
+ Artificial Neuron
+ Artificial Neural Network
+ Types of ANN
+ How do ANN work?
+ Activation Functions
+ Applications of ANN
Neural networks, also known as artificial
neural networks (ANNs) or simulated
neural networks (SNNs), are a subset
of machine learning and are at the heart
of deep learning algorithms.
Definition
Their name and structure are inspired
by the human brain, mimicking the way
that biological neurons signal to one
another.
Human
Biological
Neuron
+ A biological neuron has three types
of main components; dendrites,
soma (or cell body) and axon.
+ Dendrites receive signals from
other neurons.
+ The soma sums the incoming
signals. When sufficient input is
received, the cell fires; it transmit a
signal over its axon to other cells.
Artificial
Neuron
+ Once an input layer is determined, weights are
assigned.
+ These weights help determine the importance
of any given variable, with larger ones
contributing more significantly to the output
compared to other inputs.
+ All inputs are then multiplied by their respective
weights and then summed.
+ The output is then passed through an activation
function, which determines the output. If that
output exceeds a given threshold, it “fires” (or
activates) the node, passing data to the next
layer in the network.
Biological Neuron vs Artificial Neuron

Source: https://www.researchgate.net/publication/325870973_Investigating_Keystroke_Dynamics_as_a_Two-Factor_Biometric_Security/figures?lo=1
Artificial Neural
Network
+ Artificial neural
networks (ANNs) are
comprised of node
layers, containing an
input layer, one or
more hidden layers,
and an output layer. Source: https://www.ibm.com/cloud/learn/neural-networks
Artificial Neural Network
Training a neural network :
i.Forward Propagation - Apply a set of
weights to the input data and calculate an
output. For the first forward propagation,
the set of weights is selected randomly.

ii.Backward Propagation - Measure

the margin of error of the output and
adjust the weights accordingly to decrease
the error.

ANN repeats both forward and back

propagation until the weights are
calibrated to accurately predict an output.
Types of ANN – Feedforward Neural Networks
+ The flow of information through the network is unidirectional
without going through loops.
+ Single layer networks vs Multilayer networks
• The number of layers depends on the complexity of the function that needs to
be performed.
• The single-layered feedforward neural network consists of only two layers of
neurons and no hidden layers in between them.
• Multi-layered neural network consist of multiple hidden layers between the
input and output layers, allowing for multiple stages of information processing.
• Used to perform basic pattern and image recognition.
Single Layer vs Multilayer Neural Networks

Source: https://towardsdatascience.com/multi-layer-neural-networks-with-sigmoid-function-deep-learning-for-rookies-2-bf464f09eb7f
ANN - CALCULATION

We use (1,1) => 0 to demonstrate forward propagation

Simple Dataset
Input Output
0,0 0
0,1 1
1,0 1
1,1 0
ANN - CALCULATION

1. Assign weights to all synapses Note: Weights are selected randomly

(based on Gaussian distribution).

Since it is the first iteration, the initial

weights will be between 0 and 1.

However, final weights don’t need to be.

ANN - CALCULATION

2. Sum products of inputs with respective (10.8) + (10.2) = 1

weight. (1*0.4) + (1*0.9) = 1.3
(1*0.3) + (1*0.5) = 0.8

To get the final value, apply an

activation function to the
hidden layer sums
Types of Activation Function

Let’s choose sigmoid

ANN - CALCULATION

3. Apply S(x) to the three hidden layer sum.

S(1.0) = 0.7311
S(1.3) = 0.7858
S(0.8) = 0.6900

Then, sum the product of the

hidden layer results with the
second set of weights (also
determined at random the first
time around) to determine the
output sum.
ANN - CALCULATION

4. Calculate sum of product of hidden layers and

apply activation function.
Output sum
= (0.73 * 0.3) + (0.79 * 0.5) + (0.69 * 0.9)
= 1.235

S(1.235) = 0.7747
ANN – Complete Diagram

Question 1: Why is the calculated

output off the target mark?

Answer: Because our weights

were initialized randomly.

Action:We need to adjust the

weight via back propagation.
Back Propagation – Adjust the
Weights
+ Calculating the incremental change to these weights happens in two
steps:
1. we find the margin of error of the output result to back out the
necessary change in the output sum (delta output sum)
2. Extract the change in weights by multiplying delta output sum by
the hidden layer results

Output sum margin of error = target – calculated

Back Propagation – Adjust the
Weights
Output Sum of Margin Error = Target – Calculated
= 0 – 0.77
= -0.77

Calculate: Delta Output Sum = Derivative of the activation (sigmoid) function

and apply to the output sum

Delta output sum = S’(output sum) * (output sum margin error) Delta output
sum = S’(1.235) * (-0.77)
Delta output sum = -0.1344 (proposed change)
Back Propagation

hidden result 1 = 0.73

hidden result 2 = 0.79
hidden result 3 = 0.69

Delta weights = delta output sum * hidden layer results

Delta weights = -0.1344 * [0.73,0.79,0.69]
Delta weights = [-0.0983,-0.1056,-0.0941]

Hence
old w7 = 0.3 -> new w7 = 0.202
old w8 = 0.5 -> new w8 = 0.394
old w9 = 0.9 -> new w9 = 0.806
Back Propagation
To determine the change in weight between input
and hidden layer,we perform similar calculations.

Delta hidden sum = delta output sum * hidden-to-

outer weights * S'(hidden sum)

Delta hidden sum = -0.1344 * [0.3,0.5,0.9] * S'([1,

1.3,0.8])

Delta hidden sum = [-0.0403,-0.0672,-0.1209] *

[0.1966,0.1683,0.2139]

Delta hidden sum = [-0.0079,-0.0113,-0.0259]

Back Propagation
Then multiply the results with the input data.

input 1 = 1
input 2 = 1

Delta weights = delta hidden sum * input

Delta weights = [-0.0079,-0.0113,-0.0259] * [1,1]
Delta weights = [-0.0079,-0.0113,-0.0259,-0.0079,
-0.0113,-0.0259]
old w1 = 0.8 > new w1 = 0.7921
old w2 = 0.4 > new w2 = 0.3887
old w3 = 0.3 > new w3 = 0.2741
old w4 = 0.2 > new w4 = 0.1921
old w5 = 0.9 > new w5 = 0.8887
old w6 = 0.5 > new w6 = 0.4741
Back Propagation
New weights.

old new
--------------------------
w1:0.8 w1:0.7921
w2:0.4 w2:0.3887
w3:0.3 w3:0.2741
w4:0.2 w4:0.1921
w5:0.9 w5:0.8887
w6:0.5 w6:0.4741
w7:0.3 w7:0.2020
w8:0.5 w8:0.3940
w9:0.9 w9:0.8060
Types of ANN – Recurrent Neural Networks
+ Recurrent neural networks (RNN), as the name suggests, involves the recurrence
of operations in the form of loops. These are much more complicated than
feedforward networks and can perform more complex tasks than basic image
recognition.
+ While in feedforward neural networks, connections only lead from one neuron to
neurons in subsequent layers without any feedback, recurrent neural networks
allow for connections to lead back to neurons in the same layer allowing for a
broader range of operations.
+ One of the limitations for RNN is that they are difficult to train and have a very
short-term memory, which limits their functionality.
+ To overcome the memory limitation, a newer form of RNN, known as Long Short-
term Memory (LSTM) networks are used. LSTMs extend the memory RNNs to
enable them to perform tasks involving longer-term memory.
+ The main application areas for RNNs include natural language processing problems
such as speech and text recognition, text prediction, and natural language
generation.
Recurrent vs Feedforward Neural Networks

Source: https://machine-learning.paperspace.com/wiki/recurrent-neural-network-rnn

Source: https://towardsdatascience.com/implementation-of-rnn-lstm-and-gru-a4250bf6c090
Types of ANN – Convolutional Neural Networks
+ Convolutional neural networks (CNN) is commonly associated with
computer vision applications. Their architecture is specifically suited for
performing complex visual analysis.
+ The convolutional neural network architecture is defined by a three-
dimensional arrangement of neurons, instead of the standard two-
dimensional array.
+ The first layer in such neural networks is called a convolutional layer.
+ Each neuron in the convolutional layer only processes the information from
a small part of the visual field.
+ The convolutional layers are followed by rectified layer units or ReLU,
which enables the CNN to handle complicated information.
+ CNNs are mainly used in object recognition applications like machine vision
and in self-driving vehicles.
Convolutional Neural Networks

Source: https://towardsdatascience.com/a-comprehensive-guide-to-convolutional-neural-networks-the-eli5-way-3bd2b1164a53

Source: https://www.simplilearn.com/tutorials/deep-learning-tutorial/convolutional-neural-network
Four Important Layers in
Convolutional Neural Network
+ Convolution layer
• This is the first step in the process of extracting valuable features from an image.
• A convolution layer has several filters that perform the convolution operation. Every image is considered as a matrix of pixel values.
+ ReLU layer
• ReLU stands for the rectified linear unit.
• Once the feature maps are extracted, the next step is to move them to a ReLU layer.
• ReLU performs an element-wise operation and sets all the negative pixels to 0.
• It introduces non-linearity to the network, and the generated output is a rectified feature map.
+ Pooling layer
• Pooling is a down-sampling operation that reduces the dimensionality of the feature map.
• The rectified feature map now goes through a pooling layer to generate a pooled feature map.
+ Fully connected layer
• The next step in the process is called flattening.
• Flattening is used to convert all the resultant 2-Dimensional arrays from pooled feature maps into a single long continuous linear
vector.
• The flattened matrix is fed as input to the fully connected layer to classify the image.
How do ANN work?
+ Each node, or artificial neuron, connects to another and has an
associated weight and threshold.
Bias, b

𝑓 𝑥 = 1 if σ 𝑥𝑗 𝑤𝑗 + b ≥ 0
0 if σ 𝑥𝑗 𝑤𝑗 + b < 0

Source: https://www.freecodecamp.org/news/deep-learning-neural-networks-explained-in-plain-english/
Activation Functions
+ Linear • Sigmoid
• f(z) = z • f(x) = 1 / (1 + 𝑒 (−1∗𝑧) )
+ Non-linear 1. Negate z by multiplying by -1.
2. Find the exponent of the output in
• Rectified Linear Units (ReLU) No.1.
• ReLU ensures that the output is not 3. Add 1 to the output in No.2.
negative.
4. Divide 1 by the output in No.3.
• If z is greater than zero, the output
remains z, else if z is negative, the
output is zero.
• Softmax
• f(z) = max(0, z) • Used in the output layer.
• Calculates the probabilities
• Tanh distribution of the event over n
• Hyperbolic tangent of z. events.
• f(z) = tanh(z) • f(𝑧𝑗 ) = 𝑒 𝑧𝑗 / σ𝐾 𝑧𝑘
for j = 1 … K
𝑘=1 𝑒
Weight Matrix

Source: https://www.researchgate.net/publication/292077006_Power-Efficient_Accelerator_Design_for_Neural_Networks_Using_Computation_Reuse/figures?lo=1
Example
Activation function – tanh
Activation function - Softmax

The second output node value is greater than the

first, the computed output would be interpreted as
the categorical value corresponding to (0, 1).

Source: https://visualstudiomagazine.com/articles/2014/11/01/use-python-with-your-neural-networks.aspx
Example – Abdominal Pain Prediction
Applications of ANN
+ Image recognition
+ Speech recognition
+ Facial recognition
+ Machine translation
+ Medical diagnosis
+ Stock market prediction
+ Fraud detection
+ Many more …
End of Lecture

An Ingression Into Deep Learning - Resp
No ratings yet
An Ingression Into Deep Learning - Resp
25 pages
chapter 4 Neural Network
No ratings yet
chapter 4 Neural Network
46 pages
SHALLOW NETWORKS VERSUS DEEP NETWORKS
No ratings yet
SHALLOW NETWORKS VERSUS DEEP NETWORKS
6 pages
Mind - How To Build A Neural Network (Part One)
No ratings yet
Mind - How To Build A Neural Network (Part One)
9 pages
Neural Networks
No ratings yet
Neural Networks
27 pages
Deep Learning PDF
No ratings yet
Deep Learning PDF
55 pages
3 Short
No ratings yet
3 Short
10 pages
ANN
No ratings yet
ANN
26 pages
CCS355 NNDL Unit1
No ratings yet
CCS355 NNDL Unit1
30 pages
ML Unit 3 Notes
No ratings yet
ML Unit 3 Notes
10 pages
ML-U2
No ratings yet
ML-U2
15 pages
Neurall
No ratings yet
Neurall
10 pages
Neural Networks
No ratings yet
Neural Networks
45 pages
CS 611 Slides 5
No ratings yet
CS 611 Slides 5
28 pages
UNIT - 4
No ratings yet
UNIT - 4
17 pages
Ann
No ratings yet
Ann
31 pages
Udacity Deep LEarning Part4 RNN
No ratings yet
Udacity Deep LEarning Part4 RNN
338 pages
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
No ratings yet
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
45 pages
ANN Doc
No ratings yet
ANN Doc
2 pages
Unit 1
No ratings yet
Unit 1
20 pages
Unit 1
No ratings yet
Unit 1
16 pages
MLT unit 4 and 5 part 2
No ratings yet
MLT unit 4 and 5 part 2
34 pages
Unit III
No ratings yet
Unit III
37 pages
TO Artificial Neural Networks
No ratings yet
TO Artificial Neural Networks
22 pages
CNN and Gan: Introduction To
No ratings yet
CNN and Gan: Introduction To
58 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
22 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
7 pages
L10 Neural Network
No ratings yet
L10 Neural Network
52 pages
unit-1
No ratings yet
unit-1
19 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
20 pages
Business Intelligence & Data Mining-10
No ratings yet
Business Intelligence & Data Mining-10
39 pages
Chapter 6 AI
No ratings yet
Chapter 6 AI
52 pages
Chapter One
No ratings yet
Chapter One
9 pages
deep learning UNIT 1
No ratings yet
deep learning UNIT 1
22 pages
ML 6
No ratings yet
ML 6
10 pages
AI Mod4 Session 8 Best Fit Line & ANN
No ratings yet
AI Mod4 Session 8 Best Fit Line & ANN
39 pages
Demystifying The Mathematics Behind Convolutional Neural Networks (CNNS)
No ratings yet
Demystifying The Mathematics Behind Convolutional Neural Networks (CNNS)
19 pages
Neural Network
No ratings yet
Neural Network
58 pages
Safari - 25 Jul 2019 at 11:43
No ratings yet
Safari - 25 Jul 2019 at 11:43
1 page
ch1 of artificial newral network
No ratings yet
ch1 of artificial newral network
20 pages
Week 8 - ANN
No ratings yet
Week 8 - ANN
42 pages
EPS-DL-Handout4- Steps to Build ANN From Scratch
No ratings yet
EPS-DL-Handout4- Steps to Build ANN From Scratch
14 pages
Neural Network: BY, Deekshitha J P Rakshitha Shankar
No ratings yet
Neural Network: BY, Deekshitha J P Rakshitha Shankar
27 pages
Chapter 2 - Artificial Neural Networks (ANNs)
No ratings yet
Chapter 2 - Artificial Neural Networks (ANNs)
27 pages
Unit 6 Updated Artificial Intelligence Question Bank With Notes
No ratings yet
Unit 6 Updated Artificial Intelligence Question Bank With Notes
18 pages
A Beginner's Tutorial For CNN
100% (1)
A Beginner's Tutorial For CNN
35 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
54 pages
CH 06 Introduction to Neural Networks
No ratings yet
CH 06 Introduction to Neural Networks
45 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
43 pages
What are Neural Networks
No ratings yet
What are Neural Networks
5 pages
Don_t_be_scared_of_Neural_Network__1731079571
No ratings yet
Don_t_be_scared_of_Neural_Network__1731079571
18 pages
Lecture 7 - Neural Networks
No ratings yet
Lecture 7 - Neural Networks
48 pages
ML Ch-4 Artificial Neural Network
No ratings yet
ML Ch-4 Artificial Neural Network
54 pages
Unit 03 - Neural Networks - MD
No ratings yet
Unit 03 - Neural Networks - MD
24 pages
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
No ratings yet
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
31 pages
DL Unit 2.3
No ratings yet
DL Unit 2.3
16 pages
Lecture 12 - Neural Networks (DONE!!) PDF
No ratings yet
Lecture 12 - Neural Networks (DONE!!) PDF
27 pages
TO Artificial Neural Networks
No ratings yet
TO Artificial Neural Networks
22 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
Lecture 7 - Classification (Rules and Naïve Bayes)
100% (1)
Lecture 7 - Classification (Rules and Naïve Bayes)
19 pages
VDM Manual
No ratings yet
VDM Manual
192 pages
L03 Logic Overview-Q
No ratings yet
L03 Logic Overview-Q
35 pages
L04 Type Definitions-Q
No ratings yet
L04 Type Definitions-Q
48 pages
Get Introduction to Data Mining Global Edition Pang Ning Tan Michael Steinbach Anuj Karpatne Vipin Kumar PDF ebook with Full Chapters Now
100% (3)
Get Introduction to Data Mining Global Edition Pang Ning Tan Michael Steinbach Anuj Karpatne Vipin Kumar PDF ebook with Full Chapters Now
65 pages
NPU MachineLearning
No ratings yet
NPU MachineLearning
28 pages
4.ques and Answers
No ratings yet
4.ques and Answers
5 pages
NY Perceptron Notes
No ratings yet
NY Perceptron Notes
21 pages
QB DL
No ratings yet
QB DL
2 pages
Nueral Network Mcqs
No ratings yet
Nueral Network Mcqs
6 pages
Synopsis PDF
No ratings yet
Synopsis PDF
2 pages
K-Max Pooling Operation
No ratings yet
K-Max Pooling Operation
134 pages
Lab RNN Intro
No ratings yet
Lab RNN Intro
22 pages
Deep Learning
No ratings yet
Deep Learning
7 pages
Types of Networks
No ratings yet
Types of Networks
31 pages
Artificial Neural Networks (Anns) VS Deep Neural Networks
No ratings yet
Artificial Neural Networks (Anns) VS Deep Neural Networks
24 pages
UNIT II Basic On Neural Networks
No ratings yet
UNIT II Basic On Neural Networks
36 pages
Deep Learning Mar 2022
No ratings yet
Deep Learning Mar 2022
1 page
Machine Learning Notes
No ratings yet
Machine Learning Notes
5 pages
Wa0009.
No ratings yet
Wa0009.
4 pages
Review of Deep Learning Algorithms and Architectur
No ratings yet
Review of Deep Learning Algorithms and Architectur
29 pages
RNN Neural Network
No ratings yet
RNN Neural Network
23 pages
Loss Functions - An Algorithm-Wise Comprehensive Summary
No ratings yet
Loss Functions - An Algorithm-Wise Comprehensive Summary
5 pages
Lec29 ImportanceSampling
No ratings yet
Lec29 ImportanceSampling
84 pages
A Driving Decision
No ratings yet
A Driving Decision
39 pages
Convolutional Neural Network With An Optimized Backpropagation Technique
No ratings yet
Convolutional Neural Network With An Optimized Backpropagation Technique
5 pages
Intrusion Detection System Using Voting-Based Neural Network
No ratings yet
Intrusion Detection System Using Voting-Based Neural Network
12 pages
(Graded) : Assignment No. 3
No ratings yet
(Graded) : Assignment No. 3
3 pages
Ex 9
No ratings yet
Ex 9
2 pages
Jntuk R20 ML Unit-Iii
100% (1)
Jntuk R20 ML Unit-Iii
21 pages
Unit II
No ratings yet
Unit II
56 pages
Generative AI notes (1)
No ratings yet
Generative AI notes (1)
3 pages
CNN Notes - Rohan
No ratings yet
CNN Notes - Rohan
2 pages
Cos 302
No ratings yet
Cos 302
2 pages

Lecture 11 - Introduction To Artificial Neural Networks (ANN)

Uploaded by

Lecture 11 - Introduction To Artificial Neural Networks (ANN)

Uploaded by

Introduction

Prepared by Stephanie Chua, edited by Liew SH

ii.Backward Propagation - Measure

ANN repeats both forward and back

We use (1,1) => 0 to demonstrate forward propagation

1. Assign weights to all synapses Note: Weights are selected randomly

Since it is the first iteration, the initial

However, final weights don’t need to be.

2. Sum products of inputs with respective (1*0.8) + (1*0.2) = 1

To get the final value, apply an

Let’s choose sigmoid

3. Apply S(x) to the three hidden layer sum.

Then, sum the product of the

4. Calculate sum of product of hidden layers and

Question 1: Why is the calculated

Answer: Because our weights

Action:We need to adjust the

Output sum margin of error = target – calculated

Calculate: Delta Output Sum = Derivative of the activation (sigmoid) function

hidden result 1 = 0.73

Delta weights = delta output sum * hidden layer results

Delta hidden sum = delta output sum * hidden-to-

Delta hidden sum = -0.1344 * [0.3,0.5,0.9] * S'([1,

Delta hidden sum = [-0.0403,-0.0672,-0.1209] *

Delta hidden sum = [-0.0079,-0.0113,-0.0259]

Delta weights = delta hidden sum * input

The second output node value is greater than the

You might also like

2. Sum products of inputs with respective (10.8) + (10.2) = 1