0% found this document useful (0 votes)

22 views

Lecture 7 - ANN

Uploaded by

Đoàn Ngoc Anh

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views

Lecture 7 - ANN

Uploaded by

Đoàn Ngoc Anh

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 73

Artificial Neural Network (ANN)

n Outline:

1. Overview of ANN

2. Components of ANN

3. ANN training (forward and backward propagation)

4. ANN characteristics

5. ANN design
Artificial Neural Network (ANN)

n Outline:

1. Overview of ANN

2. Components of ANN

3. ANN training (forward and backward propagation)

4. ANN characteristics

5. ANN design
What is an ANN?

§ Computing system inspired by the biological neural network

§ Connection of units (or nodes) called artificial neurons

Ø Each node ~ biological neuron

Ø Each connection ~ synapse in brain

Biological neural
networks
§ Our brain consists of ~ 100 billion
neurons
§ A neuron may connect to as many
as 100,000 other neurons
§ Signals “move” via electrochemical
signals
§ Biological neural network:
interconnected network of billions
of neurons with trillions of
interconnections between them
Structure of a biological neuron

§ Dendrite: receives signals from other neurons

§ Cell body: sums all the incoming signals to generate input
§ Axon: transfers signals to the other neurons when the neuron “fires”
(sum reaches a threshold)
§ Synapses: points of interconnection of one neuron with other neurons
McCulloch and Pitts neuron model (1943)

§ A mathematic computing paradigm that models the human neuron

Input Weights

x1
x2 u Output
y = f (u)
x3 y
y
.
N
xN u = ∑ x jw j
j=1

u
Perceptron neuron model

§ An enhanced version of McCulloch-Pitts model:

Ø Merge Hebbian learning rule of adjusting weights

Ø Add bias
N
Input Weights u = ∑ x jw j +θ
j=1
x1
x2 Output
u
f(u) y
x3
. ⎧⎪ 1 u ≥ 0
y=⎨
xN b= θ ⎪⎩ 0 u < 0
Artificial Neural Network (ANN)

n Outline:

1. Overview of ANN

2. Components of ANN

3. ANN training (forward and backward propagation)

4. ANN characteristics

5. ANN design
Input Weights Cell
x1 body
x2 u Output
f(u)
x3 y
.
xN
General neuron model

Input Weights Cell body

x1 Net function
Output
x2 u N
f(u) u = ∑ x jw j +θ
x3 y j=1

.
xN Activation function
y = f (u)
θ Ex: 1
{wj; 1 £ j £ N}: synaptic weights y = f (u) =
1+ e −u
q : threshold
Popular net functions
Popular activation functions
Multilayer perceptron model (MLP)

§ Layered network of perceptron

neurons

Fully
connected
Artificial Neural Network (ANN)

n Outline:

1. Overview of ANN

2. Components of ANN

3. ANN training (forward and backward propagation)

4. ANN characteristics

5. ANN design
ANN training process?

§ Calibrating all of the weights by repeating forward-backward

propagation steps until the output is predicted accurately

§ Forward propagation:

Ø Applying a set of weights to the input data

Ø Calculating the output

§ Backward propagation:

Ø Measuring the error of the output (difference between desired

output and actual output)

Ø Adjusting the weights to decrease the error in the next step

ANN training example – epoch 1

INPUT DATA DESIRED VALUE ACTUAL OUTPUT

0 0.21

0 0.156

1 0.78

1 0.83
ANN training example – epoch 2

INPUT DATA DESIRED VALUE ACTUAL OUTPUT

0 0.194

0 0.143

1 0.802

1 0.895
ANN training example – epoch n

INPUT DATA DESIRED VALUE ACTUAL OUTPUT

0 0.119

0 0.056

1 0.884

1 0.926
Error back propagation learning

§ Step 1: initialization

§ Step 2: output calculating

§ Step 3: error calculating K K

E = å [e(k )]2 = å [d (k ) - z (k )]2
k =1 k =1

d – desired output values (target)

z – actual outputs
§ Step 4: weight updating then go back to step (2) until the stop
condition is satisfied
Weight updating

§ To achieve the minimum error

W – weight
E – error
Learning rate choosing
Stopping conditions

§ Average squared error change: the absolute rate of change in

the average squared error per epoch is sufficiently small (in the
range [0.1, 0.01]).

§ Generalization based criterion: after each epoch the ANN is

tested for generalization. If the generalization performance is
adequate then stop.

§ Good generalization: the I/O mapping is nearly correct for new

data
Artificial Neural Network (ANN)

n Outline:

1. Overview of ANN

2. Components of ANN

3. ANN training (forward and backward propagation)

4. ANN characteristics

5. ANN design
ANN characteristics

§ Parameters, hyperparameters

§ Shallow NN, deep NN

§ Underfitting, overfitting

§ Generalization
ANN parameters

§ Parameters: changing while training ANN

Ø Weights

Ø Biases

§ Hyperparameters: constant parameters related to ANN

configuration defined before training ANN

Ø Learning rate

Ø Number of hidden layers

Ø Net function

Ø Activation function,

Ø Number of examples in the training dataset…

Shallow NN and deep NN

§ Shallow NN:

Ø One hidden layer

Ø Used for simple problems

§ Deep NN:

Ø Many hidden layers

Ø Used for complex problems

Ø Each layer is used for a specific role in the entire problem

Deep learning video
Model fitting

§ Underfitting (high bias):

Ø Model is too simple for data

Ø Train error is large, vali/test error is large too.

Ø Model can do accurate predictions, but the initial assumption

about the data is incorrect
Model fitting (cont)

§ Overfitting (high variance):

Ø Model is too complex for data

Ø Model memorizes the training data rather than generalize the

data à error on training set is small, error on testing set is large
Model fitting (cont)

Good generalization
Model fitting
(cont) Good model Overfitting

Underfitting Bad model

How to avoid underfitting?

§ Try more complex model

Ø More powerful model with a larger number of parameters

Ø More layers

Ø More neurons per layer

§ Try larger quantity of features

Ø Get additional features

Ø Feature engineering

§ Data cleaning, cross validation (hold-out, K-fold, LOOCV)

How to avoid overfitting?

§ Try more simple model

Ø Less powerful model with a fewer number of parameters

Ø Less layers, less neurons per layer

§ Try a smaller quantity of features

Ø Remove additional features

Ø Feature selection
How to avoid overfitting? (cont)

§ Enlarge data

Ø Data cleaning

Ø Cross validation (hold-out, K-fold, LOOCV)

Ø Data augmentation (rotate, flip, scale,…)

§ More regularization

Ø Early stopping

Ø Drop out

Ø L1, L2 regularization
Early stopping
Generalization

§ Good generalization: the I/O mapping is nearly correct for new data

Good
generalization
Generalization

§ Factors that influence generalization:

Ø Training set size

Ø ANN architecture

Ø Problem complexity

§ How to improve the generalization?

Ø Collect more data for training

Ø Train several networks then select the best one

Ø Avoid overfitting, avoid underfitting

Artificial Neural Network (ANN)

n Outline:

1. Overview of ANN

2. Components of ANN

3. ANN training (forward and backward propagation)

4. ANN characteristics

5. ANN design
ANN design process

§ Data collection and representation

§ Setup network topology

§ Create network parameters

§ Initialize weight and bias values

§ Training

§ Validation à re-design or using

ANN design process

§ Data collection and representation

§ Setup network topology

§ Create network parameters

§ Initialize weight and bias values

§ Training

§ Validation à re-design or using

Data representation

One-hot encoding
⎧ 1, x ∈ C ⎡ ⎤
⎪ j k ⎢ 0 ⎥
dk , j =⎨ ⎢ ! ⎥
⎪⎩ 0, x j ∉ Ck ⎢ 1 ⎥ ← kth element
⎢ ⎥
⎢ ! ⎥
⎢ 0 ⎥
⎣ ⎦
Ck − class k
x j − input j
d k , j − desired output
ANN design process

§ Data collection and representation

§ Setup network topology

§ Create network parameters

§ Initialize weight and bias values

§ Training

§ Validation à re-design or using

Network topology

§ The way to connect neurons to form a network

§ Topology consists of:

Ø Neural framework: described by the number of neuron layers,

the number of neurons per layer

Ø Interconnection structure: different kinds of connections such

as interlayer connection, intralayer connection, self connection,
sublayer connection
Types of ANN structure

§ Feed forward neural network: may or may not have the hidden
layers (one or multiple hidden layers)

§ Radial basis function neural network

§ Self organizing neural network

§ Recurrent neural network

§ Convolutional neural network

§ Modular neural network

ANN design process

§ Data collection and representation

§ Setup network topology

§ Create network parameters

§ Initialize weight and bias values

§ Training

§ Validation à re-design or using

Network parameters

§ Learning rate

§ Activation function

§ Net function

§ Data preprocessing

§ Number of examples in the training data set

Heuristic 1

§ Maximization of information content: every training

example presented to the backpropagation algorithm must
maximize the information content.

Ø Use of an example that results in the largest training error.

Ø Use of an example that is radically different from all those

previously used.
Heuristic 2

§ Activation function: network learns faster with antisymmetric

functions when compared to nonsymmetric functions.

Antisymmetric function Nonsymmetric function

Heuristic 3

§ How many training data?

Rule of thumb: the number of training examples should be at

least five to ten times the number of weights of the network

ANN design process

§ Data collection and representation

§ Setup network topology

§ Create network parameters

§ Initialize weight and bias values

§ Training

§ Validation à re-design or using

Initialization

§ Initializing weights and biases before training process

§ Heuristics:

Ø Weights should be initialized randomly (except zero)

Ø Biases should be initialized as zero

ANN design process

§ Data collection and representation

§ Setup network topology

§ Create network parameters

§ Initialize weight and bias values

§ Training

§ Validation à re-design or using

Learning modes

§ Online learning: learning as the data comes in (one example at a

time)

Ø Sequential mode or stochastic mode

§ Offline learning: learning over the entire dataset

Ø Batch mode: updating parameters after consuming the whole

batch
ANN sequential training mode

§ Presenting I/O-1 as x(1)-y(1)

§ Performing a sequence of forward and backward computations

§ Updating the weights

§ Same for x(2)-y(2),… , x(N)-y(N)

§ The learning process continues on an epoch-by-epoch basis until

the stopping condition is satisfied
ANN design process

§ Data collection and representation

§ Setup network topology

§ Create network parameters

§ Initialize weight and bias values

§ Training

§ Validation à re-design or using

Method

§ Hold out

§ K-fold cross validation

§ LOOCV
Performance

§ Confusion matrix à Precision, recall, accuracy,

F1-score, ROC, AUC, IoU,…

§ Algorithm complexity, cost,…

Deep learning video
Convolutional neural network (CNN)
CNN to classify handwritten digits
Convolution
Convolution layer
Convolution
Pooling layer
Classification
CNN to classify handwritten digits
Layer (type) Output Shape Param #
cv0 (Conv2D) (None, 128, 128, 16) 448
Convolutional neural
max_pooling2d_11 (MaxPooling)
network
(None, 64, 64, 16) 0

cv1 (Conv2D) (None, 64, 64, 32) 4640

max_pooling2d_12 (MaxPooling) (None, 32, 32, 32) 0

cv2 (Conv2D) (None, 32, 32, 64) 18496

max_pooling2d_13 (MaxPooling (None, 16, 16, 64) 0
cv3 (Conv2D) (None, 16, 16, 128) 73856
max_pooling2d_14 (MaxPooling (None, 8, 8, 128) 0
cv4 (Conv2D) (None, 8, 8, 256) 295168
max_pooling2d_15 (MaxPooling (None, 4, 4, 256) 0
cv5 (Conv2D) (None, 4, 4, 128) 32896
cv6 (Conv2D) (None, 4, 4, 64) 8256
cv7 (Conv2D) (None, 4, 4, 32) 2080
flatten_3 (Flatten) (None, 512) 0
hiddenlayer1 (Dense) (None, 512) 262656
hiddenlayer2 (Dense) (None, 128) 65664
dense_3 (Dense) (None, 51) 6579
activation_3 (Activation) (None, 51) 0
Deep learning crash course for beginners

https://www.youtube.com/watch?v=VyWAvY2CF9c

Course developed by Jason Dsouza

Duration: 1hr 30 minutes

Bài tập áp dụng 1

§ Sinh viên làm bài tập theo nhóm đã phân công cho học phần, bao
gồm các bước sau:

Ø Sử dụng dữ liệu lá cây tự sưu tầm (3 + 2) hoặc (2+2)

Ø Huấn luyện và kiểm tra mô hình ANN (số neuron lớp ẩn lần lượt là
10, 15), dùng đặc trưng Hu’s moments, tốc độ học là 𝜂 = 0.1.
Đánh giá mô hình ANN bằng phương pháp 5-fold cross validation.
Nhận xét kết quả.

Ø Viết báo cáo.

Bài tập áp dụng 2

§ Sinh viên làm bài tập theo nhóm đã phân công cho học phần, bao
gồm các bước sau:

Ø Sử dụng dữ liệu lá cây tự sưu tầm (3 + 2) hoặc (2+2)

Ø Huấn luyện và kiểm tra mô hình ANN (số neuron lớp ẩn lần lượt là
10, 15), dùng đặc trưng HOG với các tham số tự chọn. Đánh giá
mô hình ANN bằng phương pháp 5-fold cross validation. Nhận xét
kết quả.

Ø Viết báo cáo.

Ví dụ: Green vegetables
Diếp cá (fish herb) Rau má (cica) Lá lốt (piper lolot)

Lá bạc hà (mint) Lá ngò (cilantro)

Object Detection Week 2 YOLOv1-YOLOv8
100% (1)
Object Detection Week 2 YOLOv1-YOLOv8
264 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
44 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
34 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
43 pages
NNs PDF
No ratings yet
NNs PDF
16 pages
Business Intelligence & Data Mining-10
No ratings yet
Business Intelligence & Data Mining-10
39 pages
Lecture15 NeuronNetworks
No ratings yet
Lecture15 NeuronNetworks
61 pages
Machine Learning: Neural Networks Slides Mostly Adapted From Tom Mithcell, Han and Kamber
No ratings yet
Machine Learning: Neural Networks Slides Mostly Adapted From Tom Mithcell, Han and Kamber
40 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
15-NEURAL-NETWORK-UPDATED
No ratings yet
15-NEURAL-NETWORK-UPDATED
85 pages
UNIT4_Part1 aiml
No ratings yet
UNIT4_Part1 aiml
79 pages
Week 8 - ANN
No ratings yet
Week 8 - ANN
42 pages
Artificial Neural Networks: Dan Simon Cleveland State University
No ratings yet
Artificial Neural Networks: Dan Simon Cleveland State University
44 pages
12 Neural Network
No ratings yet
12 Neural Network
52 pages
Neural Networks
No ratings yet
Neural Networks
40 pages
Artificial Neural Networks in Bi: Information System Dept ITS Surabaya 2009
No ratings yet
Artificial Neural Networks in Bi: Information System Dept ITS Surabaya 2009
42 pages
Architecture and Learning process in neural network - GeeksforGeeks
No ratings yet
Architecture and Learning process in neural network - GeeksforGeeks
6 pages
ML Unit 2
No ratings yet
ML Unit 2
91 pages
ML Unit 3 Notes
No ratings yet
ML Unit 3 Notes
10 pages
UNIT 3 - Backpropagation Algorithm
No ratings yet
UNIT 3 - Backpropagation Algorithm
38 pages
ANN Doc
No ratings yet
ANN Doc
2 pages
Ann 2
No ratings yet
Ann 2
22 pages
Lesson 14 ANN Supervised
No ratings yet
Lesson 14 ANN Supervised
37 pages
What Is A Neural Network?
100% (1)
What Is A Neural Network?
26 pages
7 Neural Networks - Lecture Slides
No ratings yet
7 Neural Networks - Lecture Slides
74 pages
Neural Networks
No ratings yet
Neural Networks
75 pages
Ann 1
No ratings yet
Ann 1
13 pages
Neural Networks and Fuzzy Systems: Multi-Layer Feed Forward Networks
No ratings yet
Neural Networks and Fuzzy Systems: Multi-Layer Feed Forward Networks
27 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
83 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
18 pages
mv_cs4243_2024_amir_6_p1 (1)
No ratings yet
mv_cs4243_2024_amir_6_p1 (1)
97 pages
Gaurav Ann PDF
No ratings yet
Gaurav Ann PDF
75 pages
Artificial Neural Network
100% (2)
Artificial Neural Network
20 pages
Lecture 13.3 Classification ANN
No ratings yet
Lecture 13.3 Classification ANN
64 pages
Neural Network: Throughout The Whole Network, Rather Than at Specific Locations
No ratings yet
Neural Network: Throughout The Whole Network, Rather Than at Specific Locations
8 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
7 pages
ML-U2
No ratings yet
ML-U2
15 pages
Machine Learning
No ratings yet
Machine Learning
77 pages
Lecture Slides-Week13,14
No ratings yet
Lecture Slides-Week13,14
62 pages
Neural Networks
No ratings yet
Neural Networks
45 pages
Neural Network
No ratings yet
Neural Network
58 pages
Ann
No ratings yet
Ann
31 pages
Unit 2 - Soft Computing
No ratings yet
Unit 2 - Soft Computing
49 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
48 pages
Unit -4 Artificial Neural Networks
No ratings yet
Unit -4 Artificial Neural Networks
33 pages
1 Neural Network: AND Function: Threshold (Y) 2
No ratings yet
1 Neural Network: AND Function: Threshold (Y) 2
13 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
46 pages
Lecture 10
No ratings yet
Lecture 10
155 pages
Lec 1
No ratings yet
Lec 1
57 pages
Module 3 Ppt
No ratings yet
Module 3 Ppt
83 pages
Ai 7
No ratings yet
Ai 7
41 pages
MOD 2
No ratings yet
MOD 2
43 pages
MIS-410: Decision Support System (DSS) : Ashis Talukder Assistant Professor, Department of MIS Dhaka University
No ratings yet
MIS-410: Decision Support System (DSS) : Ashis Talukder Assistant Professor, Department of MIS Dhaka University
44 pages
Neural Network
100% (1)
Neural Network
54 pages
ML Unit-Ii
No ratings yet
ML Unit-Ii
41 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
37 pages
Introduction to Neural Networks
No ratings yet
Introduction to Neural Networks
125 pages
09-Neural Networks
No ratings yet
09-Neural Networks
18 pages
w1 01 Introtonn
No ratings yet
w1 01 Introtonn
42 pages
CH 06 Introduction to Neural Networks
No ratings yet
CH 06 Introduction to Neural Networks
45 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Slides CNN
No ratings yet
Slides CNN
17 pages
What Is Ensemble Learning
No ratings yet
What Is Ensemble Learning
4 pages
Mechine Learning
No ratings yet
Mechine Learning
10 pages
Seminar
No ratings yet
Seminar
27 pages
Neural Network Project Report.
No ratings yet
Neural Network Project Report.
12 pages
Unit 1 Introduction To Neural Networks
No ratings yet
Unit 1 Introduction To Neural Networks
9 pages
A Survey of Machine Learning Algorithms For Big Data Analytics
No ratings yet
A Survey of Machine Learning Algorithms For Big Data Analytics
4 pages
1 s2.0 S1110016823000327 Main
No ratings yet
1 s2.0 S1110016823000327 Main
24 pages
DIGITAL FLUENCY - Unit 1 & 2 - Part 1
No ratings yet
DIGITAL FLUENCY - Unit 1 & 2 - Part 1
7 pages
Comparing Gru and LSTM For Automatic Speech Recognition: Shubham Khandelwal, Benjamin Lecouteux, Laurent Besacier
No ratings yet
Comparing Gru and LSTM For Automatic Speech Recognition: Shubham Khandelwal, Benjamin Lecouteux, Laurent Besacier
7 pages
IT5409 Ch7 Part1 Object Detection v2 Linhdt 2023
No ratings yet
IT5409 Ch7 Part1 Object Detection v2 Linhdt 2023
49 pages
Convolutional Neural Network
100% (1)
Convolutional Neural Network
59 pages
Convolutional Neural Networks - Annotated
No ratings yet
Convolutional Neural Networks - Annotated
83 pages
Azure OpenAI Workshop
No ratings yet
Azure OpenAI Workshop
30 pages
Residual Neural Network: Tea Leaf desease Detection
No ratings yet
Residual Neural Network: Tea Leaf desease Detection
6 pages
Ppt2 Introduction To Soft Computing
No ratings yet
Ppt2 Introduction To Soft Computing
10 pages
500 - Projects of ML and DL
No ratings yet
500 - Projects of ML and DL
9 pages
ESE_577_syllabus_Fall2024
No ratings yet
ESE_577_syllabus_Fall2024
4 pages
Quiz Feedback Coursera PDF
No ratings yet
Quiz Feedback Coursera PDF
4 pages
CO328 - Deep - Learning - Final 23.12.23
No ratings yet
CO328 - Deep - Learning - Final 23.12.23
2 pages
Divorce Prediction System: Devansh Kapoor 179202050
No ratings yet
Divorce Prediction System: Devansh Kapoor 179202050
12 pages
Project Report
No ratings yet
Project Report
12 pages
Indian Sign Language Recognition System
No ratings yet
Indian Sign Language Recognition System
3 pages
Sugarcane diseases
No ratings yet
Sugarcane diseases
4 pages
6th International Conference on Artificial Intelligence and Big Data (AIBD 2025)
No ratings yet
6th International Conference on Artificial Intelligence and Big Data (AIBD 2025)
2 pages
Research Paper On Content Generator
No ratings yet
Research Paper On Content Generator
15 pages
The Rise of Artificial Intelligence
No ratings yet
The Rise of Artificial Intelligence
2 pages
An Introduction to Convolutional Neural Networks_ a Comprehensive Guide to CNNs in Deep Learning _ DataCamp
No ratings yet
An Introduction to Convolutional Neural Networks_ a Comprehensive Guide to CNNs in Deep Learning _ DataCamp
14 pages
12-Back propagation Algorithm-07-08-2024
No ratings yet
12-Back propagation Algorithm-07-08-2024
26 pages