0% found this document useful (0 votes)

5 views

Week 4

Uploaded by

om55500r

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Week 4

Uploaded by

om55500r

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

2.

MULTI LAYER NETWORKS

UNIT III- SOM & SPECIAL NETWORKS

SOM-Introduction - Kohonan SOM - Linear vector quantization, Probabilistic neural

network, Cascade correlation, General Regression neural network, Cognitron -
Application of ANN - Texture classification - Character recognition.
2.1 SELF-ORGANISING NETWORKS (SOM)

Developed by Finish Prof Teuvo Kohonan in 1980’s. This network is also known as
Topology Preserving maps. The name Topology preserving is provided since the location or
position of the node varies in the stating time of training procedure and once the network
learned the given input pattern the topology or the location of neural nodes are fixed.

SOM is a Feedforward network with Single computational layer. It utilizes

Unsupervised training and is used for Dimensionality reduction. The main goal of SOM is to
change the arbitrary dimension of the given input pattern into a one- or two-Dimensional space.
Map is a 2-Dimensional space where the nodes are organized in Rectangular or Hexagonal
Grid. Figure 1 shows the grids used in SOM

Figure 1(a). Rectangular Neural Grid (b) Hexagonal Grid

Each node is provided with a weight vector which is nothing but the position of that
node in the input space or Map. Job of training is to adjust this weight vector so the distance in
the map reduces. The weight moves towards the input. Thus, from a higher dimension the map
reduces to a 2 Dimension. This is the Dimensionality Reduction Process. After training, SOM
can classify the input by selecting a nearest node (small distance) with closest weight vector to
the input space vector

2
This transformation is performed in an orderly manner. SOM uses only two-
dimensional discretized input space known as Maps for its operation. Instead of error correction
learning SOM uses Competitive / Winner Takes All learning is utilized in SOM.

SOM operates in Two modes

(1) Training
(2) Mapping

• Training Process: Develops the map using competitive procedure (Vector

Quantization)
• Mapping Process: Classifies the new supplied input based on the training
outcomes

2.1.1 SOM Algorithm

The general steps involved in SOM is given as follows

Step 1: Initialize the Weights Wij. Initialize the learning rate and topological neighbourhood
parameters
Step 2: While stop condition is false Do steps 3 to 9
Step 3: For each input Vector x, do steps 4 to 6
Step 4: For each j calculate D(j) = Σ (Wij – xi)2
Step 5: Find the index ‘j’ for which D(j) is Minimum
Step 6: For all units of j within a specific neighbourhood of j and for all i
Wij (new) = Wij (old) + α [ xi – Wij (old)]
Step 7: Update Learning Rate
Step 8: Reduce the radius of Topological Neighbourhood at specific time periods
Step 9: Test for stop condition

2.1.2 SOM Explanation

• Initialization

Weights are randomly initialized to a small value near zero (Wj)

• Competition

For the given each inputs patterns, the neurons calculate a discriminant function(Here
we use Euclidean Distance function).This Discriminant function acts as a basis for the

3
competition among neurons. The neuron with smaller Distance value is selected as wining
neuron(Winner Takes All law)

• Cooperation

The wining neuron determines the spatial locations of the excited neurons in that
neighbourhood (Topological map). Thus, a cooperation between the neurons is established by
the wining neuron in that rearranged neighbourhood

• Adaptation

The wining neurons by adjusting its weight values, tries to minimize the discriminant
function (distance value) between them and the inputs. When similar inputs are provided the
response of the wining neuron is enhance in a better way

2.1.3 SOM Merits and Demerits

Merits

• Easy to interpret
• Dimensionality Reduction
• Capable of handling different types of classification problems
• Can cluster large complex input set’s
• SOM training Time is less
• Simple algorithm
• Easy to implement

Demerits

 It does not build a generative model for the data, i.e, the model does not understand
how data is created.
 It does not behave so gently when using categorical data, even worse for mixed types
data.
 The time for preparing model is slow, hard to train against slowly evolving data

2.1.4 Applications of SOM

Applications

• Character Recognition
• Speech Recognition

4
• Texture Recognition
• Image Clustering
• Data Clustering
• Classification problems
• Dimensionality reduction applications
• Seismic analysis
• Failure Analysis etc

2.2 LEARNING VECTOR QUANTIZATION [LVQ]

• Supports Single as well as Multi Class Classification

• CODE BOOK Vectors are Developed by Training process
• Prediction is done similar to that of K-Nearest Neighbor procedure
• Codebook vectors are created from the training dataset by moving them closer (when
they are good match to the weight and input), and further away when they are a bad
match.
• When a Kt h new instance (i/p data) is given, the code book vectors are searched for a
similar value. That node/codebook vector is selected and its associated output class is
given as final output
• To select a similar input for the given Kth instance or training dataset, Euclidian
Distance measure is used as shown below

Euclidean Distance (X, Xi) = sqrt (SUM ((Xj – Xij) ^2)

Figure 2: LVQ Schematic Diagram

5
2.2.1 Algorithm

Step 1: Initialize the weight vectors to the ‘m’ Training vectors, where ‘m’ is the number of
different classification/Cluster. Start learning rate α near zero (small value)
Step 2: While Stop condition id false do steps 3 to 6
Step 3: For each input training vector X, do steps 4 to 5
Step 4: Find J such that D(J) is minimum
Step 5: Update the weights of Jth neural unit as given below
IF T = Cj then
Wj(new) = Wj(old) + α [x -wj(old)] [ Move the weight vector W towards the input X]
If T is Not Equal to C then
Wj(new) = Wj(old) - α[x -wj(old)] [ Move the weight vector W away from the input X]
Step 6: Reduce Learning Rate α
Step 7: Test for stop condition (Either a fixed number of Iteration reached or Learning rate
αhas reached a very minimum value

2.2.2 Merits

• Due to Normalization (Data Preparation) Dominance of one unit is avoided

• Dimensionality Reduction
• Feature Engineering
• Multiple Best Matches
• Multiple Passes (Multiple runs)(higher learning rate for CODE BOOK Pool
Generation & Lower Learning rate for tuning the vectors)
• Simple algorithm
• Easy to implement
• Precursor to K nearest Neighbour & SOM’s

Demerits

1. Data has to be prepared before executing this algorithm.

2. Output class should be predefined

2.3. PROBABILISTIC NEURAL NETWORKS [PNN]

o PNN has three layers of Neural Node interconnections

o Input layer can take ‘n’ number of nodes
o Input nodes are connected with the feature vectors

6
o All input feature vectors are connected with the Middle-Hidden Layer
o Hidden nodes are connected into groups and Each group denotes a particular class ‘K’
o Each node present in Hidden Layer resembles to a Gaussian Function centered on its
Feature Vector for that Kth class
o All of these Gaussian function outputs of a group/class are fed to the Kth Output unit
o Hence, we have only ‘K’ Output units only
o PNN is closely related to PARZEN Window PDF Estimator or Mixed Gaussian
Estimator
o For any output node ‘K’, all Gaussian values (of the previous Hidden layer) for that
output class are summed up
o This summed up value is scaled to a Probability Density Function (PDF)
o If class 1 contains ‘p’ feature vectors and Class 2 contains ‘Q’ feature vectors, Then P
nodes are present in Hidden layer for the class 1 & Q nodes for class 2 is present
o The equations for Gaussian functions for any input is given as

The schematic representation of Probabilistic Neural Network is shown in Figure 3.

2.3.1. Algorithm

To Set the PNN following steps are used:

Step 1: Read the exemplars vectors and class numbers
Step 2: Sort the above into ‘K’ sets, where each set contains one class of vector
Step 3: Define a Gaussian function centered on each exemplar vector and define the
summed up gaussian output function

To Classify the following steps are used:

Step 4: Read the input and feed it to the Gaussian function in each set
Step 5: Calculate the Gaussian function values at the output
Step 6: Feed this gaussian out to single output of that group
Step 7: At each class output node, Sum all its input and multiply with a constant
Step 8: Find the maximum value of all summed function values at output
Step 9: Test for stop condition

7
Figure 3: General Probabilistic Neural Network Architecture Diagram

2.4. CASCADE CORRELATION NEURAL NETWORKS [CCNN’s]

• Cascade Co-relation network is a Supervised Feedforward type of Network

• Starts with a minimal network then automatically trains & adds more new Hidden
nodes, thus creating a Multi-Layer Structure
• Cascade Correlation contains two parts: First Hidden units are added one at a time and
after added they don’t change
• Second is the Learning process where the new hidden units are created and installed
• For each new hidden unit CC tries to increase the magnitude of the correlation between
the new unit's output and the residual error signal of the network
• We train only one-layer weights, the rest are maintained as constant so the results are
cached
• Each unit sees the same inputs and error signals

The CCNN architecture is shown Figure 4.

8
Cascade correlation addresses both issues of slow rate of convergence and fixation of
nodes while training by dynamically adding hidden units to the architecture-but only the
minimum number necessary to achieve the specified error tolerance for the training set.
Furthermore, a two-step weight-training process ensures that only one layer of weights is being
trained at any time.

A cascade correlation net consists of input units, hidden units, and output units. Input
units are connected directly to output units with adjustable weighted connections.

Connections from inputs to a hidden unit are trained when the hidden unit is added to
the net and are then frozen. Connections from the hidden units to the output units are adjustable.
Cascade correlation starts with a minimal network, consisting only of the required input and
output units (and a bias input that is always equal to 1). This net is trained until no further
improvement is obtained; the error for each output unit is then computed (summed over all
training patterns).

Next, one hidden unit is added to the net in a two-step process. During the first step, a
candidate unit is connected to each of the input units, but is not connected to the output units.
The weights on the connections from the input units to the candidate unit are adjusted to
maximize the correlation between the candidate's output and the residual error at the output
units. The residual error is the difference between the target and the computed output,
multiplied by the derivative of the output unit's activation function, i.e., the quantity that would
be propagated back from the output units in the backpropagation algorithm. When this training
is completed, the weights are frozen and the candidate unit becomes a hidden unit in the net.

The second step in which the new unit is added to the net now commences. The new
hidden unit is connected to the output units, the weights on the connections being adjustable.
Now all connections to the output units are trained. (The connections from the input units are
trained again, and the new connections from the hidden unit are trained for the first time.)

9
Figure 4. Schematic Representation of Cascade Corelation Network

2.4.1. Merits

• It learns at least 10 times faster than standard Back-propagation Algorithms.

• The network determines its own size and topologies.
• It is useful for incremental learning in which new information is added to the already
trained network

2.4.2. Applications

• Recognition of different Geometric Shapes

• Vowel Recognition
• Cipher systems Identification
• Fault Classification problems

2.5. GENERAL REGRESSION NEURAL NETWORKS [GRNN’s]

o General Regression Neural Networks [GRNN’s] was proposed by D.F. Specht in 1991
o GRNN is a Single pass learning Network

10
o General Regression Neural Networks uses Gaussian Activation function for its Hidden
Layers
o GRNN is based on Function Approximation or Function estimation procedures
o Output is estimated using weighted average of the outputs of training dataset, where the
weight is calculated using the Euclidean distance between the training data and test
data
o If the distance is large then the weight will be very less and if the distance is small more
weight is given to the output
o Contains 4 layers: (1) Input layer (2) Hidden (pattern) Layer (3) Summation Layer (4)
Output (division) Layer
o GRNN’s Estimator is given by the equation

Where x = input
xi = Training sample
Y(xi) = Output for sample I
di2 = Euclidean Distance
e-(di2∕2σ2) = Activation Function – This value is taken as weight value
σ = Spread constant (only Unknown parameter)
Select σ when MSE is Minimum

2.5.1. Training Procedure

Used to calculate optimum value of σ. First divide the samples into two parts. One part
is used to train and the other is used to Test the network. Apply GRNN to Test data based on
Training data & calculate MSE for different σ . Select the Minimum MSE and its
Corresponding σ. The architecture Diagram of GRNN is given in Figure 5.

11
Figure 5. General Regression Neural Network

2.6. APPLICATIONS OF ANN

The major application are (1) Character Recognition system

(2) Texture Recognition System etc

2.6.1. Character Recognition System

Consider the characters given in figure 6. Now the objective is to recognise a particular
alphabet, say ‘A’ in this example. Using Image analysis models the particular alphabet is
segmented and converted into Intensity or Gray scale or Pixel values. The general work flow
is shown in Figure 7. The first procedure is segmentation. Segmentation is the process of
Subdividing the images into sub blocks. So alphabet “A” is isolated by using appropriate
segmentation procedures like thresholding or region Growing or Edge detector based
algorithms.

12
Figure 6. Input to Character recognition system

Figure 7. Work flow diagram for character recognition system

After implementing the segmentation procedure, we will obtain an output as shown in

figure 8a. Now this image pattern has to be converted in terms of Binary values. The pattern
is divided into different rows and columns as per the system resolution as shown in figure 8.b.
Now for each square box values in the range of zero to 255 is provide if gray scale is used.
These values represent whether the required object in present inside the square box or not.
These Binary or Gray scale values are taken as input for further processing.

13
Figure 8b. Character Pattern conversion into
Figure 8a. Character Pattern values
intensity
Figures 6,7,8 are adapted from Praveen Kumar et al. (2012), “Character Recognition
using Neural Network”, vol3 ,issue 2., .Pp 978- 981, IJST

For figure 8.b Texture Features, Shape Features and or Boundary features etc can be
extracted. This feature values are known as exemplars which is the actual input into the
neural network. Consider any neural network. The input is the feature table created as
explained in the above process, which is shown in Figure 9. This table is provided as in put to
the neural system

Figure 9: Character ‘a’ is segmented and binary values extracted from it

Figure 9 Adopted from Yusuf Perwej et al. (2011), “Neural Networks for Handwritten
English Alphabet Recognition”, International Journal of Computer Applications (0975 –
8887) Volume 20– No.7, April 2011.

Figure 10 shows the full implementation using a multi-layer neural network

14
Figure 10. ANN implementation of character recognition system

Figure 10: Adopted from Anita pal et al. (2010), “Handwritten English Character
Recognition Using Neural Network”, International Journal of Computer Science &
Communication, Vol. 1, No. 2, July-December 2010, pp. 141-144.

If the feature sets matches between the trained and current input features the output
produces “1” , which denotes that the particular alphabet is trained else “0” not recognised.

Note: Similar procedure is used for texture classification Application

REFERENCE BOOKS

1. B. Yegnanarayana, “Artificial Neural Networks” Prentice Hall Publications.

2. Simon Haykin, “Artificial Neural Networks”, Second Edition, Pearson Education.

3. Laurene Fausett, “Fundamentals of Neural Networks, Architectures, Algorithms and

Applications”, Prentice Hall publications.

4. James A. Freeman & Skapura, “Neural Networks”, Pearson Education.

ALL THE BEST ****

Airis 2 MANUAL
No ratings yet
Airis 2 MANUAL
46 pages
Module 4 Continued
No ratings yet
Module 4 Continued
244 pages
Business Data Mining Week 12
No ratings yet
Business Data Mining Week 12
24 pages
Unit 3
No ratings yet
Unit 3
32 pages
Unit Iv DM
No ratings yet
Unit Iv DM
58 pages
Metode Data Mining Som
No ratings yet
Metode Data Mining Som
22 pages
Kohonen Self Organizing Maps
100% (1)
Kohonen Self Organizing Maps
45 pages
RIS Notes Module 3 & 4
No ratings yet
RIS Notes Module 3 & 4
32 pages
Deep Learning: Prof:Naveen Ghorpade
No ratings yet
Deep Learning: Prof:Naveen Ghorpade
43 pages
Neural Networks
No ratings yet
Neural Networks
46 pages
MODULE 2 Deep Learning
No ratings yet
MODULE 2 Deep Learning
26 pages
Kohonen Self Organizing Network
No ratings yet
Kohonen Self Organizing Network
20 pages
Soft Mod 2
No ratings yet
Soft Mod 2
11 pages
Kohonen Self Organizing Maps
No ratings yet
Kohonen Self Organizing Maps
36 pages
Major Classes of Neural Networks
No ratings yet
Major Classes of Neural Networks
21 pages
Lecture 9
No ratings yet
Lecture 9
97 pages
Lecture10 CompetitiveLearning
No ratings yet
Lecture10 CompetitiveLearning
17 pages
DL Unit-Ii
No ratings yet
DL Unit-Ii
17 pages
Mid 2 NN
No ratings yet
Mid 2 NN
14 pages
Unsupervised Learning Handout
No ratings yet
Unsupervised Learning Handout
43 pages
Lecture 03 - Som + Lvq
No ratings yet
Lecture 03 - Som + Lvq
46 pages
MLCH9
No ratings yet
MLCH9
45 pages
unit 4 5 NN
No ratings yet
unit 4 5 NN
15 pages
Unit 4 NN
No ratings yet
Unit 4 NN
8 pages
Module 3
100% (1)
Module 3
79 pages
What are the commonly used activation functions
No ratings yet
What are the commonly used activation functions
8 pages
PNAL8_SelfOrganizingMaps
No ratings yet
PNAL8_SelfOrganizingMaps
35 pages
23000322023_CA2_PE-EC702C
No ratings yet
23000322023_CA2_PE-EC702C
8 pages
Soft Computing Question Answer
No ratings yet
Soft Computing Question Answer
6 pages
ANN UNIT-4
No ratings yet
ANN UNIT-4
14 pages
NN 4 5
No ratings yet
NN 4 5
15 pages
Unsupervised ANN
No ratings yet
Unsupervised ANN
14 pages
ECSE484 Intro v2
No ratings yet
ECSE484 Intro v2
67 pages
Week 3
No ratings yet
Week 3
15 pages
ML QB 4
No ratings yet
ML QB 4
69 pages
Evaluation_Metrics
No ratings yet
Evaluation_Metrics
49 pages
Artificial Neural Networks An Artificial Neuron: X W X W S X W W y
No ratings yet
Artificial Neural Networks An Artificial Neuron: X W X W S X W W y
7 pages
Self Organizing Maps
No ratings yet
Self Organizing Maps
31 pages
CMR University School of Engineering and Technology Department of Cse and It
No ratings yet
CMR University School of Engineering and Technology Department of Cse and It
6 pages
Unsupervised Learning Networks: April 2007 1
No ratings yet
Unsupervised Learning Networks: April 2007 1
46 pages
Self-Organizing Map (SOM) : Categorization Method, Neural Network Technique, Unsupervised Learning
No ratings yet
Self-Organizing Map (SOM) : Categorization Method, Neural Network Technique, Unsupervised Learning
8 pages
IT 701 Soft Computing Unit III - 1722317899
No ratings yet
IT 701 Soft Computing Unit III - 1722317899
14 pages
Machine Learning
No ratings yet
Machine Learning
11 pages
Deep Learning and Applications: Pham The Bao Ptbao@sgu - Edu.vn
No ratings yet
Deep Learning and Applications: Pham The Bao Ptbao@sgu - Edu.vn
43 pages
AI SOM Chayrung HaiVPham
No ratings yet
AI SOM Chayrung HaiVPham
51 pages
EELU ANN ITF309 Lecture 11 Spring 2024
No ratings yet
EELU ANN ITF309 Lecture 11 Spring 2024
51 pages
Competitive Networks: Master Student: Jean Carlo Grandas F. Professor: Carlos Borrás P., PHD., MSC
No ratings yet
Competitive Networks: Master Student: Jean Carlo Grandas F. Professor: Carlos Borrás P., PHD., MSC
23 pages
Lec 12 NN
No ratings yet
Lec 12 NN
20 pages
2-Kohonen Self-Organizing Maps (Som) : Patterns Associated With That Cluster
No ratings yet
2-Kohonen Self-Organizing Maps (Som) : Patterns Associated With That Cluster
11 pages
Initialization of Self Organizing Maps P
No ratings yet
Initialization of Self Organizing Maps P
18 pages
Neural Networks
No ratings yet
Neural Networks
37 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
34 pages
CI-10 Networks Based on Competition Learning - Clustering- Kmean and SOM
No ratings yet
CI-10 Networks Based on Competition Learning - Clustering- Kmean and SOM
36 pages
Unit III Word NN
No ratings yet
Unit III Word NN
12 pages
Unit 4
No ratings yet
Unit 4
38 pages
Neural Networks
No ratings yet
Neural Networks
5 pages
Session 1
No ratings yet
Session 1
8 pages
NN&DL Unit-II Unsupervise Learning Networks
No ratings yet
NN&DL Unit-II Unsupervise Learning Networks
16 pages
ml unit 04
No ratings yet
ml unit 04
11 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Signature Verification
No ratings yet
Signature Verification
65 pages
SLIC Superpixels Compared To State-Of-The-Art Superpixel Methods
No ratings yet
SLIC Superpixels Compared To State-Of-The-Art Superpixel Methods
8 pages
A Survey On Digital Image Processing Techniques For Tumor Detection
No ratings yet
A Survey On Digital Image Processing Techniques For Tumor Detection
15 pages
1 s2.0 S2214914723002544 Main
No ratings yet
1 s2.0 S2214914723002544 Main
12 pages
Rmbi1020 Lec07 Clustering
No ratings yet
Rmbi1020 Lec07 Clustering
41 pages
Vehicle License Plate Detection and Recognition Using Neural Network
No ratings yet
Vehicle License Plate Detection and Recognition Using Neural Network
5 pages
Towards Sustainable Urban Energy A Robust Deep Learning Framework for Solar Potential Estimation
No ratings yet
Towards Sustainable Urban Energy A Robust Deep Learning Framework for Solar Potential Estimation
8 pages
Applying Fuzzy Logic To Image Processing Applications: A Review
No ratings yet
Applying Fuzzy Logic To Image Processing Applications: A Review
4 pages
Links and Actions in Interplay
No ratings yet
Links and Actions in Interplay
24 pages
Digital Image Processing Laboratory Manual
No ratings yet
Digital Image Processing Laboratory Manual
65 pages
Project
No ratings yet
Project
14 pages
Self-Supervised Deep Correlation Tracking
No ratings yet
Self-Supervised Deep Correlation Tracking
10 pages
UNetFormer - A Unified Vision Transformer Model and Pre-Training Framework For 3D Medical Image Segmentation - 2204.00631v2
No ratings yet
UNetFormer - A Unified Vision Transformer Model and Pre-Training Framework For 3D Medical Image Segmentation - 2204.00631v2
12 pages
White Box Cartoon Acm Style
No ratings yet
White Box Cartoon Acm Style
9 pages
Drawings From Photos: Hubert Hua Kian Teo Gael Colas Andrew Deng
No ratings yet
Drawings From Photos: Hubert Hua Kian Teo Gael Colas Andrew Deng
5 pages
Chapter 2
No ratings yet
Chapter 2
25 pages
Sensors 21 05356
No ratings yet
Sensors 21 05356
19 pages
Class - Notes Computer Vision
No ratings yet
Class - Notes Computer Vision
3 pages
Microsoft AI_900_Exam
No ratings yet
Microsoft AI_900_Exam
14 pages
Shashank Resume L
No ratings yet
Shashank Resume L
2 pages
TD Active
No ratings yet
TD Active
2 pages
MCV C4 2024 Exam Answers
No ratings yet
MCV C4 2024 Exam Answers
7 pages
Scale-Space Theories in Computer Vision
No ratings yet
Scale-Space Theories in Computer Vision
544 pages
[Ebooks PDF] download Advances in Imaging and Electron Physics 177 1st Edition Peter W. Hawkes (Eds.) full chapters
No ratings yet
[Ebooks PDF] download Advances in Imaging and Electron Physics 177 1st Edition Peter W. Hawkes (Eds.) full chapters
55 pages
Research Article: Skin Disease Recognition Method Based On Image Color and Texture Features
No ratings yet
Research Article: Skin Disease Recognition Method Based On Image Color and Texture Features
11 pages
U-Net and Its Variants for Medical Image Segmentat
No ratings yet
U-Net and Its Variants for Medical Image Segmentat
43 pages
Retinal Blood Vessel Segmentation and Measurement of Vessel Diameters
No ratings yet
Retinal Blood Vessel Segmentation and Measurement of Vessel Diameters
6 pages
Face Detection Bagchi 7
No ratings yet
Face Detection Bagchi 7
30 pages