Backpropagation in Data Mining

This document discusses various techniques for data classification and prediction in data mining, including decision tree classification, Bayesian classification, backpropagation neural networks, rule-based classification, and associative classification. It provides details on rule-based classification, including evaluating rule accuracy and coverage, handling rule conflicts, and extracting rules from decision trees. It also explains the backpropagation algorithm for neural network classification, including defining the network topology, propagating inputs forward, and computing outputs and errors backward through the network to update weights.

Uploaded by

Ã S Àdhìkãrí

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

251 views20 pages

Backpropagation in Data Mining

Uploaded by

Ã S Àdhìkãrí

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

Data Warehouse and Mining

1
Data Warehouse and Mining – Unit 3
› Basic issues regarding classification and prediction
› Classification by Decision Tree,
› Bayesian classification,
› Classification by back propagation,
› Associative classification,
› Prediction:
› Statistical-Based Algorithms,
› Decision Tree -Based Algorithms,
› Neural Network –Based Algorithms,
› Rule-Based Algorithms,
› Other Classification Methods,
› Combining Techniques,
› Classifier Accuracy and Error Measures
2
Rule-based Classification
› Rules are a good way of representing information or bits of
knowledge. A rule-based classifier uses a set of IF-THEN rules for
classification. An IF-THEN rule is an expression of the form
› IF condition THEN conclusion.
› An example is rule R1,
R1: IF age = youth AND student = yes THEN buys_computer = yes.
› If the condition (i.e., all the attribute tests) in a rule antecedent
holds true for a given tuple, we say that the rule antecedent is
satisfied (or simply, that the rule is satisfied) and that the rule
covers the tuple.
› If a rule is satisfied by X, the rule is said to be triggered
3
› A rule R can be assessed by its coverage and accuracy.
Given a tuple, X, from a class labeled data set, D, let
ncovers be the number of tuples covered by R; ncorrect
be the number of tuples correctly classified by R; and jDj
be the number of tuples in D. We can define the coverage
and accuracy of R as

4
Rule based classification
› Example:
› Rule accuracy and coverage. Consider rule R1, which covers 2 of the 14 tuples.
› It can correctly classify both tuples. Therefore,
› coverage(R1) = 2/14 = 14.28% and accuracy(R1) = 2/2 =100%.
› That is, a rule’s coverage is the percentage of tuples that are covered by the rule
(i.e., their attribute values hold true for the rule’s antecedent). For a rule’s
accuracy, we look at the tuples that it covers and see what percentage of them
the rule can correctly classify.

5
› X=(age = youth, income = medium, student = yes, credit rating
= fair).
› We would like to classify X according to buys computer. X
satisfies R1, which triggers the rule.
› If R1 is the only rule satisfied, then the rule fires by returning
the class prediction for X.
› Note that triggering does not always mean firing because there
may be more than one rule that is satisfied! If more than one
rule is triggered, we have a potential problem.
› What if they each specify a different class? Or what if no rule is
satisfied by X?
6
Conflict resolution strategies in picking up the rules

› If more than one rule is triggered, we need a conflict resolution

strategy to figure out which rule gets to fire and assign its class
prediction to X.
› There are many possible strategies. We look at two, namely size
ordering and rule ordering.
› The size ordering scheme assigns the highest priority to the
triggering rule that has the “toughest” requirements, where
toughness is measured by the rule antecedent size. That is, the
triggering rule with the most attribute tests is fired.

7
› Rule-ordering: class-based ordering, rule-based ordering
› With class-based ordering, the classes are sorted in order of decreasing
“importance” such as by decreasing order of prevalence. That is, all the rules for
the most prevalent (or most frequent) class come first, the rules for the next
prevalent class come next, and so on. Alternatively, they may be sorted based
on the misclassification cost per class. Within each class, the rules are not
ordered—they don’t have to be because they all predict the same class (and so
there can be no class conflict).
› With rule-based ordering, the rules are organized into one long priority list,
according to some measure of rule quality, such as accuracy, coverage, or size
(number of attribute tests in the rule antecedent), or based on advice from
domain experts. When rule ordering is used, the rule set is known as a decision
list. With rule ordering, the triggering rule that appears earliest in the list has the
highest priority, and so it gets to fire its class prediction. Any other rule that
satisfies X is ignored. Most rule-based classification systems use a class-based
rule-ordering strategy.
› What if the rules are unordered? Go for a default rule.
8
Forming Rules from Decision trees

Sample:
If
age= youth &&
Student = no,

THEN
buys_computer = no

9
› A disjunction (logical OR) is implied between each of the extracted
rules. Because the rules are extracted directly from the tree, they
are mutually exclusive and exhaustive.
› Mutually exclusive means that we cannot have rule conflicts here
because no two rules will be triggered for the same tuple. (We have
one rule per leaf, and any tuple can map to only one leaf.)
› Exhaustive means there is one rule for each possible attribute–value
combination, so that this set of rules does not require a default
rule. Therefore, the order of the rules does not matter—they are
unordered.
› So, in the previous tree, which are mutually exclusive and which are
exhaustive?
10
Classification by Backpropagation
› Backpropagation is a neural network learning algorithm
› Multilayer Feed-forward NN: A multilayer feed-forward neural network consists of
an input layer, one or more hidden layers, and an output layer.

• Input units
• Input layer
• Weights
• Neurodes:
• Hidden layer(s)
• Output layer
• Fully connected NN

11
Defining a network topology
› Before training can begin, the user must decide on the
network topology by specifying the number of units in the
input layer, the number of hidden layers (if more than
one), the number of units in each hidden layer, and the
number of units in the output layer.

12
Backpropagation

› Backpropagation learns by iteratively processing a data set of training tuples,

comparing the network’s prediction for each tuple with the actual known target
value. The target value may be the known class label of the training tuple (for
classification problems) or a continuous value (for numeric prediction).
› For each training tuple, the weights are modified so as to minimize the mean-
squared error between the network’s prediction and the actual target value.
› These modifications are made in the “backwards” direction (i.e., from the output
layer) through each hidden layer down to the first hidden layer (hence the name
backpropagation).
› Although it is not guaranteed, in general the weights will eventually converge,
and the learning process stops.
13
Backpropagation algorithm
› Initialize the weights: The weights in the network are initialized to small random
numbers (e.g., ranging from 1.0 to 1.0, or 0.5 to 0.5). Each unit has a bias
associated with it. The biases are similarly initialized to small random numbers.
› Each training tuple, X, is processed by the following steps:
› Propagate the inputs forward: First, the training tuple is fed to the network’s input
layer. The inputs pass through the input units, unchanged. That is, for an input
unit, j, its output, Oj , is equal to its input value, Ij .
› Next, the net input and output of each unit in the hidden and output layers are
computed. The net input to a unit in the hidden or output layers is computed as a
linear combination of its inputs.
› Each such unit has a number of inputs to it that are, in fact, the outputs of the
units connected to it in the previous layer. Each connection has a weight. To
compute the net input to the unit, each input connected to the unit is multiplied by
its corresponding weight, and this is summed.

14
Backpropagation algorithm
› Given a unit, j in a hidden or output layer, the net input, Ij , to unit j is

› where wij is the weight of the connection from unit i in the previous layer to unit j; Oi is the output of
unit i from the previous layer; and j is the bias of the unit. The bias acts as a threshold in that it
serves to vary the activity of the unit.
› Each unit in the hidden and output layers takes its net input and then applies an activation function
to it. The function symbolizes the activation of the neuron represented by the unit. The logistic, or
sigmoid, function is used.
› Given the net input Ij to unit j, then Oj , the output of unit j, is computed as

› This function is also referred to as a squashing function, because it maps a large input domain onto
the smaller range of 0 to 1. The logistic function is nonlinear and differentiable, allowing the
backpropagation algorithm to model classification problems that are linearly inseparable.
› We compute the output values, Oj , for each hidden layer, up to and including the output layer, which
gives the network’s prediction.
15
16
› Backpropagate the error: The error is propagated backward by updating the
weights and biases to reflect the error of the network’s prediction. For a unit j in
the output layer, the error Errj is computed by

› where Oj is the actual output of unit j, and Tj is the known target value of the
given training tuple. Note that Oj(1-Oj) is the derivative of the logistic function.
› To compute the error of a hidden layer unit j, the weighted sum of the errors of
the units connected to unit j in the next layer are considered. The error of a
hidden layer unit j is

› where wjk is the weight of the connection from unit j to a unit k in the next higher
layer, and Errk is the error of unit k.
› The weights and biases are updated to reflect the propagated errors.
› Changes in weights are represented by and calculated as:

17
› ‘l’ is the learning rate, falling between 0.0 – 1.0
› A rule of thumb is to set the learning rate to 1=t , where t is the number of
iterations through the training set so far.
› Biases are updated by the following equations, where j is the change in bias j
:

› The weights and biases are updated after the presentation of each tuple. This is
referred to as case updating.
› Alternatively, the weight and bias increments could be accumulated in variables,
so that the weights and biases are updated after all the tuples in the training set
have been presented. This latter strategy is called epoch updating, where one
iteration through the training set is an epoch.
18
› Terminating condition: Training stops when All wij in the
previous epoch are so small as to be below some
specified threshold; or
› The percentage of tuples misclassified in the previous
epoch is below some threshold; or
› A prespecified number of epochs has expired.
› ASSIGNMENT: Work out the example given in book.

19
Associative classification
› Steps:
1. Mine the data for frequent itemsets, that is, find commonly
occurring attribute–value pairs in the data.
2. Analyze the frequent itemsets to generate association rules per
class, which satisfy confidence and support criteria.
3. Organize the rules to form a rule-based classifier.
› Three types:
– Classification Based on Association(CBA)
– Classification Based on Multiple Rules(CMAR)
– Classification Based on Predictive Association Rules(CPAR)

Neural Network Classification Guide
No ratings yet
Neural Network Classification Guide
23 pages
Classification 1
No ratings yet
Classification 1
78 pages
DWDM Unit 2
No ratings yet
DWDM Unit 2
23 pages
Unit6 - 5 Rule Based Classifier
No ratings yet
Unit6 - 5 Rule Based Classifier
28 pages
Chapter 9. Classification: Advanced Methods
No ratings yet
Chapter 9. Classification: Advanced Methods
39 pages
Lecture 9
No ratings yet
Lecture 9
32 pages
Classification by Back Propagation
No ratings yet
Classification by Back Propagation
20 pages
Bayesian Belief and Regression
No ratings yet
Bayesian Belief and Regression
19 pages
Data Mining, Advance Methods
No ratings yet
Data Mining, Advance Methods
83 pages
DMDW 12 Classification Advance
No ratings yet
DMDW 12 Classification Advance
22 pages
Unit 2 Soft
No ratings yet
Unit 2 Soft
14 pages
Concepts and Techniques: Data Mining
No ratings yet
Concepts and Techniques: Data Mining
78 pages
DMDW Qa-4
No ratings yet
DMDW Qa-4
14 pages
36-Multi-Layer Perceptron and Its Properties-30-10-2024
No ratings yet
36-Multi-Layer Perceptron and Its Properties-30-10-2024
39 pages
UNIT 3 - Part - 2
No ratings yet
UNIT 3 - Part - 2
43 pages
Multilayer Neural Networks Overview
No ratings yet
Multilayer Neural Networks Overview
30 pages
Ensemble Methods for Classification Accuracy
No ratings yet
Ensemble Methods for Classification Accuracy
31 pages
Backpropagation in Neural Network Classification
100% (1)
Backpropagation in Neural Network Classification
5 pages
UNIT 3 - Backpropagation Algorithm
No ratings yet
UNIT 3 - Backpropagation Algorithm
38 pages
DM See M4
No ratings yet
DM See M4
8 pages
Classification BP Regression KNN Other Classifiers - Final
No ratings yet
Classification BP Regression KNN Other Classifiers - Final
116 pages
Classification vs Regression in ML
No ratings yet
Classification vs Regression in ML
15 pages
Lecture 9
No ratings yet
Lecture 9
78 pages
Unit 4 - Classification and Prediction
No ratings yet
Unit 4 - Classification and Prediction
72 pages
6.data Mining - Classification
No ratings yet
6.data Mining - Classification
37 pages
IME672 - Lecture 48
No ratings yet
IME672 - Lecture 48
21 pages
Brief Summary ML
No ratings yet
Brief Summary ML
25 pages
Chp8 Classification Basic Concepts - Lecture#8
No ratings yet
Chp8 Classification Basic Concepts - Lecture#8
40 pages
DHSCH 6
No ratings yet
DHSCH 6
35 pages
Chapter 2 Machine Learning Draft-85-172
No ratings yet
Chapter 2 Machine Learning Draft-85-172
88 pages
AIML Unit4
No ratings yet
AIML Unit4
30 pages
Machine Learning (Unit-5) Machine Learning (Unit-5) : Scan To Open On Studocu Scan To Open On Studocu
No ratings yet
Machine Learning (Unit-5) Machine Learning (Unit-5) : Scan To Open On Studocu Scan To Open On Studocu
11 pages
Neural
No ratings yet
Neural
53 pages
XOR Problem & Two-Layer Perceptron
No ratings yet
XOR Problem & Two-Layer Perceptron
74 pages
Unit 2
No ratings yet
Unit 2
19 pages
Rule-Based Classification Overview
No ratings yet
Rule-Based Classification Overview
23 pages
Chapter 4: Classification & Prediction
100% (1)
Chapter 4: Classification & Prediction
54 pages
Classification by Back Propagation
No ratings yet
Classification by Back Propagation
20 pages
Multi-Layer Perceptron: Components
No ratings yet
Multi-Layer Perceptron: Components
9 pages
ABP DWDM UNIT 4 Classification 1
No ratings yet
ABP DWDM UNIT 4 Classification 1
51 pages
CL Back Propogation
No ratings yet
CL Back Propogation
11 pages
Data Mining-Unit-3
No ratings yet
Data Mining-Unit-3
16 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
31 pages
Notes
No ratings yet
Notes
35 pages
Big Data Analytics - Unit 3
No ratings yet
Big Data Analytics - Unit 3
55 pages
Unit-IV NN and Rule Based Algorithm
No ratings yet
Unit-IV NN and Rule Based Algorithm
13 pages
Data Science Lecture: Classification & Regression
No ratings yet
Data Science Lecture: Classification & Regression
27 pages
Ann 2 A
No ratings yet
Ann 2 A
20 pages
Python Unit 5
No ratings yet
Python Unit 5
36 pages
Chapter 4
No ratings yet
Chapter 4
31 pages
DM Assignment 2
No ratings yet
DM Assignment 2
23 pages
Neural Network
100% (1)
Neural Network
54 pages
Machine Learning
No ratings yet
Machine Learning
68 pages
Backpropagation
No ratings yet
Backpropagation
12 pages
Learning Rules For Multilayer Feedforward Neural Networks
No ratings yet
Learning Rules For Multilayer Feedforward Neural Networks
19 pages
Classification and Clustering Techniques in Data Mining
No ratings yet
Classification and Clustering Techniques in Data Mining
18 pages
DL - ANN - RNN - CNN (Autosaved) (Autosaved)
No ratings yet
DL - ANN - RNN - CNN (Autosaved) (Autosaved)
53 pages
L06 - Advance Analytical Theory and Methods - Neural Network
No ratings yet
L06 - Advance Analytical Theory and Methods - Neural Network
13 pages
1736519491-Unit II
No ratings yet
1736519491-Unit II
6 pages
Class IX SA-I Exam Syllabus Overview
No ratings yet
Class IX SA-I Exam Syllabus Overview
6 pages
Eportfolio Reflection
No ratings yet
Eportfolio Reflection
2 pages
Somol and Whiting - Doppler
No ratings yet
Somol and Whiting - Doppler
7 pages
Holistic Self-Development Guide
No ratings yet
Holistic Self-Development Guide
2 pages
Scope For Critical Thinking
No ratings yet
Scope For Critical Thinking
7 pages
UCSP - WEEK1 - July23 - Social Institutions
No ratings yet
UCSP - WEEK1 - July23 - Social Institutions
2 pages
Adversarial Autoencoder Data Synthesis For Enhancing Machine Learning-Based Phishing Detection Algorit
No ratings yet
Adversarial Autoencoder Data Synthesis For Enhancing Machine Learning-Based Phishing Detection Algorit
13 pages
Field Study 2 Action Research
No ratings yet
Field Study 2 Action Research
14 pages
Self Performance Evaluation
No ratings yet
Self Performance Evaluation
2 pages
Interactive READING
No ratings yet
Interactive READING
23 pages
Detailed LP
No ratings yet
Detailed LP
3 pages
2022-2023 Rubric Mini-Dbq or Short Response
No ratings yet
2022-2023 Rubric Mini-Dbq or Short Response
1 page
Lesson Plan Format 11 16 2015 1
No ratings yet
Lesson Plan Format 11 16 2015 1
2 pages
AI Knowledge Representation Basics
No ratings yet
AI Knowledge Representation Basics
8 pages
DK Memory Activity Book
100% (10)
DK Memory Activity Book
226 pages
Unit 6
No ratings yet
Unit 6
22 pages
Cognitive Biases
No ratings yet
Cognitive Biases
2 pages
Combinepdf PDF
100% (1)
Combinepdf PDF
502 pages
Module 2 Hebb Net
No ratings yet
Module 2 Hebb Net
12 pages
Unit 2 - Lesson B: Music: Touchstone 2nd Edition - Language Summary - Level 2
No ratings yet
Unit 2 - Lesson B: Music: Touchstone 2nd Edition - Language Summary - Level 2
3 pages
School Principal Leadership Styles
100% (1)
School Principal Leadership Styles
39 pages
Reading Skills for Test Prep
No ratings yet
Reading Skills for Test Prep
11 pages
Note Making
No ratings yet
Note Making
2 pages
Indianisms Learning Guide
No ratings yet
Indianisms Learning Guide
8 pages
English and Science Lesson Plans
No ratings yet
English and Science Lesson Plans
4 pages
Bonifacio High School Teaching Strategies
50% (2)
Bonifacio High School Teaching Strategies
36 pages
Constructivism and Reflective
No ratings yet
Constructivism and Reflective
4 pages
Legal Knowledge Dissemination - Mommers
No ratings yet
Legal Knowledge Dissemination - Mommers
9 pages
Insights Into Second Language Reading - A Cross-Linguistic Approach
100% (3)
Insights Into Second Language Reading - A Cross-Linguistic Approach
340 pages

Backpropagation in Data Mining

Uploaded by

Backpropagation in Data Mining

Uploaded by

Data Warehouse and Mining

› If more than one rule is triggered, we need a conflict resolution

› Backpropagation learns by iteratively processing a data set of training tuples,

You might also like