0% found this document useful (0 votes)

32 views18 pages

Statistical Inference INF312 - Is - Lecture 03 - Part 3

The document discusses Bayesian classification, emphasizing its probabilistic prediction capabilities based on Bayes' Theorem. It explains the naive Bayes classifier, which simplifies computations by assuming attribute independence, and provides examples of how to calculate probabilities for classifying data. Additionally, it includes a solved example demonstrating the application of Bayes' Theorem in predicting bone fractures using bone mineral density measurements.

Uploaded by

mohamed2004mowaffak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views18 pages

Statistical Inference INF312 - Is - Lecture 03 - Part 3

Uploaded by

mohamed2004mowaffak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Bayesian Classification: Why?

◼ A statistical classifier: performs probabilistic prediction, i.e.,

predicts class membership probabilities
◼ Foundation: Based on Bayes’ Theorem.
◼ Performance: A simple Bayesian classifier, naïve Bayesian
classifier, has comparable performance with decision tree and
selected neural network classifiers
◼ Incremental: Each training example can incrementally
increase/decrease the probability that a hypothesis is correct —
prior knowledge can be combined with observed data
◼ Standard: Even when Bayesian methods are computationally
intractable, they can provide a standard of optimal decision
making against which other methods can be measured
1
Bayes’ Theorem: Basics
M
◼ Total probability Theorem: P(B) =  P(B | A )P( A )
i i
i =1

◼ Bayes’ Theorem: P(H | X) = P(X | H )P(H ) = P(X | H ) P(H ) / P(X)

P(X)
◼ Let X be a data sample (“evidence”): class label is unknown
◼ Let H be a hypothesis that X belongs to class C
◼ Classification is to determine P(H|X), (i.e., posteriori probability): the
probability that the hypothesis holds given the observed data sample X
◼ P(H) (prior probability): the initial probability
◼ E.g., X will buy computer, regardless of age, income, …

◼ P(X): probability that sample data is observed

◼ P(X|H) (likelihood): the probability of observing the sample X, given that
the hypothesis holds
◼ E.g., Given that X will buy computer, the prob. that X is 31..40,

medium income
2
Prediction Based on Bayes’ Theorem
◼ Given training data X, posteriori probability of a hypothesis H,
P(H|X), follows the Bayes’ theorem

P(H | X) = P(X | H )P(H ) = P(X | H ) P(H ) / P(X)

P(X)
◼ Informally, this can be viewed as
posteriori = likelihood x prior/evidence
◼ Predicts X belongs to Ci iff the probability P(Ci|X) is the highest
among all the P(Ck|X) for all the k classes
◼ Practical difficulty: It requires initial knowledge of many
probabilities, involving significant computational cost

3
Classification Is to Derive the Maximum Posteriori
◼ Let D be a training set of tuples and their associated class
labels, and each tuple is represented by an n-D attribute vector
X = (x1, x2, …, xn)
◼ Suppose there are m classes C1, C2, …, Cm.
◼ Classification is to derive the maximum posteriori, i.e., the
maximal P(Ci|X)
◼ This can be derived from Bayes’ theorem
P(X | C )P(C )
P(C | X) = i i
i P(X)
◼ Since P(X) is constant for all classes, only
P(C | X) = P(X | C )P(C )
i i i
needs to be maximized

4
Naïve Bayes Classifier
◼ A simplified assumption: attributes are conditionally
independent (i.e., no dependence relation between
attributes):
n
P( X | C i) =  P( x | C i) = P( x | C i)  P( x | C i)  ...  P( x | C i)
k 1 2 n
k =1
◼ This greatly reduces the computation cost: Only counts the
class distribution
◼ If Ak is categorical, P(xk|Ci) is the # of tuples in Ci having value xk
for Ak divided by |Ci, D| (# of tuples of Ci in D)
◼ If Ak is continous-valued, P(xk|Ci) is usually computed based on
Gaussian distribution with a mean μ and standard deviation σ
( x− )2
1 −
g ( x,  ,  ) = e 2 2
and P(xk|Ci) is 2 
P ( X | C i ) = g ( xk ,  C i ,  Ci )
5
Naïve Bayes Classifier: Training Dataset
Example:
age income student credit_rating buys_computer
<=30 high no fair no
<=30 high no excellent no
31…40 high no fair yes
>40 medium no fair yes
>40 low yes fair yes
>40 low yes excellent no
31…40 low yes excellent yes
<=30 medium no fair no
<=30 low yes fair yes
>40 medium yes fair yes
<=30 medium yes excellent yes
31…40 medium no excellent yes
31…40 high yes fair yes
Class: >40 medium no excellent no

C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’

Data to be classified:
X = (age <=30, Income = medium, Student = yes, Credit_rating = Fair)
6
Naïve Bayes Classifier: An Example
Class:
C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’

◼ Compute P(Ci) for each class:

◼ P(C1) = P(buys_computer = “yes”) = 9/14 = 0.643

◼ P(C2) = P(buys_computer = “no”) = 5/14= 0.357

7
Naïve Bayes Classifier: An Example
Class:
C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’

◼ Compute P(X|Ci) for each class

P(Xk|C1) = P(X1|C1) * P(X2|C1) * P(X3|C1)* ….*P(Xk|C1)

P(Xk|C2) = P(X1|C2) * P(X2|C2) * P(X3|C2)* ….*P(Xk|C2)

8
Naïve Bayes Classifier: Training Dataset
Class:
C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’
Data to be classified:
X = (age <=30, Income = medium, Student = yes, Credit_rating = Fair)
Age Buys Computer Count Total Conditional Probability Conditional Probability
<= 30 Yes 2 9 (2/9) 0.222222222
<= 30 No 3 5 (3/5) 0.6
31-40 Yes 4 9 (4/9) 0.444444444
31-40 No 0 5 (0/5) 0
> 40 Yes 3 9 (3/9) 0.333333333
> 40 No 2 5 (2/5) 0.4

P(Age <= 30| Buys Computer = Yes) 0.222222222

9
Naïve Bayes Classifier: Training Dataset
Class:
C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’
Data to be classified:
X = (age <=30, Income = medium, Student = yes, Credit_rating = Fair)
Income Buys Computer Count Total Conditional Probability Conditional Probability
High Yes 2 9 (2/9) 0.222222222
High No 2 5 (2/5) 0.4
Medium Yes 4 9 (4/9) 0.444444444
Medium No 2 5 (2/5) 0.4
Low Yes 3 9 (3/9) 0.333333333
Low No 1 5 (1/5) 0.2

P(Income = High| Buys Computer = Yes) 0.222222222

10
Naïve Bayes Classifier: Training Dataset
Class:
C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’
Data to be classified:
X = (age <=30, Income = medium, Student = yes, Credit_rating = Fair)

Student Buys Computer Count Total Conditional Probability Conditional Probability

Yes Yes 6 9 (6/9) 0.666666667
Yes No 1 5 (1/5) 0.2
No Yes 3 9 (3/9) 0.333333333
No No 4 5 (4/5) 0.8

P(Student = Yes| Buys Computer = Yes) 0.666666667

P(Student = Yes| Buys Computer = No) 0.2
P(Student = No| Buys Computer = Yes) 0.333333333
P(Student = No| Buys Computer = No) 0.8

11
Naïve Bayes Classifier: Training Dataset
Class:
C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’
Data to be classified:
X = (age <=30, Income = medium, Student = yes, Credit_rating = Fair)

Credit Rating Buys Computer Count Total Conditional Probability Conditional Probability
Fair Yes 6 9 (6/9) 0.666666667
Fair No 2 5 (2/5) 0.4
Excellent Yes 3 9 (3/9) 0.333333333
Excellent No 3 5 (3/5) 0.6

P(Credit Rating = Fair| Buys Computer = Yes) 0.666666667

P(Credit Rating = Fair| Buys Computer = No) 0.4
P(Credit Rating = Excellent| Buys Computer = Yes) 0.333333333
P(Credit Rating = Excellent| Buys Computer = No) 0.6

12
Naïve Bayes Classifier: An Example
Class:
C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’

◼ Compute P(X|Ci) for each class

P(X|C1) = P(X|buys_computer = “yes”)

= 0.222 x 0.444 x 0.667 x 0.667 = 0.044

P(X|C2) = P(X|buys_computer = “no”)

= 0.6 x 0.4 x 0.2 x 0.4 = 0.019

13
Naïve Bayes Classifier: An Example
Class:
C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’

◼ Compute P(X|Ci) * P(Ci) for each class

P(X|C1) * P(C1) = 0.044 * 0.643 = 0.028

P(X|C2) * P(C2) = 0.019 * 0.357 = 0.007

◼ Decision

P(X|C1) * P(C1) > P(X|C2) * P(C2)

X belongs to (C1)
Therefore, X belongs to class (“buys_computer = yes”)
14
Solved Example on Bayes Theorem
◼ Researchers investigated the effectiveness of using the
Hologic Sahara Sonometer, a portable device that
measures bone mineral density (BMD) in the ankle, in
predicting a fracture. They used a Hologic estimated
bone mineral density value of .57 as a cutoff. The
results of the investigation yielded the following data:

15
Solved Example on Bayes Theorem
a) Calculate the sensitivity of using a BMD value of 0.57
as a cutoff value for predicting fracture.
b) Calculate the specificity of using a BMD value of 0.57
as a cutoff value for predicting fracture.
c) If it is estimated that 10 percent of the U.S.
population have a confirmed bone fracture, What is
predictive value positive of using a BMD value of
0.57 as a cutoff value for predicting fracture? That is,
we wish to estimate the probability that a subject
who has BMD value equals 0.57 has a confirmed
bone fracture.

16
Solved Example on Bayes Theorem

a) Sensitivity = P (+T \ +D) = 214/287 = 0.7456 = 74.56%

b) Specificity = P (-T \ -D) = 330/1000 = 0.33 = 33%
c) Predictive Value Positive
P +T\+D ∗P(+D)
P(+D\+T) =
𝑃(+𝑇)

17
Solved Example on Bayes Theorem

c) Predictive Value Positive

P(+T) = P(+T\+D)P(+D) + P(-T\+D)P(-D) =
(214/287)(0.1) + (670/1000)(0.9) = 0.6776
P +T\+D ∗P(+D) 0.7456∗0.1
◼ P(+D\+T) = = = 0.11
𝑃(+𝑇) 0.6776

Bayes Classification
No ratings yet
Bayes Classification
9 pages
AI Notes
No ratings yet
AI Notes
19 pages
2.3 Bayes Classification
No ratings yet
2.3 Bayes Classification
15 pages
Unit-4 DWDM
No ratings yet
Unit-4 DWDM
10 pages
ML 05 Bayesian Classifier
No ratings yet
ML 05 Bayesian Classifier
19 pages
Naive Bayesian Classification Overview
No ratings yet
Naive Bayesian Classification Overview
15 pages
Module 3 - Bayesian Classifier
No ratings yet
Module 3 - Bayesian Classifier
17 pages
Understanding Bayesian Classification Techniques
No ratings yet
Understanding Bayesian Classification Techniques
16 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
Naïve Bayes Classifier in AI Training
No ratings yet
Naïve Bayes Classifier in AI Training
27 pages
Unit6 - 3 Classification-Bayesian
No ratings yet
Unit6 - 3 Classification-Bayesian
15 pages
Naive Bayes
No ratings yet
Naive Bayes
37 pages
Data Mining - Bayesian Classification
No ratings yet
Data Mining - Bayesian Classification
6 pages
20210913115710D3708 - Session 09-12 Bayes Classifier
No ratings yet
20210913115710D3708 - Session 09-12 Bayes Classifier
30 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Understanding Bayesian Classification Techniques
No ratings yet
Understanding Bayesian Classification Techniques
25 pages
Naive Bayes Classifier Guide
No ratings yet
Naive Bayes Classifier Guide
16 pages
WINSEM2024-25 BCSE334L TH VL2024250502042 2025-03-03 Reference-Material-I
No ratings yet
WINSEM2024-25 BCSE334L TH VL2024250502042 2025-03-03 Reference-Material-I
18 pages
A5 PDF
No ratings yet
A5 PDF
9 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
47 pages
Classification Bayes
No ratings yet
Classification Bayes
21 pages
Lecture12 Ch8 ClassBasic Part2
No ratings yet
Lecture12 Ch8 ClassBasic Part2
22 pages
TTDS Lecture 5
No ratings yet
TTDS Lecture 5
8 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Lesson 3.3 - Supervised Learning Rule Based Classification
No ratings yet
Lesson 3.3 - Supervised Learning Rule Based Classification
43 pages
UNIT - IV
No ratings yet
UNIT - IV
169 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
Bayes Classification Method
No ratings yet
Bayes Classification Method
18 pages
Nayes Bayes Classifier
No ratings yet
Nayes Bayes Classifier
46 pages
Naïve Bayes Classifier in Machine Learning
No ratings yet
Naïve Bayes Classifier in Machine Learning
19 pages
9-Decision Tree Induction-23-01-2025
No ratings yet
9-Decision Tree Induction-23-01-2025
40 pages
23-Naive Bayes
No ratings yet
23-Naive Bayes
22 pages
Naïve Bayes for Data Scientists
No ratings yet
Naïve Bayes for Data Scientists
31 pages
Bayesian
No ratings yet
Bayesian
23 pages
Bayes Classification Methods
No ratings yet
Bayes Classification Methods
22 pages
Bayesian Classification, Nearest
No ratings yet
Bayesian Classification, Nearest
46 pages
Naive by
No ratings yet
Naive by
23 pages
Simple Bayesian Classifier: Assist - Prof. Songül Albayrak Yıldız Teknik Üniversitesi Bilgisayar Müh. Bölümü
No ratings yet
Simple Bayesian Classifier: Assist - Prof. Songül Albayrak Yıldız Teknik Üniversitesi Bilgisayar Müh. Bölümü
15 pages
Bayes Classification
No ratings yet
Bayes Classification
4 pages
Naïve Bayesian Classifier Overview
No ratings yet
Naïve Bayesian Classifier Overview
48 pages
Understanding Bayesian Classification
No ratings yet
Understanding Bayesian Classification
66 pages
8 - Classification NaiveBayes PDF
No ratings yet
8 - Classification NaiveBayes PDF
13 pages
Module 3 - Classification
No ratings yet
Module 3 - Classification
111 pages
Bayesian Classification - Problem
No ratings yet
Bayesian Classification - Problem
4 pages
Classification vs Prediction Overview
No ratings yet
Classification vs Prediction Overview
44 pages
Bayesian Classification Explained
No ratings yet
Bayesian Classification Explained
7 pages
4 22865 IS465 2019 1 2 1 08ClassBasic
No ratings yet
4 22865 IS465 2019 1 2 1 08ClassBasic
43 pages
07 Naive Bayes
No ratings yet
07 Naive Bayes
6 pages
Bayesian Learning
No ratings yet
Bayesian Learning
58 pages
Class Adv Classification IV
No ratings yet
Class Adv Classification IV
49 pages
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
No ratings yet
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
16 pages
Bays Classifier (Machine Learning)
No ratings yet
Bays Classifier (Machine Learning)
16 pages
Statistical Inference INF312 - Is - Lecture 03 - Part 2
No ratings yet
Statistical Inference INF312 - Is - Lecture 03 - Part 2
2 pages
Networks Lecture 5
No ratings yet
Networks Lecture 5
29 pages
DM Lect 9 - Classification - Decision Trees
No ratings yet
DM Lect 9 - Classification - Decision Trees
39 pages
Lecture 5 Modes of Operation
No ratings yet
Lecture 5 Modes of Operation
30 pages
DM Lect 6 - Recommender Systems
No ratings yet
DM Lect 6 - Recommender Systems
46 pages
DM Lect 5 - Sequence & Stream Mining
No ratings yet
DM Lect 5 - Sequence & Stream Mining
32 pages
DM Lec 6
No ratings yet
DM Lec 6
4 pages
5-Data Analytics in A Business Operations and BI Marketing Models
No ratings yet
5-Data Analytics in A Business Operations and BI Marketing Models
29 pages
Comparing Dual Simplex and Big M Methods
No ratings yet
Comparing Dual Simplex and Big M Methods
48 pages
Introduction to Data Security Course
No ratings yet
Introduction to Data Security Course
46 pages
1-Introduction To Business Intelligence in A Business Environment
No ratings yet
1-Introduction To Business Intelligence in A Business Environment
40 pages
3-Data Fundamentals For BI - Part2
No ratings yet
3-Data Fundamentals For BI - Part2
44 pages
Networks Lecture 2
No ratings yet
Networks Lecture 2
21 pages
Introduction to Computer Networks
No ratings yet
Introduction to Computer Networks
28 pages
Techline Connect User - Guide
No ratings yet
Techline Connect User - Guide
26 pages
Starfinder 044 - Horizons of The Vast (5 of 6) - Allies Against The Eye (PZO7244)
100% (5)
Starfinder 044 - Horizons of The Vast (5 of 6) - Allies Against The Eye (PZO7244)
68 pages
INSTANTID Sample Report NexisLexis
No ratings yet
INSTANTID Sample Report NexisLexis
2 pages
JHJH
No ratings yet
JHJH
8 pages
3251 - Manual Testo 325 I
No ratings yet
3251 - Manual Testo 325 I
11 pages
APC-April 2023 PDF
No ratings yet
APC-April 2023 PDF
118 pages
3D Machine Vision Systems Overview
No ratings yet
3D Machine Vision Systems Overview
12 pages
Previews 2913402 Pre
No ratings yet
Previews 2913402 Pre
12 pages
IIT Madras Paper
No ratings yet
IIT Madras Paper
4 pages
Fundamentals of Plaza Design
No ratings yet
Fundamentals of Plaza Design
8 pages
Battery Storage Costs in 2023 IRENA 1
No ratings yet
Battery Storage Costs in 2023 IRENA 1
10 pages
Introduction To Bash Scripting
No ratings yet
Introduction To Bash Scripting
6 pages
10 Digital Transformation Pitfalls
No ratings yet
10 Digital Transformation Pitfalls
6 pages
2019-20 ICSE Class 10 Computer SQP
No ratings yet
2019-20 ICSE Class 10 Computer SQP
6 pages
Solemne 2
No ratings yet
Solemne 2
13 pages
Introduction To Computer Programming Using Python Comp 111
No ratings yet
Introduction To Computer Programming Using Python Comp 111
227 pages
8.7 Taylor and Maclaurin Series
No ratings yet
8.7 Taylor and Maclaurin Series
36 pages
Ubuntu Kernel Optimization Script
No ratings yet
Ubuntu Kernel Optimization Script
3 pages
Wa0000.
No ratings yet
Wa0000.
3 pages
HP Compaq
No ratings yet
HP Compaq
34 pages
3WL51103FG351AA2-Z+B062bR61 Datasheet en
No ratings yet
3WL51103FG351AA2-Z+B062bR61 Datasheet en
5 pages
WIT12 01 Que 20190611
33% (3)
WIT12 01 Que 20190611
16 pages
Class 9 AI (1) - 1 Khan S) A
No ratings yet
Class 9 AI (1) - 1 Khan S) A
2 pages
UploadSchulabschlusszeugnis - Higher Secondary Certificate
No ratings yet
UploadSchulabschlusszeugnis - Higher Secondary Certificate
3 pages
FANUC AC Servo Motor β Series Overview
No ratings yet
FANUC AC Servo Motor β Series Overview
26 pages
GAT2024 Admit Card Instructions
No ratings yet
GAT2024 Admit Card Instructions
1 page
Tenarishydril Wedge 533 / 503 / 553 Connections: Scope
No ratings yet
Tenarishydril Wedge 533 / 503 / 553 Connections: Scope
15 pages
Accomplishment Report For OJT Students
No ratings yet
Accomplishment Report For OJT Students
3 pages
Breakdance Thesis Help for Students
100% (3)
Breakdance Thesis Help for Students
7 pages
Intro to Operating Systems
No ratings yet
Intro to Operating Systems
8 pages

Statistical Inference INF312 - Is - Lecture 03 - Part 3

Uploaded by

Statistical Inference INF312 - Is - Lecture 03 - Part 3

Uploaded by

Bayesian Classification: Why?

◼ A statistical classifier: performs probabilistic prediction, i.e.,

◼ Bayes’ Theorem: P(H | X) = P(X | H )P(H ) = P(X | H ) P(H ) / P(X)

◼ P(X): probability that sample data is observed

P(H | X) = P(X | H )P(H ) = P(X | H ) P(H ) / P(X)

C1:buys_computer = ‘yes’ C2:buys_computer = ‘no’

◼ Compute P(Ci) for each class:

◼ P(C2) = P(buys_computer = “no”) = 5/14= 0.357

◼ Compute P(X|Ci) for each class

P(Xk|C1) = P(X1|C1) * P(X2|C1) * P(X3|C1)* ….*P(Xk|C1)

P(Xk|C2) = P(X1|C2) * P(X2|C2) * P(X3|C2)* ….*P(Xk|C2)

P(Age <= 30| Buys Computer = Yes) 0.222222222

P(Income = High| Buys Computer = Yes) 0.222222222

Student Buys Computer Count Total Conditional Probability Conditional Probability

P(Student = Yes| Buys Computer = Yes) 0.666666667

P(Credit Rating = Fair| Buys Computer = Yes) 0.666666667

◼ Compute P(X|Ci) for each class

P(X|C1) = P(X|buys_computer = “yes”)

= 0.222 x 0.444 x 0.667 x 0.667 = 0.044

P(X|C2) = P(X|buys_computer = “no”)

= 0.6 x 0.4 x 0.2 x 0.4 = 0.019

◼ Compute P(X|Ci) * P(Ci) for each class

P(X|C1) * P(C1) = 0.044 * 0.643 = 0.028

P(X|C2) * P(C2) = 0.019 * 0.357 = 0.007

P(X|C1) * P(C1) > P(X|C2) * P(C2)

a) Sensitivity = P (+T \ +D) = 214/287 = 0.7456 = 74.56%

c) Predictive Value Positive

You might also like