ML pp8_u2

The document provides an overview of the Naïve Bayes Classifier, a supervised learning algorithm based on Bayes' theorem, primarily used for text classification and probabilistic predictions. It explains the algorithm's assumptions, advantages, disadvantages, and various types such as Gaussian, Multinomial, and Bernoulli. Additionally, it introduces Bayesian Networks, which represent relationships between variables and their conditional dependencies, and discusses their applications in real-world scenarios.

Uploaded by

dhruvjaisinghanioberoi484

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

ML pp8_u2

Uploaded by

dhruvjaisinghanioberoi484

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Lecture-8

Machine Learning with

Python
Naïve Bayes Classifier Algorithm
❖Naïve Bayes algorithm is a supervised learning algorithm, which is based
on Bayes theorem and used for solving classification problems.
❖It is mainly used in text classification that includes a high-dimensional training
dataset.
❖It is a probabilistic classifier, which means it predicts on the basis of the
probability of an object.
❖Some popular examples of Naïve Bayes Algorithm are spam filtration,
Sentimental analysis, and classifying articles.
Why is it called Naïve Bayes?
The Naïve Bayes algorithm is comprised of two words Naïve and Bayes, Which
can be described as:
❖Naïve: It is called Naïve because it assumes that the occurrence of a certain
feature is independent of the occurrence of other features. Such as if the fruit is
identified on the bases of color, shape, and taste, then red, spherical, and sweet
fruit is recognized as an apple. Hence each feature individually contributes to
identity that it is an apple without depending on each other.
❖Bayes: It is called Bayes because it depends on the principle of Bayes' Theorem.
Bayes' Theorem
❖Bayes' theorem is also known as Bayes' Rule or Bayes' law, which is used to
determine the probability of a hypothesis with prior knowledge. It depends on
the conditional probability.
❖The formula for Bayes' theorem is given as:

Where,
P(A|B) is Posterior probability: Probability of hypothesis A on the observed event
B.
P(B|A) is Likelihood probability: Probability of the evidence given that the
probability of a hypothesis is true.
Working of Naïve Bayes' Classifier
❖Suppose we have a dataset of weather conditions and corresponding target
variable "Play". So using this dataset we need to decide that whether we should
play or not on a particular day according to the weather conditions.
So to solve this problem, we need to follow the below steps:
❖Convert the given dataset into frequency tables.
❖Generate Likelihood table by finding the probabilities of given features.
❖Now, use Bayes theorem to calculate the posterior probability.
Problem: If the weather is sunny, then the Player should play or not?
Solution: To solve this, first consider the below dataset:
Sample
Dataset
4
Advantages of Naïve Bayes Classifier:
❖Naïve Bayes is one of the fast and easy ML algorithms to predict a class of
datasets.
❖It can be used for Binary as well as Multi-class Classifications.
❖It performs well in Multi-class predictions as compared to the other Algorithms.
❖It is the most popular choice for text classification problems.
Disadvantages of Naïve Bayes Classifier:
❖Naive Bayes assumes that all features are independent or unrelated, so it
cannot learn the relationship between features.
Applications of Naïve Bayes Classifier:
❖It is used for Credit Scoring.
❖It is used in medical data classification.
❖It can be used in real-time predictions because Naïve Bayes Classifier is an
eager learner.
❖It is used in Text classification such as Spam filtering and Sentiment analysis.
Types of Naïve Bayes Model:
There are three types of Naive Bayes Model, which are given below:
❖Gaussian: The Gaussian model assumes that features follow a normal
distribution. This means if predictors take continuous values instead of discrete,
then the model assumes that these values are sampled from the Gaussian
distribution.
❖Multinomial: The Multinomial Naïve Bayes classifier is used when the data is
multinomial distributed. It is primarily used for document classification
problems, it means a particular document belongs to which category such as
Sports, Politics, education, etc.
The classifier uses the frequency of words for the predictors.
Types of Naïve Bayes Model:
Bernoulli: The Bernoulli classifier works similar to the Multinomial classifier, but
the predictor variables are the independent Booleans variables. Such as if a
particular word is present or not in a document. This model is also famous for
document classification tasks.
Bayesian Network
In statistics, Probabilistic models are used to define a relationship between
variables and can be used to calculate the probabilities of each variable.
In many problems, there are a large number of variables. In such cases, the fully
conditional models require a huge amount of data to cover each and every case
of the probability functions which may be intractable to calculate in real-time.
There have been several attempts to simplify the conditional probability
calculations such as the Naïve Bayes but still, it does not prove to be efficient as
it drastically cuts down several variables.
Bayesian Network
❖The only way is to develop a model that can preserve the conditional
dependencies between random variables and conditional independence in
other cases. This leads us to the concept of Bayesian Networks.
❖These Bayesian Networks help us to effectively visualize the probabilistic model
for each domain and to study the relationship between random variables in the
form of a user-friendly graph.
Bayesian Network
❖Real world applications are probabilistic in nature, and to represent the
relationship between multiple events, we need a Bayesian network.
❖ It can also be used in various tasks including prediction, anomaly detection,
diagnostics, automated insight, reasoning, time series prediction, and decision
making under uncertainty.
Bayesian Network
❖A Bayesian network is a probabilistic graphical model which represents a set
of variables and their conditional dependencies using a directed acyclic
graph."
❖It is also called a Bayes network, belief network, decision network, or Bayesian
model.
❖Bayesian networks are probabilistic, because these networks are built from
a probability distribution, and also use probability theory for prediction and
anomaly detection.
Bayesian Network

Bayesian Network can be used for building models from data and experts opinions,
and it consists of two parts:
❖Directed Acyclic Graph
❖Table of conditional probabilities.
The generalized form of Bayesian network that represents and solve decision
problems under uncertain knowledge is known as an Influence diagram.
Bayesian Network
❖Each node corresponds to the random variables, and a variable can
be continuous or discrete.
❖Arc or directed arrows represent the causal relationship or conditional
probabilities between random variables. These directed links or arrows connect
the pair of nodes in the graph.
❖These links represent that one node directly influence the other node, and if
there is no directed link that means that nodes are independent with each other
Bayesian Network
Bayesian Network
The Bayesian network has mainly two components:
❖Causal Component
❖Actual numbers
Each node in the Bayesian network has condition probability
distribution P(Xi |Parent(Xi) ), which determines the effect of the parent on that
node.
Bayesian network is based on Joint probability distribution and conditional
probability.
Joint Probability Distribution
❖If we have variables x1, x2, x3,....., xn, then the probabilities of a different
combination of x1, x2, x3.. xn, are known as Joint probability distribution.
❖P[x1, x2, x3,....., xn], it can be written as the following way in terms of the joint
probability distribution.
= P[x1| x2, x3,....., xn]P[x2, x3,....., xn]
= P[x1| x2, x3,....., xn]P[x2|x3,....., xn]....P[xn-1|xn]P[xn].
❖In general for each variable Xi, we can write the equation as:
Local Markov Property
❖The Bayesian Networks satisfy the property known as the Local Markov
Property. It states that a node is conditionally independent of its
non-descendants, given its parents. In the above example, P(D|A, B) is equal to
P(D|A) because D is independent of its non-descendent, B.
❖This property aids us in simplifying the Joint Distribution. The Local Markov
Property leads us to the concept of a Markov Random Field which is a random
field around a variable that is said to follow Markov properties.
Example
Harry installed a new burglar alarm at his home to detect burglary. The alarm
reliably responds at detecting a burglary but also responds for minor earthquakes.
Harry has two neighbors David and Sophia, who have taken a responsibility to
inform Harry at work when they hear the alarm. David always calls Harry when he
hears the alarm, but sometimes he got confused with the phone ringing and calls at
that time too. On the other hand, Sophia likes to listen to high music, so sometimes
she misses to hear the alarm. Here we would like to compute the probability of
Burglary Alarm.
Problem:
Calculate the probability that alarm has sounded, but there is neither a burglary,
nor an earthquake occurred, and David and Sophia both called the Harry.
Solution
❖The Bayesian network for the above problem is given below. The network structure is
showing that burglary and earthquake is the parent node of the alarm and directly
affecting the probability of alarm's going off, but David and Sophia's calls depend on
alarm probability.
❖The network is representing that our assumptions do not directly perceive the burglary
and also do not notice the minor earthquake, and they also not confer before calling.The
conditional distributions for each node are given as conditional probabilities table or
CPT.
❖Each row in the CPT must be sum to 1 because all the entries in the table represent an
exhaustive set of cases for the variable.
❖In CPT, a boolean variable with k boolean parents contains 2K probabilities. Hence, if
there are two parents, then CPT will contain 4 probability values
Solution
Let's take the observed probability for the Burglary and earthquake component:
P(B= True) = 0.002, which is the probability of burglary.
P(B= False)= 0.998, which is the probability of no burglary.
P(E= True)= 0.001, which is the probability of a minor earthquake
P(E= False)= 0.999, Which is the probability that an earthquake not occurred.
Bayesian Network
From the formula of joint distribution, we can write the problem statement in the form of
probability distribution:
P(S, D, A, ¬B, ¬E) = P (S|A) *P (D|A)*P (A|¬B ^ ¬E) *P (¬B) *P (¬E).
= 0.75* 0.91* 0.001* 0.998*0.999
= 0.00068045.
Thank You!!

Complete Download Statistics Plain and Simple 3rd Edition Sherri L. Jackson PDF All Chapters
100% (1)
Complete Download Statistics Plain and Simple 3rd Edition Sherri L. Jackson PDF All Chapters
77 pages
Three Approaches To Probability
71% (7)
Three Approaches To Probability
6 pages
Aiml Unit 2
No ratings yet
Aiml Unit 2
15 pages
AI NOTES unit 2
No ratings yet
AI NOTES unit 2
9 pages
Lecture 06 Bayesian Networks 07112022 011127pm
No ratings yet
Lecture 06 Bayesian Networks 07112022 011127pm
33 pages
NB classifier & Bayesian Network 2
No ratings yet
NB classifier & Bayesian Network 2
37 pages
Unit V -Graphical Models
No ratings yet
Unit V -Graphical Models
43 pages
ML-9
No ratings yet
ML-9
15 pages
22cse61 Module 4
No ratings yet
22cse61 Module 4
110 pages
Data Analytics Unit-2 PPT Notes
No ratings yet
Data Analytics Unit-2 PPT Notes
190 pages
AI Bayes Theorem
No ratings yet
AI Bayes Theorem
10 pages
Unit-5 Bayes' Rule and Bayesian Network
No ratings yet
Unit-5 Bayes' Rule and Bayesian Network
9 pages
Good BayesianNetworksPrimer
No ratings yet
Good BayesianNetworksPrimer
23 pages
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
No ratings yet
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
16 pages
An Introduction to Naive Bayes Algorithm for Beginners
No ratings yet
An Introduction to Naive Bayes Algorithm for Beginners
11 pages
Unit 6
No ratings yet
Unit 6
126 pages
UNIT 2 AAM notes (1)
No ratings yet
UNIT 2 AAM notes (1)
38 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
6 pages
AI & ML Unit 2 Notes
No ratings yet
AI & ML Unit 2 Notes
12 pages
202004021910158758chandrabhan Artificial Intelligence Probabilistic Reasoning
No ratings yet
202004021910158758chandrabhan Artificial Intelligence Probabilistic Reasoning
11 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
7 pages
Bayes Theorem
No ratings yet
Bayes Theorem
20 pages
4.2 Bayes-nets
No ratings yet
4.2 Bayes-nets
33 pages
Unit-4
No ratings yet
Unit-4
36 pages
UNIT -V_ML_Final
No ratings yet
UNIT -V_ML_Final
105 pages
Bayesian Belief Network in Artificial Intelligence
No ratings yet
Bayesian Belief Network in Artificial Intelligence
10 pages
NBayes Log Reg
No ratings yet
NBayes Log Reg
18 pages
ML Unit-5
No ratings yet
ML Unit-5
104 pages
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
No ratings yet
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
17 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
11 pages
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
No ratings yet
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
18 pages
Module V_v1
No ratings yet
Module V_v1
58 pages
Bayes Algorithm
No ratings yet
Bayes Algorithm
26 pages
Unit 6
No ratings yet
Unit 6
19 pages
Ai Pro
No ratings yet
Ai Pro
11 pages
NOTES
No ratings yet
NOTES
15 pages
6 Easy Steps To Learn Naive Bayes Algorithm (With Code in Python)
No ratings yet
6 Easy Steps To Learn Naive Bayes Algorithm (With Code in Python)
3 pages
Examples - Naive Bayes - Baysian Network
No ratings yet
Examples - Naive Bayes - Baysian Network
24 pages
c14 15bayesian Networks 2020
No ratings yet
c14 15bayesian Networks 2020
115 pages
Unit4 - Lecture 2
No ratings yet
Unit4 - Lecture 2
17 pages
AAI Module 3 Notes
No ratings yet
AAI Module 3 Notes
7 pages
AI&MLUnit 2
No ratings yet
AI&MLUnit 2
26 pages
13 Bayes Nets
No ratings yet
13 Bayes Nets
38 pages
Unit Iv Learning
No ratings yet
Unit Iv Learning
40 pages
I239-5 Naive Bayes
No ratings yet
I239-5 Naive Bayes
35 pages
Bayesian Belief Network
No ratings yet
Bayesian Belief Network
41 pages
Naive Bayes
No ratings yet
Naive Bayes
11 pages
Bayes Rule PR-2
No ratings yet
Bayes Rule PR-2
5 pages
Uncertain Knowledge
No ratings yet
Uncertain Knowledge
31 pages
ML Unit No.4 Naïve Bayes Classifiers PPT Notes
No ratings yet
ML Unit No.4 Naïve Bayes Classifiers PPT Notes
47 pages
Bayesian Networks
No ratings yet
Bayesian Networks
7 pages
Bayesian Learning
No ratings yet
Bayesian Learning
44 pages
Pgm5 With Output
No ratings yet
Pgm5 With Output
13 pages
EXP1_A09_DS
No ratings yet
EXP1_A09_DS
6 pages
Baes Rule
No ratings yet
Baes Rule
8 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
7 pages
(Machine Learning) BAYES’ THEOREM AND CONCEPT LEARNING
No ratings yet
(Machine Learning) BAYES’ THEOREM AND CONCEPT LEARNING
22 pages
CSL0777 L24
No ratings yet
CSL0777 L24
38 pages
Bayes Learning
No ratings yet
Bayes Learning
37 pages
ML Unit-5 (1).pptx
No ratings yet
ML Unit-5 (1).pptx
105 pages
Naive Bayes Classifier: Fundamentals and Applications
From Everand
Naive Bayes Classifier: Fundamentals and Applications
Fouad Sabry
No ratings yet
Bayesian Inference: Fundamentals and Applications
From Everand
Bayesian Inference: Fundamentals and Applications
Fouad Sabry
No ratings yet
Chapter 22 Estimating Risk and Return On Assets
25% (4)
Chapter 22 Estimating Risk and Return On Assets
4 pages
Lecture 10 Randomized Complete Block Design Last Lecture
100% (1)
Lecture 10 Randomized Complete Block Design Last Lecture
4 pages
Spss Tugas Ral Fix
No ratings yet
Spss Tugas Ral Fix
4 pages
12 Housing Prices
No ratings yet
12 Housing Prices
12 pages
Efa Vs Cfa
No ratings yet
Efa Vs Cfa
11 pages
Definition
100% (1)
Definition
5 pages
JBDS V3N2
No ratings yet
JBDS V3N2
129 pages
MMW Finals Notes Mod 5&6
No ratings yet
MMW Finals Notes Mod 5&6
52 pages
Flexible Instruction Delivery Plan Template Group5 Autosaved
100% (1)
Flexible Instruction Delivery Plan Template Group5 Autosaved
5 pages
Euromech618 Sudret
No ratings yet
Euromech618 Sudret
58 pages
2.P&S Lesson Plan
No ratings yet
2.P&S Lesson Plan
10 pages
B39AX Topic1-P PDF
No ratings yet
B39AX Topic1-P PDF
28 pages
Lecture 16
No ratings yet
Lecture 16
10 pages
Methods in Sample Surveys: Cluster Sampling
No ratings yet
Methods in Sample Surveys: Cluster Sampling
14 pages
Moderation Reporting-Results
No ratings yet
Moderation Reporting-Results
5 pages
Critical-and-Computed-Values-of-t
No ratings yet
Critical-and-Computed-Values-of-t
10 pages
BA 1 - Describing and Summarizing Data PDF
No ratings yet
BA 1 - Describing and Summarizing Data PDF
4 pages
IGNOU MBA MS-95 Solved Assignment Dec 2012
No ratings yet
IGNOU MBA MS-95 Solved Assignment Dec 2012
14 pages
Nair-DistributionStudentst-1941
No ratings yet
Nair-DistributionStudentst-1941
19 pages
EE2211 Introduction To Machine Learning
No ratings yet
EE2211 Introduction To Machine Learning
99 pages
Statistics Assignment
No ratings yet
Statistics Assignment
8 pages
CASExam 3 Nov 2006
No ratings yet
CASExam 3 Nov 2006
51 pages
CHAPTER 7 Sampling Distributions
No ratings yet
CHAPTER 7 Sampling Distributions
8 pages
Activity 2 Visualizing A Normal Distribution1
No ratings yet
Activity 2 Visualizing A Normal Distribution1
1 page
Answer
100% (3)
Answer
5 pages
Unit Iv L Earning
No ratings yet
Unit Iv L Earning
33 pages
MIT6 436JF18 Lec24 PDF
No ratings yet
MIT6 436JF18 Lec24 PDF
9 pages
Chapter 6 S 1
No ratings yet
Chapter 6 S 1
32 pages

ML pp8_u2

Uploaded by

ML pp8_u2

Uploaded by

Lecture-8

Machine Learning with

You might also like