0% found this document useful (0 votes)

17 views13 pages

ML Last Document Group 2 PDF

The document is an assignment from Mizan Tepi University's School of Computing and Informatics, focusing on Bayes Theorem and the Naive Bayes algorithm in machine learning. It explains the concepts of Bayes Theorem, its applications, and the workings of the Naive Bayes classifier, emphasizing its efficiency in classification tasks despite its simplifying assumptions. The assignment includes a detailed breakdown of prerequisites for understanding Bayes Theorem and examples of its application in predicting outcomes.

Uploaded by

gemechisltujuba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views13 pages

ML Last Document Group 2 PDF

Uploaded by

gemechisltujuba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Mizan Tepi University

School Of Computing And Informatics

Department of Information Systems

Assignment of Machine Learning
Prepared by:

No Name ID
01 Henok Tadesse NSR/0932/13
02 Melkamu Yitayih NSR/1179/13
03 Adise Adane NSR/0124/12
04 Birhan Ayenew NSR/0393/13
05 Tizazu Mekuant NSR/1641/13

Submission to Mr.
Gebreyes G.

Submitted date 09/01/2024

Table of Contents Page
1. Introduction to Bayes Theorem in Machine Learning .............................................................2
What is Bayes Theorem? .................................................................................................................2
Prerequisites for Bayes Theorem .................................................................................................3
1. Experiment ...........................................................................................................................3
2. Sample Space ....................................................................................................................... 4
3. Event .................................................................................................................................... 4
4. Random Variable: ................................................................................................................ 6
5. Exhaustive Event: ................................................................................................................ 6
6. Independent Event: .............................................................................................................. 6
7. Conditional Probability: .......................................................................................................6
8. Marginal Probability: ........................................................................................................... 6
2. What Is the Naive Bayes Algorithm? ...................................................................................... 7
How Do Naive Bayes Algorithms Work? ................................................................................... 9
References ..............................................................................................................................12

1
1. Introduction to Bayes Theorem in Machine Learning

Bayes theorem is given by an English statistician, philosopher, and Presbyterian minister

named Mr. Thomas Bayes in 17th century. Bayes provides their thoughts in decision theory
which is extensively used in important mathematics concepts as Probability. Bayes theorem is
also widely used in Machine Learning where we need to predict classes precisely and accurately.
An important concept of Bayes theorem named Bayesian method is used to calculate
conditional probability in Machine Learning application that includes classification tasks.
Further, a simplified version of Bayes theorem (Naïve Bayes classification) is also used to
reduce computation time and average cost of the projects.

Bayes theorem is also known with some other name such as Bayes rule or Bayes Law. Bayes
theorem helps to determine the probability of an event with random knowledge. It is used to
calculate the probability of occurring one event while other one already occurred. It is a best
method to relate the condition probability and marginal probability.

In simple words, we can say that Bayes theorem helps to contribute more accurate results.

Bayes Theorem is used to estimate the precision of values and provides a method for calculating
the conditional probability. However, it is hypocritically a simple calculation but it is used to
easily calculate the conditional probability of events where intuition often fails. Some of the data
scientist assumes that Bayes theorem is most widely used in financial industries but it is not like
that. Other than financial, Bayes theorem is also extensively applied in health and medical,
research and survey industry, aeronautical sector, etc.

What is Bayes Theorem?

Bayes theorem is one of the most popular machine learning concepts that helps to calculate the
probability of occurring one event with uncertain knowledge while other one has already
occurred.

Bayes' theorem can be derived using product rule and conditional probability of event X with
known event Y:

According to the product rule we can express as the probability of event X with known event Y
as follows;

1. P(X ? Y)= P(X|Y) P(Y) {equation 1}

Further, the probability of event Y with known event X:

1. P(X ? Y)= P(Y|X) P(X) {equation 2}

2
Mathematically, Bayes theorem can be expressed by combining both equations on right hand
side. We will get:

Here, both events X and Y are independent events which means probability of outcome of both
events does not depends one another.

The above equation is called as Bayes Rule or Bayes Theorem.

P(X|Y) is called as posterior, which we need to calculate. It is defined as updated probability

after considering the evidence.

P(Y|X) is called the likelihood. It is the probability of evidence when hypothesis is true.

P(X) is called the prior probability, probability of hypothesis before considering the evidence

P(Y) is called marginal probability. It is defined as the probability of evidence under any
consideration.

Hence, Bayes Theorem can be written as:

posterior = likelihood * prior / evidence

Prerequisites for Bayes Theorem

While studying the Bayes theorem, we need to understand few important concepts. These are as
follows:

1. Experiment

An experiment is defined as the planned operation carried out under controlled condition such as
tossing a coin, drawing a card and rolling a dice, etc.

3
2. Sample Space

During an experiment what we get as a result is called as possible outcomes and the set of all
possible outcome of an event is known as sample space. For example, if we are rolling a dice,
sample space will be:

S1 = {1, 2, 3, 4, 5, 6}

Similarly, if our experiment is related to toss a coin and recording its outcomes, then sample
space will be:

S2 = {Head, Tail}

3. Event

Event is defined as subset of sample space in an experiment. Further, it is also called as set of
outcomes.

Assume in our experiment of rolling a dice, there are two event A and B such that;

A = Event when an even number is obtained = {2, 4, 6}

B = Event when a number is greater than 4 = {5, 6}

Probability of the event A ''P(A)''= Number of favourable outcomes / Total number of possible
outcomes
P(E) = 3/6 =1/2 =0.5

Similarly, Probability of the event B ''P(B)''= Number of favourable outcomes / Total number
of possible outcomes
=2/6
=1/3
=0.333

4
Union of event A and B:
A∪B = {2, 4, 5, 6}

Intersection of event A and B:

A∩B= {6}

Disjoint Event: If the intersection of the event A and B is an empty set or null then such events
are known as disjoint event or mutually exclusive events also.

5
4. Random Variable:

It is a real value function which helps mapping between sample space and a real line of an
experiment. A random variable is taken on some random values and each value having some
probability. However, it is neither random nor a variable but it behaves as a function which can
either be discrete, continuous or combination of both.

5. Exhaustive Event:

As per the name suggests, a set of events where at least one event occurs at a time, called
exhaustive event of an experiment.

Thus, two events A and B are said to be exhaustive if either A or B definitely occur at a time and
both are mutually exclusive for e.g., while tossing a coin, either it will be a Head or may be a
Tail.

6. Independent Event:

Two events are said to be independent when occurrence of one event does not affect the
occurrence of another event. In simple words we can say that the probability of outcome of both
events does not depends one another.

Mathematically, two events A and B are said to be independent if:

P(A ∩ B) = P(AB) = P(A)*P(B)

7. Conditional Probability:

Conditional probability is defined as the probability of an event A, given that another event B
has already occurred (i.e. A conditional B). This is represented by P(A|B) and we can define it as:

P(A|B) = P(A ∩ B) / P(B)

8. Marginal Probability:

Marginal probability is defined as the probability of an event A occurring independent of any

other event B. Further, it is considered as the probability of evidence under any consideration.

P(A) = P(A|B)P(B) + P(A|~B)P(~B)

6
Here ~B represents the event that B does not occur.

2. What Is the Naive Bayes Algorithm?

It is a classification technique based on Bayes’ Theorem with an independence assumption

among predictors. In simple terms, a Naive Bayes classifier assumes that the presence of a

particular feature in a class is unrelated to the presence of any other feature.

The Naïve Bayes classifier is a popular supervised machine learning algorithm used for

classification tasks such as text classification. It belongs to the family of generative learning

algorithms, which means that it models the distribution of inputs for a given class or category.

This approach is based on the assumption that the features of the input data are conditionally

independent given the class, allowing the algorithm to make predictions quickly and accurately.

In statistics, naive Bayes classifiers are considered as simple probabilistic classifiers that apply

Bayes’ theorem. This theorem is based on the probability of a hypothesis, given the data and

some prior knowledge. The naive Bayes classifier assumes that all features in the input data are

independent of each other, which is often not true in real-world scenarios. However, despite this

simplifying assumption, the naive Bayes classifier is widely used because of its efficiency and

good performance in many real-world applications.

7
Moreover, it is worth noting that naive Bayes classifiers are among the simplest Bayesian

network models, yet they can achieve high accuracy levels when coupled with kernel density

estimation. This technique involves using a kernel function to estimate the probability density

function of the input data, allowing the classifier to improve its performance in complex

scenarios where the data distribution is not well-defined. As a result, the naive Bayes classifier is

a powerful tool in machine learning, particularly in text classification, spam filtering, and

sentiment analysis, among others.

For example, a fruit may be considered to be an apple if it is red, round, and about 3 inches in

diameter. Even if these features depend on each other or upon the existence of the other features,

all of these properties independently contribute to the probability that this fruit is an apple and

that is why it is known as ‘Naive’.

An NB model is easy to build and particularly useful for very large data sets. Along with

simplicity, Naive Bayes is known to outperform even highly sophisticated classification methods.

Bayes theorem provides a way of computing posterior probability P(c|x) from P(c), P(x) and

P(x|c). Look at the equation below:

8
here

P(c|x) is the posterior probability of class (c, target) given predictor (x, attributes).

P(c) is the prior probability of class.

P(x|c) is the likelihood which is the probability of the predictor given class.

P(x) is the prior probability of the predictor.

How Do Naive Bayes Algorithms Work?

Let’s understand it using an example. Below we have a training data set of weather
and corresponding target variable ‘Play’ (suggesting possibilities of playing). Now,
we need to classify whether players will play or not based on weather condition. Let’s
follow the below steps to perform it.

Convert the data set into a frequency table

In this first step data set is converted into a frequency table

9
Create Likelihood table by finding the probabilities
Create Likelihood table by finding the probabilities like Overcast probability
0.29 and probability of playing is
0.64.

Use Naive Bayesian equation to calculate the posterior probability

Now, use Naive Bayesian equation to calculate the posterior probability for
each class. The class with the highest posterior probability is the outcome of the
prediction.

Problem: Players will play if the weather is sunny. Is this statement correct?

We can solve it using the above-discussed method of posterior probability.

P(Yes | Sunny) = P( Sunny | Yes) * P(Yes) / P (Sunny)

Here P( Sunny | Yes) * P(Yes) is in the numerator, and P (Sunny) is in the

denominator.

10
Here we have P (Sunny |Yes) = 3/9 = 0.33, P(Sunny) = 5/14 = 0.36, P( Yes)= 9/14 =
0.64

Now, P (Yes | Sunny) = 0.33 * 0.64 / 0.36 = 0.60, which has higher probability.

So P(Yes|Sunny) = 0.3*0.71/0.35= 0.60

P(No|Sunny)= P(Sunny|No)*P(No)/P(Sunny)

P(Sunny|NO)= 2/4=0.5

So P(No|Sunny)= 0.5*0.29/0.35 = 0.41

At the end P(No)= 0.29, P(Sunny)= 0.35

So as we can see from the above calculation that P(Yes|Sunny)>P(No|Sunny)

Hence on a Sunny day, Player can play the game.

The Naive Bayes uses a similar method to predict the probability of different class based on
various attributes. This algorithm is mostly used in text classification and with problems having
multiple classes.

11
References
[1] javatpoint.com

[2] W3schools.com

ML Material-I
No ratings yet
ML Material-I
35 pages
Bayes Theorem for ML Enthusiasts
No ratings yet
Bayes Theorem for ML Enthusiasts
37 pages
Bayes Theorem
No ratings yet
Bayes Theorem
20 pages
Unit - 3 Itai & ML
No ratings yet
Unit - 3 Itai & ML
57 pages
Machine Learning & Bayesian Methods
No ratings yet
Machine Learning & Bayesian Methods
28 pages
Bayes Theorem
No ratings yet
Bayes Theorem
7 pages
Naive Bayes
No ratings yet
Naive Bayes
17 pages
Unit 2
No ratings yet
Unit 2
20 pages
Bayes Decision Theorylect3
No ratings yet
Bayes Decision Theorylect3
12 pages
Naive Bayes for Beginners
No ratings yet
Naive Bayes for Beginners
24 pages
An Introduction To Naive Bayes Algorithm For Beginners
No ratings yet
An Introduction To Naive Bayes Algorithm For Beginners
11 pages
Naive Bayes
No ratings yet
Naive Bayes
29 pages
AML - Unit - 3
No ratings yet
AML - Unit - 3
2 pages
Bayesian Inference & Naive Bayes Guide
No ratings yet
Bayesian Inference & Naive Bayes Guide
14 pages
Bayes Theorem in Machine Learning
No ratings yet
Bayes Theorem in Machine Learning
40 pages
Unit 4
No ratings yet
Unit 4
36 pages
Detailed Discuss
No ratings yet
Detailed Discuss
2 pages
Baye's Theorem
No ratings yet
Baye's Theorem
14 pages
Unit 6
No ratings yet
Unit 6
19 pages
Unit II Classification
No ratings yet
Unit II Classification
31 pages
Notes On ML
No ratings yet
Notes On ML
42 pages
Bayesian Classification
No ratings yet
Bayesian Classification
7 pages
Machine Learning for Data Science Students
No ratings yet
Machine Learning for Data Science Students
37 pages
DWM Exp 4
No ratings yet
DWM Exp 4
7 pages
Baysian Modelling
No ratings yet
Baysian Modelling
16 pages
Unit Iii Bayesian Learning
No ratings yet
Unit Iii Bayesian Learning
5 pages
Unit II Probabilistic Reasoning
No ratings yet
Unit II Probabilistic Reasoning
28 pages
Adobe Scan Jun 26, 2025
No ratings yet
Adobe Scan Jun 26, 2025
12 pages
Additional Material - Naive Bayes
No ratings yet
Additional Material - Naive Bayes
6 pages
@vtudeveloper - in ML Mod 4
No ratings yet
@vtudeveloper - in ML Mod 4
11 pages
Bayesian Concept Learning Guide
No ratings yet
Bayesian Concept Learning Guide
157 pages
COSM
No ratings yet
COSM
10 pages
Data Analytics Unit-2 PPT Notes
No ratings yet
Data Analytics Unit-2 PPT Notes
190 pages
Assignment 01 Math Kishan
No ratings yet
Assignment 01 Math Kishan
16 pages
Bayesian
No ratings yet
Bayesian
14 pages
Bayes
No ratings yet
Bayes
3 pages
1 Bayes' Theorem: P (B - A) P (A) P (B)
100% (1)
1 Bayes' Theorem: P (B - A) P (A) P (B)
3 pages
Unit 3 Bayesian Concept Learning
No ratings yet
Unit 3 Bayesian Concept Learning
66 pages
Bayes
No ratings yet
Bayes
5 pages
Bayes Theorem, Types of Naive Bayes, Implementation
No ratings yet
Bayes Theorem, Types of Naive Bayes, Implementation
8 pages
Bayesian Modeling Guide
No ratings yet
Bayesian Modeling Guide
13 pages
Group 5 Practical
No ratings yet
Group 5 Practical
6 pages
Intelligent System
No ratings yet
Intelligent System
14 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
11 pages
Bayes Rule PR-2
No ratings yet
Bayes Rule PR-2
5 pages
I239-5 Naive Bayes
No ratings yet
I239-5 Naive Bayes
35 pages
Understanding Bayes' Theorem
100% (2)
Understanding Bayes' Theorem
12 pages
Berrar EBCB Naive Bayes Preprint
No ratings yet
Berrar EBCB Naive Bayes Preprint
19 pages
Probabilistic and Stochastic Models-Adi
No ratings yet
Probabilistic and Stochastic Models-Adi
33 pages
Aiml Module 04
No ratings yet
Aiml Module 04
62 pages
Naive Bayes Classification Numerical Example - Coding Infinite
No ratings yet
Naive Bayes Classification Numerical Example - Coding Infinite
14 pages
Aiml 2 3
No ratings yet
Aiml 2 3
51 pages
Topic 6: Conditional Probability - Bayes' Theorem Bayes Theorem
No ratings yet
Topic 6: Conditional Probability - Bayes' Theorem Bayes Theorem
8 pages
Module4 Notes
50% (2)
Module4 Notes
31 pages
Introduction To Bayes Theorem
No ratings yet
Introduction To Bayes Theorem
8 pages
Bayes
No ratings yet
Bayes
6 pages
Probabilistic Reasoning: Unit-V
No ratings yet
Probabilistic Reasoning: Unit-V
33 pages
Bayesian Analysis & Confidence Intervals
No ratings yet
Bayesian Analysis & Confidence Intervals
17 pages
Modular Programming
No ratings yet
Modular Programming
20 pages
Matrices for Aviation Students
No ratings yet
Matrices for Aviation Students
22 pages
Guide To Effective ChatGPT Prompting
No ratings yet
Guide To Effective ChatGPT Prompting
42 pages
Unit
No ratings yet
Unit
7 pages
SET-331. Micro Controller Based Refrigeration Control System
No ratings yet
SET-331. Micro Controller Based Refrigeration Control System
4 pages
Vigilohm Insulation Monitor
No ratings yet
Vigilohm Insulation Monitor
95 pages
1MRK514026-UUS - en - N - Installation Manual, 670 Series Version 2.2 ANSI
No ratings yet
1MRK514026-UUS - en - N - Installation Manual, 670 Series Version 2.2 ANSI
102 pages
Serial Matrix Printer Maintenance Guide
No ratings yet
Serial Matrix Printer Maintenance Guide
54 pages
Oracle Manual
100% (1)
Oracle Manual
83 pages
Alka Tiwari
No ratings yet
Alka Tiwari
37 pages
Introduction To Operating System (OS) : by Vinod Sencha
No ratings yet
Introduction To Operating System (OS) : by Vinod Sencha
59 pages
Log
No ratings yet
Log
2 pages
Project Presentation
No ratings yet
Project Presentation
16 pages
Dmxcat Multi Fixture
No ratings yet
Dmxcat Multi Fixture
1 page
Chapter 1 Algorithm and Complexity Lesson 1
No ratings yet
Chapter 1 Algorithm and Complexity Lesson 1
18 pages
PC Hardware
No ratings yet
PC Hardware
121 pages
TAFJ Basic Program Compilation Guide
No ratings yet
TAFJ Basic Program Compilation Guide
35 pages
Information Security Awaremess Ans
72% (190)
Information Security Awaremess Ans
6 pages
AWA Television Manual
No ratings yet
AWA Television Manual
18 pages
IMQAV
No ratings yet
IMQAV
3 pages
A Munsell Colour-Based Approach For Soil Classification Using Fuzzy Logic and Artificial Neural Networks
No ratings yet
A Munsell Colour-Based Approach For Soil Classification Using Fuzzy Logic and Artificial Neural Networks
17 pages
Guide Sofistik
No ratings yet
Guide Sofistik
445 pages
Data Dara Dasar
No ratings yet
Data Dara Dasar
14 pages
How To Connect A MiR Robot To A WiFi Network 2.1 - en
No ratings yet
How To Connect A MiR Robot To A WiFi Network 2.1 - en
7 pages
MS DOS Commands List With Examples and Syntax: Share
No ratings yet
MS DOS Commands List With Examples and Syntax: Share
22 pages
Revolutionizing The Job Market - NxtWave - PDF (Kali Commands)
No ratings yet
Revolutionizing The Job Market - NxtWave - PDF (Kali Commands)
7 pages
Noise Removal
No ratings yet
Noise Removal
16 pages
Using SAP PI Lookup API and Dynamic Configuration in SAP GRC NFE Outbound B2B Interface For Dynamic E-Mail Determination PDF
No ratings yet
Using SAP PI Lookup API and Dynamic Configuration in SAP GRC NFE Outbound B2B Interface For Dynamic E-Mail Determination PDF
16 pages
Ethical Perspectives On Hacktivism: The Roles and Actions of Hacker Activist Groups
No ratings yet
Ethical Perspectives On Hacktivism: The Roles and Actions of Hacker Activist Groups
7 pages
Literature Review Fonts
100% (1)
Literature Review Fonts
7 pages

ML Last Document Group 2 PDF

Uploaded by

ML Last Document Group 2 PDF

Uploaded by

Mizan Tepi University

School Of Computing And Informatics

Department of Information Systems

Submitted date 09/01/2024

Bayes theorem is given by an English statistician, philosopher, and Presbyterian minister

What is Bayes Theorem?

1. P(X ? Y)= P(X|Y) P(Y) {equation 1}

Further, the probability of event Y with known event X:

1. P(X ? Y)= P(Y|X) P(X) {equation 2}

The above equation is called as Bayes Rule or Bayes Theorem.

P(X|Y) is called as posterior, which we need to calculate. It is defined as updated probability

Hence, Bayes Theorem can be written as:

posterior = likelihood * prior / evidence

Prerequisites for Bayes Theorem

A = Event when an even number is obtained = {2, 4, 6}

B = Event when a number is greater than 4 = {5, 6}

Intersection of event A and B:

Mathematically, two events A and B are said to be independent if:

P(A ∩ B) = P(AB) = P(A)*P(B)

P(A|B) = P(A ∩ B) / P(B)

Marginal probability is defined as the probability of an event A occurring independent of any

P(A) = P(A|B)*P(B) + P(A|~B)*P(~B)

2. What Is the Naive Bayes Algorithm?

particular feature in a class is unrelated to the presence of any other feature.

good performance in many real-world applications.

sentiment analysis, among others.

that is why it is known as ‘Naive’.

P(x|c). Look at the equation below:

P(c) is the prior probability of class.

P(x) is the prior probability of the predictor.

How Do Naive Bayes Algorithms Work?

Convert the data set into a frequency table

Use Naive Bayesian equation to calculate the posterior probability

We can solve it using the above-discussed method of posterior probability.

P(Yes | Sunny) = P( Sunny | Yes) * P(Yes) / P (Sunny)

Here P( Sunny | Yes) * P(Yes) is in the numerator, and P (Sunny) is in the

So P(Yes|Sunny) = 0.3*0.71/0.35= 0.60

So P(No|Sunny)= 0.5*0.29/0.35 = 0.41

At the end P(No)= 0.29, P(Sunny)= 0.35

So as we can see from the above calculation that P(Yes|Sunny)>P(No|Sunny)

Hence on a Sunny day, Player can play the game.

You might also like

P(A) = P(A|B)P(B) + P(A|~B)P(~B)