0% found this document useful (0 votes)

24 views5 pages

Lokesh T00691325

The document contains a series of tasks related to machine learning, including predicting class labels using a Simple Bayesian Classifier, discussing the Leave-One-Out method for validation, explaining overfitting in inductive inference, calculating linear regression parameters, and solving the XOR problem using Support Vector Machines with a specific kernel. Each task provides detailed calculations and explanations, demonstrating various concepts in data mining and machine learning. The document serves as an examination paper for a machine learning course.

Uploaded by

gopisettypankaj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views5 pages

Lokesh T00691325

Uploaded by

gopisettypankaj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 5

NAME: Lokesh Reddy Syamala

ML EXAM I Date: 10/12/2021

(For tasks 1, 4 and 5 explain/ calculate every step and not using libraries )

Task 1: (20 points) For the training set given below, predict the classification of the
following sample X = {2,1,1, Class =?}
using Simple Bayesian Classifier

Sample Attribute1 Attribute2 Attribute3 Class

A1 A2 A3 C

1 1 2 1 1
2 0 0 1 1
3 2 1 2 2
4 1 2 1 2
5 0 1 2 1
6 2 2 2 2
7 1 0 1 1

# Training set in matrix form

Sample = [[1,2,1],[0,0,1],[2,1,2],[1,2,1],[0,1,2],[2,2,2],[1,0,1]]
# Class C
Class = [1,1,2,2,1,2,1]
# N: Number of samples
N = len(Sample)
# n1: Number of samples for which class is 1
n1 = Class.count(1)
# p1: Probability of a class being 1
p1 = n1/N
# n2: Number of samples for which class is 2
n2 = Class.count(2)
# p2: Probability of a class being 2
p2 = n2/N
#All the below values are initialised with 1 to avoid zero proabilit
y
# Number of A1 attributes being 2 when class is 1
count1_A1_2 = 1
# Number of A2 attributes being 1 when class is 1
count1_A2_1 = 1
# Number of A3 attributes being 1 when class is 1
count1_A3_1 = 1
# Number of A1 attributes being 2 when class is 2
count2_A1_2 = 1
# Number of A2 attributes being 1 when class is 2
count2_A2_1 = 1
# Number of A3 attributes being 1 when class is 2
count2_A3_1 = 1
# Iterating through all the samples and updating the above initialis
ed values accordingly
for i in range(N):
if Sample[i][0] == 2:
if Class[i] == 1:
count1_A1_2 += 1
else:
count2_A1_2 += 1
if Sample[i][1] == 1:
if Class[i] == 1:
count1_A2_1 += 1
else:
count2_A2_1 += 1
if Sample[i][2] == 1:
if Class[i] == 1:
count1_A3_1 += 1
else:
count2_A3_1 += 1

# Finding the probability of the class being 1 when Sample={2,1,1}

result1 = (count1_A1_2/n1)*(count1_A2_1/n1)*(count1_A3_1/n1)*p1
# Finding the probability of the class being 2 when Sample={2,1,1}
result2 = (count2_A1_2/n2)*(count2_A2_1/n2)*(count2_A3_1/n2)*p2
# Comparing the calculated probilities and outputting the class corr
esponding to the bigger probability
if (result1 > result2):
print("Predicted class is 1")
else:
print("Predicted class is 2")

Output:
Task 2: (20 Points) In which situations you would recommend Leave-One-Out method for
validation of data mining results? (20 points)

The leave-one-out approach is essentially n-fold cross-validation, where n is the number of

instances in the dataset, and we utilize it in two situations: when we don't get reliable findings
and when we want to use as much data as possible. Each instance is eliminated one at a time,
with the learning scheme focusing on the remaining instances.
It is assessed based on its accuracy on the remaining instance, with a score of 1 or 0
indicating success or failure. The final error estimate is calculated by averaging the results of
all n judgments, one for each member of the dataset.
When the largest amount of data feasible is used for training in each example, this
presumably enhances the likelihood that the classifier will be correct, and second, the
approach is deterministic, with no random sampling. It's pointless to repeat it ten times or
even once more because the result will be the same each time.
Set against this is the high computing cost, as the entire learning operation must be repeated n
times, which is typically impractical for large datasets. Nonetheless, leave-one-out appears to
offer a chance of getting the most out of a tiny dataset and obtaining the most accurate
estimate possible.

Task 3: (20 points) What is meant by the term overfitting in the context of inductive
inference? Give example(s) and solution(s)

The technique of reaching a general conclusion from a specific example is known as

inductive inference. Whereas for overfitting, when a model learns the information and noise
in the training data to the point where it degrades the model's performance on fresh data, this
is known as overfitting. This means that the model picks up on noise or random fluctuations
in the training data and learns them as ideas.
For example,
Assume that a model is being trained for a 1000-skill set that will be picked by customers.
And their outcomes, which are based on the information they supply. When the model is
applied to a real dataset, it has a 99 % accuracy rate. Only 50% accuracy is observed when
the model is run on a fully new dataset. Using the training data, the model is unable to
generalize. This is a case of overfitting.
To solve it, it is usually preferable to cross-validate to avoid this issue. The original training
datasets are utilized to produce multiple splits in cross validation. The model is then fine-
tuned using these divides. In other circumstances to solve it, more data and clear samples can
also be used.
Task 4 : ( 20 points) Given the data set with two dimensions X and Y:

X Y

1 4
4 2
3 3
5 2

Use a linear regression method to calculate the parameters  and  where y =  +  x.

(calculate every step and not using libraries)

X - Mx Y - My (X - Mx)2 (X - Mx) (Y - My)

-2.25 1.25 5.0625 -2.8125

0.75 -0.75 0.5625 -0.5625

-0.25 0.25 0.0625 -0.0625

1.75 -0.75 3.0625 -1.3125

SS: 8.75 SP: -4.75

x =13; y =11 ; Mean of X= 3.25; Mean of Y=2.75; SS =8.75; SP = -4.75

y= (SP/SS) + (Mean of Y – ((SP/SS)* Mean of X))

y= (-4.75/8.75) + (2.75 - (-0.54*3.25))
y= 4.51-0.54x
Task 5: Support Vector Machines (SVM). The Mercer kernel used to solve the XOR
problem is given by k (xi, xj) = (1 + xi Txj) p . What is the smallest positive integer p for which
the XOR problem is solved? Show the kernel and XOR Problem solution using SVM (20
points)

Let two-dimensional vectors x= [xi, xj]

Here, K (xi, xj) = (1+ xiT,xj)P
Let the Kernel K (xi, xj) = (1+ xiT,xj)2
We should show K (xi, xj) = Φ (xi)T Φ (xj)
Therefore,
K(xi, xj) = (1+ xiT,xj)2

= 1+xi12xj12+ 2 xi1 xj1 xi2 xj2 + xi22xj22+2 xi1xj1+2xj2xj1

= [1xi12√2xi1xi2 xi22√2xi1√2xi2]T [1+ xj12√2xj1xj2xj22 √2xj1√2xj2]

= Φ(xi)TΦ(xj), where Φ(x) = [1x12√2x1x2x22√2x1√2x2]

Hence, Smallest positive integer P to solve XOR problem is 2.

Ai ML Exam - 1march 16 2022-Michael Magreola
No ratings yet
Ai ML Exam - 1march 16 2022-Michael Magreola
8 pages
ML Exam 1 October 13 2022
No ratings yet
ML Exam 1 October 13 2022
2 pages
CS 675 Machine Learning Midterm Solutions
No ratings yet
CS 675 Machine Learning Midterm Solutions
10 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
Machine Learning Foundations and Applications Assignment 1 Due Date: 10 October, 2021
No ratings yet
Machine Learning Foundations and Applications Assignment 1 Due Date: 10 October, 2021
3 pages
07au Midterm
No ratings yet
07au Midterm
17 pages
hw5 1
No ratings yet
hw5 1
6 pages
Machine Learning Quiz for Students
No ratings yet
Machine Learning Quiz for Students
45 pages
SS ZG568 EC 2R SECOND SEM 2020 2021 Solution 1617000149821
No ratings yet
SS ZG568 EC 2R SECOND SEM 2020 2021 Solution 1617000149821
6 pages
Midterm Solutions
No ratings yet
Midterm Solutions
11 pages
Exam 2011
No ratings yet
Exam 2011
22 pages
ML Midsem 2018 Solutions
No ratings yet
ML Midsem 2018 Solutions
7 pages
Midterm 2010 F
No ratings yet
Midterm 2010 F
15 pages
Machine Learning 10-701 Exam Prep
No ratings yet
Machine Learning 10-701 Exam Prep
14 pages
Machine Learning Assignment Solutions
No ratings yet
Machine Learning Assignment Solutions
46 pages
ML FinalUpdated 1
No ratings yet
ML FinalUpdated 1
45 pages
HW 02
No ratings yet
HW 02
3 pages
Machine Learning PYQ 2022 Ans
No ratings yet
Machine Learning PYQ 2022 Ans
17 pages
Machine Learning MCQ Assignment
No ratings yet
Machine Learning MCQ Assignment
56 pages
Statistical Machine Learning Exam Guide
No ratings yet
Statistical Machine Learning Exam Guide
10 pages
t4 Sol
No ratings yet
t4 Sol
8 pages
ML4N Exam Guidelines & Exercises
No ratings yet
ML4N Exam Guidelines & Exercises
6 pages
hw3 Red
No ratings yet
hw3 Red
4 pages
Midterm Solutions
No ratings yet
Midterm Solutions
8 pages
Solutions: 10-601 Machine Learning, Midterm Exam: Spring 2008 Solutions
No ratings yet
Solutions: 10-601 Machine Learning, Midterm Exam: Spring 2008 Solutions
8 pages
CS725 2020 Midsem
No ratings yet
CS725 2020 Midsem
3 pages
Machine Learning Midterm 2009
No ratings yet
Machine Learning Midterm 2009
23 pages
Machine Learning Exam Guide
No ratings yet
Machine Learning Exam Guide
9 pages
Wa0030.
No ratings yet
Wa0030.
36 pages
INT354 Machine Learning Exam Instructions
No ratings yet
INT354 Machine Learning Exam Instructions
4 pages
CS725 2020 Quiz1
No ratings yet
CS725 2020 Quiz1
3 pages
ML PYQs
No ratings yet
ML PYQs
32 pages
ML Question CMU
No ratings yet
ML Question CMU
12 pages
IML19 Term1
No ratings yet
IML19 Term1
5 pages
Machine Learning Questions Final - Solutions
No ratings yet
Machine Learning Questions Final - Solutions
5 pages
ML Assignments 2025
No ratings yet
ML Assignments 2025
91 pages
2023-24 AIML ML Mid-Semester Regular QP Anwer-Keys
No ratings yet
2023-24 AIML ML Mid-Semester Regular QP Anwer-Keys
4 pages
Statistical Machine Learning Exam Guide
No ratings yet
Statistical Machine Learning Exam Guide
12 pages
Machine Learning Midterm Exam
No ratings yet
Machine Learning Midterm Exam
106 pages
Epfl Machine Learning Final Exam 2021 Solutions
No ratings yet
Epfl Machine Learning Final Exam 2021 Solutions
21 pages
Machine Learning PYQ 2023
No ratings yet
Machine Learning PYQ 2023
8 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
12 pages
HW 3
No ratings yet
HW 3
7 pages
CS412 512 Final 2017 v2
No ratings yet
CS412 512 Final 2017 v2
11 pages
Midsem Exam Overview and Questions
No ratings yet
Midsem Exam Overview and Questions
208 pages
MachineLearning MidTerm UMT Spring 2021
100% (1)
MachineLearning MidTerm UMT Spring 2021
12 pages
Homework - 1
No ratings yet
Homework - 1
10 pages
2023 Machine Learning
No ratings yet
2023 Machine Learning
8 pages
CS-31002 (ML) - CS End April 2025
No ratings yet
CS-31002 (ML) - CS End April 2025
19 pages
Midterm 2006
No ratings yet
Midterm 2006
11 pages
Exam 21
No ratings yet
Exam 21
17 pages
Machine Learning Homework 1
No ratings yet
Machine Learning Homework 1
8 pages
Introduction To Machine Learning IIT KGP Week 2
100% (1)
Introduction To Machine Learning IIT KGP Week 2
14 pages
EndSem 202223 Solution
No ratings yet
EndSem 202223 Solution
4 pages
Machine Learning Exam Questions and Answers
No ratings yet
Machine Learning Exam Questions and Answers
16 pages
Senior Software Engineer Profile
No ratings yet
Senior Software Engineer Profile
1 page
Lesson Plan DSD
No ratings yet
Lesson Plan DSD
3 pages
Disparues Write Up Officiel en
No ratings yet
Disparues Write Up Officiel en
45 pages
Tenth Graders' Views on Online English Learning Platforms
No ratings yet
Tenth Graders' Views on Online English Learning Platforms
55 pages
Facebook Custom Audiences Guide
No ratings yet
Facebook Custom Audiences Guide
50 pages
OOP File Structures with C Guide
0% (2)
OOP File Structures with C Guide
2 pages
CHAPTER 6 Organizing Test Scores
No ratings yet
CHAPTER 6 Organizing Test Scores
4 pages
Led TV : Installation Manual
No ratings yet
Led TV : Installation Manual
67 pages
Chapter 4 - Operations Management
No ratings yet
Chapter 4 - Operations Management
8 pages
Email Header
No ratings yet
Email Header
15 pages
Telkomsel Network Readiness for IMF-WB 2018
No ratings yet
Telkomsel Network Readiness for IMF-WB 2018
5 pages
ARIZ Logic for Inventive Problem Solving
No ratings yet
ARIZ Logic for Inventive Problem Solving
12 pages
Fuzzy Logic Mobile Robot Navigation
No ratings yet
Fuzzy Logic Mobile Robot Navigation
5 pages
Introduction to Sensors Overview
No ratings yet
Introduction to Sensors Overview
57 pages
ANSYS Meshing Workshop Overview
No ratings yet
ANSYS Meshing Workshop Overview
116 pages
Acer AL1722 FRU Parts Guide
No ratings yet
Acer AL1722 FRU Parts Guide
14 pages
How To Use ChatGPT To Write Your Resume
No ratings yet
How To Use ChatGPT To Write Your Resume
14 pages
Red Hat Enterprise Linux 8 Security Hardening en US
100% (1)
Red Hat Enterprise Linux 8 Security Hardening en US
110 pages
Pgdca C&C++ Lab Assignment Question
No ratings yet
Pgdca C&C++ Lab Assignment Question
2 pages
01-Rewards and Recognition Implimentation Guide
No ratings yet
01-Rewards and Recognition Implimentation Guide
170 pages
Module 5 Part 2 Dec Note
No ratings yet
Module 5 Part 2 Dec Note
2 pages
DSB-SC Modulator and Demodulator Lab
No ratings yet
DSB-SC Modulator and Demodulator Lab
5 pages
Con Arts
100% (1)
Con Arts
2 pages
SCR-2100 Datasheet
No ratings yet
SCR-2100 Datasheet
34 pages
Form Sheet AIAG VDA Design U Process-FMEA en
100% (2)
Form Sheet AIAG VDA Design U Process-FMEA en
2 pages
Atomic Layer Deposition
No ratings yet
Atomic Layer Deposition
51 pages
Consultant Expert DR Mohamed Seif Al Den Taha
No ratings yet
Consultant Expert DR Mohamed Seif Al Den Taha
107 pages
Tom Module 1
No ratings yet
Tom Module 1
103 pages
DS 1630
No ratings yet
DS 1630
4 pages
Rules of The Workplace British English Teacher
No ratings yet
Rules of The Workplace British English Teacher
4 pages

Lokesh T00691325

Uploaded by

Lokesh T00691325

Uploaded by

NAME: Lokesh Reddy Syamala

ML EXAM I Date: 10/12/2021

Sample Attribute1 Attribute2 Attribute3 Class

# Training set in matrix form

# Finding the probability of the class being 1 when Sample={2,1,1}

The leave-one-out approach is essentially n-fold cross-validation, where n is the number of

The technique of reaching a general conclusion from a specific example is known as

Use a linear regression method to calculate the parameters  and  where y =  +  x.

X - Mx Y - My (X - Mx)2 (X - Mx) (Y - My)

-2.25 1.25 5.0625 -2.8125

0.75 -0.75 0.5625 -0.5625

-0.25 0.25 0.0625 -0.0625

1.75 -0.75 3.0625 -1.3125

SS: 8.75 SP: -4.75

x =13; y =11 ; Mean of X= 3.25; Mean of Y=2.75; SS =8.75; SP = -4.75

y= (SP/SS) + (Mean of Y – ((SP/SS)* Mean of X))

Let two-dimensional vectors x= [xi, xj]

= 1+xi12xj12+ 2 xi1 xj1 xi2 xj2 + xi22xj22+2 xi1xj1+2xj2xj1

= [1xi12√2xi1xi2 xi22√2xi1√2xi2]T [1+ xj12√2xj1xj2xj22 √2xj1√2xj2]

= Φ(xi)TΦ(xj), where Φ(x) = [1x12√2x1x2x22√2x1√2x2]

Hence, Smallest positive integer P to solve XOR problem is 2.

You might also like