0% found this document useful (0 votes)

33 views

Assignment II Machine Learning

The document describes an assignment for a machine learning course involving support vector machines (SVM). It includes: 1) A description of the SVM algorithm and how it works. 2) Examples of preprocessing a student performance dataset in Python, including cleaning, aggregating and transforming the data. 3) An example Python code to build an SVM classification model on a social network advertising dataset, including data preprocessing, training and evaluating the model.

Uploaded by

Hussein Ibrahim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views

Assignment II Machine Learning

Uploaded by

Hussein Ibrahim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

SCHOOL OF TECHNOLOGY

BACHELOR OF INFORMATION SECURITY AND FORENSICS &

BACHELOR OF SOFTWARE DEVELOPMENT & BACHELOR IN INFORMATION
FORENSICS AND SECURITY
MACHINE LEARNING
JANUARY-APRIL 2023
ASSIGNMENT II

MEMBERS.
Ibrahim Hussein 19/05592 BISF
Moses Kipngeno 19/05914 BISF
Everlyne Nelius Irungu 19/05463 BISF
Alice Njeri Kuria 19/05790 BISF
Collins Njoroge 19/02573 BISF
ACTIVITY
1. Describe the Support Vector Machine algorithm.

Support Vector Machine (SVM) is a powerful machine learning algorithm used for
classification and regression tasks.
It works by finding the best hyper plane that separates the data points into different
classes in a high-dimensional space.
The SVM algorithm works through:
i. Data preprocessing: the input data is first preprocessed to ensure that it is in a suitable
format for Support Vector Machine. It may include scaling, normalization and other
transformations to ensure that the data is centered and the features are on similar
scales.
ii. Feature mapping: SVM maps the input data into a higher dimensional space using a
kernel function. This helps find a hyper plane that can effectively separate the data
points given.
iii. Hyper plane selection: SVM then searches for the optimal hyper plane that separates
the data points with maximum margin. The margin is (the distance between the hyper
plane and the closest data points from each class). The larger the margin, the more
confident the algorithm is about its classification.
iv. Support vector identification: The data points closest to the hyper plane on each side
are known as support vectors. These support vectors determine the position of the
hyper plane and are used to calculate the margin.
v. Classification: Once the optimal hyper plane is found, SVM uses it to classify new
data points based on which side of the hyper plane they fall on. If the data point falls
on the positive side of the hyper plane, it is classified as one class, and if it falls on
the negative side, it is classified as the other class.

SVM can therefore handle both linear and non-linearly separable data by using different
kernel functions. Kernel functions used in SVM include linear, polynomial, radial basis
function (RBF), and sigmoid.
SVM is a powerful algorithm for classification tasks and can handle high dimensional
datasets with complex decision boundaries as seen above.
SVM disadvantage is that it’s still not suitable for large datasets because of its high
training time.
2. Preprocess a selected dataset
Data preprocessing is the process of preparing the raw data and making it suitable for machine
learning models. Data preprocessing includes data cleaning for making the data ready to be given
to machine learning model
Below is a dataset containing student performances. We apply various data preprocessing
commands to the dataset as shown below.
import pandas as pd
import numpy as np

#read csv
df_excel = pd.read_csv('StudentsPerformance.csv')
df_excel

#first look
df_excel.describe()

#calculate specific columns

df_excel['math score'].sum()
df_excel['math score'].mean()
df_excel['math score'].max()
df_excel['math score'].min()
df_excel['math score'].count()

#calculate specific rows

df_excel['average'] = (df_excel['math score'] + df_excel['reading score']

+ df_excel['writing score'])/3
df_excel.mean(axis=1)
df_excel.head()

# count
df_excel['gender'].value_counts()

# if condition
df_excel['pass/fail'] = np.where(df_excel['average'] > 70, 'Pass', 'Fail')
df_excel.head()

# multiple conditions
conditions = [
(df_excel['average']>=90),
(df_excel['average']>=80) & (df_excel['average']<90),
(df_excel['average']>=70) & (df_excel['average']<80),
(df_excel['average']>=60) & (df_excel['average']<70),
(df_excel['average']>=50) & (df_excel['average']<60),
(df_excel['average']<50),
]

values = ['A', 'B', 'C', 'D', 'E', 'F']

df_excel['grades'] = np.select(conditions, values)
df_excel.head()

# show first 5 rows

df_excel[['average', 'pass/fail', 'grades']].head()
3. Using an example in Python and a sample dataset build an SVM model.

# Support Vector Machine

# Importing the libraries

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

# Importing the datasets

datasets = pd.read_csv('Social_Network_Ads.csv')
X = datasets.iloc[:, [2,3]].values
Y = datasets.iloc[:, 4].values

# Splitting the dataset into the Training set and Test set

from sklearn.model_selection import train_test_split

X_Train, X_Test, Y_Train, Y_Test = train_test_split(X, Y, test_size = 0.25,
random_state = 0)

# Feature Scaling

from sklearn.preprocessing import StandardScaler

sc_X = StandardScaler()
X_Train = sc_X.fit_transform(X_Train)
X_Test = sc_X.transform(X_Test)

# Fitting the classifier into the Training set

from sklearn.svm import SVC

classifier = SVC(kernel = 'linear', random_state = 0)
classifier.fit(X_Train, Y_Train)

# Predicting the test set results

Y_Pred = classifier.predict(X_Test)

# Making the Confusion Matrix

from sklearn.metrics import confusion_matrix

cm = confusion_matrix(Y_Test, Y_Pred)

# Visualising the Training set results

from matplotlib.colors import ListedColormap

X_Set, Y_Set = X_Train, Y_Train
X1, X2 = np.meshgrid(np.arange(start = X_Set[:, 0].min() - 1, stop = X_Set[:,
0].max() + 1, step = 0.01),
np.arange(start = X_Set[:, 1].min() - 1, stop = X_Set[:,
1].max() + 1, step = 0.01))
plt.contourf(X1, X2, classifier.predict(np.array([X1.ravel(),
X2.ravel()]).T).reshape(X1.shape),
alpha = 0.75, cmap = ListedColormap(('red', 'green')))
plt.xlim(X1.min(), X1.max())
plt.ylim(X2.min(), X2.max())
for i, j in enumerate(np.unique(Y_Set)):
plt.scatter(X_Set[Y_Set == j, 0], X_Set[Y_Set == j, 1],
c = ListedColormap(('red', 'green'))(i), label = j)
plt.title('Support Vector Machine (Training set)')
plt.xlabel('Age')
plt.ylabel('Estimated Salary')
plt.legend()
plt.show()

# Visualising the Test set results

from matplotlib.colors import ListedColormap

X_Set, Y_Set = X_Test, Y_Test
X1, X2 = np.meshgrid(np.arange(start = X_Set[:, 0].min() - 1, stop = X_Set[:,
0].max() + 1, step = 0.01),
np.arange(start = X_Set[:, 1].min() - 1, stop = X_Set[:,
1].max() + 1, step = 0.01))
plt.contourf(X1, X2, classifier.predict(np.array([X1.ravel(),
X2.ravel()]).T).reshape(X1.shape),
alpha = 0.75, cmap = ListedColormap(('red', 'green')))
plt.xlim(X1.min(), X1.max())
plt.ylim(X2.min(), X2.max())
for i, j in enumerate(np.unique(Y_Set)):
plt.scatter(X_Set[Y_Set == j, 0], X_Set[Y_Set == j, 1],
c = ListedColormap(('red', 'green'))(i), label = j)
plt.title('Support Vector Machine (Test set)')
plt.xlabel('Age')
plt.ylabel('Estimated Salary')
plt.legend()
plt.show()

Form 5 Science Notes - Hamidi Yusoff
No ratings yet
Form 5 Science Notes - Hamidi Yusoff
151 pages
SVM Using Python
No ratings yet
SVM Using Python
24 pages
UNIT-II-Support Vector Machine Algorithm
No ratings yet
UNIT-II-Support Vector Machine Algorithm
13 pages
B24 ML Exp-3
No ratings yet
B24 ML Exp-3
10 pages
UNIT 3 AAM
No ratings yet
UNIT 3 AAM
30 pages
Understanding Support Vector Machine Algorithm From Examples
No ratings yet
Understanding Support Vector Machine Algorithm From Examples
10 pages
Aim of The Experiment-Software Required - Theory
No ratings yet
Aim of The Experiment-Software Required - Theory
6 pages
06 Support - Vector - Machine
No ratings yet
06 Support - Vector - Machine
8 pages
MLT_07
No ratings yet
MLT_07
8 pages
SVM7
No ratings yet
SVM7
53 pages
ML W8 Merged
No ratings yet
ML W8 Merged
27 pages
SVM Implementation
No ratings yet
SVM Implementation
8 pages
Prediction On Iris
No ratings yet
Prediction On Iris
14 pages
SVM Unit 2
No ratings yet
SVM Unit 2
12 pages
svmdoc
No ratings yet
svmdoc
7 pages
Title: Implement Support Vector Machine Classifier: Department of Computer Science and Engineering
No ratings yet
Title: Implement Support Vector Machine Classifier: Department of Computer Science and Engineering
5 pages
What Is Support Vector Machine
No ratings yet
What Is Support Vector Machine
13 pages
Lab 6 Dsa
No ratings yet
Lab 6 Dsa
15 pages
A Introduction To SVM PDF
No ratings yet
A Introduction To SVM PDF
48 pages
ML Assignment-8
No ratings yet
ML Assignment-8
3 pages
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
No ratings yet
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
11 pages
This Is
No ratings yet
This Is
7 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
Support Vactor Machine Final
No ratings yet
Support Vactor Machine Final
11 pages
SVM LAB.7
No ratings yet
SVM LAB.7
4 pages
Exp 5
No ratings yet
Exp 5
14 pages
Unit2 notes What is a Support Vector Machine
No ratings yet
Unit2 notes What is a Support Vector Machine
11 pages
classification
No ratings yet
classification
4 pages
ML-Lecture-14-SVM
No ratings yet
ML-Lecture-14-SVM
15 pages
Support Vector Machin, An Excellent Tool
No ratings yet
Support Vector Machin, An Excellent Tool
36 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
ML5&6&7&8&9&10
No ratings yet
ML5&6&7&8&9&10
35 pages
SVM
No ratings yet
SVM
11 pages
ML Practical 3
No ratings yet
ML Practical 3
5 pages
Detailed SVM Presentation
No ratings yet
Detailed SVM Presentation
15 pages
AP for NLP-LO2
No ratings yet
AP for NLP-LO2
38 pages
Lab Program (SVM From Scratch)
No ratings yet
Lab Program (SVM From Scratch)
2 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
Data Mining Practicals
No ratings yet
Data Mining Practicals
22 pages
UNIT - 2-1
No ratings yet
UNIT - 2-1
7 pages
Homework 2: SVM, Kernel Methods, Ensemble Learning, Learning Theory
No ratings yet
Homework 2: SVM, Kernel Methods, Ensemble Learning, Learning Theory
12 pages
UNIT - 2
No ratings yet
UNIT - 2
15 pages
Deep Learning
No ratings yet
Deep Learning
25 pages
UNIT 1,2,3
No ratings yet
UNIT 1,2,3
17 pages
Classification Review
No ratings yet
Classification Review
8 pages
SVM
No ratings yet
SVM
9 pages
svm using iris dataset by hyparlink
No ratings yet
svm using iris dataset by hyparlink
19 pages
Purva Rawale Prcatical 4 BDA
No ratings yet
Purva Rawale Prcatical 4 BDA
6 pages
Ann Unit III
No ratings yet
Ann Unit III
20 pages
Understanding Support Vector Machine Algorithm From Examples Along With Code
No ratings yet
Understanding Support Vector Machine Algorithm From Examples Along With Code
11 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
Experiment # 10
No ratings yet
Experiment # 10
10 pages
PML Lab Exp 10
No ratings yet
PML Lab Exp 10
3 pages
SVM Experimentxtended
No ratings yet
SVM Experimentxtended
3 pages
SVM
No ratings yet
SVM
40 pages
SVM K NN MLP With Sklearn Jupyter NoteBo
No ratings yet
SVM K NN MLP With Sklearn Jupyter NoteBo
22 pages
CSL0777 L23
No ratings yet
CSL0777 L23
39 pages
Unit II 2.2 ML Kernel Machines SVM
No ratings yet
Unit II 2.2 ML Kernel Machines SVM
50 pages
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
Year 11 Gcse Literature & Language Revision Booklet
No ratings yet
Year 11 Gcse Literature & Language Revision Booklet
35 pages
PDF
No ratings yet
PDF
2 pages
UNISA 2.5_Integrated questions_Income tax_updated
No ratings yet
UNISA 2.5_Integrated questions_Income tax_updated
6 pages
Guidelines For Seismic Evaluation and Design of Petrochemical Facilities
No ratings yet
Guidelines For Seismic Evaluation and Design of Petrochemical Facilities
10 pages
What Is Diction - Learn 8 Different Types of Diction in Writing With Examples - 2022 - MasterClass
No ratings yet
What Is Diction - Learn 8 Different Types of Diction in Writing With Examples - 2022 - MasterClass
9 pages
IGCSE Geography Notes On Population
No ratings yet
IGCSE Geography Notes On Population
10 pages
Concepts of Political Science
100% (2)
Concepts of Political Science
57 pages
BS 8544_2013
No ratings yet
BS 8544_2013
104 pages
Frank O Gehry
No ratings yet
Frank O Gehry
14 pages
Bootstrap
No ratings yet
Bootstrap
90 pages
CUETApplicationForm 223511117036
No ratings yet
CUETApplicationForm 223511117036
1 page
Harpsichord User Manual v1.0.1
No ratings yet
Harpsichord User Manual v1.0.1
12 pages
Chapter 4-Plumbing Fixtures
No ratings yet
Chapter 4-Plumbing Fixtures
22 pages
0 Tejaswi Intership
No ratings yet
0 Tejaswi Intership
11 pages
MVC Architecture in OAF
No ratings yet
MVC Architecture in OAF
2 pages
Nonverbalmessages 2.0
No ratings yet
Nonverbalmessages 2.0
21 pages
Resume
No ratings yet
Resume
3 pages
BF39STP Service Manual
No ratings yet
BF39STP Service Manual
18 pages
Run Jump Throw Assessment
No ratings yet
Run Jump Throw Assessment
31 pages
Assigment 2a
No ratings yet
Assigment 2a
9 pages
Sealweld Catalogue CAD PDF
No ratings yet
Sealweld Catalogue CAD PDF
79 pages
Eligibility Criteria 2024
No ratings yet
Eligibility Criteria 2024
20 pages
Introduction To Vertical Roller Mill
No ratings yet
Introduction To Vertical Roller Mill
35 pages
Reindl PDF
No ratings yet
Reindl PDF
6 pages
Commerce Project
100% (3)
Commerce Project
20 pages
Unit 3 Teaching Grammar
No ratings yet
Unit 3 Teaching Grammar
53 pages
Kil Flam
No ratings yet
Kil Flam
2 pages
TABLET RETRIEVAL CHECKLIST Mwaa
No ratings yet
TABLET RETRIEVAL CHECKLIST Mwaa
2 pages
Gamasutra - 10 Years of Behavioral Game Design With Bungie's Research Boss2
No ratings yet
Gamasutra - 10 Years of Behavioral Game Design With Bungie's Research Boss2
8 pages

Assignment II Machine Learning

Uploaded by

Assignment II Machine Learning

Uploaded by

SCHOOL OF TECHNOLOGY

BACHELOR OF INFORMATION SECURITY AND FORENSICS &

#calculate specific columns

#calculate specific rows

df_excel['average'] = (df_excel['math score'] + df_excel['reading score']

values = ['A', 'B', 'C', 'D', 'E', 'F']

# show first 5 rows

# Support Vector Machine

# Importing the datasets

from sklearn.model_selection import train_test_split

from sklearn.preprocessing import StandardScaler

# Fitting the classifier into the Training set

from sklearn.svm import SVC

# Predicting the test set results

# Making the Confusion Matrix

from sklearn.metrics import confusion_matrix

# Visualising the Training set results

from matplotlib.colors import ListedColormap

# Visualising the Test set results

from matplotlib.colors import ListedColormap

You might also like