0% found this document useful (0 votes)

4 views

ML Lab Manual (CSE)

Uploaded by

indraneel k

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

ML Lab Manual (CSE)

Uploaded by

indraneel k

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

LAB MANUAL

MACHINE LEARNING
PROGRAM OUTCOMES (POs)

Engineering Graduates will be able to:

1. Engineering Knowledge: Apply the knowledge of mathematics, science, engineering

fundamentals, and an engineering specialization to the solution of complex engineering
problems.
2. Problem analysis: Identify, formulate, review research literature, and analyze complex
engineering problems reaching substantiated conclusions using first principles of mathematics,
natural sciences, and engineering sciences.
3. Design/development of solutions: Design solutions for complex engineering problems and
design system components or processes that meet the specified needs with appropriate
consideration for the public health and safety, and the cultural, societal, and environmental
considerations.
4. Conduct investigations of complex problems: Use research-based knowledge and research
methods including design of experiments, analysis and interpretation of data, and synthesis of
the information to provide valid conclusions.
5. Modern tool usage: Create, select, and apply appropriate techniques, resources, and modern
engineering and IT tools including prediction and modeling to complex engineering activities
with an understanding of the limitations.
6. The engineer and society: Apply reasoning informed by the contextual knowledge to assess
societal, health, safety, legal and cultural issues and the consequent responsibilities relevant to
the professional engineering practice.
7. Environment and sustainability: Understand the impact of the professional engineering
solutions in societal and environmental contexts, and demonstrate the knowledge of, and need
for sustainable development.
8. Ethics: Apply ethical principles and commit to professional ethics and responsibilities and
norms of the engineering practice.
9. Individual and team work: Function effectively as an individual, and as a member or leader
in diverse teams, and in multidisciplinary settings.
10. Communication: Communicate effectively on complex engineering activities with the
engineering community and with society at large, such as, being able to comprehend and write
effective reports and design documentation, make effective presentations, and give and receive
clear instructions.
11. Project management and finance: Demonstrate knowledge and understanding of the
engineering and management principles and apply these to one’s own work, as a member and
leader in a team, to manage projects and in multidisciplinary environments.
12. Life-long learning: Recognize the need for, and have the preparation and ability to engage in
independent and life-long learning in the broadest context of technological change.
Table of Contents
S.No Experiment Name Page No.

I Course Outcomes
1
II Syllabus
2
1 The probability that it is Friday and that a student is absent is 3 %.
Since there are 5 school days in a week, the probability that it is
Friday is 20 %. What is theprobability that a student is absent given
that today is Friday? Apply Baye’s rule in python to get the result. 4
(Ans: 15%)

2 Extract the data from database using python

5
3 Implement k-nearest neighbours classification using python
8
4 Given the following data, which specify classifications for nine
combinations of VAR1 and VAR2 predict a classification for a case
where VAR1=0.906 and VAR2=0.606, using the result of kmeans
clustering with 3 means (i.e., 3 centroids) 10

5 The following training examples map descriptions of individuals onto

high, medium and low credit-worthiness.
11

6 Implement linear regression using python

13
7 Implement Naïve Bayes theorem to classify the English text
19
8 Implement Naïve Bayes theorem to classify the English text
22
9 Implement the finite words classification system using Back-
propagation algorithm
31
Machine Learning Lab Manual

CS604PC Course Objectives

A To introduce students to the basic concepts and techniques of Machine
Learning.
B To improve their skills using Python Programming Libraries like sci-learn
and Numpy.
C To demonstrate various machine learning techniques
D To provide hands-on experience on Extraction of Databases
E To become familiar with regression methods, classification methods,
clustering methods
F To develop skills of using recent machine learning software for solving
practical problems

CO Course Outcomes
CO 1 Compare Machine Learning algorithms based on their advantages and
limitations and use the best one according to situation

CO 2 Interpret and understand modern notions in data analysis-oriented

computing
CO 3 Apply Conditional Probability using Bayes Theorem

CO 4 Evaluate Decision tree algorithms using real world data

CO5 Apply common Machine Learning algorithms in practice and
implement by their own confidently.
CO6 Experiment with real-world data usingMachine Learning algorithms.

COs PROGRAM OUTCOMES (POs) PSOs

1 2 3 4 5 6 7 8 9 10 11 12 i ii
CO1 3 3 2 3 3 2 1 2 2 0 1 3 3 3
CO2 2 3 2 3 3 2 1 2 2 0 1 3 2 2
CO3 3 2 3 3 2 1 2 2 0 1 3 2 2 3
CO4 3 2 3 3 2 1 2 2 0 1 3 2 2 2
CO5 3 2 3 3 2 1 2 2 0 1 3 2 3 2
CO6 3 2 3 3 2 1 2 2 0 1 3 2 2 3
Avg 2.83 2.3 2.6 3 2.3 1.3 1.6 2 0.6 0.6 2.3 2.3 2.3 2.5

1|Page
Machine Learning Lab Manual

CS604PC: MACHINE LEARNING LAB

III Year B.Tech. CSE II-Sem

Course Objective: The objective of this lab is to get an overview of the various
machine learning techniques and can able to demonstrate them using python.

Course Outcomes: After the completion of the course the student can able to:

1. understand complexity of Machine Learning algorithms and their limitations;

2. understand modern notions in data analysis-oriented computing;
3. be capable of confidently applying common Machine Learning algorithms in
practice and implementing their own;
4. Be capable of performing experiments in Machine Learning using real-world data.

List of Experiments

1. The probability that it is Friday and that a student is absent is 3 %. Since there are
5 school days in a week, the probability that it is Friday is 20 %. What is the
probability that a student is absent given that today is Friday? Apply Baye’s rule in
python to get the result. (Ans: 15%)

2. Extract the data from database using python

3. Implement k-nearest neighbours classification using python

4. Given the following data, which specify classifications for nine combinations of
VAR1 and VAR2 predict a classification for a case where VAR1=0.906 and
VAR2=0.606, using the result of k- means clustering with 3 means (i.e., 3 centroids)

VAR VAR2 CLAS

1 S
1.713 1.586 0
0.180 1.786 1
0.353 1.240 1
0.940 1.566 0
1.486 0.759 1
1.266 1.106 0
1.540 0.419 1
0.459 1.799 1
0.773 0.186 1

2|Page
Machine Learning Lab Manual

5. The following training examples map descriptions of individuals onto high,

medium and low credit-worthiness.
medium skiing design single twenties no ->
highRisk high golf trading married forties
yes -> lowRisk
low speedway transport married thirties yes ->
medRisk medium football banking single
thirties yes -> lowRisk high flying media
married fifties yes -> highRisk
low football security single twenties no ->
medRisk medium golf media
single thirties yes -> medRisk medium golf
transport married forties yes ->
lowRisk high skiing banking single
thirties yes -> highRisk low golf
unemployed married forties yes -> highRisk
Input attributes are (from left to right) income, recreation, job, status, age-group, home-
owner. Find the unconditional probability of `golf' and the conditional probability of
`single' given `medRisk' in the dataset?
6. Implement linear regression using python.
7. Implement Naïve Bayes theorem to classify the English text
8. Implement an algorithm to demonstrate the significance of genetic algorithm
9. Implement the finite words classification system using Back-propagation algorithm

3|Page
Machine Learning Lab Manual

1. The probability that it is Friday and that a student is absent is 3 %. Since there are 5 school
days in a week, the probability that it is Friday is 20 %. What is theprobability that a student
is absent given that today is Friday? Apply Baye’s rule in python to get the result. (Ans:
15%)

probAbsentFriday=0.03

probFriday=0.2

# bayes Formula

#p(Absent|Friday)=p(Friday|Absent)p(Absent)/p(Friday)

#p(Friday|Absent)=p(Friday∩Absent)/p(Absent)

# Therefore the result is:

bayesResult=(probAbsentFriday/probFriday)

print(bayesResult * 100)

Output: 15

4|Page
Machine Learning Lab Manual

2. Extract the data from database using python

You’ll learn the following MySQL SELECT operations from Python using a ‘MySQL Connector
Python’ module.

• Execute the SELECT query and process the result set returned by the query in
Python.
• Use Python variables in a where clause of a SELECT query to pass dynamic
values.
• Use fetchall(), fetchmany(), and fetchone() methods of a cursor class to fetch all or
limited rows from a table.
Python Select from MySQL Table

This article demonstrates how to select rows of a MySQL table in Python.

You’ll learn the following MySQL SELECT operations from Python using a ‘MySQL Connector
Python’ module.

• Steps to fetch rows from a MySQL database table

Follow these steps: –

How to Select from a MySQL table using Python

1. Connect to MySQL from Python

Python MySQL database connection to connect to MySQL database from Python

using MySQL Connector module

2. Define a SQL SELECT Query

Next, prepare a SQL SELECT query to fetch rows from a table. You can select all
or limited rows based on your requirement. If the where condition is used, then it
decides the number of rows to fetch.
For example, SELECT col1, col2,…colnN FROM MySQL_table WHERE id =
10;. This will return row number 10.

5|Page
Machine Learning Lab Manual

3. Get Cursor Object from Connection

Next, use a connection.cursor() method to create a cursor object. This method

creates a new MySQLCursor object.

4. Execute the SELECT query using execute() method

Execute the select query using the cursor.execute() method.

5. Extract all rows from a result

After successfully executing a Select operation, Use the fetchall() method of a

cursor object to get all rows from a query result. it returns a list of rows.

6. Iterate each row

Iterate a row list using a for loop and access each row individually (Access each
row’s column data using a column name or index number.)

7. Close the cursor object and database connection object

use cursor.clsoe() and connection.clsoe() method to close open connections after

your work completes.

import pymysql

def mysqlconnect():
# To connect MySQL database
conn = pymysql.connect(
host='localhost',
user='root',
password = "pass",
db='College',
)

cur = conn.cursor()
cur.execute("select @@version")
output = cur.fetchall()
print(output)

# To close the connection

conn.close()

6|Page
Machine Learning Lab Manual

7|Page
Machine Learning Lab Manual

3. Implement k-nearest neighbours classification using python

• The k-nearest neighbor algorithm is imported from the scikit-learn package.

• Create feature and target variables.
• Split data into training and test data.
• Generate a k-NN model using neighbors value.
• Train or fit the data into the model.
• Predict the future.

# Import necessary modules

from sklearn.neighbors import KNeighborsClassifier
from sklearn.model_selection import train_test_split
from sklearn.datasets import load_iris

# Loading data
irisData = load_iris()

# Create feature and target arrays

X = irisData.data
y = irisData.target

# Split into training and test set

X_train, X_test, y_train, y_test = train_test_split(
X, y, test_size = 0.2, random_state=42)

knn = KNeighborsClassifier(n_neighbors=7)

knn.fit(X_train, y_train)

# Predict on dataset which model has not seen before

print(knn.predict(X_test))

8|Page
Machine Learning Lab Manual

9|Page
Machine Learning Lab Manual

from sklearn.cluster import KMeans

import numpy as np

X = np.array([[1.713,1.586], [0.180,1.786], [0.353,1.240],

[0.940,1.566], [1.486,0.759], [1.266,1.106],[1.540,0.419],[0.459,1.799],[0.773,0.186]])

y=np.array([0,1,1,0,1,0,1,1,1])

kmeans = KMeans(n_clusters=3, random_state=0).fit(X,y)

kmeans.predict([[0.906, 0.606]])

10 | P a g e
Machine Learning Lab Manual

5. The following training examples map descriptions of individuals onto high, medium and
low credit-worthiness.

medium skiing design single twenties no -> highRisk

high golf trading married forties yes -> lowRisk

low speedway transport married thirties yes -> medRisk

medium football banking single thirties yes -> lowRisk

high flying media married fifties yes -> highRisk

low football security single twenties no -> medRisk

medium golf media single thirties yes -> medRisk

medium golf transport married forties yes -> lowRisk

high skiing banking single thirties yes -> highRisk

low golf unemployed married forties yes -> highRisk

Input attributes are (from left to right) income, recreation, job, status, age-group, home-
owner. Find the unconditional probability of `golf' and the conditional probability of `single'
given `medRisk' in the dataset?

totalRecords=10

numberGolfRecreation=4

probGolf=numberGolfRecreation/totalRecords

print("Unconditional probability of golf: = {}".format(probGolf))

#conditional probability of `single' given `medRisk'

# bayes Formula

#p(single|medRisk)=p(medRisk|single)p(single)/p(medRisk)

#p(medRisk|single)=p(medRisk ∩ single)/p(single)

# Therefore the result is:

numberMedRiskSingle=2

numberMedRisk=3

11 | P a g e
Machine Learning Lab Manual

probMedRiskSingle=numberMedRiskSingle/totalRecords

probMedRisk=numberMedRisk/totalRecords

conditionalProbability=(probMedRiskSingle/probMedRisk)

print("Conditional probability of single given medRisk: = {}".format(conditionalProbability))

Output:

Unconditional probability of golf: = 0.4

Conditional probability of single given medRisk: = 0.6666666666666667

12 | P a g e
Machine Learning Lab Manual

6. Implement linear regression using python.

Regression
Regression analysis is one of the most important fields in statistics and machine learning. There
are many regression methods available. Linear regression is one of them

What Is Regression?
Regression analysis is one of the most important fields in statistics and machine learning. There
are many regression methods available. Linear regression is one of them.

Regression searches for relationships among variables.

For example, you can observe several employees of some company and try to understand how
their salaries depend on the features, such as experience, level of education, role, city they work
in, and so on.

This is a regression problem where data related to each employee represent one observation. The
presumption is that the experience, education, role, and city are the independent features, while
the salary depends on them.

Generally, in regression analysis, you usually consider some phenomenon of interest and have a
number of observations. Each observation has two or more features. Following the assumption
that (at least) one of the features depends on the others, you try to establish a relation among them.

you need to find a function that maps some features or variables to others sufficiently well.

The dependent features are called the dependent variables, outputs, or responses.

The independent features are called the independent variables, inputs, or predictors.

Linear Regression
Linear regression is probably one of the most important and widely used regression techniques.
It’s among the simplest regression methods. One of its main advantages is the ease of
interpreting results.

When implementing linear regression of some dependent variable 𝑦 on the set of independent
variables 𝐱 = (𝑥₁, …, 𝑥ᵣ), where 𝑟 is the number of predictors, you assume a linear relationship
between 𝑦 and 𝐱: 𝑦 = 𝛽₀ + 𝛽₁𝑥₁ + ⋯ + 𝛽ᵣ𝑥ᵣ + 𝜀. This equation is the regression equation. 𝛽₀, 𝛽₁,
…, 𝛽ᵣ are the regression coefficients, and 𝜀 is the random error.

Linear regression calculates the estimators of the regression coefficients or simply the predicted
weights, denoted with 𝑏₀, 𝑏₁, …, 𝑏ᵣ. They define the estimated regression function 𝑓(𝐱) = 𝑏₀ +
𝑏₁𝑥₁ + ⋯ + 𝑏ᵣ𝑥ᵣ. This function should capture the dependencies between the inputs and output
sufficiently well.

13 | P a g e
Machine Learning Lab Manual

Simple Linear Regression

The following figure illustrates simple linear regression:

When implementing simple linear regression, you typically start with a given set of input-output
(𝑥-𝑦) pairs (green circles). These pairs are your observations. For example, the leftmost
observation (green circle) has the input 𝑥 = 5 and the actual output (response) 𝑦 = 5. The next
one has 𝑥 = 15 and 𝑦 = 20, and so on.

The estimated regression function (black line) has the equation 𝑓(𝑥) = 𝑏₀ + 𝑏₁𝑥. Your goal is to
calculate the optimal values of the predicted weights 𝑏₀ and 𝑏₁ that minimize SSR and determine
the estimated regression function. The value of 𝑏₀, also called the intercept, shows the point where
the estimated regression line crosses the 𝑦 axis. It is the value of the estimated response 𝑓(𝑥) for 𝑥
= 0. The value of 𝑏₁ determines the slope of the estimated regression line.

The predicted responses (red squares) are the points on the regression line that correspond to the
input values. For example, for the input 𝑥 = 5, the predicted response is 𝑓(5) = 8.33 (represented
with the leftmost red square).

The residuals (vertical dashed gray lines) can be calculated as 𝑦ᵢ - 𝑓(𝐱ᵢ) = 𝑦ᵢ - 𝑏₀ - 𝑏₁𝑥ᵢ for 𝑖 = 1,
…, 𝑛. They are the distances between the green circles and red squares. When you implement
linear regression, you are actually trying to minimize these distances and make the red squares as
close to the predefined green circles as possible.

14 | P a g e
Machine Learning Lab Manual

Implementing Linear Regression in Python

It’s time to start implementing linear regression in Python. Basically, all you should do is apply
the proper packages and their functions and classes.

Python Packages for Linear Regression

The package NumPy is a fundamental Python scientific package that allows many high-
performance operations on single- and multi-dimensional arrays. It also offers many mathematical
routines. Of course, it’s open source.

The package scikit-learn is a widely used Python library for machine learning, built on top of
NumPy and some other packages. It provides the means for preprocessing data, reducing
dimensionality, implementing regression, classification, clustering, and more. Like NumPy, scikit-
learn is also open source.

If you want to implement linear regression and need the functionality beyond the scope of scikit-
learn, you should consider statsmodels. It’s a powerful Python package for the estimation of
statistical models, performing tests, and more. It’s open source as well.

Simple Linear Regression With scikit-learn

Let’s start with the simplest case, which is simple linear regression.

There are five basic steps when you’re implementing linear regression:

1. Import the packages and classes you need.

2. Provide data to work with and eventually do appropriate transformations.
3. Create a regression model and fit it with existing data.
4. Check the results of model fitting to know whether the model is satisfactory.
5. Apply the model for predictions.

import numpy as np

import matplotlib.pyplot as plt

def estimate_coef(x, y):

# number of observations/points

n = np.size(x)

# mean of x and y vector

15 | P a g e
Machine Learning Lab Manual

m_x, m_y = np.mean(x), np.mean(y)

# calculating cross-deviation and deviation about x

SS_xy = np.sum(yx) - nm_y*m_x

SS_xx = np.sum(xx) - nm_x*m_x

# calculating regression coefficients

b_1 = SS_xy / SS_xx

b_0 = m_y - b_1*m_x

return(b_0, b_1)

def plot_regression_line(x, y, b):

# plotting the actual points as scatter plot

plt.scatter(x, y, color = "m",

marker = "o", s = 30)

# predicted response vector

y_pred = b[0] + b[1]*x

# plotting the regression line

plt.plot(x, y_pred, color = "g")

# putting labels

16 | P a g e
Machine Learning Lab Manual

plt.xlabel('x')

plt.ylabel('y')

# function to show plot

plt.show()

def main():

# observations

x = np.array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])

y = np.array([1, 3, 2, 5, 7, 8, 8, 9, 10, 12])

# estimating coefficients

b = estimate_coef(x, y)

print("Estimated coefficients:\nb_0 = {} \

\nb_1 = {}".format(b[0], b[1]))

# plotting regression line

plot_regression_line(x, y, b)

if __name__ == "__main__":

main()

Output:

Estimated coefficients:
b_0 = -0.0586206896552

17 | P a g e
Machine Learning Lab Manual

b_1 = 1.45747126437

18 | P a g e
Machine Learning Lab Manual

7. Implement Naïve Bayes theorem to classify the English text

import pandas as pd

from sklearn.model_selection import train_test_split

from sklearn.feature_extraction.text import CountVectorizer

from sklearn.naive_bayes import MultinomialNB

from sklearn.metrics import accuracy_score, confusion_matrix, precision_score, recall_score

msg = pd.read_csv('document.csv', names=['message', 'label'])

print("Total Instances of Dataset: ", msg.shape[0])

msg['labelnum'] = msg.label.map({'pos': 1, 'neg': 0})

X = msg.message

y = msg.labelnum

Xtrain, Xtest, ytrain, ytest = train_test_split(X, y)

count_v = CountVectorizer()

Xtrain_dm = count_v.fit_transform(Xtrain)

Xtest_dm = count_v.transform(Xtest)

df = pd.DataFrame(Xtrain_dm.toarray(),columns=count_v.get_feature_names())

clf = MultinomialNB()

clf.fit(Xtrain_dm, ytrain)

19 | P a g e
Machine Learning Lab Manual

pred = clf.predict(Xtest_dm)

print('Accuracy Metrics:')

print('Accuracy: ', accuracy_score(ytest, pred))

print('Recall: ', recall_score(ytest, pred))

print('Precision: ', precision_score(ytest, pred))

print('Confusion Matrix: \n', confusion_matrix(ytest, pred))

document.csv:

I love this sandwich,pos

This is an amazing place,pos

I feel very good about these beers,pos

This is my best work,pos

What an awesome view,pos

I do not like this restaurant,neg

I am tired of this stuff,neg

I can't deal with this,neg

He is my sworn enemy,neg

My boss is horrible,neg

This is an awesome place,pos

I do not like the taste of this juice,neg

I love to dance,pos

I am sick and tired of this place,neg

What a great holiday,pos

That is a bad locality to stay,neg

20 | P a g e
Machine Learning Lab Manual

We will have good fun tomorrow,pos

I went to my enemy's house today,neg

Output:

Total Instances of Dataset: 18

Accuracy Metrics:

Accuracy: 0.6

Recall: 0.6666666666666666

Precision: 0.6666666666666666

Confusion Matrix:

[[1 1]

[1 2]]

21 | P a g e
Machine Learning Lab Manual

8. Implement an algorithm to demonstrate the significance of genetic algorithm

import numpy

def cal_pop_fitness(equation_inputs, pop):

# Calculating the fitness value of each solution in the current population.

# The fitness function calulates the sum of products between each input and its corresponding
weight.

fitness = numpy.sum(pop*equation_inputs, axis=1)

return fitness

def select_mating_pool(pop, fitness, num_parents):

# Selecting the best individuals in the current generation as parents for producing the offspring
of the next generation.

parents = numpy.empty((num_parents, pop.shape[1]))

for parent_num in range(num_parents):

max_fitness_idx = numpy.where(fitness == numpy.max(fitness))

max_fitness_idx = max_fitness_idx[0][0]

parents[parent_num, :] = pop[max_fitness_idx, :]

fitness[max_fitness_idx] = -99999999999

return parents

def crossover(parents, offspring_size):

offspring = numpy.empty(offspring_size)

# The point at which crossover takes place between two parents. Usually, it is at the center.

crossover_point = numpy.uint8(offspring_size[1]/2)

22 | P a g e
Machine Learning Lab Manual

for k in range(offspring_size[0]):

# Index of the first parent to mate.

parent1_idx = k%parents.shape[0]

# Index of the second parent to mate.

parent2_idx = (k+1)%parents.shape[0]

# The new offspring will have its first half of its genes taken from the first parent.

offspring[k, 0:crossover_point] = parents[parent1_idx, 0:crossover_point]

# The new offspring will have its second half of its genes taken from the second parent.

offspring[k, crossover_point:] = parents[parent2_idx, crossover_point:]

return offspring

def mutation(offspring_crossover, num_mutations=1):

mutations_counter = numpy.uint8(offspring_crossover.shape[1] / num_mutations)

# Mutation changes a number of genes as defined by the num_mutations argument. The

changes are random.

for idx in range(offspring_crossover.shape[0]):

gene_idx = mutations_counter - 1

for mutation_num in range(num_mutations):

# The random value to be added to the gene.

random_value = numpy.random.uniform(-1.0, 1.0, 1)

offspring_crossover[idx, gene_idx] = offspring_crossover[idx, gene_idx] +

random_value

gene_idx = gene_idx + mutations_counter

return offspring_crossover

23 | P a g e
Machine Learning Lab Manual

import numpy

"""

The y=target is to maximize this equation ASAP:

y = w1x1+w2x2+w3x3+w4x4+w5x5+6wx6

where (x1,x2,x3,x4,x5,x6)=(4,-2,3.5,5,-11,-4.7)

What are the best values for the 6 weights w1 to w6?

We are going to use the genetic algorithm for the best possible values after a number of
generations.

"""

# Inputs of the equation.

equation_inputs = [4,-2,3.5,5,-11,-4.7]

# Number of the weights we are looking to optimize.

num_weights = len(equation_inputs)

"""

Genetic algorithm parameters:

Mating pool size

Population size

"""

sol_per_pop = 8

num_parents_mating = 4

24 | P a g e
Machine Learning Lab Manual

# Defining the population size.

pop_size = (sol_per_pop,num_weights) # The population will have sol_per_pop chromosome

where each chromosome has num_weights genes.

#Creating the initial population.

new_population = numpy.random.uniform(low=-4.0, high=4.0, size=pop_size)

print(new_population)

"""

new_population[0, :] = [2.4, 0.7, 8, -2, 5, 1.1]

new_population[1, :] = [-0.4, 2.7, 5, -1, 7, 0.1]

new_population[2, :] = [-1, 2, 2, -3, 2, 0.9]

new_population[3, :] = [4, 7, 12, 6.1, 1.4, -4]

new_population[4, :] = [3.1, 4, 0, 2.4, 4.8, 0]

new_population[5, :] = [-2, 3, -7, 6, 3, 3]

"""

best_outputs = []

num_generations = 1000

for generation in range(num_generations):

print("Generation : ", generation)

# Measuring the fitness of each chromosome in the population.

fitness = cal_pop_fitness(equation_inputs, new_population)

print("Fitness")

print(fitness)

25 | P a g e
Machine Learning Lab Manual

best_outputs.append(numpy.max(numpy.sum(new_population*equation_inputs, axis=1)))

# The best result in the current iteration.

print("Best result : ", numpy.max(numpy.sum(new_population*equation_inputs, axis=1)))

# Selecting the best parents in the population for mating.

parents = select_mating_pool(new_population, fitness,

num_parents_mating)

print("Parents")

print(parents)

# Generating next generation using crossover.

offspring_crossover = crossover(parents,

offspring_size=(pop_size[0]-parents.shape[0], num_weights))

print("Crossover")

print(offspring_crossover)

# Adding some variations to the offspring using mutation.

offspring_mutation = mutation(offspring_crossover, num_mutations=2)

print("Mutation")

print(offspring_mutation)

# Creating the new population based on the parents and offspring.

new_population[0:parents.shape[0], :] = parents

new_population[parents.shape[0]:, :] = offspring_mutation

26 | P a g e
Machine Learning Lab Manual

# Getting the best solution after iterating finishing all generations.

#At first, the fitness is calculated for each solution in the final generation.

fitness = cal_pop_fitness(equation_inputs, new_population)

# Then return the index of that solution corresponding to the best fitness.

best_match_idx = numpy.where(fitness == numpy.max(fitness))

print("Best solution : ", new_population[best_match_idx, :])

print("Best solution fitness : ", fitness[best_match_idx])

import matplotlib.pyplot

matplotlib.pyplot.plot(best_outputs)

matplotlib.pyplot.xlabel("Iteration")

matplotlib.pyplot.ylabel("Fitness")

matplotlib.pyplot.show()

Output:

[[ 0.58204141 2.32880696 -2.95130209 2.57056953 3.33055238 -0.58167871] [-1.65052225

3.52263842 -2.46577305 -1.7005396 -3.80480202 0.29677167] [ 2.6239874 -2.01548549 -
1.72292295 3.61090243 -1.25604726 -2.32647264] [-3.45167393 2.85771825 3.74655682 -
2.01790626 0.25750106 -3.12923247] [ 2.86026334 -0.4306777 -3.26297956 1.74863348 -
1.93705571 -3.18855672] [-1.70012089 0.98685104 -1.91192072 3.91873942 -0.09354385
1.43038667] [ 0.31769009 -0.87290809 3.75249785 2.57657993 0.58883082 2.83231871] [
3.83314926 0.33838112 -2.49509594 -1.50763174 3.99440509 -0.03037715]]

Generation : 0

Fitness

[-33.70834413 9.67772594 51.30214363 -4.62383365 45.91897711 -1.56604606 9.24418172 -

45.41084308]

27 | P a g e
Machine Learning Lab Manual

Best result : 51.302143629097614

Parents

[[ 2.6239874 -2.01548549 -1.72292295 3.61090243 -1.25604726 -2.32647264] [ 2.86026334 -

0.4306777 -3.26297956 1.74863348 -1.93705571 -3.18855672] [-1.65052225 3.52263842 -
2.46577305 -1.7005396 -3.80480202 0.29677167] [ 0.31769009 -0.87290809 3.75249785
2.57657993 0.58883082 2.83231871]]

Crossover

[[ 2.6239874 -2.01548549 -1.72292295 1.74863348 -1.93705571 -3.18855672] [ 2.86026334 -

0.4306777 -3.26297956 -1.7005396 -3.80480202 0.29677167] [-1.65052225 3.52263842 -
2.46577305 2.57657993 0.58883082 2.83231871] [ 0.31769009 -0.87290809 3.75249785
3.61090243 -1.25604726 -2.32647264]]

Mutation

[[ 2.6239874 -2.01548549 -1.67896632 1.74863348 -1.93705571 -3.97789372] [ 2.86026334 -

0.4306777 -3.12878279 -1.7005396 -3.80480202 -0.15430324] [-1.65052225 3.52263842 -
3.37669601 2.57657993 0.58883082 2.25153466] [ 0.31769009 -0.87290809 2.93428907
3.61090243 -1.25604726 -2.71597954]]

Generation : 999

Fitness

[2554.3935562 2551.72360738 2549.40583954 2549.29931629 2552.24225166 2550.45506206

2547.1299512 2551.22467397]

Best result : 2554.3935561987346

Parents

[[ 3.17690088e-01 -8.72908094e-01 2.67689952e+02 1.74863348e+00 -1.93705571e+00 -

3.37108802e+02] [ 3.17690088e-01 -8.72908094e-01 2.67638232e+02 1.74863348e+00 -
1.93705571e+00 -3.36689592e+02] [ 3.17690088e-01 -8.72908094e-01 2.67254110e+02

28 | P a g e
Machine Learning Lab Manual

1.74863348e+00 -1.93705571e+00 -3.36865291e+02] [ 3.17690088e-01 -8.72908094e-01

2.67370854e+02 1.74863348e+00 -1.93705571e+00 -3.36672197e+02]]

Crossover

[[ 3.17690088e-01 -8.72908094e-01 2.67689952e+02 1.74863348e+00 -1.93705571e+00 -

3.36689592e+02]

[ 3.17690088e-01 -8.72908094e-01 2.67638232e+02 1.74863348e+00

-1.93705571e+00 -3.36865291e+02]

[ 3.17690088e-01 -8.72908094e-01 2.67254110e+02 1.74863348e+00

-1.93705571e+00 -3.36672197e+02]

[ 3.17690088e-01 -8.72908094e-01 2.67370854e+02 1.74863348e+00

-1.93705571e+00 -3.37108802e+02]]

Mutation

[[ 3.17690088e-01 -8.72908094e-01 2.68382875e+02 1.74863348e+00

-1.93705571e+00 -3.36222272e+02]

[ 3.17690088e-01 -8.72908094e-01 2.68456819e+02 1.74863348e+00

-1.93705571e+00 -3.37417363e+02]

[ 3.17690088e-01 -8.72908094e-01 2.67606746e+02 1.74863348e+00

-1.93705571e+00 -3.36866918e+02]

[ 3.17690088e-01 -8.72908094e-01 2.67051753e+02 1.74863348e+00

-1.93705571e+00 -3.37331663e+02]]

Best solution : [[[ 3.17690088e-01 -8.72908094e-01 2.68456819e+02 1.74863348e+00

-1.93705571e+00 -3.37417363e+02]]]

Best solution fitness : [2558.52782726]

29 | P a g e
Machine Learning Lab Manual

30 | P a g e
Machine Learning Lab Manual

9. Implement the finite words classification system using Back-propagation algorithm

import pandas as pd

from sklearn.model_selection import train_test_split

from sklearn.feature_extraction.text import CountVectorizer

from sklearn.neural_network import MLPClassifier

from sklearn.metrics import accuracy_score, confusion_matrix, precision_score, recall_score

msg = pd.read_csv('document.csv', names=['message', 'label'])

print("Total Instances of Dataset: ", msg.shape[0])

msg['labelnum'] = msg.label.map({'pos': 1, 'neg': 0})

X = msg.message

y = msg.labelnum

Xtrain, Xtest, ytrain, ytest = train_test_split(X, y)

count_v = CountVectorizer()

Xtrain_dm = count_v.fit_transform(Xtrain)

Xtest_dm = count_v.transform(Xtest)

df = pd.DataFrame(Xtrain_dm.toarray(),columns=count_v.get_feature_names())

clf = MLPClassifier(solver='lbfgs', alpha=1e-5,hidden_layer_sizes=(5, 2), random_state=1)

31 | P a g e
Machine Learning Lab Manual

clf.fit(Xtrain_dm, ytrain)

pred = clf.predict(Xtest_dm)

print('Accuracy Metrics:')

print('Accuracy: ', accuracy_score(ytest, pred))

print('Recall: ', recall_score(ytest, pred))

print('Precision: ', precision_score(ytest, pred))

print('Confusion Matrix: \n', confusion_matrix(ytest, pred))

document.csv:

I love this sandwich,pos

This is an amazing place,pos

I feel very good about these beers,pos

This is my best work,pos

What an awesome view,pos

I do not like this restaurant,neg

I am tired of this stuff,neg

I can't deal with this,neg

He is my sworn enemy,neg

My boss is horrible,neg

This is an awesome place,pos

I do not like the taste of this juice,neg

I love to dance,pos

I am sick and tired of this place,neg

What a great holiday,pos

That is a bad locality to stay,neg

32 | P a g e
Machine Learning Lab Manual

We will have good fun tomorrow,pos

I went to my enemy's house today,neg

Output:

Total Instances of Dataset: 18

Accuracy Metrics:

Accuracy: 0.8

Recall: 1.0

Precision: 0.75

Confusion Matrix:

[[1 1]

[0 3]]

33 | P a g e
Machine Learning Lab Manual

34 | P a g e

Syringe Di 4000 Ds 3000 Catalogue Nonepca
100% (1)
Syringe Di 4000 Ds 3000 Catalogue Nonepca
4 pages
ML Lab R18
No ratings yet
ML Lab R18
35 pages
ML Lab R18
No ratings yet
ML Lab R18
34 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
43 pages
MLlab Manual
No ratings yet
MLlab Manual
36 pages
MLT Syllabus
No ratings yet
MLT Syllabus
3 pages
1152CS239-Intro. To Data Science-Syllabus
No ratings yet
1152CS239-Intro. To Data Science-Syllabus
6 pages
Syllabi - Cognitive Analytics
No ratings yet
Syllabi - Cognitive Analytics
4 pages
CS3361 DS Lab-2021 R
No ratings yet
CS3361 DS Lab-2021 R
2 pages
Ml-Lab-Manual Cse
No ratings yet
Ml-Lab-Manual Cse
69 pages
DOC-20240509-WA0008.
No ratings yet
DOC-20240509-WA0008.
157 pages
Stat
No ratings yet
Stat
9 pages
AA Lab Manual Session 2022-23
No ratings yet
AA Lab Manual Session 2022-23
33 pages
Honour
No ratings yet
Honour
56 pages
BDA
No ratings yet
BDA
40 pages
AML - Lab - Syllabus - Chandigarh University
No ratings yet
AML - Lab - Syllabus - Chandigarh University
9 pages
OOP CO PO PSO Mapping
No ratings yet
OOP CO PO PSO Mapping
3 pages
Aiml Lab Manual - Srmtrpec - 20 Feb 24
No ratings yet
Aiml Lab Manual - Srmtrpec - 20 Feb 24
86 pages
KAK_ANNCHO_cc6a009662af3d9d0dc9431cda29c6b9
No ratings yet
KAK_ANNCHO_cc6a009662af3d9d0dc9431cda29c6b9
7 pages
24BCA520
No ratings yet
24BCA520
3 pages
AI ML - LAB - Manual - 2020 - 211213 - 100634
No ratings yet
AI ML - LAB - Manual - 2020 - 211213 - 100634
53 pages
E-Note 20895 Content Document 20240607120458PM
No ratings yet
E-Note 20895 Content Document 20240607120458PM
202 pages
Front Page-2 Cse
No ratings yet
Front Page-2 Cse
2 pages
Ai & ML Lab Manual (18CSL76)
No ratings yet
Ai & ML Lab Manual (18CSL76)
42 pages
B.Tech.AIDS R 2021
No ratings yet
B.Tech.AIDS R 2021
31 pages
LTI1
No ratings yet
LTI1
20 pages
Third Semester
No ratings yet
Third Semester
84 pages
Os Lab Manual
No ratings yet
Os Lab Manual
32 pages
DAA Lab File 2[1]
No ratings yet
DAA Lab File 2[1]
113 pages
Computing (Syllabus 9569) : Singapore-Cambridge General Certificate of Education Advanced Level Higher 2 (2022)
No ratings yet
Computing (Syllabus 9569) : Singapore-Cambridge General Certificate of Education Advanced Level Higher 2 (2022)
13 pages
Information Technology s7 & s8
No ratings yet
Information Technology s7 & s8
317 pages
AIML Obze
No ratings yet
AIML Obze
90 pages
20CB603 - Unit 2
No ratings yet
20CB603 - Unit 2
47 pages
COMPUTER-NETWORKS-CourseFile 28 11 2022
No ratings yet
COMPUTER-NETWORKS-CourseFile 28 11 2022
10 pages
AIot Lab Syllabus
No ratings yet
AIot Lab Syllabus
4 pages
Lab Manual FEB 9
No ratings yet
Lab Manual FEB 9
67 pages
Document1
No ratings yet
Document1
6 pages
V Sem Syllabus
No ratings yet
V Sem Syllabus
17 pages
SOFTCOMPUTING
No ratings yet
SOFTCOMPUTING
2 pages
CS3361 Data Science Lab Manual
No ratings yet
CS3361 Data Science Lab Manual
82 pages
OOP with JAVA - BCS306A Lab manual
No ratings yet
OOP with JAVA - BCS306A Lab manual
39 pages
21ite09 Information Reterival
No ratings yet
21ite09 Information Reterival
2 pages
Aditya Institute of Technology and Management: Tekkali
No ratings yet
Aditya Institute of Technology and Management: Tekkali
64 pages
Data Mining and Warehousing (1)
No ratings yet
Data Mining and Warehousing (1)
7 pages
ML Lab Manual Final
No ratings yet
ML Lab Manual Final
27 pages
Aids - 21ad62 - Datascience Lab Manual-1
No ratings yet
Aids - 21ad62 - Datascience Lab Manual-1
15 pages
Data Structures Python Programming Lab Manual
No ratings yet
Data Structures Python Programming Lab Manual
82 pages
fds manual
No ratings yet
fds manual
25 pages
Board of Studies: Master of Science (Computer Science)
No ratings yet
Board of Studies: Master of Science (Computer Science)
31 pages
AN-IM707 Social Network Analytics PGDM Batch 2020-22
No ratings yet
AN-IM707 Social Network Analytics PGDM Batch 2020-22
6 pages
Python Lab Manual Final
100% (6)
Python Lab Manual Final
88 pages
Sem Vi Csdlo6023 Erp Lessonplan 2018-2019
No ratings yet
Sem Vi Csdlo6023 Erp Lessonplan 2018-2019
13 pages
Network Security Lab Manual
No ratings yet
Network Security Lab Manual
36 pages
Ec3561 Vlsi Laboratory L T P C
No ratings yet
Ec3561 Vlsi Laboratory L T P C
6 pages
1 22csu601-Aiml Syllabus
No ratings yet
1 22csu601-Aiml Syllabus
4 pages
CS-4049 Introduction to Blockchain and Cryptocurrency - Muhammad Asim
No ratings yet
CS-4049 Introduction to Blockchain and Cryptocurrency - Muhammad Asim
5 pages
Digital Electronics - EC0319 - Course Plan 2024
No ratings yet
Digital Electronics - EC0319 - Course Plan 2024
13 pages
POPC Lab Manual 24-25
No ratings yet
POPC Lab Manual 24-25
51 pages
CN Manual 2022
No ratings yet
CN Manual 2022
80 pages
Data Analysis
No ratings yet
Data Analysis
8 pages
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
Crime Analysis and Prediction Using Optimized K-Means Algorithm
No ratings yet
Crime Analysis and Prediction Using Optimized K-Means Algorithm
4 pages
Process Realization Plan: Sales and Marketing Process
100% (1)
Process Realization Plan: Sales and Marketing Process
1 page
CH 4 1
No ratings yet
CH 4 1
45 pages
Statistical Process Control (SPC)
No ratings yet
Statistical Process Control (SPC)
31 pages
Negative Effect of Social Media
No ratings yet
Negative Effect of Social Media
1 page
JUNE19 - Hikvision AI Facial Recognition
No ratings yet
JUNE19 - Hikvision AI Facial Recognition
28 pages
Recurrence Relation Presentation MEERA
0% (1)
Recurrence Relation Presentation MEERA
27 pages
Kem C1002 X7R SMD-3460360
No ratings yet
Kem C1002 X7R SMD-3460360
29 pages
CKD V3301series
No ratings yet
CKD V3301series
3 pages
Delta Sidewall Fans - Submittal
No ratings yet
Delta Sidewall Fans - Submittal
4 pages
SM 4.c
No ratings yet
SM 4.c
585 pages
MOM Lab No3
No ratings yet
MOM Lab No3
4 pages
Dumbbells, Cont.: A Superior Strength Experience
No ratings yet
Dumbbells, Cont.: A Superior Strength Experience
2 pages
Google Tagged LeetCode Questions Decreasing Frequency 1666171615
No ratings yet
Google Tagged LeetCode Questions Decreasing Frequency 1666171615
33 pages
Sensor Cloud
No ratings yet
Sensor Cloud
39 pages
WEL-1200 Operation Manual
No ratings yet
WEL-1200 Operation Manual
115 pages
Terms of Reference (TOR) For Selection of Consulting Audit Firm For Conducting Information Systems Audit of Banglalink Digital Communications LTD PDF
No ratings yet
Terms of Reference (TOR) For Selection of Consulting Audit Firm For Conducting Information Systems Audit of Banglalink Digital Communications LTD PDF
5 pages
Thousands of Conductance Levels in Memristors Integrated On CMOS
No ratings yet
Thousands of Conductance Levels in Memristors Integrated On CMOS
9 pages
Lets Have Fun With Math
No ratings yet
Lets Have Fun With Math
3 pages
LT Remote Seal Calibration
No ratings yet
LT Remote Seal Calibration
6 pages
Runtime R2K Back Up 240 Mins 1800W
No ratings yet
Runtime R2K Back Up 240 Mins 1800W
1 page
Emi Papaer
No ratings yet
Emi Papaer
12 pages
Log
No ratings yet
Log
33 pages
BCNP 5504 H
No ratings yet
BCNP 5504 H
2 pages
Power House - Plan
No ratings yet
Power House - Plan
4 pages
ECE 371 Electronics Measurements and Testing: Assignment #1 Direct Digital Synthesizer (DDS)
No ratings yet
ECE 371 Electronics Measurements and Testing: Assignment #1 Direct Digital Synthesizer (DDS)
4 pages
Teclas de Atalhos VS Code
No ratings yet
Teclas de Atalhos VS Code
1 page
Chapter One
No ratings yet
Chapter One
29 pages
Source Defense - The Essential Guide To PCI DSS 6.4.3 and 11.6.1
No ratings yet
Source Defense - The Essential Guide To PCI DSS 6.4.3 and 11.6.1
14 pages

ML Lab Manual (CSE)

Uploaded by

ML Lab Manual (CSE)

Uploaded by

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

Engineering Graduates will be able to:

1. Engineering Knowledge: Apply the knowledge of mathematics, science, engineering

2 Extract the data from database using python

5 The following training examples map descriptions of individuals onto

6 Implement linear regression using python

CS604PC Course Objectives

CO 2 Interpret and understand modern notions in data analysis-oriented

CO 4 Evaluate Decision tree algorithms using real world data

COs PROGRAM OUTCOMES (POs) PSOs

CS604PC: MACHINE LEARNING LAB

III Year B.Tech. CSE II-Sem

1. understand complexity of Machine Learning algorithms and their limitations;

2. Extract the data from database using python

3. Implement k-nearest neighbours classification using python

VAR VAR2 CLAS

5. The following training examples map descriptions of individuals onto high,

# Therefore the result is:

2. Extract the data from database using python

This article demonstrates how to select rows of a MySQL table in Python.

• Steps to fetch rows from a MySQL database table

Follow these steps: –

How to Select from a MySQL table using Python

1. Connect to MySQL from Python

Python MySQL database connection to connect to MySQL database from Python

2. Define a SQL SELECT Query

3. Get Cursor Object from Connection

Next, use a connection.cursor() method to create a cursor object. This method

4. Execute the SELECT query using execute() method

Execute the select query using the cursor.execute() method.

5. Extract all rows from a result

After successfully executing a Select operation, Use the fetchall() method of a

6. Iterate each row

7. Close the cursor object and database connection object

use cursor.clsoe() and connection.clsoe() method to close open connections after

# To close the connection

3. Implement k-nearest neighbours classification using python

• The k-nearest neighbor algorithm is imported from the scikit-learn package.

# Import necessary modules

# Create feature and target arrays

# Split into training and test set

# Predict on dataset which model has not seen before

from sklearn.cluster import KMeans

X = np.array([[1.713,1.586], [0.180,1.786], [0.353,1.240],

[0.940,1.566], [1.486,0.759], [1.266,1.106],[1.540,0.419],[0.459,1.799],[0.773,0.186]])

kmeans = KMeans(n_clusters=3, random_state=0).fit(X,y)

medium skiing design single twenties no -> highRisk

high golf trading married forties yes -> lowRisk

low speedway transport married thirties yes -> medRisk

medium football banking single thirties yes -> lowRisk

high flying media married fifties yes -> highRisk

low football security single twenties no -> medRisk

medium golf media single thirties yes -> medRisk

medium golf transport married forties yes -> lowRisk

high skiing banking single thirties yes -> highRisk

low golf unemployed married forties yes -> highRisk

print("Unconditional probability of golf: = {}".format(probGolf))

#conditional probability of `single' given `medRisk'

# Therefore the result is:

print("Conditional probability of single given medRisk: = {}".format(conditionalProbability))

Unconditional probability of golf: = 0.4

Conditional probability of single given medRisk: = 0.6666666666666667

6. Implement linear regression using python.

Regression searches for relationships among variables.

Simple Linear Regression

Implementing Linear Regression in Python

Python Packages for Linear Regression

Simple Linear Regression With scikit-learn

1. Import the packages and classes you need.

import matplotlib.pyplot as plt

def estimate_coef(x, y):

# mean of x and y vector

m_x, m_y = np.mean(x), np.mean(y)

# calculating cross-deviation and deviation about x

SS_xy = np.sum(y*x) - n*m_y*m_x

SS_xx = np.sum(x*x) - n*m_x*m_x

SS_xy = np.sum(yx) - nm_y*m_x

SS_xx = np.sum(xx) - nm_x*m_x