0% found this document useful (0 votes)

50 views11 pages

Logistic Regression Model Tutorial

The document builds a logistic regression model on breast cancer data. It loads and explores the data, selects relevant features, splits the data into training and test sets, standardizes the data, defines the model architecture, trains the model over multiple iterations while tracking cost, and saves the trained model.

Uploaded by

jithentar.cs21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views11 pages

Logistic Regression Model Tutorial

Uploaded by

jithentar.cs21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

LAB-5

Build Logistic Regression Model for a given dataset.

OBSERVATION:
CODE:

import math
import numpy as np
import pandas as pd
import [Link] as plt
import seaborn as sns
import [Link] as px
import pprint
import pickle

In [4]:

df = pd.read_csv('[Link]')

In [5]:

[Link]()

In [6]:
[Link]('id', axis=1, inplace=True) #drop redundant columns

In [7]:

df['diagnosis'] = (df['diagnosis'] == 'M').astype(int) #encode the label

into 1/0

In [8]:

corr = [Link]()

In [9]:

[Link](figsize=(20,20))
[Link](corr, cmap='mako_r',annot=True)
[Link]()

In [12]:

# Get the absolute value of the correlation

cor_target = abs(corr["diagnosis"])

# Select highly correlated features (thresold = 0.2)

relevant_features = cor_target[cor_target>0.2]

# Collect the names of the features

names = [index for index, value in relevant_features.items()]

# Drop the target variable from the results

[Link]('diagnosis')

# Display the results

[Link](names)

['radius_mean',
'texture_mean',
'perimeter_mean',
'area_mean',
'smoothness_mean',
'compactness_mean',
'concavity_mean',
'concave points_mean',
'symmetry_mean',
'radius_se',
'perimeter_se',
'area_se',
'compactness_se',
'concavity_se',
'concave points_se',
'radius_worst',
'texture_worst',
'perimeter_worst',
'area_worst',
'smoothness_worst',
'compactness_worst',
'concavity_worst',
'concave points_worst',
'symmetry_worst',
'fractal_dimension_worst']
In [13]:

X = df[names].values
y = df['diagnosis'].values

In [14]:

def train_test_split(X, y, random_state=42, test_size=0.2):

"""
Splits the data into training and testing sets.

Parameters:
X ([Link]): Features array of shape (n_samples, n_features).
y ([Link]): Target array of shape (n_samples,).
random_state (int): Seed for the random number generator. Default
is 42.
test_size (float): Proportion of samples to include in the test
set. Default is 0.2.

Returns:
Tuple[[Link]]: A tuple containing X_train, X_test, y_train,
y_test.
"""
# Get number of samples
n_samples = [Link][0]

# Set the seed for the random number generator

[Link](random_state)

# Shuffle the indices

shuffled_indices = [Link]([Link](n_samples))

# Determine the size of the test set

test_size = int(n_samples * test_size)

# Split the indices into test and train

test_indices = shuffled_indices[:test_size]
train_indices = shuffled_indices[test_size:]

# Split the features and target arrays into test and train
X_train, X_test = X[train_indices], X[test_indices]
y_train, y_test = y[train_indices], y[test_indices]

return X_train, X_test, y_train, y_test

In [15]:

X_train, X_test, y_train, y_test = train_test_split(X,y)

In [16]:

def standardize_data(X_train, X_test):

"""
Standardizes the input data using mean and standard deviation.

Parameters:
X_train ([Link]): Training data.
X_test ([Link]): Testing data.
Returns:
Tuple of standardized training and testing data.
"""
# Calculate the mean and standard deviation using the training data
mean = [Link](X_train, axis=0)
std = [Link](X_train, axis=0)

# Standardize the data

X_train = (X_train - mean) / std
X_test = (X_test - mean) / std

return X_train, X_test

X_train, X_test = standardize_data(X_train, X_test)

In [17]:

def sigmoid(z):
"""
Compute the sigmoid function for a given input.

The sigmoid function is a mathematical function used in logistic

regression and neural networks
to map any real-valued number to a value between 0 and 1.

Parameters:
z (float or [Link]): The input value(s) for which to compute
the sigmoid.

Returns:
float or [Link]: The sigmoid of the input value(s).

Example:
>>> sigmoid(0)
0.5
"""
# Compute the sigmoid function using the formula: 1 / (1 + e^(-z)).
sigmoid_result = 1 / (1 + [Link](-z))

# Return the computed sigmoid value.

return sigmoid_result

In [18]:

z = [Link](-12, 12, 200)

fig = [Link](x=z, y=sigmoid(z),title='Logistic

Function',template="plotly_dark")
fig.update_layout(
title_font_color="#41BEE9",
xaxis=dict(color="#41BEE9"),
yaxis=dict(color="#41BEE9")
)
[Link]()

In [19]:

class LogisticRegression:
"""
Logistic Regression model.
Parameters:
learning_rate (float): Learning rate for the model.

Methods:
initialize_parameter(): Initializes the parameters of the model.
sigmoid(z): Computes the sigmoid activation function for given
input z.
forward(X): Computes forward propagation for given input X.
compute_cost(predictions): Computes the cost function for given
predictions.
compute_gradient(predictions): Computes the gradients for the model
using given predictions.
fit(X, y, iterations, plot_cost): Trains the model on given input X
and labels y for specified iterations.
predict(X): Predicts the labels for given input X.
"""

def init(self, learning_rate=0.0001):

[Link](1)
self.learning_rate = learning_rate

def initialize_parameter(self):
"""
Initializes the parameters of the model.
"""
self.W = [Link]([Link][1])
self.b = 0.0

def forward(self, X):

"""
Computes forward propagation for given input X.

Parameters:
X ([Link]): Input array.

Returns:
[Link]: Output array.
"""
# print([Link], [Link])
Z = [Link](X, self.W) + self.b
A = sigmoid(Z)
return A

def compute_cost(self, predictions):

"""
Computes the cost function for given predictions.

Parameters:
predictions ([Link]): Predictions of the model.

Returns:
float: Cost of the model.
"""
m = [Link][0] # number of training examples
# compute the cost
cost = [Link]((-[Link](predictions + 1e-8) * self.y) + (-[Link](1 -
predictions + 1e-8)) * (
1 - self.y)) # we are adding small value epsilon to avoid
log of 0
cost = cost / m
return cost
def compute_gradient(self, predictions):
"""
Computes the gradients for the model using given predictions.

Parameters:
predictions ([Link]): Predictions of the model.
"""
# get training shape
m = [Link][0]

# compute gradients
[Link] = [Link](self.X.T, (predictions - self.y))
[Link] = [Link]([[Link](grad) for grad in [Link]])

[Link] = [Link]([Link](predictions, self.y))

# scale gradients
[Link] = [Link] * 1 / m
[Link] = [Link] * 1 / m

def fit(self, X, y, iterations, plot_cost=True):

"""
Trains the model on given input X and labels y for specified
iterations.

Parameters:
X ([Link]): Input features array of shape (n_samples,
n )
y ([Link]): Labels array of shape (n_samples, 1)
iterations (int): Number of iterations for training.
plot_cost (bool): Whether to plot cost over iterations or not.

Returns:
None.
"""
self.X = X
self.y = y

self.initialize_parameter()

costs = []
for i in range(iterations):
# forward propagation
predictions = [Link](self.X)

# compute cost
cost = self.compute_cost(predictions)
[Link](cost)

# compute gradients
self.compute_gradient(predictions)

# update parameters
self.W = self.W - self.learning_rate * [Link]
self.b = self.b - self.learning_rate * [Link]

# print cost every 100 iterations

if i % 10000 == 0:
print("Cost after iteration {}: {}".format(i, cost))
if plot_cost:
fig = [Link](y=costs,title="Cost vs
Iteration",template="plotly_dark")
fig.update_layout(
title_font_color="#41BEE9",
xaxis=dict(color="#41BEE9",title="Iterations"),
yaxis=dict(color="#41BEE9",title="cost")
)
[Link]()
def predict(self, X):
"""
Predicts the labels for given input X.

Parameters:
X ([Link]): Input features array.

Returns:
[Link]: Predicted labels.
"""
predictions = [Link](X)
return [Link](predictions)

def save_model(self, filename=None):

"""
Save the trained model to a file using pickle.

Parameters:
filename (str): The name of the file to save the model to.
"""
model_data = {
'learning_rate': self.learning_rate,
'W': self.W,
'b': self.b
}

with open(filename, 'wb') as file:

[Link](model_data, file)

@classmethod
def load_model(cls, filename):
"""
Load a trained model from a file using pickle.

Parameters:
filename (str): The name of the file to load the model from.

Returns:
LogisticRegression: An instance of the LogisticRegression class
with loaded parameters.
"""
with open(filename, 'rb') as file:
model_data = [Link](file)

# Create a new instance of the class and initialize it with the

loaded parameters
loaded_model = cls(model_data['learning_rate'])
loaded_model.W = model_data['W']
loaded_model.b = model_data['b']
return loaded_model

In [21]:

lg = LogisticRegression()
[Link](X_train, y_train,100000)

In [22]:

lg.save_model("[Link]")

In [23]:

class ClassificationMetrics:
@staticmethod
def accuracy(y_true, y_pred):
"""
Computes the accuracy of a classification model.

Parameters:
y_true (numpy array): A numpy array of true labels for each data
point.
y_pred (numpy array): A numpy array of predicted labels for each
data point.

Returns:
float: The accuracy of the model, expressed as a percentage.
"""
y_true = y_true.flatten()
total_samples = len(y_true)
correct_predictions = [Link](y_true == y_pred)
return (correct_predictions / total_samples)

@staticmethod
def precision(y_true, y_pred):
"""
Computes the precision of a classification model.

Parameters:
y_true (numpy array): A numpy array of true labels for each data
point.
y_pred (numpy array): A numpy array of predicted labels for each
data point.

Returns:
float: The precision of the model, which measures the proportion of
true positive predictions
out of all positive predictions made by the model.
"""
true_positives = [Link]((y_true == 1) & (y_pred == 1))
false_positives = [Link]((y_true == 0) & (y_pred == 1))
return true_positives / (true_positives + false_positives)

@staticmethod
def recall(y_true, y_pred):
"""
Computes the recall (sensitivity) of a classification model.

Parameters:
y_true (numpy array): A numpy array of true labels for each data
point.
y_pred (numpy array): A numpy array of predicted labels for each
data point.

Returns:
float: The recall of the model, which measures the proportion of
true positive predictions
out of all actual positive instances in the dataset.
"""
true_positives = [Link]((y_true == 1) & (y_pred == 1))
false_negatives = [Link]((y_true == 1) & (y_pred == 0))
return true_positives / (true_positives + false_negatives)

@staticmethod
def f1_score(y_true, y_pred):
"""
Computes the F1-score of a classification model.

Parameters:
y_true (numpy array): A numpy array of true labels for each data
point.
y_pred (numpy array): A numpy array of predicted labels for each
data point.

Returns:
float: The F1-score of the model, which is the harmonic mean of
precision and recall.
"""
precision_value = [Link](y_true, y_pred)
recall_value = [Link](y_true, y_pred)
return 2 * (precision_value * recall_value) / (precision_value +
recall_value)

In [24]:

model = LogisticRegression.load_model("[Link]")

In [25]:

y_pred = [Link](X_test)
accuracy = [Link](y_test, y_pred)
precision = [Link](y_test, y_pred)
recall = [Link](y_test, y_pred)
f1_score = ClassificationMetrics.f1_score(y_test, y_pred)

print(f"Accuracy: {accuracy:.2%}")
print(f"Precision: {precision:.2%}")
print(f"Recall: {recall:.2%}")
print(f"F1-Score: {f1_score:.2%}")

OUTPUT:

KNN Classification Model Implementation
No ratings yet
KNN Classification Model Implementation
21 pages
Data Preparation for Neural Networks
No ratings yet
Data Preparation for Neural Networks
16 pages
Logistic Regression with NumPy
No ratings yet
Logistic Regression with NumPy
4 pages
Neural Network Image Classification
No ratings yet
Neural Network Image Classification
11 pages
Logistic Regression with Python Code
No ratings yet
Logistic Regression with Python Code
6 pages
Logistic Regression from Scratch in Python
No ratings yet
Logistic Regression from Scratch in Python
4 pages
Perceptron and Neural Network Implementations
No ratings yet
Perceptron and Neural Network Implementations
41 pages
Deep Learning File
No ratings yet
Deep Learning File
35 pages
Implementing Machine Learning Algorithms
No ratings yet
Implementing Machine Learning Algorithms
20 pages
ML Pract
No ratings yet
ML Pract
11 pages
SVM and Regression Techniques Overview
No ratings yet
SVM and Regression Techniques Overview
216 pages
NC Lab Program1 Perceptron Algorithm
No ratings yet
NC Lab Program1 Perceptron Algorithm
36 pages
Mtech Programs AI Lab
No ratings yet
Mtech Programs AI Lab
11 pages
Neural Network Parameter Initialization
No ratings yet
Neural Network Parameter Initialization
22 pages
Building a Deep Neural Network in TensorFlow
0% (1)
Building a Deep Neural Network in TensorFlow
6 pages
Simple Linear Regression Implementation
No ratings yet
Simple Linear Regression Implementation
14 pages
Linear Regression for Restaurant Profits
No ratings yet
Linear Regression for Restaurant Profits
51 pages
Gaussian Distribution & Gradient Descent Lab
No ratings yet
Gaussian Distribution & Gradient Descent Lab
48 pages
ImportError in Scikit-Learn Usage
No ratings yet
ImportError in Scikit-Learn Usage
35 pages
Neural Network Binary Classifier Code
No ratings yet
Neural Network Binary Classifier Code
5 pages
Linear Regression for SalePrice Prediction
No ratings yet
Linear Regression for SalePrice Prediction
2 pages
Python Gradient Descent Optimization
No ratings yet
Python Gradient Descent Optimization
29 pages
Machine Learning Algorithm Implementations
No ratings yet
Machine Learning Algorithm Implementations
22 pages
Exp 3
No ratings yet
Exp 3
6 pages
Deep Learning Lab Manual: Python Programs
No ratings yet
Deep Learning Lab Manual: Python Programs
34 pages
Generative AI Practical File: Python Code
No ratings yet
Generative AI Practical File: Python Code
28 pages
Logistic Regression Implementation Guide
No ratings yet
Logistic Regression Implementation Guide
27 pages
Engineering Skills Course Certificate
No ratings yet
Engineering Skills Course Certificate
36 pages
Image Classification with LeNet-5 Model
No ratings yet
Image Classification with LeNet-5 Model
4 pages
Linear Regression Implementation Guide
100% (1)
Linear Regression Implementation Guide
45 pages
Data Preprocessing and Model Evaluation Techniques
No ratings yet
Data Preprocessing and Model Evaluation Techniques
12 pages
TensorFlow Vector Addition Guide
No ratings yet
TensorFlow Vector Addition Guide
50 pages
Perceptron Training and Implementation
No ratings yet
Perceptron Training and Implementation
6 pages
Perceptron vs Adaline in Python
No ratings yet
Perceptron vs Adaline in Python
11 pages
Cy-701 Machine Learning Lab Manual
No ratings yet
Cy-701 Machine Learning Lab Manual
31 pages
Neural Network Components Overview
No ratings yet
Neural Network Components Overview
8 pages
Python Simple Linear Regression Guide
No ratings yet
Python Simple Linear Regression Guide
56 pages
Labpdf
No ratings yet
Labpdf
10 pages
Logistic Regression on Forest Cover Data
No ratings yet
Logistic Regression on Forest Cover Data
8 pages
Training Linear Models in Python
No ratings yet
Training Linear Models in Python
24 pages
DNN Lab Manual for MCA Program
No ratings yet
DNN Lab Manual for MCA Program
34 pages
Simple Neural Network with Backpropagation
No ratings yet
Simple Neural Network with Backpropagation
3 pages
Neural Network Accuracy and Activation Functions
No ratings yet
Neural Network Accuracy and Activation Functions
83 pages
Single Unit Perceptron for Iris Classification
No ratings yet
Single Unit Perceptron for Iris Classification
59 pages
Neural Network Training with Python
No ratings yet
Neural Network Training with Python
3 pages
Neural Network Accuracy and Activation Functions
No ratings yet
Neural Network Accuracy and Activation Functions
16 pages
Adaline: Linear Activation in Neural Networks
No ratings yet
Adaline: Linear Activation in Neural Networks
19 pages
Mtech Lab Manual
No ratings yet
Mtech Lab Manual
11 pages
Data Preprocessing and ML Techniques
No ratings yet
Data Preprocessing and ML Techniques
40 pages
Python Neural Network Experiments
No ratings yet
Python Neural Network Experiments
38 pages
Machine Learning Algorithms Overview
No ratings yet
Machine Learning Algorithms Overview
19 pages
MCSL106 - Algorithms & AI Lab Manual
No ratings yet
MCSL106 - Algorithms & AI Lab Manual
15 pages
EDA and ML on Iris Dataset
100% (2)
EDA and ML on Iris Dataset
24 pages
AI - Manual Exp 7 8 9 Print
No ratings yet
AI - Manual Exp 7 8 9 Print
12 pages
Logistic Regression Model Training Guide
No ratings yet
Logistic Regression Model Training Guide
3 pages
Machine Learning Practical Guide
No ratings yet
Machine Learning Practical Guide
29 pages
Multiple Linear Regression Guide
No ratings yet
Multiple Linear Regression Guide
3 pages
SITXMGT005 Business Relationship Guide
50% (2)
SITXMGT005 Business Relationship Guide
16 pages
Solar Cell Efficiency Analysis for EE134
No ratings yet
Solar Cell Efficiency Analysis for EE134
2 pages
Seminar on Biochips and Their Applications
No ratings yet
Seminar on Biochips and Their Applications
17 pages
CS301 Data Structures Exam Solutions
No ratings yet
CS301 Data Structures Exam Solutions
39 pages
Storm LED Final Catalogue For Printing - Removed - Compressed
No ratings yet
Storm LED Final Catalogue For Printing - Removed - Compressed
31 pages
Account Statement: Rachana Gupta
No ratings yet
Account Statement: Rachana Gupta
12 pages
LabVIEW Boolean Array and Functions Guide
No ratings yet
LabVIEW Boolean Array and Functions Guide
20 pages
Understanding Mechanical Packing Basics
No ratings yet
Understanding Mechanical Packing Basics
4 pages
Recognizing and Treating Breathing Disorders 2nd Edition by Leon Chaitow ND DO UK Christopher Gilbert PHD Dinah Bradley DipPhys NZRP MNZSP Testbank & Ebook
No ratings yet
Recognizing and Treating Breathing Disorders 2nd Edition by Leon Chaitow ND DO UK Christopher Gilbert PHD Dinah Bradley DipPhys NZRP MNZSP Testbank & Ebook
215 pages
Nonprofit Cover Letter Tips and Examples
100% (1)
Nonprofit Cover Letter Tips and Examples
8 pages
Understanding IMS Online Processing
No ratings yet
Understanding IMS Online Processing
26 pages
220kV Bus Coupler Single Line Diagram
No ratings yet
220kV Bus Coupler Single Line Diagram
1 page
MSP430 16-Bit RISC Architecture Overview
No ratings yet
MSP430 16-Bit RISC Architecture Overview
10 pages
HMMT November 2018 Solutions
No ratings yet
HMMT November 2018 Solutions
4 pages
Internship Report at ACL Digital Ahmedabad
No ratings yet
Internship Report at ACL Digital Ahmedabad
40 pages
Baofeng UV-82 User Manual Guide
No ratings yet
Baofeng UV-82 User Manual Guide
40 pages
Crankcase Friction and Lubrication Insights
No ratings yet
Crankcase Friction and Lubrication Insights
4 pages
Understanding Dual and Compound Bar Charts
No ratings yet
Understanding Dual and Compound Bar Charts
18 pages
Aloe Vera Plant Battery Research Study
No ratings yet
Aloe Vera Plant Battery Research Study
15 pages
Kobuy Maintenance Guidelines for Students
No ratings yet
Kobuy Maintenance Guidelines for Students
19 pages
AIIMS Exam Admit Card Instructions
No ratings yet
AIIMS Exam Admit Card Instructions
1 page
Computer Security Lecture Notes PDF
No ratings yet
Computer Security Lecture Notes PDF
32 pages
B.Tech EEE Syllabus Overview 2024-25
No ratings yet
B.Tech EEE Syllabus Overview 2024-25
25 pages
BRCGS Packaging Issue 7 Documentation Kit
No ratings yet
BRCGS Packaging Issue 7 Documentation Kit
12 pages
Sysmex XP-Series Automated Hematology Analyzer - Service Manual PDF
60% (10)
Sysmex XP-Series Automated Hematology Analyzer - Service Manual PDF
270 pages
Statistical Tests for Regression Analysis
No ratings yet
Statistical Tests for Regression Analysis
3 pages
Deloitte India T&T ET&P SAP Consultant
No ratings yet
Deloitte India T&T ET&P SAP Consultant
4 pages
Understanding Types of Numbers
No ratings yet
Understanding Types of Numbers
27 pages
YES Bank Application Form for Entities
No ratings yet
YES Bank Application Form for Entities
7 pages
Magnesia-Spinel Brick MSp80 Overview
No ratings yet
Magnesia-Spinel Brick MSp80 Overview
1 page

Logistic Regression Model Tutorial

Uploaded by

Logistic Regression Model Tutorial

Uploaded by

LAB-5

Build Logistic Regression Model for a given dataset.

df['diagnosis'] = (df['diagnosis'] == 'M').astype(int) #encode the label

# Get the absolute value of the correlation

# Select highly correlated features (thresold = 0.2)

# Collect the names of the features

# Drop the target variable from the results

# Display the results

def train_test_split(X, y, random_state=42, test_size=0.2):

# Set the seed for the random number generator

# Shuffle the indices

# Determine the size of the test set

# Split the indices into test and train

return X_train, X_test, y_train, y_test

X_train, X_test, y_train, y_test = train_test_split(X,y)

def standardize_data(X_train, X_test):

# Standardize the data

return X_train, X_test

X_train, X_test = standardize_data(X_train, X_test)

The sigmoid function is a mathematical function used in logistic

# Return the computed sigmoid value.

z = [Link](-12, 12, 200)

fig = [Link](x=z, y=sigmoid(z),title='Logistic

def __init__(self, learning_rate=0.0001):

def forward(self, X):

def compute_cost(self, predictions):

[Link] = [Link]([Link](predictions, self.y))

def fit(self, X, y, iterations, plot_cost=True):

# print cost every 100 iterations

def save_model(self, filename=None):

with open(filename, 'wb') as file:

# Create a new instance of the class and initialize it with the

You might also like

def init(self, learning_rate=0.0001):