0% found this document useful (0 votes)

11 views15 pages

Linear Methods for Classification-1 (1)

The document discusses linear methods for classification, particularly focusing on logistic regression as a solution for modeling categorical response variables. It explains the importance of using a logistic function to ensure outputs remain within the range of [0, 1] and describes how to estimate coefficients using maximum likelihood. Additionally, it highlights the impact of confounding variables on model interpretation, emphasizing the need to include relevant features in the analysis.

Uploaded by

alenjacsbackup

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views15 pages

Linear Methods for Classification-1 (1)

Uploaded by

alenjacsbackup

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

COMP3202 - Intro to Machine Learning

Linear Methods for Classification

Linear Models for Classification
For tasks where the response variable is categorical, we
need a method that models the posterior probabilities:
Pr(Y = k | X = x)
where k is the class of instance x.
Linear Models for Classification

If Pr(Y = k | X = x) is
linear in X then
– the decision boundaries
will be linear and
– we can use a linear
model.

Figure from The Elements of Statistical Learning by Hastie, Tibshirani and Friedman, 2009.
Why not use linear regression?

Pr(Y = k | X = x)
must be modelled with a
function that gives outputs
between 0 and 1 for all
values of X

Fig. from An Introduction to Statistical Learning: with Applications in R by James, Witten, Hastie, and Tibshirani, 2013.
From linear to logistic regression
e is the
Euler's number =
2.718281

Fig. from An Introduction to Statistical Learning: with Applications in R by James, Witten, Hastie, and Tibshirani, 2013.
Logistic Regression
We need to model the relationship between P(X)=Pr(Y=1|X=x) and X.
Consider using a linear model to represent the probabilities:

Using this equation, the output for very large or very small input values
could potentially be outside the range [0, 1]. (Why is this not sensible?)
To avoid this problem, we use a logistic function that will ensure the output
to be within the range (0, 1):

This function produces an S-shaped curve, that regardless of value of X,

produces a sensible output.
Logistic Regression
After a bit of manipulation of

We find that,

called the odds,

Values between 0 and ∞

By taking the logarithm of both sides:

called the log-odds/logit,

Finding the Coefficients
Likelihood function:

Maximum likelihood is a very general approach that is used to estimate

the 𝛽s which will maximize this likelihood function.
Any statistical package can be used to estimate the 𝛽s. (e.g. SGD,...)

Generalized likelihood function:

X
Logistic Regression
The Maximum likelihood is
used to estimates the 𝛽s.

Probability
of Y = 1

X
Fig. adapted from An Introduction to Statistical Learning: with Applications in R by James, Witten, Hastie, and Tibshirani, 2013.
Logistic Regression
● Logit is linear in X
● If βi is positive then increasing Xi will increase p(X)
● If βi is negative then increasing Xi will decrease p(X)
● Predict Y = 1 for any instance for which p(X) > threshold
● The decision boundary is the points for which the log-odds are
zero.
Example
Suppose we want to predict using logistic regression whether
an individual will default on their credit card payment on the
basis of annual income, monthly credit card balance and
student status

Example and Figure from An Introduction to Statistical Learning: with Applications in R by James, Witten, Hastie, and Tibshirani, 2013.
Making Predictions

Based on this Coefficients,

The default probability for an individual with a balance of $1,000 is:

Fig. from An Introduction to Statistical Learning: with Applications in R by James, Witten, Hastie, and Tibshirani, 2013.
Confounding
Suppose that we construct a model of the probability of default
using only the feature student status and the coefficient
associated with this feature is 0.4049.

However, when we add the features balance and income to the

model the coefficient associated with the feature student status
is negative. Why?
Confounding

● Interpretation:
○ A student is less risky than a non-student with the same credit card balance

● Confounding occurs when features are correlated

● Results of linear models significantly change depending on the features included.
● It is important to include (all) relevant features
Example and Figure from An Introduction to Statistical Learning: with Applications in R by James, Witten, Hastie, and Tibshirani, 2013.

Classification Techniques in Statistical Learning
No ratings yet
Classification Techniques in Statistical Learning
102 pages
Overview of Classification Techniques
No ratings yet
Overview of Classification Techniques
26 pages
Logistic Regression for Classification Analysis
No ratings yet
Logistic Regression for Classification Analysis
56 pages
Binary Dependent Variable Regression
No ratings yet
Binary Dependent Variable Regression
63 pages
Logistic Regression for Classification Analysis
No ratings yet
Logistic Regression for Classification Analysis
29 pages
Logistic Regression for Binary Classification
No ratings yet
Logistic Regression for Binary Classification
4 pages
Regression Models for Binary Outcomes
No ratings yet
Regression Models for Binary Outcomes
27 pages
Logistic Regression for Coupon Usage
100% (1)
Logistic Regression for Coupon Usage
56 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
26 pages
Understanding Logistic Regression Concepts
No ratings yet
Understanding Logistic Regression Concepts
11 pages
Lect8_DiscreteChoiceModel_20260116f
No ratings yet
Lect8_DiscreteChoiceModel_20260116f
46 pages
Understanding Logistic Regression Concepts
No ratings yet
Understanding Logistic Regression Concepts
11 pages
Logistic Regression
No ratings yet
Logistic Regression
9 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
54 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
11 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
38 pages
Logistic Regression an Overview
No ratings yet
Logistic Regression an Overview
21 pages
Logit and Probit Models Explained
No ratings yet
Logit and Probit Models Explained
53 pages
Limited Dependent Variable Models Explained
No ratings yet
Limited Dependent Variable Models Explained
17 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Logistic Regression Overview by Gunjan Bharadwaj
100% (1)
Logistic Regression Overview by Gunjan Bharadwaj
42 pages
Logistic Regression Overview in R
No ratings yet
Logistic Regression Overview in R
47 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
19 pages
Logistic Regression for Loan Acceptance
No ratings yet
Logistic Regression for Loan Acceptance
18 pages
Models for Limited Dependent Variables
No ratings yet
Models for Limited Dependent Variables
24 pages
Binary Dependent Variables in Econometrics
No ratings yet
Binary Dependent Variables in Econometrics
38 pages
Chapter 10 - Logistic Regression: Data Mining For Business Intelligence
No ratings yet
Chapter 10 - Logistic Regression: Data Mining For Business Intelligence
20 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
10 pages
6-Discrete Choice Models
No ratings yet
6-Discrete Choice Models
139 pages
Estimating Binary Models in EViews 6
No ratings yet
Estimating Binary Models in EViews 6
12 pages
Logistic Regression Explained Simply
No ratings yet
Logistic Regression Explained Simply
21 pages
Chap10 Logistic Regression
No ratings yet
Chap10 Logistic Regression
36 pages
Classification Techniques Overview
No ratings yet
Classification Techniques Overview
5 pages
Logistic Regression for Classification Analysis
No ratings yet
Logistic Regression for Classification Analysis
34 pages
Logistic Regression for Classification
No ratings yet
Logistic Regression for Classification
34 pages
Logit and Probit Models Explained
No ratings yet
Logit and Probit Models Explained
9 pages
Understanding Logistic Regression in Classification
No ratings yet
Understanding Logistic Regression in Classification
25 pages
Topic4 Handout
No ratings yet
Topic4 Handout
29 pages
Logistic Regression Overview and Methods
No ratings yet
Logistic Regression Overview and Methods
19 pages
Understanding Logistic Regression
No ratings yet
Understanding Logistic Regression
12 pages
Maximum Likelihood in Binary Regression
No ratings yet
Maximum Likelihood in Binary Regression
39 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
17 pages
Logistic Regression for Categorical Data
No ratings yet
Logistic Regression for Categorical Data
54 pages
Logistic Regression in Business Analytics
No ratings yet
Logistic Regression in Business Analytics
62 pages
Understanding Likelihood vs Probability
No ratings yet
Understanding Likelihood vs Probability
18 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
44 pages
Regression and Logistic Analysis Guide
No ratings yet
Regression and Logistic Analysis Guide
5 pages
Logistic Regression in Applied Analysis
No ratings yet
Logistic Regression in Applied Analysis
80 pages
Logistic Regression for Breast Cancer Detection
No ratings yet
Logistic Regression for Breast Cancer Detection
20 pages
Logistic Regression Overview
No ratings yet
Logistic Regression Overview
22 pages
Logit and Probit Models Explained
No ratings yet
Logit and Probit Models Explained
50 pages
Logistic Regression Overview and Applications
No ratings yet
Logistic Regression Overview and Applications
23 pages
06.Classification 01
No ratings yet
06.Classification 01
50 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
20 pages
Logistic Regression Explained: Key Concepts
No ratings yet
Logistic Regression Explained: Key Concepts
23 pages
Understanding Generalized Linear Models
No ratings yet
Understanding Generalized Linear Models
13 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
41 pages
Logistic Regression: Assumptions & Applications
No ratings yet
Logistic Regression: Assumptions & Applications
61 pages
Understanding Logistic Regression Basics
100% (1)
Understanding Logistic Regression Basics
41 pages
AI Models for HVAC Systems Analysis
No ratings yet
AI Models for HVAC Systems Analysis
17 pages
Data Analytics with R: Online Course
No ratings yet
Data Analytics with R: Online Course
5 pages
Arecanut Leaf Disease Detection Using CNN
No ratings yet
Arecanut Leaf Disease Detection Using CNN
6 pages
Online Marketing Trends in Bangladesh
No ratings yet
Online Marketing Trends in Bangladesh
32 pages
Machine Learning Paradigms Overview
No ratings yet
Machine Learning Paradigms Overview
35 pages
Internshala Machine Learning Challenge 2021
No ratings yet
Internshala Machine Learning Challenge 2021
3 pages
Deep Learning for Pneumonia Diagnosis
No ratings yet
Deep Learning for Pneumonia Diagnosis
11 pages
AI in Telecommunications Exercises
No ratings yet
AI in Telecommunications Exercises
2 pages
Surya Teja Challapalli: Data Science Lead
No ratings yet
Surya Teja Challapalli: Data Science Lead
3 pages
TabPFN: Fast Transformer for Tabular Data
No ratings yet
TabPFN: Fast Transformer for Tabular Data
33 pages
AI's Impact on E-Commerce Service
No ratings yet
AI's Impact on E-Commerce Service
78 pages
Data Analytics Internship Report 2025
No ratings yet
Data Analytics Internship Report 2025
43 pages
Entity-Based Sentiment Classifier
No ratings yet
Entity-Based Sentiment Classifier
66 pages
Partitioning Methods in Clustering
No ratings yet
Partitioning Methods in Clustering
20 pages
HCR-Net: Script Independent Recognition
No ratings yet
HCR-Net: Script Independent Recognition
35 pages
ML in Traffic Signal Optimization
No ratings yet
ML in Traffic Signal Optimization
6 pages
Digitalization Enhancing Public Service Efficiency
No ratings yet
Digitalization Enhancing Public Service Efficiency
21 pages
OCI AI Services and Generative AI Insights
No ratings yet
OCI AI Services and Generative AI Insights
23 pages
Data Science Career Learning Path
No ratings yet
Data Science Career Learning Path
33 pages
Course List with Registration Numbers
No ratings yet
Course List with Registration Numbers
4 pages
Machine Learning for Rotary Vibration Analysis
No ratings yet
Machine Learning for Rotary Vibration Analysis
6 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
33 pages
Machine Learning's Impact on Auditing
No ratings yet
Machine Learning's Impact on Auditing
6 pages
Introduction to Machine Learning Guide
No ratings yet
Introduction to Machine Learning Guide
18 pages
Neural Networks and MNIST Classification
No ratings yet
Neural Networks and MNIST Classification
17 pages
Bhairavat Naik: Computer Science Student Profile
No ratings yet
Bhairavat Naik: Computer Science Student Profile
1 page
Survey of Vision-Language-Action Models
No ratings yet
Survey of Vision-Language-Action Models
36 pages
Financial Modeling & Valuation Course
No ratings yet
Financial Modeling & Valuation Course
7 pages
Credit Card Fraud Detection Analysis
No ratings yet
Credit Card Fraud Detection Analysis
4 pages
Online Ticket Reservation System Insights
No ratings yet
Online Ticket Reservation System Insights
7 pages

Linear Methods for Classification-1 (1)

Uploaded by

Linear Methods for Classification-1 (1)

Uploaded by

COMP3202 - Intro to Machine Learning

Linear Methods for Classification

This function produces an S-shaped curve, that regardless of value of X,

called the odds,

By taking the logarithm of both sides:

called the log-odds/logit,

Maximum likelihood is a very general approach that is used to estimate

Generalized likelihood function:

Based on this Coefficients,

However, when we add the features balance and income to the

● Confounding occurs when features are correlated

You might also like