0% found this document useful (0 votes)

53 views39 pages

Iu 3.6.4 ML 101

Machine learning algorithms analyze large amounts of data to identify patterns and make predictions without being explicitly programmed. There are three key conditions for applying machine learning: 1) a pattern must exist in the input data, 2) there must be ample data to analyze, and 3) the problem behavior can be expressed mathematically. The machine learning process involves collecting data, preparing it, selecting an algorithm, training the model, and evaluating performance. Algorithms either understand relationships between inputs and outputs or identify intrinsic patterns in the input data.

Uploaded by

anunair.viji

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views39 pages

Iu 3.6.4 ML 101

Uploaded by

anunair.viji

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

IU 3.6.

4 Machine Learning 101

RISE 2.0

SEP 2022
13 hours

Contents Where Are We in the Journey & 5 mins

Learning Objectives

Agenda for Today

25 mins
• Course Intro + Machine Learning Techniques
• Chapter 1 – Linear Regression 3.5 hrs.

• Chapter 2 – Logistic Regression 3 hrs.

• Chapter 3 - Clustering 3 hrs.

• Chapter 4 – Recommender Systems 3 hrs.

1
Where are we in the learning journey?

IU 1.0 IU 2.0
IU 3.0
Orientation Business Digital Capstone
Business and Data Analytics core
(1 week) Essentials Essentials (4 weeks)
(14 weeks)
(1 week) (1 week)

IU 3.6.4 – Machine Learning 101

Career Development Journey: 5 Career Spotlights + 4 Career Buddies

Leadership and Personal Development Journey: Networking + Enrichment Sessions

2
Re-Cap

In previous session, we covered the following topics:

• Why Machine Learning matters

• Introduction to machine learning techniques
• Supervised and Unsupervised Learnings

Note: The following slides are a repeat of previous session. The Trainer can move to jupyter
notebooks directly and come back for project de-briefing

3
Understand the importance &
applications of machine learning

Learning
Overview of different machine learning
objectives techniques & associated steps

Practice the use of machine learning for

varied business applications

4
Course Introduction
5
Machine learning is a data analytics technique
that teaches computers to do what comes
naturally to humans and animals: learn from
experience

Machine learning algorithms use computational

Machine methods to “learn” information directly from
Learning data without relying on a predetermined
equation as a model

The algorithms adaptively improve their

performance as the number of samples
available for learning increases

* More reference on the external link library 6

Why machine learning matters?
With the rise in big data, machine learning has become a key technique for solving problems in areas, such as:

Retail & CPG Manufacturing Natural language processing

Understanding the future potential The monitoring of manufacturing Natural language processing (NLP) is
demand and sales for products is a key equipment is vital to any industrial about developing applications and
task for any retailer to better plan for process. Sometimes it is critical that services that are able to understand
inventory, cut down on production of equipment be monitored in real-time for human languages.
unnecessary products, decide pricing faults and anomalies to prevent damage
strategy. and correlate equipment behavior faults • Refer here for more details on NLP
to production line issues. Fault detection
• Refer here for example on price is the pre-cursor to predictive
forecasting maintenance.

• Refer here for more details

7
Why machine learning matters?
With the rise in big data, machine learning has become a key technique for solving problems in areas, such as:

Computational finance Image processing & computer vision Computational biology

Computational finance is also sometimes Image processing & computer vision is a Can be used for tumor detection, drug
referred to as "financial engineering," method to perform some operations on discovery, and DNA sequencing.
"financial mathematics," "mathematical an image, in order to get an enhanced
finance," or "quantitative finance." It image or to extract some useful informa • Refer here for more on Tumor
uses the tools of mathematics, statistics, tion. detection or try github
and computing to solve problems in • Refer here to understand more
finance like credit scoring and • Refer here for basic understanding about DNA sequencing
algorithmic trading. about facial recognition and a quick
tutorial
• Refer here for basic understanding • Refer here to understand more about
about credit risk models motion detection
• Refer here for more about trading

8
Course Outline
9
Chapter 1 (3.5 hours)
• Types of Regression
• Linear Regression model
• Model Training, Evaluation and Validation

Course Outline (I/II) Chapter 2 (3 hours)

• Logistic Regression model
• Model Training, Iteration and Validation
• Model Fit Statistic
• Class Imbalance

10
Chapter 3 (3 hours)
• Supervised vs Unsupervised
• Clustering model
• Common Methods: K-means

Course Outline (II/II) Chapter 4 (3 hours)

• Recommender Systems
• Common methods: Association rules learning, Market
Basket Analysis, Content-based recommendation

11
Introduction to machine
learning techniques
12
When should we use machine learning?

We consider using machine learning algorithms when we have a complex task or problem involving a large amount of
data and lots of variables, but no existing formula or equation.

For example, machine learning is a good option if you need to handle situations like these:

13
Three conditions must be met to apply machine learning to a problem

A pattern must exist in the input There must exist an ample amount The behavior in the problem can be
data that would help to arrive at a of data (examples, samples) to formulated as a mathematical
conclusion apply machine learning to a problem expression
• For instance, if we concluded the • For instance, if there are no product • Machine learning is used to derive
product reviews are random and do reviews for the webcam, it will be meaning from the data and perform
not offer any meaning, then it would difficult to arrive at a decision “structured learning” to arrive at a
be difficult to arrive at a decision by whether or not to buy the product mathematical approximation to
using them describe the behavior of the problem
• Handling these situation requires
• To solve a problem with machine simplifying the hypotheses & models
learning, the machine learning (use non-parametric approaches). *
algorithm must have a pattern to
infer from * More reference on the external link library

14
How Machine Learning Works?
Process Flow of Machine Learning

15
How Machine Learning Algorithm Works?

A machine learning algorithm performs a learning task where it either:

Understands relationships between input & an output Identifies intrinsic patterns in input data
• Given input data x & an output Y, the machine learning • The machine learning algorithm tries to find underlying
algorithm tries to find a relationship between x & Y, which structure or distributions in the data x
can be represented as: Y = f(x) • Since there is no output Y defined, there are no perfect
• The goal of machine learning algorithm would be to learn answers
the properties of this target function f, based on the given
data x

* More reference on the external link library

16
Overview of different techniques
Mainly there are 5 different categories of Machine Learning techniques that are used in the industry

17
Supervised & unsupervised
learning
18
What is supervised learning?

Supervised learning is where you have input variables (x) and an output variable (Y) and you use an
algorithm to learn the mapping function from the input to the output. Y = f(X)

The goal is to approximate the mapping function so well that when you have new input data (x) that you
can predict the output variables (Y) for that data.

It is called supervised learning because the process of an algorithm learning from the training dataset can
be thought of as a teacher supervising the learning process. We know the correct answers, the algorithm
iteratively makes predictions on the training data and is corrected by the teacher. Learning stops when
the algorithm achieves an acceptable level of performance.

19
Supervised learning problems

Supervised learning problems can be further grouped into regression and classification problems:
Regression Vs Classification

• Classification: A classification problem is when the output variable is a discrete category, such as “will
a customer default or not in loan payment?” or “was a transaction anomalous or not?” or "is the growth
on brain shown in MRI scan a tumor or not?"

• Regression: A regression problem is when the output variable is a real value, such as “estimating future
demand of a product” or “predicting revenue based on advertising spend”.

20
Regression Vs Classification algorithms

21
What is unsupervised learning?

In unsupervised learning, we only have input data (X) and no corresponding output variables

The goal for unsupervised learning is to model the underlying structure or distribution in the data in order
to learn more about the data

These are called unsupervised learning because unlike supervised learning above there is no correct
answers and there is no teacher. Algorithms are left to their own devises to discover and present the
interesting structure in the data

22
Unsupervised learning problems

Unsupervised learning problems can be further grouped into clustering and association mining problems: Clustering Vs
Association

• Clustering: A clustering problem is where you want to discover the inherent groupings in the data, such as "grouping
customers by purchasing behavior & demographic features"

• Association Mining: An association rule learning problem is where you want to discover rules that describe large
portions of your data, such as "if a customer bought milk, which other products would he/she likely buy?"

23
Clustering Vs Association algorithms

24
Choosing between supervised & unsupervised ML

25
The basic steps in using any machine learning technique

Step 1 - Identify if we have a target variable

Step 2 - Identify if the target variable is continuous or categorical (not valid if there is no target variable)

Step 3 - Identify the independent features which can explain the target variable

Step 4 - Make necessary transformations of data

Step 5 - Perform modeling based on data characteristics

26
Example 1 - Regression
Objective: Predict sales for every product-store combination for the next quarter

27
Example 1 - Regression
Objective: Predict sales for every product-store combination for the next quarter

Step 1 - Identify if we have a target variable

• From the data, we can observe that we have a target variable – sales

Step 2 - Identify if the target variable is continuous or categorical (not valid if there is no target variable)
• The target variable sales is continuous. This means we should go with regression

Step 3 - Identify the independent features which can explain the target variable
• Discount, visitor count, store area, holiday status can influence sales

Step 4 - Make necessary transformations of data

• For example, the holiday status can be changed from Yes / No to 1 / 0 so the algorithm can understand it

Step 5 - Perform modeling based on data characteristics

• After the data is cleaned & pre-processed, we can choose a regression algorithm based on how the data is structured
• If we observe a linear relationship between the response (sales) and the other independent features, we can choose
linear regression

28
Example 2 - Classification
Objective: Predict if a customer will default on loan payment in the next year

29
Example 2 - Classification
Objective: Predict if a customer will default on loan payment in the next year

Step 1 - Identify if we have a target variable

• From the data, we can observe that we have a target variable - default status

Step 2 - Identify if the target variable is continuous or categorical (not valid if there is no target variable)
• The target variable sales is categorical (yes/no). This means we should go with classification

Step 3 - Identify the independent features which can explain the target variable
• Sex, Education, Income, previous default indicator, state of origin can influence the default behavior

Step 4 - Make necessary transformations of data

• For example, categorical data columns like sex, education, state can be changed to numerical values so the
algorithm can understand them better

Step 5 - Perform modeling based on data characteristics

• After the data is cleaned & pre-processed, we can choose a classification algorithm

30
Course Deep Dive
31
Chapter 1 – Linear Regression

Exit to Demo Workbook

02
Chapter 2 – Logistic Regression

Exit to Demo Workbook

03
Chapter 3 - Clustering

Exit to Demo Workbook

04
Chapter 4 – Recommender Systems

Exit to Demo Workbook

Project Details post
Machine Learning
36
Identify the level of income qualification needed for
the families in Latin America.

Points to note:
• Many social programs have a hard time ensuring that the right
people are given enough aid.
• The client believes that new ML methods beyond traditional
Project De- econometrics, might help improve the model for this problem.
Brief • The project involves these main tasks:
EDA Identify the output variable.
EDA Understand the type of data.
EDA Check if there are any biases in your dataset.
EDA Check whether all members of the house have the same poverty level.
EDA Check if there is a house without a family head.
EDA Set poverty level of the members and the head of the house within a family.
EDA Count how many null values are existing in columns.
Data Cleaning Remove null value rows of the target variable.
Modeling Predict the accuracy using random forest classifier and 2 other algorithms
Modeling Discuss parameter tuning and find the optimal paramater for each algorithm
Modeling Check the accuracy with cross validation.

UNIT I - Introduction
No ratings yet
UNIT I - Introduction
76 pages
ML Unit 1
No ratings yet
ML Unit 1
21 pages
Machine Learning With R and Python
No ratings yet
Machine Learning With R and Python
290 pages
Machine Learning Unit 1
100% (7)
Machine Learning Unit 1
112 pages
Intro to Machine Learning Concepts
No ratings yet
Intro to Machine Learning Concepts
35 pages
ML@Chapter 1
No ratings yet
ML@Chapter 1
29 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
Supervised & Deep Learning Guide
No ratings yet
Supervised & Deep Learning Guide
83 pages
Intro To ML
No ratings yet
Intro To ML
26 pages
INTRODUCTION
No ratings yet
INTRODUCTION
51 pages
Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
225 pages
21CSC305P ML - Unit 1-E
No ratings yet
21CSC305P ML - Unit 1-E
137 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
45 pages
MLP Modules (1,2,3)
No ratings yet
MLP Modules (1,2,3)
98 pages
Unit I Machine Learning
No ratings yet
Unit I Machine Learning
78 pages
Machine Learning Tutorial For Beginners
No ratings yet
Machine Learning Tutorial For Beginners
15 pages
Machine Learning Overview & Benefits
No ratings yet
Machine Learning Overview & Benefits
15 pages
ML Chap1
No ratings yet
ML Chap1
26 pages
Unit V
No ratings yet
Unit V
67 pages
R22 Machine Learning Digital Notes Final
No ratings yet
R22 Machine Learning Digital Notes Final
143 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
19 pages
Turner, Ryan - Python Machine Learning - The Ultimate Beginner's Guide To Learn Python Machine Learning Step by Step Using Scikit-Learn and Tensorflow (2019)
No ratings yet
Turner, Ryan - Python Machine Learning - The Ultimate Beginner's Guide To Learn Python Machine Learning Step by Step Using Scikit-Learn and Tensorflow (2019)
144 pages
Machine Learning For Beginners Overview of Algorithm TypesStart Learning Machine Learning From Here
No ratings yet
Machine Learning For Beginners Overview of Algorithm TypesStart Learning Machine Learning From Here
13 pages
Introduction ML
No ratings yet
Introduction ML
25 pages
Unit I MACHINE LEARNING
No ratings yet
Unit I MACHINE LEARNING
87 pages
UNIT I-Machine Learning
No ratings yet
UNIT I-Machine Learning
68 pages
Tirth PDF
No ratings yet
Tirth PDF
19 pages
MLUnit 1
No ratings yet
MLUnit 1
131 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
68 pages
Unit 1 PDF
No ratings yet
Unit 1 PDF
135 pages
ML Notes
No ratings yet
ML Notes
101 pages
CS 601-Machine Learning
No ratings yet
CS 601-Machine Learning
82 pages
ML 01
No ratings yet
ML 01
15 pages
Module 1 - Intro To ML - V2
No ratings yet
Module 1 - Intro To ML - V2
47 pages
Chapter Five
No ratings yet
Chapter Five
178 pages
Machine Learning - UNIT I
No ratings yet
Machine Learning - UNIT I
70 pages
ML Unit-1
No ratings yet
ML Unit-1
28 pages
ML Unit 1
No ratings yet
ML Unit 1
19 pages
Intro To Machine Learning
No ratings yet
Intro To Machine Learning
31 pages
Big-Data Unit-3
100% (1)
Big-Data Unit-3
54 pages
Summer of Science Report On - Intro To Machine Learning
No ratings yet
Summer of Science Report On - Intro To Machine Learning
36 pages
ML - Unit I - Final
No ratings yet
ML - Unit I - Final
132 pages
Machine Learning Section1 Ebook
No ratings yet
Machine Learning Section1 Ebook
12 pages
Module 1
No ratings yet
Module 1
54 pages
Lecture 3 B
No ratings yet
Lecture 3 B
6 pages
Machine Learning for Beginners
No ratings yet
Machine Learning for Beginners
27 pages
ML in Fashion Industry
No ratings yet
ML in Fashion Industry
40 pages
Introduction To Machine Learning Basics
No ratings yet
Introduction To Machine Learning Basics
12 pages
Machine Learning-Lecture 01
No ratings yet
Machine Learning-Lecture 01
28 pages
Supervised Learning (WWW - Anuupdates.org)
No ratings yet
Supervised Learning (WWW - Anuupdates.org)
60 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
9 pages
Machine Learning Classification, Regression and Clustering
No ratings yet
Machine Learning Classification, Regression and Clustering
77 pages
Machine Learning: BE Sixth Semester 20CS610
No ratings yet
Machine Learning: BE Sixth Semester 20CS610
211 pages
What Is Machine Learning?
No ratings yet
What Is Machine Learning?
6 pages
Department of Emerging Technology (SB) III B.Tech - I Semester
No ratings yet
Department of Emerging Technology (SB) III B.Tech - I Semester
12 pages
Introduction To ML
No ratings yet
Introduction To ML
48 pages
Unit 3 - DS - 1st Year
No ratings yet
Unit 3 - DS - 1st Year
5 pages
Introduction To ML Unit-1
No ratings yet
Introduction To ML Unit-1
90 pages
Kinematic Equation Practice Problems
No ratings yet
Kinematic Equation Practice Problems
3 pages
Variance - Wikipedia, The Free Encyclopedia
No ratings yet
Variance - Wikipedia, The Free Encyclopedia
18 pages
Biological Development
100% (2)
Biological Development
48 pages
Notes of Lubrication
No ratings yet
Notes of Lubrication
22 pages
5 - Kemrepair
No ratings yet
5 - Kemrepair
2 pages
Landmine Detection Using Autoencoders On Multipolarization GPR
No ratings yet
Landmine Detection Using Autoencoders On Multipolarization GPR
14 pages
Biometry Lecture 1
No ratings yet
Biometry Lecture 1
59 pages
Occupational Health and Safety at Work For Dummies, UK Edition - 978!1!119-28724-7
No ratings yet
Occupational Health and Safety at Work For Dummies, UK Edition - 978!1!119-28724-7
2 pages
Economy Housing Market Research
No ratings yet
Economy Housing Market Research
12 pages
Principles in Writing A Concept Paper
No ratings yet
Principles in Writing A Concept Paper
4 pages
Pre Test
No ratings yet
Pre Test
3 pages
WO Lecture 5
No ratings yet
WO Lecture 5
5 pages
Ebooks File An Introduction To Integral Transforms Patra All Chapters
100% (7)
Ebooks File An Introduction To Integral Transforms Patra All Chapters
55 pages
Worksheet-SCIENCE12 - General Physics 1 - Module 4 - Mechanical Waves - W1 PDF
No ratings yet
Worksheet-SCIENCE12 - General Physics 1 - Module 4 - Mechanical Waves - W1 PDF
3 pages
Good English Modifier
No ratings yet
Good English Modifier
18 pages
BSB41419 R2
No ratings yet
BSB41419 R2
4 pages
Canada's Federal Budget 2022
No ratings yet
Canada's Federal Budget 2022
304 pages
Getting The GMMA Right
No ratings yet
Getting The GMMA Right
3 pages
Science Quiz for Students
No ratings yet
Science Quiz for Students
9 pages
EDII, Chennai-Revised Mentors List PDF
No ratings yet
EDII, Chennai-Revised Mentors List PDF
6 pages
Experiment 1
No ratings yet
Experiment 1
3 pages
DESIGN P1 GR11 QP NOV2017 - English
No ratings yet
DESIGN P1 GR11 QP NOV2017 - English
13 pages
Holmen 200 Manual Ver 1 2 2
No ratings yet
Holmen 200 Manual Ver 1 2 2
24 pages
Risk Assesment Methology For Toxic Chemicals Evaporation
No ratings yet
Risk Assesment Methology For Toxic Chemicals Evaporation
10 pages
Offshore Asset Life Extension Guide
No ratings yet
Offshore Asset Life Extension Guide
9 pages
Week 2-Whlp-Grade-6
No ratings yet
Week 2-Whlp-Grade-6
86 pages
g11 Module Derivatives
No ratings yet
g11 Module Derivatives
3 pages
Updated Pre-Medical Leader Ph-123 (2024-25)
No ratings yet
Updated Pre-Medical Leader Ph-123 (2024-25)
4 pages
The Three Pillars of CSR
No ratings yet
The Three Pillars of CSR
22 pages
Hydrochloric Acid Inhibitor MSDS
No ratings yet
Hydrochloric Acid Inhibitor MSDS
4 pages

Iu 3.6.4 ML 101

Uploaded by

Iu 3.6.4 ML 101

Uploaded by

IU 3.6.

4 Machine Learning 101

Contents Where Are We in the Journey & 5 mins

Agenda for Today

• Chapter 2 – Logistic Regression 3 hrs.

• Chapter 3 - Clustering 3 hrs.

IU 3.6.4 – Machine Learning 101

Career Development Journey: 5 Career Spotlights + 4 Career Buddies

Leadership and Personal Development Journey: Networking + Enrichment Sessions

In previous session, we covered the following topics:

• Why Machine Learning matters

Practice the use of machine learning for

Machine learning algorithms use computational

The algorithms adaptively improve their

* More reference on the external link library 6

Retail & CPG Manufacturing Natural language processing

• Refer here for more details

Computational finance Image processing & computer vision Computational biology

Course Outline (I/II) Chapter 2 (3 hours)

Course Outline (II/II) Chapter 4 (3 hours)

A machine learning algorithm performs a learning task where it either:

* More reference on the external link library

Step 1 - Identify if we have a target variable

Step 4 - Make necessary transformations of data

Step 5 - Perform modeling based on data characteristics

Step 1 - Identify if we have a target variable

Step 4 - Make necessary transformations of data

Step 5 - Perform modeling based on data characteristics

Step 1 - Identify if we have a target variable

Step 4 - Make necessary transformations of data

Step 5 - Perform modeling based on data characteristics

Exit to Demo Workbook

Exit to Demo Workbook

Exit to Demo Workbook

Exit to Demo Workbook

You might also like