100% found this document useful (1 vote)

190 views12 pages

Machine Learning Theory

Machine learning is a subfield of artificial intelligence that gives computers the ability to learn without being explicitly programmed. It covers techniques such as regression, classification, clustering, and associations. Supervised learning involves predicting labels or targets using labeled training data, while unsupervised learning finds hidden patterns in unlabeled data. Common supervised algorithms are regression, classification using k-nearest neighbors, decision trees, logistic regression, and support vector machines. Unsupervised techniques include k-means and hierarchical clustering.

Uploaded by

airplaneunderwater

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

190 views12 pages

Machine Learning Theory

Uploaded by

airplaneunderwater

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

MACHINE LEARNING

“A subfield of computer science that gives computers the ability to learn without being explicitly
programmed”

Major machine learning techniques

- Regression/Estimation: predicting continuous values
- Classification: predicting the item class/ category of a case
- Clustering: finding the structure of data, summarizing
- Associations: Associating frequent co-occurring items/events
- Anomaly detection: discovering abnormal and unusual cases
- Sequence mining: predicting next events, click-stream (markov model, HMM)
- Dimension reduction: reducing the size of data (PCA)
- Recommendation systems: recommending items

Other important concepts

Artificial Intelligence: A wide field that tries to make computers intelligence in order to mimic cognitive
functions of humans (Computer vision, Language processing, creativity, summarizing, etc)
Machine Learning: Branch of A.I that covers the statistical part of computer intelligence. (Classification,
Clustering, Neural Network, etc).
Deep Learning: deeper level of automatization compared with most algorithms of machine learning.

SUPERVISED LEARNING
To train and direct the machine learning to predict model of future instances. For Supervised Learning the
data must be labelled.

Supervised Learning Features:

- Useful for Classification (process of predicting categories) and Regression (Process of predicting
continuous values).
- Has more evaluation methods than than Unsupervised Learning.
- Has more controlled environment than Unsupervised Learning.

Types of Supervised Techniques:

REGRESSION
The process of predicting continuous values. It is divided in two types: Linear Regression (X1,Y1) and
Multiple Regression (X1,X2,…..,Xn,Y1).

Advantages of Regression:
- Very fast to analyze.
- It is not requiring tuning parameters.
- It is Easy to understand.
- It is Highly interpretable.

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

Train and Test Data:
It is useful to separate the data in test and train data to accurate the prediction model at the moment of
evaluating the values. There are two types of forms to do this:

- Test on a portion of train set: When the Test-Set is a portion of the Train-Set. The benefits are
high training accuracy and low out-of-sample accuracy.
- Train/Test Split: It is mutually exclusive with more accurate evaluation on out sample accuracy
and highly dependent on which datasets the data is trained and tested.
- K-Fold Cross Validation: Using multiple train/test split resulting the average to produce a more
consistent accuracy.

Types of Algorithms for Regression

- Simple Regression: A model used for one independent variable to predict a dependent variable.

- Multiple Linear Regression: A model used for many independent variables to predict a dependent
variable.

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

- Non-Linear Regression: Models to recall when the distribution between data is not linear.
Examples of Non- Linear Regression are polynomial, log, logistic, cubic, square regressions, etc.

Methods to minimize the MSE (Minimum Square Error)

It’s important to minimize the MSE to obtain the most accurate predictive model, to do that there are
some methods to find the best parameter θ that minimize the MSE.

- Ordinary least Squares: Using Linear algebra operations and for dataset with less 10k values.
- Optimization Algorithms: Using Gradient Descent for dataset of 10k values or more.

Evaluation metrics in Regression:

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

CLASSIFICATION
The process of categorizing some unknown items into a discrete set of categories or “classes”. It
corresponds to a supervised learning approach. The target attribute is a categorical variable.

Types of Algorithms for Classification

- K- Nearest Neighbors: It is an algorithm who assumes that similar things exists in proximity. The
steps to use this algorithm are:
1. Pick a value for K.
2. Calculate the distance of unknown cases from all cases.
3. Select the K-observations in the training data that are nearest to the unknown
data point.
4. Predict the response of the unknown data point using the most popular
response value from K-nearest neighbors.
It is important to determinate the best value of K(Number of nearest neighbors of a specific
point). To do that It is useful to plot different K and the accuracy of those ones.

- Decision Trees: It is used to go from observations about an item (represented in the branches)
to conclusions about the item's target value (represented in the leaves). The model is all about
finding the highest information and weighted entropy.

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

- Logistic Regression: A classification algorithm for categorical variables. It is analogous to the linear
regression but predicting a categorical variable. This model Is suitable when a data is binary, it
required probabilistic results and if is important to understand the impact of a feature. Logistic
Regression uses as a step the sigmoid function. The training process is:
1. Initialize θ.
2. Calculate the predict value for a costumer
3. Compare the output of the predict value and the real value and record it as error.
4. Calculate the error for all costumers
5. Change the θ to reduce the cost.
6. Go back to step 2.

- Support Vector Machine (SVM): It is a supervised algorithm that classifies data finding a
separator. It is also mapping data to a high-dimensional feature space using different predictions
models (Kernelling). Using to image recognition, text category assignment, detecting spam,
sentiment analysis, gene expression classification.
Advantages and Disadvantages of using this algorithm:
1. A. Accurate in high-dimensional spaces.
2. A. Memory efficient.
3. D. Prone to over-fitting.
4. D. No probability estimation.
5. D. Small datasets.

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

Evaluation Metrics in Classification
- Jaccard Index: A value nearest to 1 have more accuracy

- Confusion Matrix: It’s used to calculate the value of F-score, each value of the matrix represents
the number of correct and wrong predictions. A value of F-score nearest to 1 have more
accuracy.

- Log Loss: Using for probabilities between 0 and 1 of a class labels instead of the label. A value
nearest 0 have better accuracy.

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

CLUSTERING
Dividing the population or data points into a number of groups such that data points in the same groups
are more similar to other data points in the same group and dissimilar to the data points in other groups.
Clustering is a process for unsupervised learning and is used for exploratory data analysis, summary
generation, outlier detection, finding duplicates, pre-processing step,etc.

Types of Algorithms for Clustering.

- K-means Algorithms: It is used for portioning clustering dividing the data into non-overlapping
subsets without any cluster-internal structure. The examples within a cluster are very similar and
very different across different clusters. K-means are used for med and large sized databases,
produces sphere like clusters and needs numbers of cluster. The features of this algorithms are:
Intra-Cluster: Distances within examples inside a cluster (minimized).
Inter-Cluster: Distances across examples inside a cluster (maximized).

Steps to K-means Algorithm:

1. Initialize K (centroids randomly).
2. Distance calculation.
3. Assign each point to the closest centroid
4. Calculate the SSE and try to minimize with point 5.
5. Compute the new centroids for each cluster.
6. Repeat until there are no more changes.

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

- Hierarchical Clustering: Build a hierarchy of clusters where each node is a cluster consists of the
clusters of its daughter nodes. To the top are agglomerative approach and to the bottom are
divisive approach. The hierarchical clustering is mapping into a dendogram. The steps are:

Distance between clusters:

Advantages and disadvantages of Hierarchical Clustering:

1. Doesn’t required number of clusters to be specified.
2. Easy to implement.
3. Produces a dendrogram, which helps with understanding the data.
4. Can never undo any previous steps throughout the algorithm
5. Generally, has long runtimes.
6. Sometimes difficult to identify the number of clusters by the dendrogram

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

- Density Based Clustering: Algorithm useful to locate regions of high density and separate outliers.
One of the most important is DBSCAN (Density-Based Spatial Clustering of Applications with
Noise) used to works based on density of objects. Each point is either (Core, Border, Outlier) It is
based in 2 parameters.
1. Radius of neighborhood: Radius that if includes enough number of points within
we call it a dense area.
2. Min number of neighbors: The minimum number od fata points we want in a
neighborhood to define a cluster.

Advantages of DBSCAN:

1. Arbitrarily shaped clusters.

2. Robust to outliers.
3. Does not require specification of the number of clusters

RECOMMENDER SYSTEMS
It is a process that capture the pattern of people’s behavior and use it to predict what else they might
want or like. The applications are what to buy, where to eat, which job to apply, who you should be friends
with, personalize your experience on the web. The advantages are broader exposure, possibility of
continual usage or purchase of products and provides better experience.

Implementing Recommendation Systems:

- Memory Based: uses the entire user-item dataset to generate a recommendation. Uses statistical
techniques to approximate users of items (Pearson correlation, cosine similarity, Euclidean
distance, etc).
- Model-Based: develops a model of users in an attempt to learn their preferences and models can
be created using machine learning techniques like regression, clustering, classification.

Types of recommendation systems:

- Content-Based: Tries to recommend items to an user based on their profile

Steps for content-based recommender system:

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

- Collaborative Filtering: Based on the fact that relationships exist between products. Thos
algorithm have 2 different approaches:
1. User-Based collaborative filtering: Based on user’s neighbors
The steps are:

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

Prof. Chandan Singhavi
No ratings yet
Prof. Chandan Singhavi
86 pages
Clustering (Unit 3)
100% (2)
Clustering (Unit 3)
71 pages
Oil Export Indonesia
100% (1)
Oil Export Indonesia
12 pages
Machine Learnin
100% (2)
Machine Learnin
23 pages
Assignment No - 6-1
100% (1)
Assignment No - 6-1
3 pages
ML0101EN Clas K Nearest Neighbors CustCat Py v1
100% (1)
ML0101EN Clas K Nearest Neighbors CustCat Py v1
11 pages
Feature Engineering
100% (2)
Feature Engineering
44 pages
Unit V - Classification and Prediction 2020-21
100% (1)
Unit V - Classification and Prediction 2020-21
68 pages
The Problem of Overfitting: Overfitting With Linear Regression
No ratings yet
The Problem of Overfitting: Overfitting With Linear Regression
32 pages
Artificial Intelligence: Slide 6
100% (1)
Artificial Intelligence: Slide 6
42 pages
3 Regression Diagnostics
100% (1)
3 Regression Diagnostics
53 pages
Crime Prediction in Nigeria's Higer Institutions
No ratings yet
Crime Prediction in Nigeria's Higer Institutions
13 pages
Predict 422 - Module 8
100% (1)
Predict 422 - Module 8
138 pages
1694600777-Unit2.2 Logistic Regression CU 2.0
100% (1)
1694600777-Unit2.2 Logistic Regression CU 2.0
37 pages
ML Practical File
100% (2)
ML Practical File
43 pages
Diabetes Prediction Report
No ratings yet
Diabetes Prediction Report
16 pages
Heart Disease Prediction - Jupyter Notebook
100% (1)
Heart Disease Prediction - Jupyter Notebook
9 pages
Machine Learning
100% (5)
Machine Learning
56 pages
Ensemble Methods Bagging Boosting and Stacking
100% (1)
Ensemble Methods Bagging Boosting and Stacking
19 pages
Data Science Intervieew Questions
100% (1)
Data Science Intervieew Questions
16 pages
Supervised Learning 1 PDF
100% (1)
Supervised Learning 1 PDF
162 pages
Unit - 4 Machine Learning
100% (1)
Unit - 4 Machine Learning
84 pages
ML Projects For Final Year
No ratings yet
ML Projects For Final Year
7 pages
Machine Learning Bits
100% (2)
Machine Learning Bits
28 pages
Lecture 9 PDF
100% (1)
Lecture 9 PDF
28 pages
Hyperparameter Tuning in XGBoost Using Genetic Algorithm
100% (1)
Hyperparameter Tuning in XGBoost Using Genetic Algorithm
11 pages
Loading The Dataset: 'Churn - Modelling - CSV'
No ratings yet
Loading The Dataset: 'Churn - Modelling - CSV'
6 pages
Deploy A Machine Learning Model Using Flask - Towards Data Science
No ratings yet
Deploy A Machine Learning Model Using Flask - Towards Data Science
12 pages
Ensemble Learning Methods
100% (1)
Ensemble Learning Methods
24 pages
Machine Learning (Analytics Vidhya) : What Is Logistic Regression?
100% (1)
Machine Learning (Analytics Vidhya) : What Is Logistic Regression?
5 pages
CCS355 Neural Networks and Deep Learning Lab
No ratings yet
CCS355 Neural Networks and Deep Learning Lab
43 pages
Supervised Learning (Classification and Regression)
No ratings yet
Supervised Learning (Classification and Regression)
14 pages
Statistics in Details
100% (2)
Statistics in Details
283 pages
Support Vector Machine - Explanation
No ratings yet
Support Vector Machine - Explanation
12 pages
ML Notes
100% (2)
ML Notes
125 pages
Bagging and Boosting Regression Algorithms
100% (1)
Bagging and Boosting Regression Algorithms
84 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Cluster
100% (1)
Cluster
72 pages
Unit-V Deep Learning Techniques
100% (1)
Unit-V Deep Learning Techniques
31 pages
MACHINE LEARNING 1-5 (Ai &DS)
100% (1)
MACHINE LEARNING 1-5 (Ai &DS)
60 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
27 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
Data Preprocessing
No ratings yet
Data Preprocessing
77 pages
Parallelism of Statistics and Machine Learning & Logistic Regression Versus Random Forest
100% (1)
Parallelism of Statistics and Machine Learning & Logistic Regression Versus Random Forest
72 pages
Missing Value Treatment
No ratings yet
Missing Value Treatment
22 pages
Ensemble Classifiers
100% (1)
Ensemble Classifiers
37 pages
Using Categorical Data With One Hot Encoding - Kaggle PDF
No ratings yet
Using Categorical Data With One Hot Encoding - Kaggle PDF
4 pages
SRM Valliammai Engineering College (An Autonomous Institution)
No ratings yet
SRM Valliammai Engineering College (An Autonomous Institution)
9 pages
71A Machine Learning
No ratings yet
71A Machine Learning
8 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
15 pages
Loss Functions
No ratings yet
Loss Functions
37 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
3 pages
Classification Algorithms
100% (2)
Classification Algorithms
23 pages
Building Powerful Image Classification Models Using Very Little Data
No ratings yet
Building Powerful Image Classification Models Using Very Little Data
20 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
7 Time Series Datasets For Machine Learning
No ratings yet
7 Time Series Datasets For Machine Learning
8 pages
Gradient Descent
No ratings yet
Gradient Descent
15 pages
ML First Unit
No ratings yet
ML First Unit
70 pages
Prems Mann
0% (1)
Prems Mann
17 pages
Quantitative-Methods Summary-Qm-Notes
No ratings yet
Quantitative-Methods Summary-Qm-Notes
35 pages
Curse of Dimensionality
No ratings yet
Curse of Dimensionality
9 pages
1 Pengantar Keandalan
100% (1)
1 Pengantar Keandalan
144 pages
Vivek Jain SPM 8th Ed (OCR) - 867
No ratings yet
Vivek Jain SPM 8th Ed (OCR) - 867
1 page
CQE Academy Equation Cheat Sheet - D
No ratings yet
CQE Academy Equation Cheat Sheet - D
15 pages
6CS4-02 ML PPT Unit-3
No ratings yet
6CS4-02 ML PPT Unit-3
52 pages
Classification: 12.1 Discriminant Analysis
No ratings yet
Classification: 12.1 Discriminant Analysis
21 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
5.attribute Control ChartNew
No ratings yet
5.attribute Control ChartNew
52 pages
CT3 QP 0512 PDF
No ratings yet
CT3 QP 0512 PDF
6 pages
Summer Project - Answers
100% (1)
Summer Project - Answers
4 pages
Chapter 1 - Comparing Normal Populations
No ratings yet
Chapter 1 - Comparing Normal Populations
39 pages
Hypothesis Testing in Machine Learning Using Python - by Yogesh Agrawal - 151413
No ratings yet
Hypothesis Testing in Machine Learning Using Python - by Yogesh Agrawal - 151413
15 pages
Statistics IV Interpreting The Results of Statistical Tests
No ratings yet
Statistics IV Interpreting The Results of Statistical Tests
3 pages
Unit3 Eda
No ratings yet
Unit3 Eda
13 pages
Industrial Statistics A Computer Based Approach With Python Statistics For Industry Technology and Engineering Ron S. Kenett
100% (6)
Industrial Statistics A Computer Based Approach With Python Statistics For Industry Technology and Engineering Ron S. Kenett
73 pages
9709 s20 QP 31-Solved (Handwritten)
No ratings yet
9709 s20 QP 31-Solved (Handwritten)
12 pages
ML Project Report
No ratings yet
ML Project Report
12 pages
STA101 Lecture 8
No ratings yet
STA101 Lecture 8
26 pages
Structural Equation Models With Latent V
No ratings yet
Structural Equation Models With Latent V
36 pages
(TAB) Kung Di Rin Lang Ikaw Tabs
No ratings yet
(TAB) Kung Di Rin Lang Ikaw Tabs
2 pages
2017 S1 Test2
No ratings yet
2017 S1 Test2
10 pages
Forecasting in INAR (1) Model
No ratings yet
Forecasting in INAR (1) Model
17 pages
Asq Control Chart
No ratings yet
Asq Control Chart
5 pages
Statistical Analysis 3: Paired T-Test: Research Question Type
No ratings yet
Statistical Analysis 3: Paired T-Test: Research Question Type
4 pages
Text New Mobject
No ratings yet
Text New Mobject
21 pages
Discriminant Analysis: 5.1 The Maximum Likelihood (ML) Rule
No ratings yet
Discriminant Analysis: 5.1 The Maximum Likelihood (ML) Rule
6 pages
Data Analytics Roadmap
No ratings yet
Data Analytics Roadmap
8 pages
Data Hasil Pengujian Organoleptik Uji Hedonik Produk Dendeng Daging Sapi (Excell)
No ratings yet
Data Hasil Pengujian Organoleptik Uji Hedonik Produk Dendeng Daging Sapi (Excell)
11 pages
20 Scenario Q&A For Data Analyst
No ratings yet
20 Scenario Q&A For Data Analyst
4 pages
HW 4 - Null Hypothesis Significance Tests (NHST)
No ratings yet
HW 4 - Null Hypothesis Significance Tests (NHST)
3 pages
Objective: Practice The Methods of Hydrological Statisctics (Normal Distribution), Using Mean Annual Discharge Data
No ratings yet
Objective: Practice The Methods of Hydrological Statisctics (Normal Distribution), Using Mean Annual Discharge Data
8 pages
Acharya Institute of Technology Bangalore-107 Question Bank
No ratings yet
Acharya Institute of Technology Bangalore-107 Question Bank
3 pages
Draw A State Diagram 2. Draw A State-Transition Table 3. Encode The Next-State Functions 4. Implement The Design
No ratings yet
Draw A State Diagram 2. Draw A State-Transition Table 3. Encode The Next-State Functions 4. Implement The Design
4 pages
(TAB) Moon River by Edwin
No ratings yet
(TAB) Moon River by Edwin
2 pages
Station Positive Sequence in Ohms Zero Sequence in Ohms R X R X 0.85405 11.4824 0.70662 8.65575
No ratings yet
Station Positive Sequence in Ohms Zero Sequence in Ohms R X R X 0.85405 11.4824 0.70662 8.65575
2 pages
Mini-Hydroelectric Power Plant Power Plant Engineering (Progress Report) A. Physical Measurements
No ratings yet
Mini-Hydroelectric Power Plant Power Plant Engineering (Progress Report) A. Physical Measurements
1 page
Wrwer: This Will Be Traded For A 430+ Page Solution Manual
No ratings yet
Wrwer: This Will Be Traded For A 430+ Page Solution Manual
1 page
Partial Least Squares (PLS) Structural Equation Modeling (SEM) For Building and Testing Behavioral Causal Theory: When To Choose It and How To Use It
No ratings yet
Partial Least Squares (PLS) Structural Equation Modeling (SEM) For Building and Testing Behavioral Causal Theory: When To Choose It and How To Use It
24 pages

Machine Learning Theory

Uploaded by

Machine Learning Theory

Uploaded by

MACHINE LEARNING

Major machine learning techniques

Other important concepts

Supervised Learning Features:

Types of Supervised Techniques:

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

Types of Algorithms for Regression

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

Methods to minimize the MSE (Minimum Square Error)

Evaluation metrics in Regression:

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

Types of Algorithms for Classification

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

Types of Algorithms for Clustering.

Steps to K-means Algorithm:

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

Distance between clusters:

Advantages and disadvantages of Hierarchical Clustering:

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

1. Arbitrarily shaped clusters.

Implementing Recommendation Systems:

Types of recommendation systems:

- Content-Based: Tries to recommend items to an user based on their profile

Steps for content-based recommender system:

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

By Cristóbal Veas https://www.linkedin.com/in/cristobal-veas/

You might also like