Machine Learning at Geeky Base

Machine Learning
Kan Ouivirach

Kan Ouivirach
Research & Development
Engineer

 
www.kanouivirach.com

Outline
• What is Machine Learning?
• Main Types of Learning
• Model Validation, Selection, and Evaluation
• Applied Machine Learning Process
• Cautions

What is Machine Learning?
http://www.bigdata-madesimple.com/

–Arthur Samuel (1959)
“Field of study that gives computers the ability
to learn without being explicitly programmed.”

–Tom Mitchell (1988)
“A computer program is said to learn from
experience E with respect to some class of
tasks T and performance measure P, if its
performance at tasks in T, as measured by P,
improves with experience E.”

Statistics vs. Data Mining vs. Machine Learning vs. …?

Programming vs. Machine Learning?

Programming?
“Given a speciﬁcation of a function f,
implement f that meets the speciﬁcation.”
Machine Learning?
“Given example (x, y) pairs, induce f such
that y = f(x) for given pairs and generalizes
well for unseen x”
–Peter Norvig (2014)

Why is Machine Learning so hard?
http://veronicaforand.com/

http://www.thinkgeek.com/product/f0ba/
What do you see?

Dog and Cat?
http://thisvsthatshow.com/

Applications of Machine Learning
• Search Engines
• Medical Diagnosis
• Object Recognition
• Stock Market Analysis
• Credit Card Fraud Detection
• Speech Recognition
• etc.

Recommendation System on Amazon.com

Advertisement System on Facebook.com

Speech Recognition from Microsoft

Robot Localization
https://github.com/mjl/particle_ﬁlter_demo

Main Types of Learning
• Supervised Learning
• Unsupervised Learning
• Reinforcement Learning

Supervised Learning
y = f(x)
Given x, y pairs, ﬁnd a function f that will map
new x to a proper y.

Supervised Learning Problems
• Regression
• Classiﬁcation

http://thisvsthatshow.com/
Classiﬁcation

k-Nearest Neighbors
http://bdewilde.github.io/blog/blogger/2012/10/26/classiﬁcation-of-hand-written-digits-3/

Perceptron
Processor
Input 0
Input 1
Output
One or more inputs, a processor, and a single output

Perceptron
https://datasciencelab.wordpress.com/2014/01/10/machine-learning-classics-the-perceptron/
w0x0 + w1x1

Perceptron
https://datasciencelab.wordpress.com/2014/01/10/machine-learning-classics-the-perceptron/

Probability Theory
https://seisanshi.wordpress.com/tag/probability/

A2A1 A3 An
Ck
. . .
P(Ck | A1, …, An) = P(Ck) * P(A1, …, An | Ck) / P(A1, …, An)
P(Ck | A1, …, An) P(Ck) * Prod P(Ai | C)
with independence assumption, we then have
Naive Bayes

Naive Bayes
No. Content Spam?
1 Party Yes
2 Sale Discount Yes
3 Party Sale Discount Yes
4 Python Party No
5 Python Programming No

Naive Bayes
No. Content Spam?
1 Party Yes
2 Sale Discount Yes
4 Python Party No
P(Spam) = ? P(NotSpam) = ?
P(Party | Spam) = ? P(Party | NotSpam) = ?
P(Programming | Spam) = ? P(Programming | NotSpam) = ?

Naive Bayes
No. Content Spam?
1 Party Yes
2 Sale Discount Yes
4 Python Party No
P(Spam) = 3/5 P(NotSpam) = 2/5
P(Party | Spam) = 2/3 P(Party | NotSpam) = 1/2
P(Programming | Spam) = 0 P(Programming | NotSpam) = 1/2

Naive Bayes
P(Spam | Party, Programming) = 3/5 * 2/3 * 0 = 0
P(NotSpam | Party, Programming) = 2/5 * 1/2 * 1/2 = 0.1
P(NotSpam | Party, Programming) > P(Spam | Party, Programming)
“Party Programming” is NOT a spam.

Decision Tree
Outlook
Humidity Wind
Sunny
Overcast
Rain
Yes
High Normal Strong Weak
No Yes No Yes
Day Outlook Temp Humidity WInd Play
D1 Sunny Hot High Weak No
D2 Sunny Hot High Strong No
D3 Overcast Mild High Strong Yes
D4 Rain Cool Normal Strong No
Play tennis?

Support Vector Machines
x
y
Current Coordinate System
x
z
New Coordinate System
“Kernel Trick”

Support Vector Machines
http://www.mblondel.org/journal/2010/09/19/support-vector-machines-in-python/
3 support vectors

Unsupervised Learning
f(x)
Given x, ﬁnd a function f that gives a compact
description of x.

Unsupervised Learning
• k-Means Clustering
• Hierarchical Clustering
• Gaussian Mixture Models (GMMs)

k-Means Clustering
http://stackoverﬂow.com/questions/24645068/k-means-clustering-major-understanding-issue/24645894#24645894

Anomaly Detection
http://modernfarmer.com/2013/11/farm-pop-idioms/

http://boxesandarrows.com/designing-screens-using-cores-and-paths/

Reinforcement Learning
y = f(x)
Given x and z, ﬁnd a function f that generates y.
z

Flappy Bird Hack using
Reinforcement Learning
http://sarvagyavaish.github.io/FlappyBirdRL/

Machine Learning at Geeky Base

I’ve got a perfect classiﬁers!
https://500px.com/photo/65907417/like-a-frog-trapped-inside-a-coconut-shell-by-ellena-susanti

http://blog.csdn.net/love_tea_cat/article/details/25972921
Overfitting (High Variance)
Normal fit Overfitting

http://blog.csdn.net/love_tea_cat/article/details/25972921
Underfitting (High Bias)
Normal fit Underfitting

How to Avoid Overfitting and Underfitting
• Using more data does NOT always help.
• Recommend to
• find a good number of features;
• perform cross validation;
• use regularization when overfitting is found.

Model Selection
• Use cross validation to ﬁnd the best parameters for the
model.

Metrics
• Accuracy
• True Positive, False Positive, True Negative, False
Negative
• Precision and Recall
• F1 Score
• etc.

Precision and Recall
http://en.wikipedia.org/wiki/Precision_and_recall

Applied Machine Learning Process
http://machinelearningmastery.com/process-for-working-through-machine-learning-problems/

Deﬁne the Problem
https://youmustdesireit.wordpress.com/2014/03/05/developing-and-nurturing-creative-problem-solving/

Prepare Data
http://vpnexpress.net/big-data-use-a-vpn-block-data-collection/

Spot Check Algorithms
https://www.ﬂickr.com/photos/withassociates/4385364607/sizes/l/

If two models ﬁt the data equally well,
choose the simpler one.

Improve Results
http://www.mobilemechanicprosaustin.com/

Present Results
http://www.langevin.com/blog/2013/04/25/5-tips-for-projecting-conﬁdence/presentation-skills-2/

http://newventurist.com/
• Curse of dimensionality
• Correlation does NOT  
imply causation.
• Learn many models,  
not just ONE.
• More data beats  
a cleaver algorithm.
• Data alone are not enough.
A Few Useful Things You Need to Know about Machine Learning, Pedro Domigos (2012)
Some Cautions

— Feature engineering is the key. —

Example of Feature Engineering
Width (m) Length (m) Cost (baht)
100 100 1,200,000
500 50 1,300,000
100 80 1,000,000
400 100 1,500,000
Are the data good to
model the area’s cost?
Size (m x m) Cost (baht)
100,000 1,200,000
25,000 1,300,000
8,000 1,000,000
400,00 1,500,000
Engineer features.
They look better here.

Deep Learning at Microsoft’s Speech Group

Let’s get our hands dirty!
https://github.com/zkan/intro-to-machine-learning

Machine Learning at Geeky Base

More Related Content

Similar to Machine Learning at Geeky Base (20)

More from Kan Ouivirach, Ph.D. (17)

Recently uploaded (20)

Machine Learning at Geeky Base