intro
intro
1. Foundations
4. Clustering 5. Recommender
2. Regression 3. Classification
& Retrieval Systems
6. Capstone
ML
Data Classifier Intelligence
Method
Input x:
features derived Learn xày
from data
relationship Predict y:
categorical “output”,
class or label
5 ©2015-2016 Emily Fox & Carlos Guestrin Machine Learning Specialization
Sentiment classifier
Input x: Easily best sushi in Seattle.
Sentence Sentiment
Classifier
Output: y
Sentiment
Sentence
Classifier
from
review MODEL
Output: y
Input: x Predicted
class
Education
Finance
Technology
Input: x Output: y
Webpage
8 ©2015 Emily Fox & Carlos Guestrin Machine Learning Specialization
Spam filtering
Not spam
Spam
Input: x Output: y
Text of email,
9
sender, IP,… ©2015 Emily Fox & Carlos Guestrin Machine Learning Specialization
Image classification
Input: x Output: y
Image pixels Predicted object
10 ©2015 Emily Fox & Carlos Guestrin Machine Learning Specialization
Personalized medical diagnosis
Input: x Output: y
Healthy
Disease Cold
Classifier Flu
MODEL Pneumonia
…
“House”
12 ©2015-2016 Emily Fox & Carlos Guestrin Machine Learning Specialization
Impact of classification
Core
Visual Algorithm
concept
Advanced
Practical Implement
topics
I O N A L
OPT
16 ©2015-2016 Emily Fox & Carlos Guestrin Machine Learning Specialization
Overview of content
Online
Ensembles Boosting
learning
Handling missing
Logistic regression Stochastic gradient
data
Modules 1, 2, 3 Module 9
Module 6
Score(x) < 0
…
0
Score(x) > 0
0 1 2 3 4 …
#awesome
20 ©2015-2016 Emily Fox & Carlos Guestrin Machine Learning Specialization
Module 1: Logistic regression represents probabilities
⌃
P(y=+1|x,ŵ) = 1 .
1 + e-ŵ h(x)
T
Training error
Model complexity
excellent poor
Credit?
fair
Income?
Safe Term?
high Low
3 years 5 years
3 years 5 years
Risky Safe
Logistic Regression
Degree 1 features Degree 2 features Degree 6 features
Simplify
Boosting
Need an automated,
“authentic”
Reviews marketing campaign
PRECISION RECALL
Did I (mistakenly) show a Did I not show a (great)
negative sentence??? positive sentence???
34 ©2015-2016 Emily Fox & Carlos Guestrin Machine Learning Specialization
Module 9: Scaling to huge datasets & online learning
Gradient
Better
• Course 2: Regression
- Data representation (input, output, features)
- Linear regression model
- Basic ML concepts:
• ML algorithm
• Gradient descent
• Overfitting
• Validation set and cross-validation
• Bias-variance tradeoff
• Regularization