L21 Intro ML
L21 Intro ML
Note: These slides have been borrowed and/or adapted from the Machine Learning courses offered by Pedro Domingos at the University of Washington: http://www.cs.washington.edu/education/courses/cse546/10wi Raymond Mooney at the University of Texas at Austin: http://www.cs.utexas.edu/~mooney/cs391L
A Few Quotes
A breakthrough in machine learning would be worth ten Microsofts (Bill Gates, Chairman, Microsoft) Machine learning is the next Internet
(Tony Tether, Director, DARPA)
Web rankings today are mostly a matter of machine learning (Prabhakar Raghavan, Dir. Research, Yahoo) Machine learning is going to result in a real revolution
(Greg Papadopoulos, CTO, Sun)
Magic?
No, more like gardening Seeds = Algorithms Nutrients = Data Gardener = You Plants = Programs
What is Learning?
Herbert Simon: Learning is any process by which a system improves performance from experience.
10
Sample Applications
Web search Computational biology Finance E-commerce Space exploration Robotics Information extraction Social networks Debugging [Your favorite area]
ML in a Nutshell
Tens of thousands of machine learning algorithms Hundreds new every year Every machine learning algorithm has three components:
Representation Evaluation Optimization
Representation
Decision trees Sets of rules / Logic programs Instances Graphical models (Bayes/Markov nets) Neural networks Support vector machines Model ensembles Etc.
Evaluation
Accuracy Precision and recall Squared error Likelihood Posterior probability Cost / Utility Margin Entropy K-L divergence Etc.
Optimization
Combinatorial optimization
E.g.: Greedy search
Convex optimization
E.g.: Gradient descent
Constrained optimization
E.g.: Linear programming
Types of Learning
Supervised (inductive) learning
Training data includes desired outputs
Unsupervised learning
Training data does not include desired outputs
Semi-supervised learning
Training data includes a few desired outputs
Reinforcement learning
Rewards from sequence of actions
Inductive Learning
Given examples of a function (X, F(X)) Predict function F(X) for new examples X
Discrete F(X): Classification Continuous F(X): Regression F(X) = Probability(X): Probability estimation
ML in Practice
Understanding domain, prior knowledge, and goals Data integration, selection, cleaning, pre-processing, etc. Learning models Interpreting results Consolidating and deploying discovered knowledge Loop
Inductive Learning