Course Overview
Course Overview
Machine Intelligence
Dr. Ghada Khoriba
Dr. Ensaf Hussien
Dr. Nermin Negied
Dr. Wael Gomaa
Spring 2024
1
Agenda
Course Objectives
Grading Distribution
3
4
5
Tentative Course Topics
1.Machine Learning Basics
2.Classifying with k-Nearest Neighbors
3.Splitting datasets one feature at a time: decision trees
4.Classifying with probability theory: naïve Bayes
5.Logistic regression
6.Support vector machines
7.Model Evaluation and Improvement: Cross-validation, Grid Search, Evaluation Metrics,
and Scoring
8.Ensemble learning and improving classification with the AdaBoost meta-algorithm.
9.Introduction to Neural Networks - Building NN for classification (binary/multiclass)
10.Convolutional Neural Network (CNN)
11.Pretrained models (VGG, Alexnet,..)
12.Machine learning pipeline and use cases.
6
References
7
Grading Criteria
Grading:
Course Work:
o Programming Assignments (2 times checkpoint): 15% (DataCamp group/ Machine learning track)
o Lecture Quizzes (2 times): 10%
o Lab Practical Quizzes (2 times): 10%
o Project:
Paper Report and UGRF: 5%
Project Implementation and Discussion: 20%
Midterm: 15%
Final Exam: 25%
8
Introduction to Machine Learning
The set of all tasks in which a computer can make decisions based on data
How humans make decisions. we make decisions in the following two ways:
•By using logic and reasoning
•By using our experience
MODEL
• A set of rules that represent our data and
can be used to make predictions
ALGORITHM
A procedure, or a set of steps, used to
solve a problem or perform a
computation. Usually, the goal of an
algorithm is to build a model.
Ref: https://miro.medium.com/ 10
ML Terminologies
parsimony, a simpler model with fewer parameters is favored over more complex models with more parameters, provided the models fit the data similarly well. 11
ML Applications
12
ML Applications
13
ML Applications
14
ML Applications
15
ML Applications
16
ML Applications
17
We can describe problems and their solutions using six characteristics
1. Problem class: What is the nature of the training 4. Model type: Will an intermediate model be
data and what kinds of queries will be made at made? What aspects of the data will be
the testing time? modeled? How will the model be used to make
2. Assumptions: What do we know about the predictions?
source of the data or the form of the solution? 5. Model class: What particular parametric class of
3. Evaluation criteria: What is the goal of the models will be used? What criterion will we use
prediction or estimation system? How will the to pick a particular model from the model class?
answers to individual queries be evaluated? How 6. Algorithm: What computational process will be
will the overall performance of the system be used to fit the model to the data and/or to make
measured? predictions?
18
Supervised, Unsupervised, and Reinforcement learning
Supervised learning can be subdivided into classification and regression based on the quantity we are trying
to predict.
• If our output 𝑦 is a discrete quantity (e.g. 𝐾 distinct classes) we have a classification problem.
• On the other hand, if our output y is a continuous quantity (e.g. a real number such as stock price) we have
a regression problem.
Thus, the nature of the problem changes based on the quantity y we are trying to predict. We want to get as
close as possible to the ground truth value of y.
22
Ref: Grokking Machine Learning, Luis G. Serrano 23
Unsupervised learning
The branch of machine learning that works with unlabeled data
• Given data about the size of houses on the real estate market, try to
predict their price. R
• Given a picture of Male/Female, We have to predict his/her age on
the basis of given picture. R
• Given a picture of Male/Female, We have to predict Whether She is
of High school, College, Graduate age.C
• Banks have to decide whether or not to give a loan to someone on the
basis of his credit history. C
26
How to find a Project Idea
27
27
Machine Learning Project
28
Read a Paper
29
https://saiamrit.github.io/technical-blog
Organization of a Paper
The majority of papers follow, more or less, the same convention of organization:
1.Title: Hopefully catchy ! Includes additional info about the authors and their institutions.
2.Abstract: High level summary of the entire work of the paper.
3.Introduction: Background info on the field and related research leading up to this paper.
4.Related works: Describe the already existing literature on the particular domain.
5.Methods: Highly detailed section on the study that was conducted, how it was set up, any
instruments used, and finally, the process and workflow.
6.Results: Authors talk about the data that was created or collected, it should read as an unbiased
account of what occurred.
7.Discussions: Here is where authors interpret the results, and convince the readers of their findings
and hypothesis.
8.References: Any other work that was cited in the body of the text will show up here.
9.Appendix: More figures, additional treatments on related math, or extra items of interest can find
their way in an appendix.
30
https://saiamrit.github.io/technical-blog
Read a Paper
Please check Andrew’s lecture, https://youtu.be/733m6qBH-jI?si=2t4MXkQL3S3u-5w9
3.Third Pass: Reading the entire paper with the intention to reimplement it.
31
https://saiamrit.github.io/technical-blog
How to write?
32
https://www.turing.com/kb/how-to-write-research-paper-in-machine-learning-area
33
https://www.turing.com/kb/how-to-write-research-paper-in-machine-learning-area