0% found this document useful (0 votes)
26 views

Course Logistics and Introduction: CS771: Introduction To Machine Learning Piyush Rai

This document provides information about the course CS771: Introduction to Machine Learning. It outlines the course logistics including the structure of lectures, additional discussion sessions, and materials being hosted on mooKIT. It describes the course team, workload which includes homework assignments and exams, and recommended textbooks and references. It discusses the course goals of introducing foundations of machine learning models and algorithms and developing the ability to understand, implement, evaluate, and choose the right ML approach for problems. It also provides a brief introduction to machine learning concepts.

Uploaded by

Raja
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
26 views

Course Logistics and Introduction: CS771: Introduction To Machine Learning Piyush Rai

This document provides information about the course CS771: Introduction to Machine Learning. It outlines the course logistics including the structure of lectures, additional discussion sessions, and materials being hosted on mooKIT. It describes the course team, workload which includes homework assignments and exams, and recommended textbooks and references. It discusses the course goals of introducing foundations of machine learning models and algorithms and developing the ability to understand, implement, evaluate, and choose the right ML approach for problems. It also provides a brief introduction to machine learning concepts.

Uploaded by

Raja
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 23

Course Logistics and Introduction

CS771: Introduction to Machine Learning


Piyush Rai
2
Course Logistics
 Course Name: Introduction to Machine Learning – CS771
 An introductory course – supposed to be your first intro to the subject

 Usually 3 lectures every week in form of videos (hosted on mooKIT)


 Think can these as Mon/Wed/Fri lectures in the usual classroom setting
 mooKIT URL: https://hello.iitk.ac.in/cs771a/ (CC id and password to be used for login)

 An additional discussion session every Monday, 6pm-7pm (via YouTube Live)

 All material will be posted on the mooKIT page for the course

 Q/A and announcements on Piazza. Please sign up


CS771: Intro to ML
3
Course Team

Soumya Banerjee Shivam Bansal Dhanajit Brahma


[email protected] [email protected] [email protected]

Amit Chandak Neeraj Matiyali Pratik Mazumder


[email protected] [email protected] [email protected]

CS771: Intro to ML
4
Course Team

Avik Pal Niravkumar Panchal Hemant Sadana


[email protected] [email protected] [email protected]

Rahul Sharma Piyush Rai


[email protected] [email protected]

CS771: Intro to ML
5
Workload and Grading Policy
 4 homework assignments (theory + programming) worth 50%
 Theory part: Derivations/analysis
 Programming part: Implement/use ML algos, analysis of results. Must be done in
Python (learn if not already familiar)
 Must be typeset in LaTeX (learn if not already familiar)
 To be submitted via Gradescope (login details will be provided)

 Quizzes and exams (mid-sem and end-sem) worth 50%


 Will be held online – details later
 Exact break-up of individual components will be announced in a few days
Python: https://www.geeksforgeeks.org/python-programming-language/

LaTeX: www.sharelatex.com/blog/latex-guides/beginners-tutorial.html
www.overleaf.com/learn/latex/Tutorials
CS771: Intro to ML
6
Textbook and References
 Many excellent texts but none “required”. Some include:

 Different books might vary in terms of


 Set of topics covered
 Flavor (e.g., classical statistics, deep learning, probabilistic/Bayesian, theory)
 Terminology and notation (beware of this especially)

 We will provide you the reading material from the relevant sources
CS771: Intro to ML
7
Course Goals

Credit: Rishika Agarwal (EE, graduated 2017) CS771: Intro to ML


8
Course Real Goals..
 Introduction to the foundations of machine learning models and algos

 Focus on developing the ability to


 Understand the underlying principles behind ML models and algos
 Understand how to implement and evaluate them
 Understand/develop intuition on choosing the right ML model/algo for your problem

 (Hopefully) inspire you to work on and learn more about ML

 Not an introduction to popular software frameworks and libraries, such as


scikit-learn, PyTorch, Tensorflow, etc
 Can explore once you have some understanding of various ML techniques
CS771: Intro to ML
Introduction to Machine Learning

CS771: Intro to ML
10
Machine Learning (ML)
 Designing algorithms that ingest data and learn a model of the data
 The learned model can be used to
 Detect patterns/structures/themes/trends etc. in the data
 Make predictions about future data and make decisions

 Modern ML algorithms are heavily “data-driven”


 No need to pre-define and hard-code all the rules (infeasible/impossible anyway)
 The rules are not “static”; can adapt as the ML algo ingests more and more data
CS771: Intro to ML
11
ML: From What It Does to How It Does It?
 ML enables intelligent systems to be data-driven rather than rule-driven
 How: By supplying training data and building statistical models of data
 Pictorial illustration of an ML model for binary classification:

A Linear Classifier (the statistical model)

CS771: Intro to ML
12
ML: From What It Does to How It Does It?
 ML enables intelligent systems to be data-driven rather than rule-driven
 How: By supplying training data and building statistical models of data
 Pictorial illustration of an ML model for binary classification:

A Probabilistic Classifier (the statistical model)

P(“cat”|image) P(“dog”|image)

CS771: Intro to ML
13
Overfitting = Bad ML
 Doing perfectly on training data is not good enough

 A good ML model must generalize well on unseen (test data)

 Simpler models should be preferred over more complex ones!


CS771: Intro to ML
14
ML Applications Abound..

Picture courtesy: gizmodo.com,rcdronearena.com,www.wiseyak.com,www.charlesdong.com CS771: Intro to ML


15
Key Enablers for Modern ML
 Availability of large amounts of data to train ML models

 Increased computing power (e.g., GPUs)

CS771: Intro to ML
16
ML: Some Success Stories

CS771: Intro to ML
17
ML: Some Success Stories

CS771: Intro to ML
18
ML: Some Success Stories

Picture courtesy: https://news.microsoft.com/ CS771: Intro to ML


19
ML: Some Success Stories
 Automatic Program Correction

Example from “Compilation error repair: for the student programs, from the student programs”, Ahmed et al (2018) CS771: Intro to ML
20
ML: Some Success Stories
 ML based colorimetry for water quality assessment

 Take uncontaminated water sample

 Spike it with known concentration of various compounds


(e.g., lead, iron, fluoride, etc)

 Dip a test strip (one square to measure each compound)


in the contaminated water for some time.

 Take a picture of the strip using a phone camera to


capture how the colors have changed

 Train an ML model to predict concentration levels of


various compounds based on color levels in the images
(work being done at IITK in collaboration with two startups - Earthface Analytics Pvt Ltd and Kritsnam Technologies Pvt Ltd) CS771: Intro to ML
21
Good ML Systems Should be Fair and Unbiased
 Good ML should not just be about getting high accuracies

 Should also ensure that the ML models are fair and unbiased
Criminals?

Not Criminals?

An image captioning system Don’t want a self-driving car Don’t want a predictive policing
should not always assume a that is more likely to hit black system that predicts criminality
specific gender in examples people than white people using facial features
like the above

 A lot of recent focus on Fairness and Transparency of ML systems


Picture courtesy: Bhargava and Forsyth (2019), https://www.thestranger.com/, Xiaolin Wu and Xi Zhang, “Automated Inference on Criminality Using Face Images” CS771: Intro to ML
22
Looking Back Before We Start: History of ML

- Human-like text generators (GPT-3)

CS771: Intro to ML
23
Next Class
 Various Flavors of ML problems

 Data and features

 Basic mathematical operations on data and features

CS771: Intro to ML

You might also like