0% found this document useful (0 votes)

11 views5 pages

Assignment m2 Machine Learning Final

The document discusses the application of machine learning (ML) concepts in real-world scenarios, including supervised learning for sales prediction, unsupervised learning for user grouping in video streaming, and reinforcement learning for drone delivery routes. It highlights the benefits and challenges of each approach while addressing ethical concerns in ML, particularly in healthcare. Additionally, it emphasizes the importance of model evaluation metrics beyond accuracy, suggesting methods like cross-validation to enhance reliability.

Uploaded by

sehajsingh7838

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views5 pages

Assignment m2 Machine Learning Final

Uploaded by

sehajsingh7838

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Assignment M : Machine Learning

Abdul Sahil Ansari

24/5

Objective:

This assignment will help you connect the concepts of ML with real-world scenarios.
You are expected to think critically, analyze situations, and explain how ML can be
applied without writing any code.

Part A - Real-World Scenarios

. Supervised Learning: Supermarket Sales Prediction

1
Which type of supervised learning (regression or classi cation) would be
fi
suitable here? Why?

Regression would be suitable here. The objective is to predict the monthly sales,
which is a continuous numerical value. Regression models are designed to
predict continuous outcomes, unlike classi cation models which predict discrete
fi
categories.

Suggest one bene t and one challenge of using supervised learning in this
fi
case.

Bene t: Supervised learning can provide highly accurate sales forecasts,

fi
enabling the supermarket to optimize inventory management, reduce waste, and
plan marketing campaigns more e ectively. By learning from historical data, the
ff
model can identify complex relationships between factors like season,
advertisements, and pricing, and their impact on sales.

Challenge: A signi cant challenge is the need for a large amount of high-quality,
fi
labeled historical sales data. If the data is incomplete, inconsistent, or lacks
relevant features (e.g., competitor pricing, local events), the model's accuracy
can be severely impacted. Additionally, the model might struggle to adapt to
sudden, unforeseen market changes or new product introductions without
retraining.

2. Unsupervised Learning: Video Streaming Platform User Grouping

Which unsupervised learning technique could be useful?

Clustering techniques, such as K-Means clustering or Hierarchical clustering,

would be useful. These algorithms can group users into distinct segments based
on similarities in their viewing habits (e.g., genres watched, watch times,
frequency of viewing, interaction with recommendations) without requiring pre-
defined labels.

How could this grouping improve customer experience?

Grouping users based on viewing habits allows the platform to provide highly
personalized movie recommendations. Instead of generic suggestions, users
would receive recommendations tailored to their specific cluster's preferences,
leading to a more relevant and enjoyable content discovery experience. This
personalization can increase user engagement, satisfaction, and retention, as
users feel the platform understands their tastes and offers content they are
genuinely interested in.

3. Reinforcement Learning: Drone Delivery Routes

How does reinforcement learning apply here?

Reinforcement learning (RL) is highly applicable here because the drones need to
learn optimal delivery routes through trial and error in a dynamic environment.
The drone acts as an agent, the environment is the delivery area (including
obstacles, traffic, weather), and the actions are movements and route choices.
The drone receives rewards for successful and efficient deliveries (e.g., reaching
the destination quickly, avoiding obstacles, minimizing fuel consumption) and
penalties for undesirable outcomes (e.g., delays, crashes, inefficient routes). Over
time, through continuous interaction with the environment and receiving
feedback (rewards/penalties), the RL algorithm will learn a policy that dictates
the best sequence of actions to take to optimize delivery routes.

What could be one possible risk of using this approach?

One significant risk is the potential for unforeseen or unsafe behaviors during
the learning process, especially in real-world deployment. Since RL involves
exploration and trial-and-error, the drones might initially attempt inefficient or
even dangerous routes or actions that could lead to accidents, property damage,
or injury to people. Ensuring safety during the training phase and implementing
robust safety protocols, such as simulation-based training and strict real-world
testing with human oversight, is crucial to mitigate this risk.

Part B - Case Study Reflection

Case Study: Helping in Early Disease Detection

Machine Learning (ML) holds immense potential in revolutionizing early disease

detection, offering a proactive approach to healthcare. In this scenario, supervised
learning would be the most suitable ML type. Specifically, classification algorithms
would be employed to categorize individuals into discrete groups, such as 'diseased'
or 'healthy,' or to identify the presence of specific conditions based on various input
features.

Examples of data that might be used include a wide array of patient information. This
could encompass demographic data (age, gender, ethnicity), medical history (pre-
existing conditions, family history of diseases), lifestyle factors (diet, exercise, smoking
habits), and crucially, diagnostic test results. The diagnostic data could range from
blood test markers, genetic sequences, imaging scans (e.g., X-rays, MRIs, CT scans), to
physiological measurements (e.g., blood pressure, heart rate). For instance, a model
could be trained on thousands of anonymized patient records, where each record
includes these features along with a confirmed diagnosis (the 'label').

However, the application of ML in early disease detection raises significant ethical and
social concerns. A primary concern is data privacy and security. Medical data is highly
sensitive, and its collection, storage, and processing must adhere to stringent privacy
regulations (e.g., HIPAA, GDPR). There's also the risk of algorithmic bias. If the training
data disproportionately represents certain demographics or lacks diversity, the model
might perform poorly or inaccurately for underrepresented groups, leading to
disparities in healthcare access and outcomes. For example, a model trained primarily
on data from one ethnic group might misdiagnose or delay diagnosis for individuals
from another. Furthermore, the issue of false positives and false negatives is critical.
A false positive could lead to unnecessary anxiety, costly follow-up tests, and even
invasive procedures, while a false negative could delay crucial treatment, with
potentially life-threatening consequences. Ensuring transparency in how these models
arrive at their predictions and establishing clear accountability for their outcomes are
paramount to building trust and ensuring equitable healthcare.

Part C - Thinking About Model Evaluation

1. Why might accuracy alone not be enough to evaluate this model?

Accuracy alone might not be enough to evaluate a model predicting whether a student
will pass or fail an exam, especially if there's an imbalance in the dataset (e.g.,
significantly more students pass than fail). If 95% of students typically pass, a model
that simply predicts every student will pass would achieve 95% accuracy. While
seemingly high, this model is useless as it fails to identify any failing students.
Accuracy doesn't differentiate between the types of errors (false positives vs. false
negatives), which can have different implications. In this context, incorrectly predicting
a failing student will pass (false negative) is far more critical than incorrectly predicting
a passing student will fail (false positive), as it prevents timely intervention.

2. Between precision and recall, which would matter more in this

situation? Explain your choice.

In this situation, recall would generally matter more than precision. Recall measures
the proportion of actual positive cases (students who will fail) that were correctly
identified by the model. A high recall means the model is good at catching most of the
students who are at risk of failing. The consequence of a false negative (predicting a
failing student will pass) is severe: the student might not receive the necessary support
or intervention, potentially leading to actual failure. While a low precision (many false
positives – predicting a passing student will fail) might lead to unnecessary
interventions for some students, it is less detrimental than missing a student who
genuinely needs help. The priority is to identify as many at-risk students as possible to
provide support.
3. Suggest one simple method (like cross-validation or A/B testing) to
make the model more reliable.

Cross-validation is a simple yet effective method to make the model more reliable.
Instead of training and evaluating the model on a single split of data (e.g., 80% train,
20% test), cross-validation involves partitioning the dataset into multiple subsets
(folds). The model is then trained and tested multiple times, with each fold serving as
the test set exactly once. For example, in 5-fold cross-validation, the data is divided
into five parts. The model is trained on four parts and tested on the remaining one, and
this process is repeated five times. The performance metrics (like recall) are then
averaged across all folds. This approach provides a more robust and less biased
estimate of the model's performance on unseen data, reducing the chance of
overfitting to a specific data split and giving a more reliable indication of its
generalization ability.

Evaluation Metrics & ML Problem Types
No ratings yet
Evaluation Metrics & ML Problem Types
49 pages
ML Assignment
No ratings yet
ML Assignment
3 pages
MLT 1 Save
No ratings yet
MLT 1 Save
3 pages
MAE for Predicting Article Views
No ratings yet
MAE for Predicting Article Views
141 pages
Part A - Total 5 Marks (No Choice)
No ratings yet
Part A - Total 5 Marks (No Choice)
4 pages
Lec1 Introduction
No ratings yet
Lec1 Introduction
60 pages
ML Assignment 1
No ratings yet
ML Assignment 1
57 pages
Challenges and Strategies For Implementation
No ratings yet
Challenges and Strategies For Implementation
5 pages
Supervised and Unsupervised Learning Applications
No ratings yet
Supervised and Unsupervised Learning Applications
7 pages
Ai Project Cycle Short Note
No ratings yet
Ai Project Cycle Short Note
9 pages
Common Issues in Machine Learning
No ratings yet
Common Issues in Machine Learning
6 pages
cs329s 2022 02 Slides MLSD
No ratings yet
cs329s 2022 02 Slides MLSD
99 pages
Uncertainty in Modeling
No ratings yet
Uncertainty in Modeling
25 pages
Ai PT 2 Full Crash Guide (Class 10) - Units 1 To 4
No ratings yet
Ai PT 2 Full Crash Guide (Class 10) - Units 1 To 4
8 pages
MLOps Data Lifecycle Course
No ratings yet
MLOps Data Lifecycle Course
133 pages
? Task
No ratings yet
? Task
23 pages
Lecture 3 - 1-ML and Data Systems Fundamentals
No ratings yet
Lecture 3 - 1-ML and Data Systems Fundamentals
48 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
UNIT 1 Challenges
No ratings yet
UNIT 1 Challenges
2 pages
Mini Project Report
No ratings yet
Mini Project Report
21 pages
Basic Concepts of Machine Learning For Beginners 1732109263
No ratings yet
Basic Concepts of Machine Learning For Beginners 1732109263
102 pages
Towards Trustworthy Machine Learning in Healthcare
No ratings yet
Towards Trustworthy Machine Learning in Healthcare
9 pages
AI Module 1 Simple Notes
No ratings yet
AI Module 1 Simple Notes
14 pages
Artificial Intelligence Notes
No ratings yet
Artificial Intelligence Notes
11 pages
CoE Workshop1
No ratings yet
CoE Workshop1
7 pages
PSCS511 - Machine Learning
No ratings yet
PSCS511 - Machine Learning
23 pages
Lecture 1-2
No ratings yet
Lecture 1-2
57 pages
MachineLearning Chatgpt
No ratings yet
MachineLearning Chatgpt
19 pages
Jade Abbott - Mls Hidden Tasks
No ratings yet
Jade Abbott - Mls Hidden Tasks
78 pages
Basic Concepts of Machine Learning For Beginners
No ratings yet
Basic Concepts of Machine Learning For Beginners
102 pages
Intro to Machine Learning Concepts
100% (1)
Intro to Machine Learning Concepts
58 pages
MLOps Getting From Good To Great
No ratings yet
MLOps Getting From Good To Great
41 pages
Course Objectives DM
No ratings yet
Course Objectives DM
4 pages
Mask Detection with Teachable Machine
No ratings yet
Mask Detection with Teachable Machine
28 pages
Segmentation Dataset
No ratings yet
Segmentation Dataset
41 pages
Hogwarts Sols
No ratings yet
Hogwarts Sols
8 pages
Lecture 5 - Planning & Feasibility of ML Projects
No ratings yet
Lecture 5 - Planning & Feasibility of ML Projects
42 pages
CS3244 (2120) - Project Discussion 1 - Overview
No ratings yet
CS3244 (2120) - Project Discussion 1 - Overview
25 pages
Aids QB
No ratings yet
Aids QB
8 pages
ML Ans
No ratings yet
ML Ans
13 pages
Machine Learning for Professionals
No ratings yet
Machine Learning for Professionals
26 pages
Module 1 MLA
No ratings yet
Module 1 MLA
89 pages
Deep Learnng IA
No ratings yet
Deep Learnng IA
69 pages
ML Ans
No ratings yet
ML Ans
4 pages
Simple Introduction of Neural Network
No ratings yet
Simple Introduction of Neural Network
28 pages
MachineLearning Perplexity
No ratings yet
MachineLearning Perplexity
5 pages
Evaluating Model Performance: Evaluation Strategies: Train/Validation/Test
No ratings yet
Evaluating Model Performance: Evaluation Strategies: Train/Validation/Test
127 pages
GATE ML Updated 111023
No ratings yet
GATE ML Updated 111023
109 pages
ChatGPT - Machine Learning Overview
No ratings yet
ChatGPT - Machine Learning Overview
34 pages
Aiml Report
No ratings yet
Aiml Report
70 pages
Machine Learning 1 - Jimmy
No ratings yet
Machine Learning 1 - Jimmy
2 pages
Classification vs Regression in ML
No ratings yet
Classification vs Regression in ML
15 pages
Aif-C01 - 166qa
No ratings yet
Aif-C01 - 166qa
135 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
180 pages
Diabetes Prediction Presentation
No ratings yet
Diabetes Prediction Presentation
12 pages
Compilled Notes - Final
No ratings yet
Compilled Notes - Final
100 pages
MLT U1
No ratings yet
MLT U1
23 pages
AI ML Primer CE BUET M. Sohel Rahman
No ratings yet
AI ML Primer CE BUET M. Sohel Rahman
163 pages
Quay Loading Data - GHMK 6407B
No ratings yet
Quay Loading Data - GHMK 6407B
1 page
Microsoft AI-900 Vapr-2024 by - ToanNguyen 116q
No ratings yet
Microsoft AI-900 Vapr-2024 by - ToanNguyen 116q
73 pages
Complete Practical Contract Law For Paralegals An Activities Based Approach 4 Ed Vietzen Ebook and TestBank Bundle Verified
No ratings yet
Complete Practical Contract Law For Paralegals An Activities Based Approach 4 Ed Vietzen Ebook and TestBank Bundle Verified
346 pages
Wheelock Strobe
No ratings yet
Wheelock Strobe
4 pages
Control Statements and Loops
No ratings yet
Control Statements and Loops
28 pages
Cyxtera Cross Connect Service Description
No ratings yet
Cyxtera Cross Connect Service Description
12 pages
Testreport 2024103017555672101715393
No ratings yet
Testreport 2024103017555672101715393
2 pages
SBL 6313 Site Planning Week 12
No ratings yet
SBL 6313 Site Planning Week 12
35 pages
Presentation Template Guide
No ratings yet
Presentation Template Guide
32 pages
Indian Sign Language Generator and Detector Team Qstar
No ratings yet
Indian Sign Language Generator and Detector Team Qstar
40 pages
MAS Performance Evaluation Consultation HO
No ratings yet
MAS Performance Evaluation Consultation HO
3 pages
7th & 8th Semester - ME Syllabus For The Year 2025 - 2026
No ratings yet
7th & 8th Semester - ME Syllabus For The Year 2025 - 2026
41 pages
Otc Turbo Pulse cpdp350
No ratings yet
Otc Turbo Pulse cpdp350
35 pages
Chapter 4 Final
No ratings yet
Chapter 4 Final
55 pages
Call of Duty
No ratings yet
Call of Duty
4 pages
Walk and Charge with Piezoelectric Plates
No ratings yet
Walk and Charge with Piezoelectric Plates
9 pages
Lebanon IP Address List
No ratings yet
Lebanon IP Address List
5 pages
Pay Palli Biddyut Bill via GPAY App
No ratings yet
Pay Palli Biddyut Bill via GPAY App
6 pages
Essentials of Writing a Literature Review
No ratings yet
Essentials of Writing a Literature Review
26 pages
GA and PSO Optimization Techniques in MATLAB
No ratings yet
GA and PSO Optimization Techniques in MATLAB
22 pages
SOLIDWORKS TUTORIALS Parts, Assembly, Drawings, and Sheet Metal - P9
No ratings yet
SOLIDWORKS TUTORIALS Parts, Assembly, Drawings, and Sheet Metal - P9
23 pages
J G9 URh N5 DZH 7 G IQv 6 PHL DHK BK TR
No ratings yet
J G9 URh N5 DZH 7 G IQv 6 PHL DHK BK TR
4 pages
MT-Blacklist for Web Admins
No ratings yet
MT-Blacklist for Web Admins
53 pages
WellConnect: HR & Employee Portal Guide
No ratings yet
WellConnect: HR & Employee Portal Guide
21 pages
Preprints202305 1094 v1
No ratings yet
Preprints202305 1094 v1
18 pages
16 Windows Server 2012
No ratings yet
16 Windows Server 2012
6 pages
Fixed Length Extension Columns: Ext Models
No ratings yet
Fixed Length Extension Columns: Ext Models
2 pages
Olympus CLV-190
No ratings yet
Olympus CLV-190
2 pages
Unit-1 Cayley Hamilton Theorem
50% (2)
Unit-1 Cayley Hamilton Theorem
17 pages
Bella's Lullaby - From 'Twilight (Movie) ' Sheet Music (Cello Solo) in F# Minor - Download & Print - SKU - MN0130384
No ratings yet
Bella's Lullaby - From 'Twilight (Movie) ' Sheet Music (Cello Solo) in F# Minor - Download & Print - SKU - MN0130384
5 pages

Assignment m2 Machine Learning Final

Uploaded by

Assignment m2 Machine Learning Final

Uploaded by

Assignment M : Machine Learning

Abdul Sahil Ansari

Part A - Real-World Scenarios

. Supervised Learning: Supermarket Sales Prediction

Bene t: Supervised learning can provide highly accurate sales forecasts,

2. Unsupervised Learning: Video Streaming Platform User Grouping

Which unsupervised learning technique could be useful?

Clustering techniques, such as K-Means clustering or Hierarchical clustering,

How could this grouping improve customer experience?

3. Reinforcement Learning: Drone Delivery Routes

How does reinforcement learning apply here?

What could be one possible risk of using this approach?

Part B - Case Study Reflection

Case Study: Helping in Early Disease Detection

Machine Learning (ML) holds immense potential in revolutionizing early disease

Part C - Thinking About Model Evaluation

1. Why might accuracy alone not be enough to evaluate this model?

2. Between precision and recall, which would matter more in this

You might also like