LDS7003M Group Presentation
LDS7003M Group Presentation
Contents
Module Details ........................................................................................................... 1
Assignment Description.............................................................................................. 2
Learning Outcomes .................................................................................................... 3
Advice and Guidance ................................................................................................. 4
How is this assessment marked? ............................................................................... 5
Marking Criteria .......................................................................................................... 6
Module Details
1
London School
Assignment Description
In this group project, you will use machine learning techniques to tackle a real-world problem. Your
primary objectives are to explore, preprocess, model, and evaluate various machine learning
algorithms, and then critically analyse their performance in the context of your chosen domain. The
goal is to predict a continuous numerical outcome based on multiple input features.
Scenario: Choose a dataset where the task involves predicting a continuous numerical outcome.
Possible scenarios include forecasting house prices, estimating sales revenue, or predicting patient
medical costs.
Dataset Selection: Select one of the following datasets for your analysis:
1. House Prices Dataset: This dataset contains 79 explanatory variables describing various
aspects of residential homes. The challenge is to predict the final price of each home.
2. Bike Rental Dataset: This dataset is aimed at predicting the total number of bike rentals (a
continuous variable) based on weather data, seasonal information, and other factors.
3. Medical Cost Personal Dataset: This dataset involves predicting the future medical costs of
patients based on their personal attributes such as age, BMI, smoking status, and more.
Task:
• Conduct exploratory data analysis (EDA) to identify trends, correlations, and outliers.
• Handle missing data and perform any necessary data cleaning.
• Perform feature engineering to create new relevant features.
• Scale and normalize the data as needed.
2. Model Development:
• Implement at least three different algorithms (e.g., Multiple Linear Regression, Support
Vector Regression, etc.,)
• Use k-fold cross-validation to assess the performance of each model.
3. Model Evaluation:
• Compare the models using appropriate regression metrics such as RMSE, MAE, and R-
squared.
• Use Recursive Feature Elimination (RFE) or another feature selection method to identify the
most important features.
• Perform hyperparameter tuning using GridSearch or another hyperparameter tuning method
to optimize the models.
2
London School
Assignment Description
• Discuss how the models can be applied in the context of your chosen dataset
• Explore potential impacts and applications of your findings in business or research scenarios.
• Analyse ethical considerations related to your dataset and predictive modelling. Address
issues such as data privacy, informed consent, and the potential consequences of model
predictions
Learning Outcomes
You must successfully achieve the following Learning Outcomes to pass this assessment:
7.1 Critically apply skills, techniques, and knowledge from a range of data analysis methods and
algorithms for enhancing and solving problems in various domains.
7.2 Develop abstract thinking and design ability to analytically demonstrate concepts relating to data
science.
7.3 Use research-based knowledge for the design of experiments, analysis, and interpretation of data
to provide valid results.
7.4 Critically evaluate and analyse advanced data science topics, and concepts, and implement them
in workplace.
7.5 Identify and implement appropriate programming and software tools to critically analyse big data
applications in workplace.
7.8 Critically analyse the data and apply predictive modelling technique in the field of Machine
Learning and Artificial Intelligence.
7.9 Critique legal, social, and ethical issues within the field of data science and applicable ancillary
sectors, as applied to contemporary research and industrial practice.
Assessment Guidance
Presentation Guidelines
• Presentation should include an introduction to the problem, a summary of your data
exploration and preprocessing steps, details of your model development and evaluation, and
a discussion of the domain and ethical implications.
• Use charts, graphs, and tables to illustrate key points and findings. Ensure that visuals are
clear and effectively communicate your results.
• Provide a thoughtful critique of your models' performance, including any limitations and
areas for improvement.
• Prepare to answer questions and engage in discussion about your methods and results.
3
London School
Deliverables
• Presentation Slides: Submit a deck that includes all necessary components of your project
in Moodle.
• Code and Documentation: Provide a link to your code repository and include
documentation on how to run your analysis.
General Guidance
General considerations
Please be aware that each step should be fully described in your assessment. Collaborate with your
group members effectively but ensure your individual understanding of the assignment and present
the individual task you are working on. Students should work together to ensure a cohesive and well-
coordinated presentation. Each individual’s contribution should complement the others, creating a
comprehensive and coherent narrative.
Additional Information
The work you present should be your own work, and not just copied from others. You can quote
from others, but you must say who the author is and use quotation marks or paraphrase. If you do
not do so, we will investigate your work for academic misconduct. This is particularly likely if your
Turnitin similarity score is above 25% and/or individual matches are above 6%.
If you require support with your study skills, please visit https://www.yorksj.ac.uk/students/study-skills/
Assessment Regulations
Please refer to the York St John University Code of Practice for Assessment and Academic Related
Matters 2024-25.
We ask that you pay particular attention to the academic misconduct policy. Penalties will be applied
where a student is found guilty of academic and/or ethical misconduct, including termination of
programme (Policy Link).
You are required to keep to the word limit set for an assessment and to note that you may be subject
to penalty if you exceed that limit. You are required to provide an accurate word count on the cover
sheet for each piece of work you submit (Policy Link).
For late or non-submission of work by the published deadline or an approved extended deadline, a
mark of 0NS will be recorded. Where a re-assessment opportunity exists, a student will normally be
permitted only one attempt to be re-assessed for a capped mark (Policy Link).
An extension to the published deadline may be granted to an individual student if they meet the
eligibility criteria of the (Policy Link).
4
London School
Furthermore, this assessment is marked using the assessment marking criteria or a similar rubric
that aligns with the University’s Generic Assessment Descriptors (see below). 1 This is to ensure all
assessment decisions are comparable regardless of the discipline or mode of assessment.
Please note that you must meet the required baseline standards (50 – 59%) which will include the
LOs and minimum expectations of the assessment. Further still, you must ensure you meet the
requirements of each grade boundary to progress to the next, i.e., you should demonstrate your
learning through the standards of the Pass, Merit and Distinction to reach a Distinction (70 – 84%).
These standards are designed to scaffold and build your learning to achieve your fullest potential in
each criterion being assessed.
1 A rubric is a type of scoring guide that markers use to set out specific components and expectations for an assignment for their students.
It is then used to guide the marking they undertake.
5
London School
Marking Criteria
Pass Grade Bands (100 – 50) (Learning Outcomes must be met)
Fail Grade Bands (49 – 0) (Learning Outcomes are not met)
Mark
Assessment Criteria Description
(100%)
Demonstrate a clear understanding of the problem domain and the relevance of predicting a continuous numerical
Problem
outcome based on input features. Articulate the significance and objectives of the analysis in the context of the chosen 10%
Comprehension
dataset.
Knowledge and Demonstrate understanding of data exploration techniques, including EDA, handling missing data, data cleaning, and
15%
Execution feature engineering. Appropriateness and effectiveness of preprocessing steps applied.
Implementation and Apply at least three different algorithms. Show an understanding of the models’ workings and provide a rationale for
25%
Variety their selection. Demonstrate proper use of k-fold cross-validation for performance assessment.
Compare models using appropriate regression metrics (RMSE, MAE, R-squared, etc). Use feature selection methods
Evaluation and
and hyperparameter tuning to optimise model performance. Critical analysis of model results and selection of the best- 25%
Optimization
performing model.
Discuss how the models can be applied in the context of the chosen dataset. Explore potential impacts and
applications in business or research scenarios and demonstrate a clear understanding of the practical implications.
Application and
10%
Ethical Impact
Analyse ethical considerations related to data privacy, security, and the responsible use of predictive models. Address
any potential biases and unintended consequences. Provide strategies to mitigate these issues.
Quality of Assess readability, grammar, structure, and completeness. Evaluate the effectiveness of communication, collaboration
15%
Presentation with the team, use of visuals, and overall presentation skills.
6
London School
7
London School
8
London School
Distinction (70 – Distinction (85 – Borderline Fail
Assessment Criteria Pass (50 – 59) Merit (60 – 69) Fail (30 - 44) Fail (0 - 29)
84) 100) (45 - 49)