PO687 Statistical Analysis Project Guide

This document outlines the requirements for an end of term statistics project. Students must: 1) Select a dataset, identify an outcome and predictor variable, and formulate working and null hypotheses. 2) Describe the two variables with appropriate univariate statistics and visualizations. 3) Create a graph to illustrate the bivariate relationship and test the hypothesis with a t-test or non-parametric equivalent. 4) Test the hypothesis with bivariate regression and interpret the results. 5) Expand the analysis by including two additional variables, generating hypotheses, and running/interpreting a multiple regression model with diagnostics. Compare the new model to the initial bivariate regression.

Uploaded by

pp3986

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

101 views3 pages

PO687 Statistical Analysis Project Guide

Uploaded by

pp3986

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

PO687 End of term project

Dr Raluca Popp
December 7, 2020

The rationale behind the project:

• it will test all the stats skills you acquired this term - from formulating
hypotheses, to visualising relationships, running statistical analysis, pre-
senting and interpreting the results, but also data management, such as
recoding of variables, where needed;
• you have some freedom over the analysis you will run; You have to pick
one of the 3 datasets available to you, and you get two pick the variables
you will use in the analysis;
• rather than telling you exactly what methods to apply, you will need to
think about the variables you are using and which are the appropriate
statistical techniques to test the relationship(s) between the variables you
chose;
• think about it as a miniature research project, but one in which you don’t
need a theory and literature review part. Treat this as practice for your
dissertation next year (if you choose to write one).
Your seminar leaders will not show you how to run analysis on the three
dataset for the final project. Statistical analysis is run the same way, following
the same principles. If you learn which functions to run when, you will then be
able to apply them to any dataset.

A word on R code:
• It is not mandatory to add your R code to the assignment, but it is
recommended. It does not count towards the word limit (which is not
strict, anyway) and you will be not marked on it. However, it helps us
when marking the assignment.
• If you produce your document in Word, then you can add the code at the
end of the assignment.
• If you produce the assignment using RMarkdown, then you don’t need to
include the code at the end, as it is part of the document.

1
Formulate hypotheses
1. Pick a dataset among gss, nes and world. Inspect it, have a look at the
variables it contains and at the codebook. Select an outcome and a predictor
variable. These will be the central elements of your assignment. Remember
that the outcome variable needs to be interval, ratio or high-level ordinal - what
we call a continuous variable. Feel free to recode variables where you need to.
Formulate the working and the null hypotheses. (15 points)

Univariate statistics and visualisations

2. Describe the two variables. Create appropriate visualisations for each
variable, accompanied by the appropriate descriptive statistics (hint: it all de-
pends on the level of measurement). (15 points)

Visualise a bivariate relationship

3. Thinking about the type of variable you selected, create a graph that will
illustrate the relationship between your dependent and independent variables.
Remember that visualisations have to be nice to look at, represent the data
truthfully, be clear and informative. In other words, do not forget to add titles,
labels and so on. (15 points)

Hypothesis testing with a t-test or a non-parametric test

4. Test the hypothesis you formulated in Step 1 using a t-test or a non-
parametric test, depending on which one is appropriate (hint: remember it
depends on whether the variable is normally distributed or not). Report the
test statistics, and its associated p-value. Use the .05 cut off point for statistical
significance and interpret the results. (15 points)

Bivariate regression
5. Test the hypothesis you formulated in Step 1 using a regression model.
Present the regression results in a table and interpret them. Use the .05 cut off
point for statistical significance. (15 points)

Multiple regression
6. Expand on the relationship you tested above, by choosing another two
variables that could improve your model. Feel free to recode variables.
6a. Create hypotheses for each new variable (and your outcome variable).
(5 points)
6b. Present univariate analysis on the new variables (descriptive statistics
and visualisations). (5 points)
6c. Run a regression model that includes the new variables. Present the
regression results in a table and interpret them. Use the .05 cut off point for
statistical significance. Run regression diagnostics for your model and discuss
whether your model respects OLS assumptions. If it violates any assumptions,
you need to indicate how you would fix the issue. You don’t need to re-run the
model. (10 points)

2
6d. Compare the new regression model to the model from Step 5, using
the appropriate statistical test. Report the results and interpret them. Is the
second regression model more informative? (5 points)

ProjectInstructions GradeRubric
No ratings yet
ProjectInstructions GradeRubric
3 pages
Probability and Statistics Project Guide
No ratings yet
Probability and Statistics Project Guide
2 pages
Heart Disease Data Analysis with R
No ratings yet
Heart Disease Data Analysis with R
19 pages
Research Proposal and Exam Overview
No ratings yet
Research Proposal and Exam Overview
25 pages
Advanced Statistics Research Paper Guide
No ratings yet
Advanced Statistics Research Paper Guide
4 pages
Bivariate Analysis: Regression & Correlation
No ratings yet
Bivariate Analysis: Regression & Correlation
20 pages
Employment Data Analysis Assignment Guide
No ratings yet
Employment Data Analysis Assignment Guide
5 pages
Data Analytics & Visualization Exam 2021
No ratings yet
Data Analytics & Visualization Exam 2021
6 pages
Advanced Quantitative Methods Overview
No ratings yet
Advanced Quantitative Methods Overview
12 pages
SPSS Statistical Tests Guide
No ratings yet
SPSS Statistical Tests Guide
18 pages
Assignment STAT5002
No ratings yet
Assignment STAT5002
5 pages
Advanced Statistical Methods in R
No ratings yet
Advanced Statistical Methods in R
31 pages
Research Methodology Syllabus Overview
No ratings yet
Research Methodology Syllabus Overview
6 pages
Psychology Postgraduate Practice Test
No ratings yet
Psychology Postgraduate Practice Test
5 pages
Statistics and Econometrics Exam 2021
No ratings yet
Statistics and Econometrics Exam 2021
8 pages
Statistical Tools and R for Research
No ratings yet
Statistical Tools and R for Research
6 pages
Assignment 2 2020
No ratings yet
Assignment 2 2020
6 pages
Regression Analysis and Confidence Intervals
No ratings yet
Regression Analysis and Confidence Intervals
8 pages
Statistical Testing and Modelling in R
No ratings yet
Statistical Testing and Modelling in R
21 pages
Data Analysis of Student Performance
No ratings yet
Data Analysis of Student Performance
27 pages
Advanced Data Analysis Project Guidelines
No ratings yet
Advanced Data Analysis Project Guidelines
3 pages
STATS 101A: Data Analysis Overview
No ratings yet
STATS 101A: Data Analysis Overview
25 pages
Statistical Modeling with R: Key Techniques
No ratings yet
Statistical Modeling with R: Key Techniques
9 pages
Year 9 Data Project Guide: Linear Relationships
No ratings yet
Year 9 Data Project Guide: Linear Relationships
6 pages
Econometrics: Modelling Student Performance
No ratings yet
Econometrics: Modelling Student Performance
51 pages
Inferential Statistics Project Guidelines
No ratings yet
Inferential Statistics Project Guidelines
4 pages
Z-Test Implementation in R Programming
No ratings yet
Z-Test Implementation in R Programming
21 pages
OLS Regression Analysis and Predictions
No ratings yet
OLS Regression Analysis and Predictions
19 pages
Linear Regression Assumptions and Analysis
No ratings yet
Linear Regression Assumptions and Analysis
13 pages
Econometrics Assignment Assessment Brief
No ratings yet
Econometrics Assignment Assessment Brief
5 pages
Linear Regression Project Guidelines
No ratings yet
Linear Regression Project Guidelines
1 page
Data Presentation and Analysis Guide
No ratings yet
Data Presentation and Analysis Guide
5 pages
Statistical Report Writing Guidelines
No ratings yet
Statistical Report Writing Guidelines
6 pages
Predicting Carer Distress by Age
No ratings yet
Predicting Carer Distress by Age
20 pages
Statistical Methods and Assumptions Guide
No ratings yet
Statistical Methods and Assumptions Guide
9 pages
Winter 2024 Collaborative Learning Project
No ratings yet
Winter 2024 Collaborative Learning Project
8 pages
Bivariate Data Analysis Project Guide
No ratings yet
Bivariate Data Analysis Project Guide
4 pages
R Statistical Analysis Experiments Guide
No ratings yet
R Statistical Analysis Experiments Guide
5 pages
Data Analytics Course Overview
No ratings yet
Data Analytics Course Overview
6 pages
Statistical Analysis Techniques Course
No ratings yet
Statistical Analysis Techniques Course
3 pages
Project Working Document Review
No ratings yet
Project Working Document Review
11 pages
Collaborative Statistics Project Guide
No ratings yet
Collaborative Statistics Project Guide
3 pages
Stata 18 Econometrics Assignment Guide
No ratings yet
Stata 18 Econometrics Assignment Guide
4 pages
Project Guidelines for Statistics Course
No ratings yet
Project Guidelines for Statistics Course
5 pages
R Studio Data Analysis Guide
No ratings yet
R Studio Data Analysis Guide
7 pages
Correlation Analysis Project Guide
No ratings yet
Correlation Analysis Project Guide
4 pages
Statistical Inference with R Course
No ratings yet
Statistical Inference with R Course
5 pages
Project Guide
No ratings yet
Project Guide
2 pages
City Planning Considerations in Statistics
No ratings yet
City Planning Considerations in Statistics
8 pages
MAT 152 Signature Assignment Guide
No ratings yet
MAT 152 Signature Assignment Guide
4 pages
Data Processing in Research Methodology
No ratings yet
Data Processing in Research Methodology
28 pages
Statistical Analysis of Student Performance
No ratings yet
Statistical Analysis of Student Performance
3 pages
R Inbuilt Functions and Statistical Tests
No ratings yet
R Inbuilt Functions and Statistical Tests
6 pages
Assumptions in Multivariate Analysis
No ratings yet
Assumptions in Multivariate Analysis
20 pages
Linear Regression Exam Review Guide
No ratings yet
Linear Regression Exam Review Guide
4 pages
Probability and Statistics Course Overview
No ratings yet
Probability and Statistics Course Overview
13 pages
Resource Planning in Highway Projects
No ratings yet
Resource Planning in Highway Projects
5 pages
AG1165 Ritesh Bhagat
No ratings yet
AG1165 Ritesh Bhagat
228 pages
Urban Flood Modeling with GIS & HEC-HMS
No ratings yet
Urban Flood Modeling with GIS & HEC-HMS
17 pages
SmartPLS Analysis of Consumer Trust
No ratings yet
SmartPLS Analysis of Consumer Trust
186 pages
Real-Time Sign Language Recognition Review
No ratings yet
Real-Time Sign Language Recognition Review
25 pages
Diabetes Epidemic and Comorbidities in India
No ratings yet
Diabetes Epidemic and Comorbidities in India
9 pages
Plate Load Test Certification Report
No ratings yet
Plate Load Test Certification Report
1 page
Safety Measures in Construction and Dams
No ratings yet
Safety Measures in Construction and Dams
34 pages
Intro to InfraWorks 360 for Civil Design
No ratings yet
Intro to InfraWorks 360 for Civil Design
14 pages
Dam Rehabilitation and Improvement Overview
No ratings yet
Dam Rehabilitation and Improvement Overview
12 pages
Precast Air-Lane Analysis of Military Air Base Using Ansys
No ratings yet
Precast Air-Lane Analysis of Military Air Base Using Ansys
5 pages
Research Article Analytical Study On The Structure Behaviour of Regular and Irregular Space Frame by Staad - Pro V8I
No ratings yet
Research Article Analytical Study On The Structure Behaviour of Regular and Irregular Space Frame by Staad - Pro V8I
6 pages
Labor Productivity in Construction Analysis
No ratings yet
Labor Productivity in Construction Analysis
10 pages
Management Theory Assignment Guidelines
No ratings yet
Management Theory Assignment Guidelines
2 pages
Eggshell Powder as Cement Replacement
50% (2)
Eggshell Powder as Cement Replacement
12 pages
CFD Analysis of Box Vans in ANSYS
No ratings yet
CFD Analysis of Box Vans in ANSYS
16 pages
Comparative Study and Analysis of Unstiffened and Stiffened Plate With and Without Opening
100% (1)
Comparative Study and Analysis of Unstiffened and Stiffened Plate With and Without Opening
4 pages
Computer-Aided Foot Over Bridge Design
No ratings yet
Computer-Aided Foot Over Bridge Design
6 pages
Contract of Guarantee: Sec 126-147 Overview
No ratings yet
Contract of Guarantee: Sec 126-147 Overview
9 pages
Essentials of HRM Assignment Updated 1
No ratings yet
Essentials of HRM Assignment Updated 1
10 pages
Growth Strategies and Market Concentration Analysis
No ratings yet
Growth Strategies and Market Concentration Analysis
8 pages
Critique of Jejemon Language Research
No ratings yet
Critique of Jejemon Language Research
5 pages
Pandangan Ibu Bapa Dalam Pemberian Vaksin PICKid Pandangan Dari Sudut
No ratings yet
Pandangan Ibu Bapa Dalam Pemberian Vaksin PICKid Pandangan Dari Sudut
613 pages
Community Engagement in HWC Mitigation
No ratings yet
Community Engagement in HWC Mitigation
64 pages
ENG 102 Syllabus Spring 2026
No ratings yet
ENG 102 Syllabus Spring 2026
8 pages
M60A3 Tank Procedure Guides Overview
100% (2)
M60A3 Tank Procedure Guides Overview
125 pages
Comparative Politics: Methods and Challenges
No ratings yet
Comparative Politics: Methods and Challenges
7 pages
Optimizing Pistachio Oil Extraction Process
No ratings yet
Optimizing Pistachio Oil Extraction Process
12 pages
TRA 450 Research Methodology Outline
No ratings yet
TRA 450 Research Methodology Outline
2 pages
Panel Data's Revival in Retail Research
No ratings yet
Panel Data's Revival in Retail Research
10 pages
Project Risk Management Exam Guide
No ratings yet
Project Risk Management Exam Guide
15 pages
Product Management Expertise in Healthcare
No ratings yet
Product Management Expertise in Healthcare
10 pages
TLE 6 Week 6 Lesson Plan
100% (1)
TLE 6 Week 6 Lesson Plan
3 pages
Mechanical Engineering Student CV
No ratings yet
Mechanical Engineering Student CV
2 pages
Mediation Analysis with Stata's med4way
No ratings yet
Mediation Analysis with Stata's med4way
28 pages
Logic vs. Critical Thinking Explained
0% (1)
Logic vs. Critical Thinking Explained
4 pages
Emotional Factors in Children's Learning
No ratings yet
Emotional Factors in Children's Learning
8 pages
Optimizing Kochi Metro Ticketing Operations
No ratings yet
Optimizing Kochi Metro Ticketing Operations
15 pages
Linear Measurements To Determine Working Length of Curved Canals With Fine Files: Conventional Versus Digital Radiography
No ratings yet
Linear Measurements To Determine Working Length of Curved Canals With Fine Files: Conventional Versus Digital Radiography
6 pages
Strategic Competency Mapping in Talent Management
No ratings yet
Strategic Competency Mapping in Talent Management
20 pages
PUBLIC SECTOR - AN EMPLOYER of Choice
No ratings yet
PUBLIC SECTOR - AN EMPLOYER of Choice
31 pages
Project Work Book for AI & Data Science
No ratings yet
Project Work Book for AI & Data Science
76 pages
MedTech Internship Performance Study
No ratings yet
MedTech Internship Performance Study
6 pages
Meningkatkan Produksi ASI Pasca Melahirkan
No ratings yet
Meningkatkan Produksi ASI Pasca Melahirkan
9 pages
Understanding Research: Key Concepts
No ratings yet
Understanding Research: Key Concepts
10 pages
Critical Analysis of The Kite Runner
No ratings yet
Critical Analysis of The Kite Runner
3 pages
Investment Banking Audit: 100 Challenges & Solutions
No ratings yet
Investment Banking Audit: 100 Challenges & Solutions
14 pages
Avishkar Competition 2025-26 Guidelines
No ratings yet
Avishkar Competition 2025-26 Guidelines
6 pages
Working Scientifically Skills: Self-Evaluation: Planning Investigations
No ratings yet
Working Scientifically Skills: Self-Evaluation: Planning Investigations
4 pages
Work-Life Balance and Employee Retention
100% (1)
Work-Life Balance and Employee Retention
18 pages
Citation Masaiti, G. (2018) - Education in Zambia at Fifty Years
No ratings yet
Citation Masaiti, G. (2018) - Education in Zambia at Fifty Years
32 pages

PO687 Statistical Analysis Project Guide

Uploaded by

PO687 Statistical Analysis Project Guide

Uploaded by

PO687 End of term project

The rationale behind the project:

Univariate statistics and visualisations

Visualise a bivariate relationship

Hypothesis testing with a t-test or a non-parametric test

You might also like