0% found this document useful (0 votes)

39 views

Assignment No 3 (Repaired)

This document contains information about testing for normality of data using the Kolmogorov-Smirnov test and Shapiro-Wilk test. It provides details on applying these tests, including advantages and disadvantages of the Kolmogorov-Smirnov test. An example data set on achievement motivation is analyzed to test for normality using the Kolmogorov-Smirnov statistic and the Shapiro-Wilk test in SPSS. Descriptive statistics are also reported.

Uploaded by

Ayesha Iqbal

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views

Assignment No 3 (Repaired)

Uploaded by

Ayesha Iqbal

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Assignment No 03

Name Sadiqua Iqbal

Roll no 19011513-029

Course Title Non parametric

Submitted to Dr! Muqaddas Javeid

Department of statistics

Hafiz Hayat Campus

Tests to detect normality:

The Kolmogorov–Smirnov test and the Shapiro–Wilk test are most widely
used methods to test the normality of the data.

Shapiro–Wilk test:

The Shapiro-Wilk test is commonly used for small samples to determine whether
or not a sample fits a normal distribution.

While

The Kolmogorov-Smirnov test is well-known tests of used for larger samples when
n>1000

Now,

Firstly we discussed the kolomogorov test

Definition:
The kolomogorov-Smirnov test is a non parametric test that
compares two probability distributions to determine if they are different. It is
used to test whether a sample comes from a specific distribution.

In statistics,

the Kolmogorov–Smirnov test (K–S test or KS test) is a nonparametric

test of the equality of continuous (or discontinuous, one-dimensional probability
distributions that can be used to compare a sample with a reference probability
distribution (one-sample K–S test), or to compare two samples (two-sample K–S
test).

ADVANTAGES OF KOLOMOGOROV TEST:

Kol mogor ov- Smir nov tests have the advantages that

( a) The di stri buti on of stati stic does not depend on cum ul ative distr ibution function bei ng tested

( b) T he test is exact.

Dis-ADVANTAGES OF KOLOMOGOROV TEST:

T hey have the di sadvantage that they ar e mor e sensitive to devi ations near the centr e of the distr ibution
than at the tail s
Numerical no 01
Given data:

Achievement
Moti vati on
49
49
49
50
53
53
53
54
54
54
55
56
56
56
57
58
58
58
59
60
61
61
61
61
61
63
64
64
64
65

Solution:

Achievement Sample size N Mean SD

Motivation
49 30 57.2 4.787915823
49 30 57.2 4.787915823
49 30 57.2 4.787915823
50 30 57.2 4.787915823
53 30 57.2 4.787915823
53 30 57.2 4.787915823
53 30 57.2 4.787915823
54 30 57.2 4.787915823
54 30 57.2 4.787915823
54 30 57.2 4.787915823
55 30 57.2 4.787915823
56 30 57.2 4.787915823
56 30 57.2 4.787915823
56 30 57.2 4.787915823
57 30 57.2 4.787915823
58 30 57.2 4.787915823
58 30 57.2 4.787915823
58 30 57.2 4.787915823
59 30 57.2 4.787915823
60 30 57.2 4.787915823
61 30 57.2 4.787915823
61 30 57.2 4.787915823
61 30 57.2 4.787915823
61 30 57.2 4.787915823
61 30 57.2 4.787915823
63 30 57.2 4.787915823
64 30 57.2 4.787915823
64 30 57.2 4.787915823
64 30 57.2 4.787915823
65 30 57.2 4.787915823

Next part,

Rank (Rank-1)/k Normal distribution Difference(G-F)

1 0 0.043388936 0.043388936
2 0.03333333 0.043388936 0.010055603
3
3 0.06666666 0.043388936 -0.02327773
7
4 0.1 0.06631826 -0.03368174
5 0.13333333 0.190186726 0.056853393
3
6 0.16666666 0.190186726 0.023520059
7
7 0.2 0.190186726 -0.009813274
8 0.23333333 0.251955338 0.018622005
3
9 0.26666666 0.251955338 -0.014711329
7
10 0.3 0.251955338 -0.048044662
11 0.33333333 0.322941124 -0.010392209
3
12 0.36666666 0.401049717 0.03438305
7
13 0.4 0.401049717 0.001049717
14 0.43333333 0.401049717 -0.032283617
3
15 0.46666666 0.483340296 0.01667363
7
16 0.5 0.566349327 0.066349327
17 0.53333333 0.566349327 0.033015993
3
18 0.56666666 0.566349327 -0.00031734
7
19 0.6 0.64652165 0.04652165
20 0.63333333 0.720660782 0.087327448
3
21 0.66666666 0.786304686 0.119638019
7
22 0.7 0.786304686 0.086304686
23 0.73333333 0.786304686 0.052971352
3
24 0.76666666 0.786304686 0.019638019
7
25 0.8 0.786304686 -0.013695314
26 0.83333333 0.887125681 0.053792348
3
27 0.86666666 0.922231406 0.055564739
7
28 0.9 0.922231406 0.022231406
29 0.93333333 0.922231406 -0.011101927
3
30 0.96666666 0.948354214 -0.018312452
7

Next,
Achievement motivation

Mean 57.2
Standard error 0.87414983
Median 57.5
Mode 61
Standard deviation 4.78791582
Sample Variance 22.9241379
Kurtosis -0.924434
Skewness -0.1505046
Range 16
Minimum 49
Maximum 65
Sum 1716
Count 30

Note:

The value for skewness & kurtosis between -2 & +2 are considered
acceptable in order to prove normal distribution .

Value of KS:
KS= 0.119638019

Null & Alternative Hypothesis :

Ho: Data is normal.

H1: Data is not normal

Now,

Check on SPSS:

Method:
Analysis

Explore
Then check normality

Tests of Normality
Kolmogorov-Smirnova Shapiro-Wilk
Statistic df Sig. Statistic df Sig.
Achievemotivation .123 30 .200 *
.948 30 .146

Normality :
The degree to which the sample data distribution corresponds to a
normal distribution (In graphical form, the normal distribution appears as
symmetrical and bell-shaped).

Descriptives
Statistic Std. Error
Achievemotivation Mean 57.1000 .88454
95% Confidence Interval for Lower Bound 55.2909
Mean Upper Bound 58.9091
5% Trimmed Mean 57.1296
Median 57.5000
Variance 23.472
Std. Deviation 4.84483
Minimum 49.00
Maximum 65.00
Range 16.00
Interquartile Range 8.00
Skewness -.102 .427
Kurtosis -1.025 .833

Histogram:
Graphical display of the distribution of a variable. By forming
frequency counts in categories, the shape of the variable’s distribution can be
shown. Used to make a visual comparison to the normal distribution…
QQ-plot:
Visual method for identifying whether two sets of data are drawn from
the same distribution. The QQ-plot shows a reference line at a 45 degrees angle, if
the two data sets are drawn from the same distribution, the points will fall on that
line
Conclusion:
Conclusion:
Our sample size is 30 & wants to check level of significance at 0.05…

So our critical value is 0.242

Our KS test statistic is 0.11964

Which is less than critical value which means that null hypothesis is accepted
means there is no difference between two distributions.

__________________________
(SHAPIRO-WILK TEST)

Data scientists usually have to check if data is normally distributed. An

example is the normality check on the residuals of linear regression in order to
correctly use the F-test. One way to do that is through the Shapiro-Wilk test,
which is a hypothesis test applied to a sample with a null hypothesis that the
sample stems from a normal distribution .

Definition:
Shapiro-Wilk test is a hypothesis test that evaluates whether a data
set is normally distributed. It evaluates data from a sample with the null
hypothesis that the data set is normally distributed. A large p-value indicates the
data set is normally distributed, a low p-value indicates that it isn’t normally
distributed.

Another definition:
The Shapiro-Wilk test is a hypothesis test that is applied to a
sample with a null hypothesis that the sample has been generated from a normal
distribution. If the p-value is low, we can reject such a null hypothesis and say
that the sample has not been generated from a normal distribution .

It’s an easy-to-use statistical tool that can help us find an answer to the
normality check we need, but it has one flaw: It doesn’t work well with large data
sets. The maximum allowed size for a data set depends on the implementation, but
in Python , we see that a sample size larger than 5,000 will give us an approximate
calculation for the p-value

. Advantages of the Shapiro-Wilk Test:

1: The Shapiro-Wilk test for normality is a very simple-to-use tool of statistics
to assess the normality of a data set.

2: I typically apply it after creating data visualization set either via a

histogram and/or a Q-Q plot.

3: It’s a very useful tool to ensure that a normality requirement is satisfied every
time we need it, and it must be present in every data scientist’s toolbox.

Assumption of Shapiro-Wilk Test:

1: If the Si g. val ue of the Shapi ro- Wil k Test i s gr eater than 0.05, the data i s norm al .

Note:
If the significance value of the Shapiro wilk test is greater than 0.05,
the data is normal…
Null & Alternative Hypothesis:
Ho: the sample belongs to normal distribution…

H1: the sample does not belong to normal distribution…

Level Of Significance:
𝜶 =0.05

Test Statistic

Shapiro-Wilk Tables

Given data:
Achievement Motivation-(xi)
49
49
49
50
53
53
53
54
54
54
55
56
56
56
57
58
58
58
59
60
61
61
61
61
61
63
64
64
64
65

Next,

Achievement Motivation-(xi) x-bar (x-x)2 ai (ai)(xi)

49 57.2 67.24 0.4254 20.8446
49 2401 0.2944 14.4256
49 2401 0.2487 12.1863
50 2500 0.2148 10.74
53 2809 0.187 9.911
53 2809 0.163 8.639
53 2809 0.1415 7.4995
54 2916 0.1219 6.5826
54 2916 0.1036 5.5944
54 2916 0.0862 4.6548
55 3025 0.0697 3.8335
56 3136 0.0537 3.0072
56 3136 0.0381 2.1336
56 3136 0.0227 1.2712
57 3249 0.0076 0.4332
58 3364 -0.4254 -24.6732
58 3364 -0.2944 -17.0752
58 3364 -0.2487 -14.4246
59 3481 -0.2148 -12.6732
60 3600 -0.187 -11.22
61 3721 -0.163 -9.943
61 3721 -0.1415 -8.6315
61 3721 -0.1219 -7.4359
61 3721 -0.1036 -6.3196
61 3721 -0.0862 -5.2582
63 3969 -0.0697 -4.3911
64 4096 -0.0537 -3.4368
64 4096 -0.0381 -2.4384
64 4096 -0.0227 -1.4528
65 4225 -0.0076 -0.494
96486.24 -18.111

Next,

W(NUMINATOR)=-18.111^2

W-DENOMINATOR=96486.24

W=18.112/96486.24=0.18771589

p-value= 0.927

Conclusion:

p-value>0.05

On the basis on provided

sample researcher accepts the
null hypothesis …

________________________________________________________

Chapter 8 - Quiz
No ratings yet
Chapter 8 - Quiz
10 pages
Bedi U2 A2 Madd
No ratings yet
Bedi U2 A2 Madd
8 pages
Brownian Motion
No ratings yet
Brownian Motion
7 pages
Prestigio EReader Short User Guide
No ratings yet
Prestigio EReader Short User Guide
8 pages
Research paper on exam prep
No ratings yet
Research paper on exam prep
4 pages
Copy of Ujian Ektum Kode a(1)
No ratings yet
Copy of Ujian Ektum Kode a(1)
11 pages
2184577_Parcial 1 Valentina Dominguez (1)
No ratings yet
2184577_Parcial 1 Valentina Dominguez (1)
17 pages
2019S4answers
No ratings yet
2019S4answers
1 page
SR No Question ID Correct Option SR No Question ID Correct Option SR No Question ID Correct Option SR No Question ID Correct Option
No ratings yet
SR No Question ID Correct Option SR No Question ID Correct Option SR No Question ID Correct Option SR No Question ID Correct Option
1 page
Form Isian Harganas Blora Fixxxx
No ratings yet
Form Isian Harganas Blora Fixxxx
400 pages
Assignment 2
No ratings yet
Assignment 2
1 page
Coaster Jun 2022
No ratings yet
Coaster Jun 2022
376 pages
Chart Title
No ratings yet
Chart Title
2 pages
Stasiun Campaign
No ratings yet
Stasiun Campaign
2 pages
Serv Mflow
No ratings yet
Serv Mflow
6 pages
Regression Statistics
No ratings yet
Regression Statistics
17 pages
PV Table
No ratings yet
PV Table
2 pages
Labb Act 8 9 10 11
No ratings yet
Labb Act 8 9 10 11
20 pages
Ola3 Anova
No ratings yet
Ola3 Anova
2 pages
Lampiran Uji Normalitas
No ratings yet
Lampiran Uji Normalitas
10 pages
Glucose Metabolism Assessment
No ratings yet
Glucose Metabolism Assessment
9 pages
Milabo Emc-35 Seatwork Rombergs
No ratings yet
Milabo Emc-35 Seatwork Rombergs
14 pages
Untitled
No ratings yet
Untitled
20 pages
Answers: 280.7 Seconds 77.6 Seconds 6.459 Meters (In Runge Kutta)
No ratings yet
Answers: 280.7 Seconds 77.6 Seconds 6.459 Meters (In Runge Kutta)
1,151 pages
Lab Report 2 no.
No ratings yet
Lab Report 2 no.
7 pages
Makalah Statistik Dan Kemometrik Percobaan: Fakultas Matematika Dan Ilmu Pengetahuan Alam
No ratings yet
Makalah Statistik Dan Kemometrik Percobaan: Fakultas Matematika Dan Ilmu Pengetahuan Alam
9 pages
Remuneration
No ratings yet
Remuneration
3 pages
) Tabelas Estatísticas - Testes Paramétricos Valores Da Função Distribuição Normal Padrão Ou Reduzida (Z Positivo)
No ratings yet
) Tabelas Estatísticas - Testes Paramétricos Valores Da Função Distribuição Normal Padrão Ou Reduzida (Z Positivo)
3 pages
Dodatok 6
No ratings yet
Dodatok 6
10 pages
Soal Latihan Intermediate 1
No ratings yet
Soal Latihan Intermediate 1
1 page
Método 001
No ratings yet
Método 001
3 pages
Numerical Assignment
No ratings yet
Numerical Assignment
64 pages
Untitled
No ratings yet
Untitled
23 pages
Додаток 6
No ratings yet
Додаток 6
10 pages
2016229141043954
No ratings yet
2016229141043954
7 pages
Muhammad Akbar Alviansyah - Kuis 3 Metnum
No ratings yet
Muhammad Akbar Alviansyah - Kuis 3 Metnum
5 pages
Copia de Libro1 - PESO
No ratings yet
Copia de Libro1 - PESO
9 pages
SR No Question ID Correct Option SR No Question ID Correct Option SR No Question ID Correct Option SR No Question ID Correct Option
No ratings yet
SR No Question ID Correct Option SR No Question ID Correct Option SR No Question ID Correct Option SR No Question ID Correct Option
1 page
Ring Sizing Chart PDF
No ratings yet
Ring Sizing Chart PDF
1 page
Ejercicio 2 de Simulacion...
No ratings yet
Ejercicio 2 de Simulacion...
8 pages
Bank
No ratings yet
Bank
6 pages
Course Work
No ratings yet
Course Work
20 pages
Dosing Simulator 4 0c 87 Moxi FenBen
No ratings yet
Dosing Simulator 4 0c 87 Moxi FenBen
36 pages
Final Exam: MA 2140 Statistics
No ratings yet
Final Exam: MA 2140 Statistics
3 pages
Dataset For Neural Network
No ratings yet
Dataset For Neural Network
35 pages
Measurement of Brewsters Angle and Ref Index
No ratings yet
Measurement of Brewsters Angle and Ref Index
6 pages
Milabo Emc-35 Seatwork Rombergs
No ratings yet
Milabo Emc-35 Seatwork Rombergs
16 pages
Solution Manual for Math for Health Care Professionals 2nd Edition Kennamer 1305509781 9781305509788 - Download PDF
100% (2)
Solution Manual for Math for Health Care Professionals 2nd Edition Kennamer 1305509781 9781305509788 - Download PDF
51 pages
Item Analysis
No ratings yet
Item Analysis
1 page
Uji Hipotesis Analisis Varians (Anava) Satu Jalan Sel Tak Sama (Pretest)
No ratings yet
Uji Hipotesis Analisis Varians (Anava) Satu Jalan Sel Tak Sama (Pretest)
12 pages
Felipe Poblete SimMC
No ratings yet
Felipe Poblete SimMC
87 pages
GRR Attis Fixture #6 (P24-0518)
No ratings yet
GRR Attis Fixture #6 (P24-0518)
775 pages
New Microsoft Excel Worksheet
No ratings yet
New Microsoft Excel Worksheet
8 pages
2024-10-20-0.7557466216153021
No ratings yet
2024-10-20-0.7557466216153021
21 pages
AR TALLY BY SEX
No ratings yet
AR TALLY BY SEX
6 pages
Sine Wave
No ratings yet
Sine Wave
2 pages
Homi Bhabha Answer Key 2013
No ratings yet
Homi Bhabha Answer Key 2013
2 pages
Bab Iv Pengumpulan Dan Pengolahan Data A Pengumpulan Data
No ratings yet
Bab Iv Pengumpulan Dan Pengolahan Data A Pengumpulan Data
15 pages
Carnet de Piquetage Réseau Kirzidi 2
No ratings yet
Carnet de Piquetage Réseau Kirzidi 2
6 pages
Lecture 3 Standardnormaltable PDF
No ratings yet
Lecture 3 Standardnormaltable PDF
2 pages
Medium Sudoku Puzzle Book (Printable Version)
From Everand
Medium Sudoku Puzzle Book (Printable Version)
Sheba Blake
No ratings yet
Percentile, Decile, and Quartile: P1, P2,.... P50...... P100
No ratings yet
Percentile, Decile, and Quartile: P1, P2,.... P50...... P100
10 pages
Introduction To Data Visualization With Seaborn Chapter3
100% (1)
Introduction To Data Visualization With Seaborn Chapter3
32 pages
LAB 1_Brand Strategy and Super Bowl Twitter Analytics
No ratings yet
LAB 1_Brand Strategy and Super Bowl Twitter Analytics
6 pages
Final Minutes - Guidelines BCH Business Statistics Sem 4
No ratings yet
Final Minutes - Guidelines BCH Business Statistics Sem 4
6 pages
Diploma Programmes Main Examination: Dipl/Qts0109/May2018/Maineqp
No ratings yet
Diploma Programmes Main Examination: Dipl/Qts0109/May2018/Maineqp
17 pages
Case Processing Summary
No ratings yet
Case Processing Summary
4 pages
Module 2
No ratings yet
Module 2
13 pages
Problems on Correlation Analysis (2)
No ratings yet
Problems on Correlation Analysis (2)
5 pages
QMB 3200 Lecture 2-1
No ratings yet
QMB 3200 Lecture 2-1
81 pages
Data Science Practical No 09
No ratings yet
Data Science Practical No 09
8 pages
Pearson'S Product-Moment Correlation Coefficient: X First Variable y Other Variable
No ratings yet
Pearson'S Product-Moment Correlation Coefficient: X First Variable y Other Variable
3 pages
STA301 - (Assignment No.1)
No ratings yet
STA301 - (Assignment No.1)
2 pages
Topic 1: Data Interpretation: Data Types, Its Collection, Display and Regarding Information
No ratings yet
Topic 1: Data Interpretation: Data Types, Its Collection, Display and Regarding Information
166 pages
Percentile Rank Worksheet 5
No ratings yet
Percentile Rank Worksheet 5
2 pages
Course Code: Caec 3A Course Title: College: Authors: Title of The Learning Resource
No ratings yet
Course Code: Caec 3A Course Title: College: Authors: Title of The Learning Resource
30 pages
Introduction To Business Statistics (Revision Questions) : IBS/Revision Worksheet/ BHRM/ 2020
No ratings yet
Introduction To Business Statistics (Revision Questions) : IBS/Revision Worksheet/ BHRM/ 2020
4 pages
Index Numbers
No ratings yet
Index Numbers
23 pages
Business Statistic Group 4 Mid Term Exam
No ratings yet
Business Statistic Group 4 Mid Term Exam
19 pages
Presentation of Data
No ratings yet
Presentation of Data
9 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
15 pages
Statistics-and-Probability 3Q SLM2
No ratings yet
Statistics-and-Probability 3Q SLM2
9 pages
Measures of Central Tendency Project +2
No ratings yet
Measures of Central Tendency Project +2
18 pages
3dt Formulae 2024 25
No ratings yet
3dt Formulae 2024 25
47 pages
ATGR Histogram
No ratings yet
ATGR Histogram
3 pages
GKPotazo - BSN-1A - Module 3 - Lesson 4 Hands On
No ratings yet
GKPotazo - BSN-1A - Module 3 - Lesson 4 Hands On
4 pages
AP Stat - Chap 8 Test Review Solutions
No ratings yet
AP Stat - Chap 8 Test Review Solutions
7 pages
Prob 07 Distributions 2
No ratings yet
Prob 07 Distributions 2
10 pages
Lesson 1: Illustrating T-Distribution
No ratings yet
Lesson 1: Illustrating T-Distribution
34 pages
1: Introduction: The Data Provided To Me Is The Data Taken From 50 People and The Data Is The
No ratings yet
1: Introduction: The Data Provided To Me Is The Data Taken From 50 People and The Data Is The
6 pages

Assignment No 3 (Repaired)

Uploaded by

Assignment No 3 (Repaired)

Uploaded by

Assignment No 03

Name Sadiqua Iqbal

Course Title Non parametric

Submitted to Dr! Muqaddas Javeid

Hafiz Hayat Campus

Tests to detect normality:

Firstly we discussed the kolomogorov test

the Kolmogorov–Smirnov test (K–S test or KS test) is a nonparametric

ADVANTAGES OF KOLOMOGOROV TEST:

Dis-ADVANTAGES OF KOLOMOGOROV TEST:

Achievement Sample size N Mean SD

Rank (Rank-1)/k Normal distribution Difference(G-F)

Null & Alternative Hypothesis :

Ho: Data is normal.

H1: Data is not normal

So our critical value is 0.242

Our KS test statistic is 0.11964

Data scientists usually have to check if data is normally distributed. An

. Advantages of the Shapiro-Wilk Test:

2: I typically apply it after creating data visualization set either via a

Assumption of Shapiro-Wilk Test:

H1: the sample does not belong to normal distribution…

Achievement Motivation-(xi) x-bar (x-x)2 ai (ai)(xi)

On the basis on provided

You might also like