Assignment No 3 (Repaired)
Assignment No 3 (Repaired)
Roll no 19011513-029
Department of statistics
Shapiro–Wilk test:
The Shapiro-Wilk test is commonly used for small samples to determine whether
or not a sample fits a normal distribution.
While
The Kolmogorov-Smirnov test is well-known tests of used for larger samples when
n>1000
Now,
Definition:
The kolomogorov-Smirnov test is a non parametric test that
compares two probability distributions to determine if they are different. It is
used to test whether a sample comes from a specific distribution.
OR
In statistics,
( a) The di stri buti on of stati stic does not depend on cum ul ative distr ibution function bei ng tested
( b) T he test is exact.
Achievement
Moti vati on
49
49
49
50
53
53
53
54
54
54
55
56
56
56
57
58
58
58
59
60
61
61
61
61
61
63
64
64
64
65
Solution:
Next part,
Next,
Achievement motivation
Mean 57.2
Standard error 0.87414983
Median 57.5
Mode 61
Standard deviation 4.78791582
Sample Variance 22.9241379
Kurtosis -0.924434
Skewness -0.1505046
Range 16
Minimum 49
Maximum 65
Sum 1716
Count 30
Note:
The value for skewness & kurtosis between -2 & +2 are considered
acceptable in order to prove normal distribution .
Value of KS:
KS= 0.119638019
Now,
Check on SPSS:
Method:
Analysis
Explore
Then check normality
Tests of Normality
Kolmogorov-Smirnova Shapiro-Wilk
Statistic df Sig. Statistic df Sig.
Achievemotivation .123 30 .200 *
.948 30 .146
Normality :
The degree to which the sample data distribution corresponds to a
normal distribution (In graphical form, the normal distribution appears as
symmetrical and bell-shaped).
Descriptives
Statistic Std. Error
Achievemotivation Mean 57.1000 .88454
95% Confidence Interval for Lower Bound 55.2909
Mean Upper Bound 58.9091
5% Trimmed Mean 57.1296
Median 57.5000
Variance 23.472
Std. Deviation 4.84483
Minimum 49.00
Maximum 65.00
Range 16.00
Interquartile Range 8.00
Skewness -.102 .427
Kurtosis -1.025 .833
Histogram:
Graphical display of the distribution of a variable. By forming
frequency counts in categories, the shape of the variable’s distribution can be
shown. Used to make a visual comparison to the normal distribution…
QQ-plot:
Visual method for identifying whether two sets of data are drawn from
the same distribution. The QQ-plot shows a reference line at a 45 degrees angle, if
the two data sets are drawn from the same distribution, the points will fall on that
line
Conclusion:
Conclusion:
Our sample size is 30 & wants to check level of significance at 0.05…
Which is less than critical value which means that null hypothesis is accepted
means there is no difference between two distributions.
__________________________
(SHAPIRO-WILK TEST)
Definition:
Shapiro-Wilk test is a hypothesis test that evaluates whether a data
set is normally distributed. It evaluates data from a sample with the null
hypothesis that the data set is normally distributed. A large p-value indicates the
data set is normally distributed, a low p-value indicates that it isn’t normally
distributed.
Another definition:
The Shapiro-Wilk test is a hypothesis test that is applied to a
sample with a null hypothesis that the sample has been generated from a normal
distribution. If the p-value is low, we can reject such a null hypothesis and say
that the sample has not been generated from a normal distribution .
It’s an easy-to-use statistical tool that can help us find an answer to the
normality check we need, but it has one flaw: It doesn’t work well with large data
sets. The maximum allowed size for a data set depends on the implementation, but
in Python , we see that a sample size larger than 5,000 will give us an approximate
calculation for the p-value
3: It’s a very useful tool to ensure that a normality requirement is satisfied every
time we need it, and it must be present in every data scientist’s toolbox.
Note:
If the significance value of the Shapiro wilk test is greater than 0.05,
the data is normal…
Null & Alternative Hypothesis:
Ho: the sample belongs to normal distribution…
Level Of Significance:
𝜶 =0.05
Test Statistic
Shapiro-Wilk Tables
Given data:
Achievement Motivation-(xi)
49
49
49
50
53
53
53
54
54
54
55
56
56
56
57
58
58
58
59
60
61
61
61
61
61
63
64
64
64
65
Next,
Next,
W(NUMINATOR)=-18.111^2
W-DENOMINATOR=96486.24
W=18.112/96486.24=0.18771589
p-value= 0.927
Conclusion:
p-value>0.05
________________________________________________________