0% found this document useful (0 votes)
127 views24 pages

Statistical Analysis of Ungrouped Data

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Topics covered

  • Statistical Research,
  • Statistical Findings,
  • Cumulative Frequency,
  • Data Interpretation,
  • Data Sampling,
  • Data Visualization,
  • Scatter Plot,
  • Data Summarization,
  • Standard Deviation,
  • Data Representation
0% found this document useful (0 votes)
127 views24 pages

Statistical Analysis of Ungrouped Data

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Topics covered

  • Statistical Research,
  • Statistical Findings,
  • Cumulative Frequency,
  • Data Interpretation,
  • Data Sampling,
  • Data Visualization,
  • Scatter Plot,
  • Data Summarization,
  • Standard Deviation,
  • Data Representation

STATS

STATISTIEK
UNGROUPED DATA
ONGEGROEPEERDE DATA
• Range = highest value – lowest value
Omvang (variasiewysdte) = grootse waarde – kleinste waarde

𝟏
• 𝑸𝟏(𝒑𝒐𝒔𝒊𝒕𝒊𝒐𝒏) = (𝒏 + 𝟏)
𝟒 • Inter quartile range : IQR = 𝑄3 − 𝑄1
𝟏 Inter kwartiel variasiewydte : IKV = 𝑄3 − 𝑄1
• 𝑸𝟐(𝒑𝒐𝒔𝒊𝒕𝒊𝒐𝒏) = 𝟐
(𝒏 + 𝟏)
𝟑
• 𝑸𝟑(𝒑𝒐𝒔𝒊𝒕𝒊𝒐𝒏) = 𝟒
(𝒏 + 𝟏) Copyright
E vd Merwe
BOX AND WHISKER • Five number summary
• Vyfgetal opsomming
MOND EN SNOR • Minimum
𝟏
𝒔𝒚𝒎𝒎𝒆𝒕𝒓𝒊𝒄𝒂𝒍 𝑠𝑖𝑚𝑚𝑒𝑡𝑟𝑖𝑒𝑠 • 𝑸𝟏(𝒑𝒐𝒔𝒊𝒕𝒊𝒐𝒏) = 𝟒
(𝒏 + 𝟏)
𝟐𝟓% 𝟐𝟓% 𝟏
𝟐𝟓% 𝟐𝟓% • 𝑸𝟐(𝒑𝒐𝒔𝒊𝒕𝒊𝒐𝒏) = 𝟐
(𝒏 + 𝟏)
𝟑
• 𝑸𝟑 • Maximum
• 𝑸𝟑(𝒑𝒐𝒔𝒊𝒕𝒊𝒐𝒏) = (𝒏 + 𝟏)
• Minimum • 𝑸𝟏 • 𝑸𝟐 𝟒
Lower quartile Medain Upper quartile
Onderste kwartiel Mediaan Boonste kwartiel • Maximum

𝒔𝒌𝒆𝒘 𝒕𝒐 𝒕𝒉𝒆 𝒍𝒆𝒇𝒕 𝑠𝑘𝑒𝑒𝑓 𝑛𝑎 𝑙𝑖𝑛𝑘𝑠 𝒔𝒌𝒆𝒘 𝒕𝒐 𝒕𝒉𝒆 𝒓𝒊𝒈𝒉𝒕 𝑠𝑘𝑒𝑒𝑓 𝑛𝑎 𝑟𝑒𝑔𝑠
Distribution of data
Verspreiding van data
Normal distribution (symmetrical)
Normale verspreiding (simmetries)
Negative skew (left skewed) Positive skew (right skewed)
Positief skeef (skeef na links) Positief skeef (skeef na regs)
𝑥ҧ < 𝑄2 𝑥ҧ > 𝑄2

Normal distribution (symmetrical) Positive skew (right skewed)


Normale verspreiding (simmetries)
Positief skeef (skeef na regs)
Negative skew
Positive skew (left skewed)
(right skewed) Positief skeef
Positief skeef (skeef na links)
(skeef na regs)
Negative skew (left skewed)
Positief skeef (skeef na links)
Distribution of data
Verspreiding van data
Negative skew (left skewed) Normal distribution (symmetrical) Positive skew (right skewed)
Positief skeef (skeef na links) Normale verspreiding (simmetries) Positief skeef (skeef na regs)

𝑥ҧ < 𝑄2 𝑥ҧ = 𝑄2 𝑥ҧ > 𝑄2
JUNE 2016

QUESTION 1 VRAAG 1

On a certain day a tour operator sent 11 tour buses ‘n Toeroperateur het op ‘n sekere dag 11 toerbusse na 11
to 11 different destinations. The table shows the verskillende bestemmings toe gestuur. Die table hieronder
number of passengers on each bus. toon die getal passasiers op elke bus.

1.1 Calculate the mean number of passengers 1.1 Bereken die gemiddelde getal passasiers wat in ‘n
travelling in a tour bus. toerbus ry.
JUNE 2016

1.2 Write down the five-number summary of the data.


Skryf die vyf-getal-opsomming van die data neer.

1.3 Draw a box and whisker diagram for the data.


Skets ‘n mond-en-snor diagram vir die data.

1.4 Refer to the box and whisker diagram and


comment on the skewness of the data set.
Verwys na die mond-en-snor-diagram en lewer
kommentaar op die skeefheid van die datastel.
STANDARD DEVIATION ✓ ✓ ✓
JUNE 2016

STANDAARDAFWYKING
1.5 Calcultae the standard deviation for this data set.
How to find standard deviation on Casio?
Bereken die standaardafwyking vir hierdie dataset.
Enter the DATA:

• Press Mode (top right corner)


• then press 2 for STAT mode 1.6 A tour is regarded popular if the number of passengers
• then press 1 corresponding to the 1-Var option. on a tour is one standard deviation above the mean. How
many destinations were popular on this particular day?
• Now enter the data (= →enter & AC → save)
‘n Toer word as gewild beskou indien die getal passasiers
• Press SHIFT STAT (1) op ‘n toerbus een standaardafwyking bokant die gemiddelde is.
• VAR (4) Hoeveel bestemmings was op hierdie spesifieke dag gewild?
• 𝜎𝑥 (3)
1.1
MEMO
MORE BOX AND WHISKER QUESTIONS
MEER MOND EN SNOR VRAE
25% 25%
25% 25% Maart 2016

QUESTION 1
The box and whisker diagram shows the marks (out of 80)
obtained in a History test by a class of nine learners.

1.1 Comment on the skewness of the data.

1.2 Write down the range of the marks obtained. Maximum



1.3 If the learners had to obtain 32 marks to pass the minimum
test, estimate the percentage of the class that failed the
test.
VRAAG 1
Die mond- en- snordiagram toon punte (uit 80) wat ‘n klas van nege leerders in ‘n Geskiedenistoets behaal het.

1.1 Lewer kommentaar op die skeefheid van die data.

1.2 Skryf die omvang (variasiewydte) neer van die punte wat behaal is.

1.3 Indien die leerders 32 punte moes behaal het om die toets te slaag, beraam watter persentasie van die klas die toets
gedruip het.
Maart 2016

20 41 62 75 75 80

1.4 In ascending order, the second mark is 28, the third 1.4 In stygende volgorde is die tweede punt 28, die
mark is 36 and the sixth mark 69. derde punt 36 en die sesde punt 69.
The seventh and eight marks are the same. Die sewende en die agste punte is dieselfde. Die
The average mark for this test is 54. gemiddelde punt vir hierde toets is

Fill in the marks of the remaining learners in ascending Vul die punte van die oorblywende leerders in stygende
order. volgorde in.
SCATTER PLOT JUNE 2016

SPREIDIAGRAM
QUESTION 2

On the first day of each month information is recorded about the


temperature at midday (in °C) and the number of 500ml bootles
of water that were sold at the tuck shop of a certain school
during the lunch break. The data is shown in the table and
represented on the scatter plot. The least square regression line
for this data is drawn on the scatter plot.

VRAAG 2

Op die eerste skooldag van elke maand word inligting oor die
middaguur-temperatuur (in °C) en die getal 500 ml-bottels water
wat in pouse by die snoepie van ‘n sekere skool verkoop word
aangeteken. Die data word in die table getoon en op die
spreidiagram voorgestel. Die kleinste-regressielyn vir hierdie data is
op die spreidiagram geskets.
2.1 Identify an outlier in the data. JUNE 2016
Identifiseer ‘n uitskieter in die data.

2.2 Determine the equation of the least squares regression line.


Bepaal die vergelyking van die kleinste kwadrate-regressielyn.
CASIO: X Y
• Mode
18= 12=
• STAT (2)
• 2: A+Bx 21= 15=
• Complete table .
• SHIFT STAT (1) .
• 5: Reg .
• 1: A AC
• 2: B (save)
• 3: r

2.3 Estimate the number of 500 ml bottels of water


that will be sold if the temperature is 28°C at midday.
Skat die getal 500 ml-bottels water wat verkoop sal
word indien die middagtemperatuur 28°C is.
2.4 Refer to the scatter plot. Would you say that the relation between the temperature at midday JUNE 2016
and the number of 500 ml bottles of water sold is weak or strong? Motivate your answer.
Verwys na die spreidiagram. Sou jy sê dat die verband tussen die middaguurtemperatuur en die getal
500 ml-bottels water wat verkoop word, swak of sterk is? Motiveer jou antwoord.

2.4 Strong / Sterk


The majority of the points lie close to the regression line.
Die meerderheid punte lê naby die regressielyn.
OR/ OF
𝑟 = 0,98

2.5 Give a reason why the observed trend for this data
cannot continue indefinitely.
Gee ‘n rede waarom die waargenome neiging vir hierdie data
nie onbepaals kan aanhou nie.
GROUPED DATA
GEGROEPEERDE DATA
STATS
STATISTIEK

𝟏
• 𝑸𝟏(𝒑𝒐𝒔𝒊𝒕𝒊𝒐𝒏) = (𝒏)
𝟒
𝟏
• 𝑸𝟐(𝒑𝒐𝒔𝒊𝒕𝒊𝒐𝒏) = (𝒏)
𝟐

𝟑
• 𝑸𝟑(𝒑𝒐𝒔𝒊𝒕𝒊𝒐𝒏) = (𝒏)
𝟒
Cummulative frequency graph (Ogive) March 2016

Kummulatiewe frekwensie grafiek (Ogief)


QUESTION 2 Number of messages Number of days VRAAG 2
Getal boodskappe Aantal dae
A company recorded the number of 10 < 𝑥 ≤ 20
10+20
=15 2 ‘n Maatskappy het die getal
2 boodskappe aangeteken wat oor ‘n
messages sent. The data is shown in 20 < 𝑥 ≤ 30 25 8
the table. tydperk van 60 werksdae per e-pos
30 < 𝑥 ≤ 40 35 5 gestuur is. Die data word in die table
2.1 Estimate the mean number 40 < 𝑥 ≤ 50 45 10 getoon.
of messages sent per day, 50 < 𝑥 ≤ 60 55 12 2.1 Benader die gemiddelde getal
rounded off to TWO decimal
60 < 𝑥 ≤ 70 65 18 boodskappe gestuur, afgerond tot
places.
70 < 𝑥 ≤ 80 75 3 TWEE desimale plekke.
80 < 𝑥 ≤ 90 85 2
2.2 Draw a cumulative frequency graph (Ogive) of
the data.
Teken ‘n kummulatiewe frekwensie-grafiek (Ogief)
van die data.
2.3 Hence, estimate the number of days on which 65 or
more messages were sent.
Benader vervolgens die getal dae waarop 65 of meer
boodskappe gestuur word.
46
Number of messages Number
Getal boodskappe of days
Aantal
dae cum 𝑓 ( ; )
10 < 𝑥 ≤ 20 2 2 (20 ; 2)
20 < 𝑥 ≤ 30 8 10 (30 ; 10)
30 < 𝑥 ≤ 40 5 15 (40 ; 15)
40 < 𝑥 ≤ 50 10 25 (50 ; 25)
50 < 𝑥 ≤ 60 12 37 (60 ; 37)
60 < 𝑥 ≤ 70 18 55 (70 ; 55)
70 < 𝑥 ≤ 80 3 58 (80 ; 58)
2 60 (90 ; 60) 65
80 < 𝑥 ≤ 90
Histogram Nov 2014

QUESTION 2

The speed of 55 cars passing through a certain section


of a road are monitored for one hour. The speed limit
on this section of the road is 60km per hour. A
histogram is drawn to represent the data.

2.1 Identify the modal class of the data.

VRAAG 2 2.1 Benader die gemiddelde


getal boodskappe gestuur,
Die spoed van 55 motors wat deur ‘n sekere gedeelte afgerond tot TWEE desimale
van ‘n pad beweeg, word vir een uur gemonitor. Die plekke.
spoedgrens op hierdie gedeelte van die pad is 60km per
uur. ‘n Histogram is geskets om hierdie data voor te stel.

2.1 Identifiseer die modale klas.


Nov 2014

2.2.1 Complete the cumulative frequency column in


the table.

Voltooi die kummulatiewe frekwensie kolom in die


table.
( ; ) Nov 2014
(30;1) Modale klas
(40 ; 8)
(50;21)
Modal class
(60,38)
(70;47)
(80;52)
(90;54)
(100;55)

2.2.2 Draw a cumulative frequency graph (Ogive) of


the data.
Teken ‘n kummulatiewe frekwensie-grafiek (Ogief)
van die data.
Nov 2014
2.3 The traffic department sends speeding fines to
55
all motorists whose speed exceeds 66 km per hour.
Estimate the number of motorists who will receive a
speeding ticket.
Die verkeersdepartement stuur spoedboetes aan alle 44
motoriste wat ‘n spoed van 66 km per uur oorskry.
Skat die getal motoriste wat ‘n spoedboete sal
ontvang.

66
EXAMPLE/VOORBEELD (nov 2011)
1. Determine the median.
Bepaal die mediaan.

𝑸𝟐 = 𝟔, 𝟗

𝟑
× 𝟑𝟎 22,5
2. Determine the interquartile range 𝟒
Bepaal die interkwartielvariasiewydte.
𝟏
× 𝟑𝟎
𝟐
𝑸𝟑 − 𝑸𝟏 = 𝟖, 𝟓 − 𝟒, 𝟓 = 𝟒
𝟏
× 𝟑𝟎 7,5
𝟒

4,5 8,5
6,9
EXAMPLE/VOORBEELD (nov 2011)
3. Complete the frequency table for the ogive.
Voltooi die frekwensietabel vir die ogief.

Time 𝒇 𝒄𝒖𝒎 𝒇 Coordinates


Tyd 𝒌𝒖𝒎 𝒇 koordinate
1<𝑥≤3 3 3 (3; 3)
3<𝑥≤5 6 9 (5; 9)
5<𝑥≤7 7 16 (7; 16)
7<𝑥≤9 8 24 (9; 24)
9 < 𝑥 ≤ 11 5 29 (11; 29)
11 < 𝑥 ≤ 13 2 31 (13; 31)

You might also like