0% found this document useful (0 votes)
70 views

Maths Microproject

This document describes a micro project on preparing charts for ungrouped and grouped data. It includes the project proposal, rationale, aims, literature review, methodology, resources used, and outputs. The project was carried out by a group of 8 students. They identified given data as raw, ungrouped, and grouped data. They calculated metrics like range, mean deviation, standard deviation, and variance. Through this project, the students learned to apply statistical techniques to engineering problems and developed skills in teamwork and time management.

Uploaded by

shantanudhage0
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
70 views

Maths Microproject

This document describes a micro project on preparing charts for ungrouped and grouped data. It includes the project proposal, rationale, aims, literature review, methodology, resources used, and outputs. The project was carried out by a group of 8 students. They identified given data as raw, ungrouped, and grouped data. They calculated metrics like range, mean deviation, standard deviation, and variance. Through this project, the students learned to apply statistical techniques to engineering problems and developed skills in teamwork and time management.

Uploaded by

shantanudhage0
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 20

All India Shri Shivaji Memorial Society’s Polytechnic

Kennedy Road, Near RTO, Pune-411001


MICRO PROJECT ON Group 7
Title of micro project .-Prepared chart for ungrouped and grouped
data and Use techniques and methods of statistics to crack discipline specific problems

Course:Basic mathematics

Corse Code :311302

Academic Year: 2023-2024


Semester: EJ-1k

Name:

Roll No.:

Enrolment No:

Name of Micro project Guide:

All India Shri Shivaji Memorial Society’s Polytechnic,


Kennedy Road Near RTO, Pune-411 001
=======================================
CERTIFICATE
----------------------------------------------------------------------------

Certified that this micro project report titled‘prepared chart for ungrouped
and grouped datais the bonafide work of Ms./Mr.
Roll no. of first year diploma Electronics and Telecommunication
Engineering
for thecourse: Basic mathematics Course code:311302during the academic
year 2023-2024 , who carried out the micro project work under my
supervision.

Name & signature of course


teacher

INDEX
Sr. No. Content Page No.

1. Micro Project Proposal

2. Rationale

3. Aims/Benefits of the Micro Project

4. Literature Review

5. Course Outcome achieved

6. Actual Methodology Followed

7. Actual Resources Used

8. Output of the Micro Projects

9. Skill Developed / Learning outcome of the Micro


Project

10. Applications of the Micro Project


ANNEXURE I
1. Micro Project Proposal
Title- - Prepared chart for ungrouped and grouped data and Use techniques and methods of
statistics to crack discipline specific problems

1.1Aims/Benefit of the Micro Project


Identify given data as raw data , ungrouped data and groped data
Calculate range , mean deviation for raw data , ungrouped data and groped data
Calculate standard deviation and variance for ungrouped and grouped data
1.2 Course Outcomes Addressed:
CO5 Use techniques and methods of statistics to crack discipline specific problems.

1.3Proposed Methodology (Procedure that will be followed to do the micro project- in about
100-
200 words):
Finalize the topic
Find and Collect information
Arrange information
Proofread the information
Review from the teacher
Edit project report as per the suggestions of the teacher
Proofread and finalize the report
Submission of the report
1.4Action Plan:
Sr. Detail of Activity Planned Planned Name of
No. Start date Finish Responsible Team
Date Members

1. Finalize the topic

2. Find and Collect


information

3. Arrange information

4. Proofread the information

5. Review from the teacher

6. Edit project report as per


the suggestions of the
teacher

7. Proofread and finalize the


report

8. Submission of the report


1.5Resource Required:
Sr. Name of Resource/Material Specificatio Qty. Remark
No. n

1. www.wikipedia.com Internet

2. www. Encyclopedia .com Internet

1.6 Name of Team Members with Roll Nos:


Sr. No Name of Students Roll No

1
361
SURWASE SURAJ N
2
362
TATHE OMKAR SANTOSH
3
363
THOMBARE SAKSHI DADA
4
364
THORAT ABHIJEET VISHNU
5
365
UDHE VRUSHALI SANTOSH
6
366
WABLE PRANJALI P
7
367
WAGH NISHANT MILIND
8
368
WALAVALKAR TEJRAJ PRADIP

(Name & Signature of faculty)


ANNEXURE - II
1 Micro Project Report
Title- - Prepared chart for ungrouped and grouped dataUse techniques and methods
of statistics to crack discipline specific problems

2 Rationale:This project is an attempt to initiate the multi-dimensional logical thinking and


reasoning capacity of the students.It will help to apply the concept of statistics
engineering related problems .

3. Aims/Benefits of the Micro Project:


1 Identify given data as raw data , ungrouped data and groped data
2 .Calculate range , mean deviation for raw data , ungrouped data and groped data
3. Calculate standard deviation and variance for ungrouped and grouped data. .

4Course Outcomes Achieved:


CO5 Use techniques and methods of statistics to crack discipline specific problems
5Literature Review:
1.www.wikipedia.com
2www.solve mathematics .com
3. www.study .com
4.www.information .com
5.www. Encyclopedia .com

6Actaul Methodology Followed: (Write stepwise work done, data collocted and its
analysisis (if
any). The contribution of individual member may also be noted.) :
Sr. No./ Date Time Work Done
Hour No.

1. 1 Hour Decided the theme of microproject.

2. 1 Hour Discussed microproject with teacher.

3. 1 Hour Microproject theme approved by teacher.

4. 1 Hour Team Members discussed about microproject.

5. 1 Hour Assigned work to team members.

6. 1 Hour Collected information about microproject.

7. 1 Hour Viewed Team member’s progress

8. 1 Hour Reviewed work of memebers

9. 1 Hour Helped weak members complete their part

10. 1 Hour Completed Microproject


11. 1 Hour Edited and Merged Microproject

12. 1 Hour Demonstrated microproject to the teacher

13. 1 Hour Asked Teacher’s suggestions on our


microproject

14. 1 Hour Re-edited mistakes and faults

15. 1 Hour Finalized the microproject

16. 1 Hour Submitted micro project to the teacher

7Actual Resource Used:


Sr. Name of Resource/Material Specification Qty. Remark
No.

1. www.wikipedia.com Internet

2. www. Encyclopedia .com Internet

8.0 Outputs of the Micro project (Presentation of data, findings, drawing etc.):
9.0Skill developed / Learning outcome of the Micro-Project:
Studentwill able to :
1.Identify given data as raw data , ungrouped data and groped data
2 .Calculate range , mean deviation for raw data , ungrouped data and groped data
3. Calculate standard deviation and variance for ungrouped and grouped data.
4) Demonstrate working as a leader / team member

10Applications of the Micro-Project:


Apply the concept of statistics to solve engineering related problems .

Grouped vs. Ungrouped Data


Grouped Data – Data that has been organized into groups (into a frequency distribution).
If you see a table similar to the one below, you will know that you are
dealing with grouped data:

Class Frequency
0–5 4
6 – 10 5 The frequency of a class is
11 – 15 12 the number of numbers in
16 – 20 7 that class. For example, there
must have been four numbers
between 0 and 5.

Ungrouped Data – Data that has not been organized into groups.
Ungrouped data looks like a big ol’ list of numbers.

How to Group Data


On your exam, you may have to construct a frequency distribution. Constructing a
frequency distribution is the same thing as grouping data.

The first step in grouping data is deciding how large of a class interval to use.
(Class interval = Class size)

There are 2 formulas for determining the appropriate class interval. You must be able to
choose which one would be appropriate for any given problem.

1. Class interval = Use when the problem states the


number of classes to be used.

2. Class interval = Use when the problem does not


state the number of classes to be
used.

**Don’t forget to always round up to the nearest whole number when dealing with
class interval.**
Populations vs. Samples
Population – A collection of all possible individuals, objects, or measurements (Mason
7).

Please note that a population does not have to be huge in size. It is all a matter of how
the objects in the group are defined. For example, if we wanted to compute the average
GPA of the population of students taking BA254 at GRCC in the summer of 2002, our
population would only include approximately 75 people. On the other hand, if we
wanted to find the average GPA of all students who have ever taken BA254 at GRCC,
we would be dealing with a much larger number of students.

This is because of how each population was defined. In the first example, we gave our
population a definition that severely limited the number of GPA’s that would be included.
The second population was defined a little more broadly, and, therefore, more students
would fall under this definition.

Sample – A portion or part of the population. (Mason 7)

Measures of Central Tendency


Central Tendency – Where the numbers tend to cluster.
Where most of the numbers are at.
A single value that summarizes a set of data by locating the center
of the data (Mason 65).

4 Basic Measures of Central Tendency:

1. The Mean = The Average = The Arithmetic Mean

2. The Median = The Middle (of the road)


50% of the data fall above the median, and 50% fall below the median.

3. The Mode = The Most


The mode is the number(s) that appear(s) the most out of a given set of data. A
data set can have more than one mode value.

4. Geometric Mean = G.M.


The geometric mean is a measure of central tendency that is particularly useful
for averaging percentages (%). It is sometimes considered to be a more
“conservative” average. This is because of the fact that the GM is always less
than or equal to the Arithmetic Mean.
Measures of Dispersion

What is Dispersion? To measure the dispersion of a set of numbers means to measure


how spread out the numbers in the set are. Dispersion = Spread.

Some Measures of Dispersion:

1. Range = Highest Value – Lowest value


The range is the simplest measure of dispersion; it only takes into account the
highest and lowest values.

2. Mean Deviation = M.D.


M.D. involves all the values of a set of data into its calculation.

3. Variance = s2 or ó2

s2 = Sample variance
ó2 = Population variance

4. Standard Deviation = s or ó

s = Sample standard deviation


ó = Population standard deviation

**Please note that the variance is equal to the standard deviation squared.
This also means that the standard deviation is the square root of the variance.

Therefore, if we know either the standard deviation or the variance, we can


always determine the other very quickly.

For example, if standard deviation = 5, then variance = 52 = 25

Or

If Variance = 100, then standard deviation = = 10.


Formulas for Ungrouped Data
**Please remember how to distinguish between Grouped and Ungrouped Data (On
previous pages!)**

**Also, please note that you will have to differentiate between populations and samples
when dealing with ungrouped data.**

MEAN = ARITHMETIC MEAN = AVERAGE

= the sample mean =

ì = the population mean =

**The mean for ungrouped data will be calculated in the exact same way, regardless of
whether you are dealing with a sample or a population. Later in this class, you will need
to differentiate between ì and X. It is, therefore, a good idea to become aware of the
definitions of ì and X now.**

MODE = MOST

Whichever number or numbers that appear the most within a list (set) of data.

RANGE = A SIMPLE MEASURE OF SPREAD

Range = Highest value – Lowest value

VARIANCE = s2 or ó2

**When dealing with Ungrouped data, you must determine whether you
are dealing with a sample or a population when computing the standard
deviation or the variance. Be careful!!! (If it is a sample, you will likely
see the word “sample” in the problem. If it is a population, you may or
may not see the word “population” in the problem. In other words, if
you do not see the word “sample”, assume that you are dealing with a
population!)

s2 = Sample Variance =

ó2 = Population Variance = =
STANDARD DEVIATION = s or ó

s = Sample Stand. Dev. =

ó = Population Stand. Dev. = =

GEOMETRIC MEAN = G.M.

The GM is useful when averaging percentages and also when looking for an average
percentage increase over time (taking into account the effects of compounding).

There are 2 G.M. formulas on your formula sheet. You need to know when and
where to use each!
GEOMETRIC MEAN FORMULA #1

This formula is appropriate where you have a big ol’ list of numbers (ungrouped
data).

G.M. =

n = number of numbers
X1, X2 = each individual value
GEOMETRIC MEAN #2

This formula is appropriate where you have a beginning as well as an ending value.
This formula, therefore, also involves time, and it is likely to involve years and/or
dates.

G. M. =

n = number of time periods (e.g. number of years)


MEAN DEVIATION = M.D.

M.D. =

**The straight up and down brackets indicate absolute value. Please remember
that this means to add up all of the differences without paying attention to whether
they are negative or positive!

Mean deviation is another measure of spread (others include range, std. deviation,
variance).

MEDIAN (UNGROUPED)

Steps:

1. Locate the median.

Location of Median =
This formula tells us the rank of the median in terms of smallest to largest.

2. Put the numbers in order from smallest to largest.

3. Determine the value of the median by applying the number you got from the location
of median formula to the ascending list of numbers that you just created.

For example, if the location of the median is equal to 7, then the median is the 7th number
in order of smallest to largest.

Using another example, if the location of the median is equal to 7.5, then the median is
halfway (.5) between the 7th and the 8th numbers in order from smallest to largest.
Q1, Q3, and Quartile Deviation
Q1 = First Quartile

Q1 is the value that has 25% of the data set below it, and 75% above it.

Computing Q1 and Q3 is very similar to the way the median is determined. In fact, the
median could also be called Q2.

Steps:

1. Locate Q1.

Location of Q1=

2. Put the numbers in order from smallest to largest.

3. Determine the value of Q1 by applying the number you got from the location of Q1
formula to the ascending list of numbers that you just created.

Value of Q1 = (Higher number of the two – lower number of the two) * the
percentage to the right of the decimal place from you result to the location formula
+ the lower of the two numbers

Q3 = Third Quartile

Q3 is the value that has 25% of the data set below it, and 75% above it.

Computing Q1 and Q3 is very similar to the way the median is determined. In fact, the
median could also be called Q2.

Steps:

1. Locate Q3.

Location of Q3=

2. Put the numbers in order from smallest to largest.

3. Determine the value of Q3 by applying the number you got from the location of Q3
formula to the ascending list of numbers that you just created.

Value of Q3 = (Higher number of the two – lower number of the two) * the
percentage to the right of the decimal place from you result to the location formula
+ the lower of the two numbers
Formulas for Grouped Data Problems

**Please remember how to distinguish between Grouped and Ungrouped Data (On
previous pages)**

If you are presented with a problem consisting of Grouped Data, you will need to use
some formulas and techniques which are unique to Grouped Data Problems.

**Also, please note that you will not have to differentiate between populations and
samples when dealing with grouped data.**

**I will use the following example to illustrate all of the following calculations relating
to grouped data:

Class Frequency = f
0 up to 5 2
5 up to 10 7
10 up to 15 12
15 up to 20 6
20 up to 25 3

MEAN = ARITHMETIC MEAN = AVERAGE

MEAN for Grouped Data = =

Class Frequency = f Midpoint = X fX = f multiplied by X


0 up to 5 2 2.5 5
5 up to 10 7 7.5 52.5
10 up to 15 12 12.5 150
15 up to 20 6 17.5 105
20 up to 25 + 3 22.5 + 67.5
30 = n 380 = • fx = the sum
of fx
**Take note of the difference between how we figure out n in this
GROUPED data problem, as opposed to how we would solve for n in an
UNGROUPED data problem(Here we add the frequencies, in Ungrouped we
would just count the number of numbers). In either case, n is the number of
numbers!

= = 380/30 = 12.667
STANDARD DEVIATION (GROUPED) = s

STANDARD DEVIATION for Grouped Data = s =

Class f Midpoint = X fX = f multiplied by X X2 fX2


0 up to 5 2 2.5 5 6.25 12.5
5 up to 10 7 7.5 52.5 56.25 393.75
10 up to 15 12 12.5 150 156.25 1875
15 up to 20 6 17.5 105 306.25 1837.5
20 up to 25 + 3 22.5 + 67.5 506.25 + 1518.75
30 = n 380 = • fX 5637.5 = • fX2

s=

s = 5.331

VARIANCE (GROUPED) = s2

s = 5.331, so s2 = 5.3312 = 28.420

MEDIAN

Steps for Grouped Data Median:


1. Locate which class contains the median by using the following formula:

Location of median = = 15.5


This formula tells you what “place” the median would be in if the numbers were
placed in order from smallest to largest! In this example, the median is located
halfway between the 15th and the 16th numbers.

2. Now figure out which class would contain the median. (Remember the median in
this example is between the 15th and the 16th numbers in order from smallest to
largest.)

Class Frequency = f Cumulative Frequency


0 up to 5 2 2
5 up to 10 7 9
10 up to 15 12 21
15 up to 20 6 27
20 up to 25 3 30
The median is somewhere between 10 and 15. We know this because we can look at
the Cumulative Frequency and see that the 15th and the 16th numbers are in this class!
(In fact, the 10th through the 21st numbers were in this class)

3. Do the Math!

**This is one of the trickier formulas for this exam! Please take close note of
what the following variables stand for in this particular problem:

Median =
L = Lower limit of the class that contains the median = 10
n = Number of numbers = 30
CF = Number of numbers before the class containing the median = 9
f = number of numbers in the class containing the median = 12
i = class interval (size)

Median =
MODE (GROUPED) = MOST

The mode for a grouped data problem is the midpoint of the class with the highest
frequency (f).

The class 10 – 15 has the highest frequency (12), so the mode is the midpoint of
this class. MODE = 12.5

**Please take close note that the formulas used for computing the mean, the
median, and the standard deviation for GROUPED data, all contain
lowercase “f” in them !!!! I would recommend that you use this fact to help
remember which formulas are appropriate for the grouped data problems.

RANGE (GROUPED)

Range (Grouped) = Upper Limit of Highest Class – Lower Limit of Lowest Class

In our example, 0-5 is the lowest class and 0 is the lower limit of this class. Also,
20-25 is the highest class, and 25 is the upper limit of this class.

Range(Grouped) = 25 – 0 = 25
Uses of Standard Deviation

Chebyshev’s Theorem =
where k = the number of standard deviations from the mean.

Many years ago, Russian mathematician P.L. Chebyshev developed a theorem that allows
us to determine, for any set of data, the minimum proportion of values that lie within a
specified number of standard deviations from the mean.

Basically, Chebyshev’s Theorem states that at least 1- (1/k2) of the data must fall between
the points + ks and - ks.

So, one could say that at least 75% of the data in any given set will fall within 2 standard
deviations of the mean.

Proof
1-(1/k2) = 1 – (1/22) = 1 – ¼ = .75

Further, if it was known that the mean of the same set of data was equal to 20 and the
standard deviation was equal to 5, one could conclude that at least 75% would fall
between 10 and 30.

Proof
We proved that 75% fall within 2 standard deviations of the mean when we did that
calculation a couple of seconds ago.

Now we need to figure out how far two standard deviations from the mean is in our
particular example!

We need to compute + + ks and - ks:

+ ks = 20 + 2*5 = 20 +10 = 30

- ks = 20 – 2*5 = 20 – 10 = 10
Empirical Rule (For the “normal” distribution) ***(Grouped or Ungrouped)***

First of all, the normal distribution is a set of numbers where the 3 main measurements of
central tendency (the mean, median, and mode) are all approximately equal. Graphically,
a normal distribution is bell-shaped and symmetrical.

For the normal distribution, we can be more precise as to what percent of the data will
fall within a stated “distance” from the mean.

MEMORIZE THE FOLLOWING PERCENTAGES AND THE ASSOCIATED


INTERVALS

± 1S 68%
± 2S 95%
± 3S 99.7%

X ± M.D. 57.5%

In addition to memorizing the percentages dealing with the normal curve above,
make sure to remember that 57.5% of the data for any set of numbers will fall
between – M.D. and +M.D.

Coefficient of Variation = C.V.

CV =

“A direct comparison of 2 or more measures of dispersion (such as the standard deviation


for a distribution of annual incomes compared to the standard deviation for a distribution
of the number of days absent for the same group of employees) is impossible. It is
impossible because the two values are measured in different units. The absenteeism
would be measured in number of days, while the incomes would be measured in dollar
bills($). In order to make a meaningful comparison of the 2 standard deviations, we need
to convert them to a relative value.” (Mason 113) The measure that is often used to make
these types of comparisons is the Coefficient of Variation.
Coefficient of Skewness = Sk

Sk =
The coefficient of skewness is another measure of where the data in a distribution are
located. This formula might indicate that an extreme value (high or low) has affected the
mean in a significant way.

The coefficient of skewness generally ranges from –3 to +3.

If Sk<0, then distribution is negatively skewed.


If Sk>0, then distribution is positively skewed.
If Sk is approximately = to 0, then the data is normally distributed.

Name & Signature of faculty

A LL I NDIA S HRI S HIVAJI M EMORIAL S OCIETY ’ S P OLYTECHNIC


Kennedy Road, Near RTO, Pune-411001

You might also like