We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
You are on page 1/ 20
CHAPTER IV
BIOSTATISTICS
Objective: ‘Tomake learner familiar with biostatistics as an important tool of analysis and
applications,
Desired outcome: The learner will be able to collect, organize and analyze data using parametric
and non-parametric tess. —
They will also be able to set up a hypothesis and verify the same using limits
of significance.
4.1 Probability Distributions:
Probability distributions are theoretical or expected frequency distributions when the number of
observations made is very large. They are useful tools in making inferences and decisions under
conditions of uncertainty based on limited data or theoretical considerations. Probability distributions
are either discrete or continuous depending on whether they define probabilities for discrete or
random variables.
Probability distributions are of many types but the most extensively used are Binomial distribution,
Poisson Distribution and Normal Distribution.
Binomial Distribution
Binomial distribution was discovered by Swiss mathematician James Bernoulli (1654-1705). Itisa
discrete probability distribution which is obtained when the probability ‘P’ of the happening of an
event is same in all the trials and there are only two outcomes in each trial. For example, tossing of
a coin has two outcomes - Head or tail likewise result of taking a test can be a success or failure.
The probability distribution of binomial distribution for X success in n trails is given by
POO="C, pra
=nl /x! nex! * p%qr™* For x=0,1, 2.1
where n=no, of trails. x= no. of success
p= probability of success q= probability of failure (q= 1p)
Characteristics
1. Itisa discrete probability distribution.
2. It has two parameters p or q (the probability of success or failure) and n (the number of
trails). The parameter n is always a positive integer.
3. In binomial distribution - Mean (jt) = np; Variance (6”) = npq ;
Standard deviation (6) = npq
4, Binomial distribution issymmetrcalifp=q~0.5; positively skewedifp< 0.5 and negatively
skewed if p> 0.5
5. “As‘n’ increases the binomial distribution approaches the normal distribution,
(& scanned with OKEN ScannerExample
hat is the probability of findi is
wha itty of Findings females in a sample of S fishes drawn one by one, if probability
Solution
Probability of finding female fish = 0.
=P
Probability of not finding female fish (i.e finding male fish
=05=4q
Applying binomial distribution,
P(x) = "Cy p* qr
Where n=5, x=3, p= 0.5=1/2
P(3) = C3 (1/2)3 (1/2)?
= 51/3! (5-3)! * (1/2)
1/34 21 * (1/2)
=5x4x3x2K1/3x2x1x2x1* (1/2)
5/16 = 0.312
Poisson Distribution
Poisson distribution was derived by French mathematician Siméon Denis Poisson (1837). Poisson
distribution is the probability distribution of discrete random variables of rare events whose
probability occurrence (p) is very small but number of variables (n) is very large such that np is
constant.
Formula
The probability distribution of a random variable of X is said to have Poissot
Pexy=e™ mix!
where P= Probability of success; x = variables 0,1,2,3,..
tant 2.7183; m=mean of distribution
n distribution.
= cons
Characteristics
1. Itisa discrete probability distribution, a limit
thas a single parameter, the mean of distribution.
ited form of Binomial distribution.
2.
3. Mean and variance are equal.
4. m increases, it will tend towards normal
It is positively skewed to the left. However, as
distribution.
5. In Poisson distribution, it is assumed that rare events occur randomly and independently.
an
a
(& scanned with OKEN ScannerExample
Asscience book with 585 pages contains 43 typological errors. Ifthese errors are randomly distributed
throughout the book, what is the probability that 10 pages selected at random will be free from
errors?
(Use e--735= 0.4795)
Solution
n=10
Typological errors 43 out of total 585 pages
P= 43/585=0.0735
Mean = m= np = 10 x 0.0735 = 0.735
Poisson Distribution P(x)= e-™ m* /x!
Probability of successes =1
P (0) = e735 x (0,735) 01
0.4795 x 1 = 0.4795
Normal Distribution
‘Normal distribution was first discovered by English Mathematician De Movire in 1733. Later it
was rediscovered and developed by Gauss (1809) and Laplace (1812) and hence also called as,
Gausian and Laplace distribution. Normal distribution is a continuous probability distribution which
is bell shaped unimodal and symmetrical. When normal distribution of a variable is represented
graphically, ittakes the shape of a symmetrical curve called as Normal Curve. This curve is asymptotic
to X- axis on either side.
Normal curve can be represented by the following equation
y=Vovie ¢--aV"
Where m = mean; 1 = constant = 3.14; o = standard deviation; e = constant = 2.7185
‘Characteristics
1. Normal distribution is a continuous distribution and can assumes any value from -2 to +00.
2. Normal distribution is identified by two parameters-mean (m) and standard deviation (6)
3. Mean + I a include 68.27% (2/3"4) of all the observations.
Mean + 2.6 include 95.45 % of all the observations.
Mean + 3 o include 99% of all the observations. (fig. 4.1)
‘Normal Curve is bell shaped and symmetrical about the line X= 1
Both the tails extend to infinity (asymptotic).
It is a unimodal i.e. has only one mode.
‘Mean=Median=Mode=p.
The center of the curve is defined by the mean and the spread of the curve by standard
deviation.
eXAyavs
92.
(& scanned with OKEN Scanner| til
coeffic .
9. The coefficient of skewness is zero and coefficient of kurtosis is 3.
10. Total area under the normal curve is 1
To.
i silts Ne
Mean Mean
Fig. 4.1: Area betweenm+16 ; m+1.966 ; m+2.586
Applications
1. It is used to approximate binomial and poisson distributions.
2. Itis used in sampling theory.
3. Itis used in statistical quality control.
4, Itis used in statistical hypothesis and tests of significance in which itis always assumed that
samples are drawn from a population with normal distribution.
Z transformation.
‘A random variable Z. which has a normal distribution with mean 1=0 and standard deviation @ = 1
js called standard normal distribution and curve is called standard normal curve. Tables giving the
reas under the standard normal curve are available for ready use. Total area bound by normal curve
and the X axis is 1 1f we have to calculate area under the curve that falls between two points on the
Xaxis for example between X= aand X=b, we can make use ofthe tables after transforming the
normal variable X to a standard normal variable Z using the formula : Z = X-w/o
where Z= standard normal variable; y= Mean; o= standard deviation
This is known as Z transformation.
Example
Weight of Rohu was found to be normally distibuted with mean at $00 gand standard deviation
50 . Find the standard normal variate of Rohu with 520 g.
Solution
Weight X = 520g,» =500g ando=50
= 520-500/50 = 20/50 =0.4
P—valne
When we perform a bypothesis test in statistics, a p-value helps in determining the significance of
‘our results. Hypothesis tests are used to test the validity of a claim that is made about a population.
This claim that’s on trial is called the null hypothesis. The alternative hypothesis is the one we
accep ifthe null hypothesis is concluded to be untrue, The evidence in the tial isthe data and the
983
(& scanned with OKEN Scannerstatistics that go along with it. All hypothesis tests use a p-value to weigh the strength of the
evidence (what the data tells about the population). The term significance level (alpha) is used to
refer toa pre-chosen probability and the term ‘P value’ is used to indicate a probability that_you
calculate after a given study. If your P value is less than the chosen significance level then you
reject the null hypothesis i.e. accept that your sample gives reasonable evidence to support the
alternative hypothesis. The choice of significance level at which you reject Hy is arbitrary.
Conventionally the 5% (less than 1 in 20 chance of being wrong), 1% and 0.1% (P <0.05, 0.01 and
0.001) levels have been used.
Probability
Probability may be defined as relative frequency or probable chances of occurrence of an event on
anaverage. For example, giving birth to a female in the first pregnancy; getting, head when unbiased
coin is tossed is Ys or 50%, Probability is denoted by symbol ‘p’ and its value ranges from zero to
one
0