Chapter Eight
Data Analysis
The goal of any research is to provide information out of row data
The row data after collection has to be processed and analyzed in line
with the outline (plan) laid down for the purpose at the time of
developing the research plan
Data Processing
Data possessing implies
editing, coding and classification
of collected data so that they are amenable to analysis
Editing
:
Is a process of examining the collected raw data to detect errors and
omission (extreme values) and to correct those when possible
It involves a careful scrutiny of completed questionnaires or schedules
It is done to assure that the data are accurate, consistent with other data
gathered, uniformly entered, and as complete as possible
Editing can be either field editing or central editing
Coding
:
Refers to the process of assigning numerical or other symbols to answers
so that responses can be put into a limited number of categories or
classes.
Coding is used when the researcher uses computer to analyze the data
otherwise it can be avoided
Classification
:
Most research studies result in a large volume of raw data, which
must be reduced into homogeneous group.
This is to classify the raw data or arranging data in-groups or classes on
the basis of common characteristics
Classification according to attributes
Classification according to class interval
Data Analysis
Data analysis is further transformation of the processed data to
look for patterns and relations among data groups
By analysis we mean the computation of certain indices or measures
along with searching for patterns or relationship that exist among the data
groups
Analysis particularly in case of survey or experimental data
involves estimating the values of unknown parameters of the population
and testing of hypothesis for drawing inferences.
Analysis can be categorized as
•
Descriptive Analysis
•
Inferential (Statistical) Analysis
Descriptive Analysis
Descriptive analysis refers to the transformation of raw data into a form
that will make them easy to understand and interpret
Descriptive response or observation is typically the first form of analysis
The calculation of averages, frequency distribution, and
percentage distribution is the most common form of summarizing
data
The most common forms of describing the processed data are:
•
Tabulation
•
Percentage
•
Measurements of central tendency
•
Measurements of dispersion
•
Measurement of asymmetry
•
Data transformation and index number
Inferential Analysis
Most researcher wishes to go beyond the simple tabulation of frequency
distribution and calculation of averages and /or dispersion
When the population is consisting of more than one variable it
is possible to measure the relationship between them
If we have data on two variables we said to have a bivariate variable, if
the data is more than two variables then the population is known as
multivariate population
In case of bivariate population; correlation can be found using
•
Cross tabulation
•
Karl Pearson’s coefficient of correlation: It is simple correlation and
commonly used
•
Charles Spearman’s coefficient of correlation
In case of multivariate population; correlation can be studied
through:
•
Coefficient of multiple correlation
•
Coefficient of partial correlation
Is there any cause and effect (causal relationship) between two variables
or between one variable on one side and two or more variables on the
other side?
This question can be answered by the use of regression analysis
There are different techniques of regression.
•
In case of bivariate population; cause and effect relationship can be
studied through simple regression.
•
In case of multivariate population; Causal relationship can be
studied through multiple regression analysis
Time series Analysis
;
Successive observations of the given phenomenon over a period
of time are analyzed through time series analysis.
It measures the relationship between variables and time(trend)
The past trend can be used to evaluate the success or failure
of management or any other policy.
Based on past trend the future patterns can be predicted and policy may
accordingly be formulated