Data Analysis Notes 1
Data Analysis Notes 1
STATISTICS
- is the scientific discipline that provides methods to help us POPULATION AND SAMPLE
make sense of data.
1. POPULATION - the entire collection of individuals or objects
- is about understanding the role that variability plays in about which information is desired.
- has the same meaning as the Latin word datum which EXAMPLES
data.)
- Scores of entire students of 2nd year standing civil engineering students
- can refer to the mere tabulation of numeric information as in - All children of any age who have older or younger siblings
2. SAMPLE
APPLICATION OF STATISTICS
- Today statistics is a necessity in studies and researches.
- Some of the most important subject areas which make use DEFINITION
of statistical theory and techniques are as follows:
agriculture, automatic data processing, biology, business, 1. VARIABLE
- Any characteristic whose value may change from one individual or object
demography, economics, education, engineering, to another or from one observation to another.
entertainment, environmental studies, fisheries, - Any characteristic to be measured on each individual such as
government, health, insurance, manufacturing, medicine, compressive strength, or specific gravity; or a characteristic to be
observed such as one’s locality.
- In general, uses of statistics may be described by the - The variable who is the predictor of the dependent variable
b. DEPENDENT VARIABLE
following statements:
- The variable whose value is being predicted
1. DESCRIPTIVE STATISTICS
- Observing variable yields data.
- the branch of statistics that includes methods for - SET OF DATA is a collection of values for a particular variable.
organizing and summarizing data.
2. INFERENTIAL STATISTICS
- the branch of statistics that involves generalizing from a UNIVARIATE DATA SET
sample to the population from which the sample was - Consists of observations on a single variable made on
selected and assessing the reliability of such individuals in a sample or population
generalizations.
- its main concern is to analyze the organized data leading TYPES OF UNIVARIATE DATA SET
to prediction or inferences.
1. CATEGORICAL DATA SET (QUALITATIVE)
1. DESCRIPTIVE STATISTICS
- A union leader measures the attitudes of the 248 steelworkers in his - Yields from non-numeric variables (e.g. sex, birthplace,
union chapter towards the proposition that unemployment benefits religious preference, texture, etc)
be extended by asking them if they approve or disapprove of 2. NUMERICAL DATA SET (QUANTITATIVE)
Philippines’ 100 largest metropolitan areas by combining several - Numerical in nature and can be ranked
are paid equal amounts for equal work. Accordingly, he surveys 39 - Possible values of the variable correspond to isolated
companies and compares the pay of male and female workers at
these companies, statistically equating for the company, years of points on the number line.
- Wearing seatbelts increases the chance of survival in automobile - Results from variables whose values can be counted
accidents.
using integral values (e.g. number of participants,
number of cars, etc)
2. CONTINUOUS DATA
3. INTERVAL DATA
- Possible values forms and entire interval on the number - Indicate an actual amount and there is equal unit of
line.
measurement separating each score, specifically equal
- Observations involves making measurements, as intervals.
opposed to counting
- Does not only include “greater than” and “less than”
- Results from variable that yields decimals or fraction
relationships, but also has a limit of measurement that
permits us to describe how much more or less one
NUMBER LINE object possesses than another.
4. RATIO SCALES
EXAMPLES
CONTINUOUS
1. NOMINAL DATA
EXAMPLE
2. ORDINAL DATA
3. INTERVAL DATA
- The scale to which measurement belong will be important in - If A earns 1000PhP a day and B earns 2000PhP a day, then B earns
twice as much as A.
DEFINITION
MEASUREMENT
1. NOMINAL DATA
2. ORDINAL DATA
DATA ANALYSIS
- is a process that begins with careful planning followed by
data collection, data description using graphical and
numerical summaries, data analysis, and finally
interpretation of results.