0% found this document useful (0 votes)
53 views

Stat20023.Week 1 - Basic Concepts

This document provides an overview of engineering data analysis and statistics. It defines key terminology like population, sample, parameter, and statistic. It explains the difference between descriptive and inferential statistics, and how they are used to understand data. Descriptive statistics involve presenting and summarizing data through tables, graphs, and numerical summaries. Inferential statistics draw conclusions about populations from samples using probability and estimation. The document also covers types of data, discrete vs continuous data, and levels of measurement for variables.

Uploaded by

Ryan Cortes
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
53 views

Stat20023.Week 1 - Basic Concepts

This document provides an overview of engineering data analysis and statistics. It defines key terminology like population, sample, parameter, and statistic. It explains the difference between descriptive and inferential statistics, and how they are used to understand data. Descriptive statistics involve presenting and summarizing data through tables, graphs, and numerical summaries. Inferential statistics draw conclusions about populations from samples using probability and estimation. The document also covers types of data, discrete vs continuous data, and levels of measurement for variables.

Uploaded by

Ryan Cortes
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 31

Polytechnic University of the Philippines

Stat20023: Engineering Data Analysis

Definitions and Terminology


Process of Statistics
Qualitative and Quantitative
Discrete and Continuous
Levels of Measurement
The Engineering Design Process

https://www.youtube.com/watch?v=MAhpfFt_mWM
Stat20023: Engineering Data Analysis
Develop a clear
What Engineers Do? description

Identify the
An engineer is someone who solves important factors
problems of interest to society with Conduct
the efficient application of scientific Propose or refine experiments
principles by: a model

• Refining existing products Manipulate the


model
• Designing new products or
processes Confirm the
solution

Conclusion and
Stat20023: Engineering Data Analysis recommendations
Statistics

The field of statistics deals with the collection, presentation, analysis,


and use of data to:

Design products
Make decisions Solve problems
and processes
Stat20023: Engineering Data Analysis
Statistics is the science of collecting, organizing, summarizing,
and analyzing information to draw conclusions or
answer questions

What information is referred to in the definition?


The information referred to the
definition is the data

Stat20023: Engineering Data Analysis


Statistical methods are used to help us describe
and understand variability.

Variability
By variability, we mean successive observations of a system
or phenomenon do not produce exactly the same result.

Probability
The chance that something will happen. How likely it is that some
event will occur.

Stat20023: Engineering Data Analysis


Where do engineering and statistics meet?
Biomedical Engineering:
• Create an algorithm to diagnose eye
disease based on photographs of the eye
• Can I say that my algorithm does a good
job at diagnosing? Does it do as well as a
doctor who looks at the photograph?
• Collect photos from sample of patients, get
diagnoses from both methods, compare to
the truth. See if the data suggest the
algorithm is reasonably accurate

Stat20023: Engineering Data Analysis


Where do engineering and statistics meet?
Civil Engineering
• Which intersection should
be improved first if we
want to most efficiently
reduce traffic accidents?
• Are there certain
characteristics that make
an intersection very
dangerous? Turning lanes,
traffic volume, etc.
https://philippineslifestyle.com/philippine-cities-uninhabitable-traffic-congestion/

Stat20023: Engineering Data Analysis


Applications of Probability and Statistics

Cellphone transmission over a noisy medium has a


probability of error. Number of extra bits sent depends on
statistics of the noise.

Weather prediction based on past observations makes


use of probabilities

Stat20023: Engineering Data Analysis


Applications of Probability and Statistics

Speech recognition is based on determining the


most likely (highest probability) spoken words
based on statistics of past speech. The statistics
may be speaker specific.

Stat20023: Engineering Data Analysis


Applications of Probability and Statistics
Image compression standards like jpeg make use
of unequal probabilities of pixel intensities.

https://www.lifewire.com/the-effect-of-compression-on-photographs-493726

Stat20023: Engineering Data Analysis


Definition of Terms

Universe Population Individual


Is the set of all Is the set of all A person or object
entities under study possible values of that is a member of
the variable the population being
studied
Stat20023: Engineering Data Analysis
Definition of Terms

Sample Parameter Statistic


Subset of the A numerical A numerical value
universe or the summary of a that describes a
population population sample or a number
computed from the
sample data
Stat20023: Engineering Data Analysis
Understand the Process of Statistics
Identify the research Collect the information needed
objective to answer the questions

Draw conclusion from the Organize and summarize


information the information
Inferential Statistics Descriptive Statistics
Uses methods that takes results Describe the information collected
obtained from a sample, extends them through numerical measurements,
to the population, and measures the charts, graphs, and tables
reliability of the result
Stat20023: Engineering Data Analysis
Descriptive Statistics
Presenting, organizing and summarizing data

https://covid19.who.int/table
Stat20023: Engineering Data Analysis
Descriptive Statistics

https://www.covid19.gov.ph/health/epidemiological-data-
Stat20023: Engineering Data Analysis analytics
Descriptive Statistics

https://www.covid19.gov.ph/health/epidemiological-data-
Stat20023: Engineering Data Analysis analytics
Statistical Inference
The Science of drawing statistical conclusions from specific data
using a knowledge of probability
Illustration Sample
Population

Stat20023: Engineering Data Analysis


Illustration
We want to know about these We have to work with
Population Sample
Random Selection

Parameter Inference
Statistic
(Population mean) (Sample mean)
Stat20023: Engineering Data Analysis
Exercise
The main campus at Polytechnic University of the Philippines (PUP) has a
population of approximately 42,000 students. A research question is "what
proportion of these students commutes with LRT2 regularly?" A survey was
administered to a sample of 987 Polytechnic University of the Philippines
students. Forty-three percent (43%) of the sampled students reported that
they used the LRT2 regularly. How confident can we be that 43% is close to
the actual proportion of all PUP students who commute by LRT2?
Identify the population, parameter, sample, and statistic

Stat20023: Engineering Data Analysis


Types of Data
Qualitative Quantitative
Involves attributes Represents quantity or amount
Sex

Occupation

Height/Length Temperature
Location

Time
Stat20023: Engineering Data Analysis
Discrete or Continuous

1. Chart to show a company’s profit over a


number of years
2. Chart to show favorite drink chosen by
customers in a restaurant
3. Chart to show the temperature on each
day of the week
4. Chart to show percentage of each sale of
ticket type at a concert

Stat20023: Engineering Data Analysis


Discrete or Continuous
Discrete Continuous
This type of data can’t be measured but it Continuous Data represents
can be counted. It basically represents measurements and therefore their
information that can be categorized into a values can’t be counted but they can
classification. An example is the number of be measured. An example would be
heads in 100 coin flips. the height of a person, which you can
describe by using intervals on the real
number line.

Stat20023: Engineering Data Analysis


Nominal Level
Data that can only be classified into categories and cannot be arranged
in an ordering scheme

Stat20023: Engineering Data Analysis Levels of Measurement


Ordinal Level
Data or categories that can be ranked; that is, one category is higher
than another. However, numerical differences between data values
cannot be determined.

Stat20023: Engineering Data Analysis Levels of Measurement


Interval Level

The distance between


numbers is a known, constant
size, but the zero value is
arbitrary.

Stat20023: Engineering Data Analysis Levels of Measurement


Ratio Level

Data possessing a natural


zero point and organized into
measures for which
differences are meaningful.

Stat20023: Engineering Data Analysis Levels of Measurement


Variable

Qualitative Quantitative

Ordinal Nominal Ordinal Interval Ratio

Stat20023: Engineering Data Analysis Levels of Measurement


Level of
Properties Examples Descriptive Statistics Graphs
Measurement
Nominal Discrete Binary Response Frequencies/Percentages
Order Less Names of People Mode

Ordinal Comparisons Likert Scales Frequencies


Ordered Categories Mode
Median
Percentiles
Interval Differences between Temperature Frequencies
ordered values have Mode
meaning Median, Mean
Standard Deviation

Ratio Continuous Money Mean


True 0 allows Weight Standard Deviation
ratio statements

Stat20023: Engineering Data Analysis


Engineering Applications of Statistics
1. Statistical process Control
2. Quality assessment
3. Model Building and Predicting
4. Communicating with and Acting on experimental results
5. Assessing Design Reliability
6. Experimental design
7. Simulation

Stat20023: Engineering Data Analysis


References and Resources
• The Engineering Method & Statistical Thinking, 2014 John Wiley and Sons, Inc.
• https://www.youtube.com/watch?v=MAhpfFt_mWM
• https://www.lifewire.com/the-effect-of-compression-on-photographs-493726
• https://philippineslifestyle.com/philippine-cities-uninhabitable-traffic-
congestion/
• https://www.covid19.gov.ph/health/epidemiological-data-analytics
• https://icon-library.com/icon/icon-for-decision-2.html
• https://icons8.com/icons/
• Icons made by <a href="https://www.flaticon.com/authors/freepik"
title="Freepik">Freepik</a> from <a href="https://www.flaticon.com/"
title="Flaticon"> www.flaticon.com</a>

Stat20023: Engineering Data Analysis

You might also like