STAT 1602: Introductory Statistics
1. Definition and Overview of Statistics
Statistics is the discipline of collecting, organizing, analyzing, interpreting, and presenting data.
It divides into:
- Descriptive statistics: summarize data (mean, median, mode, charts).
- Inferential statistics: make predictions from samples (hypothesis testing, confidence intervals).
Practice Questions:
1. Define descriptive and inferential statistics. Give an example.
2. Why use a sample over a full population?
3. List two measures of central tendency and two of dispersion.
2. Methods of Data Collection
Two main methods:
- Questionnaires: written surveys, efficient but risk low responses.
- Interviews: verbal data collection, richer but costlier.
Practice Questions:
1. Give one advantage of questionnaires and interviews.
2. When is an interview better than a questionnaire?
3. Would you use a questionnaire or interview for app feedback? Why?
3. Measures of Central Tendency
- Mean: average = sum / count.
- Median: middle value (or average of two middles if even-sized data).
- Mode: most frequent value.
Practice Questions:
1. Compute mean, median, mode of {2, 3, 3, 5, 8, 10}.
2. When is median better than mean?
3. What is the mode in {"red", "blue", "blue", "green", "red", "blue"}?
4. Measures of Dispersion
- Mean deviation: average of absolute differences from mean.
- Variance: average of squared differences from mean.
- Standard deviation: square root of variance.
Practice Questions:
1. Compute variance and SD of {5, 7, 7, 9, 10}.
2. What does a larger SD mean?
3. How do you compute mean deviation?
5. Measures of Position
- Quartiles (Q1, Q2, Q3): divide data into four.
- Deciles: ten parts.
- Percentiles: hundred parts.
Practice Questions:
1. Find Q1, median, Q3 for {10, 20, 30, 40, 50, 60, 70, 80}.
2. What does 90th percentile mean?
3. What does an 80th percentile score mean?
6. Regression Analysis
- Linear regression: y = a + bx.
- a: y-intercept, b: slope (rate of change).
Practice Questions:
1. Interpret y = 30 + 2x in terms of study hours vs. test score.
2. What if slope is negative in salary vs. experience?
3. What does the y-intercept mean?
7. Correlation Analysis
- Pearson r: measures linear relationship (-1 to 1).
- r > 0: positive, r < 0: negative, r = 0: no linear correlation.
Practice Questions:
1. What does r = 0.85 mean?
2. Give examples of variables with positive, negative, and zero correlation.
3. Compute r for x = {1,2,3}, y = {2,4,6}.
8. Ratios, Rates, and Index Numbers
[... Section trimmed for brevity. Full content available on request ...]