Which of the following is false about bootstrapping?
A bootstrap confidence interval constructed
based on a biased sample will still yield an unbiased estimate for the population parameter of
interest.
Which of the following is false regarding paired data? Each observation in one data set is
subtracted from the average of the other data set's observations.
Which of the following is false about bootstrap and sampling distributions? Both distributions are
created by sampling with replacement from the population.
Bootstrap distributions are centered at the sample statistic, sampling distributions are
centered at the population parameter.
Both distributions get narrower as the standard deviation decreases.
Researchers studying IQ scores of mothers and fathers of ``gifted" children collected data from 36
gifted children and their parents. First, differences in IQ scores of the father and the mother were
calculated for each child (calculated as father's IQ score - mother's IQ score). The dot plot below
shows the bootstrap distribution of means of 200 bootstrap samples taken from this original sample
of differences in IQ scores. The mean of the bootstrap distribution is approximately -3.48 points and
the bootstrap standard error is 1.3 points. Assume the usual conditions for constructing a bootstrap
confidence interval are satisfied. Which of the following statements is false? Since 0 is apparently
an unusual value for the statistic, then at the 5% significance level we would fail to reject a
null hypothesis of that claims that the fathers' and mothers' average IQs are equal.
When doing inference on a single mean, which of the following is the correct justification for using
the tt-distribution rather than the normal distribution? Because the standard error estimate may
not be accurate.
How does the shape of the tt-distribution change as the sample size increases? It becomes more
normal looking
Air quality measurements were collected in a random sample of 25 country capitals in 2013, and
then again in the same cities in 2014. We would like to use these data to compare average air
quality between the two years. Which of the following tests is the most appropriate? paired t-test
with two-sided alternative hypothesis
The sample size is 26. The test statistic is calculated as T = 2.485. What is the p-value?
between 0.01 and 0.02
When doing an ANOVA, you observe large differences in means between groups. Within the
ANOVA framework this would most likely be interpreted as: Evidence strongly favoring the
alternative hypothesis.
Which of the following is most useful for checking the equal variance across groups condition for
ANOVA? Side-by-side box plots showing roughly equally sized boxes for each group.
Based on the ANOVA output below, what is the value of the F-statistic? Choose the closest answer.
1.87
A study compared five different methods for teaching descriptive statistics. The five methods were
traditional lecture and discussion, programmed textbook instruction, programmed text with lectures
computer instruction, and computer instruction with lectures. 45 students were randomly assigned, 9
to each method. After completing the course, students took a 1-hour exam. We are interested in
finding out if the average test scores are different for the different teaching methods. At least two
group means are significantly different from each other.
For given values of the sample mean and the sample standard deviation when n = 25, you conduct a
hypothesis test and obtain a p-value of 0.0667, which leads to non-rejection of the null hypothesis.
What will happen to the p-value if the sample size increases (and all else stays the same)?Decrease
A study compared five different methods for teaching descriptive statistics. The five methods were
traditional lecture and discussion, programmed textbook instruction, programmed text with lectures,
computer instruction, and computer instruction with lectures. 45 students were randomly assigned, 9
to each method. After completing the course, students took a 1-hour exam. We are interested in
finding out if the average test scores are different for the different teaching methods. 0.005
Suppose we wanted to compare the rates of return for two stocks: the technology company Intel and
the U.S. airline Southwest Airlines. To compare the rates of return, we take a random sample of 50
days of Intel's stock returns and another random sample of 50 days for Southwest's stock returns
(not necessarily the same days). These data should not be treated as paired. Why would these data
not be considered paired data? The data can't be considered paired data because the days for
which we have Intel data may be different from the days for which we have Southwest
Airlines data.
A study examining the relationship between weight of school children (4th to 6th graders) found a
95% confidence interval for the difference between the average number of school days missed by
overweight and normal weight children (We are 95% confident that overweight children on
average miss 1.3 to 2.8 days more than children with normal weight.
An insurance company wants to estimate (using a confidence interval) its average claim amount
using data from 20 randomly selected claims. Which of the following is false? A confidence
interval based on this sample is not accurate since the sample size is small.
The figure below shows three tt-distribution curves. Which curve has the highest degree of freedom?
Solid
My friend, Tom, believes that his supermarket's prices are lower than mine, and sets an alternative
hypothesis test reflecting this. We construct a list of 10 identical items and purchase them at our
respective stores. Tom wants to know if these data support his hypothesis. Which of the following is
the correct description of Tom's situation? Tom has a one-sided alternative hypothesis and
should do a paired t-test.
The sample sizes are n_1 = 20n1=20 and n_2 = 40n2=40. Your friend who is working on this
hypothesis test calculates a Z statistic, Z = 2.5. Which of the following is true? She should be using
a T statistic instead of a Z statistic.
Which of the following is not a condition required for comparing means across multiple groups using
ANOVA? There should be at least 10 successes and 10 failures.