Business Analytics (Evans)
Chapter 8 Trendlines and Regression Analysis
I. MCQ
1) A regression model that involves a single independent variable is called .
A) single regression
B) unit regression
C) simple regression
D) individual regression
2) Regression models of data focus on predicting the future.
A) missing
B) time-series
C) panel
D) cross-sectional
The following table exhibits the age of antique furniture and the corresponding prices. Use
the table to answer the following question(s). (Hint: Use scatter diagram and the Excel
Trendline tool where necessary).
Number Value
of years ($)
78 930
91 1010
83 970
159 1950
134 1610
210 2880
88 980
178 2010
124 1370
72 900
3) What is the relationship between the age of the furniture and their values?
A) Nonlinear
B) Linear
C) Curvilinear
D) No relationship
4) Which of the following is true of linear functions used in predictive analytical models?
A) It is used when the rate of change in a variable decreases or increases quickly and then levels
out.
B) It is used when there is a steady decrease or increase over a range of a variable.
C) It is used when there is increase at a specific rate.
D) It is used when there is a rise or fall at a constantly increasing
rate.
Downloaded by Qu?nh Nh? Lê Th?
5) are mathematical functions used in predictive analytical models which
define phenomena that increase at a specific rate, and is represented by the formula y =
axb
A) Exponential functions
B) Power functions
C) Polynomial functions
D) Logarithmic functions
6) Which of the following mathematical functions, used in predictive analytical models, is
represented by the formula y = ax3 + bx2 + cx + d?
A) exponential functions
B) power functions
C) logarithmic functions
D) polynomial functions
7) In functions, represented by y = abx, y rises or falls at constantly increasing rates.
A) logarithmic
B) power
C) exponential
D) polynomial
8) In Excel's Trendline tool, the value of the gives the measure of fit of the line to the
data.
A) linear function
B) R-squared
C) moving average
D) set intercept
9) Which of the following is true of the R-squared (R2) value in Excel's Trendline function?
A) A value of 1.0 for R2 indicates maximum deviation of the data from the line.
B) If the value of R2 is above 1.0, the line will be at a perfect fit for the data.
C) The value of R2 will always be between -1 and 1.
D) As the value of R2 gets higher, the line will be a better fit for the data.
10) Which of the following equations correctly expresses the relationship between the two
variables?
A) Value = (-181.16) + 13.493 × Number of years
B) Number of years = Value / 12.537
C) Value = (459.34 / Number of years) × 4.536
D) Number of years = (17.538 × Value) / (-157.49)
Downloaded by Qu?nh Nh? Lê Th?
Copyright © 2016 Pearson Education, Inc.
Downloaded by Qu?nh Nh? Lê Th?
11) What is the expected value for a 90 year-old piece of furniture?
A) $1002.45
B) $997.98
C) $934.56
D) $1033.21
12) In a linear relationship, which of the following accounts for the many possible values of the
dependent variable that vary around the mean?
A) the coefficient of the dependent variable X
B) the value of the intercept ß0
C) the random error term ε
D) the standard error SYX
13) Which of the following is true about the observed errors associated with estimating the value
of the dependent variable using the regression line?
A) They are the horizontal distances between slopes and y-intercepts.
B) The errors are also referred to as critical values.
C) They are always maximized by the regression lines.
D) The errors can be negative or positive.
14) For an independent variable Y, the error associated with the ith observation is:
A) ei = Yi - Ŷi
B) Yi = (ei)2 - Ŷi
C) (Ŷi)2 ei = Yi
D) ei = (Yi + Ŷi)2
Downloaded by Qu?nh Nh? Lê Th?
Use the data given below to answer the following question(s).
Following is an extract from the database of a construction company. The table shows the
height of walls in feet and the cost of raising them. The estimated simple linear regression
equation is given as Ŷ = b0 + b1X. (Hint: Use Excel functions).
Height
(ft) Cost ($)
4 670
3 430
7 810
9 1100
6 790
8 880
5 760
11 1200
15) What is the value of the coefficient b0?
A) -2.25321
B) 0.010697
C) 254.8371
D) 86.81704
16) What is the value of the coefficient b1?
A) 86.81704
B) 254.8371
C) 0.010697
D) -2.14625
17) What is the estimated cost of raising a 10-inch wall?
A) 1505.786
B) 1103.578
C) 968.6109
D) 1123.008
18) Which of the following statements is true when using the Excel Regression tool?
A) The range for the independent variable values must be specified in the box for the Input
Y Range.
B) Checking the option Constant is Zero forces the intercept to zero.
C) The Regression tool can be found in the Tools tab under Insert group.
D) Adding an intercept term reduces the analysis' fit to the data.
Copyright © 2016 Pearson Education, Inc.
Downloaded by Qu?nh Nh? Lê Th?
19) Which of the following generates a scatter chart in Excel with the values predicted by the
regression model included?
A) Trendline
B) Residual Plots
C) R Square
D) Line Fit Plots
20) Which of the following is true about Excel outputs Multiple R?
A) It is often referred to as the coefficient of determination.
B) A value of 0 indicates positive correlation.
C) A negative slope of the regression line denotes a positive Multiple R.
D) It is another name for the sample correlation coefficient, r.
21) The R2 value:
A) is the variability of the observed Y-values from the predicted values.
B) indicates that as the independent variable increases, the intercept term does too.
C) gives the proportion of variation in the dependent variable that is explained by the
independent variable.
D) transforms the cumulative probability scale (vertical axis) so that the graph of the cumulative
normal distribution is a straight line.
22) For a simple linear regression model, significance of regression is:
A) a measure of how well the regression line fits the data.
B) a hypothesis test of whether the true regression coefficient ß1 is zero.
C) a statistic that modifies the value of R2 by incorporating the sample size and the number of
explanatory variables in the model.
D) the variability of the observed Y-values from the predicted values.
23) Which of the following Excel functions is applied to test for significance of regression?
A) COVAR
B) ANOVA
C) SINH
D) TREND
24) While testing hypotheses for regression coefficients, the t-test for the slope is expressed as:
A) t =
B) t =
C) t =
D) t =
Downloaded by Qu?nh Nh? Lê Th?
25) provide information about the unknown values of the true regression coefficients,
accounting for sampling error.
A) Standard errors
B) Confidence intervals
C) Adjusted R Squares
D) P-values
26) Standard residuals:
A) help detect outliers that may bias the results of a regression analysis.
B) cause differences in the regression equation by changing the slope and intercept.
C) point out the ranges for the population intercept and slope at a 95% confidence level.
D) provide information for testing hypothesis associated with the intercept and slope.
27) A(n) is an extreme value that is different from the rest of the data.
A) critical value
B) standard error
C) expected value
D) outlier
28) While checking for linearity by examining the residual plot, the residuals must:
A) exhibit a linear trend.
B) form a parabolic shape.
C) be randomly scattered.
D) be below the x-axis.
29) Which of the following is true when testing for normality of errors?
A) Normality is verified by inspecting for a bell-shaped distribution.
B) It is easier to evaluate normality with small sample sizes.
C) A scatter diagram of the whole data is always used to verify normality.
D) Errors are normally distributed when the scatter diagram shows a straight-line distribution.
30) means that the variation about the regression line is constant for all values of the
independent variable.
A) Autocorrelation
B) Normality of errors
C) Homoscedasticity
D) Linearity
E)
31) Which of the following helps in evaluation of autocorrelation?
A) Breusch-Pagan test
B) Durbin-Watson statistic
C) Hosmer-Lemeshow test
D) Cochran-Mantel-Haenszel statistics
32) In multiple regression, R Square is referred to as the:
A) multiple correlation coefficient.
B) coefficient of autocorrelation.
C) coefficient of multiple determination.
D) multiple significance coefficient.
Downloaded by Qu?nh Nh? Lê Th?
33) Which of the following is true about multiple linear regression?
A) It is a linear regression model with more than one dependent variable.
B) The regression coefficients are called fractional regression coefficients.
C) It uses least squares to estimate the intercept and slope coefficients.
D) The ANOVA tests for the significance of each variable separately.
34) When using the t-statistic in multiple regression to determine if a variable should be
removed:
2
A) R will increase if the variable is removed.
B) if |t| > 1, the standard error will decrease.
C) a large number of independent variables is convenient.
D) if |t| < 1, the standard error will increase.
35) When two or more independent variables in the same regression model can predict each
other better than the dependent variable, the condition is referred to as .
A) autocorrelation
B) heteroscedasticity
C) multicollinearity
D) homoscedasticity
36) Which of the following is true about multicollinearity?
A) The effect of a dependent variable on another becomes difficult to isolate.
B) Regression coefficients become clearer and are easier to interpret.
C) P-values reduce significantly leading to rejection of null hypothesis.
D) It is best measured using the statistic variance inflation factor (VIF).
37) Categorical variables that have been coded are called .
A) limited dependent variables
B) dummy variables
C) instrumental variables
D) observable variables
38) Interaction is:
A) the principle of having a model with maximum explanatory variables.
B) the process of coding categorical variables.
C) a measure to determine the correlation between dependent variables.
D) the dependence between two independent variables.
39) How many additional dummy variables are required if a categorical variable has 4 levels?
A) 2
B) 3
C) 1
D) 4
40) When a scatter chart of data shows a nonlinear relationship, the nonlinear model can be
expressed as:
A) Y = β0 + β1X + β2X2 + ε
B) Y = β0 + β1X + (β2X)2 + ε
C) Y = β0 + β1X + β2X
D) Y = β0 + β1X2 + β2X2 + ε
Downloaded by Qu?nh Nh? Lê Th?
41) In a curvilinear regression model, the represents the curvilinear effect.
A) intercept
B) error term
C) slope
D) R
Square
II. CALCULATE QUESTION
Use the data given below to answer the following question(s).
Following is an extract from a firm's database detailing the number of hours spent on the
job by employees and their corresponding pay. (Note: Assume a level of significance of
0.05 wherever necessary.)
Hours spent
on the job Salary ($)
4 340
12 850
7 570
5 470
11 820
8 610
9 630
13 900
10 800
6 480
42) Is the hours spent on the job a statistically significant variable in explaining the variation in
pay of employees? (Hint: Use Regression tool).
43) Draw conclusions for test of hypothesis for regression coefficients.
44) Interpret the confidence intervals.
45) Interpret residual output.
Downloaded by Qu?nh Nh? Lê Th?
III. TRUE/FALSE QUESTION
46) In predictive analysis models, a second-order polynomial has only one hill or valley.
47) The best-fitting line maximizes the residuals.
48) Creating a scatter chart with an added trendline is visually superior to the scatter chart
generated by line fit plots.
49) The standard error may be assumed to be large if the data are clustered close to the
regression line.
50) An increase in adjusted R2 indicates that the regression model has improved.
51) A good regression model has the fewest number of explanatory variables providing an
adequate interpretation of the dependent variable.
IV. THEORIES QUESTION
52) Why is regression analysis necessary in business? What categories of regression models are
used?
53) When are logarithmic functions used in predictive analysis?
Downloaded by Qu?nh Nh? Lê Th?
54) While conducting regression analysis, how is constructing a normal probability plot useful?
55) Briefly explain the assumptions on which the statistical hypothesis tests associated with
regression analysis are predicated.
56) List the systematic approach to build good multiple regression models.
.
57) Explain the concept of curvilinear regression model.
Downloaded by Qu?nh Nh? Lê Th?