Professional Documents
Culture Documents
Linear Regression
Chapter 13
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Learning Objectives
LO13-1 Explain the purpose of correlation analysis.
LO13-2 Calculate a correlation coefficient to test and interpret the
relationship between two variables.
LO13-3 Apply regression analysis to estimate the linear
relationship between two variables.
LO13-4 Evaluate the significance of the slope of the regression
equation.
LO13-5 Evaluate a regression equation’s ability to predict using
the standard estimate of the error and the coefficient of
determination.
LO13-6 Calculate and interpret confidence and prediction
intervals.
LO13-7 Use a log function to transform a nonlinear relationship.
13-2
LO13-1 Explain the purpose of
correlation analysis.
13-3
LO13-1
13-4
LO13-1
13-5
LO13-2 Calculate a correlation coefficient to test and
interpret the relationship between two variables.
13-6
LO13-2
13-7
LO13-2
13-8
LO13-2
Reject H0 if:
t > t/2,n-2 or t < -t/2,n-2
13-9
LO13-2
Numerator
13-10
LO13-2
Regression Analysis
Correlation Analysis tests for the strength and direction of the
relationship between two quantitative variables.
Y=a+bX
13-12
LO13-3
Regression Analysis
EXAMPLES
Assuming a linear relationship between the size of a home,
measured in square feet, and the cost to heat the home in
January, how does the cost vary relative to the size of the
home?
13-13
LO13-3
How does the size of a home, measured in number of square feet, relate to the cost to heat the
home in January? We would use the home size as, X, the independent variable to predict the
heating cost, and Y as the dependent variable.
Regression equation: Heating cost = a + b (home size)
How does the weight of a car relate to the car’s fuel efficiency? We would use car weight as, X,
the independent variable to predict the car’s fuel efficiency, and Y as the dependent variable.
13-14
LO13-3
Correlation coefficient:
13-17
LO13-3
Step 4: What is the predicted number of sales if someone makes 20 sales calls?
Number of Copiers Sold = 25.1792 = 19.9632 + 0.2608(20)
13-18
LO13-3
b
Number of Copiers Sold = 19.9800 + 0.2606 ( Number of Sales Calls)
* Note that the Excel differences in the values of a and b are due to rounding.
13-19
LO13-4 Evaluate the significance of the
slope of the regression equation.
13-20
LO13-4
Reject H0 if:
t > t/2,n-2 or t < -t/2,n-2
13-21
LO13-4
As in correlation analysis, please note that this statistical analysis does not
provide any evidence of a causal relationship. Another type of study is
needed to test that hypothesis.
13-23
LO13-5 Evaluate a regression equation’s ability to predict using the
standard estimate of the error and the coefficient of determination.
^
S(y - y) 2
sy. x =
n-2
13-24
LO13-5
^
S(y - y)2 587.11
sy. x = = = 45.16231 = 6.720
n-2 13
13-25
LO13-5
It ranges from 0 to 1.
It does not provide any information on the
direction of the relationship between the variables.
13-26
LO13-5
13-27
LO13-5
Re gressionSumofSquares 1738.89
R2 = = = 0.748
TotalSumofSquares 2326
13-28
LO13-6 Calculate and interpret
confidence and prediction intervals.
A confidence interval reports the interval estimate for the mean value
of Y for a given X.
A prediction interval reports the interval estimate for an individual
value of Y for a particular value of X.
13-29
LO13-6
13-30
LO13-6
Thus, the 95% confidence interval for all sales representatives who make
50 calls is from 27.3942 up to 38.6122. To interpret, let’s round the values.
For all sales representative who make 50 calls, the predicted mean number
of copiers sold is 33. The mean sales will range from 27 to 39 copiers.
13-31
LO13-6
13-32
LO13-6
Thus, the prediction interval of copiers sold by an individual sales person, such as Sheila
Baker, who makes 50 sales calls is from 17.442 up to 48.5644 copiers. Rounding these results,
the predicted number of copiers sold will be between 17 and 49. This interval is quite large. It is
much larger than the confidence interval for all sales representatives who made 50 calls. It is
logical, however, that there should be more variation in the sales estimate for an individual than
for the mean of a group.
13-33
LO13-6
Note that the width of the interval or the margin of error when predicting the
dependent variable is related to the standard error of the estimate. Also,
note that the prediction interval is wider because 1 is added to the sum
under the square root sign.
13-34
LO13-6
Confidence
Prediction Intervals
Intervals
13-35
LO13-7 Use a log function to
transform a nonlinear relationship.
13-36
LO13-7
13-37