Title: Chapter 5 Multiple Linear Regression
1Chapter 5Multiple Linear Regression
2Agenda
- Introduction
- Least Squares Estimation of the Parameters
- Matrix Approach
- Estimating ?2
- Properties of the least square estimators
- Hypothesis tests in multiple linear regression
- Confidence intervals
3Introduction
- Many applications of regression analysis involve
situations in which there are more than one
regressor variable. - A regression model that contains more than one
regressor variable is called a multiple
regression model.
4Introduction
- For example, suppose that the effective life of
a cutting tool depends on the cutting speed and
the tool angle. A possible multiple regression
model could be
where Y tool life x1 cutting speed x2 tool
angle
5Introduction
(a) The regression plane for the model E(Y)
50 10x1 7x2. (b) The contour plot
6Introduction
7Introduction
- Three-dimensional plot of the regression model
E(Y) 50 10x1 7x2 5x1x2. - The contour plot
8Introduction
- Three-dimensional plot of the regression model
E(Y) 800 10x1 7x2 8.5x12 5x22 4x1x2. - The contour plot
9Least Squares Estimation of the Parameters
10Least Squares Estimation of the Parameters
- The least squares function is given by
- The least squares estimates must satisfy
11Least Squares Estimation of the Parameters
- The least squares normal Equations are
- The solution to the normal Equations are the
least squares estimators of the regression
coefficients.
12Example 1
13Example 1
14Matrix of scatter plots for the wire bond pull
strength data
15Example 1
16Example 1
17Example 1
18Matrix Approach to Multiple Linear Regression
Suppose the model relating the regressors to the
response is
In matrix notation this model can be written as
19Matrix Approach to Multiple Linear Regression
where
20Matrix Approach to Multiple Linear Regression
We wish to find the vector of least squares
estimators that minimizes
The resulting least squares estimate is
21Matrix Approach to Multiple Linear Regression
22Example 2
23Example 2
24Example 2
25Example 2
26Example 2
27Example 2
28Minitab Practice
- Data file Example 5_2.xls
- Menu ? Stat ? Regression ? Regression
- response y
- predictors x1 x2
- ? Options Select variance inflation factors
- PRESS and predicted R-sq.
- Prediction intervals 8 275
- Select all options
- ? Results select In addition .
29(No Transcript)
30Estimating ?2
An unbiased estimator of ?2 is
31Properties of the Least Squares Estimators
Unbiased estimators
Covariance Matrix
32Properties of the Least Squares Estimators
Individual variances and covariances
In general,
33Hypothesis Tests in Multiple Linear Regression
1. Test for Significance of Regression
The appropriate hypotheses are
The test statistic is
34Hypothesis Tests in Multiple Linear Regression
1. Test for Significance of Regression
35Example 3
36Example 3
37Example 3
38Example 3
39Hypothesis Tests in Multiple Linear Regression
R2 and Adjusted R2
The coefficient of multiple determination
- For the wire bond pull strength data, we find
that R2 SSR/SST 5990.7712/6105.9447 0.9811. - Thus, the model accounts for about 98 of the
variability in the pull strength response.
40Hypothesis Tests in Multiple Linear Regression
R2 and Adjusted R2
The adjusted R2 is
- The adjusted R2 statistic penalizes the analyst
for adding terms to the model. - It can help guard against overfitting
(including regressors that are not really useful)
41Hypothesis Tests in Multiple Linear Regression
2. Tests on Individual Regression Coefficients
and Subsets of Coefficients
The hypotheses for testing the significance of
any individual regression coefficient
42Hypothesis Tests in Multiple Linear Regression
2. Tests on Individual Regression Coefficients
and Subsets of Coefficients
The test statistic is
- Reject H0 if t0 gt t?/2,n-p.
- This is called a partial or marginal test
43Example 4
44Example 4
45Hypothesis Tests in Multiple Linear Regression
The general regression significance test or the
extra sum of squares method
We wish to test the hypotheses
46Hypothesis Tests in Multiple Linear Regression
A general form of the model can be written
where X1 represents the columns of X associated
with ?1 and X2 represents the columns of X
associated with ?2
47Hypothesis Tests in Multiple Linear Regression
For the full model
If H0 is true, the reduced model is
48Hypothesis Tests in Multiple Linear Regression
The test statistic is
Reject H0 if f0 gt f?,r,n-p The test in Equation
(12-32) is often referred to as a partial F-test
49Example 5
50Example 5
51Example 5
52Confidence Intervals in Multiple Linear Regression
1. Confidence Intervals on Individual Regression
Coefficients
Definition
53Example 6
54Confidence Intervals in Multiple Linear Regression
2. Confidence Interval on the Mean Response
The mean response at a point x0 is estimated by
The variance of the estimated mean response is
55Confidence Intervals in Multiple Linear Regression
2. Confidence Interval on the Mean Response
Definition
56Example 7
57Example 7