Multiple and complex regression - PowerPoint PPT Presentation

1 / 22
About This Presentation
Title:

Multiple and complex regression

Description:

Multiple and complex regression Extensions of simple linear regression Multiple regression models: predictor variables are continuous Analysis of variance: predictor ... – PowerPoint PPT presentation

Number of Views:79
Avg rating:3.0/5.0
Slides: 23
Provided by: Preferr544
Category:

less

Transcript and Presenter's Notes

Title: Multiple and complex regression


1
Multiple and complex regression
2
Extensions of simple linear regression
  • Multiple regression models predictor variables
    are continuous
  • Analysis of variance predictor variables are
    categorical (grouping variables),
  • But general linear models can include both
    continuous and categorical predictors

3
(No Transcript)
4
Relative abundance of C3 and C4 plants
  • Paruelo Lauenroth (1996)
  • Geographic distribution and the effects of
    climate variables on the relative abundance of a
    number of plant functional types (PFTs) shrubs,
    forbs, succulents, C3 grasses and C4 grasses.

5
(No Transcript)
6
data
73 sites across temperate central North America
Response variable
Predictor variables
  • Relative abundance of PTFs (based on cover,
    biomass, and primary production) for each site
  • Longitude
  • Latitude
  • Mean annual temperature
  • Mean annual precipitation
  • Winter () precipitation
  • Summer () precipitation
  • Biomes (grassland , shrubland)

7
Relative abundance transformed ln(dat1) because
positively skewed
8
Collinearity
  • Causes computational problems because it makes
    the determinant of the matrix of X-variables
    close to zero and matrix inversion basically
    involves dividing by the determinant (very
    sensitive to small differences in the numbers)
  • Standard errors of the estimated regression
    slopes are inflated

9
Detecting collinearlity
  • Check tolerance values
  • Plot the variables
  • Examine a matrix of correlation coefficients
    between predictor variables

10
Dealing with collinearity
  • Omit predictor variables if they are highly
    correlated with other predictor variables that
    remain in the model

11
Correlations
12
(No Transcript)
13
(lnC3) ßo ß1(lat) ß2(long) ß3(latxlong)
After centering both lat and long
14
Analysis of variance
Source of variation SS df MS
Regression S(yhat-Y)2 p S(yhat-Y)2 p
Residual S(yobs-yhat)2 n-p-1 S(yobs-yhat)2 n-p-1
Total S(yobs-Y)2 n-1
15
Matrix algebra approach to OLS estimation of
multiple regression models
  • YßXe
  • XXbXY
  • b(XX) -1 (XY)

16
Criteria for best fitting in multiple
regression with p predictors.
Criterion Formula
r2
Adjusted r2
Akaike Information Criteria AIC
Akaike Information Criteria AIC
17
Hierarchical partitioning and model selection
No pred Model r2 Adjr2 P AIC (R)
1 Lon 0.0006 -0.013 0.84 30.15
1 Lat 0.47 0.46 gt0.001 -16.16
2 Lon Lat 0.48 0.46 gt0.001 -15.25
3 Long Lat Lon x Lat 0.54 0.52 gt0.001 -22.55
18
R20.48
C3
Longitude
Latitude
Model Lat Long
19
(No Transcript)
20
45 Lat
35 Lat
Model Lat Long
21
The final forward model selection is
Step AIC-228.67 SQRT_C3 LAT MAP JJAMAP
DJFMAP Df Sum of Sq RSS AIC ltnonegt
2.7759 -228.67 LONG 1 0.0209705
2.7549 -227.23 MAT 1 0.0001829 2.7757
-226.68 Call lm(formula SQRT_C3 LAT MAP
JJAMAP DJFMAP) Coefficients (Intercept)
LAT MAP JJAMAP DJFMAP
-0.7892663 0.0391180 0.0001538 -0.8573419
-0.7503936
22
The final backward selection model is
Step AIC-229.32 SQRT_C3 LAT JJAMAP
DJFMAP Df Sum of Sq RSS
AIC ltnonegt 2.8279 -229.32 - DJFMAP
1 0.26190 3.0898 -224.85 - JJAMAP 1 0.31489
3.1428 -223.61 - LAT 1 2.82772 5.6556
-180.72 Call lm(formula SQRT_C3 LAT
JJAMAP DJFMAP) Coefficients (Intercept)
LAT JJAMAP DJFMAP -0.53148
0.03748 -1.02823 -1.05164
Write a Comment
User Comments (0)
About PowerShow.com