Title: STA 3024 Introduction to Statistical 2
1STA 3024Introduction to Statistical 2
- Chapter 12
- Simple Linear Regression
2Examples in Regression from Chapter 3
- GDP (K) and internet use ( population) by
country (p.105) - GDP (K) and CO2 emission (metric tons) by
country (p.112) - Car weight (lb) and gas mileage (MPG) (p.106)
- Sit up numbers and 40-yard dash speed (sec) of
female athletes (p. 125)
3The Model
4Correlation (r) and slope (b)
5The Graph
6A Simple Example
7The Fit
8The Fitted Straight Line
9SPSS Output (For input see Basic SPSS in the
class website)
ANOVA
10Coefficients
- Residual statistics
- Including mean and standard deviation information
on prediction and residuals - On way to do residual analysis is to output the
residuals with save option and plot them.
11Inference on the slope ?
12Good Model?
ANOVA Table 12.8 will be discussed in the next
chapter.
13Possible Patterns in Residuals (p.665)
14Prediction (Middle box p.622)
15Regression toward the Mean (p.600)
- Example. During the screening, we picked a group
of people with high blood pressure. Then their
blood pressures tends to be lower when they are
measured next week, even without treatment. - Reason If consider the measure during the
screening as x and the measure next week as y for
the same person. Then the pair (x, y) will be
highly corrected when plotted.
16Standardized Residual (p.616)
- It can be seen the discrepancy between the real
line and the estimate line varies at different
point of x. Thus the residual accuracy also
changes with x.
17Caution in Correlation (slope) Interpretation
- Outlier effect
- Stratification effect (Simpsons paradox,
Exercise 3.58, Murder trials in Florida,
1976-1987, p.144)
18Stratified data
19Simpsons paradox in regression (Ecological
fallacy, p. 606)
2012.5 Linearize a Nonlinear Model