Title: Correlation
1Correlation
2CORRELATION
Pearson's r
3Questions Asked by Regression Correlation
Analyses
- REGRESSION
- What is the estimated effect of X on Y?
- Is the slope steep or shallow?
- CORRELATION
- How good a fit are the data points to the
regression line? - How strong is the relationship between X and Y?
4Correlation Coefficient
- Produces a summary measure of how close points
fall to the line. - If points close to line there is little deviation
( ) little residual variance - Conversely, the greater the deviation the lower
the correlation. - Measures how well do the IV and DV co-vary
5Positive Correlation
6Negative Correlation
7Interpreting the Correlation Coefficient
8Co-Variance How well do X and Y go together,
co-vary?What other variables co-vary?
9Heritability of Height
10What other variables Co-Vary?
- Education and income?
- Political interest and voter turnout?
- Robberies and poverty?
11 How about the co-variation between the number
of churches in a city and the number of crimes?
12Why?
Correlation is not Causation!
13The Definitional Formula for Pearsons r
14Understanding the Correlation Coefficient
(r)Numerator degree to which X and Y vary
together.Denominator degree to which X and Y
vary separately (total amount of variance in X
and Y).
15r Ratio of Variances
- actual amount of variance
- r
- maximum possible amount of variance
- Key Question How well does the Independent
Variable predict the Dependent Variable?
16If you did not know the relationship between the
X and Y variables, what would be the best
predictor for Y ?
17Two Forms of Residual Variance
- If we did not know beta e.g., we did not know
the relationship between poverty and robberies,
the mean of Y would be best predictor - If we know the relationship know beta then
best predictor is beta, the residual variance is
18Regression Line
Mean of Y
19Example
- Dependent Variable Robberies per 100,000
- Independent VariablePercentage of families
below poverty line
20(No Transcript)
21(No Transcript)
22Interpreting r the correlation coefficient of
.359
- .359 is positive
- it is moderately strong
- The key Square the correlation coefficientr
.359 ? r2 .129
23Interpreting r2
- r2 explains 12.9 of the variance of X on Y.
- Beta (knowing poverty level) improves prediction
of robberies by 12.9 over the prediction using
mean of Y.
24Next Example
- Dependent Variable Murders per 100,000
- Independent Variable Percentage of families
below poverty line
25(No Transcript)
26(No Transcript)
27(No Transcript)
28Last Example
- Dependent Variable Number of prior convictions
- Independent Variable Sentence length in years
29 30Interpretation 72 of variance explained,
while 28 remain unexplained.