Title: Scatter Diagram
1Scatter Diagram
- a plot of paired data to determine or show a
relationship between two variables
2Scatter Diagram
3Linear Correlation
- The general trend of the points seems to follow a
straight line segment.
4Linear Correlation
5Non-Linear Correlation
6No Linear Correlation
7High Linear Correlation
8Low Linear Correlation
9Correlation Coefficient, r
- The correlation coefficient is a number that
indicates the strength of a linear relationship
between two variables, x and y. - -1ltrlt1
- If r1, there is a perfect positive linear
correlation. - If r0, there is no linear correlation.
- If r-1, there is a perfect negative linear
correlation. - The closer r is to 1 or -1, the better a line
describes the relationship between the two
variables x and y.
10Positive Correlation
y
x
11Negative Correlation
y
x
12Computation formula for r
13Example
Let x be a random variable representing wind
velocity and let y be a random variable
representing drift rate of sand.
14Example
15Computation
16Coefficient of Determination
- a measure of the proportion of the variation in y
that is explained by the regression line using x
as the predicting variable
17Formula for Coefficient of Determination
18Interpretation of r2
- If r 0.949, then what percent of the variation
in minutes (y) is explained by the linear
relationship with x, miles traveled? - What percent is explained by other causes?
19Interpretation of r2
- If r 0.949, then r2 .900601
- Approximately 90 percent of the variation in
drift rate (y) is explained by the linear
relationship with x, wind velocity. - Less than ten percent is explained by other
causes.
20Warning
- The correlation coefficient ( r) measures the
strength of the relationship between two
variables. - Just because two variables are related does not
imply that there is a cause-and-effect
relationship between them.
21Questions Arising
- Can we find a relationship
between x and y? - How strong is the relationship?
22When there appears to be a linear relationship
between x and y
- attempt to fit a line to the scatter diagram.
23When using x values to predict y values
- Call x the explanatory variable
- Call y the response variable
- A lurking variable is a variable that is neither
an explanatory or response variable. Yet, a
lurking variable may be responsible for changes
in both x and y.
24The Least Squares Line
- The sum of the squares of the vertical distances
from the points to the line is made as small as
possible.
25Least Squares Criterion
The sum of the squares of the vertical distances
from the points to the line is made as small as
possible.
26Equation of the Least Squares Line
a the y-intercept
b the slope
27Finding the slope
28Finding the y-intercept
29Find the Least Squares Line
30Finding the slope
31Finding the y-intercept
32The equation of the least squares line is
33The following point will always be on the least
squares line
34Graphing the least squares line
- Using two values in the range of x, compute two
corresponding y values. - Plot these points.
- Join the points with a straight line.
35Graphing y 30.9 1.7x
- Use (8.3, 16.9)
- (average of the xs, the average of the ys)
- Try x 5.
- Compute y y 2.8 1.7(5) 11.3
36Sketching the Line Using the Points (8.3, 16.9)
and (5, 11.3)
37Using the Equation of the Least Squares Line to
Make Predictions
- Choose a value for x (within the range of x
values). - Substitute the selected x in the least squares
equation. - Determine corresponding value of y.
38Predict the time to make a trip of 14 miles
- Equation of least squares line
- y 2.8 1.7x
- Substitute x 14
- y 2.8 1.7 (14)
- y 26.6
- According to the least squares equation, a trip
of 14 miles would take 26.6 minutes.
39Interpolation
- Using the least squares line to predict y values
for x values that fall between the points in the
scatter diagram
40Extrapolation
- Prediction beyond the range of observations