Scatter Diagram - PowerPoint PPT Presentation

1 / 40
About This Presentation
Title:

Scatter Diagram

Description:

A lurking variable is a variable that is neither an explanatory or response variable. ... Yet, a lurking variable may be responsible for changes in both x and y. ... – PowerPoint PPT presentation

Number of Views:730
Avg rating:3.0/5.0
Slides: 41
Provided by: lus72
Category:

less

Transcript and Presenter's Notes

Title: Scatter Diagram


1
Scatter Diagram
  • a plot of paired data to determine or show a
    relationship between two variables

2
Scatter Diagram
3
Linear Correlation
  • The general trend of the points seems to follow a
    straight line segment.

4
Linear Correlation
5
Non-Linear Correlation
6
No Linear Correlation
7
High Linear Correlation
8
Low Linear Correlation
9
Correlation Coefficient, r
  • The correlation coefficient is a number that
    indicates the strength of a linear relationship
    between two variables, x and y.
  • -1ltrlt1
  • If r1, there is a perfect positive linear
    correlation.
  • If r0, there is no linear correlation.
  • If r-1, there is a perfect negative linear
    correlation.
  • The closer r is to 1 or -1, the better a line
    describes the relationship between the two
    variables x and y.

10
Positive Correlation
y
x
11
Negative Correlation
y
x
12
Computation formula for r
  • Let nsample size. Then

13
Example
Let x be a random variable representing wind
velocity and let y be a random variable
representing drift rate of sand.
14
Example
15
Computation
16
Coefficient of Determination
  • a measure of the proportion of the variation in y
    that is explained by the regression line using x
    as the predicting variable

17
Formula for Coefficient of Determination
18
Interpretation of r2
  • If r 0.949, then what percent of the variation
    in minutes (y) is explained by the linear
    relationship with x, miles traveled?
  • What percent is explained by other causes?

19
Interpretation of r2
  • If r 0.949, then r2 .900601
  • Approximately 90 percent of the variation in
    drift rate (y) is explained by the linear
    relationship with x, wind velocity.
  • Less than ten percent is explained by other
    causes.

20
Warning
  • The correlation coefficient ( r) measures the
    strength of the relationship between two
    variables.
  • Just because two variables are related does not
    imply that there is a cause-and-effect
    relationship between them.

21
Questions Arising
  • Can we find a relationship
    between x and y?
  • How strong is the relationship?

22
When there appears to be a linear relationship
between x and y
  • attempt to fit a line to the scatter diagram.

23
When using x values to predict y values
  • Call x the explanatory variable
  • Call y the response variable
  • A lurking variable is a variable that is neither
    an explanatory or response variable. Yet, a
    lurking variable may be responsible for changes
    in both x and y.

24
The Least Squares Line
  • The sum of the squares of the vertical distances
    from the points to the line is made as small as
    possible.

25
Least Squares Criterion
The sum of the squares of the vertical distances
from the points to the line is made as small as
possible.
26
Equation of the Least Squares Line
  • y a bx

a the y-intercept
b the slope
27
Finding the slope
28
Finding the y-intercept
29
Find the Least Squares Line
30
Finding the slope
31
Finding the y-intercept
32
The equation of the least squares line is
  • y a bx
  • y 2.8 1.7x

33
The following point will always be on the least
squares line
34
Graphing the least squares line
  • Using two values in the range of x, compute two
    corresponding y values.
  • Plot these points.
  • Join the points with a straight line.

35
Graphing y 30.9 1.7x
  • Use (8.3, 16.9)
  • (average of the xs, the average of the ys)
  • Try x 5.
  • Compute y y 2.8 1.7(5) 11.3

36
Sketching the Line Using the Points (8.3, 16.9)
and (5, 11.3)
37
Using the Equation of the Least Squares Line to
Make Predictions
  • Choose a value for x (within the range of x
    values).
  • Substitute the selected x in the least squares
    equation.
  • Determine corresponding value of y.

38
Predict the time to make a trip of 14 miles
  • Equation of least squares line
  • y 2.8 1.7x
  • Substitute x 14
  • y 2.8 1.7 (14)
  • y 26.6
  • According to the least squares equation, a trip
    of 14 miles would take 26.6 minutes.

39
Interpolation
  • Using the least squares line to predict y values
    for x values that fall between the points in the
    scatter diagram

40
Extrapolation
  • Prediction beyond the range of observations
Write a Comment
User Comments (0)
About PowerShow.com