Scatterplots, Association, and Correlation - PowerPoint PPT Presentation

1 / 30
About This Presentation
Title:

Scatterplots, Association, and Correlation

Description:

Strength. Does the plot follow the form very closely or is there a lot of ... strength is ... Measures the strength of a LINEAR association between two ... – PowerPoint PPT presentation

Number of Views:70
Avg rating:3.0/5.0
Slides: 31
Provided by: publicI
Category:

less

Transcript and Presenter's Notes

Title: Scatterplots, Association, and Correlation


1
Chapter 7
  • Scatterplots, Association, and Correlation

2
Examining Relationships
  • Relationship between two variables
  • Examples
  • Height and Weight
  • Alcohol and Body Temperature
  • SAT Verbal Score and SAT Math Score
  • High School GPA and College GPA

3
Two Types of Variables
  • Response Variable (Dependent)
  • Measures an outcome of the study
  • Explanatory Variable (Independent)
  • Used to explain the response variable.
  • Example Alcohol and Body Temp
  • Explanatory Variable Alcohol
  • Response Variable Body Temperature

4
Two Types of Variables
  • Does not mean that explanatory variable causes
    response variable
  • It helps explain the response
  • Sometimes there are no true response or
    explanatory variables
  • Ex. Height and Weight
  • SAT Verbal and SAT Math Scores

5
Graphing Two Variables
  • Plot of explanatory variable vs. response
    variable
  • Explanatory variable goes on horizontal axis (x)
  • Response variable goes on vertical axis (y)
  • If response and explanatory variables do not
    exist, you can plot the variables on either axis.
  • This plot is called a scatterplot
  • This plot can only be used if explanatory and
    response variables are both quantitative.

6
Scatterplots
  • Scatterplots show patterns, trends, and
    relationships.
  • When interpreting a scatterplot (i.e., describing
    the relationship between two variables) always
    look at the following
  • Overall Pattern
  • Form
  • Direction
  • Strength
  • Deviations from the Pattern
  • Outliers

7
Interpreting Scatterplots
  • Form
  • Is the plot linear or is it curved?
  • Strength
  • Does the plot follow the form very closely or is
    there a lot of scatter (variation)?

8
Interpreting Scatterplots
  • Direction
  • Is the plot increasing or is it decreasing?
  • Positively Associated
  • Above (below) average in one variable tends to be
    associated with above (below) average in another
    variable.
  • Negative Associated
  • Above (below) average in one variable tends to be
    associated with below (above) average in another
    variable.

9
Example Scatterplot
  • The following survey was conducted in the U.S.
    and in 10 countries of Western Europe to
    determine the percentage of teenagers who had
    used marijuana and other drugs.

10
Example Scatterplot
11
Example Scatterplot
12
Example Scatterplot
  • The variables are interchangeable in this
    example.
  • In this example, Percent of Marijuana is being
    used as the explanatory variable (since it is on
    the x-axis).
  • Percent of Other Drugs is being used as the
    response since it is on the y-axis.

13
Example - Scatterplot
  • The form is linear
  • The strength is fairly strong
  • The direction is positive since larger values on
    the x-axis yield larger values on the y-axis

14
Example - Scatterplot
  • Negative association
  • Outside temperature and amount of natural gas used

15
Correlation
  • The strength of the linear relationship between
    two quantitative variables can be described
    numerically
  • This numerical method is called correlation
  • Correlation is denoted by r

16
Correlation
  • A way to measure the strength of the linear
    relationship between two quantitative variables.

17
Correlation
  • Steps to calculate correlation
  • Calculate the mean of x and y
  • Calculate the standard deviation for x and y
  • Calculate
  • Plug all numbers into formula

18
Correlation
19
Calculating r.
  • Femur (x) 38 56 59 63 74
  • Humerus (y) 41 63 70 72 84
  • Set up a table with columns for x, y, ,
  • , , , and

20
Calculating r.
21
Calculating r
  • Recall
  • So,

22
Calculating r
  • Recall
  • So,

23
Calculating r.
  • Put everything into the formula

24
Properties of r
  • r has no units (i.e., just a number)
  • Measures the strength of a LINEAR association
    between two quantitative variables
  • If the data have a curvilinear relationship, the
    correlation may not be strong even if the data
    follow the curve very closely.

25
Properties of r
  • r always ranges in values from 1 to 1
  • r 1 indicates a straight increasing line
  • r -1 indicates a straight decreasing line
  • r 0 indicates no LINEAR relationship
  • As r moves away from 0, the linear relationship
    between variables is stronger

26
Properties of r
  • Changing the scale of x or y will not change the
    value of r
  • Not resistant to outliers
  • Strong correlation ? Causation
  • Strong linear relationship between two variables
    is NOT proof of a causal relationship!

27
Reading JMP Output
  • The following is some output from JMP where I
    considered Blood Alcohol Content and Number of
    Beers. The explanatory variable is the number of
    beers. Blood alcohol content is the response
    variable.

28
Reading JMP Output
29
Reading JMP Output
Summary of Fit
30
Reading JMP Output
  • RSquare r2
  • This means
  • I know this is positive because the scatterplot
    has a positive direction.
  • The Mean of the Response is the mean of the ys
    or
Write a Comment
User Comments (0)
About PowerShow.com