Linear Discriminant Analysis and Logistic Regression - PowerPoint PPT Presentation

1 / 17
About This Presentation
Title:

Linear Discriminant Analysis and Logistic Regression

Description:

Consider purchase data compared to a person's age. ... Even data points that are correctly predicted will contribute to the error calculation. ... – PowerPoint PPT presentation

Number of Views:227
Avg rating:3.0/5.0
Slides: 18
Provided by: HalFre9
Category:

less

Transcript and Presenter's Notes

Title: Linear Discriminant Analysis and Logistic Regression


1
Linear Discriminant Analysis and Logistic
Regression
2
Background
  • Linear Discriminant Analysis predicts a
    categorical variable based on one or more metric
    independent variables

3
Example
Consider purchase data compared to a persons
age. A 0 value for Purchase represents someone
who didnt buy, while a 1 represents someone who
did.
Data
Purchase
Age
4
Graph Interpretation
Potential customers who did purchase
Purchase
Potential customers who did not purchase
Age
5
Graphical Representation
A discriminant analysis fits a linear regression
to this data as though the categorical variable
was numerical.
Purchase
Age
6
Graphical Representation ctd.
Then the Discriminant Analysis determines a
cutoff score. For a single predictor variable,
this score is where the regression line is equal
to.5. Any data points to the left of the line
are predicted to be 0, while those to the right
are predicted to be 1. For this data, any
potential customer below the age of 41 is
predicted not to buy, while anyone older is
predicted to buy.
Purchase
Age
7
A 100 Accurate Discriminate Analysis
Even a discriminant analysis that provides
perfect separation between purchasers and
non-purchasers does not have a perfect R .
2
8
Classification Accuracy
This distance will lower the total R , even
though it is a correct classification.
2
Standard Error measures the distance of the
predicted value (the regression line) from the
observed values. Even data points that are
correctly predicted will contribute to the error
calculation. Classification accuracy is a better
measure.
9
Discriminant Analysis in StatTools
10
Discriminant Analysis in StatTools
11
StatTools Interpreting Output
Predicted Values
Actual values
Correct Predictions
12
StatTools Interpreting Output ctd.
False Positives
Predicted Values
Actual values
False Negatives
Overall Accuracy
13
Logistic Regression
A logistic regression fits a sigmoid, or S-shaped
curve instead of a straight line. On some
datasets, this will provide greater
classification accuracy.
14
Logistic Regression in StatTools
15
Logistic Regression in StatTools
16
StatTools Interpreting Output
Age is highly statistically significant
Overall Accuracy
17
Comparison
  • Discriminant Analysis
  • Can be used for dependent variables with more
    than 2 possible values
  • Logistic Regression
  • Less reliant on basic assumptions of the data
    like normality and constant variance
  • More accurate on borderline points for some
    datasets
Write a Comment
User Comments (0)
About PowerShow.com