Statistics and Data Analysis - PowerPoint PPT Presentation

1 / 55
About This Presentation
Title:

Statistics and Data Analysis

Description:

Random number generators,' e.g., coins, cards, computers ... An 'event' E will occur or not occur ... ( The four cards in the picture and one of the remaining 44. ... – PowerPoint PPT presentation

Number of Views:59
Avg rating:3.0/5.0
Slides: 56
Provided by: William7
Category:

less

Transcript and Presenter's Notes

Title: Statistics and Data Analysis


1
Statistics and Data Analysis
  • Professor William Greene
  • Stern School of Business
  • IOMS Department
  • Department of Economics

2
Statistics and Data Analysis
Part 3 Probability
3
CNN Poll Double-digit post-speech jump for Obama
plan Posted September 10th, 2009 0418 PM
ET WASHINGTON (CNN) Two out of three Americans
who watched President Barack Obama's health care
reform speech Wednesday night favor his health
care plans a 14-point gain among
speech-watchers, according to a CNN/Opinion
Research Corporation national poll of people who
tuned into Obama's address Wednesday night to a
joint session of Congress. Sixty-seven percent
of people questioned in the survey say they
support Obama's health care reform proposals that
the president outlined in his address, with 29
percent opposed. Those figures are almost
identical to a poll conducted immediately after
Bill Clinton's health care speech before Congress
in September, 1993. The audience for the speech
appears to be more Democratic than the U.S.
population as a whole. Because of this, the
results may favor Obama simply because more
Democrats than Republicans tuned into the speech.
The poll surveyed the opinions of people who
watched Wednesday night's speech, and does not
reflect the views of all Americans.
4
Probability Probable Agenda
1/52
  • Randomness and decision making
  • Quantifying randomness with probability
  • Types of probability Objective and Subjective
  • Rules (axioms) of probability
  • Probabilities of events
  • Compound events
  • Computation of probabilities
  • Independence
  • Joint events and conditional probabilities
  • Bayes Theorem

5
Decision Making Under Uncertainty
2/52
  • Understanding probability
  • Using probability to understand expected value
    and risk
  • Applications
  • Financial transactions at future dates
  • Travel mode (or time)
  • Product purchase
  • Insurance (and warranties)
  • Enter a market
  • Legal enterprises
  • Any others?
  • Life is full of uncertainty

6
What is Randomness?
3/52
  • A lack of information?
  • Can it be made to go away with enough
    information?

7
Probability
4/52
  • Quantifying randomness
  • The context An experiment that admits several
    possible outcomes
  • Some outcome will occur
  • The observer is uncertain which (or what) before
    the experiment takes place
  • Event space the set of possible outcomes.
    (Also called the sample space.)
  • Probability a measure of likelihood attached
    to the events in the event space. (Try to define
    probability without using a word that means
    probability.)

8
Assigning Probabilities to Rare Events
Colliding Bullets
9
Assigning Probabilities
Colliding Economists
10
Assign a Probability?
For all the criticism BP executives may deserve,
they are far from the only people to struggle
with such low-probability, high-cost events.
Nearly everyone does. These are precisely the
kinds of events that are hard for us as humans to
get our hands around and react to rationally,
On the other hand, when an unlikely event is all
too easy to imagine, we often go in the opposite
direction and overestimate the odds. After the
9/11 attacks, Americans canceled plane trips and
took to the road.
Quotes from Spillonomics Underestimating Risk By
DAVID LEONHARDT, New York Times Magazine, Sunday,
June 6, 2010, pp. 13-14.
11
Sources of Probability
5/52
  • Physical events mechanical. Random number
    generators, e.g., coins, cards, computers
  • Objective long run frequencies (the law of large
    numbers)
  • Subjective probabilities, e.g., sports betting,
    belief of the risk of flying. Assessments based
    on the accumulation of personal information.
  • Aggregation of subjective frequencies
    (parimutuel, sports betting lines, insurance,
    casinos, racetrack)
  • Mathematical models weather, options pricing
  • Extremely rare events can we really attach
    probabilities to these? (Found at Gettysburg, 2
    bullets that collided in midair. What is the
    probability?)

12
Rules (Axioms) of Probability
6/52
  • An event E will occur or not occur
  • P(E) is a number that equals the probability that
    E will occur.
  • By convention, 0 lt P(E) lt 1.
  • E' the event that E does not occur
  • P(E') the probability that E does not occur.

13
Essential Results for Probability
7/52
  • If P(E) 0, then E cannot (will not) occur
  • If P(E) 1, then E must (will) occur
  • E and E' are exhaustive either E or E' will
    occur.
  • Something will occur, P(E) P(E') 1
  • Only one thing can occur. If E occurs, then E'
    will not occur E and E' are exclusive.

14
Compound Outcomes (Events)
8/52
  • Define an event set of more than two possible
    equally likely elementary events.
  • Compound event An event that consists of a set
    of elementary events.
  • The compound event occurs if any of the
    elementary events occurs.

15
Counting Rule for Probabilities
9/52
  • Probabilities for compounds of atomistic equally
    likely events are obtained by counting.
  • P(Compound Event)

16
Compound Events
10/52
1 2 3
4 5 6 7
8 E A Random consumers random choice
of exactly one product Event(fruit) Event(berry
3) Event(fruity 6) Event(apple
8) P(Fruity) P(3) P(6) P(8) 1/8 1/8
1/8 3/8 P(Sweetened) P(HoneyNut 2)
P(Frosted 7) 1/8 1/8 1/4
17
Counting the Number of EventsPermutations and
Combinations
11/52
  • Permutations Number of possible arrangements
    of a set of N items
  • E.g., 4 kids, Allison, Julie, Betsy, Lesley. How
    many different lines with 3 of them?
  • AJB, ABJ, AJL, ALJ, ABL, ALB, all with Allison
    first
  • JAB, JBA, JAL, JLA, JBL, JLB, all with Julie
    first.
  • And so on 24 different lines in total.

18
Counting Permutations
12/52
  • Whats the rule?
  • N items in total
  • Choose sets of r items
  • Order matters
  • N possible first choices, then N-1 second, then
    N-2 third, and so on.
  • Nx (N-1)x(N-2)xx(N-r1)
  • 4 kids, 3 in line, 432 24 ways.

19
Permutations
13/52
20
Permutations
14/52
  • The number of ways to put N objects in order is
    N(N-1)(1) N! E.g., AJEL, ALEJ, AEJL, and so
    on. 24 possibilities
  • The number of ways to order r objects chosen out
    of N is

21
Permutations and Combinations
15/52
  • E.g., 8 Democratic presidential candidates How
    many ways can one order 2 of them? There are 8
    possibilities for the first and 7 for the second,
    so
  • 8(7)56 8!/(8-2)! 8!/6!

22
Combinations and Permutations
16/52
  • What if order doesnt matter?
  • E.g., out of A,J,E,L, 12 permutations of 2 are AJ
    AE AL JE JL EL LE LJ EJ LA EA JA. Here order
    matters
  • But suppose AJ and JA are the same event (order
    doesnt matter)? The list double counts.
  • The number of repetitions is the number of
    permutations of the r items, which is r!.

23
Combinations and Permutations
17/52
  • The number of combinations is the number of
    permutations when order does not matter.

24
Useful Results
18-19/52
25
Appplications Games of Chance Poker
20/52
  • In a 5 card hand from a deck of 52, there are
    5251504948)/(54321) different possible
    hands. (Order doesnt matter). 2,598,960
    possible hands.
  • How many of these hands have 4 aces? 48 the 4
    aces plus any of the remaining 48 cards.

26
Probability of 4 Aces
21/52
27
The Dead Mans Hand
22/52
  • The dead mans hand is 5 cards, 2 aces, 2 8s and
    some other 5th card (Wild Bill Hickok was
    holding this hand when he was shot in the back
    and killed in 1876.) The number of hands with
    two aces and two 8s is 44 1,584
  • The rest of the story claims that Hickok held all
    black cards (the bullets). The probability for
    this hand falls to only 44/2598960. (The four
    cards in the picture and one of the remaining
    44.)
  • Some claims have been made about the 5th card,
    but noone is sure there is no record.

http//en.wikipedia.org/wiki/Dead_man's_hand
28
Counting the Dead Mans Cards
23/52
The Aces 6 There are 6 possible pairs out of
A? A? A? A? (? ?) (??)
(??) (??) (??) (??) The 8s There are also
6 possible pairs out of 8? 8? 8? 8?
(? ?) (??) (??) (??) (??)
(??) There are 44 remaining cards in the deck
that are not aces and not 8s. The total number
of possible different hands is therefore 6(6)(44)
1,584. If he held the bullets (black cards),
then there are only (1)(1)(44) 44
combinations.There is a claim that the 5th card
was a diamond. This reduces the number
ofpossible combinations to (1)(1)(11).
29
Some Poker Hands
24-26/52
Full House 3 of one kind, 2 of another. (Also
called a boat.)
Royal Flush Top 5 cards in a suit
Flush 5 cards in a suit, not sequential
Straight Flush 5 sequential cards in the same
suit suit
Straight 5 cards in a numerical row, not the
same suit
4 of a kind plus any other card
30
Probabilities of 5 Card Poker Hands
27/52
Poker Hand        Different Combinations    
Probability Odds Against------------------------
--------------------------------------------------
Royal Straight Flush                 4       
.0000015391 649,7291Other Straight
Flush                36        .0000138517
72,1931 Straight Flush (Royal or other)
40 .0000153908 64,9731 Four of a
kind                     624       
.0002400960 4,1641Full House               
       3,744        .0014405762
6931Flush                           
5,108        .0019654015
5081Straight                       
10,200        .0039246468 2541Three of a
kind                 54,912        .0211284514
461Two Pairs                     
123,552        .0475390156 201One
Pair                     1,098,240       
.4225690276 1.41High card only (None of
above)  1,302,540        .5011773940
11Total                       
2,598,960       1.0000000000
http//www.durangobill.com/Poker.html
31
Odds (Ratios)
28/52
32
Odds vs. 5 Card Poker Hands
29/52
Poker Hand        Combinations    
Probability Odds Against------------------------
--------------------------------------------------
Royal Straight Flush                 4       
.0000015391 649,7291Other Straight
Flush                36        .0000138517
72,1931 Straight Flush (Royal or other)
40 .0000153908 64,9731 Four of a
kind                     624       
.0002400960 4,1641Full House               
       3,744        .0014405762
6931Flush                           
5,108        .0019654015
5081Straight                       
10,200        .0039246468 2541Three of a
kind                 54,912        .0211284514
461Two Pairs                     
123,552        .0475390156 201One
Pair                     1,098,240       
.4225690276 1.41High card only (None of
above)  1,302,540        .5011773940
11Total                       
2,598,960       1.0000000000
http//www.durangobill.com/Poker.html
33
Joint Events
30/52
  • Pairs (or groups) of events A and B
  • One or the other occurs A or B A ? B
  • Both events occur A and B A ? B
  • Independent events Occurrence of A does not
    affect the probability of B
  • An addition rule P(A ? B) P(A)P(B)-P(A ? B)
  • The product rule for independent events
  • P(A ? B)
    P(A)P(B)

34
Joint Events Pick a Card, Any Card
31/52
  • Event A Diamond P(Diamond) 13/52
  • 2? 3? 4? 5? 6? 7? 8? 9? 10? J? Q? K? A?
  • Event B Ace P(Ace) 4/52 A? A? A? A?
  • Event A or B Diamond or Ace P(Diamond or
    Ace) P(Diamond) P(Ace) P(Diamond Ace)
    13/52 4/52 1/52 16/52

35
Application
32/52
Survey of 27326 German Individuals over 5
yearsFrequency in black, sample proportion in
red.E.g., .041861144/27326, .5212314243/27326
36
The Addition Rule - Application
33/52
Survey of 27326 German Individuals over 5 years
An individual is drawn randomly from the sample
of 27,326 observations. P(Female or Insured)
P(Female) P(Insured) P(Female and Insured)
.47877
.88571 - .43691
.92757
37
Product Rule for Independent Events
34/52
  • If two events A and B are independent, the
    probability that both occur is P(A ?B) P(A)P(B)
  • Example I will fly to Washington (and back) for
    a meeting on Monday. I will use the train on
    Tuesday. P(Late if I fly) .6.
    P(Late if I take the train).2.
    Late or on time for the two days are
    independent.
  • What is the probability that I will miss at least
    one meeting?
  • P(Late Monday, Not late on Tuesday) .6(.8)
    .48
  • P(Not late Monday, Late Tuesday) .4(.2)
    .08
  • P(Late Monday and Late Tuesday) .6(.2)
    .12
  • P(Late at least once)
    .48.08.12 .68

38
Joint Events and Joint Probabilities
35/52
  • Marginal probability Probability for each
    event, without considering the other.
  • Joint probability Probability that two
    (several) events happen at the same time

39
Marginal and Joint Probabilities
36/52
Survey of 27326 German Individuals over 5
yearsConsider drawing an individual at random
from the sample.
Marginal Probabilities P(Male).52123,
P(Insured) .88571
Joint Probabilities P(Male and Insured) .44880
40
Conditional Probability
37/52
  • Conditional event occurrence of an event
    given that some other event has occurred.
  • Conditional probability Probability of an event
    given that some other event is certain to occur.
    Denoted P(AB) Probability of A will occur
    given B occurred.
  • Prob(AB) Prob(A and B) / Prob(B)

41
Conditional Probabilities
38/52
  • Company ESI sells two types of software, Basic
    and Advanced, to two markets, Government and
    Academic.Sales occur with the following
    probabilities
  • Academic Government To
    tal
  • Basic .4 .2 .6
  • Advanced .3 .1 .4
  • Total .7 .3 1.0
  • P(Basic Academic) .4 / .7
    .571
  • P(Government Advanced) .1 / .4 .25

42
Conditional Probabilities
39/52
An individual is drawn randomly from the sample
of 27,326 individuals in the German socioeconomic
panel.
P(UninsuredFemale) P(Uninsured and
Female)/P(Female) .04186/.47877.08743 P(MaleI
nsured) P(Male and Insured)/P(Insured)
.44880/.88571.50671
43
The Product Rule for Conditional Probabilities
40/52
  • For events A and B, P(A ?B)P(AB)P(B)
  • Example You draw a card from a well shuffled
    deck of cards, then a second one. What is the
    probability that the two cards will be a pair?
  • There are 13 cards. Let A1 be the card on the
    first draw and A2 be the second one. Then, P(A1
    ?A2) P(A1)P(A2A1).
  • For a pair of kings, P(K1) 1/13. P(K2K1)
    3/51.
  • P(K1 ?K2) (1/13)(3/51). There are 13 possible
    pairs, so P(Pair) 13(1/13)(3/51) 1/17.

44
Independent Events
41/52
  • Events are independent if the occurrence of one
    does not affect probabilities related to the
    other.
  • Events A and B are independent if P(AB) P(A).
    I.e., conditioning on B does not affect the
    probability of A.

45
Independent Events? Pick a Card, Any Card
42/52
  • P(Red card drawn) 26/52 1/2
  • P(Ace drawn) 4/52 1/13.
  • P(AceRed) (2/52) / (26/52) 1/13
  • P(Ace) P(AceRed) so Red Card and Ace are
    independent.

46
Independent Events?
43/52
  • Company ESI sells two types of software, Basic
    and Advanced, to two markets, Government and
    Academic.Sales occur randomly with the following
    probabilities
  • Academic Government To
    tal
  • Basic .4 .2 .6
  • Advanced .3 .1 .4
  • Total .7 .3 1.0
  • P(Basic Academic) .4 / .7
    .571 not equal to P(Basic).6
  • P(Government Advanced) .1 / .4 .25
    not equal to P(Govt) .3

47
Litigation Risk Analysis
44/52
P(Outcome Decision)
Decision
P(Result Outcome,DecisionL)
http//www.jenkens.com/Image/Jenkens/Content/The
Decision Tree.pdfsearch2222litigation
risk222Bgilchrist22
48
Litigation Risk Analysis
45/52
If we decide to LITIGATE, the probability we will
PREVAIL and FIND ASSET is P(Prevail,Find Asset)
P(Find AssetPrevail) P(Prevail) .5 .5 .25.
49
46/52
Litigation Risk Analysis Using Probabilities to
Determine a Strategy
Two paths to a favorable outcome. Probability
(upper) .7(.6)(.4) (lower) .5(.3)(.6) .168
.09 .258. How can I use this to decide
whether to litigate or not?
50
Using Conditional Probabilities Bayes Theorem
47/52
51
Using Bayes Theorem
48/52
If I choose a cookie from Bowl 1, the
probability it is chocolate chip is P(CC1)
P(CC and 1)/P(1) .125 / .5 .250
1/4 If you give me a chocolate chip cookie, what
is the probability it came from Bowl 1?
P(1CC) P(CC1)P(1)/P(CC)
(1/4)(1/2)/(3/8) 1/3
Example from http//en.wikipedia.org/wiki/Bayes'_t
heorem
52
Drug Testing
49/52
  • Data
  • P(Test correctly indicates disease).98
    (Sensitivity)
  • P(Test correctly indicates absence).95
    (Specificity)
  • P(Disease) .005 (Fairly rare)
  • Notation
  • test indicates disease, indicates no
    disease
  • D presence of disease, N absence of disease
  • Data
  • P(D) .005 (Incidence of the disease)
  • P(D) .98 (Correct detection of the
    disease)
  • P(N) .95 (Correct failure to detect the
    disease)
  • What are P(D) and P(N)? Note, P(D) the
    probability that a patient actually has the
    disease when the test says they do.

53
More Information
50/52
  • Deduce Since P(D).98, we know P(D).02
    because P(-D)P(D)1
  • P(D) is the P(False negative).
  • Deduce Since P(N).95, we know P(N).05
    because P(-N)P(N)1
  • P(N) is the P(False positive).
  • Deduce Since P(D).005, P(N).995 because
    P(D)P(N)1.

54
Now, Use Bayes Theorem
51/52
55
Summary
52/52
  • Randomness and decision making
  • Probability
  • Sources
  • Basic mathematics (the axioms)
  • Simple and compound events and constructing
    probabilities
  • Joint events
  • Independence
  • Addition and product rules for probabilities
  • Conditional probabilities and Bayes theorem
Write a Comment
User Comments (0)
About PowerShow.com