1. ?????? (Discrete Choice Models) - PowerPoint PPT Presentation

1 / 171
About This Presentation
Title:

1. ?????? (Discrete Choice Models)

Description:

– PowerPoint PPT presentation

Number of Views:155
Avg rating:3.0/5.0
Slides: 172
Provided by: edut1550
Category:

less

Transcript and Presenter's Notes

Title: 1. ?????? (Discrete Choice Models)


1
?????????????
??? ???????????
2
?????????????
  • 1. ?????? (Discrete Choice Models)
  • 2. ?????? (Sample Selection Models)
  • 3. ?????? (Duration Models)

3
1 Discrete Choice Models  
4
 
5
 

6
???????
??????(1?? 0???) ??(????) 1?? 0?? ???? ????? ??? ??? ?? ?? ???
0 0 35 ? 45?
0 0 25 ? 70?
1 0 84?12? 500? 41 ? 40?
1 0 84?12? 300? 46 ? 80?
1 1 84?12? 86?6? 350? 55 ? 150?
1 1 84?12? 87?2? 1000? 49 ? 270?
7
Econometric model
  • Single equation model
  • System of equations model
  • Simultaneous equation Model

8
Single equation model
Can be written as
9
System of equations model
10
Simultaneous equations
11
Microeconometrics
  • Discrete choice models
  • Sample selection models
  • Duration models

12

Discrete Choice Model
  • Probit Model Logit Model
  • Multinomial Choice Model
  • Multinomial Logit Model
  • Nested Logit Model
  • Mixed Logit Model
  • Multinomial Probit Model
  • Bivariated Probit Model
  • Multivariate Probit Model
  • Sequential Choice Model
  • Ordered Probit Model
  • Count Data

13
Sample Selection Model
  • Censored model
  • Sample selection model

14
Duration Model
  • Duration model
  • Split population model

15
Binary Choice Model
Individual i
Choose A
Dont Choose A
16
(No Transcript)
17
Binary Choice Model
(Unobserved variable)
18
Probit Model
N(0, 1)
Assume
19
Binary Choice Model
  • Boczar (1978, J. of Finance)

Personal loan debtor
Bank
Finance Company
20
Binary choice model
Obtain a credit from a bank
Obtain a credit from a financial company
21
The probability of choosing alternative 1 is
given by
22
Probit Model
23
The probability of choosing alternative 0 is
given by
24
Probit Model
25
Probit model
The loglikelihood for this model is given by
26
Properties of Maximum Likilihood Estimator
27
(No Transcript)
28
Probit, logit vs. OLS
29
Modeling Decision
  • This yes or no type decision leads to a dummy
    variable.
  • The dependent variable of our model is a dummy
    variable.
  • We will be modeling the probability function,
    P(Y1).

30
Simplest ModelLinear Probability Model
31
Picture of LPM
1
X
0
X0
X1
32
Problems of LPM
  • Predictions outside 0-1 range.
  • Heteroscedasticity
  • This can be solved and a estimated GLS estimator
    developed.
  • Coefficient Determination has little meaning.
  • Constant marginal effect.

33
Interpreting the Probit Model
34
The logit model
35
The Log-Likelihood function

36
LIMDEP Command   Read NVAR7Nobs200
filenames..   Regress LHSy1
RHSone,x1,x2,. Probit LHSy1
RHSone,x1,x2,. Logit LHSy1
RHSone,x1,x2,.
37
PROBIT, LOGIT Goodness of Fit Measures?
  • More often cited are R-square values based on
    likelihood ratios.
  • Maddala  
  • R2 1 - (LR / LUR) 2/n
  • McFadden R-square
  • R2 1 - (log(LUR ) / log(LR))

38
Jacobson and Roszbach (2003, Journal of Banking
Finance) ----- Bivariate Probit Model
Providing a loan?
Loan defaults?
Yes
No
Yes
No
39
Bivariate Probit Model
(if loan granted)
(if loan not granted)
(if loan does not default)
(if loan defaults)
40
Bivariate Probit model
41
Multivariate Probit Model
Cigarette
Alcohol
Marijuana
Cocaine
YES
No
No
YES
YES
YES
No
No
42
(No Transcript)
43
Multivariate Normality
44
Hausman and Wise (1978, Econometrica)
45
Multivariate Probit Model
  • J3, Clark (1961)
  • J4, Hausman and Wise (1978, Econometrica)
  • J gt 4
  • McFadden (1989, Econometrica)
  • ------ Simulation-Based Estimation
  • ------ high dimensional integrals
  • Stern (1997, Journal of Economic Literature)
  • ----- Simulated Maximum Likelihood Estimator
  • ----- Simulated Moment Estimator
  • ----- GHK simulator

46
LIMDEP CommandBivariated Probit Model   Read
NVAR7Nobs200 filenames..   Bivariate
Probit LHSY1, Y2
RHSone,x1,x2,.
RH2one,z1,z2,.
47
Multivariate Probit Model
48
Multinomial Choice Model Example Credit Card
Individual i
Alternatives
J

2
3
1
49
Multinomial Choice Model
50
Multinomial Choice Model
51
Multinomial Logit Model
Let
be the probabilities associated these m categories
( j1,2,.m-1)
52
If
53
McFadden 1973
54
Multinomial Logit Model
55
If the ith individual falls in the jth category
otherwise
56
Independence of Irrelevant Alternatives (IIA)

57
Ordered Probit Model
Example Blume, Lim, Mackinlay (1998, Jornal of
Finance) Corporate bond rating (????)
AAA
AA
A
BBB
58
Ordered Probit Model
N(0, 1)
59
(No Transcript)
60
????
?
????
?
????
?????? Never Fail
????? Eventually Fail
61
Sequential Choice Model Example???
Auction
No
Yes
No
Yes
Yes
No
62
Sequential Response Model
63
Sequential Choice Model
First Auction
No 1-F(ß1x)
YesF(ß1x)
No1-F(ß2x)
YesF(ß2x)
YesF(ß3x)
No1-F(ß3x)
64
Then the probabilities can be written as
65
Model Selection Joint decision vs.
Sequential decision
EXAMPLE
Bivariate Probit Model ? Multinomial Choice
Model? Ordered Probit Model? Sequential
Choice Model?
66
Model Selection
EXAMPLE
Ioannides and Rosenthal (1994, The Review of
Economics and Statistics) Estimating the
consumption and investment demand for housing and
their effect on housing tenure status
67
Multinomial Choice Model?
(?????)
(??????)
(??????)
(???????)
68
Ordered Probit Model?
Intensity of Utility
(???????)
(??????)
(??????)
(?????)
69
Sequential Choice Model?
???
??
??
???
70
Bivariate Probit Model ?
??
???
??
???
71
Bivariate Probit Model
72
Bivariate Probit model
73
Multivariate Probit Model
Cigarette
Alcohol
Marijuana
Cocaine
YES
No
No
YES
YES
YES
No
No
74
(No Transcript)
75
Multivariate Normality
76
Multivariate Probit Model
  • J3, Clark (1961)
  • J4, Hausman and Wise (1978, Econometrica)
  • J gt 4
  • McFadden (1989, Econometrica)
  • ------ Simulation-Based Estimation
  • ------ high dimensional integrals
  • Stern (1997, Journal of Economic Literature)
  • ----- Simulated Maximum Likelihood Estimator
  • ----- Simulated Moment Estimator
  • ----- GHK simulator

77
ApplicationSurvey bias models
  • Censored model Deaton and Irish (1984)
  • Probit model Bollinger and David (1997),
    Abrevaya and Hausman (1999a)
  • Multinomial logit model Hsiao and Sun (1999)
  • Ordered probit model Dustman and van Soest
    (2004)
  • Duration model Torelli and Trivellato (1993),
    Abrevaya and Hausman (1999b)

78
Bollinger and David (1997)
  • Binary choice model

Did not use the substance
Used the substance
Lied
Did not lie
Lied
Did not lie
79
Hsaio and Sun (1999)
  • Binary choice model

Did not use the substance
Used the substance
lied
Did not lie
80
Table 1 Replied
Substance Use qi qi yi(1 - wi
)
yi 1 yi 0
(used the substance) (did not use the substance)
wi 1 (lied) 0 Not Applicable
wi 1 (lied) (reply did not use) Not Applicable
wi 0 (did not lie) 1 0
wi 0 (did not lie) (reply used) (reply did not use)
81
Model 1 Uniform one-sided survey response bias
model
The loglikelihood for this model is given by
82
Model 2 Heterogeneous and independent one-sided
survey response bias model
83
Model 3 Heterogeneous and dependent one-sided
survey response bias model
Assume
84
Partial observability model Poirier (1980)
85
Partial observability model Poirier (1980)
  • Zero-inflated Poisson model
  • Double hurdle model
  • Split population model

86
Leung and Yu(2003) Empirical Analysis
  • 1998 U.S. National Household Survey on Drug Abuse
    (NHSDA)
  • Tobacco
  • Alcohol
  • Marijuana
  • Cocaine

87
Table 4 Model 1 Uniform One-sided Survey
Response Bias Model
Variable Tobacco Alcohol Marijuana Cocaine
------- ---- ---- ----
------- ---- ---- ----
a 0.1386 0.1614 0.0003 0.4701
-0.0586 -0.0192 -0.3202 -0.7591
Loglikelihood -13852.738 -15181.128 -6073.143 -1519.109
88
Count regression
  • Appropriate when the dependent variable
  • is a non-negative integer (0,1,2,3,)

89
  • Distributions and Models
  • Poisson Model
  • Negative Binomial Model
  • Zero-inflated Poisson Model
  • Zero-inflated Negative Binomial Model

90
Poisson Regression
91
Why not use linear regression?
  • Typical count data in health care
  • Large number of 0 values and small values
  • Discrete nature of data
  • Result
  • Unusual distribution

92
Normal distribution vsPoisson distribution
Bell shaped curve
Normal distribution
Poisson distribution
Not bell shapednext slide
Intensity of process
93
Poisson with ? 0.5
94
When Count Data Cannot be Treated Normally
95
When they probably can.
96
What happens when mean ? variance?
  • Overdispersion when variance gt mean
  • Sometimes called unobserved heterogeneity
  • Zero-Inflated More zeros than expected by
    Poisson distribution
  • Ex. If ?1 (mean1), then we expect 37 0s

97
Overdispersion
98
Poisson Regression models
Negative Binomial Regression models
u is Weibull distribution
99
Overdispersion and Zero Inflation
100
Zero-inflated Poisson
101
Example
  • Bao article
  • Predicting the use of outpatient mental health
    services do modeling approaches make a
    difference? Inquiry. 2002 Summer39(2)168-83.

102
Observed data
103
Poisson and Zero-Inflated Poisson
104
Negative Binomial Model
105
Zero-Inflated Negative Binomial Model
106
TOBIT Model
107
TOBIT MODEL


if
108
TOBIT MODEL
109
TOBIT MODEL
110
TOBIT MODEL
111
TOBIT MODEL
112
TOBIT MODEL
113
TOBIT MODEL
114
TOBIT MODEL
where
p.f.
115
TOBIT MODEL
let
116
TOBIT MODEL
NOTE
117
TOBIT MODEL
let
118
TOBIT MODEL
119
TOBIT MODEL
120
The log-likelihood function

121
Sample Selection Model
122
Self- Selection Model
123
Sample Selection Model
124
Heckmans Two-step Estimator (1979)
125
Application Heckmans Two-step Estimator vs.
Two-part model
Duan, Manning, Morris, Newhouse(1984,
JBES) Maddala (1985a, 1985b) Hay, Leu,
Fohrer(1987,JBES) Manning, Duan, Rogers (1987,
JE) Leung and Yu (1996, JE) Dow and Norton (2003,
HSORM) Dow and Norton (2005)
126
Sequential Choice Model?
???
??
??
???
0
127
Joint Choice Model
???
??
???
??
128
Duration Models
  • ? Censored Data
  • ? Unobserved Heterogeneity
  • ? Time-Varying Covariates

129
D
C

C
D
D
End of study
130
  • Hazard Rate

131
  • Survival Rate


Hazard Rate and Survival Rate

132
Duration Model
133
  • Distributions
  • Parametric
  • Expoential
  • Weibull
  • Log-normal
  • Log-logistic
  • Gamma
  • Semi-parametric
  • Coxs partial likelihood estimator

134
LIMDEP Command---Duration Model   Read
NVAR7Nobs200 filenames..   Survival
LHSln(time), status (exit1) RHSone,x1,x2,.
modelExponential Survival LHSln(time),
status (exit1) RHSone,x1,x2,.
modelWeibull Coxs Semiparametric
Estimator Survival LHSln(time), status
(exit1) RHSone,x1,x2,.
135
TOBIT MODEL


if
136
TOBIT MODEL
137
TOBIT MODEL
138
TOBIT MODEL
139
TOBIT MODEL
where
p.f.
140
TOBIT MODEL
let
141
TOBIT MODEL
NOTE
142
TOBIT MODEL
let
143
TOBIT MODEL
144
TOBIT MODEL
NOTE
by LHopital rule
145
Duration Model
D
C

C
D
D
End of study
146
Duration model
  • Censored data
  • Unobserved heterogeneity
  • Time-varying covariates

147
2.2 Hazard Analysis
148
Survival rate and Hazard rate
149
2.2 Nonparametric Hazard Analysis
  • Kaplan-Meier estimator
  • Life table estimator

150
Figure 2 Kaplan-Meier Estimates of Survival
Function
151
Figure 3 Life Table Estimates of Survival
Function
152
The density and survival functions
f (Ti?wi) the probability density function
of the failure time S
(ti?wi) the probability of survival
153
The specifications for f (Ti?xi) and S
(ti?xi)
  • Exponential
  • Weibull
  • Log-logistic
  • Log-normal

154
Figure 4 Life Table Estimates of Hazard
Functions
155
Eventually fail assumption
156
????
?
?
????
????
Eventually Fail Assumption
157
????
?
????
?
????
?????? Never Fail
????? Eventually Fail
158
Schmidt and Witte (1989) --- Split
population duration model
G (?xi) the probability of eventual failure f
(Ti?wi) the probability density function
of the failure time S (Ti?wi) the
probability of survival
159
Schmidt and Witte (1989) --- Likelihood function
160
4.3 Multivariate Split Population Duration Model
161
Multivariate probit model
162
Multivariate duration model
163
Unobserved heterogeneity
The frailty ( m 1,2) is assumed to follow
a gamma distribution with mean 1 and variance

164
Whether Part
  • individuals probability
    of eventual failure for a type k event (k
    1,2,3,4).
  • follows a Weibull distribution

165
Duration Part
  • Assume the survival function is log-logistic. The
    second frailty
  • enters the hazard function as

, and is the failure time or the
where
censored time, whichever is earlier.
166
The cumulative hazard, the survival function, and
the density function are
167
The likelihood function is given by

168
Partial likelihood Estimation
169
Person Event Time Li
1 1 3
2 2 8
3 10
4 15
5 3 21
6 4 30
7 5 32
8 52
9 52
10 52
170
B.3 Simultaneous Equations Models
  • M. J. Lee (1995, Journal of Applied Econometrics)

171
  • ????
Write a Comment
User Comments (0)
About PowerShow.com