Title: The World Distribution of Income (from Log-Normal Country Distributions)
1The World Distribution of Income (from Log-Normal
Country Distributions)
- Xavier Sala-i-Martin
- Columbia University
- June 2008
2Goal
- Estimate WDI
- Estimate Poverty Rates and Counts
- Estimate Income Inequality across the worlds
citizens
3Data
- GDP Per capita (PPP-Adjusted).
- We usually use these data as the mean of each
country/year distribution of income (for example,
when we estimate growth regressions) - Note I decompose China and India into Rural and
Urban - Use local surveys to get relative incomes of
rural and urban - Apply the ratio to PWT GDP and estimate per
capita income in Rural and Urban and treat them
as separate data points (as if they were
different countries) - Using GDP Per Capita we know
4GDP Per Capita Since 1970
5Annual Growth Rate of World Per Capita GDP
6ร-Non-Convergence 1970-2006
7s-Divergence (191 countries)
8Histogram Income Per Capita (countries)
9(No Transcript)
10(No Transcript)
11(No Transcript)
12(No Transcript)
13Adding Population Weights
14(No Transcript)
15(No Transcript)
16(No Transcript)
17(No Transcript)
18Back
19Population-Weighted ร-convergence (1970-2006)
20But NA Numbers do not show Personal Situation
Need Individual Income Distribution
- We can use Survey Data
- Problem
- Not available for every year
- Not available for every country
- Survey means do not coincide with NA means
21Surveys not available every year
- Can Interpolate Income Shares (they are slow
moving animals) - Regression
- Near-Observation
- Cubic Interpolation
- Others
22(No Transcript)
23(No Transcript)
24(No Transcript)
25Missing Countries
- Can approximate using neighboring countries
26Method Step 1 Interpolate
- Break up our sample of countries into
regions(World Bank region definitions). - Interpolate the quintile shares for country-years
with no data, according to the following scheme,
and in the following order - Group I countries with several years of
distribution data - We calculate quintile shares of years with no
income distribution data that are WITHIN the
range of the set of years with data by cubic
spline interpolation of the quintile share time
series for the country. - We calculate quintile shares of years with no
data that are OUTSIDE this range by assuming that
the share of each quintile rises each year after
the data time series ends by beta/2i, where i is
the number of years after the series ends, and
beta is the coefficient of the slope of the OLS
regression of the data time series on a constant
and on the year variable. This extrapolation
adjustment ensures that 1) the trend in the
evolution of each quintile share is maintained
for the first few years after data ends, and 2)
the shares eventually attain their all-time
average values, which is the best extrapolation
that we could make of them for years far outside
the range of our sample. - Group II countries with only one year of
distribution data. - We keep the single year of data, and impute the
quintile shares for other years to have the same
deviations from this year as does the average
quintile share time series taken over all Group I
countries in the given region, relative to the
year for which we have data for the given
country. Thus, we assume that the countrys
inequality dynamics are the same as those of its
region, but we use the single data point to
determine the level of the countrys income
distribution. - Group III countries with no distribution data.
- We impute the average quintile share time series
taken over all Group I countries in the given
region.
27Step 2 Find the s of the lognormal distribution
using least squares
28Step 3 Compute the resulting normal
distributions, and the poverty and inequality
statistics
29Step 4 (to generate confidence intervals)
Generate a new data set of quintiles
- Having obtained our point estimates, we obtain
our standard errors by reproducing our original
set of income distribution data by drawing
samples of the sample size given in the country
information sheets for the WIDER database from
each estimated lognormal distribution
corresponding to a country-year with data,
calculating the sample quintile shares for each
of these samples, discarding the sample
30Step 5 Repeat steps 1 through 4 using the
original values of s and ยต to generate samples in
step 4
- Repeat the steps 1 through 4 to generate a new
set of poverty and inequality measures for each
country-year and the world as a whole over the 34
years. We repeat the procedure N (300) times.
Note that we do not use our estimates to generate
income shares for country-years with no data, but
we obtain the data by the procedure described
above in order to keep the data-generating
process identical to the one we used to obtain
point estimates. Note also that in all
iterations, we generate our samples from the
lognormals with parameters given by the point
estimates we obtain from the true, rather than
synthetic data.
31Step 6 Find the mean and the standard deviation
of poverty and inequality measures
- Note that we have as many observations of the
poverty and inequality measures as we have
iterations of step 5. For this paper we used
N300. We can now estimate the mean and standard
deviation of these observations. If our
assumption about the nature of the sampling in
the surveys as roughly i.i.d., our assumption
that the country-year distributions are
lognormal, and our assumption that the
interpolation provides reasonable estimates of
quintile shares for country-years with no data
are all correct, the standard deviation of the
estimates for the N iterations should converge to
the population standard deviation of the
(complicated) estimator that we use to obtain our
point estimates.
32Results
33(No Transcript)
34(No Transcript)
35(No Transcript)
36(No Transcript)
37Back
38(No Transcript)
39(No Transcript)
40(No Transcript)
41(No Transcript)
42(No Transcript)
43(No Transcript)
44(No Transcript)
45(No Transcript)
46Back
47(No Transcript)
48(No Transcript)
49(No Transcript)
50(No Transcript)
51(No Transcript)
52(No Transcript)
53(No Transcript)
54(No Transcript)
55(No Transcript)
56Back
57(No Transcript)
58(No Transcript)
59(No Transcript)
60(No Transcript)
61(No Transcript)
62(No Transcript)
63(No Transcript)
64(No Transcript)
65(No Transcript)
66(No Transcript)
67(No Transcript)
68(No Transcript)
69(No Transcript)
70(No Transcript)
71(No Transcript)
72(No Transcript)
73(No Transcript)
74(No Transcript)
75(No Transcript)
76(No Transcript)
77(No Transcript)
78Back
79(No Transcript)
80(No Transcript)
81(No Transcript)
82(No Transcript)
83(No Transcript)
84(No Transcript)
85(No Transcript)
86(No Transcript)
87(No Transcript)
88(No Transcript)
89(No Transcript)
90(No Transcript)
91(No Transcript)
92(No Transcript)
93(No Transcript)
94(No Transcript)
95(No Transcript)
96(No Transcript)
97(No Transcript)
98Back
99(No Transcript)
100(No Transcript)
101(No Transcript)
102(No Transcript)
103(No Transcript)
104(No Transcript)
105(No Transcript)
106(No Transcript)
107(No Transcript)
108(No Transcript)
109(No Transcript)
110(No Transcript)
111(No Transcript)
112(No Transcript)
113(No Transcript)
114(No Transcript)
115Back
116(No Transcript)
117(No Transcript)
118(No Transcript)
119(No Transcript)
120(No Transcript)
121(No Transcript)
122Poverty Rates
123Rates or Headcounts?
- Veil of Ignorance Would you Prefer your children
to live in country A or B? - (A) 1.000.000 people and 500.000 poor (poverty
rate 50) - (B) 2.000.000 people and 666.666 poor (poverty
rate 33) - If you prefer (A), try country (C)
- (C) 500.000 people and 499.999 poor.
124Poverty Counts
125(No Transcript)
126(No Transcript)
127Gini and Atkinson Index (coef1)
128Sen Index (Income(1-gini))
129Atkinson Welfare Level
130MLD and Theil
131(No Transcript)
132(No Transcript)
133(No Transcript)
134MLD Decomposition (ttotal, wwithin, and
bbetween country inequality)
135Theil Decomposition (ttotal, wwithin, and
bbetween country inequality)
136Regional Analysis
137Sub Saharan Africa
138East Asia
139South Asia
140Latin America
141Middle East and North Africa
142Eastern Europe
143Former Soviet Union
144Counts (all regions, 1/day)
145(No Transcript)
146(No Transcript)
147(No Transcript)
148(No Transcript)
149(No Transcript)
150Sensitivity of Functional form Poverty Rates
(1/day) with Kernel, Normal, Gamma, Adjusted
Normal, Weibull distributions
151Sensitivity of Functional form Gini (1/day)
with Kernel, Normal, Gamma, Weibull distributions
152Sensitivity of GDP Source Poverty Rates
(1/day) with PWT, WB, and Maddison
153Sensitivity of Source of GDP Gini with PWT, WB,
and Maddison
154Sensitivity of Interpolation Method Poverty
Rates 1/day with Nearest, Linear, Cubic and
Baseline
155Sensitivity of Interpolation Method Gini with
Nearest, Linear, Cubic and Baseline
156Preliminary Results on Confidence Intervals with
Lognormal Gini
157Preliminary Results on Confidence Intervals with
Lognormal MLD