EC484: Econometric Analysis
Bootstrap
Taisuke Otsu
London School of Economics
2025/6
Contents
- Basic idea (Hansen Ch. 10.6-8)
- Bootstrap theory (Hansen Ch. 10.9-13, 16, 19, 21)
- Bootstrap for OLS and GMM (Hansen Ch. 10.27)
1. Basic idea
Example
• Consider wage equation estimated by CPS subsample
By delta method
t-test or CI for or is based on asymptotic approximation
Bootstrap provides alternative se and CI
Bootstrap algorithm
• Original sample for Y = log(wage) and X = educ
| i | 1 | 2 | 3 | 4 | 5 | 18 | 19 | 20 | |
| Yi | 3.64 | 3.71 | 2.65 | 2.82 | 3.50 | ··· | 3.18 | 3.59 | 3.14 |
| Xi | 18 | 18 | 13 | 16 | 16 | ··· | 16 | 18 | 16 |
• Bootstrap (re)sample is constructed by randomly drawing integers of size from with replacement
. E.g. If we draw, {16, 5, 17, 20, 20, … , 7, 1, 8}, then bootstrap sample is (same unit can appear)
By , we can compute and , say
• We can repeat this process as many as we want to get
Bootstrap variance and se
• Generally consider estimator θˆ and its bootstrap counterparts
• Bootstrap variance estimator of is
If θˆ is scalar, bootstrap standard error is
and normal-approximation bootstrap CI is
Percentile CI
Again is scalar. Several ways to construct CI
• Based on , compute its sample quantiles . Percentile bootstrap CI for is
• This CI is transformation respecting, i.e. for any monotone transform , its CI is given by
2. Bootstrap theory
Bootstrap
• Consider iid sample from population distribution function F
• Consider scalar estimator with cdf
• Usually is very complicated function depending on n, so need some approximation
Asymptotic approximation to is
• Bootstrap provides an alternative approximation to , that is
where is empirical distribution function (EDF)
• is called bootstrap distribution and bootstrap conducts inference based on
• Note: If is iid sample from , then satisfies
i.e. in ”bootstrap world”, plays role of population
Empirical distribution
• Popular choice for is empirical distribution function (EDF)
which is consistent and asymptotically normal
for each
• From EDF , bootstrap sample is obtained by drawing from observed sample with equal 1/n weights (with replacement)
Distribution of bootstrap observations
• Let be (scalar) random draw from EDF by original sample
• How can we think about distribution of There are two randomness: (i) randomness for resampling from and (ii) randomness of original sample
• To separate these randomness, introduce conditional probability and mean given
• Note: conditional distribution is always discrete (with support
Conditional mean and variance
• Conditional mean of is
• Conditional variance of is
Similarly for
and
Convergence for bootstrap statistics
• How can we think about convergence or asymptotic distribution for
• Definition: Bootstrap statistic converges in bootstrap probability to W as (denoted by if
. Note: “ p→” is (standard) convergence in probability for original sample
Property: If , then
Bootstrap WLLN
• Theorem: If is iid and , then
• Proof: Since implies , it is enough to show the first statement
Pick any ϵ > 0. By Markov inequality
where (sample var) satisfying σˆ2 p→ var[Zi ]
Asymptotic distribution
• For asymptotic distribution of , we need bootstrap version of convergence in distribution
• Definition: Bootstrap statistic converges in bootstrap distribution to W as (denoted by if
for each u at which is continuous
Bootstrap CLT
• Theorem: If is iid and , then
where
• Proof follows by applying Lindeberg-Feller CLT under conditional distribution given . We cannot use Lindeberg-Levy because conditional distribution varies with n
See Hansen, Ch. 10.14 for detail
Bootstrap CMT
• Theorem: If and is continuous at c, then
• Theorem: If and has set of discontinuity points such that , then
Consistency of bootstrap variance estimator
• Based on above tools, now consider bootstrap variance estimator of θˆ. Since itself typically does not have limiting distribution, suppose
for some sequence (typically and distribution ξ (typically normal)
• Three variance concepts
Under certain conditions (see Theorem 10.9 of Hansen-B)
Consistency of bootstrap percentile CI
• For percentile CI, suppose
where pdf ξ is symmetric around zero
• Then
where is -th quantile of
• Sketch: Let be cdf of . Since -th quantile of is , we can show
Thus
Percentile-t interval
• We can make better CI than Cpc which does not require symmetry and theoretically more accurate
• Consider t-statistic
Let be the -th quantile of
• If we know , then
• Thus (infeasible) exact CI is given by
Estimate by , which is
where is s.e. of computed by b-th resample)
Then percentile-t bootstrap CI is
which does not require symmetry of ξ
Remark: Higher order refinement
• Indeed percentile-t CI, Cpt, is more accurate than other methods in the sense that
• In contrast, percentile CI, , or asymptotic CI (say Casy) exhibit
• All CIs are asymptotically valid (in the sense of but approximation error of has smaller order
See Hansen, Ch. 10.20 for detail
Bootstrap test
• Consider testing H0 : θ = c against H1 : for scalar θ
For t-statistic , bootstrap counterpart is
• Note: should be centered at , not c. Because θˆ is true value in bootstrap world
• Bootstrap estimate for critical value is obtained by -th quantile of
Reject H0 if
Bootstrap p-value
We can also estimate p-value by bootstrap
• Recall that p-value is defined as
where is null distribution of |T |
Thus bootstrap estimate for p is
where is bootstrap distribution of
• By bootstrap algorithm is obtained as
Bootstrap bias estimation
• Let be an estimator of θ. We are interested in evaluation of bias
• Let . Then bias is written as E[Tn]
• Bootstrap counterpart of is
and bootstrap counterpart of E[Tn] is
• Based on for can be estimated by
• Given the estimated bias , bias corrected estimator is obtained as
Bootstrap MSE estimation
• Similarly MSE of θˆ can be estimated by
which is estimated by simulation as
3. Bootstrap for OLS and GMM
Bootstrap
• Consider projection model
• Projection coefficient is estimated by OLS
• Based on bootstrap sample , bootstrap counterpart of is given by (called pairs bootstrap)
Bootstrap CI and test are applicable
Bootstrap for OLS: Regression model
• Consider regression model
• Pairs bootstrap is still valid. But to improve precision, want to impose conditional moment restriction in the bootstrap world
• One way is to hold Xi fixed and draw to satisfy conditional moment restriction, that is
where ˆei is OLS residual and is iid draws (by computer) with and var[ξi ] = 1
It satisfies
Bootstrap for GMM
• Consider moment restriction model
• For just identified case (dim g = dim θ), all results above apply
• For over identified case (dim g > dim θ), there is some issue
• In bootstrap world, empirical distribution plays role of population, so bootstrap counterpart of is
where θˆ is GMM estimator, but generally
Confidence interval
Let be bootstrap resample from . Bootstrap counterpart of GMM estimator is
where and is optimal weight based on
• All methods based on are asymptotically valid
• However, in order to achieve higher-order refinement, we need to modify bootstrap method
Recentered bootstrap
• Hall & Horowitz (1996) suggested to compute bootstrap counterpart of by
i.e. use “recentered” moments to do GMM
• Percentile-t CI based on achieves higher-order refinement
Specification test
• Test validity of overidentified moment restrictions
. Test statistic (called J-statistic)
• To approximate distribution of J by bootstrap, we should use
If we use without recentering, it fails to approximate distribution of under H0