EC484: Econometric Analysis

Bootstrap

Taisuke Otsu

London School of Economics

2025/6

Contents

  1. Basic idea (Hansen Ch. 10.6-8)
  2. Bootstrap theory (Hansen Ch. 10.9-13, 16, 19, 21)
  3. Bootstrap for OLS and GMM (Hansen Ch. 10.27)

1. Basic idea

Example

• Consider wage equation estimated by CPS subsample

By delta method

t-test or CI for or is based on asymptotic approximation
Bootstrap provides alternative se and CI

Bootstrap algorithm

• Original sample for Y = log(wage) and X = educ

i12345181920
Yi3.643.712.652.823.50···3.183.593.14
Xi1818131616···161816

• Bootstrap (re)sample is constructed by randomly drawing integers of size from with replacement

. E.g. If we draw, {16, 5, 17, 20, 20, … , 7, 1, 8}, then bootstrap sample is (same unit can appear)

By , we can compute and , say
• We can repeat this process as many as we want to get

Bootstrap variance and se

• Generally consider estimator θˆ and its bootstrap counterparts
• Bootstrap variance estimator of is

If θˆ is scalar, bootstrap standard error is

and normal-approximation bootstrap CI is

Percentile CI

Again is scalar. Several ways to construct CI
• Based on , compute its sample quantiles . Percentile bootstrap CI for is

• This CI is transformation respecting, i.e. for any monotone transform , its CI is given by

2. Bootstrap theory

Bootstrap

• Consider iid sample from population distribution function F
• Consider scalar estimator with cdf

• Usually is very complicated function depending on n, so need some approximation
Asymptotic approximation to is

• Bootstrap provides an alternative approximation to , that is

where is empirical distribution function (EDF)

is called bootstrap distribution and bootstrap conducts inference based on
• Note: If is iid sample from , then satisfies

i.e. in ”bootstrap world”, plays role of population

Empirical distribution

• Popular choice for is empirical distribution function (EDF)

which is consistent and asymptotically normal

for each

• From EDF , bootstrap sample is obtained by drawing from observed sample with equal 1/n weights (with replacement)

Distribution of bootstrap observations

• Let be (scalar) random draw from EDF by original sample
• How can we think about distribution of There are two randomness: (i) randomness for resampling from and (ii) randomness of original sample
• To separate these randomness, introduce conditional probability and mean given

• Note: conditional distribution is always discrete (with support

Conditional mean and variance

• Conditional mean of is

• Conditional variance of is

Similarly for

and

Convergence for bootstrap statistics

• How can we think about convergence or asymptotic distribution for
• Definition: Bootstrap statistic converges in bootstrap probability to W as (denoted by if

. Note: “ p→” is (standard) convergence in probability for original sample
Property: If , then

Bootstrap WLLN

• Theorem: If is iid and , then

• Proof: Since implies , it is enough to show the first statement

Pick any ϵ > 0. By Markov inequality

where (sample var) satisfying σˆ2 p→ var[Zi ]

Asymptotic distribution

• For asymptotic distribution of , we need bootstrap version of convergence in distribution
• Definition: Bootstrap statistic converges in bootstrap distribution to W as (denoted by if

for each u at which is continuous

Bootstrap CLT

• Theorem: If is iid and , then

where

• Proof follows by applying Lindeberg-Feller CLT under conditional distribution given . We cannot use Lindeberg-Levy because conditional distribution varies with n
See Hansen, Ch. 10.14 for detail

Bootstrap CMT

• Theorem: If and is continuous at c, then

• Theorem: If and has set of discontinuity points such that , then

Consistency of bootstrap variance estimator

• Based on above tools, now consider bootstrap variance estimator of θˆ. Since itself typically does not have limiting distribution, suppose

for some sequence (typically and distribution ξ (typically normal)

• Three variance concepts

Under certain conditions (see Theorem 10.9 of Hansen-B)

Consistency of bootstrap percentile CI

• For percentile CI, suppose

where pdf ξ is symmetric around zero

• Then

where is -th quantile of

• Sketch: Let be cdf of . Since -th quantile of is , we can show

Thus

Percentile-t interval

• We can make better CI than Cpc which does not require symmetry and theoretically more accurate
• Consider t-statistic

Let be the -th quantile of
• If we know , then

• Thus (infeasible) exact CI is given by

Estimate by , which is

where is s.e. of computed by b-th resample)

Then percentile-t bootstrap CI is

which does not require symmetry of ξ

Remark: Higher order refinement

• Indeed percentile-t CI, Cpt, is more accurate than other methods in the sense that

• In contrast, percentile CI, , or asymptotic CI (say Casy) exhibit

• All CIs are asymptotically valid (in the sense of but approximation error of has smaller order
See Hansen, Ch. 10.20 for detail

Bootstrap test

• Consider testing H0 : θ = c against H1 : for scalar θ
For t-statistic , bootstrap counterpart is

• Note: should be centered at , not c. Because θˆ is true value in bootstrap world
• Bootstrap estimate for critical value is obtained by -th quantile of
Reject H0 if

Bootstrap p-value

We can also estimate p-value by bootstrap
• Recall that p-value is defined as

where is null distribution of |T |

Thus bootstrap estimate for p is

where is bootstrap distribution of

• By bootstrap algorithm is obtained as

Bootstrap bias estimation

• Let be an estimator of θ. We are interested in evaluation of bias
• Let . Then bias is written as E[Tn]
• Bootstrap counterpart of is

and bootstrap counterpart of E[Tn] is

• Based on for can be estimated by

• Given the estimated bias , bias corrected estimator is obtained as

Bootstrap MSE estimation

• Similarly MSE of θˆ can be estimated by

which is estimated by simulation as

3. Bootstrap for OLS and GMM

Bootstrap

• Consider projection model

• Projection coefficient is estimated by OLS

• Based on bootstrap sample , bootstrap counterpart of is given by (called pairs bootstrap)

Bootstrap CI and test are applicable

Bootstrap for OLS: Regression model

• Consider regression model

• Pairs bootstrap is still valid. But to improve precision, want to impose conditional moment restriction in the bootstrap world
• One way is to hold Xi fixed and draw to satisfy conditional moment restriction, that is

where ˆei is OLS residual and is iid draws (by computer) with and var[ξi ] = 1

It satisfies

Bootstrap for GMM

• Consider moment restriction model

• For just identified case (dim g = dim θ), all results above apply
• For over identified case (dim g > dim θ), there is some issue
• In bootstrap world, empirical distribution plays role of population, so bootstrap counterpart of is

where θˆ is GMM estimator, but generally

Confidence interval

Let be bootstrap resample from . Bootstrap counterpart of GMM estimator is

where and is optimal weight based on

• All methods based on are asymptotically valid
• However, in order to achieve higher-order refinement, we need to modify bootstrap method

Recentered bootstrap

• Hall & Horowitz (1996) suggested to compute bootstrap counterpart of by

i.e. use “recentered” moments to do GMM

• Percentile-t CI based on achieves higher-order refinement

Specification test

• Test validity of overidentified moment restrictions

. Test statistic (called J-statistic)

• To approximate distribution of J by bootstrap, we should use

If we use without recentering, it fails to approximate distribution of under H0