Background

Think of a continuous criterion for treatment.


Sharp Regression Discontinuity

It occurs when treatment is a deterministic and discontinuous function of covariate .

For example, (whether or not being treated) = if . otherwise.

An example would be the National Merit Scholarship in US awarded to students on a GPA-like basis ( ). The core assumption here is that, those around the threshold should have similar outcomes, except that those above received the scholarship.

The lack of this setting is that we could never observe two different treatment for the same covariate . Thus we could only need to extrapolate for , do causal inference around (near) the threshold . Because by taking the limit we could find the approximation.

Identification Assumption: Thus we need the covariate to be continuous!

Consider the function that we regress:

The only thing to be cared is that the relationship btw and . Here is not only correlated with , it is a deterministic function of .

除了用线性模型去拟合以外,我们还可以用非线性的模型去拟合,但是这个问题的缺陷就在于,如果你采取了一种没有那么灵活的策略去拟合,那就会带来模型的 misspecification 可能在阈值处产生一个假的”跳跃”,被你误认为是 treatment effect。

Parametric RDD

最常见的错误就是只写了一个普通的 OLS 回归:

这缺少了 running variable 多项式,导致它只是一个普通的 Dummy Variable regression.

正确的 specification 应该是:

Also see Nonparametric RDD


Reference

这条笔记主要会使用 EC423 Topic 2 作为参考资料。