Notes on causal inference with No Overlap – Regression Discontinuity

Here is my notes on regression discontinuity from Prof. Ding’s textbook (2024) and Prof. Imai’s lecture notes.

Motivation

We often cannot run a randomized experiment and have to use/design observational studies to find a setting where credible causal inference is possible.

The key is the knowledge of treatment assignment mechanism.

Regression discontinuity design (RD Design):

RD Design is a simple and widely used quasi-experimental design. The term “quasi experimental” is to emphasize that these approaches are still framed using concepts from randomized experiments but require econometric innovations to compensate for the lack of random treatment assignment.

  1. Sharp RD Design: treatment assignment is based on a deterministic rule (i.e. we have full knowledge of how treatment is assigned)

  2. Fuzzy RD Design: encouragement to receive treatment is based on a deterministic rule

Setting

  • Binary treatment Z{0,1}

  • Potential outcomes {Y(0),Y(1)}

  • There is a running variable XR such that Z=I(Xx0), where x0 is a pre-determined threshold. Note that, the treatment assignment is deterministic

  • The unconfoundedness assumption holds automatically Z{Y(1),Y(0)}X

  • The overlap assumption does not hold

e(X)=pr(Z=1X)=1(Xx0)=1 or 0

Identification

  • RD can identify a local average causal effect at the cutoff point x0

  • Estimand:

τ(x0)=E{Y(1)Y(0)X=x0}.
  • Assumption 1 (continuity assumption).
    1. E{Y(1)X=x} is continuous from the right at x0
    2. E{Y(0)X=x} is continuous from the left at x0
image-20250531113432572
  • We have (continuity)E{Y(1)X=x0}=limε0+E{Y(1)X=x0+ε}(def of Z)=limε0+E{Y(1)Z=1,X=x0+ε}=limε0+E(YZ=1,X=x0+ε), Similarly, E{Y(0)X=x0}=limε0+E(YZ=0,X=x0ε) So the local average causal effect at x0 can be identified by the difference of the two limits

  • Advantage: internal validity

  • Disadvantage: external validity

Key Theorem

Theorem 1.
Assume that the treatment is determined by Z=I(Xx0) where x0 is a predetermined threshold. Assume that E{Y(1)X=x} is continuous from the right at x0 and E{Y(0)X=x} is continuous from the left at x0. Then the local average treatment effect at X=x0 is identified by τ(x0)=limε0+E(YZ=1,X=x0+ε)limε0+E(YZ=0,X=x0ε)

τ(x0) is nonparametrically identified.

Reference

Ding, P. (2024). A First Course in Causal Inference. CRC Press.

Lecture notes: Regression Discontinuity Design

Chen Xing
Chen Xing
Founder & Data Scientist

Enjoy Life & Enjoy Work!

Related