Notes on causal inference with No Overlap – Regression Discontinuity

May 31, 2025 2 min read causal inference, econometrics, quantitive-marketing

Here is my notes on regression discontinuity from Prof. Ding’s textbook (2024) and Prof. Imai’s lecture notes.

Motivation

We often cannot run a randomized experiment and have to use/design observational studies to find a setting where credible causal inference is possible.

The key is the knowledge of treatment assignment mechanism.

Regression discontinuity design (RD Design):

RD Design is a simple and widely used quasi-experimental design. The term “quasi experimental” is to emphasize that these approaches are still framed using concepts from randomized experiments but require econometric innovations to compensate for the lack of random treatment assignment.

Sharp RD Design: treatment assignment is based on a deterministic rule (i.e. we have full knowledge of how treatment is assigned)
Fuzzy RD Design: encouragement to receive treatment is based on a deterministic rule

Setting

Binary treatment $Z \in {0, 1}$
Potential outcomes ${Y (0), Y (1)}$
There is a running variable $X \in R$ such that $Z = I (X \geq x_{0})$ , where $x_{0}$ is a pre-determined threshold. Note that, the treatment assignment is deterministic
The unconfoundedness assumption holds automatically $Z ⊥ ⊥ {Y (1), Y (0)} ∣ X$
The overlap assumption does not hold

e (X) = pr (Z = 1 ∣ X) = 1 (X \geq x_{0}) = 1 or 0

Identification

RD can identify a local average causal effect at the cutoff point $x_{0}$
Estimand:

τ (x_{0}) = E {Y (1) - Y (0) ∣ X = x_{0}} .

Assumption 1 (continuity assumption).

1. $E {Y (1) ∣ X = x}$ is continuous from the right at $x_{0}$
2. $E {Y (0) ∣ X = x}$ is continuous from the left at $x_{0}$

We have $\begin{aligned} (continuity) & E {Y (1) ∣ X = x_{0}} & = lim_{ε \to 0 +} E {Y (1) ∣ X = x_{0} + ε} \\ (def of Z) & = lim_{ε \to 0 +} E {Y (1) ∣ Z = 1, X = x_{0} + ε} \\ = lim_{ε \to 0 +} E (Y ∣ Z = 1, X = x_{0} + ε), \end{aligned}$ Similarly, $E {Y (0) ∣ X = x_{0}} = lim_{ε \to 0 +} E (Y ∣ Z = 0, X = x_{0} - ε)$ So the local average causal effect at $x_{0}$ can be identified by the difference of the two limits
Advantage: internal validity
Disadvantage: external validity

Key Theorem

Theorem 1.

Assume that the treatment is determined by

Z = I (X \geq x_{0})

where

x_{0}

is a predetermined threshold. Assume that

E {Y (1) ∣ X = x}

is continuous from the right at

x_{0}

and

E {Y (0) ∣ X = x}

is continuous from the left at

x_{0}

. Then the local average treatment effect at

X = x_{0}

is identified by

τ (x_{0}) = lim_{ε \to 0 +} E (Y ∣ Z = 1, X = x_{0} + ε) - lim_{ε \to 0 +} E (Y ∣ Z = 0, X = x_{0} - ε)

$τ (x_{0})$ is nonparametrically identified.

Reference

Ding, P. (2024). A First Course in Causal Inference. CRC Press.

Lecture notes: Regression Discontinuity Design

causal inference regression discontinuity RD overlap local average treatment effect running variable forcing variable Sharp RD Design Fuzzy RD Design quasi-experiment

Chen Xing

Founder & Data Scientist

Enjoy Life & Enjoy Work!