Notes on Control Function Method

Motivation

  • Endogeneity: When an explanatory variable (treatment) is correlated with the unobserved error term (e.g., due to omitted variables, measurement error, or simultaneity).
  • Consequence: Standard regression (e.g., OLS) yields biased estimates.
  • Goal: The control function (CF) approach “purges” endogeneity by modeling the correlation between the treatment and unobservables.

The control-function (CF) approach tackles this by explicitly modelling the source of endogeneity and then partialling it out. For linear models that modelling step turns out to be algebraically equivalent to 2SLS, but the real power of CF is that it extends seamlessly to nonlinear or limited‐dependent‐variable settings where 2SLS cannot be applied directly.

Isolate the problematic part of the error term correlated with treatment, then add it as a control variable to ’neutralize’ endogeneity.

How It Works

Outcome model: Y=β0+β1T+β2X+U

  • T: Endogenous treatment
  • X: Exogenous controls
  • U: Unobservables (correlated with T).

First-stage model: T=γ0+γ1Z+γ2X+V

  • Z: Instrumental variable (IV)
  • V: First-stage error.

CF Insight:

If U and V are correlated (i.e., Cov(U,V)0), we can decompose U into:
U=ρV+ϵ where ϵ is uncorrelated with T and V (by construction).

Estimation

  1. First Stage:

    • Regress T on Z and X:
      T^=γ^0+γ^1Z+γ^2X
    • Obtain the residual: V^=TT^.
  2. Second Stage:

    • Add V^ to the outcome model:
      Y=β0+β1T+β2X+ρV^+ϵ
    • Estimate via OLS.

Why this works:

  • V^ “controls for” the part of U correlated with T.
  • Once V^ is included, T becomes exogenous in the modified model (Cov(T,ϵ)=0).
  • β^1 is consistent for the causal effect.

Key Assumptions

  1. Instrument Validity:

    • Z is relevant: Cov(Z,T)0 (strong first stage).
    • Z is exogenous: Cov(Z,U)=0.
  2. Correct Functional Form:

    • Linearity in the first stage and control function (e.g., U=ρV+ϵ).
  3. Exclusion Restriction: Z affects Y only through T.

CF vs. Other Methods

Method Key Difference
2SLS Uses fitted values (T^); efficient but inconsistent under heteroscedasticity/nonlinearity.
Control Function Uses residuals (V^); more flexible for nonlinear models (e.g., probit).

Advantage of CF:

  • Directly models the endogeneity structure (via V^).
  • Extends to non-additive errors, discrete outcomes, and heteroscedastic settings.

Example

Problem: Estimate returns to education (T) on wages (Y), where ability (U) is unobserved and correlated with education.

  • IV: Distance to college (Z).

  • Steps:

    1. Regress: education distance + controls → get residuals V^.

    2. Regress: wages education + controls + V^.

  • Result: Coefficient on education is causal.

Generalization

Setting Control-function term Typical reference
Binary/probit Y with endogenous T Include h(v^) where h() is the generalized residual (Smith & Blundell, 1986) Rivers & Vuong (1988)
Heckman sample selection Inverse Mills ratio λ(v^) controls for non-random sample entry Heckman (1979)
Count models (Poisson, NB) Nonparametric sieve h(v^) or parametric polynomial Wooldridge (2015)
Semiparametric/ML partially-linear Learnt nuisance m(X); CF gives Neyman-orthogonal moment for Double ML Chernozhukov et al. (2018)

The principle is identical: obtain a residual that captures unobserved heterogeneity driving T, then include it (or a flexible transformation) in the structural equation.

Why This Matters

  • CF is essential when:

    • You have a valid IV and suspect omitted variable bias.
    • You work with nonlinear models (e.g., binary/duration outcomes).
  • Software Implementation:

    • Stata: ivregress 2sls (equivalent to CF in linear cases) or cmp for nonlinear.
    • R: ivreg (linear), controlfunction package.

Critical Caveats

  • Weak Instruments: If Z is weak, V^ is noisy → bias.
  • Functional Form Misspecification: If UρV+ϵ, CF fails.
  • No Magic Bullet: Validity of Z is untestable and must be justified theoretically.

Bottom Line

The CF approach harnesses IV residuals to “control” for endogeneity, converting an endogenous variable into a conditionally exogenous one. It’s a blend of IV intuition and regression control — powerful when assumptions hold.

Chen Xing
Chen Xing
Founder & Data Scientist

Enjoy Life & Enjoy Work!

Related