Big Picture of Debiased Machine Learning

Mar 25, 2025 2 min read causal inference, econometrics

Debiased machine learning (DML) is a generic recipe. The idea behind it is adding a correction term to the plug-in estimator of the functional, which leads to properties such as semi-parametric efﬁciency, double robustness, and Neyman orthogonality.

(Auto)-DML is a Method-of-Moments estimator

debiased/orthognal moment scores
- Why it matters?
  - try to solve: model selection and/or regularization bias from ML learners (e.g. Lasso)
  - Neyman orthogonality: ensure the parameter of interest insentitive to first order perturbation of nuisance estimation
  - double robustness
  - asymptotic normality
- Key Idea: Debiasing is achieved by adding a correction term to the plug-in estimator of the functional
  - Three representations: $θ = E [m (W, g)] = E [Y α (W)] = E [g (W) α (W)]$ , where
    - $g ()$ is outcome regression;
    - $α ()$ is Rieze Representer (RR);
    - $m ()$ is a contious linear functional;
    - $W = (D, X)$ is data containing treatment $D$ and covariates $X$ ;
    - $Y (d)$ is potential outcome
  - Correct the residual using RR
    - $E {m (W, g) - θ + α (W) [Y - g (W)]} = 0$
- How to construct orthogonal moment function?
  - orthogonal moment function = identifying moment function + first step influence function (FSIF)
  - identifying moment function: $m (W, g) - θ$
    - involving outcome regression
  - FSIF: $α (W) [Y - g (W)]$
    - correct the residual using Rieze Representer (RR)
    - Rieze Representer (RR)
      - In the case of ATE with binary treatment, RR are inverse propensity score terms
      - RR can be automatically characterized; NO NEED to know its analytical form
      - Can use random forests and NNet learners of RR
- Double Robustness
  - $E [m (W; g) - θ_{0} + α (W) (Y - g (W))] = - E [(α - α_{0}) (g - g_{0})]$
  - The score will be zero in expectation when either $α (W) = α_{0} (W)$ or $g (W) = g_{0} (W)$
Cross-fitting
- Why it matters?
  - Reduce overfitting bias

causal inference causal machine learning cross-fitting influence function Neyman orthogonality Riesz representation semiparametric double robust Riesz Representer double machine learning auto-dml

Chen Xing

Founder & Data Scientist

Enjoy Life & Enjoy Work!

Big Picture of Debiased Machine Learning

Chen Xing

Founder & Data Scientist

Related