Notes on Semiparametric Models

Motivation

  • Semiparametric models contain both a finite-dimensional parameter of interest (θ) and an infinite-dimensional nuisance parameter (η).

  • The goal is to estimate θ as efficiently as possible while filtering out the impact of η.

Key Concepts

1. Tangent Space

  • The tangent space consists of all possible local perturbations of the statistical model.

  • In parametric models, these directions are given by score functions (derivatives of the log-likelihood). In semiparametric models, the tangent space is typically an infinite-dimensional subspace of L2(P) (the space of square-integrable functions).

2. Nuisance Tangent Space (Tη)

  • This is the subset of the full tangent space that corresponds to variations in the nuisance parameter η, while holding the parameter of interest θ fixed.

  • It represents all the directions in which the nuisance part of the model can change and potentially affect the estimation of θ.

3. Orthogonal Complement of the Nuisance Tangent Space (Tη)

  • Defined as:

    Tη={hL2(P):h,g=0for all gTη}

    where the inner product , is typically given by covariance (or Fisher information).

  • This space contains directions that are “free” of the influence of the nuisance parameter. In other words, any variation in this space does not get “contaminated” by changes in η.

Why It Matters?

Efficient Estimation

  • In semiparametric estimation, constructing an estimator with the smallest possible variance (i.e., achieving the efficiency bound) involves ensuring that its influence function lies in Tη.

  • The influence function describes how an estimator responds to small changes in the data distribution.

  • By projecting any candidate influence function onto Tη, one removes the component due to the nuisance parameter, yielding the efficient influence function.

Practical Implication

  • This separation allows us to focus on the parameter of interest while systematically “filtering out” nuisance effects, leading to more precise (optimal) estimators.

Summary

  • Nuisance Tangent Space (Tη): Captures all the directions of change due to the nuisance parameter η.

  • Orthogonal Complement (Tη): Contains directions free from nuisance effects, representing pure variations in θ.

  • Efficient Influence Function: By projecting onto Tη, one obtains an influence function that is optimal, meaning that the corresponding estimator achieves the semiparametric efficiency bound.

Chen Xing
Chen Xing
Founder & Data Scientist

Enjoy Life & Enjoy Work!

Related