Notes on Semiparametric Models
Motivation
-
Semiparametric models contain both a finite-dimensional parameter of interest (
) and an infinite-dimensional nuisance parameter ( ). -
The goal is to estimate
as efficiently as possible while filtering out the impact of .
Key Concepts
1. Tangent Space
-
The tangent space consists of all possible local perturbations of the statistical model.
-
In parametric models, these directions are given by score functions (derivatives of the log-likelihood). In semiparametric models, the tangent space is typically an infinite-dimensional subspace of
(the space of square-integrable functions).
2. Nuisance Tangent Space ( )
-
This is the subset of the full tangent space that corresponds to variations in the nuisance parameter
, while holding the parameter of interest fixed. -
It represents all the directions in which the nuisance part of the model can change and potentially affect the estimation of
.
3. Orthogonal Complement of the Nuisance Tangent Space ( )
-
Defined as:
where the inner product
is typically given by covariance (or Fisher information). -
This space contains directions that are “free” of the influence of the nuisance parameter. In other words, any variation in this space does not get “contaminated” by changes in
.
Why It Matters?
Efficient Estimation
-
In semiparametric estimation, constructing an estimator with the smallest possible variance (i.e., achieving the efficiency bound) involves ensuring that its influence function lies in
. -
The influence function describes how an estimator responds to small changes in the data distribution.
-
By projecting any candidate influence function onto
, one removes the component due to the nuisance parameter, yielding the efficient influence function.
Practical Implication
- This separation allows us to focus on the parameter of interest while systematically “filtering out” nuisance effects, leading to more precise (optimal) estimators.
Summary
-
Nuisance Tangent Space (
): Captures all the directions of change due to the nuisance parameter . -
Orthogonal Complement (
): Contains directions free from nuisance effects, representing pure variations in . -
Efficient Influence Function: By projecting onto
, one obtains an influence function that is optimal, meaning that the corresponding estimator achieves the semiparametric efficiency bound.