In statistics, Hodges' estimator[1] (or the Hodges–Le Cam estimator[2]), named for Joseph Hodges, is a famous counterexample of an estimator which is "superefficient",[3] i.e. it attains smaller asymptotic variance than regular efficient estimators.
The existence of such a counterexample is the reason for the introduction of the notion of regular estimators.
Hodges' estimator improves upon a regular estimator at a single point.
In general, any superefficient estimator may surpass a regular estimator at most on a set of Lebesgue measure zero.
[4] Although Hodges discovered the estimator he never published it; the first publication was in the doctoral thesis of Lucien Le Cam.
is a "common" estimator for some parameter
: it is consistent, and converges to some asymptotic distribution
(usually this is a normal distribution with mean zero and variance which may depend on
is defined as[6] This estimator is equal to
It is not difficult to see that this estimator is consistent for
Thus this estimator has the same asymptotic distribution as
the rate of convergence becomes arbitrarily fast.
This estimator is superefficient, as it surpasses the asymptotic behavior of the efficient estimator
It is not true that the Hodges estimator is equivalent to the sample mean, but much better when the true mean is 0.
The correct interpretation is that, for finite
, the truncation can lead to worse square error than the sample mean estimator for
close to 0, as is shown in the example in the following section.
[8] Le Cam shows that this behaviour is typical: superefficiency at the point θ implies the existence of a sequence
is strictly larger than the Cramér-Rao bound.
For the extreme case where the asymptotic risk at θ is zero, the
[9] In general, superefficiency may only be attained on a subset of Lebesgue measure zero of the parameter space
[10] Suppose x1, ..., xn is an independent and identically distributed (IID) random sample from normal distribution N(θ, 1) with unknown mean but known variance.
Then the common estimator for the population mean θ is the arithmetic mean of all observations:
, where 1{...} denotes the indicator function.
The mean square error (scaled by n) associated with the regular estimator x is constant and equal to 1 for all θ's.
At the same time the mean square error of the Hodges' estimator
behaves erratically in the vicinity of zero, and even becomes unbounded as n → ∞.
This demonstrates that the Hodges' estimator is not regular, and its asymptotic properties are not adequately described by limits of the form (θ fixed, n → ∞).