In statistics, the jackknife (jackknife cross-validation) is a cross-validation technique and, therefore, a form of resampling. It is especially useful for bias and variance estimation. The jackknife pre-dates other common resampling methods such as the bootstrap. Given a sample of size $n$ , a jackknife estimator can be built by aggregating the parameter estimates from each subsample of size $(n-1)$ obtained by omitting one observation.^[1]

The jackknife technique was developed by Maurice Quenouille (1924–1973) from 1949 and refined in 1956. John Tukey expanded on the technique in 1958 and proposed the name "jackknife" because, like a physical jack-knife (a compact folding knife), it is a rough-and-ready tool that can improvise a solution for a variety of problems even though specific problems may be more efficiently solved with a purpose-designed tool.^[2]

The jackknife is a linear approximation of the bootstrap.^[2]

A simple example: mean estimation

The jackknife estimator of a parameter is found by systematically leaving out each observation from a dataset and calculating the parameter estimate over the remaining observations and then aggregating these calculations.

For example, if the parameter to be estimated is the population mean of random variable $x$ , then for a given set of i.i.d. observations ${\displaystyle x_{1},...,x_{n))$ the natural estimator is the sample mean:

{\bar {x))={\frac {1}{n))\sum _{i=1}^{n}x_{i}={\frac {1}{n))\sum _{i\in [n]}x_{i},

where the last sum used another way to indicate that the index $i$ runs over the set ${\displaystyle [n]=\{1,\ldots ,n\))$ .

Then we proceed as follows: For each $i\in [n]$ we compute the mean ${\displaystyle {\bar {x))_{(i)))$ of the jackknife subsample consisting of all but the $i$ -th data point, and this is called the $i$ -th jackknife replicate:

{\bar {x))_{(i)}={\frac {1}{n-1))\sum _{j\in [n],j\neq i}x_{j},\quad \quad i=1,\dots ,n.

It could help to think that these $n$ jackknife replicates ${\displaystyle {\bar {x))_{(1)},\ldots ,{\bar {x))_{(n)))$ give us an approximation of the distribution of the sample mean ${\bar {x))$ and the larger the $n$ the better this approximation will be. Then finally to get the jackknife estimator we take the average of these $n$ jackknife replicates:

{\bar {x))_{\mathrm {jack} }={\frac {1}{n))\sum _{i=1}^{n}{\bar {x))_{(i)}.

One may ask about the bias and the variance of ${\displaystyle {\bar {x))_{\mathrm {jack} ))$ . From the definition of ${\displaystyle {\bar {x))_{\mathrm {jack} ))$ as the average of the jackknife replicates one could try to calculate explicitly, and the bias is a trivial calculation but the variance of ${\displaystyle {\bar {x))_{\mathrm {jack} ))$ is more involved since the jackknife replicates are not independent.

For the special case of the mean, one can show explicitly that the jackknife estimate equals the usual estimate:

{\frac {1}{n))\sum _{i=1}^{n}{\bar {x))_{(i)}={\bar {x)).

This establishes the identity ${\bar {x))_{\mathrm {jack} }={\bar {x))$ . Then taking expectations we get $E[{\bar {x))_{\mathrm {jack} }]=E[{\bar {x))]=E[x]$ , so ${\displaystyle {\bar {x))_{\mathrm {jack} ))$ is unbiased, while taking variance we get $V[{\bar {x))_{\mathrm {jack} }]=V[{\bar {x))]=V[x]/n$ . However, these properties do not generally hold for parameters other than the mean.

This simple example for the case of mean estimation is just to illustrate the construction of a jackknife estimator, while the real subtleties (and the usefulness) emerge for the case of estimating other parameters, such as higher moments than the mean or other functionals of the distribution.

${\displaystyle {\bar {x))_{\mathrm {jack} ))$ could be used to construct an empirical estimate of the bias of ${\bar {x))$ , namely ${\widehat {\operatorname {bias} ))({\bar {x)))_{\mathrm {jack} }=c({\bar {x))_{\mathrm {jack} }-{\bar {x)))$ with some suitable factor $c>0$ , although in this case we know that ${\bar {x))_{\mathrm {jack} }={\bar {x))$ so this construction does not add any meaningful knowledge, but it gives the correct estimation of the bias (which is zero).

A jackknife estimate of the variance of ${\bar {x))$ can be calculated from the variance of the jackknife replicates ${\displaystyle {\bar {x))_{(i)))$ :^[3]^[4]

{\widehat {\operatorname {var} ))({\bar {x)))_{\mathrm {jack} }={\frac {n-1}{n))\sum _{i=1}^{n}({\bar {x))_{(i)}-{\bar {x))_{\mathrm {jack} })^{2}={\frac {1}{n(n-1)))\sum _{i=1}^{n}(x_{i}-{\bar {x)))^{2}.

The left equality defines the estimator ${\displaystyle {\widehat {\operatorname {var} ))({\bar {x)))_{\mathrm {jack} ))$ and the right equality is an identity that can be verified directly. Then taking expectations we get $E[{\widehat {\operatorname {var} ))({\bar {x)))_{\mathrm {jack} }]=V[x]/n=V[{\bar {x))]$ , so this is an unbiased estimator of the variance of ${\bar {x))$ .

Estimating the bias of an estimator

The jackknife technique can be used to estimate (and correct) the bias of an estimator calculated over the entire sample.

Suppose $\theta$ is the target parameter of interest, which is assumed to be some functional of the distribution of $x$ . Based on a finite set of observations ${\displaystyle x_{1},...,x_{n))$ , which is assumed to consist of i.i.d. copies of $x$ , the estimator ${\hat {\theta ))$ is constructed:

{\hat {\theta ))=f_{n}(x_{1},\ldots ,x_{n}).

The value of ${\hat {\theta ))$ is sample-dependent, so this value will change from one random sample to another.

By definition, the bias of ${\hat {\theta ))$ is as follows:

{\text{bias))({\hat {\theta )))=E[{\hat {\theta ))]-\theta .

One may wish to compute several values of ${\hat {\theta ))$ from several samples, and average them, to calculate an empirical approximation of $E[{\hat {\theta ))]$ , but this is impossible when there are no "other samples" when the entire set of available observations ${\displaystyle x_{1},...,x_{n))$ was used to calculate ${\hat {\theta ))$ . In this kind of situation the jackknife resampling technique may be of help.

We construct the jackknife replicates:

{\hat {\theta ))_{(1)}=f_{n-1}(x_{2},x_{3}\ldots ,x_{n})

{\hat {\theta ))_{(2)}=f_{n-1}(x_{1},x_{3},\ldots ,x_{n})

\vdots

{\hat {\theta ))_{(n)}=f_{n-1}(x_{1},x_{2},\ldots ,x_{n-1})

where each replicate is a "leave-one-out" estimate based on the jackknife subsample consisting of all but one of the data points:

{\hat {\theta ))_{(i)}=f_{n-1}(x_{1},\ldots ,x_{i-1},x_{i+1},\ldots ,x_{n})\quad \quad i=1,\dots ,n.

Then we define their average:

{\displaystyle {\hat {\theta ))_{\mathrm {jack} }={\frac {1}{n))\sum _{i=1}^{n}{\hat {\theta ))_{(i)))

The jackknife estimate of the bias of ${\hat {\theta ))$ is given by:

{\widehat {\text{bias))}({\hat {\theta )))_{\mathrm {jack} }=(n-1)({\hat {\theta ))_{\mathrm {jack} }-{\hat {\theta )))

and the resulting bias-corrected jackknife estimate of $\theta$ is given by:

{\hat {\theta ))_{\text{jack))^{*}={\hat {\theta ))-{\widehat {\text{bias))}({\hat {\theta )))_{\mathrm {jack} }=n{\hat {\theta ))-(n-1){\hat {\theta ))_{\mathrm {jack} }.

This removes the bias in the special case that the bias is $O(n^{-1})$ and reduces it to $O(n^{-2})$ in other cases.^[2]

Estimating the variance of an estimator

The jackknife technique can be also used to estimate the variance of an estimator calculated over the entire sample.

Literature

Notes

References

Cameron, Adrian; Trivedi, Pravin K. (2005). Microeconometrics : methods and applications. Cambridge New York: Cambridge University Press. ISBN 9780521848053.
Efron, Bradley; Stein, Charles (May 1981). "The Jackknife Estimate of Variance". The Annals of Statistics. 9 (3): 586–596. doi:10.1214/aos/1176345462. JSTOR 2240822.
Efron, Bradley (1982). The jackknife, the bootstrap, and other resampling plans. Philadelphia, PA: Society for Industrial and Applied Mathematics. ISBN 9781611970319.
Quenouille, Maurice H. (September 1949). "Problems in Plane Sampling". The Annals of Mathematical Statistics. 20 (3): 355–375. doi:10.1214/aoms/1177729989. JSTOR 2236533.
Quenouille, Maurice H. (1956). "Notes on Bias in Estimation". Biometrika. 43 (3–4): 353–360. doi:10.1093/biomet/43.3-4.353. JSTOR 2332914.
Tukey, John W. (1958). "Bias and confidence in not quite large samples (abstract)". The Annals of Mathematical Statistics. 29 (2): 614. doi:10.1214/aoms/1177706647.

Statistics

Descriptive statistics

Continuous data

Center	Mean Arithmetic Arithmetic-Geometric Cubic Generalized/power Geometric Harmonic Heronian Heinz Lehmer Median Mode
Dispersion	Average absolute deviation Coefficient of variation Interquartile range Percentile Range Standard deviation Variance
Shape	Central limit theorem Moments Kurtosis L-moments Skewness

Count data

Index of dispersion

Summary tables

Dependence

Graphics

Data collection

Study design	Effect size Missing data Optimal design Population Replication Sample size determination Statistic Statistical power
Survey methodology	Sampling Cluster Stratified Opinion poll Questionnaire Standard error
Controlled experiments	Blocking Factorial experiment Interaction Random assignment Randomized controlled trial Randomized experiment Scientific control
Adaptive designs	Adaptive clinical trial Stochastic approximation Up-and-down designs
Observational studies	Cohort study Cross-sectional study Natural experiment Quasi-experiment

Statistical inference

Statistical theory

Frequentist inference

Point estimation	Estimating equations Maximum likelihood Method of moments M-estimator Minimum distance Unbiased estimators Mean-unbiased minimum-variance Rao–Blackwellization Lehmann–Scheffé theorem Median unbiased Plug-in
Interval estimation	Confidence interval Pivot Likelihood interval Prediction interval Tolerance interval Resampling Bootstrap Jackknife
Testing hypotheses	1- & 2-tails Power Uniformly most powerful test Permutation test Randomization test Multiple comparisons
Parametric tests	Likelihood-ratio Score/Lagrange multiplier Wald

Specific tests

Z-test (normal) Student's t-test F-test
Goodness of fit	Chi-squared G-test Kolmogorov–Smirnov Anderson–Darling Lilliefors Jarque–Bera Normality (Shapiro–Wilk) Likelihood-ratio test Model selection Cross validation AIC BIC
Rank statistics	Sign Sample median Signed rank (Wilcoxon) Hodges–Lehmann estimator Rank sum (Mann–Whitney) Nonparametric anova 1-way (Kruskal–Wallis) 2-way (Friedman) Ordered alternative (Jonckheere–Terpstra) Van der Waerden test

Bayesian inference

Correlation	Pearson product-moment Partial correlation Confounding variable Coefficient of determination
Regression analysis	Errors and residuals Regression validation Mixed effects models Simultaneous equations models Multivariate adaptive regression splines (MARS)
Linear regression	Simple linear regression Ordinary least squares General linear model Bayesian regression
Non-standard predictors	Nonlinear regression Nonparametric Semiparametric Isotonic Robust Heteroscedasticity Homoscedasticity
Generalized linear model	Exponential families Logistic (Bernoulli) / Binomial / Poisson regressions
Partition of variance	Analysis of variance (ANOVA, anova) Analysis of covariance Multivariate ANOVA Degrees of freedom

Categorical / Multivariate / Time-series / Survival analysis

Categorical

Multivariate

Time-series

General	Decomposition Trend Stationarity Seasonal adjustment Exponential smoothing Cointegration Structural break Granger causality
Specific tests	Dickey–Fuller Johansen Q-statistic (Ljung–Box) Durbin–Watson Breusch–Godfrey
Time domain	Autocorrelation (ACF) partial (PACF) Cross-correlation (XCF) ARMA model ARIMA model (Box–Jenkins) Autoregressive conditional heteroskedasticity (ARCH) Vector autoregression (VAR)
Frequency domain	Spectral density estimation Fourier analysis Least-squares spectral analysis Wavelet Whittle likelihood

Survival

Survival function	Kaplan–Meier estimator (product limit) Proportional hazards models Accelerated failure time (AFT) model First hitting time
Hazard function	Nelson–Aalen estimator
Test	Log-rank test

Applications

Biostatistics	Bioinformatics Clinical trials / studies Epidemiology Medical statistics
Engineering statistics	Chemometrics Methods engineering Probabilistic design Process / quality control Reliability System identification
Social statistics	Actuarial science Census Crime statistics Demography Econometrics Jurimetrics National accounts Official statistics Population statistics Psychometrics
Spatial statistics	Cartography Environmental statistics Geographic information system Geostatistics Kriging