In statistics, a pivotal quantity or pivot is a function of observations and unobservable parameters such that the function's probability distribution does not depend on the unknown parameters (including nuisance parameters).^[1] A pivot need not be a statistic — the function and its 'value' can depend on the parameters of the model, but its 'distribution' must not. If it is a statistic, then it is known as an 'ancillary statistic'.

More formally,^[2] let $X=(X_{1},X_{2},\ldots ,X_{n})$ be a random sample from a distribution that depends on a parameter (or vector of parameters) $\theta$ . Let $g(X,\theta )$ be a random variable whose distribution is the same for all $\theta$ . Then $g$ is called a 'pivotal quantity' (or simply a 'pivot').

Pivotal quantities are commonly used for normalization to allow data from different data sets to be compared. It is relatively easy to construct pivots for location and scale parameters: for the former we form differences so that location cancels, for the latter ratios so that scale cancels.

Pivotal quantities are fundamental to the construction of test statistics, as they allow the statistic to not depend on parameters – for example, Student's t-statistic is for a normal distribution with unknown variance (and mean). They also provide one method of constructing confidence intervals, and the use of pivotal quantities improves performance of the bootstrap. In the form of ancillary statistics, they can be used to construct frequentist prediction intervals (predictive confidence intervals).

Examples

Normal distribution

One of the simplest pivotal quantities is the z-score. Given a normal distribution with mean $\mu$ and variance ${\displaystyle \sigma ^{2))$ , and an observation 'x', the z-score:

z={\frac {x-\mu }{\sigma )),

has distribution $N(0,1)$ – a normal distribution with mean 0 and variance 1. Similarly, since the 'n'-sample sample mean has sampling distribution $N(\mu ,\sigma ^{2}/n)$ , the z-score of the mean

z={\frac ((\overline {X))-\mu }{\sigma /{\sqrt {n))))

also has distribution $N(0,1).$ Note that while these functions depend on the parameters – and thus one can only compute them if the parameters are known (they are not statistics) — the distribution is independent of the parameters.

Given $n$ independent, identically distributed (i.i.d.) observations $X=(X_{1},X_{2},\ldots ,X_{n})$ from the normal distribution with unknown mean $\mu$ and variance ${\displaystyle \sigma ^{2))$ , a pivotal quantity can be obtained from the function:

g(x,X)={\frac {x-{\overline {X))}{s/{\sqrt {n))))

where

{\overline {X))={\frac {1}{n))\sum _{i=1}^{n}{X_{i))

and

s^{2}={\frac {1}{n-1))\sum _{i=1}^{n}{(X_{i}-{\overline {X)))^{2))

are unbiased estimates of $\mu$ and ${\displaystyle \sigma ^{2))$ , respectively. The function $g(x,X)$ is the Student's t-statistic for a new value $x$ , to be drawn from the same population as the already observed set of values $X$ .

Using $x=\mu$ the function $g(\mu ,X)$ becomes a pivotal quantity, which is also distributed by the Student's t-distribution with $\nu =n-1$ degrees of freedom. As required, even though $\mu$ appears as an argument to the function $g$ , the distribution of $g(\mu ,X)$ does not depend on the parameters $\mu$ or $\sigma$ of the normal probability distribution that governs the observations ${\displaystyle X_{1},\ldots ,X_{n))$ .

This can be used to compute a prediction interval for the next observation $X_{n+1};$ see Prediction interval: Normal distribution.

Bivariate normal distribution

In more complicated cases, it is impossible to construct exact pivots. However, having approximate pivots improves convergence to asymptotic normality.

Suppose a sample of size $n$ of vectors $(X_{i},Y_{i})'$ is taken from a bivariate normal distribution with unknown correlation $\rho$ .

An estimator of $\rho$ is the sample (Pearson, moment) correlation

{\displaystyle r={\frac ((\frac {1}{n-1))\sum _{i=1}^{n}(X_{i}-{\overline {X)))(Y_{i}-{\overline {Y)))}{s_{X}s_{Y))))

where ${\displaystyle s_{X}^{2},s_{Y}^{2))$ are sample variances of $X$ and $Y$ . The sample statistic $r$ has an asymptotically normal distribution:

{\sqrt {n)){\frac {r-\rho }{1-\rho ^{2))}\Rightarrow N(0,1)

.

However, a variance-stabilizing transformation

z={\rm ((tanh}^{-1}r={\frac {1}{2))\ln {\frac {1+r}{1-r))))

known as Fisher's 'z' transformation of the correlation coefficient allows creating the distribution of $z$ asymptotically independent of unknown parameters:

{\sqrt {n))(z-\zeta )\Rightarrow N(0,1)

where $\zeta ={\rm {tanh))^{-1}\rho$ is the corresponding distribution parameter. For finite samples sizes $n$ , the random variable $z$ will have distribution closer to normal than that of $r$ . An even closer approximation to the standard normal distribution is obtained by using a better approximation for the exact variance: the usual form is

\operatorname {Var} (z)\approx {\frac {1}{n-3))

.

Robustness

Main article: Robust statistics

From the point of view of robust statistics, pivotal quantities are robust to changes in the parameters — indeed, independent of the parameters — but not in general robust to changes in the model, such as violations of the assumption of normality. This is fundamental to the robust critique of non-robust statistics, often derived from pivotal quantities: such statistics may be robust within the family, but are not robust outside it.

References

^ Shao, J. (2008). "Pivotal quantities". Mathematical Statistics (2nd ed.). New York: Springer. pp. 471–477. ISBN 978-0-387-21718-5.
^ DeGroot, Morris H.; Schervish, Mark J. (2011). Probability and Statistics (4th ed.). Pearson. p. 489. ISBN 978-0-321-70970-7.

Statistics

Descriptive statistics

Continuous data

Center	Mean Arithmetic Arithmetic-Geometric Cubic Generalized/power Geometric Harmonic Heronian Heinz Lehmer Median Mode
Dispersion	Average absolute deviation Coefficient of variation Interquartile range Percentile Range Standard deviation Variance
Shape	Central limit theorem Moments Kurtosis L-moments Skewness

Count data

Index of dispersion

Summary tables

Dependence

Graphics

Data collection

Study design	Effect size Missing data Optimal design Population Replication Sample size determination Statistic Statistical power
Survey methodology	Sampling Cluster Stratified Opinion poll Questionnaire Standard error
Controlled experiments	Blocking Factorial experiment Interaction Random assignment Randomized controlled trial Randomized experiment Scientific control
Adaptive designs	Adaptive clinical trial Stochastic approximation Up-and-down designs
Observational studies	Cohort study Cross-sectional study Natural experiment Quasi-experiment

Statistical inference

Statistical theory

Frequentist inference

Point estimation	Estimating equations Maximum likelihood Method of moments M-estimator Minimum distance Unbiased estimators Mean-unbiased minimum-variance Rao–Blackwellization Lehmann–Scheffé theorem Median unbiased Plug-in
Interval estimation	Confidence interval Pivot Likelihood interval Prediction interval Tolerance interval Resampling Bootstrap Jackknife
Testing hypotheses	1- & 2-tails Power Uniformly most powerful test Permutation test Randomization test Multiple comparisons
Parametric tests	Likelihood-ratio Score/Lagrange multiplier Wald

Specific tests

Z-test (normal) Student's t-test F-test
Goodness of fit	Chi-squared G-test Kolmogorov–Smirnov Anderson–Darling Lilliefors Jarque–Bera Normality (Shapiro–Wilk) Likelihood-ratio test Model selection Cross validation AIC BIC
Rank statistics	Sign Sample median Signed rank (Wilcoxon) Hodges–Lehmann estimator Rank sum (Mann–Whitney) Nonparametric anova 1-way (Kruskal–Wallis) 2-way (Friedman) Ordered alternative (Jonckheere–Terpstra) Van der Waerden test

Bayesian inference

Correlation	Pearson product-moment Partial correlation Confounding variable Coefficient of determination
Regression analysis	Errors and residuals Regression validation Mixed effects models Simultaneous equations models Multivariate adaptive regression splines (MARS)
Linear regression	Simple linear regression Ordinary least squares General linear model Bayesian regression
Non-standard predictors	Nonlinear regression Nonparametric Semiparametric Isotonic Robust Heteroscedasticity Homoscedasticity
Generalized linear model	Exponential families Logistic (Bernoulli) / Binomial / Poisson regressions
Partition of variance	Analysis of variance (ANOVA, anova) Analysis of covariance Multivariate ANOVA Degrees of freedom

Categorical / Multivariate / Time-series / Survival analysis

Categorical

Multivariate

Time-series

General	Decomposition Trend Stationarity Seasonal adjustment Exponential smoothing Cointegration Structural break Granger causality
Specific tests	Dickey–Fuller Johansen Q-statistic (Ljung–Box) Durbin–Watson Breusch–Godfrey
Time domain	Autocorrelation (ACF) partial (PACF) Cross-correlation (XCF) ARMA model ARIMA model (Box–Jenkins) Autoregressive conditional heteroskedasticity (ARCH) Vector autoregression (VAR)
Frequency domain	Spectral density estimation Fourier analysis Least-squares spectral analysis Wavelet Whittle likelihood

Survival

Survival function	Kaplan–Meier estimator (product limit) Proportional hazards models Accelerated failure time (AFT) model First hitting time
Hazard function	Nelson–Aalen estimator
Test	Log-rank test

Applications

Biostatistics	Bioinformatics Clinical trials / studies Epidemiology Medical statistics
Engineering statistics	Chemometrics Methods engineering Probabilistic design Process / quality control Reliability System identification
Social statistics	Actuarial science Census Crime statistics Demography Econometrics Jurimetrics National accounts Official statistics Population statistics Psychometrics
Spatial statistics	Cartography Environmental statistics Geographic information system Geostatistics Kriging

Examples

Normal distribution

Bivariate normal distribution

Robustness

See also

References