Regression analysis
Part of a series on
Models
Linear regression Simple regression Polynomial regression General linear model
Generalized linear model Vector generalized linear model Discrete choice Binomial regression Binary regression Logistic regression Multinomial logistic regression Mixed logit Probit Multinomial probit Ordered logit Ordered probit Poisson
Multilevel model Fixed effects Random effects Linear mixed-effects model Nonlinear mixed-effects model
Nonlinear regression Nonparametric Semiparametric Robust Quantile Isotonic Principal components Least angle Local Segmented
Errors-in-variables
Estimation
Least squares Linear Non-linear
Ordinary Weighted Generalized Generalized estimating equation
Partial Total Non-negative Ridge regression Regularized
Least absolute deviations Iteratively reweighted Bayesian Bayesian multivariate Least-squares spectral analysis
Background
Regression validation Mean and predicted response Errors and residuals Goodness of fit Studentized residual Gauss–Markov theorem
Mathematics portal
v t e

This article relies largely or entirely on a single source. Relevant discussion may be found on the talk page. Please help improve this article by introducing citations to additional sources.Find sources: "Multinomial probit" – news · newspapers · books · scholar · JSTOR (July 2015)

In statistics and econometrics, the multinomial probit model is a generalization of the probit model used when there are several possible categories that the dependent variable can fall into. As such, it is an alternative to the multinomial logit model as one method of multiclass classification. It is not to be confused with the multivariate probit model, which is used to model correlated binary outcomes for more than one independent variable.

General specification

It is assumed that we have a series of observations Y_i, for i = 1...n, of the outcomes of multi-way choices from a categorical distribution of size m (there are m possible choices). Along with each observation Y_i is a set of k observed values x_1,i, ..., x_k,i of explanatory variables (also known as independent variables, predictor variables, features, etc.). Some examples:

The observed outcomes might be "has disease A, has disease B, has disease C, has none of the diseases" for a set of rare diseases with similar symptoms, and the explanatory variables might be characteristics of the patients thought to be pertinent (sex, race, age, blood pressure, body-mass index, presence or absence of various symptoms, etc.).
The observed outcomes are the votes of people for a given party or candidate in a multi-way election, and the explanatory variables are the demographic characteristics of each person (e.g. sex, race, age, income, etc.).

The multinomial probit model is a statistical model that can be used to predict the likely outcome of an unobserved multi-way trial given the associated explanatory variables. In the process, the model attempts to explain the relative effect of differing explanatory variables on the different outcomes.

Formally, the outcomes Y_i are described as being categorically-distributed data, where each outcome value h for observation i occurs with an unobserved probability p_i,h that is specific to the observation i at hand because it is determined by the values of the explanatory variables associated with that observation. That is:

Y_{i}|x_{1,i},\ldots ,x_{k,i}\ \sim \operatorname {Categorical} (p_{i,1},\ldots ,p_{i,m}),{\text{ for ))i=1,\dots ,n

or equivalently

\Pr[Y_{i}=h|x_{1,i},\ldots ,x_{k,i}]=p_{i,h},{\text{ for ))i=1,\dots ,n,

for each of m possible values of h.

Latent variable model

Multinomial probit is often written in terms of a latent variable model:

{\begin{aligned}Y_{i}^{1\ast }&={\boldsymbol {\beta ))_{1}\cdot \mathbf {X} _{i}+\varepsilon _{1}\,\\Y_{i}^{2\ast }&={\boldsymbol {\beta ))_{2}\cdot \mathbf {X} _{i}+\varepsilon _{2}\,\\\ldots &\ldots \\Y_{i}^{m\ast }&={\boldsymbol {\beta ))_{m}\cdot \mathbf {X} _{i}+\varepsilon _{m}\,\\\end{aligned))

where

{\boldsymbol {\varepsilon ))\sim {\mathcal {N))(0,{\boldsymbol {\Sigma )))

Then

Y_{i}={\begin{cases}1&{\text{if ))Y_{i}^{1\ast }>Y_{i}^{2\ast },\ldots ,Y_{i}^{m\ast }\\2&{\text{if ))Y_{i}^{2\ast }>Y_{i}^{1\ast },Y_{i}^{3\ast },\ldots ,Y_{i}^{m\ast }\\\ldots &\ldots \\m&{\text{otherwise.))\end{cases))

That is,

{\displaystyle Y_{i}=\arg \max _{h=1}^{m}Y_{i}^{h\ast ))

Note that this model allows for arbitrary correlation between the error variables, so that it doesn't necessarily respect independence of irrelevant alternatives.

When $\scriptstyle {\boldsymbol {\Sigma ))$ is the identity matrix (such that there is no correlation or heteroscedasticity), the model is called independent probit.

Estimation

This section needs expansion. You can help by adding to it. (February 2017)

For details on how the equations are estimated, see the article Probit model.

General specification

Latent variable model

Estimation

References