In statistical mechanics and mathematics, a Boltzmann distribution (also called Gibbs distribution^[1]) is a probability distribution or probability measure that gives the probability that a system will be in a certain state as a function of that state's energy and the temperature of the system. The distribution is expressed in the form:

p_{i}\propto \exp \left(-{\frac {\varepsilon _{i)){kT))\right)

where $p i$ is the probability of the system being in state $i$ , $exp$ is the exponential function, $ε i$ is the energy of that state, and a constant $kT$ of the distribution is the product of the Boltzmann constant $k$ and thermodynamic temperature $T$ . The symbol ${\textstyle \propto }$ denotes proportionality (see § The distribution for the proportionality constant).

The term system here has a wide meaning; it can range from a collection of 'sufficient number' of atoms or a single atom^[1] to a macroscopic system such as a natural gas storage tank. Therefore the Boltzmann distribution can be used to solve a wide variety of problems. The distribution shows that states with lower energy will always have a higher probability of being occupied.

The ratio of probabilities of two states is known as the Boltzmann factor and characteristically only depends on the states' energy difference:

{\frac {p_{i)){p_{j))}=\exp \left({\frac {\varepsilon _{j}-\varepsilon _{i)){kT))\right)

The Boltzmann distribution is named after Ludwig Boltzmann who first formulated it in 1868 during his studies of the statistical mechanics of gases in thermal equilibrium.^[2] Boltzmann's statistical work is borne out in his paper “On the Relationship between the Second Fundamental Theorem of the Mechanical Theory of Heat and Probability Calculations Regarding the Conditions for Thermal Equilibrium"^[3] The distribution was later investigated extensively, in its modern generic form, by Josiah Willard Gibbs in 1902.^[4]

The Boltzmann distribution should not be confused with the Maxwell–Boltzmann distribution or Maxwell-Boltzmann statistics. The Boltzmann distribution gives the probability that a system will be in a certain state as a function of that state's energy,^[5] while the Maxwell-Boltzmann distributions give the probabilities of particle speeds or energies in ideal gases. The distribution of energies in a one-dimensional gas however, does follow the Boltzmann distribution.

The distribution

The Boltzmann distribution is a probability distribution that gives the probability of a certain state as a function of that state's energy and temperature of the system to which the distribution is applied.^[6] It is given as $p_{i}={\frac {1}{Q))\exp \left(-{\frac {\varepsilon _{i)){kT))\right)={\frac {\exp \left(-{\tfrac {\varepsilon _{i)){kT))\right)}{\displaystyle \sum _{j=1}^{M}\exp \left(-{\tfrac {\varepsilon _{j)){kT))\right)))$

where:

$exp()$ is the exponential function,
$p i$ is the probability of state $i$ ,
$ε i$ is the energy of state $i$ ,
$k$ is the Boltzmann constant,
$T$ is the absolute temperature of the system,
$M$ is the number of all states accessible to the system of interest,^[6]^[5]
$Q$ (denoted by some authors by $Z$ ) is the normalization denominator, which is the canonical partition function $Q=\sum _{j=1}^{M}\exp \left(-{\tfrac {\varepsilon _{j)){kT))\right)$ It results from the constraint that the probabilities of all accessible states must add up to 1.

Using Lagrange multipliers, one can prove that the Boltzmann distribution is the distribution that maximizes the entropy ${\displaystyle S(p_{1},p_{2},\cdots ,p_{M})=-\sum _{i=1}^{M}p_{i}\log _{2}p_{i))$

subject to the normalization constraint that ${\textstyle \sum p_{i}=1}$ and the constraint that ${\textstyle \sum {p_{i}{\varepsilon }_{i))}$ equals a particular mean energy value, except for two special cases. (These special cases occur when the mean value is either the minimum or maximum of the energies $ε i$ . In these cases, the entropy maximizing distribution is a limit of Boltzmann distributions where $T$ approaches zero from above or below, respectively.)

The partition function can be calculated if we know the energies of the states accessible to the system of interest. For atoms the partition function values can be found in the NIST Atomic Spectra Database.^[7]

The distribution shows that states with lower energy will always have a higher probability of being occupied than the states with higher energy. It can also give us the quantitative relationship between the probabilities of the two states being occupied. The ratio of probabilities for states $i$ and $j$ is given as ${\frac {p_{i)){p_{j))}=\exp \left({\frac {\varepsilon _{j}-\varepsilon _{i)){kT))\right)$

where:

$p i$ is the probability of state $i$ ,
$p j$ the probability of state $j$ ,
$ε i$ is the energy of state $i$ ,
$ε j$ is the energy of state $j$ .

The corresponding ratio of populations of energy levels must also take their degeneracies into account.

The Boltzmann distribution is often used to describe the distribution of particles, such as atoms or molecules, over bound states accessible to them. If we have a system consisting of many particles, the probability of a particle being in state $i$ is practically the probability that, if we pick a random particle from that system and check what state it is in, we will find it is in state $i$ . This probability is equal to the number of particles in state $i$ divided by the total number of particles in the system, that is the fraction of particles that occupy state $i$ .

p_{i}={\frac {N_{i)){N))

where $N i$ is the number of particles in state $i$ and $N$ is the total number of particles in the system. We may use the Boltzmann distribution to find this probability that is, as we have seen, equal to the fraction of particles that are in state i. So the equation that gives the fraction of particles in state $i$ as a function of the energy of that state is ^[5] ${\frac {N_{i)){N))={\frac {\exp \left(-{\frac {\varepsilon _{i)){kT))\right)}{\displaystyle \sum _{j=1}^{M}\exp \left(-{\tfrac {\varepsilon _{j)){kT))\right)))$

This equation is of great importance to spectroscopy. In spectroscopy we observe a spectral line of atoms or molecules undergoing transitions from one state to another.^[5]^[8] In order for this to be possible, there must be some particles in the first state to undergo the transition. We may find that this condition is fulfilled by finding the fraction of particles in the first state. If it is negligible, the transition is very likely not observed at the temperature for which the calculation was done. In general, a larger fraction of molecules in the first state means a higher number of transitions to the second state.^[9] This gives a stronger spectral line. However, there are other factors that influence the intensity of a spectral line, such as whether it is caused by an allowed or a forbidden transition.

The softmax function commonly used in machine learning is related to the Boltzmann distribution:

(p_{1},\ldots ,p_{M})=\operatorname {softmax} \left[-{\frac {\varepsilon _{1)){kT)),\ldots ,-{\frac {\varepsilon _{M)){kT))\right]

Generalized Boltzmann distribution

Distribution of the form

\Pr \left(\omega \right)\propto \exp \left[\sum _{\eta =1}^{n}{\frac {X_{\eta }x_{\eta }^{\left(\omega \right))){k_{B}T))-{\frac {E^{\left(\omega \right))){k_{B}T))\right]

is called generalized Boltzmann distribution by some authors.^[10]

The Boltzmann distribution is a special case of the generalized Boltzmann distribution. The generalized Boltzmann distribution is used in statistical mechanics to describe canonical ensemble, grand canonical ensemble and isothermal–isobaric ensemble. The generalized Boltzmann distribution is usually derived from the principle of maximum entropy, but there are other derivations.^[10]^[11]

The generalized Boltzmann distribution has the following properties:

It is the only distribution for which the entropy as defined by Gibbs entropy formula matches with the entropy as defined in classical thermodynamics.^[10]
It is the only distribution that is mathematically consistent with the fundamental thermodynamic relation where state functions are described by ensemble average.^[11]

In statistical mechanics

Main articles: Canonical ensemble and Maxwell–Boltzmann statistics

The Boltzmann distribution appears in statistical mechanics when considering closed systems of fixed composition that are in thermal equilibrium (equilibrium with respect to energy exchange). The most general case is the probability distribution for the canonical ensemble. Some special cases (derivable from the canonical ensemble) show the Boltzmann distribution in different aspects:

Canonical ensemble (general case): The canonical ensemble gives the probabilities of the various possible states of a closed system of fixed volume, in thermal equilibrium with a heat bath. The canonical ensemble has a state probability distribution with the Boltzmann form.
Statistical frequencies of subsystems' states (in a non-interacting collection): When the system of interest is a collection of many non-interacting copies of a smaller subsystem, it is sometimes useful to find the statistical frequency of a given subsystem state, among the collection. The canonical ensemble has the property of separability when applied to such a collection: as long as the non-interacting subsystems have fixed composition, then each subsystem's state is independent of the others and is also characterized by a canonical ensemble. As a result, the expected statistical frequency distribution of subsystem states has the Boltzmann form.
Maxwell–Boltzmann statistics of classical gases (systems of non-interacting particles): In particle systems, many particles share the same space and regularly change places with each other; the single-particle state space they occupy is a shared space. Maxwell–Boltzmann statistics give the expected number of particles found in a given single-particle state, in a classical gas of non-interacting particles at equilibrium. This expected number distribution has the Boltzmann form.

Although these cases have strong similarities, it is helpful to distinguish them as they generalize in different ways when the crucial assumptions are changed:

When a system is in thermodynamic equilibrium with respect to both energy exchange and particle exchange, the requirement of fixed composition is relaxed and a grand canonical ensemble is obtained rather than canonical ensemble. On the other hand, if both composition and energy are fixed, then a microcanonical ensemble applies instead.
If the subsystems within a collection do interact with each other, then the expected frequencies of subsystem states no longer follow a Boltzmann distribution, and even may not have an analytical solution.^[12] The canonical ensemble can however still be applied to the collective states of the entire system considered as a whole, provided the entire system is in thermal equilibrium.
With quantum gases of non-interacting particles in equilibrium, the number of particles found in a given single-particle state does not follow Maxwell–Boltzmann statistics, and there is no simple closed form expression for quantum gases in the canonical ensemble. In the grand canonical ensemble the state-filling statistics of quantum gases are described by Fermi–Dirac statistics or Bose–Einstein statistics, depending on whether the particles are fermions or bosons, respectively.

In mathematics

Main articles: Gibbs measure, Log-linear model, and Boltzmann machine

In more general mathematical settings, the Boltzmann distribution is also known as the Gibbs measure.
In statistics and machine learning, it is called a log-linear model.
In deep learning, the Boltzmann distribution is used in the sampling distribution of stochastic neural networks such as the Boltzmann machine, restricted Boltzmann machine, energy-based models and deep Boltzmann machine. In deep learning, the Boltzmann machine is considered to be one of the unsupervised learning models. In the design of Boltzmann machine in deep learning, as the number of nodes are increased the difficulty of implementing in real time applications becomes critical, so a different type of architecture named Restricted Boltzmann machine is introduced.

In economics

The Boltzmann distribution can be introduced to allocate permits in emissions trading.^[13]^[14] The new allocation method using the Boltzmann distribution can describe the most probable, natural, and unbiased distribution of emissions permits among multiple countries.

The Boltzmann distribution has the same form as the multinomial logit model. As a discrete choice model, this is very well known in economics since Daniel McFadden made the connection to random utility maximization.^[15]

References

^ ^a ^b Landau, Lev Davidovich & Lifshitz, Evgeny Mikhailovich (1980) [1976]. Statistical Physics. Course of Theoretical Physics. Vol. 5 (3 ed.). Oxford: Pergamon Press. ISBN 0-7506-3372-7. Translated by J.B. Sykes and M.J. Kearsley. See section 28
^ Boltzmann, Ludwig (1868). "Studien über das Gleichgewicht der lebendigen Kraft zwischen bewegten materiellen Punkten" [Studies on the balance of living force between moving material points]. Wiener Berichte. 58: 517–560.
^ "Archived copy" (PDF). Archived from the original (PDF) on 2021-03-05. Retrieved 2017-05-11.((cite web)): CS1 maint: archived copy as title (link)
^ Gibbs, Josiah Willard (1902). Elementary Principles in Statistical Mechanics. New York: Charles Scribner's Sons.
^ ^a ^b ^c ^d Atkins, P. W. (2010) Quanta, W. H. Freeman and Company, New York
^ ^a ^b McQuarrie, A. (2000). Statistical Mechanics. Sausalito, CA: University Science Books. ISBN 1-891389-15-7.
^ NIST Atomic Spectra Database Levels Form at nist.gov
^ Atkins, P. W.; de Paula, J. (2009). Physical Chemistry (9th ed.). Oxford: Oxford University Press. ISBN 978-0-19-954337-3.
^ Skoog, D. A.; Holler, F. J.; Crouch, S. R. (2006). Principles of Instrumental Analysis. Boston, MA: Brooks/Cole. ISBN 978-0-495-12570-9.
^ ^a ^b ^c Gao, Xiang; Gallicchio, Emilio; Roitberg, Adrian (2019). "The generalized Boltzmann distribution is the only distribution in which the Gibbs-Shannon entropy equals the thermodynamic entropy". The Journal of Chemical Physics. 151 (3): 034113. arXiv:1903.02121. Bibcode:2019JChPh.151c4113G. doi:10.1063/1.5111333. PMID 31325924. S2CID 118981017.
^ ^a ^b Gao, Xiang (March 2022). "The Mathematics of the Ensemble Theory". Results in Physics. 34: 105230. arXiv:2006.00485. Bibcode:2022ResPh..3405230G. doi:10.1016/j.rinp.2022.105230. S2CID 221978379.
^ A classic example of this is magnetic ordering. Systems of non-interacting spins show paramagnetic behaviour that can be understood with a single-particle canonical ensemble (resulting in the Brillouin function). Systems of interacting spins can show much more complex behaviour such as ferromagnetism or antiferromagnetism.
^ Park, J.-W., Kim, C. U. and Isard, W. (2012) Permit allocation in emissions trading using the Boltzmann distribution. Physica A 391: 4883–4890
^ The Thorny Problem Of Fair Allocation. Technology Review blog. August 17, 2011. Cites and summarizes Park, Kim and Isard (2012).
^ Amemiya, Takeshi (1985). "Multinomial Logit Model". Advanced Econometrics. Oxford: Basil Blackwell. pp. 295–299. ISBN 0-631-13345-3.

Probability distributions (list)

Discrete
univariate

with finite support	Benford Bernoulli beta-binomial binomial categorical hypergeometric negative Poisson binomial Rademacher soliton discrete uniform Zipf Zipf–Mandelbrot
with infinite support	beta negative binomial Borel Conway–Maxwell–Poisson discrete phase-type Delaporte extended negative binomial Flory–Schulz Gauss–Kuzmin geometric logarithmic mixed Poisson negative binomial Panjer parabolic fractal Poisson Skellam Yule–Simon zeta

Continuous
univariate

supported on a bounded interval	arcsine ARGUS Balding–Nichols Bates beta beta rectangular continuous Bernoulli Irwin–Hall Kumaraswamy logit-normal noncentral beta PERT raised cosine reciprocal triangular U-quadratic uniform Wigner semicircle
supported on a semi-infinite interval	Benini Benktander 1st kind Benktander 2nd kind beta prime Burr chi chi-squared noncentral inverse scaled Dagum Davis Erlang hyper exponential hyperexponential hypoexponential logarithmic F noncentral folded normal Fréchet gamma generalized inverse gamma/Gompertz Gompertz shifted half-logistic half-normal Hotelling's T-squared inverse Gaussian generalized Kolmogorov Lévy log-Cauchy log-Laplace log-logistic log-normal log-t Lomax matrix-exponential Maxwell–Boltzmann Maxwell–Jüttner Mittag-Leffler Nakagami Pareto phase-type Poly-Weibull Rayleigh relativistic Breit–Wigner Rice truncated normal type-2 Gumbel Weibull discrete Wilks's lambda
supported on the whole real line	Cauchy exponential power Fisher's z Kaniadakis κ-Gaussian Gaussian q generalized normal generalized hyperbolic geometric stable Gumbel Holtsmark hyperbolic secant Johnson's S_U Landau Laplace asymmetric logistic noncentral t normal (Gaussian) normal-inverse Gaussian skew normal slash stable Student's t Tracy–Widom variance-gamma Voigt
with support whose type varies	generalized chi-squared generalized extreme value generalized Pareto Marchenko–Pastur Kaniadakis κ-exponential Kaniadakis κ-Gamma Kaniadakis κ-Weibull Kaniadakis κ-Logistic Kaniadakis κ-Erlang q-exponential q-Gaussian q-Weibull shifted log-logistic Tukey lambda