This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: "Entropy" information theory – news · newspapers · books · scholar · JSTOR (February 2019) (Learn how and when to remove this message)

Information theory

Entropy Differential entropy Conditional entropy Joint entropy Mutual information Directed information Conditional mutual information Relative entropy Entropy rate Limiting density of discrete points
Asymptotic equipartition property Rate–distortion theory
Shannon's source coding theorem Channel capacity Noisy-channel coding theorem Shannon–Hartley theorem
v t e

In information theory, the entropy of a random variable is the average level of "information", "surprise", or "uncertainty" inherent to the variable's possible outcomes. Given a discrete random variable $X$ , which takes values in the set ${\mathcal {X))$ and is distributed according to $p\colon {\mathcal {X))\to [0,1]$ , the entropy is $\mathrm {H} (X):=-\sum _{x\in {\mathcal {X))}p(x)\log p(x),$ where $\Sigma$ denotes the sum over the variable's possible values. The choice of base for $\log$ , the logarithm, varies for different applications. Base 2 gives the unit of bits (or "shannons"), while base e gives "natural units" nat, and base 10 gives units of "dits", "bans", or "hartleys". An equivalent definition of entropy is the expected value of the self-information of a variable.^[1]

The concept of information entropy was introduced by Claude Shannon in his 1948 paper "A Mathematical Theory of Communication",^[2]^[3] and is also referred to as Shannon entropy. Shannon's theory defines a data communication system composed of three elements: a source of data, a communication channel, and a receiver. The "fundamental problem of communication" – as expressed by Shannon – is for the receiver to be able to identify what data was generated by the source, based on the signal it receives through the channel.^[2]^[3] Shannon considered various ways to encode, compress, and transmit messages from a data source, and proved in his source coding theorem that the entropy represents an absolute mathematical limit on how well data from the source can be losslessly compressed onto a perfectly noiseless channel. Shannon strengthened this result considerably for noisy channels in his noisy-channel coding theorem.

Entropy in information theory is directly analogous to the entropy in statistical thermodynamics. The analogy results when the values of the random variable designate energies of microstates, so Gibbs's formula for the entropy is formally identical to Shannon's formula. Entropy has relevance to other areas of mathematics such as combinatorics and machine learning. The definition can be derived from a set of axioms establishing that entropy should be a measure of how informative the average outcome of a variable is. For a continuous random variable, differential entropy is analogous to entropy. The definition $\mathbb {E} [-\log p(X)]$ generalizes the above.

Authority control databases
International	FAST
National	Spain France BnF data Germany Israel United States Japan Czech Republic

All figures in entropically compressed exabytes
Type of Information	1986	2007
Storage	2.6	295
Broadcast	432	1900
Telecommunications	0.281	65

Introduction

Example

Definition

Measure theory

Example

Characterization

Alternative characterization

Discussion

Alternative characterization via additivity and subadditivity

Discussion

Further properties

Aspects

Relationship to thermodynamic entropy

Data compression

Entropy as a measure of diversity

Entropy of a sequence

Limitations of entropy in cryptography

Data as a Markov process

Efficiency (normalized entropy)

Entropy for continuous random variables

Differential entropy

Limiting density of discrete points

Relative entropy

Use in number theory

Use in combinatorics

Loomis–Whitney inequality

Approximation to binomial coefficient

Use in machine learning

See also

References

Further reading

Textbooks on information theory

External links