Polarization identities

Any inner product on a vector space induces a norm by the equation

\|x\|={\sqrt {\langle x,x\rangle )).

The polarization identities reverse this relationship, recovering the inner product from the norm. Every inner product satisfies:

\|x+y\|^{2}=\|x\|^{2}+\|y\|^{2}+2\operatorname {Re} \langle x,y\rangle \qquad {\text{ for all vectors ))x,y.

Solving for $\operatorname {Re} \langle x,y\rangle$ gives the formula $\operatorname {Re} \langle x,y\rangle ={\frac {1}{2))\left(\|x+y\|^{2}-\|x\|^{2}-\|y\|^{2}\right).$ If the inner product is real then $\operatorname {Re} \langle x,y\rangle =\langle x,y\rangle$ and this formula becomes a polarization identity for real inner products.

Real vector spaces

If the vector space is over the real numbers then the polarization identities are:^[4]

{\begin{alignedat}{4}\langle x,y\rangle &={\frac {1}{4))\left(\|x+y\|^{2}-\|x-y\|^{2}\right)\\[3pt]&={\frac {1}{2))\left(\|x+y\|^{2}-\|x\|^{2}-\|y\|^{2}\right)\\[3pt]&={\frac {1}{2))\left(\|x\|^{2}+\|y\|^{2}-\|x-y\|^{2}\right).\\[3pt]\end{alignedat))

These various forms are all equivalent by the parallelogram law:^{[proof 1]}

2\|x\|^{2}+2\|y\|^{2}=\|x+y\|^{2}+\|x-y\|^{2}.

This further implies that ${\displaystyle L^{p))$ class is not a Hilbert space whenever $p\neq 2$ , as the parallelogram law is not satisfied. For the sake of counterexample, consider ${\displaystyle x=1_{A))$ and ${\displaystyle y=1_{B))$ for any two disjoint subsets $A,B$ of general domain ${\displaystyle \Omega \subset \mathbb {R} ^{n))$ and compute the measure of both sets under parallelogram law.

Complex vector spaces

For vector spaces over the complex numbers, the above formulas are not quite correct because they do not describe the imaginary part of the (complex) inner product. However, an analogous expression does ensure that both real and imaginary parts are retained. The complex part of the inner product depends on whether it is antilinear in the first or the second argument. The notation $\langle x|y\rangle ,$ which is commonly used in physics will be assumed to be antilinear in the first argument while $\langle x,\,y\rangle ,$ which is commonly used in mathematics, will be assumed to be antilinear its the second argument. They are related by the formula:

\langle x,\,y\rangle =\langle y\,|\,x\rangle \quad {\text{ for all ))x,y\in H.

The real part of any inner product (no matter which argument is antilinear and no matter if it is real or complex) is a symmetric bilinear map that for any $x,y\in H$ is always equal to:^[4]^{[proof 1]}

{\begin{alignedat}{4}R(x,y):&=\operatorname {Re} \langle x\mid y\rangle =\operatorname {Re} \langle x,y\rangle \\&={\frac {1}{4))\left(\|x+y\|^{2}-\|x-y\|^{2}\right)\\&={\frac {1}{2))\left(\|x+y\|^{2}-\|x\|^{2}-\|y\|^{2}\right)\\[3pt]&={\frac {1}{2))\left(\|x\|^{2}+\|y\|^{2}-\|x-y\|^{2}\right).\\[3pt]\end{alignedat))

It is always a symmetric map, meaning that^{[proof 1]}

R(x,y)=R(y,x)\quad {\text{ for all ))x,y\in H,

and it also satisfies:^{[proof 1]}

R(y,ix)=-R(x,iy)\quad {\text{ for all ))x,y\in H.

Thus

R(ix,y)=-R(x,iy)

, which in plain English says that to move a factor of

i

to the other argument, introduce a negative sign.

Proof of properties of $R$
Let $R(x,y):={\frac {1}{4))\left(\\|x+y\\|^{2}-\\|x-y\\|^{2}\right).$ Then ${\displaystyle 2\\|x\\|^{2}+2\\|y\\|^{2}=\\|x+y\\|^{2}+\\|x-y\\|^{2))$ implies $R(x,y)={\frac {1}{4))\left(\left(2\\|x\\|^{2}+2\\|y\\|^{2}-\\|x-y\\|^{2}\right)-\\|x-y\\|^{2}\right)={\frac {1}{2))\left(\\|x\\|^{2}+\\|y\\|^{2}-\\|x-y\\|^{2}\right)$ and $R(x,y)={\frac {1}{4))\left(\\|x+y\\|^{2}-\left(2\\|x\\|^{2}+2\\|y\\|^{2}-\\|x+y\\|^{2}\right)\right)={\frac {1}{2))\left(\\|x+y\\|^{2}-\\|x\\|^{2}-\\|y\\|^{2}\right).$ Moreover, $4R(x,y)=\\|x+y\\|^{2}-\\|x-y\\|^{2}=\\|y+x\\|^{2}-\\|y-x\\|^{2}=4R(y,x),$ which proves that $R(x,y)=R(y,x)$ . From $1=i(-i)$ it follows that $y-ix=i(-iy-x)=-i(x+iy)$ and $y+ix=i(-iy+x)=i(x-iy)$ so that $-4R(y,ix)=\\|y-ix\\|^{2}-\\|y+ix\\|^{2}=\\|(-i)(x+iy)\\|^{2}-\\|i(x-iy)\\|^{2}=\\|x+iy\\|^{2}-\\|x-iy\\|^{2}=4R(x,iy),$ which proves that $R(y,ix)=-R(x,iy).$ $\blacksquare$

Proof of properties of

R

Let

R(x,y):={\frac {1}{4))\left(\|x+y\|^{2}-\|x-y\|^{2}\right).

Then

{\displaystyle 2\|x\|^{2}+2\|y\|^{2}=\|x+y\|^{2}+\|x-y\|^{2))

implies

R(x,y)={\frac {1}{4))\left(\left(2\|x\|^{2}+2\|y\|^{2}-\|x-y\|^{2}\right)-\|x-y\|^{2}\right)={\frac {1}{2))\left(\|x\|^{2}+\|y\|^{2}-\|x-y\|^{2}\right)

and

R(x,y)={\frac {1}{4))\left(\|x+y\|^{2}-\left(2\|x\|^{2}+2\|y\|^{2}-\|x+y\|^{2}\right)\right)={\frac {1}{2))\left(\|x+y\|^{2}-\|x\|^{2}-\|y\|^{2}\right).

Moreover,

4R(x,y)=\|x+y\|^{2}-\|x-y\|^{2}=\|y+x\|^{2}-\|y-x\|^{2}=4R(y,x),

which proves that

R(x,y)=R(y,x)

From $1=i(-i)$ it follows that $y-ix=i(-iy-x)=-i(x+iy)$ and $y+ix=i(-iy+x)=i(x-iy)$ so that

-4R(y,ix)=\|y-ix\|^{2}-\|y+ix\|^{2}=\|(-i)(x+iy)\|^{2}-\|i(x-iy)\|^{2}=\|x+iy\|^{2}-\|x-iy\|^{2}=4R(x,iy),

which proves that

R(y,ix)=-R(x,iy).

\blacksquare

Unlike its real part, the imaginary part of a complex inner product depends on which argument is antilinear.

Antilinear in first argument

The polarization identities for the inner product $\langle x\,|\,y\rangle ,$ which is antilinear in the first argument, are

{\begin{alignedat}{4}\langle x\,|\,y\rangle &={\frac {1}{4))\left(\|x+y\|^{2}-\|x-y\|^{2}-i\|x+iy\|^{2}+i\|x-iy\|^{2}\right)\\&=R(x,y)-iR(x,iy)\\&=R(x,y)+iR(ix,y)\\\end{alignedat))

where $x,y\in H.$ The second to last equality is similar to the formula expressing a linear functional $\varphi$ in terms of its real part: $\varphi (y)=\operatorname {Re} \varphi (y)-i(\operatorname {Re} \varphi )(iy).$

Antilinear in second argument

The polarization identities for the inner product $\langle x,\ y\rangle ,$ which is antilinear in the second argument, follows from that of $\langle x\,|\,y\rangle$ by the relationship: $\langle x,\ y\rangle :=\langle y\,|\,x\rangle ={\overline {\langle x\,|\,y\rangle ))\quad {\text{ for all ))x,y\in H.$ So for any $x,y\in H,$ ^[4]

{\begin{alignedat}{4}\langle x,\,y\rangle &={\frac {1}{4))\left(\|x+y\|^{2}-\|x-y\|^{2}+i\|x+iy\|^{2}-i\|x-iy\|^{2}\right)\\&=R(x,y)+iR(x,iy)\\&=R(x,y)-iR(ix,y).\\\end{alignedat))

This expression can be phrased symmetrically as:^[5]

\langle x,y\rangle ={\frac {1}{4))\sum _{k=0}^{3}i^{k}\left\|x+i^{k}y\right\|^{2}.

Summary of both cases

Thus if $R(x,y)+iI(x,y)$ denotes the real and imaginary parts of some inner product's value at the point $(x,y)\in H\times H$ of its domain, then its imaginary part will be:

I(x,y)~=~{\begin{cases}~R({\color {red}i}x,y)&\qquad {\text{ if antilinear in the )){\color {red}1}{\text{st argument))\\~R(x,{\color {blue}i}y)&\qquad {\text{ if antilinear in the )){\color {blue}2}{\text{nd argument))\\\end{cases))

where the scalar

i

is always located in the same argument that the inner product is antilinear in.

Using $R(ix,y)=-R(x,iy)$ , the above formula for the imaginary part becomes:

I(x,y)~=~{\begin{cases}-R(x,{\color {black}i}y)&\qquad {\text{ if antilinear in the )){\color {black}1}{\text{st argument))\\-R({\color {black}i}x,y)&\qquad {\text{ if antilinear in the )){\color {black}2}{\text{nd argument))\\\end{cases))

Reconstructing the inner product

In a normed space $(H,\|\cdot \|),$ if the parallelogram law

{\displaystyle \|x+y\|^{2}~+~\|x-y\|^{2}~=~2\|x\|^{2}+2\|y\|^{2))

holds, then there exists a unique inner product

\langle \cdot ,\ \cdot \rangle

H

such that

\|x\|^{2}=\langle x,\ x\rangle

for all

x\in H.

^[4]^[1]

Proof

We will only give the real case here; the proof for complex vector spaces is analogous.

By the above formulas, if the norm is described by an inner product (as we hope), then it must satisfy

\langle x,\ y\rangle ={\frac {1}{4))\left(\|x+y\|^{2}-\|x-y\|^{2}\right)\quad {\text{ for all ))x,y\in H,

which may serve as a definition of the unique candidate

\langle \cdot ,\cdot \rangle

for the role of a suitable inner product. Thus, the uniqueness is guaranteed.

It remains to prove that this formula indeed defines an inner product and that this inner product induces the norm $\|\cdot \|.$ Explicitly, the following will be shown:

$\langle x,x\rangle =\|x\|^{2},\quad x\in H$
$\langle x,y\rangle =\langle y,x\rangle ,\quad x,y\in H$
$\langle x+z,y\rangle =\langle x,y\rangle +\langle z,y\rangle \quad {\text{ for all ))x,y,z\in H,$
$\langle \alpha x,y\rangle =\alpha \langle x,y\rangle \quad {\text{ for all ))x,y\in H{\text{ and all ))\alpha \in \mathbb {R}$

(This axiomatization omits positivity, which is implied by (1) and the fact that $\|\cdot \|$ is a norm.)

For properties (1) and (2), substitute: ${\textstyle \langle x,x\rangle ={\frac {1}{4))\left(\|x+x\|^{2}-\|x-x\|^{2}\right)=\|x\|^{2},}$ and $\|x-y\|^{2}=\|y-x\|^{2}.$

For property (3), it is convenient to work in reverse. It remains to show that

{\displaystyle \|x+z+y\|^{2}-\|x+z-y\|^{2}{\overset {?}{=))\|x+y\|^{2}-\|x-y\|^{2}+\|z+y\|^{2}-\|z-y\|^{2))

or equivalently,

2\left(\|x+z+y\|^{2}+\|x-y\|^{2}\right)-2\left(\|x+z-y\|^{2}+\|x+y\|^{2}\right){\overset {?}{=))2\|z+y\|^{2}-2\|z-y\|^{2}.

Now apply the parallelogram identity:

{\displaystyle 2\|x+z+y\|^{2}+2\|x-y\|^{2}=\|2x+z\|^{2}+\|2y+z\|^{2))

{\displaystyle 2\|x+z-y\|^{2}+2\|x+y\|^{2}=\|2x+z\|^{2}+\|z-2y\|^{2))

Thus it remains to verify:

{\displaystyle {\cancel {\|2x+z\|^{2))}+\|2y+z\|^{2}-({\cancel {\|2x+z\|^{2))}+\|z-2y\|^{2}){\overset {?}((}={))}2\|z+y\|^{2}-2\|z-y\|^{2))

{\displaystyle \|2y+z\|^{2}-\|z-2y\|^{2}{\overset {?}{=))2\|z+y\|^{2}-2\|z-y\|^{2))

But the latter claim can be verified by subtracting the following two further applications of the parallelogram identity:

{\displaystyle \|2y+z\|^{2}+\|z\|^{2}=2\|z+y\|^{2}+2\|y\|^{2))

{\displaystyle \|z-2y\|^{2}+\|z\|^{2}=2\|z-y\|^{2}+2\|y\|^{2))

Thus (3) holds.

It can be verified by induction that (3) implies (4), as long as $\alpha \in \mathbb {Z} .$ But "(4) when $\alpha \in \mathbb {Z}$ " implies "(4) when $\alpha \in \mathbb {Q}$ ". And any positive-definite, real-valued, $\mathbb {Q}$ -bilinear form satisfies the Cauchy–Schwarz inequality, so that $\langle \cdot ,\cdot \rangle$ is continuous. Thus $\langle \cdot ,\cdot \rangle$ must be $\mathbb {R}$ -linear as well.

Another necessary and sufficient condition for there to exist an inner product that induces a given norm $\|\cdot \|$ is for the norm to satisfy Ptolemy's inequality, which is:^[6]

\|x-y\|\,\|z\|~+~\|y-z\|\,\|x\|~\geq ~\|x-z\|\,\|y\|\qquad {\text{ for all vectors ))x,y,z.

Applications and consequences

If $H$ is a complex Hilbert space then $\langle x\mid y\rangle$ is real if and only if its imaginary part is $0=R(x,iy)={\frac {1}{4))\left(\Vert x+iy\Vert ^{2}-\Vert x-iy\Vert ^{2}\right)$ , which happens if and only if $\Vert x+iy\Vert =\Vert x-iy\Vert$ . Similarly, $\langle x\mid y\rangle$ is (purely) imaginary if and only if $\Vert x+y\Vert =\Vert x-y\Vert$ . For example, from $\|x+ix\|=|1+i|\|x\|={\sqrt {2))\|x\|=|1-i|\|x\|=\|x-ix\|$ it can be concluded that $\langle x|x\rangle$ is real and that $\langle x|ix\rangle$ is purely imaginary.

Isometries

If $A:H\to Z$ is a linear isometry between two Hilbert spaces (so $\|Ah\|=\|h\|$ for all $h\in H$ ) then

\langle Ah,Ak\rangle _{Z}=\langle h,k\rangle _{H}\quad {\text{ for all ))h,k\in H;

that is, linear isometries preserve inner products.

If $A:H\to Z$ is instead an antilinear isometry then

\langle Ah,Ak\rangle _{Z}={\overline {\langle h,k\rangle _{H))}=\langle k,h\rangle _{H}\quad {\text{ for all ))h,k\in H.

Relation to the law of cosines

The second form of the polarization identity can be written as

\|{\textbf {u))-{\textbf {v))\|^{2}=\|{\textbf {u))\|^{2}+\|{\textbf {v))\|^{2}-2({\textbf {u))\cdot {\textbf {v))).

This is essentially a vector form of the law of cosines for the triangle formed by the vectors ${\textbf {u))$ , ${\textbf {v))$ , and ${\textbf {u))-{\textbf {v))$ . In particular,

{\textbf {u))\cdot {\textbf {v))=\|{\textbf {u))\|\,\|{\textbf {v))\|\cos \theta ,

where

\theta

is the angle between the vectors

{\textbf {u))

and

{\textbf {v))

The equation is numerically unstable if u and v are similar because of catastrophic cancellation and should be avoided for numeric computation.

Derivation

The basic relation between the norm and the dot product is given by the equation

\|{\textbf {v))\|^{2}={\textbf {v))\cdot {\textbf {v)).

Then

{\begin{aligned}\|{\textbf {u))+{\textbf {v))\|^{2}&=({\textbf {u))+{\textbf {v)))\cdot ({\textbf {u))+{\textbf {v)))\\[3pt]&=({\textbf {u))\cdot {\textbf {u)))+({\textbf {u))\cdot {\textbf {v)))+({\textbf {v))\cdot {\textbf {u)))+({\textbf {v))\cdot {\textbf {v)))\\[3pt]&=\|{\textbf {u))\|^{2}+\|{\textbf {v))\|^{2}+2({\textbf {u))\cdot {\textbf {v))),\end{aligned))

and similarly

\|{\textbf {u))-{\textbf {v))\|^{2}=\|{\textbf {u))\|^{2}+\|{\textbf {v))\|^{2}-2({\textbf {u))\cdot {\textbf {v))).

Forms (1) and (2) of the polarization identity now follow by solving these equations for ${\textbf {u))\cdot {\textbf {v))$ , while form (3) follows from subtracting these two equations. (Adding these two equations together gives the parallelogram law.)

Generalizations

Symmetric bilinear forms

The polarization identities are not restricted to inner products. If $B$ is any symmetric bilinear form on a vector space, and $Q$ is the quadratic form defined by

Q(v)=B(v,v),

then

{\begin{aligned}2B(u,v)&=Q(u+v)-Q(u)-Q(v),\\2B(u,v)&=Q(u)+Q(v)-Q(u-v),\\4B(u,v)&=Q(u+v)-Q(u-v).\end{aligned))

The so-called symmetrization map generalizes the latter formula, replacing $Q$ by a homogeneous polynomial of degree $k$ defined by $Q(v)=B(v,\ldots ,v),$ where $B$ is a symmetric $k$ -linear map.^[7]

The formulas above even apply in the case where the field of scalars has characteristic two, though the left-hand sides are all zero in this case. Consequently, in characteristic two there is no formula for a symmetric bilinear form in terms of a quadratic form, and they are in fact distinct notions, a fact which has important consequences in L-theory; for brevity, in this context "symmetric bilinear forms" are often referred to as "symmetric forms".

These formulas also apply to bilinear forms on modules over a commutative ring, though again one can only solve for $B(u,v)$ if 2 is invertible in the ring, and otherwise these are distinct notions. For example, over the integers, one distinguishes integral quadratic forms from integral symmetric forms, which are a narrower notion.

More generally, in the presence of a ring involution or where 2 is not invertible, one distinguishes $\varepsilon$ -quadratic forms and $\varepsilon$ -symmetric forms; a symmetric form defines a quadratic form, and the polarization identity (without a factor of 2) from a quadratic form to a symmetric form is called the "symmetrization map", and is not in general an isomorphism. This has historically been a subtle distinction: over the integers it was not until the 1950s that relation between "twos out" (integral quadratic form) and "twos in" (integral symmetric form) was understood – see discussion at integral quadratic form; and in the algebraization of surgery theory, Mishchenko originally used symmetric L-groups, rather than the correct quadratic L-groups (as in Wall and Ranicki) – see discussion at L-theory.

Homogeneous polynomials of higher degree

Finally, in any of these contexts these identities may be extended to homogeneous polynomials (that is, algebraic forms) of arbitrary degree, where it is known as the polarization formula, and is reviewed in greater detail in the article on the polarization of an algebraic form.

v t e Hilbert spaces
Basic concepts	Adjoint Inner product and L-semi-inner product Hilbert space and Prehilbert space Orthogonal complement Orthonormal basis
Main results	Bessel's inequality Cauchy–Schwarz inequality Riesz representation
Other results	Hilbert projection theorem Parseval's identity Polarization identity (Parallelogram law)
Maps	Compact operator on Hilbert space Densely defined Hermitian form Hilbert–Schmidt Normal Self-adjoint Sesquilinear form Trace class Unitary
Examples	Cⁿ(K) with K compact & n<∞ Segal–Bargmann F

v t e Banach space topics
Types of Banach spaces	Asplund Banach list Banach lattice Grothendieck Hilbert Inner product space Polarization identity (Polynomially) Reflexive Riesz L-semi-inner product (B Strictly Uniformly) convex Uniformly smooth (Injective Projective) Tensor product (of Hilbert spaces)
Banach spaces are:	Barrelled Complete F-space Fréchet tame Locally convex Seminorms/Minkowski functionals Mackey Metrizable Normed norm Quasinormed Stereotype
Function space Topologies	Banach–Mazur compactum Dual Dual space Dual norm Operator Ultraweak Weak polar operator Strong polar operator Ultrastrong Uniform convergence
Linear operators	Adjoint Bilinear form operator sesquilinear (Un)Bounded Closed Compact on Hilbert spaces (Dis)Continuous Densely defined Fredholm kernel operator Hilbert–Schmidt Functionals positive Pseudo-monotone Normal Nuclear Self-adjoint Strictly singular Trace class Transpose Unitary
Operator theory	Banach algebras C-algebras Operator space Spectrum C-algebra radius Spectral theory of ODEs Spectral theorem Polar decomposition Singular value decomposition
Theorems	Anderson–Kadec Banach–Alaoglu Banach–Mazur Banach–Saks Banach–Schauder (open mapping) Banach–Steinhaus (Uniform boundedness) Bessel's inequality Cauchy–Schwarz inequality Closed graph Closed range Eberlein–Šmulian Freudenthal spectral Gelfand–Mazur Gelfand–Naimark Goldstine Hahn–Banach hyperplane separation Kakutani fixed-point Krein–Milman Lomonosov's invariant subspace Mackey–Arens Mazur's lemma M. Riesz extension Parseval's identity Riesz's lemma Riesz representation Robinson-Ursescu Schauder fixed-point
Analysis	Abstract Wiener space Banach manifold bundle Bochner space Convex series Differentiation in Fréchet spaces Derivatives Fréchet Gateaux functional holomorphic quasi Integrals Bochner Dunford Gelfand–Pettis regulated Paley–Wiener weak Functional calculus Borel continuous holomorphic Measures Lebesgue Projection-valued Vector Weakly / Strongly measurable function
Types of sets	Absolutely convex Absorbing Affine Balanced/Circled Bounded Convex Convex cone (subset) Convex series related ((cs, lcs)-closed, (cs, bcs)-complete, (lower) ideally convex, (Hx), and (Hwx)) Linear cone (subset) Radial Radially convex/Star-shaped Symmetric Zonotope
Subsets / set operations	Affine hull (Relative) Algebraic interior (core) Bounding points Convex hull Extreme point Interior Linear span Minkowski addition Polar (Quasi) Relative interior
Examples	Absolute continuity AC $ba(\Sigma )$ c space Banach coordinate BK Besov $B_{p,q}^{s}(\mathbb {R} )$ Birnbaum–Orlicz Bounded variation BV Bs space Continuous C(K) with K compact Hausdorff Hardy H^p Hilbert H Morrey–Campanato $L^{\lambda ,p}(\Omega )$ ℓ^p ${\displaystyle \ell ^{\infty ))$ L^p ${\displaystyle L^{\infty ))$ weighted Schwartz $S\left(\mathbb {R} ^{n}\right)$ Segal–Bargmann F Sequence space Sobolev W^k,p Sobolev inequality Triebel–Lizorkin Wiener amalgam $W(X,L^{p})$
Applications	Differential operator Finite element method Mathematical formulation of quantum mechanics Ordinary Differential Equations (ODEs) Validated numerics