In linear algebra, the transpose of a matrix is an operator which flips a matrix over its diagonal; that is, it switches the row and column indices of the matrix A by producing another matrix, often denoted by A^{T} (among other notations).^{[1]}
The transpose of a matrix was introduced in 1858 by the British mathematician Arthur Cayley.^{[2]} In the case of a logical matrix representing a binary relation R, the transpose corresponds to the converse relation R^{T}.
Note that this article assumes that matrices are taken over a commutative ring. These results may not hold in the non-commutative case. |
The transpose of a matrix A, denoted by A^{T},^{[3]} ^{⊤}A, A^{⊤}, ,^{[4]}^{[5]} A′,^{[6]} A^{tr}, ^{t}A or A^{t}, may be constructed by any one of the following methods:
Formally, the i-th row, j-th column element of A^{T} is the j-th row, i-th column element of A:
If A is an m × n matrix, then A^{T} is an n × m matrix.
In the case of square matrices, A^{T} may also denote the Tth power of the matrix A. For avoiding a possible confusion, many authors use left upperscripts, that is, they denote the transpose as ^{T}A. An advantage of this notation is that no parentheses are needed when exponents are involved: as (^{T}A)^{n} = ^{T}(A^{n}), notation ^{T}A^{n} is not ambiguous.
In this article this confusion is avoided by never using the symbol T as a variable name.
A square matrix whose transpose is equal to itself is called a symmetric matrix; that is, A is symmetric if
A square matrix whose transpose is equal to its negative is called a skew-symmetric matrix; that is, A is skew-symmetric if
A square complex matrix whose transpose is equal to the matrix with every entry replaced by its complex conjugate (denoted here with an overline) is called a Hermitian matrix (equivalent to the matrix being equal to its conjugate transpose); that is, A is Hermitian if
A square complex matrix whose transpose is equal to the negation of its complex conjugate is called a skew-Hermitian matrix; that is, A is skew-Hermitian if
A square matrix whose transpose is equal to its inverse is called an orthogonal matrix; that is, A is orthogonal if
A square complex matrix whose transpose is equal to its conjugate inverse is called a unitary matrix; that is, A is unitary if
Let A and B be matrices and c be a scalar.
If A is an m × n matrix and A^{T} is its transpose, then the result of matrix multiplication with these two matrices gives two square matrices: A A^{T} is m × m and A^{T} A is n × n. Furthermore, these products are symmetric matrices. Indeed, the matrix product A A^{T} has entries that are the inner product of a row of A with a column of A^{T}. But the columns of A^{T} are the rows of A, so the entry corresponds to the inner product of two rows of A. If p_{i j} is the entry of the product, it is obtained from rows i and j in A. The entry p_{j i} is also obtained from these rows, thus p_{i j} = p_{j i}, and the product matrix (p_{i j}) is symmetric. Similarly, the product A^{T} A is a symmetric matrix.
A quick proof of the symmetry of A A^{T} results from the fact that it is its own transpose:
See also: In-place matrix transposition |
On a computer, one can often avoid explicitly transposing a matrix in memory by simply accessing the same data in a different order. For example, software libraries for linear algebra, such as BLAS, typically provide options to specify that certain matrices are to be interpreted in transposed order to avoid the necessity of data movement.
However, there remain a number of circumstances in which it is necessary or desirable to physically reorder a matrix in memory to its transposed ordering. For example, with a matrix stored in row-major order, the rows of the matrix are contiguous in memory and the columns are discontiguous. If repeated operations need to be performed on the columns, for example in a fast Fourier transform algorithm, transposing the matrix in memory (to make the columns contiguous) may improve performance by increasing memory locality.
Ideally, one might hope to transpose a matrix with minimal additional storage. This leads to the problem of transposing an n × m matrix in-place, with O(1) additional storage or at most storage much less than mn. For n ≠ m, this involves a complicated permutation of the data elements that is non-trivial to implement in-place. Therefore, efficient in-place matrix transposition has been the subject of numerous research publications in computer science, starting in the late 1950s, and several algorithms have been developed.
See also: Transpose of a linear map |
As the main use of matrices is to represent linear maps between finite-dimensional vector spaces, the transpose is an operation on matrices that may be seen as the representation of some operation on linear maps.
This leads to a much more general definition of the transpose that works on every linear map, even when linear maps cannot be represented by matrices (such as in the case of infinite dimensional vector spaces). In the finite dimensional case, the matrix representing the transpose of a linear map is the transpose of the matrix representing the linear map, independently of the basis choice.
Main article: Transpose of a linear map |
Let X^{#} denote the algebraic dual space of an R-module X. Let X and Y be R-modules. If u : X → Y is a linear map, then its algebraic adjoint or dual,^{[8]} is the map u^{#} : Y^{#} → X^{#} defined by f ↦ f ∘ u. The resulting functional u^{#}(f) is called the pullback of f by u. The following relation characterizes the algebraic adjoint of u^{[9]}
where ⟨•, •⟩ is the natural pairing (i.e. defined by ⟨h, z⟩ := h(z)). This definition also applies unchanged to left modules and to vector spaces.^{[10]}
The definition of the transpose may be seen to be independent of any bilinear form on the modules, unlike the adjoint (below).
The continuous dual space of a topological vector space (TVS) X is denoted by X'. If X and Y are TVSs then a linear map u : X → Y is weakly continuous if and only if u^{#}(Y') ⊆ X', in which case we let ^{t}u : Y' → X' denote the restriction of u^{#} to Y'. The map ^{t}u is called the transpose^{[11]} of u.
If the matrix A describes a linear map with respect to bases of V and W, then the matrix A^{T} describes the transpose of that linear map with respect to the dual bases.
Main article: Bilinear form |
Every linear map to the dual space u : X → X^{#} defines a bilinear form B : X × X → F, with the relation B(x, y) = u(x)(y). By defining the transpose of this bilinear form as the bilinear form ^{t}B defined by the transpose ^{t}u : X^{##} → X^{#} i.e. ^{t}B(y, x) = ^{t}u(Ψ(y))(x), we find that B(x, y) = ^{t}B(y, x). Here, Ψ is the natural homomorphism X → X^{##} into the double dual.
Not to be confused with Hermitian adjoint. |
If the vector spaces X and Y have respectively nondegenerate bilinear forms B_{X} and B_{Y}, a concept known as the adjoint, which is closely related to the transpose, may be defined:
If u : X → Y is a linear map between vector spaces X and Y, we define g as the adjoint of u if g : Y → X satisfies
These bilinear forms define an isomorphism between X and X^{#}, and between Y and Y^{#}, resulting in an isomorphism between the transpose and adjoint of u. The matrix of the adjoint of a map is the transposed matrix only if the bases are orthonormal with respect to their bilinear forms. In this context, many authors however, use the term transpose to refer to the adjoint as defined here.
The adjoint allows us to consider whether g : Y → X is equal to u^{ −1} : Y → X. In particular, this allows the orthogonal group over a vector space X with a quadratic form to be defined without reference to matrices (nor the components thereof) as the set of all linear maps X → X for which the adjoint equals the inverse.
Over a complex vector space, one often works with sesquilinear forms (conjugate-linear in one argument) instead of bilinear forms. The Hermitian adjoint of a map between such spaces is defined similarly, and the matrix of the Hermitian adjoint is given by the conjugate transpose matrix if the bases are orthonormal.