The Pythagorean theorem
Formal theorems constructed in a formal language are a subset of the language's formulas. The set of formulas may be broadly divided into theorems and non-theorems. However, quite often, a formal system will simply define all of its formulas as theorems.[1]

A theorem is an idea, concept or abstraction token instances of which are formed using a string of symbols according to both the syntactic rules of a language (also called its grammar) and the transformation rules of a formal system. Specifically, a theorem is always the last formula of a derivation in some formal system each formula of which is a logical consequence of the formulas which came before it in the derivation. The initially accepted formulas in the derivation are called its axioms, and are the basis on which the theorem is derived. A set of theorems is called a theory.

What makes theorems useful and of interest is that they can be interpreted as true propositions and their derivations may be interpreted as a proof of the truth of the resulting expression. Very often, in mathematical logic a theorem is expressed in a formal language, in which case it may be called a formal theorem. The syntactic rules of the formal language may be called its formal grammar or formation rules. A set of formal theorems may be referred to as a formal theory. A theorem whose interpretation is a true statement about a formal system is called a metatheorem.

The concept of a theorem is fundamentally syntactic, in contrast to the notion of a "true proposition" in which semantics are introduced. Different deductive systems may be constructed so as to yield other interpretations, depending on the presumptions of the derivation rules (i.e. belief, justification or other modalities). The soundness of a formal system depends on whether or not all of its theorems are also tautologies. A tautology is a formula that is true under any possible interpretation. A formal system is considered semantically complete when all of its tautologies are also theorems.

Although they can be written in a completely symbolic form using, for example, propositional calculus, theorems are often expressed in a natural language such as English. The same is true of proofs, which are often expressed as logically organized and clearly worded informal arguments intended to demonstrate that a formal symbolic proof can be constructed. Such arguments are typically easier to check than purely symbolic ones — indeed, many mathematicians would express a preference for a proof that not only demonstrates the validity of a theorem, but also explains in some way why it is obviously true. In some cases, a picture alone may be sufficient to prove a theorem. Because theorems lie at the core of mathematics, they are also central to its aesthetics. Theorems are often described as being "trivial", or "difficult", or "deep", or even "beautiful". These subjective judgments vary not only from person to person, but also with time: for example, as a proof is simplified or better understood, a theorem that was once difficult may become trivial. On the other hand, a deep theorem may be simply stated, but its proof may involve surprising and subtle connections between disparate areas of mathematics. Fermat's Last Theorem is a particularly well-known example of such a theorem.

Formal and informal notions

Logically most theorems are of the form of an indicative conditional: if A, then B. Such a theorem does not state that B is always true, only that B must be true if A is true. In this case A is called the hypothesis of the theorem (note that "hypothesis" here is something very different from a conjecture) and B the conclusion. The theorem "If n is an even natural number then n/2 is a natural number" is a typical example in which the hypothesis is that n is an even natural number and the conclusion is that n/2 is also a natural number.

In order to be proven, a theorem must be expressible as a precise, formal statement. Nevertheless, theorems are usually expressed in natural language rather than in a completely symbolic form, with the intention that the reader will be able to produce a formal statement from the informal one. In addition, there are often hypotheses which are understood in context, rather than explicitly stated.

It is common in mathematics to choose a number of hypotheses that are assumed to be true within a given theory, and then declare that the theory consists of all theorems provable using those hypotheses as assumptions. In this case the hypotheses that form the foundational basis are called the axioms (or postulates) of the theory. The field of mathematics known as proof theory studies formal axiom systems and the proofs that can be performed within them.

A planar map with five colors such that no two regions with the same color meet. It can actually be colored in this way with only four colors. The four color theorem states that such colorings are possible for any planar map, but every known proof involves a computational search that is too long to check by hand.

Some theorems are "trivial," in the sense that they follow from definitions, axioms, and other theorems in obvious ways and do not contain any surprising insights. Some, on the other hand, may be called "deep": their proofs may be long and difficult, involve areas of mathematics superficially distinct from the statement of the theorem itself, or show surprising connections between disparate areas of mathematics.[2] A theorem might be simple to state and yet be deep. An excellent example is Fermat's Last Theorem, and there are many other examples of simple yet deep theorems in number theory and combinatorics, among other areas.

There are other theorems for which a proof is known, but the proof cannot easily be written down. The most prominent examples are the four color theorem and the Kepler conjecture. Both of these theorems are only known to be true by reducing them to a computational search which is then verified by a computer program. Initially, many mathematicians did not accept this form of proof, but it has become more widely accepted in recent years. The mathematician Doron Zeilberger has even gone so far as to claim that these are possibly the only nontrivial results that mathematicians have ever proved.[1] Many mathematical theorems can be reduced to more straightforward computation, including polynomial identities, trigonometric identities and hypergeometric identities.[3]

Syntax and semantics

Main articles: Syntax (logic) and Formal semantics

Derivation of a theorem

Main article: Formal proof

The notion of a theorem is deeply intertwined with its derivation. Given a particular very simple formal system (we shall call ours ) whose alphabet α consists only of two symbols { , } and whose formation rule for formulas is:

'Any string of symbols of which is at least 3 symbols long, and which is not infinitely long, is a formula of . Nothing else is a formula of .'

The single axiom of is:

' '

The transformation rule for is:

'Any occurrence of '' in a formula of may be replaced by an occurance of the string ' ' and the result is a formula (wff) of .'

Theorems in are defined as those formulae in of which a derivation can be constructed, the last line of which is that formula.

1) (Given as axiom)
2) (by applying the transformation rule)
3) (by applying the transformation rule)

Therefore ' ' is a theorem of . The string ' ' is not considered to be "true" or "proved," until it is given an interpretation it is merely derived.

Two metatheorems of are:

'Every theorem of begins with ''
'Every theorem of has exactly two triangles.'

Interpretation of a theorem

Main article: Interpretation (logic)

Given a different simple formal system (we shall call this one ) whose alphabet α consists only of three symbols { , , } and whose formation rule for formulas is:

'Any string of symbols of which is at least 6 symbols long, and which is not infinitely long, is a formula of . Nothing else is a formula of .'

The single axiom schema of is:

" * * " (where " * " is a metasyntactic variable standing for a finite string of " "s )

A formal proof can be constructed as follows:

(1)
(2)
(3)

In this example the theorem produced " " can be interpreted as menaing "One plus four equals five." A different interpretation would be to read it backwards as "Five minus four equals one."== Proof of a theorem ==

A derivation of a theorem can be interpreted as a proof of the truth of some proposition. To establish a proposition as a theorem, the existence of a line of reasoning from axioms in the system (and other, already established theorems) to the given statement must be demonstrated.

Although the proof is necessary to produce a theorem, it is not usually considered part of the theorem. And even though more than one proof may be known for a single theorem, only one proof is required to establish the theorem's validity. The Pythagorean theorem and the law of quadratic reciprocity are contenders for the title of theorem with the greatest number of distinct proofs.

The definition of theorems as elements of a formal language allows for results in proof theory that study the structure of formal proofs and the structure of provable formulas. The most famous result is Gödel's incompleteness theorem; by representing theorems about basic number theory as expressions in a formal language, and then representing this language within number theory itself, Gödel constructed examples of statements that are neither provable nor disprovable from axiomatizations of number theory.

Relation to theories

Main articles: Theory and Scientific theory

A theory is a collection of theorems. The main broad distinction to be made among different types of theories is whether or not their theorems are intended to be interpreted as representing a true proposition which is empirical or non-empirical. If every theorem of a theory is empirical, then it is a scientific theory (also called an empirical theory). If there is even one non-empirical theorem in the theory, it is a non-empirical theory.

Theorems in mathematics and logic are non-empirical because they have no content until given an interpretation. This is precisely why theorems of mathematics and logic are useful. They can be applied to any subject matter no matter what its content. If the content of a theory consists entirely of theorems whose interpretations are statements about observation and sense data, than it is a scientific theory.

It is not correct to say that a scientific theory is either true or false as its theorems may be. A theory is taken as a whole as strongly or weakly supported by the facts. A key attribute of a scientific theory is that it is falsifiable, that is, it makes predictions about the natural world that are testable by experiments. Any disagreement between prediction and experiment demonstrates the incorrectness of the scientific theory, or at least limits its accuracy or domain of validity. Mathematical theorems, on the other hand, are purely abstract formal statements: the proof of a theorem cannot involve experiments or other empirical evidence in the same way such evidence is used to support scientific theories.

The Collatz conjecture: one way to illustrate its complexity is to extend the iteration from the natural numbers to the complex numbers. The result is a fractal, which (in accordance with universality) resembles the Mandelbrot set.

Nonetheless, there is some degree of empiricism and data collection involved in the discovery of mathematical theorems. By establishing a pattern, sometimes with the use of a powerful computer, mathematicians may have an idea of what to prove, and in some cases even a plan for how to set about doing the proof. For example, the Collatz conjecture has been verified for start values up to about 2.88 × 1018. The Riemann hypothesis has been verified for the first 10 trillion zeroes of the zeta function. Neither of these statements is considered to be proven.

Such evidence does not constitute proof. For example, the Mertens conjecture is a statement about natural numbers that is now known to be false, but no explicit counterexample (i.e., a natural number n for which the Mertens function M(n) equals or exceeds the square root of n) is known: all numbers less than 1014 have the Mertens property, and the smallest number which does not have this property is only known to be less than the exponential of 1.59 × 1040, which is approximately 10 to the power 4.3 × 1039. Since the number of particles in the universe is generally considered to be less than 10 to the power 100 (a googol), there is no hope to find an explicit counterexample by exhaustive search at present.

Note that the word "theory" also exists in mathematics, to denote a body of mathematical axioms, definitions and theorems, as in, for example, group theory. There are also "theorems" in science, particularly physics, and in engineering, but they often have statements and proofs in which physical assumptions and intuition play an important role; the physical axioms on which such "theorems" are based are themselves falsifiable.

Terminology

Theorems are often indicated by several other terms: the actual label "theorem" is reserved for the most important results, whereas results which are less important, or distinguished in other ways, are named by different terminology.

There are other terms, less commonly used, which are conventionally attached to proven statements, so that certain theorems are referred to by historical or customary names. For examples:

A few well-known theorems have even more idiosyncratic names. The division algorithm is a theorem expressing the outcome of division in the natural numbers and more general rings. The Banach–Tarski paradox is a theorem in measure theory that is paradoxical in the sense that it contradicts common intuitions about volume in three-dimensional space.

An unproven statement that is believed to be true is called a conjecture (or sometimes a hypothesis, but with a different meaning from the one discussed above). To be considered a conjecture, a statement must usually be proposed publicly, at which point the name of the proponent may be attached to the conjecture, as with Goldbach's conjecture. Other famous conjectures include the Collatz conjecture and the Riemann hypothesis.

Layout

A theorem and its proof are typically laid out as follows:

Theorem (name of person who proved it and year of discovery, proof or publication).
Statement of theorem (sometimes called the proposition).
Proof.
Description of proof.
End mark.

The end of the proof may be signalled by the letters Q.E.D. meaning "quod erat demonstrandum" or by one of the tombstone marks "

□" or "

∎" meaning "End of Proof", introduced by Paul Halmos following their usage in magazine articles.

The exact style will depend on the author or publication. Many publications provide instructions or macros for typesetting in the house style.

It is common for a theorem to be preceded by definitions describing the exact meaning of the terms used in the theorem. It is also common for a theorem to be preceded by a number of propositions or lemmas which are then used in the proof. However, lemmas are sometimes embedded in the proof of a theorem, either with nested proofs, or with their proofs presented after the proof of the theorem.

Corollaries to a theorem are either presented between the theorem and the proof, or directly after the proof. Sometimes corollaries have proofs of their own which explain why they follow from the theorem.

Lore

It has been estimated that over a quarter of a million theorems are proved every year.[5]

The well-known aphorism, "A mathematician is a device for turning coffee into theorems", is probably due to Alfréd Rényi, although it is often attributed to Rényi's colleague Paul Erdős (and Rényi may have been thinking of Erdős), who was famous for the many theorems he produced, the number of his collaborations, and his coffee drinking.[6]

The classification of finite simple groups is regarded by some to be the longest proof of a theorem; it comprises tens of thousands of pages in 500 journal articles by some 100 authors. These papers are together believed to give a complete proof, and there are several ongoing projects to shorten and simplify this proof.[7]

See also

Notes

  1. ^ Godel, Escher, Bach: An Eternal Golden Braid, Douglas Hofstadter
  2. ^ See Deep Theorem, cited below.
  3. ^ Petkovsek et al. 1996.
  4. ^ The word law can also refer to an axiom, a rule of inference, or, in probability theory, a probability distribution.
  5. ^ Hoffman 1998, p. 204.
  6. ^ Hoffman 1998, p. 7.
  7. ^ An enormous theorem: the classification of finite simple groups, Richard Elwes, Plus Magazine, Issue 41 December 2006.

References