This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: "Context-free grammar" – news · newspapers · books · scholar · JSTOR (February 2012) (Learn how and when to remove this message)

In formal language theory, a context-free grammar (CFG) is a formal grammar whose production rules can be applied to a nonterminal symbol regardless of its context. In particular, in a context-free grammar, each production rule is of the form

A\ \to \ \alpha

with $A$ a single nonterminal symbol, and $\alpha$ a string of terminals and/or nonterminals ( $\alpha$ can be empty). Regardless of which symbols surround it, the single nonterminal $A$ on the left hand side can always be replaced by $\alpha$ on the right hand side. This distinguishes it from a context-sensitive grammar, which can have production rules in the form $\alpha A\beta \rightarrow \alpha \gamma \beta$ with $A$ a nonterminal symbol and $\alpha$ , $\beta$ , and $\gamma$ strings of terminal and/or nonterminal symbols.

A formal grammar is essentially a set of production rules that describe all possible strings in a given formal language. Production rules are simple replacements. For example, the first rule in the picture,

\langle {\text{Stmt))\rangle \to \langle {\text{Id))\rangle =\langle {\text{Expr))\rangle ;

replaces $\langle {\text{Stmt))\rangle$ with $\langle {\text{Id))\rangle =\langle {\text{Expr))\rangle ;$ . There can be multiple replacement rules for a given nonterminal symbol. The language generated by a grammar is the set of all strings of terminal symbols that can be derived, by repeated rule applications, from some particular nonterminal symbol ("start symbol"). Nonterminal symbols are used during the derivation process, but do not appear in its final result string.

Languages generated by context-free grammars are known as context-free languages (CFL). Different context-free grammars can generate the same context-free language. It is important to distinguish the properties of the language (intrinsic properties) from the properties of a particular grammar (extrinsic properties). The language equality question (do two given context-free grammars generate the same language?) is undecidable.

Context-free grammars arise in linguistics where they are used to describe the structure of sentences and words in a natural language, and they were invented by the linguist Noam Chomsky for this purpose. By contrast, in computer science, as the use of recursively-defined concepts increased, they were used more and more. In an early application, grammars are used to describe the structure of programming languages. In a newer application, they are used in an essential part of the Extensible Markup Language (XML) called the document type definition.^[2]

In linguistics, some authors use the term phrase structure grammar to refer to context-free grammars, whereby phrase-structure grammars are distinct from dependency grammars. In computer science, a popular notation for context-free grammars is Backus–Naur form, or BNF.

Example grammar:
S → Bb \| Cc \| Ee
B → Bb \| b
C → C
D → Bd \| Cd \| d
E → Ee

Background

Formal definitions

Production rule notation

Rule application

Repetitive rule application

Context-free language

Examples

Words concatenated with their reverse

Well-formed parentheses

Well-formed nested parentheses and square brackets

Matching pairs

Distinct number of a's and b's

Second block of b's of double size

First-order logic formulas

Examples of languages that are not context free

Regular grammars

Derivations and syntax trees

Normal forms

Closure properties

Decidable problems

Parsing

Reachability, productiveness, nullability

Regularity and LL(k) checks

Emptiness and finiteness

Undecidable problems

Universality

Language equality

Language inclusion

Being in a lower or higher level of the Chomsky hierarchy

Grammar ambiguity

Language disjointness

Extensions

Subclasses

Linguistic applications

See also

References

Notes

Further reading

External links