Boolean grammar

Boolean grammars, introduced by Alexander Okhotin, are a class of formal grammars studied in formal language theory. They extend the basic type of grammars, the context-free grammars, with conjunction and negation operations. Besides these explicit operations, Boolean grammars allow implicit disjunction represented by multiple rules for a single nonterminal symbol, which is the only logical connective expressible in context-free grammars. Conjunction and negation can be used, in particular, to specify intersection and complement of languages. An intermediate class of grammars known as conjunctive grammars allows conjunction and disjunction, but not negation.

The rules of a Boolean grammar are of the form

$$A \to \alpha_1 \And \ldots \And \alpha_m \And \lnot\beta_1 \And \ldots \And \lnot\beta_n $$

where $$A$$ is a nonterminal, $$m+n \ge 1$$ and $$\alpha_1$$, ..., $$\alpha_m$$, $$\beta_1$$, ..., $$\beta_n$$ are strings formed of symbols in $$\Sigma$$ and $$N$$. Informally, such a rule asserts that every string $$w$$ over $$\Sigma$$ that satisfies each of the syntactical conditions represented by $$\alpha_1$$, ..., $$\alpha_m$$ and none of the syntactical conditions represented by $$\beta_1$$, ..., $$\beta_n$$ therefore satisfies the condition defined by $$A$$.

There exist several formal definitions of the language generated by a Boolean grammar. They have one thing in common: if the grammar is represented as a system of language equations with union, intersection, complementation and concatenation, the languages generated by the grammar must be the solution of this system. The semantics differ in details, some define the languages using language equations, some draw upon ideas from the field of logic programming. However, these nontrivial issues of formal definition are mostly irrelevant for practical considerations, and one can construct grammars according to the given informal semantics. The practical properties of the model are similar to those of conjunctive grammars, while the descriptional capabilities are further improved. In particular, some practically useful properties inherited from context-free grammars, such as efficient parsing algorithms, are retained, see.