User:Jaydavidmartin/Complexity class

In computational complexity theory, a complexity class is a set of computational problems of related resource-based complexity. The two most common resources considered are time and memory.

In general, a complexity class is defined in terms of a type of computational problem, a model of computation, and a bounded resource like time or memory. In particular, most complexity classes consist of decision problems that are solvable with a Turing machine, and are differentiated by their time or space (memory) requirements. For instance, the class P is the set of decision problems solvable by a deterministic Turing machine in polynomial time. There are, however, many complexity classes defined in terms of other types of problems (e.g. counting problems and function problems) and using other models of computation (e.g. probabilistic Turing machines, interactive proof systems, Boolean circuits, and quantum computers).

The study of the relationships between complexity classes is a major area of research in theoretical computer science. There are often general hierarchies of complexity classes; for example, it is known that a number of fundamental time and space complexity classes relate to each other in the following way: NL$$\subseteq$$P$$\subseteq$$NP$$\subseteq$$PSPACE$$\subseteq$$EXPTIME$$\subseteq$$EXPSPACE. However, many relationships are not yet known; for example, one of the most famous open problems in computer science concerns whether or not P equals NP. The relationships between classes often answer questions about the fundamental nature of computation. The P versus NP problem, for instance, is directly related to questions of whether nondeterminism adds any computational power to computers and whether problems whose solution can be quickly checked for correctness can also be quickly solved.

Background
Complexity classes are sets of related computational problems. They are defined in terms of the computational difficulty of solving the problems contained within them with respect to particular computational resources like time and memory. More formally, the definition of a complexity class consists of three things: a type of computational problem, a model of computation, and a bounded computational resource. In particular, most complexity classes consist of decision problems that can be solved by a Turing machine with bounded time or space resources. For example, the complexity class P is defined as the set of decision problems that can be solved by a deterministic Turing machine in polynomial time.

Computational problems
Intuitively, a computational problem is just a question that a computer is able to answer. For example, "is the natural number $$n$$ prime?" is a problem that a computer could solve. A computational problem is mathematically represented as the set of answers to the problem. In the primality example, the problem (call it $$PRIME$$) is represented by the set of all natural numbers that are prime: $$PRIME = \{ n \in \mathbb{N} | n \text{ is prime}\}$$. In the theory of computation, these answers are represented as strings; for example, in the primality example the natural numbers could be represented as strings of bits that represent binary numbers. For this reason, computational problems are often synonymously referred to as languages; for example, saying that the $$PRIME$$ problem is in the complexity class NP is equivalent to saying that the language $$PRIME$$ is in NP.

Decision problems
The most commonly analyzed problems in theoretical computer science are decision problems—the kinds of problems that can be posed as yes-no questions. The primality example above, for instance, is an example of a decision problem as it can be represented by the yes-no question "is the natural number $$n$$ prime". In terms of the theory of computation, a decision problem is represented as the set of input strings that a computer running a correct algorithm would answer "yes" to. In the primality example, $$PRIME$$ is the set of strings representing natural numbers that, when input into a computer running an algorithm that correctly tests for primality, the algorithm answers "yes, this number is prime" to. This "yes-no" format is often equivalently stated as "accept-reject"; that is, an algorithm "accepts" an input string if the answer to the decision problem is "yes" and "rejects" if the answer is "no".

While some problems cannot easily be expressed as decision problems, they nonetheless encompass a broad range of computational problems. Other types of problems that certain complexity classes are defined in terms of include function problems (e.g. FP), counting problems (e.g. #P), optimization problems, and promise problems (see section "Other types of problems").

Computational models
To make concrete the notion of a "computer", in theoretical computer science problems are analyzed in the context of a computational model. This is also directly relevant to making exact notions of computational resources like "time" and "memory". In computational complexity theory, complexity classes deal with the inherent resource requirements of problems and not the resource requirements that depend upon how a physical computer is implemented. For example, in the real world different computers may require different amounts of time and memory to solve the same problem because of the way that they have been engineered. Computational models provide a consistent representation of computation that enables the study of the inherent difficulty of problems.

The most commonly used computational model is the Turing machine. While other models exist and many complexity classes are defined in terms of them (see section "Other models of computation"), the Turing machine is used to define most basic complexity classes. With the Turing machine, instead of using standard units of time like the second (which make it impossible to disentangle running time from the speed of physical hardware) and standard units of memory like bytes, the notion of time is abstracted as the number of elementary steps that a Turing machine takes to solve a problem and the notion of memory is abstracted as the number of cells that are used on the machine's tape. These are explained in greater detail below.

It is also possible to use the Blum axioms to define complexity classes without referring to a concrete computational model, but this approach is less frequently used in complexity theory.

Deterministic Turing machines
A Turing machine is a mathematical model of a general computing machine. It is the most commonly used model in complexity theory, owing in large part to the fact that it is believed to be as powerful as any other model of computation and is easy to analyze mathematically. Importantly, it is believed that if a problem can be solved by an algorithm then there exists a Turing machine that solves the problem (this is known as the Church–Turing thesis).

Mechanically, a Turing machine (TM) manipulates symbols (generally restricted to the bits 0 and 1 to provide an intuitive connection to real-life computers) contained on an infinitely long strip of tape. The TM can read and write, one at a time, using a tape head. Operation is fully determined by a finite set of elementary instructions such as "in state 42, if the symbol seen is 0, write a 1; if the symbol seen is 1, change into state 17; in state 17, if the symbol seen is 0, write a 1 and change to state 6". The Turing machine starts with only the input string on its tape and blanks everywhere else. The TM accepts the input if it enters a designated accept state and rejects the input if it enters a reject state. The deterministic Turing machine (DTM) is the most basic type of Turing machine. It uses a fixed set of rules to determine its future actions (which is why it is called "deterministic").

A computational problem can then be defined in terms of a Turing machine as the set of input strings that a particular Turing machine accepts. For example, the primality problem $$PRIME$$ from above is the set of strings (representing natural numbers) that a Turing machine running an algorithm that correctly tests for primality accepts. A Turing machine is said to recognize a language (recall that "problem" and "language" are largely synonymous in computability theory) if it accepts all inputs that are in the language and is said to decide a language if it additionally rejects all inputs that are not in the language (certain inputs may cause a Turing machine to run forever, so decidability places the additional constraint over recognizability that the Turing machine halts on all inputs). A Turing machine that "solves" a problem is generally meant to mean one that decides the language.

Turing machines enable intuitive notions of "time" and "space". The time complexity of a TM on a particular input is the number of elementary steps that the Turing machine takes to reach either an accept or reject state. The space complexity is the number of cells on its tape that it uses to reach either an accept or reject state.

Nondeterministic Turing machines


A variant of the deterministic Turing machine (DTM) is the nondeterministic Turing machine (NTM). Intuitively, an NTM is just a regular Turing machine that has the added capability of being able to explore multiple possible future actions from a given state, and "choosing" a branch that accepts (if any accept). That is, while a DTM must follow only one branch of computation, an NTM can be imagined as a computation tree, branching into many possible computational pathways at each step (see image). If at least one branch of the tree halts with an "accept" condition, then the NTM accepts the input. In this way, an NTM can be thought of as simultaneously exploring all computational possibilities in parallel and selecting an ideal branch. NTMs are not meant to be physically realizable models, they are simply theoretically interesting abstract machines that give rise to a number of interesting complexity classes (which often do have physically realizable equivalent definitions).

DTMs can be viewed as a special case of NTMs that do not make use of the power of nondeterminism. Hence, every computation that can be carried out by a DTM can also be carried out by an equivalent NTM.

The time complexity of an NTM is the maximum number of steps that the NTM uses on any branch of its computation. Similarly, the space complexity of an NTM is the maximum number of cells that the NTM uses on any branch of its computation.

Resource bounds
Complexity classes group computational problems by their resource requirements. To do this, computational problems are differentiated by upper bounds on the maximum amount of resources the most efficient algorithm takes to solve them. More particularly, complexity classes are concerned with the rate of growth in resource requirements to solve a computational problem as the input size increases. For example, the amount of time it takes to solve problems in the complexity class P grows relatively slowly as the input size increases, while it grows comparatively quickly for problems in the complexity class EXPTIME (or more accurately, for problems in EXPTIME that are outside of P, since P$$\subseteq$$EXPTIME). This process is formalized using big O notation.

Note that the study of complexity classes is intended primarily to understand the inherent complexity required to solve computational problems. Complexity theorists are thus generally concerned with finding the smallest complexity class that a problem falls into and are therefore concerned with identifying which class a computational problem falls into using the most efficient algorithm. There may be an algorithm, for instance, that solves a particular problem in exponential time, but if the most efficient algorithm for solving the problem runs in polynomial time then the inherent time complexity of that problem is better described as polynomial.

Time bounds
The time complexity of an algorithm with respect to the Turing machine model is the number of steps it takes for a Turing machine to run an algorithm on a given input size. Formally, the time complexity for an algorithm implemented with a Turing machine $$M$$ is defined as the function $$t_M: \mathbb{N} \to \mathbb{N}$$, where $$t_M(n)$$ is the maximum number of steps that $$M$$ takes on any input of length $$n$$. For example, say the inputs to $$M$$ are binary numbers. Then there are, for instance, four inputs of size two: 00, 01, 10, and 11. Say that running $$M$$ on 00 takes ten steps, on 01 takes twelve steps, on 10 takes eight steps, and on 11 takes fifteen steps. The runtime $$t_M(2)$$ is the maximum of these four running times: $$t_M(2)=15$$.

However, complexity classes are concerned less with particular runtime values and more with the general class of function that the time complexity function falls into. For instance, is the time complexity a polynomial? A logarithmic function? An exponential function? Or another kind of function? Since exact time complexity functions are often complicated expressions, they are simplified using big O notation. This leads to the most basic sets of time complexity classes: DTIME and NTIME. These are defined as follows:
 * The time complexity class $$\mathsf{DTIME}(t(n))$$ is the collection of all problems that are decidable by an $$O(t(n))$$ time deterministic Turing machine.
 * The time complexity class $$\mathsf{NTIME}(t(n))$$ is the collection of all problems that are decidable by an $$O(t(n))$$ time nondeterministic Turing machine.

For example, if a problem $$X$$ can be solved by an algorithm running in time $$t(n)=8n^3+2n+4$$ then it is in DTIME$$(n^3)$$ since $$t(n) = 8n^3+2n+4 = O(n^3)$$. Notice that under big O notation it is also the case that $$8n^3+2n+4 = O(n^4)$$, $$8n^3+2n+4 = O(n^5)$$, and so on. This means that DTIME classes are generally not mutually exclusive but rather form a hierarchy: DTIME$$(n^3) \subseteq$$DTIME$$(n^4) \subseteq$$DTIME$$(n^5) \subseteq \cdots$$. This hierarchical nature appears frequently among complexity classes.

Space bounds
The space complexity of an algorithm with respect to the Turing machine model is the number of cells on the Turing machine's tape that are required to run an algorithm on a given input size. Formally, the space complexity of an algorithm implemented with a Turing machine $$M$$ is defined as the function $$s_M: \mathbb{N} \to \mathbb{N}$$, where $$s_M(n)$$ is the maximum number of cells that $$M$$ uses on any input of length $$n$$.

The most basic space complexity classes are defined as follows:
 * The space complexity class $$\mathsf{DSPACE}(s(n))$$ is the collection of all problems that are decidable by an $$O(s(n))$$ space deterministic Turing machine.
 * The space complexity class $$\mathsf{NSPACE}(s(n))$$ is the collection of all problems that are decidable by an $$O(s(n))$$ space nondeterministic Turing machine.

Basic complexity classes
ALL is the class of all decision problems. Many important complexity classes are defined by bounding the time or space used by an algorithm. Several important complexity classes defined in this manner are explained below.

Time complexity classes
Recall that the time complexity class $$\mathsf{DTIME}(t(n))$$ is the collection of all problems that are decidable by an $$O(t(n))$$ time deterministic Turing machine and $$\mathsf{NTIME}(t(n))$$ is the collection of all problems that are decidable by an $$O(t(n))$$ time nondeterministic Turing machine. Time complexity classes are often formally defined in terms of these two classes.

P and NP
P is the class of problems that are solvable by a deterministic Turing machine in polynomial time and NP is the class of problems that are solvable by a nondeterministic Turing machine in polynomial time. Or more formally,
 * $$\mathsf{P} = \bigcup_{k\in\mathbb{N}} \mathsf{DTIME}(n^k) $$
 * $$\mathsf{NP} = \bigcup_{k\in\mathbb{N}} \mathsf{NTIME}(n^k) $$

One way to think of the difference between P and NP is by defining both of them in terms of nondeterministic Turing machines. A problem $$X$$ is in NP by the definition that we've already seen: $$X \in \text{NP}$$ if there exists a polynomial-time nondeterministic Turing machine $$M$$ such that for all $$x \in \Sigma^*$$, $$x \in X$$ if at least one of $$M's$$ computational path accepts, and $$x \notin X$$ if none of $$M's$$ computational paths accepts. A language $$X$$ is in P, by contrast, if for all $$x \in X$$ all of $$M's$$ computational paths accept.

P is often said to be the class of problems that can be solved "quickly" or "efficiently" by a deterministic computer, since the time complexity of solving a problem in P increases relatively slowly with the input size.

An important characteristic of the class NP is that it can be equivalently defined as the class of problems whose solutions are verifiable by a deterministic Turing machine in polynomial time. That is, a language is in NP if there exists a deterministic polynomial time Turing machine, referred to as the verifier, that takes as input a string $$w$$ and a polynomial-size certificate string $$c$$, and accepts $$w$$ if $$w$$ is in the language and rejects $$w$$ if $$w$$ is not in the language. Intuitively, the certificate acts as a proof that the input $$w$$ is in the language. Formally:
 * NP is the class of languages $$L$$ for which there exists a polynomial-time deterministic Turing machine $$M$$ and a polynomial $$p$$ such that for all $$w \in \{0,1\}^*$$, $$w$$ is in $$L$$ if and only if there exists some $$c \in \{0,1\}^{p(|w|)}$$ such that $$M(w,c)$$ accepts.

This equivalence between the nondeterministic definition and the verifier definition highlights a fundamental connection between nondeterminism and solution verifiability. Furthermore, it also provides a useful method for proving that a language is in NP—simply identify a suitable certificate and show that it can be verified in polynomial time.

EXPTIME and NEXPTIME
EXPTIME is the class of decision problems solvable by a deterministic Turing machine in exponential time and NEXPTIME is the class of decision problems solvable by a nondeterministic Turing machine in exponential time. Or more formally,
 * $$\mathsf{EXPTIME} = \bigcup_{k\in\mathbb{N}} \mathsf{DTIME}(2^{n^k}) $$
 * $$\mathsf{NEXPTIME} = \bigcup_{k\in\mathbb{N}} \mathsf{NTIME}(2^{n^k}) $$

EXPTIME is a strict superset of P and NEXPTIME is a strict superset of NP. It is further the case that EXPTIME$$\subseteq$$NEXPTIME. It is not known whether this is proper, but if P=NP then EXPTIME must equal NEXPTIME.

Space complexity classes
Recall that the space complexity class $$\mathsf{DSPACE}(s(n))$$ is the collection of all problems that are decidable by an $$O(s(n))$$ space deterministic Turing machine and $$\mathsf{NSPACE}(s(n))$$ is the collection of all problems that are decidable by an $$O(s(n))$$ space nondeterministic Turing machine. Space complexity classes are often formally defined in terms of these two classes.

L and NL
While it is possible to define logarithmic time complexity classes, it turns out that these are extremely narrow classes as sublinear times do not even enable a Turing machine to read the entire input (in this case, because $$\log n < n $$). However, there are a meaningful number of problems that can be solved in logarithmic space. The definitions of these classes require a two-tape Turing machine so that it is possible for the machine to store the entire input (it can be shown that in terms of computability the two-tape Turing machine is equivalent to the single-tape Turing machine). In the two-tape Turing machine model, one tape is the input tape, which is read-only. The other is the work tape, which allows both reading and writing and is the tape on which the Turing machine performs computations. The space complexity of the Turing machine is measured as the number of cells that are used on the work tape.

L is then defined as the class of problems solvable in logarithmic space on a deterministic Turing machine and NL is the class of problems solvable in logarithmic space on a nondeterministic Turing machine. Or more formally,
 * $$\mathsf{L} = \mathsf{DSPACE}(\log n)$$
 * $$\mathsf{NL} = \mathsf{NSPACE}(\log n)$$

It is known that L$$\subseteq$$NL$$\subseteq$$P. However, it is not known whether any of these relationships is proper.

PSPACE and NPSPACE
The complexity classes PSPACE and NPSPACE are the space analogues to  P and  NP. That is, PSPACE is the class of problems solvable in polynomial space by a deterministic Turing machine and NPSPACE is the class of problems solvable in polynomial space by a nondeterministic Turing machine. More formally,
 * $$\mathsf{PSPACE} = \bigcup_{k\in\mathbb{N}} \mathsf{DSPACE}(n^k)$$
 * $$\mathsf{NPSPACE} = \bigcup_{k\in\mathbb{N}} \mathsf{NSPACE}(n^k)$$

While it is not known whether P=NP, Savitch's theorem famously showed that PSPACE=NPSPACE. It is also known that P$$\subseteq$$PSPACE. This follows intuitively from the fact that since writing to a cell on a Turing machine's tape is defined as taking one unit of time, then if that Turing machine operates in polynomial time it can only write to polynomially many cells. It is suspected that this subset is proper, but this has not been proven.

EXPSPACE and NEXPSPACE
The complexity classes EXPSPACE and NEXPSPACE are the space analogues to EXPTIME and NEXPTIME. That is, EXPSPACE is the class of problems solvable in exponetial space by a deterministic Turing machine and NEXPSPACE is the class of problems solvable in exponential space by a nondeterministic Turing machine. Or more formally,
 * $$\mathsf{EXPSPACE} = \bigcup_{k\in\mathbb{N}} \mathsf{DSPACE}(2^{n^k})$$
 * $$\mathsf{NEXPSPACE} = \bigcup_{k\in\mathbb{N}} \mathsf{NSPACE}(2^{n^k})$$

Savitch's theorem establishes that EXPSPACE=NEXPSPACE. This class is extremely broad: it is known to be a strict superset of PSPACE, NP, and P, and is believed to be a strict superset of EXPTIME.

Closure
Complexity classes have a variety of closure properties. For example, decision classes may be closed under negation, disjunction, conjunction, or even under all Boolean operations. Moreover, they might also be closed under a variety of quantification schemes. P, for instance, is closed under all Boolean operations, and under quantification over polynomially sized domains (though likely not closed over exponential sized domains). Closure properties can be helpful in separating classes—one possible route to separating two complexity classes is to find some closure property possessed by one and not by the other.

Each class X that is not closed under negation has a complement class co-X, which consists of the complements of the languages contained in X. Similarly, one can define the Boolean closure of a class, and so on; this is, however, less commonly done.

Closure properties are one of the key reasons many complexity classes are defined in the way that they are. Take, for example, a problem that can be solved in $$t(n)=n$$ time (that is, in linear time) and one that can be solved in, at best, $$t(n)=n^{1000}$$ time. Both of these problems are in P, yet the runtime of the second grows considerably faster than the runtime of the first as the input size increases. One might ask whether it would be better to define the class of "efficiently solvable" problems using some smaller polynomial bound, like $$n^2$$, rather than all polynomials, which allows for such large discrepancies. It turns out, however, that the polynomials are the smallest class of functions containing the linear functions that are closed under addition, multiplication, and composition. This means that the polynomials are the smallest class that enables the composition of "efficient algorithms"; that is, a polynomial-time algorithm that calls a polynomial-time subroutine still yields a polynomial-time algorithm. If the $$n^2$$ bound were utilized, however, then composing a constant number of "efficient" algorithms might result in a new algorithm that is not "efficient". (Note that the definition of P is also useful because, empirically, almost all problems in P that are practically useful do in fact have low order polynomial runtimes, and almost all problems outside of P that are practically useful do not have any known algorithms with small exponential runtimes, i.e. with $$O(c^n)$$ runtimes where $$c$$ is close to 1. )

Hardness and completeness
Many complexity classes are defined using the concept of a reduction. A reduction is a transformation of one problem into another problem. It captures the informal notion of a problem being at least as difficult as another problem. For instance, if a problem $$X$$ can be solved using an algorithm for $$Y$$, $$X$$ is no more difficult than $$Y$$, and we say that $$X$$ reduces to $$Y$$. There are many different types of reductions, based on the method of reduction, such as Cook reductions, Karp reductions and Levin reductions, and the bound on the complexity of reductions, such as polynomial-time reductions or log-space reductions.

The most commonly used reduction is a polynomial-time reduction. This means that the reduction process takes polynomial time. For example, the problem of squaring an integer can be reduced to the problem of multiplying two integers. This means an algorithm for multiplying two integers can be used to square an integer. Indeed, this can be done by giving the same input to both inputs of the multiplication algorithm. Thus we see that squaring is not more difficult than multiplication, since squaring can be reduced to multiplication.

This motivates the concept of a problem being hard for a complexity class. A problem $$X$$ is hard for a class of problems C if every problem in C can be reduced to $$X$$. Thus no problem in C is harder than $$X$$, since an algorithm for $$X$$ allows us to solve any problem in C. Of course, the notion of hard problems depends on the type of reduction being used. For complexity classes larger than P, polynomial-time reductions are commonly used. In particular, the set of problems that are hard for NP is the set of NP-hard problems.

If a problem $$X$$ is in C and is hard for C, then $$X$$ is said to be complete for C. This means that $$X$$ is the hardest problem in C (since there could be many problems that are equally hard, one might say that $$X$$ is as hard as the hardest problems in C). Thus the class of NP-complete problems contains the most difficult problems in NP, in the sense that they are the ones most likely not to be in P. Because the problem P = NP is not solved, being able to reduce a known NP-complete problem, Π2, to another problem, Π1, would indicate that there is no known polynomial-time solution for Π1. This is because a polynomial-time solution to Π1 would yield a polynomial-time solution to Π2. Similarly, because all NP problems can be reduced to the set, finding an NP-complete problem that can be solved in polynomial time would mean that P = NP.

Savitch's theorem
Savitch's theorem establishes that PSPACE = NPSPACE and EXPSPACE = NEXPSPACE. One central question of complexity theory is whether nondeterminism adds significant power to a computational model. This is central to the open P versus NP problem in the context of time. Savitch's theorem shows that for space, nondeterminism does not add significantly more power, where "significant" means the difference between polynomial and superpolynomial resource requirements (or, for EXPSPACE, the difference between exponential and superexponential). For example, Savitch's theorem proves that no problem that requires exponential space for a deterministic Turing machine can be solved by a nondeterministic polynomial space Turing machine.

Hierarchy theorems
By definition of DTIME, it follows that DTIME$$(n^{k_1})$$ is contained in DTIME$$(n^{k_2})$$ if $$k_1 \leq k_2 $$, since $$O(n^{k_1}) \subseteq O(n^{k_2})$$ if $$k_1 \leq k_2$$. However, the definition gives no indication of whether this inclusion is strict. For time and space requirements, the conditions under which the inclusion is strict are given by the time and space hierarchy theorems, respectively. They are called hierarchy theorems because they induce a proper hierarchy on the classes defined by constraining the respective resources. The hierarchy theorems enable one to make quantitative statements about how much more additional time or space is needed in order to increase the number of problems that can be solved.

The time hierarchy theorem states that
 * $$\mathsf{DTIME}\big(f(n) \big) \subsetneq \mathsf{DTIME} \big(f(n) \sdot \log^{2}(f(n)) \big)$$.

The space hierarchy theorem states that
 * $$\mathsf{DSPACE}\big(f(n)\big) \subsetneq \mathsf{DSPACE} \big(f(n) \sdot \log(f(n)) \big)$$.

The time and space hierarchy theorems form the basis for most separation results of complexity classes. For instance, the time hierarchy theorem establishes that P is strictly contained in EXPTIME, and the space hierarchy theorem establishes that L is strictly contained in PSPACE.

Other models of computation
While deterministic and non-deterministic Turing machines are the most commonly used models of computation, many complexity classes are defined in terms of other computational models. In particular,
 * A number of classes are defined using probabilistic Turing machines, including the classes BPP, PP, RP, and ZPP
 * A number of classes are defined using interactive proof systems, including the classes IP, MA, and AM
 * A number of classes are defined using Boolean circuits, including the classes P/poly and its subclasses NC and AC
 * A number of classes are defined using quantum Turing machines, including the classes BQP and QMA

These are explained in greater detail below.

Randomized computation
A number of important complexity classes are defined using the probabilistic Turing machine, a variant of the Turing machine that can toss random coins. These classes help to better describe the complexity of randomized algorithms.

A probabilistic Turing machine is similar to a deterministic Turing machine, except rather than following a single transition function (a set of rules for how to proceed at each step of the computation) it probabilistically selects between transition functions at each step. The standard definition of a probabilistic Turing machine specifies two transition functions, so that the selection of transition function at each step resembles a coin flip. The randomness introduced at each step of the computation introduces the potential for error; that is, strings that the Turing machine is meant to accept may on some occasions be rejected and strings that the Turing machine is meant to reject may on some occasions be accepted. As a result, the complexity classes based on the probabilistic Turing machine are defined in large part around the amount of error that is allowed. In particular, they are defined using an error probability $$\epsilon$$. A probabilistic Turing machine $$M$$ is said to recognize a language $$L$$ with error probability $$\epsilon$$ if:
 * 1) a string $$w$$ in $$L$$ implies that $$\text{Pr}[M \text{ accepts } w] \geq 1 - \epsilon$$
 * 2) a string $$w$$ not in $$L$$ implies that $$\text{Pr}[M \text{ rejects } w] \geq 1 - \epsilon$$

Important complexity classes


The fundamental randomized time complexity classes are ZPP, RP, co-RP, BPP, and PP.

The strictest class is ZPP (zero-error probabilistic polynomial time), the class of problems solvable in polynomial time by a probabilistic Turing machine with error probability 0. Intuitively, this is the strictest class of probabilistic problems because it demands no error whatsoever.

A slightly looser class is RP (randomized polynomial time), which maintains no error for strings not in the language but allows bounded error for strings in the language. More formally, a language is in RP if there is a probabilistic polynomial-time Turing machine $$M$$ such that if a string is not in the language then $$M$$ always rejects and if a string is in the language then $$M$$ accepts with a probability at least 1/2. The class co-RP is similarly defined except the roles are flipped: error is not allowed for strings in the language but is allowed for strings not in the language. Taken together, the classes RP and co-RP encompass all of the problems that can be solved by probabilistic Turing machines with one-sided error.

Loosening the error requirements further to allow for two-sided error yields the class BPP (bounded-error probabilistic polynomial time), the class of problems solvable in polynomial time by a probabilistic Turing machine with error probability less than 1/3 (for both strings in the language and not in the language). BPP is the most practically relevant of the probabilistic complexity classes—problems in BPP have efficient randomized algorithms that can be run quickly on real computers. BPP is also at the center of the important unsolved problem in computer science over whether P=BPP, which if true would mean that randomness does not increase the computational power of computers, i.e. any probabilistic Turing machine could be simulated by a deterministic Turing machine with at most polynomial slowdown.

The broadest class of efficiently-solvable probabilistic problems is PP (probabilistic polynomial time), the set of languages solvable by a probabilistic Turing machine in polynomial time with an error probability of less than 1/2 for all strings.

ZPP, RP and co-RP are all subsets of BPP, which in turn is a subset of PP. The reason for this is intuitive: the classes allowing zero error and only one-sided error are all contained within the class that allows two-sided error. ZPP relates to RP and co-RP in the following way: ZPP$$=$$RP$$\cap$$co-RP. That is, ZPP consists exactly of those problems that are in both RP and co-RP. Intuitively, this follows from the fact that RP and co-RP allow only one-sided error: co-RP does not allow error for strings in the language and RP does not allow error for strings not in the language. Hence, if a problem is in both RP and co-RP, then there must be no error for strings both in and not in the language (i.e. no error whatsoever), which is exactly the definition of ZPP. BPP is contained in PP since PP merely relaxes the error probability of BPP.

Important randomized space complexity classes include BPL, RL, and RLP.

Interactive proof systems
A number of complexity classes are defined using interactive proof systems. Interactive proofs generalize the proofs definition of the complexity class NP and yield insights into cryptography, approximation algorithms, and formal verification.

Interactive proof systems are abstract machines that model computation as the exchange of messages between two parties: a prover $$P$$ and a verifier $$V$$. The parties interact by exchanging messages, and an input string is accepted by the system if the verifier decides to accept the input on the basis of the messages it has received from the prover. The prover $$P$$ has unlimited computational power while the verifier has bounded computational power (the standard definition of interactive proof systems defines the verifier to be polynomially-time bounded). The prover, however, is untrustworthy (this prevents all languages from being trivially recognized by the proof system by having the computationally unbounded prover solve for whether a string is in a language and then sending a trustworthy "YES" or "NO" to the verifier), so the verifier must conduct an "interrogation" of the prover by "asking it" successive rounds of questions, accepting only if it develops a high degree of confidence that the string is in the language.

Important complexity classes
The class NP is a simple proof system in which the verifier is restricted to being a deterministic polynomial-time Turing machine and the procedure is restricted to one round (that is, the prover sends only a single, full proof—typically referred to as the certificate—to the verifier). Put another way, in the definition of the class NP (the set of decision problems for which the problem instances, when the answer is "YES", have proofs verifiable in polynomial time by a deterministic Turing machine) is a proof system in which the proof is constructed by an unmentioned prover and the deterministic Turing machine is the verifier. For this reason, NP can also be called dIP (deterministic interactive proof), though it is rarely referred to as such.

It turns out that NP captures the full power of interactive proof systems with deterministic (polynomial-time) verifiers because it can be shown that for any proof system with a deterministic verifier it is never necessary to need more than a single round of messaging between the prover and the verifier. Interactive proof systems that provide greater computational power over standard complexity classes thus require probabilistic verifiers, which means that the verifier's questions to the prover are computed using probabilistic algorithms. As noted in the section above on randomized computation, probabilistic algorithms introduce error into the system, so complexity classes based on probabilistic proof systems are defined in terms of an error probability $$\epsilon$$.

The most general complexity class arising out of this characterization is the class IP (interactive polynomial time), which is the class of all problems solvable by an interactive proof system $$(P,V)$$, where $$V$$ is probabilistic polynomial-time and the proof system satisfies two properties: for a language $$L \in $$ IP
 * 1) (Completeness) a string $$w$$ in $$L$$ implies $$\Pr[V \text{ accepts }w \text{ after interacting with } P] \ge \tfrac{2}{3}$$
 * 2) (Soundness) a string $$w$$ not in $$L$$ implies $$\Pr[V \text{ accepts }w \text{ after interacting with } P] \le \tfrac{1}{3}$$

An important feature of IP is that it equals PSPACE. In other words, any problem that can be solved by an interactive polynomial-time proof system can also be solved by a deterministic Turing machine with polynomial space resources, and vice versa.

A modification of the protocol for IP produces another important complexity class: AM (Arthur–Merlin protocol). In the definition of interactive proof systems used by IP, the prover was not able to see the coins utilized by the verifier in its probabilistic computation—it was only able to see the messages that the verifier produced with these coins. For this reason, the coins are called private random coins. The interactive proof system can be constrained so that the coins used by the verifier are public random coins; that is, the prover is able to see the coins. Formally, AM is defined as the class of languages with an interactive proof in which the verifier sends a random string to the prover, the prover responds with a message, and the verifier either accepts or rejects by applying a deterministic polynomial-time function to the message from the prover. AM can be generalized to AM[k], where k is the number of messages exchanged (so in the generalized form the standard AM defined above is AM[2]). However, it is the case that for all k$$\geq$$2, AM[k]=AM[2]. It is also the case that AM[k]$$\subseteq$$IP[k].

Other complexity classes defined using interactive proof systems include MIP (mutliprover interactive polynomial time) and QIP (quantum interactive polynomial time).

Boolean circuits
An alternative model of computation to the Turing machine is the Boolean circuit, a simplified model of the digital circuits used in modern computers. Not only does this model provide an intuitive connection between computation in theory and computation in practice, but it is also a natural model for non-uniform computation (that is, computation in which different input sizes within the same problem use different algorithms).

Formally, a Boolean circuit $$C$$ is a directed acyclic graph in which edges represent wires (which carry the bit values 0 and 1), the input bits are represented by source vertices (vertices with no incoming edges), and all non-source vertices represent logic gates (generally the AND, OR, and NOT gates). One logic gate is designated the output gate, and represents the end of the computation. The input/output behavior of a circuit $$C$$ with $$n$$ input variables is represented by the Boolean function $$f_C:\{0,1\}^n \to \{0,1\}$$; for example, on input bits $$x_1,x_2,...,x_n$$, the output bit $$b$$ of the circuit is represented mathematically as $$b = f_C(x_1,x_2,...,x_n)$$. The circuit $$C$$ is said to compute the Boolean function $$f_C$$.

Any particular circuit has a fixed number of input vertices, so it can only act on inputs of that size. Languages (the formal representations of decision problems), however, contain strings of differing lengths, so languages cannot be fully captured by a single circuit (in contrast to the Turing machine model, in which a language is fully described by a single Turing machine that can act on any input size). A language is instead represented by a circuit family. A circuit family is an infinite list of circuits $$(C_0,C_1,C_2,...)$$, where $$C_n$$ has $$n$$ input variables. A circuit family is said to decide a language $$L$$ if, for every string $$w$$, $$w$$ is in the language $$L$$ if and only if $$C_n(w)=1$$, where $$n$$ is the length of $$w$$. In other words, a string $$w$$ of size $$n$$ is in the language represented by the circuit family $$(C_0,C_1,C_2,...)$$ if the circuit $$C_n$$ (the only circuit in the family with $$n$$ input vertices) evaluates to 1 when $$w$$ is its input.

While complexity classes defined using Turing machines are described in terms of time complexity, circuit complexity classes are defined in terms of circuit size — the number of vertices in the circuit. The size complexity of a circuit family $$(C_0,C_1,C_2,...)$$ is the function $$f:\mathbb{N} \to \mathbb{N}$$, where $$f(n)$$ is the circuit size of $$C_n$$. The familiar function classes follow naturally from this; for example, a polynomial-size circuit family is one such that the function $$f$$ is a polynomial.

Important complexity classes
The complexity class P/poly is the set of languages that are decidable by polynomial-size circuit families. It turns out that there is a natural connection between circuit complexity and time complexity. Intuitively, a language with small time complexity (that is, requires relatively few sequential operations on a Turing machine), also has a small circuit complexity (that is, requires relatively few Boolean operations). Formally, it can be shown that if a language is in $$\mathsf{DTIME}(t(n))$$, where $$t$$ is a function $$t:\mathbb{N} \to \mathbb{N}$$, then it has circuit complexity $$O(t^2(n))$$. It follows directly from this fact that P$$\subseteq$$P/poly. In other words, any problem that can be solved in polynomial time by a deterministic Turing machine can also be solved by a polynomial-size circuit family. It is further the case that the inclusion is proper (P$$\subset$$P/poly) because there are undecidable problems that are in P/poly.

P/poly turns out to have a number of properties that make it highly useful in the study of the relationships between complexity classes. In particular, it is helpful in investigating problems related to P versus NP. For example, if there is any language in NP that is not in P/poly, then P$$\neq$$NP. P/poly is also helpful in investigating properties of the polynomial hierarchy. For example, if NP ⊆ P/poly, then PH collapses to $$\Sigma_2^{\mathsf P}$$. A full description of the relations between P/poly and other complexity classes is available at "Importance of P/poly". P/poly is also helpful in the general study of the properties of Turing machines, as the class can be equivalently defined as the class of languages recognized by a polynomial-time Turing machine with a polynomial-bounded advice function.

Two subclasses of P/poly that have interesting properties in their own right are NC and AC. These classes are defined not only in terms of their circuit size but also in terms of their depth. The depth of a circuit is the length of the longest directed path from an input node to the output node. The class NC is the set of languages that can be solved by circuit families that are restricted not only to having polynomial-size but also to having polylogarithmic depth. The class AC is defined similarly to NC, however gates are allowed to have unbounded fan-in (that is, the AND and OR gates can be applied to more than two bits). NC is a notable class because it can be equivalently defined as the class of languages that have efficient parallel algorithms.

Important complexity classes
Two important quantum complexity classes are BQP and QMA.

Other types of problems
While most complexity classes are sets of decision problems, there are also a number of complexity classes defined in terms of other types of problems. In particular, there are complexity classes consisting of counting problems, consisting of function problems, and consisting of promise problems. These are explained in greater detail below.

Counting problems
A counting problem asks not only whether a solution exists (as with a decision problem), but asks how many solutions exist. For example, the decision problem $$CYCLE$$ asks whether a particular graph $$G$$ has a simple cycle (the answer is a simple yes/no); the corresponding counting problem $$\#CYCLE$$ (pronounced "sharp cycle") asks how many simple cycles $$G$$ has. The output to a counting problem is thus a number, in contrast to the output for a decision problem, which is a simple yes/no (or accept/reject, 0/1, or other equivalent scheme). And whereas decision problems are represented mathematically as formal languages, counting problems are represented mathematically as functions: a counting problem is formalized as the function $$f:\{0,1\}^* \to \mathbb{N}$$ such that for an input $$w \in \{0,1\}^*$$, $$f(w)$$ is the number of solutions. For example, in the $$CYCLE$$ problem, the input is a graph $$G$$ and $$f(G)$$ is the number of simple cycles in $$G$$ (note that graphs can be represented as binary strings, so $$G \in \{0,1\}^*$$). Counting problems arise in a number of fields, including statistical estimation, statistical physics, network design, and economics.

Important complexity classes
#P (pronounced "sharp P") is an important complexity class of counting problems that can be thought of as the counting version of NP. The connection to NP arises from the fact that the number of solutions to a problem equals the number of accepting branches in a nondeterministic Turing machine's computation tree. #P is thus formally defined as follows:
 * #P is the set of all functions $$f:\{0,1\}^* \to \mathbb{N}$$ such that there is a polynomial time nondeterministic Turing machine $$M$$ such that for all $$w \in \{0,1\}^*$$, $$f(w)$$ equals the number of accepting branches in $$M$$'s computation graph on $$w$$.

And just as NP can be defined both in terms of nondeterminism and in terms of a verifier (i.e. as an interactive proof system), so too can #P be equivalently defined in terms of a verifer. Recall that a decision problem is in NP if there exists a polynomial-time checkable certificate to a given problem instance—that is, NP asks whether there exists a proof of membership for the input that can be checked for correctness in polynomial time. The class #P asks how many such certificates exist. In this context, #P is defined as follows:
 * #P is the set of functions $$f: \{0,1\}^* \to \mathbb{N}$$ such that there exists a polynomial $$p: \mathbb{N} \to \mathbb{N}$$ and a polynomial-time Turing machine $$V$$, called the verifier, such that for every $$w \in \{0,1\}^*$$, $$f(w)=\Big| \big\{c \in \{0,1\}^{p(|w|)} : V(w,c)=1 \big\}\Big| $$. In other words, $$f(w)$$ equals the size of the set containing all of the polynomial-size certificates.

Function problems
Counting problems are a subset of a broader class of problems called function problems. A function problem is a computational problem where a single output (of a total function) is expected for every input, but the output is more complex than that of a decision problem. For function problems, the output is not simply 'yes' or 'no'. The complexity class FP is the set of function problems that can be solved by a deterministic Turing machine in polynomial time.

Promise problems
Promise problems are a generalization of decision problems in which the input to a problem is guaranteed ("promised") to be from a particular subset of all possible inputs. Recall that with a decision problem $$L \subseteq \{0,1\}^*$$, an algorithm $$M$$ for $$L$$ must act (correctly) on every $$w \in \{0,1\}^*$$. A promise problem loosens the input requirement on $$M$$ by restricting the input to some subset of $$\{0,1\}^*$$.

Specifically, a promise problem is defined as a pair of non-intersecting sets $$(\Pi_{\text{ACCEPT}},\Pi_{\text{REJECT}})$$, where:
 * $$\Pi_{\text{ACCEPT}} \subseteq \{0,1\}^*$$ is the set of all inputs that are accepted.
 * $$\Pi_{\text{REJECT}} \subseteq \{0,1\}^*$$ is the set of all inputs that are rejected.

The input to an algorithm $$M$$ for a promise problem $$(\Pi_{\text{ACCEPT}},\Pi_{\text{REJECT}})$$ is thus $$\Pi_{\text{ACCEPT}} \cup \Pi_{\text{REJECT}}$$, which is called the promise. Strings in $$\Pi_{\text{ACCEPT}} \cup \Pi_{\text{REJECT}}$$ are said to satisfy the promise. By definition, $$\Pi_{\text{ACCEPT}}$$ and $$\Pi_{\text{REJECT}}$$ must be disjoint, i.e. $$\Pi_{\text{ACCEPT}} \cap \Pi_{\text{REJECT}} = \emptyset$$.

Within this formulation, it can be seen that decision problems are just the subset of promise problems with the trivial promise $$\Pi_{\text{ACCEPT}} \cup \Pi_{\text{REJECT}} = \{0,1\}^*$$. With decision problems it is thus simpler to simply define the problem as only $$\Pi_{\text{ACCEPT}}$$ (with $$\Pi_{\text{REJECT}}$$ implicitly being $$\{0,1\}^* / \Pi_{\text{ACCEPT}}$$), which throughout this page is denoted $$L$$ to emphasize that $$\Pi_{\text{ACCEPT}}=L$$ is a formal language.

Promise problems make for a more natural formulation of many computational problems (though they are generally more technically cumbersome). For instance, a computational problem could be something like "given a planar graph, determine whether or not..." This is often stated as a decision problem, with it assumed that there is some translation schema that takes every string $$s \in \{0,1\}^*$$ to a planar graph. However, it is more straightforward to define this as a promise problem in which the input is promised to be a planar graph.

Relation to complexity classes
Promise problems provide an alternate definition for standard complexity classes of decision problems. P, for instance, can be defined as a promise problem:
 * P is the class of promise problems that are solvable in deterministic polynomial time. That is, the promise problem $$(\Pi_{\text{ACCEPT}},\Pi_{\text{REJECT}})$$ is in P if there exists a polynomial-time algorithm $$M$$ such that:
 * For every $$x \in \Pi_{\text{ACCEPT}}, M(x)=1$$
 * For ever $$x \in \Pi_{\text{REJECT}}, M(x)=0$$

Classes of decision problems—that is, classes of problems defined as formal languages—thus translate naturally to promise problems, where a language $$L$$ in the class is simply $$L= \Pi_{\text{ACCEPT}}$$ and $$\Pi_{\text{REJECT}}$$ is implicitly $$\{0,1\}^* / \Pi_{\text{ACCEPT}}$$.

Formulating many basic complexity classes like P as promise problems provides little additional insight into their nature. However, promise problems have been useful to computer scientists studying certain classes. They have, for instance, played a key role in the study of SZK (statistical zero-knowledge).

Algebraic computation
Turing machines work on bits, which are equivalent to the integers $$\mathbb{N}$$. However, there are many algorithms which are most naturally described as operating over the real numbers $$\mathbb{R}$$ or the complex numbers $$\mathbb{C}$$. To express this theoretically, a computational model that is built around uncountable sets is needed. This is accomplished through a number of algebraic computational models that work over a field or ring) $\mathbb{F}$, the three most common being [[Straight-line program|algebraic straight-line programs, algebraic circuits, and algebraic computation trees.

Algebraic circuits are similar in nature to Boolean circuits, except they operate over a field $$\mathbb{F}$$ rather than bits $$\{0,1\}$$ and the gates of the circuit correspond to the arithmetic operations $$\{+,\times \} $$ rather than boolean operations.

An algebraic straight-line program over a field or ring $$\mathbb{F}$$ of length $$T$$ with input variables $$x_1,...,x_n \in \mathbb{F}$$ and built-in constants $$c_1,...,c_n \in \mathbb{F}$$ is a sequence of $$T$$ statements of the form $$y_i=z_1 OP z_{i_2}$$ for $$i=1,2,...T$$

Important complexity classes
The algebraic equivalents to P and NP are AlgP/Poly and AlgNP/Poly (these are sometimes also called VP and VPN).

Summary of relationships between complexity classes
The following table shows some of the classes of problems that are considered in complexity theory. If class X is a strict subset of Y, then X is shown below Y, with a dark line connecting them. If X is a subset, but it is unknown whether they are equal sets, then the line is lighter and is dotted. Technically, the breakdown into decidable and undecidable pertains more to the study of computability theory, but is useful for putting the complexity classes in perspective.