Constructive analysis

In mathematics, constructive analysis is mathematical analysis done according to some principles of constructive mathematics.

Introduction
The name of the subject contrasts with classical analysis, which in this context means analysis done according to the more common principles of classical mathematics. However, there are various schools of thought and many different formalizations of constructive analysis. Whether classical or constructive in some fashion, any such framework of analysis axiomatizes the real number line by some means, a collection extending the rationals and with an apartness relation definable from an asymmetric order structure. Center stage takes a positivity predicate, here denoted $$x > 0$$, which governs an equality-to-zero $$x\cong 0$$. The members of the collection are generally just called the real numbers. While this term is thus overloaded in the subject, all the frameworks share a broad common core of results that are also theorems of classical analysis.

Constructive frameworks for its formulation are extensions of Heyting arithmetic by types including $${\mathbb N}^{\mathbb N}$$, constructive second-order arithmetic, or strong enough topos-, type- or constructive set theories such as $${\mathsf{CZF}}$$, a constructive counter-part of $${\mathsf{ZF}}$$. Of course, a direct axiomatization may be studied as well.

Logical preliminaries
The base logic of constructive analysis is intuitionistic logic, which means that the principle of excluded middle $${\mathrm {PEM}}$$ is not automatically assumed for every proposition. If a proposition $$\neg\neg\exists x.\theta(x)$$ is provable, this exactly means that the non-existence claim $$\neg\exists x.\theta(x)$$ being provable would be absurd, and so the latter cannot also be provable in a consistent theory. The double-negated existence claim is a logically negative statement and implied by, but generally not equivalent to the existence claim itself. Much of the intricacies of constructive analysis can be framed in terms of the weakness of propositions of the logically negative form $$\neg\neg\phi$$, which is generally weaker than $$\phi$$. In turn, also an implication $$\big(\exists x.\theta(x)\big)\to \neg\forall x.\neg\theta(x)$$ can generally be not reversed.

While a constructive theory proves fewer theorems than its classical counter-part in its classical presentation, it may exhibit attractive meta-logical properties. For example, if a theory $${\mathsf {T}}$$ exhibits the disjunction property, then if it proves a disjunction $${\mathsf {T}}\vdash \phi\lor \psi$$ then also $${\mathsf {T}}\vdash \phi$$ or $${\mathsf {T}}\vdash \psi$$. Already in classical arithmetic, this is violated for the most basic propositions about sequences of numbers - as demonstrated next.

Undecidable predicates
A common strategy of formalization of real numbers is in terms of sequences or rationals, $${\mathbb Q}^{\mathbb N}$$ and so we draw motivation and examples in terms of those. So to define terms, consider a decidable predicate on the naturals, which in the constructive vernacular means $$\forall n. \big( Q(n)\lor\neg Q(n) \big)$$ is provable, and let $$\chi_Q\colon{\mathbb N}\to\{0, 1\}$$ be the characteristic function defined to equal $$0$$ exactly where $$Q$$ is true. The associated sequence $$q_n\,:=\,{\textstyle\sum}_{k=0}^n \chi_Q(n) / 2^{n+1}$$ is monotone, with values non-strictly growing between the bounds $$0$$ and $$1$$. Here, for the sake of demonstration, defining an extensional equality to the zero sequence $$(q\cong_\mathrm{ext} 0)\,:=\,\forall n. q_n=0$$, it follows that $$q\cong_\mathrm{ext} 0 \leftrightarrow\forall n. Q(n)$$. Note that the symbol "$$0$$" is used in several contexts here. For any theory capturing arithmetic, there are many yet undecided and even provenly independent such statements $$\forall n. Q(n)$$. Two $$\Pi_1^0$$-examples are the Goldbach conjecture and the Rosser sentence of a theory.

Consider any theory $${\mathsf{T}}$$ with quantifiers ranging over primitive recursive, rational-valued sequences. Already minimal logic proves the non-contradiction claim for any proposition, and that the negation of excluded middle for any given proposition would be absurd. This also means there is no consistent theory (even if anti-classical) rejecting the excluded middle disjunction for any given proposition. Indeed, it holds that
 * $${\mathsf{T}}\,\,\,\vdash\,\,\,\forall(x\in{\mathbb Q}^{\mathbb N}).\,\neg\neg\big((x\cong_\mathrm{ext} 0)\lor\neg(x\cong_\mathrm{ext} 0)\big)$$

This theorem is logically equivalent to the non-existence claim of a sequence for which the excluded middle disjunction about equality-to-zero would be disprovable. No sequence with that disjunction being rejected can be exhibited. Assume the theories at hand are consistent and arithmetically sound. Now Gödel's theorems mean that there is an explicit sequence $$g\in{\mathbb Q}^{\mathbb N}$$ such that, for any fixed precision, $${\mathsf{T}}$$ proves the zero-sequence to be a good approximation to $$g$$, but it can also meta-logically be established that $${\mathsf{T}}\,\nvdash\,(g\cong_\mathrm{ext} 0)$$ as well as $${\mathsf{T}}\,\nvdash\,\neg(g\cong_\mathrm{ext} 0)$$. Here this proposition $$g\cong_\mathrm{ext} 0$$ again amounts to the proposition of universally quantified form. Trivially
 * $${\mathsf{T}}+{\mathrm{PEM}}\,\,\,\vdash\,\,\,\forall(x\in{\mathbb Q}^{\mathbb N}).\,(x\cong_\mathrm{ext} 0)\lor\neg(x\cong_\mathrm{ext} 0)$$

even if these disjunction claims here do not carry any information. In the absence of further axioms breaking the meta-logical properties, constructive entailment instead generally reflects provability. Taboo statements that ought not be decidable (if the aim is to respect the provability interpretation of constructive claims) can be designed for definitions of a custom equivalence "$$\cong$$" in formalizations below as well. For implications of disjunctions of yet not proven or disproven propositions, one speaks of weak Brouwerian counterexamples.

Order vs. disjunctions
The theory of the real closed field may be axiomatized such that all the non-logical axioms are in accordance with constructive principles. This concerns a commutative ring with postulates for a positivity predicate $$x>0$$, with a positive unit and non-positive zero, i.e., $$1>0$$ and $$\neg(0>0)$$. In any such ring, one may define $$y > x\,:=\,(y - x > 0)$$, which constitutes a strict total order in its constructive formulation (also called linear order or, to be explicit about the context, a pseudo-order). As is usual, $$x < 0$$ is defined as $$0 > x$$.

This first-order theory is relevant as the structures discussed below are model thereof. However, this section thus does not concern aspects akin to topology and relevant arithmetic substructures are not definable therein.

As explained, various predicates will fail to be decidable in a constructive formulation, such as these formed from order-theoretical relations. This includes "$$\cong$$", which will be rendered equivalent to a negation. Crucial disjunctions are now discussed explicitly.

Trichotomy
In intuitonistic logic, the disjunctive syllogism in the form $$(\phi\lor\psi)\to(\neg\phi\to\psi)$$ generally really only goes in the $$\to$$-direction. In a pseudo-order, one has
 * $$\neg(x>0 \lor 0>x) \to x\cong 0$$

and indeed at most one of the three can hold at once. But the stronger, logically positive law of trichotomy disjunction does not hold in general, i.e. it is not provable that for all reals,
 * $$(x>0 \lor 0>x) \lor x\cong 0$$

See analytical ${\mathrm {LPO}}$. Other disjunctions are however implied based on other positivity results, e.g. $$(x + y > 0) \to (x>0 \lor y>0)$$. Likewise, the asymmetric order in the theory ought to fulfill the weak linearity property $$(y > x) \to (y > t \lor t > x)$$ for all $$t$$, related to locatedness of the reals.

The theory shall validate further axioms concerning the relation between the positivity predicate $$x > 0$$ and the algebraic operations including multiplicative inversion, as well as the intermediate value theorem for polynomial. In this theory, between any two separated numbers, other numbers exist.

Apartness
In the context of analysis, the auxiliary logically positive predicate
 * $$x\# y\,:=\,(x > y\lor y > x)$$

may be independently defined and constitutes an apartness relation. With it, the substitute of the principles above give tightness
 * $$\neg(x\# 0)\leftrightarrow(x\cong 0)$$

Thus, apartness can also function as a definition of "$$\cong$$", rendering it a negation. All negations are stable in intuitionistic logic, and therefore
 * $$\neg\neg(x\cong y)\leftrightarrow(x\cong y)$$

The elusive trichotomy disjunction itself then reads
 * $$(x\# 0) \lor \neg(x\# 0)$$

Importantly, a proof of the disjunction $$x\# y$$ carries positive information, in both senses of the word. Via $$(\phi\to\neg\psi)\leftrightarrow(\psi\to\neg\phi)$$ it also follows that $$x\# 0\to\neg(x\cong 0)$$. In words: A demonstration that a number is somehow apart from zero is also a demonstration that this number is non-zero. But constructively it does not follow that the doubly negative statement $$\neg(x\cong 0)$$ would imply $$x\# 0$$. Consequently, many classically equivalent statements bifurcate into distinct statement. For example, for a fixed polynomial $$p\in {\mathbb R}[x]$$ and fixed $$k\in {\mathbb N}$$, the statement that the $$k$$'th coefficient $$a_k$$ of $$p$$ is apart from zero is stronger than the mere statement that it is non-zero. A demonstration of former explicates how $$a_k$$ and zero are related, with respect to the ordering predicate on the reals, while a demonstration of the latter shows how negation of such conditions would imply to a contradiction. In turn, there is then also a strong and a looser notion of, e.g., being a third-order polynomial.

So the excluded middle for $$x\# 0$$ is apriori stronger than that for $$x\cong 0$$. However, see the discussion of possible further axiomatic principles regarding the strength of "$$\cong$$" below.

Non-strict partial order
Lastly, the relation $$0\ge x$$ may be defined by or proven equivalent to the logically negative statement $$\neg(x > 0)$$, and then $$x \le 0$$ is defined as $$0 \ge x$$. Decidability of positivity may thus be expressed as $$x > 0\lor 0\ge x$$, which as noted will not be provable in general. But neither will the totality disjunction $$x\ge 0 \lor 0\ge x$$, see also analytical ${\mathrm {LLPO}}$.

By a valid De Morgan's law, the conjunction of such statements is also rendered a negation of apartness, and so
 * $$(x\ge y \land y\ge x)\leftrightarrow (x\cong y)$$

The disjunction $$x > y \lor x\cong y$$ implies $$x\ge y$$, but the other direction is also not provable in general. In a constructive real closed field, the relation "$$\ge$$" is a negation and is not equivalent to the disjunction in general.

Variations
Demanding good order properties as above but strong completeness properties at the same time implies $${\mathrm {PEM}}$$. Notably, the MacNeille completion has better completeness properties as a collection, but a more intricate theory of its order-relation and, in turn, worse locatedness properties. While less commonly employed, also this construction simplifies to the classical real numbers when assuming $${\mathrm {PEM}}$$.

Invertibility
In the commutative ring of real numbers, a provably non-invertible element equals zero. This and the most basic locality structure is abstracted in the theory of Heyting fields.

Rational sequences
A common approach is to identify the real numbers with non-volatile sequences in $${\mathbb Q}^{\mathbb N}$$. The constant sequences correspond to rational numbers. Algebraic operations such as addition and multiplication can be defined component-wise, together with a systematic reindexing for speedup. The definition in terms of sequences furthermore enables the definition of a strict order "$$>$$" fulfilling the desired axioms. Other relations discussed above may then be defined in terms of it. In particular, any number $$x$$ apart from $$0$$, i.e. $$x\# 0$$, eventually has an index beyond which all its elements are invertible. Various implications between the relations, as well as between sequences with various properties, may then be proven.

Moduli
As the maximum on a finite set of rationals is decidable, an absolute value map on the reals may be defined and Cauchy convergence and limits of sequences of reals can be defined as usual.

A modulus of convergence is often employed in the constructive study of Cauchy sequences of reals, meaning the association of any $$\varepsilon > 0$$ to an appropriate index (beyond which the sequences are closer than $$\varepsilon$$) is required in the form of an explicit, strictly increasing function $$\varepsilon\mapsto N(\varepsilon)$$. Such a modulus may be considered for a sequence of reals, but it may also be considered for all the reals themselves, in which case one is really dealing with a sequence of pairs.

Bounds and suprema
Given such a model then enables the definition of more set theoretic notions. For any subset of reals, one may speak of an upper bound $$b$$, negatively characterized using $$x\le b$$. One may speak of least upper bounds with respect to "$$\le$$". A supremum is an upper bound given through a sequence of reals, positively characterized using "$$<$$". If a subset with an upper bound is well-behaved with respect to "$$<$$" (discussed below), it has a supremum.

Bishop's formalization
One formalization of constructive analysis, modeling the order properties described above, proves theorems for sequences of rationals $$x$$ fulfilling the regularity condition $$|x_n-x_m|\le \tfrac{1}{n}+\tfrac{1}{m}$$. An alternative is using the tighter $$2^{-n}$$ instead of $$\tfrac{1}{n}$$, and in the latter case non-zero indices ought to be used. No two of the rational entries in a regular sequence are more than $$\tfrac{3}{2}$$ apart and so one may compute natural numbers exceeding any real. For the regular sequences, one defines the logically positive loose positivity property as $$x > 0 \,:=\, \exists n. x_n > \tfrac{1}{n}$$, where the relation on the right hand side is in terms of rational numbers. Formally, a positive real in this language is a regular sequence together with a natural witnessing positivity. Further, $$x\cong y \,:=\, \forall n. |x_n-y_n| \le \tfrac{2}{n}$$, which is logically equivalent to the negation $$\neg\exists n. |x_n-y_n| > \tfrac{2}{n}$$. This is provably transitive and in turn an equivalence relation. Via this predicate, the regular sequences in the band $$|x_n| \le \tfrac{2}{n}$$ are deemed equivalent to the zero sequence. Such definitions are of course compatible with classical investigations and variations thereof were well studied also before. One has $$y > x$$ as $$(y - x) > 0$$. Also, $$x \ge 0$$ may be defined from a numerical non-negativity property, as $$x_n \geq -\tfrac{1}{n}$$ for all $$n$$, but then shown to be equivalent of the logical negation of the former.

Variations
The above definition of $$x\cong y$$ uses a common bound $$\tfrac{2}{n}$$. Other formalizations directly take as definition that for any fixed bound $$\tfrac{2}{N}$$, the numbers $$x$$ and $$y$$ must eventually be forever at least as close. Exponentially falling bounds $$2^{-n}$$ are also used, also say in a real number condition $$\forall n. |x_n-x_{n+1}|<2^{-n}$$, and likewise for the equality of two such reals. And also the sequences of rationals may be required to carry a modulus of convergence. Positivity properties may defined as being eventually forever apart by some rational.

Function choice in $${\mathbb N}^{\mathbb N}$$ or stronger principles aid such frameworks.

Coding
It is worth noting that sequences in $${\mathbb Q}^{\mathbb N}$$ can be coded rather compactly, as they each may be mapped to a unique subclass of $${\mathbb N}$$. A sequence rationals $$i\mapsto \tfrac{n_i}{d_i}(-1)^{s_i}$$ may be encoded as set of quadruples $$\langle i, n_i, d_i, s_i\rangle\in{\mathbb N}^4$$. In turn, this can be encoded as unique naturals $$2^i \cdot 3^{n_i}\cdot 5^{d_i}\cdot 7^{s_i}$$ using the fundamental theorem of arithmetic. There are more economic pairing functions as well, or extension encoding tags or metadata. For an example using this encoding, the sequence $$i\mapsto {\textstyle\sum}_{k=0}^i\tfrac{1}{k}$$, or $$1,2,\tfrac{5}{2},\tfrac{8}{3},\dots$$, may be used to compute Euler's number and with the above coding it maps to the subclass $$\{ 15, 90, 24300, 6561000,\dots\}$$ of $${\mathbb N}$$. While this example, an explicit sequence of sums, is a total recursive function to begin with, the encoding also means these objects are in scope of the quantifiers in second-order arithmetic.

Cauchy reals
In some frameworks of analysis, the name real numbers is given to such well-behaved sequences or rationals, and relations such as $$x\cong y$$ are called the equality or real numbers. Note, however, that there are properties which can distinguish between two $$\cong$$-related reals.

In contrast, in a set theory that models the naturals $${\mathbb N}$$ and validates the existence of even classically uncountable function spaces (and certainly say ${\mathsf{CZF}}$ or even $${\mathsf{ZFC}}$$) the numbers equivalent with respect to "$$\cong$$" in $${\mathbb Q}^{\mathbb N}$$ may be collected into a set and then this is called the Cauchy real number. In that language, regular rational sequences are degraded to a mere representative of a Cauchy real. Equality of those reals is then given by the equality of sets, which is governed by the set theoretical axiom of extensionality. An upshot is that the set theory will prove properties for the reals, i.e. for this class of sets, expressed using the logical equality. Constructive reals in the presence of appropriate choice axioms will be Cauchy-complete but not automatically order-complete.

Dedekind reals
In this context it may also be possible to model a theory or real numbers in terms of Dedekind cuts of $${\mathbb Q}$$. At least when assuming $${\mathrm{PEM}}$$ or dependent choice, these structures are isomorphic.

Interval arithmetic
Another approach is to define a real number as a certain subset of $${\mathbb Q}\times{\mathbb Q}$$, holding pairs representing inhabited, pairwise intersecting intervals.

Uncountability
Recall that the preorder on cardinals "$$\le$$" in set theory is the primary notion defined as injection existence. As a result, the constructive theory of cardinal order can diverge substantially from the classical one. Here, sets like $${\mathbb Q}^{\mathbb N}$$ or some models of the reals can be taken to be subcountable.

That said, Cantors diagonal construction proving uncountability of powersets like $${\mathcal P}{\mathbb N}$$ and plain function spaces like $${\mathbb Q}^{\mathbb N}$$ is intuitionistically valid. Assuming $${\mathrm {PEM}}$$ or alternatively the countable choice axiom, models of $${\mathbb R}$$ are always uncountable also over a constructive framework. One variant of the diagonal construction relevant for the present context may be formulated as follows, proven using countable choice and for reals as sequences of rationals:
 * For any two pair of reals $$a < b$$ and any sequence of reals $$x_n$$, there exists a real $$z$$ with $$ a < z < b $$ and $$ \forall (n \in {\mathbb N}). x_n\, \#\, z$$.

Formulations of the reals aided by explicit moduli permit separate treatments.

According to Kanamori, "a historical misrepresentation has been perpetuated that associates diagonalization with non-constructivity" and a constructive component of the diagonal argument already appeared in Cantor's work.

Category and type theory
All these considerations may also be undertaken in a topos or appropriate dependent type theory.

Principles
For practical mathematics, the axiom of dependent choice is adopted in various schools.

Markov's principle is adopted in the Russian school of recursive mathematics. This principle strengthens the impact of proven negation of strict equality. A so-called analytical form of it grants $$\neg(x\le 0)\to x>0$$ or $$\neg(x\cong 0)\to x\# 0$$. Weaker forms may be formulated.

The Brouwerian school reasons in terms of spreads and adopts the classically valid bar induction.

Anti-classical schools
Through the optional adoption of further consistent axioms, the negation of decidability may be provable. For example, equality-to-zero is rejected to be decidable when adopting Brouwerian continuity principles or Church's thesis in recursive mathematics. The weak continuity principle as well as $${\mathrm{CT}_0}$$ even refute $$x\ge 0 \or 0\ge x$$. The existence of a Specker sequence is proven from $${\mathrm{CT}_0}$$. Such phenomena also occur in realizability topoi. Notably, there are two anti-classical schools as incompatible with one-another. This article discusses principles compatible with the classical theory and choice is made explicit.

Theorems
Many classical theorems can only be proven in a formulation that is logically equivalent, over classical logic. Generally speaking, theorem formulation in constructive analysis mirrors the classical theory closest in separable spaces. Some theorems can only be formulated in terms of approximations.

The intermediate value theorem
For a simple example, consider the intermediate value theorem (IVT). In classical analysis, IVT implies that, given any continuous function f from a closed interval [a,b] to the real line R, if f(a) is negative while f(b) is positive, then there exists a real number c in the interval such that f(c) is exactly zero. In constructive analysis, this does not hold, because the constructive interpretation of existential quantification ("there exists") requires one to be able to construct the real number c (in the sense that it can be approximated to any desired precision by a rational number). But if f hovers near zero during a stretch along its domain, then this cannot necessarily be done.

However, constructive analysis provides several alternative formulations of IVT, all of which are equivalent to the usual form in classical analysis, but not in constructive analysis. For example, under the same conditions on f as in the classical theorem, given any natural number n (no matter how large), there exists (that is, we can construct) a real number cn in the interval such that the absolute value of f(cn) is less than 1/n. That is, we can get as close to zero as we like, even if we can't construct a c that gives us exactly zero.

Alternatively, we can keep the same conclusion as in the classical IVT—a single c such that f(c) is exactly zero—while strengthening the conditions on f. We require that f be locally non-zero, meaning that given any point x in the interval [a,b] and any natural number m, there exists (we can construct) a real number y in the interval such that |y - x| < 1/m and |f(y)| > 0. In this case, the desired number c can be constructed. This is a complicated condition, but there are several other conditions that imply it and that are commonly met; for example, every analytic function is locally non-zero (assuming that it already satisfies f(a) < 0 and f(b) > 0).

For another way to view this example, notice that according to classical logic, if the locally non-zero condition fails, then it must fail at some specific point x; and then f(x) will equal 0, so that IVT is valid automatically. Thus in classical analysis, which uses classical logic, in order to prove the full IVT, it is sufficient to prove the constructive version. From this perspective, the full IVT fails in constructive analysis simply because constructive analysis does not accept classical logic. Conversely, one may argue that the true meaning of IVT, even in classical mathematics, is the constructive version involving the locally non-zero condition, with the full IVT following by "pure logic" afterwards. Some logicians, while accepting that classical mathematics is correct, still believe that the constructive approach gives a better insight into the true meaning of theorems, in much this way.

The least-upper-bound principle and compact sets
Another difference between classical and constructive analysis is that constructive analysis does not prove the least-upper-bound principle, i.e. that any subset of the real line R would have a least upper bound (or supremum), possibly infinite. However, as with the intermediate value theorem, an alternative version survives; in constructive analysis, any located subset of the real line has a supremum. (Here a subset S of R is located if, whenever x < y are real numbers, either there exists an element s of S such that x < s, or y is an upper bound of S.) Again, this is classically equivalent to the full least upper bound principle, since every set is located in classical mathematics. And again, while the definition of located set is complicated, nevertheless it is satisfied by many commonly studied sets, including all intervals and all compact sets.

Closely related to this, in constructive mathematics, fewer characterisations of compact spaces are constructively valid—or from another point of view, there are several different concepts that are classically equivalent but not constructively equivalent. Indeed, if the interval [a,b] were sequentially compact in constructive analysis, then the classical IVT would follow from the first constructive version in the example; one could find c as a cluster point of the infinite sequence (cn)n∈N.