Function composition

In mathematics, function composition is an operation $∘$ that takes two functions $f$ and $g$, and produces a function $h = g  ∘  f$ such that $h(x) = g(f(x))$. In this operation, the function $g$ is applied to the result of applying the function $f$ to $x$. That is, the functions $f : X → Y$ and $g : Y → Z$ are composed to yield a function that maps $x$ in domain $X$ to $g(f(x))$ in codomain $Z$. Intuitively, if $z$ is a function of $y$, and $y$ is a function of $x$, then $z$ is a function of $x$. The resulting composite function is denoted $g ∘ f : X → Z$, defined by $(g ∘ f )(x) = g(f(x))$ for all $x$ in $X$.

The notation $g ∘ f$ is read as "$g$ of $f$ ", "$g$ after $f$ ", "$g$ circle $f$ ", "$g$ round $f$ ", "$g$ about $f$ ", "$g$ composed with $f$ ", "$g$ following $f$ ", "$f$ then $g$", or "$g$ on $f$ ", or "the composition of $g$ and $f$ ". Intuitively, composing functions is a chaining process in which the output of function $f$ feeds the input of function $g$.

The composition of functions is a special case of the composition of relations, sometimes also denoted by $$\circ$$. As a result, all properties of composition of relations are true of composition of functions, such as the property of associativity.

Composition of functions is different from multiplication of functions (if defined at all), and has some quite different properties; in particular, composition of functions is not commutative.

Examples

 * Composition of functions on a finite set: If $f = {(1, 1), (2, 3), (3, 1), (4, 2)}$, and $g = {(1, 2), (2, 3), (3, 1), (4, 2)}$, then $g ∘ f = {(1, 2), (2, 1), (3, 2), (4, 3)}$, as shown in the figure.
 * Composition of functions on an infinite set: If $f: R → R$ (where $R$ is the set of all real numbers) is given by $f(x) = 2x + 4$ and $g: R → R$ is given by $g(x) = x^{3}$, then:
 * If an airplane's altitude at time $t$ is $(f ∘ g)(x) = f(g(x)) = f(x^{3}) = 2x^{3} + 4$, and the air pressure at altitude $x$ is $(g ∘ f)(x) = g(f(x)) = g(2x + 4) = (2x + 4)^{3}$, then $a(t)$ is the pressure around the plane at time $t$.

Properties
The composition of functions is always associative—a property inherited from the composition of relations. That is, if $f$, $g$, and $h$ are composable, then $p(x)$. Since the parentheses do not change the result, they are generally omitted.

In a strict sense, the composition $(p ∘ a)(t)$ is only meaningful if the codomain of $f$ equals the domain of $g$; in a wider sense, it is sufficient that the former be an improper subset of the latter. Moreover, it is often convenient to tacitly restrict the domain of $f$, such that $f$ produces only values in the domain of $g$. For example, the composition $f ∘ (g ∘ h) = (f ∘ g) ∘ h$ of the functions $g ∘ f$ defined by $g ∘ f$ and $f : R → (−∞,+9]$ defined by $$g(x) = \sqrt x$$ can be defined on the interval $f(x) = 9 − x^{2}$.

The functions $g$ and $f$ are said to commute with each other if $g : [0,+∞) → R$. Commutativity is a special property, attained only by particular functions, and often in special circumstances. For example, $[−3,+3]$ only when $g ∘ f = f ∘ g$. The picture shows another example.

The composition of one-to-one (injective) functions is always one-to-one. Similarly, the composition of onto (surjective) functions is always onto. It follows that the composition of two bijections is also a bijection. The inverse function of a composition (assumed invertible) has the property that $|x| + 3 = |x + 3|$.

Derivatives of compositions involving differentiable functions can be found using the chain rule. Higher derivatives of such functions are given by Faà di Bruno's formula.

Composition monoids
Suppose one has two (or more) functions $x ≥ 0$ $(f ∘ g)^{−1} = g^{−1}∘ f^{−1}$ having the same domain and codomain; these are often called transformations. Then one can form chains of transformations composed together, such as $f: X → X,$. Such chains have the algebraic structure of a monoid, called a transformation monoid or (much more seldom) a composition monoid. In general, transformation monoids can have remarkably complicated structure. One particular notable example is the de Rham curve. The set of all functions $g: X → X$ is called the full transformation semigroup or symmetric semigroup on $X$. (One can actually define two semigroups depending how one defines the semigroup operation as the left or right composition of functions. )

If the transformations are bijective (and thus invertible), then the set of all possible combinations of these functions forms a transformation group; and one says that the group is generated by these functions. A fundamental result in group theory, Cayley's theorem, essentially says that any group is in fact just a subgroup of a permutation group (up to isomorphism).

The set of all bijective functions $f ∘ f ∘ g ∘ f$ (called permutations) forms a group with respect to function composition. This is the symmetric group, also sometimes called the composition group.

In the symmetric semigroup (of all transformations) one also finds a weaker, non-unique notion of inverse (called a pseudoinverse) because the symmetric semigroup is a regular semigroup.

Functional powers
If $f: X → X$, then $f: X → X$ may compose with itself; this is sometimes denoted as $Y ⊆ X$. That is:

More generally, for any natural number $f: X→Y$, the $n$th functional power can be defined inductively by $f^{ 2}$, a notation introduced by Hans Heinrich Bürmann and John Frederick William Herschel. Repeated composition of such a function with itself is called iterated function.
 * By convention, $(f ∘ f)(x) = f(f(x)) = f&thinsp;^{2}(x)$ is defined as the identity map on $(f ∘ f ∘ f)(x) = f(f(f(x))) = f&thinsp;^{3}(x)$'s domain, $(f ∘ f ∘ f ∘ f)(x) = f(f(f(f(x)))) = f&thinsp;^{4}(x)$.
 * If $n ≥ 2$ and $f&thinsp;^{n} = f ∘ f&thinsp;^{n−1} = f&thinsp;^{n−1} ∘ f$ admits an inverse function $f&thinsp;^{0}$ (sometimes called «minus first iteration»), negative functional powers $f&thinsp;$ are defined for $id_{X}$ as the negated power of the inverse function: $Y = X$.

Note: If $f$ takes its values in a ring (in particular for real or complex-valued $f: X → X$), there is a risk of confusion, as $f&thinsp;^{−1}$ could also stand for the $n$-fold product of $f$, e.g. $f&thinsp;^{−n}$. For trigonometric functions, usually the latter is meant, at least for positive exponents. For example, in trigonometry, this superscript notation represents standard exponentiation when used with trigonometric functions: $n > 0$. However, for negative exponents (especially &minus;1), it nevertheless usually refers to the inverse function, e.g., $f&thinsp;^{−n} = (f&thinsp;^{−1})^{n}$.

In some cases, when, for a given function $f$, the equation $f&thinsp;$ has a unique solution $g$, that function can be defined as the functional square root of $f$, then written as $f&thinsp;^{n}$.

More generally, when $f&thinsp;^{2}(x) = f(x) · f(x)$ has a unique solution for some natural number $sin^{2}(x) = sin(x) · sin(x)$, then $tan^{−1} = arctan ≠ 1/tan$ can be defined as $g ∘ g = f$.

Under additional restrictions, this idea can be generalized so that the iteration count becomes a continuous parameter; in this case, such a system is called a flow, specified through solutions of Schröder's equation. Iterated functions and flows occur naturally in the study of fractals and dynamical systems.

To avoid ambiguity, some mathematicians choose to use $g = f&thinsp;^{1/2}$ to denote the compositional meaning, writing $g^{n} = f$ for the $n$-th iterate of the function $n > 0$, as in, for example, $f&thinsp;^{m/n}$ meaning $g^{m}$. For the same purpose, $∘$ was used by Benjamin Peirce whereas Alfred Pringsheim and Jules Molk suggested $f(x)$ instead.

Alternative notations
Many mathematicians, particularly in group theory, omit the composition symbol, writing $f(x)$ for $f(x)$.

In the mid-20th century, some mathematicians decided that writing "$f(f(f(x)))$" to mean "first apply $f$, then apply $g$" was too confusing and decided to change notations. They write "$f(x)$" for "$f(x)$" and "$gf$" for "$g ∘ f$". This can be more natural and seem simpler than writing functions on the left in some areas – in linear algebra, for instance, when $x$ is a row vector and $f$ and $g$ denote matrices and the composition is by matrix multiplication. This alternative notation is called postfix notation. The order is important because function composition is not necessarily commutative (e.g. matrix multiplication). Successive transformations applying and composing to the right agrees with the left-to-right reading sequence.

Mathematicians who use postfix notation may write "$g ∘ f$", meaning first apply $f$ and then apply $g$, in keeping with the order the symbols occur in postfix notation, thus making the notation "$xf&thinsp;$" ambiguous. Computer scientists may write "$f(x)$" for this, thereby disambiguating the order of composition. To distinguish the left composition operator from a text semicolon, in the Z notation the ⨾ character is used for left relation composition. Since all functions are binary relations, it is correct to use the [fat] semicolon for function composition as well (see the article on composition of relations for further details on this notation).

Composition operator
Given a function $(xf)g$, the composition operator $g(f(x))$ is defined as that operator which maps functions to functions as $$C_g f = f \circ g.$$

Composition operators are studied in the field of operator theory.

In programming languages
Function composition appears in one form or another in numerous programming languages.

Multivariate functions
Partial composition is possible for multivariate functions. The function resulting when some argument $fg$ of the function $f$ is replaced by the function $g$ is called a composition of $f$ and $g$ in some computer engineering contexts, and is denoted $fg$ $$f|_{x_i = g} = f (x_1, \ldots, x_{i-1}, g(x_1, x_2, \ldots, x_n), x_{i+1}, \ldots, x_n).$$

When $g$ is a simple constant $b$, composition degenerates into a (partial) valuation, whose result is also known as restriction or co-factor.

$$f|_{x_i = b} = f (x_1, \ldots, x_{i-1}, b, x_{i+1}, \ldots, x_n).$$

In general, the composition of multivariate functions may involve several other functions as arguments, as in the definition of primitive recursive function. Given $f$, a $n$-ary function, and $n$ $m$-ary functions $f ; g$, the composition of $f$ with $g$, is the $m$-ary function $$h(x_1,\ldots,x_m) = f(g_1(x_1,\ldots,x_m),\ldots,g_n(x_1,\ldots,x_m)).$$

This is sometimes called the generalized composite or superposition of f with $C_{g}$. The partial composition in only one argument mentioned previously can be instantiated from this more general scheme by setting all argument functions except one to be suitably chosen projection functions. Here $x_{i}$ can be seen as a single vector/tuple-valued function in this generalized scheme, in which case this is precisely the standard definition of function composition.

A set of finitary operations on some base set X is called a clone if it contains all projections and is closed under generalized composition. A clone generally contains operations of various arities. The notion of commutation also finds an interesting generalization in the multivariate case; a function f of arity n is said to commute with a function g of arity m if f is a homomorphism preserving g, and vice versa i.e.: $$f(g(a_{11},\ldots,a_{1m}),\ldots,g(a_{n1},\ldots,a_{nm})) = g(f(a_{11},\ldots,a_{n1}),\ldots,f(a_{1m},\ldots,a_{nm})).$$

A unary operation always commutes with itself, but this is not necessarily the case for a binary (or higher arity) operation. A binary (or higher arity) operation that commutes with itself is called medial or entropic.

Generalizations
Composition can be generalized to arbitrary binary relations. If $f |_{x_{i} = g}$ and $g_{1}, ..., g_{n}$ are two binary relations, then their composition $g_{1}, ..., g_{n}$ is the relation defined as $g_{1}, ..., g_{n}$. Considering a function as a special case of a binary relation (namely functional relations), function composition satisfies the definition for relation composition. A small circle $g_{1}, ..., g_{n}$ has been used for the infix notation of composition of relations, as well as functions. When used to represent composition of functions $$(g \circ f)(x) \ = \ g(f(x))$$ however, the text sequence is reversed to illustrate the different operation sequences accordingly.

The composition is defined in the same way for partial functions and Cayley's theorem has its analogue called the Wagner–Preston theorem.

The category of sets with functions as morphisms is the prototypical category. The axioms of a category are in fact inspired from the properties (and also the definition) of function composition. The structures given by composition are axiomatized and generalized in category theory with the concept of morphism as the category-theoretical replacement of functions. The reversed order of composition in the formula $R ⊆ X × Y$ applies for composition of relations using converse relations, and thus in group theory. These structures form dagger categories. ''The standard "foundation" for mathematics starts with sets and their elements. It is possible to start differently, by axiomatising not elements of sets but functions between sets. This can be done by using the language of categories and universal constructions.''

''. . . the membership relation for sets can often be replaced by the composition operation for functions. This leads to an alternative foundation for Mathematics upon categories -- specifically, on the category of all functions. Now much of Mathematics is dynamic, in that it deals with morphisms of an object into another object of the same kind. Such morphisms (like functions) form categories, and so the approach via categories fits well with the objective of organizing and understanding Mathematics. That, in truth, should be the goal of a proper philosophy of Mathematics.''

- Saunders Mac Lane, Mathematics: Form and Function

Typography
The composition symbol $S ⊆ Y × Z$ is encoded as ; see the Degree symbol article for similar-appearing Unicode characters. In TeX, it is written.