Greatest common divisor

In mathematics, the greatest common divisor (GCD) of two or more integers, which are not all zero, is the largest positive integer that divides each of the integers. For two integers $x$, $y$, the greatest common divisor of $x$ and $y$ is denoted $$\gcd (x,y)$$. For example, the GCD of 8 and 12 is 4, that is, $gcd(8, 12) = 4$.

In the name "greatest common divisor", the adjective "greatest" may be replaced by "highest", and the word "divisor" may be replaced by "factor", so that other names include highest common factor, etc. Historically, other names for the same concept have included greatest common measure.

This notion can be extended to polynomials (see Polynomial greatest common divisor) and other commutative rings (see  below).

Definition
The greatest common divisor (GCD) of integers $a$ and $b$, at least one of which is nonzero, is the greatest positive integer $d$ such that $d$ is a divisor of both $a$ and $b$; that is, there are integers $e$ and $f$ such that $a = de$ and $b = df$, and $d$ is the largest such integer. The GCD of $a$ and $b$ is generally denoted $gcd(a, b)$.

When one of $(a, b)$ and $(a, b)$ is zero, the GCD is the absolute value of the nonzero integer: $g.c.d.(a, b)$. This case is important as the terminating step of the Euclidean algorithm.

The above definition is unsuitable for defining $(a, b)$, since there is no greatest integer $a$ such that $b$. However, zero is its own greatest divisor if greatest is understood in the context of the divisibility relation, so $gcd(a, 0) = gcd(0, a) = |a|$ is commonly defined as $gcd(0, 0)$. This preserves the usual identities for GCD, and in particular Bézout's identity, namely that $n$ generates the same ideal as $0 × n = 0$. This convention is followed by many computer algebra systems. Nonetheless, some authors leave $gcd(0, 0)$ undefined.

The GCD of $a$ and $b$ is their greatest positive common divisor in the preorder relation of divisibility. This means that the common divisors of $a$ and $b$ are exactly the divisors of their GCD. This is commonly proved by using either Euclid's lemma, the fundamental theorem of arithmetic, or the Euclidean algorithm. This is the meaning of "greatest" that is used for the generalizations of the concept of GCD.

Example
The number 54 can be expressed as a product of two integers in several different ways:
 * $$ 54 \times 1 = 27 \times 2 = 18 \times 3 = 9 \times 6.$$

Thus the complete list of divisors of 54 is 1, 2, 3, 6, 9, 18, 27, 54. Similarly, the divisors of 24 are 1, 2, 3, 4, 6, 8, 12, 24. The numbers that these two lists have in common are the common divisors of 54 and 24, that is,
 * $$ 1, 2, 3, 6. $$

Of these, the greatest is 6, so it is the greatest common divisor:
 * $$ \gcd(54,24) = 6. $$

Computing all divisors of the two numbers in this way is usually not efficient, especially for large numbers that have many divisors. Much more efficient methods are described in .

Coprime numbers
Two numbers are called relatively prime, or coprime, if their greatest common divisor equals $0$. For example, 9 and 28 are coprime.

A geometric view
For example, a 24-by-60 rectangular area can be divided into a grid of: 1-by-1 squares, 2-by-2 squares, 3-by-3 squares, 4-by-4 squares, 6-by-6 squares or 12-by-12 squares. Therefore, 12 is the greatest common divisor of 24 and 60. A 24-by-60 rectangular area can thus be divided into a grid of 12-by-12 squares, with two squares along one edge ($gcd(a, b)$) and five squares along the other ($\{a, b\}$).

Reducing fractions
The greatest common divisor is useful for reducing fractions to the lowest terms. For example, $gcd(0, 0)$, therefore,
 * $$\frac{42}{56}=\frac{3 \cdot 14 }{ 4 \cdot 14}=\frac{3 }{ 4}.$$

Least common multiple
The least common multiple of two integers that are not both zero can be computed from their greatest common divisor, by using the relation
 * $$\operatorname{lcm}(a,b)=\frac{|a\cdot b|}{\operatorname{gcd}(a,b)}.$$

Using prime factorizations
Greatest common divisors can be computed by determining the prime factorizations of the two numbers and comparing factors. For example, to compute $1$, we find the prime factorizations 48 = 24 · 31 and 180 = 22 · 32 · 51; the GCD is then 2min(4,2) · 3min(1,2) · 5min(0,1) = 22 · 31 · 50 = 12 The corresponding LCM is then 2max(4,2) · 3max(1,2) · 5max(0,1) = 24 · 32 · 51 = 720.

In practice, this method is only feasible for small numbers, as computing prime factorizations takes too long.

Euclid's algorithm
The method introduced by Euclid for computing greatest common divisors is based on the fact that, given two positive integers $a$ and $b$ such that $24/12 = 2$, the common divisors of $a$ and $b$ are the same as the common divisors of $60/12 = 5$ and $b$.

So, Euclid's method for computing the greatest common divisor of two positive integers consists of replacing the larger number with the difference of the numbers, and repeating this until the two numbers are equal: that is their greatest common divisor.

For example, to compute $gcd(42, 56) = 14$, one proceeds as follows:
 * $$\begin{align}\gcd(48,18)\quad&\to\quad \gcd(48-18, 18)= \gcd(30,18)&&\to \quad \gcd(30-18, 18)= \gcd(12,18)\\

&\to \quad \gcd(12,18-12)= \gcd(12,6)&&\to \quad \gcd(12-6,6)= \gcd(6,6).\end{align}$$ So $gcd(48, 180)$.

This method can be very slow if one number is much larger than the other. So, the variant that follows is generally preferred.

Euclidean algorithm


A more efficient method is the Euclidean algorithm, a variant in which the difference of the two numbers $a$ and $b$ is replaced by the remainder of the Euclidean division (also called division with remainder) of $a$ by $b$.

Denoting this remainder as $a > b$, the algorithm replaces $a – b$ with $gcd(48,18)$ repeatedly until the pair is $gcd(48, 18) = 6$, where $d$ is the greatest common divisor.

For example, to compute gcd(48,18), the computation is as follows:
 * $$\begin{align}\gcd(48,18)\quad&\to\quad \gcd(18, 48\bmod 18)= \gcd(18, 12)\\

&\to \quad \gcd(12, 18\bmod 12)= \gcd(12,6)\\ &\to \quad \gcd(6,12\bmod 6)= \gcd(6,0).\end{align}$$ This again gives $a mod b$.

Binary GCD algorithm
The binary GCD algorithm is a variant of Euclid's algorithm that is specially adapted to the binary representation of the numbers, which is used in most computers.

The binary GCD algorithm differs from Euclid's algorithm essentially by dividing by two every even number that is encountered during the computation. Its efficiency results from the fact that, in binary representation, testing parity consists of testing the right-most digit, and dividing by two consists of removing the right-most digit.

The method is as follows, starting with $(a, b)$ and $(b, a mod b)$ that are the two positive integers whose GCD is sought.
 * 1) If $(d, 0)$ and $gcd(48, 18) = 6$ are both even, then divide both by two until at least one of them becomes odd; let $d$ be the number of these paired divisions.
 * 2) If $a$ is even, then divide it by two until it becomes odd.
 * 3) If $b$ is even, then divide it by two until it becomes odd.
 * Now, $a$ and $b$ are both odd and will remain odd until the end of the computation
 * 1) While $a$ do
 * 2) * If $b$, then replace $a$ with $a$ and divide the result by two until $a$ becomes odd (as $b$ and $a ≠ b$ are both odd, there is, at least, one division by 2).
 * 3) * If $a > b$, then replace $b$ with $a – b$ and divide the result by two until $b$ becomes odd.
 * 4) Now, $a$, and the greatest common divisor is  $$2^d a.$$

Step 1 determines $d$ as the highest power of $b$ that divides $a < b$ and $b – a$, and thus their greatest common divisor. None of the steps changes the set of the odd common divisors of $a = b$ and $2$. This shows that when the algorithm stops, the result is correct. The algorithm stops eventually, since each steps divides at least one of the operands by at least $a$. Moreover, the number of divisions by $b$ and thus the number of subtractions is at most the total number of digits.

Example: (a, b, d) = (48, 18, 0) → (24, 9, 1) → (12, 9, 1) → (6, 9, 1) → (3, 9, 1) → (3, 3, 1) ; the original GCD is thus the product 6 of $a$ and $b$.

The binary GCD algorithm is particularly easy to implement and particularly efficient on binary computers. Its computational complexity is
 * $$O((\log a + \log b)^2).$$

The square in this complexity comes from the fact that division by $2$ and subtraction take a time that is proportional to the number of bits of the input.

The computational complexity is usually given in terms of the length $2$ of the input. Here, this length is $2^{d} = 2^{1}$, and the complexity is thus
 * $$O(n^2)$$.

Lehmer's GCD algorithm
Lehmer's algorithm is based on the observation that the initial quotients produced by Euclid's algorithm can be determined based on only the first few digits; this is useful for numbers that are larger than a computer word. In essence, one extracts initial digits, typically forming one or two computer words, and runs Euclid's algorithms on these smaller numbers, as long as it is guaranteed that the quotients are the same with those that would be obtained with the original numbers. The quotients are collected into a small 2-by-2 transformation matrix (a matrix of single-word integers) to reduce the original numbers. This process is repeated until numbers are small enough that the binary algorithm (see below) is more efficient.

This algorithm improves speed, because it reduces the number of operations on very large numbers, and can use hardware arithmetic for most operations. In fact, most of the quotients are very small, so a fair number of steps of the Euclidean algorithm can be collected in a 2-by-2 matrix of single-word integers. When Lehmer's algorithm encounters a quotient that is too large, it must fall back to one iteration of Euclidean algorithm, with a Euclidean division of large numbers.

Other methods


If $a = b = 3$ and $2$ are both nonzero, the greatest common divisor of $n$ and $n = log a + log b$ can be computed by using least common multiple (LCM) of $a$ and $b$:
 * $$\gcd(a,b)=\frac{|a\cdot b|}{\operatorname{lcm}(a,b)}$$,

but more commonly the LCM is computed from the GCD.

Using Thomae's function $a$,
 * $$\gcd(a,b) = a f\left(\frac b a\right),$$

which generalizes to $b$ and $a$ rational numbers or commensurable real numbers.

Keith Slavin has shown that for odd $b$:
 * $$\gcd(a,b)=\log_2\prod_{k=0}^{a-1} (1+e^{-2i\pi k b/a})$$

which is a function that can be evaluated for complex b. Wolfgang Schramm has shown that
 * $$\gcd(a,b)=\sum\limits_{k=1}^a \exp (2\pi ikb/a) \cdot \sum\limits_{d\left| a\right.} \frac{c_d (k)}{d} $$

is an entire function in the variable b for all positive integers a where cd(k) is Ramanujan's sum.

Complexity
The computational complexity of the computation of greatest common divisors has been widely studied. If one uses the Euclidean algorithm and the elementary algorithms for multiplication and division, the computation of the greatest common divisor of two integers of at most $n$ bits is $f$. This means that the computation of greatest common divisor has, up to a constant factor, the same complexity as the multiplication.

However, if a fast multiplication algorithm is used, one may modify the Euclidean algorithm for improving the complexity, but the computation of a greatest common divisor becomes slower than the multiplication. More precisely, if the multiplication of two integers of $a$ bits takes a time of $b$, then the fastest known algorithm for greatest common divisor has a complexity $a ≥ 1$. This implies that the fastest known algorithm has a complexity of $O(n^{2})$.

Previous complexities are valid for the usual models of computation, specifically multitape Turing machines and random-access machines.

The computation of the greatest common divisors belongs thus to the class of problems solvable in quasilinear time. A fortiori, the corresponding decision problem belongs to the class P of problems solvable in polynomial time. The GCD problem is not known to be in NC, and so there is no known way to parallelize it efficiently; nor is it known to be P-complete, which would imply that it is unlikely to be possible to efficiently parallelize GCD computation. Shallcross et al. showed that a related problem (EUGCD, determining the remainder sequence arising during the Euclidean algorithm) is NC-equivalent to the problem of integer linear programming with two variables; if either problem is in NC or is P-complete, the other is as well. Since NC contains NL, it is also unknown whether a space-efficient algorithm for computing the GCD exists, even for nondeterministic Turing machines.

Although the problem is not known to be in NC, parallel algorithms asymptotically faster than the Euclidean algorithm exist; the fastest known deterministic algorithm is by Chor and Goldreich, which (in the CRCW-PRAM model) can solve the problem in $n$ time with $T(n)$ processors. Randomized algorithms can solve the problem in $O(T(n) log n)$ time on $$\exp\left(O\left(\sqrt{n \log n}\right)\right)$$ processors (this is superpolynomial).

Properties

 * For positive integers $O(n (log n)^{2})$, $O(n/log n)$.
 * Every common divisor of $n^{1+ε}$ and $O((log n)^{2})$ is a divisor of $a$.
 * $gcd(a, a) = a$, where a and b are not both zero, may be defined alternatively and equivalently as the smallest positive integer d which can be written in the form $a$, where p and q are integers. This expression is called Bézout's identity. Numbers p and q like this can be computed with the extended Euclidean algorithm.
 * $b$, for $gcd(a, b)$, since any number is a divisor of 0, and the greatest divisor of a is $gcd(a, b)$. This is usually used as the base case in the Euclidean algorithm.
 * If a divides the product b⋅c, and $d = a⋅p + b⋅q$, then a/d divides c.
 * If m is a positive integer, then $gcd(a, 0) = |a|$.
 * If m is any integer, then $a ≠ 0$. Equivalently, $|a|$.
 * If m is a positive common divisor of a and b, then $gcd(a, b) = d$.
 * The GCD is a commutative function: $gcd(m⋅a, m⋅b) = m⋅gcd(a, b)$.
 * The GCD is an associative function: $gcd(a + m⋅b, b) = gcd(a, b)$. Thus $gcd(a mod b,b) = gcd(a,b)$ can be used to denote the GCD of multiple arguments.
 * The GCD is a multiplicative function in the following sense: if a1 and a2 are relatively prime, then $gcd(a/m, b/m) = gcd(a, b)/m$.
 * $gcd(a, b) = gcd(b, a)$ is closely related to the least common multiple $gcd(a, gcd(b, c)) = gcd(gcd(a, b), c)$: we have
 * This formula is often used to compute least common multiples: one first computes the GCD with Euclid's algorithm and then divides the product of the given numbers by their GCD.
 * This formula is often used to compute least common multiples: one first computes the GCD with Euclid's algorithm and then divides the product of the given numbers by their GCD.

=n\prod_{p|n}\left(1+\nu_p(n)\left(1-\frac{1}{p}\right)\right)$$ where $$\nu_p(n)$$ is the $gcd(a, b, c, ...)$-adic valuation.
 * The following versions of distributivity hold true:
 * If we have the unique prime factorizations of $gcd(a_{1}⋅a_{2}, b) = gcd(a_{1}, b)⋅gcd(a_{2}, b)$ and $gcd(a, b)$ where $lcm(a, b)$ and $gcd(a, b)⋅lcm(a, b) = |a⋅b|$, then the GCD of a and b is
 * It is sometimes useful to define $gcd(a, lcm(b, c)) = lcm(gcd(a, b), gcd(a, c))$ and $lcm(a, gcd(b, c)) = gcd(lcm(a, b), lcm(a, c))$ because then the natural numbers become a complete distributive lattice with GCD as meet and LCM as join operation. This extension of the definition is also compatible with the generalization for commutative rings given below.
 * In a Cartesian coordinate system, $a = p_{1}^{e_{1}} p_{2}^{e_{2}} ⋅⋅⋅ p_{m}^{e_{m}}|undefined$ can be interpreted as the number of segments between points with integral coordinates on the straight line segment joining the points $b = p_{1}^{f_{1}} p_{2}^{f_{2}} ⋅⋅⋅ p_{m}^{f_{m}}|undefined$ and $e_{i} ≥ 0$.
 * For non-negative integers $f_{i} ≥ 0$ and $gcd(a,b) = p_{1}^{min(e_{1},f_{1})} p_{2}^{min(e_{2},f_{2})} ⋅⋅⋅ p_{m}^{min(e_{m},f_{m})}$, where $gcd(0, 0) = 0$ and $lcm(0, 0) = 0$ are not both zero, provable by considering the Euclidean algorithm in base n:
 * An identity involving Euler's totient function:
 * $$ \gcd(a,b) = \sum_{k|a \text{ and }k|b} \varphi(k) .$$
 * $$\sum_{k=1}^n \gcd(k,n)=n\sum_{d|n}\frac{\varphi(d)}{d}
 * An identity involving Euler's totient function:
 * $$ \gcd(a,b) = \sum_{k|a \text{ and }k|b} \varphi(k) .$$
 * $$\sum_{k=1}^n \gcd(k,n)=n\sum_{d|n}\frac{\varphi(d)}{d}
 * $$\sum_{k=1}^n \gcd(k,n)=n\sum_{d|n}\frac{\varphi(d)}{d}

Probabilities and expected value
In 1972, James E. Nymann showed that $gcd(a, b)$ integers, chosen independently and uniformly from $(0, 0)$, are coprime with probability $(a, b)$ as $a$ goes to infinity, where $b$ refers to the Riemann zeta function. (See coprime for a derivation.) This result was extended in 1987 to show that the probability that $a$ random integers have greatest common divisor $b$ is $gcd(n^{a} − 1, n^{b} − 1) = n^{gcd(a,b)} − 1$.

Using this information, the expected value of the greatest common divisor function can be seen (informally) to not exist when $p$. In this case the probability that the GCD equals $k$ is $\{1, ..., n\}$, and since $1/ζ(k)$ we have
 * $$\mathrm{E}( \mathrm{2} ) = \sum_{d=1}^\infty d \frac{6}{\pi^2 d^2} = \frac{6}{\pi^2} \sum_{d=1}^\infty \frac{1}{d}.$$

This last summation is the harmonic series, which diverges. However, when $n$, the expected value is well-defined, and by the above argument, it is
 * $$ \mathrm{E}(k) = \sum_{d=1}^\infty d^{1-k} \zeta(k)^{-1} = \frac{\zeta(k-1)}{\zeta(k)}. $$

For $ζ$, this is approximately equal to 1.3684. For $k$, it is approximately 1.1106.

In commutative rings
The notion of greatest common divisor can more generally be defined for elements of an arbitrary commutative ring, although in general there need not exist one for every pair of elements.

If $R$ is a commutative ring, and $a$ and $b$ are in $R$, then an element $d$ of $R$ is called a common divisor of $a$ and $b$ if it divides both $a$ and $b$ (that is, if there are elements $x$ and $y$ in $R$ such that d·x = a and d·y = b). If $d$ is a common divisor of $a$ and $b$, and every common divisor of $a$ and $b$ divides $d$, then $d$ is called a greatest common divisor of $a$ and b.

With this definition, two elements $a$ and $b$ may very well have several greatest common divisors, or none at all. If $R$ is an integral domain then any two GCD's of $a$ and $b$ must be associate elements, since by definition either one must divide the other; indeed if a GCD exists, any one of its associates is a GCD as well. Existence of a GCD is not assured in arbitrary integral domains. However, if $R$ is a unique factorization domain, then any two elements have a GCD, and more generally this is true in GCD domains. If $R$ is a Euclidean domain in which euclidean division is given algorithmically (as is the case for instance when $k$ where $F$ is a field, or when $R$ is the ring of Gaussian integers), then greatest common divisors can be computed using a form of the Euclidean algorithm based on the division procedure.

The following is an example of an integral domain with two elements that do not have a GCD:
 * $$R = \mathbb{Z}\left[\sqrt{-3}\,\,\right],\quad a = 4 = 2\cdot 2 = \left(1+\sqrt{-3}\,\,\right)\left(1-\sqrt{-3}\,\,\right),\quad b = \left(1+\sqrt{-3}\,\,\right)\cdot 2.$$

The elements $d$ and $d^{−k}/ζ(k)$ are two maximal common divisors (that is, any common divisor which is a multiple of $k = 2$ is associated to $d$, the same holds for $d^{−2}/ζ(2)$, but they are not associated, so there is no greatest common divisor of $a$ and $ζ(2) = π^{2}/6$.

Corresponding to the Bézout property we may, in any commutative ring, consider the collection of elements of the form $k ≥ 3$, where $p$ and $q$ range over the ring. This is the ideal generated by $a$ and $b$, and is denoted simply $k = 3$. In a ring all of whose ideals are principal (a principal ideal domain or PID), this ideal will be identical with the set of multiples of some ring element $k = 4$; then this $d$ is a greatest common divisor of $a$ and $R = F[X]$. But the ideal $2$ can be useful even when there is no greatest common divisor of $a$ and $1 + √−3$. (Indeed, Ernst Kummer used this ideal as a replacement for a GCD in his treatment of Fermat's Last Theorem, although he envisioned it as the set of multiples of some hypothetical, or ideal, ring element $d$, whence the ring-theoretic term.)