Szemerédi–Trotter theorem

The Szemerédi–Trotter theorem is a mathematical result in the field of Discrete geometry. It asserts that given $n$ points and $m$ lines in the Euclidean plane, the number of incidences (i.e., the number of point-line pairs, such that the point lies on the line) is

$$O \left ( n^{2/3} m^{2/3} + n + m \right ).$$

This bound cannot be improved, except in terms of the implicit constants.

As for the implicit constants, it was shown by János Pach, Radoš Radoičić, Gábor Tardos, and Géza Tóth that the upper bound $$ 2.5n^{2/3} m^{2/3} + n + m$$ holds. Since then better constants are known due to better crossing lemma constants; the current best is 2.44. On the other hand, Pach and Tóth showed that the statement does not hold true if one replaces the coefficient 2.5 with 0.42.

An equivalent formulation of the theorem is the following. Given $n$ points and an integer $k ≥ 2$, the number of lines which pass through at least $k$ of the points is

$$O \left( \frac{n^2}{k^3} + \frac{n}{k} \right ).$$

The original proof of Endre Szemerédi and William T. Trotter was somewhat complicated, using a combinatorial technique known as cell decomposition. Later, László Székely discovered a much simpler proof using the crossing number inequality for graphs. (See below.)

The Szemerédi–Trotter theorem has a number of consequences, including Beck's theorem in incidence geometry and the Erdős-Szemerédi sum-product problem in additive combinatorics.

Proof of the first formulation
We may discard the lines which contain two or fewer of the points, as they can contribute at most $2m$ incidences to the total number. Thus we may assume that every line contains at least three of the points.

If a line contains $k$ points, then it will contain $k − 1$ line segments which connect two consecutive points along the line. Because $k ≥ 3$ after discarding the two-point lines, it follows that $k − 1 ≥ k/2$, so the number of these line segments on each line is at least half the number of incidences on that line. Summing over all of the lines, the number of these line segments is again at least half the total number of incidences. Thus if $e$ denotes the number of such line segments, it will suffice to show that

$$e = O \left ( n^{2/3} m^{2/3} + n + m \right).$$

Now consider the graph formed by using the $n$ points as vertices, and the $e$ line segments as edges. Since each line segment lies on one of $m$ lines, and any two lines intersect in at most one point, the crossing number of this graph is at most the number of points where two lines intersect, which is at most $m(m &minus; 1)/2$. The crossing number inequality implies that either $e ≤ 7.5n$, or that $m(m &minus; 1)/2 ≥ e^{3} / 33.75n^{2}$. In either case $e ≤ 3.24(nm)^{2/3} + 7.5n$, giving the desired bound


 * $$e = O \left ( n^{2/3} m^{2/3} + n + m \right ).$$

Proof of the second formulation
Since every pair of points can be connected by at most one line, there can be at most $n(n − 1)/2$ lines which can connect at $k$ or more points, since $k ≥ 2$. This bound will prove the theorem when $k$ is small (e.g. if $k ≤ C$ for some absolute constant $C$). Thus, we need only consider the case when $k$ is large, say $k ≥ C$.

Suppose that there are m lines that each contain at least $k$ points. These lines generate at least $mk$ incidences, and so by the first formulation of the Szemerédi–Trotter theorem, we have

$$mk = O \left ( n^{2/3} m^{2/3} + n + m \right ),$$

and so at least one of the statements $$mk = O( n^{2/3} m^{2/3} ), mk = O(n)$$, or $$mk = O(m)$$ is true. The third possibility is ruled out since $k$ was assumed to be large, so we are left with the first two. But in either of these two cases, some elementary algebra will give the bound $$m = O( n^2 / k^3 + n/k )$$ as desired.

Optimality
Except for its constant, the Szemerédi–Trotter incidence bound cannot be improved. To see this, consider for any positive integer $$N\in \mathbb{N}$$ a set of points on the integer lattice

$$P = \left \{ (a, b) \in \mathbb{Z}^2 \ : \ 1 \leq a \leq N; 1 \leq b \leq 2N^2 \right \},$$

and a set of lines

$$L = \left \{ (x, mx + b) \ : \ m, b \in \mathbb{Z}; 1 \leq m \leq N; 1 \leq b \leq N^2 \right \}.$$

Clearly, $$|P| = 2N^3$$ and $$|L| = N^3$$. Since each line is incident to $N$ points (i.e., once for each $$x \in \{1, \cdots, N\}$$), the number of incidences is $$N^4$$ which matches the upper bound.

Generalization to $$\mathbb{R}^d$$
One generalization of this result to arbitrary dimension, $$\mathbb{R}^d$$, was found by Agarwal and Aronov. Given a set of $n$ points, $S$, and the set of $m$ hyperplanes, $H$, which are each spanned by $S$, the number of incidences between $S$ and $H$ is bounded above by

$$O \left (m^{2/3}n^{d/3}+n^{d-1} \right ),$$

provided $$ n^{d-2} < m < n^{d}$$. Equivalently, the number of hyperplanes in $H$ containing $k$ or more points is bounded above by

$$O\left( \frac{n^d}{k^3} + \frac{n^{d-1}}{k} \right ).$$

A construction due to Edelsbrunner shows this bound to be asymptotically optimal.

József Solymosi and Terence Tao obtained near sharp upper bounds for the number of incidences between points and algebraic varieties in higher dimensions, when the points and varieties satisfy "certain pseudo-line type axioms". Their proof uses the Polynomial Ham Sandwich Theorem.

In $$\mathbb{C}^2$$
Many proofs of the Szemerédi–Trotter theorem over $$\mathbb{R}$$ rely in a crucial way on the topology of Euclidean space, so do not extend easily to other fields. e.g. the original proof of Szemerédi and Trotter; the polynomial partitioning proof and the crossing number proof do not extend to the complex plane.

Tóth successfully generalized the original proof of Szemerédi and Trotter to the complex plane $$\mathbb{C}^2$$ by introducing additional ideas. This result was also obtained independently and through a different method by Zahl. The implicit constant in the bound is not the same in the complex numbers: in Tóth's proof the constant can be taken to be $$10^{60}$$; the constant is not explicit in Zahl's proof.

When the point set is a Cartesian product, Solymosi and Tardos show that the Szemerédi-Trotter bound holds using a much simpler argument.

In finite fields
Let $$\mathbb{F}$$ be a field.

A Szemerédi-Trotter bound is impossible in general due to the following example, stated here in $$\mathbb{F}_p$$: let $$\mathcal{P} = \mathbb{F}_p\times \mathbb{F}_p$$ be the set of all $$p^2$$ points and let $$\mathcal{L}$$ be the set of all $$p^2$$ lines in the plane. Since each line contains $$p$$ points, there are $$p^3$$ incidences. On the other hand, a Szemerédi-Trotter bound would give $$O((p^2)^{2/3} (p^2)^{2/3} + p^2) = O(p^{8/3})$$ incidences. This example shows that the trivial, combinatorial incidence bound is tight.

Bourgain, Katz and Tao show that if this example is excluded, then an incidence bound that is an improvement on the trivial bound can be attained.

Incidence bounds over finite fields are of two types: (i) when at least one of the set of points or lines is `large' in terms of the characteristic of the field; (ii) both the set of points and the set of lines are `small' in terms of the characteristic.

Large set incidence bounds
Let $$q$$ be an odd prime power. Then Vinh showed that the number of incidences between $$n$$ points and $$m$$ lines in $$\mathbb{F}_q^2$$ is at most

$$\frac{nm}{q} + \sqrt{qnm}.$$

Note that there is no implicit constant in this bound.

Small set incidence bounds
Let $$\mathbb{F}$$ be a field of characteristic $$p\neq 2$$. Stevens and de Zeeuw show that the number of incidences between $$n$$ points and $$m$$ lines in $$\mathbb{F}^2$$ is

$$O\left(m^{11/15}n^{11/15}\right)$$

under the condition $$m^{-2}n^{13} \leq p^{15}$$ in positive characteristic. (In a field of characteristic zero, this condition is not necessary.) This bound is better than the trivial incidence estimate when $$m^{7/8} < n < m^{8/7}$$.

If the point set is a Cartesian Product, then they show an improved incidence bound: let $$\mathcal{P} = A\times B \subseteq \mathbb{F}^2$$ be a finite set of points with $$|A|\leq |B|$$ and let $$\mathcal{L} $$ be a set of lines in the plane. Suppose that $$|A||B|^2 \leq |\mathcal{L}|^3$$ and in positive characteristic that $$|A||\mathcal{L}|\leq p^2$$. Then the number of incidences between $$\mathcal{P}$$ and $$\mathcal{L} $$ is

$$O\left(|A|^{3/4}|B|^{1/2} |\mathcal{L}|^{3/4} + |\mathcal{L}| \right).$$

This bound is optimal. Note that by point-line duality in the plane, this incidence bound can be rephrased for an arbitrary point set and a set of lines having a Cartesian product structure.

In both the reals and arbitrary fields, Rudnev and Shkredov show an incidence bound for when both the point set and the line set has a Cartesian product structure. This is sometimes better than the above bounds.