CHSH inequality

In physics, the CHSH inequality can be used in the proof of Bell's theorem, which states that certain consequences of entanglement in quantum mechanics cannot be reproduced by local hidden-variable theories. Experimental verification of the inequality being violated is seen as confirmation that nature cannot be described by such theories. CHSH stands for John Clauser, Michael Horne, Abner Shimony, and Richard Holt, who described it in a much-cited paper published in 1969. They derived the CHSH inequality, which, as with John Stewart Bell's original inequality, is a constraint—on the statistical occurrence of "coincidences" in a Bell test—which is necessarily true if an underlying local hidden-variable theory exists. In practice, the inequality is routinely violated by modern experiments in quantum mechanics.

Statement
The usual form of the CHSH inequality is

where

$$a$$ and $$a'$$ are detector settings on side $$A$$, $$b$$ and $$b'$$ on side $$B$$, the four combinations being tested in separate subexperiments. The terms $$E(a,b)$$ etc. are the quantum correlations of the particle pairs, where the quantum correlation is defined to be the expectation value of the product of the "outcomes" of the experiment, i.e. the statistical average of $$A(a) \times B(b)$$, where $$A,B$$ are the separate outcomes, using the coding +1 for the '+' channel and &minus;1 for the '&minus;' channel. Clauser et al.'s 1969 derivation was oriented towards the use of "two-channel" detectors, and indeed it is for these that it is generally used, but under their method the only possible outcomes were +1 and &minus;1. In order to adapt to real situations, which at the time meant the use of polarised light and single-channel polarisers, they had to interpret '&minus;' as meaning "non-detection in the '+' channel", i.e. either '&minus;' or nothing. They did not in the original article discuss how the two-channel inequality could be applied in real experiments with real imperfect detectors, though it was later proved that the inequality itself was equally valid. The occurrence of zero outcomes, though, means it is no longer so obvious how the values of E are to be estimated from the experimental data.

The mathematical formalism of quantum mechanics predicts that the value of $$S$$ exceeds 2 for systems prepared in suitable entangled states and the appropriate choice of measurement settings (see below). The maximum violation predicted by quantum mechanics is $$2 \sqrt{2}$$ (Tsirelson's bound) and can be obtained from a maximal entangled Bell state.

Experiments
Many Bell tests conducted subsequent to Alain Aspect's second experiment in 1982 have used the CHSH inequality, estimating the terms using (3) and assuming fair sampling. Some dramatic violations of the inequality have been reported.

In practice most actual experiments have used light rather than the electrons that Bell originally had in mind. The property of interest is, in the best known experiments,  the polarisation direction, though other properties can be used. The diagram shows a typical optical experiment. Coincidences (simultaneous detections) are recorded, the results being categorised as '++', '+&minus;', '&minus;+' or '&minus;&minus;' and corresponding counts accumulated.

Four separate subexperiments are conducted, corresponding to the four terms $$E(a, b)$$ in the test statistic S ($$, above). The settings $a = 0°$, $a&prime; = 45°$, $b = 22.5°$, and $b&prime; = 67.5°$ are generally in practice chosen—the "Bell test angles"—these being the ones for which the quantum mechanical formula gives the greatest violation of the inequality.

For each selected value of $$a,b$$, the numbers of coincidences in each category $$\left\{ N_{++}, N_{--}, N_{+-}, N_{-+} \right\}$$ are recorded. The experimental estimate for $$E(a, b)$$ is then calculated as:

Once all the $$'s have been estimated, an experimental estimate of S (Eq. $$) can be found. If it is numerically greater than 2 it has infringed the CHSH inequality and the experiment is declared to have supported the quantum mechanics prediction and ruled out all local hidden-variable theories.

The CHSH paper lists many preconditions (or "reasonable and/or presumable assumptions") to derive the simplified theorem and formula. For example, for the method to be valid, it has to be assumed that the detected pairs are a fair sample of those emitted. In actual experiments, detectors are never 100% efficient, so that only a sample of the emitted pairs are detected. A subtle, related requirement is that the hidden variables do not influence or determine detection probability in a way that would lead to different samples at each arm of the experiment.

The CHSH inequality has been violated with photon pairs, beryllium ion pairs, ytterbium ion pairs, rubidium atom pairs, whole rubidium-atom cloud pairs, nitrogen vacancies in diamonds, and Josephson phase qubits.

Derivation
The original 1969 derivation will not be given here since it is not easy to follow and involves the assumption that the outcomes are all +1 or &minus;1, never zero. Bell's 1971 derivation is more general. He effectively assumes the "Objective Local Theory" later used by Clauser and Horne. It is assumed that any hidden variables associated with the detectors themselves are independent on the two sides and can be averaged out from the start. Another derivation of interest is given in Clauser and Horne's 1974 paper, in which they start from the CH74 inequality.

Bell's 1971 derivation
The following is based on page 37 of Bell's Speakable and Unspeakable, the main change being to use the symbol ‘E’ instead of ‘P’ for the expected value of the quantum correlation. This avoids any suggestion that the quantum correlation is itself a probability.

We start with the standard assumption of independence of the two sides, enabling us to obtain the joint probabilities of pairs of outcomes by multiplying the separate probabilities, for any selected value of the "hidden variable" λ. λ is assumed to be drawn from a fixed distribution of possible states of the source, the probability of the source being in the state λ for any particular trial being given by the density function ρ(λ), the integral of which over the complete hidden variable space is 1. We thus assume we can write: $$E(a, b) = \int \underline{A}(a, \lambda) \underline{B}(b, \lambda) \rho(\lambda) d\lambda$$ where A and B are the outcomes. Since the possible values of A and B are &minus;1, 0 and +1, it follows that:

Then, if a, a&prime;, b and b&prime; are alternative settings for the detectors,
 * $$\begin{align}

&E(a, b) - E\left(a, b'\right) \\

={} &\int \left[ \underline{A}(a, \lambda) \underline{B}(b, \lambda) - \underline{A}(a, \lambda) \underline{B}\left(b', \lambda\right) \right] \rho(\lambda) d\lambda \\

={} &\int \left[ \underline{A}(a, \lambda) \underline{B}(b, \lambda) - \underline{A}(a, \lambda) \underline{B}\left(b', \lambda\right) \pm \underline{A}(a, \lambda) \underline{B}(b, \lambda) \underline{A}\left(a', \lambda\right) \underline{B}\left(b', \lambda\right) \mp \underline{A}(a, \lambda) \underline{B}(b, \lambda) \underline{A}\left(a', \lambda\right) \underline{B}\left(b', \lambda\right) \right] \rho(\lambda) d\lambda\\

={} &\int \underline{A}(a, \lambda) \underline{B}(b, \lambda) \left[1 \pm \underline{A}\left(a', \lambda\right) \underline{B}\left(b', \lambda\right) \right] \rho(\lambda) d\lambda - \int \underline{A}(a, \lambda) \underline{B}\left(b', \lambda\right) \left[1 \pm \underline{A}\left(a', \lambda\right) \underline{B}(b, \lambda) \right] \rho(\lambda) d\lambda \end{align}$$

Taking absolute values of both sides, and applying the triangle inequality to the right-hand side, we obtain


 * $$\left| E(a, b) - E\left(a, b'\right) \right| \leq \left| \int \underline{A}(a, \lambda) \underline{B}(b, \lambda) \left[ 1 \pm \underline{A}\left(a', \lambda\right) \underline{B}\left(b', \lambda\right) \right] \rho(\lambda) d\lambda \right| + \left| \int \underline{A}(a, \lambda) \underline{B}\left(b', \lambda\right) \left[1 \pm \underline{A}\left(a', \lambda\right) \underline{B}(b, \lambda) \right] \rho(\lambda) d\lambda \right|$$

We use the fact that $$ \left[ 1 \pm \underline{A}\left(a', \lambda\right) \underline{B}\left(b', \lambda\right) \right] \rho(\lambda) $$ and $$ \left[ 1 \pm \underline{A}\left(a', \lambda\right) \underline{B}(b, \lambda) \right] \rho(\lambda) $$ are both non-negative to rewrite the right-hand side of this as $$ \int \left| \underline{A}(a, \lambda) \underline{B}(b, \lambda) \right| \left| \left[ 1 \pm \underline{A}\left(a', \lambda\right) \underline{B}\left(b', \lambda\right)\right] \rho(\lambda) d\lambda \right| + \int \left| \underline{A}(a, \lambda) \underline{B}(b', \lambda) \right| \left| \left[ 1 \pm \underline{A}\left(a', \lambda\right) \underline{B}(b, \lambda) \right] \rho(\lambda) d\lambda \right| $$

By ($$), this must be less than or equal to $$\int \left[ 1 \pm \underline{A}\left(a', \lambda\right) \underline{B}\left(b', \lambda\right) \right] \rho(\lambda) d\lambda + \int \left[ 1 \pm \underline{A}\left(a', \lambda\right) \underline{B}(b, \lambda) \right] \rho(\lambda) d\lambda$$ which, using the fact that the integral of $ρ(λ)$ is 1, is equal to $$2 \pm \left[ \int \underline{A}\left(a', \lambda\right) \underline{B}\left(b', \lambda\right) \rho(\lambda) d\lambda + \int \underline{A}\left(a', \lambda\right) \underline{B}(b, \lambda) \rho(\lambda) d\lambda \right]$$ which is equal to $$2 \pm \left[ E\left(a', b'\right) + E\left(a', b\right) \right]$$.

Putting this together with the left-hand side, we have: $$\left| E(a, b) - E\left(a, b'\right) \right| \; \leq 2 \; \pm \left[ E\left(a', b'\right) + E\left(a', b\right) \right]$$ which means that the left-hand side is less than or equal to both $$2 + \left[ E\left(a', b'\right) + E\left(a', b\right) \right]$$ and $$2 - \left[ E\left(a', b'\right) + E\left(a', b\right) \right]$$. That is: $$\left| E(a, b) - E\left(a, b'\right) \right| \; \leq \; 2 - \left| E\left(a', b'\right) + E\left(a', b\right) \right|$$ from which we obtain $$2 \;\geq\; \left| E(a, b) - E\left(a, b'\right) \right| + \left| E\left(a', b'\right) + E\left(a', b\right) \right| \;\geq\; \left| E(a, b) - E\left(a, b'\right) + E\left(a', b'\right) + E\left(a', b\right) \right| $$ (by the triangle inequality again), which is the CHSH inequality.

Derivation from Clauser and Horne's 1974 inequality
In their 1974 paper, Clauser and Horne show that the CHSH inequality can be derived from the CH74 one. As they tell us, in a two-channel experiment the CH74 single-channel test is still applicable and provides four sets of inequalities governing the probabilities p of coincidences.

Working from the inhomogeneous version of the inequality, we can write: $$- 1 \; \leq \; p_{jk}(a, b) - p_{jk}(a, b') + p_{jk}(a', b) + p_{jk}(a', b') - p_{jk}(a') - p_{jk}(b) \; \leq \; 0$$ where j and k are each '+' or '&minus;', indicating which detectors are being considered.

To obtain the CHSH test statistic S ($E$), all that is needed is to multiply the inequalities for which j is different from k by &minus;1 and add these to the inequalities for which j and k are the same.

Optimal violation by a general quantum state
In experimental practice, the two particles are not an ideal EPR pair. There is a necessary and sufficient condition for a two-qubit density matrix $$\rho$$ to violate the CHSH inequality, expressed by the maximum attainable polynomial Smax defined in $$. This is important in entanglement-based quantum key distribution, where the secret key rate depends on the degree of measurement correlations.

Let us introduce a 3×3 real matrix $$T_{\rho}$$ with elements $$t_{ij} = \operatorname{Tr}[\rho\cdot(\sigma_i \otimes \sigma_j)]$$, where $$\sigma_1, \sigma_2, \sigma_3$$ are the Pauli matrices. Then we find the eigenvalues and eigenvectors of the real symmetric matrix $$U_\rho = T_\rho^\text{T} T_\rho$$, $$ U_\rho \boldsymbol{e}_i = \lambda_i \boldsymbol{e}_i, \quad |\boldsymbol{e}_i| = 1, \quad i=1,2,3, $$ where the indices are sorted by $$\lambda_1 \geq \lambda_2 \geq \lambda_3$$. Then, the maximal CHSH polynomial is determined by the two greatest eigenvalues, $$ S_\text{max}(\rho) = 2\sqrt{\lambda_1 + \lambda_2}. $$

Optimal measurement bases
There exists an optimal configuration of the measurement bases a, a', b, b' for a given $$\rho$$ that yields Smax with at least one free parameter.

The projective measurement that yields either +1 or −1 for two orthogonal states $$|\alpha\rangle, |\alpha^\perp\rangle$$ respectively, can be expressed by an operator $$\Alpha = |\alpha\rangle\langle\alpha| - |\alpha^\perp\rangle\langle\alpha^\perp|$$. The choice of this measurement basis can be parametrized by a real unit vector $$\boldsymbol{a} \in \mathbb{R}^3, |\boldsymbol{a}|=1$$ and the Pauli vector $$\boldsymbol{\sigma}$$ by expressing $$\Alpha = \boldsymbol{a} \cdot \boldsymbol{\sigma}$$. Then, the expected correlation in bases a, b is $$ E(a,b) = \operatorname{Tr}[\rho(\boldsymbol{a} \cdot \boldsymbol{\sigma})\otimes(\boldsymbol{b} \cdot \boldsymbol{\sigma})] = \boldsymbol{a}^\text{T} T_\rho \boldsymbol{b}. $$ The numerical values of the basis vectors, when found, can be directly translated to the configuration of the projective measurements.

The optimal set of bases for the state $$\rho$$ is found by taking the two greatest eigenvalues $$\lambda_{1,2}$$ and the corresponding eigenvectors $$\boldsymbol{e}_{1,2}$$ of $$U_\rho$$, and finding the auxiliary unit vectors $$ \begin{align} \boldsymbol{c} &= \boldsymbol{e}_1 \cos\varphi + \boldsymbol{e}_2 \sin\varphi, \\ \boldsymbol{c}' &= \boldsymbol{e}_1 \sin\varphi - \boldsymbol{e}_2 \cos\varphi, \end{align} $$ where $$\varphi$$ is a free parameter. We also calculate the acute angle $$ \theta = \operatorname{arctan} \sqrt{\frac{\lambda_2 + \lambda_1 \tan^2{\varphi}}{\lambda_1 + \lambda_2 \tan^2{\varphi}}} $$ to obtain the bases that maximize $$, $$ \begin{align} \boldsymbol{a} &= T_\rho \boldsymbol{c}' / |T_\rho \boldsymbol{c}'|, \\ \boldsymbol{a}' &= T_\rho \boldsymbol{c} / |T_\rho \boldsymbol{c}|, \\ \boldsymbol{b} &= \boldsymbol{c} \cos\theta + \boldsymbol{c}' \sin\theta, \\ \boldsymbol{b}' &= \boldsymbol{c} \cos\theta - \boldsymbol{c}' \sin\theta. \end{align} $$

In entanglement-based quantum key distribution, there is another measurement basis used to communicate the secret key ($$\boldsymbol{a}_0$$ assuming Alice uses the side A). The bases $$\boldsymbol{a}_0, \boldsymbol{b}$$ then need to minimize the quantum bit error rate Q, which is the probability of obtaining different measurement outcomes (+1 on one particle and −1 on the other). The corresponding bases are $$ \begin{align} \boldsymbol{a}_0 &= T_\rho \boldsymbol{e}_1 / |T_\rho \boldsymbol{e}_1|, \\ \boldsymbol{b} &= \boldsymbol{e}_1. \end{align} $$ The CHSH polynomial S needs to be maximized as well, which together with the bases above creates the constraint $$\varphi = \pi/4$$.

CHSH game
The CHSH game is a thought experiment involving two parties separated at a great distance (far enough to preclude classical communication at the speed of light), each of whom has access to one half of an entangled two-qubit pair. Analysis of this game shows that no classical local hidden-variable theory can explain the correlations that can result from entanglement. Since this game is indeed physically realizable, this gives strong evidence that classical physics is fundamentally incapable of explaining certain quantum phenomena, at least in a "local" fashion.

In the CHSH game, there are two cooperating players, Alice and Bob, and a referee, Charlie. These agents will be abbreviated $$A, B, C$$ respectively. At the start of the game, Charlie chooses bits $$x,y \in \{0,1\}$$ uniformly at random, and then sends $$x$$ to Alice and $$y$$ to Bob. Alice and Bob must then each respond to Charlie with bits $$a,b \in \{0,1\}$$ respectively. Now, once Alice and Bob send their responses back to Charlie, Charlie tests if $$a \oplus b = x \land y$$, where ∧ denotes a logical AND operation and ⊕ denotes a logical XOR operation. If this equality holds, then Alice and Bob win, and if not then they lose.

It is also required that Alice and Bob's responses can only depend on the bits they see: so Alice's response $$a$$ depends only on $$x$$, and similarly for Bob. This means that Alice and Bob are forbidden from directly communicating with each other about the values of the bits sent to them by Charlie. However, Alice and Bob are allowed to decide on a common strategy before the game begins.

In the following sections, it is shown that if Alice and Bob use only classical strategies involving their local information (and potentially some random coin tosses), it is impossible for them to win with a probability higher than 75%. However, if Alice and Bob are allowed to share a single entangled qubit pair, then there exists a strategy which allows Alice and Bob to succeed with a probability of ~85%.

Optimal classical strategy
We first establish that any deterministic classical strategy has success probability at most 75% (where the probability is taken over Charlie's uniformly random choice of $$x,y$$). By a deterministic strategy, we mean a pair of functions $$f_A, f_B: \{0,1\} \mapsto \{0,1\}$$, where $$f_A$$ is a function determining Alice's response as a function of the message she receives from Charlie, and $$f_B$$ is a function determining Bob's response based on what he receives. To prove that any deterministic strategy fails at least 25% of the time, we can simply consider all possible pairs of strategies for Alice and Bob, of which there are at most 8 (for each party, there are 4 functions $$\{0,1\} \mapsto \{0,1\}$$). It can be verified that for each of those 8 strategies there is always at least one out of the four possible input pairs $$(x, y) \in \{0,1\}^2$$ which makes the strategy fail. For example, in the strategy where both players always answer 0, we have that Alice and Bob win in all cases except for when $$x=y=1$$, so using this strategy their win probability is exactly 75%.

Now, consider the case of randomized classical strategies, where Alice and Bob have access to correlated random numbers. They can be produced by jointly flipping a coin several times before the game has started and Alice and Bob are still allowed to communicate. The output they give at each round is then a function of both Charlie's message and the outcome of the corresponding coin flip. Such a strategy can be viewed as a probability distribution over deterministic strategies, and thus its success probability is a weighted sum over the success probabilities of the deterministic strategies. But since every deterministic strategy has a success probability of at most 75%, this weighted sum cannot exceed 75% either.

Optimal quantum strategy
Now, imagine that Alice and Bob share the two-qubit entangled state: $|\Phi\rangle = \frac{1}{\sqrt{2}}(|00\rangle + |11\rangle)$, commonly referred to as an EPR pair. Alice and Bob will use this entangled pair in their strategy as described below. The optimality of this strategy then follows from Tsirelson's bound.

Upon receiving the bit $$x$$ from Charlie, Alice will measure her qubit in the basis $$\{|0\rangle, |1\rangle\}$$ or in the basis $$\{|+\rangle, |-\rangle\}$$, conditionally on whether $$x=0$$ or $$x=1$$, respectively. She will then label the two possible outputs resulting from each measurement choice as $$a=0$$ if the first state in the measurement basis is observed, and $$a=1$$ otherwise.

Bob also uses the bit $$y$$ received from Charlie to decide which measurement to perform: if $$y=0$$ he measures in the basis $$\{|a_0\rangle, |a_1\rangle\}$$, while if $$y=1$$ he measures in the basis $$\{|b_0\rangle, |b_1\rangle\}$$, where$$\begin{aligned} &|a_0\rangle = \cos\theta\,|0\rangle + \sin\theta\,|1\rangle, \qquad &|b_0\rangle = \cos\theta\,|0\rangle - \sin\theta\,|1\rangle, \qquad \end{aligned}$$with $$\theta=\pi/8$$.
 * a_1\rangle = -\sin\theta\,|0\rangle + \cos\theta\,|1\rangle, \\
 * b_1\rangle = \sin\theta\,|0\rangle + \cos\theta\,|1\rangle,

To analyze the success probability, it suffices to analyze the probability that they output a winning value pair on each of the four possible inputs $$(x,y)$$, and then take the average. We analyze the case where $$x=y=0$$ here: In this case the winning response pairs are $$a = b = 0$$ and $$a=b=1$$. On input $$x=y=0$$, we know that Alice will measure in the basis $$|0\rangle, |1\rangle$$, and Bob will measure in the basis $$|a_0\rangle, |a_1\rangle$$. Then the probability that they both output 0 is the same as the probability that their measurements yield $$|0\rangle, |a_0\rangle$$ respectively, so precisely $|(\langle 0|\otimes \langle a_0|) |\Phi \rangle|^2 = \frac{1}{2}\cos^2\left(\frac{\pi}{8}\right)$. Similarly, the probability that they both output 1 is exactly $|(\langle 1|\otimes \langle a_1|) |\Phi \rangle|^2 = \frac{1}{2}\cos^2\left(\frac{\pi}{8}\right)$. So the probability that either of these successful outcomes happens is $\cos^2\left(\frac{\pi}{8}\right)$.

In the case of the 3 other possible input pairs, essentially identical analysis shows that Alice and Bob will have the same win probability of $$\cos^2\left(\frac{\pi}{8}\right)$$, so overall the average win probability for a randomly chosen input is $\cos^2\left(\frac{\pi}{8}\right)$. Since $\cos^2\left(\frac{\pi}{8}\right) \approx 85\%$, this is strictly better than what was possible in the classical case.

Modeling general quantum strategies
An arbitrary quantum strategy for the CHSH game can be modeled as a triple $$\mathcal{S} = \left(|\psi\rangle, (A_{0}, A_1), (B_0, B_1)\right)$$ where The optimal quantum strategy described above can be recast in this notation as follows: $$|\psi\rangle \in\mathbb{C}^2\otimes\mathbb{C}^2$$ is the EPR pair $|\psi\rangle = \frac{1}{\sqrt{2}}(|00\rangle + |11\rangle)$, the observable $$A_0 = Z$$ (corresponding to Alice measuring in the $$\{|0\rangle, |1\rangle\}$$ basis), the observable $$A_1 = X$$ (corresponding to Alice measuring in the $$\{|+\rangle, |-\rangle\}$$ basis), where $$X$$ and $$Z$$ are Pauli matrices. The observables $B_{0} = \frac{1}{\sqrt{2}}(X+Z)$ and $B_{1} = \frac{1}{\sqrt{2}}(Z-X)$  (corresponding to each of Bob's choice of basis to measure in). We will denote the success probability of a strategy $$\mathcal{S}$$ in the CHSH game by $$\omega^*_{\text{CHSH}}(\mathcal{S})$$, and we define the bias of the strategy $$\mathcal{S}$$ as $$\beta^*_{\text{CHSH}}(\mathcal{S}) := 2\omega^*_{\text{CHSH}}(\mathcal{S}) - 1$$, which is the difference between the winning and losing probabilities of $$\mathcal{S}$$.
 * $$|\psi\rangle \in \mathbb{C}^d \otimes \mathbb{C}^d$$ is a bipartite state for some $$d$$,
 * $$A_{0}$$ and $$A_{1}$$ are Alice's observables each corresponding to receiving $$x\in\{0,1\}$$ from the referee, and
 * $$B_{0}$$ and $$B_{1}$$ are Bob's observables each corresponding to receiving $$y\in\{0,1\}$$ from the referee.

In particular, we have $$ \beta^*_{\text{CHSH}}(\mathcal{S}) = \frac{1}{4} \sum_{x,y \in \{0,1\}} (-1)^{x\wedge y} \cdot \langle \psi | A_x\otimes B_y |\psi\rangle. $$ The bias of the quantum strategy described above is $\frac{1}{\sqrt{2}}$.

Tsirelson's inequality and CHSH rigidity
Tsirelson's inequality, discovered by Boris Tsirelson in 1980, states that for any quantum strategy $$\mathcal{S}$$ for the CHSH game, the bias $\beta^*_{\text{CHSH}}(\mathcal{S}) \leq \frac{1}{\sqrt{2}}$. Equivalently, it states that success probability $$\omega^*_{\text{CHSH}}(\mathcal{S}) \leq \cos^2\left(\frac{\pi}{8}\right) = \frac{1}{2} + \frac{1}{2\sqrt{2}}$$ for any quantum strategy $$\mathcal{S}$$ for the CHSH game. In particular, this implies the optimality of the quantum strategy described above for the CHSH game.

Tsirelsen's inequality establishes that the maximum success probability of any quantum strategy is $\cos^2\left(\frac{\pi}{8}\right)$, and we saw that this maximum success probability is achieved by the quantum strategy described above. In fact, any quantum strategy that achieves this maximum success probability must be isomorphic (in a precise sense) to the canonical quantum strategy described above; this property is called the rigidity of the CHSH game, first attributed to Summers and Werner. More formally, we have the following result:

$$

Informally, the above theorem states that given an arbitrary optimal strategy for the CHSH game, there exists a local change-of-basis (given by the isometries $$V, W$$) for Alice and Bob such that their shared state $$|\psi\rangle$$ factors into the tensor of an EPR pair $$|\Phi\rangle$$ and an additional auxiliary state $$|\phi\rangle$$. Furthermore, Alice and Bob's observables $$(A_0, A_1)$$ and $$(B_0, B_1)$$ behave, up to unitary transformations, like the $$Z$$ and $$X$$ observables on their respective qubits from the EPR pair. An approximate or quantitative version of CHSH rigidity was obtained by McKague, et al. who proved that if you have a quantum strategy $$\mathcal{S}$$ such that $\omega_{\text{CHSH}}(\mathcal{S}) = \cos^2\left(\frac{\pi}{8}\right) - \epsilon$ for some $$\epsilon > 0$$, then there exist isometries under which the strategy $$\mathcal{S}$$ is $$O(\sqrt{\epsilon})$$-close to the canonical quantum strategy. Representation-theoretic proofs of approximate rigidity are also known.

Applications
Note that the CHSH game can be viewed as a test for quantum entanglement and quantum measurements, and that the rigidity of the CHSH game lets us test for a specific entanglement as well as specific quantum measurements. This in turn can be leveraged to test or even verify entire quantum computations—in particular, the rigidity of CHSH games has been harnessed to construct protocols for verifiable quantum delegation, certifiable randomness expansion, and device-independent cryptography.