Polling system

In queueing theory, a discipline within the mathematical theory of probability, a polling system or polling model is a system where a single server visits a set of queues in some order. The model has applications in computer networks and telecommunications, manufacturing and road traffic management. The term polling system was coined at least as early as 1968 and the earliest study of such a system in 1957 where a single repairman servicing machines in the British cotton industry was modelled.

Typically it is assumed that the server visits the different queues in a cyclic manner. Exact results exist for waiting times, marginal queue lengths and joint queue lengths at polling epochs in certain models. Mean value analysis techniques can be applied to compute average quantities.

In a fluid limit, where a very large number of small jobs arrive the individual nodes can be viewed to behave similarly to fluid queues (with a two state process).

Model definition
A group of n queues are served by a single server, typically in a cyclic order 1, 2, …, n, 1, …. New jobs arrive at queue i according to a Poisson process of rate λi and are served on a first-come, first-served basis with each job having a service time denoted by an independent and identically distributed random variables Si.

The server chooses when to progress to the next node according to one of the following criteria: If a queueing node is empty the server immediately moves to serve the next queueing node.
 * exhaustive service, where a node continues to receive service until the buffer is empty.
 * gated service, where the node serves all traffic that was present at the instant that the server arrived and started serving, but subsequent arrivals during this service time must wait until the next server visit.
 * limited service, where a maximum fixed number of jobs can be served in each visit by the server.

The time taken to switch from serving node i − 1 and node i is denoted by the random variable di.

Utilization
Define ρi = λi E(Si) and write ρ = ρ1 + ρ2 + … + ρn. Then ρ is the long-run fraction of time the server spends attending to customers.

Expected waiting time
For gated service, the expected waiting time at node i is
 * $$\mathbb{E}(W_i) = \frac{1+\rho_i}{2} \mathbb{E}(C) + \frac{(1+\rho_i) \text{Var}(C_i)}{2 \mathbb{E}(C)}$$

and for exhaustive service
 * $$\mathbb{E}(W_i) = \frac{1-\rho_i}{2} \mathbb{E}(C) + \frac{(1-\rho_i) \text{Var}(C_{i+1})}{2 \mathbb{E}(C)}$$

where Ci is a random variable denoting the time between entries to node i and
 * $$\mathbb{E}(C) = \sum_{i=1}^n \frac{\mathbb{E}(d_i)}{1-\rho}$$

The variance of Ci is more complicated and a straightforward calculation requires solving n2 linear equations and n2 unknowns, however it is possible to compute from n equations.

Heavy traffic
The workload process can be approximated by a reflected Brownian motion in a heavily loaded and suitably scaled system if switching servers is immediate and a Bessel process when switching servers takes time.

Applications
Polling systems have been used to model Token Ring networks.