STAR model



In statistics, Smooth Transition Autoregressive (STAR) models are typically applied to time series data as an extension of autoregressive models, in order to allow for higher degree of flexibility in model parameters through a smooth transition.

Given a time series of data xt, the STAR model is a tool for understanding and, perhaps, predicting future values in this series, assuming that the behaviour of the series changes depending on the value of the transition variable. The transition might depend on the past values of the x series (similar to the SETAR models), or exogenous variables.

The model consists of 2 autoregressive (AR) parts linked by the transition function. The model is usually referred to as the STAR(p) models proceeded by the letter describing the transition function (see below) and p is the order of the autoregressive part. Most popular transition function include exponential function and first and second-order logistic functions. They give rise to Logistic STAR (LSTAR) and Exponential STAR (ESTAR) models.

AutoRegressive Models
Consider a simple AR(p) model for a time series yt
 * $$y_{t}=\gamma_{0}+\gamma_{1}y_{t-1}+\gamma_{2}y_{t-2}+...+\gamma_{p}y_{t-p}+\epsilon_{t}.\,$$

where:
 * $$ \gamma_{i}\,$$ for i=1,2,...,p are autoregressive coefficients, assumed to be constant over time;
 * $$ \epsilon_{t}\stackrel{\mathit{iid}}{\sim}WN(0;\sigma^{2})\, $$ stands for white-noise error term with constant variance.

written in a following vector form:
 * $$ y_{t}=\mathbf{X_{t}\gamma}+\sigma\epsilon_{t}.\,$$

where:
 * $$\mathbf{X_{t}}=(1,y_{t-1},y_{t-2},\ldots,y_{t-p})\,$$ is a column vector of variables;
 * $$\gamma \,$$ is the vector of parameters :$$\gamma_{0}, \gamma_{1},\gamma_{2},..., \gamma_{p}\,$$;
 * $$\epsilon_{t}\stackrel{\mathit{iid}}{\sim}WN(0;1)\,$$ stands for white-noise error term with constant variance.



STAR as an Extension of the AutoRegressive Model
STAR models were introduced and comprehensively developed by Kung-sik Chan and Howell Tong in 1986 (esp. p. 187), in which the same acronym was used. It originally stands for Smooth Threshold AutoRegressive. For some background history, see Tong (2011, 2012). The models can be thought of in terms of extension of autoregressive models discussed above, allowing for changes in the model parameters according to the value of a transition variable zt. Chan and Tong (1986) rigorously proved that the family of STAR models includes the SETAR model as a limiting case by showing the uniform boundedness and equicontinuity with respect to the switching parameter. Without this proof, to say that STAR models nest the SETAR model lacks justification. Unfortunately, whether one should use a SETAR model or a STAR model for one's data has been a matter of subjective judgement, taste and inclination in much of the literature. Fortunately, the test procedure, based on David Cox's test of separate family of hypotheses and developed by Gao, Ling and Tong (2018, Statistica Sinica, volume 28, 2857-2883) is now  available to address this issue. Such a test is important before adopting a STAR model because, among other issues, the parameter controlling its rate of switching is notoriously data-hungry.

Defined in this way, STAR model can be presented as follows:
 * $$	y_{t}=\mathbf{X_{t}}+ G(z_{t}, \zeta, c)\mathbf{X_{t}}+\sigma^{(j)}\epsilon_{t}\,$$

where:
 * $$ X_{t}=(1,y_{t-1},y_{t-2},...,y_{t-p})\,$$ is a column vector of variables;
 * $$G(z_{t}, \zeta, c)$$ is the transition function bounded between 0 and 1.

Basic Structure
They can be understood as two-regime SETAR model with smooth transition between regimes, or as continuum of regimes. In both cases the presence of the transition function is the defining feature of the model as it allows for changes in values of the parameters.

Transition Function
Three basic transition functions and the name of resulting models are:
 * first order logistic function - results in Logistic STAR (LSTAR) model:
 * $$G(z_{t}, \zeta, c) = (1+exp(-\zeta(z_{t}-c)))^{-1}, \zeta>0 $$


 * exponential function - results in Exponential STAR (ESTAR) model:
 * $$G(z_{t}, \zeta, c) = 1-exp(-\zeta(z_{t}-c)^{2}), \zeta>0 $$


 * second order logistic function:
 * $$G(z_{t}, \zeta, c) = (1+exp(-\zeta(z_{t}-c_{1})(z_{t}-c_{2})))^{-1}, \zeta>0 $$