Glossary
In this chapter, I decided to collect all the important definitions that are used in this monograph. I provide not only the specific definitions, but also the references to the sections where they appear.
A
- Additive component is the component that is added to the level of the data. e.g. additive seasonality implies that there is an increase in sales every January by \(x\) units in comparison with the baseline level (Section 3.1).
- Analytics is a systematic process of data analysis to support informed decisions (Section 1.1).
- Applied model or estimated model is the statistical model that is applied to the available sample of data (Section 1.4).
- Autocorrelation is the correlation of a variable with itself on a different observation (Subsection 8.3.2).
B
- Backcasting is the procedure of obtaining initial values of states by first applying the model to the original data and then to its values in reverse (Subsection 11.4.1).
D
- Dampening parameter is the parameter that shows what percentage from the captured trend component should be used in producing the one step ahead point forecast (Subsection 4.4.2).
- Data Generating Process is the mathematical model according to which the data was generated (Section 1.4). Only makes sense in case of simulated data.
- Demand intervals the values that show the distance between the demand sizes (see introduction to Chapter 13).
- Demand occurrence is the variable that tracks whether the demand occurred on the specific observation or not (see introduction to Chapter 13).
- Demand sizes are the non-zero values of demand. The term is typically used in the intermittent demand domain (see introduction to Chapter 13).
- Deterministic component is the component that does not evolve over time (Section 3.1). e.g. a deterministic trend is the constant trend that does not change its slope in the data (Subsection 3.3.5).
- Direct forecasting is a procedure of producing a specific h-steps-ahead forecast (e.g., 12 months ahead), skipping all intermediate steps (Section 11.3).
- Dynamic model is the model that has time varying elements in it. This could be time series components, previous values of variables or time varying parameters (see example for the latter in Section 10.3).
E
- Error or Error term is an unexplainable part of data (white noise) that happens due to exhistence of many small factors impacting the response variable (Section 3.1).
- Error measure or error metric is an indicator that captures performance of an approach in terms of some specific statistics (e.g. mean, median etc. Section 2.1).
- Explanatory, or exogenous, or input variable is the variable that impacts the response variable but is not impacted by it (Chapter 10).
F
- Fixed origin evaluation is the scheme when the forecasts performance is assessed only once for a specific test set based on the model estimated on a specific training set (Section 2.4).
- Forecast is a scientifically justified assertion about possible states of an object in the future (Section 1.1).
- Forecast error is the error calculated for the forecast produced on the specific horizon. Should not be confused with the error term or residuals (Section 11.3).
- Forecast horizon is the number of observations for which the forecasting is done (Section 1.3).
- Forecastability condition - A softer version of the “stability condition” that guarantees that with the increase of the sample size, the initial value of the state vector does not have an increasing impact on the final forecast (Section 5.4). e.g. the initial level should not have an increasing weight in the final forecast over time. Global level model satisfies this condition but does not satisfy the stability one.
- Forecasting is a process of producing forecasts (Section 1.1).
H
- Heteroscedasticity is the effect when the variance of the data (or the residuals) does not stay constant (Section 14.6).
I
- Identifiability is the property of a model being uniquely defined for time series. If several different combinations of parameters give exactly the same model fit and forecasts, such model would be considered unidentifiable (see some examples in Section 9.4).
- Intermittent demand is the demand that happens at irregular frequency (see introduction to Chapter 13).
- Invertibility condition - see “Stability condition” (Section 5.4).
L
- Level is the expected (average) demand for a specific time period (Section 3.1).
- Local level model is the model that has only the level component that evolves over time (Section 4.3).
- Local trend model is the model that has level and trend components, where the trend component changes over time (Subsection 4.4.1).
- Loss function or “Cost function” is the mathematical function that needs to be optimised to estimate parameters of a model (Sections 11.1, 11.2, and 11.3).
M
- Measurement equation is the equation that shows how components of time series form the structure of the data (Section 4.6).
- Measurement vector (measurement function) is the vector that describes how specifically the components form a structure in time series (Section 4.6). For example, trend component is added to the level.
- Method (forecasting method) is a particular way of doing something (Section 1.4, Dictionary (2021)).
- Model (statistical/forecasting model) is a mathematical representation of a real phenomenon with a complete specification of distribution and parameters (Section 1.4, Svetunkov and Boylan (2023a)).
- Multicollinearity is the effect when some of the explanatory variables of the model (or their combination) are linearly related with each other (Section 14.9).
- Multistep forecast or multiple-steps-ahead forecast is the forecast produced for 1 to \(h\) steps ahead (Section 11.3).
- Multiplicative component is the component that is multiplied by the level of the data. e.g. multiplicative seasonality implies that there is an increase in sales every January by \(x\)% in comparison with the baseline level (Section 3.1).
N
- Nonparametric is the approach that does not rely on assumptions of the model (Section 18.3). For example, prediction interval can be produced based on the collected in-sample multistep forecast errors, not assuming any distribution and not relying on analytical solutions for the conditional multistep variance.
O
- Outlier (not an outliner!) is the value that lies beyond some selected threshold. These typically imply that some important information was omitted from the model (Section 14.4).
P
- Parametric is the approach for producing a statistics based on the assumed model and distribution (Section 18.3). For example, this is the most restrictive way of generating prediction intervals.
- Partial autocorrelation is the autocorrelation that is cleared from all the interim effects between the observations (Subsection 8.3.3). e.g. a correlation between \(y_t\) and \(y_{t-2}\) without the possible effect on \(y_{t-1}\) on either of the two.
- Persistence vector (persistence function) is the vector that contains all the smoothing parameters (Section 4.6). It shows how the values of components are updated based on the observed forecast error.
- Point forecast is the trajectory produced by a model (Subsection 1.3.1).
R
- Random walk is the process that implies that the differences of the data have a constant level (Subsection 8.1.5), i.e. the only thing that impacts them is the error term. This is the model underlying the Naïve forecasting method (Subsection 3.3.1).
- Range is the metric that shows the width of the prediction interval (Section 2.2).
- Recursive forecasting refers to the approach, when multistep forecasts are produced iteratively, one after the other, typically based on a mathematical formula (see examples in Section 18.1.1). This is the default way of producing forecasts from dynamic models.
- Regression model is the model that captures a linear relation between the response and a set of the explanatory variables.
- Residuals are the estimated errors of the model. Whenever you apply a model to the data, you end up with the structure and residuals (Section 1.4).
- Response or output variable is the variable of interest that we want to forecast (Chapter 10).
- Rolling origin evaluation is the scheme when the forecasts performance is assessed iteratively on a moving holdout sample by adding observations to the in-sample. This is sometimes also called “Time series cross-validation” (Section 2.4).
S
- Seasonal normalisation is the process that guarantees that the seasonal indices either add up to zero - in case of the additive seasonality - or have a geometric mean of one - in case of the multiplicative one (Section 7.5).
- Seasonality is a pattern that repeats itself with a fixed periodicity (Section 3.1).
- Semiparametric is the approach which lifts some of assumptions of the model, but not all (Section 18.3). For example, prediction interval can still rely on normality, but use empirically evaluated multistep forecast variances.
- Smoothing parameter is the parameter that regulates the reactivity in the update of a time series component. Used in SES (Section 3.4) and ETS (Section 4.2).
- Stability condition (“invertibility” in case of ARIMA) is the restriction on the models parameters that guarantees that model forgets the old information and puts more weight on the recent observations (Section 5.4).
- State space model is the model that describes how different components make up the data and how those components evolve over time (Section 4.6).
- State vector is the vector that contains components (states) of time series (Section 4.6). Can include level, trend, seasonal components together with values of explanatory variables .
- Stationarity is the condition that guarantees that the unconditional moments of the model stay the same (Section 8.1. Can be weak, implying that only the expectation and the variance need to be constant and strong, implying in addition that the probability distribution does not change over time.
- Static model is the model, which structure does not change over time. An example of such model would be a Global level, a Global trend, or a regression model (see respective terms in the Glossary).
- Stochastic component is the component that can change its value over time. e.g. a stochastic trend can be positive in the beginning of the sample than slowly change its value and become negative by the end of the sample (Section 3.1).
T
- Transition equation is a system of equations describing how different components of time series interact with each other and evolve over time (Section 4.6).
- Transition matrix (transition function) is the matrix describing the interactions between the components (Section 4.6).
- Trend or growth is the average increase or decrease of the value over a period of time (Section 3.1).
- True model is the idealistic statistical model that is correctly specified (has all the necessary components in the correct form), and applied to the data in population” (Section 1.4, Svetunkov and Yusupova (2025)).
References
• Dictionary, 2021. Method. https://dictionary.cambridge.org/dictionary/english/method version: 2021-09-02
• Svetunkov, I., Boylan, J.E., 2023a. iETS: State Space Model for Intermittent Demand Forecastings. International Journal of Production Economics. 109013. https://doi.org/10.1016/j.ijpe.2023.109013
• Svetunkov, I., Yusupova, A., 2025. Statistics for business analytics. https://openforecast.org/sba/ version: 01.06.2025