TY - UNPB
T1 - State-of-the-art deep learning models are superior for time series forecasting and are applied optimally with iterative prediction methods
AU - Murray, Cathal
AU - Du Bois, Naomi
AU - Hollywood, L.E.
AU - Coyle, Damien
PY - 2023/2/28
Y1 - 2023/2/28
N2 - In recent years many new algorithms have been developed for applications in speech and image processing which may be repurposed for time series prediction. This paper presents a comprehensive comparative analysis of time series forecasting capabilities of eight such state-of-the-art algorithms – namely: Vanilla Long Short-Term Memory(V-LSTM) Gated Recurrent Unit (GRU), Bidirectional LSTM(BD-LSTM), Auto encoder (AE-LSTM), Convolutional Neural Network LSTM(CNN-LSTM), LSTM with convolutional encoder (ConvLSTM), Attention mechanism networks and the Transformer networks. Model performances across tendifferent benchmark datasets including fields of interests such as finance, weather and sales are evaluated. Direct and iterative prediction methods of forecasting are also comprehensively evaluated. For comprehensive and efficient model optimization, the asynchronous successive halving algorithm (ASHA) is applied in the training folds in a 10 fold cross validation framework. Statistical tests are used to comprehensively compare algorithm performances within and across datasets. We show that whilst there are differences between all models, the differences are insignificant for the top performing models which include the Transformer, Attention, V-LSTM, CNN-LSTM and CV-LSTM. However, the transformer model consistently produces the lowest prediction error. We also show that the iterative multistep ahead prediction method is optimal for long range prediction with these new algorithms.
AB - In recent years many new algorithms have been developed for applications in speech and image processing which may be repurposed for time series prediction. This paper presents a comprehensive comparative analysis of time series forecasting capabilities of eight such state-of-the-art algorithms – namely: Vanilla Long Short-Term Memory(V-LSTM) Gated Recurrent Unit (GRU), Bidirectional LSTM(BD-LSTM), Auto encoder (AE-LSTM), Convolutional Neural Network LSTM(CNN-LSTM), LSTM with convolutional encoder (ConvLSTM), Attention mechanism networks and the Transformer networks. Model performances across tendifferent benchmark datasets including fields of interests such as finance, weather and sales are evaluated. Direct and iterative prediction methods of forecasting are also comprehensively evaluated. For comprehensive and efficient model optimization, the asynchronous successive halving algorithm (ASHA) is applied in the training folds in a 10 fold cross validation framework. Statistical tests are used to comprehensively compare algorithm performances within and across datasets. We show that whilst there are differences between all models, the differences are insignificant for the top performing models which include the Transformer, Attention, V-LSTM, CNN-LSTM and CV-LSTM. However, the transformer model consistently produces the lowest prediction error. We also show that the iterative multistep ahead prediction method is optimal for long range prediction with these new algorithms.
KW - time series prediction
KW - forecasting
U2 - 10.2139/ssrn.4361707
DO - 10.2139/ssrn.4361707
M3 - Preprint
T3 - SSRN Electronic Journal
BT - State-of-the-art deep learning models are superior for time series forecasting and are applied optimally with iterative prediction methods
PB - SSRN
ER -