Please use this identifier to cite or link to this item: https://hdl.handle.net/11000/32294

An algorithm for automatic selection and combination of forecast models


Thumbnail

View/Open:
 1-s2.0-S0957417423021383-main.pdf

839,21 kB
Adobe PDF
Share:
Title:
An algorithm for automatic selection and combination of forecast models
Authors:
GARCIA AROCA, CARLOS  
Martínez Mayoral, Mª Asunción  
Morales-Socuéllamos, Javier
Segura-Heras, José Vicente  
Editor:
Elsevier
Department:
Departamentos de la UMH::Estadística, Matemáticas e Informática
Issue Date:
2023-09-18
URI:
https://hdl.handle.net/11000/32294
Abstract:
In this paper, we present an algorithm designed to automatically merge predictions from a collection of individual prediction methods coded in R. The algorithm employs varying weights and decision rules to ascertain the optimal amalgamation of these methods, with the aim of forecasting historical time series data while minimizing human intervention. The algorithm serves as an automated component within the artificial intelligence toolkit. The proposed algorithm (al), denoted as “alPCA” is founded on principal component analysis (PCA), hence the acronym. Commencing with 52 configurations of 11 distinct methods available in R, we calculate several loss functions: specifically, scaled Mean Absolute Percentage Error (sMAPE) and Mean Absolute Scaled Error (MASE) for both fitting (Training Phase) and prediction (Validation Phase), along with Root Mean Squared Error (RMSE) and Overall Weighted Average (OWA) solely for prediction (Validation Phase). We then employ PCA to reduce the error matrix derived from this data to one or two dimensions. Subsequently, the methods are ranked based on their proximity to the highest score. A probability distribution is fitted to this proximity metric, and utilizing the percentiles of these values, the optimal methods for combination are selected. We propose three categories of weights derived from the PCA scores, encompassing the fitting sMAPE (Training Phase) and the prediction sMAPE (Validation Phase), to facilitate the amalgamation process. This approach is applied to seven distinct univariate time series across diverse domains, including automobile sales, electricity production, and CO2 levels. Additionally, a set of 100 random monthly series from the M4 competition is included in the analysis. To assess the predictive precision of our algorithm, we compare its performance against three widely utilized combined prediction algorithms available in R. We evaluate the outcomes in Test Phase (unseen data) using four distinct loss functions and conduct a sensitivity analysis to gauge the algorithm’s robustness and efficacy across various specifications.
Keywords/Subjects:
Forecasting
Time series methods
Forecasting combination
Principal component analysis
Intelligent Manufacturing
Knowledge area:
CDU: Ciencias puras y naturales: Matemáticas
Type of document:
info:eu-repo/semantics/article
Access rights:
info:eu-repo/semantics/openAccess
DOI:
https://doi.org/10.1016/j.eswa.2023.121636
Appears in Collections:
Artículos Estadística, Matemáticas e Informática



Creative Commons ???jsp.display-item.text9???