Forecasting - 22 September 2023
R Programming language - on GitHub
Data provided from R fpp3 package.
This case study was conducted as part of the Forecasting module during the first year of the Graduate Diploma in Computer and Information Science. The implementation was carried out using the R programming language, leveraging its robust statistical capabilities to address complex forecasting challenges.
The study was structured into four distinct sections, each aimed at applying various forecasting techniques to real-world datasets:
The primary objective was to evaluate the performance of various forecasting techniques across diverse datasets, considering key accuracy metrics such as RMSE (Root Mean Squared Error) and MAPE (Mean Absolute Percentage Error). By analysing these metrics, the study aimed to identify the trade-offs between simplicity and accuracy in forecasting models and to understand how model complexity impacts predictive performance.
Analysing the forecasting accuracy metrics (RMSE and MAPE) for the models yielded the following insights:
In conclusion, while the Mean model offers superior accuracy, it fails to account for the data's underlying dynamics. The Linear Trend and Seasonal Dummies model and Multiple Linear Regression model, despite having higher error metrics, provide a more nuanced understanding of the data, making them valuable for capturing trends and seasonal variations. Thus, the choice of model should balance the need for accuracy with the complexity of the dataset and forecasting requirements.
The visualisations are presented in the order corresponding to the analysis process.