Submissions #52
ThomasFaria
started this conversation in
General
Replies: 1 comment
-
Add a few words about reproductibility |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Entry 1
RegARIMA model: automatic ARIMA modelling with detection of outliers and external regressors (financial series, macroeconomic series, toll traffic, google trends...). We use the RJdemetra package that provide an easy way to identify and estimate such models with high flexibility on parameters. RegARIMA models offer some advantages in this exercise :
• they are relatively simple, easy to interpret, and so provide a useful benchmark for more complex models.
• we allow for automatic detection of outliers on the estimation period in order to avoid large bias on coefficients. Without adding outliers, "covid" points would for instance totally distort coefficients. Outliers identified through this procedure could in addition be used in other methods
• computation time is really fast (a few seconds for the whole set of countries)
• external regressors can be selected independantly through standard variable selection.
Entry 2
Dynamic Factor Model: using several sources of data (financial series, macroeconomic series, surveys, toll traffic, google trends…) we estimate a dynamic factor model per country. Missing data are filled, collinearity is taken into account, data are scaled and log transformed when necessary. In order to reduce the estimation burden we restrict the number of factor to a maximum of 4. Estimation takes generally less than 2 seconds per country. Exact timing can be found in the actions of our open-source repository (42 sec for all countries).
Entry 3
Exponential smoothing models: forecasts are linear combinations of past values, with the weights decaying exponentially as the observations get older. Estimation takes 1 second by country. No data processing.
Entry 4
XGboost model: using several sources of data (financial series, macroeconomic series, surveys, toll traffic, google trends…) we construct a dataset in which each row corresponds to a value per country and per date with many explicative variables. On this basis, we train a XGboost model to predict the value of the following month.
Entry 5
LSTM model: using several sources of data (financial series, macroeconomic series, surveys, toll traffic, google trends…) we construct a dataset in which each row corresponds to a value per country and per date with many explicative variables. On this basis, we train a LSTM model to predict the value of the following month, hoping that it will learn from the time series.
Beta Was this translation helpful? Give feedback.
All reactions