49
views
0
recommends
+1 Recommend
3 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A Deep Learning LSTM Approach to Predict COVD-19 Deaths in North Africa

      research-article
      , PhD 1
      Asia-Pacific Journal of Public Health
      SAGE Publications

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Introduction The countries of Tunisia, Algeria, and Morocco are part of the North Africa region, also called the Maghreb. As of October 16, 2022, there are 52413 COVID-19 related deaths reported for these countries despite the significant progress in vaccination. 1 A notable interest by scholars has emerged recently to model and to forecast the spread and the lethality of the pandemic and its impact in the Middle East and North Africa region. In a recent study, spatial panel-data models were used to identify the factors for the spike of COVID-19 infections in North Africa. 2 In another study, a statistical analysis was performed based on zero-inflation models and autoregressive conditional count models to forecast death counts with evidence from Tunisian data. 3 Furthermore, quantitative analyses including statistical modeling and deep learning methods have also been performed to forecast the pandemic outbreak in different parts of the world. 4 For instance, some authors presented long short-term memory (LSTM) based models to predict novel infections of the coronavirus in India, whereas in other studies deep learning methods were used to forecast new COVID-19 cases and death rates in Australia and Iran. 5 Using COVID-19 datasets of several countries including Brazil, Germany, Italy, Spain, United Kingdom, China, India, Israel, Russia, and United States, alternative deep learning methods were studied and their results were compared in terms of forecast performance. 6 In this paper, we contribute to this ongoing literature, by conducting a statistical analysis with publicly available data on the coronavirus death counts for the Maghreb countries, and we show that the method of deep learning with LSTM network outperforms time series autoregressive integrated moving average (ARIMA) models in terms of forecast accuracy for the pandemic deaths. Methods There are several deep learning methods in the literature, including LSTM neural network, Gated Recurrent Unit, Convolutional Neural Network, Deep Neural Network, Extreme Learning Machine and Multilayer Perceptron. 7 The main components of the LSTM are its gates which are given by the input gate, the forget gate, and the output gate. The input gate will control the inflow of new information into the cell. The forget gate will control the content of the memory, that is, the forget gate will decide if we want to forget a piece of information so we can store new information. The output gate will control when the information is used in the output from the cell. In this paper, we limit our empirical analysis to only one state-of-the-art method which is based on LSTM networks to predict COVID-19 deaths in the Maghreb. In general, an LSTM network solves the gradient vanishing problem which characterizes recurrent neural networks. It does so by modeling the long-term dependencies of a time series with an optimal lag length, and by allowing memory unit of the process to decide, remember, and forget information, accordingly, which can create connections between present and past data observations, and can compute mapping between input and output sequences. To apply the LSTM methodology, each country data are divided into two subsets, one for in-sample training and it includes 90% of the data, and the other 10% is for out of sample prediction. The adaptive moment estimation (ADAM) optimization algorithm is applied to the training data which is standardized to have zero mean and unit variance. The algorithm computes an exponential moving average of the gradient and its square with specified parameter values to control the decay rates. ADAM optimizer is popular in deep learning applications, and with the new updates to the learning rate, scholars addressed the shortcomings of its original algorithm and made it a more reliable optimizer. 8 In this paper, we compute the prediction values of deaths using the predictAndUpdateState MATLAB function. The final step is to compute the root mean square error of the forecasts and to plot the observed and the predicted values of COVID-19-related deaths for each North African country. Next, we fit the data to time series ARIMA models with R software and we calculate the root mean square error (RMSE) to compare the forecast accuracy of these models with deep learning LSTM models. An autoregressive integrated moving average model can be represented as follows: (1) φ ( B ) ( 1 − B ) d ( y t − μ ) = θ ( B ) ε t Where ψ ( B ) and θ ( B ) are polynomial functions in the backshift operator B, y t is the time series, d is the order of integration, and ε t is a white noise process. Results We collected publicly available data on death counts related to the pandemic for each of the three North African countries. The data sets which have no missing values, cover the period from March 24, 2020, to April 21, 2021, and they can be obtained online. 9 The computations for the LSTM model predictions are performed with MATLAB programs, and for ARIMA models we applied R software coding. Auto.arima function in R shows that the best fit for the pandemic death data is given by an ARIMA model of order (0,1,1) for Algeria, (1,1,2) for Morocco, and (2,2,3) for Tunisia. We run these models to compute postsample predictions and root mean square error for each country data. Figure 1 displays the observed and updated forecast values of LSTM models for each of the Maghreb countries, based on the last 10% of data, which is used for prediction, whereas the first subsample of 90% of the data is used for training. There are 394 daily observations in total for each COVID-19 death data and therefore the figure lists the last 39 observed and predicted values and the forecast errors obtained from deep learning LSTM method, from March 14, 2021, to April 21, 2021. Figure 1. Observed and predicted COVID-19 deaths in North Africa. Table 1 lists RMSE for both ARIMA and LSTM models and shows clear evidence of better forecast accuracy of the deep learning method compared to times series ARIMA models. The root mean square errors for each of Algeria, Morocco, and Tunisia COVID-19 death data are much smaller with long short-term memory networks than with the autoregressive integrated moving average time series methods. Table 1. RMSE for LSTM and ARIMA Model Forecasts of COVID-19 Deaths. Country COVID-19 death count(as of October 16, 2022) RMSE Deep learning LSTM network ARIMA model Algeria 6881 1.185 2.698 Morocco 16278 3.056 6.054 Tunisia 29254 25.440 31.212 Abbreviations: ARIMA, autoregressive integrated moving average; LSTM, long short-term memory; RMSE, root mean square error. Discussion As we enter the third year of the pandemic, COVID-19 deaths have so far exceeded 6 million worldwide, and people are still struggling to return to normalcy. Like other parts of the world, the North African countries have been negatively impacted socially and economically by the pandemic outbreak and active research has emerged to model and to predict accurately the outcomes of COVID-19 in the region. As a contribution to the ongoing literature on reliable statistical modeling of the pandemic outbreak, we present an empirical study based on deep learning LSTM methods to forecast the pandemic lethality in North Africa and to compare the forecast accuracy of these models with time series ARIMA models. Our study finds that methods based on deep learning networks provide more accurate forecasts than time series autoregressive integrated moving average models, with lower root mean square forecast errors. It is very important for health official and healthcare professionals to have access to accurate forecasts of COVID-19 lethality to implement measured and effective health policies. The findings of the paper show that deep learning networks have more accurate predictions of deaths related to the pandemic than time series models. In a related study based on COVID-19 data from the Gulf countries, it was found that state space models outperform LSTM networks in terms of forecast accuracy in presence of highly complex surveillance data. 10 One limitation of this study is that it does not include a comprehensive complexity analysis in order to verify whether the superiority of deep learning LSTM models over ARIMA models in terms of more accurate forecasts with lower root mean square errors may be explained by the notion of data complexity. Also, this paper used conventional recurrent neural networks which are only capable of training a single model, whereas a bidirectional LSTM network allows the information to be processed from a sequence of input data and from the reverse of that sequence, and therefore better predictions may be expected from bidirectional LSTM than from the conventional LSTM model. This could be a direction for future empirical research on the topic.

          Related collections

          Most cited references8

          • Record: found
          • Abstract: found
          • Article: not found

          Predictions for COVID-19 with Deep Learning Models of LSTM, GRU and Bi-LSTM

          COVID-19, responsible of infecting billions of people and economy across the globe, requires detailed study of the trend it follows to develop adequate short-term prediction models for forecasting the number of future cases. In this perspective, it is possible to develop strategic planning in the public health system to avoid deaths as well as managing patients. In this paper, proposed forecast models comprising autoregressive integrated moving average (ARIMA), support vector regression (SVR), long shot term memory (LSTM), bidirectional long short term memory (Bi-LSTM) are assessed for time series prediction of confirmed cases, deaths and recoveries in ten major countries affected due to COVID-19. The performance of models is measured by mean absolute error, root mean square error and r2_score indices. In the majority of cases, Bi-LSTM model outperforms in terms of endorsed indices. Models ranking from good performance to the lowest in entire scenarios is Bi-LSTM, LSTM, GRU, SVR and ARIMA. Bi-LSTM generates lowest MAE and RMSE values of 0.0070 and 0.0077, respectively, for deaths in China. The best r2_score value is 0.9997 for recovered cases in China. On the basis of demonstrated robustness and enhanced prediction accuracy, Bi-LSTM can be exploited for pandemic prediction for better planning and management.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Time series forecasting of new cases and new deaths rate for COVID-19 using deep learning methods

            The first known case of Coronavirus disease 2019 (COVID-19) was identified in December 2019. It has spread worldwide, leading to an ongoing pandemic, imposed restrictions and costs to many countries. Predicting the number of new cases and deaths during this period can be a useful step in predicting the costs and facilities required in the future. The purpose of this study is to predict new cases and deaths rate one, three and seven-day ahead during the next 100 days. The motivation for predicting every n days (instead of just every day) is the investigation of the possibility of computational cost reduction and still achieving reasonable performance. Such a scenario may be encountered in real-time forecasting of time series. Six different deep learning methods are examined on the data adopted from the WHO website. Three methods are LSTM, Convolutional LSTM, and GRU. The bidirectional extension is then considered for each method to forecast the rate of new cases and new deaths in Australia and Iran countries. This study is novel as it carries out a comprehensive evaluation of the aforementioned three deep learning methods and their bidirectional extensions to perform prediction on COVID-19 new cases and new death rate time series. To the best of our knowledge, this is the first time that Bi-GRU and Bi-Conv-LSTM models are used for prediction on COVID-19 new cases and new deaths time series. The evaluation of the methods is presented in the form of graphs and Friedman statistical test. The results show that the bidirectional models have lower errors than other models. A several error evaluation metrics are presented to compare all models, and finally, the superiority of bidirectional methods is determined. This research could be useful for organisations working against COVID-19 and determining their long-term plans.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Deep learning via LSTM models for COVID-19 infection forecasting in India

              The COVID-19 pandemic continues to have major impact to health and medical infrastructure, economy, and agriculture. Prominent computational and mathematical models have been unreliable due to the complexity of the spread of infections. Moreover, lack of data collection and reporting makes modelling attempts difficult and unreliable. Hence, we need to re-look at the situation with reliable data sources and innovative forecasting models. Deep learning models such as recurrent neural networks are well suited for modelling spatiotemporal sequences. In this paper, we apply recurrent neural networks such as long short term memory (LSTM), bidirectional LSTM, and encoder-decoder LSTM models for multi-step (short-term) COVID-19 infection forecasting. We select Indian states with COVID-19 hotpots and capture the first (2020) and second (2021) wave of infections and provide two months ahead forecast. Our model predicts that the likelihood of another wave of infections in October and November 2021 is low; however, the authorities need to be vigilant given emerging variants of the virus. The accuracy of the predictions motivate the application of the method in other countries and regions. Nevertheless, the challenges in modelling remain due to the reliability of data and difficulties in capturing factors such as population density, logistics, and social aspects such as culture and lifestyle.
                Bookmark

                Author and article information

                Journal
                Asia Pac J Public Health
                Asia Pac J Public Health
                APH
                spaph
                Asia-Pacific Journal of Public Health
                SAGE Publications (Sage CA: Los Angeles, CA )
                1010-5395
                1941-2479
                2 December 2022
                2 December 2022
                : 10105395221141590
                Affiliations
                [1 ]University of Prince Edward Island, Charlottetown, PE, Canada
                Author notes
                [*]Sami Khedhiri, University of Prince Edward Island, 550 University Avenue, Charlottetown, PE C1A 4P3, Canada. Email: skh.upei@ 123456gmail.com
                Author information
                https://orcid.org/0000-0001-7496-3182
                Article
                10.1177_10105395221141590
                10.1177/10105395221141590
                9720417
                36461616
                884e0795-7306-43b2-b783-0723c72c12f3
                © 2022 APJPH

                This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License ( https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page ( https://us.sagepub.com/en-us/nam/open-access-at-sage).

                History
                Categories
                Short Communication
                Custom metadata
                corrected-proof
                ts1

                Comments

                Comment on this article