0
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Prediction of hepatitis E using machine learning models

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Accurate and reliable predictions of infectious disease can be valuable to public health organizations that plan interventions to decrease or prevent disease transmission. A great variety of models have been developed for this task. However, for different data series, the performance of these models varies. Hepatitis E, as an acute liver disease, has been a major public health problem. Which model is more appropriate for predicting the incidence of hepatitis E? In this paper, three different methods are used and the performance of the three methods is compared.

          Methods

          Autoregressive integrated moving average(ARIMA), support vector machine(SVM) and long short-term memory(LSTM) recurrent neural network were adopted and compared. ARIMA was implemented by python with the help of statsmodels. SVM was accomplished by matlab with libSVM library. LSTM was designed by ourselves with Keras, a deep learning library. To tackle the problem of overfitting caused by limited training samples, we adopted dropout and regularization strategies in our LSTM model. Experimental data were obtained from the monthly incidence and cases number of hepatitis E from January 2005 to December 2017 in Shandong province, China. We selected data from July 2015 to December 2017 to validate the models, and the rest was taken as training set. Three metrics were applied to compare the performance of models, including root mean square error(RMSE), mean absolute percentage error(MAPE) and mean absolute error(MAE).

          Results

          By analyzing data, we took ARIMA(1, 1, 1), ARIMA(3, 1, 2) as monthly incidence prediction model and cases number prediction model, respectively. Cross-validation and grid search were used to optimize parameters of SVM. Penalty coefficient C and kernel function parameter g were set 8, 0.125 for incidence prediction, and 22, 0.01 for cases number prediction. LSTM has 4 nodes. Dropout and L2 regularization parameters were set 0.15, 0.001, respectively. By the metrics of RMSE, we obtained 0.022, 0.0204, 0.01 for incidence prediction, using ARIMA, SVM and LSTM. And we obtained 22.25, 20.0368, 11.75 for cases number prediction, using three models. For MAPE metrics, the results were 23.5%, 21.7%, 15.08%, and 23.6%, 21.44%, 13.6%, for incidence prediction and cases number prediction, respectively. For MAE metrics, the results were 0.018, 0.0167, 0.011 and 18.003, 16.5815, 9.984, for incidence prediction and cases number prediction, respectively.

          Conclusions

          Comparing ARIMA, SVM and LSTM, we found that nonlinear models(SVM, LSTM) outperform linear models(ARIMA). LSTM obtained the best performance in all three metrics of RSME, MAPE, MAE. Hence, LSTM is the most suitable for predicting hepatitis E monthly incidence and cases number.

          Related collections

          Most cited references14

          • Record: found
          • Abstract: not found
          • Article: not found

          Generalized autoregressive conditional heteroskedasticity

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            The Exponentially Weighted Moving Average

            J. Hunter (2018)
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Financial time series forecasting using independent component analysis and support vector regression

                Bookmark

                Author and article information

                Contributors
                Role: ConceptualizationRole: MethodologyRole: Writing – original draft
                Role: ConceptualizationRole: Data curationRole: MethodologyRole: Writing – original draft
                Role: Formal analysisRole: MethodologyRole: Writing – review & editing
                Role: Data curationRole: MethodologyRole: Writing – review & editing
                Role: Data curationRole: Methodology
                Role: Data curationRole: Methodology
                Role: Editor
                Journal
                PLoS One
                PLoS ONE
                plos
                plosone
                PLoS ONE
                Public Library of Science (San Francisco, CA USA )
                1932-6203
                2020
                17 September 2020
                : 15
                : 9
                : e0237750
                Affiliations
                [1 ] School of Data and Computer Science, Shandong Women’s Unversity, Jinan, Shandong, China
                [2 ] Shandong Provincial Key Laboratory of Infectious Disease Control and Prevention, Shandong Center for Disease Control and Prevention, Jinan, Shandong, China
                [3 ] Academy of Preventive Medicine, Shandong University, Jinan, Shandong, China
                Polytechnical Universidad de Madrid, SPAIN
                Author notes

                Competing Interests: The authors have declared that no competing interests exist.

                Author information
                http://orcid.org/0000-0002-5444-3822
                http://orcid.org/0000-0002-2560-1748
                Article
                PONE-D-20-15651
                10.1371/journal.pone.0237750
                7497991
                32941452
                f8256433-a39f-4be4-9484-8aa442162e75
                © 2020 Guo et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 25 May 2020
                : 1 August 2020
                Page count
                Figures: 8, Tables: 6, Pages: 12
                Funding
                Funded by: ZhiFei Disease Prevention and Control Technology Research Fund Project
                Award ID: LYH2017-08
                Award Recipient :
                Funded by: Open Research Fund of Shandong Provincial Key Laboratory Of Infectious Disease Control and Prevention, Shandong Center for Disease Control and Prevention
                Award ID: 2017KEYLAB01
                Award Recipient :
                Funded by: Shandong Medical Health Science and Technology Development Programs
                Award ID: 2018WS309
                Award Recipient :
                Funded by: Science and Technology Project for the Universities of Shandong Province
                Award ID: J18KB171
                Award Recipient :
                Funded by: Discipline Talent Team Cultivation Program of Shandong Women’s University
                Award ID: 1904
                Award Recipient :
                Funded by: Shandong Women’s University High level scientific research project Cultivation Fund
                Award ID: 2019GSPGJ07
                Award Recipient :
                This work was supported by ZhiFei Disease Prevention and Control Technology Research Fund Project (No. LYH2017-08) to YF, Open Research Fund of Shandong Provincial Key Laboratory Of Infectious Disease Control and Prevention, Shandong Center for Disease Control and Prevention (No. 2017KEYLAB01) to YG, Shandong Medical Health Science and Technology Development Programs (No. 2018WS309) to YF, Science and Technology Project for the Universities of Shandong Province (No. J18KB171) to YG, Discipline Talent Team Cultivation Program of Shandong Women’s University (No. 1904) to YG, and Shandong Women’s University High level scientific research project Cultivation Fund (No. 2019GSPGJ07) to YG.
                Categories
                Research Article
                Medicine and health sciences
                Medical conditions
                Infectious diseases
                Viral diseases
                Hepatitis
                Hepatitis E
                Medicine and Health Sciences
                Epidemiology
                Computer and Information Sciences
                Artificial Intelligence
                Machine Learning
                Support Vector Machines
                Medicine and Health Sciences
                Epidemiology
                Infectious Disease Epidemiology
                Medicine and Health Sciences
                Medical Conditions
                Infectious Diseases
                Infectious Disease Epidemiology
                Computer and Information Sciences
                Artificial Intelligence
                Machine Learning
                Medicine and Health Sciences
                Medical Conditions
                Infectious Diseases
                Research and Analysis Methods
                Mathematical and Statistical Techniques
                Statistical Methods
                Forecasting
                Physical Sciences
                Mathematics
                Statistics
                Statistical Methods
                Forecasting
                Physical Sciences
                Mathematics
                Operator Theory
                Kernel Functions
                Custom metadata
                According to Chinese laws and regulations, only government departments at or above the provincial level have the right to publish epidemic data. At present, due to the low conditions, Shandong Health Committee has only published parts of the data. The url of published data is http://wsjkw.shandong.gov.cn/. If readers want to reproduce our experiment, they can access our github ( https://github.com/guoyanhui03/dataset.git). All other relevant data are within the manuscript and Supporting Information files.

                Uncategorized
                Uncategorized

                Comments

                Comment on this article