9
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      The research of ARIMA, GM(1,1), and LSTM models for prediction of TB cases in China

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background and objective

          Tuberculosis (Tuberculosis, TB) is a public health problem in China, which not only endangers the population’s health but also affects economic and social development. It requires an accurate prediction analysis to help to make policymakers with early warning and provide effective precautionary measures. In this study, ARIMA, GM(1,1), and LSTM models were constructed and compared, respectively. The results showed that the LSTM was the optimal model, which can be achieved satisfactory performance for TB cases predictions in mainland China.

          Methods

          The data of tuberculosis cases in mainland China were extracted from the National Health Commission of the People’s Republic of China website. According to the TB data characteristics and the sample requirements, we created the ARIMA, GM(1,1), and LSTM models, which can make predictions for the prevalence trend of TB. The mean absolute error (MAE), root mean square error (RMSE), and mean absolute percentage error (MAPE) were applied to evaluate the effects of model fitting predicting accuracy.

          Results

          There were 3,021,995 tuberculosis cases in mainland China from January 2018 to December 2020. And the overall TB cases in mainland China take on a downtrend trend. We established ARIMA, GM(1,1), and LSTM models, respectively. The optimal ARIMA model is the ARIMA (0,1,0) × (0,1,0)12. The equation for GM(1,1) model was X(k+1) = -10057053.55e (-0.01k) + 10153178.55 the Mean square deviation ratio C value was 0.49, and the Small probability of error P was 0.94. LSTM model consists of an input layer, a hidden layer and an output layer, the parameters of epochs, learning rating are 60, 0.01, respectively. The MAE, RMSE, and MAPE values of LSTM model were smaller than that of GM(1,1) and ARIMA models.

          Conclusions

          Our findings showed that the LSTM model was the optimal model, which has a higher accuracy performance than that of ARIMA and GM (1,1) models. Its prediction results can act as a predictive tool for TB prevention measures in mainland China.

          Related collections

          Most cited references55

          • Record: found
          • Abstract: found
          • Article: not found

          Applications of Support Vector Machine (SVM) Learning in Cancer Genomics

          (2018)
          Machine learning with maximization (support) of separating margin (vector), called support vector machine (SVM) learning, is a powerful classification tool that has been used for cancer genomic classification or subtyping. Today, as advancements in high-throughput technologies lead to production of large amounts of genomic and epigenomic data, the classification feature of SVMs is expanding its use in cancer genomics, leading to the discovery of new biomarkers, new drug targets, and a better understanding of cancer driver genes. Herein we reviewed the recent progress of SVMs in cancer genomic studies. We intend to comprehend the strength of the SVM learning and its future perspective in cancer genomic applications.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Estimation of COVID-19 prevalence in Italy, Spain, and France

            At the end of December 2019, coronavirus disease 2019 (COVID-19) appeared in Wuhan city, China. As of April 15, 2020, >1.9 million COVID-19 cases were confirmed worldwide, including >120,000 deaths. There is an urgent need to monitor and predict COVID-19 prevalence to control this spread more effectively. Time series models are significant in predicting the impact of the COVID-19 outbreak and taking the necessary measures to respond to this crisis. In this study, Auto-Regressive Integrated Moving Average (ARIMA) models were developed to predict the epidemiological trend of COVID-19 prevalence of Italy, Spain, and France, the most affected countries of Europe. The daily prevalence data of COVID-19 from 21 February 2020 to 15 April 2020 were collected from the WHO website. Several ARIMA models were formulated with different ARIMA parameters. ARIMA (0,2,1), ARIMA (1,2,0), and ARIMA (0,2,1) models with the lowest MAPE values (4.7520, 5.8486, and 5.6335) were selected as the best models for Italy, Spain, and France, respectively. This study shows that ARIMA models are suitable for predicting the prevalence of COVID-19 in the future. The results of the analysis can shed light on understanding the trends of the outbreak and give an idea of the epidemiological stage of these regions. Besides, the prediction of COVID-19 prevalence trends of Italy, Spain, and France can help take precautions and policy formulation for this epidemic in other countries.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Application of the ARIMA model on the COVID-2019 epidemic dataset

              Coronavirus disease 2019 (COVID-2019) has been recognized as a global threat, and several studies are being conducted using various mathematical models to predict the probable evolution of this epidemic. These mathematical models based on various factors and analyses are subject to potential bias. Here, we propose a simple econometric model that could be useful to predict the spread of COVID-2019. We performed Auto Regressive Integrated Moving Average (ARIMA) model prediction on the Johns Hopkins epidemiological data to predict the epidemiological trend of the prevalence and incidence of COVID-2019. For further comparison or for future perspective, case definition and data collection have to be maintained in real time.
                Bookmark

                Author and article information

                Contributors
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: Writing – original draftRole: Writing – review & editing
                Role: ConceptualizationRole: Data curationRole: Writing – original draftRole: Writing – review & editing
                Role: Formal analysisRole: Writing – original draft
                Role: Formal analysisRole: Writing – original draft
                Role: Writing – original draft
                Role: Writing – original draft
                Role: ConceptualizationRole: Writing – review & editing
                Role: Editor
                Journal
                PLoS One
                PLoS One
                plos
                PLoS ONE
                Public Library of Science (San Francisco, CA USA )
                1932-6203
                23 February 2022
                2022
                : 17
                : 2
                : e0262734
                Affiliations
                [1 ] Department of Medical Administration, Sichuan Provincial Orthopedics Hospital, Chengdu, Sichuan, P.R. China
                [2 ] Department of Medical Administration, Sichuan Academy of Medical Sciences & Sichuan Provincial People’s Hospital, Chengdu, Sichuan, P.R. China
                [3 ] Department of Medical Administration, Sichuan Cancer Hospital & Institute, Chengdu, Sichuan, P.R. China
                [4 ] Department of Information and Statistics, The Affiliated Hospital of Southwest Medical University, Luzhou, Sichuan, P.R. China
                [5 ] Department of Medical Administration, Luzhou People’s Hospital, Luzhou, Sichuan, P.R. China
                [6 ] School of Management, Chengdu University of Traditional Chinese Medicine, Chengdu, Sichuan, P.R. China
                Instituto Nacional de Astrofisica Optica y Electronica, MEXICO
                Author notes

                Competing Interests: The authors declare no conflicts of interest.

                Author information
                https://orcid.org/0000-0003-1471-009X
                Article
                PONE-D-21-24787
                10.1371/journal.pone.0262734
                8865644
                35196309
                8626836f-d3fb-42a9-a670-c919200f903f
                © 2022 Zhao et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 30 July 2021
                : 4 January 2022
                Page count
                Figures: 7, Tables: 6, Pages: 18
                Funding
                Funded by: National Health Commission of the People’s Republic of China
                Award ID: YLZLXZ-2021-005
                Award Recipient :
                We state that the study and the paper were financially supported by the Hospital Management Institute, the National Health Commission of the People’s Republic of China [grant no. YLZLXZ-2021-005]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Medicine and Health Sciences
                Medical Conditions
                Infectious Diseases
                Bacterial Diseases
                Tuberculosis
                Medicine and Health Sciences
                Medical Conditions
                Tropical Diseases
                Tuberculosis
                People and Places
                Geographical Locations
                Asia
                China
                Research and Analysis Methods
                Mathematical and Statistical Techniques
                Statistical Methods
                Forecasting
                Physical Sciences
                Mathematics
                Statistics
                Statistical Methods
                Forecasting
                Medicine and Health Sciences
                Medical Conditions
                Infectious Diseases
                Medicine and Health Sciences
                Public and Occupational Health
                Medicine and Health Sciences
                Medical Conditions
                Infectious Diseases
                Viral Diseases
                Covid 19
                Engineering and Technology
                Signal Processing
                Autocorrelation
                Research and Analysis Methods
                Mathematical and Statistical Techniques
                Statistical Methods
                Autocorrelation
                Physical Sciences
                Mathematics
                Statistics
                Statistical Methods
                Autocorrelation
                Biology and Life Sciences
                Organisms
                Bacteria
                Actinobacteria
                Mycobacterium Tuberculosis
                Custom metadata
                All data TB cases were taken from National Health Commission of the People’s Republic of China ( http://www.nhc.gov.cn). Anyone meeting the requirements can gain access to them. The data were relatively uninvolved in detailed patient personal information. The authors confirm they did not have any special access privileges that other would not have.

                Uncategorized
                Uncategorized

                Comments

                Comment on this article