1
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Employing advanced supervised machine learning approaches for predicting micronutrient intake status among children aged 6–23 months in Ethiopia

      research-article

      Read this article at

          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Although micronutrients (MNs) are important for children’s growth and development, their intake has not received enough attention. MN deficiency is a significant public health problem, especially in developing countries like Ethiopia. However, there is a lack of empirical evidence using advanced statistical methods, such as machine learning. Therefore, this study aimed to use advanced supervised algorithms to predict the micronutrient intake status in Ethiopian children aged 6–23 months.

          Methods

          A total weighted of 2,499 children aged 6–23 months from the Ethiopia Demographic and Health Survey 2016 data set were utilized. The data underwent preprocessing, with 80% of the observations used for training and 20% for testing the model. Twelve machine learning algorithms were employed. To select best predictive model, their performance was assessed using different evaluation metrics in Python software. The Boruta algorithm was used to select the most relevant features. Besides, seven data balancing techniques and three hyper parameter tuning methods were employed. To determine the association between independent and targeted feature, association rule mining was conducted using the a priori algorithm in R software.

          Results

          According to the 2016 Ethiopia Demographic and Health Survey, out of 2,499 weighted children aged 12–23 months, 1,728 (69.15%) had MN intake. The random forest, catboost, and light gradient boosting algorithm outperformed in predicting MN intake status among all selected classifiers. Region, wealth index, place of delivery, mothers’ occupation, child age, fathers’ educational status, desire for more children, access to media exposure, religion, residence, and antenatal care (ANC) follow-up were the top attributes to predict MN intake. Association rule mining was identified the top seven best rules that most frequently associated with MN intake among children aged 6–23 months in Ethiopia.

          Conclusion

          The random forest, catboost, and light gradient boosting algorithm achieved a highest performance and identifying the relevant predictors of MN intake. Therefore, policymakers and healthcare providers can develop targeted interventions to enhance the uptake of micronutrient supplementation among children. Customizing strategies based on identified association rules has the potential to improve child health outcomes and decrease the impact of micronutrient deficiencies in Ethiopia.

          Related collections

          Most cited references50

          • Record: found
          • Abstract: found
          • Article: found

          Maternal and child undernutrition and overweight in low-income and middle-income countries

          The Lancet, 382(9890), 427-451
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View

            Background As more and more researchers are turning to big data for new opportunities of biomedical discoveries, machine learning models, as the backbone of big data analysis, are mentioned more often in biomedical journals. However, owing to the inherent complexity of machine learning methods, they are prone to misuse. Because of the flexibility in specifying machine learning models, the results are often insufficiently reported in research articles, hindering reliable assessment of model validity and consistent interpretation of model outputs. Objective To attain a set of guidelines on the use of machine learning predictive models within clinical settings to make sure the models are correctly applied and sufficiently reported so that true discoveries can be distinguished from random coincidence. Methods A multidisciplinary panel of machine learning experts, clinicians, and traditional statisticians were interviewed, using an iterative process in accordance with the Delphi method. Results The process produced a set of guidelines that consists of (1) a list of reporting items to be included in a research article and (2) a set of practical sequential steps for developing predictive models. Conclusions A set of guidelines was generated to enable correct application of machine learning models and consistent reporting of model specifications and results in biomedical research. We believe that such guidelines will accelerate the adoption of big data analysis, particularly with machine learning methods, in the biomedical research community.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              On Splitting Training and Validation Set: A Comparative Study of Cross-Validation, Bootstrap and Systematic Sampling for Estimating the Generalization Performance of Supervised Learning

              Model validation is the most important part of building a supervised model. For building a model with good generalization performance one must have a sensible data splitting strategy, and this is crucial for model validation. In this study, we conducted a comparative study on various reported data splitting methods. The MixSim model was employed to generate nine simulated datasets with different probabilities of mis-classification and variable sample sizes. Then partial least squares for discriminant analysis and support vector machines for classification were applied to these datasets. Data splitting methods tested included variants of cross-validation, bootstrapping, bootstrapped Latin partition, Kennard-Stone algorithm (K-S) and sample set partitioning based on joint X–Y distances algorithm (SPXY). These methods were employed to split the data into training and validation sets. The estimated generalization performances from the validation sets were then compared with the ones obtained from the blind test sets which were generated from the same distribution but were unseen by the training/validation procedure used in model construction. The results showed that the size of the data is the deciding factor for the qualities of the generalization performance estimated from the validation set. We found that there was a significant gap between the performance estimated from the validation set and the one from the test set for the all the data splitting methods employed on small datasets. Such disparity decreased when more samples were available for training/validation, and this is because the models were then moving towards approximations of the central limit theory for the simulated datasets used. We also found that having too many or too few samples in the training set had a negative effect on the estimated model performance, suggesting that it is necessary to have a good balance between the sizes of training set and validation set to have a reliable estimation of model performance. We also found that systematic sampling method such as K-S and SPXY generally had very poor estimation of the model performance, most likely due to the fact that they are designed to take the most representative samples first and thus left a rather poorly representative sample set for model performance estimation. Electronic supplementary material The online version of this article (10.1007/s41664-018-0068-2) contains supplementary material, which is available to authorized users.
                Bookmark

                Author and article information

                Contributors
                URI : https://loop.frontiersin.org/people/2173743/overviewRole: Role: Role: Role: Role: Role: Role: Role: Role: Role:
                URI : https://loop.frontiersin.org/people/2618422/overviewRole: Role:
                Role: Role:
                URI : https://loop.frontiersin.org/people/1947925/overviewRole: Role:
                URI : https://loop.frontiersin.org/people/1769542/overviewRole: Role:
                URI : https://loop.frontiersin.org/people/1402541/overviewRole: Role:
                URI : https://loop.frontiersin.org/people/2334914/overviewRole: Role:
                URI : https://loop.frontiersin.org/people/2414786/overviewRole: Role: Role:
                Journal
                Front Nutr
                Front Nutr
                Front. Nutr.
                Frontiers in Nutrition
                Frontiers Media S.A.
                2296-861X
                11 June 2024
                2024
                : 11
                : 1397399
                Affiliations
                [1] 1Department of Pediatrics and Child Health Nursing, School of Nursing, College of Medicine and Health Science, Woldia University , Woldia, Ethiopia
                [2] 2Department of Pediatrics and Child Health Nursing, School of Nursing, College of Medicine and Health Science, Ambo University , Ambo, Ethiopia
                [3] 3Department of Emergency and Critical Care Nursing, School of Nursing, College of Medicine and Health Science, Woldia University , Woldia, Ethiopia
                [4] 4Department of Health Informatics, School of Public Health, College of Medicine and Health Science, Woldia University , Woldia, Ethiopia
                Author notes

                Edited by: Hettie Carina Schönfeldt, University of Pretoria, South Africa

                Reviewed by: Eskezeia Y. Dessie, Cincinnati Children's Hospital Medical Center, United States

                Xie Junhao, University College Dublin, Ireland

                *Correspondence: Alemu Birara Zemariam, alexb7298@ 123456gmail.com
                Article
                10.3389/fnut.2024.1397399
                11198118
                38919392
                f009b49b-4539-41cd-87ec-d35d65767a44
                Copyright © 2024 Zemariam, Adisu, Habesse, Abate, Bizuayehu, Wondie, Alamaw and Ngusie.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 07 March 2024
                : 22 May 2024
                Page count
                Figures: 5, Tables: 3, Equations: 0, References: 53, Pages: 13, Words: 8320
                Funding
                The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.
                Categories
                Nutrition
                Original Research
                Custom metadata
                Nutrition and Food Science Technology

                micronutrient supplementation,children,machine learning algorithm,prediction,ethiopia

                Comments

                Comment on this article