1
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Development and validation of a machine learning-based framework for assessing metabolic-associated fatty liver disease risk

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          The existing predictive models for metabolic-associated fatty liver disease (MAFLD) possess certain limitations that render them unsuitable for extensive population-wide screening. This study is founded upon population health examination data and employs a comparison of eight distinct machine learning (ML) algorithms to construct the optimal screening model for identifying high-risk individuals with MAFLD in China.

          Methods

          We collected physical examination data from 5,171,392 adults residing in the northwestern region of China, during the year 2021. Feature selection was conducted through the utilization of the Least Absolute Shrinkage and Selection Operator (LASSO) regression. Additionally, class balancing parameters were incorporated into the models, accompanied by hyperparameter tuning, to effectively address the challenges posed by imbalanced datasets. This study encompassed the development of both tree-based ML models (including Classification and Regression Trees, Random Forest, Adaptive Boosting, Light Gradient Boosting Machine, Extreme Gradient Boosting, and Categorical Boosting) and alternative ML models (specifically, k-Nearest Neighbors and Artificial Neural Network) for the purpose of identifying individuals with MAFLD. Furthermore, we visualized the importance scores of each feature on the selected model.

          Results

          The average age (standard deviation) of the 5,171,392 participants was 51.12 (15.00) years, with 52.47% of the participants being females. MAFLD was diagnosed by specialized physicians. 20 variables were finally included for analyses after LASSO regression model. Following ten rounds of cross-validation and parameter optimization for each algorithm, the CatBoost algorithm exhibited the best performance, achieving an Area Under the Receiver Operating Characteristic Curve (AUC) of 0.862. The ranking of feature importance indicates that age, BMI, triglyceride, fasting plasma glucose, waist circumference, occupation, high density lipoprotein cholesterol, low density lipoprotein cholesterol, total cholesterol, systolic blood pressure, diastolic blood pressure, ethnicity and cardiovascular diseases are the top 13 crucial factors for MAFLD screening.

          Conclusion

          This study utilized a large-scale, multi-ethnic physical examination data from the northwestern region of China to establish a more accurate and effective MAFLD risk screening model, offering a new perspective for the prediction and prevention of MAFLD.

          Related collections

          Most cited references40

          • Record: found
          • Abstract: not found
          • Article: not found

          Random Forests

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            A new definition for metabolic associated fatty liver disease: an international expert consensus statement

            The exclusion of other chronic liver diseases including "excess" alcohol intake has until now been necessary to establish a diagnosis of metabolic dysfunction-associated fatty liver disease (MAFLD). However, given our current understanding of the pathogenesis of MAFLD and its rising prevalence, "positive criteria" to diagnose the disease are required. In this work, a panel of international experts from 22 countries propose a new definition for the diagnosis of MAFLD that is both comprehensive and simple, and is independent of other liver diseases. The criteria are based on evidence of hepatic steatosis, in addition to one of the following three criteria, namely overweight/obesity, presence of type 2 diabetes mellitus, or evidence of metabolic dysregulation. We propose that disease assessment and stratification of severity should extend beyond a simple dichotomous classification to steatohepatitis vs. non-steatohepatitis. The group also suggests a set of criteria to define MAFLD-associated cirrhosis and proposes a conceptual framework to consider other causes of fatty liver disease. Finally, we bring clarity to the distinction between diagnostic criteria and inclusion criteria for research studies and clinical trials. Reaching consensus on the criteria for MAFLD will help unify the terminology (e.g. for ICD-coding), enhance the legitimacy of clinical practice and clinical trials, improve clinical care and move the clinical and scientific field of liver research forward.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              MAFLD: A consensus-driven proposed nomenclature for metabolic associated fatty liver disease

              Fatty liver associated with metabolic dysfunction is common, affects a quarter of the population, and has no approved drug therapy. Although pharmacotherapies are in development, response rates appear modest. The heterogeneous pathogenesis of metabolic fatty liver diseases and inaccuracies in terminology and definitions necessitate a reappraisal of nomenclature to inform clinical trial design and drug development. A group of experts sought to integrate current understanding of patient heterogeneity captured under the acronym nonalcoholic fatty liver disease (NAFLD) and provide suggestions on terminology that more accurately reflects pathogenesis and can help in patient stratification for management. Experts reached consensus that NAFLD does not reflect current knowledge, and metabolic (dysfunction) associated fatty liver disease "MAFLD" was suggested as a more appropriate overarching term. This opens the door for efforts from the research community to update the nomenclature and subphenotype the disease to accelerate the translational path to new treatments.
                Bookmark

                Author and article information

                Contributors
                34160869@qq.com
                zhouyi@mail.sysu.edu.cn
                Journal
                BMC Public Health
                BMC Public Health
                BMC Public Health
                BioMed Central (London )
                1471-2458
                18 September 2024
                18 September 2024
                2024
                : 24
                : 2545
                Affiliations
                [1 ]Zhongshan School of Medicine, Sun Yat-sen University, ( https://ror.org/0064kty71) 74 Zhongshan 2nd Road, Yuexiu District, Guangzhou, 510080 Guangdong China
                [2 ]People’s Hospital of Xinjiang Uygur Autonomous Region, ( https://ror.org/02r247g67) 91 Tianchi Road, Urumqi, 830054 Xinjiang China
                [3 ]School of Computer Science, China University of Geosciences, ( https://ror.org/04gcegc37) Wuhan, Beihe, 430074 China
                Article
                19882
                10.1186/s12889-024-19882-z
                11412026
                39294603
                3c991097-0842-45b6-b092-66a199ac89be
                © The Author(s) 2024

                Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

                History
                : 26 October 2023
                : 26 August 2024
                Funding
                Funded by: the Province Natural Science Foundation of Xinjiang, China
                Award ID: 2016D01C330
                Funded by: the Key Research and Development Program of China
                Award ID: 2022YFC3601600
                Funded by: the National Natural Science Foundation of China (NSFC)
                Award ID: 61876194
                Funded by: the Province Natural Science Foundation of Guangdong
                Award ID: 2021A1515011897
                Funded by: the Science and Technology Innovation Special Project of Guangdong Province, China
                Award ID: 202011020004
                Categories
                Research
                Custom metadata
                © BioMed Central Ltd., part of Springer Nature 2024

                Public health
                metabolic-associated fatty liver disease,machine learning,screening model,prediction model

                Comments

                Comment on this article