5
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Identification of endoplasmic reticulum stress-associated genes and subtypes for prediction of Alzheimer’s disease based on interpretable machine learning

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Introduction: Alzheimer’s disease (AD) is a severe dementia with clinical and pathological heterogeneity. Our study was aim to explore the roles of endoplasmic reticulum (ER) stress-related genes in AD patients based on interpretable machine learning.

          Methods: Microarray datasets were obtained from the Gene Expression Omnibus (GEO) database. We performed nine machine learning algorithms including AdaBoost, Logistic Regression, Light Gradient Boosting (LightGBM), Decision Tree (DT), eXtreme Gradient Boosting (XGBoost), Random Forest, K-nearest neighbors (KNN), Naïve Bayes, and support vector machines (SVM) to screen ER stress-related feature genes and estimate their efficiency of these genes for early diagnosis of AD. ROC curves were performed to evaluate model performance. Shapley additive explanation (SHAP) was applied for interpreting the results of these models. AD patients were classified using a consensus clustering algorithm. Immune infiltration and functional enrichment analysis were performed via CIBERSORT and GSVA, respectively. CMap analysis was utilized to identify subtype-specific small-molecule compounds.

          Results: Higher levels of immune infiltration were found in AD individuals and were markedly linked to deregulated ER stress-related genes. The SVM model exhibited the highest AUC (0.879), accuracy (0.808), recall (0.773), and precision (0.809). Six characteristic genes (RNF5, UBAC2, DNAJC10, RNF103, DDX3X, and NGLY1) were determined, which enable to precisely predict AD progression. The SHAP plots illustrated how a feature gene influence the output of the SVM prediction model. Patients with AD could obtain clinical benefits from the feature gene-based nomogram. Two ER stress-related subtypes were defined in AD, subtype2 exhibited elevated immune infiltration levels and immune score, as well as higher expression of immune checkpoint. We finally identified several subtype-specific small-molecule compounds.

          Conclusion: Our study provides new insights into the role of ER stress in AD heterogeneity and the development of novel targets for individualized treatment in patients with AD.

          Related collections

          Most cited references64

          • Record: found
          • Abstract: found
          • Article: not found

          clusterProfiler: an R package for comparing biological themes among gene clusters.

          Increasing quantitative data generated from transcriptomics and proteomics require integrative strategies for analysis. Here, we present an R package, clusterProfiler that automates the process of biological-term classification and the enrichment analysis of gene clusters. The analysis module and visualization module were combined into a reusable workflow. Currently, clusterProfiler supports three species, including humans, mice, and yeast. Methods provided in this package can be easily extended to other species and ontologies. The clusterProfiler package is released under Artistic-2.0 License within Bioconductor project. The source code and vignette are freely available at http://bioconductor.org/packages/release/bioc/html/clusterProfiler.html.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            limma powers differential expression analyses for RNA-sequencing and microarray studies

            limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments. It contains rich features for handling complex experimental designs and for information borrowing to overcome the problem of small sample sizes. Over the past decade, limma has been a popular choice for gene discovery through differential expression analyses of microarray and high-throughput PCR data. The package contains particularly strong facilities for reading, normalizing and exploring such data. Recently, the capabilities of limma have been significantly expanded in two important directions. First, the package can now perform both differential expression and differential splicing analyses of RNA sequencing (RNA-seq) data. All the downstream analysis tools previously restricted to microarray data are now available for RNA-seq as well. These capabilities allow users to analyse both RNA-seq and microarray data with very similar pipelines. Second, the package is now able to go past the traditional gene-wise expression analyses in a variety of ways, analysing expression profiles in terms of co-regulated sets of genes or in terms of higher-order expression signatures. This provides enhanced possibilities for biological interpretation of gene expression differences. This article reviews the philosophy and design of the limma package, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              GSVA: gene set variation analysis for microarray and RNA-Seq data

              Background Gene set enrichment (GSE) analysis is a popular framework for condensing information from gene expression profiles into a pathway or signature summary. The strengths of this approach over single gene analysis include noise and dimension reduction, as well as greater biological interpretability. As molecular profiling experiments move beyond simple case-control studies, robust and flexible GSE methodologies are needed that can model pathway activity within highly heterogeneous data sets. Results To address this challenge, we introduce Gene Set Variation Analysis (GSVA), a GSE method that estimates variation of pathway activity over a sample population in an unsupervised manner. We demonstrate the robustness of GSVA in a comparison with current state of the art sample-wise enrichment methods. Further, we provide examples of its utility in differential pathway activity and survival analysis. Lastly, we show how GSVA works analogously with data from both microarray and RNA-seq experiments. Conclusions GSVA provides increased power to detect subtle pathway activity changes over a sample population in comparison to corresponding methods. While GSE methods are generally regarded as end points of a bioinformatic analysis, GSVA constitutes a starting point to build pathway-centric models of biology. Moreover, GSVA contributes to the current need of GSE methods for RNA-seq data. GSVA is an open source software package for R which forms part of the Bioconductor project and can be downloaded at http://www.bioconductor.org.
                Bookmark

                Author and article information

                Contributors
                Journal
                Front Pharmacol
                Front Pharmacol
                Front. Pharmacol.
                Frontiers in Pharmacology
                Frontiers Media S.A.
                1663-9812
                19 August 2022
                2022
                : 13
                : 975774
                Affiliations
                [1] 1 Department of Geriatric Medicine, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital , Fuzhou, China
                [2] 2 Fujian Provincial Center for Geriatrics, Fujian Provincial Hospital , Fuzhou, China
                [3] 3 Department of Gastroenterology, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital , Fuzhou, China
                [4] 4 Department of Rheumatology and Immunology, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital , Fuzhou, China
                [5] 5 Department of Nephrology, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital , Fuzhou, China
                Author notes

                Edited by: Ana Rita Vaz, University of Lisbon, Portugal

                Reviewed by: Chuipu Cai, Shantou University, China

                Yuqing Yan, West Virginia University, United States.

                *Correspondence: Li Zhang, lilydoctor9902@ 123456163.com ; Zhihan Chen, han213@ 123456163.com
                [ † ]

                These authors have contributed equally to this work

                This article was submitted to Neuropharmacology, a section of the journal Frontiers in Pharmacology

                Article
                975774
                10.3389/fphar.2022.975774
                9438901
                36059957
                8fe8eb25-9b9e-4c11-8303-6081657cd269
                Copyright © 2022 Lai, Lin, Lin, Lin, Chen and Zhang.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 22 June 2022
                : 19 July 2022
                Funding
                Funded by: Natural Science Foundation of Fujian Province , doi 10.13039/501100003392;
                Funded by: Fujian Provincial Health Commission , doi 10.13039/501100014125;
                Categories
                Pharmacology
                Original Research

                Pharmacology & Pharmaceutical medicine
                alzheimer’s disease,er stress,machine learning,molecular subtypes,prediction model

                Comments

                Comment on this article