1
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      SEMdag: Fast learning of Directed Acyclic Graphs via node or layer ordering

      research-article
      , * ,
      PLOS ONE
      Public Library of Science

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          A Directed Acyclic Graph (DAG) offers an easy approach to define causal structures among gathered nodes: causal linkages are represented by arrows between the variables, leading from cause to effect. Recently, industry and academics have paid close attention to DAG structure learning from observable data, and many techniques have been put out to address the problem. We provide a two-step approach, named SEMdag(), that can be used to quickly learn high-dimensional linear SEMs. It is included in the R package SEMgraph and employs a two-stage order-based search using previous knowledge (Knowledge-based, KB) or data-driven method (Bottom-up, BU), under the premise that a linear SEM with equal variance error terms is assumed. We evaluated our framework’s for finding plausible DAGs against six well-known causal discovery techniques (ARGES, GES, PC, LiNGAM, CAM, NOTEARS). We conducted a series of experiments using observed expression (or RNA-seq) data, taking into account a pair of training and testing datasets for four distinct diseases: Amyotrophic Lateral Sclerosis (ALS), Breast cancer (BRCA), Coronavirus disease (COVID-19) and ST-elevation myocardial infarction (STEMI). The results show that the SEMdag() procedure can recover a graph structure with good disease prediction performance evaluated by a conventional supervised learning algorithm (RF): in the scenario where the initial graph is sparse, the BU approach may be a better choice than the KB one; in the case where the graph is denser, both BU an KB report high performance, with highest score for KB approach based on topological layers. Besides its superior disease predictive performance compared to previous research, SEMdag() offers the user the flexibility to define distinct structure learning algorithms and can handle high dimensional issues with less computing load. SEMdag() function is implemented in the R package SEMgraph, easily available at https://CRAN.R-project.org/package=SEMgraph.

          Related collections

          Most cited references53

          • Record: found
          • Abstract: not found
          • Article: not found

          Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Estimating the Dimension of a Model

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Sparse inverse covariance estimation with the graphical lasso.

              We consider the problem of estimating sparse graphs by a lasso penalty applied to the inverse covariance matrix. Using a coordinate descent procedure for the lasso, we develop a simple algorithm--the graphical lasso--that is remarkably fast: It solves a 1000-node problem ( approximately 500,000 parameters) in at most a minute and is 30-4000 times faster than competing methods. It also provides a conceptual link between the exact problem and the approximation suggested by Meinshausen and Bühlmann (2006). We illustrate the method on some cell-signaling data from proteomics.
                Bookmark

                Author and article information

                Contributors
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: MethodologyRole: SoftwareRole: SupervisionRole: ValidationRole: Writing – original draft
                Role: Data curationRole: Formal analysisRole: InvestigationRole: SoftwareRole: ValidationRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: Editor
                Journal
                PLoS One
                PLoS One
                plos
                PLOS ONE
                Public Library of Science (San Francisco, CA USA )
                1932-6203
                2025
                8 January 2025
                : 20
                : 1
                : e0317283
                Affiliations
                [001] Department of Brain and Behavioral Sciences, University of Pavia, Pavia, Italy
                Chinese Academy of Sciences, CHINA
                Author notes

                Competing Interests: The authors have declared that no competing interests exist.

                Author information
                https://orcid.org/0000-0001-6384-2561
                Article
                PONE-D-24-15014
                10.1371/journal.pone.0317283
                11709272
                39775401
                23e30c14-28c4-4c2c-9fa0-066c5e5f8764
                © 2025 Grassi, Tarantino

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 14 April 2024
                : 25 December 2024
                Page count
                Figures: 5, Tables: 4, Pages: 24
                Funding
                The author(s) received no specific funding for this work.
                Categories
                Research Article
                Computer and Information Sciences
                Information Theory
                Graph Theory
                Directed Graphs
                Directed Acyclic Graphs
                Physical Sciences
                Mathematics
                Graph Theory
                Directed Graphs
                Directed Acyclic Graphs
                Physical Sciences
                Mathematics
                Applied Mathematics
                Algorithms
                Research and Analysis Methods
                Simulation and Modeling
                Algorithms
                Medicine and Health Sciences
                Medical Conditions
                Infectious Diseases
                Viral Diseases
                Covid 19
                Medicine and Health Sciences
                Diagnostic Medicine
                Virus Testing
                Medicine and Health Sciences
                Medical Conditions
                Neurodegenerative Diseases
                Motor Neuron Diseases
                Amyotrophic Lateral Sclerosis
                Medicine and Health Sciences
                Neurology
                Neurodegenerative Diseases
                Motor Neuron Diseases
                Amyotrophic Lateral Sclerosis
                Medicine and Health Sciences
                Oncology
                Cancers and Neoplasms
                Breast Tumors
                Breast Cancer
                Physical Sciences
                Mathematics
                Statistics
                Statistical Noise
                Medicine and Health Sciences
                Epidemiology
                Medical Risk Factors
                Cancer Risk Factors
                Genetic Causes of Cancer
                Medicine and Health Sciences
                Oncology
                Cancer Risk Factors
                Genetic Causes of Cancer
                Custom metadata
                Code to reproduce all results of the analysis, together with the data used in this study can be found in the supplementary files available at: https://github.com/fernandoPalluzzi/SEMgraph/tree/master/SEMdag.

                Uncategorized
                Uncategorized

                Comments

                Comment on this article

                scite_
                0
                0
                0
                0
                Smart Citations
                0
                0
                0
                0
                Citing PublicationsSupportingMentioningContrasting
                View Citations

                See how this article has been cited at scite.ai

                scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

                Similar content141

                Most referenced authors1,402