0
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Causal models and prediction in cell line perturbation experiments

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          In cell line perturbation experiments, a collection of cells is perturbed with external agents and responses such as protein expression measured. Due to cost constraints, only a small fraction of all possible perturbations can be tested in vitro. This has led to the development of computational models that can predict cellular responses to perturbations in silico. A central challenge for these models is to predict the effect of new, previously untested perturbations that were not used in the training data. Here we propose causal structural equations for modeling how perturbations effect cells. From this model, we derive two estimators for predicting responses: a Linear Regression (LR) estimator and a causal structure learning estimator that we term Causal Structure Regression (CSR). The CSR estimator requires more assumptions than LR, but can predict the effects of drugs that were not applied in the training data. Next we present Cellbox, a recently proposed system of ordinary differential equations (ODEs) based model that obtained the best prediction performance on a Melanoma cell line perturbation data set (Yuan et al. in Cell Syst 12:128–140, 2021). We derive analytic results that show a close connection between CSR and Cellbox, providing a new causal interpretation for the Cellbox model. We compare LR and CSR/Cellbox in simulations, highlighting the strengths and weaknesses of the two approaches. Finally we compare the performance of LR and CSR/Cellbox on the benchmark Melanoma data set. We find that the LR model has comparable or slightly better performance than Cellbox.

          Related collections

          Most cited references22

          • Record: found
          • Abstract: found
          • Article: not found

          A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles

          We previously piloted the concept of a Connectivity Map (CMap), whereby genes, drugs, and disease states are connected by virtue of common gene-expression signatures. Here, we report more than a 1,000-fold scale-up of the CMap as part of the NIH LINCS Consortium, made possible by a new, low-cost, high-throughput reduced representation expression profiling method that we term L1000. We show that L1000 is highly reproducible, comparable to RNA sequencing, and suitable for computational inference of the expression levels of 81% of non-measured transcripts. We further show that the expanded CMap can be used to discover mechanism of action of small molecules, functionally annotate genetic variants of disease genes, and inform clinical trials. The 1.3 million L1000 profiles described here, as well as tools for their analysis, are available at https://clue.io.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Causal inference in statistics: An overview

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Reverse phase protein array: validation of a novel proteomic technology and utility for analysis of primary leukemia specimens and hematopoietic stem cells.

              Proteomics has the potential to provide answers in cancer pathogenesis and to direct targeted therapy through the comprehensive analysis of protein expression levels and activation status. The realization of this potential requires the development of new, rapid, high-throughput technologies for performing protein arrays on patient samples, as well as novel analytic techniques to interpret them. Herein, we describe the validation and robustness of using reverse phase protein arrays (RPPA) for the analysis of primary acute myelogenous leukemia samples as well as leukemic and normal stem cells. In this report, we show that array printing, detection, amplification, and staining precision are very high, reproducible, and that they correlate with traditional Western blotting. Using replicates of the same sample on the same and/or separate arrays, or using separate protein samples prepared from the same starting sample, the intra- and interarray reproducibility was extremely high. No statistically significant difference in protein signal intensities could be detected within the array setups. The activation status (phosphorylation) was maintained in experiments testing delayed processing and preparation from multiple freeze-thawed samples. Differences in protein expression could reliably be detected in as few as three cell protein equivalents. RPPA prepared from rare populations of normal and leukemic stem cells were successfully done and showed differences from bulk populations of cells. Examples show how RPPAs are ideally suited for the large-scale analysis of target identification, validation, and drug discovery. In summary, RPPA is a highly reliable, reproducible, high-throughput system that allows for the rapid large-scale proteomic analysis of protein expression and phosphorylation state in primary acute myelogenous leukemia cells, cell lines, and in human stem cells.
                Bookmark

                Author and article information

                Contributors
                jplong@mdanderson.org
                kimdo@mdanderson.org
                Journal
                BMC Bioinformatics
                BMC Bioinformatics
                BMC Bioinformatics
                BioMed Central (London )
                1471-2105
                7 January 2025
                7 January 2025
                2025
                : 26
                : 4
                Affiliations
                [1 ]Department of Biostatistics, The University of Texas MD Anderson Cancer Center, ( https://ror.org/04twxam07) Houston, TX USA
                [2 ]Biomedical Informatics, The University of Texas Health Science Center at Houston, ( https://ror.org/03gds6c39) Houston, TX USA
                [3 ]Faculty of Data Science, Shiga University, ( https://ror.org/01vvhy971) Hikone, Shiga Japan
                Article
                6027
                10.1186/s12859-024-06027-7
                11707890
                39773352
                a8e8f0b1-0d11-4879-895e-b7cfcd266a7d
                © The Author(s) 2025

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 20 February 2024
                : 27 December 2024
                Funding
                Funded by: FundRef http://dx.doi.org/10.13039/100000002, National Institutes of Health;
                Award ID: P30CA016672
                Award ID: P30CA016672
                Award Recipient :
                Categories
                Research
                Custom metadata
                © BioMed Central Ltd., part of Springer Nature 2025

                Bioinformatics & Computational biology
                causal inference,prediction,perturbation biology,systems biology

                Comments

                Comment on this article

                scite_
                0
                0
                0
                0
                Smart Citations
                0
                0
                0
                0
                Citing PublicationsSupportingMentioningContrasting
                View Citations

                See how this article has been cited at scite.ai

                scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

                Similar content281

                Most referenced authors219