4
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Deep representation learning of chemical-induced transcriptional profile for phenotype-based drug discovery

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Artificial intelligence transforms drug discovery, with phenotype-based approaches emerging as a promising alternative to target-based methods, overcoming limitations like lack of well-defined targets. While chemical-induced transcriptional profiles offer a comprehensive view of drug mechanisms, inherent noise often obscures the true signal, hindering their potential for meaningful insights. Here, we highlight the development of TranSiGen, a deep generative model employing self-supervised representation learning. TranSiGen analyzes basal cell gene expression and molecular structures to reconstruct chemical-induced transcriptional profiles with high accuracy. By capturing both cellular and compound information, TranSiGen-derived representations demonstrate efficacy in diverse downstream tasks like ligand-based virtual screening, drug response prediction, and phenotype-based drug repurposing. Notably, in vitro validation of TranSiGen’s application in pancreatic cancer drug discovery highlights its potential for identifying effective compounds. We envisage that integrating TranSiGen into the drug discovery and mechanism research holds significant promise for advancing biomedicine.

          Abstract

          While chemical-induced transcriptional profiles reveal drug mechanisms, inherent noise limits their utility. Here, authors present TranSiGen, a deep representation learning model that denoises and reconstructs these profiles, demonstrating its efficacy in downstream drug discovery tasks.

          Related collections

          Most cited references43

          • Record: found
          • Abstract: found
          • Article: not found

          Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles

          Although genomewide RNA expression analysis has become a routine tool in biomedical research, extracting biological insight from such information remains a major challenge. Here, we describe a powerful analytical method called Gene Set Enrichment Analysis (GSEA) for interpreting gene expression data. The method derives its power by focusing on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation. We demonstrate how GSEA yields insights into several cancer-related data sets, including leukemia and lung cancer. Notably, where single-gene analysis finds little similarity between two independent studies of patient survival in lung cancer, GSEA reveals many biological pathways in common. The GSEA method is embodied in a freely available software package, together with an initial database of 1,325 biologically defined gene sets.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Differential expression analysis for sequence count data

            High-throughput sequencing assays such as RNA-Seq, ChIP-Seq or barcode counting provide quantitative readouts in the form of count data. To infer differential signal in such data correctly and with good statistical power, estimation of data variability throughout the dynamic range and a suitable error model are required. We propose a method based on the negative binomial distribution, with variance and mean linked by local regression and present an implementation, DESeq, as an R/Bioconductor package.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found
              Is Open Access

              The Cancer Genome Atlas Pan-Cancer analysis project.

              The Cancer Genome Atlas (TCGA) Research Network has profiled and analyzed large numbers of human tumors to discover molecular aberrations at the DNA, RNA, protein and epigenetic levels. The resulting rich data provide a major opportunity to develop an integrated picture of commonalities, differences and emergent themes across tumor lineages. The Pan-Cancer initiative compares the first 12 tumor types profiled by TCGA. Analysis of the molecular aberrations and their functional roles across tumor types will teach us how to extend therapies effective in one cancer type to others with a similar genomic profile.
                Bookmark

                Author and article information

                Contributors
                slzhang@simm.ac.cn
                lixutong@simm.ac.cn
                myzheng@simm.ac.cn
                Journal
                Nat Commun
                Nat Commun
                Nature Communications
                Nature Publishing Group UK (London )
                2041-1723
                25 June 2024
                25 June 2024
                2024
                : 15
                : 5378
                Affiliations
                [1 ]GRID grid.9227.e, ISNI 0000000119573309, Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, , Chinese Academy of Sciences, ; 555 Zuchongzhi Road, Shanghai, 201203 China
                [2 ]University of Chinese Academy of Sciences, ( https://ror.org/05qbk4x57) No. 19A Yuquan Road, Beijing, 100049 China
                [3 ]School of Physical Science and Technology, ShanghaiTech University, ( https://ror.org/030bhh786) Shanghai, 201210 China
                [4 ]Lingang Laboratory, Shanghai, 200031 China
                [5 ]School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, ( https://ror.org/04c4dkn09) Hefei, 230026 China
                [6 ]School of Pharmaceutical Science and Technology, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, ( https://ror.org/05qbk4x57) Hangzhou, 310024 China
                Author information
                http://orcid.org/0000-0002-9167-4689
                http://orcid.org/0000-0001-9547-0643
                http://orcid.org/0000-0002-3323-3092
                Article
                49620
                10.1038/s41467-024-49620-3
                11199551
                38918369
                e3fc1fd7-4c56-4df6-9c0b-cbd1425b48ae
                © The Author(s) 2024

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 10 November 2023
                : 10 June 2024
                Funding
                Funded by: FundRef https://doi.org/10.13039/501100001809, National Natural Science Foundation of China (National Science Foundation of China);
                Award ID: T2225002
                Award ID: 82273855
                Award ID: 82204278
                Award Recipient :
                Funded by: SIMM-SHUTCM Traditional Chinese Medicine Innovation Joint Research Program (E2G805H) Shanghai Municipal Science and Technology Major Project National Key Research and Development Program of China (2023YFC2305904) National Key Research and Development Program of China (2022YFC3400504) The Large-scale Protein Preparation System at the National Facility for Protein Science in Shanghai (NFPS), Shanghai Advanced Research Institute, Chinese Academy of Science, China
                Funded by: The Youth Innovation Promotion Association CAS (2023296)
                Categories
                Article
                Custom metadata
                © Springer Nature Limited 2024

                Uncategorized
                computational models,virtual screening,virtual drug screening,machine learning,drug development

                Comments

                Comment on this article