1
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Joint dimension reduction and clustering analysis of single-cell RNA-seq and spatial transcriptomics data

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Dimension reduction and (spatial) clustering is usually performed sequentially; however, the low-dimensional embeddings estimated in the dimension-reduction step may not be relevant to the class labels inferred in the clustering step. We therefore developed a computation method, Dimension-Reduction Spatial-Clustering (DR-SC), that can simultaneously perform dimension reduction and (spatial) clustering within a unified framework. Joint analysis by DR-SC produces accurate (spatial) clustering results and ensures the effective extraction of biologically informative low-dimensional features. DR-SC is applicable to spatial clustering in spatial transcriptomics that characterizes the spatial organization of the tissue by segregating it into multiple tissue structures. Here, DR-SC relies on a latent hidden Markov random field model to encourage the spatial smoothness of the detected spatial cluster boundaries. Underlying DR-SC is an efficient expectation-maximization algorithm based on an iterative conditional mode. As such, DR-SC is scalable to large sample sizes and can optimize the spatial smoothness parameter in a data-driven manner. With comprehensive simulations and real data applications, we show that DR-SC outperforms existing clustering and spatial clustering methods: it extracts more biologically relevant features than conventional dimension reduction methods, improves clustering performance, and offers improved trajectory inference and visualization for downstream trajectory inference analyses.

          Related collections

          Most cited references72

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Integrated analysis of multimodal single-cell data

          Summary The simultaneous measurement of multiple modalities represents an exciting frontier for single-cell genomics and necessitates computational methods that can define cellular states based on multimodal data. Here, we introduce “weighted-nearest neighbor” analysis, an unsupervised framework to learn the relative utility of each data type in each cell, enabling an integrative analysis of multiple modalities. We apply our procedure to a CITE-seq dataset of 211,000 human peripheral blood mononuclear cells (PBMCs) with panels extending to 228 antibodies to construct a multimodal reference atlas of the circulating immune system. Multimodal analysis substantially improves our ability to resolve cell states, allowing us to identify and validate previously unreported lymphoid subpopulations. Moreover, we demonstrate how to leverage this reference to rapidly map new datasets and to interpret immune responses to vaccination and coronavirus disease 2019 (COVID-19). Our approach represents a broadly applicable strategy to analyze single-cell multimodal datasets and to look beyond the transcriptome toward a unified and multimodal definition of cellular identity.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Integrating single-cell transcriptomic data across different conditions, technologies, and species

            Computational single-cell RNA-seq (scRNA-seq) methods have been successfully applied to experiments representing a single condition, technology, or species to discover and define cellular phenotypes. However, identifying subpopulations of cells that are present across multiple data sets remains challenging. Here, we introduce an analytical strategy for integrating scRNA-seq data sets based on common sources of variation, enabling the identification of shared populations across data sets and downstream comparative analysis. We apply this approach, implemented in our R toolkit Seurat (http://satijalab.org/seurat/), to align scRNA-seq data sets of peripheral blood mononuclear cells under resting and stimulated conditions, hematopoietic progenitors sequenced using two profiling technologies, and pancreatic cell 'atlases' generated from human and mouse islets. In each case, we learn distinct or transitional cell states jointly across data sets, while boosting statistical power through integrated analysis. Our approach facilitates general comparisons of scRNA-seq data sets, potentially deepening our understanding of how distinct cell states respond to perturbation, disease, and evolution.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Fast unfolding of communities in large networks

              Journal of Statistical Mechanics: Theory and Experiment, 2008(10), P10008
                Bookmark

                Author and article information

                Contributors
                Journal
                Nucleic Acids Res
                Nucleic Acids Res
                nar
                Nucleic Acids Research
                Oxford University Press
                0305-1048
                1362-4962
                08 July 2022
                29 March 2022
                29 March 2022
                : 50
                : 12
                : e72
                Affiliations
                Academy of Statistics and Interdisciplinary Sciences, East China Normal University , Shanghai, 200062, China
                Centre for Quantitative Medicine, Health Services & Systems Research , Duke-NUS Medical School, 169857, Singapore
                Centre for Quantitative Medicine, Health Services & Systems Research , Duke-NUS Medical School, 169857, Singapore
                Centre for Quantitative Medicine, Health Services & Systems Research , Duke-NUS Medical School, 169857, Singapore
                Center of Statistical Research and School of Statistics, Southwestern University of Finance and Economics , Chengdu, 611130, China
                Institute of Molecular and Cell Biology(IMCB), Agency of Science , Technology and Research(A*STAR), 138673, Singapore
                Department of Anatomical Pathology , Singapore General Hospital, 169856, Singapore
                Department of Biostatistics, University of Michigan , Ann Arbor, 48109, USA
                Academy of Statistics and Interdisciplinary Sciences, East China Normal University , Shanghai, 200062, China
                Key Laboratory of Advanced Theory and Application in Statistics and Data Science-MOE, School of Statistics, East China Normal University , Shanghai, 200062, China
                Centre for Quantitative Medicine, Health Services & Systems Research , Duke-NUS Medical School, 169857, Singapore
                Author notes
                To whom correspondence should be addressed. Email: jin.liu@ 123456duke-nus.edu.sg
                Correspondence may also be addressed to Xingjie Shi. Email: xjshi@ 123456fem.ecnu.edu.cn
                Correspondence may also be addressed to Xiang Zhou. Email: xzhousph@ 123456umich.edu
                Author information
                https://orcid.org/0000-0002-1259-6564
                https://orcid.org/0000-0002-4331-7599
                https://orcid.org/0000-0002-5707-2078
                Article
                gkac219
                10.1093/nar/gkac219
                9262606
                35349708
                00df640f-d51b-4d3c-add6-845aae4086cd
                © The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 22 March 2022
                : 22 February 2022
                : 08 January 2022
                Page count
                Pages: 18
                Funding
                Funded by: Ministry of Education, Singapore;
                Award ID: MOE-T2EP20220-0009
                Award ID: MOE2018-T2-2-006
                Funded by: Natural Science Foundation of China, DOI 10.13039/501100001809;
                Award ID: 11931014
                Award ID: 12171229
                Award ID: 22ZR1420500
                Funded by: Natural Science Foundation of Shanghai, DOI 10.13039/100007219;
                Categories
                AcademicSubjects/SCI00010
                Narese/9
                Methods Online

                Genetics
                Genetics

                Comments

                Comment on this article