73
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Visualizing Single-Cell RNA-seq Data with Semisupervised Principal Component Analysis

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Single-cell RNA-seq (scRNA-seq) is a powerful tool for analyzing heterogeneous and functionally diverse cell population. Visualizing scRNA-seq data can help us effectively extract meaningful biological information and identify novel cell subtypes. Currently, the most popular methods for scRNA-seq visualization are principal component analysis (PCA) and t-distributed stochastic neighbor embedding (t-SNE). While PCA is an unsupervised dimension reduction technique, t-SNE incorporates cluster information into pairwise probability, and then maximizes the Kullback–Leibler divergence. Uniform Manifold Approximation and Projection (UMAP) is another recently developed visualization method similar to t-SNE. However, one limitation with UMAP and t-SNE is that they can only capture the local structure of the data, the global structure of the data is not faithfully preserved. In this manuscript, we propose a semisupervised principal component analysis (ssPCA) approach for scRNA-seq visualization. The proposed approach incorporates cluster-labels into dimension reduction and discovers principal components that maximize both data variance and cluster dependence. ssPCA must have cluster-labels as its input. Therefore, it is most useful for visualizing clusters from a scRNA-seq clustering software. Our experiments with simulation and real scRNA-seq data demonstrate that ssPCA is able to preserve both local and global structures of the data, and uncover the transition and progressions in the data, if they exist. In addition, ssPCA is convex and has a global optimal solution. It is also robust and computationally efficient, making it viable for scRNA-seq cluster visualization.

          Related collections

          Most cited references11

          • Record: found
          • Abstract: found
          • Article: not found

          Single cell RNA Seq reveals dynamic paracrine control of cellular variation

          High-throughput single-cell transcriptomics offers an unbiased approach for understanding the extent, basis, and function of gene expression variation between seemingly identical cells. Here, we sequence single-cell RNA-Seq libraries prepared from over 1,700 primary mouse bone marrow derived dendritic cells (DCs) spanning several experimental conditions. We find substantial variation between identically stimulated DCs, in both the fraction of cells detectably expressing a given mRNA and the transcript’s level within expressing cells. Distinct gene modules are characterized by different temporal heterogeneity profiles. In particular, a “core” module of antiviral genes is expressed very early by a few “precocious” cells, but is later activated in all cells. By stimulating cells individually in sealed microfluidic chambers, analyzing DCs from knockout mice, and modulating secretion and extracellular signaling, we show that this response is coordinated via interferon-mediated paracrine signaling. Surprisingly, preventing cell-to-cell communication also substantially reduces variability in the expression of an early-induced “peaked” inflammatory module, suggesting that paracrine signaling additionally represses part of the inflammatory program. Our study highlights the importance of cell-to-cell communication in controlling cellular heterogeneity and reveals general strategies that multicellular populations use to establish complex dynamic responses.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Visualization and analysis of single-cell RNA-seq data by kernel-based similarity learning

            The SIMLR software identifies similarities between cells across a range of single-cell RNA-seq data, enabling effective dimension reduction, clustering and visualization.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Single Cell RNA-Sequencing of Pluripotent States Unlocks Modular Transcriptional Variation

              Summary Embryonic stem cell (ESC) culture conditions are important for maintaining long-term self-renewal, and they influence cellular pluripotency state. Here, we report single cell RNA-sequencing of mESCs cultured in three different conditions: serum, 2i, and the alternative ground state a2i. We find that the cellular transcriptomes of cells grown in these conditions are distinct, with 2i being the most similar to blastocyst cells and including a subpopulation resembling the two-cell embryo state. Overall levels of intercellular gene expression heterogeneity are comparable across the three conditions. However, this masks variable expression of pluripotency genes in serum cells and homogeneous expression in 2i and a2i cells. Additionally, genes related to the cell cycle are more variably expressed in the 2i and a2i conditions. Mining of our dataset for correlations in gene expression allowed us to identify additional components of the pluripotency network, including Ptma and Zfp640, illustrating its value as a resource for future discovery.
                Bookmark

                Author and article information

                Journal
                Int J Mol Sci
                Int J Mol Sci
                ijms
                International Journal of Molecular Sciences
                MDPI
                1422-0067
                12 August 2020
                August 2020
                : 21
                : 16
                : 5797
                Affiliations
                Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA 17033, USA; zliu3@ 123456phs.psu.edu
                Author information
                https://orcid.org/0000-0003-3052-9151
                Article
                ijms-21-05797
                10.3390/ijms21165797
                7460854
                32806757
                e00f85eb-5d56-404f-bf15-4b4aecdfb159
                © 2020 by the author.

                Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( http://creativecommons.org/licenses/by/4.0/).

                History
                : 30 June 2020
                : 10 August 2020
                Categories
                Article

                Molecular biology
                scrna-seq visualization,semisupervised principal component analysis,dimension reduction,cluster visualization,nonlinear visualization

                Comments

                Comment on this article