46
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Single-nucleus cross-tissue molecular reference maps toward understanding disease gene function

      Read this article at

      ScienceOpenPublisherPubMed
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Understanding gene function and regulation in homeostasis and disease requires knowledge of the cellular and tissue contexts in which genes are expressed. Here, we applied four single-nucleus RNA sequencing methods to eight diverse, archived, frozen tissue types from 16 donors and 25 samples, generating a cross-tissue atlas of 209,126 nuclei profiles, which we integrated across tissues, donors, and laboratory methods with a conditional variational autoencoder. Using the resulting cross-tissue atlas, we highlight shared and tissue-specific features of tissue-resident cell populations; identify cell types that might contribute to neuromuscular, metabolic, and immune components of monogenic diseases and the biological processes involved in their pathology; and determine cell types and gene modules that might underlie disease mechanisms for complex traits analyzed by genome-wide association studies.

          Cartography of human cells

          The function of disease genes active in different cell types is modulated to meet the needs of the different tissues and organs in which the cells reside. Resolving these differences is critical to understanding homeostasis and disease. However, single-cell atlases generated to date have largely focused on individual tissues. Eraslan et al . applied single-nucleus RNA sequencing to frozen, banked samples from eight healthy human organs from 16 donors and characterized cell populations across tissues, including tissue-resident myeloid and fibroblast populations, and their role in tissue support and immunity (see the Perspective by Liu and Zhang). Using this cross-tissue atlas, the authors linked specific cell populations to monogenic and polygenic diseases, suggesting cell- and tissue-specific programs. —LZ and DJ

          Abstract

          Optimized single-nucleus RNA sequencing of eight frozen, banked human tissues helps reveal cell types and mechanisms underlying monogenic diseases and complex traits.

          Abstract

          INTRODUCTION

          Understanding and treating disease requires deep, systematic characterization of different cells and their interactions across human tissues and organs, along with characterization of the genetic variants that causally contribute to disease risk. Recent studies have combined single-cell atlases of specific human tissues and organs with genes associated with human disease to relate risk variants to likely cells of action. ​​However, it has been challenging to extend these studies to profile multiple tissues and organs across the body, conduct studies at population scale, and integrate cell atlases from multiple organs to yield unified insights.

          RATIONALE

          Because of the pleiotropy and specificity of disease-associated variants, systematically relating variants to cells and molecular processes requires analysis across multiple tissues and individuals. Prior cell atlases primarily relied on fresh tissue samples from a single organ or tissue. Single-nucleus RNA sequencing (snRNA-seq) can be applied to frozen, archived tissue and captures cell types that do not survive dissociation across many tissues. Deep learning methods can integrate data across individuals and tissues by controlling for batch effects while preserving biological variation.

          RESULTS

          We established a framework for multitissue human cell atlases and generated an atlas of 209,126 snRNA-seq profiles from eight tissue types across 16 individuals, archived as frozen tissue as part of the Genotype-Tissue Expression (GTEx) project. We benchmarked four protocols and show how to apply them in a pooled setting to enable larger studies. We integrated the cross-tissue atlas using a conditional variational autoencoder, annotated it with 43 broad and 74 fine categories, and demonstrated its use to decipher tissue residency, such as a macrophage dichotomy and lipid associations that are preserved across tissues, and tissue-specific fibroblast features, including lung alveolar fibroblasts with likely roles in mechanosensation. We relate cells to human disease biology and disease-risk genes for both rare and common diseases, including rare muscle disease gene groups enriched in distinct subsets of myonuclei and nonmyocytes, and cell type–specific enrichment of expression and splicing quantitative trait locus (QTL) target genes mapped to genome-wide association study loci.

          CONCLUSION

          Our framework will empower large, cross-tissue population and/or disease studies at single-cell resolution. These frameworks and the cross-tissue perspective provided here will form a basis for larger-scale future studies to improve our understanding of cross-tissue and cross-individual variation of cellular phenotypes in relation to disease-associated genetic variation.

          Related collections

          Most cited references197

          • Record: found
          • Abstract: found
          • Article: not found

          Fast, sensitive, and accurate integration of single cell data with Harmony

          The emerging diversity of single cell RNAseq datasets allows for the full transcriptional characterization of cell types across a wide variety of biological and clinical conditions. However, it is challenging to analyze them together, particularly when datasets are assayed with different technologies. Here, real biological differences are interspersed with technical differences. We present Harmony, an algorithm that projects cells into a shared embedding in which cells group by cell type rather than dataset-specific conditions. Harmony simultaneously accounts for multiple experimental and biological factors. In six analyses, we demonstrate the superior performance of Harmony to previously published algorithms. We show that Harmony requires dramatically fewer computational resources. It is the only currently available algorithm that makes the integration of ~106 cells feasible on a personal computer. We apply Harmony to PBMCs from datasets with large experimental differences, 5 studies of pancreatic islet cells, mouse embryogenesis datasets, and cross-modality spatial integration.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            SCANPY : large-scale single-cell gene expression data analysis

            Scanpy is a scalable toolkit for analyzing single-cell gene expression data. It includes methods for preprocessing, visualization, clustering, pseudotime and trajectory inference, differential expression testing, and simulation of gene regulatory networks. Its Python-based implementation efficiently deals with data sets of more than one million cells (https://github.com/theislab/Scanpy). Along with Scanpy, we present AnnData, a generic class for handling annotated data matrices (https://github.com/theislab/anndata).
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              The sva package for removing batch effects and other unwanted variation in high-throughput experiments.

              Heterogeneity and latent variables are now widely recognized as major sources of bias and variability in high-throughput experiments. The most well-known source of latent variation in genomic experiments are batch effects-when samples are processed on different days, in different groups or by different people. However, there are also a large number of other variables that may have a major impact on high-throughput measurements. Here we describe the sva package for identifying, estimating and removing unwanted sources of variation in high-throughput experiments. The sva package supports surrogate variable estimation with the sva function, direct adjustment for known batch effects with the ComBat function and adjustment for batch and latent variables in prediction problems with the fsva function.
                Bookmark

                Author and article information

                Contributors
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                (View ORCID Profile)
                Journal
                Science
                Science
                American Association for the Advancement of Science (AAAS)
                0036-8075
                1095-9203
                May 13 2022
                May 13 2022
                : 376
                : 6594
                Affiliations
                [1 ]Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.
                [2 ]The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.
                [3 ]Department of Ophthalmology, Harvard Medical School, Boston, MA 02115, USA.
                [4 ]Ocular Genomics Institute, Department of Ophthalmology, Massachusetts Eye and Ear, Harvard Medical School, Boston, MA 02114, USA.
                [5 ]Medical and Population Genetics Program, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.
                [6 ]Department of Genetics, Yale School of Medicine, New Haven, CT 06510, USA.
                [7 ]Department of Dermatology, Brigham and Women’s Hospital, Boston, MA 02115, USA.
                [8 ]Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
                [9 ]The Joint Pathology Center Gynecologic/Breast Pathology, Silver Spring, MD 20910, USA.
                [10 ]Department of Medicine, Brigham and Women’s Hospital, Boston, MA 02115, USA.
                [11 ]Center for Cancer Research and Department of Pathology, Massachusetts General Hospital, Boston, MA 02114, USA.
                [12 ]Harvard Medical School, Boston, MA 02115, USA.
                [13 ]Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
                Article
                10.1126/science.abl4290
                35549429
                720dfb9a-217c-4ab4-8e10-aa4adb7a192e
                © 2022
                History

                Comments

                Comment on this article