0
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Scbean: a python library for single-cell multi-omics data analysis

      brief-report

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Summary

          Single-cell multi-omics technologies provide a unique platform for characterizing cell states and reconstructing developmental process by simultaneously quantifying and integrating molecular signatures across various modalities, including genome, transcriptome, epigenome, and other omics layers. However, there is still an urgent unmet need for novel computational tools in this nascent field, which are critical for both effective and efficient interrogation of functionality across different omics modalities. Scbean represents a user-friendly Python library, designed to seamlessly incorporate a diverse array of models for the examination of single-cell data, encompassing both paired and unpaired multi-omics data. The library offers uniform and straightforward interfaces for tasks, such as dimensionality reduction, batch effect elimination, cell label transfer from well-annotated scRNA-seq data to scATAC-seq data, and the identification of spatially variable genes. Moreover, Scbean’s models are engineered to harness the computational power of GPU acceleration through Tensorflow, rendering them capable of effortlessly handling datasets comprising millions of cells.

          Availability and implementation

          Scbean is released on the Python Package Index (PyPI) ( https://pypi.org/project/scbean/) and GitHub ( https://github.com/jhu99/scbean) under the MIT license. The documentation and example code can be found at https://scbean.readthedocs.io/en/latest/.

          Related collections

          Most cited references23

          • Record: found
          • Abstract: found
          • Article: not found

          Comprehensive Integration of Single-Cell Data

          Single-cell transcriptomics has transformed our ability to characterize cell states, but deep biological understanding requires more than a taxonomic listing of clusters. As new methods arise to measure distinct cellular modalities, a key analytical challenge is to integrate these datasets to better understand cellular identity and function. Here, we develop a strategy to "anchor" diverse datasets together, enabling us to integrate single-cell measurements not only across scRNA-seq technologies, but also across different modalities. After demonstrating improvement over existing methods for integrating scRNA-seq data, we anchor scRNA-seq experiments with scATAC-seq to explore chromatin differences in closely related interneuron subsets and project protein expression measurements onto a bone marrow atlas to characterize lymphocyte populations. Lastly, we harmonize in situ gene expression and scRNA-seq datasets, allowing transcriptome-wide imputation of spatial gene expression patterns. Our work presents a strategy for the assembly of harmonized references and transfer of information across datasets.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Integrated analysis of multimodal single-cell data

            Summary The simultaneous measurement of multiple modalities represents an exciting frontier for single-cell genomics and necessitates computational methods that can define cellular states based on multimodal data. Here, we introduce “weighted-nearest neighbor” analysis, an unsupervised framework to learn the relative utility of each data type in each cell, enabling an integrative analysis of multiple modalities. We apply our procedure to a CITE-seq dataset of 211,000 human peripheral blood mononuclear cells (PBMCs) with panels extending to 228 antibodies to construct a multimodal reference atlas of the circulating immune system. Multimodal analysis substantially improves our ability to resolve cell states, allowing us to identify and validate previously unreported lymphoid subpopulations. Moreover, we demonstrate how to leverage this reference to rapidly map new datasets and to interpret immune responses to vaccination and coronavirus disease 2019 (COVID-19). Our approach represents a broadly applicable strategy to analyze single-cell multimodal datasets and to look beyond the transcriptome toward a unified and multimodal definition of cellular identity.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              SCANPY : large-scale single-cell gene expression data analysis

              Scanpy is a scalable toolkit for analyzing single-cell gene expression data. It includes methods for preprocessing, visualization, clustering, pseudotime and trajectory inference, differential expression testing, and simulation of gene regulatory networks. Its Python-based implementation efficiently deals with data sets of more than one million cells (https://github.com/theislab/Scanpy). Along with Scanpy, we present AnnData, a generic class for handling annotated data matrices (https://github.com/theislab/anndata).
                Bookmark

                Author and article information

                Contributors
                Role: Associate Editor
                Journal
                Bioinformatics
                Bioinformatics
                bioinformatics
                Bioinformatics
                Oxford University Press
                1367-4803
                1367-4811
                February 2024
                30 January 2024
                30 January 2024
                : 40
                : 2
                : btae053
                Affiliations
                School of Computer Science, Northwestern Polytechnical University , 710129 Xi'an, Shaanxi, China
                School of Computer Science, Northwestern Polytechnical University , 710129 Xi'an, Shaanxi, China
                School of Computer Science, Northwestern Polytechnical University , 710129 Xi'an, Shaanxi, China
                School of Computer Science, Northwestern Polytechnical University , 710129 Xi'an, Shaanxi, China
                School of Computer Science, Northwestern Polytechnical University , 710129 Xi'an, Shaanxi, China
                School of Computer Science, Northwestern Polytechnical University , 710129 Xi'an, Shaanxi, China
                School of Computer Science, Northwestern Polytechnical University , 710129 Xi'an, Shaanxi, China
                School of Life Science, Northwestern Polytechnical University , 710072 Xi'an, Shaanxi, China
                Population Health Sciences, German Center for Neurodegenerative Diseases (DZNE) , 53127 Bonn, Germany
                Department of Neurology, Faculty of Medicine, University of Bonn , 53105 Bonn, Germany
                School of Computer Science, Northwestern Polytechnical University , 710129 Xi'an, Shaanxi, China
                Population Health Sciences, German Center for Neurodegenerative Diseases (DZNE) , 53127 Bonn, Germany
                Author notes
                Corresponding authors. School of Computer Science, Northwestern Polytechnical University, 710129 Xi'an, Shaanxi, China. Populaion Health Science, German Center for Neurodegenerative Diseases, 53127 Bonn, Germany. E-mails: jhu@ 123456nwpu.edu.cn (J.H.) and Ahmad.Aziz@ 123456dzne.de  (A.A)

                Equal contribution by Haohui Zhang and Yuwei Wang.

                Author information
                https://orcid.org/0000-0002-6004-4174
                https://orcid.org/0000-0002-5728-6463
                https://orcid.org/0000-0002-3351-8020
                Article
                btae053
                10.1093/bioinformatics/btae053
                10868338
                38290765
                5cd5172a-588f-4fcd-aa40-c7e21c7abd5d
                © The Author(s) 2024. Published by Oxford University Press.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 10 October 2023
                : 10 January 2024
                : 22 January 2024
                : 25 January 2024
                : 09 February 2024
                Page count
                Pages: 4
                Funding
                Funded by: National Natural Science Foundation of China, DOI 10.13039/501100001809;
                Award ID: 62072374
                Funded by: China Scholarship Council, DOI 10.13039/501100004543;
                Funded by: Fundamental Research Funds for the Central Universities, DOI 10.13039/501100012226;
                Categories
                Applications Note
                Gene Expression
                AcademicSubjects/SCI01060

                Bioinformatics & Computational biology
                Bioinformatics & Computational biology

                Comments

                Comment on this article