15
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Systematic evaluation of cell-type deconvolution pipelines for sequencing-based bulk DNA methylomes

      review-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          DNA methylation analysis by sequencing is becoming increasingly popular, yielding methylomes at single-base pair and single-molecule resolution. It has tremendous potential for cell-type heterogeneity analysis using intrinsic read-level information. Although diverse deconvolution methods were developed to infer cell-type composition based on bulk sequencing-based methylomes, systematic evaluation has not been performed yet. Here, we thoroughly benchmark six previously published methods: Bayesian epiallele detection, DXM, PRISM, csmFinder+coMethy, ClubCpG and MethylPurify, together with two array-based methods, MeDeCom and Houseman, as a comparison group. Sequencing-based deconvolution methods consist of two main steps, informative region selection and cell-type composition estimation, thus each was individually assessed. With this elaborate evaluation, we aimed to establish which method achieves the highest performance in different scenarios of synthetic bulk samples. We found that cell-type deconvolution performance is influenced by different factors depending on the number of cell types within the mixture. Finally, we propose a best-practice deconvolution strategy for sequencing data and point out limitations that need to be handled. Array-based methods—both reference-based and reference-free—generally outperformed sequencing-based methods, despite the absence of read-level information. This implies that the current sequencing-based methods still struggle with correctly identifying cell-type-specific signals and eliminating confounding methylation patterns, which needs to be handled in future studies.

          Related collections

          Most cited references60

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          The Sequence Alignment/Map format and SAMtools

          Summary: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments. Availability: http://samtools.sourceforge.net Contact: rd@sanger.ac.uk
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Cutadapt removes adapter sequences from high-throughput sequencing reads

              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications

              Summary: A combination of bisulfite treatment of DNA and high-throughput sequencing (BS-Seq) can capture a snapshot of a cell's epigenomic state by revealing its genome-wide cytosine methylation at single base resolution. Bismark is a flexible tool for the time-efficient analysis of BS-Seq data which performs both read mapping and methylation calling in a single convenient step. Its output discriminates between cytosines in CpG, CHG and CHH context and enables bench scientists to visualize and interpret their methylation data soon after the sequencing run is completed. Availability and implementation: Bismark is released under the GNU GPLv3+ licence. The source code is freely available from www.bioinformatics.bbsrc.ac.uk/projects/bismark/. Contact: felix.krueger@bbsrc.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.
                Bookmark

                Author and article information

                Contributors
                Journal
                Brief Bioinform
                Brief Bioinform
                bib
                Briefings in Bioinformatics
                Oxford University Press
                1467-5463
                1477-4054
                July 2022
                06 July 2022
                06 July 2022
                : 23
                : 4
                : bbac248
                Affiliations
                Division of Cancer Epigenomics , German Cancer Research (DKFZ), Im Neuenheimer Feld 280, 69120, Heidelberg, Germany
                Faculty of Mathematics and Informatics , Heidelberg University, Im Neuenheimer Feld 205, 69120, Heidelberg, Germany
                Helmholtz AI , Helmholtz Zentrum München, Ingolstädter Landstraβ e 1, 85764, Neuherberg, Germany
                Helmholtz AI , Helmholtz Zentrum München, Ingolstädter Landstraβ e 1, 85764, Neuherberg, Germany
                Division of Cancer Epigenomics , German Cancer Research (DKFZ), Im Neuenheimer Feld 280, 69120, Heidelberg, Germany
                Division of Cancer Epigenomics , German Cancer Research (DKFZ), Im Neuenheimer Feld 280, 69120, Heidelberg, Germany
                Division of Cancer Epigenomics , German Cancer Research (DKFZ), Im Neuenheimer Feld 280, 69120, Heidelberg, Germany
                Division of Cancer Epigenomics , German Cancer Research (DKFZ), Im Neuenheimer Feld 280, 69120, Heidelberg, Germany
                Division of Cancer Epigenomics , German Cancer Research (DKFZ), Im Neuenheimer Feld 280, 69120, Heidelberg, Germany
                Author notes
                Corresponding authors. Yunhee Jeong, E-mail: y.jeong@ 123456dkfz-heidelberg.de ; Pavlo Lutsik, E-mail: p.lutsik@ 123456dkfz-heidelberg.de
                Article
                bbac248
                10.1093/bib/bbac248
                9294431
                35794707
                da4b3f7f-5455-415c-8cd9-daaa60ece35e
                © The Author(s) 2022. Published by Oxford University Press.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License ( https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com

                History
                : 25 March 2022
                : 18 May 2022
                : 26 May 2022
                Page count
                Pages: 15
                Funding
                Funded by: German Network for Motor Neuron Diseases, DOI 10.13039/501100010764;
                Categories
                AcademicSubjects/SCI01060
                Review

                Bioinformatics & Computational biology
                dna methylomes,deconvolution,heterogeneity,sequencing,computational epigenetics

                Comments

                Comment on this article