0
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Exhaustive identification of genome-wide binding events of transcriptional regulators

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Genome-wide binding assays aspire to map the complete binding pattern of gene regulators. Common practice relies on replication—duplicates or triplicates—and high stringency statistics to favor false negatives over false positives. Here we show that duplicates and triplicates of CUT&RUN are not sufficient to discover the entire activity of transcriptional regulators. We introduce ICEBERG (Increased Capture of Enrichment By Exhaustive Replicate aGgregation), a pipeline that harnesses large numbers of CUT&RUN replicates to discover the full set of binding events and chart the line between false positives and false negatives. We employed ICEBERG to map the full set of H3K4me3-marked regions, the targets of the co-factor β-catenin, and those of the transcription factor TBX3, in human colorectal cancer cells. The ICEBERG datasets allow benchmarking of individual replicates, comparing the performance of peak calling and replication approaches, and expose the arbitrary nature of strategies to identify reproducible peaks. Instead of a static view of genomic targets, ICEBERG establishes a spectrum of detection probabilities across the genome for a given factor, underlying the intrinsic dynamicity of its mechanism of action, and permitting to distinguish frequent from rare regulation events. Finally, ICEBERG discovered instances, undetectable with other approaches, that underlie novel mechanisms of colorectal cancer progression.

          Graphical Abstract

          Graphical Abstract

          Related collections

          Most cited references74

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          The Sequence Alignment/Map format and SAMtools

          Summary: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments. Availability: http://samtools.sourceforge.net Contact: rd@sanger.ac.uk
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Fast gapped-read alignment with Bowtie 2.

            As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              BEDTools: a flexible suite of utilities for comparing genomic features

              Motivation: Testing for correlations between different sets of genomic features is a fundamental task in genomics research. However, searching for overlaps between features with existing web-based methods is complicated by the massive datasets that are routinely produced with current sequencing technologies. Fast and flexible tools are therefore required to ask complex questions of these data in an efficient manner. Results: This article introduces a new software suite for the comparison, manipulation and annotation of genomic features in Browser Extensible Data (BED) and General Feature Format (GFF) format. BEDTools also supports the comparison of sequence alignments in BAM format to both BED and GFF features. The tools are extremely efficient and allow the user to compare large datasets (e.g. next-generation sequencing data) with both public and custom genome annotation tracks. BEDTools can be combined with one another as well as with standard UNIX commands, thus facilitating routine genomics tasks as well as pipelines that can quickly answer intricate questions of large genomic datasets. Availability and implementation: BEDTools was written in C++. Source code and a comprehensive user manual are freely available at http://code.google.com/p/bedtools Contact: aaronquinlan@gmail.com; imh4y@virginia.edu Supplementary information: Supplementary data are available at Bioinformatics online.
                Bookmark

                Author and article information

                Contributors
                Journal
                Nucleic Acids Res
                Nucleic Acids Res
                nar
                Nucleic Acids Research
                Oxford University Press
                0305-1048
                1362-4962
                24 April 2024
                18 March 2024
                18 March 2024
                : 52
                : 7
                : e40
                Affiliations
                Wallenberg Centre for Molecular Medicine, Linköping University , Linköping, Sweden
                Department of Biomedical and Clinical Sciences, Division of Molecular Medicine and Virology, Faculty of Medicine and Health Sciences, Linköping University , Linköping, Sweden
                Wallenberg Centre for Molecular Medicine, Linköping University , Linköping, Sweden
                Department of Biomedical and Clinical Sciences, Division of Molecular Medicine and Virology, Faculty of Medicine and Health Sciences, Linköping University , Linköping, Sweden
                Wallenberg Centre for Molecular Medicine, Linköping University , Linköping, Sweden
                Department of Biomedical and Clinical Sciences, Division of Molecular Medicine and Virology, Faculty of Medicine and Health Sciences, Linköping University , Linköping, Sweden
                Wallenberg Centre for Molecular Medicine, Linköping University , Linköping, Sweden
                Department of Biomedical and Clinical Sciences, Division of Molecular Medicine and Virology, Faculty of Medicine and Health Sciences, Linköping University , Linköping, Sweden
                Author notes
                To whom correspondence should be addressed. Tel: +46 13282462; Email: claudio.cantu@ 123456liu.se ; Twitter/X: @ClaudioCantu81

                The first two authors should be regarded as Joint First Authors.

                Author information
                https://orcid.org/0000-0002-5868-4797
                https://orcid.org/0000-0001-6912-0957
                https://orcid.org/0000-0003-1275-6027
                https://orcid.org/0000-0003-1547-5415
                Article
                gkae180
                10.1093/nar/gkae180
                11040144
                38499482
                6d0d3075-dc04-4e9a-bc38-ce498bdc2b59
                © The Author(s) 2024. Published by Oxford University Press on behalf of Nucleic Acids Research.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 29 February 2024
                : 20 February 2024
                : 21 September 2023
                Page count
                Pages: 18
                Funding
                Funded by: Cancerfonden, DOI 10.13039/501100002794;
                Award ID: CAN 2018/542
                Award ID: 21 1572 Pj
                Funded by: Swedish Research Council, DOI 10.13039/501100004359;
                Award ID: 2021-03075
                Award ID: 2023-01898
                Funded by: Linköping University, DOI 10.13039/501100003945;
                Funded by: Additional Ventures, DOI 10.13039/100020415;
                Award ID: SVRF2021-1048003
                Funded by: Knut and Alice Wallenberg Foundation, DOI 10.13039/501100004063;
                Categories
                AcademicSubjects/SCI00010
                Narese/3
                Methods

                Genetics
                Genetics

                Comments

                Comment on this article