45
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Re-evaluation of G-quadruplex propensity with G4Hunter

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Critical evidence for the biological relevance of G-quadruplexes (G4) has recently been obtained in seminal studies performed in a variety of organisms. Four-stranded G-quadruplex DNA structures are promising drug targets as these non-canonical structures appear to be involved in a number of key biological processes. Given the growing interest for G4, accurate tools to predict G-quadruplex propensity of a given DNA or RNA sequence are needed. Several algorithms such as Quadparser predict quadruplex forming propensity. However, a number of studies have established that sequences that are not detected by these tools do form G4 structures (false negatives) and that other sequences predicted to form G4 structures do not (false positives). Here we report development and testing of a radically different algorithm, G4Hunter that takes into account G- richness and G- skewness of a given sequence and gives a quadruplex propensity score as output. To validate this model, we tested it on a large dataset of 392 published sequences and experimentally evaluated quadruplex forming potential of 209 sequences using a combination of biophysical methods to assess quadruplex formation in vitro. We experimentally validated the G4Hunter algorithm on a short complete genome, that of the human mitochondria (16.6 kb), because of its relatively high GC content and GC skewness as well as the biological relevance of these quadruplexes near instability hotspots. We then applied the algorithm to genomes of a number of species, including humans, allowing us to conclude that the number of sequences capable of forming stable quadruplexes (at least in vitro) in the human genome is significantly higher, by a factor of 2–10, than previously thought.

          Related collections

          Most cited references42

          • Record: found
          • Abstract: found
          • Article: not found

          DNA replication through G-quadruplex motifs is promoted by the Saccharomyces cerevisiae Pif1 DNA helicase.

          G-quadruplex (G4) DNA structures are extremely stable four-stranded secondary structures held together by noncanonical G-G base pairs. Genome-wide chromatin immunoprecipitation was used to determine the in vivo binding sites of the multifunctional Saccharomyces cerevisiae Pif1 DNA helicase, a potent unwinder of G4 structures in vitro. G4 motifs were a significant subset of the high-confidence Pif1-binding sites. Replication slowed in the vicinity of these motifs, and they were prone to breakage in Pif1-deficient cells, whereas non-G4 Pif1-binding sites did not show this behavior. Introducing many copies of G4 motifs caused slow growth in replication-stressed Pif1-deficient cells, which was relieved by spontaneous mutations that eliminated their ability to form G4 structures, bind Pif1, slow DNA replication, and stimulate DNA breakage. These data suggest that G4 structures form in vivo and that they are resolved by Pif1 to prevent replication fork stalling and DNA breakage. Copyright © 2011 Elsevier Inc. All rights reserved.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Gene function correlates with potential for G4 DNA formation in the human genome

            G-rich genomic regions can form G4 DNA upon transcription or replication. We have quantified the potential for G4 DNA formation (G4P) of the 16 654 genes in the human RefSeq database, and then correlated gene function with G4P. We have found that very low and very high G4P correlates with specific functional classes of genes. Notably, tumor suppressor genes have very low G4P and proto-oncogenes have very high G4P. G4P of these genes is evenly distributed between exons and introns, and it does not reflect enrichment for CpG islands or local chromosomal environment. These results show that genomic structure undergoes selection based on gene function. Selection based on G4P could promote genomic stability (or instability) of specific classes of genes; or reflect mechanisms for global regulation of gene expression.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Unraveling cell type-specific and reprogrammable human replication origin signatures associated with G-quadruplex consensus motifs.

              DNA replication is highly regulated, ensuring faithful inheritance of genetic information through each cell cycle. In metazoans, this process is initiated at many thousands of DNA replication origins whose cell type-specific distribution and usage are poorly understood. We exhaustively mapped the genome-wide location of replication origins in human cells using deep sequencing of short nascent strands and identified ten times more origin positions than we expected; most of these positions were conserved in four different human cell lines. Furthermore, we identified a consensus G-quadruplex-forming DNA motif that can predict the position of DNA replication origins in human cells, accounting for their distribution, usage efficiency and timing. Finally, we discovered a cell type-specific reprogrammable signature of cell identity that was revealed by specific efficiencies of conserved origin positions and not by the selection of cell type-specific subsets of origins.
                Bookmark

                Author and article information

                Journal
                Nucleic Acids Res
                Nucleic Acids Res
                nar
                nar
                Nucleic Acids Research
                Oxford University Press
                0305-1048
                1362-4962
                29 February 2016
                20 January 2016
                20 January 2016
                : 44
                : 4
                : 1746-1759
                Affiliations
                [1 ]Université de Bordeaux, ARNA Laboratory, F-33000 Bordeaux, France
                [2 ]Inserm U1212, CNRS UMR 5320, IECB, F-33600 Pessac, France
                [3 ]CNRS-Université de Toulouse UMR5099, F-31000 Toulouse, France
                Author notes
                [* ]To whom correspondence should be addressed. Tel: +33 5 4000 3022; Email: jean-louis.mergny@ 123456inserm.fr
                Correspondence may also be addressed to Laurent Lacroix. Tel: +33 5 6133 5992; Email: laurent.lacroix@ 123456inserm.fr
                []These authors contributed equally to the paper as first authors.
                Article
                10.1093/nar/gkw006
                4770238
                26792894
                ab654892-f8ae-48ac-9cf2-3dc4590c3ed5
                © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@ 123456oup.com

                History
                : 3 January 2016
                : 14 December 2015
                : 7 October 2015
                Page count
                Pages: 14
                Categories
                14
                24
                Genomics
                Custom metadata
                29 February 2016

                Genetics
                Genetics

                Comments

                Comment on this article