22
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Rearrangements of 2.5 Kilobases of Noncoding DNA from the Drosophila even-skipped Locus Define Predictive Rules of Genomic cis-Regulatory Logic

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Rearrangements of about 2.5 kilobases of regulatory DNA located 5′ of the transcription start site of the Drosophila even-skipped locus generate large-scale changes in the expression of even-skipped stripes 2, 3, and 7. The most radical effects are generated by juxtaposing the minimal stripe enhancers MSE2 and MSE3 for stripes 2 and 3 with and without small “spacer” segments less than 360 bp in length. We placed these fusion constructs in a targeted transformation site and obtained quantitative expression data for these transformants together with their controlling transcription factors at cellular resolution. These data demonstrated that the rearrangements can alter expression levels in stripe 2 and the 2–3 interstripe by a factor of more than 10. We reasoned that this behavior would place tight constraints on possible rules of genomic cis-regulatory logic. To find these constraints, we confronted our new expression data together with previously obtained data on other constructs with a computational model. The model contained representations of thermodynamic protein–DNA interactions including steric interference and cooperative binding, short-range repression, direct repression, activation, and coactivation. The model was highly constrained by the training data, which it described within the limits of experimental error. The model, so constrained, was able to correctly predict expression patterns driven by enhancers for other Drosophila genes; even-skipped enhancers not included in the training set; stripe 2, 3, and 7 enhancers from various Drosophilid and Sepsid species; and long segments of even-skipped regulatory DNA that contain multiple enhancers. The model further demonstrated that elevated expression driven by a fusion of MSE2 and MSE3 was a consequence of the recruitment of a portion of MSE3 to become a functional component of MSE2, demonstrating that cis-regulatory “elements” are not elementary objects.

          Author Summary

          Metazoan genes, including those of humans, contain large noncoding regions that are required for viability. Sequence variations in these regions are statistically associated with human disease, but the mechanisms underlying these associations are not well understood. These regions regulate transcription and are frequently larger than the gene's transcript by an order of magnitude. In this paper we attempt to elucidate the regulatory code of these noncoding segments of DNA by means of quantitative spatially resolved gene expression data and a computational model. The expression data comes from the early embryo of the fruit fly Drosophila melanogaster. We chose a family of DNA constructs to analyze that drive very different patterns of expression when very small changes in DNA sequence are made, reasoning that this sensitivity would reveal important properties of the regulatory code. The model reproduced the training data with precision greater than the expected accuracy of the training data itself. It was able to correctly predict from DNA sequence the expression of 44 segments of DNA from many genes and species.

          Related collections

          Most cited references58

          • Record: found
          • Abstract: found
          • Article: not found

          A systems approach to measuring the binding energy landscapes of transcription factors.

          A major goal of systems biology is to predict the function of biological networks. Although network topologies have been successfully determined in many cases, the quantitative parameters governing these networks generally have not. Measuring affinities of molecular interactions in high-throughput format remains problematic, especially for transient and low-affinity interactions. We describe a high-throughput microfluidic platform that measures such properties on the basis of mechanical trapping of molecular interactions. With this platform we characterized DNA binding energy landscapes for four eukaryotic transcription factors; these landscapes were used to test basic assumptions about transcription factor binding and to predict their in vivo function.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Transcriptional activation by recruitment.

            The recruitment model for gene activation stipulates that an activator works by bringing the transcriptional machinery to the DNA. Recent experiments in bacteria and yeast indicate that many genes can be activated by this mechanism. These findings have implications for our understanding of the nature of activating regions and their targets, and for the role of histones in gene regulation.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Predicting expression patterns from regulatory sequence in Drosophila segmentation.

              The establishment of complex expression patterns at precise times and locations is key to metazoan development, yet a mechanistic understanding of the underlying transcription control networks is still missing. Here we describe a novel thermodynamic model that computes expression patterns as a function of cis-regulatory sequence and of the binding-site preferences and expression of participating transcription factors. We apply this model to the segmentation gene network of Drosophila melanogaster and find that it predicts expression patterns of cis-regulatory modules with remarkable accuracy, demonstrating that positional information is encoded in the regulatory sequence and input factor distribution. Our analysis reveals that both strong and weaker binding sites contribute, leading to high occupancy of the module DNA, and conferring robustness against mutation; short-range homotypic clustering of weaker sites facilitates cooperative binding, which is necessary to sharpen the patterns. Our computational framework is generally applicable to most protein-DNA interaction systems.
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                PLoS Genet
                PLoS Genet
                plos
                plosgen
                PLoS Genetics
                Public Library of Science (San Francisco, USA )
                1553-7390
                1553-7404
                February 2013
                February 2013
                28 February 2013
                : 9
                : 2
                : e1003243
                Affiliations
                [1 ]Department of Ecology and Evolution, Chicago Center for Systems Biology, University of Chicago, Chicago, Illinois, United States of America
                [2 ]Department of Biochemistry and Cell Biology, Stony Brook University, Stony Brook, New York, United States of America
                [3 ]Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
                [4 ]Escola de Artes, Ciências e Humanidades, Universidade de São Paulo, São Paulo, Brazil
                [5 ]Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
                [6 ]Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America
                [7 ]Department of Statistics, Department of Molecular Genetics and Cell Biology, and Institute of Genomics and Systems Biology, University of Chicago, Chicago, Illinois, United States of America
                University of California Berkeley, United States of America
                Author notes

                The authors have declared that no competing interests exist.

                Wrote the code: CM JI A-RK. Formulated the transcription model: JR DHS A-RK. Generated the lines, obtained quantitative data, performed simulations, and analyzed the results: A-RK. Derived the diffusion limited Arrhenius equation: AFR. Identified non-mel stripe 2 enhancers: MZL A-RK. Performed SELEX experiments: NO. Conceived and designed the experiments: JR A-RK. Wrote the paper: JR A-RK DHS.

                Article
                PGENETICS-D-12-00674
                10.1371/journal.pgen.1003243
                3585115
                23468638
                c4c15dca-4cc9-42f7-8fcc-a07a51d3ea2e
                Copyright @ 2013

                This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 15 March 2012
                : 30 November 2012
                Page count
                Pages: 18
                Funding
                This work was supported by awards NIH RO1 OD010936 (formerly RR07801), NIH P50 GM081892, NIH R01 GM70444, and the University of Chicago ( http://www.uchicago.edu). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Biology
                Biochemistry
                Computational Biology
                Developmental Biology
                Evolutionary Biology
                Genetics
                Genomics
                Systems Biology
                Computer Science
                Computer Modeling
                Mathematics
                Applied Mathematics

                Genetics
                Genetics

                Comments

                Comment on this article

                scite_
                0
                0
                0
                0
                Smart Citations
                0
                0
                0
                0
                Citing PublicationsSupportingMentioningContrasting
                View Citations

                See how this article has been cited at scite.ai

                scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

                Similar content13

                Cited by31

                Most referenced authors341