131
views
0
recommends
+1 Recommend
0 collections
    8
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Identification of shared single copy nuclear genes in Arabidopsis, Populus, Vitis and Oryza and their phylogenetic utility across various taxonomic levels

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Although the overwhelming majority of genes found in angiosperms are members of gene families, and both gene- and genome-duplication are pervasive forces in plant genomes, some genes are sufficiently distinct from all other genes in a genome that they can be operationally defined as 'single copy'. Using the gene clustering algorithm MCL-tribe, we have identified a set of 959 single copy genes that are shared single copy genes in the genomes of Arabidopsis thaliana, Populus trichocarpa, Vitis vinifera and Oryza sativa. To characterize these genes, we have performed a number of analyses examining GO annotations, coding sequence length, number of exons, number of domains, presence in distant lineages, such as Selaginella and Physcomitrella, and phylogenetic analysis to estimate copy number in other seed plants and to demonstrate their phylogenetic utility. We then provide examples of how these genes may be used in phylogenetic analyses to reconstruct organismal history, both by using extant coverage in EST databases for seed plants and de novo amplification via RT-PCR in the family Brassicaceae.

          Results

          There are 959 single copy nuclear genes shared in Arabidopsis, Populus, Vitis and Oryza ["APVO SSC genes"]. The majority of these genes are also present in the Selaginella and Physcomitrella genomes. Public EST sets for 197 species suggest that most of these genes are present across a diverse collection of seed plants, and appear to exist as single or very low copy genes, though exceptions are seen in recently polyploid taxa and in lineages where there is significant evidence for a shared large-scale duplication event. Genes encoding proteins localized in organelles are more commonly single copy than expected by chance, but the evolutionary forces responsible for this bias are unknown.

          Regardless of the evolutionary mechanisms responsible for the large number of shared single copy genes in diverse flowering plant lineages, these genes are valuable for phylogenetic and comparative analyses. Eighteen of the APVO SSC single copy genes were amplified in the Brassicaceae using RT-PCR and directly sequenced. Alignments of these sequences provide improved resolution of Brassicaceae phylogeny compared to recent studies using plastid and ITS sequences. An analysis of sequences from 13 APVO SSC genes from 69 species of seed plants, derived mainly from public EST databases, yielded a phylogeny that was largely congruent with prior hypotheses based on multiple plastid sequences. Whereas single gene phylogenies that rely on EST sequences have limited bootstrap support as the result of limited sequence information, concatenated alignments result in phylogenetic trees with strong bootstrap support for already established relationships. Overall, these single copy nuclear genes are promising markers for phylogenetics, and contain a greater proportion of phylogenetically-informative sites than commonly used protein-coding sequences from the plastid or mitochondrial genomes.

          Conclusions

          Putatively orthologous, shared single copy nuclear genes provide a vast source of new evidence for plant phylogenetics, genome mapping, and other applications, as well as a substantial class of genes for which functional characterization is needed. Preliminary evidence indicates that many of the shared single copy nuclear genes identified in this study may be well suited as markers for addressing phylogenetic hypotheses at a variety of taxonomic levels.

          Related collections

          Most cited references67

          • Record: found
          • Abstract: found
          • Article: not found
          Is Open Access

          The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla.

          The analysis of the first plant genomes provided unexpected evidence for genome duplication events in species that had previously been considered as true diploids on the basis of their genetics. These polyploidization events may have had important consequences in plant evolution, in particular for species radiation and adaptation and for the modulation of functional capacities. Here we report a high-quality draft of the genome sequence of grapevine (Vitis vinifera) obtained from a highly homozygous genotype. The draft sequence of the grapevine genome is the fourth one produced so far for flowering plants, the second for a woody species and the first for a fruit crop (cultivated for both fruit and beverage). Grapevine was selected because of its important place in the cultural heritage of humanity beginning during the Neolithic period. Several large expansions of gene families with roles in aromatic features are observed. The grapevine genome has not undergone recent genome duplication, thus enabling the discovery of ancestral traits and features of the genetic organization of flowering plants. This analysis reveals the contribution of three ancestral genomes to the grapevine haploid content. This ancestral arrangement is common to many dicotyledonous plants but is absent from the genome of rice, which is a monocotyledon. Furthermore, we explain the chronology of previously described whole-genome duplication events in the evolution of flowering plants.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus).

            Papaya, a fruit crop cultivated in tropical and subtropical regions, is known for its nutritional benefits and medicinal applications. Here we report a 3x draft genome sequence of 'SunUp' papaya, the first commercial virus-resistant transgenic fruit tree to be sequenced. The papaya genome is three times the size of the Arabidopsis genome, but contains fewer genes, including significantly fewer disease-resistance gene analogues. Comparison of the five sequenced genomes suggests a minimal angiosperm gene set of 13,311. A lack of recent genome duplication, atypical of other angiosperm genomes sequenced so far, may account for the smaller papaya gene number in most functional groups. Nonetheless, striking amplifications in gene number within particular functional groups suggest roles in the evolution of tree-like habit, deposition and remobilization of starch reserves, attraction of seed dispersal agents, and adaptation to tropical daylengths. Transgenesis at three locations is closely associated with chloroplast insertions into the nuclear genome, and with topoisomerase I recognition sites. Papaya offers numerous advantages as a system for fruit-tree functional genomics, and this draft genome sequence provides the foundation for revealing the basis of Carica's distinguishing morpho-physiological, medicinal and nutritional properties.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics.

              Integration of structural genomic data from a largely assembled rice genome sequence, with phylogenetic analysis of sequence samples for many other taxa, suggests that a polyploidization event occurred approximately 70 million years ago, before the divergence of the major cereals from one another but after the divergence of the Poales from the Liliales and Zingiberales. Ancient polyploidization and subsequent "diploidization" (loss) of many duplicated gene copies has thus shaped the genomes of all Poaceae cereal, forage, and biomass crops. The Poaceae appear to have evolved as separate lineages for approximately 50 million years, or two-thirds of the time since the duplication event. Chromosomes that are predicted to be homoeologs resulting from this ancient duplication event account for a disproportionate share of incongruent loci found by comparison of the rice sequence to a detailed sorghum sequence-tagged site-based genetic map. Differential gene loss during diploidization may have contributed many of these incongruities. Such predicted homoeologs also account for a disproportionate share of duplicated sorghum loci, further supporting the hypothesis that the polyploidization event was common to sorghum and rice. Comparative gene orders along paleo-homoeologous chromosomal segments provide a means to make phylogenetic inferences about chromosome structural rearrangements that differentiate among the grasses. Superimposition of the timing of major duplication events on taxonomic relationships leads to improved understanding of comparative gene orders, enhancing the value of data from botanical models for crop improvement and for further exploration of genomic biodiversity. Additional ancient duplication events probably remain to be discovered in other angiosperm lineages.
                Bookmark

                Author and article information

                Journal
                BMC Evol Biol
                BMC Evolutionary Biology
                BioMed Central
                1471-2148
                2010
                24 February 2010
                : 10
                : 61
                Affiliations
                [1 ]Department of Biology and the Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, 16802, USA
                [2 ]Division of Biological Sciences, University of Missouri-Columbia, Columbia, MO 65211-7310, USA
                [3 ]Department of Plant Biology, University of Georgia, Athens, GA 30602-7271, USA
                [4 ]State Key Laboratory of Genetic Engineering, the Institute of Plant Biology, School of Life Sciences, Institutes of Biomedical Sciences, Center for Evolutionary Biology, Fudan University, 220 Handan Road, Shanghai 200433, PR China
                [5 ]BASF Plant Science, Research Triangle Park, NC, 27709, USA
                Article
                1471-2148-10-61
                10.1186/1471-2148-10-61
                2848037
                20181251
                b42bbcb9-ccd7-4b06-a0b4-000f41b4833b
                Copyright ©2010 Duarte et al; licensee BioMed Central Ltd.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 31 July 2008
                : 24 February 2010
                Categories
                Research article

                Evolutionary Biology
                Evolutionary Biology

                Comments

                Comment on this article