5
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Unraveling the Complex Hybrid Ancestry and Domestication History of Cultivated Strawberry

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Cultivated strawberry ( Fragaria × ananassa) is one of our youngest domesticates, originating in early eighteenth-century Europe from spontaneous hybrids between wild allo-octoploid species ( Fragaria chiloensis and Fragaria virginiana). The improvement of horticultural traits by 300 years of breeding has enabled the global expansion of strawberry production. Here, we describe the genomic history of strawberry domestication from the earliest hybrids to modern cultivars. We observed a significant increase in heterozygosity among interspecific hybrids and a decrease in heterozygosity among domesticated descendants of those hybrids. Selective sweeps were found across the genome in early and modern phases of domestication—59–76% of the selectively swept genes originated in the three less dominant ancestral subgenomes. Contrary to the tenet that genetic diversity is limited in cultivated strawberry, we found that the octoploid species harbor massive allelic diversity and that F. × ananassa harbors as much allelic diversity as either wild founder. We identified 41.8 M subgenome-specific DNA variants among resequenced wild and domesticated individuals. Strikingly, 98% of common alleles and 73% of total alleles were shared between wild and domesticated populations. Moreover, genome-wide estimates of nucleotide diversity were virtually identical in F. chiloensis,F. virginiana, and F. × ananassa (π = 0.0059–0.0060). We found, however, that nucleotide diversity and heterozygosity were significantly lower in modern F. × ananassa populations that have experienced significant genetic gains and have produced numerous agriculturally important cultivars.

          Related collections

          Most cited references113

          • Record: found
          • Abstract: not found
          • Article: not found

          Cutadapt removes adapter sequences from high-throughput sequencing reads

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

            Next-generation DNA sequencing (NGS) projects, such as the 1000 Genomes Project, are already revolutionizing our understanding of genetic variation among individuals. However, the massive data sets generated by NGS--the 1000 Genome pilot alone includes nearly five terabases--make writing feature-rich, efficient, and robust analysis tools difficult for even computationally sophisticated individuals. Indeed, many professionals are limited in the scope and the ease with which they can answer scientific questions by the complexity of accessing and manipulating the data produced by these machines. Here, we discuss our Genome Analysis Toolkit (GATK), a structured programming framework designed to ease the development of efficient and robust analysis tools for next-generation DNA sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich set of data access patterns that encompass the majority of analysis tool needs. Separating specific analysis calculations from common data management infrastructure enables us to optimize the GATK framework for correctness, stability, and CPU and memory efficiency and to enable distributed and shared memory parallelization. We highlight the capabilities of the GATK by describing the implementation and application of robust, scale-tolerant tools like coverage calculators and single nucleotide polymorphism (SNP) calling. We conclude that the GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Detecting the number of clusters of individuals using the software structure: a simulation study

              The identification of genetically homogeneous groups of individuals is a long standing issue in population genetics. A recent Bayesian algorithm implemented in the software STRUCTURE allows the identification of such groups. However, the ability of this algorithm to detect the true number of clusters (K) in a sample of individuals when patterns of dispersal among populations are not homogeneous has not been tested. The goal of this study is to carry out such tests, using various dispersal scenarios from data generated with an individual-based model. We found that in most cases the estimated 'log probability of data' does not provide a correct estimation of the number of clusters, K. However, using an ad hoc statistic DeltaK based on the rate of change in the log probability of data between successive K values, we found that STRUCTURE accurately detects the uppermost hierarchical level of structure for the scenarios we tested. As might be expected, the results are sensitive to the type of genetic marker used (AFLP vs. microsatellite), the number of loci scored, the number of populations sampled, and the number of individuals typed in each sample.
                Bookmark

                Author and article information

                Contributors
                Role: Associate Editor
                Journal
                Mol Biol Evol
                Mol Biol Evol
                molbev
                Molecular Biology and Evolution
                Oxford University Press
                0737-4038
                1537-1719
                June 2021
                28 January 2021
                28 January 2021
                : 38
                : 6
                : 2285-2305
                Affiliations
                [1 ] Department of Plant Sciences, University of California , Davis, Davis, CA 95616, USA
                [2 ] IFAS Gulf Coast Research and Education Center, Department of Horticulture, University of Florida , Wimauma, FL 33598, USA
                [3 ] USDA-ARS, National Clonal Germplasm Repository , Corvallis, OR 92182, USA
                [4 ] Department of Horticultural Science, Michigan State University , East Lansing, MI 48824, USA
                Author notes
                Corresponding author: E-mail: sjknapp@ 123456ucdavis.edu
                Article
                msab024
                10.1093/molbev/msab024
                8136507
                33507311
                613e6639-0780-46de-89c4-c33e5898a1b2
                © The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License ( http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com

                History
                Page count
                Pages: 21
                Funding
                Funded by: USDA NIFA Postdoctoral Research Fellowship;
                Award ID: #2018-67012-27980
                Funded by: USDA-ARS Small Fruit Crop Germplasm Committee;
                Award ID: #2072-21000-049-06-S
                Funded by: USDA, DOI 10.13039/100000199;
                Funded by: NIFA, DOI 10.13039/100005825;
                Award ID: #1009804
                Award ID: #2020-67013-30870
                Funded by: National Science Foundation, DOI 10.13039/100000001;
                Award ID: #2029959
                Categories
                Discoveries
                AcademicSubjects/SCI01130
                AcademicSubjects/SCI01180

                Molecular biology
                fragaria,polyploid,genome evolution,selection,nucleotide diversity,linkage disequilibrium

                Comments

                Comment on this article