70
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Genome-Assisted Prediction of Quantitative Traits Using the R Package sommer

      research-article
      *
      PLoS ONE
      Public Library of Science

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Most traits of agronomic importance are quantitative in nature, and genetic markers have been used for decades to dissect such traits. Recently, genomic selection has earned attention as next generation sequencing technologies became feasible for major and minor crops. Mixed models have become a key tool for fitting genomic selection models, but most current genomic selection software can only include a single variance component other than the error, making hybrid prediction using additive, dominance and epistatic effects unfeasible for species displaying heterotic effects. Moreover, Likelihood-based software for fitting mixed models with multiple random effects that allows the user to specify the variance-covariance structure of random effects has not been fully exploited. A new open-source R package called sommer is presented to facilitate the use of mixed models for genomic selection and hybrid prediction purposes using more than one variance component and allowing specification of covariance structures. The use of sommer for genomic prediction is demonstrated through several examples using maize and wheat genotypic and phenotypic data. At its core, the program contains three algorithms for estimating variance components: Average information (AI), Expectation-Maximization (EM) and Efficient Mixed Model Association (EMMA). Kernels for calculating the additive, dominance and epistatic relationship matrices are included, along with other useful functions for genomic analysis. Results from sommer were comparable to other software, but the analysis was faster than Bayesian counterparts in the magnitude of hours to days. In addition, ability to deal with missing data, combined with greater flexibility and speed than other REML-based software was achieved by putting together some of the most efficient algorithms to fit models in a gentle environment such as R.

          Related collections

          Most cited references8

          • Record: found
          • Abstract: found
          • Article: not found

          Best linear unbiased estimation and prediction under a selection model.

          Mixed linear models are assumed in most animal breeding applications. Convenient methods for computing BLUE of the estimable linear functions of the fixed elements of the model and for computing best linear unbiased predictions of the random elements of the model have been available. Most data available to animal breeders, however, do not meet the usual requirements of random sampling, the problem being that the data arise either from selection experiments or from breeders' herds which are undergoing selection. Consequently, the usual methods are likely to yield biased estimates and predictions. Methods for dealing with such data are presented in this paper.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Marker-assisted selection: an approach for precision plant breeding in the twenty-first century.

            DNA markers have enormous potential to improve the efficiency and precision of conventional plant breeding via marker-assisted selection (MAS). The large number of quantitative trait loci (QTLs) mapping studies for diverse crops species have provided an abundance of DNA marker-trait associations. In this review, we present an overview of the advantages of MAS and its most widely used applications in plant breeding, providing examples from cereal crops. We also consider reasons why MAS has had only a small impact on plant breeding so far and suggest ways in which the potential of MAS can be realized. Finally, we discuss reasons why the greater adoption of MAS in the future is inevitable, although the extent of its use will depend on available resources, especially for orphan crops, and may be delayed in less-developed countries. Achieving a substantial impact on crop improvement by MAS represents the great challenge for agricultural scientists in the next few decades.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Mapping genes for complex traits in domestic animals and their use in breeding programmes.

              Genome-wide panels of SNPs have recently been used in domestic animal species to map and identify genes for many traits and to select genetically desirable livestock. This has led to the discovery of the causal genes and mutations for several single-gene traits but not for complex traits. However, the genetic merit of animals can still be estimated by genomic selection, which uses genome-wide SNP panels as markers and statistical methods that capture the effects of large numbers of SNPs simultaneously. This approach is expected to double the rate of genetic improvement per year in many livestock systems.
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                PLoS One
                PLoS ONE
                plos
                plosone
                PLoS ONE
                Public Library of Science (San Francisco, CA USA )
                1932-6203
                6 June 2016
                2016
                : 11
                : 6
                : e0156744
                Affiliations
                [001]Department of Horticulture, University of Wisconsin, Madison, Wisconsin, Unites States of America
                Institute of Genetics and Developmental Biology, CHINA
                Author notes

                Competing Interests: The author has declared that no competing interests exist.

                Conceived and designed the experiments: GCP. Performed the experiments: GCP. Analyzed the data: GCP. Contributed reagents/materials/analysis tools: GCP. Wrote the paper: GCP.

                Author information
                http://orcid.org/0000-0002-7194-3837
                Article
                PONE-D-16-11570
                10.1371/journal.pone.0156744
                4894563
                27271781
                a090ec5e-6776-4676-a117-d53a4df47f99
                © 2016 Giovanny Covarrubias-Pazaran

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 20 March 2016
                : 18 May 2016
                Page count
                Figures: 4, Tables: 3, Pages: 15
                Funding
                Funded by: funder-id http://dx.doi.org/10.13039/501100003141, Consejo Nacional de Ciencia y Tecnología;
                Award Recipient :
                CONACYT (Mexico) funded PhD studies of GCP, but the funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Biology and Life Sciences
                Genetics
                Genomics
                Plant Genomics
                Biology and Life Sciences
                Biotechnology
                Plant Biotechnology
                Plant Genomics
                Biology and Life Sciences
                Plant Science
                Plant Biotechnology
                Plant Genomics
                Biology and Life Sciences
                Genetics
                Plant Genetics
                Plant Genomics
                Biology and Life Sciences
                Plant Science
                Plant Genetics
                Plant Genomics
                Physical Sciences
                Mathematics
                Applied Mathematics
                Algorithms
                Research and Analysis Methods
                Simulation and Modeling
                Algorithms
                Biology and Life Sciences
                Genetics
                Genomics
                Structural Genomics
                Biology and Life Sciences
                Agriculture
                Crop Science
                Crops
                Cereal Crops
                Wheat
                Biology and Life Sciences
                Organisms
                Plants
                Grasses
                Wheat
                Physical Sciences
                Mathematics
                Probability Theory
                Random Variables
                Covariance
                Biology and Life Sciences
                Agriculture
                Crop Science
                Crops
                Cereal Crops
                Maize
                Biology and Life Sciences
                Organisms
                Plants
                Grasses
                Maize
                Research and Analysis Methods
                Model Organisms
                Plant and Algal Models
                Maize
                Biology and Life Sciences
                Genetics
                Genomics
                Animal Genomics
                Amphibian Genomics
                Biology and Life Sciences
                Computational Biology
                Genome Analysis
                Biology and Life Sciences
                Genetics
                Genomics
                Genome Analysis
                Custom metadata
                All genotypic and phenotypic information used in this research are freely accessible and can be found in the R package documentation. The maize data can be accessed as data (cornHybrid), and data (wheatLines). The script for all analysis can be found in S1 File.

                Uncategorized
                Uncategorized

                Comments

                Comment on this article