23
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Training set optimization under population structure in genomic selection

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Key message

          Population structure must be evaluated before optimization of the training set population. Maximizing the phenotypic variance captured by the training set is important for optimal performance.

          Abstract

          The optimization of the training set (TRS) in genomic selection has received much interest in both animal and plant breeding, because it is critical to the accuracy of the prediction models. In this study, five different TRS sampling algorithms, stratified sampling, mean of the coefficient of determination (CDmean), mean of predictor error variance (PEVmean), stratified CDmean (StratCDmean) and random sampling, were evaluated for prediction accuracy in the presence of different levels of population structure. In the presence of population structure, the most phenotypic variation captured by a sampling method in the TRS is desirable. The wheat dataset showed mild population structure, and CDmean and stratified CDmean methods showed the highest accuracies for all the traits except for test weight and heading date. The rice dataset had strong population structure and the approach based on stratified sampling showed the highest accuracies for all traits. In general, CDmean minimized the relationship between genotypes in the TRS, maximizing the relationship between TRS and the test set. This makes it suitable as an optimization criterion for long-term selection. Our results indicated that the best selection criterion used to optimize the TRS seems to depend on the interaction of trait architecture and population structure.

          Electronic supplementary material

          The online version of this article (doi:10.1007/s00122-014-2418-4) contains supplementary material, which is available to authorized users.

          Related collections

          Most cited references27

          • Record: found
          • Abstract: found
          • Article: not found

          Best linear unbiased estimation and prediction under a selection model.

          Mixed linear models are assumed in most animal breeding applications. Convenient methods for computing BLUE of the estimable linear functions of the fixed elements of the model and for computing best linear unbiased predictions of the random elements of the model have been available. Most data available to animal breeders, however, do not meet the usual requirements of random sampling, the problem being that the data arise either from selection experiments or from breeders' herds which are undergoing selection. Consequently, the usual methods are likely to yield biased estimates and predictions. Methods for dealing with such data are presented in this paper.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Recovery of inter-block information when block sizes are unequal

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Mapping genes for complex traits in domestic animals and their use in breeding programmes.

              Genome-wide panels of SNPs have recently been used in domestic animal species to map and identify genes for many traits and to select genetically desirable livestock. This has led to the discovery of the causal genes and mutations for several single-gene traits but not for complex traits. However, the genetic merit of animals can still be estimated by genomic selection, which uses genome-wide SNP panels as markers and statistical methods that capture the effects of large numbers of SNPs simultaneously. This approach is expected to double the rate of genetic improvement per year in many livestock systems.
                Bookmark

                Author and article information

                Contributors
                ji66@cornell.edu
                Journal
                Theor Appl Genet
                Theor. Appl. Genet
                TAG. Theoretical and Applied Genetics. Theoretische Und Angewandte Genetik
                Springer Berlin Heidelberg (Berlin/Heidelberg )
                0040-5752
                1432-2242
                1 November 2014
                1 November 2014
                2015
                : 128
                : 145-158
                Affiliations
                [ ]Cornell University, Ithaca, NY USA
                [ ]Hard Winter Wheat Genetics Research Unit, USDA-ARS and Department of Agronomy, Kansas State University, 4011 Throckmorton, Manhattan, KS 66506 USA
                [ ]Limagrain Europe, CS3911, 63720 Chappes, France
                Author notes

                Communicated by Chris Carolin Schön.

                Article
                2418
                10.1007/s00122-014-2418-4
                4282691
                25367380
                4b12479d-b6d0-4a76-a573-190e45c10d84
                © The Author(s) 2014

                Open AccessThis article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

                History
                : 7 May 2014
                : 12 October 2014
                Categories
                Original Paper
                Custom metadata
                © Springer-Verlag Berlin Heidelberg 2015

                Genetics
                Genetics

                Comments

                Comment on this article