6
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Genome-wide association studies and genomic selection assays made in a large sample of cacao ( Theobroma cacao L.) germplasm reveal significant marker-trait associations and good predictive value for improving yield potential

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          A genome-wide association study (GWAS) was undertaken to unravel marker-trait associations (MTAs) between SNP markers and phenotypic traits. It involved a subset of 421 cacao accessions from the large and diverse collection conserved ex situ at the International Cocoa Genebank Trinidad. A Mixed Linear Model (MLM) in TASSEL was used for the GWAS and followed by confirmatory analyses using GAPIT FarmCPU. An average linkage disequilibrium ( r 2 ) of 0.10 at 5.2 Mb was found across several chromosomes. Seventeen significant ( P ≤ 8.17 × 10 −5 (–log10 (p) = 4.088)) MTAs of interest, including six that pertained to yield-related traits, were identified using TASSEL MLM. The latter accounted for 5 to 17% of the phenotypic variation expressed. The highly significant association ( P ≤ 8.17 × 10 −5) between seed length to width ratio and TcSNP 733 on chromosome 5 was verified with FarmCPU ( P ≤ 1.12 × 10 −8). Fourteen MTAs were common to both the TASSEL and FarmCPU models at P ≤ 0.003. The most significant yield-related MTAs involved seed number and seed length on chromosome 7 ( P ≤ 1.15 × 10− 14 and P ≤ 6.75 × 10− 05, respectively) and seed number on chromosome 1 ( P ≤ 2.38 × 10 −05), based on the TASSEL MLM. It was noteworthy that seed length, seed length to width ratio and seed number were associated with markers at different loci, indicating their polygenic nature. Approximately 40 candidate genes that encode embryo and seed development, protein synthesis, carbohydrate transport and lipid biosynthesis and transport were identified in the flanking regions of the significantly associated SNPs and in linkage disequilibrium with them. A significant association of fruit surface anthocyanin intensity co-localised with MYB-related protein 308 on chromosome 4. Testing of a genomic selection approach revealed good predictive value (genomic estimated breeding values (GEBV)) for economic traits such as seed number (GEBV = 0.611), seed length (0.6199), seed width (0.5435), seed length to width ratio (0.5503), seed/cotyledon mass (0.6014) and ovule number (0.6325). The findings of this study could facilitate genomic selection and marker-assisted breeding of cacao thereby expediting improvement in the yield potential of cacao planting material.

          Related collections

          Most cited references87

          • Record: found
          • Abstract: found
          • Article: not found

          Detecting the number of clusters of individuals using the software structure: a simulation study

          The identification of genetically homogeneous groups of individuals is a long standing issue in population genetics. A recent Bayesian algorithm implemented in the software STRUCTURE allows the identification of such groups. However, the ability of this algorithm to detect the true number of clusters (K) in a sample of individuals when patterns of dispersal among populations are not homogeneous has not been tested. The goal of this study is to carry out such tests, using various dispersal scenarios from data generated with an individual-based model. We found that in most cases the estimated 'log probability of data' does not provide a correct estimation of the number of clusters, K. However, using an ad hoc statistic DeltaK based on the rate of change in the log probability of data between successive K values, we found that STRUCTURE accurately detects the uppermost hierarchical level of structure for the scenarios we tested. As might be expected, the results are sensitive to the type of genetic marker used (AFLP vs. microsatellite), the number of loci scored, the number of populations sampled, and the number of individuals typed in each sample.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Inference of Population Structure Using Multilocus Genotype Data

            We describe a model-based clustering method for using multilocus genotype data to infer population structure and assign individuals to populations. We assume a model in which there are K populations (where K may be unknown), each of which is characterized by a set of allele frequencies at each locus. Individuals in the sample are assigned (probabilistically) to populations, or jointly to two or more populations if their genotypes indicate that they are admixed. Our model does not assume a particular mutation process, and it can be applied to most of the commonly used genetic markers, provided that they are not closely linked. Applications of our method include demonstrating the presence of population structure, assigning individuals to populations, studying hybrid zones, and identifying migrants and admixed individuals. We show that the method can produce highly accurate assignments using modest numbers of loci—e.g., seven microsatellite loci in an example using genotype data from an endangered bird species. The software used for this article is available from http://www.stats.ox.ac.uk/~pritch/home.html.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              TASSEL: software for association mapping of complex traits in diverse samples.

              Association analyses that exploit the natural diversity of a genome to map at very high resolutions are becoming increasingly important. In most studies, however, researchers must contend with the confounding effects of both population and family structure. TASSEL (Trait Analysis by aSSociation, Evolution and Linkage) implements general linear model and mixed linear model approaches for controlling population and family structure. For result interpretation, the program allows for linkage disequilibrium statistics to be calculated and visualized graphically. Database browsing and data importation is facilitated by integrated middleware. Other features include analyzing insertions/deletions, calculating diversity statistics, integration of phenotypic and genotypic data, imputing missing data and calculating principal components.
                Bookmark

                Author and article information

                Contributors
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: SoftwareRole: ValidationRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: Data curationRole: MethodologyRole: Writing – review & editing
                Role: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: SoftwareRole: Writing – review & editing
                Role: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: ValidationRole: Writing – original draftRole: Writing – review & editing
                Role: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: Writing – review & editing
                Role: Data curationRole: InvestigationRole: MethodologyRole: Writing – review & editing
                Role: SupervisionRole: Writing – original draftRole: Writing – review & editing
                Role: Formal analysisRole: SoftwareRole: ValidationRole: VisualizationRole: Writing – review & editing
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: MethodologyRole: Project administrationRole: SupervisionRole: ValidationRole: Writing – original draftRole: Writing – review & editing
                Role: Editor
                Journal
                PLoS One
                PLoS One
                plos
                PLoS ONE
                Public Library of Science (San Francisco, CA USA )
                1932-6203
                6 October 2022
                2022
                : 17
                : 10
                : e0260907
                Affiliations
                [1 ] Cocoa Research Centre, The University of The West Indies, St. Augustine, Trinidad and Tobago
                [2 ] Centre de Coopération Internationale en Recherche Agronomique pour le Développement (CIRAD), UMR AGAP Institute, Montpellier, France
                [3 ] AGAP Institute, CIRAD, INRAE, Institut Agro, University of Montpellier, Montpellier, France
                [4 ] Department of Food Production, Faculty of Food and Agriculture, The University of the West Indies, St Augustine, Trinidad and Tobago
                [5 ] Faculty of Food and Agriculture, The University of the West Indies, St Augustine, Trinidad and Tobago
                CSIR-Institute of Himalayan Bioresource Technology, INDIA
                Author notes

                Competing Interests: The authors declare that they have no competing interests.

                Author information
                https://orcid.org/0000-0002-4345-4358
                Article
                PONE-D-21-36485
                10.1371/journal.pone.0260907
                9536643
                36201531
                b6293eb8-52d2-41c2-8ae2-60a9c7e505ad
                © 2022 Bekele et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 17 November 2021
                : 13 September 2022
                Page count
                Figures: 7, Tables: 5, Pages: 35
                Funding
                Funded by: Government of Trinidad and Tobago
                Award Recipient :
                Funded by: Cocoa Research Association
                Award Recipient :
                Funded by: CIRAD
                Award Recipient :
                Financial support from the Government of Trinidad and Tobago and the Cocoa Research Association, UK, and CIRAD, France that facilitated collation of the phenotypic and genotypic data, respectively, is gratefully acknowledged. However, the study design, conduct of this research and preparation of the manuscript were not influenced by the funding agencies.
                Categories
                Research Article
                Biology and Life Sciences
                Plant Science
                Plant Anatomy
                Seeds
                Biology and life sciences
                Biochemistry
                Proteins
                DNA-binding proteins
                Biology and Life Sciences
                Computational Biology
                Genome Analysis
                Genome-Wide Association Studies
                Biology and Life Sciences
                Genetics
                Genomics
                Genome Analysis
                Genome-Wide Association Studies
                Biology and Life Sciences
                Genetics
                Human Genetics
                Genome-Wide Association Studies
                Biology and Life Sciences
                Genetics
                Single Nucleotide Polymorphisms
                Biology and Life Sciences
                Plant Science
                Plant Anatomy
                Fruit and Seed Anatomy
                Plant Embryo Anatomy
                Plant Cotyledon
                Biology and Life Sciences
                Plant Science
                Plant Anatomy
                Plant Embryo Anatomy
                Plant Cotyledon
                Biology and Life Sciences
                Developmental Biology
                Embryogenesis
                Plant Embryogenesis
                Plant Embryo Anatomy
                Plant Cotyledon
                Biology and Life Sciences
                Developmental Biology
                Plant Growth and Development
                Plant Development
                Plant Embryogenesis
                Plant Embryo Anatomy
                Plant Cotyledon
                Biology and Life Sciences
                Plant Science
                Plant Growth and Development
                Plant Development
                Plant Embryogenesis
                Plant Embryo Anatomy
                Plant Cotyledon
                Biology and Life Sciences
                Genetics
                Phenotypes
                Biology and Life Sciences
                Cell Biology
                Cellular Structures and Organelles
                Cell Membranes
                Membrane Proteins
                Biology and Life Sciences
                Developmental Biology
                Embryology
                Embryo Development
                Custom metadata
                DOIs that include the data needed to replicate the study S2 Table. Phenotypic data for 346 cacao accessions fully phenotyped and used to generate descriptive statistics http://dx.doi.org/10.13140/RG.2.2.29601.48484 S10 Table. Genotype data used in GWAS http://dx.doi.org/10.13140/RG.2.2.25826.61123.

                Uncategorized
                Uncategorized

                Comments

                Comment on this article