0
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      High-quality chromosome-scale genomes facilitate effective identification of large structural variations in hot and sweet peppers

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Pepper ( Capsicum annuum) is an important vegetable crop that has been subjected to intensive breeding, resulting in limited genetic diversity, especially for sweet peppers. Previous studies have reported pepper draft genome assemblies using short read sequencing, but their capture of the extent of large structural variants (SVs), such as presence–absence variants (PAVs), inversions, and copy-number variants (CNVs) in the complex pepper genome falls short. In this study, we sequenced the genomes of representative sweet and hot pepper accessions by long-read and/or linked-read methods and advanced scaffolding technologies. First, we developed a high-quality reference genome for the sweet pepper cultivar ‘Dempsey’ and then used the reference genome to identify SVs in 11 other pepper accessions and constructed a graph-based pan-genome for pepper. We annotated an average of 42 972 gene families in each pepper accession, defining a set of 19 662 core and 23 115 non-core gene families. The new pepper pan-genome includes informative variants, 222 159 PAVs, 12 322 CNVs, and 16 032 inversions. Pan-genome analysis revealed PAVs associated with important agricultural traits, including potyvirus resistance, fruit color, pungency, and pepper fruit orientation. Comparatively, a large number of genes are affected by PAVs, which is positively correlated with the high frequency of transposable elements (TEs), indicating TEs play a key role in shaping the genomic landscape of peppers. The datasets presented herein provide a powerful new genomic resource for genetic analysis and genome-assisted breeding for pepper improvement.

          Related collections

          Most cited references66

          • Record: found
          • Abstract: found
          • Article: not found

          TBtools - an integrative toolkit developed for interactive analyses of big biological data

          The rapid development of high-throughput sequencing techniques has led biology into the big-data era. Data analyses using various bioinformatics tools rely on programming and command-line environments, which are challenging and time-consuming for most wet-lab biologists. Here, we present TBtools (a Toolkit for Biologists integrating various biological data-handling tools), a stand-alone software with a user-friendly interface. The toolkit incorporates over 130 functions, which are designed to meet the increasing demand for big-data analyses, ranging from bulk sequence processing to interactive data visualization. A wide variety of graphs can be prepared in TBtools using a new plotting engine ("JIGplot") developed to maximize their interactive ability; this engine allows quick point-and-click modification of almost every graphic feature. TBtools is platform-independent software that can be run under all operating systems with Java Runtime Environment 1.6 or newer. It is freely available to non-commercial users at https://github.com/CJ-Chen/TBtools/releases.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Fast model-based estimation of ancestry in unrelated individuals.

            Population stratification has long been recognized as a confounding factor in genetic association studies. Estimated ancestries, derived from multi-locus genotype data, can be used to perform a statistical correction for population stratification. One popular technique for estimation of ancestry is the model-based approach embodied by the widely applied program structure. Another approach, implemented in the program EIGENSTRAT, relies on Principal Component Analysis rather than model-based estimation and does not directly deliver admixture fractions. EIGENSTRAT has gained in popularity in part owing to its remarkable speed in comparison to structure. We present a new algorithm and a program, ADMIXTURE, for model-based estimation of ancestry in unrelated individuals. ADMIXTURE adopts the likelihood model embedded in structure. However, ADMIXTURE runs considerably faster, solving problems in minutes that take structure hours. In many of our experiments, we have found that ADMIXTURE is almost as fast as EIGENSTRAT. The runtime improvements of ADMIXTURE rely on a fast block relaxation scheme using sequential quadratic programming for block updates, coupled with a novel quasi-Newton acceleration of convergence. Our algorithm also runs faster and with greater accuracy than the implementation of an Expectation-Maximization (EM) algorithm incorporated in the program FRAPPE. Our simulations show that ADMIXTURE's maximum likelihood estimates of the underlying admixture coefficients and ancestral allele frequencies are as accurate as structure's Bayesian estimates. On real-world data sets, ADMIXTURE's estimates are directly comparable to those from structure and EIGENSTRAT. Taken together, our results show that ADMIXTURE's computational speed opens up the possibility of using a much larger set of markers in model-based ancestry estimation and that its estimates are suitable for use in correcting for population stratification in association studies.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              OrthoFinder: phylogenetic orthology inference for comparative genomics

              Here, we present a major advance of the OrthoFinder method. This extends OrthoFinder’s high accuracy orthogroup inference to provide phylogenetic inference of orthologs, rooted gene trees, gene duplication events, the rooted species tree, and comparative genomics statistics. Each output is benchmarked on appropriate real or simulated datasets, and where comparable methods exist, OrthoFinder is equivalent to or outperforms these methods. Furthermore, OrthoFinder is the most accurate ortholog inference method on the Quest for Orthologs benchmark test. Finally, OrthoFinder’s comprehensive phylogenetic analysis is achieved with equivalent speed and scalability to the fastest, score-based heuristic methods. OrthoFinder is available at https://github.com/davidemms/OrthoFinder.
                Bookmark

                Author and article information

                Contributors
                Journal
                Hortic Res
                Hortic Res
                hr
                Horticulture Research
                Oxford University Press
                2662-6810
                2052-7276
                2022
                19 September 2022
                19 September 2022
                : 9
                : uhac210
                Affiliations
                Department of Agriculture, Forestry and Bioresources, Research Institute of Agriculture and Life Sciences, Plant Genomics Breeding Institute, College of Agriculture and Life Sciences, Seoul National University , Seoul 08826, Republic of Korea
                Department of Agriculture, Forestry and Bioresources, Research Institute of Agriculture and Life Sciences, Plant Genomics Breeding Institute, College of Agriculture and Life Sciences, Seoul National University , Seoul 08826, Republic of Korea
                Department of Agriculture, Forestry and Bioresources, Research Institute of Agriculture and Life Sciences, Plant Genomics Breeding Institute, College of Agriculture and Life Sciences, Seoul National University , Seoul 08826, Republic of Korea
                Department of Agriculture, Forestry and Bioresources, Research Institute of Agriculture and Life Sciences, Plant Genomics Breeding Institute, College of Agriculture and Life Sciences, Seoul National University , Seoul 08826, Republic of Korea
                Department of Agriculture, Forestry and Bioresources, Research Institute of Agriculture and Life Sciences, Plant Genomics Breeding Institute, College of Agriculture and Life Sciences, Seoul National University , Seoul 08826, Republic of Korea
                Department of Agriculture, Forestry and Bioresources, Research Institute of Agriculture and Life Sciences, Plant Genomics Breeding Institute, College of Agriculture and Life Sciences, Seoul National University , Seoul 08826, Republic of Korea
                Vegetable Research Division, National Institute of Horticultural and Herbal Science , Rural Development Administration, Jeonju 55365, Republic of Korea
                National Agrobiodiversity Center, National Institute of Agricultural Sciences , Rural Development Administration, Jeonju 54874, Republic of Korea
                Department of Agriculture, Forestry and Bioresources, Research Institute of Agriculture and Life Sciences, Plant Genomics Breeding Institute, College of Agriculture and Life Sciences, Seoul National University , Seoul 08826, Republic of Korea
                Department of Agriculture, Forestry and Bioresources, Research Institute of Agriculture and Life Sciences, Plant Genomics Breeding Institute, College of Agriculture and Life Sciences, Seoul National University , Seoul 08826, Republic of Korea
                Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology , Daejeon 34141, Republic of Korea
                Genomics Division, National Institute of Agricultural Sciences , Rural Development Administration, Jeonju 54874, Republic of Korea
                Department of Agriculture, Forestry and Bioresources, Research Institute of Agriculture and Life Sciences, Plant Genomics Breeding Institute, College of Agriculture and Life Sciences, Seoul National University , Seoul 08826, Republic of Korea
                Department of Plant Sciences, University of California , Davis, CA 95616, USA
                Department of Plant Sciences, University of California , Davis, CA 95616, USA
                NRGene, 5 Golda Meir St. , Ness Ziona 7403649, Israel
                Top Seeds International Ltd. Moshav Sharona , 1523200, Israel
                BASF’s vegetable seeds business. Napoleonsweg 152 6083 AB Nunhem , the Netherlands
                Pilpel Seeds Ltd. Nes Ziona , 7414001, Israel
                Institute of Plant Science , Agricultural Research Organization, The Volcani Center, Rishon LeZion, Israel
                Department of Agriculture, Forestry and Bioresources, Research Institute of Agriculture and Life Sciences, Plant Genomics Breeding Institute, College of Agriculture and Life Sciences, Seoul National University , Seoul 08826, Republic of Korea
                Author notes

                These authors contributed equally to this work.

                Article
                uhac210
                10.1093/hr/uhac210
                9715575
                36467270
                f8a50007-6425-47a6-91dd-93087cfa6e4d
                © The Author(s) 2022. Published by Oxford University Press on behalf of Nanjing Agricultural University.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 15 April 2022
                : 13 September 2022
                : 01 December 2022
                Page count
                Pages: 13
                Categories
                Article
                AcademicSubjects/SCI01210
                AcademicSubjects/SCI01140

                Comments

                Comment on this article