32
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Optical maps refine the bread wheat Triticum aestivum cv. Chinese Spring genome assembly

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Summary

          Until recently, achieving a reference‐quality genome sequence for bread wheat was long thought beyond the limits of genome sequencing and assembly technology, primarily due to the large genome size and > 80% repetitive sequence content. The release of the chromosome scale 14.5‐Gb IWGSC RefSeq v1.0 genome sequence of bread wheat cv. Chinese Spring (CS) was, therefore, a milestone. Here, we used a direct label and stain (DLS) optical map of the CS genome together with a prior nick, label, repair and stain (NLRS) optical map, and sequence contigs assembled with Pacific Biosciences long reads, to refine the v1.0 assembly. Inconsistencies between the sequence and maps were reconciled and gaps were closed. Gap filling and anchoring of 279 unplaced scaffolds increased the total length of pseudomolecules by 168 Mb (excluding Ns). Positions and orientations were corrected for 233 and 354 scaffolds, respectively, representing 10% of the genome sequence. The accuracy of the remaining 90% of the assembly was validated. As a result of the increased contiguity, the numbers of transposable elements (TEs) and intact TEs have increased in IWGSC RefSeq v2.1 compared with v1.0. In total, 98% of the gene models identified in v1.0 were mapped onto this new assembly through development of a dedicated approach implemented in the MAGAAT pipeline. The numbers of high‐confidence genes on pseudomolecules have increased from 105 319 to 105 534. The reconciled assembly enhances the utility of the sequence for genetic mapping, comparative genomics, gene annotation and isolation, and more general studies on the biology of wheat.

          Significance Statement

          This new release of bread wheat cv. Chinese Spring reference genome sequence, IWGSC RefSeq v2.1, features correction of assembly errors affecting approximately 10% of the prior IWGSC RefSeq v1.0 release using genome‐wide optical maps and filling of gaps with single‐molecule long‐reads as well as incorporating re‐annotation of TEs and re‐computation of gene coordinates. These refinements enhance the sequence utility for breeding and research applications.

          Related collections

          Most cited references47

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          The Sequence Alignment/Map format and SAMtools

          Summary: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments. Availability: http://samtools.sourceforge.net Contact: rd@sanger.ac.uk
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Basic local alignment search tool.

            A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score. Recent mathematical results on the stochastic properties of MSP scores allow an analysis of the performance of this method as well as the statistical significance of alignments it generates. The basic algorithm is simple and robust; it can be implemented in a number of ways and applied in a variety of contexts including straightforward DNA and protein sequence database searches, motif searches, gene identification searches, and in the analysis of multiple regions of similarity in long DNA sequences. In addition to its flexibility and tractability to mathematical analysis, BLAST is an order of magnitude faster than existing sequence comparison tools of comparable sensitivity.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Cutadapt removes adapter sequences from high-throughput sequencing reads

                Bookmark

                Author and article information

                Contributors
                jdvorak@ucdavis.edu
                mcluo@ucdavis.edu
                Journal
                Plant J
                Plant J
                10.1111/(ISSN)1365-313X
                TPJ
                The Plant Journal
                John Wiley and Sons Inc. (Hoboken )
                0960-7412
                1365-313X
                16 May 2021
                July 2021
                : 107
                : 1 ( doiID: 10.1111/tpj.v107.1 )
                : 303-314
                Affiliations
                [ 1 ] Department of Plant Sciences University of California Davis CA 95616 USA
                [ 2 ] GDEC Université Clermont Auvergne INRAE Clermont‐Ferrand 63000 France
                [ 3 ] Centre for AgriBioscience Agriculture Victoria AgriBio Bundoora VIC 3083 Australia
                [ 4 ] International Wheat Genome Sequencing Consortium Eau Claire WI 54701 USA
                [ 5 ] Crop Improvement and Genetics Research Unit USDA‐ARS Albany CA 94710 USA
                [ 6 ] Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Seeland Germany
                Author notes
                [*] [* ] For correspondence (e‐mail mcluo@ 123456ucdavis.edu ; jdvorak@ 123456ucdavis.edu ).

                [ * ]

                These authors contributed to this work.

                Author information
                https://orcid.org/0000-0003-1788-7288
                https://orcid.org/0000-0002-8784-4913
                https://orcid.org/0000-0001-6373-6013
                https://orcid.org/0000-0002-9744-5887
                Article
                TPJ15289
                10.1111/tpj.15289
                8360199
                33893684
                2ae043b2-09cc-410f-bde2-226ca98c0604
                © 2021 The Authors. The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd

                This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.

                History
                : 12 April 2021
                : 17 November 2020
                : 19 April 2021
                Page count
                Figures: 5, Tables: 6, Pages: 12, Words: 9436
                Funding
                Funded by: US National Science Foundation
                Award ID: IOS‐1929053
                Funded by: USDA Agricultural Research Service CRIS
                Award ID: 2030‐21430‐014‐00‐D
                Funded by: French Government
                Award ID: ANR‐10‐BTBR‐03
                Categories
                Resource
                Resource
                Custom metadata
                2.0
                July 2021
                Converter:WILEY_ML3GV2_TO_JATSPMC version:6.0.5 mode:remove_FC converted:12.08.2021

                Plant science & Botany
                direct label and stain,pseudomolecule,transposable element,gene collinearity,hi‐c

                Comments

                Comment on this article