5
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Development of genomic resources for Rhodes grass ( Chloris gayana), draft genome and annotated variant discovery

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Genomic resources for grasses, especially warm-season grasses are limited despite their commercial and environmental importance. Here, we report the first annotated draft whole genome sequence for diploid Rhodes grass ( Chloris gayana), a tropical C4 species. Generated using long read nanopore sequencing and assembled using the Flye software package, the assembled genome is 603 Mbp in size and comprises 5,233 fragments that were annotated using the GenSas pipeline. The annotated genome has 46,087 predicted genes corresponding to 92.0% of the expected genomic content present via BUSCO analysis. Gene ontology terms and repetitive elements are identified and discussed. An additional 94 individual plant genotypes originating from three diploid and two tetraploid Rhodes grass cultivars were short-read whole genome resequenced (WGR) to generate a single nucleotide polymorphism (SNP) resource for the species that can be used to elucidate inter- and intra-cultivar relationships across both ploidy levels. A total of 75,777 high quality SNPs were used to generate a phylogenetic tree, highlighting the diversity present within the cultivars which agreed with the known breeding history. Differentiation was observed between diploid and tetraploid cultivars. The WGR data were also used to provide insights into the nature and evolution of the tetraploid status of the species, with results largely agreeing with the published literature that the tetraploids are autotetraploid.

          Related collections

          Most cited references63

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Fast and accurate short read alignment with Burrows–Wheeler transform

          Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ∼10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: rd@sanger.ac.uk
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Welcome to the Tidyverse

              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              The variant call format and VCFtools

              Summary: The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API. Availability: http://vcftools.sourceforge.net Contact: rd@sanger.ac.uk
                Bookmark

                Author and article information

                Contributors
                Journal
                Front Plant Sci
                Front Plant Sci
                Front. Plant Sci.
                Frontiers in Plant Science
                Frontiers Media S.A.
                1664-462X
                04 September 2023
                2023
                : 14
                : 1239290
                Affiliations
                [1] 1 Agriculture Victoria Research, AgriBio, The Centre for AgriBioscience , Bundoora, VIC, Australia
                [2] 2 School of Applied Systems Biology, La Trobe University , Bundoora, VIC, Australia
                Author notes

                Edited by: Miloslava Fojtova, Masaryk University, Czechia

                Reviewed by: Chris S. Jones, International Livestock Research Institute, Ethiopia; Matej Lexa, Masaryk University, Czechia; Jan Skalák, Central European Institute of Technology (CEITEC), Czechia

                *Correspondence: Kellie Maybery-Reupert, kellie.maybery-reupert@ 123456agriculture.vic.gov.au
                Article
                10.3389/fpls.2023.1239290
                10507473
                37731974
                dcef39a6-1583-4702-9881-d109e4472102
                Copyright © 2023 Maybery-Reupert, Isenegger, Hayden and Cogan

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 13 June 2023
                : 18 August 2023
                Page count
                Figures: 3, Tables: 4, Equations: 0, References: 66, Pages: 10, Words: 5606
                Funding
                The authors declare that this study received funding from Agriculture Victoria, Dairy Australia and The Gardiner Dairy Foundation through the DairyBio initiative and La Trobe University. The funder was not involved in the study design, collection, analysis, interpretation of data, the writing of this article, or the decision to submit it for publication.
                Categories
                Plant Science
                Original Research
                Custom metadata
                Plant Genetics, Epigenetics and Chromosome Biology

                Plant science & Botany
                rhodes grass,genome sequence,genetic diversity,tetraploid,orphan crop
                Plant science & Botany
                rhodes grass, genome sequence, genetic diversity, tetraploid, orphan crop

                Comments

                Comment on this article