18
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      High contiguity Arabidopsis thaliana genome assembly with a single nanopore flow cell

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The handheld Oxford Nanopore MinION sequencer generates ultra-long reads with minimal cost and time requirements, which makes sequencing genomes at the bench feasible. Here, we sequence the gold standard Arabidopsis thaliana genome (KBS-Mac-74 accession) on the bench with the MinION sequencer, and assemble the genome using typical consumer computing hardware (4 Cores, 16 Gb RAM) into chromosome arms (62 contigs with an N50 length of 12.3 Mb). We validate the contiguity and quality of the assembly with two independent single-molecule technologies, Bionano optical genome maps and Pacific Biosciences Sequel sequencing. The new A. thaliana KBS-Mac-74 genome enables resolution of a quantitative trait locus that had previously been recalcitrant to a Sanger-based BAC sequencing approach. In summary, we demonstrate that even when the purpose is to understand complex structural variation at a single region of the genome, complete genome assembly is becoming the simplest way to achieve this goal.

          Abstract

          Long-read sequencing technologies facilitate efficient and high quality genome assembly. Here Michael et al. achieve a fast reference assembly for Arabidopsis thaliana KBS-Mac-74 accession using the handheld Oxford Nanopore MinION sequencer and consumer computing hardware, and demonstrate its usefulness in resolving complex structural variation.

          Related collections

          Most cited references16

          • Record: found
          • Abstract: found
          • Article: not found

          Whole-genome sequencing of multiple Arabidopsis thaliana populations.

          The plant Arabidopsis thaliana occurs naturally in many different habitats throughout Eurasia. As a foundation for identifying genetic variation contributing to adaptation to diverse environments, a 1001 Genomes Project to sequence geographically diverse A. thaliana strains has been initiated. Here we present the first phase of this project, based on population-scale sequencing of 80 strains drawn from eight regions throughout the species' native range. We describe the majority of common small-scale polymorphisms as well as many larger insertions and deletions in the A. thaliana pan-genome, their effects on gene function, and the patterns of local and global linkage among these variants. The action of processes other than spontaneous mutation is identified by comparing the spectrum of mutations that have accumulated since A. thaliana diverged from its closest relative 10 million years ago with the spectrum observed in the laboratory. Recent species-wide selective sweeps are rare, and potentially deleterious mutations are more common in marginal populations.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Multiple reference genomes and transcriptomes for Arabidopsis thaliana

            Genetic differences between Arabidopsis thaliana accessions underlie the plant’s extensive phenotypic variation, and until now these have been interpreted largely in the context of the annotated reference accession Col-0. Here we report the sequencing, assembly and annotation of the genomes of 18 natural A. thaliana accessions, and their transcriptomes. When assessed on the basis of the reference annotation, one-third of protein-coding genes are predicted to be disrupted in at least one accession. However, re-annotation of each genome revealed that alternative gene models often restore coding potential. Gene expression in seedlings differed for nearly half of expressed genes and was frequently associated with cis variants within 5 kilobases, as were intron retention alternative splicing events. Sequence and expression variation is most pronounced in genes that respond to the biotic environment. Our data further promote evolutionary and functional studies in A. thaliana, especially the MAGIC genetic reference population descended from these accessions.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Common sequence polymorphisms shaping genetic diversity in Arabidopsis thaliana.

              The genomes of individuals from the same species vary in sequence as a result of different evolutionary processes. To examine the patterns of, and the forces shaping, sequence variation in Arabidopsis thaliana, we performed high-density array resequencing of 20 diverse strains (accessions). More than 1 million nonredundant single-nucleotide polymorphisms (SNPs) were identified at moderate false discovery rates (FDRs), and approximately 4% of the genome was identified as being highly dissimilar or deleted relative to the reference genome sequence. Patterns of polymorphism are highly nonrandom among gene families, with genes mediating interaction with the biotic environment having exceptional polymorphism levels. At the chromosomal scale, regional variation in polymorphism was readily apparent. A scan for recent selective sweeps revealed several candidate regions, including a notable example in which almost all variation was removed in a 500-kilobase window. Analyzing the polymorphisms we describe in larger sets of accessions will enable a detailed understanding of forces shaping population-wide sequence variation in A. thaliana.
                Bookmark

                Author and article information

                Contributors
                tmichael@jcvi.org
                Journal
                Nat Commun
                Nat Commun
                Nature Communications
                Nature Publishing Group UK (London )
                2041-1723
                7 February 2018
                7 February 2018
                2018
                : 9
                : 541
                Affiliations
                [1 ]GRID grid.469946.0, J. Craig Venter Institute, ; La Jolla, CA 92037 USA
                [2 ]ISNI 0000 0001 0662 7144, GRID grid.250671.7, Genomic Analysis Laboratory, , The Salk Institute for Biological Studies, ; La Jolla, CA 92037 USA
                [3 ]ISNI 0000 0001 1014 8330, GRID grid.419495.4, Max Planck Institute for Developmental Biology, ; 72076 Tübingen, Germany
                [4 ]ISNI 0000 0004 4910 6535, GRID grid.460789.4, Institut Jean-Pierre Bourgin, INRA, AgroParisTech, CNRS, , Université Paris-Saclay, ; 78000 Versailles, France
                [5 ]ISNI 0000 0001 0662 7144, GRID grid.250671.7, Howard Hughes Medical Institute, , The Salk Institute for Biological Studies, ; La Jolla, CA 92037 USA
                [6 ]ISNI 0000 0004 0466 8542, GRID grid.418554.9, Present Address: Monsanto Company, ; Creve Coeur, MO 63141 USA
                Author information
                http://orcid.org/0000-0001-6272-2875
                http://orcid.org/0000-0003-3717-0137
                http://orcid.org/0000-0002-2114-7963
                Article
                3016
                10.1038/s41467-018-03016-2
                5803254
                29317637
                5cea0d2a-60ae-45c5-b1b4-eace38c2047c
                © The Author(s) 2018

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 23 June 2017
                : 11 January 2018
                Categories
                Article
                Custom metadata
                © The Author(s) 2018

                Uncategorized
                Uncategorized

                Comments

                Comment on this article