132
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns

      brief-report

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Deblur provides a rapid and sensitive means to assess ecological patterns driven by differentiation of closely related taxa. This algorithm provides a solution to the problem of identifying real ecological differences between taxa whose amplicons differ by a single base pair, is applicable in an automated fashion to large-scale sequencing data sets, and can integrate sequencing runs collected over time.

          ABSTRACT

          High-throughput sequencing of 16S ribosomal RNA gene amplicons has facilitated understanding of complex microbial communities, but the inherent noise in PCR and DNA sequencing limits differentiation of closely related bacteria. Although many scientific questions can be addressed with broad taxonomic profiles, clinical, food safety, and some ecological applications require higher specificity. Here we introduce a novel sub-operational-taxonomic-unit (sOTU) approach, Deblur, that uses error profiles to obtain putative error-free sequences from Illumina MiSeq and HiSeq sequencing platforms. Deblur substantially reduces computational demands relative to similar sOTU methods and does so with similar or better sensitivity and specificity. Using simulations, mock mixtures, and real data sets, we detected closely related bacterial sequences with single nucleotide differences while removing false positives and maintaining stability in detection, suggesting that Deblur is limited only by read length and diversity within the amplicon sequences. Because Deblur operates on a per-sample level, it scales to modern data sets and meta-analyses. To highlight Deblur’s ability to integrate data sets, we include an interactive exploration of its application to multiple distinct sequencing rounds of the American Gut Project. Deblur is open source under the Berkeley Software Distribution (BSD) license, easily installable, and downloadable from https://github.com/biocore/deblur.

          IMPORTANCE Deblur provides a rapid and sensitive means to assess ecological patterns driven by differentiation of closely related taxa. This algorithm provides a solution to the problem of identifying real ecological differences between taxa whose amplicons differ by a single base pair, is applicable in an automated fashion to large-scale sequencing data sets, and can integrate sequencing runs collected over time.

          Related collections

          Most cited references20

          • Record: found
          • Abstract: found
          • Article: not found

          Basic local alignment search tool.

          A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score. Recent mathematical results on the stochastic properties of MSP scores allow an analysis of the performance of this method as well as the statistical significance of alignments it generates. The basic algorithm is simple and robust; it can be implemented in a number of ways and applied in a variety of contexts including straightforward DNA and protein sequence database searches, motif searches, gene identification searches, and in the analysis of multiple regions of similarity in long DNA sequences. In addition to its flexibility and tractability to mathematical analysis, BLAST is an order of magnitude faster than existing sequence comparison tools of comparable sensitivity.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            DADA2: High resolution sample inference from Illumina amplicon data

            We present DADA2, a software package that models and corrects Illumina-sequenced amplicon errors. DADA2 infers sample sequences exactly, without coarse-graining into OTUs, and resolves differences of as little as one nucleotide. In several mock communities DADA2 identified more real variants and output fewer spurious sequences than other methods. We applied DADA2 to vaginal samples from a cohort of pregnant women, revealing a diversity of previously undetected Lactobacillus crispatus variants.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Search and clustering orders of magnitude faster than BLAST.

              Biological sequence data is accumulating rapidly, motivating the development of improved high-throughput methods for sequence classification. UBLAST and USEARCH are new algorithms enabling sensitive local and global search of large sequence databases at exceptionally high speeds. They are often orders of magnitude faster than BLAST in practical applications, though sensitivity to distant protein relationships is lower. UCLUST is a new clustering method that exploits USEARCH to assign sequences to clusters. UCLUST offers several advantages over the widely used program CD-HIT, including higher speed, lower memory use, improved sensitivity, clustering at lower identities and classification of much larger datasets. Binaries are available at no charge for non-commercial use at http://www.drive5.com/usearch.
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                mSystems
                mSystems
                msys
                msys
                mSystems
                mSystems
                American Society for Microbiology (1752 N St., N.W., Washington, DC )
                2379-5077
                7 March 2017
                Mar-Apr 2017
                : 2
                : 2
                : e00191-16
                Affiliations
                [a ]Department of Pediatrics, University of California San Diego, La Jolla, California, USA
                [b ]Department of Applied Mathematics, and Interdisciplinary Quantitative Biology Graduate Program, University of Colorado Boulder, Boulder, Colorado, USA
                [c ]Department of Computer Science and Engineering, University of California San Diego, La Jolla, California, USA
                [d ]Center for Microbiome Innovation, University of California San Diego, San Diego, California, USA
                Argonne National Laboratory
                Author notes
                Address correspondence to Rob Knight, robknight@ 123456ucsd.edu .

                A.A. and D.M. contributed equally to this article.

                Citation Amir A, McDonald D, Navas-Molina JA, Kopylova E, Morton JT, Zech Xu Z, Kightley EP, Thompson LR, Hyde ER, Gonzalez A, Knight R. 2017. Deblur rapidly resolves single-nucleotide community sequence patterns. mSystems 2:e00191-16. https://doi.org/10.1128/mSystems.00191-16.

                Article
                mSystems00191-16
                10.1128/mSystems.00191-16
                5340863
                28289731
                a39b35b5-bef5-4f0f-ac6a-ba38ade475d1
                Copyright © 2017 Amir et al.

                This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license.

                History
                : 9 December 2016
                : 14 December 2016
                Page count
                supplementary-material: 6, Figures: 5, Tables: 0, Equations: 0, References: 20, Pages: 7, Words: 4269
                Funding
                Funded by: Broad Institute, Inc.
                Award ID: 5230122-5500000792
                Award Recipient : Antonio Gonzalez Award Recipient : Rob Knight
                Funded by: Laureate Institute for Brain Research
                Award ID: 131349
                Award Recipient : Rob Knight
                Funded by: Valhalla Foundation
                Award Recipient : Evguenia Kopylova Award Recipient : Luke R. Thompson Award Recipient : Antonio Gonzalez Award Recipient : Rob Knight
                Funded by: HHS | National Institutes of Health (NIH) https://doi.org/10.13039/100000002
                Award ID: HHSN261201100105C
                Award Recipient : Daniel McDonald Award Recipient : Zhenjiang Zech Xu Award Recipient : Luke R. Thompson Award Recipient : Antonio Gonzalez
                Funded by: HHS | National Institutes of Health (NIH) https://doi.org/10.13039/100000002
                Award ID: P01DK078669-06A1
                Award Recipient : Evguenia Kopylova Award Recipient : Luke R. Thompson Award Recipient : Antonio Gonzalez Award Recipient : Rob Knight
                Funded by: HHS | National Institutes of Health (NIH) https://doi.org/10.13039/100000002
                Award ID: P30MH062512
                Award Recipient : Rob Knight
                Funded by: National Science Foundation (NSF) https://doi.org/10.13039/100000001
                Award ID: 1144086
                Award Recipient : Jamie Morton
                Funded by: National Science Foundation (NSF) https://doi.org/10.13039/100000001
                Award ID: 1144807
                Award Recipient : Eric P. Kightley
                Funded by: National Science Foundation (NSF) https://doi.org/10.13039/100000001
                Award ID: DBI-1565057
                Award Recipient : Rob Knight
                Funded by: Crohn's and Colitis Foundation of America (CCFA) https://doi.org/10.13039/100001063
                Award ID: 2928138A
                Award Recipient : Rob Knight
                Funded by: Alfred P. Sloan Foundation https://doi.org/10.13039/100000879
                Award ID: G-2015-13979
                Award Recipient : Embriette R. Hyde Award Recipient : Rob Knight
                Funded by: Alfred P. Sloan Foundation https://doi.org/10.13039/100000879
                Award ID: G-2015-13933
                Award Recipient : Rob Knight
                Funded by: National Multiple Sclerosis Society (National MS Society) https://doi.org/10.13039/100000890
                Award ID: 8911sc
                Award Recipient : Rob Knight
                Funded by: United States - Israel Binational Science Foundation (BSF) https://doi.org/10.13039/100006221
                Award ID: 2010287
                Award Recipient : Amnon Amir
                Funded by: U.S. Department of Justice (DOJ) https://doi.org/10.13039/100000074
                Award ID: 2015-DN-BX-K016
                Award Recipient : Embriette R. Hyde Award Recipient : Rob Knight
                Funded by: U.S. Department of Justice (DOJ) https://doi.org/10.13039/100000074
                Award ID: 2014-R2-CX-K411
                Award Recipient : Rob Knight
                Funded by: DOD | United States Navy | Office of Naval Research (ONR) https://doi.org/10.13039/100000006
                Award ID: N00014-15-1-2809
                Award Recipient : Amnon Amir Award Recipient : Luke R. Thompson Award Recipient : Rob Knight
                Funded by: UC | University of California, San Diego (UCSD) https://doi.org/10.13039/100007911
                Award ID: P30MH062512
                Award Recipient : Rob Knight
                Funded by: DOD | United States Army | MEDCOM | Congressionally Directed Medical Research Programs (CDMRP) https://doi.org/10.13039/100000090
                Award ID: W81XWH-15-1-0653
                Award Recipient : Amnon Amir Award Recipient : Rob Knight
                Funded by: Washington University in St. Louis (WUSTL) https://doi.org/10.13039/100007268
                Award ID: WU-15-284
                Award Recipient : Evguenia Kopylova Award Recipient : Luke R. Thompson Award Recipient : Antonio Gonzalez Award Recipient : Rob Knight
                Funded by: Washington University in St. Louis (WUSTL) https://doi.org/10.13039/100007268
                Award ID: WU-16-23
                Award Recipient : Rob Knight
                Funded by: UC | University of California, San Francisco (UCSF) https://doi.org/10.13039/100008069
                Award Recipient : Evguenia Kopylova Award Recipient : Luke R. Thompson Award Recipient : Antonio Gonzalez Award Recipient : Rob Knight
                Funded by: UC | University of California, San Francisco (UCSF) https://doi.org/10.13039/100008069
                Award ID: CA 1072-A-7
                Award Recipient : Rob Knight
                Funded by: Kenneth Rainin Foundation (KRF) https://doi.org/10.13039/100009576
                Award ID: 2015-537
                Award Recipient : Rob Knight
                Categories
                Observation
                Novel Systems Biology Techniques
                Editor's Pick
                Custom metadata
                March/April 2017

                dna sequencing,microbiome
                dna sequencing, microbiome

                Comments

                Comment on this article