1
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      The OceanDNA MAG catalog contains over 50,000 prokaryotic genomes originated from various marine environments

      data-paper

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Marine microorganisms are immensely diverse and play fundamental roles in global geochemical cycling. Recent metagenome-assembled genome studies, with particular attention to large-scale projects such as Tara Oceans, have expanded the genomic repertoire of marine microorganisms. However, published marine metagenome data is still underexplored. We collected 2,057 marine metagenomes covering various marine environments and developed a new genome reconstruction pipeline. We reconstructed 52,325 qualified genomes composed of 8,466 prokaryotic species-level clusters spanning 59 phyla, including genomes from the deep-sea characterized as deeper than 1,000 m (n = 3,337), low-oxygen zones of <90 μmol O2 per kg water (n = 7,884), and polar regions (n = 7,752). Novelty evaluation using a genome taxonomy database shows that 6,256 species (73.9%) are novel and include genomes of high taxonomic novelty, such as new class candidates. These genomes collectively expanded the known phylogenetic diversity of marine prokaryotes by 34.2%, and the species representatives cover 26.5–42.0% of prokaryote-enriched metagenomes. Thoroughly leveraging accumulated metagenomic data, this genome resource, named the OceanDNA MAG catalog, illuminates uncharacterized marine microbial ‘dark matter’ lineages.

          Abstract

          Measurement(s) microbial community
          Technology Type(s) marine metagenome
          Sample Characteristic - Organism Bacteria • Archaea
          Sample Characteristic - Environment marine biome

          Related collections

          Most cited references86

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Trimmomatic: a flexible trimmer for Illumina sequence data

          Motivation: Although many next-generation sequencing (NGS) read preprocessing tools already existed, we could not find any tool or combination of tools that met our requirements in terms of flexibility, correct handling of paired-end data and high performance. We have developed Trimmomatic as a more flexible and efficient preprocessing tool, which could correctly handle paired-end data. Results: The value of NGS read preprocessing is demonstrated for both reference-based and reference-free tasks. Trimmomatic is shown to produce output that is at least competitive with, and in many cases superior to, that produced by other tools, in all scenarios tested. Availability and implementation: Trimmomatic is licensed under GPL V3. It is cross-platform (Java 1.5+ required) and available at http://www.usadellab.org/cms/index.php?page=trimmomatic Contact: usadel@bio1.rwth-aachen.de Supplementary information: Supplementary data are available at Bioinformatics online.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Fast gapped-read alignment with Bowtie 2.

            As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              featureCounts: an efficient general purpose program for assigning sequence reads to genomic features.

              Next-generation sequencing technologies generate millions of short sequence reads, which are usually aligned to a reference genome. In many applications, the key information required for downstream analysis is the number of reads mapping to each genomic feature, for example to each exon or each gene. The process of counting reads is called read summarization. Read summarization is required for a great variety of genomic analyses but has so far received relatively little attention in the literature. We present featureCounts, a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments. featureCounts implements highly efficient chromosome hashing and feature blocking techniques. It is considerably faster than existing methods (by an order of magnitude for gene-level summarization) and requires far less computer memory. It works with either single or paired-end reads and provides a wide range of options appropriate for different sequencing applications. featureCounts is available under GNU General Public License as part of the Subread (http://subread.sourceforge.net) or Rsubread (http://www.bioconductor.org) software packages.
                Bookmark

                Author and article information

                Contributors
                ynishimura@aori.u-tokyo.ac.jp
                Journal
                Sci Data
                Sci Data
                Scientific Data
                Nature Publishing Group UK (London )
                2052-4463
                17 June 2022
                17 June 2022
                2022
                : 9
                : 305
                Affiliations
                [1 ]GRID grid.26999.3d, ISNI 0000 0001 2151 536X, Atmosphere and Ocean Research Institute, , The University of Tokyo, ; Chiba, 277-8564 Japan
                [2 ]GRID grid.26999.3d, ISNI 0000 0001 2151 536X, Graduate School of Frontier Sciences, , The University of Tokyo, ; Chiba, 277-8563 Japan
                [3 ]GRID grid.26999.3d, ISNI 0000 0001 2151 536X, Collaborative Research Institute for Innovative Microbiology, , The University of Tokyo, ; Tokyo, 113-8657 Japan
                [4 ]GRID grid.410588.0, ISNI 0000 0001 2191 0132, Present Address: Research Center for Bioscience and Nanoscience (CeBN), Research Institute for Marine Resources Utilization, , Japan Agency for Marine-Earth Science and Technology (JAMSTEC), ; Yokosuka, Kanagawa 237–0061 Japan
                Author information
                http://orcid.org/0000-0002-8028-3228
                http://orcid.org/0000-0001-9667-6010
                Article
                1392
                10.1038/s41597-022-01392-5
                9205870
                35715423
                d379a5a0-15fc-40a9-869a-388a3dda19bb
                © The Author(s) 2022

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 6 October 2021
                : 12 May 2022
                Funding
                Funded by: JST, ACT-X Grant Number JPMJAX21BK The purpose of ACT-X is finding and developing superior young researchers to address important problems facing our country. Please see the homepage for more information: https://www.jst.go.jp/kisoken/act-x/en/about/index.html
                Funded by: FundRef https://doi.org/10.13039/501100001691, MEXT | Japan Society for the Promotion of Science (JSPS);
                Award ID: 18K19224
                Award ID: 18H04136
                Award ID: 21K19134
                Award Recipient :
                Categories
                Data Descriptor
                Custom metadata
                © The Author(s) 2022

                environmental microbiology,microbial communities

                Comments

                Comment on this article