6
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Diverse patterns of correspondence between protist metabarcodes and protist metagenome-assembled genomes

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Two common approaches to study the composition of environmental protist communities are metabarcoding and metagenomics. Raw metabarcoding data are usually processed into Operational Taxonomic Units (OTUs) or amplicon sequence variants (ASVs) through clustering or denoising approaches, respectively. Analogous approaches are used to assemble metagenomic reads into metagenome-assembled genomes (MAGs). Understanding the correspondence between the data produced by these two approaches can help to integrate information between the datasets and to explain how metabarcoding OTUs and MAGs are related with the underlying biological entities they are hypothesised to represent. MAGs do not contain the commonly used barcoding loci, therefore sequence homology approaches cannot be used to match OTUs and MAGs. We made an attempt to match V9 metabarcoding OTUs from the 18S rRNA gene (V9 OTUs) and MAGs from the Tara Oceans expedition based on the correspondence of their relative abundances across the same set of samples. We evaluated several metrics for detecting correspondence between features in these two datasets and developed controls to filter artefacts of data structure and processing. After selecting the best-performing metrics, ranking the V9 OTU/MAG matches by their proportionality/correlation coefficients and applying a set of selection criteria, we identified candidate matches between V9 OTUs and MAGs. In some cases, V9 OTUs and MAGs could be matched with a one-to-one correspondence, implying that they likely represent the same underlying biological entity. More generally, matches we observed could be classified into 4 scenarios: one V9 OTU matches many MAGs; many V9 OTUs match many MAGs; many V9 OTUs match one MAG; one V9 OTU matches one MAG. Notably, we found some instances in which different OTU-MAG matches from the same taxonomic group were not classified in the same scenario, with all four scenarios possible even within the same taxonomic group, illustrating that factors beyond taxonomic lineage influence the relationship between OTUs and MAGs. Overall, each scenario produces a different interpretation of V9 OTUs, MAGs and how they compare in terms of the genomic and ecological diversity they represent.

          Related collections

          Most cited references48

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Anvi’o: an advanced analysis and visualization platform for ‘omics data

          Advances in high-throughput sequencing and ‘omics technologies are revolutionizing studies of naturally occurring microbial communities. Comprehensive investigations of microbial lifestyles require the ability to interactively organize and visualize genetic information and to incorporate subtle differences that enable greater resolution of complex data. Here we introduce anvi’o, an advanced analysis and visualization platform that offers automated and human-guided characterization of microbial genomes in metagenomic assemblies, with interactive interfaces that can link ‘omics data from multiple sources into a single, intuitive display. Its extensible visualization approach distills multiple dimensions of information about each contig, offering a dynamic and unified work environment for data exploration, manipulation, and reporting. Using anvi’o, we re-analyzed publicly available datasets and explored temporal genomic changes within naturally occurring microbial populations through de novo characterization of single nucleotide variations, and linked cultivar and single-cell genomes with metagenomic and metatranscriptomic data. Anvi’o is an open-source platform that empowers researchers without extensive bioinformatics skills to perform and communicate in-depth analyses on large ‘omics datasets.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Shotgun metagenomics, from sampling to analysis

            The promises and potential pitfalls of shotgun metagenomics, from experimental design to computational analyses, are reviewed.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Ocean plankton. Eukaryotic plankton diversity in the sunlit ocean.

              Marine plankton support global biological and geochemical processes. Surveys of their biodiversity have hitherto been geographically restricted and have not accounted for the full range of plankton size. We assessed eukaryotic diversity from 334 size-fractionated photic-zone plankton communities collected across tropical and temperate oceans during the circumglobal Tara Oceans expedition. We analyzed 18S ribosomal DNA sequences across the intermediate plankton-size spectrum from the smallest unicellular eukaryotes (protists, >0.8 micrometers) to small animals of a few millimeters. Eukaryotic ribosomal diversity saturated at ~150,000 operational taxonomic units, about one-third of which could not be assigned to known eukaryotic groups. Diversity emerged at all taxonomic levels, both within the groups comprising the ~11,200 cataloged morphospecies of eukaryotic plankton and among twice as many other deep-branching lineages of unappreciated importance in plankton ecology studies. Most eukaryotic plankton biodiversity belonged to heterotrophic protistan groups, particularly those known to be parasites or symbiotic hosts.
                Bookmark

                Author and article information

                Contributors
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: ConceptualizationRole: Data curationRole: MethodologyRole: Writing – original draftRole: Writing – review & editing
                Role: ConceptualizationRole: MethodologyRole: Writing – original draftRole: Writing – review & editing
                Role: ConceptualizationRole: Data curationRole: Writing – original draftRole: Writing – review & editing
                Role: Editor
                Journal
                PLoS One
                PLoS One
                plos
                PLOS ONE
                Public Library of Science (San Francisco, CA USA )
                1932-6203
                2024
                6 June 2024
                : 19
                : 6
                : e0303697
                Affiliations
                [1 ] Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
                [2 ] CNRS, FR2424, ABiMS, Station Biologique de Roscoff, Sorbonne Université, Roscoff, France
                [3 ] CNRS, UMR7144, AD2M, Station Biologique de Roscoff, Sorbonne Université, Roscoff, France
                CEA lRlG: Commissariat a l’energie atomique et aux energies alternatives lnstitut de Recherche Interdisciplinaire de Grenoble, FRANCE
                Author notes

                Competing Interests: The authors have declared that no competing interests exist.

                Author information
                https://orcid.org/0000-0001-8689-9907
                Article
                PONE-D-23-39763
                10.1371/journal.pone.0303697
                11156365
                38843225
                04112366-5ef5-406c-b203-ce8d3a54f59d
                © 2024 Zavadska et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 28 November 2023
                : 29 April 2024
                Page count
                Figures: 3, Tables: 0, Pages: 14
                Funding
                Funded by: funder-id http://dx.doi.org/10.13039/100010663, H2020 European Research Council;
                Award ID: 949745
                Award Recipient :
                Funded by: Departament de Recerca i Universitats de la Generalitat de Catalunya
                Award ID: 2021 SGR 00751
                Award Recipient :
                This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 949745). We also acknowledge support from the Departament de Recerca i Universitats de la Generalitat de Catalunya (exp. 2021 SGR 00751).
                Categories
                Research Article
                Biology and Life Sciences
                Taxonomy
                Computer and Information Sciences
                Data Management
                Taxonomy
                Biology and Life Sciences
                Genetics
                Genomics
                Biology and Life Sciences
                Genetics
                Genomics
                Metagenomics
                Biology and Life Sciences
                Organisms
                Eukaryota
                Protists
                Biology and life sciences
                Molecular biology
                Molecular biology techniques
                Cloning
                DNA cloning
                Shotgun Sequencing
                Research and analysis methods
                Molecular biology techniques
                Cloning
                DNA cloning
                Shotgun Sequencing
                Biology and Life Sciences
                Molecular Biology
                Molecular Biology Techniques
                Sequencing Techniques
                Shotgun Sequencing
                Research and Analysis Methods
                Molecular Biology Techniques
                Sequencing Techniques
                Shotgun Sequencing
                Biology and life sciences
                Biochemistry
                Nucleic acids
                RNA
                Non-coding RNA
                Ribosomal RNA
                Biology and life sciences
                Biochemistry
                Ribosomes
                Ribosomal RNA
                Biology and life sciences
                Cell biology
                Cellular structures and organelles
                Ribosomes
                Ribosomal RNA
                Biology and Life Sciences
                Organisms
                Eukaryota
                Biology and life sciences
                Molecular biology
                Molecular biology techniques
                Sequencing techniques
                DNA sequencing
                Gene Sequencing
                Research and analysis methods
                Molecular biology techniques
                Sequencing techniques
                DNA sequencing
                Gene Sequencing
                Custom metadata
                The data on SMAG abundance and SMAG taxonomic assignment was taken from: https://doi.org/10.1016/j.xgen.2022.100123 Supplementary Data. SMAG sequences were obtained from: https://www.genoscope.cns.fr/tara/V9 raw abundance dataset for V9 metabarcodes organised at OTU level was obtained from: https://doi.org/10.5281/zenodo.7236051 All the intermediate and accessory datasets generated and used in our manuscript, along with the code and plots are available at https://github.com/beaplab/Protist_barcode_MAG_correspondence.

                Uncategorized
                Uncategorized

                Comments

                Comment on this article