6
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Application of machine learning techniques for creating urban microbial fingerprints

      research-article
      1 , 2 , 3 ,
      Biology Direct
      BioMed Central
      Microbiome, Machine learning, Public health, Urban, Bioinformatics, Microbiota

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Research has found that human associated microbial communities play a role in homeostasis and the disruption of these communities may be important in an array of medical conditions. However outside of the human body many of these communities remain poorly studied. The Metagenomics and Metadesign of the Subways and Urban Biomes (MetaSUB) International Consortium is characterizing the microbiomes of urban environments with the aim to improve design of mass transit systems. As part of the CAMDA 2018 MetaSUB Forensics Challenge 311 city microbiome samples were provided to create urban microbial fingerprints, as well as a further 3 mystery datasets for validation.

          Results

          MetaSUB samples were clustered using t-SNE in an unsupervised fashion to almost discrete groups, which upon inspection represented city of origin. Based on this clustering, geographically close metropolitan areas appear to display similar microbial profiles such as those of Auckland and Hamilton. Mystery unlabeled samples were provided part of the challenge. A random forest classifier built on the initial dataset of 311 samples was capable of correctly classifying 83.3% of the mystery samples to their city of origin. Random Forest analyses also identified features with the highest discriminatory power, ranking bacterial species such as Campylobacter jejuni and Staphylococcus argenteus as highly predictive of city of origin. The surface from which the sample was collected displayed little detectable impact on the microbial profiles in the data generated here. The proportion of reads classified per sample varied greatly and so de-novo assembly was applied to recover genomic fragments representing organisms not captured in reference databases.

          Conclusions

          Current methods can differentiate urban microbiome profiles from each other with relative ease. De-novo assembly indicated that the MetaSUB metagenomic data contains adequate depth to recover metagenomic assembled genomes and that current databases are not sufficient to fully characterize urban microbiomes. Profiles found here indicate there may be a relationship between geographical distance between areas and the urban microbiome composition although this will need further research. The impact of these different profiles on public health is currently unknown but the MetaSUB consortium is uniquely suited to evaluate these and provide a roadmap for the inclusion of urban microbiome information for city planning and public health policy.

          Reviewers

          This article was reviewed by Dimitar Vassilev, Eran Elhaik and Chengsheng Zhu.

          Electronic supplementary material

          The online version of this article (10.1186/s13062-019-0245-x) contains supplementary material, which is available to authorized users.

          Related collections

          Most cited references14

          • Record: found
          • Abstract: found
          • Article: not found

          Phylogeography's past, present, and future: 10 years after Avise, 2000.

          Approximately 20 years ago, Avise and colleagues proposed the integration of phylogenetics and population genetics for investigating the connection between micro- and macroevolutionary phenomena. The new field was termed phylogeography. Since the naming of the field, the statistical rigor of phylogeography has increased, in large part due to concurrent advances in coalescent theory which enabled model-based parameter estimation and hypothesis testing. The next phase will involve phylogeography increasingly becoming the integrative and comparative multi-taxon endeavor that it was originally conceived to be. This exciting convergence will likely involve combining spatially-explicit multiple taxon coalescent models, genomic studies of natural selection, ecological niche modeling, studies of ecological speciation, community assembly and functional trait evolution. This ambitious synthesis will allow us to determine the causal links between geography, climate change, ecological interactions and the evolution and composition of taxa across whole communities and assemblages. Although such integration presents analytical and computational challenges that will only be intensified by the growth of genomic data in non-model taxa, the rapid development of "likelihood-free" approximate Bayesian methods should permit parameter estimation and hypotheses testing using complex evolutionary demographic models and genomic phylogeographic data. We first review the conceptual beginnings of phylogeography and its accomplishments and then illustrate how it evolved into a statistically rigorous enterprise with the concurrent rise of coalescent theory. Subsequently, we discuss ways in which model-based phylogeography can interface with various subfields to become one of the most integrative fields in all of ecology and evolutionary biology.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Geospatial Resolution of Human and Bacterial Diversity with City-Scale Metagenomics

            SUMMARY The panoply of microorganisms and other species present in our environment influence human health and disease, especially in cities, but have not been profiled with metagenomics at a city-wide scale. We sequenced DNA from surfaces across the entire New York City (NYC) subway system, the Gowanus Canal, and public parks. Nearly half of the DNA (48%) does not match any known organism; identified organisms spanned 1,688 bacterial, viral, archaeal, and eukaryotic taxa, which were enriched for harmless genera associated with skin (e.g., Acinetobacter). Predicted ancestry of human DNA left on subway surfaces can recapitulate U.S. Census demographic data, and bacterial signatures can reveal a station’s history, such as marine-associated bacteria in a hurricane-flooded station. Some evidence of pathogens was found (Bacillus anthracis), but a lack of reported cases in NYC suggests that the pathogens represent a normal, urban microbiome. This baseline metagenomic map of NYC could help long-term disease surveillance, bioterrorism threat mitigation, and health management in the built environment of cities.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found
              Is Open Access

              The bright side of microbial dark matter: lessons learned from the uncultivated majority.

              Microorganisms are the most diverse and abundant life forms on Earth. Yet, in many environments, only 0.1-1% of them have been cultivated greatly hindering our understanding of the microbial world. However, today cultivation is no longer a requirement for gaining access to information from the uncultivated majority. New genomic information from metagenomics and single cell genomics has provided insights into microbial metabolic cooperation and dependence, generating new avenues for cultivation efforts. Here we summarize recent advances from uncultivated phyla and discuss how this knowledge has influenced our understanding of the topology of the tree of life and metabolic diversity.
                Bookmark

                Author and article information

                Contributors
                feargal.ryan@sahmri.com
                Journal
                Biol Direct
                Biol. Direct
                Biology Direct
                BioMed Central (London )
                1745-6150
                16 August 2019
                16 August 2019
                2019
                : 14
                : 13
                Affiliations
                [1 ]ISNI 0000000123318773, GRID grid.7872.a, APC Microbiome Ireland, , University College Cork, ; Cork, Ireland
                [2 ]GRID grid.430453.5, South Australian Health and Medical Research Institute, ; Adelaide, Australia
                [3 ]ISNI 0000 0004 0367 2697, GRID grid.1014.4, Flinders University, ; Adelaide, Australia
                Author information
                http://orcid.org/0000-0002-1565-4598
                Article
                245
                10.1186/s13062-019-0245-x
                6697990
                30646935
                3e4edbd3-cd93-4fd1-b2fb-dc4ac77d7f56
                © The Author(s). 2019

                Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                History
                : 4 October 2018
                : 6 August 2019
                Categories
                Research
                Custom metadata
                © The Author(s) 2019

                Life sciences
                microbiome,machine learning,public health,urban,bioinformatics,microbiota
                Life sciences
                microbiome, machine learning, public health, urban, bioinformatics, microbiota

                Comments

                Comment on this article