3
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      DCiPatho: deep cross-fusion networks for genome scale identification of pathogens

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Pathogen detection from biological and environmental samples is important for global disease control. Despite advances in pathogen detection using deep learning, current algorithms have limitations in processing long genomic sequences. Through the deep cross-fusion of cross, residual and deep neural networks, we developed DCiPatho for accurate pathogen detection based on the integrated frequency features of 3-to-7 k-mers. Compared with the existing state-of-the-art algorithms, DCiPatho can be used to accurately identify distinct pathogenic bacteria infecting humans, animals and plants. We evaluated DCiPatho on both learned and unlearned pathogen species using both genomics and metagenomics datasets. DCiPatho is an effective tool for the genomic-scale identification of pathogens by integrating the frequency of k-mers into deep cross-fusion networks. The source code is publicly available at https://github.com/LorMeBioAI/DCiPatho.

          Related collections

          Most cited references48

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes

          Large-scale recovery of genomes from isolates, single cells, and metagenomic data has been made possible by advances in computational methods and substantial reductions in sequencing costs. Although this increasing breadth of draft genomes is providing key information regarding the evolutionary and functional diversity of microbial life, it has become impractical to finish all available reference genomes. Making robust biological inferences from draft genomes requires accurate estimates of their completeness and contamination. Current methods for assessing genome quality are ad hoc and generally make use of a limited number of “marker” genes conserved across all bacterial or archaeal genomes. Here we introduce CheckM, an automated method for assessing the quality of a genome using a broader set of marker genes specific to the position of a genome within a reference genome tree and information about the collocation of these genes. We demonstrate the effectiveness of CheckM using synthetic data and a wide range of isolate-, single-cell-, and metagenome-derived genomes. CheckM is shown to provide accurate estimates of genome completeness and contamination and to outperform existing approaches. Using CheckM, we identify a diverse range of errors currently impacting publicly available isolate genomes and demonstrate that genomes obtained from single cells and metagenomic data vary substantially in quality. In order to facilitate the use of draft genomes, we propose an objective measure of genome quality that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Scikit-learn: Machine learning in python

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              The global burden of pathogens and pests on major food crops

              Crop pathogens and pests reduce the yield and quality of agricultural production. They cause substantial economic losses and reduce food security at household, national and global levels. Quantitative, standardized information on crop losses is difficult to compile and compare across crops, agroecosystems and regions. Here, we report on an expert-based assessment of crop health, and provide numerical estimates of yield losses on an individual pathogen and pest basis for five major crops globally and in food security hotspots. Our results document losses associated with 137 pathogens and pests associated with wheat, rice, maize, potato and soybean worldwide. Our yield loss (range) estimates at a global level and per hotspot for wheat (21.5% (10.1-28.1%)), rice (30.0% (24.6-40.9%)), maize (22.5% (19.5-41.1%)), potato (17.2% (8.1-21.0%)) and soybean (21.4% (11.0-32.4%)) suggest that the highest losses are associated with food-deficit regions with fast-growing populations, and frequently with emerging or re-emerging pests and diseases. Our assessment highlights differences in impacts among crop pathogens and pests and among food security hotspots. This analysis contributes critical information to prioritize crop health management to improve the sustainability of agroecosystems in delivering services to societies.
                Bookmark

                Author and article information

                Contributors
                Journal
                Brief Bioinform
                Brief Bioinform
                bib
                Briefings in Bioinformatics
                Oxford University Press
                1467-5463
                1477-4054
                July 2023
                30 May 2023
                30 May 2023
                : 24
                : 4
                : bbad194
                Affiliations
                Jiangsu Provincial Key Laboratory for Organic Solid Waste Utilization , Laboratory of Bio-interactions and Crop Health, Jiangsu Collaborative Innovation Center for Solid Organic Waste Resource Utilization, National Engineering Research Center for Organic-based Fertilizers, Joint International Research Laboratory of Soil Health, Nanjing Agricultural University , Nanjing 210095, Jiangsu, China
                College of Artificial Intelligence, Nanjing Agricultural University , Nanjing 210095, Jiangsu, China
                Jiangsu Provincial Key Laboratory for Organic Solid Waste Utilization , Laboratory of Bio-interactions and Crop Health, Jiangsu Collaborative Innovation Center for Solid Organic Waste Resource Utilization, National Engineering Research Center for Organic-based Fertilizers, Joint International Research Laboratory of Soil Health, Nanjing Agricultural University , Nanjing 210095, Jiangsu, China
                Jiangsu Provincial Key Laboratory for Organic Solid Waste Utilization , Laboratory of Bio-interactions and Crop Health, Jiangsu Collaborative Innovation Center for Solid Organic Waste Resource Utilization, National Engineering Research Center for Organic-based Fertilizers, Joint International Research Laboratory of Soil Health, Nanjing Agricultural University , Nanjing 210095, Jiangsu, China
                Jiangsu Provincial Key Laboratory for Organic Solid Waste Utilization , Laboratory of Bio-interactions and Crop Health, Jiangsu Collaborative Innovation Center for Solid Organic Waste Resource Utilization, National Engineering Research Center for Organic-based Fertilizers, Joint International Research Laboratory of Soil Health, Nanjing Agricultural University , Nanjing 210095, Jiangsu, China
                Jiangsu Provincial Key Laboratory for Organic Solid Waste Utilization , Laboratory of Bio-interactions and Crop Health, Jiangsu Collaborative Innovation Center for Solid Organic Waste Resource Utilization, National Engineering Research Center for Organic-based Fertilizers, Joint International Research Laboratory of Soil Health, Nanjing Agricultural University , Nanjing 210095, Jiangsu, China
                Department of Computer Science, City University of Hong Kong , Hong Kong 999077, China
                Jiangsu Provincial Key Laboratory for Organic Solid Waste Utilization , Laboratory of Bio-interactions and Crop Health, Jiangsu Collaborative Innovation Center for Solid Organic Waste Resource Utilization, National Engineering Research Center for Organic-based Fertilizers, Joint International Research Laboratory of Soil Health, Nanjing Agricultural University , Nanjing 210095, Jiangsu, China
                Jiangsu Provincial Key Laboratory for Organic Solid Waste Utilization , Laboratory of Bio-interactions and Crop Health, Jiangsu Collaborative Innovation Center for Solid Organic Waste Resource Utilization, National Engineering Research Center for Organic-based Fertilizers, Joint International Research Laboratory of Soil Health, Nanjing Agricultural University , Nanjing 210095, Jiangsu, China
                Jiangsu Provincial Key Laboratory for Organic Solid Waste Utilization , Laboratory of Bio-interactions and Crop Health, Jiangsu Collaborative Innovation Center for Solid Organic Waste Resource Utilization, National Engineering Research Center for Organic-based Fertilizers, Joint International Research Laboratory of Soil Health, Nanjing Agricultural University , Nanjing 210095, Jiangsu, China
                Jiangsu Provincial Key Laboratory for Organic Solid Waste Utilization , Laboratory of Bio-interactions and Crop Health, Jiangsu Collaborative Innovation Center for Solid Organic Waste Resource Utilization, National Engineering Research Center for Organic-based Fertilizers, Joint International Research Laboratory of Soil Health, Nanjing Agricultural University , Nanjing 210095, Jiangsu, China
                College of Artificial Intelligence, Nanjing Agricultural University , Nanjing 210095, Jiangsu, China
                Author notes
                Corresponding authors. Wei Xue, College of Artificial Intelligence, Nanjing Agricultural University, Nanjing 210095, Jiangsu, China. Tel.: 025-84395042; E-mail: xwsky@ 123456njau.edu.cn ; Zhong Wei, Jiangsu Provincial Key Laboratory for Organic Solid Waste Utilization, Laboratory of Biointeractions and Crop Health, Jiangsu Collaborative Innovation Center for Solid Organic Waste Resource Utilization, National Engineering Research Center for Organic-based Fertilizers, Joint International Research Laboratory of Soil Health, Nanjing Agricultural University, Nanjing 210095, Jiangsu, China Tel.: 025-84396864; E-mail: weizhong@ 123456njau.edu.cn
                Article
                bbad194
                10.1093/bib/bbad194
                10359081
                37249547
                fc3d639c-fef7-425e-aa21-4bdff7a2fcf5
                © The Author(s) 2023. Published by Oxford University Press.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License ( https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com

                History
                : 1 March 2023
                : 12 April 2023
                : 30 April 2023
                Page count
                Pages: 10
                Funding
                Funded by: National Natural Science Foundation of China, DOI 10.13039/501100001809;
                Award ID: 42090062
                Award ID: 42090064
                Award ID: 42007038
                Award ID: 42277113
                Award ID: 31972504
                Funded by: Fundamental Research Funds for the Central Universities, DOI 10.13039/501100012226;
                Award ID: KYT2023001
                Award ID: XUEKEN2023039
                Award ID: XUEKEN2023044
                Funded by: Natural Science Foundation of Jiangsu Province, DOI 10.13039/501100004608;
                Award ID: SBK2023030230
                Funded by: China National Tobacco Corporation, DOI 10.13039/501100008862;
                Award ID: 110202101047(LS-07)
                Funded by: Jiangxi Branch of China National Tobacco Corporation;
                Award ID: 2021.01.010
                Categories
                Problem Solving Protocol
                AcademicSubjects/SCI01060

                Bioinformatics & Computational biology
                pathogen identification,k-mer frequency,metagenomics,deep cross-fusion networks

                Comments

                Comment on this article