2
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Genome-Based Drug Target Identification in Human Pathogen Streptococcus gallolyticus

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Streptococcus gallolysticus ( Sg) is an opportunistic Gram-positive, non-motile bacterium, which causes infective endocarditis, an inflammation of the inner lining of the heart. As Sg has acquired resistance with the available antibiotics, therefore, there is a dire need to find new therapeutic targets and potent drugs to prevent and treat this disease. In the current study, an in silico approach is utilized to link genomic data of Sg species with its proteome to identify putative therapeutic targets. A total of 1,138 core proteins have been identified using pan genomic approach. Further, using subtractive proteomic analysis, a set of 18 proteins, essential for bacteria and non-homologous to host (human), is identified. Out of these 18 proteins, 12 cytoplasmic proteins were selected as potential drug targets. These selected proteins were subjected to molecular docking against drug-like compounds retrieved from ZINC database. Furthermore, the top docked compounds with lower binding energy were identified. In this work, we have identified novel drug and vaccine targets against Sg, of which some have already been reported and validated in other species. Owing to the experimental validation, we believe our methodology and result are significant contribution for drug/vaccine target identification against Sg-caused infective endocarditis.

          Related collections

          Most cited references61

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          KEGG: new perspectives on genomes, pathways, diseases and drugs

          KEGG (http://www.kegg.jp/ or http://www.genome.jp/kegg/) is an encyclopedia of genes and genomes. Assigning functional meanings to genes and genomes both at the molecular and higher levels is the primary objective of the KEGG database project. Molecular-level functions are stored in the KO (KEGG Orthology) database, where each KO is defined as a functional ortholog of genes and proteins. Higher-level functions are represented by networks of molecular interactions, reactions and relations in the forms of KEGG pathway maps, BRITE hierarchies and KEGG modules. In the past the KO database was developed for the purpose of defining nodes of molecular networks, but now the content has been expanded and the quality improved irrespective of whether or not the KOs appear in the three molecular network databases. The newly introduced addendum category of the GENES database is a collection of individual proteins whose functions are experimentally characterized and from which an increasing number of KOs are defined. Furthermore, the DISEASE and DRUG databases have been improved by systematic analysis of drug labels for better integration of diseases and drugs with the KEGG molecular networks. KEGG is moving towards becoming a comprehensive knowledge base for both functional interpretation and practical application of genomic information.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            ZINC 15 – Ligand Discovery for Everyone

            Many questions about the biological activity and availability of small molecules remain inaccessible to investigators who could most benefit from their answers. To narrow the gap between chemoinformatics and biology, we have developed a suite of ligand annotation, purchasability, target, and biology association tools, incorporated into ZINC and meant for investigators who are not computer specialists. The new version contains over 120 million purchasable “drug-like” compounds – effectively all organic molecules that are for sale – a quarter of which are available for immediate delivery. ZINC connects purchasable compounds to high-value ones such as metabolites, drugs, natural products, and annotated compounds from the literature. Compounds may be accessed by the genes for which they are annotated as well as the major and minor target classes to which those genes belong. It offers new analysis tools that are easy for nonspecialists yet with few limitations for experts. ZINC retains its original 3D roots – all molecules are available in biologically relevant, ready-to-dock formats. ZINC is freely available at http://zinc15.docking.org.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Prediction of protein subcellular localization.

              Because the protein's function is usually related to its subcellular localization, the ability to predict subcellular localization directly from protein sequences will be useful for inferring protein functions. Recent years have seen a surging interest in the development of novel computational tools to predict subcellular localization. At present, these approaches, based on a wide range of algorithms, have achieved varying degrees of success for specific organisms and for certain localization categories. A number of authors have noticed that sequence similarity is useful in predicting subcellular localization. For example, Nair and Rost (Protein Sci 2002;11:2836-2847) have carried out extensive analysis of the relation between sequence similarity and identity in subcellular localization, and have found a close relationship between them above a certain similarity threshold. However, many existing benchmark data sets used for the prediction accuracy assessment contain highly homologous sequences-some data sets comprising sequences up to 80-90% sequence identity. Using these benchmark test data will surely lead to overestimation of the performance of the methods considered. Here, we develop an approach based on a two-level support vector machine (SVM) system: the first level comprises a number of SVM classifiers, each based on a specific type of feature vectors derived from sequences; the second level SVM classifier functions as the jury machine to generate the probability distribution of decisions for possible localizations. We compare our approach with a global sequence alignment approach and other existing approaches for two benchmark data sets-one comprising prokaryotic sequences and the other eukaryotic sequences. Furthermore, we carried out all-against-all sequence alignment for several data sets to investigate the relationship between sequence homology and subcellular localization. Our results, which are consistent with previous studies, indicate that the homology search approach performs well down to 30% sequence identity, although its performance deteriorates considerably for sequences sharing lower sequence identity. A data set of high homology levels will undoubtedly lead to biased assessment of the performances of the predictive approaches-especially those relying on homology search or sequence annotations. Our two-level classification system based on SVM does not rely on homology search; therefore, its performance remains relatively unaffected by sequence homology. When compared with other approaches, our approach performed significantly better. Furthermore, we also develop a practical hybrid method, which combines the two-level SVM classifier and the homology search method, as a general tool for the sequence annotation of subcellular localization.
                Bookmark

                Author and article information

                Contributors
                Journal
                Front Genet
                Front Genet
                Front. Genet.
                Frontiers in Genetics
                Frontiers Media S.A.
                1664-8021
                25 March 2021
                2021
                : 12
                : 564056
                Affiliations
                [1] 1Department of Bioinformatics and Biosciences, Capital University of Science and Technology , Islamabad, Pakistan
                [2] 2Department of Biological Sciences, National University of Medical Sciences , Rawalpindi, Pakistan
                [3] 3Department of Biochemistry, Bahauddin Zakariya University , Multan, Pakistan
                [4] 4Department of Pharmaceutical Chemistry, College of Pharmacy, King Saud University , Riyadh, Saudi Arabia
                [5] 5Department of Pharmacology, College of Pharmacy, King Saud University , Riyadh, Saudi Arabia
                [6] 6Department of Soil Science, College of Food and Agriculture Sciences, King Saud University , Riyadh, Saudi Arabia
                [7] 7Department of Pharmacognosy (MAPPRC), College of Pharmacy, King Saud University , Riyadh, Saudi Arabia
                Author notes

                Edited by: Debmalya Barh, Institute of Integrative Omics and Applied Biotechnology (IIOAB), India

                Reviewed by: Ashutosh Mani, Motilal Nehru National Institute of Technology Allahabad, India; Muhammad Tariq, University of Tabuk, Saudi Arabia; Nurnabi Azad Jewel, Shahjalal University of Science and Technology, Bangladesh; Muhammad Ilyas, Islamia College University, Pakistan

                *Correspondence: Riaz Ullah, rullah@ 123456ksu.edu.sa

                This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics

                Article
                10.3389/fgene.2021.564056
                8027347
                33841489
                13a79958-3d45-432a-9e74-1e5d966af3c8
                Copyright © 2021 Qureshi, Bakhtiar, Faheem, Shah, Bari, Mahmood, Sohaib, Mothana, Ullah and Jamal.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 20 May 2020
                : 16 February 2021
                Page count
                Figures: 14, Tables: 16, Equations: 0, References: 63, Pages: 20, Words: 0
                Categories
                Genetics
                Original Research

                Genetics
                streptococcus gallollyticus,infective endocarditis,pan-genome,subtractive proteomics,drug prioritization

                Comments

                Comment on this article